Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Halley AI

company
Verified
https://halleyai.ai/
Activity Feed

AI & ML interests

Text Generation & Chat Assistants; Model Compression & Quantization (Q4/Q6/Q8, gs32); Inference & Serving (on-prem, low-latency); RAG / Retrieval; Agents & Tool Use; Distillation / LoRA / Fine-tuning

Sebastian Stavar's profile picture Mark Conroy's profile picture Taylor Galvan's profile picture Josie Tran's profile picture

halley-ai 's models 9

halley-ai/Qwen3-Next-80B-A3B-Instruct-MLX-5bit-gs32

Text Generation • 80B • Updated Sep 19 • 39 • 1

halley-ai/Qwen3-Next-80B-A3B-Instruct-MLX-6bit-gs64

Text Generation • 80B • Updated Sep 19 • 28 • 1

halley-ai/Qwen3-Next-80B-A3B-Instruct-MLX-4bit-gs64

Text Generation • 80B • Updated Sep 19 • 43 • 1

halley-ai/gpt-oss-120b-MLX-bf16

Text Generation • 117B • Updated Sep 8 • 198 • 2

halley-ai/gpt-oss-120b-MLX-8bit-gs32

Text Generation • 117B • Updated Sep 8 • 85 • 1

halley-ai/gpt-oss-120b-MLX-6bit-gs64

Text Generation • 117B • Updated Sep 8 • 74 • 1

halley-ai/gpt-oss-20b-MLX-5bit-gs32

Text Generation • 21B • Updated Sep 8 • 29 • 1

halley-ai/gpt-oss-20b-MLX-6bit-gs32

Text Generation • 21B • Updated Aug 18 • 26 • 1

halley-ai/gpt-oss-20b-MLX-4bit-gs32

Text Generation • 21B • Updated Aug 18 • 34 • 1
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs