Unsloth AI

Team

company

Verified

https://unsloth.ai

UnslothAI

unslothai

unsloth

Activity Feed

AI & ML interests

Open Source AI 🦥

Recent Activity

danielhanchen new activity about 7 hours ago

unsloth/DeepSeek-V3.2-GGUF:Does this support DSA/lighting attention?

danielhanchen new activity about 7 hours ago

unsloth/DeepSeek-V3.2-GGUF:Already testing it

danielhanchen new activity about 11 hours ago

unsloth/Qwen3-Coder-30B-A3B-Instruct-GGUF:New Refresh with added Tool Calling in calibration dataset and improved imatrix

View all activity

unsloth 's collections 30

Unsloth Dynamic 2.0 Quants

New 2.0 version of our Dynamic GGUF + Quants. Dynamic 2.0 achieves superior accuracy & SOTA quantization performance.

unsloth/GLM-4.7-Flash-GGUF

Text Generation • 30B • Updated 7 days ago • 265k • 373
unsloth/Kimi-K2.5-GGUF

1T • Updated 2 days ago • 8.75k • 120
unsloth/Nemotron-3-Nano-30B-A3B-GGUF

Text Generation • 32B • Updated about 1 month ago • 96.9k • 236
unsloth/GLM-4.7-GGUF

Text Generation • 358B • Updated Dec 27, 2025 • 113k • 190

Qwen3-VL

Qwen's new multimodal vision models in GGUF, safetensor, and dynamic Unsloth formats.

unsloth/Qwen3-VL-30B-A3B-Instruct-GGUF

Image-Text-to-Text • 31B • Updated 29 days ago • 164k • 76
unsloth/Qwen3-VL-30B-A3B-Thinking-GGUF

Image-Text-to-Text • 31B • Updated 29 days ago • 25k • 32
unsloth/Qwen3-VL-4B-Instruct-GGUF

Image-Text-to-Text • 4B • Updated Oct 31, 2025 • 21.4k • 36
unsloth/Qwen3-VL-4B-Thinking-GGUF

Image-Text-to-Text • 4B • Updated Oct 31, 2025 • 5.1k • 19

Ministral 3

Mistral Ministral 3: new multimodal models in Base, Instruct, and Reasoning variants, available in 3B, 8B, and 14B sizes.

unsloth/Ministral-3-14B-Instruct-2512-GGUF

14B • Updated Dec 4, 2025 • 20.4k • 59
unsloth/Ministral-3-14B-Reasoning-2512-GGUF

14B • Updated Dec 4, 2025 • 13.2k • 35
unsloth/Ministral-3-8B-Instruct-2512-GGUF

8B • Updated Dec 4, 2025 • 12.8k • 17
unsloth/Ministral-3-8B-Reasoning-2512-GGUF

8B • Updated Dec 4, 2025 • 5.17k • 8

DeepSeek-V3.1

DeepSeek's new 3.1 update to their V3 models!

unsloth/DeepSeek-V3.1-Terminus-GGUF

671B • Updated Sep 24, 2025 • 9.29k • 67
unsloth/DeepSeek-V3.1-GGUF

671B • Updated Sep 22, 2025 • 9.27k • 93
unsloth/DeepSeek-V3.1

Text Generation • 685B • Updated Aug 21, 2025 • 5 • 3
unsloth/DeepSeek-V3.1-BF16

Text Generation • 684B • Updated Aug 21, 2025 • 250 • 1

Embedding Models

Run or fine-tune embedding models with Unsloth.

unsloth/embeddinggemma-300m

Sentence Similarity • 0.3B • Updated 8 days ago • 9.42k • • 8
unsloth/embeddinggemma-300m-GGUF

Sentence Similarity • 0.3B • Updated Sep 4, 2025 • 4.92k • 48
unsloth/Qwen3-Embedding-0.6B

Feature Extraction • 0.6B • Updated 8 days ago • 935 • 3
unsloth/Qwen3-Embedding-4B

Feature Extraction • 4B • Updated 8 days ago • 696 • 1

Gemma 3

All versions of Google's new multimodal models including QAT in 1B, 4B, 12B, and 27B sizes. In GGUF, dynamic 4-bit and 16-bit formats.

unsloth/gemma-3-270m-it-GGUF

Text Generation • 0.3B • Updated Aug 15, 2025 • 20.8k • 147
unsloth/gemma-3-270m-it-qat-GGUF

Text Generation • 0.3B • Updated Aug 15, 2025 • 5.63k • 11
unsloth/gemma-3-270m-it

Text Generation • 0.3B • Updated Aug 14, 2025 • 24k • 22
unsloth/gemma-3-270m-it-unsloth-bnb-4bit

Text Generation • 0.3B • Updated Aug 14, 2025 • 14.1k • 5

Gemma 3n

Google Gemma 3n models, all versions including Dynamic GGUF, 4-bit, 16-bit and formats!

unsloth/gemma-3n-E4B-it-GGUF

Image-Text-to-Text • 7B • Updated Jun 30, 2025 • 19.7k • 186
unsloth/gemma-3n-E2B-it-GGUF

Image-Text-to-Text • 4B • Updated Jul 17, 2025 • 23.4k • 57
unsloth/gemma-3n-E4B-it-unsloth-bnb-4bit

Image-Text-to-Text • 8B • Updated Jul 11, 2025 • 18.5k • 9
unsloth/gemma-3n-E4B-unsloth-bnb-4bit

Image-Text-to-Text • 8B • Updated Jul 11, 2025 • 1.04k • 4

Phi-4 (All Versions)

Microsoft's Phi-4 models including Reasoning + Reasoning Plus & mini. Includes Dynamic 2.0 GGUF, 4-bit & 16-bit versions. Includes Unsloth's bug fixes

unsloth/Phi-4-reasoning-plus-GGUF

Text Generation • 15B • Updated May 1, 2025 • 3.6k • 77
unsloth/Phi-4-mini-reasoning-GGUF

Text Generation • 4B • Updated May 1, 2025 • 5.44k • 58
unsloth/Phi-4-reasoning-GGUF

Text Generation • 15B • Updated May 1, 2025 • 1.42k • 19
unsloth/phi-4-GGUF

Text Generation • 15B • Updated Jan 13, 2025 • 3.49k • 180

Deepseek V3 (All Versions)

Deepseek-V3-0324 and V3 - available in original, and Dynamic GGUF formats, with support for 2-8-bit quantized versions.

unsloth/DeepSeek-V3-0324-GGUF-UD

Text Generation • 671B • Updated Apr 28, 2025 • 1.04k • 21
unsloth/DeepSeek-V3-0324-GGUF

Text Generation • 671B • Updated May 22, 2025 • 3.98k • 197
unsloth/DeepSeek-V3-0324

Text Generation • 684B • Updated Apr 21, 2025 • 12 • 7
unsloth/DeepSeek-V3-0324-BF16

Text Generation • 684B • Updated Jul 14, 2025 • 28.2k • 4

Mistral Small 3 (All Versions)

A collection of Mistral's new Small 3.2 and 3 models including GGUF, 4-bit and more!

unsloth/Mistral-Small-3.2-24B-Instruct-2506-GGUF

Image-Text-to-Text • 24B • Updated Aug 26, 2025 • 48.3k • 153
unsloth/Mistral-Small-3.2-24B-Instruct-2506

Image-Text-to-Text • 24B • Updated Aug 26, 2025 • 2.57k • • 11
unsloth/Mistral-Small-3.2-24B-Instruct-2506-FP8

Image-Text-to-Text • Updated Jun 21, 2025 • 205 • 6
unsloth/Mistral-Small-3.2-24B-Instruct-2506-unsloth-bnb-4bit

Image-Text-to-Text • 25B • Updated Jun 23, 2025 • 2.56k • 12

Llama 3.3 (All Versions)

Meta's new Llama 3.3 (70B) model in all formats. Includes GGUF, 4-bit bnb and original versions.

unsloth/Llama-3.3-70B-Instruct-GGUF

Text Generation • 71B • Updated May 10, 2025 • 8.75k • 92
unsloth/Llama-3.3-70B-Instruct

Text Generation • 71B • Updated Nov 25, 2025 • 2.73k • 48
unsloth/Llama-3.3-70B-Instruct-bnb-4bit

Text Generation • 71B • Updated Nov 25, 2025 • 7.26k • 52

Qwen QwQ-32B Collection

Qwen's reasoning models including QwQ (32B) & QVQ (72B) in formats: GGUF, dynamic 4-bit and 16-bit original versions.

unsloth/QwQ-32B-GGUF

Text Generation • 33B • Updated Apr 27, 2025 • 1.39k • 86
unsloth/QwQ-32B-unsloth-bnb-4bit

Text Generation • 34B • Updated Mar 7, 2025 • 276 • 47
unsloth/QwQ-32B

Text Generation • 33B • Updated Apr 27, 2025 • 11 • • 17
unsloth/QwQ-32B-bnb-4bit

Text Generation • 34B • Updated Mar 5, 2025 • 106 • 4

Llama 3.2 Vision

Meta's Llama 3.2 vision models 11B and 90B. Include 4-bit bnb and original versions.

unsloth/Llama-3.2-11B-Vision-Instruct

Image-to-Text • 11B • Updated Dec 10, 2024 • 27.5k • 88
unsloth/Llama-3.2-11B-Vision-Instruct-bnb-4bit

Image-to-Text • 11B • Updated Dec 10, 2024 • 6.73k • 80
unsloth/Llama-3.2-11B-Vision

Image-to-Text • 11B • Updated Nov 22, 2024 • 495 • 34
unsloth/Llama-3.2-11B-Vision-bnb-4bit

Image-to-Text • 11B • Updated Nov 22, 2024 • 189 • 16

Llama 3.1 Collection

Meta's Llama 3.1 models including 8B, 70B, 405B. Includes 4-bit bnb and original versions.

unsloth/Meta-Llama-3.1-8B-Instruct-bnb-4bit

Text Generation • 8B • Updated Feb 15, 2025 • 242k • 92
unsloth/Llama-3.1-8B-Instruct-unsloth-bnb-4bit

Text Generation • 8B • Updated Feb 15, 2025 • 44.9k • 4
unsloth/Meta-Llama-3.1-8B-Instruct-unsloth-bnb-4bit

Text Generation • 8B • Updated Feb 15, 2025 • 149k • 4
unsloth/Meta-Llama-3.1-8B-bnb-4bit

Text Generation • 8B • Updated Feb 15, 2025 • 49.7k • 109

Load 4bit models 4x faster

Native bitsandbytes 4bit pre quantized models

unsloth/Llama-3.2-3B-bnb-4bit

Text Generation • 3B • Updated Jun 2, 2025 • 21.4k • 21
unsloth/Meta-Llama-3.1-8B-bnb-4bit

Text Generation • 8B • Updated Feb 15, 2025 • 49.7k • 109
unsloth/llama-3-8b-Instruct-bnb-4bit

Text Generation • 8B • Updated Nov 22, 2024 • 54k • 133
unsloth/gemma-2-9b-bnb-4bit

Text Generation • 10B • Updated Jul 22, 2025 • 8.16k • 31

Unsloth Diffusion GGUFs

Find GGUFs and other variants of diffusion based Qwen-Image and FLUX models.

unsloth/Qwen-Image-2512-GGUF

Text-to-Image • 20B • Updated 24 days ago • 121k • • 280
unsloth/LTX-2-GGUF

Image-to-Video • 19B • Updated 8 days ago • 25.6k • 91
unsloth/Z-Image-GGUF

Text-to-Image • 6B • Updated 3 days ago • 2.46k • 49
unsloth/FLUX.2-klein-9B-GGUF

Image-to-Image • 9B • Updated 14 days ago • 44.2k • 68

gpt-oss

OpenAI's gpt-oss-20b and gpt-oss-120b is here! The powerful open models are available in GGUF, original & 4-bit formats.

unsloth/gpt-oss-20b-GGUF

Text Generation • 21B • Updated Dec 19, 2025 • 126k • 554
unsloth/gpt-oss-120b-GGUF

Text Generation • 117B • Updated Aug 25, 2025 • 81.6k • 201
unsloth/gpt-oss-20b-unsloth-bnb-4bit

Text Generation • 21B • Updated Aug 8, 2025 • 146k • 35
unsloth/gpt-oss-120b-unsloth-bnb-4bit

Text Generation • 117B • Updated Aug 8, 2025 • 20.1k • 12

Qwen3

Qwen's new Qwen3 models. In Unsloth Dynamic 2.0, GGUF, 4-bit and 16-bit Safetensor formats. Includes 128K Context Length variants.

unsloth/Qwen3-30B-A3B-Instruct-2507-GGUF

31B • Updated Jul 31, 2025 • 34.3k • 282
unsloth/Qwen3-4B-Instruct-2507-GGUF

4B • Updated Aug 20, 2025 • 59k • 136
unsloth/Qwen3-4B-Thinking-2507-GGUF

4B • Updated Sep 11, 2025 • 11.8k • 88
unsloth/Qwen3-Coder-480B-A35B-Instruct-GGUF

Text Generation • 480B • Updated Jul 31, 2025 • 3.52k • 166

DeepSeek R1 (All Versions)

DeepSeek-R1-0528 is here! The most powerful reasoning open LLM, available in GGUF, original & 4-bit formats. Includes Llama & Qwen distilled models.

unsloth/DeepSeek-R1-0528-Qwen3-8B-GGUF

Text Generation • 8B • Updated Jun 16, 2025 • 50.4k • 367
unsloth/DeepSeek-R1-0528-GGUF

Text Generation • 671B • Updated Jun 15, 2025 • 3.64k • 193
unsloth/DeepSeek-R1-0528-Qwen3-8B-unsloth-bnb-4bit

Text Generation • 8B • Updated Jun 10, 2025 • 6.36k • 13
unsloth/DeepSeek-R1-0528

Text Generation • 685B • Updated Jun 10, 2025 • 23 • 15

Qwen3-Coder

The Qwen3-Coder models deliver SOTA advancements in agentic coding and code tasks. Includes Qwen3-Coder-480B-A35B.

unsloth/Qwen3-Coder-30B-A3B-Instruct-GGUF

Text Generation • 31B • Updated about 13 hours ago • 107k • 425
unsloth/Qwen3-Coder-30B-A3B-Instruct-1M-GGUF

Text Generation • 31B • Updated Aug 5, 2025 • 12.8k • 139
unsloth/Qwen3-Coder-480B-A35B-Instruct-GGUF

Text Generation • 480B • Updated Jul 31, 2025 • 3.52k • 166
unsloth/Qwen3-Coder-480B-A35B-Instruct-1M-GGUF

Text Generation • 480B • Updated Jul 23, 2025 • 1.72k • 42

Granite 4.0

IBM's new Granite-4.0 models! Run Dynamic GGUFs or fine-tune with Unsloth.

unsloth/granite-4.0-350m-GGUF

0.4B • Updated Oct 28, 2025 • 1.08k • 4
unsloth/granite-4.0-h-350m-GGUF

0.3B • Updated Oct 28, 2025 • 1.4k • 8
unsloth/granite-4.0-h-1b-GGUF

1B • Updated Oct 28, 2025 • 1.87k • 14
unsloth/granite-4.0-1b-GGUF

2B • Updated Oct 28, 2025 • 847 • 3

Llama 4

Meta's new Llama 4 multimodal models, Scout & Maverick. Includes Dynamic GGUFs, 16-bit & Dynamic 4-bit uploads. Run & fine-tune them with Unsloth!

unsloth/Llama-4-Scout-17B-16E-Instruct-GGUF

Image-to-Text • 108B • Updated Jun 17, 2025 • 21.8k • 131
unsloth/Llama-4-Maverick-17B-128E-Instruct-GGUF

Image-to-Text • 401B • Updated Jun 18, 2025 • 4.98k • 43
unsloth/Llama-4-Scout-17B-16E-Instruct

Image-to-Text • 109B • Updated Jun 17, 2025 • 618 • 56
unsloth/Llama-4-Scout-17B-16E-Instruct-unsloth-bnb-4bit

Image-to-Text • 112B • Updated Apr 12, 2025 • 1.39k • 80

Unsloth 4-bit Dynamic Quants

Unsloths Dynamic 4bit Quants selectively skips quantizing certain parameters; greatly improving accuracy while only using <10% more VRAM than BnB 4bit

unsloth/DeepSeek-R1-Distill-Llama-8B-unsloth-bnb-4bit

Text Generation • 5B • Updated Jul 18, 2025 • 37.7k • 36
unsloth/DeepSeek-R1-Distill-Qwen-14B-unsloth-bnb-4bit

Text Generation • 15B • Updated Feb 14, 2025 • 2.83k • 30
unsloth/DeepSeek-R1-Distill-Qwen-7B-unsloth-bnb-4bit

Text Generation • 5B • Updated Feb 14, 2025 • 3.24k • 24
unsloth/gemma-3-12b-it-unsloth-bnb-4bit

Image-to-Text • 12B • Updated May 12, 2025 • 40.4k • 24

Text-to-Speech (TTS) models

A collection of 4-bit, Dynamic 4-bit and 16-bit voice models including Sesame-CSM, OpenAI's Whisper, Orpheus. Fine-tune them with Unsloth now!

unsloth/orpheus-3b-0.1-ft-GGUF

Text-to-Speech • 3B • Updated Jul 9, 2025 • 1.2k • 11
unsloth/orpheus-3b-0.1-ft-unsloth-bnb-4bit

Text-to-Speech • 3B • Updated Mar 24, 2025 • 31.9k • 16
unsloth/csm-1b

Text-to-Speech • 2B • Updated May 15, 2025 • 6.41k • 19
unsloth/whisper-large-v3

Automatic Speech Recognition • 2B • Updated May 14, 2025 • 5.65k • 14

Llama 3.2

Meta's new Llama 3.2 vision and text models including 1B, 3B, 11B and 90B. Includes GGUF, 4-bit bnb and original versions.

unsloth/Llama-3.2-1B-Instruct-GGUF

Text Generation • 1B • Updated May 9, 2025 • 62.7k • 52
unsloth/Llama-3.2-1B-Instruct

Text Generation • 1B • Updated May 9, 2025 • 89.9k • 87
unsloth/Llama-3.2-1B-Instruct-unsloth-bnb-4bit

Text Generation • 0.8B • Updated Apr 26, 2025 • 55.7k • 4
unsloth/Llama-3.2-1B-Instruct-bnb-4bit

Text Generation • 1B • Updated Jan 23, 2025 • 22.5k • 22

Qwen2.5-VL (All Versions)

All versions of Qwen2.5-VL including the new 32B version and 4-bit, 16-bit and more!

unsloth/Qwen2.5-VL-3B-Instruct-GGUF

Image-Text-to-Text • 3B • Updated May 12, 2025 • 12.5k • 19
unsloth/Qwen2.5-VL-7B-Instruct-GGUF

Image-Text-to-Text • 8B • Updated May 12, 2025 • 74.5k • 137
unsloth/Qwen2.5-VL-32B-Instruct-GGUF

Image-Text-to-Text • 33B • Updated May 12, 2025 • 566 • 7
unsloth/Qwen2.5-VL-72B-Instruct-GGUF

Image-Text-to-Text • 73B • Updated May 18, 2025 • 1.03k • 7

Vision/multimodal Models

Collection of the most popular vision models including Llama 3.2, LlaVa, Qwen2 VL, Pixtral, PaliGemma and more!

unsloth/Llama-3.2-11B-Vision-Instruct

Image-to-Text • 11B • Updated Dec 10, 2024 • 27.5k • 88
unsloth/Llama-3.2-11B-Vision-Instruct-bnb-4bit

Image-to-Text • 11B • Updated Dec 10, 2024 • 6.73k • 80
unsloth/Llama-3.2-11B-Vision-Instruct-unsloth-bnb-4bit

Image-to-Text • 11B • Updated Dec 4, 2024 • 4.71k • 28
unsloth/Qwen2-VL-7B-Instruct-bnb-4bit

Image-Text-to-Text • 9B • Updated Nov 22, 2024 • 2.06k • 6

Qwen 2.5 Coder

Complete collection of Code-specific model series for Qwen2.5 in bnb 4bit, 16bit and GGUF formats.

unsloth/Qwen2.5-Coder-32B-Instruct-128K-GGUF

33B • Updated Nov 15, 2024 • 1.37k • 74
unsloth/Qwen2.5-Coder-14B-Instruct-128K-GGUF

15B • Updated Nov 14, 2024 • 1.28k • 34
unsloth/Qwen2.5-Coder-7B-Instruct-128K-GGUF

8B • Updated Nov 14, 2024 • 2.07k • 20
unsloth/Qwen2.5-Coder-3B-Instruct-128K-GGUF

3B • Updated Nov 15, 2024 • 620 • 14

Qwen 2.5

unsloth/Qwen2.5-7B-Instruct-bnb-4bit

Text Generation • 8B • Updated Apr 28, 2025 • 71k • 20
unsloth/Qwen2.5-7B-Instruct

Text Generation • 8B • Updated Apr 28, 2025 • 50.6k • • 22
unsloth/Qwen2.5-14B-bnb-4bit

Text Generation • 15B • Updated Apr 28, 2025 • 1.32k • 5
unsloth/Qwen2.5-7B-bnb-4bit

Text Generation • 8B • Updated Apr 28, 2025 • 5.44k • 6

4bit Instruct Models

unsloth/Llama-3.2-3B-Instruct-bnb-4bit

Text Generation • 3B • Updated Jun 2, 2025 • 28.8k • 33
unsloth/Llama-3.2-1B-Instruct-bnb-4bit

Text Generation • 1B • Updated Jan 23, 2025 • 22.5k • 22
unsloth/Llama-3.2-11B-Vision-Instruct-bnb-4bit

Image-to-Text • 11B • Updated Dec 10, 2024 • 6.73k • 80
unsloth/Meta-Llama-3.1-8B-Instruct-bnb-4bit

Text Generation • 8B • Updated Feb 15, 2025 • 242k • 92

Unsloth Dynamic 2.0 Quants

New 2.0 version of our Dynamic GGUF + Quants. Dynamic 2.0 achieves superior accuracy & SOTA quantization performance.

unsloth/GLM-4.7-Flash-GGUF

Text Generation • 30B • Updated 7 days ago • 265k • 373
unsloth/Kimi-K2.5-GGUF

1T • Updated 2 days ago • 8.75k • 120
unsloth/Nemotron-3-Nano-30B-A3B-GGUF

Text Generation • 32B • Updated about 1 month ago • 96.9k • 236
unsloth/GLM-4.7-GGUF

Text Generation • 358B • Updated Dec 27, 2025 • 113k • 190

Unsloth Diffusion GGUFs

Find GGUFs and other variants of diffusion based Qwen-Image and FLUX models.

unsloth/Qwen-Image-2512-GGUF

Text-to-Image • 20B • Updated 24 days ago • 121k • • 280
unsloth/LTX-2-GGUF

Image-to-Video • 19B • Updated 8 days ago • 25.6k • 91
unsloth/Z-Image-GGUF

Text-to-Image • 6B • Updated 3 days ago • 2.46k • 49
unsloth/FLUX.2-klein-9B-GGUF

Image-to-Image • 9B • Updated 14 days ago • 44.2k • 68

Qwen3-VL

Qwen's new multimodal vision models in GGUF, safetensor, and dynamic Unsloth formats.

unsloth/Qwen3-VL-30B-A3B-Instruct-GGUF

Image-Text-to-Text • 31B • Updated 29 days ago • 164k • 76
unsloth/Qwen3-VL-30B-A3B-Thinking-GGUF

Image-Text-to-Text • 31B • Updated 29 days ago • 25k • 32
unsloth/Qwen3-VL-4B-Instruct-GGUF

Image-Text-to-Text • 4B • Updated Oct 31, 2025 • 21.4k • 36
unsloth/Qwen3-VL-4B-Thinking-GGUF

Image-Text-to-Text • 4B • Updated Oct 31, 2025 • 5.1k • 19

gpt-oss

OpenAI's gpt-oss-20b and gpt-oss-120b is here! The powerful open models are available in GGUF, original & 4-bit formats.

unsloth/gpt-oss-20b-GGUF

Text Generation • 21B • Updated Dec 19, 2025 • 126k • 554
unsloth/gpt-oss-120b-GGUF

Text Generation • 117B • Updated Aug 25, 2025 • 81.6k • 201
unsloth/gpt-oss-20b-unsloth-bnb-4bit

Text Generation • 21B • Updated Aug 8, 2025 • 146k • 35
unsloth/gpt-oss-120b-unsloth-bnb-4bit

Text Generation • 117B • Updated Aug 8, 2025 • 20.1k • 12

Ministral 3

Mistral Ministral 3: new multimodal models in Base, Instruct, and Reasoning variants, available in 3B, 8B, and 14B sizes.

unsloth/Ministral-3-14B-Instruct-2512-GGUF

14B • Updated Dec 4, 2025 • 20.4k • 59
unsloth/Ministral-3-14B-Reasoning-2512-GGUF

14B • Updated Dec 4, 2025 • 13.2k • 35
unsloth/Ministral-3-8B-Instruct-2512-GGUF

8B • Updated Dec 4, 2025 • 12.8k • 17
unsloth/Ministral-3-8B-Reasoning-2512-GGUF

8B • Updated Dec 4, 2025 • 5.17k • 8

Qwen3

Qwen's new Qwen3 models. In Unsloth Dynamic 2.0, GGUF, 4-bit and 16-bit Safetensor formats. Includes 128K Context Length variants.

unsloth/Qwen3-30B-A3B-Instruct-2507-GGUF

31B • Updated Jul 31, 2025 • 34.3k • 282
unsloth/Qwen3-4B-Instruct-2507-GGUF

4B • Updated Aug 20, 2025 • 59k • 136
unsloth/Qwen3-4B-Thinking-2507-GGUF

4B • Updated Sep 11, 2025 • 11.8k • 88
unsloth/Qwen3-Coder-480B-A35B-Instruct-GGUF

Text Generation • 480B • Updated Jul 31, 2025 • 3.52k • 166

DeepSeek-V3.1

DeepSeek's new 3.1 update to their V3 models!

unsloth/DeepSeek-V3.1-Terminus-GGUF

671B • Updated Sep 24, 2025 • 9.29k • 67
unsloth/DeepSeek-V3.1-GGUF

671B • Updated Sep 22, 2025 • 9.27k • 93
unsloth/DeepSeek-V3.1

Text Generation • 685B • Updated Aug 21, 2025 • 5 • 3
unsloth/DeepSeek-V3.1-BF16

Text Generation • 684B • Updated Aug 21, 2025 • 250 • 1

DeepSeek R1 (All Versions)

DeepSeek-R1-0528 is here! The most powerful reasoning open LLM, available in GGUF, original & 4-bit formats. Includes Llama & Qwen distilled models.

unsloth/DeepSeek-R1-0528-Qwen3-8B-GGUF

Text Generation • 8B • Updated Jun 16, 2025 • 50.4k • 367
unsloth/DeepSeek-R1-0528-GGUF

Text Generation • 671B • Updated Jun 15, 2025 • 3.64k • 193
unsloth/DeepSeek-R1-0528-Qwen3-8B-unsloth-bnb-4bit

Text Generation • 8B • Updated Jun 10, 2025 • 6.36k • 13
unsloth/DeepSeek-R1-0528

Text Generation • 685B • Updated Jun 10, 2025 • 23 • 15

Embedding Models

Run or fine-tune embedding models with Unsloth.

unsloth/embeddinggemma-300m

Sentence Similarity • 0.3B • Updated 8 days ago • 9.42k • • 8
unsloth/embeddinggemma-300m-GGUF

Sentence Similarity • 0.3B • Updated Sep 4, 2025 • 4.92k • 48
unsloth/Qwen3-Embedding-0.6B

Feature Extraction • 0.6B • Updated 8 days ago • 935 • 3
unsloth/Qwen3-Embedding-4B

Feature Extraction • 4B • Updated 8 days ago • 696 • 1

Qwen3-Coder

The Qwen3-Coder models deliver SOTA advancements in agentic coding and code tasks. Includes Qwen3-Coder-480B-A35B.

unsloth/Qwen3-Coder-30B-A3B-Instruct-GGUF

Text Generation • 31B • Updated about 13 hours ago • 107k • 425
unsloth/Qwen3-Coder-30B-A3B-Instruct-1M-GGUF

Text Generation • 31B • Updated Aug 5, 2025 • 12.8k • 139
unsloth/Qwen3-Coder-480B-A35B-Instruct-GGUF

Text Generation • 480B • Updated Jul 31, 2025 • 3.52k • 166
unsloth/Qwen3-Coder-480B-A35B-Instruct-1M-GGUF

Text Generation • 480B • Updated Jul 23, 2025 • 1.72k • 42

Gemma 3

All versions of Google's new multimodal models including QAT in 1B, 4B, 12B, and 27B sizes. In GGUF, dynamic 4-bit and 16-bit formats.

unsloth/gemma-3-270m-it-GGUF

Text Generation • 0.3B • Updated Aug 15, 2025 • 20.8k • 147
unsloth/gemma-3-270m-it-qat-GGUF

Text Generation • 0.3B • Updated Aug 15, 2025 • 5.63k • 11
unsloth/gemma-3-270m-it

Text Generation • 0.3B • Updated Aug 14, 2025 • 24k • 22
unsloth/gemma-3-270m-it-unsloth-bnb-4bit

Text Generation • 0.3B • Updated Aug 14, 2025 • 14.1k • 5

Granite 4.0

IBM's new Granite-4.0 models! Run Dynamic GGUFs or fine-tune with Unsloth.

unsloth/granite-4.0-350m-GGUF

0.4B • Updated Oct 28, 2025 • 1.08k • 4
unsloth/granite-4.0-h-350m-GGUF

0.3B • Updated Oct 28, 2025 • 1.4k • 8
unsloth/granite-4.0-h-1b-GGUF

1B • Updated Oct 28, 2025 • 1.87k • 14
unsloth/granite-4.0-1b-GGUF

2B • Updated Oct 28, 2025 • 847 • 3

Gemma 3n

Google Gemma 3n models, all versions including Dynamic GGUF, 4-bit, 16-bit and formats!

unsloth/gemma-3n-E4B-it-GGUF

Image-Text-to-Text • 7B • Updated Jun 30, 2025 • 19.7k • 186
unsloth/gemma-3n-E2B-it-GGUF

Image-Text-to-Text • 4B • Updated Jul 17, 2025 • 23.4k • 57
unsloth/gemma-3n-E4B-it-unsloth-bnb-4bit

Image-Text-to-Text • 8B • Updated Jul 11, 2025 • 18.5k • 9
unsloth/gemma-3n-E4B-unsloth-bnb-4bit

Image-Text-to-Text • 8B • Updated Jul 11, 2025 • 1.04k • 4

Llama 4

Meta's new Llama 4 multimodal models, Scout & Maverick. Includes Dynamic GGUFs, 16-bit & Dynamic 4-bit uploads. Run & fine-tune them with Unsloth!

unsloth/Llama-4-Scout-17B-16E-Instruct-GGUF

Image-to-Text • 108B • Updated Jun 17, 2025 • 21.8k • 131
unsloth/Llama-4-Maverick-17B-128E-Instruct-GGUF

Image-to-Text • 401B • Updated Jun 18, 2025 • 4.98k • 43
unsloth/Llama-4-Scout-17B-16E-Instruct

Image-to-Text • 109B • Updated Jun 17, 2025 • 618 • 56
unsloth/Llama-4-Scout-17B-16E-Instruct-unsloth-bnb-4bit

Image-to-Text • 112B • Updated Apr 12, 2025 • 1.39k • 80

Phi-4 (All Versions)

Microsoft's Phi-4 models including Reasoning + Reasoning Plus & mini. Includes Dynamic 2.0 GGUF, 4-bit & 16-bit versions. Includes Unsloth's bug fixes

unsloth/Phi-4-reasoning-plus-GGUF

Text Generation • 15B • Updated May 1, 2025 • 3.6k • 77
unsloth/Phi-4-mini-reasoning-GGUF

Text Generation • 4B • Updated May 1, 2025 • 5.44k • 58
unsloth/Phi-4-reasoning-GGUF

Text Generation • 15B • Updated May 1, 2025 • 1.42k • 19
unsloth/phi-4-GGUF

Text Generation • 15B • Updated Jan 13, 2025 • 3.49k • 180

Unsloth 4-bit Dynamic Quants

Unsloths Dynamic 4bit Quants selectively skips quantizing certain parameters; greatly improving accuracy while only using <10% more VRAM than BnB 4bit

unsloth/DeepSeek-R1-Distill-Llama-8B-unsloth-bnb-4bit

Text Generation • 5B • Updated Jul 18, 2025 • 37.7k • 36
unsloth/DeepSeek-R1-Distill-Qwen-14B-unsloth-bnb-4bit

Text Generation • 15B • Updated Feb 14, 2025 • 2.83k • 30
unsloth/DeepSeek-R1-Distill-Qwen-7B-unsloth-bnb-4bit

Text Generation • 5B • Updated Feb 14, 2025 • 3.24k • 24
unsloth/gemma-3-12b-it-unsloth-bnb-4bit

Image-to-Text • 12B • Updated May 12, 2025 • 40.4k • 24

Deepseek V3 (All Versions)

Deepseek-V3-0324 and V3 - available in original, and Dynamic GGUF formats, with support for 2-8-bit quantized versions.

unsloth/DeepSeek-V3-0324-GGUF-UD

Text Generation • 671B • Updated Apr 28, 2025 • 1.04k • 21
unsloth/DeepSeek-V3-0324-GGUF

Text Generation • 671B • Updated May 22, 2025 • 3.98k • 197
unsloth/DeepSeek-V3-0324

Text Generation • 684B • Updated Apr 21, 2025 • 12 • 7
unsloth/DeepSeek-V3-0324-BF16

Text Generation • 684B • Updated Jul 14, 2025 • 28.2k • 4

Text-to-Speech (TTS) models

A collection of 4-bit, Dynamic 4-bit and 16-bit voice models including Sesame-CSM, OpenAI's Whisper, Orpheus. Fine-tune them with Unsloth now!

unsloth/orpheus-3b-0.1-ft-GGUF

Text-to-Speech • 3B • Updated Jul 9, 2025 • 1.2k • 11
unsloth/orpheus-3b-0.1-ft-unsloth-bnb-4bit

Text-to-Speech • 3B • Updated Mar 24, 2025 • 31.9k • 16
unsloth/csm-1b

Text-to-Speech • 2B • Updated May 15, 2025 • 6.41k • 19
unsloth/whisper-large-v3

Automatic Speech Recognition • 2B • Updated May 14, 2025 • 5.65k • 14

Mistral Small 3 (All Versions)

A collection of Mistral's new Small 3.2 and 3 models including GGUF, 4-bit and more!

unsloth/Mistral-Small-3.2-24B-Instruct-2506-GGUF

Image-Text-to-Text • 24B • Updated Aug 26, 2025 • 48.3k • 153
unsloth/Mistral-Small-3.2-24B-Instruct-2506

Image-Text-to-Text • 24B • Updated Aug 26, 2025 • 2.57k • • 11
unsloth/Mistral-Small-3.2-24B-Instruct-2506-FP8

Image-Text-to-Text • Updated Jun 21, 2025 • 205 • 6
unsloth/Mistral-Small-3.2-24B-Instruct-2506-unsloth-bnb-4bit

Image-Text-to-Text • 25B • Updated Jun 23, 2025 • 2.56k • 12

Llama 3.2

Meta's new Llama 3.2 vision and text models including 1B, 3B, 11B and 90B. Includes GGUF, 4-bit bnb and original versions.

unsloth/Llama-3.2-1B-Instruct-GGUF

Text Generation • 1B • Updated May 9, 2025 • 62.7k • 52
unsloth/Llama-3.2-1B-Instruct

Text Generation • 1B • Updated May 9, 2025 • 89.9k • 87
unsloth/Llama-3.2-1B-Instruct-unsloth-bnb-4bit

Text Generation • 0.8B • Updated Apr 26, 2025 • 55.7k • 4
unsloth/Llama-3.2-1B-Instruct-bnb-4bit

Text Generation • 1B • Updated Jan 23, 2025 • 22.5k • 22

Llama 3.3 (All Versions)

Meta's new Llama 3.3 (70B) model in all formats. Includes GGUF, 4-bit bnb and original versions.

unsloth/Llama-3.3-70B-Instruct-GGUF

Text Generation • 71B • Updated May 10, 2025 • 8.75k • 92
unsloth/Llama-3.3-70B-Instruct

Text Generation • 71B • Updated Nov 25, 2025 • 2.73k • 48
unsloth/Llama-3.3-70B-Instruct-bnb-4bit

Text Generation • 71B • Updated Nov 25, 2025 • 7.26k • 52

Qwen2.5-VL (All Versions)

All versions of Qwen2.5-VL including the new 32B version and 4-bit, 16-bit and more!

unsloth/Qwen2.5-VL-3B-Instruct-GGUF

Image-Text-to-Text • 3B • Updated May 12, 2025 • 12.5k • 19
unsloth/Qwen2.5-VL-7B-Instruct-GGUF

Image-Text-to-Text • 8B • Updated May 12, 2025 • 74.5k • 137
unsloth/Qwen2.5-VL-32B-Instruct-GGUF

Image-Text-to-Text • 33B • Updated May 12, 2025 • 566 • 7
unsloth/Qwen2.5-VL-72B-Instruct-GGUF

Image-Text-to-Text • 73B • Updated May 18, 2025 • 1.03k • 7

Qwen QwQ-32B Collection

Qwen's reasoning models including QwQ (32B) & QVQ (72B) in formats: GGUF, dynamic 4-bit and 16-bit original versions.

unsloth/QwQ-32B-GGUF

Text Generation • 33B • Updated Apr 27, 2025 • 1.39k • 86
unsloth/QwQ-32B-unsloth-bnb-4bit

Text Generation • 34B • Updated Mar 7, 2025 • 276 • 47
unsloth/QwQ-32B

Text Generation • 33B • Updated Apr 27, 2025 • 11 • • 17
unsloth/QwQ-32B-bnb-4bit

Text Generation • 34B • Updated Mar 5, 2025 • 106 • 4

Vision/multimodal Models

Collection of the most popular vision models including Llama 3.2, LlaVa, Qwen2 VL, Pixtral, PaliGemma and more!

unsloth/Llama-3.2-11B-Vision-Instruct

Image-to-Text • 11B • Updated Dec 10, 2024 • 27.5k • 88
unsloth/Llama-3.2-11B-Vision-Instruct-bnb-4bit

Image-to-Text • 11B • Updated Dec 10, 2024 • 6.73k • 80
unsloth/Llama-3.2-11B-Vision-Instruct-unsloth-bnb-4bit

Image-to-Text • 11B • Updated Dec 4, 2024 • 4.71k • 28
unsloth/Qwen2-VL-7B-Instruct-bnb-4bit

Image-Text-to-Text • 9B • Updated Nov 22, 2024 • 2.06k • 6

Llama 3.2 Vision

Meta's Llama 3.2 vision models 11B and 90B. Include 4-bit bnb and original versions.

unsloth/Llama-3.2-11B-Vision-Instruct

Image-to-Text • 11B • Updated Dec 10, 2024 • 27.5k • 88
unsloth/Llama-3.2-11B-Vision-Instruct-bnb-4bit

Image-to-Text • 11B • Updated Dec 10, 2024 • 6.73k • 80
unsloth/Llama-3.2-11B-Vision

Image-to-Text • 11B • Updated Nov 22, 2024 • 495 • 34
unsloth/Llama-3.2-11B-Vision-bnb-4bit

Image-to-Text • 11B • Updated Nov 22, 2024 • 189 • 16

Qwen 2.5 Coder

Complete collection of Code-specific model series for Qwen2.5 in bnb 4bit, 16bit and GGUF formats.

unsloth/Qwen2.5-Coder-32B-Instruct-128K-GGUF

33B • Updated Nov 15, 2024 • 1.37k • 74
unsloth/Qwen2.5-Coder-14B-Instruct-128K-GGUF

15B • Updated Nov 14, 2024 • 1.28k • 34
unsloth/Qwen2.5-Coder-7B-Instruct-128K-GGUF

8B • Updated Nov 14, 2024 • 2.07k • 20
unsloth/Qwen2.5-Coder-3B-Instruct-128K-GGUF

3B • Updated Nov 15, 2024 • 620 • 14

Llama 3.1 Collection

Meta's Llama 3.1 models including 8B, 70B, 405B. Includes 4-bit bnb and original versions.

unsloth/Meta-Llama-3.1-8B-Instruct-bnb-4bit

Text Generation • 8B • Updated Feb 15, 2025 • 242k • 92
unsloth/Llama-3.1-8B-Instruct-unsloth-bnb-4bit

Text Generation • 8B • Updated Feb 15, 2025 • 44.9k • 4
unsloth/Meta-Llama-3.1-8B-Instruct-unsloth-bnb-4bit

Text Generation • 8B • Updated Feb 15, 2025 • 149k • 4
unsloth/Meta-Llama-3.1-8B-bnb-4bit

Text Generation • 8B • Updated Feb 15, 2025 • 49.7k • 109

Qwen 2.5

unsloth/Qwen2.5-7B-Instruct-bnb-4bit

Text Generation • 8B • Updated Apr 28, 2025 • 71k • 20
unsloth/Qwen2.5-7B-Instruct

Text Generation • 8B • Updated Apr 28, 2025 • 50.6k • • 22
unsloth/Qwen2.5-14B-bnb-4bit

Text Generation • 15B • Updated Apr 28, 2025 • 1.32k • 5
unsloth/Qwen2.5-7B-bnb-4bit

Text Generation • 8B • Updated Apr 28, 2025 • 5.44k • 6

Load 4bit models 4x faster

Native bitsandbytes 4bit pre quantized models

unsloth/Llama-3.2-3B-bnb-4bit

Text Generation • 3B • Updated Jun 2, 2025 • 21.4k • 21
unsloth/Meta-Llama-3.1-8B-bnb-4bit

Text Generation • 8B • Updated Feb 15, 2025 • 49.7k • 109
unsloth/llama-3-8b-Instruct-bnb-4bit

Text Generation • 8B • Updated Nov 22, 2024 • 54k • 133
unsloth/gemma-2-9b-bnb-4bit

Text Generation • 10B • Updated Jul 22, 2025 • 8.16k • 31

4bit Instruct Models

unsloth/Llama-3.2-3B-Instruct-bnb-4bit

Text Generation • 3B • Updated Jun 2, 2025 • 28.8k • 33
unsloth/Llama-3.2-1B-Instruct-bnb-4bit

Text Generation • 1B • Updated Jan 23, 2025 • 22.5k • 22
unsloth/Llama-3.2-11B-Vision-Instruct-bnb-4bit

Image-to-Text • 11B • Updated Dec 10, 2024 • 6.73k • 80
unsloth/Meta-Llama-3.1-8B-Instruct-bnb-4bit

Text Generation • 8B • Updated Feb 15, 2025 • 242k • 92

AI & ML interests

Recent Activity

Team members 2

unsloth 's collections 30