New 2.0 version of our Dynamic GGUF + Quants. Dynamic 2.0 achieves superior accuracy & SOTA quantization performance.
AI & ML interests
Open Source AI 🦥
Recent Activity
View all activity
Qwen's new multimodal vision models in GGUF, safetensor, and dynamic Unsloth formats.
-
unsloth/Qwen3-VL-30B-A3B-Instruct-GGUF
Image-Text-to-Text • 31B • Updated • 164k • 76 -
unsloth/Qwen3-VL-30B-A3B-Thinking-GGUF
Image-Text-to-Text • 31B • Updated • 25k • 32 -
unsloth/Qwen3-VL-4B-Instruct-GGUF
Image-Text-to-Text • 4B • Updated • 21.4k • 36 -
unsloth/Qwen3-VL-4B-Thinking-GGUF
Image-Text-to-Text • 4B • Updated • 5.1k • 19
Mistral Ministral 3: new multimodal models in Base, Instruct, and Reasoning variants, available in 3B, 8B, and 14B sizes.
DeepSeek's new 3.1 update to their V3 models!
Run or fine-tune embedding models with Unsloth.
-
unsloth/embeddinggemma-300m
Sentence Similarity • 0.3B • Updated • 9.42k • • 8 -
unsloth/embeddinggemma-300m-GGUF
Sentence Similarity • 0.3B • Updated • 4.92k • 48 -
unsloth/Qwen3-Embedding-0.6B
Feature Extraction • 0.6B • Updated • 935 • 3 -
unsloth/Qwen3-Embedding-4B
Feature Extraction • 4B • Updated • 696 • 1
All versions of Google's new multimodal models including QAT in 1B, 4B, 12B, and 27B sizes. In GGUF, dynamic 4-bit and 16-bit formats.
-
unsloth/gemma-3-270m-it-GGUF
Text Generation • 0.3B • Updated • 20.8k • 147 -
unsloth/gemma-3-270m-it-qat-GGUF
Text Generation • 0.3B • Updated • 5.63k • 11 -
unsloth/gemma-3-270m-it
Text Generation • 0.3B • Updated • 24k • 22 -
unsloth/gemma-3-270m-it-unsloth-bnb-4bit
Text Generation • 0.3B • Updated • 14.1k • 5
Google Gemma 3n models, all versions including Dynamic GGUF, 4-bit, 16-bit and formats!
-
unsloth/gemma-3n-E4B-it-GGUF
Image-Text-to-Text • 7B • Updated • 19.7k • 186 -
unsloth/gemma-3n-E2B-it-GGUF
Image-Text-to-Text • 4B • Updated • 23.4k • 57 -
unsloth/gemma-3n-E4B-it-unsloth-bnb-4bit
Image-Text-to-Text • 8B • Updated • 18.5k • 9 -
unsloth/gemma-3n-E4B-unsloth-bnb-4bit
Image-Text-to-Text • 8B • Updated • 1.04k • 4
Microsoft's Phi-4 models including Reasoning + Reasoning Plus & mini. Includes Dynamic 2.0 GGUF, 4-bit & 16-bit versions. Includes Unsloth's bug fixes
Deepseek-V3-0324 and V3 - available in original, and Dynamic GGUF formats, with support for 2-8-bit quantized versions.
A collection of Mistral's new Small 3.2 and 3 models including GGUF, 4-bit and more!
-
unsloth/Mistral-Small-3.2-24B-Instruct-2506-GGUF
Image-Text-to-Text • 24B • Updated • 48.3k • 153 -
unsloth/Mistral-Small-3.2-24B-Instruct-2506
Image-Text-to-Text • 24B • Updated • 2.57k • • 11 -
unsloth/Mistral-Small-3.2-24B-Instruct-2506-FP8
Image-Text-to-Text • Updated • 205 • 6 -
unsloth/Mistral-Small-3.2-24B-Instruct-2506-unsloth-bnb-4bit
Image-Text-to-Text • 25B • Updated • 2.56k • 12
Meta's new Llama 3.3 (70B) model in all formats. Includes GGUF, 4-bit bnb and original versions.
Qwen's reasoning models including QwQ (32B) & QVQ (72B) in formats: GGUF, dynamic 4-bit and 16-bit original versions.
Meta's Llama 3.2 vision models 11B and 90B. Include 4-bit bnb and original versions.
-
unsloth/Llama-3.2-11B-Vision-Instruct
Image-to-Text • 11B • Updated • 27.5k • 88 -
unsloth/Llama-3.2-11B-Vision-Instruct-bnb-4bit
Image-to-Text • 11B • Updated • 6.73k • 80 -
unsloth/Llama-3.2-11B-Vision
Image-to-Text • 11B • Updated • 495 • 34 -
unsloth/Llama-3.2-11B-Vision-bnb-4bit
Image-to-Text • 11B • Updated • 189 • 16
Meta's Llama 3.1 models including 8B, 70B, 405B. Includes 4-bit bnb and original versions.
-
unsloth/Meta-Llama-3.1-8B-Instruct-bnb-4bit
Text Generation • 8B • Updated • 242k • 92 -
unsloth/Llama-3.1-8B-Instruct-unsloth-bnb-4bit
Text Generation • 8B • Updated • 44.9k • 4 -
unsloth/Meta-Llama-3.1-8B-Instruct-unsloth-bnb-4bit
Text Generation • 8B • Updated • 149k • 4 -
unsloth/Meta-Llama-3.1-8B-bnb-4bit
Text Generation • 8B • Updated • 49.7k • 109
Native bitsandbytes 4bit pre quantized models
-
unsloth/Llama-3.2-3B-bnb-4bit
Text Generation • 3B • Updated • 21.4k • 21 -
unsloth/Meta-Llama-3.1-8B-bnb-4bit
Text Generation • 8B • Updated • 49.7k • 109 -
unsloth/llama-3-8b-Instruct-bnb-4bit
Text Generation • 8B • Updated • 54k • 133 -
unsloth/gemma-2-9b-bnb-4bit
Text Generation • 10B • Updated • 8.16k • 31
Find GGUFs and other variants of diffusion based Qwen-Image and FLUX models.
OpenAI's gpt-oss-20b and gpt-oss-120b is here! The powerful open models are available in GGUF, original & 4-bit formats.
-
unsloth/gpt-oss-20b-GGUF
Text Generation • 21B • Updated • 126k • 554 -
unsloth/gpt-oss-120b-GGUF
Text Generation • 117B • Updated • 81.6k • 201 -
unsloth/gpt-oss-20b-unsloth-bnb-4bit
Text Generation • 21B • Updated • 146k • 35 -
unsloth/gpt-oss-120b-unsloth-bnb-4bit
Text Generation • 117B • Updated • 20.1k • 12
Qwen's new Qwen3 models. In Unsloth Dynamic 2.0, GGUF, 4-bit and 16-bit Safetensor formats. Includes 128K Context Length variants.
DeepSeek-R1-0528 is here! The most powerful reasoning open LLM, available in GGUF, original & 4-bit formats. Includes Llama & Qwen distilled models.
-
unsloth/DeepSeek-R1-0528-Qwen3-8B-GGUF
Text Generation • 8B • Updated • 50.4k • 367 -
unsloth/DeepSeek-R1-0528-GGUF
Text Generation • 671B • Updated • 3.64k • 193 -
unsloth/DeepSeek-R1-0528-Qwen3-8B-unsloth-bnb-4bit
Text Generation • 8B • Updated • 6.36k • 13 -
unsloth/DeepSeek-R1-0528
Text Generation • 685B • Updated • 23 • 15
The Qwen3-Coder models deliver SOTA advancements in agentic coding and code tasks. Includes Qwen3-Coder-480B-A35B.
-
unsloth/Qwen3-Coder-30B-A3B-Instruct-GGUF
Text Generation • 31B • Updated • 107k • 425 -
unsloth/Qwen3-Coder-30B-A3B-Instruct-1M-GGUF
Text Generation • 31B • Updated • 12.8k • 139 -
unsloth/Qwen3-Coder-480B-A35B-Instruct-GGUF
Text Generation • 480B • Updated • 3.52k • 166 -
unsloth/Qwen3-Coder-480B-A35B-Instruct-1M-GGUF
Text Generation • 480B • Updated • 1.72k • 42
IBM's new Granite-4.0 models! Run Dynamic GGUFs or fine-tune with Unsloth.
Meta's new Llama 4 multimodal models, Scout & Maverick. Includes Dynamic GGUFs, 16-bit & Dynamic 4-bit uploads. Run & fine-tune them with Unsloth!
-
unsloth/Llama-4-Scout-17B-16E-Instruct-GGUF
Image-to-Text • 108B • Updated • 21.8k • 131 -
unsloth/Llama-4-Maverick-17B-128E-Instruct-GGUF
Image-to-Text • 401B • Updated • 4.98k • 43 -
unsloth/Llama-4-Scout-17B-16E-Instruct
Image-to-Text • 109B • Updated • 618 • 56 -
unsloth/Llama-4-Scout-17B-16E-Instruct-unsloth-bnb-4bit
Image-to-Text • 112B • Updated • 1.39k • 80
Unsloths Dynamic 4bit Quants selectively skips quantizing certain parameters; greatly improving accuracy while only using <10% more VRAM than BnB 4bit
-
unsloth/DeepSeek-R1-Distill-Llama-8B-unsloth-bnb-4bit
Text Generation • 5B • Updated • 37.7k • 36 -
unsloth/DeepSeek-R1-Distill-Qwen-14B-unsloth-bnb-4bit
Text Generation • 15B • Updated • 2.83k • 30 -
unsloth/DeepSeek-R1-Distill-Qwen-7B-unsloth-bnb-4bit
Text Generation • 5B • Updated • 3.24k • 24 -
unsloth/gemma-3-12b-it-unsloth-bnb-4bit
Image-to-Text • 12B • Updated • 40.4k • 24
A collection of 4-bit, Dynamic 4-bit and 16-bit voice models including Sesame-CSM, OpenAI's Whisper, Orpheus. Fine-tune them with Unsloth now!
Meta's new Llama 3.2 vision and text models including 1B, 3B, 11B and 90B. Includes GGUF, 4-bit bnb and original versions.
-
unsloth/Llama-3.2-1B-Instruct-GGUF
Text Generation • 1B • Updated • 62.7k • 52 -
unsloth/Llama-3.2-1B-Instruct
Text Generation • 1B • Updated • 89.9k • 87 -
unsloth/Llama-3.2-1B-Instruct-unsloth-bnb-4bit
Text Generation • 0.8B • Updated • 55.7k • 4 -
unsloth/Llama-3.2-1B-Instruct-bnb-4bit
Text Generation • 1B • Updated • 22.5k • 22
All versions of Qwen2.5-VL including the new 32B version and 4-bit, 16-bit and more!
-
unsloth/Qwen2.5-VL-3B-Instruct-GGUF
Image-Text-to-Text • 3B • Updated • 12.5k • 19 -
unsloth/Qwen2.5-VL-7B-Instruct-GGUF
Image-Text-to-Text • 8B • Updated • 74.5k • 137 -
unsloth/Qwen2.5-VL-32B-Instruct-GGUF
Image-Text-to-Text • 33B • Updated • 566 • 7 -
unsloth/Qwen2.5-VL-72B-Instruct-GGUF
Image-Text-to-Text • 73B • Updated • 1.03k • 7
Collection of the most popular vision models including Llama 3.2, LlaVa, Qwen2 VL, Pixtral, PaliGemma and more!
-
unsloth/Llama-3.2-11B-Vision-Instruct
Image-to-Text • 11B • Updated • 27.5k • 88 -
unsloth/Llama-3.2-11B-Vision-Instruct-bnb-4bit
Image-to-Text • 11B • Updated • 6.73k • 80 -
unsloth/Llama-3.2-11B-Vision-Instruct-unsloth-bnb-4bit
Image-to-Text • 11B • Updated • 4.71k • 28 -
unsloth/Qwen2-VL-7B-Instruct-bnb-4bit
Image-Text-to-Text • 9B • Updated • 2.06k • 6
Complete collection of Code-specific model series for Qwen2.5 in bnb 4bit, 16bit and GGUF formats.
-
unsloth/Llama-3.2-3B-Instruct-bnb-4bit
Text Generation • 3B • Updated • 28.8k • 33 -
unsloth/Llama-3.2-1B-Instruct-bnb-4bit
Text Generation • 1B • Updated • 22.5k • 22 -
unsloth/Llama-3.2-11B-Vision-Instruct-bnb-4bit
Image-to-Text • 11B • Updated • 6.73k • 80 -
unsloth/Meta-Llama-3.1-8B-Instruct-bnb-4bit
Text Generation • 8B • Updated • 242k • 92
New 2.0 version of our Dynamic GGUF + Quants. Dynamic 2.0 achieves superior accuracy & SOTA quantization performance.
Find GGUFs and other variants of diffusion based Qwen-Image and FLUX models.
Qwen's new multimodal vision models in GGUF, safetensor, and dynamic Unsloth formats.
-
unsloth/Qwen3-VL-30B-A3B-Instruct-GGUF
Image-Text-to-Text • 31B • Updated • 164k • 76 -
unsloth/Qwen3-VL-30B-A3B-Thinking-GGUF
Image-Text-to-Text • 31B • Updated • 25k • 32 -
unsloth/Qwen3-VL-4B-Instruct-GGUF
Image-Text-to-Text • 4B • Updated • 21.4k • 36 -
unsloth/Qwen3-VL-4B-Thinking-GGUF
Image-Text-to-Text • 4B • Updated • 5.1k • 19
OpenAI's gpt-oss-20b and gpt-oss-120b is here! The powerful open models are available in GGUF, original & 4-bit formats.
-
unsloth/gpt-oss-20b-GGUF
Text Generation • 21B • Updated • 126k • 554 -
unsloth/gpt-oss-120b-GGUF
Text Generation • 117B • Updated • 81.6k • 201 -
unsloth/gpt-oss-20b-unsloth-bnb-4bit
Text Generation • 21B • Updated • 146k • 35 -
unsloth/gpt-oss-120b-unsloth-bnb-4bit
Text Generation • 117B • Updated • 20.1k • 12
Mistral Ministral 3: new multimodal models in Base, Instruct, and Reasoning variants, available in 3B, 8B, and 14B sizes.
Qwen's new Qwen3 models. In Unsloth Dynamic 2.0, GGUF, 4-bit and 16-bit Safetensor formats. Includes 128K Context Length variants.
DeepSeek's new 3.1 update to their V3 models!
DeepSeek-R1-0528 is here! The most powerful reasoning open LLM, available in GGUF, original & 4-bit formats. Includes Llama & Qwen distilled models.
-
unsloth/DeepSeek-R1-0528-Qwen3-8B-GGUF
Text Generation • 8B • Updated • 50.4k • 367 -
unsloth/DeepSeek-R1-0528-GGUF
Text Generation • 671B • Updated • 3.64k • 193 -
unsloth/DeepSeek-R1-0528-Qwen3-8B-unsloth-bnb-4bit
Text Generation • 8B • Updated • 6.36k • 13 -
unsloth/DeepSeek-R1-0528
Text Generation • 685B • Updated • 23 • 15
Run or fine-tune embedding models with Unsloth.
-
unsloth/embeddinggemma-300m
Sentence Similarity • 0.3B • Updated • 9.42k • • 8 -
unsloth/embeddinggemma-300m-GGUF
Sentence Similarity • 0.3B • Updated • 4.92k • 48 -
unsloth/Qwen3-Embedding-0.6B
Feature Extraction • 0.6B • Updated • 935 • 3 -
unsloth/Qwen3-Embedding-4B
Feature Extraction • 4B • Updated • 696 • 1
The Qwen3-Coder models deliver SOTA advancements in agentic coding and code tasks. Includes Qwen3-Coder-480B-A35B.
-
unsloth/Qwen3-Coder-30B-A3B-Instruct-GGUF
Text Generation • 31B • Updated • 107k • 425 -
unsloth/Qwen3-Coder-30B-A3B-Instruct-1M-GGUF
Text Generation • 31B • Updated • 12.8k • 139 -
unsloth/Qwen3-Coder-480B-A35B-Instruct-GGUF
Text Generation • 480B • Updated • 3.52k • 166 -
unsloth/Qwen3-Coder-480B-A35B-Instruct-1M-GGUF
Text Generation • 480B • Updated • 1.72k • 42
All versions of Google's new multimodal models including QAT in 1B, 4B, 12B, and 27B sizes. In GGUF, dynamic 4-bit and 16-bit formats.
-
unsloth/gemma-3-270m-it-GGUF
Text Generation • 0.3B • Updated • 20.8k • 147 -
unsloth/gemma-3-270m-it-qat-GGUF
Text Generation • 0.3B • Updated • 5.63k • 11 -
unsloth/gemma-3-270m-it
Text Generation • 0.3B • Updated • 24k • 22 -
unsloth/gemma-3-270m-it-unsloth-bnb-4bit
Text Generation • 0.3B • Updated • 14.1k • 5
IBM's new Granite-4.0 models! Run Dynamic GGUFs or fine-tune with Unsloth.
Google Gemma 3n models, all versions including Dynamic GGUF, 4-bit, 16-bit and formats!
-
unsloth/gemma-3n-E4B-it-GGUF
Image-Text-to-Text • 7B • Updated • 19.7k • 186 -
unsloth/gemma-3n-E2B-it-GGUF
Image-Text-to-Text • 4B • Updated • 23.4k • 57 -
unsloth/gemma-3n-E4B-it-unsloth-bnb-4bit
Image-Text-to-Text • 8B • Updated • 18.5k • 9 -
unsloth/gemma-3n-E4B-unsloth-bnb-4bit
Image-Text-to-Text • 8B • Updated • 1.04k • 4
Meta's new Llama 4 multimodal models, Scout & Maverick. Includes Dynamic GGUFs, 16-bit & Dynamic 4-bit uploads. Run & fine-tune them with Unsloth!
-
unsloth/Llama-4-Scout-17B-16E-Instruct-GGUF
Image-to-Text • 108B • Updated • 21.8k • 131 -
unsloth/Llama-4-Maverick-17B-128E-Instruct-GGUF
Image-to-Text • 401B • Updated • 4.98k • 43 -
unsloth/Llama-4-Scout-17B-16E-Instruct
Image-to-Text • 109B • Updated • 618 • 56 -
unsloth/Llama-4-Scout-17B-16E-Instruct-unsloth-bnb-4bit
Image-to-Text • 112B • Updated • 1.39k • 80
Microsoft's Phi-4 models including Reasoning + Reasoning Plus & mini. Includes Dynamic 2.0 GGUF, 4-bit & 16-bit versions. Includes Unsloth's bug fixes
Unsloths Dynamic 4bit Quants selectively skips quantizing certain parameters; greatly improving accuracy while only using <10% more VRAM than BnB 4bit
-
unsloth/DeepSeek-R1-Distill-Llama-8B-unsloth-bnb-4bit
Text Generation • 5B • Updated • 37.7k • 36 -
unsloth/DeepSeek-R1-Distill-Qwen-14B-unsloth-bnb-4bit
Text Generation • 15B • Updated • 2.83k • 30 -
unsloth/DeepSeek-R1-Distill-Qwen-7B-unsloth-bnb-4bit
Text Generation • 5B • Updated • 3.24k • 24 -
unsloth/gemma-3-12b-it-unsloth-bnb-4bit
Image-to-Text • 12B • Updated • 40.4k • 24
Deepseek-V3-0324 and V3 - available in original, and Dynamic GGUF formats, with support for 2-8-bit quantized versions.
A collection of 4-bit, Dynamic 4-bit and 16-bit voice models including Sesame-CSM, OpenAI's Whisper, Orpheus. Fine-tune them with Unsloth now!
A collection of Mistral's new Small 3.2 and 3 models including GGUF, 4-bit and more!
-
unsloth/Mistral-Small-3.2-24B-Instruct-2506-GGUF
Image-Text-to-Text • 24B • Updated • 48.3k • 153 -
unsloth/Mistral-Small-3.2-24B-Instruct-2506
Image-Text-to-Text • 24B • Updated • 2.57k • • 11 -
unsloth/Mistral-Small-3.2-24B-Instruct-2506-FP8
Image-Text-to-Text • Updated • 205 • 6 -
unsloth/Mistral-Small-3.2-24B-Instruct-2506-unsloth-bnb-4bit
Image-Text-to-Text • 25B • Updated • 2.56k • 12
Meta's new Llama 3.2 vision and text models including 1B, 3B, 11B and 90B. Includes GGUF, 4-bit bnb and original versions.
-
unsloth/Llama-3.2-1B-Instruct-GGUF
Text Generation • 1B • Updated • 62.7k • 52 -
unsloth/Llama-3.2-1B-Instruct
Text Generation • 1B • Updated • 89.9k • 87 -
unsloth/Llama-3.2-1B-Instruct-unsloth-bnb-4bit
Text Generation • 0.8B • Updated • 55.7k • 4 -
unsloth/Llama-3.2-1B-Instruct-bnb-4bit
Text Generation • 1B • Updated • 22.5k • 22
Meta's new Llama 3.3 (70B) model in all formats. Includes GGUF, 4-bit bnb and original versions.
All versions of Qwen2.5-VL including the new 32B version and 4-bit, 16-bit and more!
-
unsloth/Qwen2.5-VL-3B-Instruct-GGUF
Image-Text-to-Text • 3B • Updated • 12.5k • 19 -
unsloth/Qwen2.5-VL-7B-Instruct-GGUF
Image-Text-to-Text • 8B • Updated • 74.5k • 137 -
unsloth/Qwen2.5-VL-32B-Instruct-GGUF
Image-Text-to-Text • 33B • Updated • 566 • 7 -
unsloth/Qwen2.5-VL-72B-Instruct-GGUF
Image-Text-to-Text • 73B • Updated • 1.03k • 7
Qwen's reasoning models including QwQ (32B) & QVQ (72B) in formats: GGUF, dynamic 4-bit and 16-bit original versions.
Collection of the most popular vision models including Llama 3.2, LlaVa, Qwen2 VL, Pixtral, PaliGemma and more!
-
unsloth/Llama-3.2-11B-Vision-Instruct
Image-to-Text • 11B • Updated • 27.5k • 88 -
unsloth/Llama-3.2-11B-Vision-Instruct-bnb-4bit
Image-to-Text • 11B • Updated • 6.73k • 80 -
unsloth/Llama-3.2-11B-Vision-Instruct-unsloth-bnb-4bit
Image-to-Text • 11B • Updated • 4.71k • 28 -
unsloth/Qwen2-VL-7B-Instruct-bnb-4bit
Image-Text-to-Text • 9B • Updated • 2.06k • 6
Meta's Llama 3.2 vision models 11B and 90B. Include 4-bit bnb and original versions.
-
unsloth/Llama-3.2-11B-Vision-Instruct
Image-to-Text • 11B • Updated • 27.5k • 88 -
unsloth/Llama-3.2-11B-Vision-Instruct-bnb-4bit
Image-to-Text • 11B • Updated • 6.73k • 80 -
unsloth/Llama-3.2-11B-Vision
Image-to-Text • 11B • Updated • 495 • 34 -
unsloth/Llama-3.2-11B-Vision-bnb-4bit
Image-to-Text • 11B • Updated • 189 • 16
Complete collection of Code-specific model series for Qwen2.5 in bnb 4bit, 16bit and GGUF formats.
Meta's Llama 3.1 models including 8B, 70B, 405B. Includes 4-bit bnb and original versions.
-
unsloth/Meta-Llama-3.1-8B-Instruct-bnb-4bit
Text Generation • 8B • Updated • 242k • 92 -
unsloth/Llama-3.1-8B-Instruct-unsloth-bnb-4bit
Text Generation • 8B • Updated • 44.9k • 4 -
unsloth/Meta-Llama-3.1-8B-Instruct-unsloth-bnb-4bit
Text Generation • 8B • Updated • 149k • 4 -
unsloth/Meta-Llama-3.1-8B-bnb-4bit
Text Generation • 8B • Updated • 49.7k • 109
Native bitsandbytes 4bit pre quantized models
-
unsloth/Llama-3.2-3B-bnb-4bit
Text Generation • 3B • Updated • 21.4k • 21 -
unsloth/Meta-Llama-3.1-8B-bnb-4bit
Text Generation • 8B • Updated • 49.7k • 109 -
unsloth/llama-3-8b-Instruct-bnb-4bit
Text Generation • 8B • Updated • 54k • 133 -
unsloth/gemma-2-9b-bnb-4bit
Text Generation • 10B • Updated • 8.16k • 31
-
unsloth/Llama-3.2-3B-Instruct-bnb-4bit
Text Generation • 3B • Updated • 28.8k • 33 -
unsloth/Llama-3.2-1B-Instruct-bnb-4bit
Text Generation • 1B • Updated • 22.5k • 22 -
unsloth/Llama-3.2-11B-Vision-Instruct-bnb-4bit
Image-to-Text • 11B • Updated • 6.73k • 80 -
unsloth/Meta-Llama-3.1-8B-Instruct-bnb-4bit
Text Generation • 8B • Updated • 242k • 92