Nanbeige4.1-3B-GGUF

This model was converted to GGUF format from Nanbeige/Nanbeige4.1-3B using GGUF Forge.

Quants

The following quants are available: Q2_K, Q3_K_M, Q3_K_S, Q3_K_L, Q4_K_S, Q4_K_M, Q4_0, Q5_K_S, Q5_K_M, Q6_K, Q8_0, Q5_0

Ollama Support

Full Ollama support is provided by merging any sharded GGUF output into a single file after quantization.

Conversion Stats

Metric Value
Job ID 10e7461c-b7b7-4cfd-bafe-4c1e9213fefb
GGUF Forge Version v6.2
Total Time 36.5min
Avg Time per Quant 1.0min

Step Breakdown

  • Download: 1.2min
  • FP16 Conversion: 47.1s
  • Quantization: 34.4min

🚀 Convert Your Own Models

Want to convert more models to GGUF?

👉 gguforge.com — Free hosted GGUF conversion service. Login with HuggingFace and request conversions instantly!

Links

  • 🌐 Free Hosted Service: gguforge.com
  • 🛠️ Self-host GGUF Forge: GitHub
  • 📦 llama.cpp (quantization engine): GitHub
  • 💬 Community & Support: Discord

Converted automatically by GGUF Forge v6.2

Downloads last month
472
GGUF
Model size
4B params
Architecture
llama
Hardware compatibility
Log In to add your hardware

2-bit

3-bit

4-bit

5-bit

6-bit

8-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for Akicou/Nanbeige4.1-3B-GGUF

Quantized
(41)
this model