Nanbeige4.1-3B-GGUF
This model was converted to GGUF format from Nanbeige/Nanbeige4.1-3B using GGUF Forge.
Quants
The following quants are available: Q2_K, Q3_K_M, Q3_K_S, Q3_K_L, Q4_K_S, Q4_K_M, Q4_0, Q5_K_S, Q5_K_M, Q6_K, Q8_0, Q5_0
Ollama Support
Full Ollama support is provided by merging any sharded GGUF output into a single file after quantization.
Conversion Stats
| Metric | Value |
|---|---|
| Job ID | 10e7461c-b7b7-4cfd-bafe-4c1e9213fefb |
| GGUF Forge Version | v6.2 |
| Total Time | 36.5min |
| Avg Time per Quant | 1.0min |
Step Breakdown
- Download: 1.2min
- FP16 Conversion: 47.1s
- Quantization: 34.4min
🚀 Convert Your Own Models
Want to convert more models to GGUF?
👉 gguforge.com — Free hosted GGUF conversion service. Login with HuggingFace and request conversions instantly!
Links
- 🌐 Free Hosted Service: gguforge.com
- 🛠️ Self-host GGUF Forge: GitHub
- 📦 llama.cpp (quantization engine): GitHub
- 💬 Community & Support: Discord
Converted automatically by GGUF Forge v6.2
- Downloads last month
- 472
Hardware compatibility
Log In
to add your hardware
2-bit
3-bit
4-bit
5-bit
6-bit
8-bit
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
🙋
Ask for provider support