Akicou
/

Nanbeige4.1-3B-GGUF

Model card Files Files and versions

Nanbeige4.1-3B-GGUF

This model was converted to GGUF format from Nanbeige/Nanbeige4.1-3B using GGUF Forge.

Quants

The following quants are available: Q2_K, Q3_K_M, Q3_K_S, Q3_K_L, Q4_K_S, Q4_K_M, Q4_0, Q5_K_S, Q5_K_M, Q6_K, Q8_0, Q5_0

Ollama Support

Full Ollama support is provided by merging any sharded GGUF output into a single file after quantization.

Conversion Stats

Metric	Value
Job ID	`10e7461c-b7b7-4cfd-bafe-4c1e9213fefb`
GGUF Forge Version	v6.2
Total Time	36.5min
Avg Time per Quant	1.0min

Step Breakdown

Download: 1.2min
FP16 Conversion: 47.1s
Quantization: 34.4min

🚀 Convert Your Own Models

Want to convert more models to GGUF?

👉 gguforge.com — Free hosted GGUF conversion service. Login with HuggingFace and request conversions instantly!

Links

🌐 Free Hosted Service: gguforge.com
🛠️ Self-host GGUF Forge: GitHub
📦 llama.cpp (quantization engine): GitHub
💬 Community & Support: Discord

Converted automatically by GGUF Forge v6.2

Downloads last month: 472

GGUF

Model size

4B params

Architecture

llama

Hardware compatibility

Log In to add your hardware

2-bit

3-bit

4-bit

5-bit

6-bit

8-bit

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for Akicou/Nanbeige4.1-3B-GGUF

Base model

Nanbeige/Nanbeige4-3B-Base

Finetuned

Nanbeige/Nanbeige4.1-3B

Quantized

(41)

this model