Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Nanbeige
/
Nanbeige4.1-3B
like
626
Follow
Nanbeige LLM Lab
484
Text Generation
Transformers
Safetensors
English
Chinese
llama
llm
nanbeige
conversational
Eval Results
text-generation-inference
arxiv:
2602.13367
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
26
Deploy
Use this model
Chain-of-Thought or Chain-of-Mimicry? The Over-SFT problem in Nanbeige 4.1-3B aka "I_Should_X"
#26
by
srs6901
- opened
1 day ago
Discussion
srs6901
1 day ago
This comment has been hidden (marked as Resolved)
srs6901
1 day ago
This comment has been hidden (marked as Resolved)
Edit
Preview
Upload images, audio, and videos by dragging in the text input, pasting, or
clicking here
.
Tap or paste here to upload images
Comment
·
Sign up
or
log in
to comment