Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
internlm
/
CapRL-3B
like
45
Follow
Intern Large Models
851
Image-Text-to-Text
Transformers
Safetensors
internlm/CapRL-2M
English
qwen2_5_vl
image-to-text
multimodal
image caption
captioning
conversational
text-generation-inference
arxiv:
2509.22647
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
1
Deploy
Use this model
main
CapRL-3B
/
assets
16.5 MB
2 contributors
History:
2 commits
yuhangzang
Upload performance_update.png
eeb96d4
verified
about 2 months ago
comparison.png
Safe
3.41 MB
xet
Add files using upload-large-folder tool
2 months ago
info_caprl.png
Safe
3.9 MB
xet
Add files using upload-large-folder tool
2 months ago
info_caprl2.png
2.85 MB
xet
Add files using upload-large-folder tool
2 months ago
natural_caprl.png
Safe
4.21 MB
xet
Add files using upload-large-folder tool
2 months ago
performance.png
Safe
147 kB
xet
Add files using upload-large-folder tool
2 months ago
performance_update.png
Safe
140 kB
xet
Upload performance_update.png
about 2 months ago
teaser.png
Safe
1.88 MB
xet
Add files using upload-large-folder tool
2 months ago