Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
AudioVisual-Caption
/
ASID-Captioner-3B
like
1
Follow
ASID-Caption
3
Image-Text-to-Text
Transformers
Safetensors
English
qwen2_5_omni
video-captioning
audiovisual
qwen2.5-omni
instruction-tuning
attribute-structured
quality-verified
conversational
arxiv:
2602.13013
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
ASID-Captioner-3B
Commit History
Update README.md
59a00fb
verified
lyhisme
commited on
about 9 hours ago
Update README.md
c0c0dcc
verified
lyhisme
commited on
about 9 hours ago
Update README.md
4b41a1b
verified
lyhisme
commited on
9 days ago
Update README.md
acdea8c
verified
lyhisme
commited on
11 days ago
Update README.md
a5824a3
verified
lyhisme
commited on
13 days ago
Update README.md
0b1dd39
verified
lyhisme
commited on
13 days ago
Upload folder using huggingface_hub
393feb7
verified
lyhisme
commited on
15 days ago
initial commit
6abb918
verified
lyhisme
commited on
15 days ago