Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Main
Tasks
1
Libraries
Languages
Licenses
Other
Tasks
Reset Tasks
Text Generation
Any-to-Any
Image-Text-to-Text
Image-to-Text
Image-to-Image
Text-to-Image
Text-to-Video
Text-to-Speech
+ 42
Parameters
Reset Parameters
< 1B
6B
12B
32B
128B
> 500B
< 1B
> 500B
Libraries
PyTorch
google-tensorflow
TensorFlow
JAX
Transformers
Diffusers
sentence-transformers
Safetensors
ONNX
GGUF
Transformers.js
MLX
+ 41
Apps
vLLM
TGI
llama.cpp
MLX LM
LM Studio
Ollama
Jan
+ 12
Inference Providers
Groq
Novita
Nebius AI
Cerebras
SambaNova
Nscale
fal
Hyperbolic
+ 11
Apply filters
Models
9,538
Full-text search
Inference Available
Edit filters
Sort: Trending
Active filters:
image-to-text
Clear all
stepfun-ai/GELab-Zero-4B-preview
Image-to-Text
•
4B
•
Updated
6 days ago
•
673
•
90
datalab-to/chandra
Image-to-Text
•
9B
•
Updated
Oct 21
•
89.3k
•
406
lightonai/LightOnOCR-1B-1025
Image-to-Text
•
Updated
13 days ago
•
15.2k
•
179
Salesforce/blip-image-captioning-base
Image-to-Text
•
Updated
Feb 3
•
2.38M
•
821
monkt/paddleocr-onnx
Image-to-Text
•
Updated
Oct 7
•
24
nvidia/nemotron-ocr-v1
Image-to-Text
•
Updated
27 days ago
•
399
•
43
XiaomiMiMo/MiMo-Embodied-7B
Image-to-Text
•
8B
•
Updated
16 days ago
•
982
•
47
allenai/olmOCR-2-7B-1025-FP8
Image-to-Text
•
8B
•
Updated
Oct 22
•
536k
•
153
thesby/Qwen3-VL-8B-NSFW-Caption-V4.5
Image-to-Text
•
9B
•
Updated
30 days ago
•
15.5k
•
41
xtuner/llava-llama-3-8b-v1_1-gguf
Image-to-Text
•
8B
•
Updated
Apr 30, 2024
•
3.27k
•
220
VLM2Vec/VLM2Vec-V2.0
Image-to-Text
•
Updated
Jul 13
•
10.1k
•
19
allenai/olmOCR-2-7B-1025
Image-to-Text
•
8B
•
Updated
Oct 22
•
31.8k
•
88
shkb/MemeLeak
Image-to-Text
•
9B
•
Updated
4 days ago
•
100
•
2
team-lucid/trocr-small-korean
Image-to-Text
•
54.5M
•
Updated
Jul 1, 2023
•
563
•
18
deepghs/paddleocr
Image-to-Text
•
Updated
20 days ago
•
12
SawanStack/gpt2-image-captioning-onnx
Image-to-Text
•
Updated
Nov 13, 2023
•
8
•
1
OleehyO/TexTeller
Image-to-Text
•
0.3B
•
Updated
Jun 22, 2024
•
7.3k
•
38
breezedeus/pix2text-mfr
Image-to-Text
•
Updated
May 5, 2024
•
164k
•
47
GnanaPrasath/ocr_tamil
Image-to-Text
•
Updated
Feb 14, 2024
•
19
unsloth/Llama-3.2-11B-Vision-Instruct-bnb-4bit
Image-to-Text
•
6B
•
Updated
Dec 10, 2024
•
512k
•
80
unsloth/Llama-3.2-11B-Vision-Instruct
Image-to-Text
•
11B
•
Updated
Dec 10, 2024
•
22.1k
•
86
Vikhrmodels/Vikhr-2-VL-2b-Instruct-experimental
Image-to-Text
•
2B
•
Updated
Nov 3, 2024
•
30
•
20
HuggingFaceTB/SmolVLM-256M-Base
Image-to-Text
•
0.3B
•
Updated
Jan 20
•
7.8k
•
18
enalis/scold
Image-to-Text
•
Updated
Oct 29
•
49
•
7
sbintuitions/sarashina2-vision-8b
Image-to-Text
•
8B
•
Updated
Mar 27
•
7.9k
•
10
infly/INF-AZ-7B-0524
Image-to-Text
•
8B
•
Updated
May 25
•
31
•
3
MeissonFlow/Muddit
Image-to-Text
•
Updated
Jul 30
•
5
helizac/dots.ocr-4bit
Image-to-Text
•
2B
•
Updated
Aug 6
•
515
•
28
allenai/olmOCR-7B-0825
Image-to-Text
•
8B
•
Updated
Oct 22
•
1.1k
•
60
mradermacher/dunhuang-qwen2.5-vl-7b-GGUF
Image-to-Text
•
8B
•
Updated
Sep 28
•
187
•
1
Previous
1
2
3
...
100
Next