Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Main
Tasks
1
Libraries
1
Languages
Licenses
Other
Tasks
Reset Tasks
Text Generation
Any-to-Any
Image-Text-to-Text
Image-to-Text
Image-to-Image
Text-to-Image
Text-to-Video
Text-to-Speech
+ 42
Parameters
Reset Parameters
< 1B
6B
12B
32B
128B
> 500B
< 1B
> 500B
Libraries
Reset Libraries
PyTorch
google-tensorflow
TensorFlow
JAX
Transformers
Diffusers
sentence-transformers
Safetensors
ONNX
GGUF
Transformers.js
MLX
+ 41
Apps
vLLM
TGI
llama.cpp
MLX LM
LM Studio
Ollama
Jan
+ 12
Inference Providers
Groq
Novita
Nebius AI
Cerebras
SambaNova
Nscale
fal
Hyperbolic
+ 11
Apply filters
Models
9,032
Full-text search
Inference Available
Edit filters
Sort: Trending
Active filters:
image-to-text, transformers
Clear all
stepfun-ai/GELab-Zero-4B-preview
Image-to-Text
•
4B
•
Updated
7 days ago
•
703
•
92
datalab-to/chandra
Image-to-Text
•
9B
•
Updated
Oct 21
•
89.9k
•
407
Salesforce/blip-image-captioning-base
Image-to-Text
•
Updated
Feb 3
•
2.38M
•
822
lightonai/LightOnOCR-1B-1025
Image-to-Text
•
Updated
14 days ago
•
14.6k
•
179
allenai/olmOCR-2-7B-1025-FP8
Image-to-Text
•
8B
•
Updated
Oct 22
•
601k
•
153
XiaomiMiMo/MiMo-Embodied-7B
Image-to-Text
•
8B
•
Updated
17 days ago
•
988
•
47
thesby/Qwen3-VL-8B-NSFW-Caption-V4.5
Image-to-Text
•
9B
•
Updated
about 1 month ago
•
15.9k
•
43
VLM2Vec/VLM2Vec-V2.0
Image-to-Text
•
Updated
Jul 13
•
10.2k
•
19
allenai/olmOCR-2-7B-1025
Image-to-Text
•
8B
•
Updated
Oct 22
•
32.9k
•
88
shkb/MemeLeak
Image-to-Text
•
9B
•
Updated
5 days ago
•
101
•
2
prithivMLmods/LightOnOCR-1B-1025-AIO-GGUF
Image-to-Text
•
0.8B
•
Updated
about 23 hours ago
•
91
•
2
Salesforce/blip-image-captioning-large
Image-to-Text
•
0.5B
•
Updated
Feb 3
•
1.08M
•
1.44k
team-lucid/trocr-small-korean
Image-to-Text
•
54.5M
•
Updated
Jul 1, 2023
•
543
•
18
SawanStack/gpt2-image-captioning-onnx
Image-to-Text
•
Updated
Nov 13, 2023
•
8
•
1
OleehyO/TexTeller
Image-to-Text
•
0.3B
•
Updated
Jun 22, 2024
•
7.36k
•
38
breezedeus/pix2text-mfr
Image-to-Text
•
Updated
May 5, 2024
•
164k
•
47
unsloth/Llama-3.2-11B-Vision-Instruct-bnb-4bit
Image-to-Text
•
6B
•
Updated
Dec 10, 2024
•
517k
•
80
unsloth/Llama-3.2-11B-Vision-Instruct
Image-to-Text
•
11B
•
Updated
Dec 10, 2024
•
21k
•
86
Vikhrmodels/Vikhr-2-VL-2b-Instruct-experimental
Image-to-Text
•
2B
•
Updated
Nov 3, 2024
•
29
•
20
HuggingFaceTB/SmolVLM-256M-Base
Image-to-Text
•
0.3B
•
Updated
Jan 20
•
6.87k
•
18
enalis/scold
Image-to-Text
•
Updated
Oct 29
•
51
•
7
sbintuitions/sarashina2-vision-8b
Image-to-Text
•
8B
•
Updated
Mar 27
•
4.99k
•
10
infly/INF-AZ-7B-0524
Image-to-Text
•
8B
•
Updated
May 25
•
33
•
3
helizac/dots.ocr-4bit
Image-to-Text
•
2B
•
Updated
Aug 6
•
509
•
28
allenai/olmOCR-7B-0825
Image-to-Text
•
8B
•
Updated
Oct 22
•
1.11k
•
60
mradermacher/dunhuang-qwen2.5-vl-7b-GGUF
Image-to-Text
•
8B
•
Updated
Sep 28
•
226
•
1
Disty0/Qwen3-VL-32B-Instruct-SDNQ-uint4-svd-r32
Image-to-Text
•
18B
•
Updated
Oct 29
•
39
•
1
sbintuitions/sarashina2.2-vision-3b
Image-to-Text
•
4B
•
Updated
18 days ago
•
2.4k
•
13
Float16-cloud/typhoon-ocr1.5-2b-int8
Image-to-Text
•
Updated
15 days ago
•
41
•
2
suv11235/olmOCR-7B-grpo-v3
Image-to-Text
•
8B
•
Updated
7 days ago
•
17
•
1
Previous
1
2
3
...
100
Next