Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
jason500
's Collections
siweilian
text_gen_img
duomotai&tuxiangbianji
grounding
Quant
video_preprocess
mutilmodal_video2text
mutil big modal image2text
caption
MMLM
MMLM
updated
Mar 24
Upvote
-
allenai/Molmo-7B-O-0924
Image-Text-to-Text
•
8B
•
Updated
Oct 9
•
1.62k
•
162
allenai/Molmo-7B-D-0924
Image-Text-to-Text
•
8B
•
Updated
8 days ago
•
37.8k
•
559
zai-org/cogvlm2-llama3-caption
Video-Text-to-Text
•
13B
•
Updated
May 14
•
553
•
108
mistralai/Pixtral-12B-Base-2409
Updated
Jul 28
•
33
•
105
mistralai/Pixtral-12B-2409
Updated
Jul 28
•
6.23k
•
674
zai-org/glm-4v-9b
14B
•
Updated
Mar 3
•
103k
•
264
OpenGVLab/InternVL-Chat-V1-2-SFT-Data
Viewer
•
Updated
Sep 20, 2024
•
573k
•
1.02k
•
29
weic22/InstructSeg
3B
•
Updated
Dec 19, 2024
•
8
•
3
mistralai/Mistral-Small-3.1-24B-Instruct-2503
24B
•
Updated
1 day ago
•
84.6k
•
1.33k
Upvote
-
Share collection
View history
Collection guide
Browse collections