-
Refining Text-to-Image Generation: Towards Accurate Training-Free Glyph-Enhanced Image Generation
Paper • 2403.16422 • Published • 1 -
PHAnToM: Personality Has An Effect on Theory-of-Mind Reasoning in Large Language Models
Paper • 2403.02246 • Published • 1 -
ZipIR: Latent Pyramid Diffusion Transformer for High-Resolution Image Restoration
Paper • 2504.08591 • Published • 18 -
Minthy/ToriiGate-v0.4-7B
Image-Text-to-Text • 8B • Updated • 2.24k • 81
Sam Flin PRO
sflindrs
AI & ML interests
None yet
Recent Activity
liked a Space 17 days ago
jbilcke-hf/ai-comic-factory liked a Space 17 days ago
baidu/ERNIE-Image-Turbo liked a Space 17 days ago
black-forest-labs/FLUX.2-devOrganizations
None yet
Captioning
- Runtime errorAgents17
CogVLMv1 Captionner
⚙17Generate a detailed image description
-
sdasd112132/Vision-8B-MiniCPM-2_5-Uncensored-and-Detailed-4bit
Visual Question Answering • 9B • Updated • 23 • 32 - RunningFeatured561
Vision Arena (Testing VLMs side-by-side)
🖼561Explore Vision Arena visual AI demo online
-
dphn/dolphin-vision-72b
Text Generation • 73B • Updated • 50 • 134
Favorites
-
Refining Text-to-Image Generation: Towards Accurate Training-Free Glyph-Enhanced Image Generation
Paper • 2403.16422 • Published • 1 -
PHAnToM: Personality Has An Effect on Theory-of-Mind Reasoning in Large Language Models
Paper • 2403.02246 • Published • 1 -
ZipIR: Latent Pyramid Diffusion Transformer for High-Resolution Image Restoration
Paper • 2504.08591 • Published • 18 -
Minthy/ToriiGate-v0.4-7B
Image-Text-to-Text • 8B • Updated • 2.24k • 81
flux
Captioning
- Runtime errorAgents17
CogVLMv1 Captionner
⚙17Generate a detailed image description
-
sdasd112132/Vision-8B-MiniCPM-2_5-Uncensored-and-Detailed-4bit
Visual Question Answering • 9B • Updated • 23 • 32 - RunningFeatured561
Vision Arena (Testing VLMs side-by-side)
🖼561Explore Vision Arena visual AI demo online
-
dphn/dolphin-vision-72b
Text Generation • 73B • Updated • 50 • 134