R
Romanoffalex
AI & ML interests
None yet
Organizations
None yet
CV models
-
Qwen/Qwen2.5-VL-32B-Instruct
Image-Text-to-Text β’ Updated β’ 153k β’ 477 -
baidu/ERNIE-4.5-VL-28B-A3B-Base-PT
Image-Text-to-Text β’ 29B β’ Updated β’ 16 β’ 37 -
facebook/dinov3-vit7b16-pretrain-lvd1689m
Image Feature Extraction β’ Updated β’ 25.5k β’ 215 -
baidu/ERNIE-4.5-VL-28B-A3B-Thinking
Image-Text-to-Text β’ 30B β’ Updated β’ 1.17k β’ 523
Upscalers
video gan
Graphic gan
- Running on ZeroFeatured935
OminiControl
π935Generate new images from a photo and text prompt
- Running on ZeroFeatured2.07k
PuLID-FLUX
π€2.07kGenerate custom images from text and a reference photo
- Running662
PR Puppet Sora
π662Generate AI videos from text prompts
-
genmo/mochi-1-preview
Text-to-Video β’ Updated β’ 8.93k β’ β’ 1.31k
llm ru
- Paused47
Saiga 13b Q4_1 llama.cpp Retrieval QA
π47Upload files and ask questions based on their content
-
Deci/DeciLM-7B
Text Generation β’ 7B β’ Updated β’ 1.75k β’ 226 - Running89
Scaling FineWeb to 1000+ languages: Step 1: finding signal in 100s of evaluation tasks
π89Evaluate multilingual models using FineTasks
workflow
Audio
-
stabilityai/stable-audio-open-1.0
Text-to-Audio β’ Updated β’ 30.3k β’ 1.42k -
laion/emonet-face-binary
Preview β’ Updated β’ 90 β’ 3 -
laion/emonet-face-hq
Viewer β’ Updated β’ 2.5k β’ 214 β’ 2 - Running on A100236
Omnilingual ASR Media Transcription
π236Transcribe audio/video to text in many languages
3d
dataset
Llms alfa test
- Running on Zero1.12k
OOTDiffusion
π₯Ό1.12kHigh-quality virtual try-on ~ Your cyber fitting room
-
stabilityai/stable-diffusion-3-medium
Text-to-Image β’ Updated β’ 7.19k β’ β’ 4.92k -
rain1011/pyramid-flow-sd3
Text-to-Video β’ Updated β’ 837 -
nvidia/Llama-3_3-Nemotron-Super-49B-v1_5
Text Generation β’ 50B β’ Updated β’ 67.2k β’ 227
Codex model
workflow
CV models
-
Qwen/Qwen2.5-VL-32B-Instruct
Image-Text-to-Text β’ Updated β’ 153k β’ 477 -
baidu/ERNIE-4.5-VL-28B-A3B-Base-PT
Image-Text-to-Text β’ 29B β’ Updated β’ 16 β’ 37 -
facebook/dinov3-vit7b16-pretrain-lvd1689m
Image Feature Extraction β’ Updated β’ 25.5k β’ 215 -
baidu/ERNIE-4.5-VL-28B-A3B-Thinking
Image-Text-to-Text β’ 30B β’ Updated β’ 1.17k β’ 523
Audio
-
stabilityai/stable-audio-open-1.0
Text-to-Audio β’ Updated β’ 30.3k β’ 1.42k -
laion/emonet-face-binary
Preview β’ Updated β’ 90 β’ 3 -
laion/emonet-face-hq
Viewer β’ Updated β’ 2.5k β’ 214 β’ 2 - Running on A100236
Omnilingual ASR Media Transcription
π236Transcribe audio/video to text in many languages
Upscalers
3d
video gan
dataset
Graphic gan
- Running on ZeroFeatured935
OminiControl
π935Generate new images from a photo and text prompt
- Running on ZeroFeatured2.07k
PuLID-FLUX
π€2.07kGenerate custom images from text and a reference photo
- Running662
PR Puppet Sora
π662Generate AI videos from text prompts
-
genmo/mochi-1-preview
Text-to-Video β’ Updated β’ 8.93k β’ β’ 1.31k
Llms alfa test
- Running on Zero1.12k
OOTDiffusion
π₯Ό1.12kHigh-quality virtual try-on ~ Your cyber fitting room
-
stabilityai/stable-diffusion-3-medium
Text-to-Image β’ Updated β’ 7.19k β’ β’ 4.92k -
rain1011/pyramid-flow-sd3
Text-to-Video β’ Updated β’ 837 -
nvidia/Llama-3_3-Nemotron-Super-49B-v1_5
Text Generation β’ 50B β’ Updated β’ 67.2k β’ 227
llm ru
- Paused47
Saiga 13b Q4_1 llama.cpp Retrieval QA
π47Upload files and ask questions based on their content
-
Deci/DeciLM-7B
Text Generation β’ 7B β’ Updated β’ 1.75k β’ 226 - Running89
Scaling FineWeb to 1000+ languages: Step 1: finding signal in 100s of evaluation tasks
π89Evaluate multilingual models using FineTasks