Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
ZKong
's Collections
PyWheels
pose
dataset
Segment
hunyuan-video
Z-Image
tts
ocr
VL
qwen image
upscale
vae
wan2.2
qwen
sound
flux-kontext
image-process
prompt
面部AI
encoder
video
translate
motionCapture
flux
3D
image
audio
audio
updated
Jul 16
Upvote
-
google-t5/t5-base
Translation
•
0.2B
•
Updated
Feb 14, 2024
•
2.03M
•
•
760
stabilityai/stable-audio-open-1.0
Text-to-Audio
•
Updated
Jun 19
•
37.2k
•
1.37k
Kijai/MMAudio_safetensors
Updated
Dec 11, 2024
•
64
nvidia/bigvgan_v2_44khz_128band_512x
Audio-to-Audio
•
Updated
Sep 5, 2024
•
314k
•
63
hexgrad/Kokoro-82M
Text-to-Speech
•
Updated
Apr 10
•
3.22M
•
•
5.47k
mistralai/Voxtral-Mini-3B-2507
5B
•
Updated
Jul 28
•
476k
•
602
mistralai/Voxtral-Small-24B-2507
Audio-Text-to-Text
•
24B
•
Updated
7 days ago
•
8.91k
•
442
Upvote
-
Share collection
View history
Collection guide
Browse collections