Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
ZKong 's Collections
PyWheels
pose
dataset
Segment
hunyuan-video
Z-Image
tts
ocr
VL
qwen image
upscale
vae
wan2.2
qwen
sound
flux-kontext
image-process
prompt
面部AI
encoder
video
translate
motionCapture
flux
3D
image
audio

audio

updated Jul 16
Upvote
-

  • google-t5/t5-base

    Translation • 0.2B • Updated Feb 14, 2024 • 2.03M • • 760

  • stabilityai/stable-audio-open-1.0

    Text-to-Audio • Updated Jun 19 • 37.2k • 1.37k

  • Kijai/MMAudio_safetensors

    Updated Dec 11, 2024 • 64

  • nvidia/bigvgan_v2_44khz_128band_512x

    Audio-to-Audio • Updated Sep 5, 2024 • 314k • 63

  • hexgrad/Kokoro-82M

    Text-to-Speech • Updated Apr 10 • 3.22M • • 5.47k

  • mistralai/Voxtral-Mini-3B-2507

    5B • Updated Jul 28 • 476k • 602

  • mistralai/Voxtral-Small-24B-2507

    Audio-Text-to-Text • 24B • Updated 7 days ago • 8.91k • 442
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs