Spaces:

learnmlf
/

Acfoley

Sleeping

App Files Files Community

Acfoley / README.md

ZJUCQR

Add hf_AC audio generation demo

e2bca25 9 days ago

preview code

raw

history blame contribute delete

1.84 kB

A newer version of the Gradio SDK is available: 6.1.0

Upgrade

metadata

title: hf_AC Audio Foley Generator
emoji: 🎵
colorFrom: blue
colorTo: green
sdk: gradio
sdk_version: 5.42.0
app_file: app.py
pinned: false
license: mit

🎵 hf_AC Audio Foley Generator

A Gradio demo for generating synchronized audio from videos using the hf_AC (Audio-Conditioned Foley) model. This application allows you to upload a video and generate matching audio content based on text descriptions.

Features

Video-to-Audio Generation: Upload a video and generate synchronized audio
Text-Guided Generation: Use text prompts to describe the desired audio
Customizable Parameters: Adjust duration, CFG strength, and other generation parameters
Real-time Processing: Generate audio in real-time with GPU acceleration

How to Use

Load Model: The model will automatically load when you start the app
Upload Video: Choose a video file (MP4 format recommended)
Describe Audio: Write a text description of the audio you want to generate
Generate: Click the generate button and wait for the audio to be created
Download: Listen to and download the generated audio

Example Prompts

"Crackling fireplace with gentle flames"
"Ocean waves crashing on rocky shore"
"Busy city street with car horns and chatter"
"Forest ambience with bird songs and rustling leaves"
"Keyboard typing in a quiet office"

Model Information

This demo uses the hf_AC model, which is designed for audio-visual synchronization and generation. The model can generate high-quality audio that matches the visual content and text descriptions.

Technical Details

Framework: PyTorch, Gradio
Model: hf_AC (Audio-Conditioned Foley)
Audio Format: WAV, 44.1kHz
Video Support: MP4, various resolutions
Processing: GPU-accelerated when available