File size: 1,836 Bytes
06f9b4f e2bca25 06f9b4f e2bca25 06f9b4f e2bca25 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 |
---
title: hf_AC Audio Foley Generator
emoji: 🎵
colorFrom: blue
colorTo: green
sdk: gradio
sdk_version: 5.42.0
app_file: app.py
pinned: false
license: mit
---
# 🎵 hf_AC Audio Foley Generator
A Gradio demo for generating synchronized audio from videos using the hf_AC (Audio-Conditioned Foley) model. This application allows you to upload a video and generate matching audio content based on text descriptions.
## Features
- **Video-to-Audio Generation**: Upload a video and generate synchronized audio
- **Text-Guided Generation**: Use text prompts to describe the desired audio
- **Customizable Parameters**: Adjust duration, CFG strength, and other generation parameters
- **Real-time Processing**: Generate audio in real-time with GPU acceleration
## How to Use
1. **Load Model**: The model will automatically load when you start the app
2. **Upload Video**: Choose a video file (MP4 format recommended)
3. **Describe Audio**: Write a text description of the audio you want to generate
4. **Generate**: Click the generate button and wait for the audio to be created
5. **Download**: Listen to and download the generated audio
## Example Prompts
- "Crackling fireplace with gentle flames"
- "Ocean waves crashing on rocky shore"
- "Busy city street with car horns and chatter"
- "Forest ambience with bird songs and rustling leaves"
- "Keyboard typing in a quiet office"
## Model Information
This demo uses the hf_AC model, which is designed for audio-visual synchronization and generation. The model can generate high-quality audio that matches the visual content and text descriptions.
## Technical Details
- **Framework**: PyTorch, Gradio
- **Model**: hf_AC (Audio-Conditioned Foley)
- **Audio Format**: WAV, 44.1kHz
- **Video Support**: MP4, various resolutions
- **Processing**: GPU-accelerated when available
|