Efficient MoE-based LLM Collection Mixture-of-Experts Large Language Models with Advanced Quantization • 4 items • Updated 1 day ago • 18
Efficient Large Vision-Language Model Collection ERGO: LVLM trained with RL on efficiency objectives; https://github.com/nota-github/ERGO • 3 items • Updated 1 day ago • 20
Shortened LLaMA: A Simple Depth Pruning for Large Language Models Paper • 2402.02834 • Published Feb 5, 2024 • 17
EdgeFusion: On-Device Text-to-Image Generation Paper • 2404.11925 • Published Apr 18, 2024 • 23
EdgeFusion: On-Device Text-to-Image Generation Paper • 2404.11925 • Published Apr 18, 2024 • 23
Automatic Neural Network Pruning that Efficiently Preserves the Model Accuracy Paper • 2111.09635 • Published Nov 18, 2021 • 1
Running Featured 114 Compressed Stable Diffusion 🌟 114 Compare image generation results from original and compressed AI models
On Architectural Compression of Text-to-Image Diffusion Models Paper • 2305.15798 • Published May 25, 2023 • 5