Look Where It Matters: High-Resolution Crops Retrieval for Efficient VLMs Paper • 2603.16932 • Published 10 days ago • 53
BBQ-to-Image: Numeric Bounding Box and Qolor Control in Large-Scale Text-to-Image Models Paper • 2602.20672 • Published 29 days ago • 9
Generating an Image From 1,000 Words: Enhancing Text-to-Image With Structured Captions Paper • 2511.06876 • Published Nov 10, 2025 • 28