Room Vibe Check
Generate a vibe profile for your room from a photo
Everyone's room says something about them. The stack of unread books. The fairy lights. The gaming chair next to a yoga mat. What if AI could read that vibe and roast you for it?
Room Vibe Check does exactly that. Upload a photo of your room, pick Roast or Kind mode, and get a personality profile for your space.
Toggle between Roast mode (brutally funny) and Kind mode (warm and encouraging) for very different results from the same photo.
| Agent | Model | Size | Job |
|---|---|---|---|
| Room Analyzer | Florence-2-large | 0.8B | See what's in the room |
| Vibe Generator | Qwen2.5-3B-Instruct | 3.0B | Generate the vibe profile |
| Total | 3.8B |
Stage 1: Florence-2 runs a more-detailed captioning task and OCR on the photo. It describes the room and reads any visible text — posters, book titles, signs on the wall. This gives the vibe generator rich material to work with.
Stage 2: Qwen2.5-3B takes that description plus the selected mode (Roast/Kind) and generates structured JSON with the vibe profile, score, objects, one-liner, and tip.
Two specialized models instead of one general-purpose model. Florence-2 is a vision specialist — small and fast. Qwen2.5-3B handles the creative writing. Together they're 3.8B, well under the 4B limit.
At only 0.8B parameters, Florence-2-large produces surprisingly detailed room descriptions. It catches things like "a guitar leaning against the wall" or "fairy lights draped over the headboard" — exactly the kind of details that make vibe readings funny and specific.
The hardest part wasn't the tech — it was getting Qwen2.5-3B to be genuinely funny in Roast mode without being mean, and genuinely warm in Kind mode without being generic. The system prompts went through several iterations. The key was giving it a persona ("You are a brutally funny interior design critic") rather than just instructions ("Be funny").
Qwen2.5-3B occasionally produces malformed JSON. The fix: parse what you can, retry once if needed, and fall back to showing the raw room description if all else fails. Users would rather see partial results than an error screen.
Both models load on first request, not at startup. HF Spaces has a build timeout — if model download exceeds it, the Space never starts. Lazy loading means instant boot, models download when someone actually clicks "Check My Vibe."
Try Room Vibe Check on Hugging Face Spaces
Built for the Build Small Hackathon 2026 — Thousand Token Wood track. 3.8B total parameters, no cloud APIs, full agent trace transparency.
Generate a vibe profile for your room from a photo