Visual Document Retrieval
π
2
Demo for multimodal embedding models
Edit images by providing prompts and noise settings
Detect objects in images or videos
Generate personalized portraits with your face and desired poses
Generate captions for music audio
Chat with an AI assistant using text and images
Create a custom story with characters and plot
BLIP2 (cutting edge image captioning) in π€transformers