Visual Document Retrieval
π
2
Demo for multimodal embedding models
Generate captions for music audio
Chat with an AI assistant using text and images
Create a custom story with characters and plot
BLIP2 (cutting edge image captioning) in π€transformers