← Agent Marketplace
Open-source project
groundingLMM
by mbzuai-oryx
[CVPR 2024 🔥] Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that are seamlessly integrated with object segmentation masks.
959 starsSelf-host
Skills
An open-source project — explore the code and self-host it from GitHub.