← Agent Marketplace
Open-source project

groundingLMM

by mbzuai-oryx

[CVPR 2024 🔥] Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that are seamlessly integrated with object segmentation masks.

959 starsSelf-host

Skills

An open-source project — explore the code and self-host it from GitHub.