SrcLog.com

OpenGVLab

OpenGVLab

🏢 Organization

7 repositories on SrcLog

7 Repos

8.1k Stars

570 Forks

8.1k Watchers

Repositories (7)

Ask-Anything OpenGVLab/Ask-Anything Python

[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.

InternVideo OpenGVLab/InternVideo Python

[ECCV2024] Video Foundation Models & Data for Multimodal Understanding

ScaleCUA OpenGVLab/ScaleCUA Python

[ICLR 2026 Oral] ScaleCUA is the open-sourced computer use agents that can operate on cross-platform environments (Windows, macOS, Ubuntu, Android).

GITM OpenGVLab/GITM

Ghost in the Minecraft: Generally Capable Agents for Open-World Environments via Large Language Models with Text-based Knowledge and Memory

Multi-Modality-Arena OpenGVLab/Multi-Modality-Arena Python

Chatbot Arena meets multi-modality! Multi-Modality Arena allows you to benchmark vision-language models side-by-side while providing images as inputs. Supports MiniGPT-4, LLaMA-Adapter V2, LLaVA, BLIP-2, and many more!

MM-NIAH OpenGVLab/MM-NIAH Python

[NeurIPS 2024] Needle In A Multimodal Haystack (MM-NIAH): A comprehensive benchmark designed to systematically evaluate the capability of existing MLLMs to comprehend long multimodal documents.

GenExam OpenGVLab/GenExam Python

GenExam: A Multidisciplinary Text-to-Image Exam