OpenGVLab

community

https://github.com/opengvlab

opengvlab

OpenGVLab

Request to join this org

AI & ML interests

Computer Vision

Organization Card

About org cards

OpenGVLab

Welcome to OpenGVLab! We are a research group from Shanghai AI Lab focused on Vision-Centric AI research. The GV in our name, OpenGVLab, means general vision, a general understanding of vision, so little effort is needed to adapt to new vision-based tasks.

Models

InternVL: a pioneering open-source alternative to GPT-4V.
InternImage: a large-scale vision foundation models with deformable convolutions.
InternVideo: large-scale video foundation models for multimodal understanding.
VideoChat: an end-to-end chat assistant for video comprehension.
All-Seeing-Project: towards panoptic visual recognition and understanding of the open world.

Datasets

ShareGPT4o: a groundbreaking large-scale resource that we plan to open-source with 200K meticulously annotated images, 10K videos with highly descriptive captions, and 10K audio files with detailed descriptions.
InternVid: a large-scale video-text dataset for multimodal understanding and generation.

Benchmarks

MVBench: a comprehensive benchmark for multimodal video understanding.

Collections 10

spaces 9

InternVL

MVBench Leaderboard

ControlLLM

Running on Zero

VideoMamba

VideoChat2

VideoChat: Chat-Centric Video Understanding

models 68

OpenGVLab/InternVL2-40B

Image-Text-to-Text • Updated about 14 hours ago • 14 • 6

OpenGVLab/InternVL2-1B

Image-Text-to-Text • Updated about 14 hours ago • 8 • 1

OpenGVLab/InternVL-Chat-V1-5-AWQ

Image-Text-to-Text • Updated about 14 hours ago • 2.75k • 9

OpenGVLab/InternVL-Chat-V1-5-Int8

Image-Text-to-Text • Updated about 14 hours ago • 4.26k • 58

OpenGVLab/InternVL-Chat-V1-5

Image-Text-to-Text • Updated about 14 hours ago • 32.9k • 377

OpenGVLab/Mini-InternVL-Chat-4B-V1-5

Image-Text-to-Text • Updated about 14 hours ago • 22.9k • 51

OpenGVLab/Mini-InternVL-Chat-2B-V1-5

Image-Text-to-Text • Updated about 14 hours ago • 24.1k • 51

OpenGVLab/InternVL2-26B

Image-Text-to-Text • Updated about 14 hours ago • 1.41k • 46

OpenGVLab/InternVL2-8B

Image-Text-to-Text • Updated about 14 hours ago • 1.3k • 18

OpenGVLab/InternVL2-4B

Image-Text-to-Text • Updated about 14 hours ago • 526 • 6

datasets 17

OpenGVLab/VideoChat2-IT

Viewer • Updated 10 days ago • 1.82M • 132 • 33

OpenGVLab/MVBench

Viewer • Updated 12 days ago • 4k • 238 • 15

OpenGVLab/GUI-Odyssey

Viewer • Updated 14 days ago • 7.74k • 9 • 2

OpenGVLab/MMT-Bench

Viewer • Updated 15 days ago • 30k • 6 • 1

OpenGVLab/MM-NIAH

Viewer • Updated 22 days ago • 3.52k • 5 • 9

OpenGVLab/InternVid-Full

Viewer • Updated Jun 5 • 47.6M • 33 • 7

OpenGVLab/ShareGPT-4o

Viewer • Updated May 29 • 59.4k • 183 • 93

OpenGVLab/CRPE

Viewer • Updated Mar 21 • 544 • 2 • 5

OpenGVLab/Region-Evaluation-Data

Preview • Updated Mar 21 • 3 • 1

OpenGVLab/AS-Core

Preview • Updated Mar 21 • 3 • 5