LLaVA-Mini

LLaVA-Mini is a unified large multimodal model (LMM) that can support the understanding of images, high-resolution images, and videos in an efficient manner.

video

View on GitHub

572 Stars

33 Forks

572 Watchers

Python Language

apache-2.0 License

100 SrcLog Score

Cost to Build

$2.49M

Market Value

$8.25M

How is this calculated?

Growth over time

3 data points · 2025-08-12 → 2026-04-20

Stars Forks Watchers

💬

How do you feel about this project?

Ask AI about LLaVA-Mini

Question copied to clipboard

What is the ictnlp/LLaVA-Mini GitHub project? Description: "LLaVA-Mini is a unified large multimodal model (LMM) that can support the understanding of images, high-resolution images, and videos in an efficient manner. ". Written in Python. Explain what it does, its main use cases, key features, and who would benefit from using it.

Question is copied to clipboard — paste it after the AI opens.

How to clone LLaVA-Mini

Clone via HTTPS

git clone https://github.com/ictnlp/LLaVA-Mini.git

Clone via SSH

[email protected]:ictnlp/LLaVA-Mini.git

Download ZIP

Download master.zip

Found an issue?

Report bugs or request features on the LLaVA-Mini issue tracker:

Open GitHub Issues

Similar to LLaVA-Mini

video.js ijkplayer iina FFmpeg mpv JiaoZiVideoPlayer GSYVideoPlayer WWDC coursera-dl openFrameworks mediaelement digital_video_introduction moviepy ScreenToGif hls.js gifify clappr PictureSelector butter-desktop NewPipe PeerTube lux channels rust-learning DPlayer lightGallery jitsi-meet Kotlin-Tutorials boxing AndroidVideoCache