Categories
14 pages
MLLM
An overview of adaption layer in multimodal large language models.
VITA-Towards Open-Source Interactive Omni Multimodal LLM
MiniGPT-4-Enhancing Vision-Language Understanding with Advanced Large Language Models
MathVerse Does Your Multi-modal LLM Truly See the Diagrams in Visual Math Problems?
1
2
3