Homepage
Delving into the latent unknown.
Mao Song's technical blog covering machine learning, large language models (LLMs), deep learning research, and AI innovations.
News
- Make this blog framework as a blog template
A blog template for research articles and blog posts.
- Update the blog framework from Hugo to Astro
I update the blog framework from Hugo to Astro. Now the blog is more flexible and easier to maintain.
- Investigating Redundancy in Multimodal Large Language Models with Multiple Vision Encoders
We invalidate the hypothesis that 'adding more vision encoders will always improve the performance of multimodal large language models'. Accetpted by ICLR2026.
Latest
- LLM Overview of Flash Attention series
作者提出了 flashattention, 一个通过降低 multi head attention 内存访问开销来提高 attention 计算效率的方法
- MLLM Overview of Multimodal Large Language Models
- Overview of unified MLLM
Overview of unified MLLM
- MLLM Overview of Keye-VL series