Skip to main content
Avatar

Mao Song(毛松)'s Homepage

Delving into the Latent Unknown.

  1. Bilibili
  2. Google Scholar
  1. Home
  2. Archives
  3. Search
  4. Tags
  5. About
    1. Dark Mode

Categories

LLM LeetCode MLLM Infra Tutorial Machine Learning Reasoning Math NLP Terminal Deep Learning RAG RL Unified MLLM 随笔

Tags

MoE Qwen Medium Attention Reasoning Google Deepseek DFS Position Encoding Matrix Array DP RL String Transformer Tree BFS Bit Manipulation Hard Distributed Training

Archives

2026 12
2025 102
2024 51
2023 1
MLLM

Notes on Aya Vision

Aya Vision包含8B, 32B两个size,支持23种语言
March 17, 2025
1 min read
multilingual
Notes on Gemma3
LLM   MLLM

Notes on Gemma3

Notes on Gemma3 technical report
March 15, 2025
4 min read
long context
MLLM

Overview of Qwen-VL series

Overview of Qwen-VL series
March 9, 2025
1 min read
Qwen
LLM   Reasoning

Notes on QwQ-32B

notes on QwQ-32B
March 8, 2025
1 min read
Qwen Reasoning
LLM   Math

compression is intelligence

从压缩即智能的角度理解大模型
March 6, 2025
2 min read
Compression
1 … 21 22 23 … 34
© 2020 - 2026 Mao Song(毛松)'s Homepage
Built with Hugo
Theme Stack designed by Jimmy