Skip to main content
Avatar

Mao Song(毛松)'s Homepage

Delving into the Latent Unknown.

  1. Bilibili
  2. Google Scholar
  1. Home
  2. Archives
  3. Search
  4. Tags
  5. About
    1. Dark Mode

Categories

LLM LeetCode MLLM Infra Math Tutorial Machine Learning Reasoning RL NLP Terminal Agent Deep Learning RAG Unified MLLM 随笔

Tags

MoE Qwen Medium Reasoning Attention Google Deepseek DFS Position Encoding Matrix Array DP RL String Transformer Tree BFS Bit Manipulation Hard Kimi

Archives

2026 19
2025 102
2024 51
2023 1
MLLM

Notes on Qwen2.5 omni

Academic notes on Qwen2.5 omni
April 1, 2025
6 min read
Qwen omni audio
MLLM   Machine Learning

Understanding Sigmoid Loss in SigLip

Understanding Sigmoid Loss in SigLip
March 28, 2025
2 min read
loss
MLLM

Notes on Aya Vision

Aya Vision包含8B, 32B两个size,支持23种语言
March 17, 2025
1 min read
multilingual
Notes on Gemma3
LLM   MLLM

Notes on Gemma3

Notes on Gemma3 technical report
March 15, 2025
4 min read
long context
MLLM

Overview of Qwen-VL series

Overview of Qwen-VL series
March 9, 2025
1 min read
Qwen
1 … 22 23 24 … 35
© 2020 - 2026 Mao Song(毛松)'s Homepage
Built with Hugo
Theme Stack designed by Jimmy