Skip to main content
Avatar

Mao Song(毛松)'s Homepage

Delving into the Latent Unknown.

  1. Bilibili
  2. Google Scholar
  1. Home
  2. Archives
  3. Search
  4. Tags
  5. About
    1. Dark Mode

Categories

LLM LeetCode MLLM Infra Tutorial Machine Learning Reasoning Math NLP Terminal Deep Learning RAG RL Unified MLLM 随笔

Tags

MoE Qwen Medium Attention Reasoning Google Deepseek DFS Position Encoding Matrix Array DP RL String Transformer Tree BFS Bit Manipulation Hard Distributed Training

Archives

2026 12
2025 102
2024 51
2023 1
Infra   Tutorial

Distributed training--Basic

Basic concepts in distributed training
May 12, 2025
9 min read
distributed training
LLM

Notes on LLaMA4 blog

LLaMA4 blog阅读笔记
April 30, 2025
3 min read
LLaMA
LLM

Notes on Qwen3 blog

Qwen3系列LLM发布
April 29, 2025
3 min read
Qwen
MLLM

Data mixture in MLLM

MLLM训练数据配比简单总结
April 25, 2025
2 min read
data selection dataset
随笔

随笔-身体健康

疾病缠身才明白身体健康的重要性
April 23, 2025
2 min read
身体健康
1 … 19 20 21 … 34
© 2020 - 2026 Mao Song(毛松)'s Homepage
Built with Hugo
Theme Stack designed by Jimmy