Skip to main content
Avatar

Mao Song(毛松)'s Homepage

Delving into the Latent Unknown.

  1. Bilibili
  2. Google Scholar
  1. Home
  2. Archives
  3. Search
  4. Tags
  5. About
    1. Dark Mode

Categories

LLM LeetCode MLLM Infra Math Tutorial Machine Learning Reasoning RL NLP Terminal Agent Deep Learning RAG Unified MLLM 随笔

Tags

MoE Qwen Medium Reasoning Attention Google Deepseek DFS Position Encoding Matrix Array DP RL String Transformer Tree BFS Bit Manipulation Hard Kimi

Archives

2026 19
2025 102
2024 51
2023 1
Math   RL

(RL series 3) Policy evaluation

本节介绍如何求解value function 和 Q-functin
March 18, 2026
3 min read
tutorial
Math   RL

(RL series 2) Bellman Equation

本节介绍bellman equation相关概念
March 18, 2026
4 min read
tutorial
Math   RL

(RL series 1) Reinforcement Learning basic definitions

本节介绍RL中的基本概念和定义
March 18, 2026
3 min read
MDP tutorial
Math

Fix Point Theorem

不动点定理
March 9, 2026
2 min read
Infra

Notes on roofline model

roofline model 是 infra 的理论分析基础,为算法设计与优化提供思路
February 26, 2026
4 min read
roofline GPU
1 2 … 35
© 2020 - 2026 Mao Song(毛松)'s Homepage
Built with Hugo
Theme Stack designed by Jimmy