Categories
2025
Notes on Seed1.6
Notes on V-Triune
Notes on Magistral
Notes on SmolLM3
Notes on GLM-4.1V-Thinking
Notes on Qwen2.5-1M
Notes on Qwen2.5
Dual Chunk Attention
Notes on Qwen2
Notes on Qwen1.5
Notes on YaRN
Notes on Qwen-LLM
Hands on LLM(2) Transformer
Unified perspective on dLLM and LLM
Relationship between MLE and KL divergence
Notes on MiMo-VL
Hands on LLM(1) Tokenizer
Notes on attention bias
Notes on Position encoding
Notes on Qwen3
Notes on Seed1.5-VL
分布式训练:如何训练一个模型
分布式训练:参数量与计算量分析
Distributed training--Basic
Notes on LLaMA4 blog
Notes on Qwen3 blog
Data mixture in MLLM
随笔-身体健康
Notes on VAPO
Notes on VC-PPO
Notes on DAPO
Notes on Qwen2.5 omni
Understanding Sigmoid Loss in SigLip
Notes on Aya Vision
Notes on Gemma3

Overview of Qwen-VL series
Notes on QwQ-32B
compression is intelligence
Notes on Qwen2.5 VL
Git authentication error
Notes on Kimi k1.5
Screen usage
2024
Notes on Phi-4
An overview of adaption layer in multimodal large language models.
592. Fraction Addition and Subtraction
506. Relative Ranks
664. Strange Printer
264. Ugly Number II
Notes on VITA
78. Subsets
2812. Find the Safest Path in a Grid
1219. Path with Maximum Gold
861. Score After Flipping Matrix
786. K-th Smallest Prime Fraction
3075. Maximize Happiness of Selected Children
3068. Find the Maximum Sum of Node Values
ROUGE (Recall-Oriented Understudy)
506. Relative Ranks
2816. Double a Number Represented as a Linked List
2487. Remove Nodes From Linked List
237. Delete Node in a Linked List
881. Boats to Save People
165. Compare Version Numbers
Notes on t-SNE
MiniGPT-4-Enhancing Vision-Language Understanding with Advanced Large Language Models
Formal Algorithms for Transformer
1915. Number of Wonderful Substrings
2441. Largest Positive Integer That Exists With Its Negative
2000. Reverse Prefix of Word
834. Sum of Distances in Tree
2997. Minimum Number of Operations to Make Array XOR Equal to K
Regularization methods in deep learning
514. Freedom Trail
1289. Minimum Falling Path Sum II
BLEU (Bilingual Evaluation Understudy)
2370. Longest Ideal Subsequence
1137. N-th Tribonacci Number
310. Minimum Height Trees
752. Open the Lock
Notes on Llama3
1971. Find if Path Exists in Graph
1992. Find All Groups of Farmland
200. Number of Islands
463. Island Perimeter
Practical advice for analysis of large, complex data sets
988. Smallest String Starting From Leaf
623. Add One Row to Tree
405. Convert a Number to Hexadecimal
404. Sum of Left Leaves
129. Sum Root to Leaf Numbers
What's next for AI agentic workflows

Notes on RAG
