0. 示例博客
Published:
这是一个示例博客文章。
Published:
这是一个示例博客文章。
Published:
About Position Embedding: Sinusoidal PE & RoPE
Published:
Sinusoidal PE vs. RoPE
Published:
Some of Other Thoughts
Published:
Introduction of Proximal Policy Optimization (PPO)
Published:
Introduction of Group Relative Policy Optimization (GRPO)
Published:
Introduction of KL Divergence, entropy, cross entropy, and the difference between Forward KL and Reverse KL.
Published:
Derivation of common loss functions from a unified MLE / NLL perspective: MSE, MAE, CE, and beyond.
Published:
Deriving MSE, MAE, KL, and InfoNCE losses from the Information Bottleneck framework, unified via mutual information.
Published:
On-policy distillation (OPD): combining SFT’s dense supervision with RL’s on-policy property, plus a tour of self-distillation works (OPSD, SDFT, SDPO, CRISP, ExOPD, GAD).
Published:
A visual companion to blog 8. Image panels collected from an external author’s note (watermarks preserved).
Short description of portfolio item number 1
Short description of portfolio item number 2 
Published in Journal 1, 2009
This paper is about the number 1. The number 2 is left for future work.
Recommended citation: Your Name, You. (2009). "Paper Title Number 1." Journal 1. 1(1).
Download Paper | Download Slides | Download Bibtex
Published in Journal 1, 2010
This paper is about the number 2. The number 3 is left for future work.
Recommended citation: Your Name, You. (2010). "Paper Title Number 2." Journal 1. 1(2).
Download Paper | Download Slides
Published in Journal 1, 2015
This paper is about the number 3. The number 4 is left for future work.
Recommended citation: Your Name, You. (2015). "Paper Title Number 3." Journal 1. 1(3).
Download Paper | Download Slides
Published in GitHub Journal of Bugs, 2024
This paper is about fixing template issue #693.
Recommended citation: Your Name, You. (2024). "Paper Title Number 3." GitHub Journal of Bugs. 1(3).
Download Paper
Published in GitHub Journal of Bugs, 2024
This paper is about a famous math equation, \(E=mc^2\)
Recommended citation: Your Name, You. (2024). "Paper Title Number 3." GitHub Journal of Bugs. 1(3).
Download Paper
Published:
This is a description of your talk, which is a markdown files that can be all markdown-ified like any other post. Yay markdown!
Published:
This is a description of your conference proceedings talk, note the different field in type. You can put anything in this field.
Undergraduate course, University 1, Department, 2014
This is a description of a teaching experience. You can use markdown like any other post.
Workshop, University 1, Department, 2015
This is a description of a teaching experience. You can use markdown like any other post.