2021
an archive of posts from this year
| Jul 4, 2021 | a post with redirect |
|---|---|
| Jul 4, 2021 | a post with diagrams |
| Jun 24, 2021 | Optimal offline RL with the unified model-based framework |
| May 22, 2021 | a distill-style blog post |
| Apr 27, 2021 | A Brief Summary of Upper Bounds for Bandit Problems |
| Mar 15, 2021 | A Brief Introduction to Influence Funtion Technique |
| Mar 4, 2021 | Variance Reduction Technique for Optimal Offline RL |
| Mar 3, 2021 | Why can't we surpass the speed of light? Einstein tells you |
| Feb 28, 2021 | TMIS (Plug-in) estimator is statistically efficient for Tabular OPE |