Ming Yin's Blog

A place to explain and share my ideas and thoughts

a post with diagrams

an example of a blog post with diagrams

3 min read · July 4, 2021

2021
Optimal offline RL with the unified model-based framework

A model-based framework + singleton absorbing MDP technique achieves the optimal rate for several challenging offline tasks.

4 min read · June 24, 2021

2021
a distill-style blog post

an example of a distill-style blog post and main elements

7 min read · May 22, 2021

2021
A Brief Summary of Upper Bounds for Bandit Problems

This post summarizes the regret analysis of the Exploration-First Algorithm, the Upper Confidence Bound (UCB) Algorithm for the multi-armed bandits (MAB) problems and the LinUCB Algorithm for linear Bandits.

6 min read · April 27, 2021

2021
A Brief Introduction to Influence Funtion Technique

Influence function technique is powerful in that it provides a way to calculate efficiency bound for the semiparameteric estimation problems.

3 min read · March 15, 2021

2021