-
TMIS (Plug-in) estimator is statistically efficient for Tabular OPE
Surprisingly, Monte Carlo on-policy estimator is actually statistically inefficient.
-
Some nice images
Surprisingly, Monte Carlo on-policy estimator is actually statistically inefficient.