Ming Yin
Welcome to my homepage!
mingyin0312 at gmail dot com
Incoming Assistant Professor at Georgia Tech CSE
Senior Research Scientist at Snowflake AI
Postdoctoral Associate at Princeton ECE with Mengdi Wang
Ph.D.s in Computer Science & Statistics with Yu-Xiang Wang
B.S., University of Science and Technology of China, Mathematics
I have also spent time at Amazon AWS AI during the summers. My research focuses on understanding the foundations of sequential decision-making and applying them to real-world challenges in AI and science. For an overview, visit my Research page! I also (very occasionally) share insights and summarize key ideas from my research, along with topics of personal interest, on the Blog page. For a full list of my publications, check out the Publications page.
Always excited to exchange ideas at the frontier of AI :)
News
| Aug 19, 2025 |
See my posts on implementing DPO and GRPO from scratch! |
|---|---|
| Apr 13, 2025 |
CRISPR-GPT is accepted to Nature Biomedical Engineering |
| Mar 14, 2025 | I delievered a presentation at the nsf cps annual pi meeting! |
| Feb 17, 2025 | Invited as an Area Chair for NeurIPS-25. |
| Feb 12, 2025 | Invited to speak at the prestigious 8th Workshop on Cognition & Control in March at Gainesville! |
| Feb 10, 2025 | I will attend NSF Cyber Physical Systems PI Meeting Mar 13-14 at Nashville! |
| Jan 21, 2025 |
Excited to receive the Rising Star Award at CPAL! |
| Jan 19, 2025 |
Excited to receive the Rising Star Award at AI Symposium at KAUST! |
| Jan 14, 2025 |
Our MMMU benchmark is featured by Nature news article on AI testing/evaluation! |
| Jan 7, 2025 |
A personal thought on Reinforcement Learning for LLMs! |
| Jan 6, 2025 | Call for Workshop&Tutorial at UMAP! Topics on User Modeling in the Era of GenAI welcomed!! |
| Dec 30, 2024 |
My RL review literature is finally out — just in time to close out 2024 on a high note |
| Dec 12, 2024 | Invited as an Area Chair for ICML-25. |
| Oct 28, 2024 | I will give a tutorial at AAAI-25 on “Essential Theories and Techniques for Offline RL”. |
| Oct 3, 2024 | Happy to serve as the Workshop&Tutorial Co-chair for ACM UMAP, Aera Chair for AISTATS-25. |
| Sep 26, 2024 |
Four papers + One Dataset paper accetped to NeurIPS-24 |
| Sep 18, 2024 | Invited to talk at Young Research Workshop at Cornell ORIE! |
| Sep 1, 2024 | A new blog explaining Flow macthing for Generative AI! |
| May 6, 2024 | Happy to serve as an Area Chair for NeurIPS-24. |
| May 1, 2024 | Two papers accepted to icml 2024! |
| Apr 9, 2024 | Two papers accepted to (ISIT24) IEEE International Symposium on Information Theory! |
| Mar 20, 2024 | '’General Function Approximation in Nonstationary RL’’ accepted to IEEE Journal of JSAIT! |
| Feb 28, 2024 | MMMU has been selected as CVPR-24 Oral & Best Paper Finalist (top 0.2%)! |
| Feb 12, 2024 | Happy to review for the first Conference on Language Modeling. |
| Dec 1, 2023 |
Releasing MMMU, a Massive Multimodal Understand&Reasoning Benchmark for Expert AGI! |
| Oct 7, 2023 | Invited review for Journal of ASA and Conference on Learning Theory [COLT-24]. |
| Oct 7, 2023 |
TheoremQA accepted to EMNLP-23 main |
| Sep 21, 2023 |
Posterior Sampling with Delayed Feedback RL accpeted to NeurIPS-23 |
| Sep 11, 2023 | Invited to join Young Researchers Workshop at Cornell University Oct 1 - Oct 3. |
| Aug 10, 2023 | Happy to be part of the PC for MATH-AI WS, AI4science WS and review for EMNLP23, ICLR24. |
| Jul 1, 2023 | Invited review for Machine Learning by Springer. |
| Jun 2, 2023 | Excited to release TheoremQA to eval LLMs’ capabilities in Math, Physics, EE&CS and Finance! |
| Jun 1, 2023 | Happy to serve for IL with Implicit Human Feedback Workshop @ ICML-23 as part of the PC. |
| May 16, 2023 | Invited for a community committee choice session talk at 2023 Informs Annual Meeting. |
| May 16, 2023 | Invited to join the Review Board for ACM/IMS Journal of Data Science (3-year appointment). |
| May 15, 2023 | Happy to serve for the Neural Compression Workshop @ ICML-23 as part of the PC. |
| May 8, 2023 |
The new non-uniform misspecified linear bandit paper accepted to UAI-23! |
| Apr 24, 2023 | Two papers accepted to icml 2023! |
| Feb 27, 2023 | Happy to serve as an Area Chair for NeurIPS-23. |
| Feb 15, 2023 |
Excited to receive Graduation Day Award at ITA-23 |
| Jan 22, 2023 |
Parametric Differentiable Offline RL paper accpted to ICLR-23! |
| Jan 20, 2023 | I will participate in the ITA2023 in SD! Looking forward to meeting old and new friends! |
| Jan 18, 2023 | Happy to be a reviewer for ICML-2023, UAI-2023. |
| Nov 21, 2022 |
Instance-dependent Offline RL paper accpted to AAAI-23! Congrats to my JHU coauthors |
| Sep 16, 2022 |
Had a wonderful summer at AWS AI with Rasool Fakoor (and also Alex Smola) |
| Aug 29, 2022 | Happy to be part of the PC for the 3rd (Launchpad) Offline RL Workshop at NeurIPS 2022. |
| Aug 16, 2022 | Happy to be a reviewer for ICLR-2023, AAAI-23 and AISTATS-23. |
| May 15, 2022 | Low-switching RL paper accepted to ICML22 and Offline SSP paper accepted to UAI22! |
| Mar 31, 2022 | Find this thought-provoking blog by John Schulman! Great attitude!! |
| Mar 30, 2022 | Happy to be a reviewer for NeurIPS-2022. |
| Feb 13, 2022 | Happy to review for the new venue Transactions on Machine Learning Research (TMLR)! |
| Jan 21, 2022 |
Optimal Offline Linear Representation RL paper accepted to ICLR-2022! |
| Dec 19, 2021 | Happy to be a reviewer for ICML-2022. |
| Sep 29, 2021 | Three papers accepted to neurips 2021! |
| Aug 20, 2021 | I participate in the 2nd Offline DeepRL workshop (NeurIPS 21) as part of the PC. |
| Jul 12, 2021 | Happy to be a reviewer for ICLR-2022, AISTATS-2022. |
| Jun 25, 2021 | There is a new blog to explain the new Optimal Uniform OPE paper! |
| May 13, 2021 | I participate in the RL theory workshop (ICML 21) as part of the PC. |
| Apr 5, 2021 | Happy to be a reviewer for NeurIPS-2021. |
| Jan 28, 2021 |
Oral acceptance of Uniform OPE paper to AISTATS-2021! |
Selected publications
-
Preprint
-
PreprintToward Scientific Reasoning in LLMs: Training from Expert Discussions via Reinforcement LearningarXiv preprint arXiv:2505.19501, 2025
-
STSOn the Statistical Complexity for Offline and Low-Adaptive Reinforcement Learning with StructuresStatistical Science Journal, 2025
-
NBMECRISPR-GPT: LLM Agents for Automated Design of Gene-Editing ExperimentsNature Biomedical Engineering, 2025
-
NeurIPSA Theoretical Perspective for Speculative Decoding AlgorithmAdvances in Neural Information Processing Systems, 2024
-
NeurIPSNetworkGym: Reinforcement Learning Environments for Multi-Access Traffic Management in Network SimulationAdvances in Neural Information Processing Systems (Datasets and Benchmarks Track), 2024
-
NeurIPSTransfer Q*: Principled Decoding for LLM AlignmentAdvances in Neural Information Processing Systems, 2024