Loading Events

MIE Research Seminar: Bridging Online and Offline Learning Towards Improved Data-Driven Decision Making

March 1, 2023 @ 10:00 am - 11:00 am

Topic: Bridging Online and Offline Learning Towards Improved Data-Driven Decision Making

Speaker: Yunzong Xu, MIT

Abstract

Machine learning is playing an increasingly important role in decision making, with key applications ranging from dynamic pricing and recommendation systems to personalized medicine and clinical trials. While supervised machine learning traditionally excels at making predictions based on i.i.d. offline data, many modern decision-making tasks require making sequential decisions based on data collected online. Such discrepancy gives rise to important challenges of bridging offline supervised learning and online interactive learning to unlock the full potential of data-driven decision making.
In the main part of this talk, we consider the challenge of reducing difficult online decision-making problems to well-understood offline supervised learning problems. Focusing on contextual bandits, a core class of online decision-making problems, we present the first optimal and efficient reduction from contextual bandits to offline regression. A remarkable consequence of our results is that advances in offline regression immediately translate to contextual bandits, statistically and computationally. We illustrate the advantages of our results through new guarantees in complex operational environments and experiments on real-world datasets. We also discuss the extensions of our results to more challenging setups, including reinforcement learning in large state spaces.
After the main part, I will provide an overview of my additional work and broader research agenda on bridging online and offline learning towards improved data-driven decision making. I will highlight the importance of problem structures and discuss the exciting opportunities for the operations research community.

Speaker Bio

Yunzong Xu is a fifth-year PhD student in the Institute for Data, Systems, and Society at MIT, advised by Prof. David Simchi-Levi. His research lies at the intersection of machine learning and operations research. His current research interests include data-driven decision making, reinforcement learning, bandit learning, statistical learning, econometrics and causal inference, with applications to e-commerce, supply chains, and healthcare. Over the course of his PhD, his research has been recognized by multiple paper awards from INFORMS George Nicholson Paper Competition, Applied Probability Society, and other competitions/organizations. His industrial experience includes an internship at Microsoft Research on reinforcement learning, as well as an ongoing research collaboration with IBM and Boston Scientific on healthcare inventory management. Prior to joining MIT, he received his dual bachelor’s degrees in information systems and mathematics from Tsinghua University in 2018.

Upcoming Events

All
  • All
  • Alumni events
  • Anti-Racism and Cultural Diversity Office events
  • Convocation events
  • Faculty & staff events
  • Info sessions
  • Lectures, seminars and workshops
  • Socials
  • U of T holidays & closures

Presidential Day

Fri May 16, 2025
The university will be closed.

Victoria Day

Mon May 19, 2025
The university will be closed.  Enjoy the long weekend!

U of T Engineering Master of Engineering (MEng) Info Session

Tue May 20, 2025 @ 8:00 pm - 9:00 pm
Are you considering engineering graduate studies? Now is the time to accelerate your career with a one-year professional master’s degree at Canada’s #1 engineering school! Did you know you can...

ISTEP Speaker Series – Allison Godwin

Tue May 27, 2025 @ 12:00 pm - 1:00 pm
UBelong! Stories as a Promising Intervention to Address Equity Gaps in First-Year Engineering Abstract This talk describes an ongoing multi-institutional project focused on developing and implementing an ecological belonging intervention...