Loading Events

MIE Research Seminar: Bridging Online and Offline Learning Towards Improved Data-Driven Decision Making

March 1, 2023 @ 10:00 am - 11:00 am EST

Topic: Bridging Online and Offline Learning Towards Improved Data-Driven Decision Making

Speaker: Yunzong Xu, MIT

Abstract

Machine learning is playing an increasingly important role in decision making, with key applications ranging from dynamic pricing and recommendation systems to personalized medicine and clinical trials. While supervised machine learning traditionally excels at making predictions based on i.i.d. offline data, many modern decision-making tasks require making sequential decisions based on data collected online. Such discrepancy gives rise to important challenges of bridging offline supervised learning and online interactive learning to unlock the full potential of data-driven decision making.
In the main part of this talk, we consider the challenge of reducing difficult online decision-making problems to well-understood offline supervised learning problems. Focusing on contextual bandits, a core class of online decision-making problems, we present the first optimal and efficient reduction from contextual bandits to offline regression. A remarkable consequence of our results is that advances in offline regression immediately translate to contextual bandits, statistically and computationally. We illustrate the advantages of our results through new guarantees in complex operational environments and experiments on real-world datasets. We also discuss the extensions of our results to more challenging setups, including reinforcement learning in large state spaces.
After the main part, I will provide an overview of my additional work and broader research agenda on bridging online and offline learning towards improved data-driven decision making. I will highlight the importance of problem structures and discuss the exciting opportunities for the operations research community.

Speaker Bio

Yunzong Xu is a fifth-year PhD student in the Institute for Data, Systems, and Society at MIT, advised by Prof. David Simchi-Levi. His research lies at the intersection of machine learning and operations research. His current research interests include data-driven decision making, reinforcement learning, bandit learning, statistical learning, econometrics and causal inference, with applications to e-commerce, supply chains, and healthcare. Over the course of his PhD, his research has been recognized by multiple paper awards from INFORMS George Nicholson Paper Competition, Applied Probability Society, and other competitions/organizations. His industrial experience includes an internship at Microsoft Research on reinforcement learning, as well as an ongoing research collaboration with IBM and Boston Scientific on healthcare inventory management. Prior to joining MIT, he received his dual bachelor’s degrees in information systems and mathematics from Tsinghua University in 2018.

Upcoming Events

All
  • All
  • Alumni events
  • Anti-Racism and Cultural Diversity Office events
  • Convocation events
  • Faculty & staff events
  • Info sessions
  • Lectures, seminars and workshops
  • Socials
  • U of T holidays & closures

P.Eng. Licence Seminar

Wed May 1, 2024 @ 9:00 am - 11:00 am EDT
Hear from Professional Engineers Ontario (PEO) licensing staff about the various ways to meet the requirements and qualifications for a licence. You can attend in-person (location on the St. George...

Victoria Day

Mon May 20, 2024
The university will be closed.

U of T Teaching and Learning Symposium (TLS)

Wed May 22, 2024 - Thu May 23, 2024
About The annual Teaching & Learning Symposium is the premier teaching showcase for the University of Toronto. It is also a signature event for the Offices of the President and Vice-President & Provost, and by extension, CTSI. Participating in the Symposium is an excellent way to...

U of T Alumni Reunion 2024

Wed May 29, 2024 - Sun June 2, 2024
So Many Beginnings. So Many Stories. First time away from home, first all-nighter, first aha moment in a lecture hall. U of T was a time of new experiences and every spring,...