Manufacturing and Service Operations Management Conference

Session

SC1 - AI3: Data matters

Time:

Sunday, 25/June/2023:

SC 13:00-14:30

Location: Cartier II

3rd floor

Presentations

Optimizing data collection for machine learning

Rafid Mahmood^1,2, James Lucas², Jose M. Alvarez², Sanja Fidler^2,3,4, Marc T. Law²

¹University of Ottawa; ²NVIDIA Corporation; ³University of Toronto; ⁴Vector Institute

Deep learning systems use huge training data sets to meet desired performances, but over/under-collecting training data can incur unnecessary costs and workflow delays. We propose and then solve an optimal data collection problem incorporating performance targets, collection costs, a time horizon, and penalties. Experiments on six deep learning tasks show that we reduce the risks of failing to meet performance targets by over 2x compared to existing estimation-based heuristics.

Policy Learning with adaptively collected Data

Ruohan Zhan¹, Zhimei Ren², Susan Athey³, Zhengyuan Zhou⁴

¹Hong Kong University of Science and Technology; ²Chicago University; ³Stanford University; ⁴New York University

Learning optimal policies from historical data enables personalization in many applications. Adaptive data collection is becoming more common for allowing to improve inferential efficiency and optimize operational performance, but adaptivity complicates policy learning ex post. Our work complements the literature by learning policies with adaptively collected data. We propose an algorithm with proven finite-sample regret bound, which is minimax optimal and meets our established lower bound.

Quality vs. quantity of data in contextual decision-making: exact analysis under newsvendor loss

Omar Besbes, Will Ma, Omar Mouchtaki

Columbia University, United States of America

We study the performance implications of quality and quantity of data in contextual decision-making. We focus on the Newsvendor loss and consider a data-driven model in which outcomes observed in similar contexts have similar distributions. We characterize exactly the worst-case regret of a classical class of kernel policies. Our exact analysis unveils new structural insights on the learning behavior of these policies that cannot be observed through state-of-the-art general purpose bounds.

Conference Agenda