Targeted Active Learning for Bayesian Decision-Making
Filstroff, Louis; Sundin, Iiris; Mikkola, Petrus; Tiulpin, Aleksei; Kylmäoja, Juuso; Kaski, Samuel (2024-06-12)
Filstroff, Louis
Sundin, Iiris
Mikkola, Petrus
Tiulpin, Aleksei
Kylmäoja, Juuso
Kaski, Samuel
12.06.2024
Filstroff, L., Sundin, I., Mikkola, P., Tiulpin, A., Kylmäoja, J. & Kaski, S. (2024). Targeted Active Learning for Bayesian Decision-Making. Transactions on machine learning research. https://openreview.net/forum?id=KxPjuiMgmm
https://creativecommons.org/licenses/by/4.0/
CC BY 4.0. Licensed under Creative Commons Attribution 4.0. International.
https://creativecommons.org/licenses/by/4.0/
CC BY 4.0. Licensed under Creative Commons Attribution 4.0. International.
https://creativecommons.org/licenses/by/4.0/
Julkaisun pysyvä osoite on
https://urn.fi/URN:NBN:fi:oulu-202504112560
https://urn.fi/URN:NBN:fi:oulu-202504112560
Tiivistelmä
Abstract
Active learning is usually applied to acquire labels of informative data points in supervised learning, to maximize accuracy in a sample-efficient way. However, maximizing the supervised learning accuracy is not the end goal when the results are used for decision-making, for example in personalized medicine or economics. We argue that when acquiring samples sequentially, the common practice of separating learning and decision-making is sub-optimal, and we introduce an active learning strategy that takes the down-the-line decision problem into account. Specifically, we adopt a Bayesian experimental design approach, in which the proposed acquisition criterion maximizes the expected information gain on the posterior distribution of the optimal decision. We compare our targeted active learning strategy to existing alternatives on both simulated and real data and show improved performance in decision-making accuracy.
Active learning is usually applied to acquire labels of informative data points in supervised learning, to maximize accuracy in a sample-efficient way. However, maximizing the supervised learning accuracy is not the end goal when the results are used for decision-making, for example in personalized medicine or economics. We argue that when acquiring samples sequentially, the common practice of separating learning and decision-making is sub-optimal, and we introduce an active learning strategy that takes the down-the-line decision problem into account. Specifically, we adopt a Bayesian experimental design approach, in which the proposed acquisition criterion maximizes the expected information gain on the posterior distribution of the optimal decision. We compare our targeted active learning strategy to existing alternatives on both simulated and real data and show improved performance in decision-making accuracy.
Kokoelmat
- Avoin saatavuus [38865]