Abstract
Reinforcement learning (RL) can assist in medical decision making using patient data collected in electronic health record (EHR) systems. RL, a type of machine learning, can use these data to develop treatment policies. However, RL models are typically trained using imperfect retrospective EHR data. Therefore, if care is not taken in training, RL policies can propagate existing bias in healthcare. Literature that considers and addresses the issues of bias and fairness in sequential decision making are reviewed. The major themes to mitigate bias that emerge relate to (1) data management; (2) algorithmic design; and (3) clinical understanding of the resulting policies.
Original language | English |
---|---|
Article number | 52 |
Journal | ACM Computing Surveys |
Volume | 56 |
Issue number | 2 |
DOIs | |
State | Published - Sep 15 2023 |
Funding
The research is partially supported by Science Alliance, The University of Tennessee, and the Laboratory Directed Research and Development Program of Oak Ridge National Laboratory, managed by UT-Battelle, LLC, for the U.S. Department of Energy.
Keywords
- Reinforcement learning
- algorithmic bias
- bias management
- electronic health records
- treatment planning