TY - JOUR
T1 - Machine learning and deep learning tools for the automated capture of cancer surveillance data
AU - Hsu, Elizabeth
AU - Hanson, Heidi
AU - Coyle, Linda
AU - Stevens, Jennifer
AU - Tourassi, Georgia
AU - Penberthy, Lynne
N1 - Publisher Copyright:
© 2024 Published by Oxford University Press.
PY - 2024/8/1
Y1 - 2024/8/1
N2 - The National Cancer Institute and the Department of Energy strategic partnership applies advanced computing and predictive machine learning and deep learning models to automate the capture of information from unstructured clinical text for inclusion in cancer registries. Applications include extraction of key data elements from pathology reports, determination of whether a pathology or radiology report is related to cancer, extraction of relevant biomarker information, and identification of recurrence. With the growing complexity of cancer diagnosis and treatment, capturing essential information with purely manual methods is increasingly difficult. These new methods for applying advanced computational capabilities to automate data extraction represent an opportunity to close critical information gaps and create a nimble, flexible platform on which new information sources, such as genomics, can be added. This will ultimately provide a deeper understanding of the drivers of cancer and outcomes in the population and increase the timeliness of reporting. These advances will enable better understanding of how real-world patients are treated and the outcomes associated with those treatments in the context of our complex medical and social environment.
AB - The National Cancer Institute and the Department of Energy strategic partnership applies advanced computing and predictive machine learning and deep learning models to automate the capture of information from unstructured clinical text for inclusion in cancer registries. Applications include extraction of key data elements from pathology reports, determination of whether a pathology or radiology report is related to cancer, extraction of relevant biomarker information, and identification of recurrence. With the growing complexity of cancer diagnosis and treatment, capturing essential information with purely manual methods is increasingly difficult. These new methods for applying advanced computational capabilities to automate data extraction represent an opportunity to close critical information gaps and create a nimble, flexible platform on which new information sources, such as genomics, can be added. This will ultimately provide a deeper understanding of the drivers of cancer and outcomes in the population and increase the timeliness of reporting. These advances will enable better understanding of how real-world patients are treated and the outcomes associated with those treatments in the context of our complex medical and social environment.
UR - http://www.scopus.com/inward/record.url?scp=85200712691&partnerID=8YFLogxK
U2 - 10.1093/jncimonographs/lgae018
DO - 10.1093/jncimonographs/lgae018
M3 - Article
C2 - 39102883
AN - SCOPUS:85200712691
SN - 1052-6773
VL - 2024
SP - 145
EP - 151
JO - Journal of the National Cancer Institute - Monographs
JF - Journal of the National Cancer Institute - Monographs
IS - 65
ER -