Abstract
The National Cancer Institute and the Department of Energy strategic partnership applies advanced computing and predictive machine learning and deep learning models to automate the capture of information from unstructured clinical text for inclusion in cancer registries. Applications include extraction of key data elements from pathology reports, determination of whether a pathology or radiology report is related to cancer, extraction of relevant biomarker information, and identification of recurrence. With the growing complexity of cancer diagnosis and treatment, capturing essential information with purely manual methods is increasingly difficult. These new methods for applying advanced computational capabilities to automate data extraction represent an opportunity to close critical information gaps and create a nimble, flexible platform on which new information sources, such as genomics, can be added. This will ultimately provide a deeper understanding of the drivers of cancer and outcomes in the population and increase the timeliness of reporting. These advances will enable better understanding of how real-world patients are treated and the outcomes associated with those treatments in the context of our complex medical and social environment.
| Original language | English |
|---|---|
| Pages (from-to) | 145-151 |
| Number of pages | 7 |
| Journal | Journal of the National Cancer Institute - Monographs |
| Volume | 2024 |
| Issue number | 65 |
| DOIs | |
| State | Published - Aug 1 2024 |
Funding
This work was supported in part by the Joint Design of Advanced Computing Solutions for Cancer (JDACS4C) program established by the US Department of Energy (DOE) and the National Cancer Institute (NCI) of the National Institutes of Health. This work was performed under the auspices of the DOE by Los Alamos National Laboratory under Contract DE-AC5206NA25396 and Oak Ridge National Laboratory under Contract DE-AC05-00OR22725. This research used resources of the Oak Ridge Leadership Computing Facility at the Oak Ridge National Laboratory, which is supported by the Office of Science of the DOE under Contract No. DE-AC05-00OR22725. IMS is supported under Contract HHSN261201500003B/Task Order 75N91020F00001. This work was supported in part by the Joint Design of Advanced Computing Solutions for Cancer (JDACS4C) program established by the US Department of Energy (DOE) and the National Cancer Institute (NCI) of the National Institutes of Health. This work was performed under the auspices of the DOE by Los Alamos National Laboratory under Contract DE-AC5206NA25396 and Oak Ridge National Laboratory under Contract DE-AC05-00OR22725. This research used resources of the Oak Ridge Leadership Computing Facility at the Oak Ridge National Laboratory, which is supported by the Office of Science of the DOE under Contract No. DE-AC05- 00OR22725. IMS is supported under Contract HHSN261201500003B/ Task Order 75N91020F00001 The authors gratefully acknowledge the contributions of staff in the Surveillance Research Program at NCI, ORNL, IMS, and the SEER cancer registries in supporting the MOSSAIC work. The views expressed in this commentary are those of the authors and should not be interpreted to reflect the views or official policies of the National Cancer Institute, Department of Energy, or their contractors.
Fingerprint
Dive into the research topics of 'Machine learning and deep learning tools for the automated capture of cancer surveillance data'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver