Characterizing mammography reports for health analytics

Carlos C. Rojas, Robert M. Patton, Barbara G. Beckerman

Research output: Contribution to journalArticlepeer-review

4 Scopus citations

Abstract

As massive collections of digital health data are becoming available, the opportunities for large-scale automated analysis increase. In particular, the widespread collection of detailed health information is expected to help realize a vision of evidence-based public health and patient-centric health care. Within such a framework for large scale health analytics we describe the transformation of a large data set of mostly unlabeled and free-text mammography data into a searchable and accessible collection, usable for analytics. We also describe several methods to characterize and analyze the data, including their temporal aspects, using information retrieval, supervised learning, and classical statistical techniques. We present experimental results that demonstrate the validity and usefulness of the approach, since the results are consistent with the known features of the data, provide novel insights about it, and can be used in specific applications. Additionally, based on the process of going from raw data to results from analysis, we present the architecture of a generic system for health analytics from clinical notes.

Original languageEnglish
Pages (from-to)1197-1210
Number of pages14
JournalJournal of Medical Systems
Volume35
Issue number5
DOIs
StatePublished - Oct 2011

Funding

Prepared by Oak Ridge National Laboratory, P. O. Box 2008, Oak Ridge, Tennessee, 37831-6285, managed by UT-Battelle, LLC, for the U.S. Department of Energy Under contract DE-AC05-00OR22725. Research partially sponsored by the Laboratory Directed Research and Development Program of Oak Ridge National Laboratory, LDRD #5327. This manuscript has been authored by UT-Battelle, LLC, under contract DE-AC05-00OR22725 with the U.S. Department of Energy.

FundersFunder number
UT-Battelle
U.S. Department of EnergyDE-AC05-00OR22725
Oak Ridge National Laboratory
Laboratory Directed Research and Development5327

    Keywords

    • Clinical notes
    • Mammography reports
    • Temporal analysis
    • Text analysis

    Fingerprint

    Dive into the research topics of 'Characterizing mammography reports for health analytics'. Together they form a unique fingerprint.

    Cite this