Healthcare text mining projects: mining clinical narratives and patient-generated data
We currently run a number of projects to extract various structured data from unstructured clinical narratives and electronic healthcare records (EHRs). In previous projects we have developed a combination of rule-based and machine-learning methods to identify diseses that a patient has or does not have (“disease status”), including identification of various co-morbidities, problems, tests and treatments. We also work on the extraction of medication-related information (such as medication name, dosage, reason for taking, frequency, duration etc.) and clinical temporal text mining. These tasks were assessed as part of an international text mining challenge in the clinical/health-care domain: for more detail, see here, where we have showed continuous success.
Another strand in healthcare text mining is the extraction of subjective information from patient-generated data, such as tweets, blogs or patient’s narratives. We have also done some work on analysis of suicide notes (as part of the i2b2 challenges).
In collaboration with The Christie Hospital and the University of Salford, we are running “A study using techniques from clinical text mining to compare the narrative experiences of patients with medulloblastoma with factors identified from their hospital records”. This project aims to capture the narrative experiences of patients and compare them to the themes that are identified by text mining of the Christie Hospital health records. The findings are intended to provide an evidence-base for clinical service development. This work is funded by The Christie Charity Fund (£25K), and is part of Azad’s PhD.
As a continuation of The Christie Hospital’s project, we are part of a project led by the University of Salford (Prof Tony Long) on “Systematic analysis of healthcare records and the narrative experiences of children with tumours of the central nervous system and their carers – informing the evolution of selfesteem and healthrelated outcomes for future targeted interventions”. This project is funded by the Kidscan Charity (£62K).
The NIHR-funded project on “Enhanced occupational therapy interventions for children and adolescents with central nervous system tumours” aims to use advanced text-mining techniques to support patient involvement and decision making in clinical practice. It is a £250K collaboration with Royal Manchester Children Hospital and University of Salford, due to start in 2013.
Our team currently inolves A. Dehghan, G. Karystianis, J. Keane, S. Stivaros, E. Estlin, A. Kovacevic (collaborator), M. Filannino, G. Nenadic. More information on healthcare text mining tools is available on our HECTA pages.