Evaluation of a Natural Language Processing Model to Identify and Characterize Patients in the United States With High-Risk Non–Muscle-Invasive Bladder Cancer.

Evaluation of a Natural Language Processing Model to Identify and Characterize Patients in the United States With High-Risk Non–Muscle-Invasive Bladder Cancer

Challenge

Identifying and characterizing patients with non-muscle invasive bladder cancer using unstructured EHR data is methodologically challenging, and NLP approaches had not been validated for this specific oncology application at scale in real-world US data.

Solution

Target RWE researchers developed and evaluated an NLP model applied to EHR data from the Komodo Healthcare Map to identify and characterize NMIBC patients, validating model performance for extracting clinically meaningful patient profiles from unstructured clinical notes.

Impact

Demonstrating that NLP can reliably identify and characterize NMIBC patients from real-world EHR data expands Target RWE's capability to study rare oncology populations, enabling oncology drug development partners to access previously inaccessible patient cohorts.