Derivation and validation of text search algorithms for renal and adrenal lesion identification in radiology text reports
Keywords:health administrative database; text search algorithms; renal mass; adrenal mass
Introduction: Most cohort studies are limited by sampling and accrual bias. The capability to detect specific lesions identified in radiological text reports could eliminate these biases and benefit patient care, clinical research, and trial recruitment. This study derived and internally validated text search algorithms to identify four common urological lesions (solid renal masses, complex renal cysts, adrenal masses, and simple renal cysts) using radiology text reports.
Methods: A simple random sample of 10 000 abdominal ultrasound (US) and computed tomography (CT) reports was drawn from our hospital’s data warehouse. Reports were manually reviewed to determine the true status of the four lesions. Using commonly available software, we created logistic regression models having as predictors the status of a priori selected text terms in the report. We used bootstrap sampling with 95th percentile thresholds to select variables for the final models, which were modified into point systems. A second independent, random sample of 2855 reports, stratified by the number of points for each abnormality, was reviewed in a blinded fashion to measure the accuracy of each lesion’s point system.
Results: The prevalence of solid renal mass, complex renal cyst, adrenal mass, and simple renal cyst, was 2.0%, 1.7%, 3.2%, and 20.0%, respectively. Each model contained between one and five text terms with c-statistics ranging between 0.66 and 0.90. In the independent validation, the scoring systems accurately predicted the probability that a text report cited the four lesions.
Conclusions: Textual radiology reports can be analyzed using common statistical software to accurately determine the probability that important abnormalities of the kidneys or adrenal glands exist. These methods can be used for case identification or epidemiological studies.
How to Cite
You, the Author(s), assign your copyright in and to the Article to the Canadian Urological Association. This means that you may not, without the prior written permission of the CUA:
- Post the Article on any Web site
- Translate or authorize a translation of the Article
- Copy or otherwise reproduce the Article, in any format, beyond what is permitted under Canadian copyright law, or authorize others to do so
- Copy or otherwise reproduce portions of the Article, including tables and figures, beyond what is permitted under Canadian copyright law, or authorize others to do so.
The CUA encourages use for non-commercial educational purposes and will not unreasonably deny any such permission request.
You retain your moral rights in and to the Article. This means that the CUA may not assert its copyright in such a way that would negatively reflect on your reputation or your right to be associated with the Article.
The CUA also requires you to warrant the following:
- That you are the Author(s) and sole owner(s), that the Article is original and unpublished and that you have not previously assigned copyright or granted a licence to any other third party;
- That all individuals who have made a substantive contribution to the article are acknowledged;
- That the Article does not infringe any proprietary right of any third party and that you have received the permissions necessary to include the work of others in the Article; and
- That the Article does not libel or violate the privacy rights of any third party.