NLP Performance in Clinical Notes: Addressing Data Limitations and System Overfitting

Written by nlp | Published 2025/04/01
Tech Story Tags: social-support-extraction | social-isolation-detection | nlp-in-psychiatry | electronic-health-records | clinical-nlp-applications | sdoh-in-psychiatry | psychiatric-notes-mining | psychiatric-encounter-notes

TLDRThere were insufficient instances in the notes of the emotional support subcategories to evaluate the NLP systems.via the TL;DR App

Table of Links

Abstract and 1. Introduction

2 Data

2.1 Data Sources

2.2 SS and SI Categories

3 Methods

3.1 Lexicon Creation and Expansion

3.2 Annotations

3.3 System Description

4 Results

4.1 Demographics and 4.2 System Performance

5 Discussion

5.1 Limitations

6 Conclusion, Reproducibility, Funding, Acknowledgments, Author Contributions, and References

SUPPLEMENTARY

Guidelines for Annotating Social Support and Social Isolation in Clinical Notes

Other Supervised Models

5.1 Limitations

Several limitations should be noted. There were insufficient instances in the notes of the emotional support subcategories to evaluate the NLP systems. Emotional support (and lack thereof) is an important and distinct fine-grained category that would ideally be identified in the notes. Second, the RBS was designed with specific lexicons from manual review at MSHS and WCM, may have experienced overfitting and led to an inflated f-score. It would be beneficial to validate these NLP systems on clinical notes from different EHR systems. Other healthcare systems that implement a lexicon-based rules approach will need to perform site-specific template removal to avoid the problem of false-positives. With fine-tuning, the LLM approach may have been able to correctly interpret the templates; however, because the templates were removed from the notes before the annotation process, this was not assessed.

This paper is available on arxiv under CC BY 4.0 DEED license.

Authors:

(1) Braja Gopal Patra, Weill Cornell Medicine, New York, NY, USA and co-first authors;

(2) Lauren A. Lepow, Icahn School of Medicine at Mount Sinai, New York, NY, USA and co-first authors;

(3) Praneet Kasi Reddy Jagadeesh Kumar. Weill Cornell Medicine, New York, NY, USA;

(4) Veer Vekaria, Weill Cornell Medicine, New York, NY, USA;

(5) Mohit Manoj Sharma, Weill Cornell Medicine, New York, NY, USA;

(6) Prakash Adekkanattu, Weill Cornell Medicine, New York, NY, USA;

(7) Brian Fennessy, Icahn School of Medicine at Mount Sinai, New York, NY, USA;

(8) Gavin Hynes, Icahn School of Medicine at Mount Sinai, New York, NY, USA;

(9) Isotta Landi, Icahn School of Medicine at Mount Sinai, New York, NY, USA;

(10) Jorge A. Sanchez-Ruiz, Mayo Clinic, Rochester, MN, USA;

(11) Euijung Ryu, Mayo Clinic, Rochester, MN, USA;

(12) Joanna M. Biernacka, Mayo Clinic, Rochester, MN, USA;

(13) Girish N. Nadkarni, Icahn School of Medicine at Mount Sinai, New York, NY, USA;

(14) Ardesheer Talati, Columbia University Vagelos College of Physicians and Surgeons, New York, NY, USA and New York State Psychiatric Institute, New York, NY, USA;

(15) Myrna Weissman, Columbia University Vagelos College of Physicians and Surgeons, New York, NY, USA and New York State Psychiatric Institute, New York, NY, USA;

(16) Mark Olfson, Columbia University Vagelos College of Physicians and Surgeons, New York, NY, USA, New York State Psychiatric Institute, New York, NY, USA, and Columbia University Irving Medical Center, New York, NY, USA;

(17) J. John Mann, Columbia University Irving Medical Center, New York, NY, USA;

(18) Alexander W. Charney, Icahn School of Medicine at Mount Sinai, New York, NY, USA;

(19) Jyotishman Pathak, Weill Cornell Medicine, New York, NY, USA.


Written by nlp | Natural Language Processing. I am Processing Natural Language, naturally. We publish trending research and blogs.
Published by HackerNoon on 2025/04/01