CliNER 2.0: Accessible and Accurate Clinical Concept Extraction

Willie Boag; Elena Sergeeva; Saurabh Kulshreshtha; Peter Szolovits; Anna Rumshisky; Tristan Naumann

CliNER 2.0: Accessible and Accurate Clinical Concept Extraction

Willie Boag ,
Elena Sergeeva ,
Saurabh Kulshreshtha ,
Peter Szolovits ,
Anna Rumshisky ,
Tristan Naumann

March 2018

Download BibTex

Clinical notes often describe important aspects of a patient’s stay and are therefore critical to medical research. Clinical concept extraction (CCE) of named entities-such as problems, tests, and treatments-aids in forming an understanding of notes and provides a foundation for many downstream clinical decision-making tasks. Historically, this task has been posed as a standard named entity recognition (NER) sequence tagging problem, and solved with feature-based methods using handengineered domain knowledge. Recent advances, however, have demonstrated the efficacy of LSTM-based models for NER tasks, including CCE. This work presents CliNER 2.0, a simple-to-install, open-source tool for extracting concepts from clinical text. CliNER 2.0 uses a word-and character-level LSTM model, and achieves state-of-the-art performance. For ease of use, the tool also includes pre-trained models available for public use.