Kevers, Laurent
[UCL]
Medori, Julia
[UCL]
This paper addresses the issue of semi-automatic patient discharge summaries encoding into medical classifications such as ICD-9-CM. The methods detailed in this paper focus on symbolic approaches which allow the processing of unannotated corpora without any machine learning. The first method is based on the morphological analysis (MA) of medical terms extracted with hand-crafted linguistic resources. The second one (ELP) relies on the automatic extraction of variants of ICD-9CM code labels. Each method was evaluated on a set of 19,692 discharge summaries in French from a General Internal Medicine unit. Depending on the number of suggested classes, the MA method resulted in a maximal F-measure of 28.00 and a highest recall of 46.13%. The best Fmeasure for the second method was 29.43 while the maximal recall was 52.74%. Both methods were then combined. The best recall increased to 60.21% and the maximal F-measure reached 31.64.
Bibliographic reference |
Kevers, Laurent ; Medori, Julia. Symbolic classification methods for patient discharge summaries encoding into ICD.Advances in Natural Language Processing. 7th International Conference on NLP, IceTAL 2010 (Reykjavik, Iceland, 16-18 August 2010). In: Loftsson, H.; Rognvaldsson, E.; Helgadottir, S.;, Advances in Natural Language Processing. 7th International Conference on NLP, IceTAL 2010, Springer-verlag2010, p. 197-208 |
Permanent URL |
http://hdl.handle.net/2078.1/67342 |