Logo des Repositoriums
 
Textdokument

Comparing Link Grammars and Dependency Grammars for parsing German histological reports

Vorschaubild nicht verfügbar

Volltext URI

Dokumententyp

Zusatzinformation

Datum

2022

Zeitschriftentitel

ISSN der Zeitschrift

Bandtitel

Verlag

Gesellschaft für Informatik, Bonn

Zusammenfassung

The availability of structured data is becoming an increasingly critical factor in medical research. Still, pathologists in Germany document their findings in running text instead of in a structured form. In order to obtain structured data from these report texts, hey have to be converted to a more useful form. Link Grammars (LGs) and Dependency Grammars (DGs) both can be used to parse the texts. Hence, LGs and DGs can be used for information extraction on histological reports. This paper aims to compare LGs and DGs, to show why DGs are superior and to evaluate the performance of a DG parser on a corpus of 200 histological reports randomly selected from breast biopsy reports. The DG parser achieved an Unlabelled Attachment Score of 96, a Labelled Accuracy of 95 and a Labelled Attachment Score of 93. Further evaluation shows that the occurrence of medical words which have not been part of the training data does not affect the parsers performance.

Beschreibung

Dörenberg, Julian (2022): Comparing Link Grammars and Dependency Grammars for parsing German histological reports. SKILL 2022. Gesellschaft für Informatik, Bonn. PISSN: 1614-3213. ISBN: 978-3-88579-752-4. pp. 153-163. Natural Language Processing. Hamburg. 29.-30. September 2022

Zitierform

DOI

Tags