Assessing the quality of natural language text data
Author:
Abstract
We follow an empirical approach from data quality toward text quality, where the expectations of the consumer, human or machine, take the centre stage. We try to obtain numerical text quality statements which must be interpreted for the expectations of the user and suitability for automatic natural language processing (NLP) separately. We state that apart from text accessibility today only representational text quality metrics can be derived and computed automatically. Interestingly, text quality for NLP traces back to questions of text representation.
- Citation
- BibTeX
Sonntag, D.,
(2004).
Assessing the quality of natural language text data.
In:
Dadam, P. & Reichert, M.
(Hrsg.),
Informatik 2004 – Informatik verbindet – Band 1, Beiträge der 34. Jahrestagung der Gesellschaft für Informatik e.V. (GI).
Bonn:
Gesellschaft für Informatik e.V..
(S. 259-263).
@inproceedings{mci/Sonntag2004,
author = {Sonntag, Daniel},
title = {Assessing the quality of natural language text data},
booktitle = {Informatik 2004 – Informatik verbindet – Band 1, Beiträge der 34. Jahrestagung der Gesellschaft für Informatik e.V. (GI)},
year = {2004},
editor = {Dadam, Peter AND Reichert, Manfred} ,
pages = { 259-263 },
publisher = {Gesellschaft für Informatik e.V.},
address = {Bonn}
}
author = {Sonntag, Daniel},
title = {Assessing the quality of natural language text data},
booktitle = {Informatik 2004 – Informatik verbindet – Band 1, Beiträge der 34. Jahrestagung der Gesellschaft für Informatik e.V. (GI)},
year = {2004},
editor = {Dadam, Peter AND Reichert, Manfred} ,
pages = { 259-263 },
publisher = {Gesellschaft für Informatik e.V.},
address = {Bonn}
}
Dateien | Groesse | Format | Anzeige | |
---|---|---|---|---|
GI-Proceedings.50-55.pdf | 334.8Kb | View/ |
Haben Sie fehlerhafte Angaben entdeckt? Sagen Sie uns Bescheid: Send Feedback
More Info
ISBN: 3-88579-379-2
ISSN: 1617-5468
xmlui.MetaDataDisplay.field.date: 2004
Language:
(en)

Content Type: Text/Conference Paper