KoGra-DB: Using MapReduce for language corpora
dc.contributor.author | Schneider, Roman | |
dc.contributor.editor | Horbach, Matthias | |
dc.date.accessioned | 2019-03-07T09:31:47Z | |
dc.date.available | 2019-03-07T09:31:47Z | |
dc.date.issued | 2013 | |
dc.description.abstract | Linguistic query systems are special purpose IR applications. We present a novel state-of-the-art approach for the efficient exploitation of very large linguistic corpora, combining the advantages of relational database management systems (RDBMS) with the functional MapReduce programming model. Our implementation uses the German DEREKO reference corpus with multi-layer linguistic annotations and several types of text-specific metadata, but the proposed strategy is language-independent and adaptable to large-scale multilingual corpora. | en |
dc.identifier.isbn | 978-3-88579-614-5 | |
dc.identifier.pissn | 1617-5468 | |
dc.identifier.uri | https://dl.gi.de/handle/20.500.12116/20655 | |
dc.language.iso | en | |
dc.publisher | Gesellschaft für Informatik e.V. | |
dc.relation.ispartof | INFORMATIK 2013 – Informatik angepasst an Mensch, Organisation und Umwelt | |
dc.relation.ispartofseries | Lecture Notes in Informatics (LNI) - Proceedings, Volume P-220 | |
dc.title | KoGra-DB: Using MapReduce for language corpora | en |
dc.type | Text/Conference Paper | |
gi.citation.endPage | 142 | |
gi.citation.publisherPlace | Bonn | |
gi.citation.startPage | 140 | |
gi.conference.date | 16.-20. September 2013 | |
gi.conference.location | Koblenz | |
gi.conference.sessiontitle | Regular Research Papers |
Dateien
Originalbündel
1 - 1 von 1
Vorschaubild nicht verfügbar
- Name:
- 140.pdf
- Größe:
- 166.56 KB
- Format:
- Adobe Portable Document Format