KoGra-DB: Using MapReduce for language corpora

dc.contributor.author	Schneider, Roman
dc.contributor.editor	Horbach, Matthias
dc.date.accessioned	2019-03-07T09:31:47Z
dc.date.available	2019-03-07T09:31:47Z
dc.date.issued	2013
dc.description.abstract	Linguistic query systems are special purpose IR applications. We present a novel state-of-the-art approach for the efficient exploitation of very large linguistic corpora, combining the advantages of relational database management systems (RDBMS) with the functional MapReduce programming model. Our implementation uses the German DEREKO reference corpus with multi-layer linguistic annotations and several types of text-specific metadata, but the proposed strategy is language-independent and adaptable to large-scale multilingual corpora.	en
dc.identifier.isbn	978-3-88579-614-5
dc.identifier.pissn	1617-5468
dc.identifier.uri	https://dl.gi.de/handle/20.500.12116/20655
dc.language.iso	en
dc.publisher	Gesellschaft für Informatik e.V.
dc.relation.ispartof	INFORMATIK 2013 – Informatik angepasst an Mensch, Organisation und Umwelt
dc.relation.ispartofseries	Lecture Notes in Informatics (LNI) - Proceedings, Volume P-220
dc.title	KoGra-DB: Using MapReduce for language corpora	en
dc.type	Text/Conference Paper
gi.citation.endPage	142
gi.citation.publisherPlace	Bonn
gi.citation.startPage	140
gi.conference.date	16.-20. September 2013
gi.conference.location	Koblenz
gi.conference.sessiontitle	Regular Research Papers

Dateien

1 - 1 von 1