Datenbank Spektrum 13(1) - März 2013

https://dl.gi.de/handle/20.500.12116/11556

Autor*innen mit den meisten Dokumenten

Härder, Theo

Kemper, Alfons

Lehner, Wolfgang

Boehm, Matthias

Dannecker, Lars

Auflistung nach:

1 - 10 von 11

Zeitschriftenartikel
Dr. Dean Jacobs
(Datenbank-Spektrum: Vol. 13, No. 1, 2013) Kemper, Alfons; Lehner, Wolfgang
Zeitschriftenartikel
Datenmanagement und -exploration an der RWTH Aachen
(Datenbank-Spektrum: Vol. 13, No. 1, 2013) Seidl, Thomas
Der Lehrstuhl für Informatik 9 (Datenmanagement und -exploration) an der RWTH Aachen beschäftigt sich mit Data Mining- und Datenbanktechnologien für multimediale und räumlich-zeitliche Daten in ingenieur-, natur-, lebens-, wirtschafts- und sozialwissenschaftlichen Anwendungen. Sowohl die große Menge an Daten als auch die Komplexität der einzelnen Objekte bergen unterschiedliche Herausforderungen für die Analyse und Exploration realer Daten, denen wir mit der Entwicklung neuer effektiver sowie effizienter Konzepte für Datenanalyse und Datenmanagement begegnen.
Zeitschriftenartikel
Compilation of Query Languages into MapReduce
(Datenbank-Spektrum: Vol. 13, No. 1, 2013) Sauer, Caetano; Härder, Theo
The introduction of MapReduce as a tool for Big Data Analytics, combined with the new requirements of emerging application scenarios such as the Web 2.0 and scientific computing, has motivated the development of data processing languages which are more flexible and widely applicable than SQL. Based on the Big Data context, we discuss the points in which SQL is considered too restrictive. Furthermore, we provide a qualitative evaluation of how recent query languages overcome these restrictions. Having established the desired characteristics of a query language, we provide an abstract description of the compilation into the MapReduce programming model, which, up to minor variations, is essentially the same in all approaches. Given the requirements of query processing, we introduce simple generalizations of the model, which allow the reuse of well-established query evaluation techniques, and discuss strategies to generate optimized MapReduce plans.
Zeitschriftenartikel
Inkrementelle Neuberechnungen in MapReduce
(Datenbank-Spektrum: Vol. 13, No. 1, 2013) Schildgen, Johannes; Jörg, Thomas; Deßloch, Stefan
Das MapReduce-Programmiermodell ermöglicht die skalierbare Analyse und Transformation großer Datenmengen. Wir stellen das auf MapReduce basierende Marimba-Framework zur einfachen Entwicklung von inkrementellen, selbstwartbaren Programmen vor, welche bei Änderung von Quelldaten eine vollständige Wiederholung des MapReduce-Jobs vermeiden. Marimba wird anhand mehrerer Anwendungen illustriert und durch Leistungsmessungen evaluiert.
Zeitschriftenartikel
Efficient OR Hadoop: Why Not Both?
(Datenbank-Spektrum: Vol. 13, No. 1, 2013) Dittrich, Jens; Richter, Stefan; Schuh, Stefan
In this article, we give an overview of research related to Big Data processing in Hadoop going on at the Information Systems Group at Saarland University. We discuss how to make Hadoop efficient. We briefly survey three of our projects in this context: Hadoop++, Trojan Layouts, and HAIL.
Zeitschriftenartikel
Editorial
(Datenbank-Spektrum: Vol. 13, No. 1, 2013) Härder, Theo
Zeitschriftenartikel
Bericht vom Herbsttreffen der GI-Fachgruppe Datenbanksysteme
(Datenbank-Spektrum: Vol. 13, No. 1, 2013) Kemper, Alfons; Mühlbauer, Tobias; Neumann, Thomas; Reiser, Angelika; Rödiger, Wolf
Zeitschriftenartikel
News
(Datenbank-Spektrum: Vol. 13, No. 1, 2013)
Zeitschriftenartikel
Parallel Entity Resolution with Dedoop
(Datenbank-Spektrum: Vol. 13, No. 1, 2013) Kolb, Lars; Rahm, Erhard
We provide an overview of Dedoop (Deduplication with Hadoop), a new tool for parallel entity resolution (ER) on cloud infrastructures. Dedoop supports a browser-based specification of complex ER strategies and provides a large library of blocking and matching approaches. To simplify the configuration of ER strategies with several similarity metrics, training-based machine learning approaches can be employed with Dedoop. Specified ER strategies are automatically translated into MapReduce jobs for parallel execution on different Hadoop clusters. For improved performance, Dedoop supports redundancy-free multi-pass blocking as well as advanced load balancing approaches. To illustrate the usefulness of Dedoop, we present the results of a comparative evaluation of different ER strategies on a challenging real-world dataset.
Zeitschriftenartikel
Dissertationen
(Datenbank-Spektrum: Vol. 13, No. 1, 2013)

Autor*innen mit den meisten Dokumenten

Härder, Theo

Kemper, Alfons

Lehner, Wolfgang

Boehm, Matthias

Dannecker, Lars

Neueste Veröffentlichungen

Treffer pro Seite

Sortieroptionen