Auflistung Datenbank Spektrum 12(3) - November 2012 nach Erscheinungsdatum
1 - 10 von 12
Treffer pro Seite
Sortieroptionen
- ZeitschriftenartikelData Management Challenges in Next Generation Sequencing(Datenbank-Spektrum: Vol. 12, No. 3, 2012) Wandelt, Sebastian; Rheinländer, Astrid; Bux, Marc; Thalheim, Lisa; Haldemann, Berit; Leser, UlfSince the early days of the Human Genome Project, data management has been recognized as a key challenge for modern molecular biology research. By the end of the nineties, technologies had been established that adequately supported most ongoing projects, typically built upon relational database management systems. However, recent years have seen a dramatic increase in the amount of data produced by typical projects in this domain. While it took more than ten years, approximately three billion USD, and more than 200 groups worldwide to assemble the first human genome, today’s sequencing machines produce the same amount of raw data within a week, at a cost of approximately 2000 USD, and on a single device. Several national and international projects now deal with (tens of) thousands of genomes, and trends like personalized medicine call for efforts to sequence entire populations. In this paper, we highlight challenges that emerge from this flood of data, such as parallelization of algorithms, compression of genomic sequences, and cloud-based execution of complex scientific workflows. We also point to a number of further challenges that lie ahead due to the increasing demand for translational medicine, i.e., the accelerated transition of biomedical research results into medical practice.
- ZeitschriftenartikelTowards Large-Scale Meteorological Data Services: A Case Study(Datenbank-Spektrum: Vol. 12, No. 3, 2012) Misev, Dimitar; Baumann, Peter; Seib, JürgenMeteorological data contribute significantly to “Big Data”, handling multi-dimensional raster data cubes up to 5-D and with single cubes up to multi-Petabyte sizes. Due to the lack of support for raster data, traditionally file-based implementations have been used for serving such data to the community, rather than databases. Array databases overcome this by providing storage and query support.In this paper, we present a case study conducted by Deutscher Wetterdienst (DWD) where extraction and processing of gridded meteorological data sets has been investigated hands-on. Following a brief introduction of the rasdaman DBMS used, we present the database schema used and a series of array queries, selected according to their practical importance in weather forecast services. We discuss several issues that have come up, such as null values and time modeling, and how they have been addressed. To the best of our knowledge, this is the first non-academic deployment of an array database for up to 5-D data sets.
- ZeitschriftenartikelDer 175. Datenbankstammtisch an der HTW Dresden(Datenbank-Spektrum: Vol. 12, No. 3, 2012) Wloka, Uwe; Gräfe, Gunter
- ZeitschriftenartikelDie deutsche DB-Community trauert um Hagen Höpfner (25.01.1977–01.10.2012)(Datenbank-Spektrum: Vol. 12, No. 3, 2012)
- ZeitschriftenartikelBericht über den 24. GI-Workshop “Grundlagen von Datenbanken”(Datenbank-Spektrum: Vol. 12, No. 3, 2012) Schmitt, Ingo; Höpfner, Hagen
- ZeitschriftenartikelScientific Workflows and Provenance: Introduction and Research Opportunities(Datenbank-Spektrum: Vol. 12, No. 3, 2012) Cuevas-Vicenttín, Víctor; Dey, Saumen; Köhler, Sven; Riddle, Sean; Ludäscher, BertramScientific workflows are becoming increasingly popular for compute-intensive and data-intensive scientific applications. The vision and promise of scientific workflows includes rapid, easy workflow design, reuse, scalable execution, and other advantages, e.g., to facilitate “reproducible science” through provenance (e.g., data lineage) support. However, as described in the paper, important research challenges remain. While the database community has studied (business) workflow technologies extensively in the past, most current work in scientific workflows seems to be done outside of the database community, e.g., by practitioners and researchers in the computational sciences and eScience. We provide a brief introduction to scientific workflows and provenance, and identify areas and problems that suggest new opportunities for database research.
- ZeitschriftenartikelHandling Big Data in Astronomy and Astrophysics: Rich Structured Queries on Replicated Cloud Data with XtreemFS(Datenbank-Spektrum: Vol. 12, No. 3, 2012) Enke, Harry; Partl, Adrian; Reinefeld, Alexander; Schintke, FlorianWith recent observational instruments and survey campaigns in astrophysics, efficient analysis of big structured data becomes more and more relevant. While providing good query expressiveness and data analysis capabilities through SQL, off-the-shelf RDBMS are yet not well prepared to handle high volume scientific data distributed across several nodes, neither for fast data ingest nor for fast spatial queries. Our SQL query parser and job manager performs query reformulation to spread queries to data nodes, gathering outputs on a head node and providing them again to the shards for subsequent processing steps. We combine this data analysis architecture with the cloud data storage component XtreemFS for automatic data replication to improve the availability and access latency. With our solution we perform rich structured data analysis expressed using SQL on large amounts of structured astrophysical data distributed across numerous storage nodes in parallel. The cloud storage virtualization with XtreemFS provides elasticity and reproducibility of scientific analysis tasks through its snapshot capability.
- ZeitschriftenartikelCityPlot: Colored ER Diagrams to Visualize Structure and Contents of Databases(Datenbank-Spektrum: Vol. 12, No. 3, 2012) Dugas, Martin; Vossen, GottfriedCityPlot generates an extended version of a traditional entity-relationship diagram for a database. It is intended to provide a combined view of database structure and contents. The graphical output resembles the metaphor of a city. Data points are visualized according to data type and completeness. An open source reference implementation is available from http://cran.r-project.org/.
- ZeitschriftenartikelDissertationen(Datenbank-Spektrum: Vol. 12, No. 3, 2012)
- ZeitschriftenartikelNews(Datenbank-Spektrum: Vol. 12, No. 3, 2012)