Logo des Repositoriums
 

Efficient Storage and Analysis of Genome Data in Databases

dc.contributor.authorDorok, Sebastian
dc.contributor.authorBreß, Sebastian
dc.contributor.authorTeubner, Jens
dc.contributor.authorLäpple, Horstfried
dc.contributor.authorSaake, Gunter
dc.contributor.authorMarkl, Volker
dc.contributor.editorMitschang, Bernhard
dc.contributor.editorNicklas, Daniela
dc.contributor.editorLeymann, Frank
dc.contributor.editorSchöning, Harald
dc.contributor.editorHerschel, Melanie
dc.contributor.editorTeubner, Jens
dc.contributor.editorHärder, Theo
dc.contributor.editorKopp, Oliver
dc.contributor.editorWieland, Matthias
dc.date.accessioned2017-06-20T20:24:32Z
dc.date.available2017-06-20T20:24:32Z
dc.date.issued2017
dc.description.abstractGenome-analysis enables researchers to detect mutations within genomes and deduce their consequences. Researchers need reliable analysis platforms to ensure reproducible and comprehensive analysis results. Database systems provide vital support to implement the required sustainable procedures. Nevertheless, they are not used throughout the complete genome-analysis process, because (1) database systems su er from high storage overhead for genome data and (2) they introduce overhead during domain-specific analysis. To overcome these limitations, we integrate genome-specific compression into database systems using a specialized database schema. Thus, we can reduce the storage overhead to 30%. Moreover, we can exploit genome-data characteristics during query processing allowing us to analyze real-world data sets up to five times faster than specialized analysis tools and eight times faster than a straightforward database approach.en
dc.identifier.isbn978-3-88579-659-6
dc.identifier.pissn1617-5468
dc.language.isoen
dc.publisherGesellschaft für Informatik, Bonn
dc.relation.ispartofDatenbanksysteme für Business, Technologie und Web (BTW 2017)
dc.relation.ispartofseriesLecture Notes in Informatics (LNI) - Proceedings, Volume P-265
dc.subjectmain-memory database systems
dc.subjectgenome analysis
dc.subjectvariant calling
dc.titleEfficient Storage and Analysis of Genome Data in Databasesen
dc.typeText/Conference Paper
gi.citation.endPage442
gi.citation.startPage423
gi.conference.date6.-10. März 2017
gi.conference.locationStuttgart
gi.conference.sessiontitleScientific Data and Hardware

Dateien

Originalbündel
1 - 1 von 1
Lade...
Vorschaubild
Name:
paper28.pdf
Größe:
640.07 KB
Format:
Adobe Portable Document Format