Logo des Repositoriums
 

Data Management Challenges in Next Generation Sequencing

dc.contributor.authorWandelt, Sebastian
dc.contributor.authorRheinländer, Astrid
dc.contributor.authorBux, Marc
dc.contributor.authorThalheim, Lisa
dc.contributor.authorHaldemann, Berit
dc.contributor.authorLeser, Ulf
dc.date.accessioned2018-01-10T13:18:44Z
dc.date.available2018-01-10T13:18:44Z
dc.date.issued2012
dc.description.abstractSince the early days of the Human Genome Project, data management has been recognized as a key challenge for modern molecular biology research. By the end of the nineties, technologies had been established that adequately supported most ongoing projects, typically built upon relational database management systems. However, recent years have seen a dramatic increase in the amount of data produced by typical projects in this domain. While it took more than ten years, approximately three billion USD, and more than 200 groups worldwide to assemble the first human genome, today’s sequencing machines produce the same amount of raw data within a week, at a cost of approximately 2000 USD, and on a single device. Several national and international projects now deal with (tens of) thousands of genomes, and trends like personalized medicine call for efforts to sequence entire populations. In this paper, we highlight challenges that emerge from this flood of data, such as parallelization of algorithms, compression of genomic sequences, and cloud-based execution of complex scientific workflows. We also point to a number of further challenges that lie ahead due to the increasing demand for translational medicine, i.e., the accelerated transition of biomedical research results into medical practice.
dc.identifier.pissn1610-1995
dc.identifier.urihttps://dl.gi.de/handle/20.500.12116/11664
dc.publisherSpringer
dc.relation.ispartofDatenbank-Spektrum: Vol. 12, No. 3
dc.relation.ispartofseriesDatenbank-Spektrum
dc.titleData Management Challenges in Next Generation Sequencing
dc.typeText/Journal Article
gi.citation.endPage171
gi.citation.startPage161

Dateien