Logo des Repositoriums
 

Iterative Computation of Connected Graph Components with MapReduce

dc.contributor.authorKolb, Lars
dc.contributor.authorSehili, Ziad
dc.contributor.authorRahm, Erhard
dc.date.accessioned2018-01-10T13:19:39Z
dc.date.available2018-01-10T13:19:39Z
dc.date.issued2014
dc.description.abstractThe use of the MapReduce framework for iterative graph algorithms is challenging. To achieve high performance it is critical to limit the amount of intermediate results as well as the number of necessary iterations. We address these issues for the important problem of finding connected components in large graphs. We analyze an existing MapReduce algorithm, CC-MR, and present techniques to improve its performance including a memory-based connection of subgraphs in the map phase. Our evaluation with several large graph datasets shows that the improvements can substantially reduce the amount of generated data by up to a factor of 8.8 and runtime by up to factor of 3.5.
dc.identifier.pissn1610-1995
dc.identifier.urihttps://dl.gi.de/handle/20.500.12116/11710
dc.publisherSpringer
dc.relation.ispartofDatenbank-Spektrum: Vol. 14, No. 2
dc.relation.ispartofseriesDatenbank-Spektrum
dc.subjectConnected graph components
dc.subjectHadoop
dc.subjectMapReduce
dc.subjectTransitive closure
dc.titleIterative Computation of Connected Graph Components with MapReduce
dc.typeText/Journal Article
gi.citation.endPage117
gi.citation.startPage107

Dateien