Post-Debugging in Large Scale Big Data Analytic Systems
dc.contributor.author | Bergen, Eduard | |
dc.contributor.author | Edlich, Stefan | |
dc.contributor.editor | Mitschang, Bernhard | |
dc.contributor.editor | Nicklas, Daniela | |
dc.contributor.editor | Leymann, Frank | |
dc.contributor.editor | Schöning, Harald | |
dc.contributor.editor | Herschel, Melanie | |
dc.contributor.editor | Teubner, Jens | |
dc.contributor.editor | Härder, Theo | |
dc.contributor.editor | Kopp, Oliver | |
dc.contributor.editor | Wieland, Matthias | |
dc.date.accessioned | 2017-06-21T11:24:45Z | |
dc.date.available | 2017-06-21T11:24:45Z | |
dc.date.issued | 2017 | |
dc.description.abstract | Data scientists often need to fine tune and resubmit their jobs when processing a large quantity of data in big clusters because of a failed behavior of currently executed jobs. Consequently, data scientists also need to filter, combine, and correlate large data sets. Hence, debugging a job locally helps data scientists to figure out the root cause and increases efficiency while simplifying the working process. Discovering the root cause of failures in distributed systems involve a different kind of information such as the operating system type, executed system applications, the execution state, and environment variables. In general, log files contain this type of information in a cryptic and large structure. Data scientists need to analyze all related log files to get more insights about the failure and this is cumbersome and slow. Another possibility is to use our reference architecture. We extract remote data and replay the extraction on the developer’s local debugging environment. | en |
dc.identifier.isbn | 978-3-88579-660-2 | |
dc.identifier.pissn | 1617-5468 | |
dc.language.iso | en | |
dc.publisher | Gesellschaft für Informatik e.V. | |
dc.relation.ispartof | Datenbanksysteme für Business, Technologie und Web (BTW 2017) - Workshopband | |
dc.relation.ispartofseries | Lecture Notes in Informatics (LNI) - Proceedings, Volume P-266 | |
dc.subject | Software debugging | |
dc.subject | Bug detection | |
dc.subject | localization and diagnosis | |
dc.subject | Java Virtual Machine | |
dc.subject | JVMTI | |
dc.subject | Bytecode instrumentation | |
dc.subject | Apache Flink | |
dc.subject | Application-level failures | |
dc.title | Post-Debugging in Large Scale Big Data Analytic Systems | en |
dc.type | Text/Conference Paper | |
gi.citation.endPage | 74 | |
gi.citation.publisherPlace | Bonn | |
gi.citation.startPage | 65 | |
gi.conference.date | 6.-10. März 2017 | |
gi.conference.location | Stuttgart | |
gi.conference.sessiontitle | Workshop Big Data Management Systems in Business and Industrial Applications (BigBIA17) |
Dateien
Originalbündel
1 - 1 von 1