P266 - BTW2017 - Datenbanksysteme für Business, Technologie und Web - Workshopband

https://dl.gi.de/handle/20.500.12116/21091

Autor*innen mit den meisten Dokumenten

Ritter, Norbert

Gessert, Felix

Wingerath, Wolfram

Endres, Markus

Friedrich, Steffen

Auflistung nach:

1 - 10 von 47

Konferenzbeitrag
Workshop Big (and small) Data in Science and Humanities (BigDS17)
(Datenbanksysteme für Business, Technologie und Web (BTW 2017) - Workshopband, 2017) Groß, Anika; König-Ries, Birgitta; Reimann, Peter; Seeger, Bernhard
Konferenzbeitrag
Post-Debugging in Large Scale Big Data Analytic Systems
(Datenbanksysteme für Business, Technologie und Web (BTW 2017) - Workshopband, 2017) Bergen, Eduard; Edlich, Stefan
Data scientists often need to fine tune and resubmit their jobs when processing a large quantity of data in big clusters because of a failed behavior of currently executed jobs. Consequently, data scientists also need to filter, combine, and correlate large data sets. Hence, debugging a job locally helps data scientists to figure out the root cause and increases efficiency while simplifying the working process. Discovering the root cause of failures in distributed systems involve a different kind of information such as the operating system type, executed system applications, the execution state, and environment variables. In general, log files contain this type of information in a cryptic and large structure. Data scientists need to analyze all related log files to get more insights about the failure and this is cumbersome and slow. Another possibility is to use our reference architecture. We extract remote data and replay the extraction on the developer’s local debugging environment.
Konferenzbeitrag
A Deep Learning-based Approach for Banana Leaf Diseases Classification
(Datenbanksysteme für Business, Technologie und Web (BTW 2017) - Workshopband, 2017) Amara, Jihen; Bouaziz, Bassem; Algergawy, Alsayed
Plant diseases are important factors as they result in serious reduction in quality and quantity of agriculture products. Therefore, early detection and diagnosis of these diseases are important. To this end, we propose a deep learning-based approach that automates the process of classifying ba- nana leaves diseases. In particular, we make use of the LeNet architecture as a convolutional neural network to classify image data sets. The preliminary results demonstrate the effectiveness of the proposed approach even under challenging conditions such as illumination, complex background, different resolution, size, pose, and orientation of real scene images.
Konferenzbeitrag
Experiences with the Model-based Generation of Big Data Pipelines
(Datenbanksysteme für Business, Technologie und Web (BTW 2017) - Workshopband, 2017) Eichelberger, Holger; Qin, Cui; Schmid, Klaus
Developing Big Data applications implies a lot of schematic or complex structural tasks, which can easily lead to implementation errors and incorrect analysis results. In this paper, we present a model-based approach that supports the automatic generation of code to handle these repetitive tasks, enabling data engineers to focus on the functional aspects without being distracted by technical issues. In order to identify a solution, we analyzed different Big Data stream-processing frameworks, extracted a common graph-based model for Big Data streaming applications and de- veloped a tool to graphically design and generate such applications in a model-based fashion (in this work for Apache Storm). Here, we discuss the concepts of the approach, the tooling and, in particular, experiences with the approach based on feedback of our partners.
Konferenzbeitrag
Mining Industrial Logs for System Level Insights
(Datenbanksysteme für Business, Technologie und Web (BTW 2017) - Workshopband, 2017) Czora, Sebastian; Dix, Marcel; Fromm, Hansjörg; Klöpper, Benjamin; Schmitz, Björn
Industrial systems are becoming more and more complex and expensive to operate. Companies are making considerable efforts to increase operational efficiency and eliminate unplanned downtime of their equipment. Condition monitoring has been applied to improve equipment availability and reliability. Most of the condition monitoring applications, however, focus on single components, not on entire systems. The objective of this research was to demonstrate that a combination of visual analytics and association rule mining can be successfully used in a condition monitoring context on system level.
Editiertes Buch
Datenbanksysteme für Business, Technologie und Web (BTW 2017) - Workshopband
(2017)
Konferenzbeitrag
BTW 2017 Data Science Challenge (SDSC17)
(Datenbanksysteme für Business, Technologie und Web (BTW 2017) - Workshopband, 2017) Waizenegger, Tim
Konferenzbeitrag
Multimedia Similarity Search
(Datenbanksysteme für Business, Technologie und Web (BTW 2017) - Workshopband, 2017) Seidl, Thomas
Konferenzbeitrag
Understanding Trending Topics in Twitter
(Datenbanksysteme für Business, Technologie und Web (BTW 2017) - Workshopband, 2017) Kahlert, Roland; Liebeck, Matthias; Cornelius, Joseph
Many events, for instance in sports, political events, and entertainment, happen all over the globe all the time. It is difficult and time consuming to notice all these events, even with the help of different news sites. We use tweets from Twitter to automatically extract information in order to understand hashtags of real-world events. In our paper, we focus on the topic identification of a hashtag, analyze the expressed positive, neutral, and negative sentiments of users, and further investigate the expressed emotions. We crawled English tweets from 24 hashtags and report initial investigation results.
Konferenzbeitrag
Vergleich und Evaluation von RDF-on-Hadoop-Lösungen
(Datenbanksysteme für Business, Technologie und Web (BTW 2017) - Workshopband, 2017) Amann, Wolfgang
Mit der steigenden Anzahl von Daten, welche in Form des Resource Description Frame- work (RDF) veröffentlicht werden entsteht eine Menge von Daten, bei der Datenoperationen nicht mehr von einem einzelnen Rechner zu bewältigen sind. In dieser Arbeit werden Systeme vorgestellt, welche zur Lösung dieses Problems das Hadoop-Framework ausschließlich bzw. in Kombination mit anderen Big-Data-Frameworks nutzen. Danach werden mit PigSPARQL und Rya zwei dieser Ansätze, welche exemplarisch für die neuere Entwicklung dieser RDF-on-Hadoop-Systeme stehen, anhand der Benchmark-Queries der Waterloo SPARQL Diversity Test Suite auf spezifische Stärken und Schwächen analysiert.

Autor*innen mit den meisten Dokumenten

Ritter, Norbert

Gessert, Felix

Wingerath, Wolfram

Endres, Markus

Friedrich, Steffen

Neueste Veröffentlichungen

Treffer pro Seite

Sortieroptionen