Auflistung nach Autor:in "Saake, Gunter"
1 - 10 von 109
Treffer pro Seite
Sortieroptionen
- KonferenzbeitragAnalysis strategies for software product lines: A classification and survey(Software-engineering and management 2015, 2015) Thüm, Thomas; Apel, Sven; Kästner, Christian; Schaefer, Ina; Saake, GunterSoftware-product-line engineering enables the efficient development of similar software products. Instead of developing each product from scratch, products are generated from common artifacts. However, the product generation is a challenge for the analysis of correctness properties. Applying traditional analysis techniques, such as type checking and model checking, to each product involves redundant effort and is often not feasible due to the combinatorial explosion of products. Approaches to scale analysis techniques to product lines have been presented in unrelated research
- KonferenzbeitragApplying stratosphere for big data analytics(Datenbanksysteme für Business, Technologie und Web (BTW) 2046, 2013) Leich, Marcus; Adamek, Jochen; Schubotz, Moritz; Heise, Arvid; Rheinländer, Astrid; Markl, VolkerAnalyzing big data sets as they occur in modern business and science applications requires query languages that allow for the specification of complex data processing tasks. Moreover, these ideally declarative query specifications have to be optimized, parallelized and scheduled for processing on massively parallel data processing platforms. This paper demonstrates the application of Stratosphere to different kinds of Big Data Analytics tasks. Using examples from different application domains, we show how to formulate analytical tasks as Meteor queries and execute them with Stratosphere. These examples include data cleansing and information extraction tasks, and a correlation analysis of microblogging and stock trade volume data that we describe in detail in this paper.
- TextdokumentThe Best of Both Worlds: Combining Hand-Tuned and Word-Embedding-Based Similarity Measures for Entity Resolution(BTW 2019, 2019) Chen, Xiao; Campero Durand, Gabriel; Zoun, Roman; Broneske, David; Li, Yang; Saake, GunterRecently word embedding has become a beneficial technique for diverse natural language processing tasks, especially after the successful introduction of several popular neural word embedding models, such as word2vec, GloVe, and FastText. Also entity resolution, i.e., the task of identifying digital records that refer to the same real-world entity, has been shown to benefit from word embedding. However, the use of word embeddings does not lead to a one-size-fits-all solution, because it cannot provide an accurate result for those values without any semantic meaning, such as numerical values. In this paper, we propose to use the combination of general word embedding with traditional hand-picked similarity measures for solving ER tasks, which aims to select the most suitable similarity measure for each attribute based on its property. We provide some guidelines on how to choose suitable similarity measures for different types of attributes and evaluate our proposed hybrid method on both synthetic and real datasets. Experiments show that a hybrid method reliant on correctly selecting required similarity measures can outperform the method of purely adopting traditional or word-embedding-based similarity measures.
- KonferenzbeitragBestimmung der semantischen Eigenschaften von Datenstromsystemen durch Black-Box-Tests(Datenbanksysteme für Business, Technologie und Web (BTW) 2013 - Workshopband, 2013) Lauterwald, Frank; Pollner, Niko; Meyer-Wegener, KlausDie Semantik von Datenstromsystemen (DSS) ist bislang nicht standardisiert. Für Anwendungsentwickler ist es jedoch wichtig zu wissen, wie sich ein bestimmtes System in einer bestimmten Situation verhält. Ebenso bedeutsam ist das Verhalten für föderierte Datenstromsysteme, die Anfragen automatisch auf verschiedene DSS verteilen. Als Hilfsmittel zur Beschreibung können semantische Modelle dienen. Diese werden parametrisiert und können durch verschiedene Parameterwerte das Verhalten verschiedener Systeme nachbilden. Da bisher auch kein allgemein anerkanntes Modell zur Beschreibung von DSS existiert, muss man sich möglicherweise mit verschiedenen Modellen auseinandersetzen. Daher wäre es hilfreich, die Bestimmung der jeweiligen Parameterwerte weitgehend zu automatisieren, wozu dieser Beitrag eine geeignete Evaluationsumgebung vorstellt. Diese vergleicht die Ausgaben eines DSS mit allen Vorhersagen, die ein Modell für verschiedene Parameter machen kann. Stimmen die Ergebnisse überein, sind die Parameter gefunden. Erfahrungen damit und Beschränkungen dieses Ansatzes werden diskutiert.
- KonferenzbeitragBridging the gap between variability in client application and database schema(Datenbanksysteme in Business, Technologie und Web (BTW) – 13. Fachtagung des GI-Fachbereichs "Datenbanken und Informationssysteme" (DBIS), 2009) Siegmund, Norbert; Kästner, Christian; Rosenmüller, Marko; Heidenreich, Florian; Apel, Sven; Saake, GunterDatabase schemas are used to describe the logical design of a database. Diverse groups of users have different views on the global schema which leads to different local schemas. Research has focused on view integration to generate a global, consistent sch
- ZeitschriftenartikelBTW 2013 – Zwischen wissenschaftlicher Geschichte und moderner Herausforderung(Datenbank-Spektrum: Vol. 13, No. 2, 2013) Köppen, Veit; Schäler, Martin; Grebhahn, Alexander; Saake, Gunter
- KonferenzbeitragCDIM - call for papers(Datenbanksysteme für Business, Technologie und Web (BTW) 2013 - Workshopband, 2013) Nürnberger, Andreas; Balke, Wolf-TiloThe first CDIM workshop on crowd-enabled data and information management was held in conjunction with 15th GI-Fachtagung Datenbanksysteme für Business, Technologie und Web (BTW), Magdeburg, Germany, 2013.
- KonferenzbeitragComposition methods for link discovery(Datenbanksysteme für Business, Technologie und Web (BTW) 2029, 2013) Hartung, Michael; Groß, Anika; Rahm, ErhardThe Linked Open Data community publishes an increasing number of data sources on the so-called Data Web and interlinks them to support data integration applications. We investigate how the composition of existing links and mappings can help discovering new links and mappings between LOD sources. Often there will be many alternatives for composition so that the problem arises which paths can provide the best linking results with the least computation effort. We therefore investigate different methods to select and combine the most suitable mapping paths. We also propose an approach for selecting and composing individual links instead of entire mappings. We comparatively evaluate the methods on several real-world linking problems from the LOD cloud. The results show the high value of reusing and composing existing links as well as the high effectiveness of our methods.
- KonferenzbeitragCompositional Analyses of Highly-Configurable Systems with Feature-Model Interfaces(Software Engineering 2017, 2017) Schröter, Reimar; Krieter, Sebastian; Thüm, Thomas; Benduhn, Fabian; Saake, GunterToday’s software systems are often customizable by means of load-time or compile-time configuration options. These options are typically not independent and their dependencies can be specified by means of feature models. As many industrial systems contain thousands of options, the maintenance and utilization of feature models is a challenge for all stakeholders. In the last two decades, numerous approaches have been presented to support stakeholders in analyzing feature models. Such analyses are commonly reduced to satisfiability problems, which suffer from the growing number of options. While first attempts have been made to decompose feature models into smaller parts, they still require to compose all parts for analyses. We proposed the concept of a feature-model interface that only consists of a subset of features and hides all other features and dependencies. Based on a formalization of feature-model interfaces, we proved compositionality properties. We evaluated feature-model interfaces using a three-month history of an industrial fea- ture model with 18,616 features. Our results indicate performance benefits especially under evolution as often only parts of the feature model need to be analyzed again.
- KonferenzbeitragConcept for a web based support of the development process(Datenbanksysteme für Business, Technologie und Web (BTW) 2013 - Workshopband, 2013) Oellrich, Marc; Mantwill, FrankDuring the last years there have been some developments in the internet which might support the product development process. Some ideas at the beginning of the millennium have shown that web based systems can raise the efficiency, but the possibilities are nowadays much higher. While at that time representations have been only in a static state, they can now be handled much more user-friendly and get accepted like shown with Wikipedia or Facebook. Interesting further opportunities are given by Open Innovation, where problems are solved by a big amount of online users. This concept will show principal components of an integrated web based system, which supports the development methodological approach, reduces the workload to collect and enter redundant data, allows collaborative and partially asynchronous cooperation and will contribute to determine and map the knowledge and experience of the employees, what should lead to a higher ability\&nbError: Illegal entry in bfrange block in ToUnicode CMap sp;to compete and gives a clear competitive advantage.