Now showing items 1-10 of 47
Explore FREDDY: Fast Word Embeddings in Database Systems
Word embeddings encode a lot of semantic as well as syntactic features and therefore are useful in many tasks especially in Natural Language Processing and Information Retrieval. FREDDY (Fast woRd EmbedDings Database sYstems), an extended PostgreSQL database system, allowing the user to analyze structured knowledge in ...
BTW2019 - Datenbanksysteme für Business, Technologie und WebBTW2019 - Datenbanksysteme für Business, Technologie und Web
Understanding Trolls with Efficient Analytics of Large Graphs in Neo4j
Analytics of large graph data set has become an important means of understanding and influencing the world. The use of graph database technology in the International Consortium of Investigative Journalists’ (ICIJ) investigation of the Panama Papers and Paradise Papers or in cancer research illustrates how analysing ...
Ganzheitliches Metadatenmanagement im Data Lake: Anforderungen, IT-Werkzeuge und Herausforderungen in der Praxis
Data Lakes haben sich in der industriellen Praxis als Plattformen für die Speicherung und Analyse aller Arten von (Roh-)daten etabliert. Erweiterte Anforderungen hinsichtlich Governance und Self-Service machen das Metadatenmanagement im Data Lake zum kritischen Erfolgsfaktor. Bisher gibt es dazu jedoch nur wenige ...
Perceptual Relational Attributes: Navigating and Discovering Shared Perspectives from User-Generated Reviews
Effectively modelling and querying experience items like movies, books, or games in databases is challenging because these items are better described by their resulting user experience or perceived properties than by factual attributes. However, such information is often subjective, disputed, or unclear. Thus, social ...
In-Database Machine Learning: Gradient Descent and Tensor Algebra for Main Memory Database Systems
Machine learning tasks such as regression, clustering, and classification are typically performed outside of database systems using dedicated tools, necessitating the extraction, transfor-mation, and loading of data. We argue that database systems when extended to enable automatic differentiation, gradient descent, and ...
IBM Cloud Databases: Turning Open Source Databases Into Cloud Services
Databases in all their forms are the backbone of most applications, running in the Cloud or on-premise. This creates a large demand for hosted, as-a-service database systems that are used either by Cloud applications, or even by on-premise applications. Through this demand, two types of offerings were created: new ...
Eliminating the Bandwidth Bottleneck of Central Query Dispatching Through TCP Connection Hand-Over
In scale-out database architectures, client queries must be routed to individual backend database servers for processing. In dynamic database systems, where backend servers join and leave a cluster or data partitions move between servers, clients do not know which server to send queries to. Using a central dispatcher, ...
The Best of Both Worlds: Combining Hand-Tuned and Word-Embedding-Based Similarity Measures for Entity Resolution
Recently word embedding has become a beneficial technique for diverse natural language processing tasks, especially after the successful introduction of several popular neural word embedding models, such as word2vec, GloVe, and FastText. Also entity resolution, i.e., the task of identifying digital records that refer to ...
Graph Data Transformations in Gradoop
The analysis of graph data using graph database and distributed graph processing systems has gained significant interest. However, relatively little effort has been devoted to preparing the graph data for analysis, in particular to transform and integrate data from different sources. To support such ETL processes for ...