Scalable DB+IR Technology: Processing Probabilistic Datalog with HySpirit

Frommholz, Ingo; Roelleke, Thomas

Zeitschriftenartikel

Scalable DB+IR Technology: Processing Probabilistic Datalog with HySpirit

Dokumententyp

Text/Journal Article

Datum

2016

Autor:innen

Frommholz, Ingo

Roelleke, Thomas

Quelle

Datenbank-Spektrum: Vol. 16, No. 1

Verlag

Springer

Zusammenfassung

Probabilistic Datalog (PDatalog, proposed in 1995) is a probabilistic variant of Datalog and a nice conceptual idea to model Information Retrieval in a logical, rule-based programming paradigm. Making PDatalog work in real-world applications requires more than probabilistic facts and rules, and the semantics associated with the evaluation of the programs. We report in this paper some of the key features of the HySpirit system required to scale the execution of PDatalog programs.Firstly, there is the requirement to express probability estimation in PDatalog. Secondly, fuzzy-like predicates are required to model vague predicates (e.g. vague match of attributes such as age or price). Thirdly, to handle large data sets there are scalability issues to be addressed, and therefore, HySpirit provides probabilistic relational indexes and parallel and distributed processing. The main contribution of this paper is a consolidated view on the methods of the HySpirit system to make PDatalog applicable in real-scale applications that involve a wide range of requirements typical for data (information) management and analysis.

Frommholz, Ingo; Roelleke, Thomas (2016): Scalable DB+IR Technology: Processing Probabilistic Datalog with HySpirit. Datenbank-Spektrum: Vol. 16, No. 1. Springer. PISSN: 1610-1995. pp. 39-48

Schlagwörter

DB+IR , HySpirit , Probabilistic Datalog , Scalability

Sammlungen

Datenbank Spektrum 16(1) - März 2016

Komplettanzeige

Scalable DB+IR Technology: Processing Probabilistic Datalog with HySpirit

Volltext URI

Dokumententyp

Zusatzinformation

Datum

Autor:innen

Zeitschriftentitel

ISSN der Zeitschrift

Bandtitel

Quelle

Verlag

Zusammenfassung

Beschreibung

Schlagwörter

Zitierform

DOI

Tags

Sammlungen