Auflistung nach Schlagwort "Metadata"
1 - 6 von 6
Treffer pro Seite
Sortieroptionen
- ZeitschriftenartikelCollecting and visualizing data lineage of Spark jobs(Datenbank-Spektrum: Vol. 21, No. 3, 2021) Schoenenwald, Alexander; Kern, Simon; Viehhauser, Josef; Schildgen, JohannesMetadata management constitutes a key prerequisite for enterprises as they engage in data analytics and governance. Today, however, the context of data is often only manually documented by subject matter experts, and lacks completeness and reliability due to the complex nature of data pipelines. Thus, collecting data lineage—describing the origin, structure, and dependencies of data—in an automated fashion increases quality of provided metadata and reduces manual effort, making it critical for the development and operation of data pipelines. In our practice report, we propose an end-to-end solution that digests lineage via (Py‑)Spark execution plans. We build upon the open-source component Spline , allowing us to reliably consume lineage metadata and identify interdependencies. We map the digested data into an expandable data model, enabling us to extract graph structures for both coarse- and fine-grained data lineage. Lastly, our solution visualizes the extracted data lineage via a modern web app, and integrates with BMW Group’s soon-to-be open-sourced Cloud Data Hub.
- KonferenzbeitragComplexities of Identity Provenance Metadata(Open Identity Summit 2021, 2021) Semančík, RadovanData provenance information is an important part of personal data protection mechanisms. However, capabilities of existing identity management systems are severely limited when it comes to maintaining and processing data provenance information. This paper describes an effort to design and implement capability to process provenance information in midPoint, an open source identity management and governance system. The solution used value metadata for the purposes of storage and processing of provenance information. Resulting prototype was fully integrated into midPoint code base. The solution dealt with all layers of provenance information processing, from data acquisition to user interface. The prototype uncovered a relation between provenance information and other metadata types, as well as potential use of provenance-enriched metadata in conjunction with data protection mechanisms.
- ZeitschriftenartikelEinsatzpotenziale von XML in Business-Intelligence-Systemen(Wirtschaftsinformatik: Vol. 46, No. 1, 2004) Schwalm, Stephan; Bange, CarstenThis article describes the actual applications of XML in the context of Business- Intelligence-Systems (BI). The potentials and synergies of XML and BI will be emphasized. ▪XML applications can be found within all levels of BI-systems.▪The application of core-standards will be introduced as well as the possibilities of BI specific-standards.▪The discussion of the impact of XML for BI-systems follows along the dimensions: externalisation, integration, standardisation and rationalization.
- KonferenzbeitragNFDI4Energy Task Area 4: FAIR Data for Energy System Research(INFORMATIK 2023 - Designing Futures: Zukünfte gestalten, 2023) Wein, Amanda; Reinkensmeier, Jan; Weidlich, Anke; Lilliestam, Johan; Hagenmeyer, Veit; Lehnhoff, SebastianThe NFDI4Energy consortium will create a research data infrastructure for energy system research, emphasizing the openness and FAIRness of data and models in this research domain. Within the consortium, Task Area 4 focuses on the development of resources and services that will provide a semantic layer for the overall platform built by NFDI4Energy. The team of this Task Area will produce artifacts including a domain ontology, metadata standards, a knowledge graph, a Persistent Identifier service, and integration infrastructure to join these artifacts to the NFDI4Energy platform.
- ZeitschriftenartikelRepository Systems. Teil 1: Mehrstufigkeit und Entwicklungsumgebung Teil 2 erscheint im folgenden Heft(Informatik-Spektrum: Vol. 22, No. 4, 1999) Ortner, ErichRepositorien sind computerunterstützte Informationssysteme über die Informationsverarbeitung eines Unternehmens. Sie werden auch Data Dictionary, Entwicklungsdatenbank, Information Resource Dictionary System, Katalog oder Metainformationssystem genannt. Mit Repositorien kann die Dokumentation der Ressourcenbereiche einer Informationsverarbeitung wie der Benutzer- und Betreiberorganisation, der Anwendungen, der Datenressourcen, der Basissysteme, des Kommunikationssystems und der Hardware auf einer Metasprachebene strukturiert erfaßt sowie verschiedenartige Beziehungen zwischen den Dokumentationsobjekttypen innerhalb und zwischen den Ressourcenbereichen zur besseren Administration der Beschreibungsdaten spezifiziert werden. In dem Beitrag werden zunächst einige Grundbegriffe des Einsatzes und des Aufbaus von Repositorien erläutert. Hieran schließt sich die Organisation der Anwendungsentwicklung mit Hilfe solcher Systeme an. Dabei wird der Weg von einzelnen Einsatzbereichen hin zu einer umfassenden, unternehmensweiten Nutzungsstrategie für Repositorien aufgezeigt. Anschließend werden Aufgaben, die bei der Auswahl, dem Einsatz und dem Betrieb von Repositoriumssystemen sporadisch oder dauerhaft anfallen sowie die sich abzeichnende Entwicklung dieser Systeme diskutiert.AbstractRepositories are computer-assisted information systems about the information processing activities of an enterprise. They also named data dictionary, development database, information resource dictionary system, catalog or meta information system. With repositories the documentation of information processing like the user and the carry on organisation, the applications, the data-ressources the basis-systems, the communication-systems and the hardware will be recorded on a meta language level. Different relationships between the documented resources could be defined to make a better administration of the description data. In this article we first explain some fundamental ideas of the employment and setting up a repository system in an enterprise. We continue with architecture-discussion of an application development environment that contains such a repository system. The way from single use-areas up to an enterprise wide employment of a repository will be described. Following the tasks have to be done by an enterprise celting up and using a repository, sporadically or durably, are presented. Finally the outlined evolution of repository systems is discussed.
- KonferenzbeitragSemi-automatic extraction of metadata from old geological maps(INFORMATIK 2023 - Designing Futures: Zukünfte gestalten, 2023) Bürgl, Kim; Müller, LydiaGeological map communicate efficiently geological information. Old geological maps were stored as paper maps and thus need to be digitized when integrating them into digital geographic information systems. Metadata is required to find relevant maps fast. However, metadata is usually created manually with a lot of effort. We present work in progress for a semi-automated approach for extracting metadata from maps. The results show that it lowers the manual effort significantly to extract the location and improves at least the experience of the manual annotation with respect to date metadata.