Continuous Image Classification on Data Streams using Contrastive Learning and Cluster Analysis

Schliebitz, Andreas

Textdokument

Continuous Image Classification on Data Streams using Contrastive Learning and Cluster Analysis

Dokumententyp

Text

Dateien

B1-paper.pdf (193.24 KB)

Datum

2023

Autor:innen

Schliebitz, Andreas

Quelle

DC@KI2023: Proceedings of Doctoral Consortium at KI 2023

Doctoral Consortium at KI 2023

Verlag

Gesellschaft für Informatik e.V.

Zusammenfassung

This PhD proposal presents a concept for an AI-based computer vision system trained on image data streams for real-time classification of unlabeled objects. The required AI models are trained through an underexplored combination of self-supervised and incremental learning. Emphasis will be placed on contrastive learning using the SimCLR framework and its successors. The need for such a system is motivated by the observation that supervised learning approaches often require large labeled datasets. The labeling process, which is usually performed manually, is not only time-consuming but inherently prone to errors. For sufficiently large image data streams, timely labeling of samples becomes impossible leading to sporadic data annotation cycles and the capture of only temporarily representative features. Such an approach might also render the resulting classifier vulnerable to domain shift and concept drift. The image data stream used in this proposal consists of unlabeled color images of clean potatoes, which are to be sorted into several defect classes by a self-supervised classifier. Contrastive transfer learning is performed on this image data stream for the selection of a feature extractor. In this approach, different pre-trained backbone architectures are adapted and evaluated using the SimCLR framework. The classifiers are evaluated based on their generated feature vectors using cluster analysis. This involves searching for novel evaluation methods that do not require labels and are more suitable for judging model performance than existing methods. Furthermore, by clustering the feature vectors, an automatic and adaptive classification might be achievable without the use of labels. In a subsequent step, the self-supervised classifiers are continuously improved using incremental learning methods. For this the models are incrementally trained on the image data stream over a longer period of time. Potential adjustments to the data stream could increase the classifier’s accuracy as well as make it more robust to domain adaptation problems. A final validation of the incrementally self-learning classification system can be performed with smaller, manually annotated datasets.

Schliebitz, Andreas (2023): Continuous Image Classification on Data Streams using Contrastive Learning and Cluster Analysis. DC@KI2023: Proceedings of Doctoral Consortium at KI 2023. DOI: 10.18420/ki2023-dc-04. Gesellschaft für Informatik e.V.. pp. 30-39. Doctoral Consortium at KI 2023. Berlin. 45195

Schlagwörter

Feature Learning, Self-Supervised Learning, Contrastive Learning, Incremental Learning, Image Classification, Data Stream Processing, Cluster Analysis, Computer Vision

DOI

10.18420/ki2023-dc-04

Sammlungen

KI 2023 - Doctoral Consortium

Komplettanzeige

Continuous Image Classification on Data Streams using Contrastive Learning and Cluster Analysis

Volltext URI

Dokumententyp

Dateien

Zusatzinformation

Datum

Autor:innen

Zeitschriftentitel

ISSN der Zeitschrift

Bandtitel

Quelle

Verlag

Zusammenfassung

Beschreibung

Schlagwörter

Zitierform

DOI

Tags

Sammlungen