Auflistung nach Schlagwort "Active learning"
1 - 2 von 2
Treffer pro Seite
Sortieroptionen
- ZeitschriftenartikelAccounting for Task-Difficulty in Active Multi-Task Robot Control Learning(KI - Künstliche Intelligenz: Vol. 29, No. 4, 2015) Fabisch, Alexander; Metzen, Jan Hendrik; Krell, Mario Michael; Kirchner, FrankContextual policy search is a reinforcement learning approach for multi-task learning in the context of robot control learning. It can be used to learn versatilely applicable skills that generalize over a range of tasks specified by a context vector. In this work, we combine contextual policy search with ideas from active learning for selecting the task in which the next trial will be performed. Moreover, we use active training set selection for reducing detrimental effects of exploration in the sampling policy. A core challenge in this approach is that the distribution of the obtained rewards may not be directly comparable between different tasks. We propose the novel approach PUBSVE for estimating a reward baseline and investigate empirically on benchmark problems and simulated robotic tasks to which extent this method can remedy the issue of non-comparable reward.
- ZeitschriftenartikelActive and Incremental Learning with Weak Supervision(KI - Künstliche Intelligenz: Vol. 34, No. 2, 2020) Brust, Clemens-Alexander; Käding, Christoph; Denzler, JoachimLarge amounts of labeled training data are one of the main contributors to the great success that deep models have achieved in the past. Label acquisition for tasks other than benchmarks can pose a challenge due to requirements of both funding and expertise. By selecting unlabeled examples that are promising in terms of model improvement and only asking for respective labels, active learning can increase the efficiency of the labeling process in terms of time and cost. In this work, we describe combinations of an incremental learning scheme and methods of active learning. These allow for continuous exploration of newly observed unlabeled data. We describe selection criteria based on model uncertainty as well as expected model output change (EMOC). An object detection task is evaluated in a continuous exploration context on the PASCAL VOC dataset. We also validate a weakly supervised system based on active and incremental learning in a real-world biodiversity application where images from camera traps are analyzed. Labeling only 32 images by accepting or rejecting proposals generated by our method yields an increase in accuracy from 25.4 to 42.6%.