Logo des Repositoriums
 

Deep Convolutional Neural Networks for Pose Estimation in Image-Graphics Search

dc.contributor.authorEberts, Markus
dc.contributor.authorUlges, Adrian
dc.contributor.editorEibl, Maximilian
dc.contributor.editorGaedke, Martin
dc.date.accessioned2017-08-28T23:49:29Z
dc.date.available2017-08-28T23:49:29Z
dc.date.issued2017
dc.description.abstractDeep Convolutional Neural Networks (CNNs) have recently been highly successful in various image understanding tasks, ranging from object category recognition over image classification to scene segmentation. We employ CNNs for pose estimation in a cross-modal retrieval system, which -given a photo of an object -allows users to retrieve the best match from a repository of 3D models. As our system is supposed to display retrieved 3D models from the same perspective as the query image (potentially with virtual objects blended over), the pose of the object relative to the camera needs to be estimated. To do so, we study two CNN models. The first is based on end-to-end learning, i.e. a regression neural network directly estimates the pose. The second uses transfer learning with a very deep CNN pre-trained on a large-scale image collection. In quantitative experiments on a set of 3D models and real-world photos of chairs, we compare both models and show that while the end-to-end learning approach performs well on the domain it was trained on (graphics) it suffers from the capability to generalize to a new domain (photos). The transfer learning approach on the other hand handles this domain drift much better, resulting in an average angle deviation from the ground truth angle of about 14 degrees on photos.en
dc.identifier.doi10.18420/in2017_92
dc.identifier.isbn978-3-88579-669-5
dc.identifier.pissn1617-5468
dc.language.isoen
dc.publisherGesellschaft für Informatik, Bonn
dc.relation.ispartofINFORMATIK 2017
dc.relation.ispartofseriesLecture Notes in Informatics (LNI) - Proceedings, Volume P-275
dc.subjectpose estimation
dc.subjectimage retrieval
dc.subjectdeep learning
dc.subjecttransfer learning
dc.titleDeep Convolutional Neural Networks for Pose Estimation in Image-Graphics Searchen
gi.citation.endPage914
gi.citation.startPage907
gi.conference.date25.-29. September 2017
gi.conference.locationChemnitz
gi.conference.sessiontitleSENSYBLE – Smart Systems for Better Living Environments

Dateien

Originalbündel
1 - 1 von 1
Lade...
Vorschaubild
Name:
B11-5.pdf
Größe:
11.98 MB
Format:
Adobe Portable Document Format