Logo des Repositoriums
 

Fast and accurate creation of annotated head pose image test beds as prerequisite for training neural networks

dc.contributor.authorKowerko, Danny
dc.contributor.authorManthey, Robert
dc.contributor.authorHeinz, Marcel
dc.contributor.authorKronfeld, Thomas
dc.contributor.authorBrunnett, Guido
dc.contributor.editorEibl, Maximilian
dc.contributor.editorGaedke, Martin
dc.date.accessioned2017-08-28T23:47:44Z
dc.date.available2017-08-28T23:47:44Z
dc.date.issued2017
dc.description.abstractIn this paper we present an experimental setup consisting of 36 cameras on 4 height levels covering more than half space around a centrally sitting person. The synchronous image release allows to build a 3D model of the human torso in this position. Using this so-called body scanner we recorded 36 different positions giving in total 1296 images in several minutes obtaining tens to hundreds of different pitch-roll-yaw head pose combinations with very high precision of less than +-5. From annotation of 7 facial keypoints (ears, eyes, nose, corners of the mouth) in the 36 calculated 3D models of a human head/upper body, we automatically get 1296 x 7 2D facial landmark points saving a factor 36 in annotation time. The projection of the 3D model to the camera provides a foreground/background separation mask of the person in each image usable for data set augmentation e.g. by inserting different backgrounds (required for training convolutional neural networks, CNNs). Moreover, we utilize our 3D model in combination with textures to create realistic images of the pitch-roll-yaw range not assessed in experiments. This interpolation is ad hoc applicable to a subset of 10 central out of 36 total camera views where fine-grained interpolation of head poses is possible. Using interpolation and background masks for background exchange enables us to augment the data set easily by a factor of 1000 or more knowing precisely pitch, roll, yaw and the 7 annotated facial keypoints in each image.en
dc.identifier.doi10.18420/in2017_221
dc.identifier.isbn978-3-88579-669-5
dc.identifier.pissn1617-5468
dc.language.isoen
dc.publisherGesellschaft für Informatik, Bonn
dc.relation.ispartofINFORMATIK 2017
dc.relation.ispartofseriesLecture Notes in Informatics (LNI) - Proceedings, Volume P-275
dc.subjecthuman pose
dc.subjecthead pose
dc.subject3D body scanner
dc.subjectimage annotation
dc.subjecttest bed
dc.subjectpitch
dc.subjectroll
dc.subjectyaw
dc.titleFast and accurate creation of annotated head pose image test beds as prerequisite for training neural networksen
gi.citation.endPage2229
gi.citation.startPage2221
gi.conference.date25.-29. September 2017
gi.conference.locationChemnitz
gi.conference.sessiontitleDeep Learning in heterogenen Datenbeständen

Dateien

Originalbündel
1 - 1 von 1
Lade...
Vorschaubild
Name:
B29-8.pdf
Größe:
25.69 MB
Format:
Adobe Portable Document Format