Logo des Repositoriums
 

What Kind of Questions Do Developers Ask on Stack Overflow? A Comparison of Automated Approaches to Classify Posts Into Question Categories

dc.contributor.authorBeyer, Stefanie
dc.contributor.authorMacho, Christian
dc.contributor.authorDi Penta, Massimiliano
dc.contributor.authorPinzger, Martin
dc.contributor.editorKoziolek, Anne
dc.contributor.editorSchaefer, Ina
dc.contributor.editorSeidl, Christoph
dc.date.accessioned2020-12-17T11:57:56Z
dc.date.available2020-12-17T11:57:56Z
dc.date.issued2021
dc.description.abstractThis paper has been published in the Journal Empirical Software Engineering, 2020. Stack Overflow (SO) is among the most popular question and answers sites used by developers. Labeling posts with tags is one of the features to facilitate searching and browsing SO posts. However, existing tags mainly refer to technological aspects but not to the purpose of a question. In this paper, we argue that tagging posts with their purpose can facilitate developers to find the posts that provide an answer to their question. We first present a harmonization of existing taxonomies of question categories, that represent the purpose of a question, into seven categories. Next, we present two approaches to automate the classification of posts into the seven question categories, one using regular expressions and one using machine learning. Evaluating both approaches on an independent test set, we found that our regular expressions outperform machine learning. Applying the regular expressions on posts related to Android app development, showed that the categories API USAGE, CONCEPTUAL, and DISCREPANCY are most frequently assigned. By integrating our approach into SO, posts could be manually tagged with our categories which would allow developers to search posts by question category.en
dc.identifier.doi10.18420/SE2021_03
dc.identifier.isbn978-3-88579-704-3
dc.identifier.pissn1617-5468
dc.identifier.urihttps://dl.gi.de/handle/20.500.12116/34525
dc.language.isoen
dc.publisherGesellschaft für Informatik e.V.
dc.relation.ispartofSoftware Engineering 2021
dc.relation.ispartofseriesecture Notes in Informatics (LNI) - Proceedings, Volume P-310
dc.subjectStack Overflow
dc.subjectClassification
dc.subjectQuestion Categories
dc.subjectProgram Understanding
dc.titleWhat Kind of Questions Do Developers Ask on Stack Overflow? A Comparison of Automated Approaches to Classify Posts Into Question Categoriesen
dc.typeText/ConferencePaper
gi.citation.endPage28
gi.citation.publisherPlaceBonn
gi.citation.startPage27
gi.conference.date22.-26. Februar 2021
gi.conference.locationBraunschweig/Virtuell

Dateien

Originalbündel
1 - 1 von 1
Vorschaubild nicht verfügbar
Name:
B1-02.pdf
Größe:
42.22 KB
Format:
Adobe Portable Document Format