Logo des Repositoriums
 

Semantic Code Search with Neural Bag-of-Words and Graph Convolutional Networks

dc.contributor.authorSieper, Anna Abad
dc.contributor.authorAmarkhel, Omar
dc.contributor.authorDiez, Savina
dc.contributor.authorPetrak, Dominic
dc.contributor.editorBecker, Michael
dc.date.accessioned2021-03-09T10:32:32Z
dc.date.available2021-03-09T10:32:32Z
dc.date.issued2020
dc.description.abstractSoftware developers are often confronted with tasks for which there are widespread solution patterns. Searching for solutions using natural language queries often leads to unsatisfying results. Github, Microsoft Research and Weights & Biases created the CodeSearchNet Challenge to address this problem. Its goal is to develop code search approaches that return the code that best matches a natural language query. In this paper, we investigate two different approaches in this context. First, a Neural Bag-of-Words encoder using TF-IDF weighting and second, a Graph Convolutional Network which includes the call hierarchy in a target method’s representation. In our experiments we were able to improve the Neural Bag-of-Words models, whose results were published in the CodeSearchNet Challenge. Our Neural Bag-of-Words encoder improves the MRR by 4.38% for Python and 4.98% for Java. The Graph Convolutional Network did not improve the results over of the Neural Bag-of-Words model.en
dc.identifier.isbn978-3-88579-750-0
dc.identifier.pissn1614-3213
dc.identifier.urihttps://dl.gi.de/handle/20.500.12116/35781
dc.language.isoen
dc.publisherGesellschaft für Informatik e.V.
dc.relation.ispartofSKILL 2020 - Studierendenkonferenz Informatik
dc.relation.ispartofseriesLecture Notes in Informatics (LNI) - Seminars, Volume S-16
dc.subjectSemantic Code Search
dc.subjectGraph Convolutional Network
dc.subjectNeural Bag-of-Words
dc.subjectCode-SearchNet Challenge
dc.titleSemantic Code Search with Neural Bag-of-Words and Graph Convolutional Networksen
dc.typeText/Conference Paper
gi.citation.endPage
gi.citation.publisherPlaceBonn
gi.citation.startPage103
gi.conference.date30.09/01.10.2020
gi.conference.sessiontitleText Mining

Dateien

Originalbündel
1 - 1 von 1
Vorschaubild nicht verfügbar
Name:
SKILL2020-08.pdf
Größe:
236.17 KB
Format:
Adobe Portable Document Format