Semantic Code Search with Neural Bag-of-Words and Graph Convolutional Networks
dc.contributor.author | Sieper, Anna Abad | |
dc.contributor.author | Amarkhel, Omar | |
dc.contributor.author | Diez, Savina | |
dc.contributor.author | Petrak, Dominic | |
dc.contributor.editor | Becker, Michael | |
dc.date.accessioned | 2021-03-09T10:32:32Z | |
dc.date.available | 2021-03-09T10:32:32Z | |
dc.date.issued | 2020 | |
dc.description.abstract | Software developers are often confronted with tasks for which there are widespread solution patterns. Searching for solutions using natural language queries often leads to unsatisfying results. Github, Microsoft Research and Weights & Biases created the CodeSearchNet Challenge to address this problem. Its goal is to develop code search approaches that return the code that best matches a natural language query. In this paper, we investigate two different approaches in this context. First, a Neural Bag-of-Words encoder using TF-IDF weighting and second, a Graph Convolutional Network which includes the call hierarchy in a target method’s representation. In our experiments we were able to improve the Neural Bag-of-Words models, whose results were published in the CodeSearchNet Challenge. Our Neural Bag-of-Words encoder improves the MRR by 4.38% for Python and 4.98% for Java. The Graph Convolutional Network did not improve the results over of the Neural Bag-of-Words model. | en |
dc.identifier.isbn | 978-3-88579-750-0 | |
dc.identifier.pissn | 1614-3213 | |
dc.identifier.uri | https://dl.gi.de/handle/20.500.12116/35781 | |
dc.language.iso | en | |
dc.publisher | Gesellschaft für Informatik e.V. | |
dc.relation.ispartof | SKILL 2020 - Studierendenkonferenz Informatik | |
dc.relation.ispartofseries | Lecture Notes in Informatics (LNI) - Seminars, Volume S-16 | |
dc.subject | Semantic Code Search | |
dc.subject | Graph Convolutional Network | |
dc.subject | Neural Bag-of-Words | |
dc.subject | Code-SearchNet Challenge | |
dc.title | Semantic Code Search with Neural Bag-of-Words and Graph Convolutional Networks | en |
dc.type | Text/Conference Paper | |
gi.citation.endPage | ||
gi.citation.publisherPlace | Bonn | |
gi.citation.startPage | 103 | |
gi.conference.date | 30.09/01.10.2020 | |
gi.conference.sessiontitle | Text Mining |
Dateien
Originalbündel
1 - 1 von 1