Title |
Semantic Approaches to Software Component Retrieval with English Queries |
Authors |
Huijing Deng and Grzegorz Chrupała |
Abstract |
Enabling code reuse is an important goal in software engineering, and it depends crucially on effective code search interfaces. We propose to ground word meanings in source code and use such language-code mappings in order to enable a search engine for programming library code where users can pose queries in English. We exploit the fact that there are large programming language libraries which are documented both via formally specified function or method signatures as well as descriptions written in natural language. Automatically learned associations between words in descriptions and items in signatures allows us to use queries formulated in English to retrieve methods which are not documented via natural language descriptions, only based on their signatures. We show that the rankings returned by our model substantially outperforms a strong term-matching baseline. |
Topics |
Statistical and Machine Learning Methods, Semantics |
Full paper |
Semantic Approaches to Software Component Retrieval with English Queries |
Bibtex |
@InProceedings{DENG14.106,
author = {Huijing Deng and Grzegorz Chrupała}, title = {Semantic Approaches to Software Component Retrieval with English Queries}, booktitle = {Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)}, year = {2014}, month = {may}, date = {26-31}, address = {Reykjavik, Iceland}, editor = {Nicoletta Calzolari (Conference Chair) and Khalid Choukri and Thierry Declerck and Hrafn Loftsson and Bente Maegaard and Joseph Mariani and Asuncion Moreno and Jan Odijk and Stelios Piperidis}, publisher = {European Language Resources Association (ELRA)}, isbn = {978-2-9517408-8-4}, language = {english} } |