Evaluating How Developers Use General-Purpose Web-Search for Code Retrieval

by   Md Masudur Rahman, et al.

Search is an integral part of a software development process. Developers often use search engines to look for information during development, including reusable code snippets, API understanding, and reference examples. Developers tend to prefer general-purpose search engines like Google, which are often not optimized for code related documents and use search strategies and ranking techniques that are more optimized for generic, non-code related information. In this paper, we explore whether a general purpose search engine like Google is an optimal choice for code-related searches. In particular, we investigate whether the performance of searching with Google varies for code vs. non-code related searches. To analyze this, we collect search logs from 310 developers that contains nearly 150,000 search queries from Google and the associated result clicks. To differentiate between code-related searches and non-code related searches, we build a model which identifies the code intent of queries. Leveraging this model, we build an automatic classifier that detects a code and non-code related query. We confirm the effectiveness of the classifier on manually annotated queries where the classifier achieves a precision of 87 recall of 86 automatically annotate all the queries in the dataset. Analyzing this dataset, we observe that code related searching often requires more effort (e.g., time, result clicks, and query modifications) than general non-code search, which indicates code search performance with a general search engine is less effective.


page 1

page 2

page 3

page 4


Readability and Understandability of Snippets Recommended by General-purpose Web Search Engines: a Comparative Study

Developers often search for reusable code snippets on general-purpose we...

An Empirical Study of Software Exceptions in the Field using Search Logs

Software engineers spend a substantial amount of time using Web search t...

StacerBot: A Stacktrace Search Engine for Stack Overflow

We as software developers or researchers very often get stacktrace error...

CoNCRA: A Convolutional Neural Network Code Retrieval Approach

Software developers routinely search for code using general-purpose sear...

Generating Clarifying Questions for Query Refinement in Source Code Search

In source code search, a common information-seeking strategy involves pr...

PSDoodle: Fast App Screen Search via Partial Screen Doodle

Searching through existing repositories for a specific mobile app screen...

Seasonal Web Search Query Selection for Influenza-Like Illness (ILI) Estimation

Influenza-like illness (ILI) estimation from web search data is an impor...

Please sign up or login with your details

Forgot password? Click here to reset