This page lists tidbits of code or other electronic resources which are available for download.
See my GitHub page.
Open Library Query-Click Dataset
— This is an evaluation dataset collected during an analysis of search behavior from Open Library server logs. The dataset consists of 22,622 frequently submitted queries and their associated clicks plus a collection of 46,561,553 Open Library metadata records crawled on November 30, 2011. For more information, see the README.
Searcher Frustration User Study Data — This is a dataset collected during a user study of frustration during web search at the University of Massachusetts Amherst in October 2009. The study consists of query logs and sensor readings for thirty participants. For more information, see the README.
This is available under an Open Database/Database Content license. Feel free to use, redistribute, and modify the dataset, but make sure to make it available under the same license and to give due attribution in any public use of the dataset.
Task-Aware Query Recommendation Data — A dataset that consists of the search contexts and query recommendations used in Task-Aware Query Recommendaiton (Feild & Allan, SIGIR'13). See the README for more information. You may also be interested in a refinement of term-query graph query recommendation algorithm we used.