Skip to main content Accessibility help
×
Hostname: page-component-848d4c4894-nr4z6 Total loading time: 0 Render date: 2024-04-30T13:20:26.802Z Has data issue: false hasContentIssue false

7 - Assessing web search engines: a webometric approach

Published online by Cambridge University Press:  08 June 2018

Mike Thelwall
Affiliation:
University of Wolverhampton, UK
Get access

Summary

Introduction

Information Retrieval (IR) research typically evaluates search systems in terms of the standard precision, recall and F-measures to weight the relative importance of precision and recall (e.g. van Rijsbergen, 1979). All of these assess the extent to which the system returns good matches for a query. In contrast, webometric measures are designed specifically for web search engines and are designed to monitor changes in results over time and various aspects of the internal logic of the way in which search engine select the results to be returned. This chapter introduces a range of webometric measurements and illustrates them with case studies of Google, Bing and Yahoo! This is a very fertile area for simple and complex new investigations into search engine results.

The modern commercial web search engine is a highly complex system (Arasu et al., 2001) with vast social and commercial significance (Van Couvering, 2004, 2007). Although they can be evaluated to some extent with traditional IR measures like precision and recall, web search engines behave differently from traditional IR systems in many respects. Three key differences are the importance of rank order in the results; the limitation to 1,000 matches per query; and the goal of delivering relevant and useful results rather than technically accurate matches (in the sense of: Bar-Ilan and Peritz, 2008). In response, evaluation metrics have been developed to measure new characteristics of web search engines, such as the effectiveness of the rank order of the results (Zaragoza, Cambazoglu and Baeza-Yates, 2010), mean average precision (Turpin and Scholer, 2006) and discounted cumulative gain (Jarvelin and Kekalainen, 2002). A broad common goal is to assess the extent to which any web search engine delivers good and relevant results to users. For some information scientists, however, IR goals are not sufficient with regard to web search engines, for two reasons.

First, search engines like Google are so important in academia and daily life that it is important for information professionals to understand something of how they work and what their limitations are.

Type
Chapter
Information
Innovations in Information Retrieval
Perspectives for theory and practice
, pp. 135 - 146
Publisher: Facet
Print publication year: 2011

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Save book to Kindle

To save this book to your Kindle, first ensure coreplatform@cambridge.org is added to your Approved Personal Document E-mail List under your Personal Document Settings on the Manage Your Content and Devices page of your Amazon account. Then enter the ‘name’ part of your Kindle email address below. Find out more about saving to your Kindle.

Note you can select to save to either the @free.kindle.com or @kindle.com variations. ‘@free.kindle.com’ emails are free but can only be saved to your device when it is connected to wi-fi. ‘@kindle.com’ emails can be delivered even when you are not connected to wi-fi, but note that service fees apply.

Find out more about the Kindle Personal Document Service.

Available formats
×

Save book to Dropbox

To save content items to your account, please confirm that you agree to abide by our usage policies. If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account. Find out more about saving content to Dropbox.

Available formats
×

Save book to Google Drive

To save content items to your account, please confirm that you agree to abide by our usage policies. If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account. Find out more about saving content to Google Drive.

Available formats
×