Skip to main content Accessibility help
×
Hostname: page-component-5c6d5d7d68-lvtdw Total loading time: 0 Render date: 2024-08-06T18:16:21.034Z Has data issue: false hasContentIssue false

5 - Quality assurance and cataloguing

Published online by Cambridge University Press:  08 June 2018

Get access

Summary

Introduction

Quality assurance is an essential component of any web archiving programme. All collection methods involve some degree of automation, and it is therefore vital to ensure that the selection policy and collection list are actually being implemented successfully. The nature and degree of quality assurance which is required or practical will depend upon the needs and resources of the collecting agency, and the selection approaches and collection methods employed. In general, the greater the scale of collection undertaken, the more basic the level of quality assurance that can realistically be employed. This dictates that there is invariably a tradeoff between the number of resources that can be collected, and the quality control which can be applied to them, and a policy decision is required as to the minimum acceptable level of assurance.

Whatever the level of detail at which it is applied, any quality assurance process should follow the basic model illustrated in Figure 5.1 overleaf.

This chapter describes these processes in detail, and identifies some of the most commonly encountered problems and their possible solutions. It also discusses the cataloguing of archived websites. Some form of catalogue description is required in order to manage any archival collection, and make it accessible to users. Although cataloguing may take place at various stages in the web archiving process, it is included here because an important element of quality assurance is to ensure that all necessary cataloguing is accurate and complete.

Pre-collection testing

Pre-collection testing is concerned with the identification of potential issues that may affect the quality of collected content, in advance of its acquisition. It is clearly desirable to identify and resolve as many potential problems as possible prior to collection, thereby minimizing the extent of postcollection testing required. Pre-collection testing will typically include two approaches: resource analysis and test collection.

Resource analysis

This involves the manual or automated analysis of the target web resource, in order to identify the appropriate collection method and any issues that are likely to arise during collection. At the most basic level, it will be necessary to determine whether the website is static or dynamic in nature and, if the latter, whether all of the target resources are linked or only available through database queries.

Type
Chapter
Information
Archiving Websites
a practical guide for information management professionals
, pp. 69 - 81
Publisher: Facet
Print publication year: 2006

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Save book to Kindle

To save this book to your Kindle, first ensure coreplatform@cambridge.org is added to your Approved Personal Document E-mail List under your Personal Document Settings on the Manage Your Content and Devices page of your Amazon account. Then enter the ‘name’ part of your Kindle email address below. Find out more about saving to your Kindle.

Note you can select to save to either the @free.kindle.com or @kindle.com variations. ‘@free.kindle.com’ emails are free but can only be saved to your device when it is connected to wi-fi. ‘@kindle.com’ emails can be delivered even when you are not connected to wi-fi, but note that service fees apply.

Find out more about the Kindle Personal Document Service.

Available formats
×

Save book to Dropbox

To save content items to your account, please confirm that you agree to abide by our usage policies. If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account. Find out more about saving content to Dropbox.

Available formats
×

Save book to Google Drive

To save content items to your account, please confirm that you agree to abide by our usage policies. If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account. Find out more about saving content to Google Drive.

Available formats
×