Skip to main content Accessibility help
×
Hostname: page-component-77c89778f8-swr86 Total loading time: 0 Render date: 2024-07-16T09:26:57.218Z Has data issue: false hasContentIssue false

5 - Enriching

Published online by Cambridge University Press:  10 September 2022

Get access

Summary

Learning outcomes of this chapter

  • • Getting value out of non-structured metadata

  • • Understanding current possibilities and limits of named-entity recognition

  • • Being aware of the information versus non-information debate

  • • Case study: enriching the British Museum metadata

Introduction

As a metadata enthusiast, one may be struck with a sense of nostalgia when looking at catalogues from the 19th or early 20th century. The aesthetics of wooden cabinets and the beautiful handwriting make us realize how our profession has evolved over the last century. Apart from the physical appearance of the catalogue, it is interesting to take some time to read a couple of the entries. Very often, you will notice how lengthy some of the descriptions are, when compared with our contemporary descriptive practices.

With the professionalization of documentation practices in libraries, archives and museums, the entry forms for metadata became increasingly structured. Gradually, the lengthy narratives were sliced up in smaller information units, allowing a more standardized approach. The development of controlled vocabularies, as described in the previous chapter, is a good example of how a more structured approach towards subject access was rolled out across institutions.

This process of the increasing structuring of metadata illustrates well the gradual move from the narrative towards the database, as described by Manovich (2001) in his seminal work The Language of New Media. As humans, we make sense of the world through stories, told through lengthy descriptions. This explains why museums often offer audioguides, which explain in detail the context of a specific object.

Unfortunately, a computer cannot do much with a narrative. The more structured and well delineated metadata are, the better we can manage them with our IT tools. For example, the previous two chapters focused on how more value can be derived from metadata through cleaning and reconciliation. Both operations allow you to leverage your existing metadata. They do have one important constraint: they can only be applied upon structured metadata.

One can only apply cleaning and reconciliation to strings of characters with well delineated boundaries. Data cleaning can be very powerful once you know the specific format of a field. For example, if you know that an address has to be encoded by first mentioning the street name and only afterwards the house number, you can check whether this condition is met.

Type
Chapter
Information
Linked Data for Libraries, Archives and Museums
How to clean, Link and Publish your Metadata
, pp. 159 - 196
Publisher: Facet
Print publication year: 2015

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Save book to Kindle

To save this book to your Kindle, first ensure coreplatform@cambridge.org is added to your Approved Personal Document E-mail List under your Personal Document Settings on the Manage Your Content and Devices page of your Amazon account. Then enter the ‘name’ part of your Kindle email address below. Find out more about saving to your Kindle.

Note you can select to save to either the @free.kindle.com or @kindle.com variations. ‘@free.kindle.com’ emails are free but can only be saved to your device when it is connected to wi-fi. ‘@kindle.com’ emails can be delivered even when you are not connected to wi-fi, but note that service fees apply.

Find out more about the Kindle Personal Document Service.

Available formats
×

Save book to Dropbox

To save content items to your account, please confirm that you agree to abide by our usage policies. If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account. Find out more about saving content to Dropbox.

Available formats
×

Save book to Google Drive

To save content items to your account, please confirm that you agree to abide by our usage policies. If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account. Find out more about saving content to Google Drive.

Available formats
×