Book contents
- Frontmatter
- Content
- Acknowledgements
- 1 Introduction
- 2 What is a thesaurus?
- 3 Tools for subject access and retrieval
- 4 What a thesaurus is used for
- 5 Why use a thesaurus?
- 6 Types of thesaurus
- 7 The format of a thesaurus
- 8 Building a thesaurus 1: vocabulary collection
- 9 Vocabulary control 1: selection of terms
- 10 Vocabulary control 2: form of entry
- 11 Building a thesaurus 2: term extraction from document titles
- 12 Building a thesaurus 3: vocabulary analysis
- 13 The thesaural relationships
- 14 Building a thesaurus 4: introducing internal structure
- 15 Building a thesaurus 5: imposing hierarchy
- 16 Building a thesaurus 6: compound subjects and citation order
- 17 Building a thesaurus 7: conversion of the taxonomy to alphabetical format
- 18 Building a thesaurus 8: creating the thesaurus records
- 19 Managing and maintaining the thesaurus: thesaurus software
- 20 Conclusion
- Glossary
- Bibliography
- Appendix 1 Sample titles for thesaurus vocabulary
- Appendix 2 Sample terms for the thesaurus
- Appendix 3 Facets at stage 1 of analysis
- Appendix 4 Facets at stage 2 of analysis
- Appendix 5 Completed systematic display
- Appendix 6 Thesaurus entries for sample page
- Index
8 - Building a thesaurus 1: vocabulary collection
Published online by Cambridge University Press: 09 June 2018
- Frontmatter
- Content
- Acknowledgements
- 1 Introduction
- 2 What is a thesaurus?
- 3 Tools for subject access and retrieval
- 4 What a thesaurus is used for
- 5 Why use a thesaurus?
- 6 Types of thesaurus
- 7 The format of a thesaurus
- 8 Building a thesaurus 1: vocabulary collection
- 9 Vocabulary control 1: selection of terms
- 10 Vocabulary control 2: form of entry
- 11 Building a thesaurus 2: term extraction from document titles
- 12 Building a thesaurus 3: vocabulary analysis
- 13 The thesaural relationships
- 14 Building a thesaurus 4: introducing internal structure
- 15 Building a thesaurus 5: imposing hierarchy
- 16 Building a thesaurus 6: compound subjects and citation order
- 17 Building a thesaurus 7: conversion of the taxonomy to alphabetical format
- 18 Building a thesaurus 8: creating the thesaurus records
- 19 Managing and maintaining the thesaurus: thesaurus software
- 20 Conclusion
- Glossary
- Bibliography
- Appendix 1 Sample titles for thesaurus vocabulary
- Appendix 2 Sample terms for the thesaurus
- Appendix 3 Facets at stage 1 of analysis
- Appendix 4 Facets at stage 2 of analysis
- Appendix 5 Completed systematic display
- Appendix 6 Thesaurus entries for sample page
- Index
Summary
Having decided that you need to build your own thesaurus, the first stage in the process is to gather the vocabulary. There are several ways in which you can do this, and which method you choose may depend on the use to which the thesaurus will be put. Further on in this chapter we will look at the factors that may affect this decision. You do need to keep in mind the fact that the thesaurus will be used for indexing, object description, or document management of some sort, and not for making a theoretical study of the subject itself. You should therefore always consider whether individual terms are useful for the purpose of information retrieval, and whether they correspond to the material to be organized.
There are a number of potential sources that can be mined for terms. They fall into two groups: sources of actual terms and sources of document titles. Collecting document titles is an essential part of assembling vocabulary, but some additional work will be required to derive terms from the titles.
Existing vocabularies
The most obvious and most accessible source of terminology will be published vocabulary tools. These can be divided into two categories:
• vocabularies for indexing and information retrieval
• dictionaries, glossaries and word lists for study and reference.
The first category includes such things as classification schemes, subject heading lists, keyword lists, taxonomies and other thesauri – in fact, all the kinds of tools we considered in Chapter 3. These have the advantage that the terms in the vocabulary will be intended for document (or object) description, so that they will be of the kind and form that are required for a thesaurus. Vocabularies may also give a sense of the structure of the subject. The disadvantage, particularly for technical subjects, is that the meaning of terms may not be provided.
The second group embraces general and subject specialist encyclopaedias, and subject-specific dictionaries, glossaries and word lists intended for use in subject work. These tools have the advantage that the terms are accompanied by definitions which may be necessary at the stage of analysing and organizing the vocabulary.
- Type
- Chapter
- Information
- Essential Thesaurus Construction , pp. 58 - 69Publisher: FacetPrint publication year: 2006