Skip to main content Accessibility help
×
Hostname: page-component-848d4c4894-2xdlg Total loading time: 0 Render date: 2024-07-07T14:49:32.876Z Has data issue: false hasContentIssue false

11 - Categorizing syntactic constructions in a corpus*

Published online by Cambridge University Press:  05 June 2014

Nicholas Smith
Affiliation:
School of Education, University of Leicester, UK
Elena Seoane
Affiliation:
Department of English, French and German, University of Vigo, Spain
Manfred Krug
Affiliation:
Otto-Friedrich-Universität Bamberg, Germany
Julia Schlüter
Affiliation:
Otto-Friedrich-Universität Bamberg, Germany
Get access

Summary

Introduction

To arrive at a thorough description of the usage of a grammatical construction in a corpus involves a number of stages. Minimally it will require: (a) retrieval of a set of valid instances of the construction from the corpus; (b) categorization of those instances according to linguistic and/or extra-linguistic features; and (c) quantitative and/or qualitative analysis of the instances and their associated categories. Steps (a), (b) and (c) need not be linear – one may, for instance, wish to revisit the retrieval after looking at the quantitative results – but each step is nevertheless required.

Our objective here is to demonstrate and discuss step (b) – categorization of a target structure. We take as a case study the English passive construction, more specifically the long passive, i.e. passives with an overt agent by-phrase, as in John was arrested by the police (Biber et al. 1999: 154). The kinds of coding we describe generally have to be inserted by hand, although like Sebastian Hoffmann (see Chapter 10, this volume) we recommend using a part-of-speech annotated version of the corpus to facilitate retrieval of the target data.

Type
Chapter
Information
Publisher: Cambridge University Press
Print publication year: 2013

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

References

Hoffmann, Sebastian, Evert, Stefan, Smith, Nicholas, Lee, David and Berglund-Prytz, Ylva 2008. Corpus linguistics with BNCweb: a practical guide. Frankfurt: Peter Lang. Chapter 9.Google Scholar
Smith, Nicholas, Rayson, Paul and Hoffmann, Sebastian 2008. ‘Corpus tools and methods, today and tomorrow: incorporating linguists’ manual annotations’, Literary and Linguistic Computing 23(2): 163–180.CrossRefGoogle Scholar
Seoane, Elena 2009. ‘Syntactic complexity, discourse status and animacy as determinants of grammatical variation in Modern English’, English Language and Linguistics 13(3): 365–384.CrossRefGoogle Scholar

Save book to Kindle

To save this book to your Kindle, first ensure coreplatform@cambridge.org is added to your Approved Personal Document E-mail List under your Personal Document Settings on the Manage Your Content and Devices page of your Amazon account. Then enter the ‘name’ part of your Kindle email address below. Find out more about saving to your Kindle.

Note you can select to save to either the @free.kindle.com or @kindle.com variations. ‘@free.kindle.com’ emails are free but can only be saved to your device when it is connected to wi-fi. ‘@kindle.com’ emails can be delivered even when you are not connected to wi-fi, but note that service fees apply.

Find out more about the Kindle Personal Document Service.

Available formats
×

Save book to Dropbox

To save content items to your account, please confirm that you agree to abide by our usage policies. If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account. Find out more about saving content to Dropbox.

Available formats
×

Save book to Google Drive

To save content items to your account, please confirm that you agree to abide by our usage policies. If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account. Find out more about saving content to Google Drive.

Available formats
×