Skip to main content Accessibility help
×
Hostname: page-component-5c6d5d7d68-tdptf Total loading time: 0 Render date: 2024-08-15T19:29:55.088Z Has data issue: false hasContentIssue false

12 - Measuring text reuse in the news industry

from Part VII - Information studies

Published online by Cambridge University Press:  17 November 2010

Lionel Bently
Affiliation:
University of Cambridge
Jennifer Davis
Affiliation:
University of Cambridge
Jane C. Ginsburg
Affiliation:
Columbia University, New York
Get access

Summary

Introduction

The activity of text reuse describes the situation in which pre-existing written material is consciously used again during the creation of a new text or version. This might include the reuse of an entire document (e.g. in the case of duplicate web pages), or smaller segments (e.g. chunks, paragraphs and sentences) from one or more existing texts. From the author's perspective, the process of reuse involves ‘finding the relevant material, modifying it as needed and stitching the pieces together’. This may involve a process of text rewriting (or editing), with the author reusing existing material with (or without) permission from the owner. From the reader's perspective, text reuse can be cast as a problem of text analysis or attribution: given two texts is it possible to determine, within an acceptable degree of probability, whether one text is derived from the other? Identifying text reuse can be difficult due to the degree of textual transformation that can occur, from simple cut-and-paste reuse to more complex cases involving paraphrasing and summarisation making the revised version appear very different to the original text. One might add to this that recent advances in technology are also making the activity of text reuse easier. For example, the search engine Google indexes and makes easily accessible billions of web pages on a diverse range of topics and in many different languages. Being able to discover such documents may promote their use as a basis for new texts.

Type
Chapter
Information
Copyright and Piracy
An Interdisciplinary Critique
, pp. 247 - 259
Publisher: Cambridge University Press
Print publication year: 2010

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Save book to Kindle

To save this book to your Kindle, first ensure coreplatform@cambridge.org is added to your Approved Personal Document E-mail List under your Personal Document Settings on the Manage Your Content and Devices page of your Amazon account. Then enter the ‘name’ part of your Kindle email address below. Find out more about saving to your Kindle.

Note you can select to save to either the @free.kindle.com or @kindle.com variations. ‘@free.kindle.com’ emails are free but can only be saved to your device when it is connected to wi-fi. ‘@kindle.com’ emails can be delivered even when you are not connected to wi-fi, but note that service fees apply.

Find out more about the Kindle Personal Document Service.

Available formats
×

Save book to Dropbox

To save content items to your account, please confirm that you agree to abide by our usage policies. If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account. Find out more about saving content to Dropbox.

Available formats
×

Save book to Google Drive

To save content items to your account, please confirm that you agree to abide by our usage policies. If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account. Find out more about saving content to Google Drive.

Available formats
×