Hostname: page-component-8448b6f56d-gtxcr Total loading time: 0 Render date: 2024-04-23T14:07:08.259Z Has data issue: false hasContentIssue false

Statistical signals of copying are robust to time- and space-averaging

Published online by Cambridge University Press:  13 April 2023

Mason Youngblood*
Affiliation:
Minds and Traditions Group, Max Planck Institute for Geoanthropology, Jena, Germany
Helena Miton
Affiliation:
Minds and Traditions Group, Max Planck Institute for Geoanthropology, Jena, Germany Santa Fe Institute, Santa Fe, New Mexico, USA
Olivier Morin
Affiliation:
Minds and Traditions Group, Max Planck Institute for Geoanthropology, Jena, Germany Institut Jean Nicod, ENS, EHESS, PSL University, CNRS, Paris, France
*
*Corresponding author. E-mail: masonyoungblood@gmail.com

Abstract

Cattle brands (ownership marks left on animals) are subject to forces influencing other graphic codes: the copying of constituent parts, pressure for distinctiveness and pressure for complexity. The historical record of cattle brands in some US states is complete owing to legal registration, providing a unique opportunity to assess how sampling processes leading to time- and space-averaging influence our ability to make inferences from limited datasets in fields like archaeology. In this preregistered study, we used a dataset of ~81,000 Kansas cattle brands (1990–2016) to explore two aspects: (1) the relative influence of copying, pressure for distinctiveness and pressure for complexity on the creation and diffusion of brand components; and (2) the effects of time- and space-averaging on statistical signals. By conducting generative inference with an agent-based model, we found that the patterns in our data are consistent with copying and pressure for intermediate complexity. In addition, by comparing mixed and structured datasets, we found that these statistical signals of copying are robust to, and possibly boosted by, time- and space-averaging.

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
Copyright © The Author(s), 2023. Published by Cambridge University Press

Social media summary: Cattle brands provide clues about how time- and space-averaging might affect statistical signals in archaeological data.

1. Introduction

Cattle branding, or the use of hot irons to leave marks on animals, is a common practice that dates back to at least ~1500 BCE (Insoll et al., Reference Insoll, Clack and Rege2015; van der Moezel, Reference van der Moezel2016; Wolfenstine, Reference Wolfenstine1970). Cattle brands are emblems – simple graphic codes that do not encode language (Morin et al., Reference Morin, Kelly and Winters2020). In North America, cattle brands were introduced by the Spanish in the sixteenth century (Brand, Reference Brand1961) and are applied to one part of the animals’ body to denote ownership and deter theft. Today, cattle branding in most parts of the USA is regulated by state organisations that register new brands and ensure that they are unique. Cattle brands in the USA are usually composed of two to four components – constituent parts that include letters, numbers and symbols (Figure 1). Letters often correspond to initials or ranch names, but sometimes the reverse is true and ranches (and even towns) are named after cattle brands (Lombard & Du Plessis, Reference Lombard and Du Plessis2016, 2019). The symbols in brands are more arbitrary and include everything from punctuation marks to state outlines and animals.

In any symbolic system used for communication, including simple emblems like cattle brands, there needs to be a way to distinguish symbols from one another. If more distinctive symbols are less likely to be confused with other symbols, then pressure for distinctiveness should cause symbolic systems to become more effective over time. In the evolution of writing and speech, for example, pressure for distinctiveness is thought to have been fundamental in maximising the signal space for better expressivity (de Boer, Reference de Boer2005, Reference de Boer2015; Flemming, Reference Flemming2002; Liljencrants & Lindblom, Reference Liljencrants and Lindblom1972; Smith & Kirby, Reference Smith and Kirby2008). Distinctiveness is probably an important consideration when ranchers create new cattle brands, since differentiating animals from one another is their primary function (Gracy II, Reference Gracy2016). Distinctiveness is interesting to us chiefly because it cannot be maximised without countering two important mechanisms that also shape emblems: copying and pressure for reduced complexity.

When graphic elements are copied the resulting emblems are less distinctive than they would have been if they were independently created. Successful graphic forms, when they spread to new users, tend to become less informative as a result. The designs that ancient Greek city-states minted on their coins were often adopted by entirely different cities. When this happened, the amount of information that coins carried about the cities that minted them declined: the same emblem became a poorer indicator of its city of origin (Pavlek et al., Reference Pavlek, Winters and Morin2019). Likewise, the copying of heraldic coats of arms in medieval Europe decreased the distinctiveness of copied arms (Morin & Miton, Reference Morin and Miton2018).

Pressure for reduced complexity, which occurs because simpler symbols are easier to perceive, learn and produce (Miton & Morin, Reference Miton and Morin2021; Smith, Reference Smith2020), can also reduce the distinctiveness of emblems. One possible definition of complexity is the number of ways that a shape could vary from its actual form (van der Helm, Reference van der Helm2014). Intuitively, there are relatively few ways to transform the letter ‘O’ into another letter without adding to it, whereas a character like 尴 has many more degrees of freedom. Complex shapes are more likely to be distinctive because they can vary along a greater number of dimensions (Miton & Morin, Reference Miton and Morin2021). This notion predicts (among other things) that written letters should be more complex in larger scripts (e.g. Devanagari vs. Greek), a phenomenon that has been shown using a variety of metrics for visual complexity (Chang et al., Reference Chang, Chen and Perfetti2018; Miton & Morin, Reference Miton and Morin2021). Graphic codes generally evolve to have reduced complexity over time (Kelly et al., Reference Kelly, Winters, Miton and Morin2021; Tamariz & Kirby, Reference Tamariz and Kirby2015), although this tendency is not universal (Miton & Morin, Reference Miton and Morin2019; Tran et al., Reference Tran, Waring, Atmaca and Beheim2021).

1.1. First aim: distinctiveness, copying and complexity

Variation in cattle brand designs is also likely to be shaped by pressure for distinctiveness, copying and complexity, but the relative importance of these forces is unknown. We make a distinction between distinctiveness (outcome at the level of brands) and pressure for distinctiveness (process at the level of components). In our model, pressure for distinctiveness occurs when the rarest components have the highest probability of being included in brand designs. This component-level process is one of several factors (including complexity, as described above) that affect the resulting distinctiveness of brands.

State regulations forbid the exact copying of another rancher's brand: each ranch must register a brand that is unique in one way or another – meaning that its component symbols, the way they are rotated or arranged, their location on the animal, or all of the above, are unique.Footnote 1 However, they do not forbid the copying of components or even sets of components, as long as the result is not an identical replication of another brand. Ranchers may copy components out of convenience from published brand books, use common components because of standardised design or reading conventions (e.g. from branding manuals), or modify existing brands with new components (e.g. as observed among pastoralist groups and cattle thieves) (Arnold, Reference Arnold1965; Landais, Reference Landais2001; Thomas, Reference Thomas1967; Wolfenstine, Reference Wolfenstine1970). Among East African pastoralists, the copying of brands within male lineages is thought to explain the high level of sharing between groups, and possibly even the occurrence of contemporary brands in rock art in the area (Lynch & Robbins, Reference Lynch and Robbins1977). Other aspects of cattle branding, such as the branding rituals themselves, are also known to be copied (Bird Rondeau, Reference Bird Rondeau2013).

As for complexity, operationalised here as the number of components in a brand, the peculiar technology of cattle branding imposes constraints of its own. In the context of cattle branding, designs that are too complex may be more difficult to physically blacksmith, more difficult to read from a distance, and more painful for the animal (Newman, Reference Newman2007; Stamp, Reference Stamp2013). On the other hand, pressure for reduced complexity in cattle brands is probably limited by the fact that simpler brands are more easily modified by cattle thieves (Adams, Reference Adams1906; Gracy II, Reference Gracy2016). Cattle theft remains a significant problem (Keen, Reference Keen2013), and while other technologies such as radio-frequency identification and ear-tagging are less painful for the animals, they are also less effective in deterring theft (American Association of Bovine Practitioners, 2020).

The first aim of this study was to infer the relative influence of pressure for distinctiveness, copying and complexity in the cultural evolution of cattle brands in the US state of Kansas between 1990 and 2016. To do this, we conducted generative inference by fitting an agent-based model (ABM) of cattle brand invention, in which simulated ranchers create new brands under varying levels of copying or pressure for distinctiveness, and pressure for complexity, to the spatiotemporal dynamics of brands and their constituent components. We used a simplified version of the model that we originally preregistered and made no specific hypotheses. The original preregistered analysis and hypotheses can be found in the first version of the preprint,Footnote 2 and a justification for departure from preregistration is included in the SI.

1.2. Second aim: time- and space-averaging

The second aim of this study was to use the high temporal and spatial resolution of our data to assess the influence of time- and space-averaging on the predictability of data. Cultural transmission processes are most easily differentiated from one another by their spatiotemporal dynamics (Barrett, Reference Barrett2019; Wilder & Kandler, Reference Wilder and Kandler2015). Time- and space-averaging, or the mixing of temporal or spatial structure in data (Perreault, Reference Perreault2019), disrupts such dynamics and makes it much more difficult to accurately infer processes from patterns in data (Garvey, Reference Garvey2018). Averaging, also referred to as mixing, is usually an organic result of how artefacts are deposited in the archaeological record, but it is also sometimes imposed by researchers for analytical convenience (Lyman, Reference Lyman2003; Perreault, Reference Perreault2018). In some cases, time averaging may dilute evidence for other processes and increase the fit of data to neutral models (Brantingham, Reference Brantingham2003), as empirically observed in fossil assemblages (Tomašových & Kidwell, Reference Tomašových and Kidwell2010). However, simulation-based evidence suggests that time-averaging can sometimes make it more difficult to correctly detect random copying depending on the methods used (Madsen, Reference Madsen2012; Porčić, Reference Porčić2015; Premo, Reference Premo2014). The effects of space-averaging are less well understood, but there is some empirical evidence that time- and space-averaging affects statistical signals in similar ways (Lyman, Reference Lyman2003).

We investigated the effect of time- and space-averaging on signals of copying at the level of brand components (i.e. subunits). In cultural systems where variants are composed of subunits, copying should leave statistical signals analogous to linkage disequilibrium (i.e. genetic alleles being non-randomly associated; Lewontin, Reference Lewontin1995) or to pointwise mutual information (i.e. one item carrying information about the presence of another; Shannon, Reference Shannon1951). For example, if components are sometimes copied wholesale, or together in groups, then the co-occurrence of components in brands will be higher than expected by chance (Morin & Miton, Reference Morin and Miton2018). The shuffling model is a modified derivation of pointwise mutual information that predicts the prevalence of variants assuming that their subunits contain no information about one another (Morin & Miton, Reference Morin and Miton2018). In other words, the shuffling model predicts what the prevalence of brands would be if all components in the population were randomly copied proportional to their current frequencies, with no wholesale copying. In their study of heraldic coats of arms, Morin and Miton found that their shuffling model managed to predict the popularity of coats of arms to a substantial degree. They also found that the presence of heraldic coats of arms is better predicted by a random copying model when it is applied to space-averaged data from all of Europe rather than from individual provinces (Morin & Miton, Reference Morin and Miton2018). In this case, space-mixing might disrupt local clusters of design elements that result from wholesale copying of coats of arms within families, thus improving the predictions of a model that assumes random copying of individual design elements.

Do time- and space-mixing boost signals of random copying? To answer this question, we compared the performance of an adapted version of Morin and Miton's shuffling model (Morin & Miton, Reference Morin and Miton2018) on structured, time-mixed and space-mixed datasets. For each of these datasets, we first used the shuffling model to calculate the predicted prevalence of every possible combination of components. Then, we used linear mixed models to compare the accuracy of these predictions after accounting for factors like complexity. We hypothesised that the shuffling model would perform better on the time- and space-mixed datasets compared with structured datasets. The structured datasets built for this goal were also used to test whether brands that are closer to each other in time tend to be more similar.

2. Methods

2.1 Data

The data for this study come from the Brands Program of the Kansas Department of Agriculture, which has periodically published books containing all registered cattle brands in the state since 1941.Footnote 3 In the early 1990s Kansas’ Brands Program began to use a 13-digit coding system to catalogue all of their cattle brands, which has been used in the seven most recent brand books: 1990, 1998, 2003, 2008, 2014, 2015 and 2016. Kansas’ coding system includes information about brands’ components, angles of rotation and location on the body. Of the brand books that include this coding system, only the latter four are available in original PDF formats that can be accurately transcribed with optical character recognition (OCR). The three earlier books are only available as scans of the printed books, and thus have noise and artefacts that interfere with OCR. We used a custom neural network in Tesseract OCR to automatically extract the cattle brand codes and zip codes from 2008, 2014, 2015 and 2016, and manually transcribed the brand book from 1990 to maximise the temporal depth of our data. 1998 and 2003 were excluded owing to time and labour constraints. The estimated error rate in transcription, based on a manual check of 1500 brands, was 0% for brand codes and 0.4% for locations. Details about the automated and manual transcription can be seen in the data GitHub.Footnote 4

Each brand code has 13 digits, composed of four three-digit component codes and one single-digit location code. The three-digit component codes contain the abbreviation for the component and a number denoting its angle of rotation. Kansas’ coding system has some redundancies, such as ‘\’ and ‘/’ having separate abbreviations despite being rotated versions of the same component. We collapsed and converted redundant abbreviations into single categories with corrected angles of rotation. Kansas’ Brands Program only allows duplicated cattle brands if they are located on different parts of the animal. Duplicated brand codes (as opposed to the cattle brands themselves) can also occur when the same components are combined in a different arrangement. We chose to remove duplicate brand codes from within the same zip code, which usually occur when a family or ranch has registered a single brand multiple times for different locations, but we chose to keep duplicate brand codes from different zip codes, which usually correspond to different arrangements of the same components. In total, we have 81,063 brand codes from 1990 to 2016, comprised 103 unique components (details in the code GitHubFootnote 5).

2.2. Generative inference (first aim)

The ABM simulates ranchers creating new brands every year based on the components present in the brands used by ranchers in the entire state, as well as the dual constraint of simplicity/complexity. Prior simulations showed that including angles of rotation in the model yielded results incompatible with the real data, so we chose to exclude them from our analysis (Figure S4).

New brands are created by sampling components from existing brands within the entire state. All of the components are compiled into a frequency table that is used for weighted random sampling. The probability P(x) that a rancher uses a particular component x is based on the frequency of x raised to an exponent C, normalised against the probability of using other components in the population:

$$P( x ) = \displaystyle{{F_x^C } \over {\mathop \sum \nolimits_{y = 1}^n F_y^C }}$$

Negative values of C cause the rarest components to have the highest probability of adoption, thus leading to more distinctive brands. We operationalise this situation (C < 1), where the probability of adoption is negatively related to frequency, as pressure for distinctiveness. C = 0 produces completely random brands independent of component frequencies. Positive values of C cause the common components to have the highest probability of adoption and introduces redundancy in brand designs. We operationalise this situation (C > 1), where the probability of adoption is positively correlated with frequency, as copying. A value of C = 1 leads to random (i.e. unbiased) copying, where components are used proportional to their current frequencies. The parameterisation of C in this model is directly based on standard models of anticonformity and conformity in cultural transmission (Crema et al., Reference Crema, Edinborough, Kerig and Shennan2014, Reference Crema, Kandler and Shennan2016; Lachlan et al., Reference Lachlan, Ratmann and Nowicki2018; Youngblood et al., Reference Youngblood, Stubbersfield, Morin, Glassman and Acerbi2021; Youngblood & Lahti, Reference Youngblood and Lahti2022), and a full comparison can be seen in Figure S3. To simulate pressure for simplicity/complexity, the number of components in each new brand is drawn from a Poisson distribution where λ is the parameter of interest (henceforth complexity; see Figure S1 for examples). At lower values of λ (e.g. 0.5) one-components brands are much more common, and at higher values of λ (e.g. 15) four-components brands are much more common. An intermediate value of λ (e.g. 3) leads to an equal probability of two- and three-component brands and a low probability of one- and four- component brands. No new components were added to Kansas’ standardised set during our study period, so we do not simulate the innovation of new component types.

At the beginning of each year in the ABM a set of N new brands is created, where N new is the average number of new brands that appear each year in the observed data. Each new brand is assigned a zip code, where the probability of each zip code is proportional to its frequency in all years of the observed data. After new brands are created a set of N old random brands is removed, where N old is the average number of brands that disappear each year in the observed data.

The ABM is initialised with the data from 1990 and runs through every year until 2016. In order to fit the parameters of the ABM to the observed data we calculated the following set of summary statistics from 2008, 2014, 2015 and 2016:

  1. 1. proportion of components that are the most common type;

  2. 2. proportion of components that are the most rare type;

  3. 3. Shannon's diversity index of components;

  4. 4. Simpson's diversity index of components;

  5. 5. Jaccard index of components (zip codes);

  6. 6. Morisita–Horn index of components (zip codes);

  7. 7. Jaccard index of components (counties);

  8. 8. Morisita–Horn index of components (counties);

  9. 9. mean Levenshtein distance between brands (from random 10%).

Note that these summary statistics are calculated using all of the brands that are present in a given year (e.g. 2014 includes all brands created before and during 2014 that have not yet been removed from the population). This is process is conducted separately for the real and simulated data so that the two can be compared. These summary statistics were chosen because they capture the overall diversity of components with an emphasis on both common (1 and 4) and rare (2 and 3) variants, the spatial diversity of components at the level of both zip codes (5 and 6) and counties (7 and 8), and a proxy for the average perceptual distance between brands (9). For all diversity metrics we calculated their Hill number counterparts, because they are measured on the same scale (Roswell et al., Reference Roswell, Dushoff and Winfree2021) and better account for relative abundance (Chao et al., Reference Chao, Gotelli, Hsieh, Sander, Ma, Colwell and Ellison2014). Shannon's diversity index emphasises rarer types whereas Simpson's diversity index emphasises more common types. The Jaccard and Morisita–Horn indices were similarly chosen for their complementarity. The Morisita–Horn index is a commonly used abundance-based beta diversity index (Chao et al., Reference Chao, Chazdon, Colwell and Shen2006), whereas the Jaccard index is the most robust of the incidence-based beta diversity indices to sampling error (Schroeder & Jenkins, Reference Schroeder and Jenkins2018). We calculated beta diversity at both the zip code and county level to assess spatial diversity at two different resolutions. The mean Levenshtein distance (a measure of edit distance), or the minimum number of insertions, deletions and substitutions required to convert one sequence to another, was calculated from a random 10% of brands.

The same summary statistics were calculated from the real data from 2008, 2014, 2015 and 2016. Then, the random forest version of approximate Bayesian computation (ABC) was conducted using the abcrf package in R (Raynal et al., Reference Raynal, Marin, Pudlo, Ribatet, Robert and Estoup2019), with the following steps:

  1. 1. 500,000 iterations of the ABM were run to generate simulated summary statistics for different values of λ and C.

  2. 2. The output of these simulations were combined into a reference table with the simulated summary statistics as predictor variables, and the parameter values as outcome variables. λ was log-transformed because it is non-negative, as recommended for non-linear regression-based ABC (Blum & François, Reference Blum and François2010; Sisson et al., Reference Sisson, Fan and Beaumont2018).

  3. 3. A random forest of 1000 regression trees was constructed for each parameter using bootstrap samples of 80% of the data from the reference table. Random forest parameters were tuned using a random 10% of the data with the tuneRanger package in R (λ, split features = 6 and minimum node size = 3; C, split features = 17 and minimum node size = 3) (Probst et al., Reference Probst, Wright and Boulesteix2018).

  4. 4. Each trained random forest was provided with the observed summary statistics, and each regression tree was used to generate posterior distributions for both parameters.

Calculating all nine summary statistics for each of the four years means that we conducted ABC with 36 summary statistics in total. Random forest ABC is robust to the number of summary statistics (Pudlo et al., Reference Pudlo, Marin, Estoup, Cornuet, Gautier and Robert2016) so we did not have to reduce the dimensionality prior to inference (Blum et al., Reference Blum, Nunes, Prangle and Sisson2013). As an additional robustness check, we fit the agent-based model to simulated data from two combinations of known parameter values: (1) λ = 3 and C = −1 (intermediate complexity and pressure for distinctiveness); and (2) λ = 3 and C = 1 (intermediate complexity and random copying). In both cases, the random forest ABC recovered the correct input parameters (Figures S6 and S7). Visualisations of model output for the SI were conducted with principal component analysis and uniform manifold approximation and projection (McInnes et al., Reference McInnes, Healy and Melville2018).

C was assigned a normal prior distribution with a mean of 0 and standard deviation of 2. λ was assigned a gamma prior distribution with a shape parameter of 0.9 and a rate parameter of 0.2. This distribution was chosen because, based on 200,000 samples, it includes a relatively even spread across the component probabilities shown in Figure S1 while allowing for some extreme values (minimum of ~0, maximum of 75).

2.3. Shuffling model (second aim)

The shuffling model takes a set of brands and calculates a predicted prevalence for every possible combination of the components within those brands. When it performs well, the shuffling model assigns a higher predicted prevalence to brands that actually exist in the data. We predict that this will be more true for time- and space-mixed datasets than for the original, structured datasets. According to the shuffling model, the predicted prevalence of a brand is simply the product of the frequency of its components in a specific subset of data. For example, the predicted prevalence (S) of a brand with three components A, B and C would be:

$$S_{ABC} = F_A \times F_B \times F_C \times N_{other}$$

where FA is the frequency of A among brands that do not include B or C, FB is the frequency of B among brands that do not include A or C, FC is the frequency of C among brands that do not include A or B and N other is the total number of brands excluding the focal brand.

To apply this model, we first needed to generate structured and mixed datasets. The structured datasets were split into groups according to time, space and complexity. For time, brands were grouped into two age groups: the ‘old’ group (O) included all brands that appear in the 1990 brand book, and the ‘young’ group (Y) included all brands that appear in later brand books. For space, brands were grouped into four rectangular areas of roughly equal size following county borders: northwest (NW), southwest (SW), northeast (NE) and southeast (SE) (see Figure S2). For complexity, brands were grouped into two categories: two-component and three-component brands. One-component brands were excluded because they lack the combinatorics required for the shuffling model, and four-component brands were excluded because they are relatively rare in the dataset (~2%). This structure was used to split the data into 16 subsets, each of which represents a unique combination of time, space and complexity (e.g. O-NE-2 is old northeastern brands with two components).

Once we had structured datasets we generated time- and space-mixed datasets to test our hypotheses about the effects of time- and space- averaging. Each mixed dataset has as many brands as the structured dataset that it imitates. For time-mixed datasets, brands were randomly drawn from the same area independently of time. The time-mixed O-NE-2 dataset was constructed by randomly sampling n two-component brands from northeast Kansas across all time periods, where n is the number of brands in the structured subset. For space-mixed datasets, brands were randomly drawn from the same time dataset independently of space. The space-mixed O-NE-2 dataset was constructed by randomly sampling old two-component brands from all of Kansas.

For each structured and mixed dataset, we inventoried all of the components present in the brands, listed all of their possible combinations, and used the shuffling model to assign a predicted prevalence to each combination. For each possible combination we noted whether it was actually present in the dataset or not.

To compare the accuracy of the shuffling model when applied to the structured and mixed datasets, we built two linear mixed effects models with predicted prevalence as the outcome variable: one with the structured and time-mixed datasets, and the other with the structured and space-mixed datasets. Whether the dataset was mixed (MIXED: 0 = N, Y = 1), whether the combination of components was actually present in the data (ACTUAL: 0 = N, Y = 1) and complexity (COMPLEXITY: 2 = 0, 3 = 1) were potential fixed effects. Space, time and the specific combination of components were potential varying intercepts. The shuffling model assigns a predicted prevalence of 0 when a component occurs multiple times in a target brand or only occurs in a single brand in the dataset, so all combinations with a predicted prevalence of 0 were dropped from the modelling. The remaining predicted prevalence values fit a lognormal distribution so we log-transformed them prior to modelling. Model choice was conducted using Akaike's information criterion (AIC) calculated from frequentist models fit with the lme4 package in R (Bates et al., Reference Bates, Mächler, Bolker and Walker2015). The best fitting versions of each model were run as Bayesian models in Stan using the brms package in R (Bürkner, Reference Bürkner2017), using mean field approximation with 5000 samples from 50,000 iterations.

Our first prediction was that ACTUAL = 1 would have a positive effect on the predicted prevalence of a combination of components, which would indicate that the shuffling model works better than chance. Our second prediction was that there would be an informative and significantly positive interaction term ACTUAL*MIXED, or in other words that the predicted prevalence differential between ACTUAL = 1 and ACTUAL = 0 is more important when MIXED = 1. This would indicate that the performance of the shuffling model is higher in mixed datasets. Finally, complexity was included as a control variable because three-component brands have an additional multiplied proportion that probably reduces their prevalence values.

2.4. Temporal distance (supplemental analysis)

The structured datasets used for the shuffling model were also used to test whether brands that are closer to each other in time tend to be more similar. First, we pooled together both the old and young brands from each spatial subset (e.g. O-NE-2 + Y-NE-2 = NE-2) and calculated the Levenshtein distance (LD) between each pair of brands. As a reminder, old brands are those that appear in the 1990 brand book and young brands are those that appear in later brand books but not in the 1990 issue. Then, we ran a linear mixed effects model with LD as the outcome variable, whether the brands are both old (OO), both young (YY) or one of each (OY) as the predictor variable, and the identities of the compared brands as random effects (i.e. one intercept for the first brand and one for the second). We predicted that pairs of brands that are OY would have a higher LD relative to OO and YY, or, in other words, that brands from different time periods would be more distinct from one another.

3. Results

3.1. Generative inference (first aim)

The posterior distributions and estimates of the two parameters can be seen in Figure 2 and Table 1, respectively. The posterior for complexity (λ) has a median of 1.55 and a relatively tight 95% confidence interval (CI), suggesting that the data are consistent with pressure for intermediate complexity. As a reminder, λ controls the shape of the Poisson distribution from which the number of components in new brands is drawn, so a value of 1.55 gives two components a slightly higher probability than three components (Figure S1). The posterior for copying strength (C) has a median of 0.92 with a relatively tight 95% CI, suggesting that the data is consistent with copying rather than pressure for distinctiveness, at a level slightly below random copying (C = 1). Posterior simulations demonstrate that the fitted parameter values produce output close to the observed data (Figure S5).

Figure 1. Examples of two cattle brands registered in the US state of Kansas in 1884 (https://www.kansasmemory.org/item/309140/). Both brands are composed of two components: on the left a stylised ‘N’ and ‘P’, and on the right a rotated ‘(‘ and ‘S’).

Figure 2. The posterior distributions for the two parameters of the ABM computed with random forest ABC, plotted against the priors (dotted).

Table 1. The median estimates, 95% quantile-based credible intervals, and root mean square log/logit-transformed errors for each parameter. λ controls the shape of the Poisson distribution from which the number of components in new brands is drawn, and C is the exponent to which the components frequencies are raised.

3.2. Shuffling model (second aim)

Brand appears to be the only grouping variable that explains a high level of variance in the data (time-mixed, ICCbrand = 0.926, ICCspace = 0.021, ICCtime = 0.009; space-mixed, ICCbrand = 0.923, ICCspace = 0.003, ICCtime = 0.002), so we included it as a varying intercept in both models. Adding complexity as a control variable improved model fit (time-mixed, ΔAIC = 5120; space-mixed, ΔAIC = 4852). Adding ACTUAL as a fixed effect improved model fit (time-mixed, ΔAIC = 1377; space-mixed: ΔAIC = 2388) and adding ACTUAL*MIXED as an interaction further improved model fit (time-mixed, ΔAIC = 23; space-time, ΔAIC = 403). The best fitting models for both the time-mixed and space-mixed datasets had the following specification:

$$\log ( S ) \;\approx \;{\rm Normal}\;( {\mu_i, \;\;\sigma } ) $$
$$\mu _i = \alpha _{BRAND[ i ] } + ( {\beta_C \times COMPLEXITY} ) + ( \beta _A \times ACTUAL) + ( \beta _M \times MIXED) + ( \beta _{AM} \times ACTUAL\;\times MIXED) $$
$$\alpha _j\;\approx \;{\rm Normal}\;( {0, \;\;1} ) $$
$$\beta _{C, \;A, \;M, \;AM}\;\approx \;{\rm Normal}\;( {0, \;\;0.5} ) $$
$$\sigma \;\approx \;{\rm Exponential}\;( 3 ) $$

The results of the best fitting models can be seen in Table 2.

Table 2. The mean estimates and 95% credible intervals for each parameter in the best fitting models for the time-mixed (left) and space-mixed (right) subsets

As predicted, we found that both the effect of ACTUAL and the interaction between ACTUAL and MIXED are positive, indicating that the shuffling model works better than chance and performs better in both time- and space-mixed datasets. Figure 3 demonstrates the accuracy of the shuffling model's predictions when applied to a structured, time-mixed and space-mixed dataset (i.e. Y-SE-2, which has the highest sample size). The effect is subtle, but when applied to mixed datasets (right panel) the shuffling model has more true and fewer false positives, apparently because more of the possible combinations of common components are represented.

Figure 3. The accuracy of the shuffling model's predictions when applied to a structured (left panel), time-mixed and space-mixed (right panel) version of Y-SE-2, the dataset with the highest sample size. Each point above the diagonal is a unique combination of components, and components on both axes are ranked by their commonness in the entire dataset. Green is a true positive (exists and S ≥ 1), blue is a true negative (does not exist and S < 1), orange is a false positive (does not exist and S ≥ 1), and yellow is a false negative (exists and S < 1).

We found that complexity has the expected negative effect on predicted prevalence. The R-hat values are all equal to 1, and the effective sample sizes are all greater than 1000, showing that both models have converged. The results of a frequentist version of the best fitting model can be seen in Table S2.

As a follow-up, we repeated our generative inference analysis after randomly mixing the years and locations of the real brands (not preregistered) and found that the detected signals of copying were robust to time- and space-mixing (Figure S8).

3.3. Temporal distance (supplementary aim)

The temporal distance model had the following specification:

$$LD\;\approx \;{\rm Binomial}\;( {3, \;\;p_i} ) $$
$$logit( p_i) = \alpha _{BRAND_A[ i ] } + \delta _{BRAND_B[ i ] } + ( \beta _{OO} \times OO) + ( \beta _{YY} \times YY) $$
$$\alpha _j{\rm \;}\approx {\rm \;Normal}\;( {0, \;{\rm \;}1} ) $$
$$\delta _k\;\approx \;{\rm Normal}\;( {0, \;\;1} ) $$
$$\beta _{OO, YY}{\rm \;}\approx {\rm \;Normal}\;( {0, \;{\rm \;}0.5} ) $$

The results of the temporal distance model can be seen in Table 3. As predicted, we found that the effect of compared brands being from the same time period (e.g. OO or YY) on their Levenshtein distance was negative. In other words, we found that brands from different time periods are more distinct from one another. The R-hat values are all equal to 1, and the effective sample sizes are all greater than 1000, showing that the model has converged.

Table 3. The mean estimates and 95% credible intervals for each parameter in the best fitting model.

4. Discussion

Based on the results of generative inference, the spatiotemporal dynamics of Kansas cattle brands between 1990 and 2016 are consistent with copying of components across the entire state. Ranchers do not maximise the distinctiveness of their brands beyond the basic constraint of uniqueness, but they do appear to create brands of intermediate complexity. In support of our hypothesis, we found that time- and space-averaging improves the ability of the shuffling model, which assumes random copying at the level of components, to predict the presence and absence of brands in our data. This finding is supported by the fact that our generative inference results remain consistent with copying after the years and locations of brands have been shuffled. Finally, brands from the same time period are more similar to one another than brands from different time periods, as would be expected when copying is present.

Our finding that signals of copying are robust to, and possibly boosted by, time- and space-averaging is significant in light of the previous literature on this topic. Several simulation studies have found that time-averaging makes it more difficult to correctly detect random copying using classic neutrality tests like Slatkin's exact test (Madsen, Reference Madsen2012; Porčić, Reference Porčić2015; Premo, Reference Premo2014). Such tests are very sensitive to evenness, and thus the long tail of distributions (Hill, Reference Hill1997) and departures from neutrality induced by time-averaging appear to be due to the overrepresentation of rare and fleeting variants (Premo, Reference Premo2014). Previous simulation studies have assumed that an infinite number of new variants are possible, though, and time-averaging may not have this effect in systems like cattle brands where variation is bounded because new variants are produced from a finite set of constituent parts. Additionally, Premo (Reference Premo2014) found that if the entire frequency distribution of variants is considered, then signals of random copying appear to be robust to time-averaging (or even slightly boosted by it, even at low sample sizes). This is consistent with our findings, as well as empirical evidence that time-averaging boosts signals of neutral evolution in fossil assemblages (Tomašových & Kidwell, Reference Tomašových and Kidwell2010).

Figure 4 provides an example of the logic of averaging boosting signals of random copying, with a basic simulation of cultural transmission over three timesteps. In t 1 the two cultural variants A and B are equally represented in the population. In t 2 and t 3 a conformity bias (C = 2) causes B to increase in frequency, as shown in the plot of the full population in t 3 in the top right. If we only have access to a subsample from t 1 to t 3, though, the frequency of the rare variant (A) is higher and the frequency of the common variant (B) is lower (see bottom right), bringing them closer to the expectation under random copying (dashed line). If we simulate this with our cattle brand ABM, using the same starting point and temporal dynamics as our main analysis, we see a similar pattern (Figure 4). The orange and blue lines show the simulated frequency distributions of components in 2016 when C = 0 (frequency information does not matter) and when C = 10 (extreme conformity). If we take random subsamples of the same size from all years, shown in yellow and green, it brings the frequency distributions closer to the expectation under random copying, shown in black. When C = 0 the time-mixing boosts the frequencies of common components at the expense of rare ones, and when C = 10 it boosts the frequencies of rare components at the expense of common ones.

Figure 4. (a) A basic simulation of cultural transmission over three timesteps, with a fully-connected population of 30 agents transmitting two variants (A and B) with a conformity bias (C = 2). A and B are equally represented in the first timestep. The bar plot in the top right shows the frequencies of A and B in t 3, and the bar plot in the top left shows the frequencies of A and B in a time-mixed subsample of the same size from t 1 to t 3. The dashed lines show the expected frequencies under random copying. (b) The simulated component frequency distributions from the cattle brand ABM under several conditions (five iterations each). Orange and blue are the frequency distributions from 2016 when C = 0 (frequency information does not matter) and C = 10 (extreme conformity), respectively. Yellow and green are the frequency distributions from time-mixed subsamples of the same size collected across all years when C = 0 and C = 10, respectively. The black dashed line is the frequency distribution expected when C = 1 (random copying).

There are several important differences between cattle brands and typical archaeological datasets that should be highlighted. First, the relative time-scale of averaging in our study is much smaller than that typically seen in archaeological assemblages. The effects of time-averaging generally increase with the aggregation window (Perreault, Reference Perreault2018; Wilder & Kandler, Reference Wilder and Kandler2015), with the onset of effects occurring when the window is about the same as the average lifetime of variants (Madsen, Reference Madsen2012). Cattle brands are often inherited within families and can have relatively long use-lives, and we suspect that the ~25-year window of our study is close to the average lifetime of a cattle brand. Our study shows that time-averaging can have a significant impact on the detectability of copying with a much narrower time window than previously assumed. Second, time-averaging has bigger effects when innovation rates are high (Madsen, Reference Madsen2012), and no new component types were innovated during the course of our study period. Combined with the relatively small averaging window, we think that this explains the subtlety of the observed effects of time- and space-averaging (see Table 2 and Figure 3). Finally, the time- and space-averaging in our study occurred at the level of brands, while copying and pressure for distinctiveness were modelled at the level of components. In previous studies, models have assumed that cultural variants are copied discretely as types at the same level of analysis at which time- and space-averaging occurs (Madsen, Reference Madsen2012; Porčić, Reference Porčić2015; Premo, Reference Premo2014). If we use pottery traditions as an example, a ‘type’ is actually a complex combination of different techniques or design elements. Copying probably occurs at the level of these techniques and elements (or combinations of them), whereas time- and space-averaging occur at the level of the artefacts. We think that continuous cultural variation may contain additional information that is affected differently by averaging or missing data (Premo, Reference Premo2021).

Supplementary material

To view supplementary material for this article, please visit https://doi.org/10.1017/ehs.2023.5

Acknowledgments

We would like to thank Lilia Kurnosova, Elia Weber and Agnes Dietrich for their assistance with manual coding of cattle brands, and members of our lab, the Minds and Traditions Research Group at the Max Planck Institute for Geoanthropology, for feedback on earlier versions of this manuscript.

Author contributions

M.Y. transcribed the data with optical character recognition and manual coding with research assistants. M.Y. designed and conducted the agent-based modelling with feedback from H.M. and O.M. O.M. and M.Y. designed the time- and space-averaging analysis and M.Y. conducted it. M.Y., H.M., and O.M. wrote the preregistration and manuscript.

Financial support

This work has received funding from the Max Planck Society and from the ‘Frontiers in Cognition’ EUR grant, ANR-17-EURE-0017 EUR.

Conflicts of interest

We declare no conflicts of interest.

Data and code availability

The preregistration document is available on OSF (https://osf.io/d4qv3), the data and transcription methodology are available on our data GitHub (https://github.com/masonyoungblood/cattle_brand_data), and the agent-based model is available on our code GitHub (https://github.com/masonyoungblood/CattleBrandABM).

References

Adams, A. (1906). Cattle brands: A collection of western camp-fire stories. https://www.gutenberg.org/ebooks/12281Google Scholar
American Association of Bovine Practitioners (2020). Summary: AABP Animal Welfare Committee Branding Working Group activities.Google Scholar
Arnold, O. (1965). Irons in the fire: Cattle brand lore. Abelard-Schuman.Google Scholar
Barrett, B. J. (2019). Equifinality in empirical studies of cultural transmission. Behavioural Processes, 161, 129138. https://doi.org/10.1016/j.beproc.2018.01.011CrossRefGoogle ScholarPubMed
Bates, D., Mächler, M., Bolker, B., & Walker, S. (2015). Fitting linear mixed-effects models using lme4. Journal of Statistical Software, 67(1), 148. https://doi.org/10.18637/jss.v067.i01CrossRefGoogle Scholar
Bird Rondeau, J. J. (2013). Communicating social identity: The sensory performance of cattle branding. University of Calgary.Google Scholar
Blum, M. G. B., & François, O. (2010). Non-linear regression models for approximate Bayesian computation. Statistical Computing, 20, 6373. https://doi.org/10.1007/s11222-009-9116-0CrossRefGoogle Scholar
Blum, M. G. B., Nunes, M. A., Prangle, D., & Sisson, S. A. (2013). A comparative review of dimension reduction methods in approximate Bayesian computation. Statistical Science, 28(2), 189208.CrossRefGoogle Scholar
Brand, D. D. (1961). The early history of the range cattle industry in northern Mexico. Agricultural History, 35(3), 132139.Google Scholar
Brantingham, P. J. (2003). A neutral model of stone raw material procurement. American Antiquity, 68(3), 487509. https://doi.org/10.2307/3557105CrossRefGoogle Scholar
Bürkner, P. C. (2017). brms: An R package for Bayesian multilevel models using Stan. Journal of Statistical Software, 80(1). https://doi.org/10.18637/jss.v080.i01CrossRefGoogle Scholar
Chang, L. Y., Chen, Y. C., & Perfetti, C. A. (2018). GraphCom: A multidimensional measure of graphic complexity applied to 131 written languages. Behavior Research Methods, 50(1), 427449. https://doi.org/10.3758/s13428-017-0881-yCrossRefGoogle ScholarPubMed
Chao, A., Chazdon, R. L., Colwell, R. K., & Shen, T. J. (2006). Abundance-based similarity indices and their estimation when there are unseen species in samples. Biometrics, 62(2), 361371. https://doi.org/10.1111/j.1541-0420.2005.00489.xCrossRefGoogle ScholarPubMed
Chao, A., Gotelli, N. J., Hsieh, T. C., Sander, E. L., Ma, K. H., Colwell, R. K., & Ellison, A. M. (2014). Rarefaction and extrapolation with Hill numbers: A framework for sampling and estimation in species diversity studies. Ecological Monographs, 84(1), 4567. https://doi.org/10.1890/13-0133.1CrossRefGoogle Scholar
Crema, E. R., Edinborough, K., Kerig, T., & Shennan, S. J. (2014). An approximate Bayesian computation approach for inferring patterns of cultural evolutionary change. Journal of Archaeological Science, 50, 160170. https://doi.org/10.1016/j.jas.2014.07.014CrossRefGoogle Scholar
Crema, E. R., Kandler, A., & Shennan, S. (2016). Revealing patterns of cultural transmission from frequency data: Equilibrium and non-equilibrium assumptions. Scientific Reports, 6(December), 110. https://doi.org/10.1038/srep39122CrossRefGoogle ScholarPubMed
de Boer, B. (2005). Evolution of speech and its acquisition. Adaptive Behavior, 13(4), 281292. https://doi.org/10.1177/105971230501300405CrossRefGoogle Scholar
de Boer, B. (2015). Biology, culture, evolution and the cognitive nature of sound systems. Journal of Phonetics, 53, 7987. https://doi.org/10.1016/j.wocn.2015.07.001CrossRefGoogle Scholar
Flemming, E. S. (2002). Auditory representations in phonology. Routledge.Google Scholar
Garvey, R. (2018). Current and potential roles of archaeology in the development of cultural evolutionary theory. Philosophical Transactions of the Royal Society B: Biological Sciences, 373(1743). https://doi.org/10.1098/rstb.2017.0057CrossRefGoogle ScholarPubMed
GracyII, D. B. (2016). A cowman's-eye view of the information ecology of the Texas cattle industry from the Civil War to World War I. Information & Culture, 51(2), 164191. https://doi.org/10.7560/IC51202Google Scholar
Hill, M. O. (1997). An evenness statistic based on the abundance-weighted variance of species proportions. Oikos, 79(2), 413416. https://doi.org/10.2307/3546027CrossRefGoogle Scholar
Insoll, T., Clack, T., & Rege, O. (2015). Mursi ox modification in the Lower Omo Valley and the interpretation of cattle rock art in Ethiopia. Antiquity, 89(343), 91105. https://doi.org/10.15184/aqy.2014.31CrossRefGoogle Scholar
Keen, J. (2013, April). Cattle thefts spur new interest in livestock branding. USA Today. https://eu.usatoday.com/story/news/nation/2013/04/13/cattle-theft-branding/2079323/Google Scholar
Kelly, P., Winters, J., Miton, H., & Morin, O. (2021). The predictable evolution of letter shapes: An emergent script of West Africa recapitulates historical change in writing systems. Current Anthropology, 62(6). https://doi.org/10.1086/717779CrossRefGoogle Scholar
Lachlan, R. F., Ratmann, O., & Nowicki, S. (2018). Cultural conformity generates extremely stable traditions in bird song. Nature Communications, 9. https://doi.org/10.1038/s41467-018-04728-1CrossRefGoogle ScholarPubMed
Landais, E. (2001). The marking of livestock in traditional pastoral societies. Revue Scientifique et Technique, 20(2), 463479. http://dx.doi.org/10.20506/rst.20.2.1286Google ScholarPubMed
Lewontin, R. C. (1995). The detection of linkage disequilibrium in molecular sequence data. Genetics, 140(1), 377388. https://doi.org/10.1093/genetics/140.1.377CrossRefGoogle ScholarPubMed
Liljencrants, J., & Lindblom, B. (1972). Numerical simulation of vowel quality systems: The role of perceptual contrast. Language, 48(4), 839862. https://doi.org/10.2307/411991CrossRefGoogle Scholar
Lombard, C. G., & Du Plessis, T. (2016). Beyond the branding iron: Cattle brands as heritage place names in the state of Montana. Names, 64(4), 224233. https://doi.org/10.1080/00277738.2016.1223119CrossRefGoogle Scholar
Lombard, C. G., & Du Plessis, T. (2019). The socio-onomastic features of American cattle brands. Names. https://doi.org/10.1080/00277738.2018.1490517CrossRefGoogle Scholar
Lyman, L. R. (2003). The influence of time averaging and space averaging on the application of foraging theory in zooarchaeology. Journal of Archaeological Science, 30(5), 595610. https://doi.org/10.1016/S0305-4403(02)00236-4CrossRefGoogle Scholar
Lynch, M., & Robbins, L. H. (1977). Animal brands and the interpretation of rock art in East Africa. Current Anthropology, 18(3), 538539.CrossRefGoogle Scholar
Madsen, M. E. (2012). Unbiased cultural transmission in time-averaged archaeological assemblages. SSRN Electronic Journal. https://doi.org/10.2139/ssrn.2037622CrossRefGoogle Scholar
McInnes, L., Healy, J., & Melville, J. (2018). UMAP: Uniform manifold approximation and projection for dimension reduction. ArXiv. https://arxiv.org/abs/1802.03426Google Scholar
Miton, H., & Morin, O. (2019). When iconicity stands in the way of abbreviation: No Zipfian effect for figurative signals. PLoS ONE, 14(8), 119. https://doi.org/10.1371/journal.pone.0220793CrossRefGoogle ScholarPubMed
Miton, H., & Morin, O. (2021). Graphic complexity in writing systems. Cognition, 214, 104771. https://doi.org/10.1016/j.cognition.2021.104771CrossRefGoogle ScholarPubMed
Morin, O., Kelly, P., & Winters, J. (2020). Writing, graphic codes, and asynchronous communication. Topics in Cognitive Science, 12(2), 727743. https://doi.org/10.1111/tops.12386CrossRefGoogle ScholarPubMed
Morin, O., & Miton, H. (2018). Detecting wholesale copying in cultural evolution. Evolution and Human Behavior, 39(4), 392401. https://doi.org/10.1016/j.evolhumbehav.2018.03.004CrossRefGoogle Scholar
Newman, R. (2007). A guide to best practice husbandry in beef cattle: Branding, castrating and dehorning. Meat & Livestock Australia.Google Scholar
Pavlek, B., Winters, J., & Morin, O. (2019). Ancient coin designs encoded increasing amounts of economic information over centuries. Journal of Anthropological Archaeology, 56, 101103. https://doi.org/10.1016/j.jaa.2019.101103CrossRefGoogle Scholar
Perreault, C. (2018). Time-averaging slows down rates of change in the archaeological record. Journal of Archaeological Method and Theory, 25(3), 953964. https://doi.org/10.1007/s10816-018-9364-4CrossRefGoogle Scholar
Perreault, C. (2019). The Quality of the Archaeological Record. The University of Chicago Press.CrossRefGoogle Scholar
Porčić, M. (2015). Exploring the effects of assemblage accumulation on diversity and innovation rate estimates in neutral, conformist, and anti-conformist models of cultural transmission. Journal of Archaeological Method and Theory, 22(4), 10711092.CrossRefGoogle Scholar
Premo, L. S. (2014). Cultural transmission and diversity in time-averaged assemblages. Current Anthropology, 55(1), 105114. https://doi.org/10.1086/674873CrossRefGoogle Scholar
Premo, L. S. (2021). Population size limits the coefficient of variation in continuous traits affected by proportional copying error (and why this matters for studying cultural transmission). Journal of Archaeological Method and Theory, 28(2), 512534. https://doi.org/10.1007/s10816-020-09464-9CrossRefGoogle Scholar
Probst, P., Wright, M., & Boulesteix, A.-L. (2018). Hyperparameters and tuning strategies for random forest. Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery. https://doi.org/10.1002/widm.1301Google Scholar
Pudlo, P., Marin, J. M., Estoup, A., Cornuet, J. M., Gautier, M., & Robert, C. P. (2016). Reliable ABC model choice via random forests. Bioinformatics, 32(6), 859866. https://doi.org/10.1093/bioinformatics/btv684CrossRefGoogle ScholarPubMed
Raynal, L., Marin, J.-M., Pudlo, P., Ribatet, M., Robert, C. P., & Estoup, A. (2019). ABC random forests for Bayesian parameter inference. Bioinformatics, 35(10), 17201728. https://doi.org/10.1093/bioinformatics/bty867CrossRefGoogle ScholarPubMed
Roswell, M., Dushoff, J., & Winfree, R. (2021). A conceptual guide to measuring species diversity. Oikos, 130(3), 321338. https://doi.org/10.1111/oik.07202CrossRefGoogle Scholar
Schroeder, P. J., & Jenkins, D. G. (2018). How robust are popular beta diversity indices to sampling error. Ecosphere, 9(2). https://doi.org/10.1002/ecs2.2100CrossRefGoogle Scholar
Shannon, C. E. (1951). Prediction and entropy of printed English. Bell System Technical Journal, 30(1), 5064. https://doi.org/10.1002/j.1538-7305.1951.tb01366.xCrossRefGoogle Scholar
Sisson, S. A., Fan, Y., & Beaumont, M. A. (2018). Handbook of approximate Bayesian computation. CRC Press. https://doi.org/10.1201/9781315117195CrossRefGoogle Scholar
Smith, K. (2020). How culture and biology interact to shape language and the language faculty. Topics in Cognitive Science, 12(2), 690712. https://doi.org/10.1111/tops.12377CrossRefGoogle ScholarPubMed
Smith, K., & Kirby, S. (2008). Cultural evolution: Implications for understanding the human language faculty and its evolution. Philosophical Transactions of the Royal Society B, 363(1509), 35913603. https://doi.org/10.1098/rstb.2008.0145CrossRefGoogle ScholarPubMed
Stamp, J. (2013, April). Decoding the range: The secret language of cattle branding. Smithsonian Magazine. https://www.smithsonianmag.com/arts-culture/decoding-the-range-the-secret-language-of-cattle-branding-45246620/Google Scholar
Tamariz, M., & Kirby, S. (2015). Culture: Copying, compression, and conventionality. Cognitive Science, 39(1), 171183. https://doi.org/10.1111/cogs.12144CrossRefGoogle ScholarPubMed
Thomas, F. J. (1967). Mission cattle brands. Ten Finger Press.Google Scholar
Tomašových, A., & Kidwell, S. M. (2010). The effects of temporal resolution on species turnover and on testing metacommunity models. American Naturalist, 175(5), 587606. https://doi.org/10.1086/651661CrossRefGoogle ScholarPubMed
Tran, N.-H., Waring, T., Atmaca, S., & Beheim, B. A. (2021). Entropy trade-offs in artistic design: A case study of Tamil kolam. Evolutionary Human Sciences, 3(e23), 114. https://doi.org/10.1017/ehs.2021.14CrossRefGoogle Scholar
van der Helm, P. A. (2014). Simplicity in vision: A multidisciplinary account of perceptual organization. Cambridge University Press.CrossRefGoogle Scholar
van der Moezel, K. V. J. (2016). Of marks and meaning: A palaeographic, semiotic–cognitive, and comparative analysis of the identity marks from Deir El-Medina. Universiteit Leiden.Google Scholar
Wilder, B., & Kandler, A. (2015). Inference of cultural transmission modes based on incomplete information. Human Biology, 87(3), 193204.CrossRefGoogle ScholarPubMed
Wolfenstine, M. R. (1970). Manual of Brands and marks. University of Oklahoma Press.Google Scholar
Youngblood, M., & Lahti, D. (2022). Content bias in the cultural evolution of house finch song. Animal Behaviour, 185, 3748. https://doi.org/10.1016/j.anbehav.2021.12.012CrossRefGoogle Scholar
Youngblood, M., Stubbersfield, J. M., Morin, O., Glassman, R., & Acerbi, A. (2021). Negativity bias in the spread of voter fraud conspiracy theory tweets during the 2020 US election. PsyArXiv. https://doi.org/10.31234/osf.io/2jksgGoogle Scholar
Figure 0

Figure 1. Examples of two cattle brands registered in the US state of Kansas in 1884 (https://www.kansasmemory.org/item/309140/). Both brands are composed of two components: on the left a stylised ‘N’ and ‘P’, and on the right a rotated ‘(‘ and ‘S’).

Figure 1

Figure 2. The posterior distributions for the two parameters of the ABM computed with random forest ABC, plotted against the priors (dotted).

Figure 2

Table 1. The median estimates, 95% quantile-based credible intervals, and root mean square log/logit-transformed errors for each parameter. λ controls the shape of the Poisson distribution from which the number of components in new brands is drawn, and C is the exponent to which the components frequencies are raised.

Figure 3

Table 2. The mean estimates and 95% credible intervals for each parameter in the best fitting models for the time-mixed (left) and space-mixed (right) subsets

Figure 4

Figure 3. The accuracy of the shuffling model's predictions when applied to a structured (left panel), time-mixed and space-mixed (right panel) version of Y-SE-2, the dataset with the highest sample size. Each point above the diagonal is a unique combination of components, and components on both axes are ranked by their commonness in the entire dataset. Green is a true positive (exists and S ≥ 1), blue is a true negative (does not exist and S < 1), orange is a false positive (does not exist and S ≥ 1), and yellow is a false negative (exists and S < 1).

Figure 5

Table 3. The mean estimates and 95% credible intervals for each parameter in the best fitting model.

Figure 6

Figure 4. (a) A basic simulation of cultural transmission over three timesteps, with a fully-connected population of 30 agents transmitting two variants (A and B) with a conformity bias (C = 2). A and B are equally represented in the first timestep. The bar plot in the top right shows the frequencies of A and B in t3, and the bar plot in the top left shows the frequencies of A and B in a time-mixed subsample of the same size from t1 to t3. The dashed lines show the expected frequencies under random copying. (b) The simulated component frequency distributions from the cattle brand ABM under several conditions (five iterations each). Orange and blue are the frequency distributions from 2016 when C = 0 (frequency information does not matter) and C = 10 (extreme conformity), respectively. Yellow and green are the frequency distributions from time-mixed subsamples of the same size collected across all years when C = 0 and C = 10, respectively. The black dashed line is the frequency distribution expected when C = 1 (random copying).

Supplementary material: PDF

Youngblood et al. supplementary material

Youngblood et al. supplementary material

Download Youngblood et al. supplementary material(PDF)
PDF 1.2 MB