Hostname: page-component-848d4c4894-sjtt6 Total loading time: 0 Render date: 2024-06-30T09:10:35.617Z Has data issue: false hasContentIssue false

Evolutionary Traits and Genomic Surveillance of SARS-CoV-2 in South America

Published online by Cambridge University Press:  01 January 2024

Pablo A. Ortiz-Pineda*
Affiliation:
Laboratory of Molecular Biology and Genomics, InnovaGen Foundation, Popayán, Colombia
Carlos H. Sierra-Torres
Affiliation:
Laboratory of Molecular Biology and Genomics, InnovaGen Foundation, Popayán, Colombia Human Genetics Laboratory, Department of Physiological Sciences, Faculty of Health Sciences, University of Cauca, Popayán, Colombia
*
Correspondence should be addressed to Pablo A. Ortiz-Pineda; portiz@fig.org.co

Abstract

Since the zoonotic event from which SARS-CoV-2 started infecting humans late in 2019, the virus has caused more than 5 million deaths and has infected over 500 million people around the world. The pandemic has had a severe impact on social and economic activities, with greater repercussions in low-income countries. South America, with almost 5% of the world’s population, has reckoned with almost a fifth of the total people infected and more than 26% (>1/4) of the deceased. Fortunately, the full genome structure and sequence of SARS-CoV-2 have been rapidly obtained and studied thanks to all the scientific efforts and data sharing around the world. Such molecular analysis of SARS-CoV-2 dynamics showed that rates of mutation, similar to other members of the Coronaviridae family, along with natural selection forces, could result in the emergence of new variants; few of them might be of high consequence. However, this is a serious threat to controlling the pandemic and, of course, enduring the process of returning to normalization with the implicit monetary cost of such a contingency. The lack of updated knowledge in South America justifies the need to develop a structured genomic surveillance program of current and emerging SARS-CoV-2 variants. The modeling of the molecular events and microevolution of the virus will contribute to making better decisions on public health management of the pandemic and developing accurate treatments and more efficient vaccines.

Type
Review Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
Copyright
Copyright © 2022 Pablo A. Ortiz-Pineda and Carlos H. Sierra-Torres.

1. Introduction

SARS-CoV-2 is a new single-stranded RNA virus that was reported to cause coronavirus disease (COVID-19) early in December 2019 and was declared a pandemic by the World Health Organization (WHO) in March 2020 [Reference Khailany, Safdar and Ozaslan1]. This virus is actually causing the fourth wave of massive infections in several countries in which the most affected are the United States, India, Brazil, Russia, France, United Kingdom, Turkey, Argentina, Colombia, and Spain, according to WHO and worldwide news. By August 2021, WHO reported that more than 250 million people around the world had been infected with SARS-CoV-2 and more than 5.1 million people had died from COVID-19 (https://www.who.int/home). This pandemic is showing a significant adverse impact on social and economic activities and as expected, it has been most noticeable in several low-income countries (LICs) [Reference Brasher, Chikumba, McDougall, Jannicke and Angela2, Reference Uddin, Mustafa and Rizvi3].

Coronaviruses are enveloped positive-sense RNA viruses that contain the largest RNA genomes known to date within the Coronaviridae family [Reference Khailany, Safdar and Ozaslan1, Reference Abdelrahman, Li and Wang4]. SARS-CoV-2 is a very close relative of the SARS-CoV virus, which caused its own epidemic between November 2002 and May 2004. The SARS-CoV epidemic resulted in 8,096 people infected and 774 people dead, with a fatality rate of between 10 and 16% [Reference Luk, Li, Fung, Lau and Woo5, Reference Anderson, Fraser and Ghani6]. Another well-known family member is MERS-CoV, a virus responsible for the current epidemic of Middle East Respiratory Syndrome (MERS). Commonly known as camel flu, it began in 2012, infected 2,040 people, and showed a fatality rate estimated at 34% [Reference Mackay and Arden7]. SARS-CoV, SARS-CoV-2, and MERS-CoV belong to the Betacoronaviridae subfamily. SARS-CoV-2 shares a 79.6% sequence identity with SARS-CoV [Reference Zhou, Yang and Wang8] and 50% with MERS-CoV [Reference Lu, Zhao and Li9]. It is also notable that SARS-CoV-2 reveals a 96.2% homology with BatCov RaTG13, a bat coronavirus, which might suggest its zoonotic origins [Reference Zhou, Yang and Wang8]. A hypothesis of an intermediate host mammal, the pangolin, was considered; but maximum-likelihood phylogenies and experimental approaches showed that the pangolin virus has a very low affinity for the human ACE2 receptor, suggesting a direct transmission from bats [Reference Giovanetti, Benedetti and Campisi10, Reference Liu, Jiang and Wan11].

Coronavirus ssRNA genomes are fast-changing entities with relatively high mutation rates in the virus superkingdom. Genetic surveillance and real-time tracking of such diversity are necessary because changes in the SARS-CoV-2 sequence are closely related to epidemiology impact: (1) the infectivity rate, (2) the mortality rate of infected people, and (3) the effectiveness of existing vaccines [Reference Yuen, Ye, Fung, Chan and Jin12, Reference Kadam, Sukhramani, Bishnoi, Pable and Barvkar13].

In an effort never seen before in the history of science, the SARS-CoV-2 RNA genome was sequenced rapidly and in huge numbers to facilitate both diagnostic testing and genetic surveillance of the virus [Reference Oliveira, Oliveira, Sabino and Okay14]. Additionally, studying the molecular aspects of the disease and the virus lets researchers understand the key molecular events that resulted in the pandemic, allowing them to make decisions and prognoses about what might happen to humans. This includes critical questions such as how is SARS-CoV-2 transmitted; how pathogenic is the virus in tertiary and quaternary spreading to humans; what are the mechanisms of virus shedding in asymptomatic and presymptomatic people; how COVID-19 should be diagnosed with effective and available procedures; how COVID-19 should be treated; what is the origin of SARS-CoV-2 and COVID-19 [Reference Yuen, Ye, Fung, Chan and Jin12].

Genomic surveillance in Latin America is not only important but urgent, especially because the rate of infections is the highest proportional to the population. There are approximately 422 million people in South America, which roughly corresponds to 5% of the world population. Approximately 40 million out of the 250 million people infected are inhabitants of South America. This is more than 17% of infections worldwide and is the same number of infected people in India, a country with 1.37 billion people (https://www.worldometers.info/coronavirus/).

Another perspective to better understand the severity of the pandemic in South America is a comparison of the tested positive cases per million inhabitants. In Argentina (>116 k), Uruguay (>110 k), Brazil (>102 k), Colombia (>97 k), Chile (>90 k), Peru (>67 k), and Paraguay (>63 k), the prevelance of infection was 3 to 5 times greater than the proportion of infections in India (>23 k) and with approximately 30% of the corresponding population, according to the statistics from the World Health Organization (https://www.who.int/home). Of course, these high numbers of infectious rates are the result of the wrong, insufficient, and neglected policies administered in each country.

Latin America has limited economic resources not only in infrastructure but also in competent people to process, analyze, and decode these data from a public health perspective. In LICs, such limitations constrain the performance of genomic surveillance compared to those of developed countries [Reference Inzaule, Tessema, Kebede, Ogwell Ouma and Nkengasong15Reference Laiton-Donato, Villabona-Arenas and Usme-Ciro17]. These limitations are reflected in the 122,120 genomes reported to GISAID (global initiative on sharing all influenza data) from South America. This figure represents only 2.3% of the more than 5 million genomes reported; it should be noted that Brazil submitted 73,801 (61%) of the total.

In Colombia, over 8,000 genome sequences have been reported, representing 6% of all South American sequences and 0.14% of all GISAID sequences [Reference Elbe and Buckland-Merrett18]. Venezuela, a country with limited epidemiological information, has only 172 sequences, corresponding to 0.14% of South American information and 0.0032% of world genomes. These gaps created blind spots for fully understanding the SARS-CoV-2 emerging diversity, limiting our ability to develop appropriate vaccines and efficient treatments.

The primary purpose of this review is to provide specific information about the sources of genomic diversity and the dynamics of viral microevolution that might help to understand the epidemiology and virulence of the SARS-CoV-2 and possible future coronaviruses. Moreover, we call for urgent genomic surveillance in Latin America, a region with particular SARS-CoV-2 dynamics currently spreading global and endemic variants that might have serious implications for virulence (pathology) and people’s health.

2. Molecular Aspects and Microevolution of the SARS-CoV-2

According to the NCBI reference sequence (NC_045512.2), the SARS-CoV-2 RNA genome contains nearly 29,903 base pairs (bp) [Reference Wu, Zhao and Yu19]. It is composed of 8 open reading frames (ORFs) and 4 structural genes: (1) the spike glycoprotein (S), typical of the Coronaviridae family, (2) the envelope protein (E), (3) the membrane protein (M), and (4) the nucleocapsid protein (N) distributed along the RNA genome and interlaced with the ORFs. Additionally, ORF1ab (21,555 nt) occupies 72% of the whole genome and encodes 16 nonstructural proteins (nsp) involved in proteolysis, replication, and adaptation to a new host [Reference Naqvi, Fatima and Mohammad20]. The remaining ORFs (3a, 3b, 6, 7a, 7b, 8, 10) are accessory proteins involved in virion assembly but with functions still under investigation [Reference Rahimi, Mirzazadeh and Tavakolpour21].

There are two main concerns about the source of variation in the SARS-CoV-2 genomes. First, we must understand that the more the population is infected, the higher the chances of finding a mutation are. As there are many copies of the virus, there is a proportional probability of a mutation occurring that would increase its virulence. This can be evidenced by the appearance of new variants in the middle of waves of infections in Europe, South Africa, the United States, Brazil, and now India [Reference Makoni22Reference Voloch, da Silva Francisco and de Almeida24]. The second aspect of concern is related to the natural process of emerging mutations. All DNA or RNA organisms are susceptible to nucleotide changes (substitutions, insertions, deletions, etc.) that could produce phenotypical changes [Reference Rehman, Shafique, Ihsan and Liu25]. Most of the changes are neutral (synonym mutations) and do not affect the phenotype. Some others might damage the viral structure or the capacity of the virus to reinfect, but natural selection gets rid of them. In contrast, the last few mutations are the ones that confer an adaptive response and make them dominant by the phenotype associated with them (i.e., more infectious) [Reference Volz, O’Toole and Bull26, Reference Becerra-Flores and Cardozo27]. For example, gaining affinity for the human angiotensin-converting enzyme 2 (ACE2) receptor opens the door for SARS-CoV-2 in human cells [Reference Kai and Kai28]. Humans are not immune to this source of variation; although the rate of change is much slower, some changes in the ACE2 gene translate into ACE2 protein polymorphism, which is now evidence of susceptibility to SARS-CoV-2 infection and COVID-19 severity [Reference Ragia and Manolopoulos29Reference Dong, Zhang and Ma32].

In the year and a half since the virus has been spreading around the world, there have been reports and evidence that new variants are appearing around the world. According to the WHO and the US CDC (Centers for Disease Control), variations can be divided into three categories: (1) VOI (variants of interest), (2) VOC (variants of concern), and (3) VOHC (variants of high consequence) [33]. VOI are the ones with specific genetic markers that have been associated with changes in virulence. VOCs are variants for which there is experimental evidence of an increase in virulence (pathogenicity) and/or vaccine/treatment resistance. VOHC are the ones that pose a serious health threat, making available treatments useless [33]. Finally, the variants recently discovered with significant spreading capacities and with a potential menace are called variants under investigation (VUI). All of these variants are the result of evolutionary force imbalances that ponder how to increase diversity.

Genetic drift and selection pressures will limit the diversity of SARS-CoV-2 by limiting the virus’s spread or survival due to random events or biosafety protocols. In contrast, the source of variation in any virus comes from mutation and recombination occurring in every event of infection and replication [Reference Rehman, Shafique, Ihsan and Liu25, Reference Domingo34].

Cumulative mutations in SARS-CoV-2 are directly correlated with five pandemic alarms: (1) an increase in disease severity and mortality risk; (2) the speed of transmission from human to human; (3) the effectiveness of available vaccines; (4) the effectiveness of diagnostic tests; (5) susceptibility to treatments [Reference Yuen, Ye, Fung, Chan and Jin12, Reference Choudhary, Sreenivasulu, Mitra, Misra and Sharma35Reference Muñoz37]. All these trepidations are examined under genomic surveillance, which consists of deep analysis to determine how sequence changes reflect variations in the phenotype that modify the mechanism to detect and neutralize the infection.

3. Nomenclature of SARS-CoV-2

Due to the fast spread of diversity observed in SARS-CoV-2, the need to name the lineages has emerged. This is evident in the nomenclature used by the main databases or organizations in charge of surveillance, like WHO, GISAID, and Nextstrain. Initially, variants were named according to the place where they were first reported (i.e., the UK, South Africa, Brazil, or India). A dynamic method was later proposed based on the use of a phylogenetic framework to track lineages that contribute the most to an active spread [Reference Rambaut, Holmes and O’Toole38]. The two main databases for SARS-CoV-2 genomes and genomic epidemiology, GISAID and Nextstrain, respectively, settled on their own nomenclature [Reference Elbe and Buckland-Merrett18, Reference Hadfield, Megill and Bell39]. To make it easier and more practical to be discussed by nonscientific audiences, the World Health Organization recommended on May 30, 2021, using letters of the Greek Alphabet to denote variants of concern and interest [40]. Table 1 compiles lineages, the place where they were first identified, and important characteristics of variants. Until June 21, 2021, the group of VOCs was comprised of five variants: Alpha (UK B.1.1.7), Beta (South Africa B.1.351), Gamma (Brazil P.1), Delta (India B.1.617.2), and Omicron (South Africa B.1.1.529). Also, there are two VOIs under research: Lambda (Peru C.37) and Mu (Colombia B.1.621). Until now, no VOHC has been reported.

Table 1 Nomenclature of SARS-CoV-2 variants and noticeable pathological characteristics.

UI: under investigation. Our own source. Data from OMS, CDC, ECDC, and Outbreak.info.

When comparing the variants, it is noticeable that a large number of all mutations are located in the spike (S) protein. Being the S protein of the coronavirus, the main determinant of host and tissue tropism, it is also the target of vaccines, neutralizing antibodies and inhibitors of virus entry [Reference Pomplun41, Reference Hu, He and Gao42]. Furthermore, mRNA vaccines currently in use worldwide were designed to transcribe fragments of this protein. The S gene, which consists of 3,821 nucleotides coding for a protein of 1,273 amino acids, forms a trimeric spike (subunits S1, S2, and S2′) on the virion surface and plays an essential role in viral entry [Reference Plante, Liu and Liu43]. Coronaviruses may use two different pathways to enter the host cell: (1) the protease-mediated cell surface pathway and (2) the endosomal pathway [Reference Hu, He and Gao42, Reference Zhou, Vedantham and Lu44]. According to Hu and colleagues, the spike proteins of several coronaviruses are cleaved by host proteases in the S1 subunit for receptor binding and the S2 subunit for membrane fusion in the entry step of infection [Reference Ragia and Manolopoulos29, Reference Hu, He and Gao42]. Similar to SARS-CoV, the cellular receptor for SARS-CoV-2 is angiotensin-converting enzyme 2 (ACE2). However, the SARS-CoV-2 S protein has up to a 20-fold higher affinity for ACE2 than its counterpart, the SARS-CoV spike protein [Reference Kai and Kai28].

A recent study showed that a new serine protease (elastase-2) was introduced when a glycine replaced aspartic acid at position 614 of the spike protein [Reference Bhattacharyya, Das and Ghosh45]. Experimental studies have shown that it enhances viral replication in human lung epithelial cells and primary human airway tissues by increasing the infectivity and stability of virions. This is currently the dominant mutation in all VOC and VOI [Reference Volz, O’Toole and Bull26, Reference Plante, Liu and Liu43, Reference Mullen, Tsueng and Abdel Latif46]. It is still uncertain whether the D614G (Asp614Gly) mutation affects the antigenic properties of protein S, although there is the possibility of positive natural selection. With the gain of transmissibility and the absence of preexisting immunity in the general population, the chances for this mutation to disappear seem remote, and it is not known whether SARS-CoV-2 is fully adapted for efficient growth in human cells [Reference Giovanetti, Benedetti and Campisi10]. Supported by robust sequencing and correlation with abundance, with reference to all sequences obtained at specific periods of time, one can determine if a new mutation is likely to become dominant.

4. The SARS-CoV-2 Mutation Rate

Because of the chemistry of the molecule, single-stranded RNA viruses have, on average, a higher mutation rate than DNA viruses. However, large coronavirus genomes such as SARS-CoV-2 are relatively stable thanks to a proofreading mechanism that operates during replication (nsp14 exonuclease) [Reference Giovanetti, Benedetti and Campisi10, Reference van Dorp, Acman and Richard47]. Still, many mutations occur, generating stable variants like de D614G (Asp614Gly) that become dominant in about 6 months. Studies have shown that mutations occur more frequently in some regions of the SARS-CoV-2 genome [Reference Rahimi, Mirzazadeh and Tavakolpour21, Reference Roy, Mandal and Mondal48]. When measuring point mutations in the whole viral genome, Roy and colleagues determined the mutation frequency in μ = 9.4 × 10−6, which corresponds to 20,163 polymorphisms detected/(29,903 nucleotide genome size × 71,703 sequences analyzed in the dataset). When measuring only the nsp region (ORF1ab), the rate was 8.78 × 10−6. Within this region, the genes nsp1 and nsp2 have the highest rates of mutation at 1.12 × 10−5 and 1.08 × 10−5, respectively. These two are leader genes to inhibit immune response: nsp1 inhibits protein translation by blocking 40S ribosome and interferon (IFN) signaling, while nsp2 inhibits prohibitins 1 and 2 to disrupt the cellular environment [Reference Yoshimoto49]. Mutations in these two genes might confer novel viral outcomes to evade the host’s immunogenic response.

Compared to other studies, the mutation rate in the SARS-CoV genome of family members was estimated to be 0.80–2.38 × 10−3 nucleotide substitutions per site/year, which is in the same order of magnitude as other RNA viruses when a time scale is involved [Reference Zhao, Li and Wu50]. For SARS-CoV-2, the moderate accumulation of changes observed in a year was approximately 6 × 10−4 nucleotides/genome/year [Reference van Dorp, Acman and Richard47]. Compared to those high rates, accessory genes ORF7a and ORF3a are 1.37 × 10−5 and 1.35 × 10−5 nucleotides/genome/year, respectively. ORF7a, an accessory protein, is thought to be involved in viral assembly or budding events specific to SARS-CoVs [Reference Yoshimoto49]. The accumulation of nonsynonymous mutations in this gene may provide new molecular options for increasing virulence efficiency [Reference Roy, Mandal and Mondal48]. On the other hand, ORF3a has been reported to have proapoptotic activity through mitochondrial damage and activating inflammatory responses of host cells [Reference Yue, Nabar and Shi51]. The high rate of mutations in this gene could be interpreted as a devious strategy of the virus to finish its life cycle and kill the host cell. Genomic data can show how viral pathogens have responded to different forces of natural selection. In a model of codons, natural selection acting over any locus can be estimated using the proportion of nonsynonymous (dN) and synonymous (dS) mutations (dN/dS). Values of dN/dS > 1 can be interpreted as a positive natural selection because diversity arises. In contrast, negative selection or selective removal of the alleles that are deleterious results in dN/dS < 1 [Reference Roy, Mandal and Mondal48, Reference Kryazhimskiy and Plotkin52].

When comparing genes within the SARS-CoV-2 genome, recent studies have found that all nsp genes (except nsp11), S (spike), and M (membrane) are under negative (purifying) selection (dN/dS < 1) [Reference Roy, Mandal and Mondal48]. This is reasonable because these are the genes (nsps, S and M) in which the host immune response is active. On the other hand, accessory proteins ORFs 3a, 6, 7a, 8, and 10, structural proteins E (envelope), and N (nucleocapsid) genes are under positive selection (dN/dS > 1), having different forms, arise in the possible new variants.

5. Mutations Related to VOC and VOI

To understand what is happening with the distribution of variants in South America, it is necessary to first understand what set of mutations is linked to each variant. Based on the SARS-CoV-2 genome data from GISAID, the platform Outbreak.info, updated daily, displays graphic information that correlates the presence of certain detected mutations with VOI and VOCs [Reference Mullen, Tsueng and Abdel Latif46].

Similar to variant classification, mutations are also considered: mutations under observation (MUO), mutations of interest (MOI), and mutations of concern (MOC), sorted according to experimental observations in phenotypic changes related to virulence (pathogenicity) in SARS-CoV-2. We already mentioned D614G (Asp614Gly), a mutation now surviving in all VOI, VOC, and VUO. The current mutation of concern is S: E484K (Glu484Lys). A mutation of G > A in position 23,012 that changes a glutamic acid (E) for Lysine (K). This mutation is present in VOC Beta (B.1.351), Gamma (P.1), and Mu (B.1.621). This mutation increased dominance in Brazil [Reference Ferrareze, Franceschi, Mayer, Caldana, Zimerman and Thompson53] and recent studies have shown reduced neutralization by immune reactions, also called “escape mutations” [Reference Jangra54, Reference Wise55], and higher infectivity when in the presence of other mutations [Reference Khan, Zia and Suleman56].

The following mutations are all of interest (MOI), but their impact resides in the synergic functionality with other mutations. K417N (Lys417Asn) and K417T (Lys417Thr) mutations are G > T substitutions at position 22,811 and A > C substitutions at position 22,810, respectively, that change a Lysine (K) for Asparagine (N) or Threonine (T) and have been found in Beta, Gamma, and Omicron variants, and appear to have higher rates of infectivity [Reference Khan, Zia and Suleman56]. Another mutation under the radar is N501Y (Asn501Tyr). This causes an A to become a T at position 23,063, resulting in asparagine (N) for tyrosine (Y). This mutation is present in Alpha, Beta, Gamma, Omicron, and Mu. This last variant was first identified in Colombia and rapidly spread in the northwest region of South America. It is involved in higher infectivity [Reference Khan, Zia and Suleman56] and, mechanistically, the N501Y (Asn501Tyr) substitution improved the affinity of the viral spike protein for cellular receptors, and researchers have suggested this mutation be classified as a MOC [Reference Liu, Liu and Plante57]. To easily visualize MOCs and MOIs with each VOC and VOI, Figure 1 shows the variants currently active in South America and their respective mutations and frequency. Additionally, Table 2 shows the frequency of each variant in every continental country in South America.

Figure 1 Cross reference of variants of concern (VOC) and (VOI) variants of interests (Y-axis) and the occurrence and frequency of point mutations (X axis). Mutations and frequency were measured with >90% prevalence of that specific mutation in at least one lineage. Text in red represent variants/mutations of concern. Yellow: variants/mutations of interest. White text: variants/mutations under surveillance or observation. Graphic modified from outbreak.info website [Reference Mullen, Tsueng and Abdel Latif46] and customized to show only the VOC and VOI currently spreading in South America. By the time of manuscript writing, a few cases of Omicron were present in South America but reached over 10% in South Africa and over 3% in the fourth wave of infections in Europe in the last month. Source: outbreak.info/GISAID.

Table 2 Distribution and frequencies of VOC and VOI in continental South America.

Clades as headers. α = alpha, β = beta, γ = gamma, δ = delta, λ = Lambda, μ = Mu. wt+ = “wild-types”/original variants. N: number of sequences curated in GISAID (29 Nov 2021). Analysis on November 2021, UK: United Kingdom. SA: South Africa, Br: Brazil, In: India, US: United States, Pe: Peru, and Col: Colombia. At the time of manuscript writing, isolated cases of Omicron were reported in Brazil. Countries with * data not available from the last three months. Our own source. Data from: Nextstrain.org.

6. Variant Distribution in South America

To have a broader perspective of what is happening in South America, we can examine the distribution and frequencies of the variants currently identified. Table 2 shows the SARS-CoV-2 variants (VOC and VOI) and frequencies in South American countries together with the amount (number and percentage) of sequences submitted to GISAID by the end of September of 2021. GISAID data shows Delta as the largest widespread, with an almost 90% frequency of all recently sequenced variants [Reference Elbe and Buckland-Merrett18]. Delta increased rapidly from 1% in early June to 19% in late July to 90% in early November. This variant is known for the mutations in the Spike gene: T19R (Thr16Arg), T95I (Thr95Ile), L452R (Leu452Arg), T478K (Thr478Lys), D614G (Asp614Gly), P681R (Pro681Arg), and D950N (Asp950Asn). Similar to variant Mu, they share the mutations T95I (Thr95Ile) and D950N (Asp950Asn) that seem to correspond to the significant increase in infectivity. No data on the severity of the disease, mortality, or vaccine resistance have been linked to these latter mutations yet.

In second place is the Gamma variant, with an almost insignificant 5% compared to the 85% it had in early July. It was first identified on January 6, 2021, in Tokyo, Japan, by travelers from the Brazilian Amazonia [Reference Faria, Claro, Candido, Moyses Franco, Andrade and Coletti58]. It is currently in every continental country in South America. It contains 14 mutations, 10 of which are in the S gene and 5 of those 10 are the mutations: L18F (Leu18Phe), K417T (Lys417Thr), E484K (Glu484Lys), N501Y (Asn501Tyr), and D614G (Asp614Gly). It had reached 96 countries by the end of June 2021. Recent studies have shown a significant reduction in the neutralization of Pfizer and Moderna vaccines in fully dosed people [Reference Garcia-Beltran, Lam and Denis59], and the efficacy of Sinovac vaccines is close to 50%, but exact measurements are still being conducted in studies [Reference Moutinho60].

The third most frequent variant is shared by Lambda (C.37, Nextstrain: 21G) and Mu (B.1.621, Nextstrain: 21H), which are about 3 to 5% frequent each. Lambda (C.37), also known in Latin America as “Variante Andina,” was originally reported in Peru in August 2020 and classified by WHO as VOI on June 14, 2021. The Lambda variant contains six nonsynonymous mutations: G75V (Gly75Val), T76I (Thr76Ile), L452Q (Leu452Gln), F490S (Phe490Ser), D614G (asp614Gly), and T859N (Thr859Asn) in the S gene, plus a novel deletion (Δ246–252) [Reference Robertson61]. Interestingly, by early May of 2021, the Lambda variant (C.37) was close to 93% of the frequency in Peru and reached over 30 countries, including Australia [Reference van Homrigh62]. It is still profuse in 1 out of every 5 cases in Argentina, Bolivia, and Chile. It is not clear if the Lambda variant has implications for vaccine resistance or implications for disease severity, as was speculated in the news.

The other 3% is Mu. It is an outlier of distribution because the variant was found mostly in Colombia, where it counts for 95% of the newly reported cases in June and July 2021 [Reference Laiton-Donato, Franco-Muñoz and Álvarez-Díaz63]; 3% in other South American countries; 2% in the Caribbean islands and the rest of the world where it spread. Inside Colombia, it is still near 9% but has been outspread by the Delta variant. In May 2021, the Mu variant represented 95% of Colombian sequences, and by early August, there was still over 82% [Reference Laiton-Donato, Villabona-Arenas and Usme-Ciro17, Reference Laiton-Donato, Franco-Muñoz and Álvarez-Díaz63]. No evidence of the severity of the disease or symptoms has been reported yet.

The thriving capacity of SARS-CoV-2 to mutate has resulted in, in some cases, convergent evolution and a boost of mutations that are not phylogenetically related. This is, the same mutations are emerging in different places [Reference van Dorp, Acman and Richard47]. As with D950N (Asp950Asn), the mutation G > A at position 24,410 converts an aspartic acid (D) to an asparagine (N). It surfaced almost simultaneously in opposite places of the world: the Delta variant (B.1.617.2) in India and the Mu variant (B.1.621) in Colombia. Studies have shown that in the S gene alone, there is one location with at least 15 recurrent mutations, suggesting convergent evolution and a particular interest in the context of adaptation of the virus to the human host [Reference van Dorp, Acman and Richard47].

Although these data compilations reveal high mutation rates and pressures of selection in favor of the arising diversity of SARS-CoV-2, it is important to mention that these data must be interpreted carefully because some statistical metrics might technically be biased in different ways. On the other hand, the majority of sequences in South America come from just a few research centers or government-funded laboratories in which the collected samples are typically from main urban areas and might not represent the full diversity, especially from regions with difficult access or those near to frontiers with other countries. Secondly, the sequences are being obtained from different platforms. It is well known that PacBio, ION Torrent, and Oxford NanoPore technologies offer longer reads at a quality cost, making them difficult to compare with Illumina or Sanger methods. This highly error-prone platform might show an inaccurate reading of nucleotides as possible point mutations if replicates and controls are not rigorous enough. Third, countries in South America have the lowest speed for updating genomic information in GISAID. By the time this article was written, there was no information about the last three months from Bolivia, Uruguay, Paraguay, and Venezuela.

As a fourth aspect, most of the statistics to measure transmissibility are based on confirmed positive cases; however, only in Germany, studies showed that almost 42% of people are unaware of their current status of infection [Reference Wild64]. An unmeasurable number of people in South America are infected but not officially diagnosed, therefore making the fatality of SARS-CoV-2 much lower and the transmissibility much higher [Reference Irons and Raftery65].

And finally, these molecular dynamics of mutation and variant frequencies are time-sensitive and a time interval as low as 30 days is enough to significantly change the epidemiological landscape. According to GISAID, South American countries show a median of 92 days (equal to 3 months with a range of 27 to 271 days) for the deposition of sequences [Reference Elbe and Buckland-Merrett18]. Compared to the 16 days of developed countries in Europe, South American countries are taking too much time to reach the correct epidemiological and appropriate public measures based on evidence to control the pandemic, thus resulting in massive waves of infection and new variants appearing.

7. Conclusions

This compiled data present irrefutable evidence that Latin America is a special case of the genomic dynamics of SARS-CoV-2. The distribution of all mutations found accounts for over 63% of the whole SARS-CoV-2 genome, representing a high degree of polymorphisms, with mutation and positive natural selection as sources of diversity. This was possible to understand as scientific collaborations were based on data sharing through public databases like GISAID.

However, Latin American reports are somehow incomplete and biased because there are not enough genome sequences generated in proportion to the rates of infection; and the small amount of data reported does not represent all the territory. Additionally, the limited access to samples in geographically challenging locations restrains real-time monitoring of variants’ distribution.

Continuous genomic surveillance to track existing VOC and VOI and the rapid emergence of new variants is critical for guiding and making appropriate public health decisions aimed at limiting the spread and finding effective treatments. Additionally, the phylogenetic information in the evolutionary perspective will give us keys to creating models to predict new epidemics of coronavirus, as has been happening approximately every decade since SARS-CoV in the early 2000s.

This pandemic teaches us that a successful exit from these worldwide circumstances lies in global policies and contributions from all countries. Finally, Latin America urges the world to approach higher standards in technology and human talent to overcome the limitations that are currently happening to track, in real time, the dynamics of COVID-19 and SARS-CoV-2. An immediate solution could be that countries with robust surveillance programs and NGS infrastructure, like Brazil, Chile, and Colombia, start to work with samples provided by neighboring countries to homogenize and keep up the upgraded information with the help of the Pan American Health Organization.

Conflicts of Interest

The authors declare that they have no conflicts of interest.

Authors’ Contributions

PAOP contributed to the literature review, construction, and writing of the manuscript. CHST was responsible for literature and manuscript review.

Acknowledgments

The authors express gratitude to members of the Scientific Committee at InnovaGen Foundation: Daniela Carvajal, María V. Londoño, and Yexania Arboleda, for their input and commentaries in preparing this manuscript. Also, special thanks are due to Edna L. Ortiz for the style and language review. Finally, the authors acknowledge the collaboration and support of the Vice Presidency of Research at the University of Cauca. This work is funded by the Colombian Ministry of Science, Technology, and Innovation through funds provided by the General Royalties System (Grant no. BPIN-2020000100086).

References

Khailany, R. A., Safdar, M., and Ozaslan, M., “Genomic characterization of a novel,Gene Reports, vol. 19, 2020.CrossRefGoogle ScholarPubMed
Brasher, C., Chikumba, E., McDougall, R., Jannicke, M.-O., and Angela, E., “The COVID-19 pandemic: effects on low- and middle-income countries,Anesthesia and Analgesia, vol. 131, no. 1, pp. 8692, 2020.Google Scholar
Uddin, M., Mustafa, F., Rizvi, T. A. et al., “SARS-CoV 2/COVID-19: viral genomics, epidemiology, vaccines, and therapeutic interventions,Viruses, vol. 12, no. 5, p. 526, 2020.CrossRefGoogle ScholarPubMed
Abdelrahman, Z., Li, M., and Wang, X., “Comparative review of SARS-CoV 2, SARS-CoV, MERS-CoV, and influenza A respiratory viruses,Frontiers in Immunology, vol. 11, 2020.CrossRefGoogle ScholarPubMed
Luk, H. K. H., Li, X., Fung, J., Lau, S. K. P., and Woo, P. C. Y., “Molecular epidemiology, evolution and phylogeny of SARS coronavirus,Infection, Genetics and Evolution, vol. 71, pp. 2130, 2019.CrossRefGoogle ScholarPubMed
Anderson, R. M., Fraser, C., Ghani, A. C. et al., “Epidemiology, transmission dynamics and control of SARS: the 2002-2003 epidemic,Philosophical Transactions of the Royal Society of London. Series B: Biological Sciences, vol. 359, no. 1447, 2004.CrossRefGoogle ScholarPubMed
Mackay, I. M. and Arden, K. E., “MERS coronavirus: diagnostics, epidemiology and transmission,Virology Journal, vol. 12, no. 1, p. 222, 2015.CrossRefGoogle ScholarPubMed
Zhou, P., Yang, X.-L., Wang, X.-G. et al., “A pneumonia outbreak associated with a new coronavirus of probable bat origin,Nature, vol. 579, no. 7798, pp. 270273, 2020.CrossRefGoogle ScholarPubMed
Lu, R., Zhao, X., Li, J. et al., “Genomic characterisation and epidemiology of 2019 novel coronavirus: implications for virus origins and receptor binding,The Lancet, vol. 395, pp. 565574, 2020.CrossRefGoogle ScholarPubMed
Giovanetti, M., Benedetti, F., Campisi, G. et al., “Evolution patterns of SARS-CoV 2: snapshot on its genome variants,Biochemical and Biophysical Research Communications, vol. 538, pp. 8891, 2021.CrossRefGoogle ScholarPubMed
Liu, P., Jiang, J.-Z., Wan, X.-F. et al., “Are pangolins the intermediate host of the 2019 novel coronavirus (SARS-CoV 2)?,PLoS Pathogens, vol. 16, no. 5, Article ID e1008421, 2020CrossRefGoogle ScholarPubMed
Yuen, K.-S., Ye, Z.-W., Fung, S.-Y., Chan, C.-P., and Jin, D.-Y., “SARS-CoV 2 and COVID-19: the most important research questions,Cell & Bioscience, vol. 10, no. 1, p. 40, 2020.CrossRefGoogle ScholarPubMed
Kadam, S. B., Sukhramani, G. S., Bishnoi, P., Pable, A. A., and Barvkar, V. T., “SARS-CoV 2, the pandemic coronavirus: molecular and structural insights,Journal of Basic Microbiology, vol. 61, no. 3, pp. 180202, 2021.CrossRefGoogle ScholarPubMed
Oliveira, B. A., Oliveira, L. C. D., Sabino, E. C., Okay, T. S., “SARS-CoV 2 and the COVID-19 disease: a mini review on diagnostic methods,Revista do Instituto de Medicina Tropical de São Paulo, vol. 62, 2020.CrossRefGoogle ScholarPubMed
Inzaule, S. C., Tessema, S. K., Kebede, Y., Ogwell Ouma, A. E., and Nkengasong, J. N., “Genomic-informed pathogen surveillance in Africa: opportunities and challenges,The Lancet Infectious Diseases, vol. 21, no. 9, pp. e281e289, 2021.CrossRefGoogle ScholarPubMed
Álvarez-Díaz, D. A., Laiton-Donato, K., Franco-Muñoz, C., and Mercado-Reyes, M., “Secuenciación del SARS-CoV 2: la iniciativa tecnológica para fortalecer los sistemas de alerta temprana ante emergencias de salud pública en Latinoamérica y el Caribe,Biomedica, vol. 40, pp. 188197, 2020.CrossRefGoogle Scholar
Laiton-Donato, K., Villabona-Arenas, C. J., Usme-Ciro, J. A. et al., “Genomic epidemiology of severe acute respiratory syndrome coronavirus 2, Colombia,Emerging Infectious Diseases, vol. 26, no. 12, pp. 28542862, 2020.CrossRefGoogle ScholarPubMed
Elbe, S., Buckland-Merrett, G., “Data, disease and diplomacy: GISAID’s innovative contribution to global health,Global Challenges, vol. 1, no. 1, pp. 3346, 2017.CrossRefGoogle ScholarPubMed
Wu, F., Zhao, S., Yu, B. et al., “A new coronavirus associated with human respiratory disease in China,Nature, vol. 579, no. 7798, pp. 265269, 2020.CrossRefGoogle ScholarPubMed
Naqvi, A. A. T., Fatima, K., Mohammad, T. et al., “Insights into SARS-CoV 2 genome, structure, evolution, pathogenesis and therapies: structural genomics approach,Biochimica et Biophysica Acta—Molecular Basis of Disease, vol. 1866, no. 10, Article ID 165878, 2020Google ScholarPubMed
Rahimi, A., Mirzazadeh, A., and Tavakolpour, S., “Genetics and genomics of SARS-CoV 2: a review of the literature with the special focus on genetic diversity and SARS-CoV 2 genome detection,Genomics, vol. 113, no. 1, pp. 12211232, 2021.CrossRefGoogle ScholarPubMed
Makoni, M., “South Africa responds to new SARS-CoV 2 variant,The Lancet, vol. 397, no. 10271, 2021.CrossRefGoogle ScholarPubMed
Wise, J., “Covid-19: new coronavirus variant is identified in UK,BMJ, vol. 371, Article ID m4857, 2020.Google ScholarPubMed
Voloch, C. M., da Silva Francisco, R., de Almeida, L. G. P., et al., “Genomic characterization of a novel SARS-CoV 2 lineage from rio de Janeiro, Brazil,Journal of Virology, vol. 95, no. 10, 2021.CrossRefGoogle ScholarPubMed
Rehman, S. U., Shafique, L., Ihsan, A., and Liu, Q., “Evolutionary trajectory for the emergence of novel coronavirus SARS-CoV 2,Pathogens, vol. 9, no. 3, p. 240, 2020.CrossRefGoogle ScholarPubMed
Volz, E., O’Toole, Á., Bull, M. et al., “Evaluating the effects of SARS-CoV 2 spike mutation D614G on transmissibility and pathogenicity,Cell, vol. 184, 2021.CrossRefGoogle ScholarPubMed
Becerra-Flores, M. and Cardozo, T., “SARS-CoV 2 viral spike G614 mutation exhibits higher case fatality rate,International Journal of Clinical Practice, vol. 74, no. 8, 2020.CrossRefGoogle ScholarPubMed
Kai, H. and Kai, M., “Interactions of coronaviruses with ACE2, angiotensin II, and RAS inhibitors-lessons from available evidence and insights into COVID-19,Hypertension Research, vol. 43, no. 7, pp. 648654, 2020.CrossRefGoogle ScholarPubMed
Ragia, G. and Manolopoulos, V. G., “Assessing COVID-19 susceptibility through analysis of the genetic and epigenetic diversity of ACE2-mediated SARS-CoV 2 entry,Pharmacogenomics, vol. 21, no. 18, pp. 13111329, 2020.CrossRefGoogle ScholarPubMed
Devaux, C. A., Rolain, J.-M., and Raoult, D., “ACE2 receptor polymorphism: susceptibility to SARS-CoV 2, hypertension, multi-organ failure, and COVID-19 disease outcome,Journal of Microbiology, Immunology, and Infection, vol. 53, no. 3, pp. 425435, 2020.CrossRefGoogle ScholarPubMed
Hou, Y., Zhao, J., Martin, W. et al., “New insights into genetic susceptibility of COVID-19: an ACE2 and TMPRSS2 polymorphism analysis,BMC Medicine, vol. 18, no. 1, p. 216, 2020.CrossRefGoogle ScholarPubMed
Dong, M., Zhang, J., Ma, X. et al., “ACE2, TMPRSS2 distribution and extrapulmonary organ injury in patients with COVID-19,Biomedicine & Pharmacotherapy, vol. 131, Article ID 110678, 2020.CrossRefGoogle ScholarPubMed
Centers for Disease Control and Prevention—US-CDC, “SARS-CoV 2 variant classifications and definitions,2021, https://www.cdc.gov/coronavirus/2019-ncov/variants/variant-info.html.Google Scholar
Domingo, E., “Mechanisms of viral emergence,Veterinary Research, vol. 41, no. 6, p. 38, 2010.CrossRefGoogle ScholarPubMed
Choudhary, S., Sreenivasulu, K., Mitra, P., Misra, S., and Sharma, P., “Role of genetic variants and gene expression in the susceptibility and severity of COVID-19,Annals of Laboratory Medicine, vol. 41, no. 2, pp. 129138, 2021.CrossRefGoogle ScholarPubMed
Forni, G. and Mantovani, A., “COVID-19 vaccines: where we stand and challenges ahead,Cell Death & Differentiation, vol. 28, no. 2, pp. 626639, 2021.CrossRefGoogle ScholarPubMed
Muñoz, N., “COVID-19 in latin America: a first glance to the mortality,Colombia Médica, vol. 51, pp. 13, 2020.Google ScholarPubMed
Rambaut, A., Holmes, E. C., O’Toole, Á. et al., “A dynamic nomenclature proposal for SARS-CoV 2 lineages to assist genomic epidemiology,Nature Microbiology, vol. 5, no. 11, pp. 14031407, 2020.CrossRefGoogle ScholarPubMed
Hadfield, J., Megill, C., Bell, S. M. et al., “Nextstrain: real-time tracking of pathogen evolution,Bioinformatics, vol. 34, no. 23, pp. 41214123, 2018.CrossRefGoogle ScholarPubMed
World Health Organization, “World health organization,2021, https://www.who.int/en/activities/tracking-SARS-CoV-2-variants/.Google Scholar
Pomplun, S., “Targeting the SARS-CoV 2-spike protein: from antibodies to miniproteins and peptides,RSC Medicinal Chemistry, vol. 12, no. 2, pp. 197202, 2021.CrossRefGoogle Scholar
Hu, J., He, C.-L., Gao, Q.-Z. et al., “D614G mutation of SARS-CoV 2 spike protein enhances viral infectivity,BioRxiv, 2020.CrossRefGoogle Scholar
Plante, J. A., Liu, Y., Liu, J. et al., “Spike mutation D614G alters SARS-CoV 2 fitness,Nature, vol. 592, no. 7852, pp. 116121, 2021.CrossRefGoogle ScholarPubMed
Zhou, Y., Vedantham, P., Lu, K. et al., “Protease inhibitors targeting coronavirus and filovirus entry,Antiviral Research, vol. 116, pp. 7684, 2015.CrossRefGoogle ScholarPubMed
Bhattacharyya, C., Das, C., Ghosh, A. et al., “SARS-CoV 2 mutation 614G creates an elastase cleavage site enhancing its spread in high AAT-deficient regions,Infection, Genetics and Evolution, vol. 90, Article ID 104760, 2021.CrossRefGoogle ScholarPubMed
Mullen, J. L., Tsueng, G., Abdel Latif, A. et al., “Outbreak.info,2021, https://outbreak.info/.Google Scholar
van Dorp, L., Acman, M., Richard, D. et al., “Emergence of genomic diversity and recurrent mutations in SARS-CoV 2,Infection, Genetics and Evolution, vol. 83, Article ID 104351, 2020.CrossRefGoogle ScholarPubMed
Roy, C., Mandal, S. M., Mondal, S. K. et al., “Trends of mutation accumulation across global SARS-CoV 2 genomes: implications for the evolution of the novel coronavirus,Genomics, vol. 112, no. 6, pp. 53315342, 2020.CrossRefGoogle ScholarPubMed
Yoshimoto, F. K., “The proteins of severe acute respiratory syndrome coronavirus-2 (SARS CoV 2 or n-COV 19), the cause of COVID-19,The Protein Journal, vol. 39, no. 3, pp. 198216, 2020.CrossRefGoogle ScholarPubMed
Zhao, Z., Li, H., Wu, X. et al., “Moderate mutation rate in the SARS coronavirus genome and its implications,BMC Evolutionary Biology, vol. 4, no. 1, p. 21, 2004.CrossRefGoogle ScholarPubMed
Yue, Y., Nabar, N. R., Shi, C.-S. et al., “SARS-coronavirus open reading frame-3a drives multimodal necrotic cell death,Cell Death & Disease, vol. 9, no. 9, p. 904, 2018.CrossRefGoogle ScholarPubMed
Kryazhimskiy, S. and Plotkin, J. B., “The population genetics of dN/dS,PLoS Genetics, vol. 4, no. 12, Article ID e1000304, 2008.CrossRefGoogle ScholarPubMed
Ferrareze, P. A. G., Franceschi, V. B., Mayer, A. D. M., Caldana, G. D., Zimerman, R. A., and Thompson, C. E., “E484K as an innovative phylogenetic event for viral evolution: genomic analysis of the E484K spike mutation in SARS-CoV 2 lineages from Brazil,Infection, Genetics and Evolution, vol. 93, Article ID 104941, 2021.CrossRefGoogle ScholarPubMed
Jangra, S., “The E484K mutation in the SARS-CoV 2 spike protein reduces but does not abolish neutralizing activity of human convalescent and post-vaccination sera,medRxiv, 2021.CrossRefGoogle Scholar
Wise, J., “Covid-19: the E484K mutation and the risks it poses,BMJ, vol. 372, 2021.Google ScholarPubMed
Khan, A., Zia, T., Suleman, M. et al., “Higher infectivity of the SARS-CoV 2 new variants is associated with K417N/T, E484K, and N501Y mutants: an insight from structural data,Journal of Cellular Physiology, vol. 236, no. 10, 2021.CrossRefGoogle ScholarPubMed
Liu, Y., Liu, J., Plante, K. S. et al., “The N501Y spike substitution enhances SARS-CoV 2 transmission,” BioRxiv, 2021.CrossRefGoogle Scholar
Faria, N., Claro, I., Candido, D., Moyses Franco, L., Andrade, P., and Coletti, T., “Genomic characterisation of an emergent SARS-CoV 2 lineage in manaus: preliminary findings,2021, https://virological.org/t/genomic-characterisation-of-an-emergent-sars-cov-2-lineage-in-manaus-preliminary-findings/586.Google Scholar
Garcia-Beltran, W. F., Lam, E. C., Denis, K. S. et al., “Multiple SARS-CoV 2 variants escape neutralization by vaccine-induced humoral immunity,” medRxiv, 2021.CrossRefGoogle Scholar
Moutinho, S., “Chinese COVID-19 vaccine maintains protection in variant-plagued Brazil,Science, vol. 4, 2021.Google Scholar
Robertson, S., “Lambda lineage of SARS-CoV 2 has potential to become variant of concern,2021, https://www.news-medical.net/news/20210627/Lambda-lineage-of-SARS-CoV-2-has-potential-to-become-variant-of-concern.aspx.Google Scholar
van Homrigh, M., “Covid 19 coronavirus: ultra-contagious lambda variant detected in Australia,2021, https://www.nzherald.co.nz/world/covid-19-coronavirus-ultra-contagious-lambda-variant-detected-in-australia/LAUYIDU2L7SSWMX6IQDTOYCX6A/.Google Scholar
Laiton-Donato, K., Franco-Muñoz, C., Álvarez-Díaz, D. A. et al., “Characterization of the emerging B.1.621 variant of interest of SARS-CoV 2,Infection, Genetics and Evolution, vol. 95, Article ID 105038, 2021.CrossRefGoogle ScholarPubMed
Wild, P., “Gutenberg COVID-19 study,2021, https://www.unimedizin-mainz.de/gcs/overview.html?L=1.Google Scholar
Irons, N. J. and Raftery, A. E., “Estimating SARS-CoV 2 infections from deaths, confirmed cases, tests, and random surveys,Proceedings of the National Academy of Sciences, vol. 118, no. 31, 2021.CrossRefGoogle ScholarPubMed
Figure 0

Table 1 Nomenclature of SARS-CoV-2 variants and noticeable pathological characteristics.

Figure 1

Figure 1 Cross reference of variants of concern (VOC) and (VOI) variants of interests (Y-axis) and the occurrence and frequency of point mutations (X axis). Mutations and frequency were measured with >90% prevalence of that specific mutation in at least one lineage. Text in red represent variants/mutations of concern. Yellow: variants/mutations of interest. White text: variants/mutations under surveillance or observation. Graphic modified from outbreak.info website [46] and customized to show only the VOC and VOI currently spreading in South America. By the time of manuscript writing, a few cases of Omicron were present in South America but reached over 10% in South Africa and over 3% in the fourth wave of infections in Europe in the last month. Source: outbreak.info/GISAID.

Figure 2

Table 2 Distribution and frequencies of VOC and VOI in continental South America.