Skip to main content Accessibility help

Molecular marker information from de novo assembled transcriptomes of chilli pepper (Capsicum annuum L.) varieties based on next-generation sequencing technology

  • Yul-Kyun Ahn (a1), Swati Tripathi (a1), Young-Il Cho (a1), Jeong-Ho Kim (a1), Hye-Eun Lee (a1), Do-Sun Kim (a1) and Jong-Gyu Woo (a1)...


Next-generation sequencing technique has been known as a useful tool for de novo transcriptome assembly, functional annotation of genes and identification of molecular markers. This study was carried out to mine molecular markers from de novo assembled transcriptomes of four chilli pepper varieties, the highly pungent ‘Saengryeg 211’ and non-pungent ‘Saengryeg 213’ and variably pigmented ‘Mandarin’ and ‘Blackcluster’. Pyrosequencing of the complementary DNA library resulted in 361,671, 274,269, 279,221, and 316,357 raw reads, which were assembled in 23,607, 19,894, 18,340 and 20,357 contigs, for the four varieties, respectively. Detailed sequence variant analysis identified numerous potential single-nucleotide polymorphisms (SNPs) and simple sequence repeats (SSRs) for all the varieties for which the primers were designed. The transcriptome information and SNP/SSR markers generated in this study provide valuable resources for high-density molecular genetic mapping in chilli pepper and Quantitative trait loci analysis related to fruit qualities. These markers for pepper will be highly valuable for marker-assisted breeding and other genetic studies.


Corresponding author

* Corresponding author. E-mail:


Hide All
Ahn, YK, Tripathi, S, Kim, JH, Cho, YI, Lee, HE, Kim, DS, Woo, JG and Cho, MC (2014) Transcriptome analysis of Capsicum annuum varieties Mandarin and Blackcluster: assembly, annotation and molecular marker discovery. Gene 533: 494499.
Ashrafi, H, Hill, T, Stoffel, K, Kozik, A, Yao, JQ, Chin-Wo, SR and Deynze, AV (2012) De novo assembly of the pepper transcriptome (Capsicum annuum): a benchmark for in silico discovery of SNPs, SSRs and candidate genes. BMC Genomics 13: 571.
Deschamps, S, Llaca, V and May, GD (2012) Genotyping by sequencing in plants. Biology 1: 460483.
Hill, TA, Ashrafi, H, Chin-Wo, SR, Yao, J, Stoffel, K, Truco, MJ, Kozik, A, Michelmore, RW and Van Deynze, A (2013) Characterization of Capsicum annuum genetic diversity and population structure based on parallel polymorphism discovery with a 30 K unigene pepper GeneChip. PLoS One 8: e56200.
Hyten, D, Cannon, S, Song, Q, Weeks, N, Fickus, E, Shoemaker, R, Specht, J, Farmer, A, May, G and Cregan, P (2010) High-throughput SNP discovery through deep resequencing of a reduced representation library to anchor and orient scaffolds in the soybean whole genome sequence. BMC Genomics 11: 38.
Kim, HJ, Baek, KH, Lee, SW, Kim, JE, Lee, BW, Cho, HS, Kim, WT, Choi, D and Hur, CG (2008) Pepper EST database: comprehensive in silico tool for analyzing the chili pepper (Capsicum annuum) transcriptome. BMC Plant Biology 8: 101.
Liu, S, Li, W, Wu, Y, Chen, C and Lei, J (2013) De novo transcriptome assembly in chilli pepper (Capsicum frutescens) to identify genes involved in the biosynthesis of capsaicinoids. PLoS One 8: e48156.
Lu, FH, Cho, MC and Park, YJ (2012) Transcriptome profiling and molecular marker discovery in red pepper, Capsicum annuum L. TF68. Molecular Biology Reports 39: 33273335.
Novaes, E, Drost, DR, Farmerie, WG, Pappas, GJ Jr, Grattapaglia, D, Sederoff, RR and Kirst, M (2008) High-throughput gene and SNP discovery in Eucalyptus grandis, an uncharacterized genome. BMC Genomics 9: 312.
Sonah, H, Deshmukh, RK, Sharma, A, Singh, VP, Gupta, DK, Gacche, RN, Rana, JC, Singh, NK and Sharma, TR (2011) Genome-wide distribution and organization of microsatellites in plants: an insight into marker development in Brachypodium . PLoS One 6: e21298.
Spindel, J, Wright, M, Chen, C, Cobb, J, Gage, J, Harrington, S, Lorieux, M, Ahmadi, N and McCouch, S (2013) Bridging the genotyping gap: using genotyping by sequencing (GBS) to add high-density SNP markers and new value to traditional bi-parental mapping and breeding populations. Theoretical and Applied Genetics 126: 26992716.
Temnykh, S, DeClerck, G, Lukashova, A, Lipovich, L, Cartinhour, S and McCouch, S (2001) Computational and experimental analysis of microsatellites in rice (Oryza sativa L.): frequency, length variation, transposon associations, and genetic marker potential. Genome Research 11: 14411452.
Van Tassell, CP, Smith, TP, Matukumalli, LK, Taylor, JF, Schnabel, RD, Lawley, CT, Haudenschild, CD, Moore, SS, Warren, WC and Sonstegard, TS (2008) SNP discovery and allele frequency estimation by deep sequencing of reduced representation libraries. Nature Methods 5: 247252.
Yang, T, Bao, SY, Ford, R, Jia, TJ, Guan, JP, He, YH, Sun, XL, Jiang, JY, Hao, JJ, Zhang, XY and Zong, XX (2012) High-throughput novel microsatellite marker of faba bean via next generation sequencing. BMC Genomics 13: 602.
Yu, JN, Won, C, Jun, J, Lim, Y and Kwak, M (2011) Fast and cost-effective mining of microsatellite markers using NGS technology: an example of a Korean water deer Hydropotes inermis argyropus . PLoS One 6: e26933.



Full text views

Total number of HTML views: 0
Total number of PDF views: 0 *
Loading metrics...

Abstract views

Total abstract views: 0 *
Loading metrics...

* Views captured on Cambridge Core between <date>. This data will be updated every 24 hours.

Usage data cannot currently be displayed