1,283
Views
7
CrossRef citations to date
0
Altmetric
Letter to the Editor

Genetic polymorphism of 29 STR loci in the Hunan Han population from China

, , , , , , & show all

Dear Editor,

The Han population accounts for the majority of the Chinese population. Hunan is a large province located at 108°47′–114°15′ E and 30°08′–24°38′ N in Southern China, specifically in the area in the middle reaches of Yangtze River. The province has a total area of 218 400 km2 and a population of 72 million. The Han population of the province is approximately 59.15 million [Citation1]. In this study, we evaluated the allele frequencies and forensic parameters of 29 short tandem repeat (STR) loci by using two commercial kits (Goldeneye™ 20A kit and Investigator HDplex kit). Bloodstain specimens were obtained from 501 unrelated healthy individuals (with informed consent) from the Han population of the Hunan Province, China. The Goldeneye™ 20A kit (PeopleSpot Inc., Beijing, China) includes the amelogenin locus and 19 autosomal loci (i.e. D19S433, D5S818, D21S11, D18S51, D6S1043, D3S1358, D13S317, D7S820, D16S539, CSF1PO, Penta D, vWA, D8S1179, TPOX, Penta E, TH01, D12S391, D2S1338, and FGA). The Investigator HDplex kit (Qiagen, Hilden, Germany) contains the amelogenin locus, one CODIS (D18S51), and 11 Non-CODIS loci (i.e. D2S1360, D3S1744, D4S2366, D5S2500, SE33, D6S474, D7S1517, D8S1132, D10S2325, D12S391 and D21S2055). Both kits contain D18S51 and D12S391, which can be used for sample concordance.

Genomic DNA was extracted from the bloodstain samples by using the Chelex-100 protocol according to the description by Walsh et al. [Citation2]. DNA concentration was diluted to 1.0 ng/µL by adding high-purity water. Goldeneye™ 20A PCR kit was used following the manufacturer's guidelines [Citation1]. Qiagen Investigator HDplex kit was used following the manufacturer's guidelines but reduce the volume to 10 µL [Citation4]. PCR amplification was performed on a GeneAmp® PCR 9700 thermal cycler (Thermo Fisher Scientific, Waltham, MA, USA). Capillary electrophoresis was performed on an AB 3130XL Genetic Analyzer (Thermo Fisher Scientific). 9948a and nuclease-free water were amplified as positive and negative controls, respectively. DNA profiles were analysed with GeneMapper ID 3.2 (Thermo Fisher Scientific).

Genotype frequencies at each locus, power of discrimination (PD), power of exclusion (PE), probability of matching (PM), polymorphism information content (PIC), observed heterozygosity (Ho) and Hardy–Weinberg equilibrium (HWE) were evaluated with modified PowerStat (version 1.2) program [Citation5]. Linkage disequilibrium (LD) analysis was performed using Genepop version 4.2 software package (http://genepop.curtin.edu.au). The allele frequency distributions among groups were compared using Arlequin version 3.5 software package [Citation6]. Pairwise genetic distance among populations was calculated according to Nei's formula by using Phylogeny Inference Package (Phylip) version 3.69 [Citation7]. Neighbour-Joining (NJ) phylogenetic tree was constructed and viewed with TreeView softwar [Citation8].The current study was approved by the Third Xiangya Hospital of Central South University (approval code: 2016-S041).

The allele frequencies and population genetics parameters of the 29 STR loci are summarized in Table S1. We detected 403 alleles across the 29 STR loci, with the largest number of 50 variants at the SE33 locus and the least number of 6 alleles at the TPOX locus. Genotyping results should be combined with the standard DNA allelic ladder to avoid genotyping errors because of the complex internal sequence structure of the SE33 locus [Citation9]. The P-value for Hardy–Weinberg testing did not significantly deviate from HWE after Bonferroni correction (P = 0.05/29 = 0.0017), only D21S2055 (P = 0.000 2) and D7S1517 (P = 0.0012) had minor departures from HWE because of errors in random sampling. Among the 29 STR loci, the SE33 locus was the most informative marker, with a PD value of 0.9930 and a PIC value of 0.9500, whereas the TPOX locus presented the lowest forensic efficiency, with a PD value of 0.7590 and a PIC value of 0.5200. The Ho ranged from 0.5930 (TPOX) to 0.9240 (Penta E). The combined PM and combined PE for the 29 STR loci were 1.34 × 10−36 and 0.999 999 999 998, respectively. Pairwise LD tests demonstrated that only 18 pairs (Table S2) of the loci remained in LD for 406 pairwise comparisons after Bonferroni correction (P = 0.05/406 = 0.00012). These significant pairs in LD may be attributed to random sampling errors because the pairs are located on different autosomal chromosomes or chromosome arms.

Table S3 shows the locus-by-locus population differences (P-value) in the Investigator HDplex kit between the Hunan Han population and previously published data of other ethnic populations at the same loci. Evident differences (P < 0.05) were found between the Hunan Han and the Dutch [Citation10], North Italian [Citation11], Somali [Citation12], Lithuanian (except D3S1744, P = 0.0723) [Citation13] and Swedish (except D3S1744, P = 0.1267 and D8S1132, P = 0.2776) [Citation12] populations. Significant differences were also observed from the Hunan Han population at D2S1360, D3S1744, D10S2325, SE33 and D12S391 with the Hebei Han [Citation4]; at D4S2366, D8S1132, SE33 and D12S391 with the Sichuan Han [Citation14]; and at D3S1744, D10S2325, SE33 and D12S391 with the Shanghai Han [Citation15]. Table S4 shows the genetic distances between Hunan Han and other populations. Figure S1 presents the phylogenetic tree constructed by comparing the allelic frequencies of 11 STR loci in the Investigator HDplex kit and determines the genetic relationships of the Hunan Han population to the other populations. The minimum genetic distance and close genetic relationship were found between the Hunan Han and Shanghai Han (0.0059) populations. The largest distance was observed between the Hunan Han and Somali (0.1752) populations.

Our results for 19 STR loci in the Goldeneye™ 20A kit compared with other populations are also summarized. Table S5 shows that the distribution of alleles for the Hunan Han versus Suzhou Han [Citation16], Henan Han [Citation17], Yunnan Bai [Citation18], Liaoning Manchu [Citation19] and Philippine [Citation20] populations indicated statistically significant differences at 8, 7, 14, 15 and 16 STR loci, respectively. No significant differences were found between the Hunan Han with the Guangdong Han [Citation3] (except D7S820, Penta E and TH01), Huai'an Han [Citation16] (except Penta E), Yangzhou Han [Citation16] (except D19S433, D3S1358 and TH01) and Taizhou Han [Citation16] (except D16S539, Penta D and TH01) populations. Table S6 and Figure S2 present the genetic distances and phylogenetic tree between the Hunan Han and other populations, respectively. The minimum genetic distance and close genetic relationship were detected between the Hunan Han and Guangdong Han (0.0040) populations. The largest distance was found between the Hunan Han and Liaoning Manchu (0.0618) populations.

In summary, data suggested that the use of 29 STR loci could provide highly informative polymorphic data for forensic identification and establish a database for the Hunan Han group in China. The data also demonstrated that the distribution of allele frequencies varied in different areas or even in the same area for different subpopulations. Close genetic distance and genetic relationship were noted among regions separated by small distances because of their geographic proximity and high gene mobility.

This paper strictly follows all the guidelines for publication of population data as required by this target journal.

Supplemental material

1306430_suppl.zip

Download Zip (2.3 MB)

1306430_suppl.zip

Download Zip (1.6 MB)

Additional information

Funding

Graduate Student Education Innovation Projects of Central South University

References

  • Population Census Office under the State Council Department of Population. Employment statistics national bureau of statistics, tabulation of the 2010 population census of the People's Republic of China. Beijing: China Statistics Press; 2012.
  • Walsh PS, Metzger DA, Higuchi R. Chelex 100 as a medium for simple extraction of DNA for PCR-based typing from forensic material. Biotechniques. 1991;10:506–513.
  • Tong D, Chen Y, Ou X, et al. Polymorphism analysis and evaluation of 19 STR loci in the Han population of Southern China. Ann Hum Biol. 2013;40:191–196.
  • Lei L, Xu J, Du Q, et al. Genetic polymorphism of the 26 short tandem repeat loci in the Chinese Hebei Han population using two commercial forensic kits. Mol Biol Rep. 2015;42:217–225.
  • Zhao F, Wu XY, Cai GQ, et al. The application of modified-powerstates software in the forensic biostatistics. Chin J Forensic Med. 2003;18:297–312.
  • Excoffier L, Lischer HE. Arlequin ver 3.5: a new series of programs to performed population genetics analyses under Linux and Windows. Mol Ecol Resour. 2010;10:564–567.
  • Felsenstein JP. PHYLIP (phylogeny inference package) version 3.69. Seattle (WA): Department of Genome Sciences, University of Washington; 2009.
  • Page RD. TreeView: an application to display phylogenetic trees on personal computers. Comput Appl Biosci. 1996;12:357–358.
  • Hering S, Edelmann J, Dressler J. Sequence variations in the primer binding regions of the highly polymorphic STR system SE33. Int J Legal Med. 2002;116:365–367.
  • Westen AA, Haned H, Grol LJ, et al. Combining results of forensic STR kits: HDplex validation including allelic association and linkage testing with NGM and Identifiler loci. Int J Legal Med. 2012;126:781–789.
  • Turrina S, Ferrian M, Caratti S, et al. Investigator HDplex markers: allele frequencies and mutational events in a North Italian population. Int J Legal Med. 2015;129:731–733.
  • Tillmar AO, Nilsson H, Kling D, et al. Analysis of Investigator HDplex markers in Swedish and Somali populations. Forensic Sci Int Genet. 2013;7:e21–e22.
  • Pepinski W, Niemcunowicz-Janica A, Skawronska M, et al. Polymorphism of 11 non-CODIS STRs in a population sample of Lithuanian minority residing in northeastern Poland. Forensic Sci Int Genet. 2011;5:e37.
  • Chen PY, Zhang J, Luo HB, et al. Substitution mutation induced migration anomaly of a D10S2325 allele on capillary electrophoresis. Int J Legal Med. 2013;127:363–368.
  • Zhang S, Zhang Z, Zhao S, et al. Genetic polymorphisms in 12 autosomal STR in a Shanghai Han population from China. Electrophoresis. 2013;34:613–617.
  • Yin C, Ji Q, Li K, et al. Analysis of 19 STR loci reveals genetic characteristic of eastern Chinese Han population. Forensic Sci Int Genet. 2015;14:108–109.
  • Wang H, Wu D, Feng Z, et al. Genetic polymorphisms of 20 short tandem repeat loci from the Han population in Henan, China. Electrophoresis. 2014;35:1509–1514.
  • Li Y, Hong Y, Li X, et al. Allele frequency of 19 autosomal STR loci in the Bai population from the southwestern region of Mainland China. Electrophoresis. 2015;36:2498–2503.
  • Liu J, Guo L, Qi R, et al. Allele frequencies of 19 autosomal STR loci in Manchu population of China with phylogenetic structure among worldwide populations. Gene. 2013;529:282–287.
  • Rodriguez JJ, Salvador JM, Calacal GC, et al. Allele frequencies of 23 autosomal short tandem repeat loci in the Philippine population. Leg Med (Tokyo). 2015;17:295–297.