Comprehensive genotyping of the USA national maize inbred seed bank - PubMed (original) (raw)

doi: 10.1186/gb-2013-14-6-r55.

Mark J Millard, Jeffrey C Glaubitz, Jason A Peiffer, Kelly L Swarts, Terry M Casstevens, Robert J Elshire, Charlotte B Acharya, Sharon E Mitchell, Sherry A Flint-Garcia, Michael D McMullen, James B Holland, Edward S Buckler, Candice A Gardner

Comprehensive genotyping of the USA national maize inbred seed bank

Maria C Romay et al. Genome Biol. 2013.

Abstract

Background: Genotyping by sequencing, a new low-cost, high-throughput sequencing technology was used to genotype 2,815 maize inbred accessions, preserved mostly at the National Plant Germplasm System in the USA. The collection includes inbred lines from breeding programs all over the world.

Results: The method produced 681,257 single-nucleotide polymorphism (SNP) markers distributed across the entire genome, with the ability to detect rare alleles at high confidence levels. More than half of the SNPs in the collection are rare. Although most rare alleles have been incorporated into public temperate breeding programs, only a modest amount of the available diversity is present in the commercial germplasm. Analysis of genetic distances shows population stratification, including a small number of large clusters centered on key lines. Nevertheless, an average fixation index of 0.06 indicates moderate differentiation between the three major maize subpopulations. Linkage disequilibrium (LD) decays very rapidly, but the extent of LD is highly dependent on the particular group of germplasm and region of the genome. The utility of these data for performing genome-wide association studies was tested with two simply inherited traits and one complex trait. We identified trait associations at SNPs very close to known candidate genes for kernel color, sweet corn, and flowering time; however, results suggest that more SNPs are needed to better explore the genetic architecture of complex traits.

Conclusions: The genotypic information described here allows this publicly available panel to be exploited by researchers facing the challenges of sustainable agriculture through better knowledge of the nature of genetic diversity.

PubMed Disclaimer

Figures

Figure 1

Figure 1

distribution of single-nucleotide polymorphisms (SNPs) across the genome. Distribution of the number of SNPs found in 1 Mb windows across the 10 maize chromosomes. Centromere positions are shown in black.

Figure 2

Figure 2

Identical by state (IBS) distribution across GBS samples. Distribution of IBS values across (A) the 2,815 accessions and (B) for accessions with multiple samples.

Figure 3

Figure 3

B73 network diagram. Network relationships of maize inbred lines with values of IBS greater than 0.97 for B73.

Figure 4

Figure 4

Multidimensional scanning for 2,815 maize inbred lines. Genetic relationships between the maize inbred lines preserved at the NCRPIS germplasm bank visualized using a principal coordinate analysis of the distances matrix. The × and Y axes represent PCo1 and PCo2 respectively. Colors are assigned based on (A) population structure or (B) breeding program. Inbred lines obtained directly from landraces without selection are highlighted in red to serve as reference.

Figure 5

Figure 5

Minor allele frequency (MAF) distribution and percentage of single-nucleotide polymorphisms (SNPs) shared between maize subpopulations. Histogram of MAF distribution over all groups, and cumulative percentage of SNPs shared between different groups of germplasm for each class of MAF. Columns represent the percentage of SNPs in each MAF category; lines represent the percentage of alleles shared between the groups of germplasm at equal or lesser MAF value.

Figure 6

Figure 6

Expired Plant Variety Protection (ExPVP) network diagram and distribution of segregating single-nucleotide polymorphism (SNPs). (A) Network of relationships for the ExPVP inbreds constructed using identical by state (IBS) values greater than 0.9. Each dot (inbred line) has a different color assigned based on the company where it was developed. (B) Distribution of the segregating SNPs between the three heterotic groups that form the three main clusters in the network graph.

Figure 7

Figure 7

Genome-wide pairwise relationships between different genetic diversity measurements. Relationships between nested association mapping (NAM) recombination rate (log10 cM/Mb), average haplotype length (bp), average LD (_r_2), and fixation indexes (Fst) between stiff stalk, non-stiff stalk, and tropical lines at the NAM genetic map bin scale. The numbers indicate the coefficient of determination (_r_2) calculated using Spearman's rank correlation. LD, linkage disequilibrium.

Figure 8

Figure 8

Decline of genome-wide linkage disequilibrium (LD) across all maize inbreds. Mean LD decay measured as pairwise _r_2 between all single-nucleotide polymorphisms in the collection. The red line represents the average value while the darker gray area represents the 50% range of values and light gray 90%.

Figure 9

Figure 9

Genome-wide association study (GWAS) for yellow versus white kernels. GWAS for kernel color on 1,595 maize inbred lines with yellow or white kernels.

Figure 10

Figure 10

Genome-wide association study (GWAS) for sweet versus starchy corn. GWAS for kernel color on 2,145 maize inbred lines with sweet or starchy kernels. SNP, single-nucleotide polymorphism.

Figure 11

Figure 11

Genome-wide association study (GWAS) for growing degree days to silking. GWAS for growing degree days to 50% silking on 2,279 maize inbred lines. NAM, nested association mapping; QTL, quantitative trait loci.

Similar articles

Cited by

References

    1. FAOSTAT. http://faostat.fao.org
    1. Fu H, Dooner HK. Intraspecific violation of genetic colinearity and its implications in maize. Proc Natl Acad Sci USA. 2002;99:9573–9578. doi: 10.1073/pnas.132259199. - DOI - PMC - PubMed
    1. Morgante M, Brunner S, Pea G, Fengler K, Zuccolo A, Rafalski A. Gene duplication and exon shuffling by helitron-like transposons generate intraspecies diversity in maize. Nat Genet. 2005;37:997–1002. doi: 10.1038/ng1615. - DOI - PubMed
    1. Chia J-M, Song C, Bradbury PJ, Costich D, De Leon N, Doebley J, Elshire RJ, Gaut B, Geller L, Glaubitz JC, Gore M, Guill KE, Holland J, Hufford MB, Lai J, Li M, Liu X, Lu Y, McCombie R, Nelson R, Poland J, Prasanna BM, Pyhäjärvi T, Rong T, Sekhon RS, Sun Q, Tenaillon MI, Tian F, Wang J, Xu X. et al.Maize HapMap2 identifies extant variation from a genome in flux. Nat Genet. 2012;44:803–807. doi: 10.1038/ng.2313. - DOI - PubMed
    1. Shull GH. The composition of a field of maize. American Breeders Association Report. 1908;4:296–301.

MeSH terms

Substances

LinkOut - more resources