Genome-wide scan for seed composition provides insights into soybean quality improvement and the impacts of domestication and breeding (original) (raw)
Related papers
A genome-wide association study of seed protein and oil content in soybean
BMC Genomics, 2014
Background: Association analysis is an alternative to conventional family-based methods to detect the location of gene(s) or quantitative trait loci (QTL) and provides relatively high resolution in terms of defining the genome position of a gene or QTL. Seed protein and oil concentration are quantitative traits which are determined by the interaction among many genes with small to moderate genetic effects and their interaction with the environment. In this study, a genome-wide association study (GWAS) was performed to identify quantitative trait loci (QTL) controlling seed protein and oil concentration in 298 soybean germplasm accessions exhibiting a wide range of seed protein and oil content. Results: A total of 55,159 single nucleotide polymorphisms (SNPs) were genotyped using various methods including Illumina Infinium and GoldenGate assays and 31,954 markers with minor allele frequency >0.10 were used to estimate linkage disequilibrium (LD) in heterochromatic and euchromatic regions. In euchromatic regions, the mean LD (r 2 ) rapidly declined to 0.2 within 360 Kbp, whereas the mean LD declined to 0.2 at 9,600 Kbp in heterochromatic regions. The GWAS results identified 40 SNPs in 17 different genomic regions significantly associated with seed protein. Of these, the five SNPs with the highest associations and seven adjacent SNPs were located in the 27.6-30.0 Mbp region of Gm20. A major seed protein QTL has been previously mapped to the same location and potential candidate genes have recently been identified in this region. The GWAS results also detected 25 SNPs in 13 different genomic regions associated with seed oil. Of these markers, seven SNPs had a significant association with both protein and oil.
Plants
Soybean (Glycine max L.) is a globally important source of plant proteins, oils, and amino acids for both humans and livestock. Wild soybean (Glycine soja Sieb. and Zucc.), the ancestor of cultivated soybean, could be a useful genetic source for increasing these components in soybean crops. In this study, 96,432 single-nucleotide polymorphisms (SNPs) across 203 wild soybean accessions from the 180K Axiom® Soya SNP array were investigated using an association analysis. Protein and oil content exhibited a highly significant negative correlation, while the 17 amino acids exhibited a highly significant positive correlation with each other. A genome-wide association study (GWAS) was conducted on the protein, oil, and amino acid content using the 203 wild soybean accessions. A total of 44 significant SNPs were associated with protein, oil, and amino acid content. Glyma.11g015500 and Glyma.20g050300, which contained SNPs detected from the GWAS, were selected as novel candidate genes for th...
Genome Wide Association Study and Genomic Selection of Amino Acid Concentrations in Soybean Seeds
Frontiers in Plant Science
Soybean is a major source of protein for human consumption and animal feed. Releasing new cultivars with high nutritional value is one of the major goals in soybean breeding. To achieve this goal, genome-wide association studies of seed amino acid contents were conducted based on 249 soybean accessions from China, US, Japan, and South Korea. The accessions were evaluated for 15 amino acids and genotyped by sequencing. Significant genetic variation was observed for amino acids among the accessions. Among the 231 single nucleotide polymorphisms (SNPs) significantly associated with variations in amino acid contents, fifteen SNPs localized near 14 candidate genes involving in amino acid metabolism. The amino acids were classified into two groups with five in one group and seven amino acids in the other. Correlation coefficients among the amino acids within each group were high and positive, but the correlation coefficients of amino acids between the two groups were negative. Twenty-five SNP markers associated with multiple amino acids can be used to simultaneously improve multi-amino acid concentration in soybean. Genomic selection analysis of amino acid concentration showed that selection efficiency of amino acids based on the markers significantly associated with all 15 amino acids was higher than that based on random markers or markers only associated with individual amino acid. The identified markers could facilitate selection of soybean varieties with improved seed quality.
Journal of Crop Science and Biotechnology, 2017
Soybean [Glycine max (L.) Merrill] yield and seed fatty acids, protein, and oil content are important traits for which an improved understanding of significant genomic regions would be useful. To accomplish this, a soybean population consisting of 203 F 5 derived recombinant inbred lines (RILs) was developed and genotyped with 11,633 polymorphic single nucleotide polymorphisms (SNPs). Each RIL was grown in a single plot at Knoxville, TN in 2010; followed by replicated, multi-location field trials in 2013 and 2014. The data from 2010, 2013, and 2014 were analyzed together in order to detect quantitative trait loci (QTL) for these traits, and 30 total QTLs were detected. Five QTLs are candidates for confirmed status and one QTL is a candidate for positional confirmation. Many of the genes with mutations in close proximity to the fatty acid QTLs are involved in biological processes for fatty acids and/or lipids and could be considered possible candidate genes. Similarly, genes with mutations in genomic regions near yield, protein, and oil QTLs were plentiful and may contribute to the variation observed in these traits. Except for yield and stearic acid, each trait displayed pleiotropic effects with other traits in this study. Notable are the pleiotropic effects for oleic and linolenic acid on chromosomes 9, 13, and 19. Overall, the findings from this research contribute new information to the genetic understanding of soybean yield and seed fatty acids, protein and oil content. This understanding will be useful in making trait improvements.
Identification of quantitative trait loci controlling soybean seed protein and oil content
PLOS ONE, 2023
Soybean is a major source of seed protein and oil globally with an average composition of 40% protein and 20% oil in the seed. The goal of this study was to identify quantitative trait loci (QTL) conferring seed protein and oil content utilizing a population constructed by crossing an above average protein content line, PI 399084 to another line that had a low protein content value, PI 507429, both from the USDA soybean germplasm collection. The recombinant inbred line (RIL) population, PI 507429 x PI 399084, was evaluated in two replications over four years (2018–2021); the seeds were analyzed for seed protein and oil content using near-infrared reflectance spectroscopy. The recombinant inbred lines and the two parents were re-sequenced using genotyping by sequencing. A total of 12,761 molecular markers, which came from genotyping by sequencing, the SoySNP6k BeadChip and selected simple sequence repeat (SSR) markers from known protein QTL chromosomal regions were used for mapping. ...
Plant biotechnology journal, 2018
The cultivated [Glycine max (L) Merr.] and wild [Glycine soja Siebold & Zucc.] soybean species comprise wide variation in seed composition traits. Compared to wild soybean, cultivated soybean contains low protein, high oil, and high sucrose. In this study, an interspecific population was derived from a cross between G. max (Williams 82) and G. soja (PI 483460B). This recombinant inbred line (RIL) population of 188 lines was sequenced at 0.3× depth. Based on 91 342 single nucleotide polymorphisms (SNPs), recombination events in RILs were defined, and a high-resolution bin map was developed (4070 bins). In addition to bin mapping, quantitative trait loci (QTL) analysis for protein, oil, and sucrose was performed using 3343 polymorphic SNPs (3K-SNP), derived from Illumina Infinium BeadChip sequencing platform. The QTL regions from both platforms were compared, and a significant concordance was observed between bin and 3K-SNP markers. Importantly, the bin map derived from next-generatio...
Frontiers in Plant Science
Soybean (Glycine max [L.] Merr.) is one of the most significant crops in the world in terms of oil and protein. Owing to the rising demand for soybean products, there is an increasing need for improved varieties for more productive farming. However, complex correlation patterns among quantitative traits along with genetic interactions pose a challenge for soybean breeding. Association studies play an important role in the identification of accession with useful alleles by locating genomic sites associated with the phenotype in germplasm collections. In the present study, a genome-wide association study was carried out for seven agronomic and yield-related traits. A field experiment was conducted in 2015/2016 at two locations that include 155 diverse soybean germplasm. These germplasms were genotyped using SoySNP50K Illumina Infinium Bead-Chip. A total of 51 markers were identified for node number, plant height, pods per plant, seeds per plant, seed weight per plant, hundred-grain we...
, oil, palmitic, stearic, oleic, linoleic, and linolenic acids percentage in soybean seeds. The objective of this study was to identify Quantitative Trait Loci (QTL) controlling protein, oil, and fatty acids content in a set of F 5:8 RILs derived from a cross between lines, 'MD 96-5722' and 'Spencer' using 5376 Single Nucleotide Polymorphism (SNP) markers from the Illumina Infinium SoySNP6K BeadChip array. QTL analysis used WinQTL Cart 2.5 software for composite interval mapping (CIM). Identified, were; one protein content QTL on linkage group (LG-) B2 or chromosome (Chr_) 14; 11 QTL associated with oil content on six linkage groups LG-N (Chr_3), LG-A1 (Chr_5), LG-K (Chr_9), LG-F (Chr_13), LG-B2 (Chr_14), and LG-J (Chr_16); and sixteen QTL for five major fatty acids (palmitic, stearic, oleic, linoleic, and linolenic acids) on LG-N (Chr_3), LG-F (Chr_13), LG-B2 (Chr_14), LG-E (Chr_15), LG-J (Chr_16), and LG-G (Chr_18). The SNP markers closely linked to the QTL reported here will be useful for development of cultivars with altered oil and fatty acid compositions in soybean breeding programs.
International Journal of Molecular Sciences
Soybean seeds consist of approximately 40% protein and 20% oil, making them one of the world’s most important cultivated legumes. However, the levels of these compounds are negatively correlated with each other and regulated by quantitative trait loci (QTL) that are controlled by several genes. In this study, a total of 190 F2 and 90 BC1F2 plants derived from a cross of Daepung (Glycine max) with GWS-1887 (G. soja, a source of high protein), were used for the QTL analysis of protein and oil content. In the F2:3 populations, the average protein and oil content was 45.52% and 11.59%, respectively. A QTL associated with protein levels was detected at Gm20_29512680 on chr. 20 with a likelihood of odds (LOD) of 9.57 and an R2 of 17.2%. A QTL associated with oil levels was also detected at Gm15_3621773 on chr. 15 (LOD: 5.80; R2: 12.2%). In the BC1F2:3 populations, the average protein and oil content was 44.25% and 12.14%, respectively. A QTL associated with both protein and oil content wa...