Xinbin Dai - Academia.edu (original) (raw)

Papers by Xinbin Dai

Research paper thumbnail of The CLE53–SUNN genetic pathway negatively regulates arbuscular mycorrhiza root colonization in Medicago truncatula

Journal of Experimental Botany, Apr 20, 2020

Plants and arbuscular mycorrhizal fungi (AMF) engage in mutually beneficial symbioses based on a ... more Plants and arbuscular mycorrhizal fungi (AMF) engage in mutually beneficial symbioses based on a reciprocal exchange of nutrients. The beneficial character of the symbiosis is maintained through a mechanism called autoregulation of mycorrhization (AOM). AOM includes root-to-shoot-to-root signaling; however, the molecular details of AOM are poorly understood. AOM shares many features of autoregulation of nodulation (AON) where several genes are known, including the receptor-like kinase SUPER NUMERIC NODULES (SUNN), root-to-shoot mobile CLAVATA3/ENDOSPERM SURROUNDING REGION (ESR)-RELATED (CLE) peptides, and the hydroxyproline O-arabinosyltransferase ROOT DETERMINED NODULATION1 (RDN1) required for post-translational peptide modification. In this work, CLE53 was identified to negatively regulate AMF symbiosis in a SUNN-and RDN1-dependent manner. CLE53 expression was repressed at low phosphorus, while it was induced by AMF colonization and high phosphorus. CLE53 overexpression reduced AMF colonization in a SUNN-and RDN1 dependent manner, while cle53, rdn1, and sunn mutants were more colonized than the wild type. RNA-sequencing identified 700 genes with SUNN-dependent regulation in AMFcolonized plants, providing a resource for future identification of additional AOM genes. Disruption of AOM genes in crops potentially constitutes a novel route for improving AMF-derived phosphorus uptake in agricultural systems with high phosphorus levels.

Research paper thumbnail of pssRNAMiner: a plant short small RNA regulatory cascade analysis server

Nucleic Acids Research, 2008

In plants, short RNAs including » 21-nt microRNA (miRNA) and 21-nt transacting siRNA (ta-siRNA) c... more In plants, short RNAs including » 21-nt microRNA (miRNA) and 21-nt transacting siRNA (ta-siRNA) compose a 'miRNA ! ta-siRNA ! target gene' cascade pathway that regulates gene expression at the posttranscriptional level. In this cascade, biogenesis of ta-siRNA clusters requires 21-nt intervals (i.e. phasing) and miRNA (phase-initiator) cleavage sites on its TAS transcript. Here, we report a novel web server, pssRNAMiner, which is developed to identify both the clusters of phased small RNAs as well as the potential phase-initiator. To detect phased small RNA clusters, the pssRNAMiner maps input small RNAs against user-specified transcript/ genomic sequences, and then identifies phased small RNA clusters by evaluating P-values of hypergeometric distribution. To identify potential phaseinitiators, pssRNAMiner aligns input phase-initiators with transcripts of TAS candidates using the Smith-Waterman algorithm. Potential cleavage sites on TAS candidates are further identified from complementary regions by weighting the alignment expectation and its distance to detected phased small RNA clusters. The pssRNAMiner web server is freely available at http://bioinfo3.noble.org/pssRNAMiner/.

Research paper thumbnail of Genome-wide analysis of phenylpropanoid defence pathways

Molecular Plant Pathology, Jul 30, 2010

Phenylpropanoids can function as preformed and inducible antimicrobial compounds, as well as sign... more Phenylpropanoids can function as preformed and inducible antimicrobial compounds, as well as signal molecules, in plantmicrobe interactions. Since we last reviewed the field 8 years ago, there has been a huge increase in our understanding of the genes of phenylpropanoid biosynthesis and their regulation, brought about largely by advances in genome technology, from whole-genome sequencing to massively parallel gene expression profiling. Here, we present an overview of the biosynthesis and roles of phenylpropanoids in plant defence, together with an analysis of confirmed and predicted phenylpropanoid pathway genes in the sequenced genomes of 11 plant species. Examples are provided of phylogenetic and expression clustering analyses, and the large body of underlying genomic data is provided through a website accessible from the article.m pp_648 829..846 FUNCTIONS OF PHENYLPROPANOID COMPOUNDS IN PLANT DEFENCE Lignin and lignans Lignin and lignans share monolignols as common precursors (Figs 1 and 2). Lignification makes the cell wall more resistant to mechanical pressure applied during fungal penetration (Bechinger et al., 1999). A lignified cell wall is water resistant and thus less accessible to cell wall-degrading enzymes from plant pathogens. Natural lignin contains three major units: p-hydroxyphenyl (H), guaiacyl (G) and syringyl (S). The structural analysis of lignins induced by elicitor from Rhizosphaera kalkhoffii in spruce revealed that the defence lignins are significantly different from

Research paper thumbnail of TrichOME: A Comparative Omics Database for Plant Trichomes 1(C)(W)(OA)

Plant secretory trichomes have a unique capacity for chemical synthesis and secretion and have be... more Plant secretory trichomes have a unique capacity for chemical synthesis and secretion and have been described as biofactories for the production of natural products. However, until recently, most trichome-specific metabolic pathways and genes involved in various trichome developmental stages have remained unknown. Furthermore, only a very limited amount of plant trichome genomics information is available in scattered databases. We present an integrated "omics" database, TrichOME, to facilitate the study of plant trichomes. The database hosts a large volume of functional omics data, including expressed sequence tag/unigene sequences, microarray hybridizations from both trichome and control tissues, mass spectrometry-based trichome metabolite profiles, and trichome-related genes curated from published literature. The expressed sequence tag/ unigene sequences have been annotated based upon sequence similarity with popular databases (e.g. Gene Ontology, Kyoto Encyclopedia of Genes and Genomes, and Transporter Classification Database). The unigenes, metabolites, curated genes, and probe sets have been mapped against each other to enable comparative analysis. The database also integrates bioinformatics tools with a focus on the mining of trichome-specific genes in unigenes and microarray-based gene expression profiles. TrichOME is a valuable and unique resource for plant trichome research, since the genes and metabolites expressed in trichomes are often underrepresented in regular non-tissue-targeted cDNA libraries. TrichOME is freely available at http://www.planttrichome.org/.

Research paper thumbnail of Comparative Genome Analyses of Plant Rust Pathogen Genomes Reveal a Confluence of Pathogenicity Factors to Quell Host Plant Defense Responses

Plants

Switchgrass rust caused by Puccinia novopanici (P. novopanici) has the ability to significantly a... more Switchgrass rust caused by Puccinia novopanici (P. novopanici) has the ability to significantly affect the biomass yield of switchgrass, an important biofuel crop in the United States. A comparative genome analysis of P. novopanici with rust pathogen genomes infecting monocot cereal crops wheat, barley, oats, maize and sorghum revealed the presence of larger structural variations contributing to their genome sizes. A comparative alignment of the rust pathogen genomes resulted in the identification of collinear and syntenic relationships between P. novopanici and P. sorghi; P. graminis tritici 21–0 (Pgt 21) and P. graminis tritici Ug99 (Pgt Ug99) and between Pgt 21 and P. triticina (Pt). Repeat element analysis indicated a strong presence of retro elements among different Puccinia genomes, contributing to the genome size variation between ~1 and 3%. A comparative look at the enriched protein families of Puccinia spp. revealed a predominant role of restriction of telomere capping prot...

Research paper thumbnail of Insertional mutagenesis of Brachypodium distachyon using the Tnt1 retrotransposable element

The Plant Journal, 2020

SUMMARYBrachypodium distachyon is an annual C3 grass used as a monocot model system in functional... more SUMMARYBrachypodium distachyon is an annual C3 grass used as a monocot model system in functional genomics research. Insertional mutagenesis is a powerful tool for both forward and reverse genetics studies. In this study, we explored the possibility of using the tobacco retrotransposon Tnt1 to create a transposon‐based insertion mutant population in B. distachyon. We developed transgenic B. distachyon plants expressing Tnt1 (R0) and in the subsequent regenerants (R1) we observed that Tnt1 actively transposed during somatic embryogenesis, generating an average of 6.37 insertions per line in a population of 19 independent R1 regenerant plants analyzed. In seed‐derived progeny of R1 plants, Tnt1 segregated in a Mendelian ratio of 3:1 and no new Tnt1 transposition was observed. A total of 126 flanking sequence tags (FSTs) were recovered from the analyzed R0 and R1 lines. Analysis of the FSTs showed a uniform pattern of insertion in all the chromosomes (1–5) without any preference for a ...

Research paper thumbnail of MtSSPdb: The Medicago truncatula Small Secreted Peptide Database

Plant Physiology, 2020

One-sentence summary: MtSSPdb hosts a compendium of small secreted peptide sequences with annotat... more One-sentence summary: MtSSPdb hosts a compendium of small secreted peptide sequences with annotations and an RNA-seq-based gene expression atlas for Medicago truncatula, a plant small secreted peptide prediction tool, and phenotyping data from synthetic peptide screens in planta.

Research paper thumbnail of A microRNA biogenesis-like pathway for producing phased small interfering RNA from a long non-coding RNA in rice

Journal of Experimental Botany, 2019

Phased small interfering RNAs (phasiRNAs) are a class of non-coding RNAs that perform essential f... more Phased small interfering RNAs (phasiRNAs) are a class of non-coding RNAs that perform essential functions in plants. Unlike microRNA biogenesis from a hairpin structure, the production of phasiRNAs usually requires a phase initiator and an RNA-dependent RNA polymerase (RDR) to form double-strand RNAs. By using full-length rice cDNA (KL-cDNA) to identify phasiRNA loci, we found that a putative non-coding sequence with a long hairpin structure generates the phasiRNAs, which we name Long Hairpin-structure containing non-coding RNA (LHR). The biogenesis of LHR-derived phasiRNAs was dependent on rice DCL4, but not on RDR2/6, DCL1, or DCL3. Since all of the LHR-phasiRNAs (-5p from the forward strand and-3p from the reverse strand of the dsRNAs) are mapped to the forward strand of LHR, LHR-phasiRNAs should be derived from its hairpin structure, similar to a microRNA precursor. A degradome-based validation suggested that several thylakoid-related genes were targeted by LHR-phasiRNAs. In addition, the production of LHR-phasiRNAs was completely abolished in the lhr mutant, which also exhibited decreased plant height, leaf size, and grain weight, probably through the regulation of photosynthesis. Based on our results, we propose a micro-RNA biogenesis-like pathway for producing phased siRNAs that expands our understanding of the current model of phased siRNA biogenesis in plants.

Research paper thumbnail of Genome-Wide Identification of Medicago Peptides Involved in Macronutrient Responses and Nodulation

Plant Physiology, 2017

Growing evidence indicates that small, secreted peptides (SSPs) play critical roles in legume gro... more Growing evidence indicates that small, secreted peptides (SSPs) play critical roles in legume growth and development, yet the annotation of SSP-coding genes is far from complete. Systematic reannotation of the Medicago truncatula genome identified 1,970 homologs of established SSP gene families and an additional 2,455 genes that are potentially novel SSPs, previously unreported in the literature. The expression patterns of known and putative SSP genes based on 144 RNA sequencing data sets covering various stages of macronutrient deficiencies and symbiotic interactions with rhizobia and mycorrhiza were investigated. Focusing on those known or suspected to act via receptor-mediated signaling, 240 nutrient-responsive and 365 nodulation-responsive Signaling-SSPs were identified, greatly expanding the number of SSP gene families potentially involved in acclimation to nutrient deficiencies and nodulation. Synthetic peptide applications were shown to alter root growth and nodulation phenotypes, revealing additional regulators of legume nutrient acquisition. Our results constitute a powerful resource enabling further investigations of specific SSP functions via peptide treatment and reverse genetics.

Research paper thumbnail of iTAK: A Program for Genome-wide Prediction and Classification of Plant Transcription Factors, Transcriptional Regulators, and Protein Kinases

Research paper thumbnail of LegumeIP 2.0-a platform for the study of gene function and genome evolution in legumes

Nucleic acids research, Jan 17, 2015

The LegumeIP 2.0 database hosts large-scale genomics and transcriptomics data and provides integr... more The LegumeIP 2.0 database hosts large-scale genomics and transcriptomics data and provides integrative bioinformatics tools for the study of gene function and evolution in legumes. Our recent updates in LegumeIP 2.0 include gene and protein sequences, gene models and annotations, syntenic regions, protein families and phylogenetic trees for six legume species: Medicago truncatula, Glycine max (soybean), Lotus japonicus, Phaseolus vulgaris (common bean), Cicer arietinum (chickpea) and Cajanus cajan (pigeon pea) and two outgroup reference species: Arabidopsis thaliana and Poplar trichocarpa. Moreover, the LegumeIP 2.0 features the following new data resources and bioinformatics tools: (i) an integrative gene expression atlas for four model legumes that include 550 array hybridizations from M. truncatula, 962 gene expression profiles of G. max, 276 array hybridizations from L. japonicas and 56 RNA-Seq-based gene expression profiles for C. arietinum. These datasets were manually curated...

Research paper thumbnail of The Medicago sativa gene index 1.2: a web-accessible gene expression atlas for investigating expression differences between Medicago sativa subspecies

BMC genomics, Jan 7, 2015

Alfalfa (Medicago sativa L.) is the primary forage legume crop species in the United States and p... more Alfalfa (Medicago sativa L.) is the primary forage legume crop species in the United States and plays essential economic and ecological roles in agricultural systems across the country. Modern alfalfa is the result of hybridization between tetraploid M. sativa ssp. sativa and M. sativa ssp. falcata. Due to its large and complex genome, there are few genomic resources available for alfalfa improvement. A de novo transcriptome assembly from two alfalfa subspecies, M. sativa ssp. sativa (B47) and M. sativa ssp. falcata (F56) was developed using Illumina RNA-seq technology. Transcripts from roots, nitrogen-fixing root nodules, leaves, flowers, elongating stem internodes, and post-elongation stem internodes were assembled into the Medicago sativa Gene Index 1.2 (MSGI 1.2) representing 112,626 unique transcript sequences. Nodule-specific and transcripts involved in cell wall biosynthesis were identified. Statistical analyses identified 20,447 transcripts differentially expressed between t...

Research paper thumbnail of PLAN: a web platform for automating high-throughput BLAST searches and for managing and mining results

BMC bioinformatics, Jan 9, 2007

BLAST searches are widely used for sequence alignment. The search results are commonly adopted fo... more BLAST searches are widely used for sequence alignment. The search results are commonly adopted for various functional and comparative genomics tasks such as annotating unknown sequences, investigating gene models and comparing two sequence sets. Advances in sequencing technologies pose challenges for high-throughput analysis of large-scale sequence data. A number of programs and hardware solutions exist for efficient BLAST searching, but there is a lack of generic software solutions for mining and personalized management of the results. Systematically reviewing the results and identifying information of interest remains tedious and time-consuming. Personal BLAST Navigator (PLAN) is a versatile web platform that helps users to carry out various personalized pre- and post-BLAST tasks, including: (1) query and target sequence database management, (2) automated high-throughput BLAST searching, (3) indexing and searching of results, (4) filtering results online, (5) managing results of p...

Research paper thumbnail of Comprehensive analysis of small RNA-seq data reveals that combination of miRNA with its isomiRs increase the accuracy of target prediction in Arabidopsis

RNA biology, Jan 28, 2015

Along with the canonical miRNA, distinct miRNA-like sequences called sibling miRNAs (sib-miRs) ar... more Along with the canonical miRNA, distinct miRNA-like sequences called sibling miRNAs (sib-miRs) are generated from the same pre-miRNA. Among them, isomeric sequences featuring slight variations at the terminals, relative to the canonical miRNA, constitute a pool of isomeric sibling miRNAs (isomiRs). Despite the high prevalence of isomiRs in eukaryotes, their features and relevance remain elusive. In this study, we performed a comprehensive analysis of mature precursor miRNA (pre-miRNA) sequences from Arabidopsis to understand their features and regulatory targets. The influence of isomiR terminal heterogeneity in target binding was examined comprehensively. Our comprehensive analyses suggested a novel computational strategy that utilizes miRNA and its isomiRs to enhance the accuracy of their regulatory target prediction in Arabidopsis. A few targets are shared by several members of isomiRs; however, this phenomenon was not typical. Gene Ontology (GO) enrichment analysis showed that c...

Research paper thumbnail of Terpene Biosynthesis in Glandular Trichomes of Hop    

Plant Physiology, 2008

Hop (Humulus lupulus L. Cannabaceae) is an economically important crop for the brewing industry, ... more Hop (Humulus lupulus L. Cannabaceae) is an economically important crop for the brewing industry, where it is used to impart flavor and aroma to beer, and has also drawn attention in recent years due to its potential pharmaceutical applications. Essential oils (mono- and sesquiterpenes), bitter acids (prenylated polyketides), and prenylflavonoids are the primary phytochemical components that account for these traits, and all accumulate at high concentrations in glandular trichomes of hop cones. To understand the molecular basis for terpene accumulation in hop trichomes, a trichome cDNA library was constructed and 9,816 cleansed expressed sequence tag (EST) sequences were obtained from random sequencing of 16,152 cDNA clones. The ESTs were assembled into 3,619 unigenes (1,101 contigs and 2,518 singletons). Putative functions were assigned to the unigenes based on their homology to annotated sequences in the GenBank database. Two mono- and two sesquiterpene synthases identified from th...

Research paper thumbnail of TrichOME: A Comparative Omics Database for Plant Trichomes      

Plant Physiology, 2009

Plant secretory trichomes have a unique capacity for chemical synthesis and secretion and have be... more Plant secretory trichomes have a unique capacity for chemical synthesis and secretion and have been described as biofactories for the production of natural products. However, until recently, most trichome-specific metabolic pathways and genes involved in various trichome developmental stages have remained unknown. Furthermore, only a very limited amount of plant trichome genomics information is available in scattered databases. We present an integrated “omics” database, TrichOME, to facilitate the study of plant trichomes. The database hosts a large volume of functional omics data, including expressed sequence tag/unigene sequences, microarray hybridizations from both trichome and control tissues, mass spectrometry-based trichome metabolite profiles, and trichome-related genes curated from published literature. The expressed sequence tag/unigene sequences have been annotated based upon sequence similarity with popular databases (e.g. Gene Ontology, Kyoto Encyclopedia of Genes and Ge...

Research paper thumbnail of The Medicago genome provides insight into the evolution of rhizobial symbioses

Research paper thumbnail of Regulation of a Virus-Induced Lethal Disease in Tomato Revealed by LongSAGE Analysis

Molecular Plant-Microbe Interactions®, 2007

Infection of Cucumber mosaic virus (CMV) and D satellite RNA (satRNA) in tomato plants induces ra... more Infection of Cucumber mosaic virus (CMV) and D satellite RNA (satRNA) in tomato plants induces rapid plant death, which has caused catastrophic crop losses. We conducted long serial analysis of gene expression (LongSAGE) in control and virus-infected plants to identify the genes that may be involved in the development of this lethal tomato disease. The transcriptomes were compared between mock-inoculated plants and plants infected with CMV, CMV/D satRNA, or CMV/Dm satRNA (a nonnecrogenic mutant of D satRNA with three mutated nucleotides). The analysis revealed both general and specific changes in the tomato transcriptome after infection with these viruses. A massive transcriptional difference of approximately 400 genes was found between the transcriptomes of CMV/D and CMV/Dm satRNA-infected plants. Particularly, the Long-SAGE data indicated the activation of ethylene synthesis and signaling by CMV/D satRNA infection. Results from inoculation tests with an ethylene-insensitive mutant...

Research paper thumbnail of Differential mRNA Translation in Medicago truncatula Accessions with Contrasting Responses to Ozone-Induced Oxidative Stress

Molecular Plant, 2012

Acute ozone is a model abiotic elicitor of oxidative stress and a useful tool for understanding b... more Acute ozone is a model abiotic elicitor of oxidative stress and a useful tool for understanding biochemical and molecular events during oxidative signaling. Two Medicago truncatula accessions with contrasting responses to ozone were used to examine translational regulation during ozone stress. In ozone-resistant JE154, significant reduction in ribosome loading was observed within one hour of ozone treatment, suggesting energy homeostasis as a vital factor for oxidative stress management. Polysomal RNA-based expression profiling with Affymetrix arrays revealed extensive changes in the translatomes of both accessions. Messenger RNAs with low GC content in their 5' and 3'-UTRs were preferentially associated with polysomes during oxidative stress. Genebins analysis revealed extensive changes in various gene ontologies in both accessions. Extensive changes in nicotinate and nicotinamide metabolism genes were corroborated with increased levels of NAD 1 and NADH in JE154. The significantly lower NAD 1 :NADH redox status in JE154, in conjunction with higher ATP amounts, provided a cellular milieu conducive for overcoming oxidative stress. Low levels of ATP, NADH, and suppression of antioxidant defense responses, abet build-up of ozone-derived ROS and ultimately lead to oxidative cell death in Jemalong.

Research paper thumbnail of A legume specific protein database (LegProt) improves the number of identified peptides, confidence scores and overall protein identification success rates for legume proteomics

Phytochemistry, 2011

A legume specific protein database (LegProt) has been created containing sequences from seven leg... more A legume specific protein database (LegProt) has been created containing sequences from seven legume species, i.e., Glycine max, Lotus japonicus, Medicago sativa, Medicago truncatula, Lupinusalbus, Phaseolus vulgaris, and Pisum sativum. The database consists of amino acid sequences translated from predicted gene models and 6-frame translations of tentative consensus (TC) sequences assembled from expressed sequence tags (ESTs) and singleton ESTs. This database was queried using mass spectral data for protein identification and identification success rates were compared to the NCBI nr database. Specifically, Mascot MS/MS ion searches of tandem nano-LC Q-TOFMS/MS mass spectral data showed that relative to the NCBI nr protein database, the LegProt database yielded a 54% increase in the average protein score (i.e., from NCBI nr 480 to LegProt 739) and a 50% increase in the average number of matched peptides (i.e., from NCBI nr 8 to LegProt 12). The overall identification success rate als...

Research paper thumbnail of The CLE53–SUNN genetic pathway negatively regulates arbuscular mycorrhiza root colonization in Medicago truncatula

Journal of Experimental Botany, Apr 20, 2020

Plants and arbuscular mycorrhizal fungi (AMF) engage in mutually beneficial symbioses based on a ... more Plants and arbuscular mycorrhizal fungi (AMF) engage in mutually beneficial symbioses based on a reciprocal exchange of nutrients. The beneficial character of the symbiosis is maintained through a mechanism called autoregulation of mycorrhization (AOM). AOM includes root-to-shoot-to-root signaling; however, the molecular details of AOM are poorly understood. AOM shares many features of autoregulation of nodulation (AON) where several genes are known, including the receptor-like kinase SUPER NUMERIC NODULES (SUNN), root-to-shoot mobile CLAVATA3/ENDOSPERM SURROUNDING REGION (ESR)-RELATED (CLE) peptides, and the hydroxyproline O-arabinosyltransferase ROOT DETERMINED NODULATION1 (RDN1) required for post-translational peptide modification. In this work, CLE53 was identified to negatively regulate AMF symbiosis in a SUNN-and RDN1-dependent manner. CLE53 expression was repressed at low phosphorus, while it was induced by AMF colonization and high phosphorus. CLE53 overexpression reduced AMF colonization in a SUNN-and RDN1 dependent manner, while cle53, rdn1, and sunn mutants were more colonized than the wild type. RNA-sequencing identified 700 genes with SUNN-dependent regulation in AMFcolonized plants, providing a resource for future identification of additional AOM genes. Disruption of AOM genes in crops potentially constitutes a novel route for improving AMF-derived phosphorus uptake in agricultural systems with high phosphorus levels.

Research paper thumbnail of pssRNAMiner: a plant short small RNA regulatory cascade analysis server

Nucleic Acids Research, 2008

In plants, short RNAs including » 21-nt microRNA (miRNA) and 21-nt transacting siRNA (ta-siRNA) c... more In plants, short RNAs including » 21-nt microRNA (miRNA) and 21-nt transacting siRNA (ta-siRNA) compose a 'miRNA ! ta-siRNA ! target gene' cascade pathway that regulates gene expression at the posttranscriptional level. In this cascade, biogenesis of ta-siRNA clusters requires 21-nt intervals (i.e. phasing) and miRNA (phase-initiator) cleavage sites on its TAS transcript. Here, we report a novel web server, pssRNAMiner, which is developed to identify both the clusters of phased small RNAs as well as the potential phase-initiator. To detect phased small RNA clusters, the pssRNAMiner maps input small RNAs against user-specified transcript/ genomic sequences, and then identifies phased small RNA clusters by evaluating P-values of hypergeometric distribution. To identify potential phaseinitiators, pssRNAMiner aligns input phase-initiators with transcripts of TAS candidates using the Smith-Waterman algorithm. Potential cleavage sites on TAS candidates are further identified from complementary regions by weighting the alignment expectation and its distance to detected phased small RNA clusters. The pssRNAMiner web server is freely available at http://bioinfo3.noble.org/pssRNAMiner/.

Research paper thumbnail of Genome-wide analysis of phenylpropanoid defence pathways

Molecular Plant Pathology, Jul 30, 2010

Phenylpropanoids can function as preformed and inducible antimicrobial compounds, as well as sign... more Phenylpropanoids can function as preformed and inducible antimicrobial compounds, as well as signal molecules, in plantmicrobe interactions. Since we last reviewed the field 8 years ago, there has been a huge increase in our understanding of the genes of phenylpropanoid biosynthesis and their regulation, brought about largely by advances in genome technology, from whole-genome sequencing to massively parallel gene expression profiling. Here, we present an overview of the biosynthesis and roles of phenylpropanoids in plant defence, together with an analysis of confirmed and predicted phenylpropanoid pathway genes in the sequenced genomes of 11 plant species. Examples are provided of phylogenetic and expression clustering analyses, and the large body of underlying genomic data is provided through a website accessible from the article.m pp_648 829..846 FUNCTIONS OF PHENYLPROPANOID COMPOUNDS IN PLANT DEFENCE Lignin and lignans Lignin and lignans share monolignols as common precursors (Figs 1 and 2). Lignification makes the cell wall more resistant to mechanical pressure applied during fungal penetration (Bechinger et al., 1999). A lignified cell wall is water resistant and thus less accessible to cell wall-degrading enzymes from plant pathogens. Natural lignin contains three major units: p-hydroxyphenyl (H), guaiacyl (G) and syringyl (S). The structural analysis of lignins induced by elicitor from Rhizosphaera kalkhoffii in spruce revealed that the defence lignins are significantly different from

Research paper thumbnail of TrichOME: A Comparative Omics Database for Plant Trichomes 1(C)(W)(OA)

Plant secretory trichomes have a unique capacity for chemical synthesis and secretion and have be... more Plant secretory trichomes have a unique capacity for chemical synthesis and secretion and have been described as biofactories for the production of natural products. However, until recently, most trichome-specific metabolic pathways and genes involved in various trichome developmental stages have remained unknown. Furthermore, only a very limited amount of plant trichome genomics information is available in scattered databases. We present an integrated "omics" database, TrichOME, to facilitate the study of plant trichomes. The database hosts a large volume of functional omics data, including expressed sequence tag/unigene sequences, microarray hybridizations from both trichome and control tissues, mass spectrometry-based trichome metabolite profiles, and trichome-related genes curated from published literature. The expressed sequence tag/ unigene sequences have been annotated based upon sequence similarity with popular databases (e.g. Gene Ontology, Kyoto Encyclopedia of Genes and Genomes, and Transporter Classification Database). The unigenes, metabolites, curated genes, and probe sets have been mapped against each other to enable comparative analysis. The database also integrates bioinformatics tools with a focus on the mining of trichome-specific genes in unigenes and microarray-based gene expression profiles. TrichOME is a valuable and unique resource for plant trichome research, since the genes and metabolites expressed in trichomes are often underrepresented in regular non-tissue-targeted cDNA libraries. TrichOME is freely available at http://www.planttrichome.org/.

Research paper thumbnail of Comparative Genome Analyses of Plant Rust Pathogen Genomes Reveal a Confluence of Pathogenicity Factors to Quell Host Plant Defense Responses

Plants

Switchgrass rust caused by Puccinia novopanici (P. novopanici) has the ability to significantly a... more Switchgrass rust caused by Puccinia novopanici (P. novopanici) has the ability to significantly affect the biomass yield of switchgrass, an important biofuel crop in the United States. A comparative genome analysis of P. novopanici with rust pathogen genomes infecting monocot cereal crops wheat, barley, oats, maize and sorghum revealed the presence of larger structural variations contributing to their genome sizes. A comparative alignment of the rust pathogen genomes resulted in the identification of collinear and syntenic relationships between P. novopanici and P. sorghi; P. graminis tritici 21–0 (Pgt 21) and P. graminis tritici Ug99 (Pgt Ug99) and between Pgt 21 and P. triticina (Pt). Repeat element analysis indicated a strong presence of retro elements among different Puccinia genomes, contributing to the genome size variation between ~1 and 3%. A comparative look at the enriched protein families of Puccinia spp. revealed a predominant role of restriction of telomere capping prot...

Research paper thumbnail of Insertional mutagenesis of Brachypodium distachyon using the Tnt1 retrotransposable element

The Plant Journal, 2020

SUMMARYBrachypodium distachyon is an annual C3 grass used as a monocot model system in functional... more SUMMARYBrachypodium distachyon is an annual C3 grass used as a monocot model system in functional genomics research. Insertional mutagenesis is a powerful tool for both forward and reverse genetics studies. In this study, we explored the possibility of using the tobacco retrotransposon Tnt1 to create a transposon‐based insertion mutant population in B. distachyon. We developed transgenic B. distachyon plants expressing Tnt1 (R0) and in the subsequent regenerants (R1) we observed that Tnt1 actively transposed during somatic embryogenesis, generating an average of 6.37 insertions per line in a population of 19 independent R1 regenerant plants analyzed. In seed‐derived progeny of R1 plants, Tnt1 segregated in a Mendelian ratio of 3:1 and no new Tnt1 transposition was observed. A total of 126 flanking sequence tags (FSTs) were recovered from the analyzed R0 and R1 lines. Analysis of the FSTs showed a uniform pattern of insertion in all the chromosomes (1–5) without any preference for a ...

Research paper thumbnail of MtSSPdb: The Medicago truncatula Small Secreted Peptide Database

Plant Physiology, 2020

One-sentence summary: MtSSPdb hosts a compendium of small secreted peptide sequences with annotat... more One-sentence summary: MtSSPdb hosts a compendium of small secreted peptide sequences with annotations and an RNA-seq-based gene expression atlas for Medicago truncatula, a plant small secreted peptide prediction tool, and phenotyping data from synthetic peptide screens in planta.

Research paper thumbnail of A microRNA biogenesis-like pathway for producing phased small interfering RNA from a long non-coding RNA in rice

Journal of Experimental Botany, 2019

Phased small interfering RNAs (phasiRNAs) are a class of non-coding RNAs that perform essential f... more Phased small interfering RNAs (phasiRNAs) are a class of non-coding RNAs that perform essential functions in plants. Unlike microRNA biogenesis from a hairpin structure, the production of phasiRNAs usually requires a phase initiator and an RNA-dependent RNA polymerase (RDR) to form double-strand RNAs. By using full-length rice cDNA (KL-cDNA) to identify phasiRNA loci, we found that a putative non-coding sequence with a long hairpin structure generates the phasiRNAs, which we name Long Hairpin-structure containing non-coding RNA (LHR). The biogenesis of LHR-derived phasiRNAs was dependent on rice DCL4, but not on RDR2/6, DCL1, or DCL3. Since all of the LHR-phasiRNAs (-5p from the forward strand and-3p from the reverse strand of the dsRNAs) are mapped to the forward strand of LHR, LHR-phasiRNAs should be derived from its hairpin structure, similar to a microRNA precursor. A degradome-based validation suggested that several thylakoid-related genes were targeted by LHR-phasiRNAs. In addition, the production of LHR-phasiRNAs was completely abolished in the lhr mutant, which also exhibited decreased plant height, leaf size, and grain weight, probably through the regulation of photosynthesis. Based on our results, we propose a micro-RNA biogenesis-like pathway for producing phased siRNAs that expands our understanding of the current model of phased siRNA biogenesis in plants.

Research paper thumbnail of Genome-Wide Identification of Medicago Peptides Involved in Macronutrient Responses and Nodulation

Plant Physiology, 2017

Growing evidence indicates that small, secreted peptides (SSPs) play critical roles in legume gro... more Growing evidence indicates that small, secreted peptides (SSPs) play critical roles in legume growth and development, yet the annotation of SSP-coding genes is far from complete. Systematic reannotation of the Medicago truncatula genome identified 1,970 homologs of established SSP gene families and an additional 2,455 genes that are potentially novel SSPs, previously unreported in the literature. The expression patterns of known and putative SSP genes based on 144 RNA sequencing data sets covering various stages of macronutrient deficiencies and symbiotic interactions with rhizobia and mycorrhiza were investigated. Focusing on those known or suspected to act via receptor-mediated signaling, 240 nutrient-responsive and 365 nodulation-responsive Signaling-SSPs were identified, greatly expanding the number of SSP gene families potentially involved in acclimation to nutrient deficiencies and nodulation. Synthetic peptide applications were shown to alter root growth and nodulation phenotypes, revealing additional regulators of legume nutrient acquisition. Our results constitute a powerful resource enabling further investigations of specific SSP functions via peptide treatment and reverse genetics.

Research paper thumbnail of iTAK: A Program for Genome-wide Prediction and Classification of Plant Transcription Factors, Transcriptional Regulators, and Protein Kinases

Research paper thumbnail of LegumeIP 2.0-a platform for the study of gene function and genome evolution in legumes

Nucleic acids research, Jan 17, 2015

The LegumeIP 2.0 database hosts large-scale genomics and transcriptomics data and provides integr... more The LegumeIP 2.0 database hosts large-scale genomics and transcriptomics data and provides integrative bioinformatics tools for the study of gene function and evolution in legumes. Our recent updates in LegumeIP 2.0 include gene and protein sequences, gene models and annotations, syntenic regions, protein families and phylogenetic trees for six legume species: Medicago truncatula, Glycine max (soybean), Lotus japonicus, Phaseolus vulgaris (common bean), Cicer arietinum (chickpea) and Cajanus cajan (pigeon pea) and two outgroup reference species: Arabidopsis thaliana and Poplar trichocarpa. Moreover, the LegumeIP 2.0 features the following new data resources and bioinformatics tools: (i) an integrative gene expression atlas for four model legumes that include 550 array hybridizations from M. truncatula, 962 gene expression profiles of G. max, 276 array hybridizations from L. japonicas and 56 RNA-Seq-based gene expression profiles for C. arietinum. These datasets were manually curated...

Research paper thumbnail of The Medicago sativa gene index 1.2: a web-accessible gene expression atlas for investigating expression differences between Medicago sativa subspecies

BMC genomics, Jan 7, 2015

Alfalfa (Medicago sativa L.) is the primary forage legume crop species in the United States and p... more Alfalfa (Medicago sativa L.) is the primary forage legume crop species in the United States and plays essential economic and ecological roles in agricultural systems across the country. Modern alfalfa is the result of hybridization between tetraploid M. sativa ssp. sativa and M. sativa ssp. falcata. Due to its large and complex genome, there are few genomic resources available for alfalfa improvement. A de novo transcriptome assembly from two alfalfa subspecies, M. sativa ssp. sativa (B47) and M. sativa ssp. falcata (F56) was developed using Illumina RNA-seq technology. Transcripts from roots, nitrogen-fixing root nodules, leaves, flowers, elongating stem internodes, and post-elongation stem internodes were assembled into the Medicago sativa Gene Index 1.2 (MSGI 1.2) representing 112,626 unique transcript sequences. Nodule-specific and transcripts involved in cell wall biosynthesis were identified. Statistical analyses identified 20,447 transcripts differentially expressed between t...

Research paper thumbnail of PLAN: a web platform for automating high-throughput BLAST searches and for managing and mining results

BMC bioinformatics, Jan 9, 2007

BLAST searches are widely used for sequence alignment. The search results are commonly adopted fo... more BLAST searches are widely used for sequence alignment. The search results are commonly adopted for various functional and comparative genomics tasks such as annotating unknown sequences, investigating gene models and comparing two sequence sets. Advances in sequencing technologies pose challenges for high-throughput analysis of large-scale sequence data. A number of programs and hardware solutions exist for efficient BLAST searching, but there is a lack of generic software solutions for mining and personalized management of the results. Systematically reviewing the results and identifying information of interest remains tedious and time-consuming. Personal BLAST Navigator (PLAN) is a versatile web platform that helps users to carry out various personalized pre- and post-BLAST tasks, including: (1) query and target sequence database management, (2) automated high-throughput BLAST searching, (3) indexing and searching of results, (4) filtering results online, (5) managing results of p...

Research paper thumbnail of Comprehensive analysis of small RNA-seq data reveals that combination of miRNA with its isomiRs increase the accuracy of target prediction in Arabidopsis

RNA biology, Jan 28, 2015

Along with the canonical miRNA, distinct miRNA-like sequences called sibling miRNAs (sib-miRs) ar... more Along with the canonical miRNA, distinct miRNA-like sequences called sibling miRNAs (sib-miRs) are generated from the same pre-miRNA. Among them, isomeric sequences featuring slight variations at the terminals, relative to the canonical miRNA, constitute a pool of isomeric sibling miRNAs (isomiRs). Despite the high prevalence of isomiRs in eukaryotes, their features and relevance remain elusive. In this study, we performed a comprehensive analysis of mature precursor miRNA (pre-miRNA) sequences from Arabidopsis to understand their features and regulatory targets. The influence of isomiR terminal heterogeneity in target binding was examined comprehensively. Our comprehensive analyses suggested a novel computational strategy that utilizes miRNA and its isomiRs to enhance the accuracy of their regulatory target prediction in Arabidopsis. A few targets are shared by several members of isomiRs; however, this phenomenon was not typical. Gene Ontology (GO) enrichment analysis showed that c...

Research paper thumbnail of Terpene Biosynthesis in Glandular Trichomes of Hop    

Plant Physiology, 2008

Hop (Humulus lupulus L. Cannabaceae) is an economically important crop for the brewing industry, ... more Hop (Humulus lupulus L. Cannabaceae) is an economically important crop for the brewing industry, where it is used to impart flavor and aroma to beer, and has also drawn attention in recent years due to its potential pharmaceutical applications. Essential oils (mono- and sesquiterpenes), bitter acids (prenylated polyketides), and prenylflavonoids are the primary phytochemical components that account for these traits, and all accumulate at high concentrations in glandular trichomes of hop cones. To understand the molecular basis for terpene accumulation in hop trichomes, a trichome cDNA library was constructed and 9,816 cleansed expressed sequence tag (EST) sequences were obtained from random sequencing of 16,152 cDNA clones. The ESTs were assembled into 3,619 unigenes (1,101 contigs and 2,518 singletons). Putative functions were assigned to the unigenes based on their homology to annotated sequences in the GenBank database. Two mono- and two sesquiterpene synthases identified from th...

Research paper thumbnail of TrichOME: A Comparative Omics Database for Plant Trichomes      

Plant Physiology, 2009

Plant secretory trichomes have a unique capacity for chemical synthesis and secretion and have be... more Plant secretory trichomes have a unique capacity for chemical synthesis and secretion and have been described as biofactories for the production of natural products. However, until recently, most trichome-specific metabolic pathways and genes involved in various trichome developmental stages have remained unknown. Furthermore, only a very limited amount of plant trichome genomics information is available in scattered databases. We present an integrated “omics” database, TrichOME, to facilitate the study of plant trichomes. The database hosts a large volume of functional omics data, including expressed sequence tag/unigene sequences, microarray hybridizations from both trichome and control tissues, mass spectrometry-based trichome metabolite profiles, and trichome-related genes curated from published literature. The expressed sequence tag/unigene sequences have been annotated based upon sequence similarity with popular databases (e.g. Gene Ontology, Kyoto Encyclopedia of Genes and Ge...

Research paper thumbnail of The Medicago genome provides insight into the evolution of rhizobial symbioses

Research paper thumbnail of Regulation of a Virus-Induced Lethal Disease in Tomato Revealed by LongSAGE Analysis

Molecular Plant-Microbe Interactions®, 2007

Infection of Cucumber mosaic virus (CMV) and D satellite RNA (satRNA) in tomato plants induces ra... more Infection of Cucumber mosaic virus (CMV) and D satellite RNA (satRNA) in tomato plants induces rapid plant death, which has caused catastrophic crop losses. We conducted long serial analysis of gene expression (LongSAGE) in control and virus-infected plants to identify the genes that may be involved in the development of this lethal tomato disease. The transcriptomes were compared between mock-inoculated plants and plants infected with CMV, CMV/D satRNA, or CMV/Dm satRNA (a nonnecrogenic mutant of D satRNA with three mutated nucleotides). The analysis revealed both general and specific changes in the tomato transcriptome after infection with these viruses. A massive transcriptional difference of approximately 400 genes was found between the transcriptomes of CMV/D and CMV/Dm satRNA-infected plants. Particularly, the Long-SAGE data indicated the activation of ethylene synthesis and signaling by CMV/D satRNA infection. Results from inoculation tests with an ethylene-insensitive mutant...

Research paper thumbnail of Differential mRNA Translation in Medicago truncatula Accessions with Contrasting Responses to Ozone-Induced Oxidative Stress

Molecular Plant, 2012

Acute ozone is a model abiotic elicitor of oxidative stress and a useful tool for understanding b... more Acute ozone is a model abiotic elicitor of oxidative stress and a useful tool for understanding biochemical and molecular events during oxidative signaling. Two Medicago truncatula accessions with contrasting responses to ozone were used to examine translational regulation during ozone stress. In ozone-resistant JE154, significant reduction in ribosome loading was observed within one hour of ozone treatment, suggesting energy homeostasis as a vital factor for oxidative stress management. Polysomal RNA-based expression profiling with Affymetrix arrays revealed extensive changes in the translatomes of both accessions. Messenger RNAs with low GC content in their 5' and 3'-UTRs were preferentially associated with polysomes during oxidative stress. Genebins analysis revealed extensive changes in various gene ontologies in both accessions. Extensive changes in nicotinate and nicotinamide metabolism genes were corroborated with increased levels of NAD 1 and NADH in JE154. The significantly lower NAD 1 :NADH redox status in JE154, in conjunction with higher ATP amounts, provided a cellular milieu conducive for overcoming oxidative stress. Low levels of ATP, NADH, and suppression of antioxidant defense responses, abet build-up of ozone-derived ROS and ultimately lead to oxidative cell death in Jemalong.

Research paper thumbnail of A legume specific protein database (LegProt) improves the number of identified peptides, confidence scores and overall protein identification success rates for legume proteomics

Phytochemistry, 2011

A legume specific protein database (LegProt) has been created containing sequences from seven leg... more A legume specific protein database (LegProt) has been created containing sequences from seven legume species, i.e., Glycine max, Lotus japonicus, Medicago sativa, Medicago truncatula, Lupinusalbus, Phaseolus vulgaris, and Pisum sativum. The database consists of amino acid sequences translated from predicted gene models and 6-frame translations of tentative consensus (TC) sequences assembled from expressed sequence tags (ESTs) and singleton ESTs. This database was queried using mass spectral data for protein identification and identification success rates were compared to the NCBI nr database. Specifically, Mascot MS/MS ion searches of tandem nano-LC Q-TOFMS/MS mass spectral data showed that relative to the NCBI nr protein database, the LegProt database yielded a 54% increase in the average protein score (i.e., from NCBI nr 480 to LegProt 739) and a 50% increase in the average number of matched peptides (i.e., from NCBI nr 8 to LegProt 12). The overall identification success rate als...