Søren Vang - Academia.edu (original) (raw)
Papers by Søren Vang
Gut, 2015
To develop an affordable and robust pipeline for selection of patient-specific somatic structural... more To develop an affordable and robust pipeline for selection of patient-specific somatic structural variants (SSVs) being informative about radicality of the primary resection, response to adjuvant therapy, incipient recurrence and response to treatment performed in relation to diagnosis of recurrence. We have established efficient procedures for identification of SSVs by next-generation sequencing and subsequent quantification of 3-6 SSVs in plasma. The consequence of intratumour heterogeneity on our approach was assessed. The level of circulating tumour DNA (ctDNA) was quantified in 151 serial plasma samples from six relapsing and five non-relapsing colorectal cancer (CRC) patients by droplet digital PCR, and correlated to clinical findings. Up to six personalised assays were designed for each patient. Our approach enabled efficient temporal assessment of disease status, response to surgical and oncological intervention, and early detection of incipient recurrence. Our approach provided 2-15 (mean 10) months' lead time on detection of metastatic recurrence compared to conventional follow-up. The sensitivity and specificity of the SSVs in terms of detecting postsurgery relapse were 100%. We show that assessment of ctDNA is a non-invasive, exquisitely specific and highly sensitive approach for monitoring disease load, which has the potential to provide clinically relevant lead times compared with conventional methods. Furthermore, we provide a low-coverage protocol optimised for identifying SSVs with excellent correlation between SSVs identified in tumours and matched metastases. Application of ctDNA analysis has the potential to change clinical practice in the management of CRC.
... Christina B Pedersen1, Email: cbak@ki.au.dk. ... Zhang J,Li X,Mueller M,Wang Y,Zong C,Deng N,... more ... Christina B Pedersen1, Email: cbak@ki.au.dk. ... Zhang J,Li X,Mueller M,Wang Y,Zong C,Deng N,Vondriska TM,Liem DA,Yang JI,Korge P,Honda H,Weiss JN,Apweiler R,Ping P. Systematic characterization of the murine mitochondrial proteome using functionally validated cardiac ...
Methods in Molecular Biology, 2010
The ambition to measure all or at least a significant fraction of relevant molecules in a cell cu... more The ambition to measure all or at least a significant fraction of relevant molecules in a cell culture or tissue sample has reached possible realization with the development of the so-called OMICS technologies. We will here briefly review current technologies and give examples of their applications in investigations related to protein misfolding diseases. We will primarily cover the classical OMICS categories GENOMICS, TRANSCRIPTOMICS, METABOLOMICS, and with some more detail PROTEOMICS. These techniques are in most cases performed by dedicated core facilities or commercial services. We will give an assessment of uses as well as limitations of these technologies supported by examples of their application in research related to protein misfolding. We will further briefly discuss genome-wide RNA interference and finally touch on bioinformatics, because the huge amounts of data typically collected with OMICS techniques requires the application of specific software to handle and stratify the data sets. Today, most biologists using OMICS-techniques must, at least in part, be able to analyze their own data using user-friendly web-based tools.
Protein misfolding is a common event in living cells. In young and healthy cells, the misfolded p... more Protein misfolding is a common event in living cells. In young and healthy cells, the misfolded protein load is disposed of by protein quality control (PQC) systems. In aging cells and in cells from certain individuals with genetic diseases, the load may overwhelm the PQC capacity, resulting in accumulation of misfolded proteins. Dependent on the properties of the protein and the efficiency of the PQC systems, the accumulated protein may be degraded or assembled into toxic oligomers and aggregates. To illustrate this concept, we discuss a number of very different protein misfolding diseases including phenylketonuria, Parkinson's disease, α-1-antitrypsin deficiency, familial neurohypophyseal diabetes insipidus, and shortchain acyl-CoA dehydrogenase deficiency. Despite the differences, an emerging paradigm suggests that the cellular effects of protein misfolding provide a common framework that may contribute to the elucidation of the cell pathology and guide intervention and treatment strategies of many genetic and age-dependent diseases.
PLoS ONE, 2014
Formalin-fixed, paraffin-embedded (FFPE) tissues are an invaluable resource for clinical research... more Formalin-fixed, paraffin-embedded (FFPE) tissues are an invaluable resource for clinical research. However, nucleic acids extracted from FFPE tissues are fragmented and chemically modified making them challenging to use in molecular studies. We analysed 23 fresh-frozen (FF), 35 FFPE and 38 paired FF/FFPE specimens, representing six different human tissue types (bladder, prostate and colon carcinoma; liver and colon normal tissue; reactive tonsil) in order to examine the potential use of FFPE samples in next-generation sequencing (NGS) based retrospective and prospective clinical studies. Two methods for DNA and three methods for RNA extraction from FFPE tissues were compared and were found to affect nucleic acid quantity and quality. DNA and RNA from selected FFPE and paired FF/FFPE specimens were used for exome and transcriptome analysis. Preparations of DNA Exome-Seq libraries was more challenging (29.5% success) than that of RNA-Seq libraries, presumably because of modifications to FFPE tissue-derived DNA. Libraries could still be prepared from RNA isolated from two-decade old FFPE tissues. Data were analysed using the CLC Bio Genomics Workbench and revealed systematic differences between FF and FFPE tissue-derived nucleic acid libraries. In spite of this, pairwise analysis of DNA Exome-Seq data showed concordance for 70-80% of variants in FF and FFPE samples stored for fewer than three years. RNA-Seq data showed high correlation of expression profiles in FF/FFPE pairs (Pearson Correlations of 0.90 +/-0.05), irrespective of storage time (up to 244 months) and tissue type. A common set of 1,494 genes was identified with expression profiles that were significantly different between paired FF and FFPE samples irrespective of tissue type. Our results are promising and suggest that NGS can be used to study FFPE specimens in both prospective and retrospective archive-based studies in which FF specimens are not available. Citation: Hedegaard J, Thorsen K, Lund MK, Hein A-MK, Hamilton-Dutoit SJ, et al. (2014) Next-Generation Sequencing of RNA and DNA Isolated from Paired Fresh-Frozen and Formalin-Fixed Paraffin-Embedded Samples of Human Cancer and Normal Tissue. PLoS ONE 9(5): e98187.
Cell Reports, 2014
Bladder cancer (or urothelial cell carcinoma [UCC]) is characterized by field disease (malignant ... more Bladder cancer (or urothelial cell carcinoma [UCC]) is characterized by field disease (malignant alterations in surrounding mucosa) and frequent recurrences. Whole-genome, exome, and transcriptome sequencing of 38 tumors, including four metachronous tumor pairs and 20 superficial tumors, identified an APOBEC mutational signature in one-third. This was biased toward the sense strand, correlated with mean expression level, and clustered near breakpoints. A > G mutations were up to eight times more frequent on the sense strand (p < 0.002) in [ACG]AT contexts. The patient-specific APOBEC signature was negatively correlated to repair-gene expression and was not related to clinicopathological parameters. Mutations in gene families and single genes were related to tumor stage, and expression of chromatin modifiers correlated with survival. Evolutionary and subclonal analyses of early/late tumor pairs showed a unitary origin, and discrete tumor clones contained mutated cancer genes. The ancestral clones contained Pik3ca/Kdm6a mutations and may reflect the field-disease mutations shared among later tumors.
THE PLANT CELL ONLINE, 2006
Retroposition is widely found to play essential roles in origination of new mammalian and other a... more Retroposition is widely found to play essential roles in origination of new mammalian and other animal genes. However, the scarcity of retrogenes in plants has led to the assumption that plant genomes rarely evolve new gene duplicates by retroposition, despite abundant retrotransposons in plants and a reported long terminal repeat (LTR) retrotransposonmediated mechanism of retroposing cellular genes in maize (Zea mays). We show extensive retropositions in the rice (Oryza sativa) genome, with 1235 identified primary retrogenes. We identified 27 of these primary retrogenes within LTR retrotransposons, confirming a previously observed role of retroelements in generating plant retrogenes. Substitution analyses revealed that the vast majority are subject to negative selection, suggesting, along with expression data and evidence of age, that they are likely functional retrogenes. In addition, 42% of these retrosequences have recruited new exons from flanking regions, generating a large number of chimerical genes. We also identified young chimerical genes, suggesting that gene origination through retroposition is ongoing, with a rate an order of magnitude higher than the rate in primates. Finally, we observed that retropositions have followed an unexpected spatial pattern in which functional retrogenes avoid centromeric regions, while retropseudogenes are randomly distributed. These observations suggest that retroposition is an important mechanism that governs gene evolution in rice and other grass species.
Proteome Science, 2009
Background: Mitochondrial proteins are central to various metabolic activities and are key regula... more Background: Mitochondrial proteins are central to various metabolic activities and are key regulators of apoptosis. Disturbance of mitochondrial proteins is therefore often associated with disease. Large scale protein data are required to capture the mitochondrial protein levels and mass spectrometry based proteomics is suitable for generating such data. To study the relative quantities of mitochondrial proteins in cells from cultivated human skin fibroblasts we applied a proteomic method based on nanoLC-MS/MS analysis of iTRAQ-labeled peptides.
PLoS ONE, 2011
Celastrol, a natural substance isolated from plant extracts used in traditional Chinese medicine,... more Celastrol, a natural substance isolated from plant extracts used in traditional Chinese medicine, has been extensively investigated as a possible drug for treatment of cancer, autoimmune diseases, and protein misfolding disorders. Although studies focusing on celastrol's effects in specific cellular pathways have revealed a considerable number of targets in a diverse array of in vitro models there is an essential need for investigations that can provide a global view of its effects. To assess cellular effects of celastrol and to identify target proteins as biomarkers for monitoring treatment regimes, we performed large-scale quantitative proteomics in cultured human lymphoblastoid cells, a cell type that can be readily prepared from human blood samples. Celastrol substantially modified the proteome composition and 158 of the close to 1800 proteins with robust quantitation showed at least a 1.5 fold change in protein levels. Up-regulated proteins play key roles in cytoprotection with a prominent group involved in quality control and processing of proteins traversing the endoplasmic reticulum. Increased levels of proteins essential for the cellular protection against oxidative stress including heme oxygenase 1, several peroxiredoxins and thioredoxins as well as proteins involved in the control of iron homeostasis were also observed. Specific analysis of the mitochondrial proteome strongly indicated that the mitochondrial association of certain antioxidant defense and apoptosis-regulating proteins increased in cells exposed to celastrol. Analysis of selected mRNA transcripts showed that celastrol activated several different stress response pathways and dose response studies furthermore showed that continuous exposure to sub-micromolar concentrations of celastrol is associated with reduced cellular viability and proliferation. The extensive catalog of regulated proteins presented here identifies numerous cellular effects of celastrol and constitutes a valuable biomarker tool for the development and monitoration of disease treatment strategies.
Nucleic Acids Research, 2007
Gene duplication is an important process in evolution. The availability of genome sequences of a ... more Gene duplication is an important process in evolution. The availability of genome sequences of a number of organisms has made it possible to conduct comprehensive searches for duplicated genes enabling informative studies of their evolution. We have established the FGF (Fishing Gene Family) program to efficiently search for and identify gene families. The FGF output displays the results as visual phylogenetic trees including information on gene structure, chromosome position, duplication fate and selective pressure. It is particularly useful to identify pseudogenes and detect changes in gene structure. FGF is freely available on a web server at
Nucleic Acids Research, 2007
TreeFam (http://www.treefam.org) was developed to provide curated phylogenetic trees for all anim... more TreeFam (http://www.treefam.org) was developed to provide curated phylogenetic trees for all animal gene families, as well as orthologue and paralogue assignments. Release 4.0 of TreeFam contains curated trees for 1314 families and automatically generated trees for another 14 351 families. We have expanded TreeFam to include 25 fully sequenced animal genomes, as well as four genomes from plant and fungal outgroup species. We have also introduced more accurate approaches for automatically grouping genes into families, for building phylogenetic trees, and for inferring orthologues and paralogues. The user interface for viewing phylogenetic trees and family information has been improved. Furthermore, a new perl API lets users easily extract data from the TreeFam mysql database.
Nucleic Acids Research, 2007
Platform) is a server designed to comprehensively analyze single genes and relationships between ... more Platform) is a server designed to comprehensively analyze single genes and relationships between genes basing on SNPs in the human genome. The aim of the platform is to facilitate the study of SNP finding and analysis within the framework of medical research. Using a user-friendly web interface, genes can be searched by name, description, position, SNP ID or clone name. Several public databases are integrated, including gene information from Ensembl, protein features from Uniprot/ SWISS-PROT, Pfam and DAS-CBS. Gene relationships are fetched from BIND, MINT, KEGG and are integrated with ortholog data from TreeFam to extend the current interaction networks. Integrated tools for primer-design and mis-splicing analysis have been developed to facilitate experimental analysis of individual genes with focus on their variation. Snap is available at
Gut, 2015
To develop an affordable and robust pipeline for selection of patient-specific somatic structural... more To develop an affordable and robust pipeline for selection of patient-specific somatic structural variants (SSVs) being informative about radicality of the primary resection, response to adjuvant therapy, incipient recurrence and response to treatment performed in relation to diagnosis of recurrence. We have established efficient procedures for identification of SSVs by next-generation sequencing and subsequent quantification of 3-6 SSVs in plasma. The consequence of intratumour heterogeneity on our approach was assessed. The level of circulating tumour DNA (ctDNA) was quantified in 151 serial plasma samples from six relapsing and five non-relapsing colorectal cancer (CRC) patients by droplet digital PCR, and correlated to clinical findings. Up to six personalised assays were designed for each patient. Our approach enabled efficient temporal assessment of disease status, response to surgical and oncological intervention, and early detection of incipient recurrence. Our approach provided 2-15 (mean 10) months&amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;#39; lead time on detection of metastatic recurrence compared to conventional follow-up. The sensitivity and specificity of the SSVs in terms of detecting postsurgery relapse were 100%. We show that assessment of ctDNA is a non-invasive, exquisitely specific and highly sensitive approach for monitoring disease load, which has the potential to provide clinically relevant lead times compared with conventional methods. Furthermore, we provide a low-coverage protocol optimised for identifying SSVs with excellent correlation between SSVs identified in tumours and matched metastases. Application of ctDNA analysis has the potential to change clinical practice in the management of CRC.
... Christina B Pedersen1, Email: cbak@ki.au.dk. ... Zhang J,Li X,Mueller M,Wang Y,Zong C,Deng N,... more ... Christina B Pedersen1, Email: cbak@ki.au.dk. ... Zhang J,Li X,Mueller M,Wang Y,Zong C,Deng N,Vondriska TM,Liem DA,Yang JI,Korge P,Honda H,Weiss JN,Apweiler R,Ping P. Systematic characterization of the murine mitochondrial proteome using functionally validated cardiac ...
Methods in Molecular Biology, 2010
The ambition to measure all or at least a significant fraction of relevant molecules in a cell cu... more The ambition to measure all or at least a significant fraction of relevant molecules in a cell culture or tissue sample has reached possible realization with the development of the so-called OMICS technologies. We will here briefly review current technologies and give examples of their applications in investigations related to protein misfolding diseases. We will primarily cover the classical OMICS categories GENOMICS, TRANSCRIPTOMICS, METABOLOMICS, and with some more detail PROTEOMICS. These techniques are in most cases performed by dedicated core facilities or commercial services. We will give an assessment of uses as well as limitations of these technologies supported by examples of their application in research related to protein misfolding. We will further briefly discuss genome-wide RNA interference and finally touch on bioinformatics, because the huge amounts of data typically collected with OMICS techniques requires the application of specific software to handle and stratify the data sets. Today, most biologists using OMICS-techniques must, at least in part, be able to analyze their own data using user-friendly web-based tools.
Protein misfolding is a common event in living cells. In young and healthy cells, the misfolded p... more Protein misfolding is a common event in living cells. In young and healthy cells, the misfolded protein load is disposed of by protein quality control (PQC) systems. In aging cells and in cells from certain individuals with genetic diseases, the load may overwhelm the PQC capacity, resulting in accumulation of misfolded proteins. Dependent on the properties of the protein and the efficiency of the PQC systems, the accumulated protein may be degraded or assembled into toxic oligomers and aggregates. To illustrate this concept, we discuss a number of very different protein misfolding diseases including phenylketonuria, Parkinson's disease, α-1-antitrypsin deficiency, familial neurohypophyseal diabetes insipidus, and shortchain acyl-CoA dehydrogenase deficiency. Despite the differences, an emerging paradigm suggests that the cellular effects of protein misfolding provide a common framework that may contribute to the elucidation of the cell pathology and guide intervention and treatment strategies of many genetic and age-dependent diseases.
PLoS ONE, 2014
Formalin-fixed, paraffin-embedded (FFPE) tissues are an invaluable resource for clinical research... more Formalin-fixed, paraffin-embedded (FFPE) tissues are an invaluable resource for clinical research. However, nucleic acids extracted from FFPE tissues are fragmented and chemically modified making them challenging to use in molecular studies. We analysed 23 fresh-frozen (FF), 35 FFPE and 38 paired FF/FFPE specimens, representing six different human tissue types (bladder, prostate and colon carcinoma; liver and colon normal tissue; reactive tonsil) in order to examine the potential use of FFPE samples in next-generation sequencing (NGS) based retrospective and prospective clinical studies. Two methods for DNA and three methods for RNA extraction from FFPE tissues were compared and were found to affect nucleic acid quantity and quality. DNA and RNA from selected FFPE and paired FF/FFPE specimens were used for exome and transcriptome analysis. Preparations of DNA Exome-Seq libraries was more challenging (29.5% success) than that of RNA-Seq libraries, presumably because of modifications to FFPE tissue-derived DNA. Libraries could still be prepared from RNA isolated from two-decade old FFPE tissues. Data were analysed using the CLC Bio Genomics Workbench and revealed systematic differences between FF and FFPE tissue-derived nucleic acid libraries. In spite of this, pairwise analysis of DNA Exome-Seq data showed concordance for 70-80% of variants in FF and FFPE samples stored for fewer than three years. RNA-Seq data showed high correlation of expression profiles in FF/FFPE pairs (Pearson Correlations of 0.90 +/-0.05), irrespective of storage time (up to 244 months) and tissue type. A common set of 1,494 genes was identified with expression profiles that were significantly different between paired FF and FFPE samples irrespective of tissue type. Our results are promising and suggest that NGS can be used to study FFPE specimens in both prospective and retrospective archive-based studies in which FF specimens are not available. Citation: Hedegaard J, Thorsen K, Lund MK, Hein A-MK, Hamilton-Dutoit SJ, et al. (2014) Next-Generation Sequencing of RNA and DNA Isolated from Paired Fresh-Frozen and Formalin-Fixed Paraffin-Embedded Samples of Human Cancer and Normal Tissue. PLoS ONE 9(5): e98187.
Cell Reports, 2014
Bladder cancer (or urothelial cell carcinoma [UCC]) is characterized by field disease (malignant ... more Bladder cancer (or urothelial cell carcinoma [UCC]) is characterized by field disease (malignant alterations in surrounding mucosa) and frequent recurrences. Whole-genome, exome, and transcriptome sequencing of 38 tumors, including four metachronous tumor pairs and 20 superficial tumors, identified an APOBEC mutational signature in one-third. This was biased toward the sense strand, correlated with mean expression level, and clustered near breakpoints. A > G mutations were up to eight times more frequent on the sense strand (p < 0.002) in [ACG]AT contexts. The patient-specific APOBEC signature was negatively correlated to repair-gene expression and was not related to clinicopathological parameters. Mutations in gene families and single genes were related to tumor stage, and expression of chromatin modifiers correlated with survival. Evolutionary and subclonal analyses of early/late tumor pairs showed a unitary origin, and discrete tumor clones contained mutated cancer genes. The ancestral clones contained Pik3ca/Kdm6a mutations and may reflect the field-disease mutations shared among later tumors.
THE PLANT CELL ONLINE, 2006
Retroposition is widely found to play essential roles in origination of new mammalian and other a... more Retroposition is widely found to play essential roles in origination of new mammalian and other animal genes. However, the scarcity of retrogenes in plants has led to the assumption that plant genomes rarely evolve new gene duplicates by retroposition, despite abundant retrotransposons in plants and a reported long terminal repeat (LTR) retrotransposonmediated mechanism of retroposing cellular genes in maize (Zea mays). We show extensive retropositions in the rice (Oryza sativa) genome, with 1235 identified primary retrogenes. We identified 27 of these primary retrogenes within LTR retrotransposons, confirming a previously observed role of retroelements in generating plant retrogenes. Substitution analyses revealed that the vast majority are subject to negative selection, suggesting, along with expression data and evidence of age, that they are likely functional retrogenes. In addition, 42% of these retrosequences have recruited new exons from flanking regions, generating a large number of chimerical genes. We also identified young chimerical genes, suggesting that gene origination through retroposition is ongoing, with a rate an order of magnitude higher than the rate in primates. Finally, we observed that retropositions have followed an unexpected spatial pattern in which functional retrogenes avoid centromeric regions, while retropseudogenes are randomly distributed. These observations suggest that retroposition is an important mechanism that governs gene evolution in rice and other grass species.
Proteome Science, 2009
Background: Mitochondrial proteins are central to various metabolic activities and are key regula... more Background: Mitochondrial proteins are central to various metabolic activities and are key regulators of apoptosis. Disturbance of mitochondrial proteins is therefore often associated with disease. Large scale protein data are required to capture the mitochondrial protein levels and mass spectrometry based proteomics is suitable for generating such data. To study the relative quantities of mitochondrial proteins in cells from cultivated human skin fibroblasts we applied a proteomic method based on nanoLC-MS/MS analysis of iTRAQ-labeled peptides.
PLoS ONE, 2011
Celastrol, a natural substance isolated from plant extracts used in traditional Chinese medicine,... more Celastrol, a natural substance isolated from plant extracts used in traditional Chinese medicine, has been extensively investigated as a possible drug for treatment of cancer, autoimmune diseases, and protein misfolding disorders. Although studies focusing on celastrol's effects in specific cellular pathways have revealed a considerable number of targets in a diverse array of in vitro models there is an essential need for investigations that can provide a global view of its effects. To assess cellular effects of celastrol and to identify target proteins as biomarkers for monitoring treatment regimes, we performed large-scale quantitative proteomics in cultured human lymphoblastoid cells, a cell type that can be readily prepared from human blood samples. Celastrol substantially modified the proteome composition and 158 of the close to 1800 proteins with robust quantitation showed at least a 1.5 fold change in protein levels. Up-regulated proteins play key roles in cytoprotection with a prominent group involved in quality control and processing of proteins traversing the endoplasmic reticulum. Increased levels of proteins essential for the cellular protection against oxidative stress including heme oxygenase 1, several peroxiredoxins and thioredoxins as well as proteins involved in the control of iron homeostasis were also observed. Specific analysis of the mitochondrial proteome strongly indicated that the mitochondrial association of certain antioxidant defense and apoptosis-regulating proteins increased in cells exposed to celastrol. Analysis of selected mRNA transcripts showed that celastrol activated several different stress response pathways and dose response studies furthermore showed that continuous exposure to sub-micromolar concentrations of celastrol is associated with reduced cellular viability and proliferation. The extensive catalog of regulated proteins presented here identifies numerous cellular effects of celastrol and constitutes a valuable biomarker tool for the development and monitoration of disease treatment strategies.
Nucleic Acids Research, 2007
Gene duplication is an important process in evolution. The availability of genome sequences of a ... more Gene duplication is an important process in evolution. The availability of genome sequences of a number of organisms has made it possible to conduct comprehensive searches for duplicated genes enabling informative studies of their evolution. We have established the FGF (Fishing Gene Family) program to efficiently search for and identify gene families. The FGF output displays the results as visual phylogenetic trees including information on gene structure, chromosome position, duplication fate and selective pressure. It is particularly useful to identify pseudogenes and detect changes in gene structure. FGF is freely available on a web server at
Nucleic Acids Research, 2007
TreeFam (http://www.treefam.org) was developed to provide curated phylogenetic trees for all anim... more TreeFam (http://www.treefam.org) was developed to provide curated phylogenetic trees for all animal gene families, as well as orthologue and paralogue assignments. Release 4.0 of TreeFam contains curated trees for 1314 families and automatically generated trees for another 14 351 families. We have expanded TreeFam to include 25 fully sequenced animal genomes, as well as four genomes from plant and fungal outgroup species. We have also introduced more accurate approaches for automatically grouping genes into families, for building phylogenetic trees, and for inferring orthologues and paralogues. The user interface for viewing phylogenetic trees and family information has been improved. Furthermore, a new perl API lets users easily extract data from the TreeFam mysql database.
Nucleic Acids Research, 2007
Platform) is a server designed to comprehensively analyze single genes and relationships between ... more Platform) is a server designed to comprehensively analyze single genes and relationships between genes basing on SNPs in the human genome. The aim of the platform is to facilitate the study of SNP finding and analysis within the framework of medical research. Using a user-friendly web interface, genes can be searched by name, description, position, SNP ID or clone name. Several public databases are integrated, including gene information from Ensembl, protein features from Uniprot/ SWISS-PROT, Pfam and DAS-CBS. Gene relationships are fetched from BIND, MINT, KEGG and are integrated with ortholog data from TreeFam to extend the current interaction networks. Integrated tools for primer-design and mis-splicing analysis have been developed to facilitate experimental analysis of individual genes with focus on their variation. Snap is available at