Konstantin Popadin | Ecole Polytechnique Federale de Lausanne (original) (raw)

efficiency of purifying selection by Konstantin Popadin

Research paper thumbnail of Slightly deleterious genomic variants and transcriptome perturbations in Down syndrome embryonic selection

J a n u a r y 2 0 1 8 V o l u m e 2 8 N u m b e r 1 C o l d S p r i n g H a r b o r L a b o r a t... more J a n u a r y 2 0 1 8 V o l u m e 2 8 N u m b e r 1 C o l d S p r i n g H a r b o r L a b o r a t o r y P r e s s R esearch G e n o m e From the cover:

Research paper thumbnail of A genetic component of extinction risk in mammals

Genetic factors may play an important role in species extinction but their actual effect remains ... more Genetic factors may play an important role in species extinction but their actual effect remains poorly understood, particularly because of a strong and potentially masking effect expected from ecological traits. We investigated the role of genetics in mammal extinction taking both ecological and genetic factors into account. As a proxy for the role of genetics we used the ratio of the rates of nonsynonymous (amino acid changing) to synonymous (leaving the amino acid unchanged) nucleotide substitutions, K a / K s. Because most nonsynonymous substitutions are likely to be slightly deleterious and thus selected against, this ratio is a measure of the inefficiency of selection: if large (but less than 1), it implies a low efficiency of selection against nonsynonymous mutations. As a result, nonsynonymous mutations may accumulate and thus contribute to extinction. As a proxy for the role of ecology we used body mass W, with which most extinction-related ecological traits strongly correlate. As a measure of extinction risk we used species' affiliation with the five levels of extinction threat according to the IUCN Red List of Threatened Species. We calculated K a / K s for mitochondrial protein-coding genes of 211 mammalian species, each of which was characterized by body mass and the level of threat. Using logistic regression analysis, we then constructed a set of logistic regression models of extinction risk on ln(K a / K s) and lnW. We found that K a / K s and body mass are responsible for a 38% and a 62% increase in extinction risk, respectively. Given that the standard error of these values is 13%, the contribution of genetic factors to extinction risk in mammals is estimated to be one-quarter to one-half of the total of ecological and genetic effects. We conclude that the effect of genetics on extinction is significant, though it is almost certainly smaller than the effect of ecological traits.

Research paper thumbnail of Life-history traits drive the evolutionary rates of mammalian coding and noncoding genomic elements

Proceedings of the National Academy of Sciences, 2007

A comprehensive phylogenetic framework is indispensable for investigating the evolution of genomi... more A comprehensive phylogenetic framework is indispensable for investigating the evolution of genomic features in mammals as a whole, and particularly in humans. Using the ENCODE sequence data, we estimated mammalian neutral evolutionary rates and selective pressures acting on conserved coding and noncoding elements. We show that neutral evolutionary rates can be explained by the generation time (GT) hypothesis. Accordingly, primates (especially humans), having longer GTs than other mammals, display slower rates of neutral evolution. The evolution of constrained elements, particularly of nonsynonymous sites, is in agreement with the expectations of the nearly neutral theory of molecular evolution. We show that rates of nonsynonymous substitutions (dN) depend on the population size of a species. The results are robust to the exclusion of hypermutable CpG prone sites. The average rate of evolution in conserved noncoding sequences (CNCs) is 1.7 times higher than in nonsynonymous sites. Despite this, CNCs evolve at similar or even lower rates than nonsynonymous sites in the majority of basal branches of the eutherian tree. This observation could be the result of an overall gradual or, alternatively, lineage-specific relaxation of CNCs. The latter hypothesis was supported by the finding that 3 of the 20 longest CNCs displayed significant relaxation of individual branches. This observation may explain why the evolution of CNCs fits the expectations of the nearly neutral theory less well than the evolution of nonsynonymous sites.

Research paper thumbnail of Purifying Selection in Mammalian Mitochondrial Protein-Coding Genes Is Highly Effective and Congruent with Evolution of Nuclear Genes

Molecular Biology and Evolution, 2013

Cover Art Caption: Two mammalian phylogenetic trees (mitochondrial on the left; nuclear on the ri... more Cover Art Caption: Two mammalian phylogenetic trees (mitochondrial on the left; nuclear on the right). The mitochondrial tree is characterized by longer (higher rate of molecular evolution) and thinner (lower effective population size) branches as compared to the nuclear tree. The animals on the mitochondrial tree are single, refl ecting the asexual nature of the mitochondrial genome, and are more fi t, indicating highly effective elimination of slightly-deleterious mutations. Hand drawing by Leila Mamirova; artwork provided by

Research paper thumbnail of Accumulation of slightly deleterious mutations in mitochondrial protein-coding genes of large versus small mammals

Proceedings of the National Academy of Sciences of the United States of America, 2007

After the effective size of a population, N(e), declines, some slightly deleterious amino acid re... more After the effective size of a population, N(e), declines, some slightly deleterious amino acid replacements which were initially suppressed by purifying selection become effectively neutral and can reach fixation. Here we investigate this phenomenon for a set of all 13 mitochondrial protein-coding genes from 110 mammalian species. By using body mass as a proxy for N(e), we show that large mammals (i.e., those with low N(e)) as compared with small ones (in our sample these are, on average, 369.5 kg and 275 g, respectively) have a 43% higher rate of accumulation of nonsynonymous nucleotide substitutions relative to synonymous substitutions, and an 8-40% higher rate of accumulation of radical amino acid substitutions relative to conservative substitutions, depending on the type of amino acid classification. These higher rates result in a 6% greater amino acid dissimilarity between modern species and their most recent reconstructed ancestors in large versus small mammals. Because nonsyn...

Research paper thumbnail of A manually curated database of tetrapod mitochondrially encoded tRNA sequences and secondary structures

BMC Bioinformatics, 2007

Background: Mitochondrial tRNAs have been the subject of study for structural biologists interest... more Background: Mitochondrial tRNAs have been the subject of study for structural biologists interested in their secondary structure characteristics, evolutionary biologists have researched patterns of compensatory and structural evolution and medical studies have been directed towards understanding the basis of human disease. However, an up to date, manually curated database of mitochondrially encoded tRNAs from higher animals is currently not available.

Research paper thumbnail of Purifying selection in mitochondria, free-living and obligate intracellular proteobacteria

BMC evolutionary biology, 2007

The effectiveness of elimination of slightly deleterious mutations depends mainly on drift and re... more The effectiveness of elimination of slightly deleterious mutations depends mainly on drift and recombination frequency. Here we analyze the influence of these two factors on the strength of the purifying selection in mitochondrial and proteobacterial orthologous genes taking into account the differences in the organism lifestyles. (I) We found that the probability of fixation of nonsynonymous substitutions (Kn/Ks) in mitochondria is significantly lower compared to obligate intracellular bacteria and even marginally significantly lower compared to free-living bacteria. The comparison of bacteria of different lifestyles demonstrates more effective elimination of slightly deleterious mutations in (II) free-living bacteria as compared to obligate intracellular species and in (III) obligate intracellular parasites as compared to obligate intracellular symbionts. (IV) Finally, we observed that the level of the purifying selection (i.e. 1-Kn/Ks) increases with the density of mobile element...

mitochondrial genome in aging and cancer by Konstantin Popadin

Research paper thumbnail of Cells

Mitochondrial DNA (mtDNA) encodes core subunits of oxidative phosphorylation complexes and, as a ... more Mitochondrial DNA (mtDNA) encodes core subunits of oxidative phosphorylation complexes and, as a result of intricate regulatory crosstalk between nuclear and mitochondrial genomes, the total number of mtDNA copies fits the requirements of each cell type. Deviations from the physiological number of mtDNA copies are expected to be deleterious and might cause some inherited diseases and normal ageing. We studied 46 obese patients with type 2 diabetes (T2DM) one year after a laparoscopic sleeve gastrectomy (LSG) and Roux-en-Y gastric bypass (RYGB). The results were compared with normal-weight patients without T2DM (control group 1) (body mass index (BMI) = 22.5 ± 3.01 kg/m2) and patients with obesity without T2DM (control group 2) (BMI = 36 ± 3.45 kg/m2). We detected an increase of mtDNA copy number in the cells of the buffy coat obtained from peripheral blood, sampled one year after bariatric surgery. We also found that average mtDNA copy number as well as its dynamics (before and after the surgery) are gender-specific. To the best of our knowledge, this is the first evidence for the restoration of mtDNA copy number in obese patients after LSG and RYGB.

Research paper thumbnail of Deleterious in late life mitochondrial alleles and aging: secrets of Japanese centenarians

biorxiv, 2018

Aging is associated with accumulation of somatic mutations. This process is especially pronounced... more Aging is associated with accumulation of somatic mutations. This process is especially pronounced in
mitochondrial genomes of postmitotic cells, which accumulate large-scale somatic mitochondrial deletions with time,
leading to neurodegeneration, muscular dystrophy and aging. Slowing down the rate of origin of these somatic deletions
may benefit human lifespan and healthy aging. The main factors determining breakpoints of somatic mitochondrial
deletions are direct nucleotide repeats, which might be considered as Deleterious In Late Life (DILL) alleles.
Correspondingly, the decreased amount of these DILL alleles might lead to low production of somatic deletions and
increased lifespan. Intriguingly, in the Japanese D4a haplogroup, which is famous for an excess of centenarians and
supercentenarians, we found that the longest direct repeat (“common repeat”) in the human mitochondrial genome has
been disrupted by a point synonymous mutation. Thus we hypothesize that the disruption of the common repeat annuls
common deletion (which is the most frequent among all somatic deletions) and at least partially may contribute to the
extreme longevity of the D4a Japanese haplogroup. Here, to better understand the mitochondrial components of
longevity and potential causative links between repeats, deletions and longevity we discuss molecular, population and
evolutionary factors affecting dynamics of mitochondrial direct repeats.

Research paper thumbnail of Mitochondrial mutational spectrum provides an universal marker of cellular and organismal longevity

It has been shown recently that mitochondrial (mtDNA) somatic variants are numerous enough to tra... more It has been shown recently that mitochondrial (mtDNA) somatic variants are numerous enough to trace cellular lineages in our body. Here we extend this statement and demonstrate that mtDNA variants can be interpreted not only as neutral markers of cell divisions but the relative frequency of different mtDNA substitutions (i.e. mtDNA mutational spectrum) can inform us about important biological properties such as cell longevity. Analysing 7611 somatic mtDNA mutations from 37 types of human cancers and more than 2000 somatic mtDNA mutations from 25 healthy human tissues we observed that mtDNA mutational spectrum is associated with cell turnover rate: the ratio of T>C to G>A is increasing with cell longevity. To extend this logic we considered that, if universal, the discovered mutation bias may drive the differences in mtDNA mutational spectrum between mammalian species with short- (‘mice’) and long- (‘elephants’) lived oocytes. Based on presumably neutral polymorphisms in MT-CYB we reconstructed mutational spectra for 424 mammalian species and obtained that the fraction of T>C positively correlated with the species-specific generation length, which is a good proxy for oocyte longevity. Next, comparing complete mitochondrial genomes of 650 mammalian species we confirmed that exactly the same process shapes the nucleotide content of the most neutral sites in the whole mitochondrial genomes of short- (high T, low C) versus long- (low T, high C) lived mammals. Altogether analysing mtDNA mutations in time interval from dozens of years (somatic mutations) through the hundreds of thousands of years (within species polymorphisms) to millions of years (between species substitutions) we demonstrated that T>C/G>A positively correlates with cellular and organismal longevity. We hypothesize that the discovered mtDNA signature presents a chemical damage which is associated with the level of oxidative metabolism which, in turn, correlates with cellular and organismal longevity. The described properties of mtDNA mutational spectrum shed light on mtDNA replication, mtDNA evolution of mammals and can be used as a marker of cell longevity in single-cell analyses of heterogeneous samples.

Research paper thumbnail of Mitochondrial DNA, direct repeats, deletions, and centenarians: An unfinished puzzle

Biochimica Et Biophysica Acta-bioenergetics, 2010

Research paper thumbnail of Repeats longevity and the sources of mt DNA deletions

The repetitive DNA elements called CRISPRs and their associated genes: evidence of horizontal tra... more The repetitive DNA elements called CRISPRs and their associated genes: evidence of horizontal transfer among prokaryotes. J. Mol. Evol. 62, 718-729 0168-9525/$ -see front matter ?

Research paper thumbnail of Nucleotide repeats in mitochondrial genome determine human lifespan

Direct nucleotide repeats can facilitate deletions of segments of mitochondrial genome 1 , leadin... more Direct nucleotide repeats can facilitate deletions of segments of mitochondrial genome 1 , leading to a wide range of neuromuscular disorders 1,2 as well as aging 2,3 in humans. We hypothesized that the number of the direct perfect repeats in human mitochondrial genomes influences longevity through the formation of harmful mtDNA deletions in the somatic cells. The analysis of the complete mitochondrial genomes of 762 unrelated Japanese individuals 4-6 reveals a negative correlation between the abundance of the direct perfect repeats and the expected longevity. This association is largely due to the disruption of the common repeat (8470,13447) by a point mutation 8473C which occurred at the origin of the D4a haplogroup characterized by extreme longevity in Japan 7 . Our results provide the first evidence for correlation between the number of nucleotide repeats and the lifespan on intraspecific level.

Research paper thumbnail of Mitochondrial DNA, direct repeats, deletions, and centenarians: An unfinished puzzle

Biochimica et Biophysica Acta (BBA) - Bioenergetics, 2010

Research paper thumbnail of When man got his mtDNA deletions

Research paper thumbnail of Mitochondrial DNA Mutations and Cancer

The American Journal of Pathology, 2014

Research paper thumbnail of Roles of mitochondrial dynamics under stressful and normal conditions in yeast cells

Oxidative medicine and cellular longevity, 2013

Eukaryotic cells contain dynamic mitochondrial filaments: they fuse and divide. Here we summarize... more Eukaryotic cells contain dynamic mitochondrial filaments: they fuse and divide. Here we summarize data on the protein machinery driving mitochondrial dynamics in yeast and also discuss the factors that affect the fusion-fission balance. Fission is a general stress response of cells, and in the case of yeast this response appears to be prosurvival. At the same time, even under normal conditions yeast mitochondria undergo continuous cycles of fusion and fission. This seems to be a futile cycle and also expensive from the energy point of view. Why does it exist? Benefits might be the same as in the case of sexual reproduction. Indeed, mixing and separating of mitochondrial content allows mitochondrial DNA to segregate and recombine randomly, leading to high variation in the numbers of mutations per individual mitochondrion. This opens a possibility for effective purifying selection-elimination of mitochondria highly contaminated by deleterious mutations. The beneficial action presumes ...

Research paper thumbnail of Mitochondrial DNA Mutations and Cancer: Lessons from the Parathyroid

The American Journal of Pathology, Sep 18, 2014

transcriptome variation and fitness by Konstantin Popadin

Research paper thumbnail of Gene Age Predicts the Strength of Purifying Selection Acting on Gene Expression Variation in Humans

The American Journal of Human Genetics, 2014

Gene expression levels can be subject to selection. We hypothesized that the age of gene origin i... more Gene expression levels can be subject to selection. We hypothesized that the age of gene origin is associated with expression constraints, given that it affects the level of gene integration into the functional cellular environment. By studying the genetic variation affecting gene expression levels (cis expression quantitative trait loci [cis-eQTLs]) and protein levels (cis protein QTLs [cis-pQTLs]), we determined that young, primate-specific genes are enriched in cis-eQTLs and cis-pQTLs. Compared to cis-eQTLs of old genes originating before the zebrafish divergence, cis-eQTLs of young genes have a higher effect size, are located closer to the transcription start site, are more significant, and tend to influence genes in multiple tissues and populations. These results suggest that the expression constraint of each gene increases throughout its lifespan. We also detected a positive correlation between expression constraints (approximated by cis-eQTL properties) and coding constraints (approximated by Ka/Ks) and observed that this correlation might be driven by gene age. To uncover factors associated with the increase in gene-age-related expression constraints, we demonstrated that gene connectivity, gene involvement in complex regulatory networks, gene haploinsufficiency, and the strength of posttranscriptional regulation increase with gene age. We also observed an increase in heritability of gene expression levels with age, implying a reduction of the environmental component. In summary, we show that gene age shapes key gene properties during evolution and is therefore an important component of genome function.

Research paper thumbnail of Genetic and Epigenetic Regulation of Human lincRNA Gene Expression

The American Journal of Human Genetics, 2013

Large intergenic noncoding RNAs (lincRNAs) are still poorly functionally characterized. We analyz... more Large intergenic noncoding RNAs (lincRNAs) are still poorly functionally characterized. We analyzed the genetic and epigenetic regulation of human lincRNA expression in the GenCord collection by using three cell types from 195 unrelated European individuals. We detected a considerable number of cis expression quantitative trait loci (cis-eQTLs) and demonstrated that the genetic regulation of lincRNA expression is independent of the regulation of neighboring protein-coding genes. lincRNAs have relatively more cis-eQTLs than do equally expressed protein-coding genes with the same exon number. lincRNA cis-eQTLs are located closer to transcription start sites (TSSs) and their effect sizes are higher than cis-eQTLs found for protein-coding genes, suggesting that lincRNA expression levels are less constrained than that of protein-coding genes. Additionally, lincRNA cis-eQTLs can influence the expression level of nearby protein-coding genes and thus could be considered as QTLs for enhancer activity. Enrichment of expressed lincRNA promoters in enhancer marks provides an additional argument for the involvement of lincRNAs in the regulation of transcription in cis. By investigating the epigenetic regulation of lincRNAs, we observed both positive and negative correlations between DNA methylation and gene expression (expression quantitative trait methylation [eQTMs]), as expected, and found that the landscapes of passive and active roles of DNA methylation in gene regulation are similar to protein-coding genes. However, lincRNA eQTMs are located closer to TSSs than are protein-coding gene eQTMs. These similarities and differences in genetic and epigenetic regulation between lincRNAs and protein-coding genes contribute to the elucidation of potential functions of lincRNAs.

Research paper thumbnail of Slightly deleterious genomic variants and transcriptome perturbations in Down syndrome embryonic selection

J a n u a r y 2 0 1 8 V o l u m e 2 8 N u m b e r 1 C o l d S p r i n g H a r b o r L a b o r a t... more J a n u a r y 2 0 1 8 V o l u m e 2 8 N u m b e r 1 C o l d S p r i n g H a r b o r L a b o r a t o r y P r e s s R esearch G e n o m e From the cover:

Research paper thumbnail of A genetic component of extinction risk in mammals

Genetic factors may play an important role in species extinction but their actual effect remains ... more Genetic factors may play an important role in species extinction but their actual effect remains poorly understood, particularly because of a strong and potentially masking effect expected from ecological traits. We investigated the role of genetics in mammal extinction taking both ecological and genetic factors into account. As a proxy for the role of genetics we used the ratio of the rates of nonsynonymous (amino acid changing) to synonymous (leaving the amino acid unchanged) nucleotide substitutions, K a / K s. Because most nonsynonymous substitutions are likely to be slightly deleterious and thus selected against, this ratio is a measure of the inefficiency of selection: if large (but less than 1), it implies a low efficiency of selection against nonsynonymous mutations. As a result, nonsynonymous mutations may accumulate and thus contribute to extinction. As a proxy for the role of ecology we used body mass W, with which most extinction-related ecological traits strongly correlate. As a measure of extinction risk we used species' affiliation with the five levels of extinction threat according to the IUCN Red List of Threatened Species. We calculated K a / K s for mitochondrial protein-coding genes of 211 mammalian species, each of which was characterized by body mass and the level of threat. Using logistic regression analysis, we then constructed a set of logistic regression models of extinction risk on ln(K a / K s) and lnW. We found that K a / K s and body mass are responsible for a 38% and a 62% increase in extinction risk, respectively. Given that the standard error of these values is 13%, the contribution of genetic factors to extinction risk in mammals is estimated to be one-quarter to one-half of the total of ecological and genetic effects. We conclude that the effect of genetics on extinction is significant, though it is almost certainly smaller than the effect of ecological traits.

Research paper thumbnail of Life-history traits drive the evolutionary rates of mammalian coding and noncoding genomic elements

Proceedings of the National Academy of Sciences, 2007

A comprehensive phylogenetic framework is indispensable for investigating the evolution of genomi... more A comprehensive phylogenetic framework is indispensable for investigating the evolution of genomic features in mammals as a whole, and particularly in humans. Using the ENCODE sequence data, we estimated mammalian neutral evolutionary rates and selective pressures acting on conserved coding and noncoding elements. We show that neutral evolutionary rates can be explained by the generation time (GT) hypothesis. Accordingly, primates (especially humans), having longer GTs than other mammals, display slower rates of neutral evolution. The evolution of constrained elements, particularly of nonsynonymous sites, is in agreement with the expectations of the nearly neutral theory of molecular evolution. We show that rates of nonsynonymous substitutions (dN) depend on the population size of a species. The results are robust to the exclusion of hypermutable CpG prone sites. The average rate of evolution in conserved noncoding sequences (CNCs) is 1.7 times higher than in nonsynonymous sites. Despite this, CNCs evolve at similar or even lower rates than nonsynonymous sites in the majority of basal branches of the eutherian tree. This observation could be the result of an overall gradual or, alternatively, lineage-specific relaxation of CNCs. The latter hypothesis was supported by the finding that 3 of the 20 longest CNCs displayed significant relaxation of individual branches. This observation may explain why the evolution of CNCs fits the expectations of the nearly neutral theory less well than the evolution of nonsynonymous sites.

Research paper thumbnail of Purifying Selection in Mammalian Mitochondrial Protein-Coding Genes Is Highly Effective and Congruent with Evolution of Nuclear Genes

Molecular Biology and Evolution, 2013

Cover Art Caption: Two mammalian phylogenetic trees (mitochondrial on the left; nuclear on the ri... more Cover Art Caption: Two mammalian phylogenetic trees (mitochondrial on the left; nuclear on the right). The mitochondrial tree is characterized by longer (higher rate of molecular evolution) and thinner (lower effective population size) branches as compared to the nuclear tree. The animals on the mitochondrial tree are single, refl ecting the asexual nature of the mitochondrial genome, and are more fi t, indicating highly effective elimination of slightly-deleterious mutations. Hand drawing by Leila Mamirova; artwork provided by

Research paper thumbnail of Accumulation of slightly deleterious mutations in mitochondrial protein-coding genes of large versus small mammals

Proceedings of the National Academy of Sciences of the United States of America, 2007

After the effective size of a population, N(e), declines, some slightly deleterious amino acid re... more After the effective size of a population, N(e), declines, some slightly deleterious amino acid replacements which were initially suppressed by purifying selection become effectively neutral and can reach fixation. Here we investigate this phenomenon for a set of all 13 mitochondrial protein-coding genes from 110 mammalian species. By using body mass as a proxy for N(e), we show that large mammals (i.e., those with low N(e)) as compared with small ones (in our sample these are, on average, 369.5 kg and 275 g, respectively) have a 43% higher rate of accumulation of nonsynonymous nucleotide substitutions relative to synonymous substitutions, and an 8-40% higher rate of accumulation of radical amino acid substitutions relative to conservative substitutions, depending on the type of amino acid classification. These higher rates result in a 6% greater amino acid dissimilarity between modern species and their most recent reconstructed ancestors in large versus small mammals. Because nonsyn...

Research paper thumbnail of A manually curated database of tetrapod mitochondrially encoded tRNA sequences and secondary structures

BMC Bioinformatics, 2007

Background: Mitochondrial tRNAs have been the subject of study for structural biologists interest... more Background: Mitochondrial tRNAs have been the subject of study for structural biologists interested in their secondary structure characteristics, evolutionary biologists have researched patterns of compensatory and structural evolution and medical studies have been directed towards understanding the basis of human disease. However, an up to date, manually curated database of mitochondrially encoded tRNAs from higher animals is currently not available.

Research paper thumbnail of Purifying selection in mitochondria, free-living and obligate intracellular proteobacteria

BMC evolutionary biology, 2007

The effectiveness of elimination of slightly deleterious mutations depends mainly on drift and re... more The effectiveness of elimination of slightly deleterious mutations depends mainly on drift and recombination frequency. Here we analyze the influence of these two factors on the strength of the purifying selection in mitochondrial and proteobacterial orthologous genes taking into account the differences in the organism lifestyles. (I) We found that the probability of fixation of nonsynonymous substitutions (Kn/Ks) in mitochondria is significantly lower compared to obligate intracellular bacteria and even marginally significantly lower compared to free-living bacteria. The comparison of bacteria of different lifestyles demonstrates more effective elimination of slightly deleterious mutations in (II) free-living bacteria as compared to obligate intracellular species and in (III) obligate intracellular parasites as compared to obligate intracellular symbionts. (IV) Finally, we observed that the level of the purifying selection (i.e. 1-Kn/Ks) increases with the density of mobile element...

Research paper thumbnail of Cells

Mitochondrial DNA (mtDNA) encodes core subunits of oxidative phosphorylation complexes and, as a ... more Mitochondrial DNA (mtDNA) encodes core subunits of oxidative phosphorylation complexes and, as a result of intricate regulatory crosstalk between nuclear and mitochondrial genomes, the total number of mtDNA copies fits the requirements of each cell type. Deviations from the physiological number of mtDNA copies are expected to be deleterious and might cause some inherited diseases and normal ageing. We studied 46 obese patients with type 2 diabetes (T2DM) one year after a laparoscopic sleeve gastrectomy (LSG) and Roux-en-Y gastric bypass (RYGB). The results were compared with normal-weight patients without T2DM (control group 1) (body mass index (BMI) = 22.5 ± 3.01 kg/m2) and patients with obesity without T2DM (control group 2) (BMI = 36 ± 3.45 kg/m2). We detected an increase of mtDNA copy number in the cells of the buffy coat obtained from peripheral blood, sampled one year after bariatric surgery. We also found that average mtDNA copy number as well as its dynamics (before and after the surgery) are gender-specific. To the best of our knowledge, this is the first evidence for the restoration of mtDNA copy number in obese patients after LSG and RYGB.

Research paper thumbnail of Deleterious in late life mitochondrial alleles and aging: secrets of Japanese centenarians

biorxiv, 2018

Aging is associated with accumulation of somatic mutations. This process is especially pronounced... more Aging is associated with accumulation of somatic mutations. This process is especially pronounced in
mitochondrial genomes of postmitotic cells, which accumulate large-scale somatic mitochondrial deletions with time,
leading to neurodegeneration, muscular dystrophy and aging. Slowing down the rate of origin of these somatic deletions
may benefit human lifespan and healthy aging. The main factors determining breakpoints of somatic mitochondrial
deletions are direct nucleotide repeats, which might be considered as Deleterious In Late Life (DILL) alleles.
Correspondingly, the decreased amount of these DILL alleles might lead to low production of somatic deletions and
increased lifespan. Intriguingly, in the Japanese D4a haplogroup, which is famous for an excess of centenarians and
supercentenarians, we found that the longest direct repeat (“common repeat”) in the human mitochondrial genome has
been disrupted by a point synonymous mutation. Thus we hypothesize that the disruption of the common repeat annuls
common deletion (which is the most frequent among all somatic deletions) and at least partially may contribute to the
extreme longevity of the D4a Japanese haplogroup. Here, to better understand the mitochondrial components of
longevity and potential causative links between repeats, deletions and longevity we discuss molecular, population and
evolutionary factors affecting dynamics of mitochondrial direct repeats.

Research paper thumbnail of Mitochondrial mutational spectrum provides an universal marker of cellular and organismal longevity

It has been shown recently that mitochondrial (mtDNA) somatic variants are numerous enough to tra... more It has been shown recently that mitochondrial (mtDNA) somatic variants are numerous enough to trace cellular lineages in our body. Here we extend this statement and demonstrate that mtDNA variants can be interpreted not only as neutral markers of cell divisions but the relative frequency of different mtDNA substitutions (i.e. mtDNA mutational spectrum) can inform us about important biological properties such as cell longevity. Analysing 7611 somatic mtDNA mutations from 37 types of human cancers and more than 2000 somatic mtDNA mutations from 25 healthy human tissues we observed that mtDNA mutational spectrum is associated with cell turnover rate: the ratio of T>C to G>A is increasing with cell longevity. To extend this logic we considered that, if universal, the discovered mutation bias may drive the differences in mtDNA mutational spectrum between mammalian species with short- (‘mice’) and long- (‘elephants’) lived oocytes. Based on presumably neutral polymorphisms in MT-CYB we reconstructed mutational spectra for 424 mammalian species and obtained that the fraction of T>C positively correlated with the species-specific generation length, which is a good proxy for oocyte longevity. Next, comparing complete mitochondrial genomes of 650 mammalian species we confirmed that exactly the same process shapes the nucleotide content of the most neutral sites in the whole mitochondrial genomes of short- (high T, low C) versus long- (low T, high C) lived mammals. Altogether analysing mtDNA mutations in time interval from dozens of years (somatic mutations) through the hundreds of thousands of years (within species polymorphisms) to millions of years (between species substitutions) we demonstrated that T>C/G>A positively correlates with cellular and organismal longevity. We hypothesize that the discovered mtDNA signature presents a chemical damage which is associated with the level of oxidative metabolism which, in turn, correlates with cellular and organismal longevity. The described properties of mtDNA mutational spectrum shed light on mtDNA replication, mtDNA evolution of mammals and can be used as a marker of cell longevity in single-cell analyses of heterogeneous samples.

Research paper thumbnail of Mitochondrial DNA, direct repeats, deletions, and centenarians: An unfinished puzzle

Biochimica Et Biophysica Acta-bioenergetics, 2010

Research paper thumbnail of Repeats longevity and the sources of mt DNA deletions

The repetitive DNA elements called CRISPRs and their associated genes: evidence of horizontal tra... more The repetitive DNA elements called CRISPRs and their associated genes: evidence of horizontal transfer among prokaryotes. J. Mol. Evol. 62, 718-729 0168-9525/$ -see front matter ?

Research paper thumbnail of Nucleotide repeats in mitochondrial genome determine human lifespan

Direct nucleotide repeats can facilitate deletions of segments of mitochondrial genome 1 , leadin... more Direct nucleotide repeats can facilitate deletions of segments of mitochondrial genome 1 , leading to a wide range of neuromuscular disorders 1,2 as well as aging 2,3 in humans. We hypothesized that the number of the direct perfect repeats in human mitochondrial genomes influences longevity through the formation of harmful mtDNA deletions in the somatic cells. The analysis of the complete mitochondrial genomes of 762 unrelated Japanese individuals 4-6 reveals a negative correlation between the abundance of the direct perfect repeats and the expected longevity. This association is largely due to the disruption of the common repeat (8470,13447) by a point mutation 8473C which occurred at the origin of the D4a haplogroup characterized by extreme longevity in Japan 7 . Our results provide the first evidence for correlation between the number of nucleotide repeats and the lifespan on intraspecific level.

Research paper thumbnail of Mitochondrial DNA, direct repeats, deletions, and centenarians: An unfinished puzzle

Biochimica et Biophysica Acta (BBA) - Bioenergetics, 2010

Research paper thumbnail of When man got his mtDNA deletions

Research paper thumbnail of Mitochondrial DNA Mutations and Cancer

The American Journal of Pathology, 2014

Research paper thumbnail of Roles of mitochondrial dynamics under stressful and normal conditions in yeast cells

Oxidative medicine and cellular longevity, 2013

Eukaryotic cells contain dynamic mitochondrial filaments: they fuse and divide. Here we summarize... more Eukaryotic cells contain dynamic mitochondrial filaments: they fuse and divide. Here we summarize data on the protein machinery driving mitochondrial dynamics in yeast and also discuss the factors that affect the fusion-fission balance. Fission is a general stress response of cells, and in the case of yeast this response appears to be prosurvival. At the same time, even under normal conditions yeast mitochondria undergo continuous cycles of fusion and fission. This seems to be a futile cycle and also expensive from the energy point of view. Why does it exist? Benefits might be the same as in the case of sexual reproduction. Indeed, mixing and separating of mitochondrial content allows mitochondrial DNA to segregate and recombine randomly, leading to high variation in the numbers of mutations per individual mitochondrion. This opens a possibility for effective purifying selection-elimination of mitochondria highly contaminated by deleterious mutations. The beneficial action presumes ...

Research paper thumbnail of Mitochondrial DNA Mutations and Cancer: Lessons from the Parathyroid

The American Journal of Pathology, Sep 18, 2014

Research paper thumbnail of Gene Age Predicts the Strength of Purifying Selection Acting on Gene Expression Variation in Humans

The American Journal of Human Genetics, 2014

Gene expression levels can be subject to selection. We hypothesized that the age of gene origin i... more Gene expression levels can be subject to selection. We hypothesized that the age of gene origin is associated with expression constraints, given that it affects the level of gene integration into the functional cellular environment. By studying the genetic variation affecting gene expression levels (cis expression quantitative trait loci [cis-eQTLs]) and protein levels (cis protein QTLs [cis-pQTLs]), we determined that young, primate-specific genes are enriched in cis-eQTLs and cis-pQTLs. Compared to cis-eQTLs of old genes originating before the zebrafish divergence, cis-eQTLs of young genes have a higher effect size, are located closer to the transcription start site, are more significant, and tend to influence genes in multiple tissues and populations. These results suggest that the expression constraint of each gene increases throughout its lifespan. We also detected a positive correlation between expression constraints (approximated by cis-eQTL properties) and coding constraints (approximated by Ka/Ks) and observed that this correlation might be driven by gene age. To uncover factors associated with the increase in gene-age-related expression constraints, we demonstrated that gene connectivity, gene involvement in complex regulatory networks, gene haploinsufficiency, and the strength of posttranscriptional regulation increase with gene age. We also observed an increase in heritability of gene expression levels with age, implying a reduction of the environmental component. In summary, we show that gene age shapes key gene properties during evolution and is therefore an important component of genome function.

Research paper thumbnail of Genetic and Epigenetic Regulation of Human lincRNA Gene Expression

The American Journal of Human Genetics, 2013

Large intergenic noncoding RNAs (lincRNAs) are still poorly functionally characterized. We analyz... more Large intergenic noncoding RNAs (lincRNAs) are still poorly functionally characterized. We analyzed the genetic and epigenetic regulation of human lincRNA expression in the GenCord collection by using three cell types from 195 unrelated European individuals. We detected a considerable number of cis expression quantitative trait loci (cis-eQTLs) and demonstrated that the genetic regulation of lincRNA expression is independent of the regulation of neighboring protein-coding genes. lincRNAs have relatively more cis-eQTLs than do equally expressed protein-coding genes with the same exon number. lincRNA cis-eQTLs are located closer to transcription start sites (TSSs) and their effect sizes are higher than cis-eQTLs found for protein-coding genes, suggesting that lincRNA expression levels are less constrained than that of protein-coding genes. Additionally, lincRNA cis-eQTLs can influence the expression level of nearby protein-coding genes and thus could be considered as QTLs for enhancer activity. Enrichment of expressed lincRNA promoters in enhancer marks provides an additional argument for the involvement of lincRNAs in the regulation of transcription in cis. By investigating the epigenetic regulation of lincRNAs, we observed both positive and negative correlations between DNA methylation and gene expression (expression quantitative trait methylation [eQTMs]), as expected, and found that the landscapes of passive and active roles of DNA methylation in gene regulation are similar to protein-coding genes. However, lincRNA eQTMs are located closer to TSSs than are protein-coding gene eQTMs. These similarities and differences in genetic and epigenetic regulation between lincRNAs and protein-coding genes contribute to the elucidation of potential functions of lincRNAs.

Research paper thumbnail of Biased allelic expression in human primary fibroblast single cells

American journal of human genetics, Jan 8, 2015

The study of gene expression in mammalian single cells via genomic technologies now provides the ... more The study of gene expression in mammalian single cells via genomic technologies now provides the possibility to investigate the patterns of allelic gene expression. We used single-cell RNA sequencing to detect the allele-specific mRNA level in 203 single human primary fibroblasts over 133,633 unique heterozygous single-nucleotide variants (hetSNVs). We observed that at the snapshot of analyses, each cell contained mostly transcripts from one allele from the majority of genes; indeed, 76.4% of the hetSNVs displayed stochastic monoallelic expression in single cells. Remarkably, adjacent hetSNVs exhibited a haplotype-consistent allelic ratio; in contrast, distant sites located in two different genes were independent of the haplotype structure. Moreover, the allele-specific expression in single cells correlated with the abundance of the cellular transcript. We observed that genes expressing both alleles in the majority of the single cells at a given time point were rare and enriched wit...

Research paper thumbnail of Transcriptome-wide decreased variation in gene expression of Down syndrome fibroblasts: Selection or canalization?

Down syndrome is characterized on one hand by extensive phenotypic variability, and on the other ... more Down syndrome is characterized on one hand by extensive phenotypic variability, and on the other by a recognizable dysmorphic syndrome. Here we compare whole transcriptomes by RNA-Seq of primary fibroblasts from 8 Down Syndrome (DS) and 8 Normal (N) unrelated individuals age and sex matched in order to study variation in gene expression. Gene by gene comparison of the whole transcriptomes revealed a remarkable decreased variance in gene expression level in DS versus N (median of the distribution of ratios of variances in DS to N is 0.64, which is less than the expected 1; P value < 1*10-16, Wilcoxon signed-rank test). The decreased variance in DS could not be explained by differences in mean expression levels between DS and N. This effect is present for genes on each individual chromosome (except chromosome 21) and for different sets of genes studied. The effect is also robust to the number of passages of fibroblasts. The effect is stronger in young (<2 years) versus old (>...

Research paper thumbnail of Domains of genome-wide gene expression dysregulation in Down’s syndrome

Nature, 2014

Trisomy 21 is the most frequent genetic cause of cognitive impairment. To assess the perturbation... more Trisomy 21 is the most frequent genetic cause of cognitive impairment. To assess the perturbations of gene expression in trisomy 21, and to eliminate the noise of genomic variability, we studied the transcriptome of fetal fibroblasts from a pair of monozygotic twins discordant for trisomy 21. Here we show that the differential expression between the twins is organized in domains along all chromosomes that are either upregulated or downregulated. These gene expression dysregulation domains (GEDDs) can be defined by the expression level of their gene content, and are well conserved in induced pluripotent stem cells derived from the twins' fibroblasts. Comparison of the transcriptome of the Ts65Dn mouse model of Down's syndrome and normal littermate mouse fibroblasts = also showed GEDDs along the mouse chromosomes that were syntenic in human. The GEDDs correlate with the lamina-associated (LADs) and replication domains of mammalian cells. The overall position of LADs was not altered in trisomic cells; however, the H3K4me3 profile of the trisomic fibroblasts was modified and accurately followed the GEDD pattern. These results indicate that the nuclear compartments of trisomic cells undergo modifications of the chromatin environment influencing the overall transcriptome, and that GEDDs may therefore contribute to some trisomy 21 phenotypes.

Research paper thumbnail of The complex SNP and CNV genetic architecture of the increased risk of congenital heart defects in Down syndrome

Genome Research, 2013

Congenital heart defect (CHD) occurs in 40% of Down syndrome (DS) cases. While carrying three cop... more Congenital heart defect (CHD) occurs in 40% of Down syndrome (DS) cases. While carrying three copies of chromosome 21 increases the risk for CHD, trisomy 21 itself is not sufficient to cause CHD. Thus, additional genetic variation and/or environmental factors could contribute to the CHD risk. Here we report genomic variations that in concert with trisomy 21, determine the risk for CHD in DS. This case-control GWAS includes 187 DS with CHD (AVSD = 69, ASD = 53, VSD = 65) as cases, and 151 DS without CHD as controls. Chromosome 21-specific association studies revealed rs2832616 and rs1943950 as CHD risk alleles (adjusted genotypic P-values &amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;lt;0.05). These signals were confirmed in a replication cohort of 92 DS-CHD cases and 80 DS-without CHD (nominal P-value 0.0022). Furthermore, CNV analyses using a customized chromosome 21 aCGH of 135K probes in 55 DS-AVSD and 53 DS-without CHD revealed three CNV regions associated with AVSD risk (FDR ≤ 0.05). Two of these regions that are located within the previously identified CHD region on chromosome 21 were further confirmed in a replication study of 49 DS-AVSD and 45 DS- without CHD (FDR ≤ 0.05). One of these CNVs maps near the RIPK4 gene, and the second includes the ZBTB21 (previously ZNF295) gene, highlighting the potential role of these genes in the pathogenesis of CHD in DS. We propose that the genetic architecture of the CHD risk of DS is complex and includes trisomy 21, and SNP and CNV variations in chromosome 21. In addition, a yet-unidentified genetic variation in the rest of the genome may contribute to this complex genetic architecture.

Research paper thumbnail of DNA-Methylation Patterns in Trisomy 21 Using Cells from Monozygotic Twins

PLOS ONE, 2015

DNA methylation is essential in mammalian development. We have hypothesized that methylation diff... more DNA methylation is essential in mammalian development. We have hypothesized that methylation differences induced by trisomy 21 (T21) contribute to the phenotypic characteristics and heterogeneity in Down syndrome (DS). In order to determine the methylation differences in T21 without interference of the interindividual genomic variation, we have used fetal skin fibroblasts from monozygotic (MZ) twins discordant for T21. We also used skin fibroblasts from MZ twins concordant for T21, normal MZ twins without T21, and unrelated normal and T21 individuals. Reduced Representation Bisulfite Sequencing (RRBS) revealed 35 differentially methylated promoter regions (DMRs) (Absolute methylation differences = 25%, FDR &amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;lt; 0.001) in MZ twins discordant for T21 that have also been observed in comparison between unrelated normal and T21 individuals. The identified DMRs are enriched for genes involved in embryonic organ morphogenesis (FDR = 1.60 e -03) and include genes of the HOXB and HOXD clusters. These DMRs are maintained in iPS cells generated from this twin pair and are correlated with the gene expression changes. We have also observed an increase in DNA methylation level in the T21 methylome compared to the normal euploid methylome. This observation is concordant with the up regulation of DNA methyltransferase enzymes (DNMT3B and DNMT3L) and down regulation of DNA demethylation enzymes (TET2 and TET3) observed in the iPSC of the T21 versus normal twin. Altogether, the results of this study highlight the epigenetic effects of the extra chromosome 21 in T21 on loci outside of this chromosome that are relevant to DS associated phenotypes.

Research paper thumbnail of APOBEC-induced mutations in human cancers are strongly enriched on the lagging DNA strand during replication

Genome research, Jan 11, 2016

APOBEC3A and APOBEC3B, cytidine deaminases of the APOBEC family, are among the main factors causi... more APOBEC3A and APOBEC3B, cytidine deaminases of the APOBEC family, are among the main factors causing mutations in human cancers. APOBEC deaminates cytosines in single-stranded DNA (ssDNA). A fraction of the APOBEC-induced mutations occur as clusters ("kataegis") in single-stranded DNA produced during repair of double-stranded breaks (DSBs). However, the properties of the remaining 87% of nonclustered APOBEC-induced mutations, the source and the genomic distribution of the ssDNA where they occur, are largely unknown. By analyzing genomic and exomic cancer databases, we show that >33% of dispersed APOBEC-induced mutations occur on the lagging strand during DNA replication, thus unraveling the major source of ssDNA targeted by APOBEC in cancer. Although methylated cytosine is generally more mutation-prone than nonmethylated cytosine, we report that methylation reduces the rate of APOBEC-induced mutations by a factor of roughly two. Finally, we show that in cancers with exte...

Research paper thumbnail of Genomic analysis identifies new drivers and progression pathways in skin basal cell carcinoma

Nature genetics, Jan 7, 2016

Basal cell carcinoma (BCC) of the skin is the most common malignant neoplasm in humans. BCC is pr... more Basal cell carcinoma (BCC) of the skin is the most common malignant neoplasm in humans. BCC is primarily driven by the Sonic Hedgehog (Hh) pathway. However, its phenotypic variation remains unexplained. Our genetic profiling of 293 BCCs found the highest mutation rate in cancer (65 mutations/Mb). Eighty-five percent of the BCCs harbored mutations in Hh pathway genes (PTCH1, 73% or SMO, 20% (P = 6.6 × 10(-8)) and SUFU, 8%) and in TP53 (61%). However, 85% of the BCCs also harbored additional driver mutations in other cancer-related genes. We observed recurrent mutations in MYCN (30%), PPP6C (15%), STK19 (10%), LATS1 (8%), ERBB2 (4%), PIK3CA (2%), and NRAS, KRAS or HRAS (2%), and loss-of-function and deleterious missense mutations were present in PTPN14 (23%), RB1 (8%) and FBXW7 (5%). Consistent with the mutational profiles, N-Myc and Hippo-YAP pathway target genes were upregulated. Functional analysis of the mutations in MYCN, PTPN14 and LATS1 suggested their potential relevance in BC...

Research paper thumbnail of The 'Stem' and the 'Workers' of the mtDNA population of the cell. Evidence from mutational analysis

bioRxiv (Cold Spring Harbor Laboratory), Apr 15, 2023

Research paper thumbnail of Direction mutation pressure of SARS-CoV-2 helps to understand the past and predict the future evolution: C>U and G>U biased mutagenesis forces the majority of amino-acid substitutions to be from CG-rich losers to U-rich gainers

Evolution is a function of mutagenesis and selection. To analyse the role of mutagenesis on the s... more Evolution is a function of mutagenesis and selection. To analyse the role of mutagenesis on the structure of the SARS-CoV-2 genome, we reconstructed the mutational spectrum, which was highly C>U and G>U biased. This bias forces the SARS-CoV-2 genome to become increasingly U-rich unless selection cancels it. We analysed the consequences of this bias on the composition of the most neutral (four-fold degenerate synonymous substitutions) and the least neutral positions (nonsynonymous substitutions). The neutral nucleotide composition is already highly saturated by U and, according to our model, it is at equilibrium, suggesting that in the future, we don’t expect any more increase in U. However, nonsynonymous changes continue slowly evolve towards equilibrium substituting CG-rich amino-acids (“losers”) with U-rich ones (“gainers”). This process is universal for all genes of SARS-CoV-2 as well as for other coronaviridae species. In line with the direction mutation pressure hypothesi...

Research paper thumbnail of Secondary structure of the human mitochondrial genome affects formation of deletions

BMC Biology

Background Aging in postmitotic tissues is associated with clonal expansion of somatic mitochondr... more Background Aging in postmitotic tissues is associated with clonal expansion of somatic mitochondrial deletions, the origin of which is not well understood. Such deletions are often flanked by direct nucleotide repeats, but this alone does not fully explain their distribution. Here, we hypothesized that the close proximity of direct repeats on single-stranded mitochondrial DNA (mtDNA) might play a role in the formation of deletions. Results By analyzing human mtDNA deletions in the major arc of mtDNA, which is single-stranded during replication and is characterized by a high number of deletions, we found a non-uniform distribution with a “hot spot” where one deletion breakpoint occurred within the region of 6–9 kb and another within 13–16 kb of the mtDNA. This distribution was not explained by the presence of direct repeats, suggesting that other factors, such as the spatial proximity of these two regions, can be the cause. In silico analyses revealed that the single-stranded major a...

Research paper thumbnail of Improving genome quality through artificial truncating purifying selection using heat shock: case of carps

The process of domestication is associated with decrease in effective population size, which in t... more The process of domestication is associated with decrease in effective population size, which in turn leads to accumulation of slightly-deleterious mutations due to genetic drift. To maintain genome quality at a high level, we propose to use a stress-induced strong purifying selection, which based on negative epistasis, can effectively eliminate organisms with an excess of deleterious variants. Here, to identify stress factors, which interact with the effect of deleterious mutations we performed a proof-of-principle experiment with several regimes of a heat shock. We observed that fitness of mutated versus wild-type carp lines drops stronger after heat shock, which is a signature of a negative epistasis. Although the observed trend is promising, the effect of the epistasis is weak and unstable from family to family. Thus, more deep tuning of heat shock regimes is needed to uncover the most efficient combination of factors (absolute temperature, duration, stage of the embryo developme...

Research paper thumbnail of Buffering role of HSP shapes the molecular evolution of mammalian and human genomes at short and long-term scales

Cold Spring Harbor Laboratory - bioRxiv, Nov 14, 2022

Heat shock proteins in parallel with their main and originally discovered functionmaintenance of ... more Heat shock proteins in parallel with their main and originally discovered functionmaintenance of folded proteins under stressful conditions, can play also background buffering role-by folding proteins with an excess of slightly-deleterious nonsynonymous variants (SDNV). Here we tested several scenarios of this buffering role. On the comparative species scale, we demonstrated that low-Ne species are characterized by a higher expression level of hsp90 which can be explained by the excess of SDNV. On the comparative tissue level, we showed that long-lived tissues have also a higher hsp90 expression level, which can be advantageous to maintain the functionality of proteins. On the comparative gene level, we demonstrated that purifying selection of hsp90 in low-Ne-species did not relax as strongly as it happens for control genes, similar to hsp90. Additionally, we demonstrated that hsp clients versus non-clients are characterised by decreased level of selective constraints; demonstrate stronger relaxation of purifying selection in low-Ne species; have an excess of slightly-deleterious variants associated with complex disease phenotypes in humans; have an excess of pathological variants associated with clinical phenotypes in humans, suggesting that clients, being buffered by hsp90 can degenerate a bit more as compared to non-clients. Altogether, our results show that the secondary role of hsp, buffering of SDNV, is widespread and universal affecting properties of species, tissues and genes. A deep understanding of the buffering role of hsp90 will help to predict the deleterious effect of each variant in the human genome more precisely as well as will extend the application of the effectively-neutral theory of molecular evolution.

Research paper thumbnail of Molecular design of hypothalamus development

Nature, 2020

A wealth of specialized neuroendocrine command systems intercalated within the hypothalamus contr... more A wealth of specialized neuroendocrine command systems intercalated within the hypothalamus control the most fundamental physiological needs 1,2. Nevertheless, a developmental blueprint integrating molecular determinants of neuronal and glial diversity along temporal and spatial scales of hypothalamus development remains unresolved 3. Here, we combine single-cell RNA-seq on 51,199 cells of ectodermal origin, gene regulatory network (GRN) screens in conjunction with GWAS-based disease phenotyping and genetic lineage reconstruction to show that 9 glial and 33 neuronal subtypes are generated by mid-gestation under the control of distinct GRNs. Combinatorial molecular codes arising from neurotransmitters, neuropeptides and transcription Users may view, print, copy, and download text and data-mine the content in such documents, for the purposes of academic research, subject always to the full Conditions of use:

Research paper thumbnail of A mitochondria-specific mutational signature of aging: increased rate of A>G substitutions on a heavy chain

The mutational spectrum of the mitochondrial DNA (mtDNA) does not resemble any of the known mutat... more The mutational spectrum of the mitochondrial DNA (mtDNA) does not resemble any of the known mutational signatures of the nuclear genome and variation in mtDNA mutational spectra between different organisms is still incomprehensible. Since mitochondria is tightly involved in aerobic energy production, it is expected that mtDNA mutational spectra is affected by the oxidative damage. Assuming that oxidative damage increases with age, we analyze mtDNA mutagenesis of different species. Analysing (i) dozens thousands of somatic mtDNA mutations in samples of different age (ii) 70053 polymorphic synonymous mtDNA substitutions, reconstructed in 424 mammalian species with different generation length and (iii) synonymous nucleotide content of 650 complete mitochondrial genomes of mammalian species we observed that the frequency of AH>GH substitutions (H - heavy chain notation) is twice higher in species with high versus low generation length making their mtDNA more AH poor and GH rich. Cons...

Research paper thumbnail of ImtRDB: a database and software for mitochondrial imperfect interspersed repeats annotation

Research paper thumbnail of Body size of Holopedium gibberum under the influence of fish predation in small subarctic lakes

SIL Proceedings, 1922-2010, 2002

Research paper thumbnail of Design logic of hypothalamus development mapped by single-cell RNA-seq

Supplementary files and Code for the article (Molecular design of hypothalamus development).<b... more Supplementary files and Code for the article (Molecular design of hypothalamus development).<br>Raw sequencing data and experimental design description were deposited in Gene Expression Omnibus (accession number: GSE132730). <br>Legends to Supplementary Files mentioned in the article<br>S1. Differentially expressed genes identified by MAST for clusters presented in Figure 1a and for the branching nodes in the dendrogram of Extended data 1a:"Data_and_Code\EDA_Integration_Annotation\DGE_and_Dend.xlsx".<br>S2.A list of genes that are specifically expressed in hypothalamic sub-regions (according to the Allen brain atlas database; www.brain-map.org) and used as positional marks:"Data_and_Code\EDA_Integration_Annotation\nuclei.xlsx".<br>S3. AUCell matrix showing the distribution of the activity of all regulons:"Data_and_Code\Developmental Regulons\auc_cell.tsv".<br>S4. A list of genes used for the manual annotation and expl...

Research paper thumbnail of COMMENTARY When man got his mtDNA deletions?

Somatic mtDNA mutations and deletions in particular are known to clonally expand within cells, ev... more Somatic mtDNA mutations and deletions in particular are known to clonally expand within cells, eventually reaching detrimental intracellular concentrations. The possibility that clonal expansion is a slow process taking a lifetime had prompted an idea that founder mutations of mutant clones that cause mitochondrial dysfunction in the aged tissue might have originated early in life. If, conversely, expansion was fast, founder mutations should predominantly originate later in life. This distinction is important: indeed, from which mutations should we protect ourselves – those of early development/childhood or those happening at old age? Recently, high-resolution data describing the distribution of mtDNA deletions have been obtained using a novel, highly efficient method (Taylor et al., 2014). These data have been interpreted as supporting predominantly early origin of founder mutations. Re-analysis of the data implies that the data actually

Research paper thumbnail of Bi-phasic dynamics of the mitochondrial DNA mutation m.3243A>G in blood: An unbiased, mutation level-dependent model implies positive selection in the germline

The A-to-G point mutation at position 3243 in the human mitochondrial genome (m.3243A>G) is th... more The A-to-G point mutation at position 3243 in the human mitochondrial genome (m.3243A>G) is the most common pathogenic mtDNA variant responsible for disease in humans. It is widely accepted that m.3243A>G levels decrease in blood with age, and correction representing ~2% annual decline is often applied to account for this change in mutation level. Here we report that recent data indicate the dynamics of m.3243A>G are far more complex and depend on the blood mutation level in a bi-phasic way. As a consequence, the traditional 2% correction, which is adequate ‘on average’, creates opposite predictive biases at high and low mutation levels. Thus, overall accuracy of traditional correction depends on the proportion of individuals with high and low mutant levels in the dataset. Unbiased age correction is needed to circumvent these drawbacks of the standard model. We propose to abolish both biases by using an approach where correction depends on mutation level in biphasic way, to...

Research paper thumbnail of Supplementary Material 1

Research paper thumbnail of Deleterious in late life mitochondrial alleles and aging: secrets of Japanese centenarians

Aging is associated with accumulation of somatic mutations. This process is especially pronounced... more Aging is associated with accumulation of somatic mutations. This process is especially pronounced in mitochondrial genomes of postmitotic cells, which accumulate large-scale somatic mitochondrial deletions with time, leading to neurodegeneration, muscular dystrophy and aging. Slowing down the rate of origin of these somatic deletions may benefit human lifespan and healthy aging. The main factors determining breakpoints of somatic mitochondrial deletions are direct nucleotide repeats, which might be considered as Deleterious In Late Life (DILL) alleles. Correspondingly, the decreased amount of these DILL alleles might lead to low production of somatic deletions and increased lifespan. Intriguingly, in the Japanese D4a haplogroup, which is famous for an excess of centenarians and supercentenarians, we found that the longest direct repeat ("common repeat") in the human mitochondrial genome has been disrupted by a point synonymous mutation. Thus we hypothesize that the disrupt...

Research paper thumbnail of Mammalian mitochondrial mutational spectrum as a hallmark of cellular and organismal aging

Mutational spectrum of the mitochondrial genome (mtDNA) does not resemble signatures of any known... more Mutational spectrum of the mitochondrial genome (mtDNA) does not resemble signatures of any known mutagens and variation in mtDNA mutational spectra between different tissues and organisms is still incomprehensible. Since mitochondria is tightly involved in aerobic energy production, it is expected that mtDNA mutational spectra may be affected by the oxidative damage which is increasing with cellular and organismal aging. However, the well-documented mutational signature of the oxidative damage, G>T substitutions, is typical only for the nuclear genome while it is extremely rare and age-independent in mtDNA. Thus it is still unclear if there is a mitochondria - specific mutational signature of the oxidative damage. Here, reconstructing mtDNA mutational spectra for human cancers originated from 21 tissues with various cell turnover rate, human oocytes fertilized at different ages, and 424 mammalian species with variable generation length which is a proxy for oocyte age, we observe...

Research paper thumbnail of Slightly deleterious genomic variants and transcriptome perturbations in Down syndrome embryonic selection

Genome research, Jan 13, 2017

The majority of aneuploid fetuses are spontaneously miscarried. Nevertheless, some aneuploid indi... more The majority of aneuploid fetuses are spontaneously miscarried. Nevertheless, some aneuploid individuals survive despite the strong genetic insult. Here, we investigate if the survival probability of aneuploid fetuses is affected by the genome-wide burden of slightly deleterious variants. We analyzed two cohorts of live-born Down syndrome individuals (388 genotyped samples and 16 fibroblast transcriptomes) and observed a deficit of slightly deleterious variants on Chromosome 21 and decreased transcriptome-wide variation in the expression level of highly constrained genes. We interpret these results as signatures of embryonic selection, and propose a genetic handicap model whereby an individual bearing an extremely severe deleterious variant (such as aneuploidy) could escape embryonic lethality if the genome-wide burden of slightly deleterious variants is sufficiently low. This approach can be used to study the composition and effect of the numerous slightly deleterious variants in h...

Research paper thumbnail of Repeats, longevity and the sources of mtDNA deletions: evidence from ‘deletional spectra’

Research paper thumbnail of When man got his mtDNA deletions?

Aging cell, 2014

Somatic mtDNA mutations and deletions in particular are known to clonally expand within cells, ev... more Somatic mtDNA mutations and deletions in particular are known to clonally expand within cells, eventually reaching detrimental intracellular concentrations. The possibility that clonal expansion is a slow process taking a lifetime had prompted an idea that founder mutations of mutant clones that cause mitochondrial dysfunction in the aged tissue might have originated early in life. If, conversely, expansion was fast, founder mutations should predominantly originate later in life. This distinction is important: indeed, from which mutations should we protect ourselves - those of early development/childhood or those happening at old age? Recently, high-resolution data describing the distribution of mtDNA deletions have been obtained using a novel, highly efficient method (Taylor et al., ). These data have been interpreted as supporting predominantly early origin of founder mutations. Re-analysis of the data implies that the data actually better fit mostly late origin of founders, altho...

Research paper thumbnail of Increased prevalence of clonal hematopoiesis of indeterminate potential amongst people living with HIV

Scientific Reports

People living with human immunodeficiency virus (PLWH) have significantly increased risk for card... more People living with human immunodeficiency virus (PLWH) have significantly increased risk for cardiovascular disease in part due to inflammation and immune dysregulation. Clonal hematopoiesis of indeterminate potential (CHIP), the age-related acquisition and expansion of hematopoietic stem cells due to leukemogenic driver mutations, increases risk for both hematologic malignancy and coronary artery disease (CAD). Since increased inflammation is hypothesized to be both a cause and consequence of CHIP, we hypothesized that PLWH have a greater prevalence of CHIP. We searched for CHIP in multi-ethnic cases from the Swiss HIV Cohort Study (SHCS, n = 600) and controls from the Atherosclerosis Risk in the Communities study (ARIC, n = 8111) from blood DNA-derived exome sequences. We observed that HIV is associated with a twofold increase in CHIP prevalence, both in the whole study population and in a subset of 230 cases and 1002 matched controls selected by propensity matching to control for...

Research paper thumbnail of Author response