Marcelo Boareto - Academia.edu (original) (raw)

Uploads

Papers by Marcelo Boareto

Research paper thumbnail of Jagged mediates differences in normal and tumor angiogenesis by affecting tip-stalk fate decision

Angiogenesis is critical during development, wound repair, and cancer progression. During angioge... more Angiogenesis is critical during development, wound repair, and cancer progression. During angiogenesis, some endothelial cells adopt a tip phenotype to lead the formation of new branching vessels; the trailing stalk cells proliferate to develop the vessel. Notch and VEGF signaling mediate the selection of these tip endothelial cells. However, how Jagged, a Notch ligand that is overexpressed in cancer, affects angiogenesis remains elusive. Here, by developing a theoretical framework for Notch-Delta-Jagged-VEGF signaling, we found that higher production levels of Jagged destabilizes the tip and stalk cell fates and can give rise to a hybrid tip/stalk phenotype that leads to poorly perfused and chaotic angiogenesis, which is a hallmark of cancer. Consistently, the signaling interactions that restrict Notch-Jagged signaling, such as Fringe, cis-inhibition, and increased production of Delta, stabilize tip and stalk fates and limit the existence of hybrid tip/stalk phenotype. Our results underline how overexpression of Jagged can transform physiological angiogenesis into pathological one.

Research paper thumbnail of Jagged–Delta asymmetry in Notch signaling can give rise to a Sender/Receiver hybrid phenotype

Notch signaling pathway mediates cell-fate determination during embryonic development, wound heal... more Notch signaling pathway mediates cell-fate determination during embryonic development, wound healing, and tumorigenesis. This pathway is activated when the ligand Delta or the ligand Jagged of one cell interacts with the Notch receptor of its neighboring cell, releasing the Notch Intracellular Domain (NICD) that activates many downstream target genes. NICD affects ligand production asymmetrically––it represses Delta, but activates Jagged. Although the dynamical role of Notch–Jagged signaling remains elusive, it is widely recognized that Notch–Delta signaling behaves as an intercellular toggle switch, giving rise to two distinct fates that neighboring cells adopt––Sender (high ligand, low receptor) and Receiver (low ligand, high receptor). Here, we devise a specific theoretical framework that incorporates both Delta and Jagged in Notch signaling circuit to explore the functional role of Jagged in cell-fate determination. We find that the asymmetric effect of NICD renders the circuit to behave as a three-way switch, giving rise to an additional state––a hybrid Sender/Receiver (medium ligand, medium receptor). This phenotype allows neighboring cells to both send and receive signals, thereby attaining similar fates. We also show that due to the asymmetric effect of the glycosyltransferase Fringe, different outcomes are generated depending on which ligand is dominant: Delta-mediated signaling drives neighboring cells to have an opposite fate; Jagged-mediated signaling drives the cell to maintain a similar fate to that of its neighbor. We elucidate the role of Jagged in cell-fate determination and discuss its possible implications in understanding tumor–stroma cross-talk, which frequently entails Notch–Jagged communication.

Research paper thumbnail of Supervised Variational Relevance Learning, an analytic geometric feature selection with applications to omic data sets

We introduce Supervised Variational Relevance Learning (Suvrel), a variational method to determin... more We introduce Supervised Variational Relevance Learning (Suvrel), a variational method to determine metric tensors to define distance based similarity in pattern classification, inspired in relevance learning. The variational method is applied to a cost function that penalizes large intraclass distances and favors small interclass distances. We find analytically the metric tensor that minimizes the cost function. Preprocessing the patterns by doing linear transformations using the metric tensor yields a dataset which can be more efficiently classified. We test our methods using publicly available datasets, for some standard classifiers. Among these datasets, two were tested by the MAQC-II project and, even without the use of further preprocessing, our results improve on their performance.

Research paper thumbnail of t-Test at the Probe Level: An Alternative Method to Identify Statistically Significant Genes for Microarray Data

Microarray data analysis typically consists in identifying a list of differentially 1 expressed g... more Microarray data analysis typically consists in identifying a list of differentially 1 expressed genes (DEG), i.e., the genes that are differentially expressed between two 2 experimental conditions. Variance shrinkage methods have been considered a better choice 3 than the standard t-test for selecting the DEG because they correct the dependence of the 4 error with the expression level. This dependence is mainly caused by errors in background 5 correction which affects more severely genes with low expression values. Here, we propose 6 a new method for identifying the DEG that overcome this issue and does not require 7 background correction or variance shrinkage. Unlike current methods, our methodology is 8 easy to understand and to implement. It consists of applying the standard t-test directly on the 9 normalized intensity data, which is possible because the probe intensity is linear dependent 10 with the gene expression level and because the t-test is scale-and location-invariant. This 11 methodology considerably improves the sensitivity and robustness of the list of DEG when 12 compared to the t-test applied to preprocessed data and to the most widely used shrinkage 13 methods, SAM and LIMMA. Our approach is useful especially when the genes of interest 14 have small differences in expression and therefore get ignored by standard variance shrinkage 15 methods.

Research paper thumbnail of Coupling the modules of EMT and stemness: A tunable ‘stemness window’ model

Metastasis of carcinoma involves migration of tumor cells to distant organs and initiate secondar... more Metastasis of carcinoma involves migration of tumor cells to distant organs and initiate secondary tumors. Migration requires a complete or partial Epithelial-to-Mesenchymal Transition (EMT), and tumor-initiation requires cells possessing stemness. Epithelial cells (E) undergoing a complete EMT to become mesenchymal (M) have been suggested to be more likely to possess stemness. However, recent studies suggest that stemness can also be associated with cells undergoing a partial EMT (hybrid E/M phenotype). Therefore, the correlation between EMT and stemness remains elusive. Here, using a theoretical framework that couples the core EMT and stemness modules (miR-200/ZEB and LIN28/let-7), we demonstrate that the positioning of ‘stemness window’ on the ‘EMT axis’ need not be universal; rather it can be fine-tuned. Particularly, we present OVOL as an example of a modulating factor that, due to its coupling with miR-200/ZEB/LIN28/let-7 circuit, fine-tunes the EMT-stemness interplay. Coupling OVOL can inhibit the stemness likelihood of M and elevate that of the hybrid E/M (partial EMT) phenotype, thereby pulling the ‘stemness window’ away from the M end of ‘EMT axis’. Our results unify various apparently contradictory experimental findings regarding the interconnection between EMT and stemness, corroborate the emerging notion that partial EMT associates with stemness, and offer new testable predictions.

Research paper thumbnail of Implications of the Hybrid Epithelial/Mesenchymal Phenotype in Metastasis

Transitions between epithelial and mesenchymal phenotypes - the epithelial to -mesenchymal transi... more Transitions between epithelial and mesenchymal phenotypes - the epithelial to -mesenchymal transition (EMT) and its reverse the mesenchymal to epithelial transition (MET) - are hallmarks of cancer metastasis. While transitioning between the epithelial and mesenchymal phenotypes, cells can also attain a hybrid epithelial/mesenchymal (E/M) (i.e., partial or intermediate EMT) phenotype. Cells in this phenotype have mixed epithelial (e.g., adhesion) and mesenchymal (e.g., migration) properties, thereby allowing them to move collectively as clusters. If these clusters reach the bloodstream intact, they can give rise to clusters of circulating tumor cells (CTCs), as have often been seen experimentally. Here, we review the operating principles of the core regulatory network for EMT/MET that acts as a "three-way" switch giving rise to three distinct phenotypes - E, M and hybrid E/M - and present a theoretical framework that can elucidate the role of many other players in regulating epithelial plasticity. Furthermore, we highlight recent studies on partial EMT and its association with drug resistance and tumor-initiating potential; and discuss how cell-cell communication between cells in a partial EMT phenotype can enable the formation of clusters of CTCs. These clusters can be more apoptosis-resistant and have more tumor-initiating potential than singly moving CTCs with a wholly mesenchymal (complete EMT) phenotype. Also, more such clusters can be formed under inflammatory conditions that are often generated by various therapies. Finally, we discuss the multiple advantages that the partial EMT or hybrid E/M phenotype have as compared to a complete EMT phenotype and argue that these collectively migrating cells are the primary "bad actors" of metastasis.

Research paper thumbnail of OVOL guides the epithelial-hybrid-mesenchymal transition

Metastasis involves multiple cycles of Epithelial-to-Mesenchymal Transition (EMT) and its reverse... more Metastasis involves multiple cycles of Epithelial-to-Mesenchymal Transition (EMT) and its reverse-MET. Cells can also undergo partial transitions to attain a hybrid
epithelial/mesenchymal (E/M) phenotype that has maximum cellular plasticity and allows migration of Circulating Tumor Cells (CTCs) as a cluster. Hence, deciphering the molecular players helping to maintain the hybrid E/M phenotype may inform antimetastasis strategies. Here, we devised a mechanism-based mathematical model to couple the transcription factor OVOL with the core EMT regulatory network miR-200/ZEB that acts as a three-way switch between the E, E/M and M phenotypes. We show that OVOL can modulate cellular plasticity in multiple ways - restricting EMT, driving MET, expanding the existence of the hybrid E/M phenotype and turning both EMT and MET into two-step processes. Our theoretical framework explains the differences between the observed effects of OVOL in breast and prostate cancer, and provides a platform for investigating additional signals during metastasis.

Research paper thumbnail of Operating principles of Notch–Delta–Jagged module of cell–cell communication

Notch pathway is an evolutionarily conserved cell–cell communication mechanism governing cell-fat... more Notch pathway is an evolutionarily conserved cell–cell communication mechanism governing cell-fate during development and tumor progression. It is activated when Notch receptor of one cell binds to either of its ligand—Delta or Jagged—of another cell. Notch–Delta (ND) signaling forms a two-way switch, and two cells interacting viaNDsignaling adopt different fates—Sender (high ligand, low receptor) and Receiver (low ligand, high receptor). Notch–Delta–Jagged signaling (NDJ) behaves as a three-way switch and enables an additional fate—hybrid Sender/Receiver (S/R) (medium ligand, medium receptor). Here, by extending our framework of NDJ signaling for a two-cell system, we show that higher production rate of Jagged, but not that of Delta, expands the range of parameters for which both cells attain the hybrid S/R state. Conversely, glycosyltransferase Fringe and cis-inhibition reduces
this range of conditions, and reduces the relative stability of the hybrid S/R state, thereby promoting cell-fate divergence and consequently lateral inhibition-based patterns. Lastly, soluble Jagged drives the cells to attain the hybrid S/R state, and soluble Delta drives them to be Receivers. We also discuss the critical role of hybrid S/R state in promoting cancer metastasis by enabling collective cell migration and expanding cancer stem cell (CSC) population.

Research paper thumbnail of Are background correction and variance shrinkage necessary steps in microarray analysis?

Several types of background correction and variance shrinkage methods have been proposed. Shrinka... more Several types of background correction and variance shrinkage methods have been proposed. Shrinkage methods have been considered useful since they correct dependence of the error with the expression level. This dependence is mainly caused by errors in background correction which affects more severely genes with low expression values. We propose a simple and effective procedure where background correction and variance shrinkage are not necessary. The proposed method uses the normalized intensity data directly, which is possible due to the scale and location invariance of the t-variable. For the case of the standard t-test, we show that it improves the sensitivity and robustness of the Predictive Gene Lists (PGL) when compared to the most widely used shrinkage methods, SAM and LIMMA.

Research paper thumbnail of Multistability in Notch-Jagged-Delta signaling pathway

Research paper thumbnail of Relationship between global structural parameters and Enzyme Commission hierarchy: implications for function prediction

In protein databases there is a substantial number of proteins structurally determined but withou... more In protein databases there is a substantial number of proteins structurally determined but without function annotation. Understanding the relationship between function and structure can be useful to predict function on a large scale. We have analyzed the similarities in global physicochemical parameters for a set of enzymes which were classified according to the four Enzyme Commission (EC) hierarchical levels. Using relevance theory we introduced a distance between proteins in the space of physicochemical characteristics. This was done by minimizing a cost function of the metric tensor built to reflect the EC classification system. Using an unsupervised clustering method on a set of 1025 enzymes, we obtained no relevant clustering formation compatible with EC classification. The distance distributions between enzymes from the same EC group and from different EC groups were compared by histograms. Such analysis was also performed using sequence alignment similarity as a distance. Our results suggest that global structure parameters are not sufficient to segregate enzymes according to EC hierarchy. This indicates that features essential for function are rather local than global. Consequently, methods for predicting function based on global attributes should not obtain high accuracy in main EC classes prediction without relying on similarities between enzymes from training and validation datasets. Furthermore, these results are consistent with a substantial number of studies suggesting that function evolves fundamentally by recruitment, i.e., a same protein motif or fold can be used to perform different enzymatic functions and a few number of specific amino acids are actually responsible for enzyme activity. These essential amino acids should belong to active sites and an effective method for predicting function should be able to recognize them.

Research paper thumbnail of Jagged mediates differences in normal and tumor angiogenesis by affecting tip-stalk fate decision

Angiogenesis is critical during development, wound repair, and cancer progression. During angioge... more Angiogenesis is critical during development, wound repair, and cancer progression. During angiogenesis, some endothelial cells adopt a tip phenotype to lead the formation of new branching vessels; the trailing stalk cells proliferate to develop the vessel. Notch and VEGF signaling mediate the selection of these tip endothelial cells. However, how Jagged, a Notch ligand that is overexpressed in cancer, affects angiogenesis remains elusive. Here, by developing a theoretical framework for Notch-Delta-Jagged-VEGF signaling, we found that higher production levels of Jagged destabilizes the tip and stalk cell fates and can give rise to a hybrid tip/stalk phenotype that leads to poorly perfused and chaotic angiogenesis, which is a hallmark of cancer. Consistently, the signaling interactions that restrict Notch-Jagged signaling, such as Fringe, cis-inhibition, and increased production of Delta, stabilize tip and stalk fates and limit the existence of hybrid tip/stalk phenotype. Our results underline how overexpression of Jagged can transform physiological angiogenesis into pathological one.

Research paper thumbnail of Jagged–Delta asymmetry in Notch signaling can give rise to a Sender/Receiver hybrid phenotype

Notch signaling pathway mediates cell-fate determination during embryonic development, wound heal... more Notch signaling pathway mediates cell-fate determination during embryonic development, wound healing, and tumorigenesis. This pathway is activated when the ligand Delta or the ligand Jagged of one cell interacts with the Notch receptor of its neighboring cell, releasing the Notch Intracellular Domain (NICD) that activates many downstream target genes. NICD affects ligand production asymmetrically––it represses Delta, but activates Jagged. Although the dynamical role of Notch–Jagged signaling remains elusive, it is widely recognized that Notch–Delta signaling behaves as an intercellular toggle switch, giving rise to two distinct fates that neighboring cells adopt––Sender (high ligand, low receptor) and Receiver (low ligand, high receptor). Here, we devise a specific theoretical framework that incorporates both Delta and Jagged in Notch signaling circuit to explore the functional role of Jagged in cell-fate determination. We find that the asymmetric effect of NICD renders the circuit to behave as a three-way switch, giving rise to an additional state––a hybrid Sender/Receiver (medium ligand, medium receptor). This phenotype allows neighboring cells to both send and receive signals, thereby attaining similar fates. We also show that due to the asymmetric effect of the glycosyltransferase Fringe, different outcomes are generated depending on which ligand is dominant: Delta-mediated signaling drives neighboring cells to have an opposite fate; Jagged-mediated signaling drives the cell to maintain a similar fate to that of its neighbor. We elucidate the role of Jagged in cell-fate determination and discuss its possible implications in understanding tumor–stroma cross-talk, which frequently entails Notch–Jagged communication.

Research paper thumbnail of Supervised Variational Relevance Learning, an analytic geometric feature selection with applications to omic data sets

We introduce Supervised Variational Relevance Learning (Suvrel), a variational method to determin... more We introduce Supervised Variational Relevance Learning (Suvrel), a variational method to determine metric tensors to define distance based similarity in pattern classification, inspired in relevance learning. The variational method is applied to a cost function that penalizes large intraclass distances and favors small interclass distances. We find analytically the metric tensor that minimizes the cost function. Preprocessing the patterns by doing linear transformations using the metric tensor yields a dataset which can be more efficiently classified. We test our methods using publicly available datasets, for some standard classifiers. Among these datasets, two were tested by the MAQC-II project and, even without the use of further preprocessing, our results improve on their performance.

Research paper thumbnail of t-Test at the Probe Level: An Alternative Method to Identify Statistically Significant Genes for Microarray Data

Microarray data analysis typically consists in identifying a list of differentially 1 expressed g... more Microarray data analysis typically consists in identifying a list of differentially 1 expressed genes (DEG), i.e., the genes that are differentially expressed between two 2 experimental conditions. Variance shrinkage methods have been considered a better choice 3 than the standard t-test for selecting the DEG because they correct the dependence of the 4 error with the expression level. This dependence is mainly caused by errors in background 5 correction which affects more severely genes with low expression values. Here, we propose 6 a new method for identifying the DEG that overcome this issue and does not require 7 background correction or variance shrinkage. Unlike current methods, our methodology is 8 easy to understand and to implement. It consists of applying the standard t-test directly on the 9 normalized intensity data, which is possible because the probe intensity is linear dependent 10 with the gene expression level and because the t-test is scale-and location-invariant. This 11 methodology considerably improves the sensitivity and robustness of the list of DEG when 12 compared to the t-test applied to preprocessed data and to the most widely used shrinkage 13 methods, SAM and LIMMA. Our approach is useful especially when the genes of interest 14 have small differences in expression and therefore get ignored by standard variance shrinkage 15 methods.

Research paper thumbnail of Coupling the modules of EMT and stemness: A tunable ‘stemness window’ model

Metastasis of carcinoma involves migration of tumor cells to distant organs and initiate secondar... more Metastasis of carcinoma involves migration of tumor cells to distant organs and initiate secondary tumors. Migration requires a complete or partial Epithelial-to-Mesenchymal Transition (EMT), and tumor-initiation requires cells possessing stemness. Epithelial cells (E) undergoing a complete EMT to become mesenchymal (M) have been suggested to be more likely to possess stemness. However, recent studies suggest that stemness can also be associated with cells undergoing a partial EMT (hybrid E/M phenotype). Therefore, the correlation between EMT and stemness remains elusive. Here, using a theoretical framework that couples the core EMT and stemness modules (miR-200/ZEB and LIN28/let-7), we demonstrate that the positioning of ‘stemness window’ on the ‘EMT axis’ need not be universal; rather it can be fine-tuned. Particularly, we present OVOL as an example of a modulating factor that, due to its coupling with miR-200/ZEB/LIN28/let-7 circuit, fine-tunes the EMT-stemness interplay. Coupling OVOL can inhibit the stemness likelihood of M and elevate that of the hybrid E/M (partial EMT) phenotype, thereby pulling the ‘stemness window’ away from the M end of ‘EMT axis’. Our results unify various apparently contradictory experimental findings regarding the interconnection between EMT and stemness, corroborate the emerging notion that partial EMT associates with stemness, and offer new testable predictions.

Research paper thumbnail of Implications of the Hybrid Epithelial/Mesenchymal Phenotype in Metastasis

Transitions between epithelial and mesenchymal phenotypes - the epithelial to -mesenchymal transi... more Transitions between epithelial and mesenchymal phenotypes - the epithelial to -mesenchymal transition (EMT) and its reverse the mesenchymal to epithelial transition (MET) - are hallmarks of cancer metastasis. While transitioning between the epithelial and mesenchymal phenotypes, cells can also attain a hybrid epithelial/mesenchymal (E/M) (i.e., partial or intermediate EMT) phenotype. Cells in this phenotype have mixed epithelial (e.g., adhesion) and mesenchymal (e.g., migration) properties, thereby allowing them to move collectively as clusters. If these clusters reach the bloodstream intact, they can give rise to clusters of circulating tumor cells (CTCs), as have often been seen experimentally. Here, we review the operating principles of the core regulatory network for EMT/MET that acts as a "three-way" switch giving rise to three distinct phenotypes - E, M and hybrid E/M - and present a theoretical framework that can elucidate the role of many other players in regulating epithelial plasticity. Furthermore, we highlight recent studies on partial EMT and its association with drug resistance and tumor-initiating potential; and discuss how cell-cell communication between cells in a partial EMT phenotype can enable the formation of clusters of CTCs. These clusters can be more apoptosis-resistant and have more tumor-initiating potential than singly moving CTCs with a wholly mesenchymal (complete EMT) phenotype. Also, more such clusters can be formed under inflammatory conditions that are often generated by various therapies. Finally, we discuss the multiple advantages that the partial EMT or hybrid E/M phenotype have as compared to a complete EMT phenotype and argue that these collectively migrating cells are the primary "bad actors" of metastasis.

Research paper thumbnail of OVOL guides the epithelial-hybrid-mesenchymal transition

Metastasis involves multiple cycles of Epithelial-to-Mesenchymal Transition (EMT) and its reverse... more Metastasis involves multiple cycles of Epithelial-to-Mesenchymal Transition (EMT) and its reverse-MET. Cells can also undergo partial transitions to attain a hybrid
epithelial/mesenchymal (E/M) phenotype that has maximum cellular plasticity and allows migration of Circulating Tumor Cells (CTCs) as a cluster. Hence, deciphering the molecular players helping to maintain the hybrid E/M phenotype may inform antimetastasis strategies. Here, we devised a mechanism-based mathematical model to couple the transcription factor OVOL with the core EMT regulatory network miR-200/ZEB that acts as a three-way switch between the E, E/M and M phenotypes. We show that OVOL can modulate cellular plasticity in multiple ways - restricting EMT, driving MET, expanding the existence of the hybrid E/M phenotype and turning both EMT and MET into two-step processes. Our theoretical framework explains the differences between the observed effects of OVOL in breast and prostate cancer, and provides a platform for investigating additional signals during metastasis.

Research paper thumbnail of Operating principles of Notch–Delta–Jagged module of cell–cell communication

Notch pathway is an evolutionarily conserved cell–cell communication mechanism governing cell-fat... more Notch pathway is an evolutionarily conserved cell–cell communication mechanism governing cell-fate during development and tumor progression. It is activated when Notch receptor of one cell binds to either of its ligand—Delta or Jagged—of another cell. Notch–Delta (ND) signaling forms a two-way switch, and two cells interacting viaNDsignaling adopt different fates—Sender (high ligand, low receptor) and Receiver (low ligand, high receptor). Notch–Delta–Jagged signaling (NDJ) behaves as a three-way switch and enables an additional fate—hybrid Sender/Receiver (S/R) (medium ligand, medium receptor). Here, by extending our framework of NDJ signaling for a two-cell system, we show that higher production rate of Jagged, but not that of Delta, expands the range of parameters for which both cells attain the hybrid S/R state. Conversely, glycosyltransferase Fringe and cis-inhibition reduces
this range of conditions, and reduces the relative stability of the hybrid S/R state, thereby promoting cell-fate divergence and consequently lateral inhibition-based patterns. Lastly, soluble Jagged drives the cells to attain the hybrid S/R state, and soluble Delta drives them to be Receivers. We also discuss the critical role of hybrid S/R state in promoting cancer metastasis by enabling collective cell migration and expanding cancer stem cell (CSC) population.

Research paper thumbnail of Are background correction and variance shrinkage necessary steps in microarray analysis?

Several types of background correction and variance shrinkage methods have been proposed. Shrinka... more Several types of background correction and variance shrinkage methods have been proposed. Shrinkage methods have been considered useful since they correct dependence of the error with the expression level. This dependence is mainly caused by errors in background correction which affects more severely genes with low expression values. We propose a simple and effective procedure where background correction and variance shrinkage are not necessary. The proposed method uses the normalized intensity data directly, which is possible due to the scale and location invariance of the t-variable. For the case of the standard t-test, we show that it improves the sensitivity and robustness of the Predictive Gene Lists (PGL) when compared to the most widely used shrinkage methods, SAM and LIMMA.

Research paper thumbnail of Multistability in Notch-Jagged-Delta signaling pathway

Research paper thumbnail of Relationship between global structural parameters and Enzyme Commission hierarchy: implications for function prediction

In protein databases there is a substantial number of proteins structurally determined but withou... more In protein databases there is a substantial number of proteins structurally determined but without function annotation. Understanding the relationship between function and structure can be useful to predict function on a large scale. We have analyzed the similarities in global physicochemical parameters for a set of enzymes which were classified according to the four Enzyme Commission (EC) hierarchical levels. Using relevance theory we introduced a distance between proteins in the space of physicochemical characteristics. This was done by minimizing a cost function of the metric tensor built to reflect the EC classification system. Using an unsupervised clustering method on a set of 1025 enzymes, we obtained no relevant clustering formation compatible with EC classification. The distance distributions between enzymes from the same EC group and from different EC groups were compared by histograms. Such analysis was also performed using sequence alignment similarity as a distance. Our results suggest that global structure parameters are not sufficient to segregate enzymes according to EC hierarchy. This indicates that features essential for function are rather local than global. Consequently, methods for predicting function based on global attributes should not obtain high accuracy in main EC classes prediction without relying on similarities between enzymes from training and validation datasets. Furthermore, these results are consistent with a substantial number of studies suggesting that function evolves fundamentally by recruitment, i.e., a same protein motif or fold can be used to perform different enzymatic functions and a few number of specific amino acids are actually responsible for enzyme activity. These essential amino acids should belong to active sites and an effective method for predicting function should be able to recognize them.