Roberto Todeschini | Università degli Studi di Milano-Bicocca (original) (raw)
Uploads
Papers by Roberto Todeschini
Fresenius' journal of analytical chemistry, 1994
The role of multivariate analysis methods in evaluating, rationalizing, and working out complex e... more The role of multivariate analysis methods in evaluating, rationalizing, and working out complex environmental problems is discussed. The discussion is organized in two sections; a literature analysis of the application of chemometric methods to PCDD/PCDF data interpretation and source correlation and a review of the role of chemometric methods in analysing the results obtained by the Authors studying PCDD/PCDF formation and destruction mechanisms in MSW combustion processes.
Journal of Chemometrics, 2018
Toxicological & Environmental Chemistry, 2017
Frontiers in Chemistry, 2017
This work describes a novel approach based on advanced molecular similarity to predict the sweetn... more This work describes a novel approach based on advanced molecular similarity to predict the sweetness of chemicals. The proposed Quantitative Structure-Taste Relationship (QSTR) model is an expert system developed keeping in mind the five principles defined by the Organization for Economic Cooperation and Development (OECD) for the validation of (Q)SARs. The 649 sweet and non-sweet molecules were described by both conformation-independent extended-connectivity fingerprints (ECFPs) and molecular descriptors. In particular, the molecular similarity in the ECFPs space showed a clear association with molecular taste and it was exploited for model development. Molecules laying in the subspaces where the taste assignation was more difficult were modeled trough a consensus between linear and local approaches (Partial Least Squares-Discriminant Analysis and N-nearest-neighbor classifier). The expert system, which was thoroughly validated through a Monte Carlo procedure and an external set, gave satisfactory results in comparison with the state-of-the-art models. Moreover, the QSTR model can be leveraged into a greater understanding of the relationship between molecular structure and sweetness, and into the design of novel sweeteners.
This paper reports an analysis and comparison of the use of 51 different similarity coefficients ... more This paper reports an analysis and comparison of the use of 51 different similarity coefficients for computing the similarities between binary fingerprints for both simulated and real chemical data sets. Five pairs and a triplet of coefficients were found to yield identical similarity values, leading to the elimination of seven of the coefficients. The remaining 44 coefficients were then compared in two ways: by their theoretical characteristics using simple descriptive statistics, correlation analysis, multidimensional scaling, Hasse diagrams, and the recently described atemporal target diffusion model; and by their effectiveness for similarity-based virtual screening using MDDR, WOMBAT, and MUV data. The comparisons demonstrate the general utility of the well-known Tanimoto method but also suggest other coefficients that may be worthy of further attention.
In this paper, a new concept of similarity is introduced with the aim of detecting higher-order s... more In this paper, a new concept of similarity is introduced with the aim of detecting higher-order similarities among objects, and meta-distances and meta-similarities are derived from it. A total of 100 meta-distances were obtained from a set of ten classical distances and were compared, in terms of classification performances, against classical distance measures. Classification methods based on local similarity analysis and several benchmark datasets were used. In several cases, the non-error rate (NER) of classifiers based on the new meta-distances significantly increased with respect to that of the classical Euclidean distance.
properly validated through cross-validation and external test sets. The applicability domain of m... more properly validated through cross-validation and external test sets. The applicability domain of models was investigated, and the interpretation of the role of the molecular descriptors in classifying sweet and non-sweet tastes was evaluated. The classification and the performance of the models presented in this paper are simple but accurate. They are based on a relatively small number of descriptors and a straightforward classification approach. The results presented here indicate that the proposed models can be used to accurately select new compounds as potential sweetener candidates.
Organohalogen Compounds, 1999
Continental Shelf Research, 2003
Freshwater and marine ecosystems are exposed to various multi-component mixtures of pollutants. N... more Freshwater and marine ecosystems are exposed to various multi-component mixtures of pollutants. Nevertheless, most ecotoxicological research and chemicals regulation focus on hazard and exposure assessment of individual substances only, the problem of chemical mixtures in the environment is ignored to a large extent. In contrast, the assessment of combination effects has a long tradition in pharmacology, where mixtures of chemicals are specifically designed to develop new products, e.g. human and veterinary drugs or agricultural and non-agricultural pesticides. In this area, two concepts are frequently used and are thought to describe fundamental relationships between single substance and mixture effects: Independent Action (Response Addition) and Concentration Addition. The question, to what extent these concepts may also be applied in an ecotoxicological and regulatory context may be considered a research topic of major importance, as the concepts would allow to make use of already existing single substance toxicity data for the predictive assessment of mixture toxicities. Two critical knowledge gaps are identified: (a) There is a lack of environmental realism, as a huge part of our current knowledge about the applicability of the concepts is restricted to artificial situations with respect to mixture composition or biological effect assessment. (b) The knowledge on what exactly is needed for using the concepts as tools for the predictive mixture toxicity assessment is insufficient. Both gaps seriously hamper the necessary, scientifically sound consideration of mixture toxicities in a regulatory context.
Two novel classification methods, called N3 (N-nearest neighbors) and BNN (binned nearest neighbo... more Two novel classification methods, called N3 (N-nearest neighbors) and BNN (binned nearest neighbors), are proposed. Both methods are inspired by the principles of the K-nearest neighbors (KNN) method, being both based on object pairwise similarities. Their performance was evaluated in comparison with nine well-known classification methods. In order to obtain reliable statistics, several comparisons were performed using 32 different literature data sets, which differ for number of objects, variables and classes. Results highlighted that N3 on average behaves as the most efficient classification method with similar performance to support vector machine based on radial basis function kernel (SVM/RBF). The method BNN showed on average higher performance than the classical K-nearest neighbors method.
Zeitschrift für Naturforschung A, 2008
In 1971 McClelland obtained lower and upper bounds for the total π-electron energy. We now formul... more In 1971 McClelland obtained lower and upper bounds for the total π-electron energy. We now formulate the generalized version of these bounds, applicable to the energy-like expression
This paper reports an analysis and comparison of the use of 51 different similarity coefficients ... more This paper reports an analysis and comparison of the use of 51 different similarity coefficients for computing the similarities between binary fingerprints for both simulated and real chemical data sets. Five pairs and a triplet of coefficients were found to yield identical similarity values, leading to the elimination of seven of the coefficients. The remaining 44 coefficients were then compared in two ways: by their theoretical characteristics using simple descriptive statistics, correlation analysis, multidimensional scaling, Hasse diagrams, and the recently described atemporal target diffusion model; and by their effectiveness for similarity-based virtual screening using MDDR, WOMBAT, and MUV data. The comparisons demonstrate the general utility of the well-known Tanimoto method but also suggest other coefficients that may be worthy of further attention.
Fresenius' journal of analytical chemistry, 1994
The role of multivariate analysis methods in evaluating, rationalizing, and working out complex e... more The role of multivariate analysis methods in evaluating, rationalizing, and working out complex environmental problems is discussed. The discussion is organized in two sections; a literature analysis of the application of chemometric methods to PCDD/PCDF data interpretation and source correlation and a review of the role of chemometric methods in analysing the results obtained by the Authors studying PCDD/PCDF formation and destruction mechanisms in MSW combustion processes.
Journal of Chemometrics, 2018
Toxicological & Environmental Chemistry, 2017
Frontiers in Chemistry, 2017
This work describes a novel approach based on advanced molecular similarity to predict the sweetn... more This work describes a novel approach based on advanced molecular similarity to predict the sweetness of chemicals. The proposed Quantitative Structure-Taste Relationship (QSTR) model is an expert system developed keeping in mind the five principles defined by the Organization for Economic Cooperation and Development (OECD) for the validation of (Q)SARs. The 649 sweet and non-sweet molecules were described by both conformation-independent extended-connectivity fingerprints (ECFPs) and molecular descriptors. In particular, the molecular similarity in the ECFPs space showed a clear association with molecular taste and it was exploited for model development. Molecules laying in the subspaces where the taste assignation was more difficult were modeled trough a consensus between linear and local approaches (Partial Least Squares-Discriminant Analysis and N-nearest-neighbor classifier). The expert system, which was thoroughly validated through a Monte Carlo procedure and an external set, gave satisfactory results in comparison with the state-of-the-art models. Moreover, the QSTR model can be leveraged into a greater understanding of the relationship between molecular structure and sweetness, and into the design of novel sweeteners.
This paper reports an analysis and comparison of the use of 51 different similarity coefficients ... more This paper reports an analysis and comparison of the use of 51 different similarity coefficients for computing the similarities between binary fingerprints for both simulated and real chemical data sets. Five pairs and a triplet of coefficients were found to yield identical similarity values, leading to the elimination of seven of the coefficients. The remaining 44 coefficients were then compared in two ways: by their theoretical characteristics using simple descriptive statistics, correlation analysis, multidimensional scaling, Hasse diagrams, and the recently described atemporal target diffusion model; and by their effectiveness for similarity-based virtual screening using MDDR, WOMBAT, and MUV data. The comparisons demonstrate the general utility of the well-known Tanimoto method but also suggest other coefficients that may be worthy of further attention.
In this paper, a new concept of similarity is introduced with the aim of detecting higher-order s... more In this paper, a new concept of similarity is introduced with the aim of detecting higher-order similarities among objects, and meta-distances and meta-similarities are derived from it. A total of 100 meta-distances were obtained from a set of ten classical distances and were compared, in terms of classification performances, against classical distance measures. Classification methods based on local similarity analysis and several benchmark datasets were used. In several cases, the non-error rate (NER) of classifiers based on the new meta-distances significantly increased with respect to that of the classical Euclidean distance.
properly validated through cross-validation and external test sets. The applicability domain of m... more properly validated through cross-validation and external test sets. The applicability domain of models was investigated, and the interpretation of the role of the molecular descriptors in classifying sweet and non-sweet tastes was evaluated. The classification and the performance of the models presented in this paper are simple but accurate. They are based on a relatively small number of descriptors and a straightforward classification approach. The results presented here indicate that the proposed models can be used to accurately select new compounds as potential sweetener candidates.
Organohalogen Compounds, 1999
Continental Shelf Research, 2003
Freshwater and marine ecosystems are exposed to various multi-component mixtures of pollutants. N... more Freshwater and marine ecosystems are exposed to various multi-component mixtures of pollutants. Nevertheless, most ecotoxicological research and chemicals regulation focus on hazard and exposure assessment of individual substances only, the problem of chemical mixtures in the environment is ignored to a large extent. In contrast, the assessment of combination effects has a long tradition in pharmacology, where mixtures of chemicals are specifically designed to develop new products, e.g. human and veterinary drugs or agricultural and non-agricultural pesticides. In this area, two concepts are frequently used and are thought to describe fundamental relationships between single substance and mixture effects: Independent Action (Response Addition) and Concentration Addition. The question, to what extent these concepts may also be applied in an ecotoxicological and regulatory context may be considered a research topic of major importance, as the concepts would allow to make use of already existing single substance toxicity data for the predictive assessment of mixture toxicities. Two critical knowledge gaps are identified: (a) There is a lack of environmental realism, as a huge part of our current knowledge about the applicability of the concepts is restricted to artificial situations with respect to mixture composition or biological effect assessment. (b) The knowledge on what exactly is needed for using the concepts as tools for the predictive mixture toxicity assessment is insufficient. Both gaps seriously hamper the necessary, scientifically sound consideration of mixture toxicities in a regulatory context.
Two novel classification methods, called N3 (N-nearest neighbors) and BNN (binned nearest neighbo... more Two novel classification methods, called N3 (N-nearest neighbors) and BNN (binned nearest neighbors), are proposed. Both methods are inspired by the principles of the K-nearest neighbors (KNN) method, being both based on object pairwise similarities. Their performance was evaluated in comparison with nine well-known classification methods. In order to obtain reliable statistics, several comparisons were performed using 32 different literature data sets, which differ for number of objects, variables and classes. Results highlighted that N3 on average behaves as the most efficient classification method with similar performance to support vector machine based on radial basis function kernel (SVM/RBF). The method BNN showed on average higher performance than the classical K-nearest neighbors method.
Zeitschrift für Naturforschung A, 2008
In 1971 McClelland obtained lower and upper bounds for the total π-electron energy. We now formul... more In 1971 McClelland obtained lower and upper bounds for the total π-electron energy. We now formulate the generalized version of these bounds, applicable to the energy-like expression
This paper reports an analysis and comparison of the use of 51 different similarity coefficients ... more This paper reports an analysis and comparison of the use of 51 different similarity coefficients for computing the similarities between binary fingerprints for both simulated and real chemical data sets. Five pairs and a triplet of coefficients were found to yield identical similarity values, leading to the elimination of seven of the coefficients. The remaining 44 coefficients were then compared in two ways: by their theoretical characteristics using simple descriptive statistics, correlation analysis, multidimensional scaling, Hasse diagrams, and the recently described atemporal target diffusion model; and by their effectiveness for similarity-based virtual screening using MDDR, WOMBAT, and MUV data. The comparisons demonstrate the general utility of the well-known Tanimoto method but also suggest other coefficients that may be worthy of further attention.
Comprehensive Chemometrics, 2020
in Infrared Spectroscopy for Food Quality Analysis and Control, Da-Wen Sun (ED), Elsevier, 2008
in Novel Molecular Structure Descriptors - Theory and Applications II, I. Gutman, B. Furtula (Eds.), University of Kragujevac and Faculty of Science Kragujevac, 2010
in Scientific Data Ranking Methods: Theory and Applications, R. Todeschini, M. Pavan (EDs), Elsevier, 2008
in Novel Molecular Structure Descriptors - Theory and Applications I, I. Gutman, B. Furtula (Eds.), University of Kragujevac and Faculty of Science Kragujevac, 2010