Applicability domain (original) (raw)

Property Value
dbo:abstract The applicability domain (AD) (for both chemistry and machine learning) of a QSAR model is the physico-chemical, structural or biological space, knowledge or information on which the training set of the model has been developed, and for which it is applicable to make predictions for new compounds. The purpose of AD is to state whether the model's assumptions are met, and for which chemicals the model can be reliably applicable. In general, this is the case for interpolation rather than for extrapolation. Up to now there is no single generally accepted algorithm for determining the AD: a comprehensive survey can be found in a Report and Recommendations of ECVAM Workshop 52. There exists a rather systematic approach for defining interpolation regions. The process involves the removal of outliers and a probability density distribution method using kernel-weighted sampling. Another widely used approach for the structural AD of the regression QSAR models is based on the leverage calculated from the diagonal values of the hat matrix of the modeling molecular descriptors. A recent rigorous benchmarking study of several AD algorithms identified standard-deviation of model predictions as the most reliable approach.To investigate the AD of a training set of chemicals one can directly analyse properties of the multivariate descriptor space of the training compounds or more indirectly via distance (or similarity) metrics. When using distance metrics care should be taken to use an orthogonal and significant vector space. This can be achieved by different means of feature selection and successive principal components analysis. (en)
dbo:wikiPageID 13629713 (xsd:integer)
dbo:wikiPageLength 3353 (xsd:nonNegativeInteger)
dbo:wikiPageRevisionID 1049597531 (xsd:integer)
dbo:wikiPageWikiLink dbr:Multivariate_statistics dbr:Interpolation dbr:QSAR dbc:Cheminformatics dbr:Machine_learning dbr:Training_set dbr:Distance dbc:Drug_discovery dbc:Medicinal_chemistry dbr:Chemistry dbr:Extrapolation dbr:Principal_components_analysis
dbp:wikiPageUsesTemplate dbt:More_citations_needed dbt:Multiple_issues dbt:Portal_bar dbt:Technical dbt:Medicinal-chem-stub
dcterms:subject dbc:Cheminformatics dbc:Drug_discovery dbc:Medicinal_chemistry
gold:hypernym dbr:Space
rdfs:comment The applicability domain (AD) (for both chemistry and machine learning) of a QSAR model is the physico-chemical, structural or biological space, knowledge or information on which the training set of the model has been developed, and for which it is applicable to make predictions for new compounds. (en)
rdfs:label Applicability domain (en)
owl:sameAs freebase:Applicability domain wikidata:Applicability domain https://global.dbpedia.org/id/4RceR
prov:wasDerivedFrom wikipedia-en:Applicability_domain?oldid=1049597531&ns=0
foaf:isPrimaryTopicOf wikipedia-en:Applicability_domain
is dbo:wikiPageDisambiguates of dbr:Applicability
is dbo:wikiPageRedirects of dbr:Applicability_Domain
is dbo:wikiPageWikiLink of dbr:Quantitative_structure–activity_relationship dbr:Molecular_descriptor dbr:Applicability dbr:Applicability_Domain
is foaf:primaryTopic of wikipedia-en:Applicability_domain