Bayesian interpretation of kernel regularization (original) (raw)

Within bayesian statistics for machine learning, kernel methods arise from the assumption of an inner product space or similarity structure on inputs. For some such methods, such as support vector machines (SVMs), the original formulation and its regularization were not Bayesian in nature. It is helpful to understand them from a Bayesian perspective. Because the kernels are not necessarily positive semidefinite, the underlying structure may not be inner product spaces, but instead more general reproducing kernel Hilbert spaces. In Bayesian probability kernel methods are a key component of Gaussian processes, where the kernel function is known as the covariance function. Kernel methods have traditionally been used in supervised learning problems where the input space is usually a space of v

Property	Value
dbo:abstract	Within bayesian statistics for machine learning, kernel methods arise from the assumption of an inner product space or similarity structure on inputs. For some such methods, such as support vector machines (SVMs), the original formulation and its regularization were not Bayesian in nature. It is helpful to understand them from a Bayesian perspective. Because the kernels are not necessarily positive semidefinite, the underlying structure may not be inner product spaces, but instead more general reproducing kernel Hilbert spaces. In Bayesian probability kernel methods are a key component of Gaussian processes, where the kernel function is known as the covariance function. Kernel methods have traditionally been used in supervised learning problems where the input space is usually a space of vectors while the output space is a space of scalars. More recently these methods have been extended to problems that deal with multiple outputs such as in multi-task learning. A mathematical equivalence between the regularization and the Bayesian point of view is easily proved in cases where the reproducing kernel Hilbert space is finite-dimensional. The infinite-dimensional case raises subtle mathematical issues; we will consider here the finite-dimensional case. We start with a brief review of the main ideas underlying kernel methods for scalar learning, and briefly introduce the concepts of regularization and Gaussian processes. We then show how both points of view arrive at essentially equivalent estimators, and show the connection that ties them together. (en)
dbo:wikiPageID	35867897 (xsd:integer)
dbo:wikiPageLength	17543 (xsd:nonNegativeInteger)
dbo:wikiPageRevisionID	1109518986 (xsd:integer)
dbo:wikiPageWikiLink	dbr:Bayesian_linear_regression dbr:Bayesian_probability dbr:Bayesian_statistics dbr:Kernel_methods dbr:Regularized_least_squares dbr:Reproducing_kernel_Hilbert_space dbr:Estimator dbr:Gaussian_process dbr:Gramian_matrix dbr:Multi-task_learning dbr:Multivariate_normal_distribution dbr:Likelihood_function dbr:Machine_learning dbr:Kernel_methods_for_vector_output dbc:Machine_learning dbr:Positive-definite_function dbr:Posterior_probability dbr:Prior_probability dbr:Regularization_(mathematics) dbr:Hilbert_space dbc:Bayesian_statistics dbr:Support_vector_machine dbr:Symmetry_in_mathematics dbr:Tikhonov_regularization dbr:Supervised_learning dbr:Gaussian_processes
dbp:wikiPageUsesTemplate	dbt:Further dbt:NumBlk dbt:Reflist dbt:Technical dbt:EquationRef dbt:EquationNote
dcterms:subject	dbc:Machine_learning dbc:Bayesian_statistics
rdfs:comment	Within bayesian statistics for machine learning, kernel methods arise from the assumption of an inner product space or similarity structure on inputs. For some such methods, such as support vector machines (SVMs), the original formulation and its regularization were not Bayesian in nature. It is helpful to understand them from a Bayesian perspective. Because the kernels are not necessarily positive semidefinite, the underlying structure may not be inner product spaces, but instead more general reproducing kernel Hilbert spaces. In Bayesian probability kernel methods are a key component of Gaussian processes, where the kernel function is known as the covariance function. Kernel methods have traditionally been used in supervised learning problems where the input space is usually a space of v (en)
rdfs:label	Bayesian interpretation of kernel regularization (en)
owl:sameAs	freebase:Bayesian interpretation of kernel regularization wikidata:Bayesian interpretation of kernel regularization https://global.dbpedia.org/id/4XAJr
prov:wasDerivedFrom	wikipedia-en:Bayesian_interpretation_of_kernel_regularization?oldid=1109518986&ns=0
foaf:isPrimaryTopicOf	wikipedia-en:Bayesian_interpretation_of_kernel_regularization
is dbo:wikiPageRedirects of	dbr:Bayesian_interpretation_of_regularization
is dbo:wikiPageWikiLink of	dbr:Bayesian_linear_regression dbr:Bayesian_interpretation_of_regularization dbr:List_of_things_named_after_Thomas_Bayes dbr:Outline_of_machine_learning
is foaf:primaryTopic of	wikipedia-en:Bayesian_interpretation_of_kernel_regularization