Markov chain monte carlo methods Research Papers (original) (raw)

Top Papers
Most Cited Papers
Most Downloaded Papers
Newest Papers
People
- by
- •
- Econometrics, Statistics, Environmental Epidemiology, Statistical Analysis

Naive Bayes is one of most effective classification algorithms. In many applications, however, a ranking of examples are more desirable than just classification. How to extend naive Bayes to improve its ranking performance is an interesting and useful question in practice. Weighted naive Bayes is an extension of naive Bayes, in which attributes have different weights. This paper investigates how to learn a weighted naive Bayes with accurate ranking from data, or more precisely, how to learn the weights of a weighted naive Bayes to produce accurate ranking. We explore various methods: the gain ratio method, the hill climbing method, and the Markov chain Monte Carlo method, the hill climbing method combined with the gain ratio method, and the Markov chain Monte Carlo method combined with the gain ratio method. Our experiments show that a weighted naive Bayes trained to produce accurate ranking outperforms naive Bayes.

- by Harry Zhang
- •
- Data Mining, Markov Processes, Monte Carlo Methods, Decision Trees
- by Ashish Sharma
- •
- Environmental Engineering, Civil Engineering, Water resources, Applied Economics
- by Lennart Bondesson
- •
- Marketing, Statistics, Distribution Theory, Markov Chain Monte Carlo
- by Samuel Manda
- •
- Econometrics, Statistics, Martingale, Survival Analysis
- by Zizhuo Xing
- •
- Engineering, Technology, Kinetics, Cell Culture
- by Mark Steel
- •
- Time Series, Statistical Analysis, Time series analysis, Bayesian Analysis
- by Keming Yu
- •
- Econometrics, Bayesian, Bayesian Analysis, Bayesian Inference
- by Cristina Sotto
- •
- Statistics, Applied Economics, Parameter estimation, Random Forest
- by Cees Glas
- •
- Psychology, Applied Mathematics, Item Response Theory, Cost effectiveness
- by Brian Junker
- •
- Psychology, Statistics, Research Methodology, Item Response Theory

Selection of a flood frequency distribution and associated parameter estimation procedure is an important step in flood frequency analysis. This is however a difficult task due to problems in selecting the best fit distribution from a large number of candidate distributions and parameter estimation procedures available in the literature. This paper presents a case study with flood data from Tasmania in Australia, which examines four model selection criteria: Akaike Information Criterion (AIC), Akaike Information Criterion—second order variant (AICc), Bayesian Information Criterion (BIC) and a modified Anderson–Darling Criterion (ADC). It has been found from the Monte Carlo simulation that ADC is more successful in recognizing the parent distribution correctly than the AIC and BIC when the parent is a three-parameter distribution. On the other hand, AIC and BIC are better in recognizing the parent distribution correctly when the parent is a two-parameter distribution. From the seven different probability distributions examined for Tasmania, it has been found that two-parameter distributions are preferable to three-parameter ones for Tasmania, with Log Normal appears to be the best selection. The paper also evaluates three most widely used parameter estimation procedures for the Log Normal distribution: method of moments (MOM), method of maximum likelihood (MLE) and Bayesian Markov Chain Monte Carlo method (BAY). It has been found that the BAY procedure provides better parameter estimates for the Log Normal distribution, which results in flood quantile estimates with smaller bias and standard error as compared to the MOM and MLE. The findings from this study would be useful in flood frequency analyses in other Australian states and other countries in particular, when selecting an appropriate probability distribution from a number of alternatives.

- by Khaled Bob
- •
- Engineering, Monte Carlo Simulation, Monte Carlo, Parameter estimation
- by Michael McAleer
- •
- Econometrics, Estimation, Leverage, Asymmetry

Multivariate adaptive regression spline fitting or MARS (Friedman 1991) provides a useful methodology for flexible adaptive regression with many predictors. The MARS methodology produces an estimate of the mean response that is a linear combination of adaptively chosen basis functions. Recently, a Bayesian version of MARS has been proposed (Denison, Mallick and Smith 1998a, Holmes and Denison, 2002) combining the MARS methodology with the benefits of Bayesian methods for accounting for model uncertainty to achieve improvements in predictive performance. In implementation of the Bayesian MARS approach, Markov chain Monte Carlo methods are used for computations, in which at each iteration of the algorithm it is proposed to change the current model by either (a) Adding a basis function (birth step) (b) Deleting a basis function (death step) or (c) Altering an existing basis function (change step). In the algorithm of Denison, Mallick and Smith (1998a), when a birth step is proposed, the type of basis function is determined by simulation from the prior. This works well in problems with a small number of predictors, is simple to program, and leads to a simple form for Metropolis-Hastings acceptance probabilities. However, in problems with very large numbers of predictors where many of the predictors are useless it may be difficult to find interesting interactions with such an approach. In the original MARS algorithm of Friedman (1991) a heuristic is used of building up higher order interactions from lower order ones, which greatly reduces the complexity of the search for good basis functions to add to the model. While we do not exactly follow the intuition of the original MARS algorithm in this paper, we nevertheless suggest a similar idea in which the Metropolis-Hastings proposals of Denison, Mallick and Smith (1998a) are altered to allow dependence on the current model. Our modification allows more rapid identification and exploration of important interactions, especially in problems with very large numbers of predictor variables and many useless predictors. Performance of the algorithms is compared in simulation studies.

- by Hiep Duc Nguyen
- •
- Mathematics, Computer Science, Statistics, Bayesian Inference
- by James Albert
- •
- Regression Model, Markov chain monte carlo methods, Ordinal Data, Cumulant
- by andrew keats and +1
- •
- Environmental Engineering, Bayesian Inference, Atmospheric sciences, Dispersion modelling
- by Debasis Kundu
- •
- Data Analysis, Monte Carlo Simulation, Pure Mathematics, Bayesian Inference

The phenomenon of sponsored search advertising - where advertisers pay a fee to Internet search engines to be displayed alongside organic (non-sponsored) web search results - is gaining ground as the largest source of revenues for search engines. Using a unique 6 month panel dataset of several hundred keywords collected from a large nationwide retailer that advertises on Google, we

- by Anindya Ghose
- •
- Marketing, Information Retrieval, Social Sciences, Creativity
- by Gwen Littlewort and +1
- •
- Machine Learning, Face Recognition, Support Vector Machines, Automatic Control
- by Fernando Abanto Quintana
- •
- Statistics, Applied Statistics, Experimental Design, Probability
- by Adrian Dobra
- •
- Food Web Dynamics, Search Engine, World Wide Web, Markov chain monte carlo methods
- by bhavani shankar
- •
- Development Economics, Spatial econometrics, Agricultural Economics, Applied Research
- by Karl Bang Christensen
- •
- Mathematical Sciences, Sample Size, Conditional probability, Markov chain monte carlo methods
- by Tim Swartz
- •
- Statistics, Applied Statistics, Data Analysis, Bayesian Analysis
- by Joan Valls
- •
- Genetics, Polymorphism, Molecular Genetics, Markov Chain Monte Carlo
- by Ehsan Fayyazi
- •
- Bayesian Inference, Markov chain monte carlo methods
- by Xiaodong Du
- •
- Mechanical Engineering, Economics, Econometrics, Financial Economics
- by Paramjit Gill
- •
- Social Psychology, Applied Mathematics, Statistics, Bayesian Analysis
- by Efthymios Tsionas
- •
- Mechanical Engineering, Energy Economics, Applied Economics, Bayesian Analysis
- by Salvador Carranza
- •
- Evolutionary Biology, Genetics, Zoology, Geology
- by L. Kurkalova
- •
- Econometrics, Applied Economics, Bayesian Analysis, Productivity Analysis
- by Joshua Vogelstein
- •
- Engineering, Data Analysis, Computational Neuroscience, Parameter estimation
- by José Jiménez Ortega
- •
- Thermodynamics, Atmospheric Science, Decision Making Under Uncertainty, Atmospheric sciences
- by Alejandro Artacho and +2
- •
- Bayesian Networks, Mathematical Sciences, Markov Chain Monte Carlo, expert System
- by Brian Dennis
- •
- Statistical Computing, Frequentist (Statistics), Computational Biology, Ecology
- by Guido Imbens and +1
- •
- Network Formation, Developing Country, High School, High School Student
- by Youssef Marzouk
- •
- Probability Theory, Decision Analysis, Uncertainty Quantification, Bayesian Analysis
- by Helga Wagner
- •
- Econometrics, Statistics, Time Series, Biometrics
- by T. Langø
- •
- Engineering, Algorithms, Ultrasound Imaging, Ultrasound
- by M. Ausín
- •
- Statistics, Optimal Control, Steady state, Markov chain monte carlo methods
- by Claudia Mignone
- •
- Physics, Data Analysis, Dust (Astronomy & Astrophysics), Monte Carlo Methods
- by John Marriott
- •
- Mathematics, Econometrics, Statistics, Spline smoothing and regression
- by SUDIPTO BANERJEE
- •
- Econometrics, Statistics, Linear Model, Disease mapping
- by Lelys I Bravo de Guenni
- •
- Stochastic Process, Econometrics, Statistics, Nonlinear Time Series
- by Lara Jamieson and +1
- •
- Markov Chain Monte Carlo, Case Study, Bayesian statistics, Density dependence

While statisticians are well-accustomed to performing exploratory analysis in the modeling stage of an analysis, the notion of conducting preliminary general-purpose exploratory analysis in the Monte Carlo stage (or more generally, the model-fitting stage) of an analysis is an area that we feel deserves much further attention. Toward this aim, this article proposes a general-purpose algorithm for automatic density exploration. The proposed exploration algorithm combines and expands upon components from various adaptive Markov chain Monte Carlo methods, with the Wang–Landau algorithm at its heart. Additionally, the algorithm is run on interacting parallel chains—a feature that both decreases computational cost as well as stabilizes the algorithm, improving its ability to explore the density. Performance of this new parallel adaptive Wang–Landau algorithm is studied in several applications. Through a Bayesian variable selection example, we demonstrate the convergence gains obtained with interacting chains. The ability of the algorithm’s adaptive proposal to induce mode-jumping is illustrated through a Bayesian mixture modeling application. Last, through a two-dimensional Ising model, the authors demonstrate the ability of the algorithm to overcome the high correlations encountered in spatial models. Supplemental materials are available online.

- by Del Moral Pierre
- •
- Statistics, Statistical Computing, Monte Carlo, Ising Model
- by Herman van Dijk
- •
- Health Economics, Bayesian Econometrics, Neural Network, Bayesian Analysis