David Hand - Academia.edu (original) (raw)

Papers by David Hand

Research paper thumbnail of Optimising -means clustering results with standard software packages

Computational Statistics & Data Analysis, 2005

ABSTRACT

Research paper thumbnail of Estimating class sizes by adjusting fallible classifier results

Computers & Mathematics with Applications, 1986

Research paper thumbnail of Rejoinder: Classifier Technology and the Illusion of Progress

Statistical Science, 2006

Research paper thumbnail of Local Versus Global Models for Classification Problems

The American Statistician, 2003

Research paper thumbnail of Data Mining: Statistics and More?

The American Statistician, 1998

Research paper thumbnail of On Comparing Two Treatments

The American Statistician, 1992

Abstract Choice between two treatments, A and B, is sometimes based on the probability that A wil... more Abstract Choice between two treatments, A and B, is sometimes based on the probability that A will be more effective (score higher, say) than B. Ideally, to estimate this probability a sample of subjects would receive both A and B and the proportion of (A — B) differences which are positive would be used as the estimate. Often, however, both treatments cannot be given to each subject, and inference is based on a trial using two independent samples. Unfortunately, probability structures exist for which P(A — B > 0) for two independent samples is not equal to P(A — B > 0) for matched samples. The two-independent-sample Wilcoxon test statistic addresses the former probability and hence cannot be used to answer the question, “Is the probability that A will do better than B greater than 1/2?” unless further assumptions are made.

Research paper thumbnail of Projection techniques for nonlinear principal component analysis

Statistics and Computing, 2003

Principal Components Analysis (PCA) is traditionally a linear technique for projecting multidimen... more Principal Components Analysis (PCA) is traditionally a linear technique for projecting multidimensional data onto lower dimensional subspaces with minimal loss of variance. However, there are several applications where the data lie in a lower dimensional subspace that is not linear; in these cases linear PCA is not the optimal method to recover this subspace and thus account for the largest

Research paper thumbnail of New Uses of Statistics in Retail Banking

American Journal of Mathematical and Management Sciences, 2000

ABSTRACT

Research paper thumbnail of Justifying adverse actions with new scorecard technologies

Research paper thumbnail of Kernel quantile-based estimation of expected shortfall

Research paper thumbnail of A recursive partitioning tool for interval prediction

Advances in Data Analysis and Classification, 2007

ABSTRACT

Research paper thumbnail of Temporally adaptive estimation of logistic classifiers on data streams

Advances in Data Analysis and Classification, 2009

Modern technology has allowed real-time data collection in a variety of domains, ranging from env... more Modern technology has allowed real-time data collection in a variety of domains, ranging from environmental monitoring to healthcare. Consequently, there is a growing need for algorithms capable of performing inferential tasks in an online manner, continuously revising their estimates to reflect the current status of the underlying process. In particular, we are interested in constructing online and temporally adaptive classifiers

Research paper thumbnail of Where are the large and difficult datasets?

Advances in Data Analysis and Classification, 2009

A great many comparative performance assessments of classification rules have been undertaken, ra... more A great many comparative performance assessments of classification rules have been undertaken, ranging from small ones involving just one or two methods, to large ones involving many tens of methods. We are undertaking a meta-analytic study of these studies, attempting to distil some overall conclusions. This paper describes just one of our observations. The dataset analysed in this paper contains

Research paper thumbnail of Transaction aggregation as a strategy for credit card fraud detection

Data Mining and Knowledge Discovery, 2008

Research paper thumbnail of Performance criteria for plastic card fraud detection tools

Journal of the …, 2007

... Top of page Acknowledgements. The work of Piotr Juszczak and Dave Weston described here was s... more ... Top of page Acknowledgements. The work of Piotr Juszczak and Dave Weston described here was supported by the EPSRC under grant number EP/C532589/1: ThinkCrime: Statistical and machine learning tools for plastic card and other personal fraud detection. ...

Research paper thumbnail of Averaging over decision trees

Journal of Classification, 1996

Research paper thumbnail of Statistical fraud detection: A review

Statistical Science, 2002

Research paper thumbnail of Methods and models in statistics. In honour of Professor John Nelder, FRS. Written papers of the symposium, London, UK, March 29–30, 2004

Research paper thumbnail of Banking, Statistics in

Research paper thumbnail of Expert Systems in Statistics

The Knowledge Engineering Review, 1984

Statistical expert systems are attracting increasing attention as a possible way to alleviate the... more Statistical expert systems are attracting increasing attention as a possible way to alleviate the shortage of expert consultant statisticians. This paper summarises the requirements of such systems, showing how the demands of data analysis are different from those of other fields, and describes some recent work.

Research paper thumbnail of Optimising -means clustering results with standard software packages

Computational Statistics & Data Analysis, 2005

ABSTRACT

Research paper thumbnail of Estimating class sizes by adjusting fallible classifier results

Computers & Mathematics with Applications, 1986

Research paper thumbnail of Rejoinder: Classifier Technology and the Illusion of Progress

Statistical Science, 2006

Research paper thumbnail of Local Versus Global Models for Classification Problems

The American Statistician, 2003

Research paper thumbnail of Data Mining: Statistics and More?

The American Statistician, 1998

Research paper thumbnail of On Comparing Two Treatments

The American Statistician, 1992

Abstract Choice between two treatments, A and B, is sometimes based on the probability that A wil... more Abstract Choice between two treatments, A and B, is sometimes based on the probability that A will be more effective (score higher, say) than B. Ideally, to estimate this probability a sample of subjects would receive both A and B and the proportion of (A — B) differences which are positive would be used as the estimate. Often, however, both treatments cannot be given to each subject, and inference is based on a trial using two independent samples. Unfortunately, probability structures exist for which P(A — B > 0) for two independent samples is not equal to P(A — B > 0) for matched samples. The two-independent-sample Wilcoxon test statistic addresses the former probability and hence cannot be used to answer the question, “Is the probability that A will do better than B greater than 1/2?” unless further assumptions are made.

Research paper thumbnail of Projection techniques for nonlinear principal component analysis

Statistics and Computing, 2003

Principal Components Analysis (PCA) is traditionally a linear technique for projecting multidimen... more Principal Components Analysis (PCA) is traditionally a linear technique for projecting multidimensional data onto lower dimensional subspaces with minimal loss of variance. However, there are several applications where the data lie in a lower dimensional subspace that is not linear; in these cases linear PCA is not the optimal method to recover this subspace and thus account for the largest

Research paper thumbnail of New Uses of Statistics in Retail Banking

American Journal of Mathematical and Management Sciences, 2000

ABSTRACT

Research paper thumbnail of Justifying adverse actions with new scorecard technologies

Research paper thumbnail of Kernel quantile-based estimation of expected shortfall

Research paper thumbnail of A recursive partitioning tool for interval prediction

Advances in Data Analysis and Classification, 2007

ABSTRACT

Research paper thumbnail of Temporally adaptive estimation of logistic classifiers on data streams

Advances in Data Analysis and Classification, 2009

Modern technology has allowed real-time data collection in a variety of domains, ranging from env... more Modern technology has allowed real-time data collection in a variety of domains, ranging from environmental monitoring to healthcare. Consequently, there is a growing need for algorithms capable of performing inferential tasks in an online manner, continuously revising their estimates to reflect the current status of the underlying process. In particular, we are interested in constructing online and temporally adaptive classifiers

Research paper thumbnail of Where are the large and difficult datasets?

Advances in Data Analysis and Classification, 2009

A great many comparative performance assessments of classification rules have been undertaken, ra... more A great many comparative performance assessments of classification rules have been undertaken, ranging from small ones involving just one or two methods, to large ones involving many tens of methods. We are undertaking a meta-analytic study of these studies, attempting to distil some overall conclusions. This paper describes just one of our observations. The dataset analysed in this paper contains

Research paper thumbnail of Transaction aggregation as a strategy for credit card fraud detection

Data Mining and Knowledge Discovery, 2008

Research paper thumbnail of Performance criteria for plastic card fraud detection tools

Journal of the …, 2007

... Top of page Acknowledgements. The work of Piotr Juszczak and Dave Weston described here was s... more ... Top of page Acknowledgements. The work of Piotr Juszczak and Dave Weston described here was supported by the EPSRC under grant number EP/C532589/1: ThinkCrime: Statistical and machine learning tools for plastic card and other personal fraud detection. ...

Research paper thumbnail of Averaging over decision trees

Journal of Classification, 1996

Research paper thumbnail of Statistical fraud detection: A review

Statistical Science, 2002

Research paper thumbnail of Methods and models in statistics. In honour of Professor John Nelder, FRS. Written papers of the symposium, London, UK, March 29–30, 2004

Research paper thumbnail of Banking, Statistics in

Research paper thumbnail of Expert Systems in Statistics

The Knowledge Engineering Review, 1984

Statistical expert systems are attracting increasing attention as a possible way to alleviate the... more Statistical expert systems are attracting increasing attention as a possible way to alleviate the shortage of expert consultant statisticians. This paper summarises the requirements of such systems, showing how the demands of data analysis are different from those of other fields, and describes some recent work.