Three Steps Towards Robust Regression | Psychometrika | Cambridge Core (original) (raw)

Abstract

The three most commonly used statistics, the arithmetic mean, variance, and the product-moment correlation, are most unfortunate choices when data are not strictly Gaussian. A new measure of correlation and a measure of scale are proposed which are substantially more robust than their least squares counterparts. An illustration shows how increased robustness can be obtained through the use of equal regression weights without severe loss in accuracy. The paper also shows how incorporating knowledge about the theoretical structure of the regression coefficients into their estimation can aid substantially in increasing their robustness.

References

Anderson, N. H. Scales and statistics: Parametric and non-parametric. Psychological Bulletin, 1961, 58, 305–316.10.1037/h0042576CrossRefGoogle Scholar

Andrews, D. F., Bickel, P. J., Hampel, F. R., Huber, P. J., Rogers, W. H., and Tukey, J. W. Robust estimates of location, 1972, Princeton, N. J.: Princeton University Press.Google Scholar

Bock, R. D. Multivariate statistical methods in behavioral research, 1975, New York: McGraw-Hill.Google Scholar

Bock, R. D. and Kolakowski, D. Further evidence of sex-linked major gene influence on human spatial visualizing ability. Americal Journal of Human Genetics, 1973, 25, 1–14.Google ScholarPubMed

Bock, R. D., Wainer, H., Thissen, D., Peterson, A., Murray, J., and Roche, A. F. A parameterization of individual human growth curves. Human Biology, 1973, 45, 63–80.Google ScholarPubMed

Box, G. E. P., and Tiao, G. C. A Bayesian approach to some outlier problems. Biometrika, 1968, 55, 119–129.10.1093/biomet/55.1.119CrossRefGoogle ScholarPubMed

Czuber, E. Theorie der beobachtungsfehler. Leipzig, 1891.Google Scholar

David, H. A. Gini's mean difference rediscovered. Biometrika, 1968, 55, 573–574.Google Scholar

Devlin, S. J., Gnanadesikan, R., and Kettenring, J. R. Robust estimation and outlier detection with correlation coefficients. Biometrika, 1975, in press. (a)10.1093/biomet/62.3.531CrossRefGoogle Scholar

Devlin, S. J., Gnanadesikan, R., and Kettenring, J. R. Robust estimation of correlation and covariance matrices. Paper presented at the spring meeting of the Psychometric Society, Iowa City, April 26, 1975. (b)Google Scholar

Downton, F. Linear estimates with polynomial coefficients. Biometrika, 1966, 53, 129–141.10.1093/biomet/53.1-2.129CrossRefGoogle ScholarPubMed

Gini, C. Variabilita e mutabilita, contributo allo studio delle distribuzione e relazione statistiche. Sudi-Economico-Giuridici della R. Universita di Cagliari, 1912.Google Scholar

Gnanadesikan, R., and Kettenring, J. R. Robust estimates, residuals, and outlier detection with multiresponse data. Biometrics, 1972, 28, 81–124.10.2307/2528963CrossRefGoogle Scholar

Green, B. F. Parameter sensitivity in multivariate methods, 1974, Baltimore: Department of Psychology, Johns Hopkins University.Google Scholar

Helmert, F. R. Die Berechnung des wahrscheinlichen Beobachtungs fehlers aus den ersten Potenzen der Differenzen gleichgenauer directer Beobachtungen. Astronomische Nachrichten, 1876, 88, 257–272.10.1002/asna.18760880802CrossRefGoogle Scholar

Hogg, R. V. Adaptive robust procedures: A partial review and some suggestions for further applications and theory. Journal of the American Statistical Association, 1974, 69, 909–927.10.1080/01621459.1974.10480225CrossRefGoogle Scholar

Hogg, R. V., and Randles, R. Adaptive distribution-free regression methods. Technometrics, 1975, in press.10.1080/00401706.1975.10489366CrossRefGoogle Scholar

Hotelling, H., and Pabst, M. R. Rank correlation and tests of significance involving no assumption of normality. Annals of Mathematical Statistics, 1936, 7, 29–43.10.1214/aoms/1177732543CrossRefGoogle Scholar

Huber, P. J. Robust statistics: A review. Annals of Mathematical Statistics, 1972, 43, 1041–1067.10.1214/aoms/1177692459CrossRefGoogle Scholar

Knuth, D. E. The art of computer programming (Vol. 2). Reading, Mass.: Addison-Wesley. 1969, 1–112.Google Scholar

Mood, A. M. Introduction to the theory of statistics, 1950, New York: McGraw-Hill.Google Scholar

Roche, A. F., Wainer, H., and Thissen, D. Predicting adult stature for individuals, 1975, Basel, Switz.: Karger.Google ScholarPubMed

Samejima, F. Estimation of latent ability using a response pattern of graded scores. Psychometrika Monograph Supplement, 1969, No. 17.Google Scholar

Singleton, R. C. An efficient algorithm for sorting with minimal storage. Communications of the Association for Computing Machinery, 1969, 12, 185–187.10.1145/362875.362901CrossRefGoogle Scholar

Tukey, J. W. Exploratory data analysis (limited preliminary edition, Vol. 3), 1970, Reading, Mass.: Addison-Wesley.Google Scholar

Tukey, J. W., and McLaughlin, D. H. Less vulnerable confidence and significance procedures for location based upon a single sample: Trimming/Winsorization 1. Sankhyā, 1963, A 25, 331–352.Google Scholar

von Andrae, Uber die Bestimmung des wahrscheinlichen Fehlers durch die gegebenen Differenzen vom gleich genauen Beobachtungen einer Unbekannten. Astronomische Nachrichten, 1872, 79, 257–272.10.1002/asna.18720791702CrossRefGoogle Scholar

Wainer, H. Predicting the outcome of the Senate trial of Richard M. Nixon. Behavioral Science, 1974, 19, 404–406.10.1002/bs.3830190607CrossRefGoogle Scholar

Wainer, H. Estimating coefficients in linear models: It don't make no nevermind. Psychological Bulletin, 1975, in press.Google Scholar

Wainer, H., Gruvaeus, G., and Zill, N. Senatorial decision making: I. The determination of structure. Behavioral Science, 1973, 18, 7–19.10.1002/bs.3830180103CrossRefGoogle Scholar

Wainer, H., and Thissen, D. Multivariate semi-metric smoothing in multiple prediction. Journal of the American Statistical Association, 1975, 70. (a)Google Scholar

Wainer, H., and Thissen, D. When jackknifing fails (or does it?). Psychometrika, 1975, 40, 113–114.10.1007/BF02291483CrossRefGoogle Scholar

Wainer, H., Zill, N., and Gruvaeus, G. Senatorial decision making: II. Prediction. Behavioral Science, 1973, 18, 20–26.CrossRefGoogle Scholar

Wright, S. Evolution and the genetics of populations. Vol. 1, Genetic and biometric foundations, 1968, Chicago: University of Chicago Press.Google Scholar