R. W. Oldford (original) (raw)

R. W. Oldford

Research overview

Statistical reasoning, exploratory data analysis, data visualization, and the development of interactive computational environments that support these activities, comprise the broad areas of my research interests.

Related to computational structure and methods for interactive data analysis, but somewhat removed from software implementation, I am also interested in the philosophical structure of statistical reasoning.

And I am always interested in applications of statistical methods in the natural and computational sciences.

Interests

Education

See my YouTube or BitChute channels for more.

Posters

with various students

Software Projects

*

Loon.ggplot

Turn ggplot2 graphic data structures into interactive loon plots

PairViz

Ordering visualizations using Graph Traversal

qqtest

Self Calibrating Quantile-Quantile Plots for Visual Testing

zenplots

Zigzag Expanded Navigation Plots

Loon

Exploratory interactive data visualization.

May 2018 Significance, 15(3)

About "her emails"

Patterns in Secretary Clinton’s emails and a website (select “Code”) that allows anyone to interactively explore the patterns.

May 2018 SIAM Journal on Optimization, 28(1)

Euclidean distance matrix completion and point configurations from the minimal spanning tree

The paper introduces a special case of the Euclidean distance matrix completion problem of interest in statistical data analysis where only the minimal spanning tree distances are given and the matrix completion must preserve the minimal spanning tree. A guided random search algorithm is shown to outperform more standard optimization methods which also force peculiar and generally unwanted geometric structure on the point configurations their completions produce.

October 2017 Electronic Imaging 2018, Computational Imaging, XVI

Illuminant estimation using ensembles of multivariate regression trees

In this paper, we show that a simple and accurate ensemble model can be learned by (i) using multivariate regression trees to take into account that the chromaticity components of the illuminant are correlated and constrained, and (ii) fitting each tree by directly minimizing a loss function of interest—such as recovery angular error or reproduction angular error—rather than indirectly using the squared-error loss function as a surrogate. We show empirically that overall our method leads to improved performance on diverse image sets.

March 2016 The American Statistician, 70(1), pp. 74-90

Self-Calibrating Quantile–Quantile Plots

Quantile–quantile plots, or qqplots, are an important visual tool for many applications but their interpretation requires some care and often more experience. This apparent subjectivity is unnecessary. By drawing on the computational and display facilities now widely available, qqplots are easily enriched to help with their interpretation. An overview of quantile functions and quantile–quantile plots is presented against the backdrop of their early historical development. Strengths and shortcomings of the traditional display are described. A new enhanced qqplot, the self-calibrating qqplot, is introduced and demonstrated on a variety of examples—both synthetic and real. Real examples include normal qqplots, log-normal plots, half-normal plots for factorial experiments, qqplots for the average and standard deviation in process improvement applications, detection of multivariate outliers, and the comparison of empirical distributions. Self-calibration is had by visually incorporating sampling variation in the qqplot display in a variety of ways. The new qqplot is available through the function and R package qqtest.

December 2011 Computational Statistics, 26(4)

Graphs as navigational infrastructure for high dimensional data spaces

We propose using graph theoretic results to develop an infrastructure that tracks movement from a display of one set of variables to another. The illustrative example throughout is the real-time morphing of one scatterplot into another. Hurley and Oldford (J Comput Graph Stat 2010) made extensive use of the graph having variables as nodes and edges indicating a paired relationship between them. The present paper introduces several new graphs derivable from this one whose traversals can be described as particular movements through high dimensional spaces. These are connected to known results in graph theory and the graph theoretic results applied to the problem of visualizing high-dimensional data.

August 2000 Statistical Science, 15(3)

Scientific method, statistical method, and the speed of light

What is “statistical method”? Is it the same as “scientific method”? This paper answers the first question by specifying the elements and procedures common to all statistical investigations and organizing these into a single structure. This structure is illustrated by careful examination of the first scientific study on the speed of light carried out by A. A. Michelson in 1879. Our answer to the second question is negative. To understand this a history on the speed of light up to the time of Michelson’s study is presented. The larger history and the details of a single study allow us to place the method of statistics within the larger context of science.

Recent & Upcoming Talks

Recent Publications

Students

Current Grad Students

Avatar

Avatar

Avatar

Zehao Xu

Ph.D. student in Statistics

Size proportional Venn and Euler diagrams in 2 and 3 dimensions, vennplot(…) in R, data visualization systems

Former PhD students

Avatar

Adam Rahman

Data Scientist

Preserving Measured Structure During Generation and Reduction of Multivariate Point Configurations, scagnostics distributions, data reduction, simulation

Avatar

Adrian Waddell

Statistician

Interactive Visualization and Exploration of High-Dimensional Data, data visualization, loon

Avatar

Greg Anglin

Research Advisor (Statistics)

A Statistical Programming Environment for Modelling Counting Processes, and An object-oriented array manipulation prorocol in a statistical programming environment, Statistical computing environments, event history analysis

Avatar

Ruth Urner

Assistant Professor

Learning with non-Standard Supervision, theoretical machine learning, clustering, strong and weak learners

Avatar

Wu Zhou

Senior Researcher, Data Scientist

A new framework for clustering, ensemble cluster analysis, data mining, and machine learning, also A review and implementation of some approaches to metric clustering

Former Masters students

Avatar

Alex (Xian) Wang

Data Scientist

Interactive Micromaps in R with loon, spatial data visualization and interactive analysis, loon.micromaps

Avatar

Amanda Murdoch

Senior Analyst

Tracking Eye Movement When Observing Statistical Graphics, data analysis, experimental design, statistical modelling, EyeTrackR

Avatar

Avatar

Derek (Daoxiang) Wang

Middle Office and Valuation Senior Analyst

A Visualizing Tool for Conditional Independence, Financial analysis and copula modelling

Avatar

Erin McLeish

Ph.D. student in Computer Science

Visual Empirical Regions of Influence (VERI) Clustering, Assessment and Alternatives, computational geometry and graph-based clustering

Avatar

Glenn Lee

Game Mathematician

Eikosograms and Their Software Implementation, Categorical data visualization and analysis, Gaming probability

Avatar

Greg Anglin

Research Advisor (Statistics)

A Statistical Programming Environment for Modelling Counting Processes, and An object-oriented array manipulation prorocol in a statistical programming environment, Statistical computing environments, event history analysis

Avatar

Avatar

Hanna Kazhamiaka

Data Scientist

An Experiment in Visual Clustering Using Star Glyph Displays, data visualization, statistical modelling and machine learning

Avatar

Hugh Chipman

Professor

The Use of Projection and Sectioning in the Graphic Analysis of Multidimensional Data, Statistics

Avatar

Hudson (Hui) Zhao

?????

Implementing Surfaces in OpenGL … Calls from Macintosh Common Lisp, data visualization, OpenGL

Avatar

(Jack) Jiahua Liu

Masters of Divinity student

Glyphs and pixel-oriented glyphs for data visualization in R

Avatar

Jim Adams

Senior Advisor for Pricing and Contracting

A Study of Alaskan King Crabs, Paralithodes camtschatica (Tilesius), Near Kodiak Island, Alaska, 1960-1986, Statistical Data Analysis, Biostatistics

Avatar

Lijie (Justine) Fu

Lead Software Engineer

Implementation of Three-dimensional Scagnostics, data visualization, geometric graphs, scagnostics3D

Avatar

Michael Lewis

(Deceased)

Constraint-based programming in statistics

Avatar

Nan (Tina) Zhao

?????

A Preliminary Statistical Analysis on Risk Factors for Dementia and CIND from the Canadian Study of Health and Aging

Avatar

Natasha Wiebe

Research Manager

Colour Parameterization in a Multiparametric Image Interface, interactive data visualization, biostatistics

Avatar

Paul Poirier

?????

Visualizing surfaces, common lisp implementation of hidden line 3d rotating surfaces

Avatar

Qing Li

?????

Probability of Carrying a Mutation of Colorectal Cancer Gene hMSH2/hMLH1 Based on Family History

Avatar

Avatar

Tracey (Xin) Chen

Senior Fraud Analyst

Visual Patterns with CCmaps and Magnification Algorithms, data visualization, spatial statistics, local map distortion

Avatar

Weicong (Vivi) Ma

Data Engineer

On the Utility of Adding An Abstract Domain and Attribute Paths to SQL, Data Base Theory and Engineering

Avatar

Wenqing Liu

Data Scientist

TREC, tree reduced ensemble clustering, data science, clustering, machine learning

Avatar

Wu Zhou

Senior Researcher, Data Scientist

A new framework for clustering, ensemble cluster analysis, data mining, and machine learning, also A review and implementation of some approaches to metric clustering

Avatar

Kathy (Xiaomei) Yu

Marketing Campaign Design and Analytics Manager

R package tidytable, a visualization tool for multi-way tables, data visualization, automated table analyses

Avatar

Zehao Xu

Ph.D. student in Statistics

Size proportional Venn and Euler diagrams in 2 and 3 dimensions, vennplot(…) in R, data visualization systems