Declaring and diagnosing research designs (original) (raw)

CRAN status CRAN RStudio mirror downloads Build status Code coverage Replications

DeclareDesign is a system for describing research designs in code and simulating them in order to understand their properties. Because DeclareDesign employs a consistent grammar of designs, you can focus on the intellectually challenging part – designing good research studies – without having to code up simulations from scratch. For more, see declaredesign.org.

Installation

To install the latest stable release of DeclareDesign, please ensure that you are running version 3.5 or later of R and run the following code:

Usage

Designs are declared by adding together design elements. Here’s a minimal example that describes a 100 unit randomized controlled trial with a binary outcome. Half the units are assigned to treatment and the remainder to control. The true value of the average treatment effect is 0.05 and it will be estimated with the difference-in-means estimator. The diagnosis shows that the study is unbiased but underpowered.

Inquiry Estimator Outcome Bias SE(Bias) Power SE(Power) n sims
ATE estimator Y -0.004 0.004 0.076 0.01 500

Companion software

The core DeclareDesign package relies on four companion packages, each of which is useful in its own right.

  1. randomizr: Easy to use tools for common forms of random assignment and sampling.
  2. fabricatr: Imagine your data before you collect it.
  3. estimatr: Fast estimators for social scientists.
  4. DesignLibrary: Templates to quickly adopt and adapt common research designs.

Learning DeclareDesign

  1. To get started, have a look at this vignette on the idea behind DeclareDesign, which covers the main functionality of the software.
  2. For an explanation of the philosophy behind DeclareDesign, examples in code and words of declaring and diagnosing common research designs in the social sciences, as well as examples of how to incorporate DeclareDesign into your own research, see the book Research Design in the Social Sciences (Blair, Coppock, Humphreys, 2023).

Package structure

Each of these declare_*() functions returns a function.

  1. [declare_model()](reference/declare%5Fmodel.html) (describes dimensions and distributions over the variables, including potential outcomes)
  2. [declare_inquiry()](reference/declare%5Finquiry.html) (takes variables in the model and calculates estimand value)
  3. [declare_sampling()](reference/declare%5Fsampling.html) (takes a population and selects a sample)
  4. [declare_assignment()](reference/declare%5Fassignment.html) (takes a population or sample and adds treatment assignments)
  5. [declare_measurement()](reference/declare%5Fmeasurement.html) (takes data and adds measured values)
  6. [declare_estimator()](reference/declare%5Festimator.html) (takes data produced by sampling, assignment, and measurement and returns estimates linked to inquiries)
  7. [declare_test()](reference/declare%5Ftest.html) (takes data produced by sampling, assignment, and measurement and returns the result of a test)

To declare a design, connect the components of your design with the + operator.

Once you have declared your design, there are four core post-design-declaration commands used to modify or diagnose your design:

  1. [diagnose_design()](reference/diagnose%5Fdesign.html) (takes a design and returns simulations and diagnosis)
  2. [draw_data()](reference/draw%5Ffunctions.html) (takes a design and returns a single draw of the data)
  3. [draw_estimates()](reference/draw%5Ffunctions.html) (takes a design and returns a single simulation of estimates)
  4. [draw_estimands()](reference/draw%5Ffunctions.html) (takes a design and returns a single simulation of estimands)

A few other features:

  1. A designer is a function that takes parameters (e.g., N) and returns a design. [expand_design()](reference/expand%5Fdesign.html) is a function of a designer and parameters that return a design.
  2. You can change the diagnosands with [declare_diagnosands()](reference/declare%5Fdiagnosands.html).

This project was generously supported by a grant from the Laura and John Arnold Foundation and seed funding from EGAP.