multiverse: Multiplexing alternative data analyses in R notebooks

Abhraneel Sarma, Alexander Kale, michael jongho moon, Nathan Taback, Fanny Chevalier, Jessica Hullman, Matthew Kay

April 2023

Abstract

There are myriad ways to analyze any given dataset. But which one to trust? In the face of such uncertainty, analysts adopt multiverse analysis: running all reasonable analyses on the dataset. Yet this is cognitively and technically difficult with existing tools—how does one specify and execute all combinations of reasonable analyses of a dataset?—and often requires discarding existing workflows. We present multiverse, a tool for implementing multiverse analyses in R with expressive syntax supporting existing computational notebook workflows. Informed by iterative user testing, multiverse supports building up a multiverse through local changes to a single analysis and optimises execution by pruning redundant computations. We evaluate four existing multiverse tools (including multiverse) using principles of cognitive ergonomics.We identify design tradeoffs (e.g. flexibility in locally defining decisions versus making it easy to understand a multiverse’s global structure), and suggest future directions for multiverse tool design, like more effective debugging support.

Type

Conference paper

Publication

CHI `23: Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems