Reproducibility

Enabling collaborative, systematic and reproducible research, experimentation and development with an open publication model in computer engineering

These pages are edited collaboratively with the help of cTuning foundation. If you would like to add new links and citations, make corrections, please get in touch with Grigori Fursin.

Manifesto / motivation

Rather than writing yet another manifesto on reproducible research and experimentation in computer engineering, we have been working on enabling sharing and reproducing experimental results and artifacts in computer engineering since 2006 as a side effect of our MILEPOST and cTuning.org projects. We attempted to build a practical machine learning based self-tuning compiler combining plugin-based auto-tuning framework with a public cTuning repository of knowledge, crowdsourcing predictive analytics, but faced numerous problems including:

Lack of common, large and diverse benchmarks and data sets needed to build statistically meaningful predictive models;
Lack of common experimental methodology and unified ways to preserve, systematize and share our growing optimization knowledge and research material including benchmarks, data sets, tools, tuning plugins, predictive models and optimization results;
Problem with continuously changing, "black box" and complex software and hardware stack with many hardwired and hidden optimization choices and heuristics not well suited for auto-tuning and machine learning;
Difficulty to reproduce performance results from the cTuning.org database submitted by users due to a lack of full software and hardware dependencies;
Difficulty to validate related auto-tuning and machine learning techniques from existing publications due to a lack of culture of sharing research artifacts with full experiment specifications along with publications in computer engineering.

Our events

We have been releasing our own ... After evangelizing for 7 years and trying to solve technical aspects

ADAPT'14

We collaborate with our colleagues from AEC who recently managed to persuade the following conferences join similar initiative:

OOPSLA'13, PLDI'14

</p> <a name="technical">

General and technological aspects of reproducibility and open publication model

Together with the community we are gradually trying to address/solve the following challenges that we faced during our R&D:

capture, preserve, formalize, systematize, exchange and improve knowledge and experimental results including negative ones
describe and catalog whole experimental setups with all related material including algorithms, benchmarks, codelets, datasets, tools, models and any other artifact
validate and verify experimental results by the community
develop common research interfaces for existing or new tools
develop common experimental frameworks and repositories
share rare hardware and computational resources for experimental validation
deal with variability and rising amount of experimental data using statistical analysis, data mining, predictive modeling and other techniques
implement previously published experimental scenarios (auto-tuning, run-time adaptation) using common infrastructure
implement open access to publications and data (particularly discussing intellectual property IP and legal issues)
enable interactive articles

Our expertise and work

Set up evaluation of experimental results and all related material for workshops, conferences and journals
Improve sharing, description of dependencies, and statistical reproducibility of experimental results and related material
Improve public Collective Mind repository of knowledge and collaborative experimentation infrastructure in computer engineering
Validate new open publication model

Contents

Manifesto / motivation

Our events

General and technological aspects of reproducibility and open publication model

Our expertise and work

Community-driven reviewing of publications and artifacts

Packing artifacts for evaluation

Events

Links