What are datakits?

In opendata.studio, a datakit is a structured way to organise and bundle a data analysis in a reusable and reproducible format.

A datakit contains:

the analysis algorithm and its execution environment
saved run states from algorithm executions
input and output data, along with configurable options
visualisations of data, including graph and table specifications
user interface definitions.

These elements are defined by individual components inside each datakit:

resources: store tabular data
algorithms and containers: define the algorithm code and execution environment
views: visualise data (e.g.: graphs)
interfaces: describe user interfaces for the analysis, rendered through web components.

These elements form a complete description of a data analysis that can be executed through our web interface or command line tool. Each execution is tracked, creating a reproducible record of the analysis process. Once an analysis is completed, the results and process can be easily shared or published, ensuring transparency and allowing others to build upon your work.

This tutorial will introduce you to working with datakits. To begin with, let’s create a simple datakit containing an algorithm that adds two numbers together.