# Metadata design

#### What is the metadata?

The metadata is a simple `.csv` file that contains the information to (1) find and retrieve the data saved to a remote server and (2) to segment and transform the data to compare experimental conditions. I would recommend recording as many experimental variables as possible, just to be safe. Each row of the `.csv` file is an individual from the experiment. It is mandatory to have at least the following columns: 'machine\_name', 'region\_id', and 'date' with date in a YYY-MM-DD format. Without these columns the data cannot be retrieved.

[![](https://bookstack.lab.gilest.ro/uploads/images/gallery/2022-11/scaled-1680-/image-1669144064645.png)](https://bookstack.lab.gilest.ro/uploads/images/gallery/2022-11/image-1669144064645.png)

##### Top tips.

- Make the metadata exhaustive. Record everything just encase you need information in the future.
- Once you've finished all your replicates, put them in the same `csv` file with a column "replicate" to identify them. It's easier than loading them separately.
- Make your metadata files straight away, as soon as you start your experiment. Time will erode the little details!