diff --git a/.nojekyll b/.nojekyll
index d38689ec..4f7d2a8c 100644
--- a/.nojekyll
+++ b/.nojekyll
@@ -1 +1 @@
-c0d3f531
\ No newline at end of file
+0af61480
\ No newline at end of file
diff --git a/develop/01_RDM_intro.html b/develop/01_RDM_intro.html
index d90925e6..a40e2753 100644
--- a/develop/01_RDM_intro.html
+++ b/develop/01_RDM_intro.html
@@ -295,7 +295,7 @@ <h1 class="title">1. Introduction to RDM</h1>
     <div>
     <div class="quarto-title-meta-heading">Modified</div>
     <div class="quarto-title-meta-contents">
-      <p class="date-modified">April 25, 2024</p>
+      <p class="date-modified">April 26, 2024</p>
     </div>
   </div>
     
@@ -343,7 +343,7 @@ <h1>FAIR Research Data Management and the Data Lifecycle</h1>
 <div class="callout-body-container callout-body">
 <div>
 <div class="callout-definition">
-<p>The definition of <a href="https://www.merriam-webster.com/dictionary/management"><strong>Management</strong></a> is “the practice of managing; handling, supervision, or control.”.</p>
+<p>The definition of <a href="https://www.merriam-webster.com/dictionary/management"><strong>Management</strong></a> is “the practice of managing; handling, supervision, or control”.</p>
 <p>In accordance with the UCPH Policy for Research Data Management, <strong>research data</strong> encompasses both physical material and digital information gathered, observed, produced, or formulated during research activities carried out during research. This broad definition includes various types of data serving as the foundation for the research, such as specimens, notebooks, interviews, texts, literature, digital raw data, recordings, computer code, and meticulous documentation of these materials and data, forming the core of the analysis that underlies the research outcomes.</p>
 </div>
 </div>
diff --git a/develop/02_DMP.html b/develop/02_DMP.html
index 668f5142..8fea2287 100644
--- a/develop/02_DMP.html
+++ b/develop/02_DMP.html
@@ -261,7 +261,7 @@ <h1 class="title">2. Data Management Plan</h1>
     <div>
     <div class="quarto-title-meta-heading">Modified</div>
     <div class="quarto-title-meta-contents">
-      <p class="date-modified">April 25, 2024</p>
+      <p class="date-modified">April 26, 2024</p>
     </div>
   </div>
     
diff --git a/develop/03_DOD.html b/develop/03_DOD.html
index d8d33bbe..247f6f83 100644
--- a/develop/03_DOD.html
+++ b/develop/03_DOD.html
@@ -311,7 +311,7 @@ <h1 class="title">3. Data organization and storage</h1>
     <div>
     <div class="quarto-title-meta-heading">Modified</div>
     <div class="quarto-title-meta-contents">
-      <p class="date-modified">April 25, 2024</p>
+      <p class="date-modified">April 26, 2024</p>
     </div>
   </div>
     
@@ -742,7 +742,7 @@ <h3 class="anchored" data-anchor-id="quick-tutorial-on-cookiecutter">Quick tutor
 </div>
 <div class="callout-body-container callout-body">
 <p><strong>Learn how to create your own template <a href="../develop/practical_workshop.html">here</a>.</strong></p>
-<p>We offer workshops on practical RDM for NGS data. Keep an eye on the upcoming events on the <a href="https://hds-sandbox.github.io/news/news.html">Sandbox website</a>.</p>
+<p>We offer workshops on practical RDM for biodata. Keep an eye on the upcoming events on the <a href="https://hds-sandbox.github.io/news/news.html">Sandbox website</a>.</p>
 </div>
 </div>
 </section>
@@ -867,7 +867,7 @@ <h4 class="anchored" data-anchor-id="manual-download">Manual Download</h4>
 </section>
 <section id="naming-conventions" class="level2">
 <h2 class="anchored" data-anchor-id="naming-conventions">Naming conventions</h2>
-<p>Consistent naming conventions play a crucial role in scientific research by enhancing organization and data retrieval. By adopting standardized naming conventions, researchers ensure that files, experiments, or datasets are labeled logically, facilitating easy location and comparison of similar data. For instance, in fields like genomics, uniform naming conventions for files associated with particular experiments or samples allow for swift identification and comparison of relevant data, streamlining the research process and contributing to the reproducibility of findings. Overall, promotes efficiency, collaboration, and the integrity of scientific work.</p>
+<p>Consistent naming conventions play a crucial role in scientific research by enhancing organization and data retrieval. By adopting standardized naming conventions, researchers ensure that files, experiments, or datasets are labeled logically, facilitating easy location and comparison of similar data. The importance of uniform naming conventions extends to various fields, in fields like genomics or health data science, uniform naming conventions for files associated with particular experiments or samples allow for swift identification and comparison of relevant data, streamlining the research process and contributing to the reproducibility of findings. Overall, promotes efficiency, collaboration, and the integrity of scientific work.</p>
 <div class="callout callout-style-default callout-tip callout-titled" title="General tips for file and folder naming">
 <div class="callout-header d-flex align-content-center">
 <div class="callout-icon-container">
@@ -893,648 +893,33 @@ <h2 class="anchored" data-anchor-id="naming-conventions">Naming conventions</h2>
 <li>Not all search tools may work well with spaces (messy to indicate paths)</li>
 <li>If the length is a concern, use capital letters to delimit words <a href="https://en.wikipedia.org/wiki/Camel_case">camelCase</a>.</li>
 </ul></li>
-<li><strong>Sequential numbering</strong>: Use a two-‑digit format for single-digit numbers (0–9) to ensure correct numerical sequence order (for example, 01 and not 1)</li>
+<li><strong>Sequential numbering</strong>: Use a two-‑digit format for single-digit numbers (0–9) to ensure correct numerical sequence order (for example, 01 and not, 1 if your sequence only goes up to 99)</li>
 <li><strong>Version control</strong>: Indicate the version (“V”) or revision (“R”) as the last element, using the two-digit format (e.g., v01, v02)</li>
 <li>Write down your naming convention pattern and document it in the README file</li>
 </ul>
 </div>
 </div>
 <div class="callout callout-style-default callout-exercise no-icon callout-titled">
-<div class="callout-header d-flex align-content-center" data-bs-toggle="collapse" data-bs-target=".callout-19-contents" aria-controls="callout-19" aria-expanded="true" aria-label="Toggle callout">
+<div class="callout-header d-flex align-content-center" data-bs-toggle="collapse" data-bs-target=".callout-18-contents" aria-controls="callout-18" aria-expanded="true" aria-label="Toggle callout">
 <div class="callout-icon-container">
 <i class="callout-icon no-icon"></i>
 </div>
 <div class="callout-title-container flex-fill">
-Define your file name conventions
+Create your own naming conventions
 </div>
 <div class="callout-btn-toggle d-inline-block border-0 py-1 ps-1 pe-0 float-end"><i class="callout-toggle"></i></div>
 </div>
-<div id="callout-19" class="callout-19-contents callout-collapse collapse show">
+<div id="callout-18" class="callout-18-contents callout-collapse collapse show">
 <div class="callout-body-container callout-body">
 <div>
 <div class="callout-exercise">
-<p>Avoid long and complicated names and ensure your file names are both informative and easy to manage:</p>
-<ol type="1">
-<li>For saving a new plot, a heatmap representing sample correlations</li>
-<li>When naming the file for the document containing the Research Data Management Course Objectives (Version 2, 2nd May 2024) from the University of Copenhagen</li>
-<li>Consider the most common file types you work with, such as visualizations, tables, etc., and create logical and clear file names</li>
-</ol>
-<div class="callout callout-style-default callout-hint no-icon callout-titled">
-<div class="callout-header d-flex align-content-center" data-bs-toggle="collapse" data-bs-target=".callout-18-contents" aria-controls="callout-18" aria-expanded="false" aria-label="Toggle callout">
-<div class="callout-icon-container">
-<i class="callout-icon no-icon"></i>
-</div>
-<div class="callout-title-container flex-fill">
-Hint
-</div>
-<div class="callout-btn-toggle d-inline-block border-0 py-1 ps-1 pe-0 float-end"><i class="callout-toggle"></i></div>
-</div>
-<div id="callout-18" class="callout-18-contents callout-collapse collapse">
-<div class="callout-body-container callout-body">
-<div>
-<div class="callout-hint">
-<ol type="1">
-<li><code>heatmap_sampleCor_20240101.png</code></li>
-<li><code>KU_RDM-objectives_20240502_v02.doc</code> or <code>KU_RDMObj_20240502_v02.doc</code></li>
-</ol>
-</div>
+<p>Consider the most common types of files and folders you will be working with, such as visualizations, results tables, and processed files. Develop a logical and clear naming system for these files based on the tips provided above. Aim for concise and straightforward names to avoid complexity.</p>
 </div>
 </div>
 </div>
 </div>
 </div>
-</div>
-</div>
-</div>
-</div>
-<details>
-<summary>
-<span style="color:#4b4646;font-weight:bold;font-size:x-large">Additional file naming conventions
-</span></summary>
-<p>
-</p><div class="cell">
-<div class="cell-output-display">
-<div>
-<div id="woxjkazsmh" style="padding-left:0px;padding-right:0px;padding-top:10px;padding-bottom:10px;overflow-x:auto;overflow-y:auto;width:auto;height:auto;">
-<style>#woxjkazsmh table {
-  font-family: system-ui, 'Segoe UI', Roboto, Helvetica, Arial, sans-serif, 'Apple Color Emoji', 'Segoe UI Emoji', 'Segoe UI Symbol', 'Noto Color Emoji';
-  -webkit-font-smoothing: antialiased;
-  -moz-osx-font-smoothing: grayscale;
-}
-
-#woxjkazsmh thead, #woxjkazsmh tbody, #woxjkazsmh tfoot, #woxjkazsmh tr, #woxjkazsmh td, #woxjkazsmh th {
-  border-style: none;
-}
-
-#woxjkazsmh p {
-  margin: 0;
-  padding: 0;
-}
-
-#woxjkazsmh .gt_table {
-  display: table;
-  border-collapse: collapse;
-  line-height: normal;
-  margin-left: auto;
-  margin-right: auto;
-  color: #333333;
-  font-size: 11px;
-  font-weight: normal;
-  font-style: normal;
-  background-color: #FFFFFF;
-  width: auto;
-  border-top-style: solid;
-  border-top-width: 2px;
-  border-top-color: #A8A8A8;
-  border-right-style: none;
-  border-right-width: 2px;
-  border-right-color: #D3D3D3;
-  border-bottom-style: solid;
-  border-bottom-width: 2px;
-  border-bottom-color: #A8A8A8;
-  border-left-style: none;
-  border-left-width: 2px;
-  border-left-color: #D3D3D3;
-}
-
-#woxjkazsmh .gt_caption {
-  padding-top: 4px;
-  padding-bottom: 4px;
-}
-
-#woxjkazsmh .gt_title {
-  color: #333333;
-  font-size: 125%;
-  font-weight: initial;
-  padding-top: 4px;
-  padding-bottom: 4px;
-  padding-left: 5px;
-  padding-right: 5px;
-  border-bottom-color: #FFFFFF;
-  border-bottom-width: 0;
-}
-
-#woxjkazsmh .gt_subtitle {
-  color: #333333;
-  font-size: 85%;
-  font-weight: initial;
-  padding-top: 3px;
-  padding-bottom: 5px;
-  padding-left: 5px;
-  padding-right: 5px;
-  border-top-color: #FFFFFF;
-  border-top-width: 0;
-}
-
-#woxjkazsmh .gt_heading {
-  background-color: #FFFFFF;
-  text-align: center;
-  border-bottom-color: #FFFFFF;
-  border-left-style: none;
-  border-left-width: 1px;
-  border-left-color: #D3D3D3;
-  border-right-style: none;
-  border-right-width: 1px;
-  border-right-color: #D3D3D3;
-}
-
-#woxjkazsmh .gt_bottom_border {
-  border-bottom-style: solid;
-  border-bottom-width: 2px;
-  border-bottom-color: #D3D3D3;
-}
-
-#woxjkazsmh .gt_col_headings {
-  border-top-style: solid;
-  border-top-width: 2px;
-  border-top-color: #D3D3D3;
-  border-bottom-style: solid;
-  border-bottom-width: 2px;
-  border-bottom-color: #D3D3D3;
-  border-left-style: none;
-  border-left-width: 1px;
-  border-left-color: #D3D3D3;
-  border-right-style: none;
-  border-right-width: 1px;
-  border-right-color: #D3D3D3;
-}
-
-#woxjkazsmh .gt_col_heading {
-  color: #333333;
-  background-color: #FFFFFF;
-  font-size: 100%;
-  font-weight: normal;
-  text-transform: inherit;
-  border-left-style: none;
-  border-left-width: 1px;
-  border-left-color: #D3D3D3;
-  border-right-style: none;
-  border-right-width: 1px;
-  border-right-color: #D3D3D3;
-  vertical-align: bottom;
-  padding-top: 5px;
-  padding-bottom: 6px;
-  padding-left: 5px;
-  padding-right: 5px;
-  overflow-x: hidden;
-}
-
-#woxjkazsmh .gt_column_spanner_outer {
-  color: #333333;
-  background-color: #FFFFFF;
-  font-size: 100%;
-  font-weight: normal;
-  text-transform: inherit;
-  padding-top: 0;
-  padding-bottom: 0;
-  padding-left: 4px;
-  padding-right: 4px;
-}
-
-#woxjkazsmh .gt_column_spanner_outer:first-child {
-  padding-left: 0;
-}
-
-#woxjkazsmh .gt_column_spanner_outer:last-child {
-  padding-right: 0;
-}
-
-#woxjkazsmh .gt_column_spanner {
-  border-bottom-style: solid;
-  border-bottom-width: 2px;
-  border-bottom-color: #D3D3D3;
-  vertical-align: bottom;
-  padding-top: 5px;
-  padding-bottom: 5px;
-  overflow-x: hidden;
-  display: inline-block;
-  width: 100%;
-}
-
-#woxjkazsmh .gt_spanner_row {
-  border-bottom-style: hidden;
-}
-
-#woxjkazsmh .gt_group_heading {
-  padding-top: 8px;
-  padding-bottom: 8px;
-  padding-left: 5px;
-  padding-right: 5px;
-  color: #333333;
-  background-color: #FFFFFF;
-  font-size: 100%;
-  font-weight: initial;
-  text-transform: inherit;
-  border-top-style: solid;
-  border-top-width: 2px;
-  border-top-color: #D3D3D3;
-  border-bottom-style: solid;
-  border-bottom-width: 2px;
-  border-bottom-color: #D3D3D3;
-  border-left-style: none;
-  border-left-width: 1px;
-  border-left-color: #D3D3D3;
-  border-right-style: none;
-  border-right-width: 1px;
-  border-right-color: #D3D3D3;
-  vertical-align: middle;
-  text-align: left;
-}
-
-#woxjkazsmh .gt_empty_group_heading {
-  padding: 0.5px;
-  color: #333333;
-  background-color: #FFFFFF;
-  font-size: 100%;
-  font-weight: initial;
-  border-top-style: solid;
-  border-top-width: 2px;
-  border-top-color: #D3D3D3;
-  border-bottom-style: solid;
-  border-bottom-width: 2px;
-  border-bottom-color: #D3D3D3;
-  vertical-align: middle;
-}
-
-#woxjkazsmh .gt_from_md > :first-child {
-  margin-top: 0;
-}
-
-#woxjkazsmh .gt_from_md > :last-child {
-  margin-bottom: 0;
-}
-
-#woxjkazsmh .gt_row {
-  padding-top: 8px;
-  padding-bottom: 8px;
-  padding-left: 5px;
-  padding-right: 5px;
-  margin: 10px;
-  border-top-style: solid;
-  border-top-width: 1px;
-  border-top-color: #D3D3D3;
-  border-left-style: none;
-  border-left-width: 1px;
-  border-left-color: #D3D3D3;
-  border-right-style: none;
-  border-right-width: 1px;
-  border-right-color: #D3D3D3;
-  vertical-align: middle;
-  overflow-x: hidden;
-}
-
-#woxjkazsmh .gt_stub {
-  color: #333333;
-  background-color: #FFFFFF;
-  font-size: 100%;
-  font-weight: initial;
-  text-transform: inherit;
-  border-right-style: solid;
-  border-right-width: 2px;
-  border-right-color: #D3D3D3;
-  padding-left: 5px;
-  padding-right: 5px;
-}
-
-#woxjkazsmh .gt_stub_row_group {
-  color: #333333;
-  background-color: #FFFFFF;
-  font-size: 100%;
-  font-weight: initial;
-  text-transform: inherit;
-  border-right-style: solid;
-  border-right-width: 2px;
-  border-right-color: #D3D3D3;
-  padding-left: 5px;
-  padding-right: 5px;
-  vertical-align: top;
-}
-
-#woxjkazsmh .gt_row_group_first td {
-  border-top-width: 2px;
-}
-
-#woxjkazsmh .gt_row_group_first th {
-  border-top-width: 2px;
-}
-
-#woxjkazsmh .gt_summary_row {
-  color: #333333;
-  background-color: #FFFFFF;
-  text-transform: inherit;
-  padding-top: 8px;
-  padding-bottom: 8px;
-  padding-left: 5px;
-  padding-right: 5px;
-}
-
-#woxjkazsmh .gt_first_summary_row {
-  border-top-style: solid;
-  border-top-color: #D3D3D3;
-}
-
-#woxjkazsmh .gt_first_summary_row.thick {
-  border-top-width: 2px;
-}
-
-#woxjkazsmh .gt_last_summary_row {
-  padding-top: 8px;
-  padding-bottom: 8px;
-  padding-left: 5px;
-  padding-right: 5px;
-  border-bottom-style: solid;
-  border-bottom-width: 2px;
-  border-bottom-color: #D3D3D3;
-}
-
-#woxjkazsmh .gt_grand_summary_row {
-  color: #333333;
-  background-color: #FFFFFF;
-  text-transform: inherit;
-  padding-top: 8px;
-  padding-bottom: 8px;
-  padding-left: 5px;
-  padding-right: 5px;
-}
-
-#woxjkazsmh .gt_first_grand_summary_row {
-  padding-top: 8px;
-  padding-bottom: 8px;
-  padding-left: 5px;
-  padding-right: 5px;
-  border-top-style: double;
-  border-top-width: 6px;
-  border-top-color: #D3D3D3;
-}
-
-#woxjkazsmh .gt_last_grand_summary_row_top {
-  padding-top: 8px;
-  padding-bottom: 8px;
-  padding-left: 5px;
-  padding-right: 5px;
-  border-bottom-style: double;
-  border-bottom-width: 6px;
-  border-bottom-color: #D3D3D3;
-}
-
-#woxjkazsmh .gt_striped {
-  background-color: rgba(128, 128, 128, 0.05);
-}
-
-#woxjkazsmh .gt_table_body {
-  border-top-style: solid;
-  border-top-width: 2px;
-  border-top-color: #D3D3D3;
-  border-bottom-style: solid;
-  border-bottom-width: 2px;
-  border-bottom-color: #D3D3D3;
-}
-
-#woxjkazsmh .gt_footnotes {
-  color: #333333;
-  background-color: #FFFFFF;
-  border-bottom-style: none;
-  border-bottom-width: 2px;
-  border-bottom-color: #D3D3D3;
-  border-left-style: none;
-  border-left-width: 2px;
-  border-left-color: #D3D3D3;
-  border-right-style: none;
-  border-right-width: 2px;
-  border-right-color: #D3D3D3;
-}
-
-#woxjkazsmh .gt_footnote {
-  margin: 0px;
-  font-size: 90%;
-  padding-top: 4px;
-  padding-bottom: 4px;
-  padding-left: 5px;
-  padding-right: 5px;
-}
-
-#woxjkazsmh .gt_sourcenotes {
-  color: #333333;
-  background-color: #FFFFFF;
-  border-bottom-style: none;
-  border-bottom-width: 2px;
-  border-bottom-color: #D3D3D3;
-  border-left-style: none;
-  border-left-width: 2px;
-  border-left-color: #D3D3D3;
-  border-right-style: none;
-  border-right-width: 2px;
-  border-right-color: #D3D3D3;
-}
-
-#woxjkazsmh .gt_sourcenote {
-  font-size: 90%;
-  padding-top: 4px;
-  padding-bottom: 4px;
-  padding-left: 5px;
-  padding-right: 5px;
-}
-
-#woxjkazsmh .gt_left {
-  text-align: left;
-}
-
-#woxjkazsmh .gt_center {
-  text-align: center;
-}
-
-#woxjkazsmh .gt_right {
-  text-align: right;
-  font-variant-numeric: tabular-nums;
-}
-
-#woxjkazsmh .gt_font_normal {
-  font-weight: normal;
-}
-
-#woxjkazsmh .gt_font_bold {
-  font-weight: bold;
-}
-
-#woxjkazsmh .gt_font_italic {
-  font-style: italic;
-}
-
-#woxjkazsmh .gt_super {
-  font-size: 65%;
-}
-
-#woxjkazsmh .gt_footnote_marks {
-  font-size: 75%;
-  vertical-align: 0.4em;
-  position: initial;
-}
-
-#woxjkazsmh .gt_asterisk {
-  font-size: 100%;
-  vertical-align: 0;
-}
-
-#woxjkazsmh .gt_indent_1 {
-  text-indent: 5px;
-}
-
-#woxjkazsmh .gt_indent_2 {
-  text-indent: 10px;
-}
-
-#woxjkazsmh .gt_indent_3 {
-  text-indent: 15px;
-}
-
-#woxjkazsmh .gt_indent_4 {
-  text-indent: 20px;
-}
-
-#woxjkazsmh .gt_indent_5 {
-  text-indent: 25px;
-}
-</style>
-
-<table class="gt_table table table-sm table-striped small" data-quarto-postprocess="true" data-quarto-disable-processing="false" data-quarto-bootstrap="false">
-<thead>
-<tr class="header gt_col_headings">
-<th id="name" class="gt_col_heading gt_columns_bottom_border gt_left" data-quarto-table-cell-role="th" scope="col">name</th>
-<th id="description" class="gt_col_heading gt_columns_bottom_border gt_left" data-quarto-table-cell-role="th" scope="col">description</th>
-<th id="naming_convention" class="gt_col_heading gt_columns_bottom_border gt_left" data-quarto-table-cell-role="th" scope="col">naming_convention</th>
-<th id="file format" class="gt_col_heading gt_columns_bottom_border gt_left" data-quarto-table-cell-role="th" scope="col">file format</th>
-<th id="example" class="gt_col_heading gt_columns_bottom_border gt_left" data-quarto-table-cell-role="th" scope="col">example</th>
-</tr>
-</thead>
-<tbody class="gt_table_body">
-<tr class="odd">
-<td class="gt_row gt_left" headers="name">.fastq</td>
-<td class="gt_row gt_left" headers="description">raw sequencing reads</td>
-<td class="gt_row gt_left" headers="naming_convention">nan</td>
-<td class="gt_row gt_left" headers="file format">nan</td>
-<td class="gt_row gt_left" headers="example">sampleID_run_read1.fastq</td>
-</tr>
-<tr class="even">
-<td class="gt_row gt_left" headers="name">.fastqc</td>
-<td class="gt_row gt_left" headers="description">quality control from fastqc</td>
-<td class="gt_row gt_left" headers="naming_convention">nan</td>
-<td class="gt_row gt_left" headers="file format">nan</td>
-<td class="gt_row gt_left" headers="example">sampleID_run_read1.fastqc</td>
-</tr>
-<tr class="odd">
-<td class="gt_row gt_left" headers="name">.bam</td>
-<td class="gt_row gt_left" headers="description">aligned reads</td>
-<td class="gt_row gt_left" headers="naming_convention">nan</td>
-<td class="gt_row gt_left" headers="file format">nan</td>
-<td class="gt_row gt_left" headers="example">sampleID_run_read1.bam</td>
-</tr>
-<tr class="even">
-<td class="gt_row gt_left" headers="name">GTF</td>
-<td class="gt_row gt_left" headers="description">sequence annotation</td>
-<td class="gt_row gt_left" headers="naming_convention">nan</td>
-<td class="gt_row gt_left" headers="file format">nan</td>
-<td class="gt_row gt_left" headers="example">one of https://www.gencodegenes.org/</td>
-</tr>
-<tr class="odd">
-<td class="gt_row gt_left" headers="name">GFF</td>
-<td class="gt_row gt_left" headers="description">sequence annotation</td>
-<td class="gt_row gt_left" headers="naming_convention">nan</td>
-<td class="gt_row gt_left" headers="file format">nan</td>
-<td class="gt_row gt_left" headers="example">one of https://www.gencodegenes.org/</td>
-</tr>
-<tr class="even">
-<td class="gt_row gt_left" headers="name">.bed</td>
-<td class="gt_row gt_left" headers="description">genome locations</td>
-<td class="gt_row gt_left" headers="naming_convention">nan</td>
-<td class="gt_row gt_left" headers="file format">nan</td>
-<td class="gt_row gt_left" headers="example">nan</td>
-</tr>
-<tr class="odd">
-<td class="gt_row gt_left" headers="name">.bigwig</td>
-<td class="gt_row gt_left" headers="description">genome coverage</td>
-<td class="gt_row gt_left" headers="naming_convention">nan</td>
-<td class="gt_row gt_left" headers="file format">nan</td>
-<td class="gt_row gt_left" headers="example">nan</td>
-</tr>
-<tr class="even">
-<td class="gt_row gt_left" headers="name">.fasta</td>
-<td class="gt_row gt_left" headers="description">sequence data (nucleotide/aminoacid)</td>
-<td class="gt_row gt_left" headers="naming_convention">nan</td>
-<td class="gt_row gt_left" headers="file format">nan</td>
-<td class="gt_row gt_left" headers="example">one of https://www.gencodegenes.org/</td>
-</tr>
-<tr class="odd">
-<td class="gt_row gt_left" headers="name">Multiqc report</td>
-<td class="gt_row gt_left" headers="description">QC aggregated report</td>
-<td class="gt_row gt_left" headers="naming_convention">&lt;assayID\&gt;_YYYYMMDD.multiqc</td>
-<td class="gt_row gt_left" headers="file format">multiqc</td>
-<td class="gt_row gt_left" headers="example">RNA_20200101.multiqc</td>
-</tr>
-<tr class="even">
-<td class="gt_row gt_left" headers="name">Count matrix</td>
-<td class="gt_row gt_left" headers="description">final count matrix</td>
-<td class="gt_row gt_left" headers="naming_convention">&lt;assayID\&gt;_cm_aligner_YYYYMMDD.tsv</td>
-<td class="gt_row gt_left" headers="file format">tsv</td>
-<td class="gt_row gt_left" headers="example">RNA_cm_salmon_20200101.tsv</td>
-</tr>
-<tr class="odd">
-<td class="gt_row gt_left" headers="name">DEA</td>
-<td class="gt_row gt_left" headers="description">differential expression analysis results</td>
-<td class="gt_row gt_left" headers="naming_convention">DEA_&lt;condition1-condition2\&gt;_LFC&lt;absolute_threshold\&gt;_p&lt;pvalue decimals\&gt;_YYYYMMDD.tsv</td>
-<td class="gt_row gt_left" headers="file format">tsv</td>
-<td class="gt_row gt_left" headers="example">DEA_treat-untreat_LFC1_p01_20200101.tsv</td>
-</tr>
-<tr class="even">
-<td class="gt_row gt_left" headers="name">DBA</td>
-<td class="gt_row gt_left" headers="description">differential binding analysis results</td>
-<td class="gt_row gt_left" headers="naming_convention">DBA_&lt;condition1-condition2\&gt;_LFC&lt;absolute_threshold\&gt;_p&lt;pvalue decimals\&gt;_YYYYMMDD.tsv</td>
-<td class="gt_row gt_left" headers="file format">tsv</td>
-<td class="gt_row gt_left" headers="example">DBA_treat-untreat_LFC1_p01_20200101.tsv</td>
-</tr>
-<tr class="odd">
-<td class="gt_row gt_left" headers="name">MAplot</td>
-<td class="gt_row gt_left" headers="description">MA plot</td>
-<td class="gt_row gt_left" headers="naming_convention">MAplot_&lt;condition1-condition2\&gt;_YYYYMMDD.jpeg</td>
-<td class="gt_row gt_left" headers="file format">jpeg</td>
-<td class="gt_row gt_left" headers="example">MAplot_treat-untreat_20200101.jpeg</td>
-</tr>
-<tr class="even">
-<td class="gt_row gt_left" headers="name">Heatmap plot</td>
-<td class="gt_row gt_left" headers="description">Heatmap plot of anything</td>
-<td class="gt_row gt_left" headers="naming_convention">heatmap_&lt;type\&gt;_YYYYMMDD.jpeg</td>
-<td class="gt_row gt_left" headers="file format">jpeg</td>
-<td class="gt_row gt_left" headers="example">heatmap_sampleCor_20200101.jpeg</td>
-</tr>
-<tr class="odd">
-<td class="gt_row gt_left" headers="name">Volcano plot</td>
-<td class="gt_row gt_left" headers="description">Volcano plot</td>
-<td class="gt_row gt_left" headers="naming_convention">volcano_&lt;condition1-condition2\&gt;_YYYYMMDD.jpeg</td>
-<td class="gt_row gt_left" headers="file format">jpeg</td>
-<td class="gt_row gt_left" headers="example">volcano_treat-untreat_20200101.jpeg</td>
-</tr>
-<tr class="even">
-<td class="gt_row gt_left" headers="name">Venn diagram</td>
-<td class="gt_row gt_left" headers="description">Venn diagram</td>
-<td class="gt_row gt_left" headers="naming_convention">venn_&lt;type\&gt;_YYYYMMDD.jpeg</td>
-<td class="gt_row gt_left" headers="file format">jpeg</td>
-<td class="gt_row gt_left" headers="example">venn_consensus_20200101.jpeg</td>
-</tr>
-<tr class="odd">
-<td class="gt_row gt_left" headers="name">Enrichment table</td>
-<td class="gt_row gt_left" headers="description">Enrichment results</td>
-<td class="gt_row gt_left" headers="naming_convention">nan</td>
-<td class="gt_row gt_left" headers="file format">tsv</td>
-<td class="gt_row gt_left" headers="example">nan</td>
-</tr>
-</tbody>
-</table>
-
-</div>
-</div>
-</div>
-</div>
-<p></p>
-</details>
+<p>To learn more about naming conventions for NGS analysis and see additional examples, click <a href="../develop/examples/NGS_management.html">here</a>.</p>
 </section>
 <section id="wrap-up" class="level2">
 <h2 class="anchored" data-anchor-id="wrap-up">Wrap up</h2>
diff --git a/develop/04_metadata.html b/develop/04_metadata.html
index 91075922..c3118951 100644
--- a/develop/04_metadata.html
+++ b/develop/04_metadata.html
@@ -261,7 +261,7 @@
     <h2 id="toc-title">On this page</h2>
    
   <ul>
-  <li><a href="#documentation-and-metadata" id="toc-documentation-and-metadata" class="nav-link active" data-scroll-target="#documentation-and-metadata">Documentation and metadata</a>
+  <li><a href="#data-documentation" id="toc-data-documentation" class="nav-link active" data-scroll-target="#data-documentation">Data documentation</a>
   <ul class="collapse">
   <li><a href="#streamlining-metadata-collection" id="toc-streamlining-metadata-collection" class="nav-link" data-scroll-target="#streamlining-metadata-collection">Streamlining Metadata Collection</a></li>
   <li><a href="#readme.md" id="toc-readme.md" class="nav-link" data-scroll-target="#readme.md">README.md</a></li>
@@ -305,7 +305,7 @@ <h1 class="title">4. Documentation for biodata</h1>
     <div>
     <div class="quarto-title-meta-heading">Modified</div>
     <div class="quarto-title-meta-contents">
-      <p class="date-modified">April 25, 2024</p>
+      <p class="date-modified">April 26, 2024</p>
     </div>
   </div>
     
@@ -337,42 +337,42 @@ <h1 class="title">4. Documentation for biodata</h1>
 </div>
 </div>
 <p>In bioinformatics data management, documentation plays a critical role in ensuring clarity and reproducibility. Documentation and metadata are essential components in ensuring your data adheres to FAIR principles.</p>
-<section id="documentation-and-metadata" class="level2">
-<h2 class="anchored" data-anchor-id="documentation-and-metadata">Documentation and metadata</h2>
+<section id="data-documentation" class="level2">
+<h2 class="anchored" data-anchor-id="data-documentation">Data documentation</h2>
 <p>Essential documentation comes in different forms and flavors, serving various purposes in research. Examples include protocols outlining experimental procedures, detailed lab journals recording experimental conditions and observations, codebooks explaining concepts, variables, and abbreviations used in the analysis, information about the structure and content of a dataset, software installation, and usage manual, code explanation within files or methodological information outlining data processing steps.</p>
 <p><img src="images/metadata.png" class="img-fluid" alt="metadata"> <em>From <a href="https://www.ontotext.com/knowledgehub/fundamentals/metadata-fundamental/">ontotext.com</a></em></p>
-<p>Metadata provides essential context and structure to (primary) data, enabling researchers to understand its significance and facilitate efficient data management. Some common elements found in metadata for bioinformatics data include:</p>
+<p>Data documentation provides essential context and structure to (primary) data, enabling researchers to understand its significance and facilitate efficient data management. Some common elements found in metadata for bioinformatics data include:</p>
 <ul>
-<li>Sample information and collection details</li>
-<li>Experimental conditions</li>
-<li>Data processing steps applied to the raw data</li>
-<li>Annotation and Ontology terms</li>
-<li>File metadata (file type, file format, etc.)</li>
-<li>Ethical and Legal Compliance</li>
+<li><strong>Data collection information</strong>: source (e.g., organism, tissue or location), date (YYYY-MM-DD format) and time, collection methods employed or experimental conditions.</li>
+<li><strong>Data processing information</strong>: data content, data format, data cleaning and transformation such as filtering and normalizations techniques, and software and tools used.</li>
+<li><strong>Data description</strong>: variables and attributes, and data types (e.g., categorical, numerical, or textual).</li>
+<li><strong>Biological context</strong>: experimental design, biological purpose and relevance and implications in the broader context.</li>
+<li><strong>Data ownership and access</strong>: authorship, licensing of the data and details on accessing and sharing.</li>
+<li><strong>Provenance and tracking</strong>: version control information over time and citations, such as links to publications or studies that reference the data.</li>
 </ul>
-<p>Metadata serves as a crucial guide in navigating the complex landscape of data, akin to a cheat sheet for piecing together the puzzle of information. Much like identifying puzzle pieces, metadata provides essential details about data origin, structure, and context, such as sample collection details, experimental procedures, and equipment used. Metadata enables data exploration, interpretation, and future accessibility, promoting effective management and facilitating data usability and reuse.</p>
-<div class="callout callout-style-default callout-note callout-titled" title="Benefits of collecting proper metadata">
+<p>Data documentation also serves as a crucial guide in navigating the complex landscape of data, akin to a cheat sheet for piecing together the puzzle of information. Much like identifying puzzle pieces, metadata provides essential details about data origin, structure, and context, such as sample collection details, experimental procedures, and equipment used. Metadata enables data exploration, interpretation, and future accessibility, promoting effective management and facilitating data usability and reuse.</p>
+<div class="callout callout-style-default callout-note callout-titled" title="Benefits of collecting proper documentation">
 <div class="callout-header d-flex align-content-center">
 <div class="callout-icon-container">
 <i class="callout-icon"></i>
 </div>
 <div class="callout-title-container flex-fill">
-Benefits of collecting proper metadata
+Benefits of collecting proper documentation
 </div>
 </div>
 <div class="callout-body-container callout-body">
 <ol type="1">
 <li><strong>Data Context and Interpretation</strong>: Aiding in understanding experimental conditions, sample origins, and processing methods, is crucial for accurate results interpretation.</li>
-<li><strong>Data Discovery and Access</strong>: Metadata enables easy locating and accessing of specific datasets by quickly identifying relevant data through sample identifiers, experimental parameters, and timestamps.</li>
-<li><strong>Reproducibility and Collaboration</strong>: Metadata facilitates experiment replication and validation by enabling colleagues to reproduce analyses, compare results, and collaborate effectively, enhancing the integrity of scientific findings.</li>
-<li><strong>Quality Control and Validation</strong>: Metadata supports data quality assessment by tracking the origin and handling of NGS data, allowing the identification of errors or biases to validate analysis accuracy and reliability.</li>
-<li><strong>Long-Term Data Preservation</strong>: metadata ensures preservation over time, facilitating future understanding and utilization of archived datasets for continued scientific impact as research progresses.</li>
+<li><strong>Data Discovery and Access</strong>: Documentation enables easy locating and accessing of specific datasets by quickly identifying relevant data through sample identifiers, experimental parameters, and timestamps.</li>
+<li><strong>Reproducibility and Collaboration</strong>: Documentation facilitates experiment replication and validation by enabling colleagues to reproduce analyses, compare results, and collaborate effectively, enhancing the integrity of scientific findings.</li>
+<li><strong>Quality Control and Validation</strong>: Documentation supports data quality assessment by tracking the origin and handling of NGS data, allowing the identification of errors or biases to validate analysis accuracy and reliability.</li>
+<li><strong>Long-Term Data Preservation</strong>: Documentation ensures preservation over time, facilitating future understanding and utilization of archived datasets for continued scientific impact as research progresses.</li>
 </ol>
 </div>
 </div>
 <section id="streamlining-metadata-collection" class="level3">
 <h3 class="anchored" data-anchor-id="streamlining-metadata-collection">Streamlining Metadata Collection</h3>
-<p>Data and project directories should both include metadata and a README file.</p>
+<p>Data and project directories should both include metadata and a README file. Metadata delivers descriptive information about a dataset or project, offering insights for interpreting, using, and sharing the data effectively. README files offer an overview and purpose of the project or dataset, providing instructions and guidance for setting up, running, and using the data or tools. While metadata concentrates on the data itself, README files provide a broader perspective on the overall project or resource.</p>
 <div class="callout callout-style-default callout-note callout-titled" title="Practical tips">
 <div class="callout-header d-flex align-content-center">
 <div class="callout-icon-container">
@@ -396,9 +396,28 @@ <h3 class="anchored" data-anchor-id="streamlining-metadata-collection">Streamlin
 </section>
 <section id="readme.md" class="level3">
 <h3 class="anchored" data-anchor-id="readme.md">README.md</h3>
+<div class="callout callout-style-default callout-note callout-titled">
+<div class="callout-header d-flex align-content-center">
+<div class="callout-icon-container">
+<i class="callout-icon"></i>
+</div>
+<div class="callout-title-container flex-fill">
+File formats
+</div>
+</div>
+<div class="callout-body-container callout-body">
+<p>Link to the <a href="https://fileinfo.com/">file format database</a></p>
+<ul>
+<li>Markdown (<code>.md</code>): commonly used because is easy to read and write and is compatible across platforms (e.g., GitHub, GitLab). Supports formatting like headings, lists, links, images, and code blocks.</li>
+<li>Plain Text (<code>.txt</code>): Simple and straightforward format without any rich formatting and great for basic instructions. Lack the ability of structure content effectively.</li>
+<li>ReStructuredText (<code>.rst</code>): commonly used for python projects. Supports advanced formatting (takes, links, images and code blocks) .</li>
+</ul>
+<p>Others such as HTML, YAML and Notebooks.</p>
+</div>
+</div>
 <p>The README.md file, written in <a href="https://www.markdownguide.org/">markdown format</a>, provides a detailed description of the folder’s content. It includes information such as the purpose of the data, collection methods, and relevant details. The content might differ based on the purpose of the data.</p>
 <div class="callout callout-style-default callout-exercise no-icon callout-titled">
-<div class="callout-header d-flex align-content-center" data-bs-toggle="collapse" data-bs-target=".callout-4-contents" aria-controls="callout-4" aria-expanded="true" aria-label="Toggle callout">
+<div class="callout-header d-flex align-content-center" data-bs-toggle="collapse" data-bs-target=".callout-5-contents" aria-controls="callout-5" aria-expanded="true" aria-label="Toggle callout">
 <div class="callout-icon-container">
 <i class="callout-icon no-icon"></i>
 </div>
@@ -407,7 +426,7 @@ <h3 class="anchored" data-anchor-id="readme.md">README.md</h3>
 </div>
 <div class="callout-btn-toggle d-inline-block border-0 py-1 ps-1 pe-0 float-end"><i class="callout-toggle"></i></div>
 </div>
-<div id="callout-4" class="callout-4-contents callout-collapse collapse show">
+<div id="callout-5" class="callout-5-contents callout-collapse collapse show">
 <div class="callout-body-container callout-body">
 <div>
 <div class="callout-exercise">
@@ -427,7 +446,7 @@ <h3 class="anchored" data-anchor-id="readme.md">README.md</h3>
 </div>
 <p><strong>Structure for bioinformatics projects.</strong></p>
 <ul>
-<li>Description of the project</li>
+<li>Description and relevance the project</li>
 <li>Objectives and aims</li>
 <li>Datasets and software requirements</li>
 <li>Instruction for data interpretation</li>
@@ -438,7 +457,27 @@ <h3 class="anchored" data-anchor-id="readme.md">README.md</h3>
 </section>
 <section id="metadata.yml" class="level3">
 <h3 class="anchored" data-anchor-id="metadata.yml">metadata.yml</h3>
-<p>Metadata can be written in many file formats (commonly used: YAML, TXT, JSON, and CSV). We recommend <a href="https://fileinfo.com/extension/yml">YAML format</a>, which is a text document that contains data formatted using a human-readable data format for data serialization. The content will be specific to the type of project.</p>
+<div class="callout callout-style-default callout-note callout-titled">
+<div class="callout-header d-flex align-content-center">
+<div class="callout-icon-container">
+<i class="callout-icon"></i>
+</div>
+<div class="callout-title-container flex-fill">
+File formats
+</div>
+</div>
+<div class="callout-body-container callout-body">
+<ul>
+<li>XML (eXtensible Markup Language): uses custom tags to describe data and allows for a hierarchical structure.</li>
+<li>JSON (JavaScript Object Notation): lightweight and human-readable format that is easy to parse and generate.</li>
+<li>CSV (Comma-Separated Values) or TSV (tabulate-separate values): simple and widely supported for representing tabular formats. Easy to manipulate using software or programming languages. It is often use for sample metadata.</li>
+<li>YAML (YAML Ain’t Markup Language): human-readable data serialization format, commonly used as project configuration files.</li>
+</ul>
+<p>Others such as RDF or HDF5.</p>
+</div>
+</div>
+<p>Link to the <a href="https://fileinfo.com/">file format database</a>.</p>
+<p>Metadata can be written in many file formats (commonly used: YAML, TXT, JSON, and CSV). We recommend <a href="https://fileinfo.com/extension/yml">YAML format</a>, which is a text document that contains data formatted using a human-readable data format for data serialization. However, choose the format that best suits the project’s needs. The content will be specific to the type of project.</p>
 <div class="sourceCode" id="cb1"><pre class="sourceCode YAML code-with-copy"><code class="sourceCode yaml"><span id="cb1-1"><a href="#cb1-1" aria-hidden="true" tabindex="-1"></a><span class="fu">metadata</span><span class="kw">:</span></span>
 <span id="cb1-2"><a href="#cb1-2" aria-hidden="true" tabindex="-1"></a><span class="at">  </span><span class="fu">project</span><span class="kw">:</span><span class="at"> </span><span class="st">"Title"</span></span>
 <span id="cb1-3"><a href="#cb1-3" aria-hidden="true" tabindex="-1"></a><span class="at">  </span><span class="fu">author</span><span class="kw">:</span><span class="at"> </span><span class="st">"Name"</span></span>
@@ -476,7 +515,7 @@ <h3 class="anchored" data-anchor-id="metadata.yml">metadata.yml</h3>
 <div class="callout-body-container callout-body">
 <p>There is an exercise in the <a href="../develop/practical_workshop.html">practical material</a> to streamline the creation of metadata files using Cookiecutter, a template-based scaffolding tool.</p>
 <div class="callout callout-style-default callout-hint no-icon callout-titled">
-<div class="callout-header d-flex align-content-center" data-bs-toggle="collapse" data-bs-target=".callout-5-contents" aria-controls="callout-5" aria-expanded="false" aria-label="Toggle callout">
+<div class="callout-header d-flex align-content-center" data-bs-toggle="collapse" data-bs-target=".callout-7-contents" aria-controls="callout-7" aria-expanded="false" aria-label="Toggle callout">
 <div class="callout-icon-container">
 <i class="callout-icon no-icon"></i>
 </div>
@@ -485,7 +524,7 @@ <h3 class="anchored" data-anchor-id="metadata.yml">metadata.yml</h3>
 </div>
 <div class="callout-btn-toggle d-inline-block border-0 py-1 ps-1 pe-0 float-end"><i class="callout-toggle"></i></div>
 </div>
-<div id="callout-5" class="callout-5-contents callout-collapse collapse">
+<div id="callout-7" class="callout-7-contents callout-collapse collapse">
 <div class="callout-body-container callout-body">
 <div>
 <div class="callout-hint">
@@ -515,15 +554,17 @@ <h2 class="anchored" data-anchor-id="controlled-vocabularies-and-ontologies">Con
 <ul>
 <li><a href="https://www.ebi.ac.uk/ols4/ontologies/uberon">Uberon anatomy ontology</a></li>
 <li><a href="https://geneontology.org/docs/tools-overview/">Gene ontology</a></li>
-<li><a href="https://www.ebi.ac.uk/training/online/courses/ensembl-browsing-genomes/navigating-ensembl/investigating-a-gene/#:~:text=Ensembl%20gene%20IDs%20begin%20with,of%20species%20other%20than%20human">Ensembl gene IDs</a>.</li>
+<li><a href="https://www.ebi.ac.uk/training/online/courses/ensembl-browsing-genomes/navigating-ensembl/investigating-a-gene/#:~:text=Ensembl%20gene%20IDs%20begin%20with,of%20species%20other%20than%20human">Ensembl gene IDs</a></li>
 <li><a href="https://www.ncbi.nlm.nih.gov/mesh">Medical Subject Headings (MeSH)</a></li>
 <li><a href="https://www.ebi.ac.uk/chebi/">Chemical Entities of Biological Interest</a></li>
 <li><a href="https://mged.sourceforge.net/ontologies/index.php">Microarray Gene Expression Society Ontology (MGED)</a></li>
+<li><a href="https://www.ncbi.nlm.nih.gov/taxonomy">NCBI taxonomy</a></li>
+<li><a href="https://mondo.monarchinitiative.org/">Mondo disease database</a></li>
 </ul>
 </div>
 </div>
 <div class="callout callout-style-default callout-definition no-icon callout-titled">
-<div class="callout-header d-flex align-content-center" data-bs-toggle="collapse" data-bs-target=".callout-8-contents" aria-controls="callout-8" aria-expanded="false" aria-label="Toggle callout">
+<div class="callout-header d-flex align-content-center" data-bs-toggle="collapse" data-bs-target=".callout-10-contents" aria-controls="callout-10" aria-expanded="false" aria-label="Toggle callout">
 <div class="callout-icon-container">
 <i class="callout-icon no-icon"></i>
 </div>
@@ -532,7 +573,7 @@ <h2 class="anchored" data-anchor-id="controlled-vocabularies-and-ontologies">Con
 </div>
 <div class="callout-btn-toggle d-inline-block border-0 py-1 ps-1 pe-0 float-end"><i class="callout-toggle"></i></div>
 </div>
-<div id="callout-8" class="callout-8-contents callout-collapse collapse">
+<div id="callout-10" class="callout-10-contents callout-collapse collapse">
 <div class="callout-body-container callout-body">
 <div>
 <div class="callout-definition">
@@ -552,7 +593,7 @@ <h2 class="anchored" data-anchor-id="database-and-data-catalogs">Database and da
 <h3 class="anchored" data-anchor-id="tables-as-databases">Tables as databases</h3>
 <p>A browsable table can be created by recursively navigating through a project’s folder hierarchy using a script and generating a TSV file (tab-separated values) named, for example, database_YYYYMMDD.tsv. This table acts as a centralized repository for all project data, simplifying access and organization. Consistency in metadata structure across projects is vital for efficient data management and integration, as it aids in tracking all conducted assays. Adhering to a uniform metadata format enables the seamless inclusion of essential information from YAML files into the browsable table.</p>
 <div class="callout callout-style-default callout-exercise no-icon callout-titled">
-<div class="callout-header d-flex align-content-center" data-bs-toggle="collapse" data-bs-target=".callout-10-contents" aria-controls="callout-10" aria-expanded="true" aria-label="Toggle callout">
+<div class="callout-header d-flex align-content-center" data-bs-toggle="collapse" data-bs-target=".callout-12-contents" aria-controls="callout-12" aria-expanded="true" aria-label="Toggle callout">
 <div class="callout-icon-container">
 <i class="callout-icon no-icon"></i>
 </div>
@@ -561,7 +602,7 @@ <h3 class="anchored" data-anchor-id="tables-as-databases">Tables as databases</h
 </div>
 <div class="callout-btn-toggle d-inline-block border-0 py-1 ps-1 pe-0 float-end"><i class="callout-toggle"></i></div>
 </div>
-<div id="callout-10" class="callout-10-contents callout-collapse collapse show">
+<div id="callout-12" class="callout-12-contents callout-collapse collapse show">
 <div class="callout-body-container callout-body">
 <div>
 <div class="callout-exercise">
@@ -573,7 +614,7 @@ <h3 class="anchored" data-anchor-id="tables-as-databases">Tables as databases</h
 </ul>
 <p>Click on the hint to reveal the solution and a code example for the exercise, which may serve as inspiration.</p>
 <div class="callout callout-style-default callout-hint no-icon callout-titled">
-<div class="callout-header d-flex align-content-center" data-bs-toggle="collapse" data-bs-target=".callout-9-contents" aria-controls="callout-9" aria-expanded="false" aria-label="Toggle callout">
+<div class="callout-header d-flex align-content-center" data-bs-toggle="collapse" data-bs-target=".callout-11-contents" aria-controls="callout-11" aria-expanded="false" aria-label="Toggle callout">
 <div class="callout-icon-container">
 <i class="callout-icon no-icon"></i>
 </div>
@@ -582,7 +623,7 @@ <h3 class="anchored" data-anchor-id="tables-as-databases">Tables as databases</h
 </div>
 <div class="callout-btn-toggle d-inline-block border-0 py-1 ps-1 pe-0 float-end"><i class="callout-toggle"></i></div>
 </div>
-<div id="callout-9" class="callout-9-contents callout-collapse collapse">
+<div id="callout-11" class="callout-11-contents callout-collapse collapse">
 <div class="callout-body-container callout-body">
 <div>
 <div class="callout-hint">
@@ -659,7 +700,7 @@ <h3 class="anchored" data-anchor-id="sqlite-database">SQLite database</h3>
 </div>
 </div>
 <div class="callout callout-style-default callout-exercise no-icon callout-titled">
-<div class="callout-header d-flex align-content-center" data-bs-toggle="collapse" data-bs-target=".callout-13-contents" aria-controls="callout-13" aria-expanded="true" aria-label="Toggle callout">
+<div class="callout-header d-flex align-content-center" data-bs-toggle="collapse" data-bs-target=".callout-15-contents" aria-controls="callout-15" aria-expanded="true" aria-label="Toggle callout">
 <div class="callout-icon-container">
 <i class="callout-icon no-icon"></i>
 </div>
@@ -668,13 +709,13 @@ <h3 class="anchored" data-anchor-id="sqlite-database">SQLite database</h3>
 </div>
 <div class="callout-btn-toggle d-inline-block border-0 py-1 ps-1 pe-0 float-end"><i class="callout-toggle"></i></div>
 </div>
-<div id="callout-13" class="callout-13-contents callout-collapse collapse show">
+<div id="callout-15" class="callout-15-contents callout-collapse collapse show">
 <div class="callout-body-container callout-body">
 <div>
 <div class="callout-exercise">
 <p>Click on the hint to reveal the solution and a code example for the exercise, which may serve as inspiration.</p>
 <div class="callout callout-style-default callout-hint no-icon callout-titled">
-<div class="callout-header d-flex align-content-center" data-bs-toggle="collapse" data-bs-target=".callout-12-contents" aria-controls="callout-12" aria-expanded="false" aria-label="Toggle callout">
+<div class="callout-header d-flex align-content-center" data-bs-toggle="collapse" data-bs-target=".callout-14-contents" aria-controls="callout-14" aria-expanded="false" aria-label="Toggle callout">
 <div class="callout-icon-container">
 <i class="callout-icon no-icon"></i>
 </div>
@@ -683,7 +724,7 @@ <h3 class="anchored" data-anchor-id="sqlite-database">SQLite database</h3>
 </div>
 <div class="callout-btn-toggle d-inline-block border-0 py-1 ps-1 pe-0 float-end"><i class="callout-toggle"></i></div>
 </div>
-<div id="callout-12" class="callout-12-contents callout-collapse collapse">
+<div id="callout-14" class="callout-14-contents callout-collapse collapse">
 <div class="callout-body-container callout-body">
 <div>
 <div class="callout-hint">
@@ -728,7 +769,7 @@ <h3 class="anchored" data-anchor-id="catalog-browser">Catalog browser</h3>
 </figure>
 </div>
 <div class="callout callout-style-default callout-exercise no-icon callout-titled">
-<div class="callout-header d-flex align-content-center" data-bs-toggle="collapse" data-bs-target=".callout-16-contents" aria-controls="callout-16" aria-expanded="true" aria-label="Toggle callout">
+<div class="callout-header d-flex align-content-center" data-bs-toggle="collapse" data-bs-target=".callout-18-contents" aria-controls="callout-18" aria-expanded="true" aria-label="Toggle callout">
 <div class="callout-icon-container">
 <i class="callout-icon no-icon"></i>
 </div>
@@ -737,7 +778,7 @@ <h3 class="anchored" data-anchor-id="catalog-browser">Catalog browser</h3>
 </div>
 <div class="callout-btn-toggle d-inline-block border-0 py-1 ps-1 pe-0 float-end"><i class="callout-toggle"></i></div>
 </div>
-<div id="callout-16" class="callout-16-contents callout-collapse collapse show">
+<div id="callout-18" class="callout-18-contents callout-collapse collapse show">
 <div class="callout-body-container callout-body">
 <div>
 <div class="callout-exercise">
@@ -746,7 +787,7 @@ <h3 class="anchored" data-anchor-id="catalog-browser">Catalog browser</h3>
 <li>Solution A. From a TSV</li>
 </ul>
 <div class="callout callout-style-default callout-hint no-icon callout-titled">
-<div class="callout-header d-flex align-content-center" data-bs-toggle="collapse" data-bs-target=".callout-14-contents" aria-controls="callout-14" aria-expanded="false" aria-label="Toggle callout">
+<div class="callout-header d-flex align-content-center" data-bs-toggle="collapse" data-bs-target=".callout-16-contents" aria-controls="callout-16" aria-expanded="false" aria-label="Toggle callout">
 <div class="callout-icon-container">
 <i class="callout-icon no-icon"></i>
 </div>
@@ -755,7 +796,7 @@ <h3 class="anchored" data-anchor-id="catalog-browser">Catalog browser</h3>
 </div>
 <div class="callout-btn-toggle d-inline-block border-0 py-1 ps-1 pe-0 float-end"><i class="callout-toggle"></i></div>
 </div>
-<div id="callout-14" class="callout-14-contents callout-collapse collapse">
+<div id="callout-16" class="callout-16-contents callout-collapse collapse">
 <div class="callout-body-container callout-body">
 <div>
 <div class="callout-hint code-overflow-wrap">
@@ -804,7 +845,7 @@ <h3 class="anchored" data-anchor-id="catalog-browser">Catalog browser</h3>
 <li>Solution B. From an SQLite database</li>
 </ul>
 <div class="callout callout-style-default callout-hint no-icon callout-titled">
-<div class="callout-header d-flex align-content-center" data-bs-toggle="collapse" data-bs-target=".callout-15-contents" aria-controls="callout-15" aria-expanded="false" aria-label="Toggle callout">
+<div class="callout-header d-flex align-content-center" data-bs-toggle="collapse" data-bs-target=".callout-17-contents" aria-controls="callout-17" aria-expanded="false" aria-label="Toggle callout">
 <div class="callout-icon-container">
 <i class="callout-icon no-icon"></i>
 </div>
@@ -813,7 +854,7 @@ <h3 class="anchored" data-anchor-id="catalog-browser">Catalog browser</h3>
 </div>
 <div class="callout-btn-toggle d-inline-block border-0 py-1 ps-1 pe-0 float-end"><i class="callout-toggle"></i></div>
 </div>
-<div id="callout-15" class="callout-15-contents callout-collapse collapse">
+<div id="callout-17" class="callout-17-contents callout-collapse collapse">
 <div class="callout-body-container callout-body">
 <div>
 <div class="callout-hint code-overflow-wrap">
@@ -884,7 +925,7 @@ <h3 class="anchored" data-anchor-id="catalog-browser">Catalog browser</h3>
 </div>
 </div>
 <div class="callout callout-style-default callout-exercise no-icon callout-titled">
-<div class="callout-header d-flex align-content-center" data-bs-toggle="collapse" data-bs-target=".callout-17-contents" aria-controls="callout-17" aria-expanded="true" aria-label="Toggle callout">
+<div class="callout-header d-flex align-content-center" data-bs-toggle="collapse" data-bs-target=".callout-19-contents" aria-controls="callout-19" aria-expanded="true" aria-label="Toggle callout">
 <div class="callout-icon-container">
 <i class="callout-icon no-icon"></i>
 </div>
@@ -893,7 +934,7 @@ <h3 class="anchored" data-anchor-id="catalog-browser">Catalog browser</h3>
 </div>
 <div class="callout-btn-toggle d-inline-block border-0 py-1 ps-1 pe-0 float-end"><i class="callout-toggle"></i></div>
 </div>
-<div id="callout-17" class="callout-17-contents callout-collapse collapse show">
+<div id="callout-19" class="callout-19-contents callout-collapse collapse show">
 <div class="callout-body-container callout-body">
 <div>
 <div class="callout-exercise">
diff --git a/develop/05_VC.html b/develop/05_VC.html
index 2ed0ff9f..8451bc27 100644
--- a/develop/05_VC.html
+++ b/develop/05_VC.html
@@ -267,7 +267,7 @@ <h1 class="title">5. Data Analysis with Version Control</h1>
     <div>
     <div class="quarto-title-meta-heading">Modified</div>
     <div class="quarto-title-meta-contents">
-      <p class="date-modified">April 25, 2024</p>
+      <p class="date-modified">April 26, 2024</p>
     </div>
   </div>
     
diff --git a/develop/06_pipelines.html b/develop/06_pipelines.html
index 871d7283..7720d84c 100644
--- a/develop/06_pipelines.html
+++ b/develop/06_pipelines.html
@@ -246,7 +246,7 @@ <h1 class="title">6. Processing and analyzing biodata</h1>
     <div>
     <div class="quarto-title-meta-heading">Modified</div>
     <div class="quarto-title-meta-contents">
-      <p class="date-modified">April 25, 2024</p>
+      <p class="date-modified">April 26, 2024</p>
     </div>
   </div>
     
diff --git a/develop/07_repos.html b/develop/07_repos.html
index c0f490fe..22f6e43c 100644
--- a/develop/07_repos.html
+++ b/develop/07_repos.html
@@ -260,7 +260,7 @@ <h1 class="title">7. Storing and sharing biodata</h1>
     <div>
     <div class="quarto-title-meta-heading">Modified</div>
     <div class="quarto-title-meta-contents">
-      <p class="date-modified">April 25, 2024</p>
+      <p class="date-modified">April 26, 2024</p>
     </div>
   </div>
     
diff --git a/develop/contributors.html b/develop/contributors.html
index dfe90bf4..56d13c70 100644
--- a/develop/contributors.html
+++ b/develop/contributors.html
@@ -152,7 +152,7 @@ <h1 class="title">Practical material</h1>
     <div>
     <div class="quarto-title-meta-heading">Modified</div>
     <div class="quarto-title-meta-contents">
-      <p class="date-modified">April 25, 2024</p>
+      <p class="date-modified">April 26, 2024</p>
     </div>
   </div>
     
diff --git a/develop/examples/NGS_OS_FAIR.html b/develop/examples/NGS_OS_FAIR.html
index 05c16cce..b8fca30d 100644
--- a/develop/examples/NGS_OS_FAIR.html
+++ b/develop/examples/NGS_OS_FAIR.html
@@ -177,7 +177,7 @@
           <li class="sidebar-item">
   <div class="sidebar-item-container"> 
   <a href="../../develop/examples/NGS_management.html" class="sidebar-item-text sidebar-link">
- <span class="menu-text">NGS data strategies</span></a>
+ <span class="menu-text">Effective RDM Practices in NGS Analysis</span></a>
   </div>
 </li>
           <li class="sidebar-item">
@@ -244,7 +244,7 @@ <h1 class="title">Applied Open Science and FAIR principles to NGS</h1>
     <div>
     <div class="quarto-title-meta-heading">Modified</div>
     <div class="quarto-title-meta-contents">
-      <p class="date-modified">April 25, 2024</p>
+      <p class="date-modified">April 26, 2024</p>
     </div>
   </div>
     
diff --git a/develop/examples/NGS_management.html b/develop/examples/NGS_management.html
index 78044004..7817e9cd 100644
--- a/develop/examples/NGS_management.html
+++ b/develop/examples/NGS_management.html
@@ -153,7 +153,7 @@
       <button type="button" class="quarto-btn-toggle btn" data-bs-toggle="collapse" data-bs-target=".quarto-sidebar-collapse-item" aria-controls="quarto-sidebar" aria-expanded="false" aria-label="Toggle sidebar navigation" onclick="if (window.quartoToggleHeadroom) { window.quartoToggleHeadroom(); }">
         <i class="bi bi-layout-text-sidebar-reverse"></i>
       </button>
-        <nav class="quarto-page-breadcrumbs" aria-label="breadcrumb"><ol class="breadcrumb"><li class="breadcrumb-item"><a href="../../develop/examples/NGS_OS_FAIR.html">NGS data</a></li><li class="breadcrumb-item"><a href="../../develop/examples/NGS_management.html">NGS data strategies</a></li></ol></nav>
+        <nav class="quarto-page-breadcrumbs" aria-label="breadcrumb"><ol class="breadcrumb"><li class="breadcrumb-item"><a href="../../develop/examples/NGS_OS_FAIR.html">NGS data</a></li><li class="breadcrumb-item"><a href="../../develop/examples/NGS_management.html">Effective RDM Practices in NGS Analysis</a></li></ol></nav>
         <a class="flex-grow-1" role="button" data-bs-toggle="collapse" data-bs-target=".quarto-sidebar-collapse-item" aria-controls="quarto-sidebar" aria-expanded="false" aria-label="Toggle sidebar navigation" onclick="if (window.quartoToggleHeadroom) { window.quartoToggleHeadroom(); }">      
         </a>
       <button type="button" class="btn quarto-search-button" aria-label="" onclick="window.quartoOpenSearch();">
@@ -197,7 +197,7 @@
           <li class="sidebar-item">
   <div class="sidebar-item-container"> 
   <a href="../../develop/examples/NGS_management.html" class="sidebar-item-text sidebar-link active">
- <span class="menu-text">NGS data strategies</span></a>
+ <span class="menu-text">Effective RDM Practices in NGS Analysis</span></a>
   </div>
 </li>
           <li class="sidebar-item">
@@ -244,6 +244,9 @@ <h2 id="toc-title">On this page</h2>
   <li><a href="#software-and-code" id="toc-software-and-code" class="nav-link" data-scroll-target="#software-and-code">3. <strong>Software and code</strong>:</a></li>
   <li><a href="#pipelines-and-workflows" id="toc-pipelines-and-workflows" class="nav-link" data-scroll-target="#pipelines-and-workflows">4. <strong>Pipelines and workflows</strong></a></li>
   </ul></li>
+  </ul></li>
+  <li><a href="#file-naming-convention-examples" id="toc-file-naming-convention-examples" class="nav-link" data-scroll-target="#file-naming-convention-examples">File naming convention examples</a>
+  <ul class="collapse">
   <li><a href="#wrap-up" id="toc-wrap-up" class="nav-link" data-scroll-target="#wrap-up">Wrap up</a></li>
   </ul></li>
   </ul>
@@ -252,9 +255,8 @@ <h2 id="toc-title">On this page</h2>
 <!-- main -->
 <main class="content" id="quarto-document-content">
 
-<header id="title-block-header" class="quarto-title-block default"><nav class="quarto-page-breadcrumbs quarto-title-breadcrumbs d-none d-lg-block" aria-label="breadcrumb"><ol class="breadcrumb"><li class="breadcrumb-item"><a href="../../develop/examples/NGS_OS_FAIR.html">NGS data</a></li><li class="breadcrumb-item"><a href="../../develop/examples/NGS_management.html">NGS data strategies</a></li></ol></nav>
+<header id="title-block-header" class="quarto-title-block default"><nav class="quarto-page-breadcrumbs quarto-title-breadcrumbs d-none d-lg-block" aria-label="breadcrumb"><ol class="breadcrumb"><li class="breadcrumb-item"><a href="../../develop/examples/NGS_OS_FAIR.html">NGS data</a></li><li class="breadcrumb-item"><a href="../../develop/examples/NGS_management.html">Effective RDM Practices in NGS Analysis</a></li></ol></nav>
 <div class="quarto-title">
-<h1 class="title">NGS data strategies</h1>
 </div>
 
 
@@ -272,7 +274,7 @@ <h1 class="title">NGS data strategies</h1>
     <div>
     <div class="quarto-title-meta-heading">Modified</div>
     <div class="quarto-title-meta-contents">
-      <p class="date-modified">April 25, 2024</p>
+      <p class="date-modified">April 26, 2024</p>
     </div>
   </div>
     
@@ -283,8 +285,6 @@ <h1 class="title">NGS data strategies</h1>
 </header>
 
 
-<section id="effective-rdm-practices-in-ngs-analysis" class="level1">
-<h1>Effective RDM Practices in NGS Analysis</h1>
 <div class="callout callout-style-default callout-note callout-titled" title="Section Overview">
 <div class="callout-header d-flex align-content-center">
 <div class="callout-icon-container">
@@ -298,12 +298,13 @@ <h1>Effective RDM Practices in NGS Analysis</h1>
 <p>⏰ <strong>Time Estimation:</strong> X minutes</p>
 <p>💬 <strong>Learning Objectives:</strong></p>
 <ol type="1">
-<li>Next Generation Sequencing data types and metadata</li>
-<li>Best practices for software and code management</li>
-<li>Pipelines and workflows</li>
+<li>NGS data strategies</li>
+<li>File naming conventions examples</li>
 </ol>
 </div>
 </div>
+<section id="effective-rdm-practices-in-ngs-analysis" class="level1">
+<h1>Effective RDM Practices in NGS Analysis</h1>
 <p>In the data life cycle for Next Generation Sequencing (NGS) technology data, processing, and analyzing are critical phases that involve transforming raw sequencing data into meaningful biological insights. Researchers apply computational methods and bioinformatics tools to extract valuable information from the vast amount of sequencing data generated in NGS experiments. We’ll first explore the primary data types generated pre- and post-processing and the importance of detailed documentation. We will then focus on good practices used when performing data analysis and software development.</p>
 <div class="callout callout-style-default callout-definition no-icon callout-titled">
 <div class="callout-header d-flex align-content-center" data-bs-toggle="collapse" data-bs-target=".callout-2-contents" aria-controls="callout-2" aria-expanded="false" aria-label="Toggle callout">
@@ -491,6 +492,592 @@ <h3 class="anchored" data-anchor-id="pipelines-and-workflows">4. <strong>Pipelin
 <p>Take our course on Reproducible Research Practices <a href="">LINK</a></p>
 </div>
 </div>
+</section>
+</section>
+</section>
+<section id="file-naming-convention-examples" class="level1">
+<h1>File naming convention examples</h1>
+<div class="cell">
+<div class="cell-output-display">
+<div>
+<div id="onqjjhbbxi" style="padding-left:0px;padding-right:0px;padding-top:10px;padding-bottom:10px;overflow-x:auto;overflow-y:auto;width:auto;height:auto;">
+<style>#onqjjhbbxi table {
+  font-family: system-ui, 'Segoe UI', Roboto, Helvetica, Arial, sans-serif, 'Apple Color Emoji', 'Segoe UI Emoji', 'Segoe UI Symbol', 'Noto Color Emoji';
+  -webkit-font-smoothing: antialiased;
+  -moz-osx-font-smoothing: grayscale;
+}
+
+#onqjjhbbxi thead, #onqjjhbbxi tbody, #onqjjhbbxi tfoot, #onqjjhbbxi tr, #onqjjhbbxi td, #onqjjhbbxi th {
+  border-style: none;
+}
+
+#onqjjhbbxi p {
+  margin: 0;
+  padding: 0;
+}
+
+#onqjjhbbxi .gt_table {
+  display: table;
+  border-collapse: collapse;
+  line-height: normal;
+  margin-left: auto;
+  margin-right: auto;
+  color: #333333;
+  font-size: 11px;
+  font-weight: normal;
+  font-style: normal;
+  background-color: #FFFFFF;
+  width: auto;
+  border-top-style: solid;
+  border-top-width: 2px;
+  border-top-color: #A8A8A8;
+  border-right-style: none;
+  border-right-width: 2px;
+  border-right-color: #D3D3D3;
+  border-bottom-style: solid;
+  border-bottom-width: 2px;
+  border-bottom-color: #A8A8A8;
+  border-left-style: none;
+  border-left-width: 2px;
+  border-left-color: #D3D3D3;
+}
+
+#onqjjhbbxi .gt_caption {
+  padding-top: 4px;
+  padding-bottom: 4px;
+}
+
+#onqjjhbbxi .gt_title {
+  color: #333333;
+  font-size: 125%;
+  font-weight: initial;
+  padding-top: 4px;
+  padding-bottom: 4px;
+  padding-left: 5px;
+  padding-right: 5px;
+  border-bottom-color: #FFFFFF;
+  border-bottom-width: 0;
+}
+
+#onqjjhbbxi .gt_subtitle {
+  color: #333333;
+  font-size: 85%;
+  font-weight: initial;
+  padding-top: 3px;
+  padding-bottom: 5px;
+  padding-left: 5px;
+  padding-right: 5px;
+  border-top-color: #FFFFFF;
+  border-top-width: 0;
+}
+
+#onqjjhbbxi .gt_heading {
+  background-color: #FFFFFF;
+  text-align: center;
+  border-bottom-color: #FFFFFF;
+  border-left-style: none;
+  border-left-width: 1px;
+  border-left-color: #D3D3D3;
+  border-right-style: none;
+  border-right-width: 1px;
+  border-right-color: #D3D3D3;
+}
+
+#onqjjhbbxi .gt_bottom_border {
+  border-bottom-style: solid;
+  border-bottom-width: 2px;
+  border-bottom-color: #D3D3D3;
+}
+
+#onqjjhbbxi .gt_col_headings {
+  border-top-style: solid;
+  border-top-width: 2px;
+  border-top-color: #D3D3D3;
+  border-bottom-style: solid;
+  border-bottom-width: 2px;
+  border-bottom-color: #D3D3D3;
+  border-left-style: none;
+  border-left-width: 1px;
+  border-left-color: #D3D3D3;
+  border-right-style: none;
+  border-right-width: 1px;
+  border-right-color: #D3D3D3;
+}
+
+#onqjjhbbxi .gt_col_heading {
+  color: #333333;
+  background-color: #FFFFFF;
+  font-size: 100%;
+  font-weight: normal;
+  text-transform: inherit;
+  border-left-style: none;
+  border-left-width: 1px;
+  border-left-color: #D3D3D3;
+  border-right-style: none;
+  border-right-width: 1px;
+  border-right-color: #D3D3D3;
+  vertical-align: bottom;
+  padding-top: 5px;
+  padding-bottom: 6px;
+  padding-left: 5px;
+  padding-right: 5px;
+  overflow-x: hidden;
+}
+
+#onqjjhbbxi .gt_column_spanner_outer {
+  color: #333333;
+  background-color: #FFFFFF;
+  font-size: 100%;
+  font-weight: normal;
+  text-transform: inherit;
+  padding-top: 0;
+  padding-bottom: 0;
+  padding-left: 4px;
+  padding-right: 4px;
+}
+
+#onqjjhbbxi .gt_column_spanner_outer:first-child {
+  padding-left: 0;
+}
+
+#onqjjhbbxi .gt_column_spanner_outer:last-child {
+  padding-right: 0;
+}
+
+#onqjjhbbxi .gt_column_spanner {
+  border-bottom-style: solid;
+  border-bottom-width: 2px;
+  border-bottom-color: #D3D3D3;
+  vertical-align: bottom;
+  padding-top: 5px;
+  padding-bottom: 5px;
+  overflow-x: hidden;
+  display: inline-block;
+  width: 100%;
+}
+
+#onqjjhbbxi .gt_spanner_row {
+  border-bottom-style: hidden;
+}
+
+#onqjjhbbxi .gt_group_heading {
+  padding-top: 8px;
+  padding-bottom: 8px;
+  padding-left: 5px;
+  padding-right: 5px;
+  color: #333333;
+  background-color: #FFFFFF;
+  font-size: 100%;
+  font-weight: initial;
+  text-transform: inherit;
+  border-top-style: solid;
+  border-top-width: 2px;
+  border-top-color: #D3D3D3;
+  border-bottom-style: solid;
+  border-bottom-width: 2px;
+  border-bottom-color: #D3D3D3;
+  border-left-style: none;
+  border-left-width: 1px;
+  border-left-color: #D3D3D3;
+  border-right-style: none;
+  border-right-width: 1px;
+  border-right-color: #D3D3D3;
+  vertical-align: middle;
+  text-align: left;
+}
+
+#onqjjhbbxi .gt_empty_group_heading {
+  padding: 0.5px;
+  color: #333333;
+  background-color: #FFFFFF;
+  font-size: 100%;
+  font-weight: initial;
+  border-top-style: solid;
+  border-top-width: 2px;
+  border-top-color: #D3D3D3;
+  border-bottom-style: solid;
+  border-bottom-width: 2px;
+  border-bottom-color: #D3D3D3;
+  vertical-align: middle;
+}
+
+#onqjjhbbxi .gt_from_md > :first-child {
+  margin-top: 0;
+}
+
+#onqjjhbbxi .gt_from_md > :last-child {
+  margin-bottom: 0;
+}
+
+#onqjjhbbxi .gt_row {
+  padding-top: 8px;
+  padding-bottom: 8px;
+  padding-left: 5px;
+  padding-right: 5px;
+  margin: 10px;
+  border-top-style: solid;
+  border-top-width: 1px;
+  border-top-color: #D3D3D3;
+  border-left-style: none;
+  border-left-width: 1px;
+  border-left-color: #D3D3D3;
+  border-right-style: none;
+  border-right-width: 1px;
+  border-right-color: #D3D3D3;
+  vertical-align: middle;
+  overflow-x: hidden;
+}
+
+#onqjjhbbxi .gt_stub {
+  color: #333333;
+  background-color: #FFFFFF;
+  font-size: 100%;
+  font-weight: initial;
+  text-transform: inherit;
+  border-right-style: solid;
+  border-right-width: 2px;
+  border-right-color: #D3D3D3;
+  padding-left: 5px;
+  padding-right: 5px;
+}
+
+#onqjjhbbxi .gt_stub_row_group {
+  color: #333333;
+  background-color: #FFFFFF;
+  font-size: 100%;
+  font-weight: initial;
+  text-transform: inherit;
+  border-right-style: solid;
+  border-right-width: 2px;
+  border-right-color: #D3D3D3;
+  padding-left: 5px;
+  padding-right: 5px;
+  vertical-align: top;
+}
+
+#onqjjhbbxi .gt_row_group_first td {
+  border-top-width: 2px;
+}
+
+#onqjjhbbxi .gt_row_group_first th {
+  border-top-width: 2px;
+}
+
+#onqjjhbbxi .gt_summary_row {
+  color: #333333;
+  background-color: #FFFFFF;
+  text-transform: inherit;
+  padding-top: 8px;
+  padding-bottom: 8px;
+  padding-left: 5px;
+  padding-right: 5px;
+}
+
+#onqjjhbbxi .gt_first_summary_row {
+  border-top-style: solid;
+  border-top-color: #D3D3D3;
+}
+
+#onqjjhbbxi .gt_first_summary_row.thick {
+  border-top-width: 2px;
+}
+
+#onqjjhbbxi .gt_last_summary_row {
+  padding-top: 8px;
+  padding-bottom: 8px;
+  padding-left: 5px;
+  padding-right: 5px;
+  border-bottom-style: solid;
+  border-bottom-width: 2px;
+  border-bottom-color: #D3D3D3;
+}
+
+#onqjjhbbxi .gt_grand_summary_row {
+  color: #333333;
+  background-color: #FFFFFF;
+  text-transform: inherit;
+  padding-top: 8px;
+  padding-bottom: 8px;
+  padding-left: 5px;
+  padding-right: 5px;
+}
+
+#onqjjhbbxi .gt_first_grand_summary_row {
+  padding-top: 8px;
+  padding-bottom: 8px;
+  padding-left: 5px;
+  padding-right: 5px;
+  border-top-style: double;
+  border-top-width: 6px;
+  border-top-color: #D3D3D3;
+}
+
+#onqjjhbbxi .gt_last_grand_summary_row_top {
+  padding-top: 8px;
+  padding-bottom: 8px;
+  padding-left: 5px;
+  padding-right: 5px;
+  border-bottom-style: double;
+  border-bottom-width: 6px;
+  border-bottom-color: #D3D3D3;
+}
+
+#onqjjhbbxi .gt_striped {
+  background-color: rgba(128, 128, 128, 0.05);
+}
+
+#onqjjhbbxi .gt_table_body {
+  border-top-style: solid;
+  border-top-width: 2px;
+  border-top-color: #D3D3D3;
+  border-bottom-style: solid;
+  border-bottom-width: 2px;
+  border-bottom-color: #D3D3D3;
+}
+
+#onqjjhbbxi .gt_footnotes {
+  color: #333333;
+  background-color: #FFFFFF;
+  border-bottom-style: none;
+  border-bottom-width: 2px;
+  border-bottom-color: #D3D3D3;
+  border-left-style: none;
+  border-left-width: 2px;
+  border-left-color: #D3D3D3;
+  border-right-style: none;
+  border-right-width: 2px;
+  border-right-color: #D3D3D3;
+}
+
+#onqjjhbbxi .gt_footnote {
+  margin: 0px;
+  font-size: 90%;
+  padding-top: 4px;
+  padding-bottom: 4px;
+  padding-left: 5px;
+  padding-right: 5px;
+}
+
+#onqjjhbbxi .gt_sourcenotes {
+  color: #333333;
+  background-color: #FFFFFF;
+  border-bottom-style: none;
+  border-bottom-width: 2px;
+  border-bottom-color: #D3D3D3;
+  border-left-style: none;
+  border-left-width: 2px;
+  border-left-color: #D3D3D3;
+  border-right-style: none;
+  border-right-width: 2px;
+  border-right-color: #D3D3D3;
+}
+
+#onqjjhbbxi .gt_sourcenote {
+  font-size: 90%;
+  padding-top: 4px;
+  padding-bottom: 4px;
+  padding-left: 5px;
+  padding-right: 5px;
+}
+
+#onqjjhbbxi .gt_left {
+  text-align: left;
+}
+
+#onqjjhbbxi .gt_center {
+  text-align: center;
+}
+
+#onqjjhbbxi .gt_right {
+  text-align: right;
+  font-variant-numeric: tabular-nums;
+}
+
+#onqjjhbbxi .gt_font_normal {
+  font-weight: normal;
+}
+
+#onqjjhbbxi .gt_font_bold {
+  font-weight: bold;
+}
+
+#onqjjhbbxi .gt_font_italic {
+  font-style: italic;
+}
+
+#onqjjhbbxi .gt_super {
+  font-size: 65%;
+}
+
+#onqjjhbbxi .gt_footnote_marks {
+  font-size: 75%;
+  vertical-align: 0.4em;
+  position: initial;
+}
+
+#onqjjhbbxi .gt_asterisk {
+  font-size: 100%;
+  vertical-align: 0;
+}
+
+#onqjjhbbxi .gt_indent_1 {
+  text-indent: 5px;
+}
+
+#onqjjhbbxi .gt_indent_2 {
+  text-indent: 10px;
+}
+
+#onqjjhbbxi .gt_indent_3 {
+  text-indent: 15px;
+}
+
+#onqjjhbbxi .gt_indent_4 {
+  text-indent: 20px;
+}
+
+#onqjjhbbxi .gt_indent_5 {
+  text-indent: 25px;
+}
+</style>
+
+<table class="gt_table table table-sm table-striped small" data-quarto-postprocess="true" data-quarto-disable-processing="false" data-quarto-bootstrap="false">
+<thead>
+<tr class="header gt_col_headings">
+<th id="name" class="gt_col_heading gt_columns_bottom_border gt_left" data-quarto-table-cell-role="th" scope="col">name</th>
+<th id="description" class="gt_col_heading gt_columns_bottom_border gt_left" data-quarto-table-cell-role="th" scope="col">description</th>
+<th id="naming_convention" class="gt_col_heading gt_columns_bottom_border gt_left" data-quarto-table-cell-role="th" scope="col">naming_convention</th>
+<th id="file format" class="gt_col_heading gt_columns_bottom_border gt_left" data-quarto-table-cell-role="th" scope="col">file format</th>
+<th id="example" class="gt_col_heading gt_columns_bottom_border gt_left" data-quarto-table-cell-role="th" scope="col">example</th>
+</tr>
+</thead>
+<tbody class="gt_table_body">
+<tr class="odd">
+<td class="gt_row gt_left" headers="name">.fastq</td>
+<td class="gt_row gt_left" headers="description">raw sequencing reads</td>
+<td class="gt_row gt_left" headers="naming_convention">nan</td>
+<td class="gt_row gt_left" headers="file format">nan</td>
+<td class="gt_row gt_left" headers="example">sampleID_run_read1.fastq</td>
+</tr>
+<tr class="even">
+<td class="gt_row gt_left" headers="name">.fastqc</td>
+<td class="gt_row gt_left" headers="description">quality control from fastqc</td>
+<td class="gt_row gt_left" headers="naming_convention">nan</td>
+<td class="gt_row gt_left" headers="file format">nan</td>
+<td class="gt_row gt_left" headers="example">sampleID_run_read1.fastqc</td>
+</tr>
+<tr class="odd">
+<td class="gt_row gt_left" headers="name">.bam</td>
+<td class="gt_row gt_left" headers="description">aligned reads</td>
+<td class="gt_row gt_left" headers="naming_convention">nan</td>
+<td class="gt_row gt_left" headers="file format">nan</td>
+<td class="gt_row gt_left" headers="example">sampleID_run_read1.bam</td>
+</tr>
+<tr class="even">
+<td class="gt_row gt_left" headers="name">GTF</td>
+<td class="gt_row gt_left" headers="description">sequence annotation</td>
+<td class="gt_row gt_left" headers="naming_convention">nan</td>
+<td class="gt_row gt_left" headers="file format">nan</td>
+<td class="gt_row gt_left" headers="example">one of https://www.gencodegenes.org/</td>
+</tr>
+<tr class="odd">
+<td class="gt_row gt_left" headers="name">GFF</td>
+<td class="gt_row gt_left" headers="description">sequence annotation</td>
+<td class="gt_row gt_left" headers="naming_convention">nan</td>
+<td class="gt_row gt_left" headers="file format">nan</td>
+<td class="gt_row gt_left" headers="example">one of https://www.gencodegenes.org/</td>
+</tr>
+<tr class="even">
+<td class="gt_row gt_left" headers="name">.bed</td>
+<td class="gt_row gt_left" headers="description">genome locations</td>
+<td class="gt_row gt_left" headers="naming_convention">nan</td>
+<td class="gt_row gt_left" headers="file format">nan</td>
+<td class="gt_row gt_left" headers="example">nan</td>
+</tr>
+<tr class="odd">
+<td class="gt_row gt_left" headers="name">.bigwig</td>
+<td class="gt_row gt_left" headers="description">genome coverage</td>
+<td class="gt_row gt_left" headers="naming_convention">nan</td>
+<td class="gt_row gt_left" headers="file format">nan</td>
+<td class="gt_row gt_left" headers="example">nan</td>
+</tr>
+<tr class="even">
+<td class="gt_row gt_left" headers="name">.fasta</td>
+<td class="gt_row gt_left" headers="description">sequence data (nucleotide/aminoacid)</td>
+<td class="gt_row gt_left" headers="naming_convention">nan</td>
+<td class="gt_row gt_left" headers="file format">nan</td>
+<td class="gt_row gt_left" headers="example">one of https://www.gencodegenes.org/</td>
+</tr>
+<tr class="odd">
+<td class="gt_row gt_left" headers="name">Multiqc report</td>
+<td class="gt_row gt_left" headers="description">QC aggregated report</td>
+<td class="gt_row gt_left" headers="naming_convention">&lt;assayID\&gt;_YYYYMMDD.multiqc</td>
+<td class="gt_row gt_left" headers="file format">multiqc</td>
+<td class="gt_row gt_left" headers="example">RNA_20200101.multiqc</td>
+</tr>
+<tr class="even">
+<td class="gt_row gt_left" headers="name">Count matrix</td>
+<td class="gt_row gt_left" headers="description">final count matrix</td>
+<td class="gt_row gt_left" headers="naming_convention">&lt;assayID\&gt;_cm_aligner_YYYYMMDD.tsv</td>
+<td class="gt_row gt_left" headers="file format">tsv</td>
+<td class="gt_row gt_left" headers="example">RNA_cm_salmon_20200101.tsv</td>
+</tr>
+<tr class="odd">
+<td class="gt_row gt_left" headers="name">DEA</td>
+<td class="gt_row gt_left" headers="description">differential expression analysis results</td>
+<td class="gt_row gt_left" headers="naming_convention">DEA_&lt;condition1-condition2\&gt;_LFC&lt;absolute_threshold\&gt;_p&lt;pvalue decimals\&gt;_YYYYMMDD.tsv</td>
+<td class="gt_row gt_left" headers="file format">tsv</td>
+<td class="gt_row gt_left" headers="example">DEA_treat-untreat_LFC1_p01_20200101.tsv</td>
+</tr>
+<tr class="even">
+<td class="gt_row gt_left" headers="name">DBA</td>
+<td class="gt_row gt_left" headers="description">differential binding analysis results</td>
+<td class="gt_row gt_left" headers="naming_convention">DBA_&lt;condition1-condition2\&gt;_LFC&lt;absolute_threshold\&gt;_p&lt;pvalue decimals\&gt;_YYYYMMDD.tsv</td>
+<td class="gt_row gt_left" headers="file format">tsv</td>
+<td class="gt_row gt_left" headers="example">DBA_treat-untreat_LFC1_p01_20200101.tsv</td>
+</tr>
+<tr class="odd">
+<td class="gt_row gt_left" headers="name">MAplot</td>
+<td class="gt_row gt_left" headers="description">MA plot</td>
+<td class="gt_row gt_left" headers="naming_convention">MAplot_&lt;condition1-condition2\&gt;_YYYYMMDD.jpeg</td>
+<td class="gt_row gt_left" headers="file format">jpeg</td>
+<td class="gt_row gt_left" headers="example">MAplot_treat-untreat_20200101.jpeg</td>
+</tr>
+<tr class="even">
+<td class="gt_row gt_left" headers="name">Heatmap plot</td>
+<td class="gt_row gt_left" headers="description">Heatmap plot of anything</td>
+<td class="gt_row gt_left" headers="naming_convention">heatmap_&lt;type\&gt;_YYYYMMDD.jpeg</td>
+<td class="gt_row gt_left" headers="file format">jpeg</td>
+<td class="gt_row gt_left" headers="example">heatmap_sampleCor_20200101.jpeg</td>
+</tr>
+<tr class="odd">
+<td class="gt_row gt_left" headers="name">Volcano plot</td>
+<td class="gt_row gt_left" headers="description">Volcano plot</td>
+<td class="gt_row gt_left" headers="naming_convention">volcano_&lt;condition1-condition2\&gt;_YYYYMMDD.jpeg</td>
+<td class="gt_row gt_left" headers="file format">jpeg</td>
+<td class="gt_row gt_left" headers="example">volcano_treat-untreat_20200101.jpeg</td>
+</tr>
+<tr class="even">
+<td class="gt_row gt_left" headers="name">Venn diagram</td>
+<td class="gt_row gt_left" headers="description">Venn diagram</td>
+<td class="gt_row gt_left" headers="naming_convention">venn_&lt;type\&gt;_YYYYMMDD.jpeg</td>
+<td class="gt_row gt_left" headers="file format">jpeg</td>
+<td class="gt_row gt_left" headers="example">venn_consensus_20200101.jpeg</td>
+</tr>
+<tr class="odd">
+<td class="gt_row gt_left" headers="name">Enrichment table</td>
+<td class="gt_row gt_left" headers="description">Enrichment results</td>
+<td class="gt_row gt_left" headers="naming_convention">nan</td>
+<td class="gt_row gt_left" headers="file format">tsv</td>
+<td class="gt_row gt_left" headers="example">nan</td>
+</tr>
+</tbody>
+</table>
+
+</div>
+</div>
+</div>
+</div>
 <p>Click below to access a list of the most common file formats used when working with NGS data.</p>
 <details>
 <summary>
@@ -521,8 +1108,6 @@ <h3 class="anchored" data-anchor-id="pipelines-and-workflows">4. <strong>Pipelin
 <p>Explore more data types at the <a href="http://genome.ucsc.edu/FAQ/FAQformat">UCSC webpage</a>. Check out <a href="https://bioinformatics.uconn.edu/resources-and-events/tutorials-2/file-formats-tutorial/">this tutorial</a> for more detailed explanations.</p>
 <p></p>
 </details>
-</section>
-</section>
 <section id="wrap-up" class="level2">
 <h2 class="anchored" data-anchor-id="wrap-up">Wrap up</h2>
 <p>In this lesson, we have taken a look a the vast and diverse landscape of bioinformatics data.</p>
diff --git a/develop/examples/NGS_metadata.html b/develop/examples/NGS_metadata.html
index 9a9fdb35..bba83638 100644
--- a/develop/examples/NGS_metadata.html
+++ b/develop/examples/NGS_metadata.html
@@ -177,7 +177,7 @@
           <li class="sidebar-item">
   <div class="sidebar-item-container"> 
   <a href="../../develop/examples/NGS_management.html" class="sidebar-item-text sidebar-link">
- <span class="menu-text">NGS data strategies</span></a>
+ <span class="menu-text">Effective RDM Practices in NGS Analysis</span></a>
   </div>
 </li>
           <li class="sidebar-item">
@@ -215,9 +215,10 @@
     <h2 id="toc-title">On this page</h2>
    
   <ul>
-  <li><a href="#sample-metadata-fields" id="toc-sample-metadata-fields" class="nav-link active" data-scroll-target="#sample-metadata-fields">Sample metadata fields</a></li>
-  <li><a href="#project-metadata-fields" id="toc-project-metadata-fields" class="nav-link" data-scroll-target="#project-metadata-fields">Project metadata fields</a></li>
+  <li><a href="#project-metadata-fields" id="toc-project-metadata-fields" class="nav-link active" data-scroll-target="#project-metadata-fields">Project metadata fields</a></li>
+  <li><a href="#sample-metadata-fields" id="toc-sample-metadata-fields" class="nav-link" data-scroll-target="#sample-metadata-fields">Sample metadata fields</a></li>
   <li><a href="#assay-metadata-fields" id="toc-assay-metadata-fields" class="nav-link" data-scroll-target="#assay-metadata-fields">Assay metadata fields</a></li>
+  <li><a href="#sources" id="toc-sources" class="nav-link" data-scroll-target="#sources">Sources</a></li>
   </ul>
 </nav>
     </div>
@@ -244,7 +245,7 @@ <h1 class="title">NGS Assay and Project metadata</h1>
     <div>
     <div class="quarto-title-meta-heading">Modified</div>
     <div class="quarto-title-meta-contents">
-      <p class="date-modified">April 25, 2024</p>
+      <p class="date-modified">April 26, 2024</p>
     </div>
   </div>
     
@@ -273,29 +274,29 @@ <h1 class="title">NGS Assay and Project metadata</h1>
 </div>
 </div>
 <p>You should consider revisiting these examples after completing <a href="../../develop/04_metadata.html">lesson 4</a> in the course material. Please review these three tables containing pre-filled data fields for metadata, each serving distinct purposes: sample metadata, project metadata, and experimental metadata.</p>
-<section id="sample-metadata-fields" class="level3">
-<h3 class="anchored" data-anchor-id="sample-metadata-fields">Sample metadata fields</h3>
-<p>Some details might be specific to your samples. For example, which samples are treated, which are controlled, which tissue they come from, which cell type, the age, etc. Here is a list of possible metadata fields that you can use:</p>
+<section id="project-metadata-fields" class="level3">
+<h3 class="anchored" data-anchor-id="project-metadata-fields">Project metadata fields</h3>
+<p>Here you will find a table with possible metadata fields that you can use to annotate and track your <code>Project</code> folders:</p>
 <div class="cell">
 <div class="cell-output-display">
 <div>
-<div id="ocybglrhmu" style="padding-left:0px;padding-right:0px;padding-top:10px;padding-bottom:10px;overflow-x:auto;overflow-y:auto;width:auto;height:auto;">
-<style>#ocybglrhmu table {
+<div id="ejunlcvyhz" style="padding-left:0px;padding-right:0px;padding-top:10px;padding-bottom:10px;overflow-x:auto;overflow-y:auto;width:auto;height:auto;">
+<style>#ejunlcvyhz table {
   font-family: system-ui, 'Segoe UI', Roboto, Helvetica, Arial, sans-serif, 'Apple Color Emoji', 'Segoe UI Emoji', 'Segoe UI Symbol', 'Noto Color Emoji';
   -webkit-font-smoothing: antialiased;
   -moz-osx-font-smoothing: grayscale;
 }
 
-#ocybglrhmu thead, #ocybglrhmu tbody, #ocybglrhmu tfoot, #ocybglrhmu tr, #ocybglrhmu td, #ocybglrhmu th {
+#ejunlcvyhz thead, #ejunlcvyhz tbody, #ejunlcvyhz tfoot, #ejunlcvyhz tr, #ejunlcvyhz td, #ejunlcvyhz th {
   border-style: none;
 }
 
-#ocybglrhmu p {
+#ejunlcvyhz p {
   margin: 0;
   padding: 0;
 }
 
-#ocybglrhmu .gt_table {
+#ejunlcvyhz .gt_table {
   display: table;
   border-collapse: collapse;
   line-height: normal;
@@ -321,12 +322,12 @@ <h3 class="anchored" data-anchor-id="sample-metadata-fields">Sample metadata fie
   border-left-color: #D3D3D3;
 }
 
-#ocybglrhmu .gt_caption {
+#ejunlcvyhz .gt_caption {
   padding-top: 4px;
   padding-bottom: 4px;
 }
 
-#ocybglrhmu .gt_title {
+#ejunlcvyhz .gt_title {
   color: #333333;
   font-size: 125%;
   font-weight: initial;
@@ -338,7 +339,7 @@ <h3 class="anchored" data-anchor-id="sample-metadata-fields">Sample metadata fie
   border-bottom-width: 0;
 }
 
-#ocybglrhmu .gt_subtitle {
+#ejunlcvyhz .gt_subtitle {
   color: #333333;
   font-size: 85%;
   font-weight: initial;
@@ -350,7 +351,7 @@ <h3 class="anchored" data-anchor-id="sample-metadata-fields">Sample metadata fie
   border-top-width: 0;
 }
 
-#ocybglrhmu .gt_heading {
+#ejunlcvyhz .gt_heading {
   background-color: #FFFFFF;
   text-align: center;
   border-bottom-color: #FFFFFF;
@@ -362,13 +363,13 @@ <h3 class="anchored" data-anchor-id="sample-metadata-fields">Sample metadata fie
   border-right-color: #D3D3D3;
 }
 
-#ocybglrhmu .gt_bottom_border {
+#ejunlcvyhz .gt_bottom_border {
   border-bottom-style: solid;
   border-bottom-width: 2px;
   border-bottom-color: #D3D3D3;
 }
 
-#ocybglrhmu .gt_col_headings {
+#ejunlcvyhz .gt_col_headings {
   border-top-style: solid;
   border-top-width: 2px;
   border-top-color: #D3D3D3;
@@ -383,7 +384,7 @@ <h3 class="anchored" data-anchor-id="sample-metadata-fields">Sample metadata fie
   border-right-color: #D3D3D3;
 }
 
-#ocybglrhmu .gt_col_heading {
+#ejunlcvyhz .gt_col_heading {
   color: #333333;
   background-color: #FFFFFF;
   font-size: 100%;
@@ -403,7 +404,7 @@ <h3 class="anchored" data-anchor-id="sample-metadata-fields">Sample metadata fie
   overflow-x: hidden;
 }
 
-#ocybglrhmu .gt_column_spanner_outer {
+#ejunlcvyhz .gt_column_spanner_outer {
   color: #333333;
   background-color: #FFFFFF;
   font-size: 100%;
@@ -415,15 +416,15 @@ <h3 class="anchored" data-anchor-id="sample-metadata-fields">Sample metadata fie
   padding-right: 4px;
 }
 
-#ocybglrhmu .gt_column_spanner_outer:first-child {
+#ejunlcvyhz .gt_column_spanner_outer:first-child {
   padding-left: 0;
 }
 
-#ocybglrhmu .gt_column_spanner_outer:last-child {
+#ejunlcvyhz .gt_column_spanner_outer:last-child {
   padding-right: 0;
 }
 
-#ocybglrhmu .gt_column_spanner {
+#ejunlcvyhz .gt_column_spanner {
   border-bottom-style: solid;
   border-bottom-width: 2px;
   border-bottom-color: #D3D3D3;
@@ -435,11 +436,11 @@ <h3 class="anchored" data-anchor-id="sample-metadata-fields">Sample metadata fie
   width: 100%;
 }
 
-#ocybglrhmu .gt_spanner_row {
+#ejunlcvyhz .gt_spanner_row {
   border-bottom-style: hidden;
 }
 
-#ocybglrhmu .gt_group_heading {
+#ejunlcvyhz .gt_group_heading {
   padding-top: 8px;
   padding-bottom: 8px;
   padding-left: 5px;
@@ -465,7 +466,7 @@ <h3 class="anchored" data-anchor-id="sample-metadata-fields">Sample metadata fie
   text-align: left;
 }
 
-#ocybglrhmu .gt_empty_group_heading {
+#ejunlcvyhz .gt_empty_group_heading {
   padding: 0.5px;
   color: #333333;
   background-color: #FFFFFF;
@@ -480,15 +481,15 @@ <h3 class="anchored" data-anchor-id="sample-metadata-fields">Sample metadata fie
   vertical-align: middle;
 }
 
-#ocybglrhmu .gt_from_md > :first-child {
+#ejunlcvyhz .gt_from_md > :first-child {
   margin-top: 0;
 }
 
-#ocybglrhmu .gt_from_md > :last-child {
+#ejunlcvyhz .gt_from_md > :last-child {
   margin-bottom: 0;
 }
 
-#ocybglrhmu .gt_row {
+#ejunlcvyhz .gt_row {
   padding-top: 8px;
   padding-bottom: 8px;
   padding-left: 5px;
@@ -507,7 +508,7 @@ <h3 class="anchored" data-anchor-id="sample-metadata-fields">Sample metadata fie
   overflow-x: hidden;
 }
 
-#ocybglrhmu .gt_stub {
+#ejunlcvyhz .gt_stub {
   color: #333333;
   background-color: #FFFFFF;
   font-size: 100%;
@@ -520,7 +521,7 @@ <h3 class="anchored" data-anchor-id="sample-metadata-fields">Sample metadata fie
   padding-right: 5px;
 }
 
-#ocybglrhmu .gt_stub_row_group {
+#ejunlcvyhz .gt_stub_row_group {
   color: #333333;
   background-color: #FFFFFF;
   font-size: 100%;
@@ -534,15 +535,15 @@ <h3 class="anchored" data-anchor-id="sample-metadata-fields">Sample metadata fie
   vertical-align: top;
 }
 
-#ocybglrhmu .gt_row_group_first td {
+#ejunlcvyhz .gt_row_group_first td {
   border-top-width: 2px;
 }
 
-#ocybglrhmu .gt_row_group_first th {
+#ejunlcvyhz .gt_row_group_first th {
   border-top-width: 2px;
 }
 
-#ocybglrhmu .gt_summary_row {
+#ejunlcvyhz .gt_summary_row {
   color: #333333;
   background-color: #FFFFFF;
   text-transform: inherit;
@@ -552,16 +553,16 @@ <h3 class="anchored" data-anchor-id="sample-metadata-fields">Sample metadata fie
   padding-right: 5px;
 }
 
-#ocybglrhmu .gt_first_summary_row {
+#ejunlcvyhz .gt_first_summary_row {
   border-top-style: solid;
   border-top-color: #D3D3D3;
 }
 
-#ocybglrhmu .gt_first_summary_row.thick {
+#ejunlcvyhz .gt_first_summary_row.thick {
   border-top-width: 2px;
 }
 
-#ocybglrhmu .gt_last_summary_row {
+#ejunlcvyhz .gt_last_summary_row {
   padding-top: 8px;
   padding-bottom: 8px;
   padding-left: 5px;
@@ -571,7 +572,7 @@ <h3 class="anchored" data-anchor-id="sample-metadata-fields">Sample metadata fie
   border-bottom-color: #D3D3D3;
 }
 
-#ocybglrhmu .gt_grand_summary_row {
+#ejunlcvyhz .gt_grand_summary_row {
   color: #333333;
   background-color: #FFFFFF;
   text-transform: inherit;
@@ -581,7 +582,7 @@ <h3 class="anchored" data-anchor-id="sample-metadata-fields">Sample metadata fie
   padding-right: 5px;
 }
 
-#ocybglrhmu .gt_first_grand_summary_row {
+#ejunlcvyhz .gt_first_grand_summary_row {
   padding-top: 8px;
   padding-bottom: 8px;
   padding-left: 5px;
@@ -591,7 +592,7 @@ <h3 class="anchored" data-anchor-id="sample-metadata-fields">Sample metadata fie
   border-top-color: #D3D3D3;
 }
 
-#ocybglrhmu .gt_last_grand_summary_row_top {
+#ejunlcvyhz .gt_last_grand_summary_row_top {
   padding-top: 8px;
   padding-bottom: 8px;
   padding-left: 5px;
@@ -601,11 +602,11 @@ <h3 class="anchored" data-anchor-id="sample-metadata-fields">Sample metadata fie
   border-bottom-color: #D3D3D3;
 }
 
-#ocybglrhmu .gt_striped {
+#ejunlcvyhz .gt_striped {
   background-color: rgba(128, 128, 128, 0.05);
 }
 
-#ocybglrhmu .gt_table_body {
+#ejunlcvyhz .gt_table_body {
   border-top-style: solid;
   border-top-width: 2px;
   border-top-color: #D3D3D3;
@@ -614,7 +615,7 @@ <h3 class="anchored" data-anchor-id="sample-metadata-fields">Sample metadata fie
   border-bottom-color: #D3D3D3;
 }
 
-#ocybglrhmu .gt_footnotes {
+#ejunlcvyhz .gt_footnotes {
   color: #333333;
   background-color: #FFFFFF;
   border-bottom-style: none;
@@ -628,7 +629,7 @@ <h3 class="anchored" data-anchor-id="sample-metadata-fields">Sample metadata fie
   border-right-color: #D3D3D3;
 }
 
-#ocybglrhmu .gt_footnote {
+#ejunlcvyhz .gt_footnote {
   margin: 0px;
   font-size: 90%;
   padding-top: 4px;
@@ -637,7 +638,7 @@ <h3 class="anchored" data-anchor-id="sample-metadata-fields">Sample metadata fie
   padding-right: 5px;
 }
 
-#ocybglrhmu .gt_sourcenotes {
+#ejunlcvyhz .gt_sourcenotes {
   color: #333333;
   background-color: #FFFFFF;
   border-bottom-style: none;
@@ -651,7 +652,7 @@ <h3 class="anchored" data-anchor-id="sample-metadata-fields">Sample metadata fie
   border-right-color: #D3D3D3;
 }
 
-#ocybglrhmu .gt_sourcenote {
+#ejunlcvyhz .gt_sourcenote {
   font-size: 90%;
   padding-top: 4px;
   padding-bottom: 4px;
@@ -659,63 +660,63 @@ <h3 class="anchored" data-anchor-id="sample-metadata-fields">Sample metadata fie
   padding-right: 5px;
 }
 
-#ocybglrhmu .gt_left {
+#ejunlcvyhz .gt_left {
   text-align: left;
 }
 
-#ocybglrhmu .gt_center {
+#ejunlcvyhz .gt_center {
   text-align: center;
 }
 
-#ocybglrhmu .gt_right {
+#ejunlcvyhz .gt_right {
   text-align: right;
   font-variant-numeric: tabular-nums;
 }
 
-#ocybglrhmu .gt_font_normal {
+#ejunlcvyhz .gt_font_normal {
   font-weight: normal;
 }
 
-#ocybglrhmu .gt_font_bold {
+#ejunlcvyhz .gt_font_bold {
   font-weight: bold;
 }
 
-#ocybglrhmu .gt_font_italic {
+#ejunlcvyhz .gt_font_italic {
   font-style: italic;
 }
 
-#ocybglrhmu .gt_super {
+#ejunlcvyhz .gt_super {
   font-size: 65%;
 }
 
-#ocybglrhmu .gt_footnote_marks {
+#ejunlcvyhz .gt_footnote_marks {
   font-size: 75%;
   vertical-align: 0.4em;
   position: initial;
 }
 
-#ocybglrhmu .gt_asterisk {
+#ejunlcvyhz .gt_asterisk {
   font-size: 100%;
   vertical-align: 0;
 }
 
-#ocybglrhmu .gt_indent_1 {
+#ejunlcvyhz .gt_indent_1 {
   text-indent: 5px;
 }
 
-#ocybglrhmu .gt_indent_2 {
+#ejunlcvyhz .gt_indent_2 {
   text-indent: 10px;
 }
 
-#ocybglrhmu .gt_indent_3 {
+#ejunlcvyhz .gt_indent_3 {
   text-indent: 15px;
 }
 
-#ocybglrhmu .gt_indent_4 {
+#ejunlcvyhz .gt_indent_4 {
   text-indent: 20px;
 }
 
-#ocybglrhmu .gt_indent_5 {
+#ejunlcvyhz .gt_indent_5 {
   text-indent: 25px;
 }
 </style>
@@ -726,129 +727,38 @@ <h3 class="anchored" data-anchor-id="sample-metadata-fields">Sample metadata fie
 <th id="Metadata field" class="gt_col_heading gt_columns_bottom_border gt_left" data-quarto-table-cell-role="th" scope="col">Metadata field</th>
 <th id="Definition" class="gt_col_heading gt_columns_bottom_border gt_left" data-quarto-table-cell-role="th" scope="col">Definition</th>
 <th id="Format" class="gt_col_heading gt_columns_bottom_border gt_left" data-quarto-table-cell-role="th" scope="col">Format</th>
-<th id="Ontology" class="gt_col_heading gt_columns_bottom_border gt_left" data-quarto-table-cell-role="th" scope="col">Ontology</th>
+<th id="Ontology" class="gt_col_heading gt_columns_bottom_border gt_center" data-quarto-table-cell-role="th" scope="col">Ontology</th>
 <th id="Example" class="gt_col_heading gt_columns_bottom_border gt_left" data-quarto-table-cell-role="th" scope="col">Example</th>
 </tr>
 </thead>
 <tbody class="gt_table_body">
 <tr class="odd">
-<td class="gt_row gt_left" headers="Metadata field">sample</td>
-<td class="gt_row gt_left" headers="Definition">Name of the sample</td>
-<td class="gt_row gt_left" headers="Format">NA</td>
-<td class="gt_row gt_left" headers="Ontology">NA</td>
-<td class="gt_row gt_left" headers="Example">control_rep1, treat_rep1</td>
-</tr>
-<tr class="even">
-<td class="gt_row gt_left" headers="Metadata field">fastq_1</td>
-<td class="gt_row gt_left" headers="Definition">Path to fastq file 1</td>
-<td class="gt_row gt_left" headers="Format">NA</td>
-<td class="gt_row gt_left" headers="Ontology">NA</td>
-<td class="gt_row gt_left" headers="Example">AEG588A1_S1_L002_R1_001.fastq.gz</td>
-</tr>
-<tr class="odd">
-<td class="gt_row gt_left" headers="Metadata field">fastq_2</td>
-<td class="gt_row gt_left" headers="Definition">Path to paired fastq file, if it is a paired experiment</td>
-<td class="gt_row gt_left" headers="Format">NA</td>
-<td class="gt_row gt_left" headers="Ontology">NA</td>
-<td class="gt_row gt_left" headers="Example">AEG588A1_S1_L002_R2_001.fastq.gz</td>
-</tr>
-<tr class="even">
-<td class="gt_row gt_left" headers="Metadata field">strandedness</td>
-<td class="gt_row gt_left" headers="Definition">The strandedness of the cDNA library</td>
-<td class="gt_row gt_left" headers="Format">&lt;unstranded OR forward OR reverse \&gt;</td>
-<td class="gt_row gt_left" headers="Ontology">NA</td>
-<td class="gt_row gt_left" headers="Example">unstranded</td>
-</tr>
-<tr class="odd">
-<td class="gt_row gt_left" headers="Metadata field">condition</td>
-<td class="gt_row gt_left" headers="Definition">Variable of interest of the experiment, such as "control", "treatment", etc</td>
-<td class="gt_row gt_left" headers="Format">wordWord</td>
-<td class="gt_row gt_left" headers="Ontology">camelCase</td>
-<td class="gt_row gt_left" headers="Example">control, treat1, treat2</td>
-</tr>
-<tr class="even">
-<td class="gt_row gt_left" headers="Metadata field">cell_type</td>
-<td class="gt_row gt_left" headers="Definition">The cell type(s) known or selected to be present in the sample</td>
-<td class="gt_row gt_left" headers="Format">NA</td>
-<td class="gt_row gt_left" headers="Ontology">ontology field- e.g. EFO or OBI</td>
-<td class="gt_row gt_left" headers="Example">NA</td>
-</tr>
-<tr class="odd">
-<td class="gt_row gt_left" headers="Metadata field">tissue</td>
-<td class="gt_row gt_left" headers="Definition">The tissue from which the sample was taken</td>
-<td class="gt_row gt_left" headers="Format">NA</td>
-<td class="gt_row gt_left" headers="Ontology">Uberon</td>
-<td class="gt_row gt_left" headers="Example">NA</td>
-</tr>
-<tr class="even">
-<td class="gt_row gt_left" headers="Metadata field">sex</td>
-<td class="gt_row gt_left" headers="Definition">The biological/genetic sex of the sample</td>
-<td class="gt_row gt_left" headers="Format">NA</td>
-<td class="gt_row gt_left" headers="Ontology">ontology field- e.g. EFO or OBI</td>
-<td class="gt_row gt_left" headers="Example">NA</td>
-</tr>
-<tr class="odd">
-<td class="gt_row gt_left" headers="Metadata field">cell_line</td>
-<td class="gt_row gt_left" headers="Definition">Cell line of the sample</td>
-<td class="gt_row gt_left" headers="Format">NA</td>
-<td class="gt_row gt_left" headers="Ontology">ontology field- e.g. EFO or OBI</td>
-<td class="gt_row gt_left" headers="Example">NA</td>
-</tr>
-<tr class="even">
-<td class="gt_row gt_left" headers="Metadata field">organism</td>
-<td class="gt_row gt_left" headers="Definition">Organism origin of the sample</td>
-<td class="gt_row gt_left" headers="Format">&lt;Genus species&gt;</td>
-<td class="gt_row gt_left" headers="Ontology">Taxonomy</td>
-<td class="gt_row gt_left" headers="Example">Mus musculus</td>
-</tr>
-<tr class="odd">
-<td class="gt_row gt_left" headers="Metadata field">replicate</td>
-<td class="gt_row gt_left" headers="Definition">Replicate number</td>
-<td class="gt_row gt_left" headers="Format">&lt;integer\&gt;</td>
-<td class="gt_row gt_left" headers="Ontology">NA</td>
-<td class="gt_row gt_left" headers="Example">1</td>
-</tr>
-<tr class="even">
-<td class="gt_row gt_left" headers="Metadata field">batch</td>
-<td class="gt_row gt_left" headers="Definition">Batch information</td>
-<td class="gt_row gt_left" headers="Format">wordWord</td>
-<td class="gt_row gt_left" headers="Ontology">camelCase</td>
-<td class="gt_row gt_left" headers="Example">1</td>
-</tr>
-<tr class="odd">
-<td class="gt_row gt_left" headers="Metadata field">disease</td>
-<td class="gt_row gt_left" headers="Definition">Any diseases that may affect the sample</td>
-<td class="gt_row gt_left" headers="Format">NA</td>
-<td class="gt_row gt_left" headers="Ontology">Disease Ontology or MONDO</td>
-<td class="gt_row gt_left" headers="Example">NA</td>
+<td class="gt_row gt_left" headers="Metadata field">project</td>
+<td class="gt_row gt_left" headers="Definition">Project ID</td>
+<td class="gt_row gt_left" headers="Format">&lt;surname\&gt;_et_al_2023</td>
+<td class="gt_row gt_center" headers="Ontology">NA</td>
+<td class="gt_row gt_left" headers="Example">proks_et_al_2023</td>
 </tr>
 <tr class="even">
-<td class="gt_row gt_left" headers="Metadata field">developmental_stage</td>
-<td class="gt_row gt_left" headers="Definition">The developmental stage of the sample</td>
-<td class="gt_row gt_left" headers="Format">NA</td>
-<td class="gt_row gt_left" headers="Ontology">NA</td>
-<td class="gt_row gt_left" headers="Example">NA</td>
+<td class="gt_row gt_left" headers="Metadata field">author</td>
+<td class="gt_row gt_left" headers="Definition">Owner of the project</td>
+<td class="gt_row gt_left" headers="Format">&lt;First name\&gt; &lt;Surname\&gt;</td>
+<td class="gt_row gt_center" headers="Ontology">NA</td>
+<td class="gt_row gt_left" headers="Example">Martin Proks</td>
 </tr>
 <tr class="odd">
-<td class="gt_row gt_left" headers="Metadata field">sample_type</td>
-<td class="gt_row gt_left" headers="Definition">The type of the collected specimen, eg tissue biopsy, blood draw or throat swab</td>
-<td class="gt_row gt_left" headers="Format">NA</td>
-<td class="gt_row gt_left" headers="Ontology">NA</td>
-<td class="gt_row gt_left" headers="Example">NA</td>
+<td class="gt_row gt_left" headers="Metadata field">date</td>
+<td class="gt_row gt_left" headers="Definition">Date of creation</td>
+<td class="gt_row gt_left" headers="Format">YYYYMMDD</td>
+<td class="gt_row gt_center" headers="Ontology">NA</td>
+<td class="gt_row gt_left" headers="Example">20230101</td>
 </tr>
 <tr class="even">
-<td class="gt_row gt_left" headers="Metadata field">strain</td>
-<td class="gt_row gt_left" headers="Definition">Strain of the species from which the sample was collected, if applicable</td>
-<td class="gt_row gt_left" headers="Format">NA</td>
-<td class="gt_row gt_left" headers="Ontology">ontology field - e.g. NCBITaxonomy</td>
-<td class="gt_row gt_left" headers="Example">NA</td>
-</tr>
-<tr class="odd">
-<td class="gt_row gt_left" headers="Metadata field">genetic variation</td>
-<td class="gt_row gt_left" headers="Definition">Any relevant genetic differences from the specimen or sample to the expected genomic information for this species, eg abnormal chromosome counts, major translocations or indels</td>
-<td class="gt_row gt_left" headers="Format">NA</td>
-<td class="gt_row gt_left" headers="Ontology">NA</td>
-<td class="gt_row gt_left" headers="Example">NA</td>
+<td class="gt_row gt_left" headers="Metadata field">description</td>
+<td class="gt_row gt_left" headers="Definition">Short description of the project</td>
+<td class="gt_row gt_left" headers="Format">Plain text</td>
+<td class="gt_row gt_center" headers="Ontology">NA</td>
+<td class="gt_row gt_left" headers="Example">This is a project describing the effect of Oct4 perturbation after pERK activation</td>
 </tr>
 </tbody>
 </table>
@@ -858,29 +768,29 @@ <h3 class="anchored" data-anchor-id="sample-metadata-fields">Sample metadata fie
 </div>
 </div>
 </section>
-<section id="project-metadata-fields" class="level3">
-<h3 class="anchored" data-anchor-id="project-metadata-fields">Project metadata fields</h3>
-<p>Here you will find a table with possible metadata fields that you can use to annotate and track your <code>Project</code> folders:</p>
+<section id="sample-metadata-fields" class="level3">
+<h3 class="anchored" data-anchor-id="sample-metadata-fields">Sample metadata fields</h3>
+<p>Some details might be specific to your samples. For example, which samples are treated, which are controlled, which tissue they come from, which cell type, the age, etc. Here is a list of possible metadata fields that you can use:</p>
 <div class="cell">
 <div class="cell-output-display">
 <div>
-<div id="upokuveyuf" style="padding-left:0px;padding-right:0px;padding-top:10px;padding-bottom:10px;overflow-x:auto;overflow-y:auto;width:auto;height:auto;">
-<style>#upokuveyuf table {
+<div id="dycrmfnlaq" style="padding-left:0px;padding-right:0px;padding-top:10px;padding-bottom:10px;overflow-x:auto;overflow-y:auto;width:auto;height:auto;">
+<style>#dycrmfnlaq table {
   font-family: system-ui, 'Segoe UI', Roboto, Helvetica, Arial, sans-serif, 'Apple Color Emoji', 'Segoe UI Emoji', 'Segoe UI Symbol', 'Noto Color Emoji';
   -webkit-font-smoothing: antialiased;
   -moz-osx-font-smoothing: grayscale;
 }
 
-#upokuveyuf thead, #upokuveyuf tbody, #upokuveyuf tfoot, #upokuveyuf tr, #upokuveyuf td, #upokuveyuf th {
+#dycrmfnlaq thead, #dycrmfnlaq tbody, #dycrmfnlaq tfoot, #dycrmfnlaq tr, #dycrmfnlaq td, #dycrmfnlaq th {
   border-style: none;
 }
 
-#upokuveyuf p {
+#dycrmfnlaq p {
   margin: 0;
   padding: 0;
 }
 
-#upokuveyuf .gt_table {
+#dycrmfnlaq .gt_table {
   display: table;
   border-collapse: collapse;
   line-height: normal;
@@ -906,12 +816,12 @@ <h3 class="anchored" data-anchor-id="project-metadata-fields">Project metadata f
   border-left-color: #D3D3D3;
 }
 
-#upokuveyuf .gt_caption {
+#dycrmfnlaq .gt_caption {
   padding-top: 4px;
   padding-bottom: 4px;
 }
 
-#upokuveyuf .gt_title {
+#dycrmfnlaq .gt_title {
   color: #333333;
   font-size: 125%;
   font-weight: initial;
@@ -923,7 +833,7 @@ <h3 class="anchored" data-anchor-id="project-metadata-fields">Project metadata f
   border-bottom-width: 0;
 }
 
-#upokuveyuf .gt_subtitle {
+#dycrmfnlaq .gt_subtitle {
   color: #333333;
   font-size: 85%;
   font-weight: initial;
@@ -935,7 +845,7 @@ <h3 class="anchored" data-anchor-id="project-metadata-fields">Project metadata f
   border-top-width: 0;
 }
 
-#upokuveyuf .gt_heading {
+#dycrmfnlaq .gt_heading {
   background-color: #FFFFFF;
   text-align: center;
   border-bottom-color: #FFFFFF;
@@ -947,13 +857,13 @@ <h3 class="anchored" data-anchor-id="project-metadata-fields">Project metadata f
   border-right-color: #D3D3D3;
 }
 
-#upokuveyuf .gt_bottom_border {
+#dycrmfnlaq .gt_bottom_border {
   border-bottom-style: solid;
   border-bottom-width: 2px;
   border-bottom-color: #D3D3D3;
 }
 
-#upokuveyuf .gt_col_headings {
+#dycrmfnlaq .gt_col_headings {
   border-top-style: solid;
   border-top-width: 2px;
   border-top-color: #D3D3D3;
@@ -968,7 +878,7 @@ <h3 class="anchored" data-anchor-id="project-metadata-fields">Project metadata f
   border-right-color: #D3D3D3;
 }
 
-#upokuveyuf .gt_col_heading {
+#dycrmfnlaq .gt_col_heading {
   color: #333333;
   background-color: #FFFFFF;
   font-size: 100%;
@@ -988,7 +898,7 @@ <h3 class="anchored" data-anchor-id="project-metadata-fields">Project metadata f
   overflow-x: hidden;
 }
 
-#upokuveyuf .gt_column_spanner_outer {
+#dycrmfnlaq .gt_column_spanner_outer {
   color: #333333;
   background-color: #FFFFFF;
   font-size: 100%;
@@ -1000,15 +910,15 @@ <h3 class="anchored" data-anchor-id="project-metadata-fields">Project metadata f
   padding-right: 4px;
 }
 
-#upokuveyuf .gt_column_spanner_outer:first-child {
+#dycrmfnlaq .gt_column_spanner_outer:first-child {
   padding-left: 0;
 }
 
-#upokuveyuf .gt_column_spanner_outer:last-child {
+#dycrmfnlaq .gt_column_spanner_outer:last-child {
   padding-right: 0;
 }
 
-#upokuveyuf .gt_column_spanner {
+#dycrmfnlaq .gt_column_spanner {
   border-bottom-style: solid;
   border-bottom-width: 2px;
   border-bottom-color: #D3D3D3;
@@ -1020,11 +930,11 @@ <h3 class="anchored" data-anchor-id="project-metadata-fields">Project metadata f
   width: 100%;
 }
 
-#upokuveyuf .gt_spanner_row {
+#dycrmfnlaq .gt_spanner_row {
   border-bottom-style: hidden;
 }
 
-#upokuveyuf .gt_group_heading {
+#dycrmfnlaq .gt_group_heading {
   padding-top: 8px;
   padding-bottom: 8px;
   padding-left: 5px;
@@ -1050,7 +960,7 @@ <h3 class="anchored" data-anchor-id="project-metadata-fields">Project metadata f
   text-align: left;
 }
 
-#upokuveyuf .gt_empty_group_heading {
+#dycrmfnlaq .gt_empty_group_heading {
   padding: 0.5px;
   color: #333333;
   background-color: #FFFFFF;
@@ -1065,15 +975,15 @@ <h3 class="anchored" data-anchor-id="project-metadata-fields">Project metadata f
   vertical-align: middle;
 }
 
-#upokuveyuf .gt_from_md > :first-child {
+#dycrmfnlaq .gt_from_md > :first-child {
   margin-top: 0;
 }
 
-#upokuveyuf .gt_from_md > :last-child {
+#dycrmfnlaq .gt_from_md > :last-child {
   margin-bottom: 0;
 }
 
-#upokuveyuf .gt_row {
+#dycrmfnlaq .gt_row {
   padding-top: 8px;
   padding-bottom: 8px;
   padding-left: 5px;
@@ -1092,7 +1002,7 @@ <h3 class="anchored" data-anchor-id="project-metadata-fields">Project metadata f
   overflow-x: hidden;
 }
 
-#upokuveyuf .gt_stub {
+#dycrmfnlaq .gt_stub {
   color: #333333;
   background-color: #FFFFFF;
   font-size: 100%;
@@ -1105,7 +1015,7 @@ <h3 class="anchored" data-anchor-id="project-metadata-fields">Project metadata f
   padding-right: 5px;
 }
 
-#upokuveyuf .gt_stub_row_group {
+#dycrmfnlaq .gt_stub_row_group {
   color: #333333;
   background-color: #FFFFFF;
   font-size: 100%;
@@ -1119,15 +1029,15 @@ <h3 class="anchored" data-anchor-id="project-metadata-fields">Project metadata f
   vertical-align: top;
 }
 
-#upokuveyuf .gt_row_group_first td {
+#dycrmfnlaq .gt_row_group_first td {
   border-top-width: 2px;
 }
 
-#upokuveyuf .gt_row_group_first th {
+#dycrmfnlaq .gt_row_group_first th {
   border-top-width: 2px;
 }
 
-#upokuveyuf .gt_summary_row {
+#dycrmfnlaq .gt_summary_row {
   color: #333333;
   background-color: #FFFFFF;
   text-transform: inherit;
@@ -1137,16 +1047,16 @@ <h3 class="anchored" data-anchor-id="project-metadata-fields">Project metadata f
   padding-right: 5px;
 }
 
-#upokuveyuf .gt_first_summary_row {
+#dycrmfnlaq .gt_first_summary_row {
   border-top-style: solid;
   border-top-color: #D3D3D3;
 }
 
-#upokuveyuf .gt_first_summary_row.thick {
+#dycrmfnlaq .gt_first_summary_row.thick {
   border-top-width: 2px;
 }
 
-#upokuveyuf .gt_last_summary_row {
+#dycrmfnlaq .gt_last_summary_row {
   padding-top: 8px;
   padding-bottom: 8px;
   padding-left: 5px;
@@ -1156,7 +1066,7 @@ <h3 class="anchored" data-anchor-id="project-metadata-fields">Project metadata f
   border-bottom-color: #D3D3D3;
 }
 
-#upokuveyuf .gt_grand_summary_row {
+#dycrmfnlaq .gt_grand_summary_row {
   color: #333333;
   background-color: #FFFFFF;
   text-transform: inherit;
@@ -1166,7 +1076,7 @@ <h3 class="anchored" data-anchor-id="project-metadata-fields">Project metadata f
   padding-right: 5px;
 }
 
-#upokuveyuf .gt_first_grand_summary_row {
+#dycrmfnlaq .gt_first_grand_summary_row {
   padding-top: 8px;
   padding-bottom: 8px;
   padding-left: 5px;
@@ -1176,7 +1086,7 @@ <h3 class="anchored" data-anchor-id="project-metadata-fields">Project metadata f
   border-top-color: #D3D3D3;
 }
 
-#upokuveyuf .gt_last_grand_summary_row_top {
+#dycrmfnlaq .gt_last_grand_summary_row_top {
   padding-top: 8px;
   padding-bottom: 8px;
   padding-left: 5px;
@@ -1186,11 +1096,11 @@ <h3 class="anchored" data-anchor-id="project-metadata-fields">Project metadata f
   border-bottom-color: #D3D3D3;
 }
 
-#upokuveyuf .gt_striped {
+#dycrmfnlaq .gt_striped {
   background-color: rgba(128, 128, 128, 0.05);
 }
 
-#upokuveyuf .gt_table_body {
+#dycrmfnlaq .gt_table_body {
   border-top-style: solid;
   border-top-width: 2px;
   border-top-color: #D3D3D3;
@@ -1199,7 +1109,7 @@ <h3 class="anchored" data-anchor-id="project-metadata-fields">Project metadata f
   border-bottom-color: #D3D3D3;
 }
 
-#upokuveyuf .gt_footnotes {
+#dycrmfnlaq .gt_footnotes {
   color: #333333;
   background-color: #FFFFFF;
   border-bottom-style: none;
@@ -1213,7 +1123,7 @@ <h3 class="anchored" data-anchor-id="project-metadata-fields">Project metadata f
   border-right-color: #D3D3D3;
 }
 
-#upokuveyuf .gt_footnote {
+#dycrmfnlaq .gt_footnote {
   margin: 0px;
   font-size: 90%;
   padding-top: 4px;
@@ -1222,7 +1132,7 @@ <h3 class="anchored" data-anchor-id="project-metadata-fields">Project metadata f
   padding-right: 5px;
 }
 
-#upokuveyuf .gt_sourcenotes {
+#dycrmfnlaq .gt_sourcenotes {
   color: #333333;
   background-color: #FFFFFF;
   border-bottom-style: none;
@@ -1236,7 +1146,7 @@ <h3 class="anchored" data-anchor-id="project-metadata-fields">Project metadata f
   border-right-color: #D3D3D3;
 }
 
-#upokuveyuf .gt_sourcenote {
+#dycrmfnlaq .gt_sourcenote {
   font-size: 90%;
   padding-top: 4px;
   padding-bottom: 4px;
@@ -1244,63 +1154,63 @@ <h3 class="anchored" data-anchor-id="project-metadata-fields">Project metadata f
   padding-right: 5px;
 }
 
-#upokuveyuf .gt_left {
+#dycrmfnlaq .gt_left {
   text-align: left;
 }
 
-#upokuveyuf .gt_center {
+#dycrmfnlaq .gt_center {
   text-align: center;
 }
 
-#upokuveyuf .gt_right {
+#dycrmfnlaq .gt_right {
   text-align: right;
   font-variant-numeric: tabular-nums;
 }
 
-#upokuveyuf .gt_font_normal {
+#dycrmfnlaq .gt_font_normal {
   font-weight: normal;
 }
 
-#upokuveyuf .gt_font_bold {
+#dycrmfnlaq .gt_font_bold {
   font-weight: bold;
 }
 
-#upokuveyuf .gt_font_italic {
+#dycrmfnlaq .gt_font_italic {
   font-style: italic;
 }
 
-#upokuveyuf .gt_super {
+#dycrmfnlaq .gt_super {
   font-size: 65%;
 }
 
-#upokuveyuf .gt_footnote_marks {
+#dycrmfnlaq .gt_footnote_marks {
   font-size: 75%;
   vertical-align: 0.4em;
   position: initial;
 }
 
-#upokuveyuf .gt_asterisk {
+#dycrmfnlaq .gt_asterisk {
   font-size: 100%;
   vertical-align: 0;
 }
 
-#upokuveyuf .gt_indent_1 {
+#dycrmfnlaq .gt_indent_1 {
   text-indent: 5px;
 }
 
-#upokuveyuf .gt_indent_2 {
+#dycrmfnlaq .gt_indent_2 {
   text-indent: 10px;
 }
 
-#upokuveyuf .gt_indent_3 {
+#dycrmfnlaq .gt_indent_3 {
   text-indent: 15px;
 }
 
-#upokuveyuf .gt_indent_4 {
+#dycrmfnlaq .gt_indent_4 {
   text-indent: 20px;
 }
 
-#upokuveyuf .gt_indent_5 {
+#dycrmfnlaq .gt_indent_5 {
   text-indent: 25px;
 }
 </style>
@@ -1311,38 +1221,129 @@ <h3 class="anchored" data-anchor-id="project-metadata-fields">Project metadata f
 <th id="Metadata field" class="gt_col_heading gt_columns_bottom_border gt_left" data-quarto-table-cell-role="th" scope="col">Metadata field</th>
 <th id="Definition" class="gt_col_heading gt_columns_bottom_border gt_left" data-quarto-table-cell-role="th" scope="col">Definition</th>
 <th id="Format" class="gt_col_heading gt_columns_bottom_border gt_left" data-quarto-table-cell-role="th" scope="col">Format</th>
-<th id="Ontology" class="gt_col_heading gt_columns_bottom_border gt_center" data-quarto-table-cell-role="th" scope="col">Ontology</th>
+<th id="Ontology" class="gt_col_heading gt_columns_bottom_border gt_left" data-quarto-table-cell-role="th" scope="col">Ontology</th>
 <th id="Example" class="gt_col_heading gt_columns_bottom_border gt_left" data-quarto-table-cell-role="th" scope="col">Example</th>
 </tr>
 </thead>
 <tbody class="gt_table_body">
 <tr class="odd">
-<td class="gt_row gt_left" headers="Metadata field">project</td>
-<td class="gt_row gt_left" headers="Definition">Project ID</td>
-<td class="gt_row gt_left" headers="Format">&lt;surname\&gt;_et_al_2023</td>
-<td class="gt_row gt_center" headers="Ontology">NA</td>
-<td class="gt_row gt_left" headers="Example">proks_et_al_2023</td>
+<td class="gt_row gt_left" headers="Metadata field">sample</td>
+<td class="gt_row gt_left" headers="Definition">Name of the sample</td>
+<td class="gt_row gt_left" headers="Format">NA</td>
+<td class="gt_row gt_left" headers="Ontology">NA</td>
+<td class="gt_row gt_left" headers="Example">control_rep1, treat_rep1</td>
 </tr>
 <tr class="even">
-<td class="gt_row gt_left" headers="Metadata field">author</td>
-<td class="gt_row gt_left" headers="Definition">Owner of the project</td>
-<td class="gt_row gt_left" headers="Format">&lt;First name\&gt; &lt;Surname\&gt;</td>
-<td class="gt_row gt_center" headers="Ontology">NA</td>
-<td class="gt_row gt_left" headers="Example">Martin Proks</td>
+<td class="gt_row gt_left" headers="Metadata field">fastq_1</td>
+<td class="gt_row gt_left" headers="Definition">Path to fastq file 1</td>
+<td class="gt_row gt_left" headers="Format">NA</td>
+<td class="gt_row gt_left" headers="Ontology">NA</td>
+<td class="gt_row gt_left" headers="Example">AEG588A1_S1_L002_R1_001.fastq.gz</td>
 </tr>
 <tr class="odd">
-<td class="gt_row gt_left" headers="Metadata field">date</td>
-<td class="gt_row gt_left" headers="Definition">Date of creation</td>
-<td class="gt_row gt_left" headers="Format">YYYYMMDD</td>
-<td class="gt_row gt_center" headers="Ontology">NA</td>
-<td class="gt_row gt_left" headers="Example">20230101</td>
+<td class="gt_row gt_left" headers="Metadata field">fastq_2</td>
+<td class="gt_row gt_left" headers="Definition">Path to paired fastq file, if it is a paired experiment</td>
+<td class="gt_row gt_left" headers="Format">NA</td>
+<td class="gt_row gt_left" headers="Ontology">NA</td>
+<td class="gt_row gt_left" headers="Example">AEG588A1_S1_L002_R2_001.fastq.gz</td>
 </tr>
 <tr class="even">
-<td class="gt_row gt_left" headers="Metadata field">description</td>
-<td class="gt_row gt_left" headers="Definition">Short description of the project</td>
-<td class="gt_row gt_left" headers="Format">Plain text</td>
-<td class="gt_row gt_center" headers="Ontology">NA</td>
-<td class="gt_row gt_left" headers="Example">This is a project describing the effect of Oct4 perturbation after pERK activation</td>
+<td class="gt_row gt_left" headers="Metadata field">strandedness</td>
+<td class="gt_row gt_left" headers="Definition">The strandedness of the cDNA library</td>
+<td class="gt_row gt_left" headers="Format">&lt;unstranded OR forward OR reverse \&gt;</td>
+<td class="gt_row gt_left" headers="Ontology">NA</td>
+<td class="gt_row gt_left" headers="Example">unstranded</td>
+</tr>
+<tr class="odd">
+<td class="gt_row gt_left" headers="Metadata field">condition</td>
+<td class="gt_row gt_left" headers="Definition">Variable of interest of the experiment, such as "control", "treatment", etc</td>
+<td class="gt_row gt_left" headers="Format">wordWord</td>
+<td class="gt_row gt_left" headers="Ontology">camelCase</td>
+<td class="gt_row gt_left" headers="Example">control, treat1, treat2</td>
+</tr>
+<tr class="even">
+<td class="gt_row gt_left" headers="Metadata field">cell_type</td>
+<td class="gt_row gt_left" headers="Definition">The cell type(s) known or selected to be present in the sample</td>
+<td class="gt_row gt_left" headers="Format">NA</td>
+<td class="gt_row gt_left" headers="Ontology">ontology field- e.g. EFO or OBI</td>
+<td class="gt_row gt_left" headers="Example">NA</td>
+</tr>
+<tr class="odd">
+<td class="gt_row gt_left" headers="Metadata field">tissue</td>
+<td class="gt_row gt_left" headers="Definition">The tissue from which the sample was taken</td>
+<td class="gt_row gt_left" headers="Format">NA</td>
+<td class="gt_row gt_left" headers="Ontology">Uberon</td>
+<td class="gt_row gt_left" headers="Example">NA</td>
+</tr>
+<tr class="even">
+<td class="gt_row gt_left" headers="Metadata field">sex</td>
+<td class="gt_row gt_left" headers="Definition">The biological/genetic sex of the sample</td>
+<td class="gt_row gt_left" headers="Format">NA</td>
+<td class="gt_row gt_left" headers="Ontology">ontology field- e.g. EFO or OBI</td>
+<td class="gt_row gt_left" headers="Example">NA</td>
+</tr>
+<tr class="odd">
+<td class="gt_row gt_left" headers="Metadata field">cell_line</td>
+<td class="gt_row gt_left" headers="Definition">Cell line of the sample</td>
+<td class="gt_row gt_left" headers="Format">NA</td>
+<td class="gt_row gt_left" headers="Ontology">ontology field- e.g. EFO or OBI</td>
+<td class="gt_row gt_left" headers="Example">NA</td>
+</tr>
+<tr class="even">
+<td class="gt_row gt_left" headers="Metadata field">organism</td>
+<td class="gt_row gt_left" headers="Definition">Organism origin of the sample</td>
+<td class="gt_row gt_left" headers="Format">&lt;Genus species&gt;</td>
+<td class="gt_row gt_left" headers="Ontology">Taxonomy</td>
+<td class="gt_row gt_left" headers="Example">Mus musculus</td>
+</tr>
+<tr class="odd">
+<td class="gt_row gt_left" headers="Metadata field">replicate</td>
+<td class="gt_row gt_left" headers="Definition">Replicate number</td>
+<td class="gt_row gt_left" headers="Format">&lt;integer\&gt;</td>
+<td class="gt_row gt_left" headers="Ontology">NA</td>
+<td class="gt_row gt_left" headers="Example">1</td>
+</tr>
+<tr class="even">
+<td class="gt_row gt_left" headers="Metadata field">batch</td>
+<td class="gt_row gt_left" headers="Definition">Batch information</td>
+<td class="gt_row gt_left" headers="Format">wordWord</td>
+<td class="gt_row gt_left" headers="Ontology">camelCase</td>
+<td class="gt_row gt_left" headers="Example">1</td>
+</tr>
+<tr class="odd">
+<td class="gt_row gt_left" headers="Metadata field">disease</td>
+<td class="gt_row gt_left" headers="Definition">Any diseases that may affect the sample</td>
+<td class="gt_row gt_left" headers="Format">NA</td>
+<td class="gt_row gt_left" headers="Ontology">Disease Ontology or MONDO</td>
+<td class="gt_row gt_left" headers="Example">NA</td>
+</tr>
+<tr class="even">
+<td class="gt_row gt_left" headers="Metadata field">developmental_stage</td>
+<td class="gt_row gt_left" headers="Definition">The developmental stage of the sample</td>
+<td class="gt_row gt_left" headers="Format">NA</td>
+<td class="gt_row gt_left" headers="Ontology">NA</td>
+<td class="gt_row gt_left" headers="Example">NA</td>
+</tr>
+<tr class="odd">
+<td class="gt_row gt_left" headers="Metadata field">sample_type</td>
+<td class="gt_row gt_left" headers="Definition">The type of the collected specimen, eg tissue biopsy, blood draw or throat swab</td>
+<td class="gt_row gt_left" headers="Format">NA</td>
+<td class="gt_row gt_left" headers="Ontology">NA</td>
+<td class="gt_row gt_left" headers="Example">NA</td>
+</tr>
+<tr class="even">
+<td class="gt_row gt_left" headers="Metadata field">strain</td>
+<td class="gt_row gt_left" headers="Definition">Strain of the species from which the sample was collected, if applicable</td>
+<td class="gt_row gt_left" headers="Format">NA</td>
+<td class="gt_row gt_left" headers="Ontology">ontology field - e.g. NCBITaxonomy</td>
+<td class="gt_row gt_left" headers="Example">NA</td>
+</tr>
+<tr class="odd">
+<td class="gt_row gt_left" headers="Metadata field">genetic variation</td>
+<td class="gt_row gt_left" headers="Definition">Any relevant genetic differences from the specimen or sample to the expected genomic information for this species, eg abnormal chromosome counts, major translocations or indels</td>
+<td class="gt_row gt_left" headers="Format">NA</td>
+<td class="gt_row gt_left" headers="Ontology">NA</td>
+<td class="gt_row gt_left" headers="Example">NA</td>
 </tr>
 </tbody>
 </table>
@@ -1358,23 +1359,23 @@ <h3 class="anchored" data-anchor-id="assay-metadata-fields">Assay metadata field
 <div class="cell">
 <div class="cell-output-display">
 <div>
-<div id="upcwvhlaqs" style="padding-left:0px;padding-right:0px;padding-top:10px;padding-bottom:10px;overflow-x:auto;overflow-y:auto;width:auto;height:auto;">
-<style>#upcwvhlaqs table {
+<div id="wtfxybhlxy" style="padding-left:0px;padding-right:0px;padding-top:10px;padding-bottom:10px;overflow-x:auto;overflow-y:auto;width:auto;height:auto;">
+<style>#wtfxybhlxy table {
   font-family: system-ui, 'Segoe UI', Roboto, Helvetica, Arial, sans-serif, 'Apple Color Emoji', 'Segoe UI Emoji', 'Segoe UI Symbol', 'Noto Color Emoji';
   -webkit-font-smoothing: antialiased;
   -moz-osx-font-smoothing: grayscale;
 }
 
-#upcwvhlaqs thead, #upcwvhlaqs tbody, #upcwvhlaqs tfoot, #upcwvhlaqs tr, #upcwvhlaqs td, #upcwvhlaqs th {
+#wtfxybhlxy thead, #wtfxybhlxy tbody, #wtfxybhlxy tfoot, #wtfxybhlxy tr, #wtfxybhlxy td, #wtfxybhlxy th {
   border-style: none;
 }
 
-#upcwvhlaqs p {
+#wtfxybhlxy p {
   margin: 0;
   padding: 0;
 }
 
-#upcwvhlaqs .gt_table {
+#wtfxybhlxy .gt_table {
   display: table;
   border-collapse: collapse;
   line-height: normal;
@@ -1400,12 +1401,12 @@ <h3 class="anchored" data-anchor-id="assay-metadata-fields">Assay metadata field
   border-left-color: #D3D3D3;
 }
 
-#upcwvhlaqs .gt_caption {
+#wtfxybhlxy .gt_caption {
   padding-top: 4px;
   padding-bottom: 4px;
 }
 
-#upcwvhlaqs .gt_title {
+#wtfxybhlxy .gt_title {
   color: #333333;
   font-size: 125%;
   font-weight: initial;
@@ -1417,7 +1418,7 @@ <h3 class="anchored" data-anchor-id="assay-metadata-fields">Assay metadata field
   border-bottom-width: 0;
 }
 
-#upcwvhlaqs .gt_subtitle {
+#wtfxybhlxy .gt_subtitle {
   color: #333333;
   font-size: 85%;
   font-weight: initial;
@@ -1429,7 +1430,7 @@ <h3 class="anchored" data-anchor-id="assay-metadata-fields">Assay metadata field
   border-top-width: 0;
 }
 
-#upcwvhlaqs .gt_heading {
+#wtfxybhlxy .gt_heading {
   background-color: #FFFFFF;
   text-align: center;
   border-bottom-color: #FFFFFF;
@@ -1441,13 +1442,13 @@ <h3 class="anchored" data-anchor-id="assay-metadata-fields">Assay metadata field
   border-right-color: #D3D3D3;
 }
 
-#upcwvhlaqs .gt_bottom_border {
+#wtfxybhlxy .gt_bottom_border {
   border-bottom-style: solid;
   border-bottom-width: 2px;
   border-bottom-color: #D3D3D3;
 }
 
-#upcwvhlaqs .gt_col_headings {
+#wtfxybhlxy .gt_col_headings {
   border-top-style: solid;
   border-top-width: 2px;
   border-top-color: #D3D3D3;
@@ -1462,7 +1463,7 @@ <h3 class="anchored" data-anchor-id="assay-metadata-fields">Assay metadata field
   border-right-color: #D3D3D3;
 }
 
-#upcwvhlaqs .gt_col_heading {
+#wtfxybhlxy .gt_col_heading {
   color: #333333;
   background-color: #FFFFFF;
   font-size: 100%;
@@ -1482,7 +1483,7 @@ <h3 class="anchored" data-anchor-id="assay-metadata-fields">Assay metadata field
   overflow-x: hidden;
 }
 
-#upcwvhlaqs .gt_column_spanner_outer {
+#wtfxybhlxy .gt_column_spanner_outer {
   color: #333333;
   background-color: #FFFFFF;
   font-size: 100%;
@@ -1494,15 +1495,15 @@ <h3 class="anchored" data-anchor-id="assay-metadata-fields">Assay metadata field
   padding-right: 4px;
 }
 
-#upcwvhlaqs .gt_column_spanner_outer:first-child {
+#wtfxybhlxy .gt_column_spanner_outer:first-child {
   padding-left: 0;
 }
 
-#upcwvhlaqs .gt_column_spanner_outer:last-child {
+#wtfxybhlxy .gt_column_spanner_outer:last-child {
   padding-right: 0;
 }
 
-#upcwvhlaqs .gt_column_spanner {
+#wtfxybhlxy .gt_column_spanner {
   border-bottom-style: solid;
   border-bottom-width: 2px;
   border-bottom-color: #D3D3D3;
@@ -1514,11 +1515,11 @@ <h3 class="anchored" data-anchor-id="assay-metadata-fields">Assay metadata field
   width: 100%;
 }
 
-#upcwvhlaqs .gt_spanner_row {
+#wtfxybhlxy .gt_spanner_row {
   border-bottom-style: hidden;
 }
 
-#upcwvhlaqs .gt_group_heading {
+#wtfxybhlxy .gt_group_heading {
   padding-top: 8px;
   padding-bottom: 8px;
   padding-left: 5px;
@@ -1544,7 +1545,7 @@ <h3 class="anchored" data-anchor-id="assay-metadata-fields">Assay metadata field
   text-align: left;
 }
 
-#upcwvhlaqs .gt_empty_group_heading {
+#wtfxybhlxy .gt_empty_group_heading {
   padding: 0.5px;
   color: #333333;
   background-color: #FFFFFF;
@@ -1559,15 +1560,15 @@ <h3 class="anchored" data-anchor-id="assay-metadata-fields">Assay metadata field
   vertical-align: middle;
 }
 
-#upcwvhlaqs .gt_from_md > :first-child {
+#wtfxybhlxy .gt_from_md > :first-child {
   margin-top: 0;
 }
 
-#upcwvhlaqs .gt_from_md > :last-child {
+#wtfxybhlxy .gt_from_md > :last-child {
   margin-bottom: 0;
 }
 
-#upcwvhlaqs .gt_row {
+#wtfxybhlxy .gt_row {
   padding-top: 8px;
   padding-bottom: 8px;
   padding-left: 5px;
@@ -1586,7 +1587,7 @@ <h3 class="anchored" data-anchor-id="assay-metadata-fields">Assay metadata field
   overflow-x: hidden;
 }
 
-#upcwvhlaqs .gt_stub {
+#wtfxybhlxy .gt_stub {
   color: #333333;
   background-color: #FFFFFF;
   font-size: 100%;
@@ -1599,7 +1600,7 @@ <h3 class="anchored" data-anchor-id="assay-metadata-fields">Assay metadata field
   padding-right: 5px;
 }
 
-#upcwvhlaqs .gt_stub_row_group {
+#wtfxybhlxy .gt_stub_row_group {
   color: #333333;
   background-color: #FFFFFF;
   font-size: 100%;
@@ -1613,15 +1614,15 @@ <h3 class="anchored" data-anchor-id="assay-metadata-fields">Assay metadata field
   vertical-align: top;
 }
 
-#upcwvhlaqs .gt_row_group_first td {
+#wtfxybhlxy .gt_row_group_first td {
   border-top-width: 2px;
 }
 
-#upcwvhlaqs .gt_row_group_first th {
+#wtfxybhlxy .gt_row_group_first th {
   border-top-width: 2px;
 }
 
-#upcwvhlaqs .gt_summary_row {
+#wtfxybhlxy .gt_summary_row {
   color: #333333;
   background-color: #FFFFFF;
   text-transform: inherit;
@@ -1631,16 +1632,16 @@ <h3 class="anchored" data-anchor-id="assay-metadata-fields">Assay metadata field
   padding-right: 5px;
 }
 
-#upcwvhlaqs .gt_first_summary_row {
+#wtfxybhlxy .gt_first_summary_row {
   border-top-style: solid;
   border-top-color: #D3D3D3;
 }
 
-#upcwvhlaqs .gt_first_summary_row.thick {
+#wtfxybhlxy .gt_first_summary_row.thick {
   border-top-width: 2px;
 }
 
-#upcwvhlaqs .gt_last_summary_row {
+#wtfxybhlxy .gt_last_summary_row {
   padding-top: 8px;
   padding-bottom: 8px;
   padding-left: 5px;
@@ -1650,7 +1651,7 @@ <h3 class="anchored" data-anchor-id="assay-metadata-fields">Assay metadata field
   border-bottom-color: #D3D3D3;
 }
 
-#upcwvhlaqs .gt_grand_summary_row {
+#wtfxybhlxy .gt_grand_summary_row {
   color: #333333;
   background-color: #FFFFFF;
   text-transform: inherit;
@@ -1660,7 +1661,7 @@ <h3 class="anchored" data-anchor-id="assay-metadata-fields">Assay metadata field
   padding-right: 5px;
 }
 
-#upcwvhlaqs .gt_first_grand_summary_row {
+#wtfxybhlxy .gt_first_grand_summary_row {
   padding-top: 8px;
   padding-bottom: 8px;
   padding-left: 5px;
@@ -1670,7 +1671,7 @@ <h3 class="anchored" data-anchor-id="assay-metadata-fields">Assay metadata field
   border-top-color: #D3D3D3;
 }
 
-#upcwvhlaqs .gt_last_grand_summary_row_top {
+#wtfxybhlxy .gt_last_grand_summary_row_top {
   padding-top: 8px;
   padding-bottom: 8px;
   padding-left: 5px;
@@ -1680,11 +1681,11 @@ <h3 class="anchored" data-anchor-id="assay-metadata-fields">Assay metadata field
   border-bottom-color: #D3D3D3;
 }
 
-#upcwvhlaqs .gt_striped {
+#wtfxybhlxy .gt_striped {
   background-color: rgba(128, 128, 128, 0.05);
 }
 
-#upcwvhlaqs .gt_table_body {
+#wtfxybhlxy .gt_table_body {
   border-top-style: solid;
   border-top-width: 2px;
   border-top-color: #D3D3D3;
@@ -1693,7 +1694,7 @@ <h3 class="anchored" data-anchor-id="assay-metadata-fields">Assay metadata field
   border-bottom-color: #D3D3D3;
 }
 
-#upcwvhlaqs .gt_footnotes {
+#wtfxybhlxy .gt_footnotes {
   color: #333333;
   background-color: #FFFFFF;
   border-bottom-style: none;
@@ -1707,7 +1708,7 @@ <h3 class="anchored" data-anchor-id="assay-metadata-fields">Assay metadata field
   border-right-color: #D3D3D3;
 }
 
-#upcwvhlaqs .gt_footnote {
+#wtfxybhlxy .gt_footnote {
   margin: 0px;
   font-size: 90%;
   padding-top: 4px;
@@ -1716,7 +1717,7 @@ <h3 class="anchored" data-anchor-id="assay-metadata-fields">Assay metadata field
   padding-right: 5px;
 }
 
-#upcwvhlaqs .gt_sourcenotes {
+#wtfxybhlxy .gt_sourcenotes {
   color: #333333;
   background-color: #FFFFFF;
   border-bottom-style: none;
@@ -1730,7 +1731,7 @@ <h3 class="anchored" data-anchor-id="assay-metadata-fields">Assay metadata field
   border-right-color: #D3D3D3;
 }
 
-#upcwvhlaqs .gt_sourcenote {
+#wtfxybhlxy .gt_sourcenote {
   font-size: 90%;
   padding-top: 4px;
   padding-bottom: 4px;
@@ -1738,63 +1739,63 @@ <h3 class="anchored" data-anchor-id="assay-metadata-fields">Assay metadata field
   padding-right: 5px;
 }
 
-#upcwvhlaqs .gt_left {
+#wtfxybhlxy .gt_left {
   text-align: left;
 }
 
-#upcwvhlaqs .gt_center {
+#wtfxybhlxy .gt_center {
   text-align: center;
 }
 
-#upcwvhlaqs .gt_right {
+#wtfxybhlxy .gt_right {
   text-align: right;
   font-variant-numeric: tabular-nums;
 }
 
-#upcwvhlaqs .gt_font_normal {
+#wtfxybhlxy .gt_font_normal {
   font-weight: normal;
 }
 
-#upcwvhlaqs .gt_font_bold {
+#wtfxybhlxy .gt_font_bold {
   font-weight: bold;
 }
 
-#upcwvhlaqs .gt_font_italic {
+#wtfxybhlxy .gt_font_italic {
   font-style: italic;
 }
 
-#upcwvhlaqs .gt_super {
+#wtfxybhlxy .gt_super {
   font-size: 65%;
 }
 
-#upcwvhlaqs .gt_footnote_marks {
+#wtfxybhlxy .gt_footnote_marks {
   font-size: 75%;
   vertical-align: 0.4em;
   position: initial;
 }
 
-#upcwvhlaqs .gt_asterisk {
+#wtfxybhlxy .gt_asterisk {
   font-size: 100%;
   vertical-align: 0;
 }
 
-#upcwvhlaqs .gt_indent_1 {
+#wtfxybhlxy .gt_indent_1 {
   text-indent: 5px;
 }
 
-#upcwvhlaqs .gt_indent_2 {
+#wtfxybhlxy .gt_indent_2 {
   text-indent: 10px;
 }
 
-#upcwvhlaqs .gt_indent_3 {
+#wtfxybhlxy .gt_indent_3 {
   text-indent: 15px;
 }
 
-#upcwvhlaqs .gt_indent_4 {
+#wtfxybhlxy .gt_indent_4 {
   text-indent: 20px;
 }
 
-#upcwvhlaqs .gt_indent_5 {
+#wtfxybhlxy .gt_indent_5 {
   text-indent: 25px;
 }
 </style>
@@ -1962,23 +1963,23 @@ <h3 class="anchored" data-anchor-id="assay-metadata-fields">Assay metadata field
 <div class="cell">
 <div class="cell-output-display">
 <div>
-<div id="qqyqkgybos" style="padding-left:0px;padding-right:0px;padding-top:10px;padding-bottom:10px;overflow-x:auto;overflow-y:auto;width:auto;height:auto;">
-<style>#qqyqkgybos table {
+<div id="umrhefvalv" style="padding-left:0px;padding-right:0px;padding-top:10px;padding-bottom:10px;overflow-x:auto;overflow-y:auto;width:auto;height:auto;">
+<style>#umrhefvalv table {
   font-family: system-ui, 'Segoe UI', Roboto, Helvetica, Arial, sans-serif, 'Apple Color Emoji', 'Segoe UI Emoji', 'Segoe UI Symbol', 'Noto Color Emoji';
   -webkit-font-smoothing: antialiased;
   -moz-osx-font-smoothing: grayscale;
 }
 
-#qqyqkgybos thead, #qqyqkgybos tbody, #qqyqkgybos tfoot, #qqyqkgybos tr, #qqyqkgybos td, #qqyqkgybos th {
+#umrhefvalv thead, #umrhefvalv tbody, #umrhefvalv tfoot, #umrhefvalv tr, #umrhefvalv td, #umrhefvalv th {
   border-style: none;
 }
 
-#qqyqkgybos p {
+#umrhefvalv p {
   margin: 0;
   padding: 0;
 }
 
-#qqyqkgybos .gt_table {
+#umrhefvalv .gt_table {
   display: table;
   border-collapse: collapse;
   line-height: normal;
@@ -2004,12 +2005,12 @@ <h3 class="anchored" data-anchor-id="assay-metadata-fields">Assay metadata field
   border-left-color: #D3D3D3;
 }
 
-#qqyqkgybos .gt_caption {
+#umrhefvalv .gt_caption {
   padding-top: 4px;
   padding-bottom: 4px;
 }
 
-#qqyqkgybos .gt_title {
+#umrhefvalv .gt_title {
   color: #333333;
   font-size: 125%;
   font-weight: initial;
@@ -2021,7 +2022,7 @@ <h3 class="anchored" data-anchor-id="assay-metadata-fields">Assay metadata field
   border-bottom-width: 0;
 }
 
-#qqyqkgybos .gt_subtitle {
+#umrhefvalv .gt_subtitle {
   color: #333333;
   font-size: 85%;
   font-weight: initial;
@@ -2033,7 +2034,7 @@ <h3 class="anchored" data-anchor-id="assay-metadata-fields">Assay metadata field
   border-top-width: 0;
 }
 
-#qqyqkgybos .gt_heading {
+#umrhefvalv .gt_heading {
   background-color: #FFFFFF;
   text-align: center;
   border-bottom-color: #FFFFFF;
@@ -2045,13 +2046,13 @@ <h3 class="anchored" data-anchor-id="assay-metadata-fields">Assay metadata field
   border-right-color: #D3D3D3;
 }
 
-#qqyqkgybos .gt_bottom_border {
+#umrhefvalv .gt_bottom_border {
   border-bottom-style: solid;
   border-bottom-width: 2px;
   border-bottom-color: #D3D3D3;
 }
 
-#qqyqkgybos .gt_col_headings {
+#umrhefvalv .gt_col_headings {
   border-top-style: solid;
   border-top-width: 2px;
   border-top-color: #D3D3D3;
@@ -2066,7 +2067,7 @@ <h3 class="anchored" data-anchor-id="assay-metadata-fields">Assay metadata field
   border-right-color: #D3D3D3;
 }
 
-#qqyqkgybos .gt_col_heading {
+#umrhefvalv .gt_col_heading {
   color: #333333;
   background-color: #FFFFFF;
   font-size: 100%;
@@ -2086,7 +2087,7 @@ <h3 class="anchored" data-anchor-id="assay-metadata-fields">Assay metadata field
   overflow-x: hidden;
 }
 
-#qqyqkgybos .gt_column_spanner_outer {
+#umrhefvalv .gt_column_spanner_outer {
   color: #333333;
   background-color: #FFFFFF;
   font-size: 100%;
@@ -2098,15 +2099,15 @@ <h3 class="anchored" data-anchor-id="assay-metadata-fields">Assay metadata field
   padding-right: 4px;
 }
 
-#qqyqkgybos .gt_column_spanner_outer:first-child {
+#umrhefvalv .gt_column_spanner_outer:first-child {
   padding-left: 0;
 }
 
-#qqyqkgybos .gt_column_spanner_outer:last-child {
+#umrhefvalv .gt_column_spanner_outer:last-child {
   padding-right: 0;
 }
 
-#qqyqkgybos .gt_column_spanner {
+#umrhefvalv .gt_column_spanner {
   border-bottom-style: solid;
   border-bottom-width: 2px;
   border-bottom-color: #D3D3D3;
@@ -2118,11 +2119,11 @@ <h3 class="anchored" data-anchor-id="assay-metadata-fields">Assay metadata field
   width: 100%;
 }
 
-#qqyqkgybos .gt_spanner_row {
+#umrhefvalv .gt_spanner_row {
   border-bottom-style: hidden;
 }
 
-#qqyqkgybos .gt_group_heading {
+#umrhefvalv .gt_group_heading {
   padding-top: 8px;
   padding-bottom: 8px;
   padding-left: 5px;
@@ -2148,7 +2149,7 @@ <h3 class="anchored" data-anchor-id="assay-metadata-fields">Assay metadata field
   text-align: left;
 }
 
-#qqyqkgybos .gt_empty_group_heading {
+#umrhefvalv .gt_empty_group_heading {
   padding: 0.5px;
   color: #333333;
   background-color: #FFFFFF;
@@ -2163,15 +2164,15 @@ <h3 class="anchored" data-anchor-id="assay-metadata-fields">Assay metadata field
   vertical-align: middle;
 }
 
-#qqyqkgybos .gt_from_md > :first-child {
+#umrhefvalv .gt_from_md > :first-child {
   margin-top: 0;
 }
 
-#qqyqkgybos .gt_from_md > :last-child {
+#umrhefvalv .gt_from_md > :last-child {
   margin-bottom: 0;
 }
 
-#qqyqkgybos .gt_row {
+#umrhefvalv .gt_row {
   padding-top: 8px;
   padding-bottom: 8px;
   padding-left: 5px;
@@ -2190,7 +2191,7 @@ <h3 class="anchored" data-anchor-id="assay-metadata-fields">Assay metadata field
   overflow-x: hidden;
 }
 
-#qqyqkgybos .gt_stub {
+#umrhefvalv .gt_stub {
   color: #333333;
   background-color: #FFFFFF;
   font-size: 100%;
@@ -2203,7 +2204,7 @@ <h3 class="anchored" data-anchor-id="assay-metadata-fields">Assay metadata field
   padding-right: 5px;
 }
 
-#qqyqkgybos .gt_stub_row_group {
+#umrhefvalv .gt_stub_row_group {
   color: #333333;
   background-color: #FFFFFF;
   font-size: 100%;
@@ -2217,15 +2218,15 @@ <h3 class="anchored" data-anchor-id="assay-metadata-fields">Assay metadata field
   vertical-align: top;
 }
 
-#qqyqkgybos .gt_row_group_first td {
+#umrhefvalv .gt_row_group_first td {
   border-top-width: 2px;
 }
 
-#qqyqkgybos .gt_row_group_first th {
+#umrhefvalv .gt_row_group_first th {
   border-top-width: 2px;
 }
 
-#qqyqkgybos .gt_summary_row {
+#umrhefvalv .gt_summary_row {
   color: #333333;
   background-color: #FFFFFF;
   text-transform: inherit;
@@ -2235,16 +2236,16 @@ <h3 class="anchored" data-anchor-id="assay-metadata-fields">Assay metadata field
   padding-right: 5px;
 }
 
-#qqyqkgybos .gt_first_summary_row {
+#umrhefvalv .gt_first_summary_row {
   border-top-style: solid;
   border-top-color: #D3D3D3;
 }
 
-#qqyqkgybos .gt_first_summary_row.thick {
+#umrhefvalv .gt_first_summary_row.thick {
   border-top-width: 2px;
 }
 
-#qqyqkgybos .gt_last_summary_row {
+#umrhefvalv .gt_last_summary_row {
   padding-top: 8px;
   padding-bottom: 8px;
   padding-left: 5px;
@@ -2254,7 +2255,7 @@ <h3 class="anchored" data-anchor-id="assay-metadata-fields">Assay metadata field
   border-bottom-color: #D3D3D3;
 }
 
-#qqyqkgybos .gt_grand_summary_row {
+#umrhefvalv .gt_grand_summary_row {
   color: #333333;
   background-color: #FFFFFF;
   text-transform: inherit;
@@ -2264,7 +2265,7 @@ <h3 class="anchored" data-anchor-id="assay-metadata-fields">Assay metadata field
   padding-right: 5px;
 }
 
-#qqyqkgybos .gt_first_grand_summary_row {
+#umrhefvalv .gt_first_grand_summary_row {
   padding-top: 8px;
   padding-bottom: 8px;
   padding-left: 5px;
@@ -2274,7 +2275,7 @@ <h3 class="anchored" data-anchor-id="assay-metadata-fields">Assay metadata field
   border-top-color: #D3D3D3;
 }
 
-#qqyqkgybos .gt_last_grand_summary_row_top {
+#umrhefvalv .gt_last_grand_summary_row_top {
   padding-top: 8px;
   padding-bottom: 8px;
   padding-left: 5px;
@@ -2284,11 +2285,11 @@ <h3 class="anchored" data-anchor-id="assay-metadata-fields">Assay metadata field
   border-bottom-color: #D3D3D3;
 }
 
-#qqyqkgybos .gt_striped {
+#umrhefvalv .gt_striped {
   background-color: rgba(128, 128, 128, 0.05);
 }
 
-#qqyqkgybos .gt_table_body {
+#umrhefvalv .gt_table_body {
   border-top-style: solid;
   border-top-width: 2px;
   border-top-color: #D3D3D3;
@@ -2297,7 +2298,7 @@ <h3 class="anchored" data-anchor-id="assay-metadata-fields">Assay metadata field
   border-bottom-color: #D3D3D3;
 }
 
-#qqyqkgybos .gt_footnotes {
+#umrhefvalv .gt_footnotes {
   color: #333333;
   background-color: #FFFFFF;
   border-bottom-style: none;
@@ -2311,7 +2312,7 @@ <h3 class="anchored" data-anchor-id="assay-metadata-fields">Assay metadata field
   border-right-color: #D3D3D3;
 }
 
-#qqyqkgybos .gt_footnote {
+#umrhefvalv .gt_footnote {
   margin: 0px;
   font-size: 90%;
   padding-top: 4px;
@@ -2320,7 +2321,7 @@ <h3 class="anchored" data-anchor-id="assay-metadata-fields">Assay metadata field
   padding-right: 5px;
 }
 
-#qqyqkgybos .gt_sourcenotes {
+#umrhefvalv .gt_sourcenotes {
   color: #333333;
   background-color: #FFFFFF;
   border-bottom-style: none;
@@ -2334,7 +2335,7 @@ <h3 class="anchored" data-anchor-id="assay-metadata-fields">Assay metadata field
   border-right-color: #D3D3D3;
 }
 
-#qqyqkgybos .gt_sourcenote {
+#umrhefvalv .gt_sourcenote {
   font-size: 90%;
   padding-top: 4px;
   padding-bottom: 4px;
@@ -2342,63 +2343,63 @@ <h3 class="anchored" data-anchor-id="assay-metadata-fields">Assay metadata field
   padding-right: 5px;
 }
 
-#qqyqkgybos .gt_left {
+#umrhefvalv .gt_left {
   text-align: left;
 }
 
-#qqyqkgybos .gt_center {
+#umrhefvalv .gt_center {
   text-align: center;
 }
 
-#qqyqkgybos .gt_right {
+#umrhefvalv .gt_right {
   text-align: right;
   font-variant-numeric: tabular-nums;
 }
 
-#qqyqkgybos .gt_font_normal {
+#umrhefvalv .gt_font_normal {
   font-weight: normal;
 }
 
-#qqyqkgybos .gt_font_bold {
+#umrhefvalv .gt_font_bold {
   font-weight: bold;
 }
 
-#qqyqkgybos .gt_font_italic {
+#umrhefvalv .gt_font_italic {
   font-style: italic;
 }
 
-#qqyqkgybos .gt_super {
+#umrhefvalv .gt_super {
   font-size: 65%;
 }
 
-#qqyqkgybos .gt_footnote_marks {
+#umrhefvalv .gt_footnote_marks {
   font-size: 75%;
   vertical-align: 0.4em;
   position: initial;
 }
 
-#qqyqkgybos .gt_asterisk {
+#umrhefvalv .gt_asterisk {
   font-size: 100%;
   vertical-align: 0;
 }
 
-#qqyqkgybos .gt_indent_1 {
+#umrhefvalv .gt_indent_1 {
   text-indent: 5px;
 }
 
-#qqyqkgybos .gt_indent_2 {
+#umrhefvalv .gt_indent_2 {
   text-indent: 10px;
 }
 
-#qqyqkgybos .gt_indent_3 {
+#umrhefvalv .gt_indent_3 {
   text-indent: 15px;
 }
 
-#qqyqkgybos .gt_indent_4 {
+#umrhefvalv .gt_indent_4 {
   text-indent: 20px;
 }
 
-#qqyqkgybos .gt_indent_5 {
+#umrhefvalv .gt_indent_5 {
   text-indent: 25px;
 }
 </style>
@@ -2546,6 +2547,13 @@ <h3 class="anchored" data-anchor-id="assay-metadata-fields">Assay metadata field
 </div>
 </div>
 </div>
+</section>
+<section id="sources" class="level3">
+<h3 class="anchored" data-anchor-id="sources">Sources</h3>
+<ul>
+<li><a href="https://faircookbook.elixir-europe.org/content/recipes/interoperability/transcriptomics-metadata.html#analysis-metadata">Transcriptomics metadata standards and fields</a></li>
+<li>Biological ontologies for data scientists,<a href="https://lamin.ai/docs/bionty">Bionty</a></li>
+</ul>
 
 
 </section>
diff --git a/develop/examples/proteomics_metadata.html b/develop/examples/proteomics_metadata.html
index caf9eb0a..51f830ae 100644
--- a/develop/examples/proteomics_metadata.html
+++ b/develop/examples/proteomics_metadata.html
@@ -176,7 +176,7 @@
           <li class="sidebar-item">
   <div class="sidebar-item-container"> 
   <a href="../../develop/examples/NGS_management.html" class="sidebar-item-text sidebar-link">
- <span class="menu-text">NGS data strategies</span></a>
+ <span class="menu-text">Effective RDM Practices in NGS Analysis</span></a>
   </div>
 </li>
           <li class="sidebar-item">
diff --git a/develop/practical_workshop.html b/develop/practical_workshop.html
index 1296e34b..a7055fb7 100644
--- a/develop/practical_workshop.html
+++ b/develop/practical_workshop.html
@@ -176,19 +176,13 @@ <h2 id="toc-title">On this page</h2>
   <ul class="collapse">
   <li><a href="#template-engine" id="toc-template-engine" class="nav-link" data-scroll-target="#template-engine">Template engine</a></li>
   </ul></li>
-  <li><a href="#metadata" id="toc-metadata" class="nav-link" data-scroll-target="#metadata">2. Metadata</a>
+  <li><a href="#data-documentation" id="toc-data-documentation" class="nav-link" data-scroll-target="#data-documentation">2. Data documentation</a>
   <ul class="collapse">
-  <li><a href="#readme.md-file" id="toc-readme.md-file" class="nav-link" data-scroll-target="#readme.md-file">README.md file</a></li>
-  <li><a href="#metadata.yml" id="toc-metadata.yml" class="nav-link" data-scroll-target="#metadata.yml">metadata.yml</a></li>
-  <li><a href="#metadata-fields" id="toc-metadata-fields" class="nav-link" data-scroll-target="#metadata-fields">Metadata fields</a></li>
-  <li><a href="#more-info" id="toc-more-info" class="nav-link" data-scroll-target="#more-info">More info</a></li>
+  <li><a href="#metadata" id="toc-metadata" class="nav-link" data-scroll-target="#metadata">Metadata</a></li>
+  <li><a href="#readme-file" id="toc-readme-file" class="nav-link" data-scroll-target="#readme-file">README file</a></li>
   </ul></li>
-  <li><a href="#naming-conventions" id="toc-naming-conventions" class="nav-link" data-scroll-target="#naming-conventions">3. Naming conventions</a>
-  <ul class="collapse">
-  <li><a href="#general-tips" id="toc-general-tips" class="nav-link" data-scroll-target="#general-tips">General tips</a></li>
-  <li><a href="#suggestions-for-ngs-data" id="toc-suggestions-for-ngs-data" class="nav-link" data-scroll-target="#suggestions-for-ngs-data">Suggestions for NGS data</a></li>
-  </ul></li>
-  <li><a href="#create-a-catalog-of-your-assay-folder" id="toc-create-a-catalog-of-your-assay-folder" class="nav-link" data-scroll-target="#create-a-catalog-of-your-assay-folder">4. Create a catalog of your assay folder</a></li>
+  <li><a href="#naming-conventions" id="toc-naming-conventions" class="nav-link" data-scroll-target="#naming-conventions">3. Naming conventions</a></li>
+  <li><a href="#create-a-catalog-of-your-data-folder" id="toc-create-a-catalog-of-your-data-folder" class="nav-link" data-scroll-target="#create-a-catalog-of-your-data-folder">4. Create a catalog of your data folder</a></li>
   <li><a href="#version-control-of-your-data-analysis-using-git-and-github" id="toc-version-control-of-your-data-analysis-using-git-and-github" class="nav-link" data-scroll-target="#version-control-of-your-data-analysis-using-git-and-github">5. Version control of your data analysis using Git and GitHub</a>
   <ul class="collapse">
   <li><a href="#creating-a-git-repo-online-and-copying-your-project-folder" id="toc-creating-a-git-repo-online-and-copying-your-project-folder" class="nav-link" data-scroll-target="#creating-a-git-repo-online-and-copying-your-project-folder">Creating a git repo online and copying your project folder</a></li>
@@ -225,7 +219,7 @@ <h1 class="title">Practical material</h1>
     <div>
     <div class="quarto-title-meta-heading">Modified</div>
     <div class="quarto-title-meta-contents">
-      <p class="date-modified">April 25, 2024</p>
+      <p class="date-modified">April 26, 2024</p>
     </div>
   </div>
     
@@ -539,46 +533,35 @@ <h5 class="anchored" data-anchor-id="step-4-review-the-generated-project">Step 4
 <div>
 <div class="callout-exercise">
 <p>Use Cookiecutter to create custom templates for your folders. You can do it from scratch (see Exercise 1, part B) or opt for one of our pre-made templates available as a Github repository (recommended for this workshop). Feel free to tailor the template to your specific requirements—you don’t have to follow our examples exactly.</p>
-<p><strong>Requirements</strong> We assume you have already gone through the requirements at the beginning of the practical lesson. This includes installing the necessary tools and setting up accounts as needed.</p>
+<p><strong>Requirements</strong></p>
+<p>We assume you have already gone through the requirements at the beginning of the practical lesson. This includes installing the necessary tools and setting up accounts as needed.</p>
 <p><strong>Project</strong></p>
 <ol type="1">
-<li>Go to our <a href="https://github.com/hds-sandbox/cookiecutter-template">Cookicutter template</a> and click on the **Fork*</li>
-</ol>
-<ul>
-<li>button at the top-right corner of the repository page to create a copy of the repository on your own GitHub account or organization. <img src="./images/fork_repo_project.png" class="img-fluid" alt="fork_repo_example"></li>
-</ul>
-<ol start="2" type="1">
-<li>Open a terminal on your computer, copy the URL of your fork and <strong>clone</strong> the repository to your local machine (the URL should look something like https://github.com/your_username/cookiecutter-template):</li>
-</ol>
+<li><p>Go to our <a href="https://github.com/hds-sandbox/cookiecutter-template">Cookicutter template</a> and click on the <strong>Fork</strong> button at the top-right corner of the repository page to create a copy of the repository on your own GitHub account or organization. <img src="./images/fork_repo_project.png" class="img-fluid" alt="fork_repo_example"></p></li>
+<li><p>Open a terminal on your computer, copy the URL of your fork and <strong>clone</strong> the repository to your local machine (the URL should look something like https://github.com/your_username/cookiecutter-template):</p>
 <div class="sourceCode" id="cb11"><pre class="sourceCode bash code-with-copy"><code class="sourceCode bash"><span id="cb11-1"><a href="#cb11-1" aria-hidden="true" tabindex="-1"></a><span class="fu">git</span> clone <span class="op">&lt;</span>your URL to the template<span class="op">&gt;</span></span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
-<p>If you have a GitHub Desktop, click <strong>Add</strong> and select “Clone repository” from the options 3. Open the repository and navigate through the different directories 4. Modify the contents of the repository as needed to fit your project’s requirements. You can change files, add new ones. remove existing one or adjust the folder structure. For inspiration, review the data structure above under ‘Project folder’. For instance, this template is missing the ‘reports’ directory. Consider creating it, along with a subdirectory named ‘figures’. Here’s an example of how to do it:</p>
-<div class="sourceCode" id="cb12"><pre class="sourceCode bash code-with-copy"><code class="sourceCode bash"><span id="cb12-1"><a href="#cb12-1" aria-hidden="true" tabindex="-1"></a><span class="bu">cd</span> <span class="dt">\{\{\ </span>cookiecutter.project_name\ \}\}/  </span>
-<span id="cb12-2"><a href="#cb12-2" aria-hidden="true" tabindex="-1"></a><span class="fu">mkdir</span> reports </span>
-<span id="cb12-3"><a href="#cb12-3" aria-hidden="true" tabindex="-1"></a><span class="fu">touch</span> requirements.txt</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
-<ol start="5" type="1">
-<li>Modify the <code>cookiecutter.json</code> file. You could add new variables or change the default values:</li>
-</ol>
-<div class="sourceCode" id="cb13"><pre class="sourceCode bash code-with-copy"><code class="sourceCode bash"><span id="cb13-1"><a href="#cb13-1" aria-hidden="true" tabindex="-1"></a><span class="co"># open a text editor</span></span>
-<span id="cb13-2"><a href="#cb13-2" aria-hidden="true" tabindex="-1"></a> <span class="st">"author"</span><span class="ex">:</span> <span class="st">"Alba Refoyo"</span>,</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
-<ol start="5" type="1">
-<li>Commit and push changes when you are done with your modifications</li>
+<p>If you have a GitHub Desktop, click <strong>Add</strong> and select “Clone repository” from the options</p></li>
+<li><p>Open the repository and navigate through the different directories</p></li>
+<li><p>Modify the contents of the repository as needed to fit your project’s requirements. You can change files, add new ones. remove existing one or adjust the folder structure. For inspiration, review the data structure above under ‘Project folder’. For instance, this template is missing the ‘reports’ directory and add the ‘requirements.txt’ file. Consider creating it, along with a subdirectory named ‘reports/figures’.</p>
+<pre class="plaintext"><code>├── results/
+│   ├── figures/
+├── requirements.txt</code></pre>
+<p>Here’s an example of how to do it:</p>
+<div class="sourceCode" id="cb13"><pre class="sourceCode bash code-with-copy"><code class="sourceCode bash"><span id="cb13-1"><a href="#cb13-1" aria-hidden="true" tabindex="-1"></a><span class="co"># Open your terminal and navigate to your template directory. Then: </span></span>
+<span id="cb13-2"><a href="#cb13-2" aria-hidden="true" tabindex="-1"></a><span class="bu">cd</span> <span class="dt">\{\{\ </span>cookiecutter.project_name\ \}\}/  </span>
+<span id="cb13-3"><a href="#cb13-3" aria-hidden="true" tabindex="-1"></a><span class="fu">mkdir</span> reports </span>
+<span id="cb13-4"><a href="#cb13-4" aria-hidden="true" tabindex="-1"></a><span class="fu">touch</span> requirements.txt</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div></li>
+<li><p>Commit and push changes when you are done with your modifications</p></li>
 </ol>
 <ul>
-<li>Stage the changes with ‘git add’</li>
-<li>Commit the changes with a meaningful commit message ‘git commit -m “update cookicutter template”’</li>
-<li>Push the changes to your forked repository on Github ‘git push origin main’ (or the appropriate branch name)</li>
+<li>Stage the changes with <code>git add</code></li>
+<li>Commit the changes with a meaningful commit message <code>git commit -m "update cookicutter template"</code></li>
+<li>Push the changes to your forked repository on Github <code>git push origin main</code> (or the appropriate branch name)</li>
 </ul>
 <ol start="6" type="1">
-<li>Test your template by using <code>cookiecutter &lt;URL to your GitHub repository "cookicutter-template"&gt;</code> Fill up the variables and verify that the modified template looks like you would expect.<br>
-</li>
-<li>Optional: You can customize or remove this prompt message entirely, allowing you to tailor the text to your preferences for a unique experience each time you use the template.</li>
+<li><p>Test your template by using <code>cookiecutter &lt;URL to your GitHub repository "cookicutter-template"&gt;</code></p>
+<p>Fill up the variables and verify that the new structure (and folders) looks like you would expect. Have any new folders been added, or have some been removed?</p></li>
 </ol>
-<div class="sourceCode" id="cb14"><pre class="sourceCode bash code-with-copy"><code class="sourceCode bash"><span id="cb14-1"><a href="#cb14-1" aria-hidden="true" tabindex="-1"></a><span class="st">"__prompts__"</span><span class="ex">:</span> {</span>
-<span id="cb14-2"><a href="#cb14-2" aria-hidden="true" tabindex="-1"></a>    <span class="st">"project_name"</span><span class="ex">:</span> <span class="st">"Project directory name [Example: project_short_description_202X]"</span>,</span>
-<span id="cb14-3"><a href="#cb14-3" aria-hidden="true" tabindex="-1"></a>    <span class="st">"author"</span><span class="ex">:</span> <span class="st">"Author of the project"</span>,</span>
-<span id="cb14-4"><a href="#cb14-4" aria-hidden="true" tabindex="-1"></a>    <span class="st">"date"</span><span class="ex">:</span> <span class="st">"Date of project creation, default is today's date"</span>,</span>
-<span id="cb14-5"><a href="#cb14-5" aria-hidden="true" tabindex="-1"></a>    <span class="st">"short_description"</span><span class="ex">:</span> <span class="st">"Provide a detailed description of the project (context/content)"</span></span>
-<span id="cb14-6"><a href="#cb14-6" aria-hidden="true" tabindex="-1"></a>  <span class="er">}</span><span class="ex">,</span></span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
 </div>
 </div>
 </div>
@@ -621,1905 +604,339 @@ <h5 class="anchored" data-anchor-id="step-4-review-the-generated-project">Step 4
 </section>
 </section>
 </section>
-<section id="metadata" class="level2">
-<h2 class="anchored" data-anchor-id="metadata">2. Metadata</h2>
-<p>Metadata is the behind-the-scenes information that makes sense of data and gives context and structure. For biodata, metadata includes information such as when and where the data was collected, what it represents, and how it was processed. Let’s check what kind of relevant metadata is available for NGS data and how to capture it in your Assay or Project folders. Both of these folders contain a metadata.yml file and a README.md file. In this section, we will check what kind of information you should collect in each of these files.</p>
-<div class="callout callout-style-default callout-warning callout-titled" title="Metadata and controlled vocabularies">
+<section id="data-documentation" class="level2">
+<h2 class="anchored" data-anchor-id="data-documentation">2. Data documentation</h2>
+<p>Data documentation involves organizing, describing, and providing context for datasets and projects. While metadata concentrates on the data itself, README files provide a broader perspective on the overall project or resource.</p>
+<section id="metadata" class="level3">
+<h3 class="anchored" data-anchor-id="metadata">Metadata</h3>
+<div class="callout callout-style-default callout-warning callout-titled" title="metadata.yml">
 <div class="callout-header d-flex align-content-center">
 <div class="callout-icon-container">
 <i class="callout-icon"></i>
 </div>
 <div class="callout-title-container flex-fill">
-Metadata and controlled vocabularies
+metadata.yml
 </div>
 </div>
 <div class="callout-body-container callout-body">
-<p>In order for metadata to be most useful, you should try to use controlled vocabularies for all your fields. For example, tissue could be described with the <a href="https://www.ebi.ac.uk/ols/ontologies/uberon">UBERON ontologies</a>, species using the <a href="https://www.ncbi.nlm.nih.gov/taxonomy">NCBI taxonomy</a>, diseases using the <a href="https://mondo.monarchinitiative.org/">Mondo database</a>, etc. Unfortunately, implementing a systematic way of using these vocabularies is rather complex and outside the scope of this workshop, but you are very welcome to try to implement them on your own!</p>
-</div>
-</div>
-<section id="readme.md-file" class="level3">
-<h3 class="anchored" data-anchor-id="readme.md-file">README.md file</h3>
-<p>The README.md file is a <a href="https://www.markdownguide.org/">markdown file</a> that allows you to write a long description of the data placed in a folder. Since it is a markdown file, you are able to write in rich text format (bold, italic, include links, etc) what is inside the folder, why it was created/collected, and how and when. If it is an <code>Assay</code> folder, you could include the laboratory protocol used to generate the samples, images explaining the experiment design, a summary of the results of the experiment, and any sort of comments that would help to understand the context of the experiment. On the other hand, a ‘Project’ README file may contain a description of the project, what are its aims, why is it important, what ‘Assays’ is it using, how to interpret the code notebooks, a summary of the results and, again, any sort of comments that would help to understand the project.</p>
-<p>Here is an example of a README file for a <code>Project</code> folder:</p>
-<div class="sourceCode" id="cb16"><pre class="sourceCode bash code-overflow-wrap code-with-copy"><code class="sourceCode bash"><span id="cb16-1"><a href="#cb16-1" aria-hidden="true" tabindex="-1"></a><span class="co"># NGS Analysis Project: Exploring Gene Expression in Human Tissues</span></span>
-<span id="cb16-2"><a href="#cb16-2" aria-hidden="true" tabindex="-1"></a></span>
-<span id="cb16-3"><a href="#cb16-3" aria-hidden="true" tabindex="-1"></a><span class="co">## Aims</span></span>
-<span id="cb16-4"><a href="#cb16-4" aria-hidden="true" tabindex="-1"></a></span>
-<span id="cb16-5"><a href="#cb16-5" aria-hidden="true" tabindex="-1"></a><span class="ex">This</span> project aims to investigate gene expression patterns across various human tissues using Next Generation Sequencing <span class="er">(</span><span class="ex">NGS</span><span class="kw">)</span> <span class="ex">data.</span> By analyzing the transcriptomes of different tissues, we seek to uncover tissue-specific gene expression profiles and identify potential markers associated with specific biological functions or diseases.</span>
-<span id="cb16-6"><a href="#cb16-6" aria-hidden="true" tabindex="-1"></a></span>
-<span id="cb16-7"><a href="#cb16-7" aria-hidden="true" tabindex="-1"></a><span class="co">## Why It's Important</span></span>
-<span id="cb16-8"><a href="#cb16-8" aria-hidden="true" tabindex="-1"></a></span>
-<span id="cb16-9"><a href="#cb16-9" aria-hidden="true" tabindex="-1"></a><span class="ex">Understanding</span> tissue-specific gene expression is crucial for deciphering the molecular basis of health and disease. Identifying genes that are uniquely expressed in certain tissues can provide insights into tissue function, development, and potential therapeutic targets. This project contributes to our broader understanding of human biology and has implications for personalized medicine and disease research.</span>
-<span id="cb16-10"><a href="#cb16-10" aria-hidden="true" tabindex="-1"></a></span>
-<span id="cb16-11"><a href="#cb16-11" aria-hidden="true" tabindex="-1"></a><span class="co">## Datasets</span></span>
-<span id="cb16-12"><a href="#cb16-12" aria-hidden="true" tabindex="-1"></a></span>
-<span id="cb16-13"><a href="#cb16-13" aria-hidden="true" tabindex="-1"></a><span class="ex">We</span> have used internal datasets with IDs: RNA_humanSkin_20201030, RNA_humanBrain_20210102, RNA_humanLung_20220304.</span>
-<span id="cb16-14"><a href="#cb16-14" aria-hidden="true" tabindex="-1"></a></span>
-<span id="cb16-15"><a href="#cb16-15" aria-hidden="true" tabindex="-1"></a><span class="ex">In</span> addition, we utilized publicly available NGS datasets from the GTEx <span class="er">(</span><span class="ex">Genotype-Tissue</span> Expression<span class="kw">)</span> <span class="ex">project,</span> which provides comprehensive RNA-seq data across multiple human tissues. These datasets offer a wealth of information on gene expression levels and isoform variations across diverse tissues, making them ideal for our analysis.</span>
-<span id="cb16-16"><a href="#cb16-16" aria-hidden="true" tabindex="-1"></a></span>
-<span id="cb16-17"><a href="#cb16-17" aria-hidden="true" tabindex="-1"></a><span class="co">## Summary of Results</span></span>
-<span id="cb16-18"><a href="#cb16-18" aria-hidden="true" tabindex="-1"></a></span>
-<span id="cb16-19"><a href="#cb16-19" aria-hidden="true" tabindex="-1"></a><span class="ex">Our</span> analysis revealed distinct gene expression patterns among different human tissues. We identified tissue-specific genes enriched in brain tissues, highlighting their potential roles in neurodevelopment and function. Additionally, we found a set of genes that exhibit consistent expression across a range of tissues, suggesting their fundamental importance in basic cellular processes.</span>
-<span id="cb16-20"><a href="#cb16-20" aria-hidden="true" tabindex="-1"></a></span>
-<span id="cb16-21"><a href="#cb16-21" aria-hidden="true" tabindex="-1"></a><span class="ex">Furthermore,</span> our differential expression analysis unveiled significant changes in gene expression between healthy and diseased tissues, shedding light on potential molecular factors underlying various diseases. Overall, this project underscores the power of NGS data in unraveling intricate gene expression networks and their implications for human health.</span>
-<span id="cb16-22"><a href="#cb16-22" aria-hidden="true" tabindex="-1"></a></span>
-<span id="cb16-23"><a href="#cb16-23" aria-hidden="true" tabindex="-1"></a><span class="ex">---</span></span>
-<span id="cb16-24"><a href="#cb16-24" aria-hidden="true" tabindex="-1"></a></span>
-<span id="cb16-25"><a href="#cb16-25" aria-hidden="true" tabindex="-1"></a><span class="ex">For</span> more details, refer to our [Jupyter Notebook]<span class="er">(</span><span class="ex">link-to-jupyter-notebook.ipynb</span><span class="kw">)</span> <span class="cf">for</span> the <span class="bu">complete</span> analysis pipeline and code.</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
-</section>
-<section id="metadata.yml" class="level3">
-<h3 class="anchored" data-anchor-id="metadata.yml">metadata.yml</h3>
-<p>The metadata file is a <a href="https://fileinfo.com/extension/yml">yml file</a>, which is a text document that contains data formatted using a human-readable data format for data serialization.</p>
-<div class="quarto-figure quarto-figure-center">
-<figure class="figure">
-<p><img src="./images/yml_example.png" class="img-fluid figure-img"></p>
-<figcaption>yaml file example</figcaption>
-</figure>
+<p>Choose the format that best suits the project’s needs. In this workshop, we will focus on YAMl as it is highly used for configuration files (e.g., in conda or pipelines).</p>
+<div class="callout callout-style-default callout-definition no-icon callout-titled">
+<div class="callout-header d-flex align-content-center" data-bs-toggle="collapse" data-bs-target=".callout-9-contents" aria-controls="callout-9" aria-expanded="false" aria-label="Toggle callout">
+<div class="callout-icon-container">
+<i class="callout-icon no-icon"></i>
 </div>
-</section>
-<section id="metadata-fields" class="level3">
-<h3 class="anchored" data-anchor-id="metadata-fields">Metadata fields</h3>
-<p>There is a ton of information you can collect regarding an NGS assay or a project. Some information fields are very general, such as author or date, while others are specific to the Assay or Project folder. Below, we will take a look at the minimal information you should collect in each of the folders.</p>
-<section id="general-metadata-fields" class="level4">
-<h4 class="anchored" data-anchor-id="general-metadata-fields">General metadata fields</h4>
-<p>Here you can find a list of suggestions for general metadata fields that can be used for both assays and project folders:</p>
+<div class="callout-title-container flex-fill">
+File formats
+</div>
+<div class="callout-btn-toggle d-inline-block border-0 py-1 ps-1 pe-0 float-end"><i class="callout-toggle"></i></div>
+</div>
+<div id="callout-9" class="callout-9-contents callout-collapse collapse">
+<div class="callout-body-container callout-body">
+<div>
+<div class="callout-definition">
 <ul>
-<li><strong>Title</strong>: A brief yet informative name for the dataset.</li>
-<li><strong>Author(s)</strong>: The individual(s) or organization responsible for creating the dataset. You can use your <a href="https://orcid.org/">ORCID</a></li>
-<li><strong>Date Created</strong>: The date when the dataset was originally generated or compiled. Use YYYY-MM-DD format!</li>
-<li><strong>Description</strong>: A short narrative explaining the content, purpose, and context.</li>
-<li><strong>Keywords</strong>: A set of descriptive terms or phrases that capture the folder’s main topics and attributes.</li>
-<li><strong>Version</strong>: The version number or identifier for the folder, useful for tracking changes.</li>
-<li><strong>License</strong>: The type of license or terms of use associated with the dataset/project.</li>
+<li>XML (eXtensible Markup Language): uses custom tags to describe data and allows for a hierarchical structure.</li>
+<li>JSON (JavaScript Object Notation): lightweight and human-readable format that is easy to parse and generate.</li>
+<li>CSV (Comma-Separated Values) or TSV (tabulate-separate values): simple and widely supported for representing tabular formats. Easy to manipulate using software or programming languages. It is often use for sample metadata.</li>
+<li>YAML (YAML Ain’t Markup Language): human-readable data serialization format, commonly used as project configuration files.</li>
 </ul>
-</section>
-<section id="assay-metadata-fields" class="level4">
-<h4 class="anchored" data-anchor-id="assay-metadata-fields">Assay metadata fields</h4>
-<p>Here you will find a table with possible metadata fields that you can use to annotate and track your <code>Assay</code> folders:</p>
-<div class="cell">
-<div class="cell-output-display">
-<div>
-<div id="uxbnbdecxw" style="padding-left:0px;padding-right:0px;padding-top:10px;padding-bottom:10px;overflow-x:auto;overflow-y:auto;width:auto;height:auto;">
-<style>#uxbnbdecxw table {
-  font-family: system-ui, 'Segoe UI', Roboto, Helvetica, Arial, sans-serif, 'Apple Color Emoji', 'Segoe UI Emoji', 'Segoe UI Symbol', 'Noto Color Emoji';
-  -webkit-font-smoothing: antialiased;
-  -moz-osx-font-smoothing: grayscale;
-}
-
-#uxbnbdecxw thead, #uxbnbdecxw tbody, #uxbnbdecxw tfoot, #uxbnbdecxw tr, #uxbnbdecxw td, #uxbnbdecxw th {
-  border-style: none;
-}
-
-#uxbnbdecxw p {
-  margin: 0;
-  padding: 0;
-}
-
-#uxbnbdecxw .gt_table {
-  display: table;
-  border-collapse: collapse;
-  line-height: normal;
-  margin-left: auto;
-  margin-right: auto;
-  color: #333333;
-  font-size: 11px;
-  font-weight: normal;
-  font-style: normal;
-  background-color: #FFFFFF;
-  width: auto;
-  border-top-style: solid;
-  border-top-width: 2px;
-  border-top-color: #A8A8A8;
-  border-right-style: none;
-  border-right-width: 2px;
-  border-right-color: #D3D3D3;
-  border-bottom-style: solid;
-  border-bottom-width: 2px;
-  border-bottom-color: #A8A8A8;
-  border-left-style: none;
-  border-left-width: 2px;
-  border-left-color: #D3D3D3;
-}
-
-#uxbnbdecxw .gt_caption {
-  padding-top: 4px;
-  padding-bottom: 4px;
-}
-
-#uxbnbdecxw .gt_title {
-  color: #333333;
-  font-size: 125%;
-  font-weight: initial;
-  padding-top: 4px;
-  padding-bottom: 4px;
-  padding-left: 5px;
-  padding-right: 5px;
-  border-bottom-color: #FFFFFF;
-  border-bottom-width: 0;
-}
-
-#uxbnbdecxw .gt_subtitle {
-  color: #333333;
-  font-size: 85%;
-  font-weight: initial;
-  padding-top: 3px;
-  padding-bottom: 5px;
-  padding-left: 5px;
-  padding-right: 5px;
-  border-top-color: #FFFFFF;
-  border-top-width: 0;
-}
-
-#uxbnbdecxw .gt_heading {
-  background-color: #FFFFFF;
-  text-align: center;
-  border-bottom-color: #FFFFFF;
-  border-left-style: none;
-  border-left-width: 1px;
-  border-left-color: #D3D3D3;
-  border-right-style: none;
-  border-right-width: 1px;
-  border-right-color: #D3D3D3;
-}
-
-#uxbnbdecxw .gt_bottom_border {
-  border-bottom-style: solid;
-  border-bottom-width: 2px;
-  border-bottom-color: #D3D3D3;
-}
-
-#uxbnbdecxw .gt_col_headings {
-  border-top-style: solid;
-  border-top-width: 2px;
-  border-top-color: #D3D3D3;
-  border-bottom-style: solid;
-  border-bottom-width: 2px;
-  border-bottom-color: #D3D3D3;
-  border-left-style: none;
-  border-left-width: 1px;
-  border-left-color: #D3D3D3;
-  border-right-style: none;
-  border-right-width: 1px;
-  border-right-color: #D3D3D3;
-}
-
-#uxbnbdecxw .gt_col_heading {
-  color: #333333;
-  background-color: #FFFFFF;
-  font-size: 100%;
-  font-weight: normal;
-  text-transform: inherit;
-  border-left-style: none;
-  border-left-width: 1px;
-  border-left-color: #D3D3D3;
-  border-right-style: none;
-  border-right-width: 1px;
-  border-right-color: #D3D3D3;
-  vertical-align: bottom;
-  padding-top: 5px;
-  padding-bottom: 6px;
-  padding-left: 5px;
-  padding-right: 5px;
-  overflow-x: hidden;
-}
-
-#uxbnbdecxw .gt_column_spanner_outer {
-  color: #333333;
-  background-color: #FFFFFF;
-  font-size: 100%;
-  font-weight: normal;
-  text-transform: inherit;
-  padding-top: 0;
-  padding-bottom: 0;
-  padding-left: 4px;
-  padding-right: 4px;
-}
-
-#uxbnbdecxw .gt_column_spanner_outer:first-child {
-  padding-left: 0;
-}
-
-#uxbnbdecxw .gt_column_spanner_outer:last-child {
-  padding-right: 0;
-}
-
-#uxbnbdecxw .gt_column_spanner {
-  border-bottom-style: solid;
-  border-bottom-width: 2px;
-  border-bottom-color: #D3D3D3;
-  vertical-align: bottom;
-  padding-top: 5px;
-  padding-bottom: 5px;
-  overflow-x: hidden;
-  display: inline-block;
-  width: 100%;
-}
-
-#uxbnbdecxw .gt_spanner_row {
-  border-bottom-style: hidden;
-}
-
-#uxbnbdecxw .gt_group_heading {
-  padding-top: 8px;
-  padding-bottom: 8px;
-  padding-left: 5px;
-  padding-right: 5px;
-  color: #333333;
-  background-color: #FFFFFF;
-  font-size: 100%;
-  font-weight: initial;
-  text-transform: inherit;
-  border-top-style: solid;
-  border-top-width: 2px;
-  border-top-color: #D3D3D3;
-  border-bottom-style: solid;
-  border-bottom-width: 2px;
-  border-bottom-color: #D3D3D3;
-  border-left-style: none;
-  border-left-width: 1px;
-  border-left-color: #D3D3D3;
-  border-right-style: none;
-  border-right-width: 1px;
-  border-right-color: #D3D3D3;
-  vertical-align: middle;
-  text-align: left;
-}
-
-#uxbnbdecxw .gt_empty_group_heading {
-  padding: 0.5px;
-  color: #333333;
-  background-color: #FFFFFF;
-  font-size: 100%;
-  font-weight: initial;
-  border-top-style: solid;
-  border-top-width: 2px;
-  border-top-color: #D3D3D3;
-  border-bottom-style: solid;
-  border-bottom-width: 2px;
-  border-bottom-color: #D3D3D3;
-  vertical-align: middle;
-}
-
-#uxbnbdecxw .gt_from_md > :first-child {
-  margin-top: 0;
-}
-
-#uxbnbdecxw .gt_from_md > :last-child {
-  margin-bottom: 0;
-}
-
-#uxbnbdecxw .gt_row {
-  padding-top: 8px;
-  padding-bottom: 8px;
-  padding-left: 5px;
-  padding-right: 5px;
-  margin: 10px;
-  border-top-style: solid;
-  border-top-width: 1px;
-  border-top-color: #D3D3D3;
-  border-left-style: none;
-  border-left-width: 1px;
-  border-left-color: #D3D3D3;
-  border-right-style: none;
-  border-right-width: 1px;
-  border-right-color: #D3D3D3;
-  vertical-align: middle;
-  overflow-x: hidden;
-}
-
-#uxbnbdecxw .gt_stub {
-  color: #333333;
-  background-color: #FFFFFF;
-  font-size: 100%;
-  font-weight: initial;
-  text-transform: inherit;
-  border-right-style: solid;
-  border-right-width: 2px;
-  border-right-color: #D3D3D3;
-  padding-left: 5px;
-  padding-right: 5px;
-}
-
-#uxbnbdecxw .gt_stub_row_group {
-  color: #333333;
-  background-color: #FFFFFF;
-  font-size: 100%;
-  font-weight: initial;
-  text-transform: inherit;
-  border-right-style: solid;
-  border-right-width: 2px;
-  border-right-color: #D3D3D3;
-  padding-left: 5px;
-  padding-right: 5px;
-  vertical-align: top;
-}
-
-#uxbnbdecxw .gt_row_group_first td {
-  border-top-width: 2px;
-}
-
-#uxbnbdecxw .gt_row_group_first th {
-  border-top-width: 2px;
-}
-
-#uxbnbdecxw .gt_summary_row {
-  color: #333333;
-  background-color: #FFFFFF;
-  text-transform: inherit;
-  padding-top: 8px;
-  padding-bottom: 8px;
-  padding-left: 5px;
-  padding-right: 5px;
-}
-
-#uxbnbdecxw .gt_first_summary_row {
-  border-top-style: solid;
-  border-top-color: #D3D3D3;
-}
-
-#uxbnbdecxw .gt_first_summary_row.thick {
-  border-top-width: 2px;
-}
-
-#uxbnbdecxw .gt_last_summary_row {
-  padding-top: 8px;
-  padding-bottom: 8px;
-  padding-left: 5px;
-  padding-right: 5px;
-  border-bottom-style: solid;
-  border-bottom-width: 2px;
-  border-bottom-color: #D3D3D3;
-}
-
-#uxbnbdecxw .gt_grand_summary_row {
-  color: #333333;
-  background-color: #FFFFFF;
-  text-transform: inherit;
-  padding-top: 8px;
-  padding-bottom: 8px;
-  padding-left: 5px;
-  padding-right: 5px;
-}
-
-#uxbnbdecxw .gt_first_grand_summary_row {
-  padding-top: 8px;
-  padding-bottom: 8px;
-  padding-left: 5px;
-  padding-right: 5px;
-  border-top-style: double;
-  border-top-width: 6px;
-  border-top-color: #D3D3D3;
-}
-
-#uxbnbdecxw .gt_last_grand_summary_row_top {
-  padding-top: 8px;
-  padding-bottom: 8px;
-  padding-left: 5px;
-  padding-right: 5px;
-  border-bottom-style: double;
-  border-bottom-width: 6px;
-  border-bottom-color: #D3D3D3;
-}
-
-#uxbnbdecxw .gt_striped {
-  background-color: rgba(128, 128, 128, 0.05);
-}
-
-#uxbnbdecxw .gt_table_body {
-  border-top-style: solid;
-  border-top-width: 2px;
-  border-top-color: #D3D3D3;
-  border-bottom-style: solid;
-  border-bottom-width: 2px;
-  border-bottom-color: #D3D3D3;
-}
-
-#uxbnbdecxw .gt_footnotes {
-  color: #333333;
-  background-color: #FFFFFF;
-  border-bottom-style: none;
-  border-bottom-width: 2px;
-  border-bottom-color: #D3D3D3;
-  border-left-style: none;
-  border-left-width: 2px;
-  border-left-color: #D3D3D3;
-  border-right-style: none;
-  border-right-width: 2px;
-  border-right-color: #D3D3D3;
-}
-
-#uxbnbdecxw .gt_footnote {
-  margin: 0px;
-  font-size: 90%;
-  padding-top: 4px;
-  padding-bottom: 4px;
-  padding-left: 5px;
-  padding-right: 5px;
-}
-
-#uxbnbdecxw .gt_sourcenotes {
-  color: #333333;
-  background-color: #FFFFFF;
-  border-bottom-style: none;
-  border-bottom-width: 2px;
-  border-bottom-color: #D3D3D3;
-  border-left-style: none;
-  border-left-width: 2px;
-  border-left-color: #D3D3D3;
-  border-right-style: none;
-  border-right-width: 2px;
-  border-right-color: #D3D3D3;
-}
-
-#uxbnbdecxw .gt_sourcenote {
-  font-size: 90%;
-  padding-top: 4px;
-  padding-bottom: 4px;
-  padding-left: 5px;
-  padding-right: 5px;
-}
-
-#uxbnbdecxw .gt_left {
-  text-align: left;
-}
-
-#uxbnbdecxw .gt_center {
-  text-align: center;
-}
-
-#uxbnbdecxw .gt_right {
-  text-align: right;
-  font-variant-numeric: tabular-nums;
-}
-
-#uxbnbdecxw .gt_font_normal {
-  font-weight: normal;
-}
-
-#uxbnbdecxw .gt_font_bold {
-  font-weight: bold;
-}
-
-#uxbnbdecxw .gt_font_italic {
-  font-style: italic;
-}
-
-#uxbnbdecxw .gt_super {
-  font-size: 65%;
-}
-
-#uxbnbdecxw .gt_footnote_marks {
-  font-size: 75%;
-  vertical-align: 0.4em;
-  position: initial;
-}
-
-#uxbnbdecxw .gt_asterisk {
-  font-size: 100%;
-  vertical-align: 0;
-}
-
-#uxbnbdecxw .gt_indent_1 {
-  text-indent: 5px;
-}
-
-#uxbnbdecxw .gt_indent_2 {
-  text-indent: 10px;
-}
-
-#uxbnbdecxw .gt_indent_3 {
-  text-indent: 15px;
-}
-
-#uxbnbdecxw .gt_indent_4 {
-  text-indent: 20px;
-}
-
-#uxbnbdecxw .gt_indent_5 {
-  text-indent: 25px;
-}
-</style>
-
-<table class="gt_table table table-sm table-striped small" data-quarto-postprocess="true" data-quarto-disable-processing="false" data-quarto-bootstrap="false">
-<thead>
-<tr class="header gt_col_headings">
-<th id="Metadata field" class="gt_col_heading gt_columns_bottom_border gt_left" data-quarto-table-cell-role="th" scope="col">Metadata field</th>
-<th id="Definition" class="gt_col_heading gt_columns_bottom_border gt_left" data-quarto-table-cell-role="th" scope="col">Definition</th>
-<th id="Format" class="gt_col_heading gt_columns_bottom_border gt_left" data-quarto-table-cell-role="th" scope="col">Format</th>
-<th id="Ontology" class="gt_col_heading gt_columns_bottom_border gt_left" data-quarto-table-cell-role="th" scope="col">Ontology</th>
-<th id="Example" class="gt_col_heading gt_columns_bottom_border gt_left" data-quarto-table-cell-role="th" scope="col">Example</th>
-</tr>
-</thead>
-<tbody class="gt_table_body">
-<tr class="odd">
-<td class="gt_row gt_left" headers="Metadata field">assay_ID</td>
-<td class="gt_row gt_left" headers="Definition">Identifier for the assay that is at least unique within the project</td>
-<td class="gt_row gt_left" headers="Format">&lt;Assay-ID\&gt;_&lt;keyword\&gt;_YYYYMMDD</td>
-<td class="gt_row gt_left" headers="Ontology">NA</td>
-<td class="gt_row gt_left" headers="Example">CHIP_Oct4_20200101</td>
-</tr>
-<tr class="even">
-<td class="gt_row gt_left" headers="Metadata field">assay_type</td>
-<td class="gt_row gt_left" headers="Definition">The type of experiment performed, eg ATAC-seq or seqFISH</td>
-<td class="gt_row gt_left" headers="Format">NA</td>
-<td class="gt_row gt_left" headers="Ontology">ontology field- e.g. EFO or OBI</td>
-<td class="gt_row gt_left" headers="Example">ChIPseq</td>
-</tr>
-<tr class="odd">
-<td class="gt_row gt_left" headers="Metadata field">assay_subtype</td>
-<td class="gt_row gt_left" headers="Definition">More specific type or assay like bulk nascent RNAseq or single cell ATACseq</td>
-<td class="gt_row gt_left" headers="Format">NA</td>
-<td class="gt_row gt_left" headers="Ontology">ontology field- e.g. EFO or OBI</td>
-<td class="gt_row gt_left" headers="Example">bulk ChIPseq</td>
-</tr>
-<tr class="even">
-<td class="gt_row gt_left" headers="Metadata field">owner</td>
-<td class="gt_row gt_left" headers="Definition">Owner of the assay (who made the experiment?).</td>
-<td class="gt_row gt_left" headers="Format">&lt;First Name\&gt; &lt;Last Name\&gt;</td>
-<td class="gt_row gt_left" headers="Ontology">NA</td>
-<td class="gt_row gt_left" headers="Example">Jose Romero</td>
-</tr>
-<tr class="odd">
-<td class="gt_row gt_left" headers="Metadata field">platform</td>
-<td class="gt_row gt_left" headers="Definition">The type of instrument used to perform the assay, eg Illumina HiSeq 4000 or Fluidigm C1 microfluidics platform</td>
-<td class="gt_row gt_left" headers="Format">NA</td>
-<td class="gt_row gt_left" headers="Ontology">ontology field- e.g. EFO or OBI</td>
-<td class="gt_row gt_left" headers="Example">Illumina</td>
-</tr>
-<tr class="even">
-<td class="gt_row gt_left" headers="Metadata field">extraction_method</td>
-<td class="gt_row gt_left" headers="Definition">Technique used to extract the nucleic acid from the cell</td>
-<td class="gt_row gt_left" headers="Format">NA</td>
-<td class="gt_row gt_left" headers="Ontology">ontology field- e.g. EFO or OBI</td>
-<td class="gt_row gt_left" headers="Example">NA</td>
-</tr>
-<tr class="odd">
-<td class="gt_row gt_left" headers="Metadata field">library_method</td>
-<td class="gt_row gt_left" headers="Definition">Technique used to amplify a cDNA library</td>
-<td class="gt_row gt_left" headers="Format">NA</td>
-<td class="gt_row gt_left" headers="Ontology">ontology field- e.g. EFO or OBI</td>
-<td class="gt_row gt_left" headers="Example">NA</td>
-</tr>
-<tr class="even">
-<td class="gt_row gt_left" headers="Metadata field">external_accessions</td>
-<td class="gt_row gt_left" headers="Definition">Accession numbers from external resources to which assay or protocol information was submitted</td>
-<td class="gt_row gt_left" headers="Format">NA</td>
-<td class="gt_row gt_left" headers="Ontology">eg protocols.io, AE, GEO accession number, etc</td>
-<td class="gt_row gt_left" headers="Example">GSEXXXXX</td>
-</tr>
-<tr class="odd">
-<td class="gt_row gt_left" headers="Metadata field">keyword</td>
-<td class="gt_row gt_left" headers="Definition">Keyword for easy identification</td>
-<td class="gt_row gt_left" headers="Format">wordWord</td>
-<td class="gt_row gt_left" headers="Ontology">camelCase</td>
-<td class="gt_row gt_left" headers="Example">Oct4ChIP</td>
-</tr>
-<tr class="even">
-<td class="gt_row gt_left" headers="Metadata field">date</td>
-<td class="gt_row gt_left" headers="Definition">Date of assay creation</td>
-<td class="gt_row gt_left" headers="Format">YYYYMMDD</td>
-<td class="gt_row gt_left" headers="Ontology">NA</td>
-<td class="gt_row gt_left" headers="Example">20200101</td>
-</tr>
-<tr class="odd">
-<td class="gt_row gt_left" headers="Metadata field">nsamples</td>
-<td class="gt_row gt_left" headers="Definition">Number of samples analyzed in this assay</td>
-<td class="gt_row gt_left" headers="Format">&lt;integer\&gt;</td>
-<td class="gt_row gt_left" headers="Ontology">NA</td>
-<td class="gt_row gt_left" headers="Example">9</td>
-</tr>
-<tr class="even">
-<td class="gt_row gt_left" headers="Metadata field">is_paired</td>
-<td class="gt_row gt_left" headers="Definition">Paired fastq files or not</td>
-<td class="gt_row gt_left" headers="Format">&lt;single OR paired\&gt;</td>
-<td class="gt_row gt_left" headers="Ontology">NA</td>
-<td class="gt_row gt_left" headers="Example">single</td>
-</tr>
-<tr class="odd">
-<td class="gt_row gt_left" headers="Metadata field">pipeline</td>
-<td class="gt_row gt_left" headers="Definition">Pipeline used to process data and version</td>
-<td class="gt_row gt_left" headers="Format">NA</td>
-<td class="gt_row gt_left" headers="Ontology">NA</td>
-<td class="gt_row gt_left" headers="Example">nf-core/chipseq -r 1.0</td>
-</tr>
-<tr class="even">
-<td class="gt_row gt_left" headers="Metadata field">strandedness</td>
-<td class="gt_row gt_left" headers="Definition">The strandedness of the cDNA library</td>
-<td class="gt_row gt_left" headers="Format">&lt;+ OR - OR *\&gt;</td>
-<td class="gt_row gt_left" headers="Ontology">NA</td>
-<td class="gt_row gt_left" headers="Example">*</td>
-</tr>
-<tr class="odd">
-<td class="gt_row gt_left" headers="Metadata field">processed_by</td>
-<td class="gt_row gt_left" headers="Definition">Who processed the data</td>
-<td class="gt_row gt_left" headers="Format">&lt;First Name\&gt; &lt;Last Name\&gt;</td>
-<td class="gt_row gt_left" headers="Ontology">NA</td>
-<td class="gt_row gt_left" headers="Example">Sarah Lundregan</td>
-</tr>
-<tr class="even">
-<td class="gt_row gt_left" headers="Metadata field">organism</td>
-<td class="gt_row gt_left" headers="Definition">Organism origin</td>
-<td class="gt_row gt_left" headers="Format">&lt;Genus species\&gt;</td>
-<td class="gt_row gt_left" headers="Ontology">Taxonomy name</td>
-<td class="gt_row gt_left" headers="Example">Mus musculus</td>
-</tr>
-<tr class="odd">
-<td class="gt_row gt_left" headers="Metadata field">origin</td>
-<td class="gt_row gt_left" headers="Definition">Is internal or external (from a public resources) data</td>
-<td class="gt_row gt_left" headers="Format">&lt;internal OR external\&gt;</td>
-<td class="gt_row gt_left" headers="Ontology">NA</td>
-<td class="gt_row gt_left" headers="Example">internal</td>
-</tr>
-<tr class="even">
-<td class="gt_row gt_left" headers="Metadata field">path</td>
-<td class="gt_row gt_left" headers="Definition">Path to files</td>
-<td class="gt_row gt_left" headers="Format">&lt;/path/to/file\&gt;</td>
-<td class="gt_row gt_left" headers="Ontology">NA</td>
-<td class="gt_row gt_left" headers="Example">NA</td>
-</tr>
-<tr class="odd">
-<td class="gt_row gt_left" headers="Metadata field">short_desc</td>
-<td class="gt_row gt_left" headers="Definition">Short description of the assay</td>
-<td class="gt_row gt_left" headers="Format">plain text</td>
-<td class="gt_row gt_left" headers="Ontology">NA</td>
-<td class="gt_row gt_left" headers="Example">Oct4 ChIP after pERK activation</td>
-</tr>
-<tr class="even">
-<td class="gt_row gt_left" headers="Metadata field">ELN_ID</td>
-<td class="gt_row gt_left" headers="Definition">ID of the experiment/assay in your Electronic Lab Notebook software, like labguru or benchling</td>
-<td class="gt_row gt_left" headers="Format">plain text</td>
-<td class="gt_row gt_left" headers="Ontology">NA</td>
-<td class="gt_row gt_left" headers="Example">NA</td>
-</tr>
-</tbody>
-</table>
-
+<p>Others such as RDF or HDF5.</p>
 </div>
 </div>
 </div>
 </div>
-</section>
-<section id="project-metadata-fields" class="level4">
-<h4 class="anchored" data-anchor-id="project-metadata-fields">Project metadata fields</h4>
-<p>Here you will find a table with possible metadata fields that you can use to annotate and track your <code>Project</code> folders:</p>
-<div class="cell">
-<div class="cell-output-display">
-<div>
-<div id="wqjiengwve" style="padding-left:0px;padding-right:0px;padding-top:10px;padding-bottom:10px;overflow-x:auto;overflow-y:auto;width:auto;height:auto;">
-<style>#wqjiengwve table {
-  font-family: system-ui, 'Segoe UI', Roboto, Helvetica, Arial, sans-serif, 'Apple Color Emoji', 'Segoe UI Emoji', 'Segoe UI Symbol', 'Noto Color Emoji';
-  -webkit-font-smoothing: antialiased;
-  -moz-osx-font-smoothing: grayscale;
-}
-
-#wqjiengwve thead, #wqjiengwve tbody, #wqjiengwve tfoot, #wqjiengwve tr, #wqjiengwve td, #wqjiengwve th {
-  border-style: none;
-}
-
-#wqjiengwve p {
-  margin: 0;
-  padding: 0;
-}
-
-#wqjiengwve .gt_table {
-  display: table;
-  border-collapse: collapse;
-  line-height: normal;
-  margin-left: auto;
-  margin-right: auto;
-  color: #333333;
-  font-size: 11px;
-  font-weight: normal;
-  font-style: normal;
-  background-color: #FFFFFF;
-  width: auto;
-  border-top-style: solid;
-  border-top-width: 2px;
-  border-top-color: #A8A8A8;
-  border-right-style: none;
-  border-right-width: 2px;
-  border-right-color: #D3D3D3;
-  border-bottom-style: solid;
-  border-bottom-width: 2px;
-  border-bottom-color: #A8A8A8;
-  border-left-style: none;
-  border-left-width: 2px;
-  border-left-color: #D3D3D3;
-}
-
-#wqjiengwve .gt_caption {
-  padding-top: 4px;
-  padding-bottom: 4px;
-}
-
-#wqjiengwve .gt_title {
-  color: #333333;
-  font-size: 125%;
-  font-weight: initial;
-  padding-top: 4px;
-  padding-bottom: 4px;
-  padding-left: 5px;
-  padding-right: 5px;
-  border-bottom-color: #FFFFFF;
-  border-bottom-width: 0;
-}
-
-#wqjiengwve .gt_subtitle {
-  color: #333333;
-  font-size: 85%;
-  font-weight: initial;
-  padding-top: 3px;
-  padding-bottom: 5px;
-  padding-left: 5px;
-  padding-right: 5px;
-  border-top-color: #FFFFFF;
-  border-top-width: 0;
-}
-
-#wqjiengwve .gt_heading {
-  background-color: #FFFFFF;
-  text-align: center;
-  border-bottom-color: #FFFFFF;
-  border-left-style: none;
-  border-left-width: 1px;
-  border-left-color: #D3D3D3;
-  border-right-style: none;
-  border-right-width: 1px;
-  border-right-color: #D3D3D3;
-}
-
-#wqjiengwve .gt_bottom_border {
-  border-bottom-style: solid;
-  border-bottom-width: 2px;
-  border-bottom-color: #D3D3D3;
-}
-
-#wqjiengwve .gt_col_headings {
-  border-top-style: solid;
-  border-top-width: 2px;
-  border-top-color: #D3D3D3;
-  border-bottom-style: solid;
-  border-bottom-width: 2px;
-  border-bottom-color: #D3D3D3;
-  border-left-style: none;
-  border-left-width: 1px;
-  border-left-color: #D3D3D3;
-  border-right-style: none;
-  border-right-width: 1px;
-  border-right-color: #D3D3D3;
-}
-
-#wqjiengwve .gt_col_heading {
-  color: #333333;
-  background-color: #FFFFFF;
-  font-size: 100%;
-  font-weight: normal;
-  text-transform: inherit;
-  border-left-style: none;
-  border-left-width: 1px;
-  border-left-color: #D3D3D3;
-  border-right-style: none;
-  border-right-width: 1px;
-  border-right-color: #D3D3D3;
-  vertical-align: bottom;
-  padding-top: 5px;
-  padding-bottom: 6px;
-  padding-left: 5px;
-  padding-right: 5px;
-  overflow-x: hidden;
-}
-
-#wqjiengwve .gt_column_spanner_outer {
-  color: #333333;
-  background-color: #FFFFFF;
-  font-size: 100%;
-  font-weight: normal;
-  text-transform: inherit;
-  padding-top: 0;
-  padding-bottom: 0;
-  padding-left: 4px;
-  padding-right: 4px;
-}
-
-#wqjiengwve .gt_column_spanner_outer:first-child {
-  padding-left: 0;
-}
-
-#wqjiengwve .gt_column_spanner_outer:last-child {
-  padding-right: 0;
-}
-
-#wqjiengwve .gt_column_spanner {
-  border-bottom-style: solid;
-  border-bottom-width: 2px;
-  border-bottom-color: #D3D3D3;
-  vertical-align: bottom;
-  padding-top: 5px;
-  padding-bottom: 5px;
-  overflow-x: hidden;
-  display: inline-block;
-  width: 100%;
-}
-
-#wqjiengwve .gt_spanner_row {
-  border-bottom-style: hidden;
-}
-
-#wqjiengwve .gt_group_heading {
-  padding-top: 8px;
-  padding-bottom: 8px;
-  padding-left: 5px;
-  padding-right: 5px;
-  color: #333333;
-  background-color: #FFFFFF;
-  font-size: 100%;
-  font-weight: initial;
-  text-transform: inherit;
-  border-top-style: solid;
-  border-top-width: 2px;
-  border-top-color: #D3D3D3;
-  border-bottom-style: solid;
-  border-bottom-width: 2px;
-  border-bottom-color: #D3D3D3;
-  border-left-style: none;
-  border-left-width: 1px;
-  border-left-color: #D3D3D3;
-  border-right-style: none;
-  border-right-width: 1px;
-  border-right-color: #D3D3D3;
-  vertical-align: middle;
-  text-align: left;
-}
-
-#wqjiengwve .gt_empty_group_heading {
-  padding: 0.5px;
-  color: #333333;
-  background-color: #FFFFFF;
-  font-size: 100%;
-  font-weight: initial;
-  border-top-style: solid;
-  border-top-width: 2px;
-  border-top-color: #D3D3D3;
-  border-bottom-style: solid;
-  border-bottom-width: 2px;
-  border-bottom-color: #D3D3D3;
-  vertical-align: middle;
-}
-
-#wqjiengwve .gt_from_md > :first-child {
-  margin-top: 0;
-}
-
-#wqjiengwve .gt_from_md > :last-child {
-  margin-bottom: 0;
-}
-
-#wqjiengwve .gt_row {
-  padding-top: 8px;
-  padding-bottom: 8px;
-  padding-left: 5px;
-  padding-right: 5px;
-  margin: 10px;
-  border-top-style: solid;
-  border-top-width: 1px;
-  border-top-color: #D3D3D3;
-  border-left-style: none;
-  border-left-width: 1px;
-  border-left-color: #D3D3D3;
-  border-right-style: none;
-  border-right-width: 1px;
-  border-right-color: #D3D3D3;
-  vertical-align: middle;
-  overflow-x: hidden;
-}
-
-#wqjiengwve .gt_stub {
-  color: #333333;
-  background-color: #FFFFFF;
-  font-size: 100%;
-  font-weight: initial;
-  text-transform: inherit;
-  border-right-style: solid;
-  border-right-width: 2px;
-  border-right-color: #D3D3D3;
-  padding-left: 5px;
-  padding-right: 5px;
-}
-
-#wqjiengwve .gt_stub_row_group {
-  color: #333333;
-  background-color: #FFFFFF;
-  font-size: 100%;
-  font-weight: initial;
-  text-transform: inherit;
-  border-right-style: solid;
-  border-right-width: 2px;
-  border-right-color: #D3D3D3;
-  padding-left: 5px;
-  padding-right: 5px;
-  vertical-align: top;
-}
-
-#wqjiengwve .gt_row_group_first td {
-  border-top-width: 2px;
-}
-
-#wqjiengwve .gt_row_group_first th {
-  border-top-width: 2px;
-}
-
-#wqjiengwve .gt_summary_row {
-  color: #333333;
-  background-color: #FFFFFF;
-  text-transform: inherit;
-  padding-top: 8px;
-  padding-bottom: 8px;
-  padding-left: 5px;
-  padding-right: 5px;
-}
-
-#wqjiengwve .gt_first_summary_row {
-  border-top-style: solid;
-  border-top-color: #D3D3D3;
-}
-
-#wqjiengwve .gt_first_summary_row.thick {
-  border-top-width: 2px;
-}
-
-#wqjiengwve .gt_last_summary_row {
-  padding-top: 8px;
-  padding-bottom: 8px;
-  padding-left: 5px;
-  padding-right: 5px;
-  border-bottom-style: solid;
-  border-bottom-width: 2px;
-  border-bottom-color: #D3D3D3;
-}
-
-#wqjiengwve .gt_grand_summary_row {
-  color: #333333;
-  background-color: #FFFFFF;
-  text-transform: inherit;
-  padding-top: 8px;
-  padding-bottom: 8px;
-  padding-left: 5px;
-  padding-right: 5px;
-}
-
-#wqjiengwve .gt_first_grand_summary_row {
-  padding-top: 8px;
-  padding-bottom: 8px;
-  padding-left: 5px;
-  padding-right: 5px;
-  border-top-style: double;
-  border-top-width: 6px;
-  border-top-color: #D3D3D3;
-}
-
-#wqjiengwve .gt_last_grand_summary_row_top {
-  padding-top: 8px;
-  padding-bottom: 8px;
-  padding-left: 5px;
-  padding-right: 5px;
-  border-bottom-style: double;
-  border-bottom-width: 6px;
-  border-bottom-color: #D3D3D3;
-}
-
-#wqjiengwve .gt_striped {
-  background-color: rgba(128, 128, 128, 0.05);
-}
-
-#wqjiengwve .gt_table_body {
-  border-top-style: solid;
-  border-top-width: 2px;
-  border-top-color: #D3D3D3;
-  border-bottom-style: solid;
-  border-bottom-width: 2px;
-  border-bottom-color: #D3D3D3;
-}
-
-#wqjiengwve .gt_footnotes {
-  color: #333333;
-  background-color: #FFFFFF;
-  border-bottom-style: none;
-  border-bottom-width: 2px;
-  border-bottom-color: #D3D3D3;
-  border-left-style: none;
-  border-left-width: 2px;
-  border-left-color: #D3D3D3;
-  border-right-style: none;
-  border-right-width: 2px;
-  border-right-color: #D3D3D3;
-}
-
-#wqjiengwve .gt_footnote {
-  margin: 0px;
-  font-size: 90%;
-  padding-top: 4px;
-  padding-bottom: 4px;
-  padding-left: 5px;
-  padding-right: 5px;
-}
-
-#wqjiengwve .gt_sourcenotes {
-  color: #333333;
-  background-color: #FFFFFF;
-  border-bottom-style: none;
-  border-bottom-width: 2px;
-  border-bottom-color: #D3D3D3;
-  border-left-style: none;
-  border-left-width: 2px;
-  border-left-color: #D3D3D3;
-  border-right-style: none;
-  border-right-width: 2px;
-  border-right-color: #D3D3D3;
-}
-
-#wqjiengwve .gt_sourcenote {
-  font-size: 90%;
-  padding-top: 4px;
-  padding-bottom: 4px;
-  padding-left: 5px;
-  padding-right: 5px;
-}
-
-#wqjiengwve .gt_left {
-  text-align: left;
-}
-
-#wqjiengwve .gt_center {
-  text-align: center;
-}
-
-#wqjiengwve .gt_right {
-  text-align: right;
-  font-variant-numeric: tabular-nums;
-}
-
-#wqjiengwve .gt_font_normal {
-  font-weight: normal;
-}
-
-#wqjiengwve .gt_font_bold {
-  font-weight: bold;
-}
-
-#wqjiengwve .gt_font_italic {
-  font-style: italic;
-}
-
-#wqjiengwve .gt_super {
-  font-size: 65%;
-}
-
-#wqjiengwve .gt_footnote_marks {
-  font-size: 75%;
-  vertical-align: 0.4em;
-  position: initial;
-}
-
-#wqjiengwve .gt_asterisk {
-  font-size: 100%;
-  vertical-align: 0;
-}
-
-#wqjiengwve .gt_indent_1 {
-  text-indent: 5px;
-}
-
-#wqjiengwve .gt_indent_2 {
-  text-indent: 10px;
-}
-
-#wqjiengwve .gt_indent_3 {
-  text-indent: 15px;
-}
-
-#wqjiengwve .gt_indent_4 {
-  text-indent: 20px;
-}
-
-#wqjiengwve .gt_indent_5 {
-  text-indent: 25px;
-}
-</style>
-
-<table class="gt_table table table-sm table-striped small" data-quarto-postprocess="true" data-quarto-disable-processing="false" data-quarto-bootstrap="false">
-<thead>
-<tr class="header gt_col_headings">
-<th id="Metadata field" class="gt_col_heading gt_columns_bottom_border gt_left" data-quarto-table-cell-role="th" scope="col">Metadata field</th>
-<th id="Definition" class="gt_col_heading gt_columns_bottom_border gt_left" data-quarto-table-cell-role="th" scope="col">Definition</th>
-<th id="Format" class="gt_col_heading gt_columns_bottom_border gt_left" data-quarto-table-cell-role="th" scope="col">Format</th>
-<th id="Ontology" class="gt_col_heading gt_columns_bottom_border gt_center" data-quarto-table-cell-role="th" scope="col">Ontology</th>
-<th id="Example" class="gt_col_heading gt_columns_bottom_border gt_left" data-quarto-table-cell-role="th" scope="col">Example</th>
-</tr>
-</thead>
-<tbody class="gt_table_body">
-<tr class="odd">
-<td class="gt_row gt_left" headers="Metadata field">project</td>
-<td class="gt_row gt_left" headers="Definition">Project ID</td>
-<td class="gt_row gt_left" headers="Format">&lt;surname\&gt;_et_al_2023</td>
-<td class="gt_row gt_center" headers="Ontology">NA</td>
-<td class="gt_row gt_left" headers="Example">proks_et_al_2023</td>
-</tr>
-<tr class="even">
-<td class="gt_row gt_left" headers="Metadata field">author</td>
-<td class="gt_row gt_left" headers="Definition">Owner of the project</td>
-<td class="gt_row gt_left" headers="Format">&lt;First name\&gt; &lt;Surname\&gt;</td>
-<td class="gt_row gt_center" headers="Ontology">NA</td>
-<td class="gt_row gt_left" headers="Example">Martin Proks</td>
-</tr>
-<tr class="odd">
-<td class="gt_row gt_left" headers="Metadata field">date</td>
-<td class="gt_row gt_left" headers="Definition">Date of creation</td>
-<td class="gt_row gt_left" headers="Format">YYYYMMDD</td>
-<td class="gt_row gt_center" headers="Ontology">NA</td>
-<td class="gt_row gt_left" headers="Example">20230101</td>
-</tr>
-<tr class="even">
-<td class="gt_row gt_left" headers="Metadata field">description</td>
-<td class="gt_row gt_left" headers="Definition">Short description of the project</td>
-<td class="gt_row gt_left" headers="Format">Plain text</td>
-<td class="gt_row gt_center" headers="Ontology">NA</td>
-<td class="gt_row gt_left" headers="Example">This is a project describing the effect of Oct4 perturbation after pERK activation</td>
-</tr>
-</tbody>
-</table>
-
 </div>
+<p>Link to the <a href="https://fileinfo.com/">file format database</a>.</p>
 </div>
 </div>
+<p>Metadata in biological datasets refers to the information that describes the data and provides context for how the data was collected, processed, and analyzed. Metadata is crucial for understanding, interpreting, and using biological datasets effectively. It also ensures that datasets are reusable, reproducible and understandable by other researchers. Some of the components may differ depending on the type of project, but there are general concepts that will always be shared across different projects:</p>
+<ul>
+<li>Sample information and collection details</li>
+<li>Biological context (such experimental conditions if applicable)</li>
+<li>Data description</li>
+<li>Data processing steps applied to the raw data</li>
+<li>Annotation and Ontology terms</li>
+<li>File metadata (file type, file format, etc.)</li>
+<li>Ethical and Legal Compliance (ownership, access, provenance)</li>
+</ul>
+<div class="callout callout-style-default callout-warning callout-titled" title="Metadata and controlled vocabularies">
+<div class="callout-header d-flex align-content-center">
+<div class="callout-icon-container">
+<i class="callout-icon"></i>
+</div>
+<div class="callout-title-container flex-fill">
+Metadata and controlled vocabularies
+</div>
+</div>
+<div class="callout-body-container callout-body">
+<p>To maximize the usefulness of metadata, aim to use controlled vocabularies across all fields. Read more about data documentation and find ontology services examples in <a href="https://hds-sandbox.github.io/RDM_NGS_course/develop/04_metadata.html#controlled-vocabularies-and-ontologies">lesson 4</a>. We encourage you to begin implementing them systematically on your own (under the “sources” section, you will find some helpful links to guide you putting them in practice).</p>
+<p>If you work with NGS data, check out <a href="https://hds-sandbox.github.io/RDM_NGS_course/develop/examples/NGS_metadata.html">this</a> recommendations and examples of metadata for samples, projects and datasets.</p>
+</div>
 </div>
 </section>
-</section>
-<section id="more-info" class="level3">
-<h3 class="anchored" data-anchor-id="more-info">More info</h3>
-<p>The information provided in this lesson is not at all exhaustive. There might be many more fields and controlled vocabularies that could be useful for your NGS data. We recommend that you take a look at the following sources for more information!</p>
+<section id="readme-file" class="level3">
+<h3 class="anchored" data-anchor-id="readme-file">README file</h3>
+<div class="callout callout-style-default callout-warning callout-titled" title="README.md">
+<div class="callout-header d-flex align-content-center">
+<div class="callout-icon-container">
+<i class="callout-icon"></i>
+</div>
+<div class="callout-title-container flex-fill">
+README.md
+</div>
+</div>
+<div class="callout-body-container callout-body">
+<p>Choose the format that best suits the project’s needs. In this workshop, we will focused on Markdown as it is the most used format due to its balance of simplicity and expressive formatting options.</p>
+<div class="callout callout-style-default callout-definition no-icon callout-titled">
+<div class="callout-header d-flex align-content-center" data-bs-toggle="collapse" data-bs-target=".callout-12-contents" aria-controls="callout-12" aria-expanded="false" aria-label="Toggle callout">
+<div class="callout-icon-container">
+<i class="callout-icon no-icon"></i>
+</div>
+<div class="callout-title-container flex-fill">
+File formats
+</div>
+<div class="callout-btn-toggle d-inline-block border-0 py-1 ps-1 pe-0 float-end"><i class="callout-toggle"></i></div>
+</div>
+<div id="callout-12" class="callout-12-contents callout-collapse collapse">
+<div class="callout-body-container callout-body">
+<div>
+<div class="callout-definition">
 <ul>
-<li><a href="https://faircookbook.elixir-europe.org/content/recipes/interoperability/transcriptomics-metadata.html#analysis-metadata">Transcriptomics metadata standards and fields</a></li>
-<li><a href="https://lamin.ai/docs/bionty">Bionty</a>: Biological ontologies for data scientists.</li>
+<li>Markdown (<code>.md</code>): commonly used because is easy to read and write and is compatible across platforms (e.g., GitHub, GitLab). Supports formatting like headings, lists, links, images, and code blocks.</li>
+<li>Plain Text (<code>.txt</code>): Simple and straightforward format without any rich formatting and great for basic instructions. Lack the ability of structure content effectively.</li>
+<li>ReStructuredText (<code>.rst</code>): commonly used for python projects. Supports advanced formatting (takes, links, images and code blocks) .</li>
 </ul>
-<div class="callout callout-style-default callout-exercise no-icon callout-titled">
-<div class="callout-header d-flex align-content-center" data-bs-toggle="collapse" data-bs-target=".callout-12-contents" aria-controls="callout-12" aria-expanded="true" aria-label="Toggle callout">
+<p>Others such as HTML, YAML and Notebooks.</p>
+</div>
+</div>
+</div>
+</div>
+</div>
+<p>Link to the <a href="https://fileinfo.com/">file format database</a></p>
+</div>
+</div>
+<p>The README.md file is a <a href="https://www.markdownguide.org/">markdown file</a> that provides a comprehensive description of the data within a folder. Its rich text format (including bold, italic, links, etc.) allows you to explain the contents of the folder, as well as the reasons and methods behind its creation or collection. The content will vary depending on what it described (data or assays, project, software…).</p>
+<p>Here is an example of a README file for a bioinformatics project:</p>
+<div class="callout callout-style-default callout-readme no-icon callout-titled">
+<div class="callout-header d-flex align-content-center">
 <div class="callout-icon-container">
 <i class="callout-icon no-icon"></i>
 </div>
 <div class="callout-title-container flex-fill">
-Exercise 2: modify the metadata.yml files in your Cookiecutter templates
+README
 </div>
-<div class="callout-btn-toggle d-inline-block border-0 py-1 ps-1 pe-0 float-end"><i class="callout-toggle"></i></div>
 </div>
-<div id="callout-12" class="callout-12-contents callout-collapse collapse show">
 <div class="callout-body-container callout-body">
 <div>
-<div class="callout-exercise">
-<p>We have seen some examples of metadata for NGS data. It is time now to customize your Cookiecutter templates and modify the metadata.yml files so that they fit your needs!</p>
-<ol type="1">
-<li>Think about what kind of metadata you would like to include.</li>
-<li>Modify the <code>cookiecutter.json</code> file so that when you create a new folder template, all the metadata is filled accordingly.</li>
-</ol>
-<div class="callout callout-style-default callout-hint no-icon callout-titled">
-<div class="callout-header d-flex align-content-center" data-bs-toggle="collapse" data-bs-target=".callout-10-contents" aria-controls="callout-10" aria-expanded="false" aria-label="Toggle callout">
+<div class="callout-readme">
+<p># TITLE</p>
+<p>Clear and descriptive.</p>
+<p># OVERVIEW</p>
+<p>Introduction to the project including its aims, and its significance. Describe the main purpose and the biological questions being addressed.</p>
+<div class="callout callout-style-default callout-definition no-icon callout-titled">
+<div class="callout-header d-flex align-content-center" data-bs-toggle="collapse" data-bs-target=".callout-14-contents" aria-controls="callout-14" aria-expanded="false" aria-label="Toggle callout">
 <div class="callout-icon-container">
 <i class="callout-icon no-icon"></i>
 </div>
 <div class="callout-title-container flex-fill">
-Hint
+Example text
 </div>
 <div class="callout-btn-toggle d-inline-block border-0 py-1 ps-1 pe-0 float-end"><i class="callout-toggle"></i></div>
 </div>
-<div id="callout-10" class="callout-10-contents callout-collapse collapse">
+<div id="callout-14" class="callout-14-contents callout-collapse collapse">
 <div class="callout-body-container callout-body">
 <div>
-<div class="callout-hint">
-<div class="quarto-figure quarto-figure-center">
-<figure class="figure">
-<p><img src="./images/cookiecutter_json.png" class="img-fluid figure-img"></p>
-<figcaption>cookiecutter_json_example</figcaption>
-</figure>
+<div class="callout-definition">
+<p>This project aims to investigate gene expression patterns across various human tissues using Next Generation Sequencing (NGS) data. By analyzing the transcriptomes of different tissues, we seek to uncover tissue-specific gene expression profiles and identify potential markers associated with specific biological functions or diseases.</p>
+<p>Understanding tissue-specific gene expression is crucial for deciphering the molecular basis of health and disease. Identifying genes that are uniquely expressed in certain tissues can provide insights into tissue function, development, and potential therapeutic targets. This project contributes to our broader understanding of human biology and has implications for personalized medicine and disease research.</p>
+</div>
 </div>
 </div>
 </div>
 </div>
+<p># TABLE OF CONTENTS (optional but helpful for others to navigate to different sections)</p>
+<p># INSTALLATION AND SETUP</p>
+<p>List all prerequisites, software, dependencies, and system requirements needed for others to reproduce the project. If available, you may link to a Docker image, Conda YAML file, or requirements.txt file.</p>
+<p># USAGE</p>
+<p>Include command-line examples for various functionalities or steps and path for running a pipeline, if applicable.</p>
+<p># DATASETS</p>
+<p>Describe the data,, including its sources, format, and how to access it. If the data has undergone preprocessing, provide a description of the processes applied or the pipeline used.</p>
+<div class="callout callout-style-default callout-definition no-icon callout-titled">
+<div class="callout-header d-flex align-content-center" data-bs-toggle="collapse" data-bs-target=".callout-15-contents" aria-controls="callout-15" aria-expanded="false" aria-label="Toggle callout">
+<div class="callout-icon-container">
+<i class="callout-icon no-icon"></i>
+</div>
+<div class="callout-title-container flex-fill">
+Example text
 </div>
+<div class="callout-btn-toggle d-inline-block border-0 py-1 ps-1 pe-0 float-end"><i class="callout-toggle"></i></div>
 </div>
-<ol start="3" type="1">
-<li>Modify the <code>metadata.yml</code> file so that it includes the metadata recorded by the <code>cookiecutter.json</code> file.</li>
-</ol>
-<div class="callout callout-style-default callout-hint no-icon callout-titled">
-<div class="callout-header d-flex align-content-center" data-bs-toggle="collapse" data-bs-target=".callout-11-contents" aria-controls="callout-11" aria-expanded="false" aria-label="Toggle callout">
+<div id="callout-15" class="callout-15-contents callout-collapse collapse">
+<div class="callout-body-container callout-body">
+<div>
+<div class="callout-definition">
+<p>We have used internal datasets with IDs: RNA_humanSkin_20201030, RNA_humanBrain_20210102, RNA_humanLung_20220304.</p>
+<p>In addition, we utilized publicly available NGS datasets from the GTEx (Genotype-Tissue Expression) project, which provides comprehensive RNA-seq data across multiple human tissues. These datasets offer a wealth of information on gene expression levels and isoform variations across diverse tissues, making them ideal for our analysis.</p>
+</div>
+</div>
+</div>
+</div>
+</div>
+<p># RESULTS</p>
+<p>Summarize the results and key findings or outputs.</p>
+<div class="callout callout-style-default callout-definition no-icon callout-titled">
+<div class="callout-header d-flex align-content-center" data-bs-toggle="collapse" data-bs-target=".callout-16-contents" aria-controls="callout-16" aria-expanded="false" aria-label="Toggle callout">
 <div class="callout-icon-container">
 <i class="callout-icon no-icon"></i>
 </div>
 <div class="callout-title-container flex-fill">
-Hint
+Example text
 </div>
 <div class="callout-btn-toggle d-inline-block border-0 py-1 ps-1 pe-0 float-end"><i class="callout-toggle"></i></div>
 </div>
-<div id="callout-11" class="callout-11-contents callout-collapse collapse">
+<div id="callout-16" class="callout-16-contents callout-collapse collapse">
 <div class="callout-body-container callout-body">
 <div>
-<div class="callout-hint">
-<div class="quarto-figure quarto-figure-center">
-<figure class="figure">
-<p><img src="./images/assay_metadata.png" class="img-fluid figure-img"></p>
-<figcaption>assay_metadata_example</figcaption>
-</figure>
+<div class="callout-definition">
+<p>Our analysis revealed distinct gene expression patterns among different human tissues. We identified tissue-specific genes enriched in brain tissues, highlighting their potential roles in neurodevelopment and function. Additionally, we found a set of genes that exhibit consistent expression across a range of tissues, suggesting their fundamental importance in basic cellular processes.</p>
+<p>Furthermore, our differential expression analysis unveiled significant changes in gene expression between healthy and diseased tissues, shedding light on potential molecular factors underlying various diseases. Overall, this project underscores the power of NGS data in unraveling intricate gene expression networks and their implications for human health.</p>
 </div>
 </div>
 </div>
 </div>
 </div>
+<p># CONTRIBUTIONS AND CONTACT INFO</p>
+<p># LICENSE</p>
 </div>
-<ol start="4" type="1">
-<li>Modify the <code>README.md</code> file so that it includes the short description recorded by the <code>cookiecutter.json</code> file.</li>
-<li>Git add, commit, and push the changes to your template.</li>
-<li>Test your folders by using the command <code>cookiecutter &lt;URL to your cookiecutter repository in GitHub&gt;</code></li>
-</ol>
 </div>
 </div>
 </div>
+<div class="callout callout-style-default callout-exercise no-icon callout-titled">
+<div class="callout-header d-flex align-content-center" data-bs-toggle="collapse" data-bs-target=".callout-18-contents" aria-controls="callout-18" aria-expanded="true" aria-label="Toggle callout">
+<div class="callout-icon-container">
+<i class="callout-icon no-icon"></i>
 </div>
+<div class="callout-title-container flex-fill">
+Exercise 2: modify the metadata.yml file in your Cookiecutter template
 </div>
-</section>
-</section>
-<section id="naming-conventions" class="level2">
-<h2 class="anchored" data-anchor-id="naming-conventions">3. Naming conventions</h2>
-<p>Using consistent naming conventions is important in scientific research as it helps with the organization and retrieval of data or results. By adopting standardized naming conventions, researchers ensure that files, experiments, or data sets are labeled in a clear, logical manner. This makes it easier to locate and compare similar types of data or results, even when dealing with large datasets or multiple experiments. For instance, in genomics, employing uniform naming conventions for files related to specific experiments or samples allows for swift identification and comparison of relevant data, streamlining the research process and contributing to the reproducibility of findings. This practice promotes efficiency, collaboration, and the integrity of scientific work.</p>
-<section id="general-tips" class="level3">
-<h3 class="anchored" data-anchor-id="general-tips">General tips</h3>
-<p>Below you will find a small list of general tips to follow when you name a folder or a file:</p>
-<ul>
-<li>Use only alphanumeric characters to write a word: a to z and 0 to 9</li>
-<li>Avoid special characters: ~!@#$%^&amp;*()`<span></span>“|</li>
-<li>Date format: use <code>YYYYMMDD</code> format. For example: 20230101.</li>
-<li>Authors: use initials. For example: JARH</li>
-<li><strong>Don’t use spaces</strong>! Computers get very confused when you need to point a path to a file and it contains spaces! Instead:
+<div class="callout-btn-toggle d-inline-block border-0 py-1 ps-1 pe-0 float-end"><i class="callout-toggle"></i></div>
+</div>
+<div id="callout-18" class="callout-18-contents callout-collapse collapse show">
+<div class="callout-body-container callout-body">
+<div>
+<div class="callout-exercise">
+<p>It is time now to customize your Cookiecutter templates and modify the metadata.yml files so that they fit your needs!</p>
+<ol start="0" type="1">
+<li><p>Consider changing variables (add/remove) in the metadata.yml file from the cookicutter template.</p></li>
+<li><p>Modify the <code>cookiecutter.json</code> file. You could add new variables or change the default key and/or values:</p>
+<div class="sourceCode" id="cb15"><pre class="sourceCode json code-overflow-wrap code-with-copy"><code class="sourceCode json"><span id="cb15-1"><a href="#cb15-1" aria-hidden="true" tabindex="-1"></a><span class="fu">{</span></span>
+<span id="cb15-2"><a href="#cb15-2" aria-hidden="true" tabindex="-1"></a><span class="dt">"project_name"</span><span class="fu">:</span> <span class="st">"myProject"</span><span class="fu">,</span></span>
+<span id="cb15-3"><a href="#cb15-3" aria-hidden="true" tabindex="-1"></a><span class="dt">"project_slug"</span><span class="fu">:</span> <span class="st">"{{ cookiecutter.project_name.lower().replace(' ', '_').replace('-', '_') }}"</span><span class="fu">,</span></span>
+<span id="cb15-4"><a href="#cb15-4" aria-hidden="true" tabindex="-1"></a><span class="dt">"authors"</span><span class="fu">:</span> <span class="st">"myName"</span><span class="fu">,</span></span>
+<span id="cb15-5"><a href="#cb15-5" aria-hidden="true" tabindex="-1"></a><span class="dt">"start_date"</span><span class="fu">:</span> <span class="st">"{% now 'utc', '%Y%m%d' %}"</span><span class="fu">,</span></span>
+<span id="cb15-6"><a href="#cb15-6" aria-hidden="true" tabindex="-1"></a><span class="dt">"short_desc"</span><span class="fu">:</span> <span class="st">""</span><span class="fu">,</span></span>
+<span id="cb15-7"><a href="#cb15-7" aria-hidden="true" tabindex="-1"></a><span class="dt">"version"</span><span class="fu">:</span> <span class="st">"0.1.0"</span></span>
+<span id="cb15-8"><a href="#cb15-8" aria-hidden="true" tabindex="-1"></a><span class="fu">}</span></span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
+<p>The metadata file will be filled accordingly.</p></li>
+<li><p>Optional: You can customize or remove this prompt message entirely, allowing you to tailor the text to your preferences for a unique experience each time you use the template.</p>
+<div class="sourceCode" id="cb16"><pre class="sourceCode json code-overflow-wrap code-with-copy"><code class="sourceCode json"><span id="cb16-1"><a href="#cb16-1" aria-hidden="true" tabindex="-1"></a><span class="er">"__prompts__":</span> <span class="fu">{</span></span>
+<span id="cb16-2"><a href="#cb16-2" aria-hidden="true" tabindex="-1"></a>    <span class="dt">"project_name"</span><span class="fu">:</span> <span class="st">"Project directory name [Example: project_short_description_202X]"</span><span class="fu">,</span></span>
+<span id="cb16-3"><a href="#cb16-3" aria-hidden="true" tabindex="-1"></a>    <span class="dt">"author"</span><span class="fu">:</span> <span class="st">"Author of the project"</span><span class="fu">,</span></span>
+<span id="cb16-4"><a href="#cb16-4" aria-hidden="true" tabindex="-1"></a>    <span class="dt">"date"</span><span class="fu">:</span> <span class="st">"Date of project creation, default is today's date"</span><span class="fu">,</span></span>
+<span id="cb16-5"><a href="#cb16-5" aria-hidden="true" tabindex="-1"></a>    <span class="dt">"short_description"</span><span class="fu">:</span> <span class="st">"Provide a detailed description of the project (context/content)"</span></span>
+<span id="cb16-6"><a href="#cb16-6" aria-hidden="true" tabindex="-1"></a><span class="fu">}</span><span class="er">,</span></span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div></li>
+<li><p>Modify the <code>metadata.yml</code> file so that it includes the metadata recorded by the <code>cookiecutter.json</code> file. Hint below:</p>
+<div class="sourceCode" id="cb17"><pre class="sourceCode json code-overflow-wrap code-with-copy"><code class="sourceCode json"><span id="cb17-1"><a href="#cb17-1" aria-hidden="true" tabindex="-1"></a><span class="er">project:</span> <span class="fu">{</span><span class="er">{</span> <span class="er">cookiecutter.project_name</span> <span class="fu">}</span><span class="er">}</span></span>
+<span id="cb17-2"><a href="#cb17-2" aria-hidden="true" tabindex="-1"></a><span class="er">author:</span> <span class="fu">{</span><span class="er">{</span> <span class="er">cookiecutter.author</span> <span class="fu">}</span><span class="er">}</span></span>
+<span id="cb17-3"><a href="#cb17-3" aria-hidden="true" tabindex="-1"></a><span class="er">date:</span> <span class="fu">{</span><span class="er">{</span> <span class="er">cookiecutter.date</span> <span class="fu">}</span><span class="er">}</span></span>
+<span id="cb17-4"><a href="#cb17-4" aria-hidden="true" tabindex="-1"></a><span class="er">description:</span> <span class="fu">{</span><span class="er">{</span> <span class="er">cookiecutter.short_description</span> <span class="fu">}</span><span class="er">}</span></span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div></li>
+<li><p>Modify the <code>README.md</code> file so that it includes the short description recorded by the <code>cookiecutter.json</code> file and the metadata at the top of the markdown file (top between lines of dashed).</p>
+<div class="sourceCode" id="cb18"><pre class="sourceCode md code-overflow-wrap code-with-copy"><code class="sourceCode markdown"><span id="cb18-1"><a href="#cb18-1" aria-hidden="true" tabindex="-1"></a><span class="co">---</span></span>
+<span id="cb18-2"><a href="#cb18-2" aria-hidden="true" tabindex="-1"></a><span class="an">title:</span><span class="co"> {{ cookiecutter.project_name }}</span></span>
+<span id="cb18-3"><a href="#cb18-3" aria-hidden="true" tabindex="-1"></a><span class="an">date:</span><span class="co"> "{{ cookiecutter.date }}"</span></span>
+<span id="cb18-4"><a href="#cb18-4" aria-hidden="true" tabindex="-1"></a><span class="an">author:</span><span class="co"> {{ cookiecutter.author }}</span></span>
+<span id="cb18-5"><a href="#cb18-5" aria-hidden="true" tabindex="-1"></a><span class="an">version:</span><span class="co"> {{ cookiecutter.version }}</span></span>
+<span id="cb18-6"><a href="#cb18-6" aria-hidden="true" tabindex="-1"></a><span class="co">---</span></span>
+<span id="cb18-7"><a href="#cb18-7" aria-hidden="true" tabindex="-1"></a></span>
+<span id="cb18-8"><a href="#cb18-8" aria-hidden="true" tabindex="-1"></a>Project description</span>
+<span id="cb18-9"><a href="#cb18-9" aria-hidden="true" tabindex="-1"></a>----</span>
+<span id="cb18-10"><a href="#cb18-10" aria-hidden="true" tabindex="-1"></a></span>
+<span id="cb18-11"><a href="#cb18-11" aria-hidden="true" tabindex="-1"></a>{{ cookiecutter.short_description }}</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div></li>
+<li><p>Commit and push changes when you are done with your modifications</p></li>
+</ol>
 <ul>
-<li>Separate field sections are separated by underscores <code>_</code>.</li>
-<li>Words in each section are written in <a href="https://en.wikipedia.org/wiki/Camel_case">camelCase</a>. It would look then like this: <code>field1_word1Word2.txt</code>. For example: <code>heatmap_sampleCor_20230101.png</code>. The first field indicates what this file is, i.e., a heatmap. The second field is what is being plotted, i.e., sample correlations; since the field contains two words, they are written in camelCase. The third field is the date when the image was created.</li>
-</ul></li>
-<li>Use as short fields as possible. You can try to use understandable abbreviations, like LFC for LogFoldChange, Cor for correlations, Dist for distances, etc.</li>
-<li>Avoid long names as much as you can, be concise!</li>
-<li>Avoid creating many sublevels of folders.</li>
-<li>Write down your naming convention pattern and document it in the README file</li>
-<li>When using a sequential numbering system, use leading zeros to make sure files are sorted in sequential order. Use <code>01</code> instead of just <code>1</code> if your sequence only goes up to <code>99</code>.</li>
-<li>Versions should be used as the last element, and use at least two digits with a leading 0 (e.g.&nbsp;v01, v02)</li>
+<li>Stage the changes with <code>git add</code></li>
+<li>Commit the changes with a meaningful commit message <code>git commit -m "update cookicutter template"</code></li>
+<li>Push the changes to your forked repository on Github <code>git push origin main</code> (or the appropriate branch name)</li>
 </ul>
-</section>
-<section id="suggestions-for-ngs-data" class="level3">
-<h3 class="anchored" data-anchor-id="suggestions-for-ngs-data">Suggestions for NGS data</h3>
-<p>More info on naming conventions for different types of files and analysis is in development.</p>
-<div class="cell">
-<div class="cell-output-display">
-<div>
-<div id="rgkhnigfca" style="padding-left:0px;padding-right:0px;padding-top:10px;padding-bottom:10px;overflow-x:auto;overflow-y:auto;width:auto;height:auto;">
-<style>#rgkhnigfca table {
-  font-family: system-ui, 'Segoe UI', Roboto, Helvetica, Arial, sans-serif, 'Apple Color Emoji', 'Segoe UI Emoji', 'Segoe UI Symbol', 'Noto Color Emoji';
-  -webkit-font-smoothing: antialiased;
-  -moz-osx-font-smoothing: grayscale;
-}
-
-#rgkhnigfca thead, #rgkhnigfca tbody, #rgkhnigfca tfoot, #rgkhnigfca tr, #rgkhnigfca td, #rgkhnigfca th {
-  border-style: none;
-}
-
-#rgkhnigfca p {
-  margin: 0;
-  padding: 0;
-}
-
-#rgkhnigfca .gt_table {
-  display: table;
-  border-collapse: collapse;
-  line-height: normal;
-  margin-left: auto;
-  margin-right: auto;
-  color: #333333;
-  font-size: 11px;
-  font-weight: normal;
-  font-style: normal;
-  background-color: #FFFFFF;
-  width: auto;
-  border-top-style: solid;
-  border-top-width: 2px;
-  border-top-color: #A8A8A8;
-  border-right-style: none;
-  border-right-width: 2px;
-  border-right-color: #D3D3D3;
-  border-bottom-style: solid;
-  border-bottom-width: 2px;
-  border-bottom-color: #A8A8A8;
-  border-left-style: none;
-  border-left-width: 2px;
-  border-left-color: #D3D3D3;
-}
-
-#rgkhnigfca .gt_caption {
-  padding-top: 4px;
-  padding-bottom: 4px;
-}
-
-#rgkhnigfca .gt_title {
-  color: #333333;
-  font-size: 125%;
-  font-weight: initial;
-  padding-top: 4px;
-  padding-bottom: 4px;
-  padding-left: 5px;
-  padding-right: 5px;
-  border-bottom-color: #FFFFFF;
-  border-bottom-width: 0;
-}
-
-#rgkhnigfca .gt_subtitle {
-  color: #333333;
-  font-size: 85%;
-  font-weight: initial;
-  padding-top: 3px;
-  padding-bottom: 5px;
-  padding-left: 5px;
-  padding-right: 5px;
-  border-top-color: #FFFFFF;
-  border-top-width: 0;
-}
-
-#rgkhnigfca .gt_heading {
-  background-color: #FFFFFF;
-  text-align: center;
-  border-bottom-color: #FFFFFF;
-  border-left-style: none;
-  border-left-width: 1px;
-  border-left-color: #D3D3D3;
-  border-right-style: none;
-  border-right-width: 1px;
-  border-right-color: #D3D3D3;
-}
-
-#rgkhnigfca .gt_bottom_border {
-  border-bottom-style: solid;
-  border-bottom-width: 2px;
-  border-bottom-color: #D3D3D3;
-}
-
-#rgkhnigfca .gt_col_headings {
-  border-top-style: solid;
-  border-top-width: 2px;
-  border-top-color: #D3D3D3;
-  border-bottom-style: solid;
-  border-bottom-width: 2px;
-  border-bottom-color: #D3D3D3;
-  border-left-style: none;
-  border-left-width: 1px;
-  border-left-color: #D3D3D3;
-  border-right-style: none;
-  border-right-width: 1px;
-  border-right-color: #D3D3D3;
-}
-
-#rgkhnigfca .gt_col_heading {
-  color: #333333;
-  background-color: #FFFFFF;
-  font-size: 100%;
-  font-weight: normal;
-  text-transform: inherit;
-  border-left-style: none;
-  border-left-width: 1px;
-  border-left-color: #D3D3D3;
-  border-right-style: none;
-  border-right-width: 1px;
-  border-right-color: #D3D3D3;
-  vertical-align: bottom;
-  padding-top: 5px;
-  padding-bottom: 6px;
-  padding-left: 5px;
-  padding-right: 5px;
-  overflow-x: hidden;
-}
-
-#rgkhnigfca .gt_column_spanner_outer {
-  color: #333333;
-  background-color: #FFFFFF;
-  font-size: 100%;
-  font-weight: normal;
-  text-transform: inherit;
-  padding-top: 0;
-  padding-bottom: 0;
-  padding-left: 4px;
-  padding-right: 4px;
-}
-
-#rgkhnigfca .gt_column_spanner_outer:first-child {
-  padding-left: 0;
-}
-
-#rgkhnigfca .gt_column_spanner_outer:last-child {
-  padding-right: 0;
-}
-
-#rgkhnigfca .gt_column_spanner {
-  border-bottom-style: solid;
-  border-bottom-width: 2px;
-  border-bottom-color: #D3D3D3;
-  vertical-align: bottom;
-  padding-top: 5px;
-  padding-bottom: 5px;
-  overflow-x: hidden;
-  display: inline-block;
-  width: 100%;
-}
-
-#rgkhnigfca .gt_spanner_row {
-  border-bottom-style: hidden;
-}
-
-#rgkhnigfca .gt_group_heading {
-  padding-top: 8px;
-  padding-bottom: 8px;
-  padding-left: 5px;
-  padding-right: 5px;
-  color: #333333;
-  background-color: #FFFFFF;
-  font-size: 100%;
-  font-weight: initial;
-  text-transform: inherit;
-  border-top-style: solid;
-  border-top-width: 2px;
-  border-top-color: #D3D3D3;
-  border-bottom-style: solid;
-  border-bottom-width: 2px;
-  border-bottom-color: #D3D3D3;
-  border-left-style: none;
-  border-left-width: 1px;
-  border-left-color: #D3D3D3;
-  border-right-style: none;
-  border-right-width: 1px;
-  border-right-color: #D3D3D3;
-  vertical-align: middle;
-  text-align: left;
-}
-
-#rgkhnigfca .gt_empty_group_heading {
-  padding: 0.5px;
-  color: #333333;
-  background-color: #FFFFFF;
-  font-size: 100%;
-  font-weight: initial;
-  border-top-style: solid;
-  border-top-width: 2px;
-  border-top-color: #D3D3D3;
-  border-bottom-style: solid;
-  border-bottom-width: 2px;
-  border-bottom-color: #D3D3D3;
-  vertical-align: middle;
-}
-
-#rgkhnigfca .gt_from_md > :first-child {
-  margin-top: 0;
-}
-
-#rgkhnigfca .gt_from_md > :last-child {
-  margin-bottom: 0;
-}
-
-#rgkhnigfca .gt_row {
-  padding-top: 8px;
-  padding-bottom: 8px;
-  padding-left: 5px;
-  padding-right: 5px;
-  margin: 10px;
-  border-top-style: solid;
-  border-top-width: 1px;
-  border-top-color: #D3D3D3;
-  border-left-style: none;
-  border-left-width: 1px;
-  border-left-color: #D3D3D3;
-  border-right-style: none;
-  border-right-width: 1px;
-  border-right-color: #D3D3D3;
-  vertical-align: middle;
-  overflow-x: hidden;
-}
-
-#rgkhnigfca .gt_stub {
-  color: #333333;
-  background-color: #FFFFFF;
-  font-size: 100%;
-  font-weight: initial;
-  text-transform: inherit;
-  border-right-style: solid;
-  border-right-width: 2px;
-  border-right-color: #D3D3D3;
-  padding-left: 5px;
-  padding-right: 5px;
-}
-
-#rgkhnigfca .gt_stub_row_group {
-  color: #333333;
-  background-color: #FFFFFF;
-  font-size: 100%;
-  font-weight: initial;
-  text-transform: inherit;
-  border-right-style: solid;
-  border-right-width: 2px;
-  border-right-color: #D3D3D3;
-  padding-left: 5px;
-  padding-right: 5px;
-  vertical-align: top;
-}
-
-#rgkhnigfca .gt_row_group_first td {
-  border-top-width: 2px;
-}
-
-#rgkhnigfca .gt_row_group_first th {
-  border-top-width: 2px;
-}
-
-#rgkhnigfca .gt_summary_row {
-  color: #333333;
-  background-color: #FFFFFF;
-  text-transform: inherit;
-  padding-top: 8px;
-  padding-bottom: 8px;
-  padding-left: 5px;
-  padding-right: 5px;
-}
-
-#rgkhnigfca .gt_first_summary_row {
-  border-top-style: solid;
-  border-top-color: #D3D3D3;
-}
-
-#rgkhnigfca .gt_first_summary_row.thick {
-  border-top-width: 2px;
-}
-
-#rgkhnigfca .gt_last_summary_row {
-  padding-top: 8px;
-  padding-bottom: 8px;
-  padding-left: 5px;
-  padding-right: 5px;
-  border-bottom-style: solid;
-  border-bottom-width: 2px;
-  border-bottom-color: #D3D3D3;
-}
-
-#rgkhnigfca .gt_grand_summary_row {
-  color: #333333;
-  background-color: #FFFFFF;
-  text-transform: inherit;
-  padding-top: 8px;
-  padding-bottom: 8px;
-  padding-left: 5px;
-  padding-right: 5px;
-}
-
-#rgkhnigfca .gt_first_grand_summary_row {
-  padding-top: 8px;
-  padding-bottom: 8px;
-  padding-left: 5px;
-  padding-right: 5px;
-  border-top-style: double;
-  border-top-width: 6px;
-  border-top-color: #D3D3D3;
-}
-
-#rgkhnigfca .gt_last_grand_summary_row_top {
-  padding-top: 8px;
-  padding-bottom: 8px;
-  padding-left: 5px;
-  padding-right: 5px;
-  border-bottom-style: double;
-  border-bottom-width: 6px;
-  border-bottom-color: #D3D3D3;
-}
-
-#rgkhnigfca .gt_striped {
-  background-color: rgba(128, 128, 128, 0.05);
-}
-
-#rgkhnigfca .gt_table_body {
-  border-top-style: solid;
-  border-top-width: 2px;
-  border-top-color: #D3D3D3;
-  border-bottom-style: solid;
-  border-bottom-width: 2px;
-  border-bottom-color: #D3D3D3;
-}
-
-#rgkhnigfca .gt_footnotes {
-  color: #333333;
-  background-color: #FFFFFF;
-  border-bottom-style: none;
-  border-bottom-width: 2px;
-  border-bottom-color: #D3D3D3;
-  border-left-style: none;
-  border-left-width: 2px;
-  border-left-color: #D3D3D3;
-  border-right-style: none;
-  border-right-width: 2px;
-  border-right-color: #D3D3D3;
-}
-
-#rgkhnigfca .gt_footnote {
-  margin: 0px;
-  font-size: 90%;
-  padding-top: 4px;
-  padding-bottom: 4px;
-  padding-left: 5px;
-  padding-right: 5px;
-}
-
-#rgkhnigfca .gt_sourcenotes {
-  color: #333333;
-  background-color: #FFFFFF;
-  border-bottom-style: none;
-  border-bottom-width: 2px;
-  border-bottom-color: #D3D3D3;
-  border-left-style: none;
-  border-left-width: 2px;
-  border-left-color: #D3D3D3;
-  border-right-style: none;
-  border-right-width: 2px;
-  border-right-color: #D3D3D3;
-}
-
-#rgkhnigfca .gt_sourcenote {
-  font-size: 90%;
-  padding-top: 4px;
-  padding-bottom: 4px;
-  padding-left: 5px;
-  padding-right: 5px;
-}
-
-#rgkhnigfca .gt_left {
-  text-align: left;
-}
-
-#rgkhnigfca .gt_center {
-  text-align: center;
-}
-
-#rgkhnigfca .gt_right {
-  text-align: right;
-  font-variant-numeric: tabular-nums;
-}
-
-#rgkhnigfca .gt_font_normal {
-  font-weight: normal;
-}
-
-#rgkhnigfca .gt_font_bold {
-  font-weight: bold;
-}
-
-#rgkhnigfca .gt_font_italic {
-  font-style: italic;
-}
-
-#rgkhnigfca .gt_super {
-  font-size: 65%;
-}
-
-#rgkhnigfca .gt_footnote_marks {
-  font-size: 75%;
-  vertical-align: 0.4em;
-  position: initial;
-}
-
-#rgkhnigfca .gt_asterisk {
-  font-size: 100%;
-  vertical-align: 0;
-}
-
-#rgkhnigfca .gt_indent_1 {
-  text-indent: 5px;
-}
-
-#rgkhnigfca .gt_indent_2 {
-  text-indent: 10px;
-}
-
-#rgkhnigfca .gt_indent_3 {
-  text-indent: 15px;
-}
-
-#rgkhnigfca .gt_indent_4 {
-  text-indent: 20px;
-}
-
-#rgkhnigfca .gt_indent_5 {
-  text-indent: 25px;
-}
-</style>
-
-<table class="gt_table table table-sm table-striped small" data-quarto-postprocess="true" data-quarto-disable-processing="false" data-quarto-bootstrap="false">
-<thead>
-<tr class="header gt_col_headings">
-<th id="name" class="gt_col_heading gt_columns_bottom_border gt_left" data-quarto-table-cell-role="th" scope="col">name</th>
-<th id="description" class="gt_col_heading gt_columns_bottom_border gt_left" data-quarto-table-cell-role="th" scope="col">description</th>
-<th id="naming_convention" class="gt_col_heading gt_columns_bottom_border gt_left" data-quarto-table-cell-role="th" scope="col">naming_convention</th>
-<th id="file format" class="gt_col_heading gt_columns_bottom_border gt_left" data-quarto-table-cell-role="th" scope="col">file format</th>
-<th id="example" class="gt_col_heading gt_columns_bottom_border gt_left" data-quarto-table-cell-role="th" scope="col">example</th>
-</tr>
-</thead>
-<tbody class="gt_table_body">
-<tr class="odd">
-<td class="gt_row gt_left" headers="name">.fastq</td>
-<td class="gt_row gt_left" headers="description">raw sequencing reads</td>
-<td class="gt_row gt_left" headers="naming_convention">nan</td>
-<td class="gt_row gt_left" headers="file format">nan</td>
-<td class="gt_row gt_left" headers="example">sampleID_run_read1.fastq</td>
-</tr>
-<tr class="even">
-<td class="gt_row gt_left" headers="name">.fastqc</td>
-<td class="gt_row gt_left" headers="description">quality control from fastqc</td>
-<td class="gt_row gt_left" headers="naming_convention">nan</td>
-<td class="gt_row gt_left" headers="file format">nan</td>
-<td class="gt_row gt_left" headers="example">sampleID_run_read1.fastqc</td>
-</tr>
-<tr class="odd">
-<td class="gt_row gt_left" headers="name">.bam</td>
-<td class="gt_row gt_left" headers="description">aligned reads</td>
-<td class="gt_row gt_left" headers="naming_convention">nan</td>
-<td class="gt_row gt_left" headers="file format">nan</td>
-<td class="gt_row gt_left" headers="example">sampleID_run_read1.bam</td>
-</tr>
-<tr class="even">
-<td class="gt_row gt_left" headers="name">GTF</td>
-<td class="gt_row gt_left" headers="description">sequence annotation</td>
-<td class="gt_row gt_left" headers="naming_convention">nan</td>
-<td class="gt_row gt_left" headers="file format">nan</td>
-<td class="gt_row gt_left" headers="example">one of https://www.gencodegenes.org/</td>
-</tr>
-<tr class="odd">
-<td class="gt_row gt_left" headers="name">GFF</td>
-<td class="gt_row gt_left" headers="description">sequence annotation</td>
-<td class="gt_row gt_left" headers="naming_convention">nan</td>
-<td class="gt_row gt_left" headers="file format">nan</td>
-<td class="gt_row gt_left" headers="example">one of https://www.gencodegenes.org/</td>
-</tr>
-<tr class="even">
-<td class="gt_row gt_left" headers="name">.bed</td>
-<td class="gt_row gt_left" headers="description">genome locations</td>
-<td class="gt_row gt_left" headers="naming_convention">nan</td>
-<td class="gt_row gt_left" headers="file format">nan</td>
-<td class="gt_row gt_left" headers="example">nan</td>
-</tr>
-<tr class="odd">
-<td class="gt_row gt_left" headers="name">.bigwig</td>
-<td class="gt_row gt_left" headers="description">genome coverage</td>
-<td class="gt_row gt_left" headers="naming_convention">nan</td>
-<td class="gt_row gt_left" headers="file format">nan</td>
-<td class="gt_row gt_left" headers="example">nan</td>
-</tr>
-<tr class="even">
-<td class="gt_row gt_left" headers="name">.fasta</td>
-<td class="gt_row gt_left" headers="description">sequence data (nucleotide/aminoacid)</td>
-<td class="gt_row gt_left" headers="naming_convention">nan</td>
-<td class="gt_row gt_left" headers="file format">nan</td>
-<td class="gt_row gt_left" headers="example">one of https://www.gencodegenes.org/</td>
-</tr>
-<tr class="odd">
-<td class="gt_row gt_left" headers="name">Multiqc report</td>
-<td class="gt_row gt_left" headers="description">QC aggregated report</td>
-<td class="gt_row gt_left" headers="naming_convention">&lt;assayID\&gt;_YYYYMMDD.multiqc</td>
-<td class="gt_row gt_left" headers="file format">multiqc</td>
-<td class="gt_row gt_left" headers="example">RNA_20200101.multiqc</td>
-</tr>
-<tr class="even">
-<td class="gt_row gt_left" headers="name">Count matrix</td>
-<td class="gt_row gt_left" headers="description">final count matrix</td>
-<td class="gt_row gt_left" headers="naming_convention">&lt;assayID\&gt;_cm_aligner_YYYYMMDD.tsv</td>
-<td class="gt_row gt_left" headers="file format">tsv</td>
-<td class="gt_row gt_left" headers="example">RNA_cm_salmon_20200101.tsv</td>
-</tr>
-<tr class="odd">
-<td class="gt_row gt_left" headers="name">DEA</td>
-<td class="gt_row gt_left" headers="description">differential expression analysis results</td>
-<td class="gt_row gt_left" headers="naming_convention">DEA_&lt;condition1-condition2\&gt;_LFC&lt;absolute_threshold\&gt;_p&lt;pvalue decimals\&gt;_YYYYMMDD.tsv</td>
-<td class="gt_row gt_left" headers="file format">tsv</td>
-<td class="gt_row gt_left" headers="example">DEA_treat-untreat_LFC1_p01_20200101.tsv</td>
-</tr>
-<tr class="even">
-<td class="gt_row gt_left" headers="name">DBA</td>
-<td class="gt_row gt_left" headers="description">differential binding analysis results</td>
-<td class="gt_row gt_left" headers="naming_convention">DBA_&lt;condition1-condition2\&gt;_LFC&lt;absolute_threshold\&gt;_p&lt;pvalue decimals\&gt;_YYYYMMDD.tsv</td>
-<td class="gt_row gt_left" headers="file format">tsv</td>
-<td class="gt_row gt_left" headers="example">DBA_treat-untreat_LFC1_p01_20200101.tsv</td>
-</tr>
-<tr class="odd">
-<td class="gt_row gt_left" headers="name">MAplot</td>
-<td class="gt_row gt_left" headers="description">MA plot</td>
-<td class="gt_row gt_left" headers="naming_convention">MAplot_&lt;condition1-condition2\&gt;_YYYYMMDD.jpeg</td>
-<td class="gt_row gt_left" headers="file format">jpeg</td>
-<td class="gt_row gt_left" headers="example">MAplot_treat-untreat_20200101.jpeg</td>
-</tr>
-<tr class="even">
-<td class="gt_row gt_left" headers="name">Heatmap plot</td>
-<td class="gt_row gt_left" headers="description">Heatmap plot of anything</td>
-<td class="gt_row gt_left" headers="naming_convention">heatmap_&lt;type\&gt;_YYYYMMDD.jpeg</td>
-<td class="gt_row gt_left" headers="file format">jpeg</td>
-<td class="gt_row gt_left" headers="example">heatmap_sampleCor_20200101.jpeg</td>
-</tr>
-<tr class="odd">
-<td class="gt_row gt_left" headers="name">Volcano plot</td>
-<td class="gt_row gt_left" headers="description">Volcano plot</td>
-<td class="gt_row gt_left" headers="naming_convention">volcano_&lt;condition1-condition2\&gt;_YYYYMMDD.jpeg</td>
-<td class="gt_row gt_left" headers="file format">jpeg</td>
-<td class="gt_row gt_left" headers="example">volcano_treat-untreat_20200101.jpeg</td>
-</tr>
-<tr class="even">
-<td class="gt_row gt_left" headers="name">Venn diagram</td>
-<td class="gt_row gt_left" headers="description">Venn diagram</td>
-<td class="gt_row gt_left" headers="naming_convention">venn_&lt;type\&gt;_YYYYMMDD.jpeg</td>
-<td class="gt_row gt_left" headers="file format">jpeg</td>
-<td class="gt_row gt_left" headers="example">venn_consensus_20200101.jpeg</td>
-</tr>
-<tr class="odd">
-<td class="gt_row gt_left" headers="name">Enrichment table</td>
-<td class="gt_row gt_left" headers="description">Enrichment results</td>
-<td class="gt_row gt_left" headers="naming_convention">nan</td>
-<td class="gt_row gt_left" headers="file format">tsv</td>
-<td class="gt_row gt_left" headers="example">nan</td>
-</tr>
-</tbody>
-</table>
-
+<ol start="6" type="1">
+<li><p>Test your template by using <code>cookiecutter &lt;URL to your GitHub repository "cookicutter-template"&gt;</code></p>
+<p>Fill up the variables and verify that the modified information looks like you would expect.</p></li>
+</ol>
+</div>
 </div>
 </div>
 </div>
 </div>
+</section>
+</section>
+<section id="naming-conventions" class="level2">
+<h2 class="anchored" data-anchor-id="naming-conventions">3. Naming conventions</h2>
+<p>As discussed in <a href="https://hds-sandbox.github.io/RDM_NGS_course/develop/03_DOD.html#naming-conventions">lesson 3</a>, consistent naming conventions are key for interpreting, comparing, and reproducing findings in scientific research. Standardized naming helps organize and retrieve data or results, allowing researchers to locate and compare similar types of data within or across large datasets.</p>
 <div class="callout callout-style-default callout-exercise no-icon callout-titled">
-<div class="callout-header d-flex align-content-center" data-bs-toggle="collapse" data-bs-target=".callout-13-contents" aria-controls="callout-13" aria-expanded="true" aria-label="Toggle callout">
+<div class="callout-header d-flex align-content-center" data-bs-toggle="collapse" data-bs-target=".callout-20-contents" aria-controls="callout-20" aria-expanded="true" aria-label="Toggle callout">
 <div class="callout-icon-container">
 <i class="callout-icon no-icon"></i>
 </div>
 <div class="callout-title-container flex-fill">
-Exercise 3: Create your own naming conventions
+Exercise 3: Define your file name conventions
 </div>
 <div class="callout-btn-toggle d-inline-block border-0 py-1 ps-1 pe-0 float-end"><i class="callout-toggle"></i></div>
 </div>
-<div id="callout-13" class="callout-13-contents callout-collapse collapse show">
+<div id="callout-20" class="callout-20-contents callout-collapse collapse show">
 <div class="callout-body-container callout-body">
 <div>
 <div class="callout-exercise">
-<p>Think about the most common types of files and folders you will be working on, such as visualizations, results tables, processed files, etc. Then come up with a logical and clear way of naming those files using the tips suggested above. Remember to avoid making long and complicated names!</p>
+<p>Avoid long and complicated names and ensure your file names are both informative and easy to manage:</p>
+<ol type="1">
+<li>For saving a new plot, a heatmap representing sample correlations</li>
+<li>When naming the file for the document containing the Research Data Management Course Objectives (Version 2, 2nd May 2024) from the University of Copenhagen</li>
+<li>Consider the most common file types you work with, such as visualizations, figures, tables, etc., and create logical and clear file names</li>
+</ol>
+<div class="callout callout-style-default callout-hint no-icon callout-titled">
+<div class="callout-header d-flex align-content-center" data-bs-toggle="collapse" data-bs-target=".callout-19-contents" aria-controls="callout-19" aria-expanded="false" aria-label="Toggle callout">
+<div class="callout-icon-container">
+<i class="callout-icon no-icon"></i>
+</div>
+<div class="callout-title-container flex-fill">
+Hint
+</div>
+<div class="callout-btn-toggle d-inline-block border-0 py-1 ps-1 pe-0 float-end"><i class="callout-toggle"></i></div>
+</div>
+<div id="callout-19" class="callout-19-contents callout-collapse collapse">
+<div class="callout-body-container callout-body">
+<div>
+<div class="callout-hint">
+<ol type="1">
+<li><code>heatmap_sampleCor_20240101.png</code></li>
+<li><code>KU_RDM-objectives_20240502_v02.doc</code> or <code>KU_RDMObj_20240502_v02.doc</code></li>
+</ol>
+</div>
+</div>
+</div>
+</div>
+</div>
 </div>
 </div>
 </div>
 </div>
 </div>
 </section>
-</section>
-<section id="create-a-catalog-of-your-assay-folder" class="level2">
-<h2 class="anchored" data-anchor-id="create-a-catalog-of-your-assay-folder">4. Create a catalog of your assay folder</h2>
+<section id="create-a-catalog-of-your-data-folder" class="level2">
+<h2 class="anchored" data-anchor-id="create-a-catalog-of-your-data-folder">4. Create a catalog of your data folder</h2>
 <p>The next step is to collect all the NGS datasets that you have created in the manner explained above. Since your folders all should contain the <code>metadata.yml</code> file in the same place with the same metadata, it should be very easy to iteratively go through all the folders and merge all the metadata.yml files into a one single table. This table can be then browsed easily with Microsoft Excel, for example. If you are interested in making a Shiny app or Python Panel tool to interactively browse the catalog, check out this <a href="../develop/04_metadata.html">lesson</a>.</p>
 <div class="callout callout-style-default callout-exercise no-icon callout-titled">
-<div class="callout-header d-flex align-content-center" data-bs-toggle="collapse" data-bs-target=".callout-14-contents" aria-controls="callout-14" aria-expanded="true" aria-label="Toggle callout">
+<div class="callout-header d-flex align-content-center" data-bs-toggle="collapse" data-bs-target=".callout-21-contents" aria-controls="callout-21" aria-expanded="true" aria-label="Toggle callout">
 <div class="callout-icon-container">
 <i class="callout-icon no-icon"></i>
 </div>
@@ -2528,45 +945,55 @@ <h2 class="anchored" data-anchor-id="create-a-catalog-of-your-assay-folder">4. C
 </div>
 <div class="callout-btn-toggle d-inline-block border-0 py-1 ps-1 pe-0 float-end"><i class="callout-toggle"></i></div>
 </div>
-<div id="callout-14" class="callout-14-contents callout-collapse collapse show">
+<div id="callout-21" class="callout-21-contents callout-collapse collapse show">
 <div class="callout-body-container callout-body">
 <div>
 <div class="callout-exercise">
 <p>We will make a small script in R (or you can make one with Python) that recursively goes through all the folders inside an input path (like your <code>Assays</code> folder), fetches all the <code>metadata.yml</code> files, and merges them. Finally, it will write a TSV file as an output.</p>
 <ol type="1">
-<li>Create a folder called <code>Assays</code></li>
-<li>Under that folder, make three new <code>Assay</code> folders from your cookiecutter template</li>
-<li>Run the script below with R (or create your own with Python). Modify the <code>folder_path</code> variable so it matches the path to the folder <code>Assays</code>. The table will be written under the same <code>folder_path</code>.</li>
-<li>Visualize your <code>Assays</code> table with Excel</li>
+<li>Create a folder called <code>dataset</code> and change directory <code>cd dataset</code></li>
+<li>Fork <a href="https://github.com/hds-sandbox/cc-data-template">this repository</a>: a Cookiecutter template designed for NGS datasets. While you are welcome to create your own template from scratch, we recommend using this one to save time.</li>
+<li>Run the <code>cookiecutter cc-data-template</code> command at least twice to create multiple datasets or projects. Use different values each time to simulate various scenarios (do this in the dataset directory that you have previously created). Execute the script below using R (or create your own script in Python). Adjust the <code>folder_path</code> variable so that it matches the path to the Assays folder. The resulting table will be saved in the same <code>folder_path</code>.</li>
+<li>Open your <code>database_YYYYMMDD.tsv</code> table in a text editor from the command-line, or view it in Excel for better visualization.</li>
 </ol>
-<div class="sourceCode" id="cb17"><pre class="sourceCode r code-overflow-wrap code-with-copy"><code class="sourceCode r"><span id="cb17-1"><a href="#cb17-1" aria-hidden="true" tabindex="-1"></a></span>
-<span id="cb17-2"><a href="#cb17-2" aria-hidden="true" tabindex="-1"></a><span class="fu">library</span>(yaml)</span>
-<span id="cb17-3"><a href="#cb17-3" aria-hidden="true" tabindex="-1"></a><span class="fu">library</span>(dplyr)</span>
-<span id="cb17-4"><a href="#cb17-4" aria-hidden="true" tabindex="-1"></a><span class="fu">library</span>(lubridate)</span>
-<span id="cb17-5"><a href="#cb17-5" aria-hidden="true" tabindex="-1"></a></span>
-<span id="cb17-6"><a href="#cb17-6" aria-hidden="true" tabindex="-1"></a><span class="co"># Function to recursively fetch metadata.yml files</span></span>
-<span id="cb17-7"><a href="#cb17-7" aria-hidden="true" tabindex="-1"></a>get_metadata <span class="ot">&lt;-</span> <span class="cf">function</span>(folder_path) {</span>
-<span id="cb17-8"><a href="#cb17-8" aria-hidden="true" tabindex="-1"></a>    file_list <span class="ot">&lt;-</span> <span class="fu">list.files</span>(<span class="at">path =</span> folder_path, <span class="at">pattern =</span> <span class="st">"metadata</span><span class="sc">\\</span><span class="st">.yml$"</span>, <span class="at">recursive =</span> <span class="cn">TRUE</span>, <span class="at">full.names =</span> <span class="cn">TRUE</span>)</span>
-<span id="cb17-9"><a href="#cb17-9" aria-hidden="true" tabindex="-1"></a>    metadata_list <span class="ot">&lt;-</span> <span class="fu">lapply</span>(file_list, yaml<span class="sc">::</span>yaml.load_file)</span>
-<span id="cb17-10"><a href="#cb17-10" aria-hidden="true" tabindex="-1"></a>    <span class="fu">return</span>(metadata_list)</span>
-<span id="cb17-11"><a href="#cb17-11" aria-hidden="true" tabindex="-1"></a>    }</span>
-<span id="cb17-12"><a href="#cb17-12" aria-hidden="true" tabindex="-1"></a></span>
-<span id="cb17-13"><a href="#cb17-13" aria-hidden="true" tabindex="-1"></a><span class="co"># Specify the folder path</span></span>
-<span id="cb17-14"><a href="#cb17-14" aria-hidden="true" tabindex="-1"></a>    folder_path <span class="ot">&lt;-</span> <span class="st">"/path/to/your/folder"</span></span>
-<span id="cb17-15"><a href="#cb17-15" aria-hidden="true" tabindex="-1"></a></span>
-<span id="cb17-16"><a href="#cb17-16" aria-hidden="true" tabindex="-1"></a>    <span class="co"># Fetch metadata from the specified folder</span></span>
-<span id="cb17-17"><a href="#cb17-17" aria-hidden="true" tabindex="-1"></a>    metadata <span class="ot">&lt;-</span> <span class="fu">get_metadata</span>(folder_path)</span>
-<span id="cb17-18"><a href="#cb17-18" aria-hidden="true" tabindex="-1"></a></span>
-<span id="cb17-19"><a href="#cb17-19" aria-hidden="true" tabindex="-1"></a>    <span class="co"># Convert metadata to a data frame</span></span>
-<span id="cb17-20"><a href="#cb17-20" aria-hidden="true" tabindex="-1"></a>    metadata_df <span class="ot">&lt;-</span> <span class="fu">data.frame</span>(<span class="fu">matrix</span>(<span class="fu">unlist</span>(metadata), <span class="at">ncol =</span> <span class="fu">length</span>(metadata), <span class="at">byrow =</span> <span class="cn">TRUE</span>))</span>
-<span id="cb17-21"><a href="#cb17-21" aria-hidden="true" tabindex="-1"></a>    <span class="fu">colnames</span>(metadata_df) <span class="ot">&lt;-</span> <span class="fu">names</span>(metadata[[<span class="dv">1</span>]])</span>
-<span id="cb17-22"><a href="#cb17-22" aria-hidden="true" tabindex="-1"></a></span>
-<span id="cb17-23"><a href="#cb17-23" aria-hidden="true" tabindex="-1"></a>    <span class="co"># Save the data frame as a TSV file</span></span>
-<span id="cb17-24"><a href="#cb17-24" aria-hidden="true" tabindex="-1"></a>    output_file <span class="ot">&lt;-</span> <span class="fu">paste0</span>(<span class="st">"database_"</span>, <span class="fu">format</span>(<span class="fu">Sys.Date</span>(), <span class="st">"%Y%m%d"</span>), <span class="st">".tsv"</span>)</span>
-<span id="cb17-25"><a href="#cb17-25" aria-hidden="true" tabindex="-1"></a>    <span class="fu">write.table</span>(metadata_df, <span class="at">file =</span> output_file, <span class="at">sep =</span> <span class="st">"</span><span class="sc">\t</span><span class="st">"</span>, <span class="at">quote =</span> <span class="cn">FALSE</span>, <span class="at">row.names =</span> <span class="cn">FALSE</span>)</span>
-<span id="cb17-26"><a href="#cb17-26" aria-hidden="true" tabindex="-1"></a></span>
-<span id="cb17-27"><a href="#cb17-27" aria-hidden="true" tabindex="-1"></a>    <span class="co"># Print confirmation message</span></span>
-<span id="cb17-28"><a href="#cb17-28" aria-hidden="true" tabindex="-1"></a>    <span class="fu">cat</span>(<span class="st">"Database saved as"</span>, output_file, <span class="st">"</span><span class="sc">\n</span><span class="st">"</span>)</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
+<div class="sourceCode" id="cb19"><pre class="sourceCode r code-overflow-wrap code-with-copy"><code class="sourceCode r"><span id="cb19-1"><a href="#cb19-1" aria-hidden="true" tabindex="-1"></a></span>
+<span id="cb19-2"><a href="#cb19-2" aria-hidden="true" tabindex="-1"></a><span class="fu">library</span>(yaml)</span>
+<span id="cb19-3"><a href="#cb19-3" aria-hidden="true" tabindex="-1"></a><span class="fu">library</span>(dplyr)</span>
+<span id="cb19-4"><a href="#cb19-4" aria-hidden="true" tabindex="-1"></a><span class="fu">library</span>(lubridate)</span>
+<span id="cb19-5"><a href="#cb19-5" aria-hidden="true" tabindex="-1"></a></span>
+<span id="cb19-6"><a href="#cb19-6" aria-hidden="true" tabindex="-1"></a><span class="co"># Function to read a YAML file and transform it into a dataframe format.</span></span>
+<span id="cb19-7"><a href="#cb19-7" aria-hidden="true" tabindex="-1"></a>read_yaml <span class="ot">&lt;-</span> <span class="cf">function</span>(file_path) {</span>
+<span id="cb19-8"><a href="#cb19-8" aria-hidden="true" tabindex="-1"></a>  <span class="co"># Read the YAML file and convert it to a data frame</span></span>
+<span id="cb19-9"><a href="#cb19-9" aria-hidden="true" tabindex="-1"></a>  df <span class="ot">&lt;-</span> yaml<span class="sc">::</span><span class="fu">yaml.load_file</span>(file_path) <span class="sc">%&gt;%</span> <span class="fu">as.data.frame</span>(<span class="at">stringsAsFactors =</span> <span class="cn">FALSE</span>)</span>
+<span id="cb19-10"><a href="#cb19-10" aria-hidden="true" tabindex="-1"></a>  </span>
+<span id="cb19-11"><a href="#cb19-11" aria-hidden="true" tabindex="-1"></a>  <span class="co"># Return the data frame</span></span>
+<span id="cb19-12"><a href="#cb19-12" aria-hidden="true" tabindex="-1"></a>  <span class="fu">return</span>(df)</span>
+<span id="cb19-13"><a href="#cb19-13" aria-hidden="true" tabindex="-1"></a>}</span>
+<span id="cb19-14"><a href="#cb19-14" aria-hidden="true" tabindex="-1"></a></span>
+<span id="cb19-15"><a href="#cb19-15" aria-hidden="true" tabindex="-1"></a><span class="co"># Function to recursively fetch metadata.yml files</span></span>
+<span id="cb19-16"><a href="#cb19-16" aria-hidden="true" tabindex="-1"></a>get_metadata <span class="ot">&lt;-</span> <span class="cf">function</span>(folder_path) {</span>
+<span id="cb19-17"><a href="#cb19-17" aria-hidden="true" tabindex="-1"></a>  file_list <span class="ot">&lt;-</span> <span class="fu">list.files</span>(<span class="at">path =</span> folder_path, <span class="at">pattern =</span> <span class="st">"metadata</span><span class="sc">\\</span><span class="st">.yml$"</span>, <span class="at">recursive =</span> <span class="cn">TRUE</span>, <span class="at">full.names =</span> <span class="cn">TRUE</span>)</span>
+<span id="cb19-18"><a href="#cb19-18" aria-hidden="true" tabindex="-1"></a></span>
+<span id="cb19-19"><a href="#cb19-19" aria-hidden="true" tabindex="-1"></a>  metadata_list <span class="ot">&lt;-</span> <span class="fu">lapply</span>(file_list, read_yaml)</span>
+<span id="cb19-20"><a href="#cb19-20" aria-hidden="true" tabindex="-1"></a>  </span>
+<span id="cb19-21"><a href="#cb19-21" aria-hidden="true" tabindex="-1"></a>  <span class="co"># Combine the list of data frames into a single data frame using dplyr::bind_rows()</span></span>
+<span id="cb19-22"><a href="#cb19-22" aria-hidden="true" tabindex="-1"></a>  combined_metadata <span class="ot">&lt;-</span> <span class="fu">bind_rows</span>(metadata_list)</span>
+<span id="cb19-23"><a href="#cb19-23" aria-hidden="true" tabindex="-1"></a></span>
+<span id="cb19-24"><a href="#cb19-24" aria-hidden="true" tabindex="-1"></a>  <span class="fu">return</span>(combined_metadata)</span>
+<span id="cb19-25"><a href="#cb19-25" aria-hidden="true" tabindex="-1"></a>}</span>
+<span id="cb19-26"><a href="#cb19-26" aria-hidden="true" tabindex="-1"></a></span>
+<span id="cb19-27"><a href="#cb19-27" aria-hidden="true" tabindex="-1"></a><span class="co"># Specify the folder path</span></span>
+<span id="cb19-28"><a href="#cb19-28" aria-hidden="true" tabindex="-1"></a>folder_path <span class="ot">&lt;-</span> <span class="st">"/path/to/your/folder"</span></span>
+<span id="cb19-29"><a href="#cb19-29" aria-hidden="true" tabindex="-1"></a></span>
+<span id="cb19-30"><a href="#cb19-30" aria-hidden="true" tabindex="-1"></a><span class="co"># Fetch metadata from the specified folder</span></span>
+<span id="cb19-31"><a href="#cb19-31" aria-hidden="true" tabindex="-1"></a>metadata <span class="ot">&lt;-</span> <span class="fu">get_metadata</span>(folder_path)</span>
+<span id="cb19-32"><a href="#cb19-32" aria-hidden="true" tabindex="-1"></a></span>
+<span id="cb19-33"><a href="#cb19-33" aria-hidden="true" tabindex="-1"></a><span class="co"># Save the data frame as a TSV file</span></span>
+<span id="cb19-34"><a href="#cb19-34" aria-hidden="true" tabindex="-1"></a>output_file <span class="ot">&lt;-</span> <span class="fu">paste0</span>(<span class="st">"database_"</span>, <span class="fu">format</span>(<span class="fu">Sys.Date</span>(), <span class="st">"%Y%m%d"</span>), <span class="st">".tsv"</span>)</span>
+<span id="cb19-35"><a href="#cb19-35" aria-hidden="true" tabindex="-1"></a><span class="fu">write.table</span>(metadata, <span class="at">file =</span> output_file, <span class="at">sep =</span> <span class="st">"</span><span class="sc">\t</span><span class="st">"</span>, <span class="at">quote =</span> <span class="cn">FALSE</span>, <span class="at">row.names =</span> <span class="cn">FALSE</span>)</span>
+<span id="cb19-36"><a href="#cb19-36" aria-hidden="true" tabindex="-1"></a></span>
+<span id="cb19-37"><a href="#cb19-37" aria-hidden="true" tabindex="-1"></a><span class="co"># Print confirmation message</span></span>
+<span id="cb19-38"><a href="#cb19-38" aria-hidden="true" tabindex="-1"></a><span class="fu">cat</span>(<span class="st">"Database saved as"</span>, output_file, <span class="st">"</span><span class="sc">\n</span><span class="st">"</span>)</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
 </div>
 </div>
 </div>
@@ -2615,7 +1042,7 @@ <h3 class="anchored" data-anchor-id="github-pages">GitHub Pages</h3>
 <p>Once you have created your repository (and put it in GitHub), you have now the opportunity to add your data analysis reports that you created, in either Jupyter Notebooks, Rmarkdowns, or HTML reports, in a <a href="https://pages.github.com/">GitHub Page website</a>. Creating a GitHub page is very simple, and we really recommend that you follow the nice tutorial that GitHub has put for you. Nonetheless, we will see the main steps in the exercise below.</p>
 <p>There are many different ways to create your web pages. We recommend using Mkdocs and Mkdocs materials as a framework to create a nice webpage simply. The folder templates that we used as an example in the previous exercise already contain everything you need to start a webpage. Nonetheless, you will need to understand the basics of <a href="https://www.mkdocs.org/">MkDocs</a> and <a href="https://squidfunk.github.io/mkdocs-material/">MkDocs materials</a> to design a webpage to your liking. MkDocs is a static webpage generator that is very easy to use, while MkDocs materials is an extension of the tool that gives you many more options to customize your website. Check out their web pages to get started!</p>
 <div class="callout callout-style-default callout-exercise no-icon callout-titled">
-<div class="callout-header d-flex align-content-center" data-bs-toggle="collapse" data-bs-target=".callout-17-contents" aria-controls="callout-17" aria-expanded="true" aria-label="Toggle callout">
+<div class="callout-header d-flex align-content-center" data-bs-toggle="collapse" data-bs-target=".callout-24-contents" aria-controls="callout-24" aria-expanded="true" aria-label="Toggle callout">
 <div class="callout-icon-container">
 <i class="callout-icon no-icon"></i>
 </div>
@@ -2624,7 +1051,7 @@ <h3 class="anchored" data-anchor-id="github-pages">GitHub Pages</h3>
 </div>
 <div class="callout-btn-toggle d-inline-block border-0 py-1 ps-1 pe-0 float-end"><i class="callout-toggle"></i></div>
 </div>
-<div id="callout-17" class="callout-17-contents callout-collapse collapse show">
+<div id="callout-24" class="callout-24-contents callout-collapse collapse show">
 <div class="callout-body-container callout-body">
 <div>
 <div class="callout-exercise">
@@ -2683,7 +1110,7 @@ <h3 class="anchored" data-anchor-id="zenodo">Zenodo</h3>
 <p>Zenodo[https://zenodo.org/] is an open-access digital repository designed to facilitate the archiving of scientific research outputs. It operates under the umbrella of the European Organization for Nuclear Research (CERN) and is supported by the European Commission. Zenodo accommodates a broad spectrum of research outputs, including datasets, papers, software, and multimedia files. This versatility makes it an invaluable resource for researchers across a wide array of domains, promoting transparency, collaboration, and the advancement of knowledge on a global scale.</p>
 <p>Operating on a user-friendly web platform, Zenodo allows researchers to easily upload, share, and preserve their research data and related materials. Upon deposit, each item is assigned a unique Digital Object Identifier (DOI), granting it a citable status and ensuring its long-term accessibility. Additionally, Zenodo provides robust metadata capabilities, enabling researchers to enrich their submissions with detailed contextual information. In addition, it allows you to <a href="https://docs.github.com/en/repositories/archiving-a-github-repository/referencing-and-citing-content">link your GitHub account</a>, providing a streamlined way to archive a specific release of your GitHub repository directly into Zenodo. This integration simplifies the process of preserving a snapshot of your project’s progress for long-term accessibility and citation.</p>
 <div class="callout callout-style-default callout-exercise no-icon callout-titled">
-<div class="callout-header d-flex align-content-center" data-bs-toggle="collapse" data-bs-target=".callout-19-contents" aria-controls="callout-19" aria-expanded="true" aria-label="Toggle callout">
+<div class="callout-header d-flex align-content-center" data-bs-toggle="collapse" data-bs-target=".callout-26-contents" aria-controls="callout-26" aria-expanded="true" aria-label="Toggle callout">
 <div class="callout-icon-container">
 <i class="callout-icon no-icon"></i>
 </div>
@@ -2692,7 +1119,7 @@ <h3 class="anchored" data-anchor-id="zenodo">Zenodo</h3>
 </div>
 <div class="callout-btn-toggle d-inline-block border-0 py-1 ps-1 pe-0 float-end"><i class="callout-toggle"></i></div>
 </div>
-<div id="callout-19" class="callout-19-contents callout-collapse collapse show">
+<div id="callout-26" class="callout-26-contents callout-collapse collapse show">
 <div class="callout-body-container callout-body">
 <div>
 <div class="callout-exercise">
diff --git a/index.html b/index.html
index 6746e252..e9c0954d 100644
--- a/index.html
+++ b/index.html
@@ -164,7 +164,7 @@ <h1 class="title">Computational Research Data Management</h1>
     <div>
     <div class="quarto-title-meta-heading">Modified</div>
     <div class="quarto-title-meta-contents">
-      <p class="date-modified">April 25, 2024</p>
+      <p class="date-modified">April 26, 2024</p>
     </div>
   </div>
     
diff --git a/practical_workflows.html b/practical_workflows.html
index 4c13ac2f..1c9788e3 100644
--- a/practical_workflows.html
+++ b/practical_workflows.html
@@ -190,7 +190,7 @@
     <div>
     <div class="quarto-title-meta-heading">Modified</div>
     <div class="quarto-title-meta-contents">
-      <p class="date-modified">April 25, 2024</p>
+      <p class="date-modified">April 26, 2024</p>
     </div>
   </div>
     
diff --git a/search.json b/search.json
index 5e797b26..3f7fe3e4 100644
--- a/search.json
+++ b/search.json
@@ -58,28 +58,28 @@
     "href": "develop/practical_workshop.html#organize-and-structure-your-datasets-and-data-analysis",
     "title": "Practical material",
     "section": "1. Organize and structure your datasets and data analysis",
-    "text": "1. Organize and structure your datasets and data analysis\nEstablishing a consistent file structure and naming conventions will help you efficiently manage your data. We will classify your data and data analyses into two distinct types of folders to ensure the data can be used and shared by many lab members while preventing modifications by any individual:\n\nData folders (assay or external databases and resources): They house the raw and processed datasets, alongside the pipeline/workflow used to generate the processed data, the provenance of the raw data, and quality control reports of the data. The data should be locked and set to read-only to prevent unintended modifications. This applies to experimental data generated in your lab as well as external resources. Provide an MD5 checksum file when you download them yourself to verify their integrity.\nProject folders: They contain all the essential files for a specific research project. Projects may use data from various resources or experiments, or build upon previous results from other projects. The data should not be copied or duplicated, instead, it should be linked directly from the source.\n\nData and data analysis are kept separate because a project may utilize one or more datasets to address a scientific question. Data can be reused in multiple projects over time, combined with other datasets for comparison, or used to build larger datasets. Additionally, data may be utilized by different researchers to answer various research questions.\n\n\n\n\n\n\nHint\n\n\n\n\n\n\n\nWhen organizing your data folders, separate assays from external resources and maintain a consistent structure. For example, organize genome references by species and further categorize them by versions. Make sure to include all relevant information, and refer to this lesson for additional tips on data organization.\nThis will help you to keep your data tidied up, especially if you are working in a big lab where assays may be used for different purposes and by different people!\n\n\n\n\n\n\nData folders\nWhether your lab generates its own experimental data, receives it from collaborators, or works with previously published datasets, the data folder should follow a similar structure to the one presented here. Create a separate folder for each dataset, including raw files and processed files alongside the corresponding documentation and pipeline that generated the processed data. Raw files should remain untouched, and you should consider locking modifications to the final results once data preprocessing is complete. This precaution helps prevent unwanted changes to the data. Each subfolder should be named in a way that is distinct, easily readable and clear at a glance. Check this lesson for tips on naming conventions.\n\n\n\n\n\n\nHint\n\n\n\n\n\n\n\nUse an acronym (1) that describes the type of NGS assay (RNAseq, ChIPseq, ATACseq) a keyword (2) that represents a unique element to that assay, and the date (3).\n&lt;Assay-ID&gt;_&lt;keyword&gt;_YYYYMMDD\nFor example CHIP_Oct4_20230101 is a ChIPseq assay made on 1st January 2023 with the keyword Oct4, so it is easily identifiable by the eye.\n\n\n\n\n\nLet’s explore a potential folder structure and the types of files you might encounter within it.\n&lt;data_type&gt;_&lt;keyword&gt;_YYYYMMDD/\n├── README.md \n├── CHECKSUMS\n├── pipeline\n    ├── pipeline.md\n    ├── scripts/\n├── processed\n    ├── fastqc/\n    ├── multiqc/\n    ├── final_fastq/\n└── raw\n    ├── .fastq.gz \n    └── samplesheet.csv\n\nREADME.md: This file contains a detailed description of the dataset commonly in markdown format. It should include the provenance of the raw data (such as samples, laboratory protocols used, the aim of the project, folder structure, naming conventions, etc.).\nmetadata.yml: This metadata file outlines different keys and essential information, usually presented in YAML format. For more details, refer to this lesson.\npipeline.md: This file provides an overview of the pipeline used to process raw data, as well as the commands to run the pipeline. The pipeline itself and all the required scripts should be collected in the same directory.\nprocessed: This folder contains the results from the preprocessing pipeline. The content vary depending on the specific pipeline used (create additional subdirectories as needed).\nraw: This folder holds the raw data.\n\n.fastq.gz: For example, in NGS assays, there should be ‘fastq’ files.\nsamplesheet.csv: This file holds essential metadata for the samples, including sample identification, experimental variables, batch information, and other metrics crucial for downstream analysis. It is important that this file is complete and current, as it is key to interpreting results. If you are considering running nf-core pipelines, this file will be required.\n\n\n\n\nProject folders\nOn the other hand, we have another type of folder called Projects which refers to data analyses that are specific to particular tasks, such as those involved in preparing a potential article. In this folder, you will create a subfolder for each project that you or your lab is working on. Each Project subfolder should include project-specific information, data analysis pipelines, notebooks, and scripts used for that particular project. Additionally, you should include an environment file with all the required software and dependencies needed for the project, including their versions. This helps ensure that the analyses can be easily replicated and shared with others.\nThe Project folder should be named in a way that is unique, easy to read, distinguishable, and clear at a glance. For example, you might name it based on the main author’s initials, the dataset being analyzed, the project name, a unique descriptive element related to the project, or the part of the project you are responsible for, along with the date:\n&lt;project&gt;_&lt;keyword&gt;_YYYYMMDD\n\n\n\n\n\n\nNaming examples\n\n\n\n\n\n\n\n\nRNASeq_Mouse_Brain_20230512: a project RNA sequencing data from a mouse brain experiment, created on May 12, 2023\nEHR_COVID19_Study_20230115: a project around electronic health records data for a COVID-19 study, created on January 15, 2023.\n\n\n\n\n\n\nNow, let’s explore an example of a folder structure and the types of files you might encounter within it.\n&lt;project&gt;_&lt;keyword&gt;_YYYYMMDD\n├── data\n│  └── &lt;ID&gt;_&lt;keyword&gt;_YYYYMMDD &lt;- symbolic link\n├── documents\n│  └── research_project_template.docx\n├── metadata.yml\n├── notebooks\n│  └── 01_data_processing.rmd\n│  └── 02_data_analysis.rmd\n│  └── 03_data_visualization.rmd\n├── README.md\n├── reports\n│  └── 01_data_processing.html\n│  └── 02_data_analysis.html\n│  ├── 03_data_visualization.html\n│  │  └── figures\n│  │  └── tables\n├── requirements.txt // env.yaml\n├── results\n│  ├── figures\n│  │  └── 02_data_analysis/\n│  │    └── heatmap_sampleCor_20230102.png\n│  ├── tables\n│  │  └── 02_data_analysis/\n│  │    └── DEA_treat-control_LFC1_p01.tsv\n│  │    └── SumStats_sampleCor_20230102.tsv\n├── pipeline\n│  ├── rules // processes \n│  │  └── step1_data_processing.smk\n│  └── pipeline.md\n├── scratch\n└── scripts\n\ndata: This folder contains symlinks or shortcuts to the actual data files, ensuring that the original files remain unaltered.\ndocuments: This folder houses Word documents, slides, or PDFs associated with the project, including data and project explanations, research papers, and more. It also includes the Data Management Plan.\n\nresearch_project_template.docx. If you download our template you will find a is a pre-filled Data Management Plan based on the Horizon Europe guidelines named ‘Non-sensitive_NGS_research_project_template.docx’.\n\nmetadata.yml: metadata file describing various keys of the project or experiment (see this lesson).\nnotebooks: This folder stores Jupyter, R Markdown, or Quarto notebooks containing the data analysis. Figures and tables used for the reports are organized under subfolders named after the notebook that created them for provenance purposes.\nREADME.md: A detailed project description in markdown or plain-text format.\nreports: Notebooks rendered as HTML, docx, or PDF files for sharing with colleagues or as formal data analysis reports.\n\nfigures: figures produced upon rendering notebooks. The figures will be saved under a subfolder named after the notebook that created them. This is for provenance purposes so we know which notebook created which figures.\n\nrequirements.txt: This file lists the necessary software, libraries, and their versions required to reproduce the code. If you’re using conda environments, you will also find the env.yaml file here, which outlines the specific environment configuration.\nresults: This folder contains analysis results, such as figures and tables. Organizing results by the pipeline, script, or notebook that generated them will make it easier to locate and interpret the data.\npipeline: A folder containing pipeline scripts or workflows for processing and analyzing data.\nscratch: A folder designated for temporary files or workspace for experiments and development.\nscripts: Folder for helper scripts needed to run data analysis or reproduce the work.\n\n\n\nTemplate engine\nCreating a folder template is straightforward with cookiecutter a command-line tool that generates projects from templates (called cookiecutters). For example, it can help you set up a Python package project based on a Python package project template.\n\n\n\n\n\n\nCookiecutter templates\n\n\n\nHere are some template that you can use to get started, adapt and modify them to your own needs:\n\nPython package project\nSandbox test\nData science\nNGS data\n\nCreate your own template from scratch.\n\n\n\nQuick tutorial on cookiecutter\nBuilding a Cookiecutter template from scratch requires defining a folder structure, crafting a cookiecutter.json file, and outlining placeholders (keywords) that will be substituted when generating a new project. Here’s a step-by-step guide on how to proceed:\n\nStep 1: Create a Folder Template\nFirst, begin by creating a folder structure that aligns with your desired template design. For instance, let’s set up a simple Python project template:\nmy_template/\n|-- {{cookiecutter.project_name}}\n|   |-- main.py\n|-- tests\n|   |-- test_{{cookiecutter.project_name}}.py\n|-- README.md\nIn this example, {cookiecutter.project_name} is a placeholder that will be replaced with the actual project name when the template is used. This directory contains a python script (‘main.py’), a subdirectory (‘tests’) with a second python script named after the project (‘test_{{cookiecutter.project_name}}.py’) and a ‘README.md’ file.\n\n\nStep 2: Create cookiecutter.json\nIn the root of your template folder, create a file named cookiecutter.json. This file will define the variables (keywords) that users will be prompted to fill in. For our Python project template, it might look like this:\n{\n  \"project_name\": \"MyProject\",\n  \"author_name\": \"Your Name\",\n  \"description\": \"A short description of your project\"\n}\nWhen users generate a project based on your template, they will be prompted with these questions. The provided values (“responses”) will be used to substitute the placeholders in your template files.\nBeyond substituting placeholders in file and directory names, Cookiecutter can automatically populate text file contents with information. This feature is useful for offering default configurations or code file templates. Let’s enhance our earlier example by incorporating a placeholder within a text file:\nFirst, modify the my_template/main.py file to include a placeholder inside its contents:\n# main.py\n\ndef hello():\n    print(\"Hello, {{cookiecutter.project_name}}!\")\nThe ‘{{cookiecutter.project_name}}’ placeholder is now included within the main.py file. When you execute Cookiecutter, it will automatically replace the placeholders in both file and directory names and within text file contents.\nAfter running Cookiecutter, your generated ‘main.py’ file could appear as follows:\n# main.py\n\ndef hello():\n    print(\"Hello, MyProject!\")  # Assuming \"MyProject\" was entered as the project_name\n\n\nStep 3: Use Cookiecutter\nOnce your template is prepared, you can utilize Cookiecutter to create a project from it. Open a terminal and execute:\ncookiecutter path/to/your/template\nCookiecutter will prompt you to provide values for project_name, author_name, and description. Once you input these values, Cookiecutter will replace the placeholders in your template files with the entered values.\n\n\nStep 4: Review the Generated Project\nAfter the generation process is complete, navigate to the directory where Cookiecutter created the new project. You will find a project structure with the placeholders replaced by the values you provided.\n\n\n\n\n\n\nExercise 1: Create your own template\n\n\n\n\n\n\n\nUse Cookiecutter to create custom templates for your folders. You can do it from scratch (see Exercise 1, part B) or opt for one of our pre-made templates available as a Github repository (recommended for this workshop). Feel free to tailor the template to your specific requirements—you don’t have to follow our examples exactly.\nRequirements We assume you have already gone through the requirements at the beginning of the practical lesson. This includes installing the necessary tools and setting up accounts as needed.\nProject\n\nGo to our Cookicutter template and click on the **Fork*\n\n\nbutton at the top-right corner of the repository page to create a copy of the repository on your own GitHub account or organization. \n\n\nOpen a terminal on your computer, copy the URL of your fork and clone the repository to your local machine (the URL should look something like https://github.com/your_username/cookiecutter-template):\n\ngit clone &lt;your URL to the template&gt;\nIf you have a GitHub Desktop, click Add and select “Clone repository” from the options 3. Open the repository and navigate through the different directories 4. Modify the contents of the repository as needed to fit your project’s requirements. You can change files, add new ones. remove existing one or adjust the folder structure. For inspiration, review the data structure above under ‘Project folder’. For instance, this template is missing the ‘reports’ directory. Consider creating it, along with a subdirectory named ‘figures’. Here’s an example of how to do it:\ncd \\{\\{\\ cookiecutter.project_name\\ \\}\\}/  \nmkdir reports \ntouch requirements.txt\n\nModify the cookiecutter.json file. You could add new variables or change the default values:\n\n# open a text editor\n \"author\": \"Alba Refoyo\",\n\nCommit and push changes when you are done with your modifications\n\n\nStage the changes with ‘git add’\nCommit the changes with a meaningful commit message ‘git commit -m “update cookicutter template”’\nPush the changes to your forked repository on Github ‘git push origin main’ (or the appropriate branch name)\n\n\nTest your template by using cookiecutter &lt;URL to your GitHub repository \"cookicutter-template\"&gt; Fill up the variables and verify that the modified template looks like you would expect.\n\nOptional: You can customize or remove this prompt message entirely, allowing you to tailor the text to your preferences for a unique experience each time you use the template.\n\n\"__prompts__\": {\n    \"project_name\": \"Project directory name [Example: project_short_description_202X]\",\n    \"author\": \"Author of the project\",\n    \"date\": \"Date of project creation, default is today's date\",\n    \"short_description\": \"Provide a detailed description of the project (context/content)\"\n  },\n\n\n\n\n\n\n\n\n\n\n\nOptional Exercise 1, part B\n\n\n\n\n\n\n\nCreate a template from scratch using this tutorial scratch, it can be as basic as this one below or ‘Data folder’:\nmy_template/\n|-- {{cookiecutter.project_name}}\n|   |-- main.py\n|-- tests\n|   |-- test_{{cookiecutter.project_name}}.py\n|-- README.md\n\nStep 1: Create a directory for the template.\nStep 2: Write a cookiecutter.json file with variables such as project_name and author.\nStep 3: Set up the folder structure by creating subdirectories and files as needed.\nStep 4: Incorporate cookiecutter variables in the names of files.\nStep 5: Use cookiecutter variables within scripts, such as printing a message that includes the project name."
+    "text": "1. Organize and structure your datasets and data analysis\nEstablishing a consistent file structure and naming conventions will help you efficiently manage your data. We will classify your data and data analyses into two distinct types of folders to ensure the data can be used and shared by many lab members while preventing modifications by any individual:\n\nData folders (assay or external databases and resources): They house the raw and processed datasets, alongside the pipeline/workflow used to generate the processed data, the provenance of the raw data, and quality control reports of the data. The data should be locked and set to read-only to prevent unintended modifications. This applies to experimental data generated in your lab as well as external resources. Provide an MD5 checksum file when you download them yourself to verify their integrity.\nProject folders: They contain all the essential files for a specific research project. Projects may use data from various resources or experiments, or build upon previous results from other projects. The data should not be copied or duplicated, instead, it should be linked directly from the source.\n\nData and data analysis are kept separate because a project may utilize one or more datasets to address a scientific question. Data can be reused in multiple projects over time, combined with other datasets for comparison, or used to build larger datasets. Additionally, data may be utilized by different researchers to answer various research questions.\n\n\n\n\n\n\nHint\n\n\n\n\n\n\n\nWhen organizing your data folders, separate assays from external resources and maintain a consistent structure. For example, organize genome references by species and further categorize them by versions. Make sure to include all relevant information, and refer to this lesson for additional tips on data organization.\nThis will help you to keep your data tidied up, especially if you are working in a big lab where assays may be used for different purposes and by different people!\n\n\n\n\n\n\nData folders\nWhether your lab generates its own experimental data, receives it from collaborators, or works with previously published datasets, the data folder should follow a similar structure to the one presented here. Create a separate folder for each dataset, including raw files and processed files alongside the corresponding documentation and pipeline that generated the processed data. Raw files should remain untouched, and you should consider locking modifications to the final results once data preprocessing is complete. This precaution helps prevent unwanted changes to the data. Each subfolder should be named in a way that is distinct, easily readable and clear at a glance. Check this lesson for tips on naming conventions.\n\n\n\n\n\n\nHint\n\n\n\n\n\n\n\nUse an acronym (1) that describes the type of NGS assay (RNAseq, ChIPseq, ATACseq) a keyword (2) that represents a unique element to that assay, and the date (3).\n&lt;Assay-ID&gt;_&lt;keyword&gt;_YYYYMMDD\nFor example CHIP_Oct4_20230101 is a ChIPseq assay made on 1st January 2023 with the keyword Oct4, so it is easily identifiable by the eye.\n\n\n\n\n\nLet’s explore a potential folder structure and the types of files you might encounter within it.\n&lt;data_type&gt;_&lt;keyword&gt;_YYYYMMDD/\n├── README.md \n├── CHECKSUMS\n├── pipeline\n    ├── pipeline.md\n    ├── scripts/\n├── processed\n    ├── fastqc/\n    ├── multiqc/\n    ├── final_fastq/\n└── raw\n    ├── .fastq.gz \n    └── samplesheet.csv\n\nREADME.md: This file contains a detailed description of the dataset commonly in markdown format. It should include the provenance of the raw data (such as samples, laboratory protocols used, the aim of the project, folder structure, naming conventions, etc.).\nmetadata.yml: This metadata file outlines different keys and essential information, usually presented in YAML format. For more details, refer to this lesson.\npipeline.md: This file provides an overview of the pipeline used to process raw data, as well as the commands to run the pipeline. The pipeline itself and all the required scripts should be collected in the same directory.\nprocessed: This folder contains the results from the preprocessing pipeline. The content vary depending on the specific pipeline used (create additional subdirectories as needed).\nraw: This folder holds the raw data.\n\n.fastq.gz: For example, in NGS assays, there should be ‘fastq’ files.\nsamplesheet.csv: This file holds essential metadata for the samples, including sample identification, experimental variables, batch information, and other metrics crucial for downstream analysis. It is important that this file is complete and current, as it is key to interpreting results. If you are considering running nf-core pipelines, this file will be required.\n\n\n\n\nProject folders\nOn the other hand, we have another type of folder called Projects which refers to data analyses that are specific to particular tasks, such as those involved in preparing a potential article. In this folder, you will create a subfolder for each project that you or your lab is working on. Each Project subfolder should include project-specific information, data analysis pipelines, notebooks, and scripts used for that particular project. Additionally, you should include an environment file with all the required software and dependencies needed for the project, including their versions. This helps ensure that the analyses can be easily replicated and shared with others.\nThe Project folder should be named in a way that is unique, easy to read, distinguishable, and clear at a glance. For example, you might name it based on the main author’s initials, the dataset being analyzed, the project name, a unique descriptive element related to the project, or the part of the project you are responsible for, along with the date:\n&lt;project&gt;_&lt;keyword&gt;_YYYYMMDD\n\n\n\n\n\n\nNaming examples\n\n\n\n\n\n\n\n\nRNASeq_Mouse_Brain_20230512: a project RNA sequencing data from a mouse brain experiment, created on May 12, 2023\nEHR_COVID19_Study_20230115: a project around electronic health records data for a COVID-19 study, created on January 15, 2023.\n\n\n\n\n\n\nNow, let’s explore an example of a folder structure and the types of files you might encounter within it.\n&lt;project&gt;_&lt;keyword&gt;_YYYYMMDD\n├── data\n│  └── &lt;ID&gt;_&lt;keyword&gt;_YYYYMMDD &lt;- symbolic link\n├── documents\n│  └── research_project_template.docx\n├── metadata.yml\n├── notebooks\n│  └── 01_data_processing.rmd\n│  └── 02_data_analysis.rmd\n│  └── 03_data_visualization.rmd\n├── README.md\n├── reports\n│  └── 01_data_processing.html\n│  └── 02_data_analysis.html\n│  ├── 03_data_visualization.html\n│  │  └── figures\n│  │  └── tables\n├── requirements.txt // env.yaml\n├── results\n│  ├── figures\n│  │  └── 02_data_analysis/\n│  │    └── heatmap_sampleCor_20230102.png\n│  ├── tables\n│  │  └── 02_data_analysis/\n│  │    └── DEA_treat-control_LFC1_p01.tsv\n│  │    └── SumStats_sampleCor_20230102.tsv\n├── pipeline\n│  ├── rules // processes \n│  │  └── step1_data_processing.smk\n│  └── pipeline.md\n├── scratch\n└── scripts\n\ndata: This folder contains symlinks or shortcuts to the actual data files, ensuring that the original files remain unaltered.\ndocuments: This folder houses Word documents, slides, or PDFs associated with the project, including data and project explanations, research papers, and more. It also includes the Data Management Plan.\n\nresearch_project_template.docx. If you download our template you will find a is a pre-filled Data Management Plan based on the Horizon Europe guidelines named ‘Non-sensitive_NGS_research_project_template.docx’.\n\nmetadata.yml: metadata file describing various keys of the project or experiment (see this lesson).\nnotebooks: This folder stores Jupyter, R Markdown, or Quarto notebooks containing the data analysis. Figures and tables used for the reports are organized under subfolders named after the notebook that created them for provenance purposes.\nREADME.md: A detailed project description in markdown or plain-text format.\nreports: Notebooks rendered as HTML, docx, or PDF files for sharing with colleagues or as formal data analysis reports.\n\nfigures: figures produced upon rendering notebooks. The figures will be saved under a subfolder named after the notebook that created them. This is for provenance purposes so we know which notebook created which figures.\n\nrequirements.txt: This file lists the necessary software, libraries, and their versions required to reproduce the code. If you’re using conda environments, you will also find the env.yaml file here, which outlines the specific environment configuration.\nresults: This folder contains analysis results, such as figures and tables. Organizing results by the pipeline, script, or notebook that generated them will make it easier to locate and interpret the data.\npipeline: A folder containing pipeline scripts or workflows for processing and analyzing data.\nscratch: A folder designated for temporary files or workspace for experiments and development.\nscripts: Folder for helper scripts needed to run data analysis or reproduce the work.\n\n\n\nTemplate engine\nCreating a folder template is straightforward with cookiecutter a command-line tool that generates projects from templates (called cookiecutters). For example, it can help you set up a Python package project based on a Python package project template.\n\n\n\n\n\n\nCookiecutter templates\n\n\n\nHere are some template that you can use to get started, adapt and modify them to your own needs:\n\nPython package project\nSandbox test\nData science\nNGS data\n\nCreate your own template from scratch.\n\n\n\nQuick tutorial on cookiecutter\nBuilding a Cookiecutter template from scratch requires defining a folder structure, crafting a cookiecutter.json file, and outlining placeholders (keywords) that will be substituted when generating a new project. Here’s a step-by-step guide on how to proceed:\n\nStep 1: Create a Folder Template\nFirst, begin by creating a folder structure that aligns with your desired template design. For instance, let’s set up a simple Python project template:\nmy_template/\n|-- {{cookiecutter.project_name}}\n|   |-- main.py\n|-- tests\n|   |-- test_{{cookiecutter.project_name}}.py\n|-- README.md\nIn this example, {cookiecutter.project_name} is a placeholder that will be replaced with the actual project name when the template is used. This directory contains a python script (‘main.py’), a subdirectory (‘tests’) with a second python script named after the project (‘test_{{cookiecutter.project_name}}.py’) and a ‘README.md’ file.\n\n\nStep 2: Create cookiecutter.json\nIn the root of your template folder, create a file named cookiecutter.json. This file will define the variables (keywords) that users will be prompted to fill in. For our Python project template, it might look like this:\n{\n  \"project_name\": \"MyProject\",\n  \"author_name\": \"Your Name\",\n  \"description\": \"A short description of your project\"\n}\nWhen users generate a project based on your template, they will be prompted with these questions. The provided values (“responses”) will be used to substitute the placeholders in your template files.\nBeyond substituting placeholders in file and directory names, Cookiecutter can automatically populate text file contents with information. This feature is useful for offering default configurations or code file templates. Let’s enhance our earlier example by incorporating a placeholder within a text file:\nFirst, modify the my_template/main.py file to include a placeholder inside its contents:\n# main.py\n\ndef hello():\n    print(\"Hello, {{cookiecutter.project_name}}!\")\nThe ‘{{cookiecutter.project_name}}’ placeholder is now included within the main.py file. When you execute Cookiecutter, it will automatically replace the placeholders in both file and directory names and within text file contents.\nAfter running Cookiecutter, your generated ‘main.py’ file could appear as follows:\n# main.py\n\ndef hello():\n    print(\"Hello, MyProject!\")  # Assuming \"MyProject\" was entered as the project_name\n\n\nStep 3: Use Cookiecutter\nOnce your template is prepared, you can utilize Cookiecutter to create a project from it. Open a terminal and execute:\ncookiecutter path/to/your/template\nCookiecutter will prompt you to provide values for project_name, author_name, and description. Once you input these values, Cookiecutter will replace the placeholders in your template files with the entered values.\n\n\nStep 4: Review the Generated Project\nAfter the generation process is complete, navigate to the directory where Cookiecutter created the new project. You will find a project structure with the placeholders replaced by the values you provided.\n\n\n\n\n\n\nExercise 1: Create your own template\n\n\n\n\n\n\n\nUse Cookiecutter to create custom templates for your folders. You can do it from scratch (see Exercise 1, part B) or opt for one of our pre-made templates available as a Github repository (recommended for this workshop). Feel free to tailor the template to your specific requirements—you don’t have to follow our examples exactly.\nRequirements\nWe assume you have already gone through the requirements at the beginning of the practical lesson. This includes installing the necessary tools and setting up accounts as needed.\nProject\n\nGo to our Cookicutter template and click on the Fork button at the top-right corner of the repository page to create a copy of the repository on your own GitHub account or organization. \nOpen a terminal on your computer, copy the URL of your fork and clone the repository to your local machine (the URL should look something like https://github.com/your_username/cookiecutter-template):\ngit clone &lt;your URL to the template&gt;\nIf you have a GitHub Desktop, click Add and select “Clone repository” from the options\nOpen the repository and navigate through the different directories\nModify the contents of the repository as needed to fit your project’s requirements. You can change files, add new ones. remove existing one or adjust the folder structure. For inspiration, review the data structure above under ‘Project folder’. For instance, this template is missing the ‘reports’ directory and add the ‘requirements.txt’ file. Consider creating it, along with a subdirectory named ‘reports/figures’.\n├── results/\n│   ├── figures/\n├── requirements.txt\nHere’s an example of how to do it:\n# Open your terminal and navigate to your template directory. Then: \ncd \\{\\{\\ cookiecutter.project_name\\ \\}\\}/  \nmkdir reports \ntouch requirements.txt\nCommit and push changes when you are done with your modifications\n\n\nStage the changes with git add\nCommit the changes with a meaningful commit message git commit -m \"update cookicutter template\"\nPush the changes to your forked repository on Github git push origin main (or the appropriate branch name)\n\n\nTest your template by using cookiecutter &lt;URL to your GitHub repository \"cookicutter-template\"&gt;\nFill up the variables and verify that the new structure (and folders) looks like you would expect. Have any new folders been added, or have some been removed?\n\n\n\n\n\n\n\n\n\n\n\n\nOptional Exercise 1, part B\n\n\n\n\n\n\n\nCreate a template from scratch using this tutorial scratch, it can be as basic as this one below or ‘Data folder’:\nmy_template/\n|-- {{cookiecutter.project_name}}\n|   |-- main.py\n|-- tests\n|   |-- test_{{cookiecutter.project_name}}.py\n|-- README.md\n\nStep 1: Create a directory for the template.\nStep 2: Write a cookiecutter.json file with variables such as project_name and author.\nStep 3: Set up the folder structure by creating subdirectories and files as needed.\nStep 4: Incorporate cookiecutter variables in the names of files.\nStep 5: Use cookiecutter variables within scripts, such as printing a message that includes the project name."
   },
   {
-    "objectID": "develop/practical_workshop.html#metadata",
-    "href": "develop/practical_workshop.html#metadata",
+    "objectID": "develop/practical_workshop.html#data-documentation",
+    "href": "develop/practical_workshop.html#data-documentation",
     "title": "Practical material",
-    "section": "2. Metadata",
-    "text": "2. Metadata\nMetadata is the behind-the-scenes information that makes sense of data and gives context and structure. For biodata, metadata includes information such as when and where the data was collected, what it represents, and how it was processed. Let’s check what kind of relevant metadata is available for NGS data and how to capture it in your Assay or Project folders. Both of these folders contain a metadata.yml file and a README.md file. In this section, we will check what kind of information you should collect in each of these files.\n\n\n\n\n\n\nMetadata and controlled vocabularies\n\n\n\nIn order for metadata to be most useful, you should try to use controlled vocabularies for all your fields. For example, tissue could be described with the UBERON ontologies, species using the NCBI taxonomy, diseases using the Mondo database, etc. Unfortunately, implementing a systematic way of using these vocabularies is rather complex and outside the scope of this workshop, but you are very welcome to try to implement them on your own!\n\n\n\nREADME.md file\nThe README.md file is a markdown file that allows you to write a long description of the data placed in a folder. Since it is a markdown file, you are able to write in rich text format (bold, italic, include links, etc) what is inside the folder, why it was created/collected, and how and when. If it is an Assay folder, you could include the laboratory protocol used to generate the samples, images explaining the experiment design, a summary of the results of the experiment, and any sort of comments that would help to understand the context of the experiment. On the other hand, a ‘Project’ README file may contain a description of the project, what are its aims, why is it important, what ‘Assays’ is it using, how to interpret the code notebooks, a summary of the results and, again, any sort of comments that would help to understand the project.\nHere is an example of a README file for a Project folder:\n# NGS Analysis Project: Exploring Gene Expression in Human Tissues\n\n## Aims\n\nThis project aims to investigate gene expression patterns across various human tissues using Next Generation Sequencing (NGS) data. By analyzing the transcriptomes of different tissues, we seek to uncover tissue-specific gene expression profiles and identify potential markers associated with specific biological functions or diseases.\n\n## Why It's Important\n\nUnderstanding tissue-specific gene expression is crucial for deciphering the molecular basis of health and disease. Identifying genes that are uniquely expressed in certain tissues can provide insights into tissue function, development, and potential therapeutic targets. This project contributes to our broader understanding of human biology and has implications for personalized medicine and disease research.\n\n## Datasets\n\nWe have used internal datasets with IDs: RNA_humanSkin_20201030, RNA_humanBrain_20210102, RNA_humanLung_20220304.\n\nIn addition, we utilized publicly available NGS datasets from the GTEx (Genotype-Tissue Expression) project, which provides comprehensive RNA-seq data across multiple human tissues. These datasets offer a wealth of information on gene expression levels and isoform variations across diverse tissues, making them ideal for our analysis.\n\n## Summary of Results\n\nOur analysis revealed distinct gene expression patterns among different human tissues. We identified tissue-specific genes enriched in brain tissues, highlighting their potential roles in neurodevelopment and function. Additionally, we found a set of genes that exhibit consistent expression across a range of tissues, suggesting their fundamental importance in basic cellular processes.\n\nFurthermore, our differential expression analysis unveiled significant changes in gene expression between healthy and diseased tissues, shedding light on potential molecular factors underlying various diseases. Overall, this project underscores the power of NGS data in unraveling intricate gene expression networks and their implications for human health.\n\n---\n\nFor more details, refer to our [Jupyter Notebook](link-to-jupyter-notebook.ipynb) for the complete analysis pipeline and code.\n\n\nmetadata.yml\nThe metadata file is a yml file, which is a text document that contains data formatted using a human-readable data format for data serialization.\n\n\n\nyaml file example\n\n\n\n\nMetadata fields\nThere is a ton of information you can collect regarding an NGS assay or a project. Some information fields are very general, such as author or date, while others are specific to the Assay or Project folder. Below, we will take a look at the minimal information you should collect in each of the folders.\n\nGeneral metadata fields\nHere you can find a list of suggestions for general metadata fields that can be used for both assays and project folders:\n\nTitle: A brief yet informative name for the dataset.\nAuthor(s): The individual(s) or organization responsible for creating the dataset. You can use your ORCID\nDate Created: The date when the dataset was originally generated or compiled. Use YYYY-MM-DD format!\nDescription: A short narrative explaining the content, purpose, and context.\nKeywords: A set of descriptive terms or phrases that capture the folder’s main topics and attributes.\nVersion: The version number or identifier for the folder, useful for tracking changes.\nLicense: The type of license or terms of use associated with the dataset/project.\n\n\n\nAssay metadata fields\nHere you will find a table with possible metadata fields that you can use to annotate and track your Assay folders:\n\n\n\n\n\n\n\n\n\nMetadata field\nDefinition\nFormat\nOntology\nExample\n\n\n\n\nassay_ID\nIdentifier for the assay that is at least unique within the project\n&lt;Assay-ID\\&gt;_&lt;keyword\\&gt;_YYYYMMDD\nNA\nCHIP_Oct4_20200101\n\n\nassay_type\nThe type of experiment performed, eg ATAC-seq or seqFISH\nNA\nontology field- e.g. EFO or OBI\nChIPseq\n\n\nassay_subtype\nMore specific type or assay like bulk nascent RNAseq or single cell ATACseq\nNA\nontology field- e.g. EFO or OBI\nbulk ChIPseq\n\n\nowner\nOwner of the assay (who made the experiment?).\n&lt;First Name\\&gt; &lt;Last Name\\&gt;\nNA\nJose Romero\n\n\nplatform\nThe type of instrument used to perform the assay, eg Illumina HiSeq 4000 or Fluidigm C1 microfluidics platform\nNA\nontology field- e.g. EFO or OBI\nIllumina\n\n\nextraction_method\nTechnique used to extract the nucleic acid from the cell\nNA\nontology field- e.g. EFO or OBI\nNA\n\n\nlibrary_method\nTechnique used to amplify a cDNA library\nNA\nontology field- e.g. EFO or OBI\nNA\n\n\nexternal_accessions\nAccession numbers from external resources to which assay or protocol information was submitted\nNA\neg protocols.io, AE, GEO accession number, etc\nGSEXXXXX\n\n\nkeyword\nKeyword for easy identification\nwordWord\ncamelCase\nOct4ChIP\n\n\ndate\nDate of assay creation\nYYYYMMDD\nNA\n20200101\n\n\nnsamples\nNumber of samples analyzed in this assay\n&lt;integer\\&gt;\nNA\n9\n\n\nis_paired\nPaired fastq files or not\n&lt;single OR paired\\&gt;\nNA\nsingle\n\n\npipeline\nPipeline used to process data and version\nNA\nNA\nnf-core/chipseq -r 1.0\n\n\nstrandedness\nThe strandedness of the cDNA library\n&lt;+ OR - OR *\\&gt;\nNA\n*\n\n\nprocessed_by\nWho processed the data\n&lt;First Name\\&gt; &lt;Last Name\\&gt;\nNA\nSarah Lundregan\n\n\norganism\nOrganism origin\n&lt;Genus species\\&gt;\nTaxonomy name\nMus musculus\n\n\norigin\nIs internal or external (from a public resources) data\n&lt;internal OR external\\&gt;\nNA\ninternal\n\n\npath\nPath to files\n&lt;/path/to/file\\&gt;\nNA\nNA\n\n\nshort_desc\nShort description of the assay\nplain text\nNA\nOct4 ChIP after pERK activation\n\n\nELN_ID\nID of the experiment/assay in your Electronic Lab Notebook software, like labguru or benchling\nplain text\nNA\nNA\n\n\n\n\n\n\n\n\n\n\nProject metadata fields\nHere you will find a table with possible metadata fields that you can use to annotate and track your Project folders:\n\n\n\n\n\n\n\n\n\nMetadata field\nDefinition\nFormat\nOntology\nExample\n\n\n\n\nproject\nProject ID\n&lt;surname\\&gt;_et_al_2023\nNA\nproks_et_al_2023\n\n\nauthor\nOwner of the project\n&lt;First name\\&gt; &lt;Surname\\&gt;\nNA\nMartin Proks\n\n\ndate\nDate of creation\nYYYYMMDD\nNA\n20230101\n\n\ndescription\nShort description of the project\nPlain text\nNA\nThis is a project describing the effect of Oct4 perturbation after pERK activation\n\n\n\n\n\n\n\n\n\n\n\nMore info\nThe information provided in this lesson is not at all exhaustive. There might be many more fields and controlled vocabularies that could be useful for your NGS data. We recommend that you take a look at the following sources for more information!\n\nTranscriptomics metadata standards and fields\nBionty: Biological ontologies for data scientists.\n\n\n\n\n\n\n\nExercise 2: modify the metadata.yml files in your Cookiecutter templates\n\n\n\n\n\n\n\nWe have seen some examples of metadata for NGS data. It is time now to customize your Cookiecutter templates and modify the metadata.yml files so that they fit your needs!\n\nThink about what kind of metadata you would like to include.\nModify the cookiecutter.json file so that when you create a new folder template, all the metadata is filled accordingly.\n\n\n\n\n\n\n\nHint\n\n\n\n\n\n\n\n\n\n\ncookiecutter_json_example\n\n\n\n\n\n\n\n\nModify the metadata.yml file so that it includes the metadata recorded by the cookiecutter.json file.\n\n\n\n\n\n\n\nHint\n\n\n\n\n\n\n\n\n\n\nassay_metadata_example\n\n\n\n\n\n\n\n\nModify the README.md file so that it includes the short description recorded by the cookiecutter.json file.\nGit add, commit, and push the changes to your template.\nTest your folders by using the command cookiecutter &lt;URL to your cookiecutter repository in GitHub&gt;"
+    "section": "2. Data documentation",
+    "text": "2. Data documentation\nData documentation involves organizing, describing, and providing context for datasets and projects. While metadata concentrates on the data itself, README files provide a broader perspective on the overall project or resource.\n\nMetadata\n\n\n\n\n\n\nmetadata.yml\n\n\n\nChoose the format that best suits the project’s needs. In this workshop, we will focus on YAMl as it is highly used for configuration files (e.g., in conda or pipelines).\n\n\n\n\n\n\nFile formats\n\n\n\n\n\n\n\n\nXML (eXtensible Markup Language): uses custom tags to describe data and allows for a hierarchical structure.\nJSON (JavaScript Object Notation): lightweight and human-readable format that is easy to parse and generate.\nCSV (Comma-Separated Values) or TSV (tabulate-separate values): simple and widely supported for representing tabular formats. Easy to manipulate using software or programming languages. It is often use for sample metadata.\nYAML (YAML Ain’t Markup Language): human-readable data serialization format, commonly used as project configuration files.\n\nOthers such as RDF or HDF5.\n\n\n\n\n\nLink to the file format database.\n\n\nMetadata in biological datasets refers to the information that describes the data and provides context for how the data was collected, processed, and analyzed. Metadata is crucial for understanding, interpreting, and using biological datasets effectively. It also ensures that datasets are reusable, reproducible and understandable by other researchers. Some of the components may differ depending on the type of project, but there are general concepts that will always be shared across different projects:\n\nSample information and collection details\nBiological context (such experimental conditions if applicable)\nData description\nData processing steps applied to the raw data\nAnnotation and Ontology terms\nFile metadata (file type, file format, etc.)\nEthical and Legal Compliance (ownership, access, provenance)\n\n\n\n\n\n\n\nMetadata and controlled vocabularies\n\n\n\nTo maximize the usefulness of metadata, aim to use controlled vocabularies across all fields. Read more about data documentation and find ontology services examples in lesson 4. We encourage you to begin implementing them systematically on your own (under the “sources” section, you will find some helpful links to guide you putting them in practice).\nIf you work with NGS data, check out this recommendations and examples of metadata for samples, projects and datasets.\n\n\n\n\nREADME file\n\n\n\n\n\n\nREADME.md\n\n\n\nChoose the format that best suits the project’s needs. In this workshop, we will focused on Markdown as it is the most used format due to its balance of simplicity and expressive formatting options.\n\n\n\n\n\n\nFile formats\n\n\n\n\n\n\n\n\nMarkdown (.md): commonly used because is easy to read and write and is compatible across platforms (e.g., GitHub, GitLab). Supports formatting like headings, lists, links, images, and code blocks.\nPlain Text (.txt): Simple and straightforward format without any rich formatting and great for basic instructions. Lack the ability of structure content effectively.\nReStructuredText (.rst): commonly used for python projects. Supports advanced formatting (takes, links, images and code blocks) .\n\nOthers such as HTML, YAML and Notebooks.\n\n\n\n\n\nLink to the file format database\n\n\nThe README.md file is a markdown file that provides a comprehensive description of the data within a folder. Its rich text format (including bold, italic, links, etc.) allows you to explain the contents of the folder, as well as the reasons and methods behind its creation or collection. The content will vary depending on what it described (data or assays, project, software…).\nHere is an example of a README file for a bioinformatics project:\n\n\n\n\n\n\nREADME\n\n\n\n\n\n# TITLE\nClear and descriptive.\n# OVERVIEW\nIntroduction to the project including its aims, and its significance. Describe the main purpose and the biological questions being addressed.\n\n\n\n\n\n\nExample text\n\n\n\n\n\n\n\nThis project aims to investigate gene expression patterns across various human tissues using Next Generation Sequencing (NGS) data. By analyzing the transcriptomes of different tissues, we seek to uncover tissue-specific gene expression profiles and identify potential markers associated with specific biological functions or diseases.\nUnderstanding tissue-specific gene expression is crucial for deciphering the molecular basis of health and disease. Identifying genes that are uniquely expressed in certain tissues can provide insights into tissue function, development, and potential therapeutic targets. This project contributes to our broader understanding of human biology and has implications for personalized medicine and disease research.\n\n\n\n\n\n# TABLE OF CONTENTS (optional but helpful for others to navigate to different sections)\n# INSTALLATION AND SETUP\nList all prerequisites, software, dependencies, and system requirements needed for others to reproduce the project. If available, you may link to a Docker image, Conda YAML file, or requirements.txt file.\n# USAGE\nInclude command-line examples for various functionalities or steps and path for running a pipeline, if applicable.\n# DATASETS\nDescribe the data,, including its sources, format, and how to access it. If the data has undergone preprocessing, provide a description of the processes applied or the pipeline used.\n\n\n\n\n\n\nExample text\n\n\n\n\n\n\n\nWe have used internal datasets with IDs: RNA_humanSkin_20201030, RNA_humanBrain_20210102, RNA_humanLung_20220304.\nIn addition, we utilized publicly available NGS datasets from the GTEx (Genotype-Tissue Expression) project, which provides comprehensive RNA-seq data across multiple human tissues. These datasets offer a wealth of information on gene expression levels and isoform variations across diverse tissues, making them ideal for our analysis.\n\n\n\n\n\n# RESULTS\nSummarize the results and key findings or outputs.\n\n\n\n\n\n\nExample text\n\n\n\n\n\n\n\nOur analysis revealed distinct gene expression patterns among different human tissues. We identified tissue-specific genes enriched in brain tissues, highlighting their potential roles in neurodevelopment and function. Additionally, we found a set of genes that exhibit consistent expression across a range of tissues, suggesting their fundamental importance in basic cellular processes.\nFurthermore, our differential expression analysis unveiled significant changes in gene expression between healthy and diseased tissues, shedding light on potential molecular factors underlying various diseases. Overall, this project underscores the power of NGS data in unraveling intricate gene expression networks and their implications for human health.\n\n\n\n\n\n# CONTRIBUTIONS AND CONTACT INFO\n# LICENSE\n\n\n\n\n\n\n\n\n\n\nExercise 2: modify the metadata.yml file in your Cookiecutter template\n\n\n\n\n\n\n\nIt is time now to customize your Cookiecutter templates and modify the metadata.yml files so that they fit your needs!\n\nConsider changing variables (add/remove) in the metadata.yml file from the cookicutter template.\nModify the cookiecutter.json file. You could add new variables or change the default key and/or values:\n{\n\"project_name\": \"myProject\",\n\"project_slug\": \"{{ cookiecutter.project_name.lower().replace(' ', '_').replace('-', '_') }}\",\n\"authors\": \"myName\",\n\"start_date\": \"{% now 'utc', '%Y%m%d' %}\",\n\"short_desc\": \"\",\n\"version\": \"0.1.0\"\n}\nThe metadata file will be filled accordingly.\nOptional: You can customize or remove this prompt message entirely, allowing you to tailor the text to your preferences for a unique experience each time you use the template.\n\"__prompts__\": {\n    \"project_name\": \"Project directory name [Example: project_short_description_202X]\",\n    \"author\": \"Author of the project\",\n    \"date\": \"Date of project creation, default is today's date\",\n    \"short_description\": \"Provide a detailed description of the project (context/content)\"\n},\nModify the metadata.yml file so that it includes the metadata recorded by the cookiecutter.json file. Hint below:\nproject: {{ cookiecutter.project_name }}\nauthor: {{ cookiecutter.author }}\ndate: {{ cookiecutter.date }}\ndescription: {{ cookiecutter.short_description }}\nModify the README.md file so that it includes the short description recorded by the cookiecutter.json file and the metadata at the top of the markdown file (top between lines of dashed).\n---\ntitle: {{ cookiecutter.project_name }}\ndate: \"{{ cookiecutter.date }}\"\nauthor: {{ cookiecutter.author }}\nversion: {{ cookiecutter.version }}\n---\n\nProject description\n----\n\n{{ cookiecutter.short_description }}\nCommit and push changes when you are done with your modifications\n\n\nStage the changes with git add\nCommit the changes with a meaningful commit message git commit -m \"update cookicutter template\"\nPush the changes to your forked repository on Github git push origin main (or the appropriate branch name)\n\n\nTest your template by using cookiecutter &lt;URL to your GitHub repository \"cookicutter-template\"&gt;\nFill up the variables and verify that the modified information looks like you would expect."
   },
   {
     "objectID": "develop/practical_workshop.html#naming-conventions",
     "href": "develop/practical_workshop.html#naming-conventions",
     "title": "Practical material",
     "section": "3. Naming conventions",
-    "text": "3. Naming conventions\nUsing consistent naming conventions is important in scientific research as it helps with the organization and retrieval of data or results. By adopting standardized naming conventions, researchers ensure that files, experiments, or data sets are labeled in a clear, logical manner. This makes it easier to locate and compare similar types of data or results, even when dealing with large datasets or multiple experiments. For instance, in genomics, employing uniform naming conventions for files related to specific experiments or samples allows for swift identification and comparison of relevant data, streamlining the research process and contributing to the reproducibility of findings. This practice promotes efficiency, collaboration, and the integrity of scientific work.\n\nGeneral tips\nBelow you will find a small list of general tips to follow when you name a folder or a file:\n\nUse only alphanumeric characters to write a word: a to z and 0 to 9\nAvoid special characters: ~!@#$%^&*()`“|\nDate format: use YYYYMMDD format. For example: 20230101.\nAuthors: use initials. For example: JARH\nDon’t use spaces! Computers get very confused when you need to point a path to a file and it contains spaces! Instead:\n\nSeparate field sections are separated by underscores _.\nWords in each section are written in camelCase. It would look then like this: field1_word1Word2.txt. For example: heatmap_sampleCor_20230101.png. The first field indicates what this file is, i.e., a heatmap. The second field is what is being plotted, i.e., sample correlations; since the field contains two words, they are written in camelCase. The third field is the date when the image was created.\n\nUse as short fields as possible. You can try to use understandable abbreviations, like LFC for LogFoldChange, Cor for correlations, Dist for distances, etc.\nAvoid long names as much as you can, be concise!\nAvoid creating many sublevels of folders.\nWrite down your naming convention pattern and document it in the README file\nWhen using a sequential numbering system, use leading zeros to make sure files are sorted in sequential order. Use 01 instead of just 1 if your sequence only goes up to 99.\nVersions should be used as the last element, and use at least two digits with a leading 0 (e.g. v01, v02)\n\n\n\nSuggestions for NGS data\nMore info on naming conventions for different types of files and analysis is in development.\n\n\n\n\n\n\n\n\n\nname\ndescription\nnaming_convention\nfile format\nexample\n\n\n\n\n.fastq\nraw sequencing reads\nnan\nnan\nsampleID_run_read1.fastq\n\n\n.fastqc\nquality control from fastqc\nnan\nnan\nsampleID_run_read1.fastqc\n\n\n.bam\naligned reads\nnan\nnan\nsampleID_run_read1.bam\n\n\nGTF\nsequence annotation\nnan\nnan\none of https://www.gencodegenes.org/\n\n\nGFF\nsequence annotation\nnan\nnan\none of https://www.gencodegenes.org/\n\n\n.bed\ngenome locations\nnan\nnan\nnan\n\n\n.bigwig\ngenome coverage\nnan\nnan\nnan\n\n\n.fasta\nsequence data (nucleotide/aminoacid)\nnan\nnan\none of https://www.gencodegenes.org/\n\n\nMultiqc report\nQC aggregated report\n&lt;assayID\\&gt;_YYYYMMDD.multiqc\nmultiqc\nRNA_20200101.multiqc\n\n\nCount matrix\nfinal count matrix\n&lt;assayID\\&gt;_cm_aligner_YYYYMMDD.tsv\ntsv\nRNA_cm_salmon_20200101.tsv\n\n\nDEA\ndifferential expression analysis results\nDEA_&lt;condition1-condition2\\&gt;_LFC&lt;absolute_threshold\\&gt;_p&lt;pvalue decimals\\&gt;_YYYYMMDD.tsv\ntsv\nDEA_treat-untreat_LFC1_p01_20200101.tsv\n\n\nDBA\ndifferential binding analysis results\nDBA_&lt;condition1-condition2\\&gt;_LFC&lt;absolute_threshold\\&gt;_p&lt;pvalue decimals\\&gt;_YYYYMMDD.tsv\ntsv\nDBA_treat-untreat_LFC1_p01_20200101.tsv\n\n\nMAplot\nMA plot\nMAplot_&lt;condition1-condition2\\&gt;_YYYYMMDD.jpeg\njpeg\nMAplot_treat-untreat_20200101.jpeg\n\n\nHeatmap plot\nHeatmap plot of anything\nheatmap_&lt;type\\&gt;_YYYYMMDD.jpeg\njpeg\nheatmap_sampleCor_20200101.jpeg\n\n\nVolcano plot\nVolcano plot\nvolcano_&lt;condition1-condition2\\&gt;_YYYYMMDD.jpeg\njpeg\nvolcano_treat-untreat_20200101.jpeg\n\n\nVenn diagram\nVenn diagram\nvenn_&lt;type\\&gt;_YYYYMMDD.jpeg\njpeg\nvenn_consensus_20200101.jpeg\n\n\nEnrichment table\nEnrichment results\nnan\ntsv\nnan\n\n\n\n\n\n\n\n\n\n\n\n\n\n\nExercise 3: Create your own naming conventions\n\n\n\n\n\n\n\nThink about the most common types of files and folders you will be working on, such as visualizations, results tables, processed files, etc. Then come up with a logical and clear way of naming those files using the tips suggested above. Remember to avoid making long and complicated names!"
+    "text": "3. Naming conventions\nAs discussed in lesson 3, consistent naming conventions are key for interpreting, comparing, and reproducing findings in scientific research. Standardized naming helps organize and retrieve data or results, allowing researchers to locate and compare similar types of data within or across large datasets.\n\n\n\n\n\n\nExercise 3: Define your file name conventions\n\n\n\n\n\n\n\nAvoid long and complicated names and ensure your file names are both informative and easy to manage:\n\nFor saving a new plot, a heatmap representing sample correlations\nWhen naming the file for the document containing the Research Data Management Course Objectives (Version 2, 2nd May 2024) from the University of Copenhagen\nConsider the most common file types you work with, such as visualizations, figures, tables, etc., and create logical and clear file names\n\n\n\n\n\n\n\nHint\n\n\n\n\n\n\n\n\nheatmap_sampleCor_20240101.png\nKU_RDM-objectives_20240502_v02.doc or KU_RDMObj_20240502_v02.doc"
   },
   {
-    "objectID": "develop/practical_workshop.html#create-a-catalog-of-your-assay-folder",
-    "href": "develop/practical_workshop.html#create-a-catalog-of-your-assay-folder",
+    "objectID": "develop/practical_workshop.html#create-a-catalog-of-your-data-folder",
+    "href": "develop/practical_workshop.html#create-a-catalog-of-your-data-folder",
     "title": "Practical material",
-    "section": "4. Create a catalog of your assay folder",
-    "text": "4. Create a catalog of your assay folder\nThe next step is to collect all the NGS datasets that you have created in the manner explained above. Since your folders all should contain the metadata.yml file in the same place with the same metadata, it should be very easy to iteratively go through all the folders and merge all the metadata.yml files into a one single table. This table can be then browsed easily with Microsoft Excel, for example. If you are interested in making a Shiny app or Python Panel tool to interactively browse the catalog, check out this lesson.\n\n\n\n\n\n\nExercise 4: create a metadata.tsv catalog\n\n\n\n\n\n\n\nWe will make a small script in R (or you can make one with Python) that recursively goes through all the folders inside an input path (like your Assays folder), fetches all the metadata.yml files, and merges them. Finally, it will write a TSV file as an output.\n\nCreate a folder called Assays\nUnder that folder, make three new Assay folders from your cookiecutter template\nRun the script below with R (or create your own with Python). Modify the folder_path variable so it matches the path to the folder Assays. The table will be written under the same folder_path.\nVisualize your Assays table with Excel\n\n\nlibrary(yaml)\nlibrary(dplyr)\nlibrary(lubridate)\n\n# Function to recursively fetch metadata.yml files\nget_metadata &lt;- function(folder_path) {\n    file_list &lt;- list.files(path = folder_path, pattern = \"metadata\\\\.yml$\", recursive = TRUE, full.names = TRUE)\n    metadata_list &lt;- lapply(file_list, yaml::yaml.load_file)\n    return(metadata_list)\n    }\n\n# Specify the folder path\n    folder_path &lt;- \"/path/to/your/folder\"\n\n    # Fetch metadata from the specified folder\n    metadata &lt;- get_metadata(folder_path)\n\n    # Convert metadata to a data frame\n    metadata_df &lt;- data.frame(matrix(unlist(metadata), ncol = length(metadata), byrow = TRUE))\n    colnames(metadata_df) &lt;- names(metadata[[1]])\n\n    # Save the data frame as a TSV file\n    output_file &lt;- paste0(\"database_\", format(Sys.Date(), \"%Y%m%d\"), \".tsv\")\n    write.table(metadata_df, file = output_file, sep = \"\\t\", quote = FALSE, row.names = FALSE)\n\n    # Print confirmation message\n    cat(\"Database saved as\", output_file, \"\\n\")"
+    "section": "4. Create a catalog of your data folder",
+    "text": "4. Create a catalog of your data folder\nThe next step is to collect all the NGS datasets that you have created in the manner explained above. Since your folders all should contain the metadata.yml file in the same place with the same metadata, it should be very easy to iteratively go through all the folders and merge all the metadata.yml files into a one single table. This table can be then browsed easily with Microsoft Excel, for example. If you are interested in making a Shiny app or Python Panel tool to interactively browse the catalog, check out this lesson.\n\n\n\n\n\n\nExercise 4: create a metadata.tsv catalog\n\n\n\n\n\n\n\nWe will make a small script in R (or you can make one with Python) that recursively goes through all the folders inside an input path (like your Assays folder), fetches all the metadata.yml files, and merges them. Finally, it will write a TSV file as an output.\n\nCreate a folder called dataset and change directory cd dataset\nFork this repository: a Cookiecutter template designed for NGS datasets. While you are welcome to create your own template from scratch, we recommend using this one to save time.\nRun the cookiecutter cc-data-template command at least twice to create multiple datasets or projects. Use different values each time to simulate various scenarios (do this in the dataset directory that you have previously created). Execute the script below using R (or create your own script in Python). Adjust the folder_path variable so that it matches the path to the Assays folder. The resulting table will be saved in the same folder_path.\nOpen your database_YYYYMMDD.tsv table in a text editor from the command-line, or view it in Excel for better visualization.\n\n\nlibrary(yaml)\nlibrary(dplyr)\nlibrary(lubridate)\n\n# Function to read a YAML file and transform it into a dataframe format.\nread_yaml &lt;- function(file_path) {\n  # Read the YAML file and convert it to a data frame\n  df &lt;- yaml::yaml.load_file(file_path) %&gt;% as.data.frame(stringsAsFactors = FALSE)\n  \n  # Return the data frame\n  return(df)\n}\n\n# Function to recursively fetch metadata.yml files\nget_metadata &lt;- function(folder_path) {\n  file_list &lt;- list.files(path = folder_path, pattern = \"metadata\\\\.yml$\", recursive = TRUE, full.names = TRUE)\n\n  metadata_list &lt;- lapply(file_list, read_yaml)\n  \n  # Combine the list of data frames into a single data frame using dplyr::bind_rows()\n  combined_metadata &lt;- bind_rows(metadata_list)\n\n  return(combined_metadata)\n}\n\n# Specify the folder path\nfolder_path &lt;- \"/path/to/your/folder\"\n\n# Fetch metadata from the specified folder\nmetadata &lt;- get_metadata(folder_path)\n\n# Save the data frame as a TSV file\noutput_file &lt;- paste0(\"database_\", format(Sys.Date(), \"%Y%m%d\"), \".tsv\")\nwrite.table(metadata, file = output_file, sep = \"\\t\", quote = FALSE, row.names = FALSE)\n\n# Print confirmation message\ncat(\"Database saved as\", output_file, \"\\n\")"
   },
   {
     "objectID": "develop/practical_workshop.html#version-control-of-your-data-analysis-using-git-and-github",
@@ -115,11 +115,11 @@
     ]
   },
   {
-    "objectID": "develop/04_metadata.html#documentation-and-metadata",
-    "href": "develop/04_metadata.html#documentation-and-metadata",
+    "objectID": "develop/04_metadata.html#data-documentation",
+    "href": "develop/04_metadata.html#data-documentation",
     "title": "4. Documentation for biodata",
-    "section": "Documentation and metadata",
-    "text": "Documentation and metadata\nEssential documentation comes in different forms and flavors, serving various purposes in research. Examples include protocols outlining experimental procedures, detailed lab journals recording experimental conditions and observations, codebooks explaining concepts, variables, and abbreviations used in the analysis, information about the structure and content of a dataset, software installation, and usage manual, code explanation within files or methodological information outlining data processing steps.\n From ontotext.com\nMetadata provides essential context and structure to (primary) data, enabling researchers to understand its significance and facilitate efficient data management. Some common elements found in metadata for bioinformatics data include:\n\nSample information and collection details\nExperimental conditions\nData processing steps applied to the raw data\nAnnotation and Ontology terms\nFile metadata (file type, file format, etc.)\nEthical and Legal Compliance\n\nMetadata serves as a crucial guide in navigating the complex landscape of data, akin to a cheat sheet for piecing together the puzzle of information. Much like identifying puzzle pieces, metadata provides essential details about data origin, structure, and context, such as sample collection details, experimental procedures, and equipment used. Metadata enables data exploration, interpretation, and future accessibility, promoting effective management and facilitating data usability and reuse.\n\n\n\n\n\n\nBenefits of collecting proper metadata\n\n\n\n\nData Context and Interpretation: Aiding in understanding experimental conditions, sample origins, and processing methods, is crucial for accurate results interpretation.\nData Discovery and Access: Metadata enables easy locating and accessing of specific datasets by quickly identifying relevant data through sample identifiers, experimental parameters, and timestamps.\nReproducibility and Collaboration: Metadata facilitates experiment replication and validation by enabling colleagues to reproduce analyses, compare results, and collaborate effectively, enhancing the integrity of scientific findings.\nQuality Control and Validation: Metadata supports data quality assessment by tracking the origin and handling of NGS data, allowing the identification of errors or biases to validate analysis accuracy and reliability.\nLong-Term Data Preservation: metadata ensures preservation over time, facilitating future understanding and utilization of archived datasets for continued scientific impact as research progresses.\n\n\n\n\nStreamlining Metadata Collection\nData and project directories should both include metadata and a README file.\n\n\n\n\n\n\nPractical tips\n\n\n\n\nImplement a logical structure with clear and descriptive file names.\nUse of controlled vocabularies and ontologies to ensure consistency and efficient data management and interpretation.\nUse a repository and a versioning system\nMake it Machine-readable, -actionable, and -interpretable.\nDevelop standards further within your research environment FAIRsharing standards.\nInclude all information for others to comprehend and effectively utilize the data.\n\n\n\n\n\nREADME.md\nThe README.md file, written in markdown format, provides a detailed description of the folder’s content. It includes information such as the purpose of the data, collection methods, and relevant details. The content might differ based on the purpose of the data.\n\n\n\n\n\n\nExercise 1: Identify README.md key components.\n\n\n\n\n\n\n\nSelect one of the examples below and reflect on how effectively the README communicates important information about the project. Please note that some of the links lead to README files describing databases, while others pertain to software and tools.\n\n1000 Genomes Project. You will find several readme files here.\n\nHomo Sapiens, fasta GRCh38\nIPD-IMGT/HLA Database\nDocker\nPython pandas\n\n\n\n\n\n\nStructure for bioinformatics projects.\n\nDescription of the project\nObjectives and aims\nDatasets and software requirements\nInstruction for data interpretation\nSummary of results\nContributions\nAdditional comments or notes\n\n\n\nmetadata.yml\nMetadata can be written in many file formats (commonly used: YAML, TXT, JSON, and CSV). We recommend YAML format, which is a text document that contains data formatted using a human-readable data format for data serialization. The content will be specific to the type of project.\nmetadata:\n  project: \"Title\"\n  author: \"Name\"\n  date: \"YYYYMMDD\"\n  description: \"Project short description\"\n  version: \"1.0\"\n  analysis:\n    tool: \"software\"\n    version: \"1.1.1\"\nSome general metadata fields used across different disciplines:\n\nProject Title: A concise and informative name for the dataset.\nAuthor(s): The individual(s) or organization responsible for creating the dataset. Include ORCID for identification.\nDate Created: The date when the dataset was originally generated or compiled, in YYYY-MM-DD format.\nDate Modified: The date when the dataset was last updated or modified (YYYY-MM-DD).\nObject ID: The project or assay ID for tracking and reference purposes.\nDescription: A short narrative explaining the content, purpose, and context of the project.\nKeywords: Descriptive terms or phrases capturing the main topics and attributes.\nEthical and Legal Considerations: Information about ethical approvals, consent, and any legal restrictions.\nVersion: The version number or identifier, useful for tracking changes.\nRelated Publications: Links or references to scientific publications associated with the folder. Always add the DOI.\nFunding Source: Details about the funding agency or source that supported the research or data generation.\nLicense: The type of license or terms of use associated with the dataset/project.\nContact Information: Contact details for individuals who can provide further information about the dataset/project.\n\n\n\n\n\n\n\nTip\n\n\n\nThere is an exercise in the practical material to streamline the creation of metadata files using Cookiecutter, a template-based scaffolding tool.\n\n\n\n\n\n\nHint\n\n\n\n\n\n\n\nCreate a metadata file with the following description fields: name, date, description, version, authors, keywords, license. Fill it up at the start of the project, when you generate the file structure.",
+    "section": "Data documentation",
+    "text": "Data documentation\nEssential documentation comes in different forms and flavors, serving various purposes in research. Examples include protocols outlining experimental procedures, detailed lab journals recording experimental conditions and observations, codebooks explaining concepts, variables, and abbreviations used in the analysis, information about the structure and content of a dataset, software installation, and usage manual, code explanation within files or methodological information outlining data processing steps.\n From ontotext.com\nData documentation provides essential context and structure to (primary) data, enabling researchers to understand its significance and facilitate efficient data management. Some common elements found in metadata for bioinformatics data include:\n\nData collection information: source (e.g., organism, tissue or location), date (YYYY-MM-DD format) and time, collection methods employed or experimental conditions.\nData processing information: data content, data format, data cleaning and transformation such as filtering and normalizations techniques, and software and tools used.\nData description: variables and attributes, and data types (e.g., categorical, numerical, or textual).\nBiological context: experimental design, biological purpose and relevance and implications in the broader context.\nData ownership and access: authorship, licensing of the data and details on accessing and sharing.\nProvenance and tracking: version control information over time and citations, such as links to publications or studies that reference the data.\n\nData documentation also serves as a crucial guide in navigating the complex landscape of data, akin to a cheat sheet for piecing together the puzzle of information. Much like identifying puzzle pieces, metadata provides essential details about data origin, structure, and context, such as sample collection details, experimental procedures, and equipment used. Metadata enables data exploration, interpretation, and future accessibility, promoting effective management and facilitating data usability and reuse.\n\n\n\n\n\n\nBenefits of collecting proper documentation\n\n\n\n\nData Context and Interpretation: Aiding in understanding experimental conditions, sample origins, and processing methods, is crucial for accurate results interpretation.\nData Discovery and Access: Documentation enables easy locating and accessing of specific datasets by quickly identifying relevant data through sample identifiers, experimental parameters, and timestamps.\nReproducibility and Collaboration: Documentation facilitates experiment replication and validation by enabling colleagues to reproduce analyses, compare results, and collaborate effectively, enhancing the integrity of scientific findings.\nQuality Control and Validation: Documentation supports data quality assessment by tracking the origin and handling of NGS data, allowing the identification of errors or biases to validate analysis accuracy and reliability.\nLong-Term Data Preservation: Documentation ensures preservation over time, facilitating future understanding and utilization of archived datasets for continued scientific impact as research progresses.\n\n\n\n\nStreamlining Metadata Collection\nData and project directories should both include metadata and a README file. Metadata delivers descriptive information about a dataset or project, offering insights for interpreting, using, and sharing the data effectively. README files offer an overview and purpose of the project or dataset, providing instructions and guidance for setting up, running, and using the data or tools. While metadata concentrates on the data itself, README files provide a broader perspective on the overall project or resource.\n\n\n\n\n\n\nPractical tips\n\n\n\n\nImplement a logical structure with clear and descriptive file names.\nUse of controlled vocabularies and ontologies to ensure consistency and efficient data management and interpretation.\nUse a repository and a versioning system\nMake it Machine-readable, -actionable, and -interpretable.\nDevelop standards further within your research environment FAIRsharing standards.\nInclude all information for others to comprehend and effectively utilize the data.\n\n\n\n\n\nREADME.md\n\n\n\n\n\n\nFile formats\n\n\n\nLink to the file format database\n\nMarkdown (.md): commonly used because is easy to read and write and is compatible across platforms (e.g., GitHub, GitLab). Supports formatting like headings, lists, links, images, and code blocks.\nPlain Text (.txt): Simple and straightforward format without any rich formatting and great for basic instructions. Lack the ability of structure content effectively.\nReStructuredText (.rst): commonly used for python projects. Supports advanced formatting (takes, links, images and code blocks) .\n\nOthers such as HTML, YAML and Notebooks.\n\n\nThe README.md file, written in markdown format, provides a detailed description of the folder’s content. It includes information such as the purpose of the data, collection methods, and relevant details. The content might differ based on the purpose of the data.\n\n\n\n\n\n\nExercise 1: Identify README.md key components.\n\n\n\n\n\n\n\nSelect one of the examples below and reflect on how effectively the README communicates important information about the project. Please note that some of the links lead to README files describing databases, while others pertain to software and tools.\n\n1000 Genomes Project. You will find several readme files here.\n\nHomo Sapiens, fasta GRCh38\nIPD-IMGT/HLA Database\nDocker\nPython pandas\n\n\n\n\n\n\nStructure for bioinformatics projects.\n\nDescription and relevance the project\nObjectives and aims\nDatasets and software requirements\nInstruction for data interpretation\nSummary of results\nContributions\nAdditional comments or notes\n\n\n\nmetadata.yml\n\n\n\n\n\n\nFile formats\n\n\n\n\nXML (eXtensible Markup Language): uses custom tags to describe data and allows for a hierarchical structure.\nJSON (JavaScript Object Notation): lightweight and human-readable format that is easy to parse and generate.\nCSV (Comma-Separated Values) or TSV (tabulate-separate values): simple and widely supported for representing tabular formats. Easy to manipulate using software or programming languages. It is often use for sample metadata.\nYAML (YAML Ain’t Markup Language): human-readable data serialization format, commonly used as project configuration files.\n\nOthers such as RDF or HDF5.\n\n\nLink to the file format database.\nMetadata can be written in many file formats (commonly used: YAML, TXT, JSON, and CSV). We recommend YAML format, which is a text document that contains data formatted using a human-readable data format for data serialization. However, choose the format that best suits the project’s needs. The content will be specific to the type of project.\nmetadata:\n  project: \"Title\"\n  author: \"Name\"\n  date: \"YYYYMMDD\"\n  description: \"Project short description\"\n  version: \"1.0\"\n  analysis:\n    tool: \"software\"\n    version: \"1.1.1\"\nSome general metadata fields used across different disciplines:\n\nProject Title: A concise and informative name for the dataset.\nAuthor(s): The individual(s) or organization responsible for creating the dataset. Include ORCID for identification.\nDate Created: The date when the dataset was originally generated or compiled, in YYYY-MM-DD format.\nDate Modified: The date when the dataset was last updated or modified (YYYY-MM-DD).\nObject ID: The project or assay ID for tracking and reference purposes.\nDescription: A short narrative explaining the content, purpose, and context of the project.\nKeywords: Descriptive terms or phrases capturing the main topics and attributes.\nEthical and Legal Considerations: Information about ethical approvals, consent, and any legal restrictions.\nVersion: The version number or identifier, useful for tracking changes.\nRelated Publications: Links or references to scientific publications associated with the folder. Always add the DOI.\nFunding Source: Details about the funding agency or source that supported the research or data generation.\nLicense: The type of license or terms of use associated with the dataset/project.\nContact Information: Contact details for individuals who can provide further information about the dataset/project.\n\n\n\n\n\n\n\nTip\n\n\n\nThere is an exercise in the practical material to streamline the creation of metadata files using Cookiecutter, a template-based scaffolding tool.\n\n\n\n\n\n\nHint\n\n\n\n\n\n\n\nCreate a metadata file with the following description fields: name, date, description, version, authors, keywords, license. Fill it up at the start of the project, when you generate the file structure.",
     "crumbs": [
       "Course material",
       "Key practices",
@@ -131,7 +131,7 @@
     "href": "develop/04_metadata.html#controlled-vocabularies-and-ontologies",
     "title": "4. Documentation for biodata",
     "section": "Controlled vocabularies and ontologies",
-    "text": "Controlled vocabularies and ontologies\nResearchers encountering inconsistent and non-standardized terms (e.g., gene names, disease names, cell types, protein domains, etc.) across datasets may face challenges in data integration. Thus, requiring additional curation time to enable meaningful comparisons. Standardized vocabularies streamline integration, improving consistency and comparability in analysis. Leveraging widely accepted ontologies in the documentation ensures consistent capture of experiment details in metadata fields, aiding data interpretation.\n\n\n\n\n\n\nExamples of ontology services\n\n\n\n\nUberon anatomy ontology\nGene ontology\nEnsembl gene IDs.\nMedical Subject Headings (MeSH)\nChemical Entities of Biological Interest\nMicroarray Gene Expression Society Ontology (MGED)\n\n\n\n\n\n\n\n\n\nOntology definition\n\n\n\n\n\n\n\nAn ontology is a structured framework representing concepts, attributes, and relationships within a specific domain, aiding knowledge organization and integration. Employing standardized vocabularies, it facilitates effective communication and reasoning between humans and computers. Ontologies are crucial for knowledge representation, data integration, and semantic interoperability, enhancing understanding and collaboration across complex domains.\n\n\n\n\n\nStandardization improves data discoverability and interoperability, enabling robust analysis, accelerating knowledge sharing, and facilitating cross-study comparisons. Ontologies act as universal translators, fostering harmonious data interpretation and collaboration across scientific disciplines.\nYou can find three examples of metadata tailored for different purposes NGS data examples: sample metadata, project metadata, and experimental metadata. We suggest exploring controlled vocabularies and metadata standards within your field and seeking additional specialized sources. You will find a few sources at the end of the page.",
+    "text": "Controlled vocabularies and ontologies\nResearchers encountering inconsistent and non-standardized terms (e.g., gene names, disease names, cell types, protein domains, etc.) across datasets may face challenges in data integration. Thus, requiring additional curation time to enable meaningful comparisons. Standardized vocabularies streamline integration, improving consistency and comparability in analysis. Leveraging widely accepted ontologies in the documentation ensures consistent capture of experiment details in metadata fields, aiding data interpretation.\n\n\n\n\n\n\nExamples of ontology services\n\n\n\n\nUberon anatomy ontology\nGene ontology\nEnsembl gene IDs\nMedical Subject Headings (MeSH)\nChemical Entities of Biological Interest\nMicroarray Gene Expression Society Ontology (MGED)\nNCBI taxonomy\nMondo disease database\n\n\n\n\n\n\n\n\n\nOntology definition\n\n\n\n\n\n\n\nAn ontology is a structured framework representing concepts, attributes, and relationships within a specific domain, aiding knowledge organization and integration. Employing standardized vocabularies, it facilitates effective communication and reasoning between humans and computers. Ontologies are crucial for knowledge representation, data integration, and semantic interoperability, enhancing understanding and collaboration across complex domains.\n\n\n\n\n\nStandardization improves data discoverability and interoperability, enabling robust analysis, accelerating knowledge sharing, and facilitating cross-study comparisons. Ontologies act as universal translators, fostering harmonious data interpretation and collaboration across scientific disciplines.\nYou can find three examples of metadata tailored for different purposes NGS data examples: sample metadata, project metadata, and experimental metadata. We suggest exploring controlled vocabularies and metadata standards within your field and seeking additional specialized sources. You will find a few sources at the end of the page.",
     "crumbs": [
       "Course material",
       "Key practices",
@@ -273,37 +273,37 @@
   {
     "objectID": "develop/examples/NGS_management.html",
     "href": "develop/examples/NGS_management.html",
-    "title": "NGS data strategies",
+    "title": "Effective RDM Practices in NGS Analysis",
     "section": "",
-    "text": "Section Overview\n\n\n\n⏰ Time Estimation: X minutes\n💬 Learning Objectives:\n\nNext Generation Sequencing data types and metadata\nBest practices for software and code management\nPipelines and workflows\n\n\n\nIn the data life cycle for Next Generation Sequencing (NGS) technology data, processing, and analyzing are critical phases that involve transforming raw sequencing data into meaningful biological insights. Researchers apply computational methods and bioinformatics tools to extract valuable information from the vast amount of sequencing data generated in NGS experiments. We’ll first explore the primary data types generated pre- and post-processing and the importance of detailed documentation. We will then focus on good practices used when performing data analysis and software development.\n\n\n\n\n\n\nNext Generation Sequencing\n\n\n\n\n\n\n\nNext Generation Sequencing (NGS), or high-throughput sequencing, has revolutionized genomics research. It encompasses advanced techniques for rapid and cost-effective analysis of DNA or RNA molecules. Unlike traditional methods, NGS can analyze millions of DNA fragments simultaneously, enhancing the speed, efficiency, and scale of sequencing and becoming integral to modern genomics and biomedical studies. As NGS technologies continue to advance and become more accessible, they will remain at the front of cutting-edge genomics research, driving innovations that contribute to our understanding of complex genetic interactions and their implications for human health and biology.\nApplications\nIt is widely utilized in various applications, including genomic sequencing, transcriptome analysis (RNA-Seq), epigenetic profiling (ChIP-Seq), metagenomics, and targeted sequencing. In addition, it plays a crucial role in fields such as oncology, infectious disease research, and personalized medicine.\nData production\nNGS workflows involve key steps, from sample preparation to data analysis. Samples undergo extraction and fragmentation, followed by the addition of unique identifiers, known as library preparation, for multiplexed sequencing. Then, fragments are amplified and sequences in parallel sequencing using state-of-the-art NGS platforms. Subsequent data analysis processes reconstruct the original sequence and identify genetic variations, structural changes, or functional elements. The unique identifiers are specific adapter sequences that allow future identification of individual samples within a multiplexed sequencing run.\n\n\n\n\n\n\n\n\n\n\n\nExercise\n\n\n\n\n\n\n\n\nDo you ensure that all the data you collect or generate is accompanied by metadata? Have you ever encountered missing information when reading a provided file?\nDo you utilize specific databases or repositories for storing and accessing your research data?\nWhat are the typical data formats you encounter during data processing? As outputs of your analysis, what are the common data formats you encounter for visualization or further analysis?\nDo you document and track the workflows you use for data processing and analysis, including the software employed? How do you ensure reproducibility?\n\n\n\n\n\n\n\n\n\n\nThoroughly document your datasets and the experimental setup to ensure reproducibility. Adhering to standards will ensure interoperability. Data types’ examples:\n\nElectronic Laboratory Notebook (ELN): digital description of the experimental design, and measurement devices. ELNs offer features like data entry, text editing, file attachments, collaboration tools, and search capabilities.\nLaboratory protocols: methodologies to prepare and manage samples.\nSamples: refers to the biological material (extraction of DNA, RNA, or proteins). Specification of sample identifier, sample type, source organism, etc.\nSequencing: details on the platform (e.g., Illumina, Oxford Nanopore), library preparation method, coverage, quality control metrics (e.g., Phred score)…\nRaw sequencing data: sequences and quality scores (e.g., FASTQ files)\n\n\n\n\n\n\n\nNote\n\n\n\nA metadata file is crucial during data analysis as it contains information about the experimental conditions (such as sequencing details, treatment, sample type, time points, tissue…).\n\n\n\n\n\nExamples of data types generated during processing:\n\nQuality control metrics: to filter out potential artifacts and ensure the reliability of downstream analyses (e.g., bioinformatics tool like FastQC or MultiQC for results’ aggregation)\nData alignments: in genomics to determine the location of the read in the genome and in transcriptomics to identify gene expression levels.\nDNA analysis results: such as variant calling, genome annotation, functional genomics, phylogenetics, metagenomics, etc. Results are usually presented in tabular format.\nRNA Expression analysis results: from differential gene expression, gene ontology (GO) enrichment, alternative splicing, pathway analysis, etc. Results are usually presented in tabular format.\nEpigenetic profiling outputs: to assess gene regulation and chromatin structure (e.g., ChIP-Seq). Usually presented in BED format.\n\nThe interpretation of NGS data relies heavily on the results of data analysis, which are pivotal for understanding the biological significance of the findings and formulating hypotheses for further exploration. Clear and effective visualization methods are crucial for communicating and interpreting the vast amount of information generated by NGS experiments.\n\n\n\n\n\n\nOther types of data: databases and visualizations\n\n\n\n\n\n\n\n\nKnowledge databases\nA knowledge database is a structured repository of biological information that categorizes and annotates genes, proteins, and their functions, facilitating comprehensive understanding and analysis of biological systems. Here are five examples of knowledge databases:\n\n\nGene Ontology (GO): A comprehensive resource that classifies gene functions into defined terms, allowing for standardized annotation and comparison of genes across different organisms.\nDisease Ontology: A database that provides structured, standardized terminology for various diseases and their relationships, aiding in the systematic analysis of disease-related data.\nKEGG Pathways: A collection of manually curated pathway maps representing molecular interactions and reaction networks within cells, enabling the interpretation of high-throughput data in the context of biological systems.\nReactome: An open-access database that offers curated descriptions of biological processes, including pathways, reactions, and molecular events, facilitating the interpretation of large-scale biological data.\nUniProt: An extensive protein knowledgebase that provides detailed information about proteins, including their sequences, functions, and related annotations, supporting a wide range of biological research endeavors.\n\n\nVisualizations\n\n\nHeatmaps: frequently used to visualize gene expression patterns, epigenetic modifications, or microbial abundances across samples/conditions.\nVolcano Plots: commonly used in differential gene expression analysis\nGenome Browser Snapshots: display alignments and genomic features in genomic regions (e.g., gene annotations, ChIP-Seq peaks)\nNetwork Visualizations:utilized to explore gene regulatory networks or protein-protein interaction\nGenomic Annotations: to annotate genetic variations (functional impact on genes, genomic regions, or regulatory element)\n\n\n\n\n\n\n\n\n\nBest practices for software and code management (don’t forget to read about FAIR software):\n\nCommenting your code: to enhance readability and comprehension\nMake your source code accessible using a repository (GitHub, GitLab, Bitbucket, SourceForge, etc.) that provides version control (VC) solutions. This step is one of the most important ones as version control systems (Git or SVN) track changes in your code over time and enable collaboration and easy version management. Most Danish institutions provide courses on Git/GitHub, check yours! We also highly recommend reading this paper (Perez-Riverol et al. 2016).\nREADME file: with comprehensive information about the project including installation instructions, usage examples or tutorials, licensing details, citation information, etc.\nRegister your code in a research software registry and include a clear and accessible software usage license: enabling other researchers to discover and reuse software packages (alongside metadata). More recommendations here.\nUse domain-relevant community standards to ensure consistency and interoperability (e.g., CodeMeta).\n\n\n\n\n\n\n\nGit and Github courses and other resources\n\n\n\n\n\n\n\n\nUniversity of Copenhagen\nAarhus University\nAalborg University\nDTU Git guidelines Find more resources on the Berkeley Library website\n\n\n\n\n\n\n\n\n\nYou might use standard workflows or generate new ones during data processing and data analysis steps.\n\nCode notebooks: tools for data documentation (e.g. Jupyter Notebook, Rmarkdown) enabling the combination of code with descriptive text and visualizations.\n\nIntegrated development environments (knitr or MLflow).\nPipeline frameworks or workflow management systems: designed to streamline and automate various steps involved in data analysis (data extraction, transformation, validation, visualization, and more). Additionally, they contribute to ensuring interoperability by facilitating seamless integration and interaction between different components or stages. There are two very popular systems, Nextflow and Snakemake.\n\nA great example of community-curated workflows is the nf-core community. Nf-core is a collaborative and open-source initiative comprising bioinformaticians and researchers dedicated to developing and maintaining a collection of curated and reproducible Nextflow-based pipelines for NGS data analysis, ensuring standardized and efficient data processing workflows.\n\n\n\n\n\n\nCourse on pipelines and workflows\n\n\n\nTake our course on Reproducible Research Practices LINK\n\n\nClick below to access a list of the most common file formats used when working with NGS data.\n\n\nData types summary\n\n\nSelect appropriate file formats that balance data accessibility, storage efficiency, and compatibility with downstream analysis tools. Standardized file formats facilitate data sharing and collaboration among researchers in the scientific community.\n\nBAM/SAM: stores the alignment information (binary and text-based respectively)\nFASTA: store nucleotide or amino acid sequence, commonly used for reference sequences or assembled contigs.\nGene Transfer Format (GTF) and General Feature Format (GFF): annotates genomic features such as genes, exons, and transcripts.\nAlignment indexes: data structures for efficient and rapid mapping of sequencing reads to a reference.\nVariant Call Format (VCF): stores genetic variation such as single nucleotide variants (SNVs), insertions, deletions, and structural variants (and their position, quality score, etc.)\nCount matrix: quantifies the abundance of RNA transcripts or genomic features across samples\nBED/BEDGraph: represent genomic intervals or coverage information (e.g., peak calling identifies regions of enriched signal intensity)\nWIG/BigWig: store genome-wide data\n\nGeneral formats\n\nTabular formats: File formats like CSV, TSV, and XLSX are used to store data in rows and columns for easy data analysis and sharing\nImage formats: File formats such as PNG and SVG are used to store graphical visualizations, making them easily viewable and shareable\nBinary formats: File formats like NPZ and H5 are used to store large datasets, ensuring efficient data access and storage\nJSON: A lightweight data-interchange format for storing hierarchical data structures, commonly used in bioinformatics tools\nHTML: A format used to create interactive reports that include both visualizations and textual descriptions of analysis results\nCode notebooks: Interactive documents combining code, visualizations, and explanatory text, aiding in data analysis reproducibility and documentation\nScripts: Text files containing sets of commands or code instructions for automating data processing and analysis tasks\n\nExplore more data types at the UCSC webpage. Check out this tutorial for more detailed explanations.\n\n\n\n\n\n\nIn this lesson, we have taken a look a the vast and diverse landscape of bioinformatics data.",
+    "text": "Section Overview\n\n\n\n⏰ Time Estimation: X minutes\n💬 Learning Objectives:\n\nNGS data strategies\nFile naming conventions examples",
     "crumbs": [
       "Use cases",
       "NGS data",
-      "NGS data strategies"
+      "Effective RDM Practices in NGS Analysis"
     ]
   },
   {
     "objectID": "develop/examples/NGS_management.html#practical-tips-for-computational-research",
     "href": "develop/examples/NGS_management.html#practical-tips-for-computational-research",
-    "title": "NGS data strategies",
-    "section": "",
-    "text": "Thoroughly document your datasets and the experimental setup to ensure reproducibility. Adhering to standards will ensure interoperability. Data types’ examples:\n\nElectronic Laboratory Notebook (ELN): digital description of the experimental design, and measurement devices. ELNs offer features like data entry, text editing, file attachments, collaboration tools, and search capabilities.\nLaboratory protocols: methodologies to prepare and manage samples.\nSamples: refers to the biological material (extraction of DNA, RNA, or proteins). Specification of sample identifier, sample type, source organism, etc.\nSequencing: details on the platform (e.g., Illumina, Oxford Nanopore), library preparation method, coverage, quality control metrics (e.g., Phred score)…\nRaw sequencing data: sequences and quality scores (e.g., FASTQ files)\n\n\n\n\n\n\n\nNote\n\n\n\nA metadata file is crucial during data analysis as it contains information about the experimental conditions (such as sequencing details, treatment, sample type, time points, tissue…).\n\n\n\n\n\nExamples of data types generated during processing:\n\nQuality control metrics: to filter out potential artifacts and ensure the reliability of downstream analyses (e.g., bioinformatics tool like FastQC or MultiQC for results’ aggregation)\nData alignments: in genomics to determine the location of the read in the genome and in transcriptomics to identify gene expression levels.\nDNA analysis results: such as variant calling, genome annotation, functional genomics, phylogenetics, metagenomics, etc. Results are usually presented in tabular format.\nRNA Expression analysis results: from differential gene expression, gene ontology (GO) enrichment, alternative splicing, pathway analysis, etc. Results are usually presented in tabular format.\nEpigenetic profiling outputs: to assess gene regulation and chromatin structure (e.g., ChIP-Seq). Usually presented in BED format.\n\nThe interpretation of NGS data relies heavily on the results of data analysis, which are pivotal for understanding the biological significance of the findings and formulating hypotheses for further exploration. Clear and effective visualization methods are crucial for communicating and interpreting the vast amount of information generated by NGS experiments.\n\n\n\n\n\n\nOther types of data: databases and visualizations\n\n\n\n\n\n\n\n\nKnowledge databases\nA knowledge database is a structured repository of biological information that categorizes and annotates genes, proteins, and their functions, facilitating comprehensive understanding and analysis of biological systems. Here are five examples of knowledge databases:\n\n\nGene Ontology (GO): A comprehensive resource that classifies gene functions into defined terms, allowing for standardized annotation and comparison of genes across different organisms.\nDisease Ontology: A database that provides structured, standardized terminology for various diseases and their relationships, aiding in the systematic analysis of disease-related data.\nKEGG Pathways: A collection of manually curated pathway maps representing molecular interactions and reaction networks within cells, enabling the interpretation of high-throughput data in the context of biological systems.\nReactome: An open-access database that offers curated descriptions of biological processes, including pathways, reactions, and molecular events, facilitating the interpretation of large-scale biological data.\nUniProt: An extensive protein knowledgebase that provides detailed information about proteins, including their sequences, functions, and related annotations, supporting a wide range of biological research endeavors.\n\n\nVisualizations\n\n\nHeatmaps: frequently used to visualize gene expression patterns, epigenetic modifications, or microbial abundances across samples/conditions.\nVolcano Plots: commonly used in differential gene expression analysis\nGenome Browser Snapshots: display alignments and genomic features in genomic regions (e.g., gene annotations, ChIP-Seq peaks)\nNetwork Visualizations:utilized to explore gene regulatory networks or protein-protein interaction\nGenomic Annotations: to annotate genetic variations (functional impact on genes, genomic regions, or regulatory element)\n\n\n\n\n\n\n\n\n\nBest practices for software and code management (don’t forget to read about FAIR software):\n\nCommenting your code: to enhance readability and comprehension\nMake your source code accessible using a repository (GitHub, GitLab, Bitbucket, SourceForge, etc.) that provides version control (VC) solutions. This step is one of the most important ones as version control systems (Git or SVN) track changes in your code over time and enable collaboration and easy version management. Most Danish institutions provide courses on Git/GitHub, check yours! We also highly recommend reading this paper (Perez-Riverol et al. 2016).\nREADME file: with comprehensive information about the project including installation instructions, usage examples or tutorials, licensing details, citation information, etc.\nRegister your code in a research software registry and include a clear and accessible software usage license: enabling other researchers to discover and reuse software packages (alongside metadata). More recommendations here.\nUse domain-relevant community standards to ensure consistency and interoperability (e.g., CodeMeta).\n\n\n\n\n\n\n\nGit and Github courses and other resources\n\n\n\n\n\n\n\n\nUniversity of Copenhagen\nAarhus University\nAalborg University\nDTU Git guidelines Find more resources on the Berkeley Library website\n\n\n\n\n\n\n\n\n\nYou might use standard workflows or generate new ones during data processing and data analysis steps.\n\nCode notebooks: tools for data documentation (e.g. Jupyter Notebook, Rmarkdown) enabling the combination of code with descriptive text and visualizations.\n\nIntegrated development environments (knitr or MLflow).\nPipeline frameworks or workflow management systems: designed to streamline and automate various steps involved in data analysis (data extraction, transformation, validation, visualization, and more). Additionally, they contribute to ensuring interoperability by facilitating seamless integration and interaction between different components or stages. There are two very popular systems, Nextflow and Snakemake.\n\nA great example of community-curated workflows is the nf-core community. Nf-core is a collaborative and open-source initiative comprising bioinformaticians and researchers dedicated to developing and maintaining a collection of curated and reproducible Nextflow-based pipelines for NGS data analysis, ensuring standardized and efficient data processing workflows.\n\n\n\n\n\n\nCourse on pipelines and workflows\n\n\n\nTake our course on Reproducible Research Practices LINK\n\n\nClick below to access a list of the most common file formats used when working with NGS data.\n\n\nData types summary\n\n\nSelect appropriate file formats that balance data accessibility, storage efficiency, and compatibility with downstream analysis tools. Standardized file formats facilitate data sharing and collaboration among researchers in the scientific community.\n\nBAM/SAM: stores the alignment information (binary and text-based respectively)\nFASTA: store nucleotide or amino acid sequence, commonly used for reference sequences or assembled contigs.\nGene Transfer Format (GTF) and General Feature Format (GFF): annotates genomic features such as genes, exons, and transcripts.\nAlignment indexes: data structures for efficient and rapid mapping of sequencing reads to a reference.\nVariant Call Format (VCF): stores genetic variation such as single nucleotide variants (SNVs), insertions, deletions, and structural variants (and their position, quality score, etc.)\nCount matrix: quantifies the abundance of RNA transcripts or genomic features across samples\nBED/BEDGraph: represent genomic intervals or coverage information (e.g., peak calling identifies regions of enriched signal intensity)\nWIG/BigWig: store genome-wide data\n\nGeneral formats\n\nTabular formats: File formats like CSV, TSV, and XLSX are used to store data in rows and columns for easy data analysis and sharing\nImage formats: File formats such as PNG and SVG are used to store graphical visualizations, making them easily viewable and shareable\nBinary formats: File formats like NPZ and H5 are used to store large datasets, ensuring efficient data access and storage\nJSON: A lightweight data-interchange format for storing hierarchical data structures, commonly used in bioinformatics tools\nHTML: A format used to create interactive reports that include both visualizations and textual descriptions of analysis results\nCode notebooks: Interactive documents combining code, visualizations, and explanatory text, aiding in data analysis reproducibility and documentation\nScripts: Text files containing sets of commands or code instructions for automating data processing and analysis tasks\n\nExplore more data types at the UCSC webpage. Check out this tutorial for more detailed explanations.",
+    "title": "Effective RDM Practices in NGS Analysis",
+    "section": "Practical tips for computational research",
+    "text": "Practical tips for computational research\n\n1. Experiments / raw data\nThoroughly document your datasets and the experimental setup to ensure reproducibility. Adhering to standards will ensure interoperability. Data types’ examples:\n\nElectronic Laboratory Notebook (ELN): digital description of the experimental design, and measurement devices. ELNs offer features like data entry, text editing, file attachments, collaboration tools, and search capabilities.\nLaboratory protocols: methodologies to prepare and manage samples.\nSamples: refers to the biological material (extraction of DNA, RNA, or proteins). Specification of sample identifier, sample type, source organism, etc.\nSequencing: details on the platform (e.g., Illumina, Oxford Nanopore), library preparation method, coverage, quality control metrics (e.g., Phred score)…\nRaw sequencing data: sequences and quality scores (e.g., FASTQ files)\n\n\n\n\n\n\n\nNote\n\n\n\nA metadata file is crucial during data analysis as it contains information about the experimental conditions (such as sequencing details, treatment, sample type, time points, tissue…).\n\n\n\n\n2. Input / Pre- and post-processing data\nExamples of data types generated during processing:\n\nQuality control metrics: to filter out potential artifacts and ensure the reliability of downstream analyses (e.g., bioinformatics tool like FastQC or MultiQC for results’ aggregation)\nData alignments: in genomics to determine the location of the read in the genome and in transcriptomics to identify gene expression levels.\nDNA analysis results: such as variant calling, genome annotation, functional genomics, phylogenetics, metagenomics, etc. Results are usually presented in tabular format.\nRNA Expression analysis results: from differential gene expression, gene ontology (GO) enrichment, alternative splicing, pathway analysis, etc. Results are usually presented in tabular format.\nEpigenetic profiling outputs: to assess gene regulation and chromatin structure (e.g., ChIP-Seq). Usually presented in BED format.\n\nThe interpretation of NGS data relies heavily on the results of data analysis, which are pivotal for understanding the biological significance of the findings and formulating hypotheses for further exploration. Clear and effective visualization methods are crucial for communicating and interpreting the vast amount of information generated by NGS experiments.\n\n\n\n\n\n\nOther types of data: databases and visualizations\n\n\n\n\n\n\n\n\nKnowledge databases\nA knowledge database is a structured repository of biological information that categorizes and annotates genes, proteins, and their functions, facilitating comprehensive understanding and analysis of biological systems. Here are five examples of knowledge databases:\n\n\nGene Ontology (GO): A comprehensive resource that classifies gene functions into defined terms, allowing for standardized annotation and comparison of genes across different organisms.\nDisease Ontology: A database that provides structured, standardized terminology for various diseases and their relationships, aiding in the systematic analysis of disease-related data.\nKEGG Pathways: A collection of manually curated pathway maps representing molecular interactions and reaction networks within cells, enabling the interpretation of high-throughput data in the context of biological systems.\nReactome: An open-access database that offers curated descriptions of biological processes, including pathways, reactions, and molecular events, facilitating the interpretation of large-scale biological data.\nUniProt: An extensive protein knowledgebase that provides detailed information about proteins, including their sequences, functions, and related annotations, supporting a wide range of biological research endeavors.\n\n\nVisualizations\n\n\nHeatmaps: frequently used to visualize gene expression patterns, epigenetic modifications, or microbial abundances across samples/conditions.\nVolcano Plots: commonly used in differential gene expression analysis\nGenome Browser Snapshots: display alignments and genomic features in genomic regions (e.g., gene annotations, ChIP-Seq peaks)\nNetwork Visualizations:utilized to explore gene regulatory networks or protein-protein interaction\nGenomic Annotations: to annotate genetic variations (functional impact on genes, genomic regions, or regulatory element)\n\n\n\n\n\n\n\n\n3. Software and code:\nBest practices for software and code management (don’t forget to read about FAIR software):\n\nCommenting your code: to enhance readability and comprehension\nMake your source code accessible using a repository (GitHub, GitLab, Bitbucket, SourceForge, etc.) that provides version control (VC) solutions. This step is one of the most important ones as version control systems (Git or SVN) track changes in your code over time and enable collaboration and easy version management. Most Danish institutions provide courses on Git/GitHub, check yours! We also highly recommend reading this paper (Perez-Riverol et al. 2016).\nREADME file: with comprehensive information about the project including installation instructions, usage examples or tutorials, licensing details, citation information, etc.\nRegister your code in a research software registry and include a clear and accessible software usage license: enabling other researchers to discover and reuse software packages (alongside metadata). More recommendations here.\nUse domain-relevant community standards to ensure consistency and interoperability (e.g., CodeMeta).\n\n\n\n\n\n\n\nGit and Github courses and other resources\n\n\n\n\n\n\n\n\nUniversity of Copenhagen\nAarhus University\nAalborg University\nDTU Git guidelines Find more resources on the Berkeley Library website\n\n\n\n\n\n\n\n\n4. Pipelines and workflows\nYou might use standard workflows or generate new ones during data processing and data analysis steps.\n\nCode notebooks: tools for data documentation (e.g. Jupyter Notebook, Rmarkdown) enabling the combination of code with descriptive text and visualizations.\n\nIntegrated development environments (knitr or MLflow).\nPipeline frameworks or workflow management systems: designed to streamline and automate various steps involved in data analysis (data extraction, transformation, validation, visualization, and more). Additionally, they contribute to ensuring interoperability by facilitating seamless integration and interaction between different components or stages. There are two very popular systems, Nextflow and Snakemake.\n\nA great example of community-curated workflows is the nf-core community. Nf-core is a collaborative and open-source initiative comprising bioinformaticians and researchers dedicated to developing and maintaining a collection of curated and reproducible Nextflow-based pipelines for NGS data analysis, ensuring standardized and efficient data processing workflows.\n\n\n\n\n\n\nCourse on pipelines and workflows\n\n\n\nTake our course on Reproducible Research Practices LINK",
     "crumbs": [
       "Use cases",
       "NGS data",
-      "NGS data strategies"
+      "Effective RDM Practices in NGS Analysis"
     ]
   },
   {
     "objectID": "develop/examples/NGS_management.html#wrap-up",
     "href": "develop/examples/NGS_management.html#wrap-up",
-    "title": "NGS data strategies",
-    "section": "",
-    "text": "In this lesson, we have taken a look a the vast and diverse landscape of bioinformatics data.",
+    "title": "Effective RDM Practices in NGS Analysis",
+    "section": "Wrap up",
+    "text": "Wrap up\nIn this lesson, we have taken a look a the vast and diverse landscape of bioinformatics data.",
     "crumbs": [
       "Use cases",
       "NGS data",
-      "NGS data strategies"
+      "Effective RDM Practices in NGS Analysis"
     ]
   },
   {
@@ -365,7 +365,7 @@
     "href": "develop/examples/NGS_metadata.html",
     "title": "NGS Assay and Project metadata",
     "section": "",
-    "text": "Section Overview\n\n\n\n⏰ Time Estimation: X minutes\n💬 Learning Objectives:\n\nDevelop your metadata\n\n\n\nYou should consider revisiting these examples after completing lesson 4 in the course material. Please review these three tables containing pre-filled data fields for metadata, each serving distinct purposes: sample metadata, project metadata, and experimental metadata.\n\nSample metadata fields\nSome details might be specific to your samples. For example, which samples are treated, which are controlled, which tissue they come from, which cell type, the age, etc. Here is a list of possible metadata fields that you can use:\n\n\n\n\n\n\n\n\n\nMetadata field\nDefinition\nFormat\nOntology\nExample\n\n\n\n\nsample\nName of the sample\nNA\nNA\ncontrol_rep1, treat_rep1\n\n\nfastq_1\nPath to fastq file 1\nNA\nNA\nAEG588A1_S1_L002_R1_001.fastq.gz\n\n\nfastq_2\nPath to paired fastq file, if it is a paired experiment\nNA\nNA\nAEG588A1_S1_L002_R2_001.fastq.gz\n\n\nstrandedness\nThe strandedness of the cDNA library\n&lt;unstranded OR forward OR reverse \\&gt;\nNA\nunstranded\n\n\ncondition\nVariable of interest of the experiment, such as \"control\", \"treatment\", etc\nwordWord\ncamelCase\ncontrol, treat1, treat2\n\n\ncell_type\nThe cell type(s) known or selected to be present in the sample\nNA\nontology field- e.g. EFO or OBI\nNA\n\n\ntissue\nThe tissue from which the sample was taken\nNA\nUberon\nNA\n\n\nsex\nThe biological/genetic sex of the sample\nNA\nontology field- e.g. EFO or OBI\nNA\n\n\ncell_line\nCell line of the sample\nNA\nontology field- e.g. EFO or OBI\nNA\n\n\norganism\nOrganism origin of the sample\n&lt;Genus species&gt;\nTaxonomy\nMus musculus\n\n\nreplicate\nReplicate number\n&lt;integer\\&gt;\nNA\n1\n\n\nbatch\nBatch information\nwordWord\ncamelCase\n1\n\n\ndisease\nAny diseases that may affect the sample\nNA\nDisease Ontology or MONDO\nNA\n\n\ndevelopmental_stage\nThe developmental stage of the sample\nNA\nNA\nNA\n\n\nsample_type\nThe type of the collected specimen, eg tissue biopsy, blood draw or throat swab\nNA\nNA\nNA\n\n\nstrain\nStrain of the species from which the sample was collected, if applicable\nNA\nontology field - e.g. NCBITaxonomy\nNA\n\n\ngenetic variation\nAny relevant genetic differences from the specimen or sample to the expected genomic information for this species, eg abnormal chromosome counts, major translocations or indels\nNA\nNA\nNA\n\n\n\n\n\n\n\n\n\n\nProject metadata fields\nHere you will find a table with possible metadata fields that you can use to annotate and track your Project folders:\n\n\n\n\n\n\n\n\n\nMetadata field\nDefinition\nFormat\nOntology\nExample\n\n\n\n\nproject\nProject ID\n&lt;surname\\&gt;_et_al_2023\nNA\nproks_et_al_2023\n\n\nauthor\nOwner of the project\n&lt;First name\\&gt; &lt;Surname\\&gt;\nNA\nMartin Proks\n\n\ndate\nDate of creation\nYYYYMMDD\nNA\n20230101\n\n\ndescription\nShort description of the project\nPlain text\nNA\nThis is a project describing the effect of Oct4 perturbation after pERK activation\n\n\n\n\n\n\n\n\n\n\nAssay metadata fields\nHere you will find a table with possible metadata fields that you can use to annotate and track your Assay folders:\n\n\n\n\n\n\n\n\n\nMetadata field\nDefinition\nFormat\nOntology\nExample\n\n\n\n\nassay_ID\nIdentifier for the assay that is at least unique within the project\n&lt;Assay-ID\\&gt;_&lt;keyword\\&gt;_YYYYMMDD\nNA\nCHIP_Oct4_20200101\n\n\nassay_type\nThe type of experiment performed, eg ATAC-seq or seqFISH\nNA\nontology field- e.g. EFO or OBI\nChIPseq\n\n\nassay_subtype\nMore specific type or assay like bulk nascent RNAseq or single cell ATACseq\nNA\nontology field- e.g. EFO or OBI\nbulk ChIPseq\n\n\nowner\nOwner of the assay (who made the experiment?).\n&lt;First Name\\&gt; &lt;Last Name\\&gt;\nNA\nJose Romero\n\n\nplatform\nThe type of instrument used to perform the assay, eg Illumina HiSeq 4000 or Fluidigm C1 microfluidics platform\nNA\nontology field- e.g. EFO or OBI\nIllumina\n\n\nextraction_method\nTechnique used to extract the nucleic acid from the cell\nNA\nontology field- e.g. EFO or OBI\nNA\n\n\nlibrary_method\nTechnique used to amplify a cDNA library\nNA\nontology field- e.g. EFO or OBI\nNA\n\n\nexternal_accessions\nAccession numbers from external resources to which assay or protocol information was submitted\nNA\neg protocols.io, AE, GEO accession number, etc\nGSEXXXXX\n\n\nkeyword\nKeyword for easy identification\nwordWord\ncamelCase\nOct4ChIP\n\n\ndate\nDate of assay creation\nYYYYMMDD\nNA\n20200101\n\n\nnsamples\nNumber of samples analyzed in this assay\n&lt;integer\\&gt;\nNA\n9\n\n\nis_paired\nPaired fastq files or not\n&lt;single OR paired\\&gt;\nNA\nsingle\n\n\npipeline\nPipeline used to process data and version\nNA\nNA\nnf-core/chipseq -r 1.0\n\n\nstrandedness\nThe strandedness of the cDNA library\n&lt;+ OR - OR *\\&gt;\nNA\n*\n\n\nprocessed_by\nWho processed the data\n&lt;First Name\\&gt; &lt;Last Name\\&gt;\nNA\nSarah Lundregan\n\n\norganism\nOrganism origin\n&lt;Genus species\\&gt;\nTaxonomy name\nMus musculus\n\n\norigin\nIs internal or external (from a public resources) data\n&lt;internal OR external\\&gt;\nNA\ninternal\n\n\npath\nPath to files\n&lt;/path/to/file\\&gt;\nNA\nNA\n\n\nshort_desc\nShort description of the assay\nplain text\nNA\nOct4 ChIP after pERK activation\n\n\nELN_ID\nID of the experiment/assay in your Electronic Lab Notebook software, like labguru or benchling\nplain text\nNA\nNA\n\n\n\n\n\n\n\n\nThe metadata must include key details such as the project’s short description, author information, creation date, experimental protocol, assay ID, assay type, platform utilized, library details, keywords, sample count, paired-end status, processor information, organism studied, sample origin, and file path.\nIf you would create a database from the metadata files, your table should look like this (each row corresponding to one project):\n\n\n\n\n\n\n\n\n\nassay_ID\nassay_type\nassay_subtype\nowner\nplatform\nextraction_method\nlibrary_method\nexternal_accessions\nkeyword\ndate\nnsamples\nis_paired\npipeline\nstrandedness\nprocessed_by\norganism\norigin\npath\nshort_desc\nELN_ID\n\n\n\n\nRNA_oct4_20200101\nRNAseq\nbulk RNAseq\nSarah Lundregan\nNextSeq 2000\nNA\nNA\nNA\noct4\n20200101\n9\npaired\nnf-core/chipseq 2.3.1\n*\nSL\nMus musculus\ninternal\nNA\nBulk RNAseq of Oct4 knockout\n234\n\n\nCHIP_oct4_20200101\nChIPseq\nbulk ChIPseq\nJose Romero\nNextSeq 2000\nNA\nNA\nNA\noct4\n20200101\n9\nsingle\nnf-core/rnaseq 3.12.0\n*\nJARH\nMus musculus\ninternal\nNA\nBulk ChIPseq of Oct4 overexpression\n123\n\n\nCHIP_med1_20190204\nChIPseq\nbulk ChIPseq\nMartin Proks\nNextSeq 2000\nNA\nNA\nNA\nmed1\n20190204\n12\nsingle\nnf-core/rnaseq 3.12.0\n*\nMP\nMus musculus\ninternal\nNA\nBulk ChIPseq of Med1 overexpression\n345\n\n\nSCR_humanSkin_20210302\nRNAseq\nsingle cell RNAseq\nJose Romero\nNextSeq 2000\nNA\nNA\nNA\nhumanSkin\n20210302\n23123\npaired\nnf-core/scrnaseq 1.8.2\n*\nJARH\nHomo sapiens\nexternal\nNA\nscRNAseq analysis of human skin development\nNA\n\n\nSCR_humanBrain_20220610\nRNAseq\nsingle cell RNAseq\nMartin Proks\nNextSeq 2000\nNA\nNA\nNA\nhumanBrain\n20220610\n1234\npaired\ncustom\n*\nMP\nHomo sapiens\nexternal\nNA\nscRNAseq analysis of human brain development\nNA\n\n\n\n\n\n\n\n\n\n\n\n\nCopyrightCC-BY-SA 4.0 license",
+    "text": "Section Overview\n\n\n\n⏰ Time Estimation: X minutes\n💬 Learning Objectives:\n\nDevelop your metadata\n\n\n\nYou should consider revisiting these examples after completing lesson 4 in the course material. Please review these three tables containing pre-filled data fields for metadata, each serving distinct purposes: sample metadata, project metadata, and experimental metadata.\n\nProject metadata fields\nHere you will find a table with possible metadata fields that you can use to annotate and track your Project folders:\n\n\n\n\n\n\n\n\n\nMetadata field\nDefinition\nFormat\nOntology\nExample\n\n\n\n\nproject\nProject ID\n&lt;surname\\&gt;_et_al_2023\nNA\nproks_et_al_2023\n\n\nauthor\nOwner of the project\n&lt;First name\\&gt; &lt;Surname\\&gt;\nNA\nMartin Proks\n\n\ndate\nDate of creation\nYYYYMMDD\nNA\n20230101\n\n\ndescription\nShort description of the project\nPlain text\nNA\nThis is a project describing the effect of Oct4 perturbation after pERK activation\n\n\n\n\n\n\n\n\n\n\nSample metadata fields\nSome details might be specific to your samples. For example, which samples are treated, which are controlled, which tissue they come from, which cell type, the age, etc. Here is a list of possible metadata fields that you can use:\n\n\n\n\n\n\n\n\n\nMetadata field\nDefinition\nFormat\nOntology\nExample\n\n\n\n\nsample\nName of the sample\nNA\nNA\ncontrol_rep1, treat_rep1\n\n\nfastq_1\nPath to fastq file 1\nNA\nNA\nAEG588A1_S1_L002_R1_001.fastq.gz\n\n\nfastq_2\nPath to paired fastq file, if it is a paired experiment\nNA\nNA\nAEG588A1_S1_L002_R2_001.fastq.gz\n\n\nstrandedness\nThe strandedness of the cDNA library\n&lt;unstranded OR forward OR reverse \\&gt;\nNA\nunstranded\n\n\ncondition\nVariable of interest of the experiment, such as \"control\", \"treatment\", etc\nwordWord\ncamelCase\ncontrol, treat1, treat2\n\n\ncell_type\nThe cell type(s) known or selected to be present in the sample\nNA\nontology field- e.g. EFO or OBI\nNA\n\n\ntissue\nThe tissue from which the sample was taken\nNA\nUberon\nNA\n\n\nsex\nThe biological/genetic sex of the sample\nNA\nontology field- e.g. EFO or OBI\nNA\n\n\ncell_line\nCell line of the sample\nNA\nontology field- e.g. EFO or OBI\nNA\n\n\norganism\nOrganism origin of the sample\n&lt;Genus species&gt;\nTaxonomy\nMus musculus\n\n\nreplicate\nReplicate number\n&lt;integer\\&gt;\nNA\n1\n\n\nbatch\nBatch information\nwordWord\ncamelCase\n1\n\n\ndisease\nAny diseases that may affect the sample\nNA\nDisease Ontology or MONDO\nNA\n\n\ndevelopmental_stage\nThe developmental stage of the sample\nNA\nNA\nNA\n\n\nsample_type\nThe type of the collected specimen, eg tissue biopsy, blood draw or throat swab\nNA\nNA\nNA\n\n\nstrain\nStrain of the species from which the sample was collected, if applicable\nNA\nontology field - e.g. NCBITaxonomy\nNA\n\n\ngenetic variation\nAny relevant genetic differences from the specimen or sample to the expected genomic information for this species, eg abnormal chromosome counts, major translocations or indels\nNA\nNA\nNA\n\n\n\n\n\n\n\n\n\n\nAssay metadata fields\nHere you will find a table with possible metadata fields that you can use to annotate and track your Assay folders:\n\n\n\n\n\n\n\n\n\nMetadata field\nDefinition\nFormat\nOntology\nExample\n\n\n\n\nassay_ID\nIdentifier for the assay that is at least unique within the project\n&lt;Assay-ID\\&gt;_&lt;keyword\\&gt;_YYYYMMDD\nNA\nCHIP_Oct4_20200101\n\n\nassay_type\nThe type of experiment performed, eg ATAC-seq or seqFISH\nNA\nontology field- e.g. EFO or OBI\nChIPseq\n\n\nassay_subtype\nMore specific type or assay like bulk nascent RNAseq or single cell ATACseq\nNA\nontology field- e.g. EFO or OBI\nbulk ChIPseq\n\n\nowner\nOwner of the assay (who made the experiment?).\n&lt;First Name\\&gt; &lt;Last Name\\&gt;\nNA\nJose Romero\n\n\nplatform\nThe type of instrument used to perform the assay, eg Illumina HiSeq 4000 or Fluidigm C1 microfluidics platform\nNA\nontology field- e.g. EFO or OBI\nIllumina\n\n\nextraction_method\nTechnique used to extract the nucleic acid from the cell\nNA\nontology field- e.g. EFO or OBI\nNA\n\n\nlibrary_method\nTechnique used to amplify a cDNA library\nNA\nontology field- e.g. EFO or OBI\nNA\n\n\nexternal_accessions\nAccession numbers from external resources to which assay or protocol information was submitted\nNA\neg protocols.io, AE, GEO accession number, etc\nGSEXXXXX\n\n\nkeyword\nKeyword for easy identification\nwordWord\ncamelCase\nOct4ChIP\n\n\ndate\nDate of assay creation\nYYYYMMDD\nNA\n20200101\n\n\nnsamples\nNumber of samples analyzed in this assay\n&lt;integer\\&gt;\nNA\n9\n\n\nis_paired\nPaired fastq files or not\n&lt;single OR paired\\&gt;\nNA\nsingle\n\n\npipeline\nPipeline used to process data and version\nNA\nNA\nnf-core/chipseq -r 1.0\n\n\nstrandedness\nThe strandedness of the cDNA library\n&lt;+ OR - OR *\\&gt;\nNA\n*\n\n\nprocessed_by\nWho processed the data\n&lt;First Name\\&gt; &lt;Last Name\\&gt;\nNA\nSarah Lundregan\n\n\norganism\nOrganism origin\n&lt;Genus species\\&gt;\nTaxonomy name\nMus musculus\n\n\norigin\nIs internal or external (from a public resources) data\n&lt;internal OR external\\&gt;\nNA\ninternal\n\n\npath\nPath to files\n&lt;/path/to/file\\&gt;\nNA\nNA\n\n\nshort_desc\nShort description of the assay\nplain text\nNA\nOct4 ChIP after pERK activation\n\n\nELN_ID\nID of the experiment/assay in your Electronic Lab Notebook software, like labguru or benchling\nplain text\nNA\nNA\n\n\n\n\n\n\n\n\nThe metadata must include key details such as the project’s short description, author information, creation date, experimental protocol, assay ID, assay type, platform utilized, library details, keywords, sample count, paired-end status, processor information, organism studied, sample origin, and file path.\nIf you would create a database from the metadata files, your table should look like this (each row corresponding to one project):\n\n\n\n\n\n\n\n\n\nassay_ID\nassay_type\nassay_subtype\nowner\nplatform\nextraction_method\nlibrary_method\nexternal_accessions\nkeyword\ndate\nnsamples\nis_paired\npipeline\nstrandedness\nprocessed_by\norganism\norigin\npath\nshort_desc\nELN_ID\n\n\n\n\nRNA_oct4_20200101\nRNAseq\nbulk RNAseq\nSarah Lundregan\nNextSeq 2000\nNA\nNA\nNA\noct4\n20200101\n9\npaired\nnf-core/chipseq 2.3.1\n*\nSL\nMus musculus\ninternal\nNA\nBulk RNAseq of Oct4 knockout\n234\n\n\nCHIP_oct4_20200101\nChIPseq\nbulk ChIPseq\nJose Romero\nNextSeq 2000\nNA\nNA\nNA\noct4\n20200101\n9\nsingle\nnf-core/rnaseq 3.12.0\n*\nJARH\nMus musculus\ninternal\nNA\nBulk ChIPseq of Oct4 overexpression\n123\n\n\nCHIP_med1_20190204\nChIPseq\nbulk ChIPseq\nMartin Proks\nNextSeq 2000\nNA\nNA\nNA\nmed1\n20190204\n12\nsingle\nnf-core/rnaseq 3.12.0\n*\nMP\nMus musculus\ninternal\nNA\nBulk ChIPseq of Med1 overexpression\n345\n\n\nSCR_humanSkin_20210302\nRNAseq\nsingle cell RNAseq\nJose Romero\nNextSeq 2000\nNA\nNA\nNA\nhumanSkin\n20210302\n23123\npaired\nnf-core/scrnaseq 1.8.2\n*\nJARH\nHomo sapiens\nexternal\nNA\nscRNAseq analysis of human skin development\nNA\n\n\nSCR_humanBrain_20220610\nRNAseq\nsingle cell RNAseq\nMartin Proks\nNextSeq 2000\nNA\nNA\nNA\nhumanBrain\n20220610\n1234\npaired\ncustom\n*\nMP\nHomo sapiens\nexternal\nNA\nscRNAseq analysis of human brain development\nNA\n\n\n\n\n\n\n\n\n\n\nSources\n\nTranscriptomics metadata standards and fields\nBiological ontologies for data scientists,Bionty\n\n\n\n\n\nCopyrightCC-BY-SA 4.0 license",
     "crumbs": [
       "Use cases",
       "NGS data",
@@ -432,7 +432,7 @@
     "href": "develop/03_DOD.html#template-engine",
     "title": "3. Data organization and storage",
     "section": "Template engine",
-    "text": "Template engine\nSetting up folder structures manually for each new project can be time-consuming. Thankfully, tools like Cookiecutter offer a solution by allowing users to create project templates easily. These templates can ensure consistency across projects and save time. Additionally, using cruft alongside Cookiecutter can assist in maintaining older templates when updates are made (by synchronizing them with the latest version).\n\n\n\n\n\n\nCookiecutter templates\n\n\n\n\nCookiecutter template for Data science projects\nBrickmanlab template for NGS data: similar to the folder structures in the examples above. You can download and modify it to suit your needs.\n\n\n\n\nQuick tutorial on cookiecutter\n\n\n\n\n\n\nSandbox Tutorial\n\n\n\nLearn how to create your own template here.\nWe offer workshops on practical RDM for NGS data. Keep an eye on the upcoming events on the Sandbox website.",
+    "text": "Template engine\nSetting up folder structures manually for each new project can be time-consuming. Thankfully, tools like Cookiecutter offer a solution by allowing users to create project templates easily. These templates can ensure consistency across projects and save time. Additionally, using cruft alongside Cookiecutter can assist in maintaining older templates when updates are made (by synchronizing them with the latest version).\n\n\n\n\n\n\nCookiecutter templates\n\n\n\n\nCookiecutter template for Data science projects\nBrickmanlab template for NGS data: similar to the folder structures in the examples above. You can download and modify it to suit your needs.\n\n\n\n\nQuick tutorial on cookiecutter\n\n\n\n\n\n\nSandbox Tutorial\n\n\n\nLearn how to create your own template here.\nWe offer workshops on practical RDM for biodata. Keep an eye on the upcoming events on the Sandbox website.",
     "crumbs": [
       "Course material",
       "Key practices",
@@ -456,7 +456,7 @@
     "href": "develop/03_DOD.html#naming-conventions",
     "title": "3. Data organization and storage",
     "section": "Naming conventions",
-    "text": "Naming conventions\nConsistent naming conventions play a crucial role in scientific research by enhancing organization and data retrieval. By adopting standardized naming conventions, researchers ensure that files, experiments, or datasets are labeled logically, facilitating easy location and comparison of similar data. For instance, in fields like genomics, uniform naming conventions for files associated with particular experiments or samples allow for swift identification and comparison of relevant data, streamlining the research process and contributing to the reproducibility of findings. Overall, promotes efficiency, collaboration, and the integrity of scientific work.\n\n\n\n\n\n\nGeneral tips for file and folder naming\n\n\n\nRemember to keep the folder structure simple.\n\nKeep it short and meaningful (use understandable abbreviation only, e.g., Cor for correlations or LFC for Log Fold Change)\nConsider including one of these elements: project name, category, descriptor, content, author…\n\nAuthor-based: use initials\n\nUse alphanumeric characters: letters (A-Z) and numbers (0-9)\nAvoid special characters: ~!@#$%^&*()`“|\nDate-based format: use YYYYMMDD format (year/month/day format helps with sorting and listing files in chronological order)\nUse underscores and hyphens as delimiters and avoid spaces.\n\nNot all search tools may work well with spaces (messy to indicate paths)\nIf the length is a concern, use capital letters to delimit words camelCase.\n\nSequential numbering: Use a two-‑digit format for single-digit numbers (0–9) to ensure correct numerical sequence order (for example, 01 and not 1)\nVersion control: Indicate the version (“V”) or revision (“R”) as the last element, using the two-digit format (e.g., v01, v02)\nWrite down your naming convention pattern and document it in the README file\n\n\n\n\n\n\n\n\n\nDefine your file name conventions\n\n\n\n\n\n\n\nAvoid long and complicated names and ensure your file names are both informative and easy to manage:\n\nFor saving a new plot, a heatmap representing sample correlations\nWhen naming the file for the document containing the Research Data Management Course Objectives (Version 2, 2nd May 2024) from the University of Copenhagen\nConsider the most common file types you work with, such as visualizations, tables, etc., and create logical and clear file names\n\n\n\n\n\n\n\nHint\n\n\n\n\n\n\n\n\nheatmap_sampleCor_20240101.png\nKU_RDM-objectives_20240502_v02.doc or KU_RDMObj_20240502_v02.doc\n\n\n\n\n\n\n\n\n\n\n\n\n\nAdditional file naming conventions\n\n\n\n\n\n\n\n\n\n\n\nname\ndescription\nnaming_convention\nfile format\nexample\n\n\n\n\n.fastq\nraw sequencing reads\nnan\nnan\nsampleID_run_read1.fastq\n\n\n.fastqc\nquality control from fastqc\nnan\nnan\nsampleID_run_read1.fastqc\n\n\n.bam\naligned reads\nnan\nnan\nsampleID_run_read1.bam\n\n\nGTF\nsequence annotation\nnan\nnan\none of https://www.gencodegenes.org/\n\n\nGFF\nsequence annotation\nnan\nnan\none of https://www.gencodegenes.org/\n\n\n.bed\ngenome locations\nnan\nnan\nnan\n\n\n.bigwig\ngenome coverage\nnan\nnan\nnan\n\n\n.fasta\nsequence data (nucleotide/aminoacid)\nnan\nnan\none of https://www.gencodegenes.org/\n\n\nMultiqc report\nQC aggregated report\n&lt;assayID\\&gt;_YYYYMMDD.multiqc\nmultiqc\nRNA_20200101.multiqc\n\n\nCount matrix\nfinal count matrix\n&lt;assayID\\&gt;_cm_aligner_YYYYMMDD.tsv\ntsv\nRNA_cm_salmon_20200101.tsv\n\n\nDEA\ndifferential expression analysis results\nDEA_&lt;condition1-condition2\\&gt;_LFC&lt;absolute_threshold\\&gt;_p&lt;pvalue decimals\\&gt;_YYYYMMDD.tsv\ntsv\nDEA_treat-untreat_LFC1_p01_20200101.tsv\n\n\nDBA\ndifferential binding analysis results\nDBA_&lt;condition1-condition2\\&gt;_LFC&lt;absolute_threshold\\&gt;_p&lt;pvalue decimals\\&gt;_YYYYMMDD.tsv\ntsv\nDBA_treat-untreat_LFC1_p01_20200101.tsv\n\n\nMAplot\nMA plot\nMAplot_&lt;condition1-condition2\\&gt;_YYYYMMDD.jpeg\njpeg\nMAplot_treat-untreat_20200101.jpeg\n\n\nHeatmap plot\nHeatmap plot of anything\nheatmap_&lt;type\\&gt;_YYYYMMDD.jpeg\njpeg\nheatmap_sampleCor_20200101.jpeg\n\n\nVolcano plot\nVolcano plot\nvolcano_&lt;condition1-condition2\\&gt;_YYYYMMDD.jpeg\njpeg\nvolcano_treat-untreat_20200101.jpeg\n\n\nVenn diagram\nVenn diagram\nvenn_&lt;type\\&gt;_YYYYMMDD.jpeg\njpeg\nvenn_consensus_20200101.jpeg\n\n\nEnrichment table\nEnrichment results\nnan\ntsv\nnan",
+    "text": "Naming conventions\nConsistent naming conventions play a crucial role in scientific research by enhancing organization and data retrieval. By adopting standardized naming conventions, researchers ensure that files, experiments, or datasets are labeled logically, facilitating easy location and comparison of similar data. The importance of uniform naming conventions extends to various fields, in fields like genomics or health data science, uniform naming conventions for files associated with particular experiments or samples allow for swift identification and comparison of relevant data, streamlining the research process and contributing to the reproducibility of findings. Overall, promotes efficiency, collaboration, and the integrity of scientific work.\n\n\n\n\n\n\nGeneral tips for file and folder naming\n\n\n\nRemember to keep the folder structure simple.\n\nKeep it short and meaningful (use understandable abbreviation only, e.g., Cor for correlations or LFC for Log Fold Change)\nConsider including one of these elements: project name, category, descriptor, content, author…\n\nAuthor-based: use initials\n\nUse alphanumeric characters: letters (A-Z) and numbers (0-9)\nAvoid special characters: ~!@#$%^&*()`“|\nDate-based format: use YYYYMMDD format (year/month/day format helps with sorting and listing files in chronological order)\nUse underscores and hyphens as delimiters and avoid spaces.\n\nNot all search tools may work well with spaces (messy to indicate paths)\nIf the length is a concern, use capital letters to delimit words camelCase.\n\nSequential numbering: Use a two-‑digit format for single-digit numbers (0–9) to ensure correct numerical sequence order (for example, 01 and not, 1 if your sequence only goes up to 99)\nVersion control: Indicate the version (“V”) or revision (“R”) as the last element, using the two-digit format (e.g., v01, v02)\nWrite down your naming convention pattern and document it in the README file\n\n\n\n\n\n\n\n\n\nCreate your own naming conventions\n\n\n\n\n\n\n\nConsider the most common types of files and folders you will be working with, such as visualizations, results tables, and processed files. Develop a logical and clear naming system for these files based on the tips provided above. Aim for concise and straightforward names to avoid complexity.\n\n\n\n\n\nTo learn more about naming conventions for NGS analysis and see additional examples, click here.",
     "crumbs": [
       "Course material",
       "Key practices",
diff --git a/site_libs/bootstrap/bootstrap.min.css b/site_libs/bootstrap/bootstrap.min.css
index 620fe7b9..990d3a0d 100644
--- a/site_libs/bootstrap/bootstrap.min.css
+++ b/site_libs/bootstrap/bootstrap.min.css
@@ -1,4 +1,4 @@
-﻿@import"https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.0.0/css/all.min.css";@import"https://fonts.googleapis.com/css2?family=Roboto:wght@300;400;500;700&display=swap";details>summary{list-style:none}details>summary::before{content:"> ";font-size:1.5em;margin:-5px 7px 0 0;color:#000;font-weight:bold}div.callout-exercise{border-left-color:#3eab1f !important}div.callout-exercise .callout-header{background-color:#9df980 !important;height:30px}.callout-exercise>.callout-header::before{font-family:"Font Awesome 5 Free";content:"";margin-right:10px}div.callout-exercise.callout-style-default div.callout-body{padding-bottom:0em !important;margin-bottom:-1.5em !important}.callout-hint>.callout-header::before{font-family:"Font Awesome 5 Free";content:"";margin-right:10px;color:#0c0b0c}div.callout-hint.callout-style-default>.callout-header{background-color:#f3f3f6 !important;height:25px}.callout-definition>.callout-header::before{font-family:"Font Awesome 5 Free";content:"";margin-right:10px;color:#bebcbc}div.callout-definition.callout-style-default>.callout-header{background-color:#fff !important;height:30px}/*!
+﻿@import"https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.0.0/css/all.min.css";@import"https://fonts.googleapis.com/css2?family=Roboto:wght@300;400;500;700&display=swap";details>summary{list-style:none}details>summary::before{content:"> ";font-size:1.5em;margin:-5px 7px 0 0;color:#000;font-weight:bold}div.callout-exercise{border-left-color:#3eab1f !important}div.callout-exercise .callout-header{background-color:#9df980 !important;height:30px}.callout-exercise>.callout-header::before{font-family:"Font Awesome 5 Free";content:"";margin-right:10px}div.callout-exercise.callout-style-default div.callout-body{padding-bottom:0em !important;margin-bottom:-1.5em !important}.callout-hint>.callout-header::before{font-family:"Font Awesome 5 Free";content:"";margin-right:10px;color:#0c0b0c}div.callout-hint.callout-style-default>.callout-header{background-color:#f3f3f6 !important;height:25px}.callout-definition>.callout-header::before{font-family:"Font Awesome 5 Free";content:"";margin-right:10px;color:#bebcbc}div.callout-definition.callout-style-default>.callout-header{background-color:#fff !important;height:26px}div.callout-definition{font-size:15px}.callout-readme>.callout-header::before{font-family:"Courier New",Courier,monospace;margin-right:10px;color:#606060}div.callout-readme.callout-style-default>.callout-header{background-color:#e3e3e3 !important;font-family:"Courier New",Courier,monospace;font-size:22px;height:30px}div.callout-readme p{font-family:"Courier New",Courier,monospace;font-size:14px;margin-bottom:.5em !important}/*!
  * Bootstrap  v5.3.1 (https://getbootstrap.com/)
  * Copyright 2011-2023 The Bootstrap Authors
  * Licensed under MIT (https://github.com/twbs/bootstrap/blob/main/LICENSE)
diff --git a/sitemap.xml b/sitemap.xml
index 77972346..4b93bad2 100644
--- a/sitemap.xml
+++ b/sitemap.xml
@@ -2,74 +2,74 @@
 <urlset xmlns="http://www.sitemaps.org/schemas/sitemap/0.9">
   <url>
     <loc>https://hds-sandbox.github.io/RDM_NGS_course/use_cases.html</loc>
-    <lastmod>2024-04-25T07:09:28.032Z</lastmod>
+    <lastmod>2024-04-26T14:17:18.988Z</lastmod>
   </url>
   <url>
     <loc>https://hds-sandbox.github.io/RDM_NGS_course/develop/06_pipelines.html</loc>
-    <lastmod>2024-04-25T07:09:27.992Z</lastmod>
+    <lastmod>2024-04-26T14:17:18.952Z</lastmod>
   </url>
   <url>
     <loc>https://hds-sandbox.github.io/RDM_NGS_course/develop/practical_workshop.html</loc>
-    <lastmod>2024-04-25T07:09:28.032Z</lastmod>
+    <lastmod>2024-04-26T14:17:18.988Z</lastmod>
   </url>
   <url>
     <loc>https://hds-sandbox.github.io/RDM_NGS_course/develop/04_metadata.html</loc>
-    <lastmod>2024-04-25T07:09:27.992Z</lastmod>
+    <lastmod>2024-04-26T14:17:18.952Z</lastmod>
   </url>
   <url>
     <loc>https://hds-sandbox.github.io/RDM_NGS_course/develop/05_VC.html</loc>
-    <lastmod>2024-04-25T07:09:27.992Z</lastmod>
+    <lastmod>2024-04-26T14:17:18.952Z</lastmod>
   </url>
   <url>
     <loc>https://hds-sandbox.github.io/RDM_NGS_course/develop/07_repos.html</loc>
-    <lastmod>2024-04-25T07:09:27.992Z</lastmod>
+    <lastmod>2024-04-26T14:17:18.952Z</lastmod>
   </url>
   <url>
     <loc>https://hds-sandbox.github.io/RDM_NGS_course/develop/examples/proteomics_metadata.html</loc>
-    <lastmod>2024-04-25T07:09:28.008Z</lastmod>
+    <lastmod>2024-04-26T14:17:18.968Z</lastmod>
   </url>
   <url>
     <loc>https://hds-sandbox.github.io/RDM_NGS_course/develop/examples/NGS_management.html</loc>
-    <lastmod>2024-04-25T07:09:28.008Z</lastmod>
+    <lastmod>2024-04-26T14:17:18.968Z</lastmod>
   </url>
   <url>
     <loc>https://hds-sandbox.github.io/RDM_NGS_course/cards/JARomero.html</loc>
-    <lastmod>2024-04-25T07:09:27.992Z</lastmod>
+    <lastmod>2024-04-26T14:17:18.948Z</lastmod>
   </url>
   <url>
     <loc>https://hds-sandbox.github.io/RDM_NGS_course/practical_workflows.html</loc>
-    <lastmod>2024-04-25T07:09:28.032Z</lastmod>
+    <lastmod>2024-04-26T14:17:18.988Z</lastmod>
   </url>
   <url>
     <loc>https://hds-sandbox.github.io/RDM_NGS_course/cards/AlbaMartinez.html</loc>
-    <lastmod>2024-04-25T07:09:27.992Z</lastmod>
+    <lastmod>2024-04-26T14:17:18.948Z</lastmod>
   </url>
   <url>
     <loc>https://hds-sandbox.github.io/RDM_NGS_course/develop/examples/NGS_OS_FAIR.html</loc>
-    <lastmod>2024-04-25T07:09:28.008Z</lastmod>
+    <lastmod>2024-04-26T14:17:18.968Z</lastmod>
   </url>
   <url>
     <loc>https://hds-sandbox.github.io/RDM_NGS_course/develop/examples/NGS_metadata.html</loc>
-    <lastmod>2024-04-25T07:09:28.008Z</lastmod>
+    <lastmod>2024-04-26T14:17:18.968Z</lastmod>
   </url>
   <url>
     <loc>https://hds-sandbox.github.io/RDM_NGS_course/develop/contributors.html</loc>
-    <lastmod>2024-04-25T07:09:28.008Z</lastmod>
+    <lastmod>2024-04-26T14:17:18.968Z</lastmod>
   </url>
   <url>
     <loc>https://hds-sandbox.github.io/RDM_NGS_course/develop/03_DOD.html</loc>
-    <lastmod>2024-04-25T07:09:27.992Z</lastmod>
+    <lastmod>2024-04-26T14:17:18.952Z</lastmod>
   </url>
   <url>
     <loc>https://hds-sandbox.github.io/RDM_NGS_course/develop/01_RDM_intro.html</loc>
-    <lastmod>2024-04-25T07:09:27.992Z</lastmod>
+    <lastmod>2024-04-26T14:17:18.952Z</lastmod>
   </url>
   <url>
     <loc>https://hds-sandbox.github.io/RDM_NGS_course/develop/02_DMP.html</loc>
-    <lastmod>2024-04-25T07:09:27.992Z</lastmod>
+    <lastmod>2024-04-26T14:17:18.952Z</lastmod>
   </url>
   <url>
     <loc>https://hds-sandbox.github.io/RDM_NGS_course/index.html</loc>
-    <lastmod>2024-04-25T07:09:28.032Z</lastmod>
+    <lastmod>2024-04-26T14:17:18.988Z</lastmod>
   </url>
 </urlset>
diff --git a/use_cases.html b/use_cases.html
index 3c181f2c..799e524a 100644
--- a/use_cases.html
+++ b/use_cases.html
@@ -177,7 +177,7 @@
           <li class="sidebar-item">
   <div class="sidebar-item-container"> 
   <a href="./develop/examples/NGS_management.html" class="sidebar-item-text sidebar-link">
- <span class="menu-text">NGS data strategies</span></a>
+ <span class="menu-text">Effective RDM Practices in NGS Analysis</span></a>
   </div>
 </li>
           <li class="sidebar-item">
@@ -236,7 +236,7 @@ <h1 class="title">RDM use cases</h1>
     <div>
     <div class="quarto-title-meta-heading">Modified</div>
     <div class="quarto-title-meta-contents">
-      <p class="date-modified">April 25, 2024</p>
+      <p class="date-modified">April 26, 2024</p>
     </div>
   </div>
     

name	description	naming_convention	file format	example
.fastq	raw sequencing reads	nan	nan	sampleID_run_read1.fastq
.fastqc	quality control from fastqc	nan	nan	sampleID_run_read1.fastqc
.bam	aligned reads	nan	nan	sampleID_run_read1.bam
GTF	sequence annotation	nan	nan	one of https://www.gencodegenes.org/
GFF	sequence annotation	nan	nan	one of https://www.gencodegenes.org/
.bed	genome locations	nan	nan	nan
.bigwig	genome coverage	nan	nan	nan
.fasta	sequence data (nucleotide/aminoacid)	nan	nan	one of https://www.gencodegenes.org/
Multiqc report	QC aggregated report	<assayID\>_YYYYMMDD.multiqc	multiqc	RNA_20200101.multiqc
Count matrix	final count matrix	<assayID\>_cm_aligner_YYYYMMDD.tsv	tsv	RNA_cm_salmon_20200101.tsv
DEA	differential expression analysis results	DEA_<condition1-condition2\>_LFC<absolute_threshold\>_p<pvalue decimals\>_YYYYMMDD.tsv	tsv	DEA_treat-untreat_LFC1_p01_20200101.tsv
DBA	differential binding analysis results	DBA_<condition1-condition2\>_LFC<absolute_threshold\>_p<pvalue decimals\>_YYYYMMDD.tsv	tsv	DBA_treat-untreat_LFC1_p01_20200101.tsv
MAplot	MA plot	MAplot_<condition1-condition2\>_YYYYMMDD.jpeg	jpeg	MAplot_treat-untreat_20200101.jpeg
Heatmap plot	Heatmap plot of anything	heatmap_<type\>_YYYYMMDD.jpeg	jpeg	heatmap_sampleCor_20200101.jpeg
Volcano plot	Volcano plot	volcano_<condition1-condition2\>_YYYYMMDD.jpeg	jpeg	volcano_treat-untreat_20200101.jpeg
Venn diagram	Venn diagram	venn_<type\>_YYYYMMDD.jpeg	jpeg	venn_consensus_20200101.jpeg
Enrichment table	Enrichment results	nan	tsv	nan
Metadata field	Definition	Format	Ontology	Example
assay_ID	Identifier for the assay that is at least unique within the project	<Assay-ID\>_<keyword\>_YYYYMMDD	NA	CHIP_Oct4_20200101
assay_type	The type of experiment performed, eg ATAC-seq or seqFISH	NA	ontology field- e.g. EFO or OBI	ChIPseq
assay_subtype	More specific type or assay like bulk nascent RNAseq or single cell ATACseq	NA	ontology field- e.g. EFO or OBI	bulk ChIPseq
owner	Owner of the assay (who made the experiment?).	<First Name\> <Last Name\>	NA	Jose Romero
platform	The type of instrument used to perform the assay, eg Illumina HiSeq 4000 or Fluidigm C1 microfluidics platform	NA	ontology field- e.g. EFO or OBI	Illumina
extraction_method	Technique used to extract the nucleic acid from the cell	NA	ontology field- e.g. EFO or OBI	NA
library_method	Technique used to amplify a cDNA library	NA	ontology field- e.g. EFO or OBI	NA
external_accessions	Accession numbers from external resources to which assay or protocol information was submitted	NA	eg protocols.io, AE, GEO accession number, etc	GSEXXXXX
keyword	Keyword for easy identification	wordWord	camelCase	Oct4ChIP
date	Date of assay creation	YYYYMMDD	NA	20200101
nsamples	Number of samples analyzed in this assay	<integer\>	NA	9
is_paired	Paired fastq files or not	<single OR paired\>	NA	single
pipeline	Pipeline used to process data and version	NA	NA	nf-core/chipseq -r 1.0
strandedness	The strandedness of the cDNA library	<+ OR - OR *\>	NA	*
processed_by	Who processed the data	<First Name\> <Last Name\>	NA	Sarah Lundregan
organism	Organism origin	<Genus species\>	Taxonomy name	Mus musculus
origin	Is internal or external (from a public resources) data	<internal OR external\>	NA	internal
path	Path to files	</path/to/file\>	NA	NA
short_desc	Short description of the assay	plain text	NA	Oct4 ChIP after pERK activation
ELN_ID	ID of the experiment/assay in your Electronic Lab Notebook software, like labguru or benchling	plain text	NA	NA
Metadata field	Definition	Format	Ontology	Example
project	Project ID	<surname\>_et_al_2023	NA	proks_et_al_2023
author	Owner of the project	<First name\> <Surname\>	NA	Martin Proks
date	Date of creation	YYYYMMDD	NA	20230101
description	Short description of the project	Plain text	NA	This is a project describing the effect of Oct4 perturbation after pERK activation