From 15f234114e5fa93becb3386a4046f435f9ef76b3 Mon Sep 17 00:00:00 2001 From: zeynepyardimcikaraca <157539242+zeynepyardimcikaraca@users.noreply.github.com> Date: Sun, 17 Mar 2024 01:30:14 +0100 Subject: [PATCH 1/4] Update _index.md I have corrected the broken hyperlink of github cheatsheet. --- content/docs/modules/week3/_index.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/content/docs/modules/week3/_index.md b/content/docs/modules/week3/_index.md index b0333b7..5d6ca8d 100644 --- a/content/docs/modules/week3/_index.md +++ b/content/docs/modules/week3/_index.md @@ -39,7 +39,7 @@ __Tips & tricks__ - Curious how to use Git with a graphical user interface? - Use Git directly from within R - [find out how!](https://swcarpentry.github.io/git-novice/14-supplemental-rstudio/) - Another fantastic Git client is [Sourcetreeapp](https://www.sourcetreeapp.com), which works on Windows, Mac and Linux! -- [Git & Github cheatsheet](https://github.com/tilburgsciencehub/website/raw/master/content/building-blocks/collaborate-and-share-your-work/use-github/github_cheatsheet_tsh.pdf) +- [Git & Github cheatsheet](https://github.com/hannesdatta/course-dprep/blob/master/content/docs/modules/week3/tutorial/git_cheatsheet.pdf) - Optional activity: [Contributing to an open source web site](activity) {{< /hint >}} From 5437992f10874d3eb8b4ffa4c46710a6c4d624b3 Mon Sep 17 00:00:00 2001 From: zeynepyardimcikaraca <157539242+zeynepyardimcikaraca@users.noreply.github.com> Date: Sun, 17 Mar 2024 01:32:02 +0100 Subject: [PATCH 2/4] Update _index.md I have updated the broken hyperlink of github cheatsheet --- content/docs/modules/week3/_index.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/content/docs/modules/week3/_index.md b/content/docs/modules/week3/_index.md index 5d6ca8d..5c57445 100644 --- a/content/docs/modules/week3/_index.md +++ b/content/docs/modules/week3/_index.md @@ -39,7 +39,7 @@ __Tips & tricks__ - Curious how to use Git with a graphical user interface? - Use Git directly from within R - [find out how!](https://swcarpentry.github.io/git-novice/14-supplemental-rstudio/) - Another fantastic Git client is [Sourcetreeapp](https://www.sourcetreeapp.com), which works on Windows, Mac and Linux! -- [Git & Github cheatsheet](https://github.com/hannesdatta/course-dprep/blob/master/content/docs/modules/week3/tutorial/git_cheatsheet.pdf) +- [Git & Github cheatsheet](https://tilburgsciencehub.com/topics/automation/version-control/start-git/images/github_cheatsheet_tsh.pdf) - Optional activity: [Contributing to an open source web site](activity) {{< /hint >}} From 6e67c91ff95927bdb38da9912bea60f2acf5b210 Mon Sep 17 00:00:00 2001 From: Hannes Datta Date: Mon, 18 Mar 2024 14:33:59 +0100 Subject: [PATCH 3/4] update example questions --- content/docs/exam/examplequestions.md | 112 +++++++++----------------- 1 file changed, 40 insertions(+), 72 deletions(-) diff --git a/content/docs/exam/examplequestions.md b/content/docs/exam/examplequestions.md index 00e67c1..75aee5c 100644 --- a/content/docs/exam/examplequestions.md +++ b/content/docs/exam/examplequestions.md @@ -8,77 +8,32 @@ draft: false # Example questions -Questions will be asked along the course's learning goals, and complexity levels (e.g., knowledge, application, evaluation). For details, see [here](../exam#content). +Questions will be asked along the course's learning goals, and complexity levels (e.g., comprehension, application, synthesis, evaluation). For details, see [here](../exam#content). Below, you can find a few example questions, which will be discussed with students in the final live stream of this course. -## Theoretical part {{< hint warning >}} -This part of the exam consists of __personalized open and closed (multiple-choice) questions__, shown in __random order__. You can freely go back and forth between these questions. +This exam consists of __open and closed (multiple-choice) questions__. You can freely go back and forth between these questions. {{< /hint >}} ![](../dprep_part1.png) *Note: the number of questions depends on the points awarded to each question. The instructions during the final exam may slightly vary, so make sure to still read it accordingly.* -1. Please name a tool that can be used to automate workflows. (*knowledge*) - -2. Please name three ways to deploy one's research findings. (*knowledge*) - -3. What are the main benefits of exploring data using RMarkdown documents, compared to “point-and-click” interfaces (e.g., SPSS), or manually investigating data by issuing commands in the R terminal? (*comprehension*) - -4. What are the benefits from automating pipelines, compared to manually executing source code files? (*comprehension*) - -5. Please view the code snippet below. - -``` -library(dplyr) -df <- read.csv('data.csv') -df <- df %>% filter(age >= 18) -``` -Please assess the completeness of the script with regard to the ITO components of a source code file. Can you identify any missing piece in the code? (*analysis*) - -6. Please assess whether the makefile below will run when you type "make". (*analysis*) - -__Directory Structure__ -``` -\readme.md -\code\makefile -\code\load.R -\data\dataset.csv -``` -__Makefile__ -``` -data/dataset.csv: load.R - R --vanilla < load.R -``` - -![](../dprep_overview.png) - -## Practical part - -{{< hint warning >}} - -This part of the exam consists of __personalized open questions__, shown in __random order__. You can freely go back and forth between these questions. - -{{< /hint >}} - -![](../dprep_part2.png) - -*Note: the instructions during the final exam may slightly vary, so make sure to still read it accordingly.* - -### Question 1 - -Imagine you have just enrolled as a thesis student, and you receive the following email from your advisor: +1. Please download the `datasets.RData` workspace file from the exam cover page and open it in RStudio. Please answer the following questions using the objects in this R workspace. + 1. Please use the dataset stored in `data1`. Using `dplyr`, reshape this dataset from wide to long. Paste the code snippet with the solution below. + 2. Please use the dataset stored in `data2`. Using `dplyr`, please create an aggregated dataset, taking an average of `variable1` and `variable2` for all users in the data (i.e., you obtain a dataset with the number of rows equal to the number of users in the data). + 3. Please take a look at `data3`. Please propose which data preparation steps are necessary to clean this data. +2. Imagine you have just enrolled as a thesis student, and you receive the following email from your advisor. Submit your PDF document, and provide a conclusion on the suitability of the explored data for the research question. {{< hint >}} Dear (name of student), -I really look forward to working with you on this exciting dataset, capturing the consumption of music on Spotify. I scraped it from spotifycharts.com a while ago. +I really look forward to working with you on this exciting dataset, capturing the consumption of music on Spotify. I scraped it from spotifycharts.com a while ago. Please download this `data.zip`, which contains a stripped-down version of an RMarkdown file and the data. As a starting point, please explore the data set using RMarkdown. I’d love to learn more about the data myself (haven’t looked into it yet) - maybe you can figure out a way to shed some light on how the start of the global pandemic (let’s assume that was March 2020) affected music consumption? @@ -86,11 +41,7 @@ Please render your RMarkdown as a PDF document. Please keep any code that you’ {{< /hint >}} -Submit your PDF document for question 1, and provide a conclusion on the suitability of the explored data for the research question? (*analysis*) - -### Question 2 - -Imagine you are a research assistant at Tilburg University, and you receive the following email from your project supervisor: +3. Please download the `github_repository.zip` file from the exam cover page and unzip it to a folder on this computer. Open this folder using Git Bash. Imagine you are a research assistant at Tilburg University, and you receive the following email from your project supervisor. Please submit your Git repository, by zipping the folder and uploading it here. {{< hint >}} @@ -105,21 +56,38 @@ Starting from `run.R`, can you apply your learnings from dPrep, and submit a lin - Have a proper readme at the repository (in an `.md` file), - Ignore files that should not be versioned using .gitignore, and - remove `run.R` and replace it by a proper makefile for this project. +- throughout, make use of frequent commits and commit messages. -I really look forward seeing your work. Your deliverable is just a link to a (private!) GitHub repository, provided in the answer box below. +I really look forward seeing your work. Your deliverable is the zipped Git repository, which you can upload in the answer box below. {{< /hint >}} -a) Please submit your GitHub link with your end-to-end GitHub workflow using make (*application*) - -b) How could you determine whether the GitHub workflow runs well, beyond merely executing it yourself? (*evaluation*) - - - +4. Other example questions. + 1. Please name three ways to deploy one's research findings. (*knowledge*) + 2. What are the main benefits of exploring data using RMarkdown documents, compared to “point-and-click” interfaces (e.g., SPSS), or manually investigating data by issuing commands in the R terminal? (*comprehension*) + 3. What are the benefits from automating pipelines, compared to manually executing source code files? (*comprehension*) + 4. Please view the code snippet below and assess the completeness of the script with regard to the ITO components of a source code file. Can you identify any missing piece in the code? (*analysis*) + + ``` + library(dplyr) + df <- read.csv('data.csv') + df <- df %>% filter(age >= 18) + ``` + + 5. Please assess whether the makefile below will run when you type "make". (*analysis*) + + {{< hint >}} + Directory Structure: + + \readme.md + \code\makefile + \code\load.R + \data\dataset.csv + + Makefile: + + data/dataset.csv: load.R + R --vanilla < load.R + + {{< /hint >}} + \ No newline at end of file From 1a37ce635fd7ac4cc77306fd1152ad7c4647db27 Mon Sep 17 00:00:00 2001 From: Hannes Datta Date: Mon, 18 Mar 2024 14:34:38 +0100 Subject: [PATCH 4/4] update slidedeck --- content/docs/modules/week7/slides.Rpres | 84 +++++++++++++--------- content/docs/modules/week7/slides.html | 95 ++++++++++++++++--------- 2 files changed, 111 insertions(+), 68 deletions(-) diff --git a/content/docs/modules/week7/slides.Rpres b/content/docs/modules/week7/slides.Rpres index 5bbaac4..fd23b8e 100644 --- a/content/docs/modules/week7/slides.Rpres +++ b/content/docs/modules/week7/slides.Rpres @@ -22,7 +22,9 @@ Welcome to the final lecture in dPrep! If you haven't done so, please **explore the exam page & example questions** at [https://dprep.hannesdatta.com/docs/exam](). +