Skip to content

Latest commit

 

History

History
112 lines (84 loc) · 3.5 KB

README.org

File metadata and controls

112 lines (84 loc) · 3.5 KB

Template Name

Optional text description

Metadata

nameTemplate data
science referencePaper citation
science DOIdoi:
data DOIdoi:
data URLhttps://
keywordsfoo, bar

Create dataset

  • Remove the following text, or replace with explanation of how datalad dataset is created
  • Replace the bash block with python or other preferred language
  • Keep the #+NAME: datalad-create name so that these blocks can be called via script
# datalad create -f -d . -c text2git -D "Template"
datalad create -f -d . -D "Cryo data template"

git remote add origin [email protected]:cryo-data/template.git

git add meta.json README.org
git commit -m "README & metadata"
git branch -M main
git push -u origin main

How to use this template

dest="Mankoff_2020"
git clone https://github.com/cryo-data/template.git ${dest}
cd ${dest}
git remote rm origin
git remote add origin [email protected]:cryo-data/${dest}.git
git branch -M main
git push -u origin main
  • If the dataset already contains its own README.org, then keep that and rename this one to datalad.org
git mv README.org datalad.org
git commit datalad.org -m 'Rename README.org to datalad.org because dataset has README.org'
  • Populate the datalad dataset
    NOTE
    The code below should go into the create data section above.

The code below downloads each file using datalad download-url. This is one of many ways to populate a dalatad dataset.

  • TODO: Show examples for adding data that is accessible via an archive (Zenodo?) or data already in git.
  • TODO: Provide small shell or Python script in template folder for traversing and downloading remote directory structure (e.g. wget -r ...).
export SERVER=https://dataverse01.geus.dk
export DOI=10.22008/promice/data/ice_discharge/d/v02

curl ${SERVER}/api/datasets/:persistentId?persistentId=doi:${DOI} > dv.json
cat dv.json | tr ',' '\n' | grep -E '"persistentId"' | cut -d'"' -f4 > urls.txt
while read -r PID; do
  datalad download-url $SERVER/api/access/datafile/:persistentId?persistentId=${PID}
done < urls.txt
rm dv.json urls.txt # cleanup
  • Push changes
git push