Template Name

Optional text description

Metadata

name	Template data
science reference	Paper citation
science DOI	doi:
data DOI	doi:
data URL	https://
keywords	foo, bar

Create dataset

Remove the following text, or replace with explanation of how datalad dataset is created
Replace the bash block with python or other preferred language
Keep the #+NAME: datalad-create name so that these blocks can be called via script

# datalad create -f -d . -c text2git -D "Template"
datalad create -f -d . -D "Cryo data template"

git remote add origin [email protected]:cryo-data/template.git

git add meta.json README.org
git commit -m "README & metadata"
git branch -M main
git push -u origin main

How to use this template

Pick a name for the new dataset
- Folder name should be “Author_YYYY” if possible, or some other common name for the dataset (e.g. ArcticDEM or MEaSUREs.0471)
- In this example, we’ll use Mankoff_2020, and convert https://dataverse01.geus.dk/dataset.xhtml?persistentId=doi:10.22008/promice/data/ice_discharge/d/v02 to a datalad dataset.
Copy template to clean directory using the new dataset name

dest="Mankoff_2020"
git clone https://github.com/cryo-data/template.git ${dest}
cd ${dest}

Set up a new repository, ideally under the cryo-data portal on GitHub, but can be hosted anywhere.
- Set up by visiting https://github.com/organizations/cryo-data/repositories/new and do **not** initialize with any files, then:

git remote rm origin
git remote add origin [email protected]:cryo-data/${dest}.git
git branch -M main
git push -u origin main

If the dataset already contains its own README.org, then keep that and rename this one to datalad.org

git mv README.org datalad.org
git commit datalad.org -m 'Rename README.org to datalad.org because dataset has README.org'

Populate the datalad dataset

NOTE
The code below should go into the create data section above.

The code below downloads each file using datalad download-url. This is one of many ways to populate a dalatad dataset.

TODO: Show examples for adding data that is accessible via an archive (Zenodo?) or data already in git.
TODO: Provide small shell or Python script in template folder for traversing and downloading remote directory structure (e.g. wget -r ...).

export SERVER=https://dataverse01.geus.dk
export DOI=10.22008/promice/data/ice_discharge/d/v02

curl ${SERVER}/api/datasets/:persistentId?persistentId=doi:${DOI} > dv.json
cat dv.json | tr ',' '\n' | grep -E '"persistentId"' | cut -d'"' -f4 > urls.txt
while read -r PID; do
  datalad download-url $SERVER/api/access/datafile/:persistentId?persistentId=${PID}
done < urls.txt
rm dv.json urls.txt # cleanup

Push changes

git push

Remove this section from the ./README.org file.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.datalad		.datalad
.gitattributes		.gitattributes
README.org		README.org
meta.json		meta.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Template Name

Metadata

Create dataset

How to use this template

About

Releases

Packages

cryo-data/template

Folders and files

Latest commit

History

Repository files navigation

Template Name

Metadata

Create dataset

How to use this template

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages