What the project does

Takes data from the PHAC Data Catalogue and presents them as a dynamic table for the public to explore.

Why the project is useful

Gives the public a way to explore and understand all the data holdings at PHAC

How it works

Put a copy of the data catalogue as a CSV in the root folder, saved as "data-catalogue.csv"
run extractCatalogue.py using Python, which will create output.json
rename the output.json to data.json when you're ready to overwrite the previous data.

extractCatalogue.py

This python script reads data-catalogue.csv and extracts the following columns of data (exact names of the headers in the catalogue):

Database/Dataset/System Name (English)
Acronym (English)
Description (English)
Keywords
Objective(s) (English)
Geographical Coverage
Data Quality Checks or Assessments
Frequency of Data Collection
Data sources
Open government status
Programming Language
Years/Cycle Available
Availability of Indigenous Variables/Data
Availability of Sex and Gender-based Analysis Plus (SGBA+) Data
Access Requirement
Data is Accessible to
Intended Audience of Data Knowledge Translation Products and Publications
When was the Open Government Portal last updated?
Hyperlinks

The headers are then changed to plain language:

Dataset
Acronym
Description
Keywords
Objectives
Coverage
Quality Checks
Frequency
Sources
Open Status
Programming Language
Years Available
Indigenous Data
SGBA+ Data
Access
Accessible To
Audience
Category
Last Updated
Hyperlinks

The script excludes hyperlinks that don't have "http", which should exclude any internal links.

The script then converts these data to a JSON file and exports it to output.json.

display the data with datatables

When you're ready to update the data catalogue viewer, you can rename output.json to data.json, overwriting the previous catalogue. And that's all you need to do. Whenever someone goes to index.html in their browser it will use DataTables to build a dynamic and searchable table of the data catalogue.

Who maintains and contributes to the project

This project is run by the Data Transparency team at PHAC

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
data		data
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md
approved-datasets.txt		approved-datasets.txt
approved-fields.csv		approved-fields.csv
custom.css		custom.css
extractCatalogue.py		extractCatalogue.py
favicon.ico		favicon.ico
index.html		index.html
scripts.js		scripts.js
search-highlight.js		search-highlight.js

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

What the project does

Why the project is useful

How it works

extractCatalogue.py

display the data with datatables

Who maintains and contributes to the project

About

Releases

Packages

Languages

PHACDataHub/data-catalogue-prototype

Folders and files

Latest commit

History

Repository files navigation

What the project does

Why the project is useful

How it works

extractCatalogue.py

display the data with datatables

Who maintains and contributes to the project

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages