Skip to content

Tasks: Data maintenance

James McKinney edited this page Nov 6, 2018 · 31 revisions

Tasks

If you encounter errors, refer to the Troubleshooting page. If the maintenance tasks return output, refer to the expected output below.

Regularly

Quarterly

Run the maintenance tasks of:

Check this master spreadsheet for manual requests. If a cell in the Next boundary column is highlighted in yellow, then its shapefile is out-of-date and must be requested. If the cell is highlighted in beige, then it's unknown whether its shapefile is out-of-date (and its shapefile is not online).

Expected output

Some of the maintenance tasks return output that is expected and requires no action.

represent-canada-data

invoke definitions

Richmond, NS districts                                       Expected slug to be Richmond districts not Richmond, NS districts
Paroisse de Plessisville districts                           Expected slug to be Plessisville districts not Paroisse de Plessisville districts

We need to disambiguate Richmond, NS from Richmond, QC and Paroisse de Plessisville, QC from Ville de Plessisville, QC.

Montréal boroughs and districts                              Expected slug to match \AMontréal (boroughs|districts)\Z not Montréal boroughs and districts
Montréal boroughs and districts                              Expected authority to be Ville de Montréal not Directeur général des élections du Québec

The boundary set is exceptional, since the Conseil municipal has representatives of the borough and district levels.

Newfoundland and Labrador electoral districts (2006)         Expected LICENSE.txt to match "all rights reserved" template
Newfoundland and Labrador electoral districts                Expected LICENSE.txt to match "all rights reserved" template
Quebec electoral districts (2017)                            Expected authority to be Her Majesty the Queen in Right of Quebec not Directeur général des élections du Québec
Quebec electoral districts                                   Expected authority to be Her Majesty the Queen in Right of Quebec not Directeur général des élections du Québec

Newfoundland and Labrador and Quebec have non-standard licensing arrangements.

Bonnyville No. 87 wards                                      Expected slug to be Bonnyville No. 87 divisions not Bonnyville No. 87 wards

Bonnyville No. 87 (represent-canada-private-data) has non-standard boundary names.

invoke urls

301 https://www.geosask.ca/Portal/jsp/terms_popup.jsp
307 http://chathamkentopendata-chatham-kent.opendata.arcgis.com/datasets/1f4428dd5d764320b4246d190cfb70cb_0
403 https://www.arcgis.com/sharing/rest/content/items/2ffb1ce148804fe4ade2414e6ef10d21/data

GeoSask and Chatham Kent's open data catalogs no longer exist. Saskatchewan electoral districts are now available from Elections Saskatchewan but without a license, so we continue to attribute GeoSask.

invoke spreadsheet

Census divisions          no extra
Census subdivisions       no extra
ocd-division/country:ca/cd:1205 no record
ocd-division/country:ca/cd:1207 no record
ocd-division/country:ca/cd:1210 no record
ocd-division/country:ca/cd:1211 no record
ocd-division/country:ca/cd:1212 no record
ocd-division/country:ca/cd:1214 no record
ocd-division/country:ca/cd:1215 no record
ocd-division/country:ca/cd:1216 no record
ocd-division/country:ca/cd:1218 no record

The spreadsheet only tracks federal, provincial, territorial and Census subdivision boundaries.

scrapers-ca

invoke tidy

County of Grande Prairie No. 1 Council                       Expected Grande Prairie County No. 1 Municipal district Council
Strathcona County Council                                    Expected Strathcona County Municipal Council
Langley Township Council                                     Expected Langley District Council
Haldimand County Council                                     Expected Haldimand County City Council

The task uses a jurisdiction's Census data to guess its legislature's proper name. However, Census data may not correspond with how the jurisdiction refers to its legislature.

Conseil municipal de Dollard-Des-Ormeaux                     Expected Conseil municipal de Dollard-Des Ormeaux

The task uses the English name of a jurisdiction. For Quebec jurisdictions, we use the French name.

ca_on_waterloo_region                                        Expected ca_on_waterloo

We use a different module name for the Region of Waterloo to avoid a collision with the City of Waterloo.

ca_mb                                                        Check: http://www.gov.mb.ca/legislature/
ca_nb                                                        Check: http://www.gnb.ca/legis/index.asp

If the jurisdiction's url contains a path, the task reminds you to double-check that the path is correct.

invoke sources_and_assertions

Expected 2 sources after 4 requests ca/people.py
Expected 2 sources after 3 requests ca_ab/people.py
Expected 2 sources after 3 requests ca_sk/people.py
  • ca: We don't add a source URL for the Twitter usernames. A request validates the photo.
  • ca_ab: A request retrieves the temporary CSV file.
  • ca_sk: A request caches the photo.

scrapers_ca_app

manage.py mappings

This command no longer exists, but we retain this documentation in case we restore it.

>1 subtypes for ocd-division/country:ca/csd:2466023: ['borough', 'district']

We haven't yet written code to handle these cases. The corresponding boundary sets will be excluding from the mappings.

No division ocd-division/country:ca/csd:2425213/borough:1 for https://represent.opennorth.ca/boundary-sets/desjardins-districts/
No division ocd-division/country:ca/csd:2425213/borough:2 for https://represent.opennorth.ca/boundary-sets/les-chutes-de-la-chaudiere-est-districts/
No division ocd-division/country:ca/csd:2425213/borough:3 for https://represent.opennorth.ca/boundary-sets/les-chutes-de-la-chaudiere-ouest-districts/
No subtypes for ocd-division/country:ca/cd:1212
No subtypes for ocd-division/country:ca/cd:1214
No subtypes for ocd-division/country:ca/csd:2432045
No subtypes for ocd-division/country:ca/csd:2470035

The identifiers don't yet exist in ocd-division-ids.

manage.py checkmappings

This command no longer exists, but we retain this documentation in case we restore it.

No match for ... from https://represent.opennorth.ca/boundaries/new-brunswick-electoral-districts-2006/?limit=0
No match for ... from https://represent.opennorth.ca/boundaries/newfoundland-and-labrador-electoral-districts-2006/?limit=0

You can ignore errors for historical boundaries.