Skip to content

GoC-Spending/data-corporations

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

25 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Detailed Canadian Companies

Find companies that can supply your organization with the goods, services and technology it needs.

Install

NodeJS is required.

$ npm install

Download Data

$ npm run download

Login | retrieve new CookieJar
Get details | offset: 34475
Parsing links | total: 51709
Found links: 25
Get corporations: 24
Saving HTML: 293324750000
Saving HTML: 306180290000
Saving HTML: 234567150009
Saving HTML: 234567151960
Saving HTML: 234567067124

Process Data

  • Process HTML data to JSON
  • Convert JSON to CSV
$ npm run start

> [email protected] start /Users/mac/Github/data-corporations
> npm run metadata && npm run json2csv


> [email protected] metadata /Users/mac/Github/data-corporations
> node metadata.js

  processing [====================] 100% 51709/51709

> [email protected] json2csv /Users/mac/Github/data-corporations
> json2csv -i data/corporations.json > data/corporations.csv

Datasets

[
{
  "operatingName": "PACE (Providing Assistance Counselling and Education)",
  "telephone": "(780) 539-6692",
  "email": "[email protected]",
  "yearEstablished": 1981,
  "exporting": "No",
  "mailingAddress": "10031 103rd Ave GRANDE PRAIRIE, Alberta T8V 1B9 ",
  "locationAddress": "10031 103rd Ave GRANDE PRAIRIE, Alberta T8V 1B9 ",
  "primaryIndustry": "624190 - Other Individual and Family Services",
  "primaryIndustryNAICS": 624190,
  "primaryBusinessActivity": "Services",
  "filename": "234567004510.html"
}
]