Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Adapt scraper for site layout change #5

Open
wants to merge 5 commits into
base: master
Choose a base branch
from

Conversation

ondenman
Copy link
Contributor

@ondenman ondenman commented May 12, 2017

The site layout has changed and the scraper no longer pulls in data. This PR updates CSS selectors in MembersPage and MemberDiv and also adds MemberPage.

Note:
The scraper was capturing contact form urls for members (e.g. http://www.tucamarapr.org/dnncamara/web/contacto.aspx?rep=35). The forms are still live but I cannot find a link to them in either the member rows or member pages. As such, I have dropped this field.

In addition to voice phone and fax, member pages also list a TTY number. (I've opened a PR in csv_to_popolo to handle TTY numbers: tmtmtmtm/csv_to_popolo#115)

@ondenman ondenman force-pushed the new-site-layout-changes branch 4 times, most recently from 45d8baa to bfd2014 Compare May 17, 2017 10:16
@ondenman ondenman force-pushed the new-site-layout-changes branch 2 times, most recently from d05e708 to 399a148 Compare May 19, 2017 10:27
@ondenman ondenman changed the title WIP: New site layout changes Adapt scraper for new site layout May 19, 2017
@ondenman ondenman force-pushed the new-site-layout-changes branch 2 times, most recently from 8329aad to a3d24b5 Compare May 19, 2017 13:24
@ondenman ondenman changed the title Adapt scraper for new site layout Adapt scraper for site layout change May 23, 2017
@ondenman ondenman force-pushed the new-site-layout-changes branch 4 times, most recently from ec98989 to 1eb4137 Compare May 23, 2017 10:03
@ondenman ondenman force-pushed the new-site-layout-changes branch 7 times, most recently from 7173a3c to c335ede Compare June 13, 2017 11:34
Oliver Denman added 5 commits July 7, 2017 15:37
Phone, fax, party and contact information are no longer listed in
individual member rows. Contact form urls are no longer listed in either
member rows or on member pages.

Phone, fax and party info will be captured in MemberPage which will be added
in a forthcoming commit.
@ondenman ondenman force-pushed the new-site-layout-changes branch 2 times, most recently from e4e1462 to 1274945 Compare July 7, 2017 14:38
@ondenman ondenman mentioned this pull request Jul 7, 2017
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant