Skip to content

A system to scrape words off a production website and spellcheck them all. Because your site is already in production, and we don't really spellcheck pages when we write them in a code editor, do we?

Notifications You must be signed in to change notification settings

TheRealEdDawson/scrape_n_spell

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

43 Commits
 
 
 
 
 
 

Repository files navigation

scrape_n_spell

A script to scrape words off a production website and spellcheck them all. Because your site is already in production, and we don't really spellcheck pages when we write them in a code editor, do we?

Usage

python scrape_n_spell.py http://www.example.com/page-to-spellcheck OUTPUTFILENAME.TXT

Where "http://www.example.com/page-to-spellcheck" is the web page you want to spell check, and "OUTPUTFILENAME.TXT" is the name of the text file where you want the results to be recorded.

Setup (for Linux command-line environment)

1. Install Python 3 (if not already pre-installed)

https://docs.python.org/3/using/unix.html#getting-and-installing-the-latest-version-of-python

2. Install pyenchant

pip install pyenchant 

3. Install Lynx (if not already pre-installed)

apt-get install lynx

(Ubuntu / Debian)

OR

yum install lynx

(Red Hat / CENTOS)

OR

brew install lynx 

(macOS)

OR

(use the package manager available to you)

About

A system to scrape words off a production website and spellcheck them all. Because your site is already in production, and we don't really spellcheck pages when we write them in a code editor, do we?

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages