-
Notifications
You must be signed in to change notification settings - Fork 9
Home
The state of the source code, especially in regard to the assets that the WAIL user interface utilizes, is disarray. Because of this, the active development/reorganization is being done on the osagnostic
branch. The reorganization hierarchy (for OS X) will be as-follows with the hope of squashing some long outstanding platform variances:
/Application/ - WAIL.app (executable) -- Contents/ (the core WAIL code/functionality) -- build/ (pyinstaller and icons) -- bundledApps/ --- heritrix-3.2.0/ ---- bin/ (the Heritrix executables are in here) ---- jobs/ (poll this directory for current/past jobs --- html/ (a remnant from warc-proxy) --- phantomjs/ (support for executing Heritrix jobs using the web interface) --- tomcat/ ---- webapps/ ----- ROOT/ (wayback src dir) ----- bin/ (wayback support) ----- lib/ (wayback support) ----- ...etc... (wayback support) --- warc-proxy/ -- support
This will allow a single icon
for WAIL in OS X but something similar should be done with Windows; however, Windows does not have this folder-is-an-app concept (via bundles), so something creative and aesthetic should be done.
There is also a plan to fetch the WAIL assets (e.g., Wayback, Heritrix) on first-run to provide the user the most up-to-date version. A binary will probably be needed, as we do not want to require the user to have development tools (e.g., Python, setup-tools, gcc) installed on their system. This should be facilitate by the above restructuring, as everything that needs to be fetch should reside in /bundledapps/.
As for the method to accomplish this, git
is too technical and likely not supported unless a library is included in WAIL. In that same regard, anything that is fetched will need to be placed in a very specific, portable location (relative to WAIL), and will need to have its configuration files setup by the WAIL script, so it might be best to keep the fetch procedure in WAIL, whether by http(s), git, curl, or some other method.
The user interface must be intuitive for casual PC users. Some UI guidelines should probably be consulted.
Tabs:
- Basic tab -- URL text field -- Archive Now! button -- Check Archived Status button -- View Archive button
- Advanced tab -- General Sub-tab --- Service Status, State, Version text fields and a Fix button for each: ---- Heritrix ---- Wayback ---- Tomcat -- Wayback Sub-tab --- View Wayback In Browser button --- Edit Wayback Configuration button --- Reset Wayback Configuration button -- Heritrix Sub-tab --- Listbox with all job ids previously run ---- When one is selected, metadata about the crawl is displayed to the right via text --- Setup New Crawl button --- Launch WebUI button --- Relaunch Process button -- Miscellaneous --- Launch WARC-Proxy button
WAIL is a project of the Web Science / Digital Libraries Research Group at Old Dominion University.
If you wish to find out more about the group you can
This work is supported by the National Endowment for the Humanities (NEH), through Digital Humanities grants HD-51670-13 and HK-50181-14.
Intro
Getting Started
Using WAIL
- Navigating WAIL
- Collections
- Adding To A Collection
- Heritrix
- Services
- Miscellaneous
- Twitter Archiving
- Twitter Authorization
- FAQ
Development