The following roadmap incorporates most of the issues in GitHub and the projected developments in our grant proposals. As a rough timetable, it seems like we could get to v3.5 by midsummer 2017.
1.Scrubber stop word radio button settings were made persistent (#260).
2.Labeling of duplicate original sources in Manage fixed (#356).
3.MUFI display tables were proofread (#458) and the UI for reading the tables was enhanced with Ajax loading and sort, search, and paging capabilities.
4.Possible errors caused by empty documents generated by Cutter were prevented by trimming documents of leading and ending white space and milestones (#471).
5.In Cutter, the “Cut documents on this term” field is now hidden on page load (#486).
6. Scalar’s internal links in “In the Margins” are now converted to full urls opening in a new window in Lexos (#487).
7.Multicloud improvements: (a) form submission converted to Ajax; (b) warning dialog added to tell user about browser freezes; (c) wordCounts array generated server-side for better performance; (d) coerce MALLET data to UTF-8 (#498).
8.A download link was added to the Manage tool.
9.Scrubber and Cutter previews now display the name of the document label instead of the source file.
10. Improvements made to the Delete Selected
, Merge Selected Documents
, and Apply Class to Selected Documents
functions in Manage.
11. Fixed bug causing tokenization settings to fail in visualization tools (#519)
1.Finish the improved Windows installer (#490) (almost done).
- Migrate to python3 (#434). Done
- Follow PEP8 standards (#509) Done
- Scrubber hyphen and consolidation enhancements (#438). Priority
- Handle digits more robustly in Scrubber. (#460)
- Migrate all remaining form submissions to Ajax (#494). Priority
- Enable the user to select how to replace markup elements in Scrubber. (#457) Priority
- Adopt continuous integration for the workflow (#497). DONE
- Move dev docs and possibly installation materials to the Wiki or another repo (#489). Done
- Add document label truncation where necessary (#501). Priority
- File structure refactor (#541). Done
- Introduce error handling Ajax responses and other server errors (#495). Done (at least, the issue is closed)
- Refactor Javascript (#496).
- Re-implement TF-IDF in Normalize options (#444).
- Re-implement commented out silhouette score settings (#478).
- Add more averages to statistics table (#255).
- Investigate why d3.js v40 doesn’t work (#464).
- Develop a user setting to display MUFI fonts (#365).
- Decide what totals and averages will be displayed in the Tokenizer table (#462).
- Fix Document label ordering issue in Tokenizer (#470).
- Make some constants user-configurable (#472, #454).
- Begin indexing strategy (indirectly influences issues #429 and #476). Priority
- Investigate upload security issues for web scraper (see also issue #451, #500). Priority
- Investigate whether the Multicloud algorithm can be changed to prevent browser freezes (#484).
- How to handle number of leaves display in cluster (#503).
- Adopt modularization with Flask Blueprints (#508). DONE
- Implement webpack, or, more likely, a custom method of minifying and bundling static assets (#485).
- External tokenization library integration.
- Interactive dendrograms.
- Bootstrap consensus trees. Priority
- Web scraper UI.
- Ribbon diagram.
- Starting with commit 06bbde9cc9157c54ce2f3d00a2a831238e128c18 (21 January 2017), commits to master are not in Cheng’s Python 3 conversion. We should check whether they have been merged as of 7 July 2017.