SL_NLP_PassiveActive1

Run notebooks

Run the notebooks in the exact order. The output files from each notebook is the input for the next notebook.

Place an 'input.csv' file in the root folder. It should have 2 required columns which are "prompt" and "response".
Run spacy_classifiers.ipynb. This will split the responses into clauses in the file voice_classified.csv
Run abstraction_scores.ipynb. This will split the responses into clauses in the file abstraction_scored.csv
Run readability_scorer.ipynb. This will split the responses into clauses in the file readability_scored.csv
Finally run final_output_nb.ipynb. This will produce two files output.csv and debug.csv. Output.csv is the minimal output which contains the split clauses, the final score and final voice based on maximum internal scores. Debug.csv contains a bit more granular details and scores of each internal terms.

Steps to set up an internal tool are:

Clone this project. Move to the project folder.
Run pip install -r requirements.txt
Run python -c "import nltk; nltk.download('wordnet'); nltk.download('stopwords'); nltk.download('punkt');"
Run python -m spacy download <model-type>. Model type can be en_core_web_lg, en_core_web_md or en_core_web_sm.
Run python voice_identifier.py --help to get started.

Name		Name	Last commit message	Last commit date
Latest commit History 91 Commits
notebooks		notebooks
ActivePassiveTM_Top.ipynb		ActivePassiveTM_Top.ipynb
README.md		README.md
abstraction_scores.py		abstraction_scores.py
config.json		config.json
final_output_nb.py		final_output_nb.py
readability_scorer.py		readability_scorer.py
requirements.txt		requirements.txt
spacy_classifiers.py		spacy_classifiers.py
voice_identifier.py		voice_identifier.py