DPhate-double-paraphrasing-hate-speech

Bachelor's thesis on removing hate from online comments using paraphrasing: algorithm DPhate.

Usage

To recreate the data generated in the research paper (also available here), where the input are hateful sentences from the Hatexplain dataset, use:

python3 DPhate.py

To test the algorithm on your own examples use the followoing python code:

from DPhate import DPhate
dphate = DPhate()
phrase = "I fucking love your mother."
toxicity = dphate.modelD.predict(phrase)['toxicity']
toxCategory = int((toxicity-0.5)//0.125)
dphate.predict(phrase,toxCategory)

Fine-tuned T5 models are too big for GitHub and can be downloaded here. It is a 2.3GB zip file, which contains 3 different T5 models.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
data-generated		data-generated
finetuning-T5		finetuning-T5
microWorkers-data-and-analysis		microWorkers-data-and-analysis
pictures		pictures
presentation		presentation
technical-evaluation		technical-evaluation
.gitignore		.gitignore
DPhate.py		DPhate.py
DiplomaENGfinall.pdf		DiplomaENGfinall.pdf
NLP_tools.py		NLP_tools.py
README.md		README.md
articles.txt		articles.txt
notes_and_links.txt		notes_and_links.txt
paraphraser_testing.py		paraphraser_testing.py
preprocessing.py		preprocessing.py
remove_similar.py		remove_similar.py
shannon_info.py		shannon_info.py
thoughts_and_ideas.txt		thoughts_and_ideas.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DPhate-double-paraphrasing-hate-speech

Usage

About

Releases

Packages

Languages

DrejcPesjak/DPhate-double-paraphrasing-hate-speech

Folders and files

Latest commit

History

Repository files navigation

DPhate-double-paraphrasing-hate-speech

Usage

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages