phishing-detection-plugin/backend/dataset at master · kishore0549/phishing-detection-plugin

History

Name		Name	Last commit message	Last commit date
parent directory ..
README.md		README.md
dataset.arff		dataset.arff
preprocess.ipynb		preprocess.ipynb
preprocess.py		preprocess.py

README.md

Preprocess Dataset

This a public phishing site dataset taken from UCI repository.

Download the dataset and save as dataset.arff. The preprocess.py loads the arff file and converts it to numpy array. Then dataset metadata is printed and then dataset is splited into training and testing set with 30% for testing.
Change working directory to /backend/dataset and Run the preprocessor with

python3 preprocess.py

Training and testing data *.npy files are created in the working directory.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

dataset

dataset

README.md

Preprocess Dataset

Files

dataset

Directory actions

More options

Directory actions

More options

Latest commit

History

dataset

Folders and files

parent directory

README.md

Preprocess Dataset