A Malware Prediction model that predicted if the PE format file is malicious or legitimate.
A python based Machine learning model with classifier is used for detecting if the file (PE format) is malicious or not. Also confusion matrix is used to evaluate the model. Signature based analysis (Used MD5 hash).
Project is created with:
- Vscode
- Python: 3.8.3
- Random Forest
- Regression
- Decision Tree
- Gradient Boosting
- Ada Boost
- GNB
- k-NN
- pandas
- numpy
- scipy
- scikit
- pefile
- pickle
To run this project, install it locally using conda or pip (or use Anaconda Navigator to install the libraries packages and notebook).
$ conda install python=3.8.3
$ conda install pandas numpy matplotlib notebook
$ conda install pip
Use [pip](https://pypi.python.org/pypi/pip) to install any missing dependencies
Use [python] version 3
Dependencies
============
* pandas ```pip install pandas```
* numpy ```pip install numpy```
* pickle ```pip install pickle``` or ```pip install pickle-mixin```
* scipy ```pip install scipy```
* scikit ```pip install -U scikit-learn```
* pefile ```pip install pefile```
* seaborn ```pip install seaborn```
To run the program
===================
* ```python learn.py```
* ```python check.py PEfilename```