Machine-Learning-approach-for-Malware-Detection

A Machine Learning approach for classifying a file as Malicious or Legitimate.

This approach tries out 6 different classification algorithms before deciding which one to use for prediction by comparing their results. Different Machine Learning models tried are, Linear Regression, RandomForest, DecisionTree, Adaboost, Gaussian, Gradient Boosting.

In order to test the model on an unseen file, it's required to extract the characteristics of the given file. Python's pefile.PE library is used to construct and build the feature vector and a ML model is used to predict the class for the given file based on the already trained model.

Dependencies

pandas pip install pandas
numpy pip install numpy
pickle pip install pickle
scipy pip install scipy
scikit pip install -U scikit-learn

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
README.md		README.md
data.zip		data.zip
malware-classification.ipynb		malware-classification.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Machine-Learning-approach-for-Malware-Detection

Dependencies

About

Releases

Packages

Languages

surajr/Machine-Learning-approach-for-Malware-Detection

Folders and files

Latest commit

History

Repository files navigation

Machine-Learning-approach-for-Malware-Detection

Dependencies

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages