https://www.kaggle.com/clmentbisaillon/fake-and-real-news-dataset
The data set was merged. We concanated the data and shuffled it to avoid biases.
decision tree
- https://anderfernandez.com/en/blog/code-decision-tree-python-from-scratch/
- http://www.odbms.org/wp-content/uploads/2014/07/DecisionTrees.pdf
- https://towardsdatascience.com/classification-decision-trees-easily-explained-f1064dde175e
- https://www.researchgate.net/publication/350287290_Fake_News_Classification_Using_Random_Forest_and_Decision_Tree_J48
splitting data
- https://stackabuse.com/scikit-learns-traintestsplit-training-testing-and-validation-sets/
- https://machinelearningmastery.com/train-test-split-for-evaluating-machine-learning-algorithms/
opening files
csv to feather