CS4243-Project

This repository contains different ML approaches to the Where is Waldo problem.

Cascade Classifier

Content under folder cascadeclassifier

Detection

Execute the command

python detector.py <classifier> <image> <3_digit_image_idx> <output_path>

Note: Classifier can be found in classifier/.

Training

Download the executables from here and extract it into cascadeclassifier\

Create bg.txt (negative examples)

python generate_bg.py <folder_with_neg_egs>

Note: bg.txt can be appended by executing generate_bg.py on multiple negatives folders.

Create waldo.vec (positive examples)

python generate_info.py <folder_with_pos_egs>  // for additional training images

python generate_info_xml.py  // for xml annonations provided by course

opencv_createsamples -info info.dat -num <#_pos_egs> -w 50 -h 50 -vec waldo.vec

Note: bg.txt can be appended by executing generate_bg.py on multiple negatives folders.

Train cascade

opencv_traincascade -data classifier -vec waldo.vec -bg bg.txt -numPos <#_pos_egs> -numNeg <#_neg_egs> -numStages 50 -w 50 -h 50

Code adapted from (https://github.com/CrzyDataScience/WhereIsWally)

HOG SVM Detector

Content under folder hog_svm

Detection

For sklearn version, execute the command

python test_HOG_SVM.py

For OpenCV version, execute the command

python OpenCV_test_HOG_SVM.py

Training

For sklearn version, execute the command

python train_HOG_SVM.py

For OpenCV version, execute the command

python OpenCV_train_HOG_SVM.py

Code adapted from (https://github.com/SamPlvs/Object-detection-via-HOG-SVM)

Template Matching

Pure template matching

python pure_tm.py -t <template folder> -i <3_digit_image_idx>

Output image would be in template_matching/results folder

Scoring the result from cascade classifier

python base_line_scoring.py -t <template folder> -i <3_digit_image_idx>

This script takes template_matching/baseline.txt as input and would output a csv file to template_matching/baseline.

Inside baseline folder:

python baseline2result.py -b <baseline_csv_file>

This script converts baseline csv file to actual baseline.txt file.

Official Training for Cascade Classifier

Note: k = 2 for cross-validation.

Generate data required for training

python data_generator.py

No arguments needed. Only needed to generate once. A data folder will be created, which will contain all the necessary training files/data needed.

Folder structure: data -> k_idx -> character -> body_part -> bg/info files

Training

python trainer.py -w <width> -bt <booster> -minHitRate <minHitRate> -maxFalseAlarmRate <maxFalseAlarmRate> -mode <mode>

One execution will train for all characters and body parts. The trainer will train for numStages = 17. A trained_models folder will be created, which will contain all the trained models. numPos is specific for stage 0. For subsequent stages, more postive examples will be consumed. numPos and numNeg will be automatically calculated based on the examples we have.

Folder structure: trained_models -> parameters -> k_idx -> character -> body_part -> cascade.xml Note: parameters format for folder name is w_bt_minHitRate_maxFalseAlarmRate_mode

Validation

python validator.py

Validator will generate and evaluate baselines for all models contained in the trained_models folder The validator will evaluate for numStages=10 to 17. A baseline folder will be created, which will contain all the baselines. A eval.txt file will be generated which contains the average mAP for all the models (aggregated according to training parameters).

Folder structure: baseline -> parameters -> k_idx -> waldo.txt + wenda.txt + wizard.txt

Ensemble Detector

python ensemble_detector.py -ii <image input text file>
python ensemble_detector_visualizer.py -ip <single input image path>

Ensemble detector will execute the two-stage detection and outputs to the corresponding baseline files. Baselines are stored in the baseline folder. Templates are stored in the template folder. Cascade classifier models are stored in the classifier folder. Format for each line in image input text file: path\to\image img_idx

References

Training examples derived from (https://github.com/vc1492a/Hey-Waldo)
OpenCV's tutorial

Name		Name	Last commit message	Last commit date
Latest commit History 187 Commits
baseline		baseline
cache_anno		cache_anno
cascade_classifier		cascade_classifier
datasets		datasets
ensemble_detector		ensemble_detector
final_training		final_training
hog_svm		hog_svm
img_aug		img_aug
template_matching		template_matching
training		training
.DS_Store		.DS_Store
.gitignore		.gitignore
README.md		README.md
README.txt		README.txt
evaluation.py		evaluation.py
requirements.txt		requirements.txt
vis_anno.py		vis_anno.py
vis_baseline.py		vis_baseline.py
voc_eval.py		voc_eval.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CS4243-Project

Cascade Classifier

Detection

Training

HOG SVM Detector

Detection

Training

Template Matching

Pure template matching

Scoring the result from cascade classifier

Official Training for Cascade Classifier

Generate data required for training

Training

Validation

Ensemble Detector

References

About

Releases

Packages

Contributors 3

Languages

jlks96/CS4243-Project

Folders and files

Latest commit

History

Repository files navigation

CS4243-Project

Cascade Classifier

Detection

Training

HOG SVM Detector

Detection

Training

Template Matching

Pure template matching

Scoring the result from cascade classifier

Official Training for Cascade Classifier

Generate data required for training

Training

Validation

Ensemble Detector

References

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages