YOLOv4 TensorFlow SavedModel

Forked from the repo by hunglc007.

This fork generates an easy to use YOLOv4 TensorFlow SavedModel that accepts any image size, works in batched and non-batched mode and returns person detections in a convenient tf.RaggedTensor.

Compile the aforementioned SavedModel as follows:

python save_model.py --weights $weight_dir/yolov4.weights --output $output_path --input_size 416 --model yolov4

API Reference

Load the saved model as

import tensorflow as tf

model = tf.saved_model.load('path_to_model')

Methods

model.predict_single_image

Performs person bounding box detection on an RGB image.

model.predict_single_image(
    image, threshold=0.1, nms_iou_threshold=0.65, flip_aug=False, bothflip_aug=False)

Arguments:

image: a uint8 Tensor of shape [H, W, 3] containing an RGB image.
threshold: a float32 value for thresholding detection scores (detections with lower score are discarded)
nms_iou_threshold: float value for use in intersection-over-union-based (IoU) non-max suppression (NMS). Too low values may result in false negatives when people are close to each other in the image, while too high values may result in duplicates (same person detected multiple times).
flip_aug: boolean specifying whether to run the image through the detector with horizontal flipping as well and aggregate the results (before the detector NMS step).
bothflip_aug: boolean specifying whether to run the image through the detector with horizontal and vertical flipping as well (so 3 augmentations) and aggregate the results (before the detector NMS step).

Return value:

boxes: [left, top, width, height, confidence] for each detection box. Shape is [num_detections, 5].

model.predict_multi_image

The batched (multiple input images) equivalent of predict_single_image. Performs person detection on a batch of RGB images.

model.predict_multi_image(
    images, threshold=0.1, nms_iou_threshold=0.65, flip_aug=False, bothflip_aug=False)

Only the first argument is mandatory.

images: a batch of RGB images as a uint8 Tensor with shape [N, H, W, 3]
The remaining arguments have the same type and meaning as in predict_single_image (see above).

Return value:

boxes: [left, top, width, height, confidence] for each detection box. It is a tf.RaggedTensor with shape [N, None, 5] where the None stands for the ragged dimension (the image-specific number of detections).

Name		Name	Last commit message	Last commit date
Latest commit History 164 Commits
android		android
core		core
data		data
mAP		mAP
scripts		scripts
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
LICENSE		LICENSE
README.md		README.md
benchmarks.py		benchmarks.py
convert_tflite.py		convert_tflite.py
convert_trt.py		convert_trt.py
detect.py		detect.py
detectvideo.py		detectvideo.py
evaluate.py		evaluate.py
requirements-gpu.txt		requirements-gpu.txt
requirements.txt		requirements.txt
result-int8.png		result-int8.png
result.png		result.png
save_model.py		save_model.py
save_reid_model.py		save_reid_model.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

YOLOv4 TensorFlow SavedModel

API Reference

Methods

model.predict_single_image

Arguments:

Return value:

model.predict_multi_image

Return value:

About

Releases 1

Languages

License

isarandi/tensorflow-yolov4-tflite

Folders and files

Latest commit

History

Repository files navigation

YOLOv4 TensorFlow SavedModel

API Reference

Methods

model.predict_single_image

Arguments:

Return value:

model.predict_multi_image

Return value:

About

Topics

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases 1

Languages