Merge pull request #59 from TangmereCottage/master

Add real time object detection and tracking
abizovnuralem · Jul 24, 2024 · a233a24 · a233a24
2 parents 7d23f0f + e4715db
commit a233a24
Show file tree

Hide file tree

Showing 10 changed files with 348 additions and 3 deletions.
diff --git a/.gitignore b/.gitignore
@@ -3,4 +3,5 @@ log
 install
 .vscode
 __pycache__
-.env
+.env
+.DS_Store
diff --git a/README.md b/README.md
@@ -30,7 +30,7 @@ This repo will empower your Unitree GO2 AIR/PRO/EDU robots with ROS2 capabilitie
 14. Creating a PointCloud map and store it :white_check_mark:
 15. SLAM (slam_toolbox) (in the current version is not working, need to fix params)
 16. Navigation (nav2) (in the current version is not working, need to fix params)
-17. Object detection
+17. Object detection :white_check_mark:
 18. AutoPilot
 
 ## Your feedback and support mean the world to us. 
@@ -80,12 +80,16 @@ cd ros2_ws/src
 git clone --recurse-submodules https://github.com/abizovnuralem/go2_ros2_sdk.git
 cp -a go2_ros2_sdk/. .
 rm -r -f go2_ros2_sdk
+sudo apt install ros-humble-image-tools
+sudo apt install ros-humble-vision-msgs
 sudo apt install python3-pip clang
 pip install -r requirements.txt
 cd ..
 ```
 
-NOTE: check for any error messages, and do not disregard them. If `pip install` does not complete cleanly, various features will not work. For example, `open3d` does not yet support `python3.12` and therefroe you will need to set up a 3.11 `venv` first etc.
+NOTE 1: check for any error messages, and do not disregard them. If `pip install` does not complete cleanly, various features will not work. For example, `open3d` does not yet support `python3.12` and therefore you will need to set up a 3.11 `venv` first etc.
+
+NOTE 2: for real time object detection and tracking, please install [PyTorch](https://pytorch.org/).
 
 Install `rust` language support in your system: [instructions](https://www.rust-lang.org/tools/install) 
 
@@ -111,6 +115,41 @@ export CONN_TYPE="webrtc"
 ros2 launch go2_robot_sdk robot.launch.py
 ```
 
+## Real time image detection and tracking
+
+<p align="center">
+<img width="300" src="https://github.com/abizovnuralem/go2_ros2_sdk/doc_images/go2_air_giraffe.png" alt='Giraffe Detection and Tracking'>
+</p>
+
+This capability is directly based on [J. Francis's work](https://github.com/jfrancis71/ros2_coco_detector). Once you have launched the sdk, the color image data will be available at `go2_camera/color/image`. In another terminal enter:
+
+```bash
+source install/setup.bash
+ros2 run coco_detector coco_detector_node
+```
+
+There will be a short delay the first time the node is run for PyTorch TorchVision to download the neural network. You should see a downloading progress bar. This network is then cached for subsequent runs.
+
+On another terminal, to view the detection messages:
+```shell
+source install/setup.bash
+ros2 topic echo /detected_objects
+```
+The detection messages contain the detected object (`class_id`) and the `score`, a number from 0 to 1. For example: `detections:results:hypothesis:class_id: giraffe` and `detections:results:hypothesis:score: 0.9989`. The `bbox:center:x` and `bbox:center:y` contain the centroid of the object in pixels. These data can be used to implement real-time object following for animals and people. People are detected as `detections:results:hypothesis:class_id: person`.
+
+To view the image stream annotated with the labels and bounding boxes:
+```shell
+source install/setup.bash
+ros2 run image_tools showimage --ros-args -r /image:=/annotated_image
+```
+
+Example Use:
+
+```shell
+ros2 run coco_detector coco_detector_node --ros-args -p publish_annotated_image:=False -p device:=cuda -p detection_threshold:=0.7
+```
+
+This will run the coco detector without publishing the annotated image (it is True by default) using the default CUDA device (device=cpu by default). It sets the detection_threshold to 0.7 (it is 0.9 by default). The detection_threshold should be between 0.0 and 1.0; the higher this number the more detections will be rejected. If you have too many false detections try increasing this number. Thus only Detection2DArray messages are published on topic /detected_objects.
 
 ## 3D map generation
 To save the map, you need to:

diff --git a/coco_detector/README.md b/coco_detector/README.md
@@ -0,0 +1,134 @@
+# ros2_coco_detector
+Integrate PyTorch Torchvision MobileNet for Microsoft COCO object detection into the ROS2 environment
+
+## YouTube Demonstration
+<a href="https://www.youtube.com/watch?v=emUs0nwwde8">
+ <img src="https://img.youtube.com/vi/emUs0nwwde8/0.jpg" height=320>
+</a>
+
+## Summary
+
+This package performs object detection in the ROS2 environment. There are certainly more sophisticated object detection frameworks out there, eg.
+[NVIDIA Isaac](https://github.com/NVIDIA-ISAAC-ROS/isaac_ros_object_detection).
+
+The chief virtue of this package is the simplicity of the codebase and use of standardised ROS2 messages, with the goal of being simple to understand and to use.
+
+## Packages
+
+coco_detector: package containing coco_detector_node for listening on ROS2 topic /image and publishing ROS2 Detection2DArray message on topic /detected_objects. Also (by default) publishes Image (with labels and bounding boxes) message on topic /annotated_image. The object detection is performed by PyTorch using MobileNet.
+
+### Tested Hardware
+
+Dell Precision Tower 2210, NVIDIA RTX2070 (GPU is optional)
+
+### Tested Software
+
+Ubuntu 22.04, ROS2 Humble (RoboStack), PyTorch 2.1.2, CUDA 12.2 (CUDA is only needed if you require GPU)
+
+## Installation - historical reference only
+
+NOTE - outdated - not relevant - for historial reference only.
+
+Follow the [RoboStack](https://robostack.github.io/GettingStarted.html) installation instructions to install ROS2
+
+(Ensure you have also followed the step Installation tools for local development in the above instructions)
+
+Follow the [PyTorch](https://pytorch.org/) installation instructions to install PyTorch (selecting the conda option).
+
+```
+mamba activate ros2 # (use the name here you decided to call this conda environment)
+mamba install ros-humble-image-tools
+mamba install ros-humble-vision-msgs
+cd ~
+mkdir -p ros2_ws/src
+cd ros2_ws
+git -C src clone https://github.com/jfrancis71/ros2_coco_detector.git
+colcon build --symlink-install
+```
+You may receive a warning on the colcon build step: "SetuptoolsDeprecationWarning: setup.py install is deprecated", this can be ignored.
+
+The above steps assume a RoboStack mamba/conda ROS2 install. If using other installation process, replace the RoboStack image-tools and vision-msgs packages install steps with whichever command is appropriate for your environment. The image-tools package is not required for coco_detector, it is just used in the steps below for convenient demonstration. However vision-msgs is required (this is where the ROS2 DetectionArray2D message is defined)
+
+## Activate Environment
+
+```
+mamba activate ros2 # (use the name here you decided to call this conda environment)
+cd ~/ros2_ws
+source ./install/setup.bash
+```
+
+## Verify Install
+
+Launch a camera stream:
+```
+ros2 run image_tools cam2image
+```
+
+On another terminal enter:
+```
+ros2 run coco_detector coco_detector_node
+```
+There will be a short delay the first time the node is run for PyTorch TorchVision to download the neural network. You should see a downloading progress bar. This network is then cached for subsequent runs.
+
+On another terminal to view the detection messages:
+```
+ros2 topic echo /detected_objects
+```
+To view the image stream annotated with the labels and bounding boxes:
+```
+ros2 run image_tools showimage --ros-args -r /image:=/annotated_image
+```
+
+Example Use:
+
+```
+ros2 run coco_detector coco_detector_node --ros-args -p publish_annotated_image:=False -p device:=cuda -p detection_threshold:=0.7
+```
+
+This will run the coco detector without publishing the annotated image (it is True by default) using the default CUDA device (device=cpu by default). It sets the detection_threshold to 0.7 (it is 0.9 by default). The detection_threshold should be between 0.0 and 1.0; the higher this number the more detections will be rejected. If you have too many false detections try increasing this number. Thus only Detection2DArray messages are published on topic /detected_objects.
+
+
+## Suggested Setup For Mobile Robotics
+
+These suggestions are for a Raspberry Pi 3 Model B+ running ROS2.
+
+As of 16/02/2024, the PyTorch Conda install does not appear to be working for Raspberry Pi 3 Model B+.
+There may be other installation options, but I have not explored that.
+
+As an alternative if you have a ROS2 workstation connected to the same network, I suggest publishing the compressed image on the Raspberry Pi and running the COCO detector on the workstation.
+
+The below setup involves the ROS2 compression transport on both the Raspberry Pi and workstation. If using RoboStack ROS2 Humble you can install on each with:
+
+```mamba install ros-humble-compressed-image-transport```
+
+Raspberry Pi (run each command in seperate terminals):
+
+```ros2 run image_tools cam2image --ros-args -r /image:=/charlie/image```
+
+```ros2 run image_transport republish raw compressed --ros-args -r in:=/charlie/image -r out/compressed:=/charlie/compressed```
+
+Workstation (run each command in seperate terminals):
+
+```ros2 run image_transport republish compressed raw --ros-args -r /in/compressed:=/charlie/compressed -r /out:=/server/image```
+
+```ros2 run coco_detector coco_detector_node --ros-args -r /image:=/server/image```
+
+I have relabelled topic names for clarity and keeping the image topics on the different machines seperate. Compression is not necessary, but I have poor performance on my network without compression.
+
+Note you could use launch files (for convenience) to run the above nodes. I do not cover that here as it will be specific to your setup.
+
+## Notes
+
+The ROS2 documentation suggests that the ObjectHypotheses.class_id should be an identifier that the client should then look up in a database. This seems more complex than I have a need for. So this implementation just places the class label here directly, eg. class_id = "dog". See the ROS2 Vision Msgs Github link in the external links section below for more details.
+
+## External Links
+
+[COCO Dataset Homepage](https://cocodataset.org/#home)
+
+[Microsoft COCO: Common Objects in Context](http://arxiv.org/abs/1405.0312)
+
+[PyTorch MobileNet](https://pytorch.org/vision/stable/models/generated/torchvision.models.detection.fasterrcnn_mobilenet_v3_large_320_fpn.html)
+
+[MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications](https://arxiv.org/abs/1704.04861)
+
+[ROS2 Vision Msgs Github repo](https://github.com/ros-perception/vision_msgs)
diff --git a/coco_detector/coco_detector/__init__.py b/coco_detector/coco_detector/__init__.py
diff --git a/coco_detector/coco_detector/coco_detector_node.py b/coco_detector/coco_detector/coco_detector_node.py
@@ -0,0 +1,120 @@
+"""Detects COCO objects in image and publishes in ROS2.
+
+Subscribes to /image and publishes Detection2DArray message on topic /detected_objects.
+Also publishes (by default) annotated image with bounding boxes on /annotated_image.
+Uses PyTorch and FasterRCNN_MobileNet model from torchvision.
+Bounding Boxes use image convention, ie center.y = 0 means top of image.
+"""
+
+import collections
+import numpy as np
+import rclpy
+from rclpy.node import Node
+from sensor_msgs.msg import Image
+from vision_msgs.msg import BoundingBox2D, ObjectHypothesis, ObjectHypothesisWithPose
+from vision_msgs.msg import Detection2D, Detection2DArray
+from cv_bridge import CvBridge
+import torch
+from torchvision.models import detection as detection_model
+from torchvision.utils import draw_bounding_boxes
+
+Detection = collections.namedtuple("Detection", "label, bbox, score")
+
+class CocoDetectorNode(Node):
+ """Detects COCO objects in image and publishes on ROS2.
+
+ Subscribes to /image and publishes Detection2DArray on /detected_objects.
+ Also publishes augmented image with bounding boxes on /annotated_image.
+ """
+
+ # pylint: disable=R0902 disable too many instance variables warning for this class
+ def __init__(self):
+ super().__init__("coco_detector_node")
+ self.declare_parameter('device', 'cpu')
+ self.declare_parameter('detection_threshold', 0.9)
+ self.declare_parameter('publish_annotated_image', True)
+ self.device = self.get_parameter('device').get_parameter_value().string_value
+ self.detection_threshold = \
+ self.get_parameter('detection_threshold').get_parameter_value().double_value
+ self.subscription = self.create_subscription(
+ Image,
+ "/go2_camera/color/image",
+ self.listener_callback,
+ 10)
+ self.detected_objects_publisher = \
+ self.create_publisher(Detection2DArray, "detected_objects", 10)
+ if self.get_parameter('publish_annotated_image').get_parameter_value().bool_value:
+ self.annotated_image_publisher = \
+ self.create_publisher(Image, "annotated_image", 10)
+ else:
+ self.annotated_image_publisher = None
+ self.bridge = CvBridge()
+ self.model = detection_model.fasterrcnn_mobilenet_v3_large_320_fpn(
+ weights="FasterRCNN_MobileNet_V3_Large_320_FPN_Weights.COCO_V1",
+ progress=True,
+ weights_backbone="MobileNet_V3_Large_Weights.IMAGENET1K_V1").to(self.device)
+ self.class_labels = \
+ detection_model.FasterRCNN_MobileNet_V3_Large_320_FPN_Weights.DEFAULT.meta["categories"]
+ self.model.eval()
+ self.get_logger().info("Node has started.")
+
+ def mobilenet_to_ros2(self, detection, header):
+ """Converts a Detection tuple(label, bbox, score) to a ROS2 Detection2D message."""
+
+ detection2d = Detection2D()
+ detection2d.header = header
+ object_hypothesis_with_pose = ObjectHypothesisWithPose()
+ object_hypothesis = ObjectHypothesis()
+ object_hypothesis.class_id = self.class_labels[detection.label]
+ object_hypothesis.score = detection.score.detach().item()
+ object_hypothesis_with_pose.hypothesis = object_hypothesis
+ detection2d.results.append(object_hypothesis_with_pose)
+ bounding_box = BoundingBox2D()
+ bounding_box.center.position.x = float((detection.bbox[0] + detection.bbox[2]) / 2)
+ bounding_box.center.position.y = float((detection.bbox[1] + detection.bbox[3]) / 2)
+ bounding_box.center.theta = 0.0
+ bounding_box.size_x = float(2 * (bounding_box.center.position.x - detection.bbox[0]))
+ bounding_box.size_y = float(2 * (bounding_box.center.position.y - detection.bbox[1]))
+ detection2d.bbox = bounding_box
+ return detection2d
+
+ def publish_annotated_image(self, filtered_detections, header, image):
+ """Draws the bounding boxes on the image and publishes to /annotated_image"""
+
+ if len(filtered_detections) > 0:
+ pred_boxes = torch.stack([detection.bbox for detection in filtered_detections])
+ pred_labels = [self.class_labels[detection.label] for detection in filtered_detections]
+ annotated_image = draw_bounding_boxes(torch.tensor(image), pred_boxes,
+ pred_labels, colors="yellow")
+ else:
+ annotated_image = torch.tensor(image)
+ ros2_image_msg = self.bridge.cv2_to_imgmsg(annotated_image.numpy().transpose(1, 2, 0),
+ encoding="rgb8")
+ ros2_image_msg.header = header
+ self.annotated_image_publisher.publish(ros2_image_msg)
+
+ def listener_callback(self, msg):
+ """Reads image and publishes on /detected_objects and /annotated_image."""
+ cv_image = self.bridge.imgmsg_to_cv2(msg, desired_encoding="rgb8")
+ image = cv_image.copy().transpose((2, 0, 1))
+ batch_image = np.expand_dims(image, axis=0)
+ tensor_image = torch.tensor(batch_image/255.0, dtype=torch.float, device=self.device)
+ mobilenet_detections = self.model(tensor_image)[0] # pylint: disable=E1102 disable not callable warning
+ filtered_detections = [Detection(label_id, box, score) for label_id, box, score in
+ zip(mobilenet_detections["labels"],
+ mobilenet_detections["boxes"],
+ mobilenet_detections["scores"]) if score >= self.detection_threshold]
+ detection_array = Detection2DArray()
+ detection_array.header = msg.header
+ detection_array.detections = \
+ [self.mobilenet_to_ros2(detection, msg.header) for detection in filtered_detections]
+ self.detected_objects_publisher.publish(detection_array)
+ if self.annotated_image_publisher is not None:
+ self.publish_annotated_image(filtered_detections, msg.header, image)
+
+
+rclpy.init()
+coco_detector_node = CocoDetectorNode()
+rclpy.spin(coco_detector_node)
+coco_detector_node.destroy_node()
+rclpy.shutdown()
diff --git a/coco_detector/package.xml b/coco_detector/package.xml
@@ -0,0 +1,18 @@
+<?xml version="1.0"?>
+<?xml-model href="http://download.ros.org/schema/package_format3.xsd" schematypens="http://www.w3.org/2001/XMLSchema"?>
+<package format="3">
+ <name>coco_detector</name>
+ <version>0.0.0</version>
+ <description>COCO detector</description>
+ <maintainer email="[email protected]">julian</maintainer>
+ <license>TODO: License declaration</license>
+
+ <test_depend>ament_copyright</test_depend>
+ <test_depend>ament_flake8</test_depend>
+ <test_depend>ament_pep257</test_depend>
+ <test_depend>python3-pytest</test_depend>
+
+ <export>
+ <build_type>ament_python</build_type>
+ </export>
+</package>
diff --git a/coco_detector/resource/coco_detector b/coco_detector/resource/coco_detector
diff --git a/coco_detector/setup.cfg b/coco_detector/setup.cfg
@@ -0,0 +1,4 @@
+[develop]
+script_dir=$base/lib/coco_detector
+[install]
+install_scripts=$base/lib/coco_detector
diff --git a/coco_detector/setup.py b/coco_detector/setup.py
@@ -0,0 +1,29 @@
+import os
+from glob import glob
+from setuptools import setup
+from setuptools import find_packages
+
+package_name = 'coco_detector'
+
+setup(
+ name=package_name,
+ version='0.0.0',
+ packages=find_packages(exclude=['test']),
+ data_files=[
+ ('share/ament_index/resource_index/packages',
+ ['resource/' + package_name]),
+ ('share/' + package_name, ['package.xml']),
+ ],
+ install_requires=['setuptools'],
+ zip_safe=True,
+ maintainer='julian',
+ maintainer_email='[email protected]',
+ description='COCO detector',
+ license='TODO: License declaration',
+ tests_require=['pytest'],
+ entry_points={
+ 'console_scripts': [
+ "coco_detector_node = coco_detector.coco_detector_node"
+ ],
+ },
+)
diff --git a/doc_images/go2_air_giraffe.png b/doc_images/go2_air_giraffe.png