A Real-Time Visual Tracking Solution for Object Detection with Depth Camera via HHA images and Fast-RCNN Descriptor
The problem Object Detection Using Depth and Color Images solves
Can be used for better representation of crowded images in real-life situations like crowd controlling and traffic control
Better use of depth information for simultateous image detection in noisy images
Better side view prediction probablity
Can be used eventually to create state-of-the-art 3D real world object detection by providing faster and precise accuracy.
Check Video on GitHub for visualisation.
Challenges we ran into
YOLO fails to extract the relevant information from depth images. So we have to convert it into HHA image format.
It is difficult to combine RGB and depth images to train on single neural network. So we have to train two different neural networks and then combine their results to get more accuracy.