In simplest worlds, It describes the world to the blind. It takes video as input and describes the world to the blind.  

describe what can people use it for, or how it makes existing tasks easier/safer e.t.c (markdown supported)

The problem it solves

lack of gpu for efficient training of model.
lack of good quality of data for efficient training.
lot's of bugs encountered but all fixed.