In this demo, we present a real-time surveillance video parsing (RSVP) system to parse surveillance videos. Surveillance video pars- ing, which aims to segment the video frames into several labels, e.g., face, pants, left-legs, has wide applications[7], especially in security filed. However, it is very tedious and time-consuming to annotate all the frames in a video. We design RSVP system to parse the surveillance videos in real-time. The system, namely Single frame Video Parsing (SVP), requires only one labeled frame in training stage. SVP jointly considers the segmentation of preceding frames when parsing one particular frame within the video. The RSVP system is proved to be effective and efficient in real applications.

Deep Architecture