Alright now you know how to train a deep learning model, what’s next. You wanna deploy it and check it how its performing. If its performing bad, how to proceed further. If its performing good, how to scale it.

In this blog we are gonna deal with all these stuff about deployment and model retraining for a high accuracy.

Place where you wanted to deploy your model usally.

Deploying the model in Deepstream.

Deepstream

The RTSP is a steam of video frames that are given to the video management software and AI models for inferences. Once the inference is done on the image, the database is stored and the inferences are pushed to custom analytical dashboard. Usually we have mutliple source of cameras and we build a pipeline for each of them. This can work for a small scale but when we have a large scale of cameras, this is not a good approach. We need better apporaches to handle this. If we can combine output from multiple cameras we can even get better results for applications like traffic management, crowd management etc.

Current Methods.

Features in Upgraded Deepstream

Now we have a warehouse where a lot of people keep moving in it. One of the core application is to track and detect the person all the time.

A detector module helps to draw a bouding boxes around a person and the tracker traces the path of the person. We find these modules often in any system. The tracker is used to check if a person enters an unauthorized area.

Now image a person moves from one room to another, since there is a switch of a person in cam 1 to 2, a new tracker is allocated to same person, which is not what we wanted. In such situations we need to have a multi camera tracking system.

Deploying the model in Deepstream.

Challenges in Multiple Tracking System

Challenges in Multiple Tracking System.

Generating CG Dataset and Training( Omniverse )

CG dataset Creation + Model Training.

Bulding the CG Dataset

One the challenges in training a model is to get a good datasets. We usually do have the real datasets which is blurry. The other problem is, when it comes to things like tracking the ground truth is not available. Annotating such tasks is very difficult. These are the reasons why we need to approach for CG dataset. Omniverse gives you a set of predefined enviroments to deal with. We can use these environments to generate the dataset.

Nvidia Omniverse, place to built virtual env.

Building a digital twin of the warehouse.

Building a Virtual Humans in warehouse.

Output of CCTV photages generated from CG environment.

Training the neural nets.

In further training, along with the real data we can use the CG dataset for training the neural network. This will help the model to learn the features of the object in a better way. Also there will be any mistakes in the dataset given by the CG.

New training pipeline.

RE - ID is a process where the model is used to detect the same person in multiple cameras. This is done by using the features of the person. The features are extracted from the person and stored in a database. When a person is detected in a new camera, the features are extracted and compared with the database. If the features are matched, then the person is the same.

Re-ID for person tracking.

Performance on multiple machines.

Building Multicamera perception system

Multiple camera perception system.

We need to upgrade the system from single camera to multiple camera. This is done by using the multiple camera perception system. This system is used to detect the objects in multiple cameras.

Multiple camera modules.

The multi tracking system have:

Pixel to Physical mapping: This is used to map the pixel to the physical location of the object. This is done by using the camera calibration.
Behavioural analysis: This is used to analyse the behaviour of the object. This is done by using the object detection and tracking.
Matching Process: This is used to match the object in multiple cameras. This is done by using the Re-ID.