SeeU Track (November 2024 ~ May 2025)

In this porject, We develop AI model for AI CCTV system which is called SeeU track. For this project, the system is mainly focus on the re-identification task for vehicle and person. For Vehicle re-identification, we employed object detection model for detecting vehicle and its license plate, classification model to recognize the vehicle's brand, model, and color, and text detection and text recognition model (OCR) to read the license plate. Besides, for person re-identification, we applied object detection model to detect person and person attribute recognition model to recognize the person's attribute in terms of gender, age, type and color of cloth, etc.

1. Person, Vehicle, License plate detection with Object Detection Model

We exploit Object Detection model to detect person, vehicle, and its license plate to detect the target accurately. To select the most decent model in the desired environment, we searched and tested the open source model, and decide to use YOLOv11 model.

2. Vehicle recognition with Object Classification Model

We use Object Classification model to extract the MMC (maker, model, color) Information of the vehicle. For this task, many open source models are tested and YOLOv11 model is chosen since it has a higher accuracy than others.

3. License Plate Recognition

OCR model is used for this task. OCR model is consist of text detection and recognition. We tested and evaluated models and dbnet and svtr models are chosen.

4. Person Attribute Recognition

Person Attribute Recongition is the task to extract the information of the person including gender, age, type and color of cloth, accessory, etc. To develop the model, we tried different architectures to extract the feature information such as Resnet, vision transformers (ex. swin transformer), and etc. After many test, we decide to employ vision language model for extraction and transformer for classification task.

(PS) During the model development, we tested many scheme such as changing the architecutre, modifing number of layers, activation function, loss function, and etc to improve the performance.