Skip to content

Latest commit

 

History

History
113 lines (82 loc) · 17.3 KB

README_en.md

File metadata and controls

113 lines (82 loc) · 17.3 KB

简体中文 | English

Real Time Pedestrian Analysis Tool PP-Human

PP-Human is the industry's first open-sourced real-time pedestrian analysis tool based on PaddlePaddle deep learning framework. It has three major features: rich functions, wide application, and efficient deployment.

PP-Human supports various inputs such as images, single-camera, and multi-camera videos. It covers multi-object tracking, attributes recognition, behavior analysis, visitor traffic statistics, and trace records. PP-Human can be applied to fields including Smart Transportation, Smart Community, and industrial inspections. It can also be deployed on server sides and TensorRT accelerator. On the T4 server, it could achieve real-time analysis.

📣 Updates

  • 🔥 2022.7.13:PP-Human v2 launched with a full upgrade of four industrial features: behavior analysis, attributes recognition, visitor traffic statistics and ReID. It provides a strong core algorithm for pedestrian detection, tracking and attribute analysis with a simple and detailed development process and model optimization strategy.

  • 2022.4.18: Add PP-Human practical tutorials, including training, deployment, and action expansion. Details for AIStudio project please see Link

  • 2022.4.10: Add PP-Human examples; empower refined management of intelligent community management. A quick start for AIStudio Link

  • 2022.4.5: Launch the real-time pedestrian analysis tool PP-Human. It supports pedestrian tracking, visitor traffic statistics, attributes recognition, and falling detection. Due to its specific optimization of real-scene data, it can accurately recognize various falling gestures, and adapt to different environmental backgrounds, light and camera angles.

🔮 Features and demonstration

⭐ Feature 💟 Advantages 💡Example
ReID Extraordinary performance: special optimization for technical challenges such as target occlusion, uncompleted and blurry objects to achieve mAP 98.8, 1.5ms/person
Attribute analysis Compatible with a variety of data formats: support for images, video input

High performance: Integrated open-sourced datasets with real enterprise data for training, achieved mAP 94.86, 2ms/person

Support 26 attributes: gender, age, glasses, tops, shoes, hats, backpacks and other 26 high-frequency attributes
Behaviour detection Rich function: support five high-frequency anomaly behavior detection of falling, fighting, smoking, telephoning, and intrusion

Robust: unlimited by different environmental backgrounds, light, and camera angles.

High performance: Compared with video recognition technology, it takes significantly smaller computation resources; support localization and service-oriented rapid deployment

Fast training: only takes 15 minutes to produce high precision behavior detection models
Visitor traffic statistics
Trace record
Simple and easy to use: single parameter to initiate functions of visitor traffic statistics and trace record

🗳 Model Zoo

Single model results (click to expand)
Task Application Accuracy Inference speed(ms) Model size Inference deployment model
Object detection (high precision) Image input mAP: 57.8 25.1ms 182M Link
Object detection (Lightweight) Image input mAP: 53.2 16.2ms 27M Link
Object tracking (high precision) Video input MOTA: 82.2 31.8ms 182M Link
Object tracking (high precision) Video input MOTA: 73.9 21.0ms 27M Link
Attribute recognition (high precision) Image/Video input Attribute recognition mA: 95.4 Single person 4.2ms 86M Link
Attribute recognition (Lightweight) Image/Video input Attribute recognition mA: 94.5 Single person 2.9ms 7.2M Link
Keypoint detection Video input Attribute recognition AP: 87.1 Single person 5.7ms 101M Link
Classification based on key point sequences Video input Attribute recognition Accuracy: 96.43 Single person 0.07ms 21.8M Link
Detection based on Human ID Video input Attribute recognition Accuracy: 86.85 Single person 1.8ms 45M Link
Detection based on Human ID Video input Attribute recognition AP50: 79.5 Single person 10.9ms 27M Link
Video classification Video input Attribute recognition Accuracy: 89.0 19.7ms/1s Video 90M Link
ReID Video input ReID mAP: 98.8 Single person 0.23ms 85M Link
End-to-end model results (click to expand)
Task End-to-End Speed(ms) Model Size
Pedestrian detection (high precision) 25.1ms Multi-object tracking 182M
Pedestrian detection (lightweight) 16.2ms Multi-object tracking 27M
Pedestrian tracking (high precision) 31.8ms Multi-object tracking 182M
Pedestrian tracking (lightweight) 21.0ms Multi-object tracking 27M
Attribute recognition (high precision) Single person8.5ms Object detection
Attribute recognition
Object detection:182M
Attribute recognition:86M
Attribute recognition (lightweight) Single person 7.1ms Object detection
Attribute recognition
Object detection:182M
Attribute recognition:86M
Falling detection Single person 10ms Multi-object tracking
Keypoint detection
Behavior detection based on key points
Multi-object tracking:182M
Keypoint detection:101M
Behavior detection based on key points: 21.8M
Intrusion detection 31.8ms Multi-object tracking 182M
Fighting detection 19.7ms Video classification 90M
Smoking detection Single person 15.1ms Object detection
Object detection based on Human Id
Object detection:182M
Object detection based on Human ID: 27M
Phoning detection Single person ms Object detection
Image classification based on Human ID
Object detection:182M
Image classification based on Human ID:45M

Click to download the model, then unzip and save it in the . /output_inference.

📚 Doc Tutorials

Pedestrian attribute/feature recognition

Behavior detection

ReID

Pedestrian tracking, visitor traffic statistics, trace records