DatasheetΒΆ

This chapter describes the performance measurements of the Edge AI Inference demos.

Performance data of the demos can be auto generated by running following command on target:

root@j7-evm:/opt/edge_ai_apps/tests# ./gen_data_sheet.sh

The performence mesurments includes the following

  1. FPS : Effective framerate at which the application runs

  2. Total time : Average time taken to process each frame, which includes pre-processing, inference and post-processing time

  3. Inference time : Average time taken to infer each frame

  4. CPU loading : Loading on different CPU cores present

  5. DDR BW : DDR read and write BW used

  6. HWA Loading : Loading on different Hardware accelerators present

Following are the latest performance numbers of the C++ demos:

Source : USB Camera Capture Framerate : 30 fps Resolution : 720p format : JPEG

Model

FPS

Total time (ms)

Inference time (ms)

A72 Load (%)

DDR Read BW (MB/s)

DDR Write BW (MB/s)

DDR Total BW (MB/s)

C71 Load (%)

C66_1 Load (%)

C66_2 Load (%)

MCU2_0 Load (%)

MCU2_1 Load (%)

MSC_0 (%)

MSC_1 (%)

VISS (%)

NF (%)

LDC (%)

SDE (%)

DOF (%)

ONR-CL-6150-mobileNetV2-1p4-qat

30.48

33.06

3.02

32.30

2146

1069

3215

8.16

32.86

35.28

1.0

0.0

7.15

0

0

0

0

0

0

TFL-CL-0000-mobileNetV1-mlperf

30.46

33.05

1.51

32.92

2047

1069

3116

4.32

31.8

32.92

1.0

0.0

7.4

0

0

0

0

0

0

TFL-OD-2020-ssdLite-mobDet-DSP-coco-320x320

30.61

33.05

5.35

30.71

140

1129

1269

14.17

36.90

33.48

2.0

0.0

7.12

0

0

0

0

0

0

TVM-CL-3410-gluoncv-mxnet-mobv2

30.38

33.05

2.15

30.26

2084

1066

3150

6.0

34.21

32.93

2.0

0.0

7.14

0

0

0

0

0

0

Source : Video Video Framerate : 25 fps Resolution : 720p Encoding : h264

Model

FPS

Total time (ms)

Inference time (ms)

A72 Load (%)

DDR Read BW (MB/s)

DDR Write BW (MB/s)

DDR Total BW (MB/s)

C71 Load (%)

C66_1 Load (%)

C66_2 Load (%)

MCU2_0 Load (%)

MCU2_1 Load (%)

MSC_0 (%)

MSC_1 (%)

VISS (%)

NF (%)

LDC (%)

SDE (%)

DOF (%)

ONR-CL-6150-mobileNetV2-1p4-qat

25.47

39.37

3.02

27.4

2101

897

2998

6.48

10.98

29.83

2.0

0.0

6.8

0

0

0

0

0

0

TFL-CL-0000-mobileNetV1-mlperf

25.55

39.35

1.66

32.16

1999

885

2884

3.75

10.9

30.78

1.0

0.0

5.95

0

0

0

0

0

0

TFL-OD-2020-ssdLite-mobDet-DSP-coco-320x320

25.43

39.36

5.07

34.25

53

944

997

12.21

13.10

29.33

1.0

0.0

6.10

0

0

0

0

0

0

TVM-CL-3410-gluoncv-mxnet-mobv2

25.53

39.37

2.01

27.52

2041

893

2934

5.24

10.99

30.17

1.0

0.0

6.1

0

0

0

0

0

0

Source : CSI Camera (ov5640) Capture Framerate : 30 fps Resolution : 720p format : YUYV

Model

FPS

Total time (ms)

Inference time (ms)

A72 Load (%)

DDR Read BW (MB/s)

DDR Write BW (MB/s)

DDR Total BW (MB/s)

C71 Load (%)

C66_1 Load (%)

C66_2 Load (%)

MCU2_0 Load (%)

MCU2_1 Load (%)

MSC_0 (%)

MSC_1 (%)

VISS (%)

NF (%)

LDC (%)

SDE (%)

DOF (%)

ONR-CL-6150-mobileNetV2-1p4-qat

29.41

34.01

2.04

21.25

16

1096

1112

7.16

43.52

30.44

1.0

0.0

6.89

0

0

0

0

0

0

TFL-CL-0000-mobileNetV1-mlperf

29.37

34.08

1.01

21.50

2069

1095

3164

4.13

41.44

30.80

1.0

0.0

6.85

0

0

0

0

0

0

TFL-OD-2020-ssdLite-mobDet-DSP-coco-320x320

29.40

34.02

5.03

19.14

147

1151

1298

13.45

47.61

31.29

1.0

0.0

6.91

0

0

0

0

0

0

TVM-CL-3410-gluoncv-mxnet-mobv2

29.45

34.03

2.00

21.94

2104

1095

3199

5.74

42.91

30.73

1.0

0.0

6.85

0

0

0

0

0

0