1. J784S4 Datasheet¶
1.1. Introduction¶
This section provides the performance numbers of device drivers supported in PDK
1.1.1. Setup Details¶
SOC Details | Values |
---|---|
Core | R5F |
Core Operating Speed | 1GHz |
DDR Speed | 4266 MTs |
VPAC Frequency | 650 MHz |
DMPAC Frequency | 520 MHz |
Cache status | Enabled |
Optimization Details | Values |
---|---|
Profile | Release |
Compile Options for R5F | -g -ms -DMAKEFILE_BUILD -c -qq -pdsw225 –endian=little -mv7R5 –abi=eabi -eo.oer5f -ea.ser5f –symdebug:dwarf –embed_inline_assembly –float_support=vfpv3d16 –emit_warnings_as_errors |
Linker Options for R5F | –emit_warnings_as_errors -w -q -u _c_int00 -c -mv7R5 –diag_suppress=10063 -x –zero_init=on |
Code Placement | DDR |
Data Placement | DDR |
1.1.2. Software Performance Numbers¶
1.1.2.1. UDMA¶
1.1.2.1.1. DMA Parameters¶
- Ring Order ID: 0
- Channel Order ID: 0
- Channel DMA Priority: 1
- Channel Bus Priority: 4
- Channel BUS QOS: 4
- Channel TX FIFO depth: 128
- Channel Fetch Word Size: 16
- Channel Burst Size: 64 bytes for normal channel, 128 bytes for HC and UHC channels
1.1.2.1.2. Test Parameters¶
- Type: TR15 Block copy
- TR: one TR per TRPD in PBR mode
- TR Memory: Same as buffer memory (DDR, MSMC or OCMC depends on the test performed)
- Transfer Size: 1 MB read and 1MB write
- 1MB means 1000x1000 bytes and 1KB means 1000 bytes
Note: Throughput numbers mentioned is the combined memory throughput of both read and write operations
1.1.2.1.3. DRU Blockcopy¶
DRU channel performance with TR submitted through ring
1.1.2.2. OSPI¶
1.1.2.2.1. OSPI Memory Non Cached Test Set-up¶
- Platform: J784S4 EVM.
- OS Type: Baremetal/FreeRTOS.
- Core : R5F_0 at 1 GHz, A72_0 at 2 GHz.
- Software/Application Used: OSPI_Flash_TestApp/OSPI_Flash_Dma_TestApp/OSPI_Baremetal_Flash_TestApp/OSPI_Baremetal_Flash_Dma_TestApp
- System Configuration: Cache OFF, Read/Write Buffer in DDR. DMA Enabled/Disabled, Interrupts ON.
1.1.2.2.2. OSPI Read/Write Performance (DDR Octal Mode)¶
OSPI RCLK | OS | CPU | Mode | Write Tput (MB/s) | Write CPU Load | Read Tput (MB/s) | Read CPU Load |
---|---|---|---|---|---|---|---|
133 MHz | Baremetal | R5F_0 | DAC | 7.368 | |||
DAC DMA | 264.458 | ||||||
INDAC | 0.560 | 23.467 | |||||
A72_0 | DAC | 5.508 | |||||
DAC DMA | 265.529 | ||||||
INDAC | 0.561 | 12.430 | |||||
RTOS | R5F_0 | DAC | 7.300 | 51% | |||
DAC DMA | 263.196 | 1% | |||||
INDAC | 0.576 | 100% | 23.866 | 51% | |||
166 MHz | Baremetal | R5F_0 | DAC | 8.570 | |||
DAC DMA | 329.430 | ||||||
INDAC | 0.560 | 23.862 | |||||
A72_0 | DAC | 6.163 | |||||
DAC DMA | 330.885 | ||||||
INDAC | 0.561 | 12.426 | |||||
RTOS | R5F_0 | DAC | 8.570 | 51% | |||
DAC DMA | 327.270 | 2% | |||||
INDAC | 0.576 | 100% | 23.871 | 51% |
1.1.2.2.3. OSPI Memory Cached Test Set-up¶
- Platform: J784S4 EVM.
- OS Type: Baremetal/FreeRTOS.
- Core : R5F_0 at 1 GHz, A72_0 at 2 GHz.
- Software/Application Used: OSPI_Flash_Cache_TestApp/OSPI_Flash_Dma_Cache_TestApp/OSPI_Baremetal_Flash_Cache_TestApp/OSPI_Baremetal_Flash_Dma_Cache_TestApp
- System Configuration: Cache ON, Read/Write Buffer in DDR. DMA Enabled/Disabled, Interrupts ON.
1.1.2.2.4. OSPI Read/Write Performance (DDR Octal Mode)¶
OSPI RCLK | OS | CPU | Mode | Write Tput (MB/s) | Write CPU Load | Read Tput (MB/s) | Read CPU Load |
---|---|---|---|---|---|---|---|
133 MHz | Baremetal | R5F_0 | DAC | 81.601 | |||
DAC DMA | 264.458 | ||||||
INDAC | 0.558 | 23.880 | |||||
RTOS | R5F_0 | DAC | 81.594 | 51% | |||
DAC DMA | 263.196 | 1% | |||||
INDAC | 0.568 | 100% | 23.837 | 51% | |||
166 MHz | Baremetal | R5F_0 | DAC | 93.099 | |||
DAC DMA | 329.223 | ||||||
INDAC | 0.560 | 23.884 | |||||
RTOS | R5F_0 | DAC | 92.843 | 51% | |||
DAC DMA | 327.373 | 1% | |||||
INDAC | 0.563 | 100% | 23.869 | 51% |
1.1.2.3. CPSW_9G¶
1.1.2.3.1. Test Setup¶
Hardware Configuration | Value |
---|---|
Processing Core | Main R5F0 Core 0 |
Core Frequency | 1 GHz |
Ethernet Interface Type | QSGMII at 1Gbps |
Packet buffer memory | DDR |
Hardware checksum offload | Yes |
Scatter-gather TX | Yes |
Scatter-gather RX | No |
Software Configuration | Value |
---|---|
RTOS | FreeRTOS |
RTOS application | Enet LLD lwIP example |
TCP/IP stack | lwIP 2.1.2 |
Host PC tool version | iperf v2.0.10 |
1.1.2.3.2. TCP Performance¶
Test | Bandwidth (Mbps) | CPU Load (%) |
---|---|---|
TCP RX | 125 | 39 |
TCP TX | 187 | 66 |
TCP Bidirectional | RX=132 TX=170 | 100 |
Host PC commands:
iperf -c <evm_ip> -r
iperf -c <evm_ip> -d
1.1.2.3.3. UDP Performance¶
Test | Datagram Length = 64B | Datagram Length = 256B | Datagram Length = 512B | Datagram Length = 1470B | ||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|
Bandwidth
(Mbps)
|
CPU
Load
(%)
|
Packet
Loss
(%)
|
Bandwidth
(Mbps)
|
CPU
Load
(%)
|
Packet
Loss
(%)
|
Bandwidth
(Mbps)
|
CPU
Load
(%)
|
Packet
Loss
(%)
|
Bandwidth
(Mbps)
|
CPU
Load
(%)
|
Packet
Loss
(%)
|
|
UDP RX | 5.24 | 22 | 0.0 | 26.2 | 38 | 0.0 | 26.2 | 25 | 0.0 | 26.2 | 17 | 0.0 |
10.5 | 36 | 0.0026 | 52.4 | 69 | 0.17 | 52.4 | 42 | 0.001 | 52.4 | 25 | 0.0 | |
15.7 | 49 | 0.061 | 105 | 105 | 76 | 0.11 | 105 | 40 | 0.0 | |||
UDP RX (Max) | 22 | 66 | 0.14 | 52.4 | 69 | 0.14 | 110 | 79 | 0.31 | 294 | 100 | 0.12 |
UDP TX (Max) | 45.7 | 100 | 0.0 | 112 | 100 | 0.0136 | 223 | 100 | 0.0498 | 622 | 100 | 0.0 |
Host PC commands:
Test with datagram length of 64B:
iperf -c <evm_ip> -u -l64 -b<bw> -r where <bw> is 5M, 10M, 15M, etc
Test with datagram length of 256B:
iperf -c <evm_ip> -u -l256 -b<bw> -r where <bw> is 25M, 50M, 100M, etc
Test with datagram length of 512B:
iperf -c <evm_ip> -u -l512 -b<bw> -r where <bw> is 25M, 50M, 100M, etc
Test with datagram length of 1470B (max):
iperf -c <evm_ip> -u -b<bw> -r where <bw> is 25M, 50M, 100M, etc
1.1.2.4. SBL OSPI Boot Performance App¶
1.1.2.4.1. Test Set-up¶
- Platform: J721S2 EVM.
- OS Type: Baremetal
- Core : R5F_0 at 1 GHz
- Software/Application Used: sbl_cust_img (with custom flags) and sbl_boot_perf_test appimage
1.1.2.4.2. GP EVM Performance¶
SBL Boot Time Breakdown | Time (ms) |
MCU_PORZ_OUT to MCU_RESETSTATz | 0.63 |
ROM : init + SBL load from OSPI | 8.768 |
SBL : SBL_SciClientInit: ReadSysfwImage | 0.053 |
Load/Start SYSFW | 4.120 |
Sciclient_init | 3.153 |
Board Config | 1.967 |
PM Config | 0.133 |
Security Config | 0.337 |
RM Config | 0.421 |
SBL: SoC Late-Init | 0.00 |
SBL : Board_init (pinmux) | 0.646 |
SBL : Board_init (PLL) | 1.142 |
SBL: Board_init (CLOCKS) | 1.445 |
SBL: OSPI init | 0.144 |
SBL: App copy to MCU SRAM & Jump to App | 2.672 |
Misc | 0.038 |
MCUSW: CAN response | 1.00 |
TOTAL time | 26.669 |
1.1.2.5. CSI-Rx¶
Capture Type | Configuration | CPU Load |
---|---|---|
CSI2Rx Inst 0 | 4CH 1080P30 IMX390 Sensor Raw12 | 1% (MCU2_0) |