2.2. Performance Guide¶
2.2.1. Kernel Performance Guide¶
Read This First
All performance numbers provided in this document are gathered using following Evaluation Modules unless otherwise specified.
Name | Description |
---|---|
AM335x | AM335x Evaluation Module rev 1.5B with ARM running at 1000MHz, DDR3-400 (400MHz/800 MT/S), TMDXEVM3358 |
AM437x-gpevm | AM437x-gpevm Evaluation Module rev 1.5A with ARM running at 1000MHz, DDR3-400 (400MHz/800 MT/S), TMDSEVM437X |
AM572x EVM | AM57xx Evaluation Module rev A2 with ARM running at 1500MHz, DDR3L-533 (533 MHz/1066 MT/S), TMDSEVM572x |
K2HK EVM | K2 Hawkings Evaluation Module rev 40 with ARM running at 1200MHz, DDR3-1600 (800 MHz/1600 MT/S), EVMK2H |
K2G EVM | K2 Galileo Evaluation Module rev C, DDR3-1333 (666 MHz/1333 MT/S), EVMK2G |
AM65x EVM | AM65x Evaluation Module rev 1.0 with ARM running at 800MHz, DDR4-2400 (1333 MT/S), TMDX654GPEVM |
Table: Evaluation Modules
About This Manual
This document provides performance data for each of the device drivers which are part of the Process SDK Linux package. This document should be used in conjunction with release notes and user guides provided with the Process SDK Linux package for information on specific issues present with drivers included in a particular release.
If You Need Assistance
For further information or to report any problems, contact http://community.ti.com/ or http://support.ti.com/
2.2.1.1. System Benchmarks¶
2.2.1.1.1. LMBench¶
LMBench is a collection of microbenchmarks of which the memory bandwidth and latency related ones are typically used to estimate processor memory system performance.
Latency: lat_mem_rd-stride128-szN, where N is equal to or smaller than the cache size at given level measures the cache miss penalty. N that is at least double the size of last level cache is the latency to external memory.
Bandwidth: bw_mem_bcopy-N, where N is is equal to or smaller than the cache size at a given level measures the achivable memory bandwidth from software doing a memcpy() type operation. Typical use is for external memory bandwidth calculation. The bandwidth is calculated as byte read and written counts as 1 which should be roughly half of STREAM copy result.
Benchmarks | am335x-evm: perf | am57xx-evm: perf | am654x-evm: perf | k2g-evm: perf | omapl138-lcdk: perf |
---|---|---|---|---|---|
af_unix_sock_stream_latency (microsec) | 43.11 | 28.31 | 42.81 | 47.32 | 678.71 |
af_unix_socket_stream_bandwidth (MBs) | 173.98 | 2075.27 | 1175.86 | 656.56 | 30.64 |
bw_file_rd-io-1mb (MB/s) | 179.95 | 1503.19 | 1190.71 | 713.14 | 43.99 |
bw_file_rd-o2c-1mb (MB/s) | 154.06 | 1066.29 | 790.51 | 543.38 | 39.71 |
bw_mem-bcopy-16mb (MB/s) | 211.31 | 1950.27 | 816.62 | 1244.85 | 39.71 |
bw_mem-bcopy-1mb (MB/s) | 199.04 | 4934.52 | 1542.32 | 1350.57 | 96.69 |
bw_mem-bcopy-2mb (MB/s) | 200.68 | 2764.07 | 921.66 | 1253.92 | 96.92 |
bw_mem-bcopy-4mb (MB/s) | 207.23 | 2033.55 | 826.28 | 1247.86 | 98.63 |
bw_mem-bcopy-8mb (MB/s) | 218.14 | 1963.67 | 818.16 | 1246.88 | 96.51 |
bw_mem-bzero-16mb (MB/s) | 995.77 | 4986.75 | 1959.34 | 3401.00 | 192.97 |
bw_mem-bzero-1mb (MB/s) | 591.49 (min 199.04, max 983.93) | 5322.83 (min 4934.52, max 5711.13) | 2927.55 (min 1542.32, max 4312.77) | 2910.03 (min 1350.57, max 4469.49) | 143.77 (min 96.69, max 190.84) |
bw_mem-bzero-2mb (MB/s) | 598.10 (min 200.68, max 995.52) | 4128.79 (min 2764.07, max 5493.50) | 2571.75 (min 921.66, max 4221.84) | 2588.13 (min 1253.92, max 3922.34) | 144.65 (min 96.92, max 192.38) |
bw_mem-bzero-4mb (MB/s) | 595.68 (min 207.23, max 984.13) | 3667.10 (min 2033.55, max 5300.65) | 1619.78 (min 826.28, max 2413.27) | 2446.75 (min 1247.86, max 3645.64) | 145.78 (min 98.63, max 192.92) |
bw_mem-bzero-8mb (MB/s) | 603.14 (min 218.14, max 988.14) | 3541.02 (min 1963.67, max 5118.36) | 1407.96 (min 818.16, max 1997.75) | 2370.17 (min 1246.88, max 3493.45) | 144.83 (min 96.51, max 193.15) |
bw_mem-cp-16mb (MB/s) | 197.05 | 1044.66 | 482.38 | 605.26 | 189.47 |
bw_mem-cp-1mb (MB/s) | 593.40 (min 194.40, max 992.39) | 4957.56 (min 4215.54, max 5699.58) | 2638.47 (min 839.35, max 4437.58) | 2620.03 (min 607.63, max 4632.43) | 147.53 (min 105.59, max 189.47) |
bw_mem-cp-2mb (MB/s) | 592.35 (min 192.47, max 992.23) | 3418.66 (min 1219.33, max 5617.98) | 2399.73 (min 546.60, max 4252.85) | 2280.00 (min 604.59, max 3955.41) | 149.47 (min 106.72, max 192.22) |
bw_mem-cp-4mb (MB/s) | 600.71 (min 199.53, max 1001.88) | 3173.29 (min 1041.53, max 5305.04) | 1453.98 (min 491.40, max 2416.55) | 2125.09 (min 601.87, max 3648.30) | 149.78 (min 106.20, max 193.36) |
bw_mem-cp-8mb (MB/s) | 603.92 (min 202.80, max 1005.03) | 3053.62 (min 1018.98, max 5088.25) | 1238.98 (min 481.70, max 1996.26) | 2037.28 (min 610.36, max 3464.20) | 148.30 (min 103.48, max 193.12) |
bw_mem-fcp-16mb (MB/s) | 298.12 | 1077.08 | 808.24 | 610.31 | 190.84 |
bw_mem-fcp-1mb (MB/s) | 638.55 (min 293.17, max 983.93) | 4317.30 (min 2923.47, max 5711.13) | 2908.97 (min 1505.17, max 4312.77) | 2537.59 (min 605.69, max 4469.49) | 133.43 (min 76.02, max 190.84) |
bw_mem-fcp-2mb (MB/s) | 645.02 (min 294.51, max 995.52) | 3350.98 (min 1208.46, max 5493.50) | 2568.52 (min 915.19, max 4221.84) | 2264.80 (min 607.26, max 3922.34) | 134.84 (min 77.29, max 192.38) |
bw_mem-fcp-4mb (MB/s) | 639.20 (min 294.27, max 984.13) | 3189.92 (min 1079.19, max 5300.65) | 1617.78 (min 822.28, max 2413.27) | 2123.08 (min 600.51, max 3645.64) | 134.92 (min 76.92, max 192.92) |
bw_mem-fcp-8mb (MB/s) | 643.89 (min 299.63, max 988.14) | 3096.17 (min 1073.97, max 5118.36) | 1404.76 (min 811.77, max 1997.75) | 2052.80 (min 612.14, max 3493.45) | 134.11 (min 75.07, max 193.15) |
bw_mem-frd-16mb (MB/s) | 249.26 | 1054.23 | 1317.20 | 825.38 | 134.30 |
bw_mem-frd-1mb (MB/s) | 280.67 (min 268.17, max 293.17) | 3008.32 (min 2923.47, max 3093.16) | 1508.89 (min 1505.17, max 1512.61) | 770.66 (min 605.69, max 935.63) | 104.51 (min 76.02, max 133.00) |
bw_mem-frd-2mb (MB/s) | 273.47 (min 252.43, max 294.51) | 1743.18 (min 1208.46, max 2277.90) | 1231.14 (min 915.19, max 1547.09) | 720.76 (min 607.26, max 834.26) | 105.47 (min 77.29, max 133.65) |
bw_mem-frd-4mb (MB/s) | 271.78 (min 249.28, max 294.27) | 1144.19 (min 1079.19, max 1209.19) | 1117.48 (min 822.28, max 1412.68) | 713.69 (min 600.51, max 826.87) | 105.72 (min 76.92, max 134.52) |
bw_mem-frd-8mb (MB/s) | 274.42 (min 249.21, max 299.63) | 1070.39 (min 1066.81, max 1073.97) | 1066.39 (min 811.77, max 1321.00) | 719.08 (min 612.14, max 826.02) | 104.83 (min 75.07, max 134.58) |
bw_mem-fwr-16mb (MB/s) | 995.21 | 4972.80 | 1956.23 | 3376.24 | 193.23 |
bw_mem-fwr-1mb (MB/s) | 630.28 (min 268.17, max 992.39) | 4396.37 (min 3093.16, max 5699.58) | 2975.10 (min 1512.61, max 4437.58) | 2784.03 (min 935.63, max 4632.43) | 161.24 (min 133.00, max 189.47) |
bw_mem-fwr-2mb (MB/s) | 622.33 (min 252.43, max 992.23) | 3947.94 (min 2277.90, max 5617.98) | 2899.97 (min 1547.09, max 4252.85) | 2394.84 (min 834.26, max 3955.41) | 162.94 (min 133.65, max 192.22) |
bw_mem-fwr-4mb (MB/s) | 625.58 (min 249.28, max 1001.88) | 3257.12 (min 1209.19, max 5305.04) | 1914.62 (min 1412.68, max 2416.55) | 2237.59 (min 826.87, max 3648.30) | 163.94 (min 134.52, max 193.36) |
bw_mem-fwr-8mb (MB/s) | 627.12 (min 249.21, max 1005.03) | 3077.53 (min 1066.81, max 5088.25) | 1658.63 (min 1321.00, max 1996.26) | 2145.11 (min 826.02, max 3464.20) | 163.85 (min 134.58, max 193.12) |
bw_mem-rd-16mb (MB/s) | 252.03 | 3041.83 | 1290.22 | 2437.91 | 143.96 |
bw_mem-rd-1mb (MB/s) | 628.47 (min 273.00, max 983.93) | 12180.05 (min 10906.36, max 13453.74) | 2087.09 (min 2037.01, max 2137.17) | 1784.65 (min 869.72, max 2699.58) | 438.39 (min 142.57, max 734.21) |
bw_mem-rd-2mb (MB/s) | 619.92 (min 255.59, max 984.25) | 8883.72 (min 8807.20, max 8960.24) | 1794.40 (min 1748.86, max 1839.93) | 1584.61 (min 702.25, max 2466.96) | 440.33 (min 144.28, max 736.38) |
bw_mem-rd-4mb (MB/s) | 629.79 (min 251.76, max 1007.81) | 2480.41 (min 1610.74, max 3350.08) | 1231.95 (min 1057.92, max 1405.98) | 1572.43 (min 706.21, max 2438.65) | 438.37 (min 143.87, max 732.87) |
bw_mem-rd-8mb (MB/s) | 618.04 (min 251.59, max 984.49) | 2189.89 (min 1304.42, max 3075.35) | 1113.83 (min 932.94, max 1294.71) | 1564.93 (min 700.83, max 2429.03) | 441.79 (min 144.27, max 739.30) |
bw_mem-rdwr-16mb (MB/s) | 203.03 | 1186.68 | 886.52 | 668.59 | 117.15 |
bw_mem-rdwr-1mb (MB/s) | 200.55 (min 194.40, max 206.70) | 5900.68 (min 4215.54, max 7585.81) | 1365.93 (min 839.35, max 1892.51) | 708.35 (min 607.63, max 809.06) | 111.33 (min 105.59, max 117.07) |
bw_mem-rdwr-2mb (MB/s) | 197.81 (min 192.47, max 203.15) | 2686.15 (min 1219.33, max 4152.97) | 1122.34 (min 546.60, max 1698.08) | 638.54 (min 604.59, max 672.49) | 111.99 (min 106.72, max 117.26) |
bw_mem-rdwr-4mb (MB/s) | 201.34 (min 199.53, max 203.14) | 1208.05 (min 1041.53, max 1374.57) | 764.44 (min 491.40, max 1037.48) | 636.17 (min 601.87, max 670.47) | 112.10 (min 106.20, max 118.00) |
bw_mem-rdwr-8mb (MB/s) | 202.78 (min 202.75, max 202.80) | 1113.72 (min 1018.98, max 1208.46) | 690.49 (min 481.70, max 899.28) | 640.95 (min 610.36, max 671.54) | 110.44 (min 103.48, max 117.40) |
bw_mem-wr-16mb (MB/s) | 996.14 | 1278.67 | 921.29 | 698.60 | 740.19 |
bw_mem-wr-1mb (MB/s) | 595.32 (min 206.70, max 983.93) | 10519.78 (min 7585.81, max 13453.74) | 1964.76 (min 1892.51, max 2037.01) | 839.39 (min 809.06, max 869.72) | 425.64 (min 117.07, max 734.21) |
bw_mem-wr-2mb (MB/s) | 593.70 (min 203.15, max 984.25) | 6556.61 (min 4152.97, max 8960.24) | 1723.47 (min 1698.08, max 1748.86) | 687.37 (min 672.49, max 702.25) | 426.82 (min 117.26, max 736.38) |
bw_mem-wr-4mb (MB/s) | 605.48 (min 203.14, max 1007.81) | 1492.66 (min 1374.57, max 1610.74) | 1047.70 (min 1037.48, max 1057.92) | 688.34 (min 670.47, max 706.21) | 425.44 (min 118.00, max 732.87) |
bw_mem-wr-8mb (MB/s) | 593.62 (min 202.75, max 984.49) | 1256.44 (min 1208.46, max 1304.42) | 916.11 (min 899.28, max 932.94) | 686.19 (min 671.54, max 700.83) | 428.35 (min 117.40, max 739.30) |
bw_mmap_rd-mo-1mb (MB/s) | 263.05 | 4126.30 | 2080.08 | 1558.44 | 133.05 |
bw_mmap_rd-o2c-1mb (MB/s) | 174.28 | 1318.52 | 962.46 | 618.52 | 85.01 |
bw_pipe (MB/s) | 288.71 | 579.06 | 1012.37 | 417.20 | 28.06 |
bw_unix (MB/s) | 173.98 | 2075.27 | 1175.86 | 656.56 | 30.64 |
lat_connect (us) | 78.87 | 56.58 | 66.19 | 88.75 | 1079.50 |
lat_ctx-2-128k (us) | 27.34 | 3.37 | 9.19 | 6.12 | 178.77 |
lat_ctx-2-256k (us) | 4.00 | 4.00 | 10.87 | 4.00 | 155.50 |
lat_ctx-4-128k (us) | 57.19 | 5.59 | 12.29 | 3.49 | 199.85 |
lat_ctx-4-256k (us) | 0.00 | 0.00 | 17.86 | 0.00 | 155.58 |
lat_fs-0k (num_files) | 215.00 | 340.00 | 259.00 | 194.00 | 18.00 |
lat_fs-10k (num_files) | 75.00 | 138.00 | 76.00 | 88.00 | 8.00 |
lat_fs-1k (num_files) | 117.00 | 199.00 | 84.00 | 132.00 | 11.00 |
lat_fs-4k (num_files) | 117.00 | 200.00 | 76.00 | 119.00 | 11.00 |
lat_mem_rd-stride128-sz1000k (ns) | 222.64 | 12.82 | 29.73 | 124.53 | 231.42 |
lat_mem_rd-stride128-sz125k (ns) | 11.76 | 12.68 | 9.48 | 20.05 | 220.64 |
lat_mem_rd-stride128-sz250k (ns) | 49.41 | 12.82 | 10.14 | 20.16 | 219.59 |
lat_mem_rd-stride128-sz31k (ns) | 3.01 | 6.44 | 3.82 | 12.05 | 178.08 |
lat_mem_rd-stride128-sz50 (ns) | 3.01 | 2.67 | 3.77 | 4.01 | 5.07 |
lat_mem_rd-stride128-sz500k (ns) | 185.70 | 12.82 | 10.47 | 37.20 | 231.51 |
lat_mem_rd-stride128-sz62k (ns) | 9.14 | 12.68 | 8.03 | 20.06 | 221.60 |
lat_mmap-1m (us) | 63.00 | 52.00 | 23.00 | 77.00 | 686.00 |
lat_ops-double-add (ns) | 2.38 | 0.73 | 0.91 | 1.09 | 35.21 |
lat_ops-double-mul (ns) | 11.08 | 3.35 | 5.01 | 5.01 | 100.11 |
lat_ops-float-add (ns) | 2.28 | 0.73 | 0.91 | 1.09 | 21.51 |
lat_ops-float-mul (ns) | 10.06 | 3.34 | 5.02 | 5.01 | 66.18 |
lat_ops-int-add (ns) | 1.01 | 0.67 | 1.26 | 1.00 | 1.68 |
lat_ops-int-bit (ns) | 0.67 | 0.45 | 0.84 | 0.67 | 2.54 |
lat_ops-int-div (ns) | 58.24 | 58.50 | 7.53 | 87.73 | 190.70 |
lat_ops-int-mod (ns) | 23.50 | 10.27 | 7.94 | 15.40 | 88.39 |
lat_ops-int-mul (ns) | 6.07 | 2.10 | 3.80 | 3.14 | 6.60 |
lat_ops-int64-add (ns) | 1.20 | 0.74 | 1.25 | 1.10 | 5.29 |
lat_ops-int64-bit (ns) | 1.04 | 0.68 | 0.84 | 1.02 | 2.58 |
lat_ops-int64-div (ns) | 245.07 | 126.32 | 11.93 | 190.00 | 835.74 |
lat_ops-int64-mod (ns) | 71.19 | 22.71 | 9.19 | 34.12 | 253.38 |
lat_pagefault (us) | 1.69 | 1.01 | 3.19 | 1.73 | 8.51 |
lat_pipe (us) | 34.29 | 24.37 | 24.40 | 34.30 | 494.95 |
lat_proc-exec (us) | 2045.67 | 979.17 | 1379.75 | 1218.20 | 7480.00 |
lat_proc-fork (us) | 1644.50 | 918.83 | 1272.40 | 1176.80 | 6863.00 |
lat_proc-proccall (us) | 0.02 | 0.01 | 0.01 | 0.01 | 0.07 |
lat_select (us) | 45.65 | 28.83 | 59.07 | 49.01 | 237.82 |
lat_sem (us) | 4.87 | 2.48 | 4.72 | 9.25 | 144.85 |
lat_sig-catch (us) | 6.44 | 3.53 | 7.34 | 5.78 | 38.82 |
lat_sig-install (us) | 1.39 | 0.63 | 0.90 | 0.97 | 6.22 |
lat_sig-prot (us) | 0.46 | 0.39 | 0.65 | 0.24 | 0.74 |
lat_syscall-fstat (us) | 3.22 | 2.40 | 3.53 | 3.52 | 9.18 |
lat_syscall-null (us) | 0.56 | 0.31 | 0.55 | 0.46 | 2.12 |
lat_syscall-open (us) | 290.10 | 151.26 | 200.77 | 206.44 | 1544.75 |
lat_syscall-read (us) | 1.10 | 0.47 | 1.24 | 0.68 | 4.23 |
lat_syscall-stat (us) | 7.90 | 5.21 | 7.57 | 7.88 | 48.72 |
lat_syscall-write (us) | 0.74 | 0.38 | 0.83 | 0.57 | 3.92 |
lat_tcp (us) | 1.04 | 0.59 | 1.04 | 0.87 | 2.81 |
lat_unix (us) | 43.11 | 28.31 | 42.81 | 47.32 | 678.71 |
latency_for_0.50_mb_block_size (nanosec) | 185.70 | 12.82 | 10.47 | 37.20 | 231.51 |
latency_for_1.00_mb_block_size (nanosec) | 111.32 (min 0.00, max 222.64) | 6.41 (min 0.00, max 12.82) | 14.86 (min 0.00, max 29.73) | 62.26 (min 0.00, max 124.53) | 115.71 (min 0.00, max 231.42) |
pipe_bandwidth (MBs) | 288.71 | 579.06 | 1012.37 | 417.20 | 28.06 |
pipe_latency (microsec) | 34.29 | 24.37 | 24.40 | 34.30 | 494.95 |
procedure_call (microsec) | 0.02 | 0.01 | 0.01 | 0.01 | 0.07 |
select_on_200_tcp_fds (microsec) | 45.65 | 28.83 | 59.07 | 49.01 | 237.82 |
semaphore_latency (microsec) | 4.87 | 2.48 | 4.72 | 9.25 | 144.85 |
signal_handler_latency (microsec) | 1.39 | 0.63 | 0.90 | 0.97 | 6.22 |
signal_handler_overhead (microsec) | 6.44 | 3.53 | 7.34 | 5.78 | 38.82 |
tcp_ip_connection_cost_to_localhost (microsec) | 78.87 | 56.58 | 66.19 | 88.75 | 1079.50 |
tcp_latency_using_localhost (microsec) | 1.04 | 0.59 | 1.04 | 0.87 | 2.81 |
Table: LM Bench Metrics
2.2.1.1.2. Dhrystone¶
Dhrystone is a core only benchmark that runs from warm L1 caches in all modern processors. It scales linearly with clock speed. For standard ARM cores the DMIPS/MHz score will be identical with the same compiler and flags.
Benchmarks | am335x-evm: perf | am57xx-evm: perf | am654x-evm: perf | k2g-evm: perf | omapl138-lcdk: perf |
---|---|---|---|---|---|
cpu_clock (MHz) | 1000.00 | 1500.00 | 400.00 | 50.00 | 230.00 |
dhrystone_per_mhz (DMIPS/MHz) | 2.00 | 3.30 | 5.90 | 67.00 | 1.80 |
dhrystone_per_second (DhrystoneP) | 3571428.50 | 8695652.00 | 4166666.80 | 5882353.00 | 735294.10 |
Table: Dhrystone Benchmark
2.2.1.1.3. Whetstone¶
Benchmarks | am335x-evm: perf | am57xx-evm: perf | am654x-evm: perf | k2g-evm: perf | omapl138-lcdk: perf |
---|---|---|---|---|---|
whetstone (MIPS) | 1000.00 | 3333.30 | 3333.30 | 2500.00 | 31.60 |
Table: Whetstone Benchmark
2.2.1.1.4. Linpack¶
Linpack measures peak double precision (64 bit) floating point performance in sloving a dense linear system.
Benchmarks | am335x-evm: perf | am57xx-evm: perf | am654x-evm: perf | k2g-evm: perf | omapl138-lcdk: perf |
---|---|---|---|---|---|
linpack (Kflops) | 52763.00 | 948697.00 | 336535.00 | 626121.00 | 7465.00 |
Table: Linpack Benchmark
2.2.1.1.5. NBench¶
Benchmarks | am335x-evm: perf | am57xx-evm: perf | am654x-evm: perf | k2g-evm: perf | omapl138-lcdk: perf |
---|---|---|---|---|---|
assignment (Iterations) | 8.07 | 19.89 | 7.71 | 13.23 | 0.99 |
fourier (Iterations) | 2372.50 | 18969.00 | 8257.80 | 12539.00 | 121.72 |
fp_emulation (Iterations) | 51.80 | 92.60 | 61.53 | 61.65 | 19.80 |
huffman (Iterations) | 706.28 | 1355.30 | 673.88 | 903.17 | 220.32 |
idea (Iterations) | 1561.60 | 4297.20 | 1920.50 | 2862.80 | 451.90 |
lu_decomposition (Iterations) | 75.84 | 822.57 | 323.40 | 550.96 | 5.85 |
neural_net (Iterations) | 2.07 | 23.07 | 4.22 | 15.53 | 0.19 |
numeric_sort (Iterations) | 443.49 | 758.50 | 299.10 | 510.06 | 103.72 |
string_sort (Iterations) | 66.75 | 140.98 | 94.71 | 93.94 | 7.97 |
Table: NBench Benchmarks
2.2.1.1.6. Stream¶
STREAM is a microbenchmarks for measuring data memory system performance without any data reuse. It is designed to miss on caches and exercise data prefetcher and apeculative accesseses. it uses double precision floating point (64bit) but in most modern processors the memory access will be the bottleck. The four individual scores are copy, scale as in multiply by constant, add two numbers, and triad for multiply accumulate. For bandwidth a byte read counts as one and a byte written counts as one resulting in a score that is double the bandwidth LMBench will show.
Benchmarks | am335x-evm: perf | am57xx-evm: perf | am654x-evm: perf | k2g-evm: perf |
---|---|---|---|---|
add (MB/s) | 396.30 | 3815.80 | 1759.00 | 2431.40 |
copy (MB/s) | 441.80 | 3933.10 | 1944.20 | 2517.60 |
scale (MB/s) | 640.70 | 4371.20 | 1988.20 | 2412.60 |
triad (MB/s) | 425.40 | 3875.10 | 1652.10 | 2404.80 |
Table: Stream
2.2.1.1.7. CoreMarkPro¶
CoreMark®-Pro is a comprehensive, advanced processor benchmark that works with and enhances the market-proven industry-standard EEMBC CoreMark® benchmark. While CoreMark stresses the CPU pipeline, CoreMark-Pro tests the entire processor, adding comprehensive support for multicore technology, a combination of integer and floating-point workloads, and data sets for utilizing larger memory subsystems.
Table: CoreMarkPro
2.2.1.1.8. MultiBench¶
MultiBench is a suite of benchmarks that allows processor and system designers to analyze, test, and improve multicore processors. It uses three forms of concurrency: Data decomposition: multiple threads cooperating on achieving a unified goal and demonstrating a processors support for fine grain parallelism. Processing multiple data streams: uses common code running over multiple threads and demonstrating how well a processor scales over scalable data inputs. Multiple workload processing: shows the scalability of general-purpose processing, demonstrating concurrency over both code and data. MultiBench combines a wide variety of application-specific workloads with the EEMBC Multi-Instance-Test Harness (MITH), compatible and portable with most any multicore processors and operating systems. MITH uses a thread-based API (POSIX-compliant) to establish a common programming model that communicates with the benchmark through an abstraction layer and provides a flexible interface to allow a wide variety of thread-enabled workloads to be tested.
Table: Multibench
2.2.1.1.9. Spec2K6¶
CPU2006 is a set of benchmarks designed to test the CPU performance of a modern server computer system. It is split into two components, the first being CINT2006, the other being CFP2006 (SPECfp), for floating point testing.
SPEC defines a base runtime for each of the 12 benchmark programs. For SPECint2006, that number ranges from 1000 to 3000 seconds. The timed test is run on the system, and the time of the test system is compared to the reference time, and a ratio is computed. That ratio becomes the SPECint score for that test. (This differs from the rating in SPECINT2000, which multiplies the ratio by 100.)
As an example for SPECint2006, consider a processor which can run 400.perlbench in 2000 seconds. The time it takes the reference machine to run the benchmark is 9770 seconds. Thus the ratio is 4.885. Each ratio is computed, and then the geometric mean of those ratios is computed to produce an overall value.
Table: Spec2K6
2.2.1.2. Boot-time Measurement¶
2.2.1.2.1. Boot media: MMCSD¶
Boot Configuration | am335x-evm: boot time (sec) | am57xx-evm: boot time (sec) | am654x-evm: boot time (sec) | k2g-evm: boot time (sec) | omapl138-lcdk: boot time (sec) |
---|---|---|---|---|---|
Kernel boot time test when bootloader, kernel and sdk-rootfs are in mmc-sd | 45.61 (min 45.41, max 45.76) | 26.74 (min 26.19, max 27.94) | 15.08 (min 14.38, max 15.84) | 34.52 (min 34.38, max 34.89) | 125.64 (min 125.15, max 126.21) |
Kernel boot time test when init is /bin/sh and bootloader, kernel and sdk-rootfs are in mmc-sd | 5.60 (min 5.57, max 5.62) | 6.90 (min 6.88, max 6.92) | 7.53 (min 7.49, max 7.54) | 9.18 (min 8.84, max 10.43) |
Table: Boot time MMC/SD
2.2.1.2.2. Boot media: NAND¶
Table: Boot time MMC/SD
2.2.1.3. ALSA SoC Audio Driver¶
- Access type - RW_INTERLEAVED
- Channels - 2
- Format - S16_LE
- Period size - 64
Sampling Rate (Hz) | am335x-evm: Throughput (bits/sec) | am335x-evm: CPU Load (%) | am57xx-evm: Throughput (bits/sec) | am57xx-evm: CPU Load (%) | am654x-evm: Throughput (bits/sec) | am654x-evm: CPU Load (%) | k2g-evm: Throughput (bits/sec) | k2g-evm: CPU Load (%) | omapl138-lcdk: Throughput (bits/sec) | omapl138-lcdk: CPU Load (%) |
---|---|---|---|---|---|---|---|---|---|---|
8000 | 255998.00 | 3.00 | 255982.00 | 0.13 | 255995.00 | 0.26 | 352798.00 | 0.45 | 256004.00 | 2.54 |
11025 | 352797.00 | 3.66 | 352778.00 | 0.17 | 352793.00 | 0.28 | 352798.00 | 0.43 | 352808.00 | 3.25 |
16000 | 511995.00 | 8.65 | 511963.00 | 0.22 | 511990.00 | 0.41 | 352797.00 | 0.41 | 512017.00 | 5.12 |
22050 | 705593.00 | 7.74 | 705556.00 | 0.27 | 705586.00 | 0.43 | 705595.00 | 0.81 | 705623.00 | 5.76 |
24000 | 705593.00 | 7.72 | 705556.00 | 0.30 | 705587.00 | 0.41 | 705595.00 | 0.72 | 705632.00 | 5.84 |
32000 | 1023988.00 | 0.97 | 1023926.00 | 0.40 | 1023980.00 | 0.72 | 705595.00 | 0.75 | 1024067.00 | 8.33 |
44100 | 1411182.00 | 14.49 | 1411111.00 | 0.46 | 1411172.00 | 0.66 | 1411189.00 | 1.38 | 1411327.00 | 11.63 |
48000 | 1535979.00 | 4.93 | 1535888.00 | 0.59 | 1535969.00 | 1.02 | 1411189.00 | 1.33 | 1536154.00 | 12.88 |
88200 | 2822349.00 | 28.55 | 2822220.00 | 0.95 | 2822339.00 | 1.12 | 2822375.00 | 2.80 | 2729819.00 | 26.42 |
96000 | 3071942.00 | 31.13 | 3071774.00 | 1.05 | 3071934.00 | 1.12 | 2822375.00 | 2.56 | 2962574.00 | 27.33 |
Table: Audio Capture
Sampling Rate (Hz) | am335x-evm: Throughput (bits/sec) | am335x-evm: CPU Load (%) | am57xx-evm: Throughput (bits/sec) | am57xx-evm: CPU Load (%) | k2g-evm: Throughput (bits/sec) | k2g-evm: CPU Load (%) | omapl138-lcdk: Throughput (bits/sec) | omapl138-lcdk: CPU Load (%) |
---|---|---|---|---|---|---|---|---|
8000 | 256100.00 | 2.97 | 256081.00 | 0.10 | 352941.00 | 0.37 | 256109.00 | 2.73 |
11025 | 352938.00 | 3.73 | 352915.00 | 0.14 | 352941.00 | 0.45 | 352952.00 | 3.21 |
16000 | 512199.00 | 5.30 | 512162.00 | 0.17 | 352941.00 | 0.37 | 512225.00 | 6.06 |
22050 | 705874.00 | 7.52 | 705830.00 | 0.18 | 705882.00 | 0.61 | 705919.00 | 5.80 |
24000 | 705875.00 | 7.45 | 705830.00 | 0.22 | 705882.00 | 0.64 | 705918.00 | 5.81 |
32000 | 1024397.00 | 0.98 | 1024324.00 | 0.23 | 705882.00 | 0.75 | 1024481.00 | 8.42 |
44100 | 1411745.00 | 14.07 | 1411660.00 | 0.41 | 1411764.00 | 1.30 | 1411895.00 | 11.57 |
48000 | 1536593.00 | 12.59 | 1536486.00 | 0.40 | 1411764.00 | 1.20 | 1536769.00 | 12.05 |
88200 | 2823476.00 | 27.83 | 2823317.00 | 0.83 | 2823524.00 | 2.66 | 2731891.00 | 25.40 |
96000 | 3073192.00 | 4.38 | 3072969.00 | 0.96 | 2823525.00 | 2.66 | 2964839.00 | 27.60 |
Table: Audio Playback
2.2.1.4. Sensor Capture¶
Capture video frames (MMAP buffers) with v4l2c-ctl and record the reported fps
Resolution | Format | am57xx-evm: Fps | am57xx-evm: Sensor |
---|---|---|---|
1280x800 | nv12 | 30.03 | ov10635 |
1280x800 | rgb4 | 30.03 | ov10635 |
160x128 | nv12 | 40.81 (min 40.60, max 40.91) | mt9t111 |
160x128 | rgb4 | 40.81 (min 40.60, max 40.91) | mt9t111 |
2048x1536 | nv12 | 6.64 (min 6.10, max 6.70) | mt9t111 |
2048x1536 | rgb4 | 6.64 (min 6.10, max 6.70) | mt9t111 |
320x240 | nv12 | 30.03 | ov10635 |
320x240 | rgb4 | 30.03 | ov10635 |
Table: Sensor Capture
2.2.1.5. Display Driver¶
Mode | am335x-evm: Fps | am57xx-evm: Fps | am654x-evm: Fps | k2g-evm: Fps |
---|---|---|---|---|
1280x800@60 | 59.99 (min 59.98, max 60.01) | |||
480x272@60 | 59.94 (min 59.93, max 59.95) | |||
800x480@60 | 59.52 (min 59.50, max 59.54) | |||
800x480@62 | 61.89 (min 61.89, max 61.90) |
Table: Display performance (LCD)
Table: Display performance (DVI)
Mode | am335x-evm: Fps | am57xx-evm: Fps | am654x-evm: Fps | k2g-evm: Fps |
---|---|---|---|---|
1024x576@60 | 59.97 (min 59.96, max 59.98) | 59.97 (min 59.95, max 59.99) | 60.01 (min 60.00, max 60.01) | |
1024x768@60 | 60.00 (min 59.99, max 60.02) | |||
1024x768@70 | 70.07 (min 70.04, max 70.11) | |||
1024x768@75 | 75.03 (min 74.97, max 75.08) | |||
1152x864@75 | 75.00 (min 74.97, max 75.02) | |||
1280x1024@60 | 60.02 (min 60.00, max 60.06) | |||
1280x1024@75 | 75.03 (min 75.00, max 75.05) | |||
1280x720@60 | 60.00 (min 59.97, max 60.03) | 60.00 (min 59.98, max 60.02) | 60.00 | |
1280x768@60 | 59.87 (min 59.85, max 59.90) | |||
1280x768@75 | 74.89 (min 74.80, max 74.99) | |||
1280x800@60 | 59.81 (min 59.79, max 59.83) | |||
1280x800@75 | 74.93 (min 74.91, max 74.99) | |||
1280x960@60 | 60.00 (min 59.98, max 60.02) | |||
1360x768@60 | 59.95 (min 59.93, max 59.98) | |||
1400x1050@60 | 59.98 (min 59.96, max 60.01) | |||
1400x1050@75 | 74.87 (min 74.75, max 74.96) | |||
1440x900@60 | 59.89 (min 59.86, max 59.91) | |||
1440x900@75 | 74.98 (min 74.96, max 75.01) | |||
1600x1200@60 | 60.00 (min 59.97, max 60.03) | |||
1600x1200@65 | 65.00 (min 64.98, max 65.02) | |||
1600x1200@70 | 70.00 (min 69.97, max 70.02) | |||
1600x900@60 | 60.00 (min 59.98, max 60.02) | |||
1680x1050@60 | 59.96 (min 59.93, max 59.98) | |||
1680x1050@75 | 74.89 (min 74.85, max 74.94) | |||
1680x945@60 | 60.02 (min 59.93, max 60.09) | |||
1920x1080@60 | 60.00 (min 59.97, max 60.03) | |||
2048x1152@60 | 60.00 (min 59.98, max 60.02) | |||
640x480@60 | 60.00 (min 59.99, max 60.01) | 60.00 (min 59.98, max 60.02) | ||
640x480@73 | 72.81 (min 72.79, max 72.82) | 72.81 (min 72.67, max 72.94) | ||
640x480@75 | 75.00 (min 74.99, max 75.01) | 75.00 (min 74.81, max 75.19) | ||
720x400@70 | 70.08 (min 70.07, max 70.09) | 70.08 (min 70.05, max 70.11) | ||
800x600@56 | 56.25 (min 56.24, max 56.26) | 56.25 (min 56.22, max 56.27) | ||
800x600@60 | 60.32 (min 60.31, max 60.32) | 60.32 (min 60.30, max 60.34) | ||
800x600@72 | 72.19 (min 72.17, max 72.20) | 72.19 (min 72.16, max 72.21) | 72.19 (min 72.18, max 72.19) | |
800x600@75 | 75.00 (min 74.99, max 75.01) | 75.00 (min 74.97, max 75.02) | 75.00 (min 74.99, max 75.01) | |
832x624@75 | 74.55 (min 74.51, max 74.59) | 74.55 (min 74.53, max 74.59) | 74.57 (min 74.56, max 74.58) | |
848x480@60 | 60.00 (min 59.99, max 60.01) | 60.00 (min 59.98, max 60.02) |
Table: Display performance (HDMI)
2.2.1.6. Graphics SGX/RGX Driver¶
2.2.1.6.1. GLBenchmark¶
Run GLBenchmark and capture performance reported Display rate (Fps), Fill rate, Vertex Throughput, etc. All display outputs (HDMI, Displayport and/or LCD) are connected when running these tests
2.2.1.6.1.1. Performance (Fps)¶
Benchmark | am335x-evm: Test Number | am335x-evm: Fps | am57xx-evm: Test Number | am57xx-evm: Fps |
---|---|---|---|---|
GLB25_EgyptTestC24Z16FixedTime test | 2500005.00 | 5.15 (min 2.43, max 13.93) | 2500005.00 | 37.99 (min 20.66, max 59.39) |
GLB25_EgyptTestC24Z16_ETC1 test | 2501001.00 | 6.27 (min 2.33, max 14.53) | 2501001.00 | 45.12 (min 20.66, max 60.12) |
GLB25_EgyptTestC24Z16_ETC1to565 test | 2501401.00 | 6.26 (min 2.33, max 14.36) | 2501401.00 | 45.14 (min 20.65, max 60.47) |
GLB25_EgyptTestC24Z16_PVRTC4 test | 2501101.00 | 6.09 (min 2.18, max 14.06) | 2501101.00 | 44.98 (min 20.61, max 60.34) |
GLB25_EgyptTestC24Z24MS4 test | 2500003.00 | 5.00 (min 0.55, max 11.79) | 2500003.00 | 43.68 (min 19.42, max 60.38) |
GLB25_EgyptTestStandard_inherited test | 2000000.00 | 23.77 (min 15.13, max 35.86) | 2000000.00 | 59.52 (min 59.26, max 59.80) |
Benchmark | am57xx-evm: Test Number | am57xx-evm: Fps |
---|---|---|
GLB25_EgyptTestC24Z16_ETC1_Offscreen test | 2501011.00 | 29.00 |
GLB25_EgyptTestStandardOffscreen_inherited test | 2000010.00 | 98.00 |
Table: GLBenchmark 2.5 Performance
2.2.1.6.1.2. Vertex Throughput¶
Benchmark | am335x-evm: Test Number | am335x-evm: Rate (triangles/sec) | am57xx-evm: Test Number | am57xx-evm: Rate (triangles/sec) | am654x-evm: Test Number | am654x-evm: Rate (triangles/sec) |
---|---|---|---|---|---|---|
GLB25_TriangleTexFragmentLitTestC24Z16 test | 2500511.00 | 2205212.00 | 2500511.00 | 24883830.00 | ||
GLB25_TriangleTexTestC24Z16 test | 2500301.00 | 11232073.00 | 2500301.00 | 105217912.00 | 2500301.00 | 16715124.00 |
GLB25_TriangleTexVertexLitTestC24Z16 test | 2500411.00 | 3745968.50 | 2500411.00 | 39240892.00 |
Table: GLBenchmark 2.5 Vertex Throughput
2.2.1.6.1.3. Pixel Throughput¶
Benchmark | am335x-evm: Test Number | am335x-evm: Rate (texel/sec) | am335x-evm: Fps | am57xx-evm: Test Number | am57xx-evm: Rate (texel/sec) | am57xx-evm: Fps | am654x-evm: Test Number | am654x-evm: Rate (texel/sec) | am654x-evm: Fps |
---|---|---|---|---|---|---|---|---|---|
GLB25_FillTestC24Z16 test | 2500101.00 | 105899912.00 | 4.32 (min 4.18, max 5.02) | 2500101.00 | 1440882048.00 | 58.54 (min 55.56, max 59.52) | 2500101.00 | 543738496.00 | 22.12 (min 21.29, max 22.62) |
Table: GLBenchmark 2.5 Pixel Throughput
2.2.1.6.2. GFXBench¶
Run GFXBench and capture performance reported (Score and Display rate in fps). All display outputs (HDMI, Displayport and/or LCD) are connected when running these tests
Table: GFXBench
2.2.1.6.3. Glmark2¶
Run Glmark2 and capture performance reported (Score). All display outputs (HDMI, Displayport and/or LCD) are connected when running these tests
Table: Glmark2
2.2.1.7. Multimedia (Decode)¶
Run gstreamer pipeline “gst-launch-1.0 playbin uri=file://<Path to stream> video-sink=”kmssink sync=false connector=<connector id>” audio-sink=fakesink” and calculate performance based on the execution time reported. All display display outputs (HDMI and LCD) were connected when running these tests, but playout was forced to LCD via the connector=<connector id> option.
2.2.1.7.1. H264¶
Resolution | am57xx-evm: Fps | am57xx-evm: IVA Freq (MHz) | am57xx-evm: IPU Freq (MHz) |
---|---|---|---|
1080i | 30300.00 | 532.00 | |
1080p | 60.00 | 532.00 | |
720p | 59940.00 | 532.00 | |
720x480 | 24.17 | 532.00 | |
800x480 | 30.00 | 532.00 | |
CIF | 90000.00 | 532.00 |
Table: Gstreamer H264 in AVI Container Decode Performance
2.2.1.7.2. MPEG4¶
Resolution | am57xx-evm: Fps | am57xx-evm: IVA Freq (MHz) | am57xx-evm: IPU Freq (MHz) |
---|---|---|---|
CIF | 30.00 | 532.00 | |
QVGA | 30.00 | 532.00 | |
VGA | 532.00 |
Table: GStreamer MPEG4 in 3GP Container Decode Performance
2.2.1.7.3. MPEG2¶
Resolution | am57xx-evm: Fps | am57xx-evm: IVA Freq (MHz) | am57xx-evm: IPU Freq (MHz) |
---|---|---|---|
1080p | 60.00 | 532.00 | |
720p | 29.97 | 532.00 |
Table: GStreamer MPEG2 in MP4 Container Decode Performance
2.2.1.8. Ethernet Driver¶
2.2.1.8.1. TCP Throughput¶
TCP Window Size (KBytes) | am335x-evm: Throughput (Mbits/sec) | am335x-evm: CPU Load | am57xx-evm: Throughput (Mbits/sec) | am57xx-evm: CPU Load | am654x-evm: Throughput (Mbits/sec) | am654x-evm: CPU Load | k2g-evm: Throughput (Mbits/sec) | k2g-evm: CPU Load | omapl138-lcdk: Throughput (Mbits/sec) | omapl138-lcdk: CPU Load |
---|---|---|---|---|---|---|---|---|---|---|
8 | 256.96 | 629.12 | 620.00 | 514.40 | 33.84 | |||||
16 | 289.60 | 772.00 | 839.20 | 641.60 | 43.76 | |||||
32 | 321.60 | 1024.00 | 1099.20 | 792.80 | 55.20 | |||||
64 | 366.00 | 1185.60 | 1573.60 | 812.00 | 62.80 | |||||
128 | 395.44 | 1136.00 | 1608.00 | 928.00 | 64.40 | |||||
256 | 388.32 | 1124.80 | 1569.60 | 920.00 | 63.60 |
Table: TCP Throughput
TCP Throughput Interrupt Pacing
TCP Window Size (KBytes) | am335x-evm: Throughput (Mbits/sec) | am335x-evm: CPU Load | am57xx-evm: Throughput (Mbits/sec) | am57xx-evm: CPU Load | omapl138-lcdk: Throughput (Mbits/sec) | omapl138-lcdk: CPU Load |
---|---|---|---|---|---|---|
8 | 246.32 | 583.76 | 33.60 | |||
16 | 267.20 | 632.00 | 43.36 | |||
32 | 319.20 | 868.80 | 54.64 | |||
64 | 385.04 | 1168.00 | 62.40 | |||
128 | 391.52 | 1120.00 | 68.16 | |||
256 | 373.76 | 1110.40 | 67.68 |
Table: TCP Throughput Interrupt Pacing
2.2.1.8.2. UDP Throughput¶
UDP Throughput Egress
UDP Packet Size(bytes) | am335x-evm: Throughput (Mbits/sec) | am335x-evm: CPU Load | am335x-evm: Packets Per Second (kpps) | am57xx-evm: Throughput (Mbits/sec) | am57xx-evm: CPU Load | am57xx-evm: Packets Per Second (kpps) | am654x-evm: Throughput (Mbits/sec) | am654x-evm: CPU Load | am654x-evm: Packets Per Second (kpps) | k2g-evm: Throughput (Mbits/sec) | k2g-evm: CPU Load | k2g-evm: Packets Per Second (kpps) | omapl138-lcdk: Throughput (Mbits/sec) | omapl138-lcdk: CPU Load | omapl138-lcdk: Packets Per Second (kpps) |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
64 | 20.90 | 99.90 | 39.00 | 48.10 | 82.60 | 93.00 | 23.80 | 100.00 | 44.00 | 1.60 | 98.30 | 1.00 | |||
128 | 40.40 | 99.90 | 39.00 | 92.80 | 80.60 | 89.00 | 46.70 | 100.00 | 44.00 | 3.20 | 98.00 | 2.00 | |||
256 | 79.80 | 99.80 | 38.00 | 183.50 | 80.50 | 89.00 | 92.50 | 100.00 | 44.00 | 6.40 | 97.90 | 2.00 | |||
512 | 157.60 | 99.90 | 38.00 | 362.00 | 81.40 | 88.00 | 181.60 | 100.00 | 44.00 | 12.40 | 98.30 | 2.00 | |||
1024 | 305.00 | 99.90 | 37.00 | 678.00 | 80.50 | 82.00 | 477.00 | 100.00 | 58.00 | 358.00 | 100.00 | 43.00 | 24.50 | 98.20 | 2.00 |
1470 | 428.00 | 99.90 | 36.00 | 891.00 | 76.90 | 75.00 | 954.00 | 62.90 | 81.00 | 503.00 | 100.00 | 42.00 | 34.30 | 98.50 | 2.00 |
1500 | 316.00 | 100.00 | 26.00 | 580.00 | 75.70 | 48.00 | 780.00 | 66.50 | 65.00 | 388.00 | 100.00 | 32.00 | 27.60 | 98.20 | 2.00 |
4000 | 934.00 | 73.30 | 29.00 | 956.00 | 49.20 | 29.00 | |||||||||
8000 | 941.00 | 68.20 | 14.00 | 955.00 | 45.70 | 14.00 | 66.50 | 97.90 | 1.00 |
Table: UDP Throughput Egress
UDP Throughput Ingress
UDP Packet Size(bytes) | am335x-evm: Throughput (Mbits/sec) | am335x-evm: CPU Load | am335x-evm: Packets Per Second (kpps) | am57xx-evm: Throughput (Mbits/sec) | am57xx-evm: CPU Load | am57xx-evm: Packets Per Second (kpps) | am654x-evm: Throughput (Mbits/sec) | am654x-evm: CPU Load | am654x-evm: Packets Per Second (kpps) | k2g-evm: Throughput (Mbits/sec) | k2g-evm: CPU Load | k2g-evm: Packets Per Second (kpps) | omapl138-lcdk: Throughput (Mbits/sec) | omapl138-lcdk: CPU Load | omapl138-lcdk: Packets Per Second (kpps) |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
64 | 21.60 | 98.60 | 41.00 | 57.00 | 88.20 | 111.00 | 51.60 | 46.60 | 99.00 | 27.00 | 100.00 | 52.00 | |||
128 | 30.00 | 99.60 | 29.00 | 116.50 | 90.10 | 113.00 | 51.10 | 35.00 | 49.00 | 70.70 | 99.80 | 68.00 | 8.60 | 80.90 | 7.00 |
256 | 54.90 | 96.80 | 26.00 | 72.20 | 73.90 | 35.00 | 94.80 | 39.80 | 45.00 | 141.40 | 99.80 | 68.00 | 11.60 | 80.50 | 5.00 |
512 | 113.80 | 99.40 | 27.00 | 464.00 | 91.00 | 113.00 | 213.00 | 38.80 | 52.00 | 308.00 | 97.30 | 75.00 | 13.10 | 81.10 | 3.00 |
1024 | 230.00 | 99.80 | 28.00 | 917.00 | 94.50 | 111.00 | 438.00 | 37.00 | 53.00 | 402.00 | 100.00 | 49.00 | 7.80 | 84.10 | 0.00 |
1470 | 340.00 | 100.00 | 28.00 | 785.00 | 69.60 | 66.00 | 957.00 | 42.60 | 81.00 | 459.00 | 97.80 | 39.00 | 46.80 | 82.50 | 3.00 |
1500 | 222.00 | 100.00 | 18.00 | 484.00 | 83.60 | 40.00 | 374.00 | 38.50 | 31.00 | 274.00 | 100.00 | 22.00 | 31.50 | 93.90 | 2.00 |
4000 | 954.00 | 82.80 | 29.00 | 746.00 | 37.60 | 23.00 | 63.20 | 91.90 | 1.00 | ||||||
8000 | 843.00 | 68.10 | 13.00 | 937.00 | 35.10 | 14.00 | 77.50 | 92.90 | 1.00 |
Table: UDP Throughput Ingress
- iperf version 2.0.5
- For receive performance, on DUT, invoke iperf in server mode.
iperf -s -u
- For transmit performance, on DUT, invoke iperf in client mode.
iperf -c <server ip> -b <bandwidth limit> -f M -t 60
2.2.1.9. PCIe Driver¶
2.2.1.9.1. PCIe-ETH¶
TCP Window Size(Kbytes) | am654x-evm: Bandwidth (Mbits/sec) | k2g-evm: Bandwidth (Mbits/sec) |
---|---|---|
8 | 638.32 | |
16 | 733.60 | |
32 | 971.20 | |
64 | 1193.60 | |
128 | 1016.80 | 1309.60 |
256 | 1120.80 | 1354.40 |
Table: PCI Ethernet
2.2.1.10. NAND Driver¶
2.2.1.10.1. AM335X-EVM¶
Buffer size (bytes) | am335x-evm: Write UBIFS Throughput (Mbytes/sec) | am335x-evm: Write UBIFS CPU Load (%) | am335x-evm: Read UBIFS Throughput (Mbytes/sec) | am335x-evm: Read UBIFS CPU Load (%) |
---|---|---|---|---|
102400 | 4.33 (min 4.30, max 4.38) | 100.00 | 7.33 | 100.00 |
262144 | 4.36 (min 4.33, max 4.38) | 100.00 | 7.37 | 100.00 |
524288 | 4.39 (min 4.35, max 4.42) | 99.99 (min 99.96, max 100.00) | 7.39 | 100.00 |
1048576 | 4.40 (min 4.38, max 4.42) | 100.00 | 7.41 | 100.00 |
5242880 | 4.43 (min 4.42, max 4.44) | 100.00 | 7.38 | 100.00 |
2.2.1.10.2. OMAPL138-LCDK¶
Buffer size (bytes) | omapl138-lcdk: Write UBIFS Throughput (Mbytes/sec) | omapl138-lcdk: Write UBIFS CPU Load (%) | omapl138-lcdk: Read UBIFS Throughput (Mbytes/sec) | omapl138-lcdk: Read UBIFS CPU Load (%) |
---|---|---|---|---|
102400 | 1.41 (min 1.40, max 1.42) | 100.00 | 2.01 | 100.00 |
262144 | 1.43 (min 1.42, max 1.47) | 100.00 | 2.01 | 100.00 |
524288 | 1.43 (min 1.42, max 1.47) | 100.00 | 2.01 | 100.00 |
1048576 | 1.43 (min 1.41, max 1.47) | 100.00 | 2.02 | 100.00 |
5242880 | 1.44 (min 1.42, max 1.46) | 100.00 | 2.00 | 100.00 |
2.2.1.11. QSPI Flash Driver¶
2.2.1.11.1. AM654x-EVM¶
2.2.1.11.1.1. UBIFS¶
Buffer size (bytes) | am654x-evm: Write UBIFS Throughput (Mbytes/sec) | am654x-evm: Write UBIFS CPU Load (%) | am654x-evm: Read UBIFS Throughput (Mbytes/sec) | am654x-evm: Read UBIFS CPU Load (%) |
---|---|---|---|---|
102400 | 0.64 (min 0.50, max 1.15) | 44.69 (min 26.95, max 50.51) | 45.56 | 16.67 |
262144 | 0.48 (min 0.36, max 0.54) | 42.43 (min 31.49, max 49.52) | 43.77 | 21.05 |
524288 | 0.48 (min 0.36, max 0.54) | 41.94 (min 41.17, max 43.42) | 45.17 | 25.00 |
1048576 | 0.48 (min 0.36, max 0.54) | 37.40 (min 30.22, max 49.39) | 40.14 | 27.27 |
2.2.1.11.1.2. RAW¶
File size (Mbytes) | am654x-evm: Raw Read Throughput (Mbytes/sec) |
---|---|
50 | 208.33 |
2.2.1.11.2. K2G-EVM¶
Buffer size (bytes) | k2g-evm: Write UBIFS Throughput (Mbytes/sec) | k2g-evm: Write UBIFS CPU Load (%) | k2g-evm: Read UBIFS Throughput (Mbytes/sec) | k2g-evm: Read UBIFS CPU Load (%) |
---|---|---|---|---|
102400 | 0.48 (min 0.32, max 0.83) | 100.00 | 13.60 | 7.69 |
262144 | 0.40 (min 0.28, max 0.58) | 100.00 | 13.50 | 0.00 |
524288 | 0.42 (min 0.26, max 0.54) | 100.00 | 13.46 | 14.29 |
1048576 | 0.40 (min 0.28, max 0.59) | 100.00 | 13.38 | 20.00 |
2.2.1.12. SPI Flash Driver¶
2.2.1.12.1. K2G-EVM¶
Buffer size (bytes) | k2g-evm: Write UBIFS Throughput (Mbytes/sec) | k2g-evm: Write UBIFS CPU Load (%) | k2g-evm: Read UBIFS Throughput (Mbytes/sec) | k2g-evm: Read UBIFS CPU Load (%) |
---|---|---|---|---|
102400 | 0.09 (min 0.08, max 0.13) | 62.36 (min 42.87, max 69.73) | 0.21 | 4.41 |
262144 | 0.08 (min 0.08, max 0.09) | 67.99 (min 65.36, max 70.17) | 0.21 | 1.78 |
524288 | 0.08 (min 0.08, max 0.09) | 67.79 (min 65.03, max 69.35) | 0.21 | 3.14 |
1048576 | 0.08 (min 0.08, max 0.09) | 68.50 (min 66.09, max 70.87) | 0.20 | 0.80 |
2.2.1.12.2. AM654X-EVM¶
Buffer size (bytes) | am654x-evm: Write UBIFS Throughput (Mbytes/sec) | am654x-evm: Write UBIFS CPU Load (%) | am654x-evm: Read UBIFS Throughput (Mbytes/sec) | am654x-evm: Read UBIFS CPU Load (%) |
---|---|---|---|---|
102400 | 0.33 (min 0.26, max 0.59) | 25.31 (min 21.84, max 26.70) | 2.66 | 3.80 |
262144 | 0.27 (min 0.24, max 0.29) | 26.45 (min 26.22, max 26.82) | 2.67 | 2.88 |
524288 | 0.27 (min 0.25, max 0.30) | 25.46 (min 24.79, max 25.89) | 2.67 | 3.49 |
1048576 | 0.28 (min 0.25, max 0.30) | 25.35 (min 24.15, max 26.22) | 2.67 | 2.56 |
2.2.1.13. EMMC Driver¶
Warning
IMPORTANT: The performance numbers can be severely affected if the media is mounted in sync mode. Hot plug scripts in the filesystem mount removable media in sync mode to ensure data integrity. For performance sensitive applications, umount the auto-mounted filesystem and re-mount in async mode.
2.2.1.13.1. AM57XX-EVM¶
Buffer size (bytes) | am57xx-evm: Write VFAT Throughput (Mbytes/sec) | am57xx-evm: Write VFAT CPU Load (%) | am57xx-evm: Read VFAT Throughput (Mbytes/sec) | am57xx-evm: Read VFAT CPU Load (%) |
---|---|---|---|---|
102400 | 12.31 (min 11.80, max 12.63) | 2.23 (min 1.58, max 4.57) | 65.53 | 9.35 |
262144 | 12.30 (min 11.18, max 12.63) | 2.15 (min 1.52, max 4.18) | 67.76 | 7.24 |
524288 | 12.28 (min 11.75, max 12.60) | 2.17 (min 1.43, max 4.66) | 72.83 | 8.04 |
1048576 | 12.32 (min 11.81, max 12.64) | 2.18 (min 1.38, max 4.47) | 74.82 | 8.24 |
5242880 | 12.29 (min 11.81, max 12.61) | 2.29 (min 1.52, max 4.57) | 75.06 | 8.30 |
Buffer size (bytes) | am57xx-evm: Write EXT2 Throughput (Mbytes/sec) | am57xx-evm: Write EXT2 CPU Load (%) | am57xx-evm: Read EXT2 Throughput (Mbytes/sec) | am57xx-evm: Read EXT2 CPU Load (%) |
---|---|---|---|---|
102400 | 12.59 (min 12.31, max 12.68) | 1.89 (min 1.39, max 3.60) | 69.20 | 5.10 |
262144 | 12.56 (min 12.25, max 12.67) | 1.86 (min 1.03, max 3.87) | 72.14 | 8.22 |
524288 | 12.56 (min 12.29, max 12.65) | 1.95 (min 1.21, max 4.22) | 77.01 | 6.32 |
1048576 | 12.54 (min 12.25, max 12.63) | 1.91 (min 1.15, max 3.64) | 78.53 | 0.00 |
5242880 | 12.56 (min 12.25, max 12.67) | 1.88 (min 1.16, max 3.98) | 78.55 | 0.00 |
Buffer size (bytes) | am57xx-evm: Write EXT4 Throughput (Mbytes/sec) | am57xx-evm: Write EXT4 CPU Load (%) | am57xx-evm: Read EXT4 Throughput (Mbytes/sec) | am57xx-evm: Read EXT4 CPU Load (%) |
---|---|---|---|---|
102400 | 12.24 (min 11.94, max 12.45) | 1.72 (min 1.26, max 2.86) | 69.99 | 7.38 |
262144 | 12.44 (min 11.70, max 12.68) | 1.90 (min 1.52, max 2.91) | 72.94 | 7.34 |
524288 | 12.54 (min 12.43, max 12.63) | 1.85 (min 1.46, max 2.92) | 78.04 | 6.44 |
1048576 | 12.57 (min 12.39, max 12.65) | 1.79 (min 1.33, max 2.91) | 79.66 | 7.95 |
5242880 | 12.53 (min 12.39, max 12.62) | 1.87 (min 1.51, max 2.80) | 79.91 | 8.33 |
2.2.1.13.2. AM654x-EVM¶
Buffer size (bytes) | am654x-evm: Write VFAT Throughput (Mbytes/sec) | am654x-evm: Write VFAT CPU Load (%) | am654x-evm: Read VFAT Throughput (Mbytes/sec) | am654x-evm: Read VFAT CPU Load (%) |
---|---|---|---|---|
102400 | 20.29 (min 19.40, max 20.65) | 1.95 (min 1.40, max 3.66) | 128.33 | 7.17 |
262144 | 20.29 (min 18.97, max 20.69) | 1.80 (min 1.38, max 3.26) | 134.43 | 6.84 |
524288 | 20.29 (min 19.01, max 20.77) | 1.86 (min 1.29, max 3.27) | 152.49 | 7.04 |
1048576 | 20.28 (min 19.07, max 20.69) | 1.85 (min 1.43, max 3.37) | 151.98 | 7.35 |
5242880 | 20.29 (min 18.89, max 20.74) | 1.77 (min 1.34, max 3.20) | 149.59 | 8.60 |
Buffer size (bytes) | am654x-evm: Write EXT2 Throughput (Mbytes/sec) | am654x-evm: Write EXT2 CPU Load (%) | am654x-evm: Read EXT2 Throughput (Mbytes/sec) | am654x-evm: Read EXT2 CPU Load (%) |
---|---|---|---|---|
102400 | 21.07 (min 20.86, max 21.21) | 1.18 (min 0.91, max 1.77) | 130.67 | 5.75 |
262144 | 21.06 (min 20.61, max 21.81) | 1.13 (min 0.95, max 1.62) | 140.22 | 7.12 |
524288 | 21.34 (min 20.96, max 21.86) | 1.16 (min 1.00, max 1.62) | 161.05 | 7.36 |
1048576 | 20.92 (min 20.53, max 21.09) | 1.07 (min 0.91, max 1.47) | 168.52 | 7.69 |
5242880 | 21.14 (min 20.85, max 21.46) | 1.13 (min 0.95, max 1.59) | 168.33 | 8.54 |
Buffer size (bytes) | am654x-evm: Write EXT4 Throughput (Mbytes/sec) | am654x-evm: Write EXT4 CPU Load (%) | am654x-evm: Read EXT4 Throughput (Mbytes/sec) | am654x-evm: Read EXT4 CPU Load (%) |
---|---|---|---|---|
102400 | 20.76 (min 20.52, max 21.35) | 1.22 (min 1.03, max 1.54) | 120.01 | 6.32 |
262144 | 20.66 (min 20.09, max 21.40) | 1.17 (min 1.03, max 1.53) | 134.06 | 7.07 |
524288 | 20.89 (min 20.71, max 21.01) | 1.16 (min 0.95, max 1.54) | 166.96 | 6.85 |
1048576 | 21.69 (min 21.40, max 21.79) | 1.26 (min 1.09, max 1.63) | 180.86 | 7.42 |
5242880 | 20.85 (min 20.59, max 20.93) | 1.17 (min 0.95, max 1.47) | 180.90 | 9.48 |
2.2.1.13.3. K2G-EVM¶
Buffer size (bytes) | k2g-evm: Write VFAT Throughput (Mbytes/sec) | k2g-evm: Write VFAT CPU Load (%) | k2g-evm: Read VFAT Throughput (Mbytes/sec) | k2g-evm: Read VFAT CPU Load (%) |
---|---|---|---|---|
102400 | 21.71 (min 19.28, max 22.68) | 12.70 (min 10.06, max 21.11) | 39.65 | 16.35 |
262144 | 21.75 (min 19.11, max 22.49) | 13.30 (min 10.99, max 21.25) | 40.36 | 16.22 |
524288 | 21.48 (min 19.17, max 22.16) | 12.57 (min 9.85, max 21.32) | 41.57 | 13.71 |
1048576 | 21.76 (min 19.16, max 22.65) | 12.86 (min 10.30, max 21.76) | 42.62 | 14.98 |
5242880 | 21.52 (min 19.13, max 22.15) | 13.02 (min 10.19, max 21.21) | 42.56 | 13.22 |
Buffer size (bytes) | k2g-evm: Write EXT2 Throughput (Mbytes/sec) | k2g-evm: Write EXT2 CPU Load (%) | k2g-evm: Read EXT2 Throughput (Mbytes/sec) | k2g-evm: Read EXT2 CPU Load (%) |
---|---|---|---|---|
102400 | 22.23 (min 20.70, max 22.74) | 11.05 (min 8.42, max 21.18) | 40.31 | 14.17 |
262144 | 22.27 (min 20.89, max 22.73) | 11.84 (min 9.01, max 21.19) | 41.21 | 13.01 |
524288 | 22.43 (min 20.99, max 22.98) | 11.77 (min 9.01, max 20.68) | 42.69 | 14.86 |
1048576 | 22.36 (min 20.87, max 22.85) | 11.39 (min 8.66, max 21.07) | 43.05 | 14.29 |
5242880 | 22.32 (min 20.99, max 22.70) | 11.47 (min 8.70, max 21.00) | 43.01 | 12.86 |
Buffer size (bytes) | k2g-evm: Write EXT4 Throughput (Mbytes/sec) | k2g-evm: Write EXT4 CPU Load (%) | k2g-evm: Read EXT4 Throughput (Mbytes/sec) | k2g-evm: Read EXT4 CPU Load (%) |
---|---|---|---|---|
102400 | 21.98 (min 21.13, max 22.51) | 10.76 (min 8.75, max 16.46) | 40.47 | 14.57 |
262144 | 22.06 (min 20.83, max 22.59) | 11.42 (min 9.94, max 16.60) | 40.11 | 13.57 |
524288 | 21.70 (min 20.77, max 22.11) | 10.87 (min 8.90, max 16.17) | 39.85 | 9.13 |
1048576 | 21.74 (min 20.84, max 22.13) | 10.90 (min 8.81, max 16.73) | 41.59 | 12.35 |
5242880 | 22.10 (min 21.51, max 22.43) | 11.18 (min 9.40, max 16.74) | 43.45 | 13.28 |
2.2.1.14. SATA Driver¶
AM57XX-EVM
Buffer size (bytes) | am57xx-evm: Write EXT2 Throughput (Mbytes/sec) | am57xx-evm: Write EXT2 CPU Load (%) | am57xx-evm: Read EXT2 Throughput (Mbytes/sec) | am57xx-evm: Read EXT2 CPU Load (%) |
---|---|---|---|---|
102400 | 120.74 (min 106.80, max 127.84) | 11.13 (min 6.05, max 30.91) | 134.97 | 11.77 |
262144 | 122.70 (min 111.39, max 126.15) | 11.73 (min 5.90, max 33.02) | 133.25 | 11.54 |
524288 | 124.22 (min 120.07, max 126.26) | 11.40 (min 5.89, max 31.44) | 133.11 | 11.25 |
1048576 | 123.35 (min 118.07, max 125.98) | 11.29 (min 6.06, max 31.30) | 133.11 | 10.59 |
5242880 | 124.45 (min 118.82, max 127.27) | 11.49 (min 6.30, max 31.19) | 132.83 | 9.72 |
Buffer size (bytes) | am57xx-evm: Write EXT4 Throughput (Mbytes/sec) | am57xx-evm: Write EXT4 CPU Load (%) | am57xx-evm: Read EXT4 Throughput (Mbytes/sec) | am57xx-evm: Read EXT4 CPU Load (%) |
---|---|---|---|---|
102400 | 123.56 (min 121.15, max 127.36) | 10.97 (min 7.01, max 25.50) | 130.66 | 11.39 |
262144 | 119.87 (min 117.97, max 122.95) | 10.80 (min 7.32, max 23.44) | 132.73 | 10.36 |
524288 | 121.84 (min 112.38, max 126.66) | 10.66 (min 6.42, max 25.36) | 132.10 | 10.85 |
1048576 | 124.30 (min 120.66, max 127.52) | 10.76 (min 6.86, max 24.81) | 133.37 | 11.32 |
5242880 | 119.94 (min 118.63, max 120.83) | 10.76 (min 7.01, max 24.90) | 129.19 | 10.41 |
- Filesize used is : 1G
- SATA II Harddisk used is: Seagate ST3500514NS 500G
2.2.1.14.1. mSATA Driver¶
AM57XX-EVM
Buffer size (bytes) | am57xx-evm: Write EXT2 Throughput (Mbytes/sec) | am57xx-evm: Write EXT2 CPU Load (%) | am57xx-evm: Read EXT2 Throughput (Mbytes/sec) | am57xx-evm: Read EXT2 CPU Load (%) |
---|---|---|---|---|
102400 | 64.43 (min 63.67, max 65.01) | 4.62 (min 2.68, max 11.78) | 225.91 | 19.34 |
262144 | 64.27 (min 63.24, max 64.77) | 4.54 (min 2.78, max 10.79) | 234.83 | 20.12 |
524288 | 64.24 (min 63.88, max 64.58) | 3.53 (min 2.69, max 6.02) | 248.93 | 20.05 |
1048576 | 64.44 (min 63.93, max 65.00) | 3.42 (min 2.69, max 5.50) | 257.56 | 20.28 |
5242880 | 64.27 (min 63.93, max 64.56) | 3.43 (min 2.84, max 5.39) | 258.07 | 18.37 |
Buffer size (bytes) | am57xx-evm: Write EXT4 Throughput (Mbytes/sec) | am57xx-evm: Write EXT4 CPU Load (%) | am57xx-evm: Read EXT4 Throughput (Mbytes/sec) | am57xx-evm: Read EXT4 CPU Load (%) |
---|---|---|---|---|
102400 | 64.47 (min 64.18, max 64.82) | 4.05 (min 2.94, max 6.38) | 230.92 | 20.97 |
262144 | 64.25 (min 63.78, max 64.52) | 4.25 (min 3.44, max 6.95) | 237.18 | 19.01 |
524288 | 64.54 (min 63.91, max 65.36) | 4.38 (min 3.40, max 7.29) | 254.33 | 20.05 |
1048576 | 64.16 (min 63.43, max 64.59) | 4.14 (min 3.31, max 6.78) | 264.97 | 23.74 |
5242880 | 65.36 (min 63.87, max 68.40) | 4.49 (min 3.45, max 7.39) | 271.22 | 21.48 |
- Filesize used is : 1G
- MSATA Harddisk used is: SMS200S3/30G Kingston mSATA SSD drive
2.2.1.15. MMC/SD Driver¶
Warning
IMPORTANT: The performance numbers can be severely affected if the media is mounted in sync mode. Hot plug scripts in the filesystem mount removable media in sync mode to ensure data integrity. For performance sensitive applications, umount the auto-mounted filesystem and re-mount in async mode.
2.2.1.15.1. AM335x-EVM¶
Buffer size (bytes) | am335x-evm: Write VFAT Throughput (Mbytes/sec) | am335x-evm: Write VFAT CPU Load (%) | am335x-evm: Read VFAT Throughput (Mbytes/sec) | am335x-evm: Read VFAT CPU Load (%) |
---|---|---|---|---|
102400 | 9.33 (min 8.78, max 9.69) | 9.83 (min 8.40, max 14.06) | 20.39 | 23.83 |
262144 | 9.25 (min 8.64, max 9.76) | 9.57 (min 7.87, max 13.92) | 20.63 | 21.82 |
524288 | 8.48 (min 7.25, max 9.66) | 8.82 (min 6.77, max 12.02) | 21.13 | 20.61 |
1048576 | 8.67 (min 6.66, max 9.78) | 8.91 (min 7.50, max 10.90) | 21.24 | 18.48 |
5242880 | 8.74 (min 7.67, max 9.87) | 8.95 (min 7.40, max 12.07) | 21.31 | 18.89 |
Buffer size (bytes) | am335x-evm: Write EXT2 Throughput (Mbytes/sec) | am335x-evm: Write EXT2 CPU Load (%) | am335x-evm: Read EXT2 Throughput (Mbytes/sec) | am335x-evm: Read EXT2 CPU Load (%) |
---|---|---|---|---|
102400 | 8.60 (min 3.82, max 10.07) | 7.18 (min 5.88, max 8.06) | 21.16 | 23.27 |
262144 | 9.55 (min 7.84, max 10.16) | 8.30 (min 7.32, max 10.93) | 21.57 | 23.42 |
524288 | 9.66 (min 8.24, max 10.22) | 8.60 (min 7.49, max 12.12) | 22.18 | 17.57 |
1048576 | 10.11 (min 9.62, max 10.28) | 9.03 (min 7.17, max 14.38) | 22.44 | 18.26 |
5242880 | 10.15 (min 9.78, max 10.29) | 8.76 (min 7.05, max 13.56) | 22.53 | 14.98 |
Buffer size (bytes) | am335x-evm: Write EXT4 Throughput (Mbytes/sec) | am335x-evm: Write EXT4 CPU Load (%) | am335x-evm: Read EXT4 Throughput (Mbytes/sec) | am335x-evm: Read EXT4 CPU Load (%) |
---|---|---|---|---|
102400 | 9.71 (min 9.26, max 10.59) | 8.69 (min 7.49, max 12.70) | 17.93 | 19.96 |
262144 | 9.80 (min 9.11, max 10.30) | 8.52 (min 7.60, max 11.09) | 21.70 | 23.08 |
524288 | 9.55 (min 9.10, max 9.76) | 8.54 (min 7.53, max 11.23) | 22.33 | 17.65 |
1048576 | 9.84 (min 9.62, max 9.91) | 8.69 (min 7.73, max 11.78) | 22.65 | 16.48 |
5242880 | 9.71 (min 9.05, max 9.90) | 8.13 (min 7.35, max 10.43) | 22.80 | 16.63 |
The performance numbers were captured using the following:
- SanDisk 8GB MicroSDHC Class 10 Memory Card
- Partition was mounted with async option
The performance numbers were captured using the following:
- SanDisk 8GB MicroSDHC Class 10 Memory Card
- Partition was mounted with async option
2.2.1.15.2. AM57XX-EVM¶
Buffer size (bytes) | am57xx-evm: Write VFAT Throughput (Mbytes/sec) | am57xx-evm: Write VFAT CPU Load (%) | am57xx-evm: Read VFAT Throughput (Mbytes/sec) | am57xx-evm: Read VFAT CPU Load (%) |
---|---|---|---|---|
102400 | 7.37 (min 6.11, max 8.71) | 1.51 (min 0.94, max 3.15) | 21.26 | 2.74 |
262144 | 6.08 (min 5.70, max 6.53) | 1.19 (min 0.82, max 2.40) | 21.48 | 1.87 |
524288 | 6.02 (min 5.40, max 6.82) | 1.08 (min 0.76, max 2.09) | 22.12 | 2.74 |
1048576 | 8.05 (min 7.42, max 8.74) | 1.45 (min 0.89, max 2.92) | 22.11 | 2.22 |
5242880 | 8.05 (min 7.08, max 8.62) | 1.52 (min 1.12, max 2.81) | 22.10 | 2.43 |
Buffer size (bytes) | am57xx-evm: Write EXT2 Throughput (Mbytes/sec) | am57xx-evm: Write EXT2 CPU Load (%) | am57xx-evm: Read EXT2 Throughput (Mbytes/sec) | am57xx-evm: Read EXT2 CPU Load (%) |
---|---|---|---|---|
102400 | 9.24 (min 7.75, max 10.05) | 1.33 (min 0.84, max 2.48) | 21.61 | 1.87 |
262144 | 10.45 (min 10.24, max 10.61) | 1.68 (min 1.05, max 3.23) | 21.99 | 1.69 |
524288 | 10.42 (min 9.91, max 10.58) | 1.59 (min 1.22, max 2.90) | 22.55 | 2.05 |
1048576 | 10.50 (min 10.15, max 10.62) | 1.66 (min 1.06, max 3.11) | 22.74 | 1.96 |
5242880 | 10.50 (min 10.26, max 10.60) | 1.63 (min 1.11, max 3.19) | 22.74 | 2.70 |
Buffer size (bytes) | am57xx-evm: Write EXT4 Throughput (Mbytes/sec) | am57xx-evm: Write EXT4 CPU Load (%) | am57xx-evm: Read EXT4 Throughput (Mbytes/sec) | am57xx-evm: Read EXT4 CPU Load (%) |
---|---|---|---|---|
102400 | 9.68 (min 8.49, max 10.52) | 1.48 (min 1.06, max 2.32) | 21.69 | 1.57 |
262144 | 9.88 (min 9.65, max 10.25) | 1.48 (min 1.07, max 2.57) | 22.04 | 2.01 |
524288 | 9.62 (min 9.02, max 9.80) | 1.36 (min 1.13, max 1.95) | 22.59 | 2.16 |
1048576 | 9.63 (min 9.14, max 10.28) | 1.42 (min 1.07, max 2.34) | 22.83 | 2.07 |
5242880 | 8.37 (min 6.58, max 9.57) | 1.33 (min 0.97, max 2.38) | 22.83 | 1.86 |
The performance numbers were captured using the following:
- SanDisk 8GB MicroSDHC Class 10 Memory Card
- Partition was mounted with async option
The performance numbers were captured using the following:
- SanDisk 8GB MicroSDHC Class 10 Memory Card
- Partition was mounted with async option
The performance numbers were captured using the following:
- SanDisk 8GB SDHC UHS Memory Card
- Partition was mounted with async option
2.2.1.15.3. AM654x-EVM¶
Buffer size (bytes) | am654x-evm: Write VFAT Throughput (Mbytes/sec) | am654x-evm: Write VFAT CPU Load (%) | am654x-evm: Read VFAT Throughput (Mbytes/sec) | am654x-evm: Read VFAT CPU Load (%) |
---|---|---|---|---|
102400 | 16.56 (min 15.29, max 17.07) | 1.69 (min 1.20, max 3.36) | 22.15 | 1.43 |
262144 | 16.57 (min 15.47, max 16.94) | 1.57 (min 1.12, max 3.10) | 22.35 | 1.44 |
524288 | 15.94 (min 13.81, max 16.85) | 1.50 (min 0.96, max 2.98) | 22.73 | 1.14 |
1048576 | 15.85 (min 13.78, max 16.97) | 1.47 (min 0.92, max 2.89) | 22.61 | 1.08 |
5242880 | 15.84 (min 13.59, max 17.14) | 1.51 (min 0.94, max 3.01) | 22.57 | 1.24 |
Buffer size (bytes) | am654x-evm: Write EXT2 Throughput (Mbytes/sec) | am654x-evm: Write EXT2 CPU Load (%) | am654x-evm: Read EXT2 Throughput (Mbytes/sec) | am654x-evm: Read EXT2 CPU Load (%) |
---|---|---|---|---|
102400 | 16.78 (min 16.60, max 16.87) | 0.98 (min 0.80, max 1.46) | 22.58 | 1.35 |
262144 | 16.17 (min 13.83, max 16.94) | 0.91 (min 0.69, max 1.37) | 22.96 | 1.26 |
524288 | 15.48 (min 13.78, max 16.85) | 0.87 (min 0.66, max 1.28) | 23.44 | 0.95 |
1048576 | 16.61 (min 16.12, max 16.80) | 0.88 (min 0.72, max 1.35) | 23.57 | 1.24 |
5242880 | 16.69 (min 16.29, max 16.87) | 0.94 (min 0.80, max 1.36) | 23.15 | 1.22 |
Buffer size (bytes) | am654x-evm: Write EXT4 Throughput (Mbytes/sec) | am654x-evm: Write EXT4 CPU Load (%) | am654x-evm: Read EXT4 Throughput (Mbytes/sec) | am654x-evm: Read EXT4 CPU Load (%) |
---|---|---|---|---|
102400 | 16.84 (min 16.66, max 17.08) | 1.03 (min 0.93, max 1.35) | 22.68 | 1.14 |
262144 | 16.34 (min 13.85, max 17.12) | 0.96 (min 0.79, max 1.24) | 23.06 | 1.27 |
524288 | 17.08 (min 16.85, max 17.41) | 1.00 (min 0.90, max 1.28) | 23.58 | 1.40 |
1048576 | 17.00 (min 16.80, max 17.10) | 0.99 (min 0.90, max 1.20) | 23.79 | 0.11 |
5242880 | 17.01 (min 16.88, max 17.14) | 0.97 (min 0.85, max 1.25) | 23.77 | 1.08 |
The performance numbers were captured using the following:
- SanDisk 8GB SDHC UHS Memory Card
- Partition was mounted with async option
The performance numbers were captured using the following:
- SanDisk 8GB MicroSDHC Class 10 Memory Card
- Partition was mounted with async option
The performance numbers were captured using the following:
- SanDisk 8GB MicroSDHC Class 10 Memory Card
- Partition was mounted with async option
The performance numbers were captured using the following:
- SanDisk 8GB MicroSDHC Class 10 Memory Card
- Partition was mounted with async option
The performance numbers were captured using the following:
- SanDisk 8GB MicroSDHC Class 10 Memory Card
- Partition was mounted with async option
2.2.1.15.4. K2G-EVM¶
Buffer size (bytes) | k2g-evm: Write VFAT Throughput (Mbytes/sec) | k2g-evm: Write VFAT CPU Load (%) | k2g-evm: Read VFAT Throughput (Mbytes/sec) | k2g-evm: Read VFAT CPU Load (%) |
---|---|---|---|---|
102400 | 9.29 (min 8.24, max 10.33) | 5.68 (min 3.64, max 10.58) | 21.04 | 8.37 |
262144 | 9.57 (min 8.93, max 9.79) | 6.08 (min 4.23, max 11.22) | 21.47 | 9.24 |
524288 | 9.66 (min 8.67, max 9.93) | 6.20 (min 4.75, max 10.31) | 22.08 | 6.87 |
1048576 | 9.64 (min 8.70, max 9.93) | 6.25 (min 4.89, max 10.57) | 22.36 | 8.14 |
5242880 | 9.42 (min 7.95, max 9.96) | 5.90 (min 4.29, max 9.52) | 22.28 | 7.51 |
Buffer size (bytes) | k2g-evm: Write EXT2 Throughput (Mbytes/sec) | k2g-evm: Write EXT2 CPU Load (%) | k2g-evm: Read EXT2 Throughput (Mbytes/sec) | k2g-evm: Read EXT2 CPU Load (%) |
---|---|---|---|---|
102400 | 8.89 (min 8.10, max 9.24) | 4.70 (min 3.43, max 8.26) | 21.69 | 8.51 |
262144 | 9.95 (min 9.59, max 10.16) | 4.93 (min 3.42, max 9.77) | 22.02 | 8.82 |
524288 | 9.61 (min 9.12, max 9.93) | 4.91 (min 3.59, max 9.38) | 22.55 | 7.13 |
1048576 | 7.79 (min 6.79, max 8.84) | 3.81 (min 2.57, max 8.23) | 22.50 | 6.91 |
5242880 | 9.51 (min 9.17, max 10.06) | 4.88 (min 3.61, max 8.94) | 22.72 | 7.39 |
Buffer size (bytes) | k2g-evm: Write EXT4 Throughput (Mbytes/sec) | k2g-evm: Write EXT4 CPU Load (%) | k2g-evm: Read EXT4 Throughput (Mbytes/sec) | k2g-evm: Read EXT4 CPU Load (%) |
---|---|---|---|---|
102400 | 9.06 (min 7.91, max 10.38) | 4.55 (min 3.45, max 7.97) | 21.76 | 7.56 |
262144 | 9.62 (min 9.40, max 10.03) | 4.88 (min 4.07, max 7.39) | 22.09 | 6.85 |
524288 | 9.58 (min 8.98, max 10.06) | 4.92 (min 3.96, max 7.13) | 22.64 | 6.74 |
1048576 | 9.47 (min 8.95, max 9.62) | 4.88 (min 4.21, max 7.18) | 22.83 | 7.17 |
5242880 | 8.26 (min 7.41, max 8.95) | 4.38 (min 3.26, max 7.03) | 22.80 | 6.77 |
The performance numbers were captured using the following:
- SanDisk 8GB MicroSDHC Class 10 Memory Card
- Partition was mounted with async option
2.2.1.15.5. OMAPL138-LCDK¶
Buffer size (bytes) | omapl138-lcdk: Write VFAT Throughput (Mbytes/sec) | omapl138-lcdk: Write VFAT CPU Load (%) | omapl138-lcdk: Read VFAT Throughput (Mbytes/sec) | omapl138-lcdk: Read VFAT CPU Load (%) |
---|---|---|---|---|
102400 | 6.89 (min 4.34, max 7.86) | 87.64 (min 60.78, max 97.36) | 14.79 | 69.44 |
262144 | 7.50 (min 4.59, max 8.48) | 80.22 (min 52.84, max 89.80) | 14.67 | 64.08 |
524288 | 7.46 (min 4.62, max 8.42) | 80.12 (min 52.40, max 90.86) | 15.00 | 61.58 |
1048576 | 7.20 (min 4.45, max 8.32) | 79.24 (min 63.67, max 89.67) | 15.17 | 62.39 |
5242880 | 7.34 (min 4.65, max 8.47) | 78.78 (min 54.62, max 88.22) | 13.06 | 57.47 |
Buffer size (bytes) | omapl138-lcdk: Write EXT2 Throughput (Mbytes/sec) | omapl138-lcdk: Write EXT2 CPU Load (%) | omapl138-lcdk: Read EXT2 Throughput (Mbytes/sec) | omapl138-lcdk: Read EXT2 CPU Load (%) |
---|---|---|---|---|
102400 | 6.10 (min 3.55, max 7.34) | 60.52 (min 51.91, max 66.96) | 15.90 | 61.53 |
262144 | 7.82 (min 7.47, max 8.04) | 59.76 (min 54.70, max 75.64) | 15.68 | 59.47 |
524288 | 7.63 (min 6.60, max 8.19) | 61.84 (min 51.12, max 73.80) | 15.99 | 60.19 |
1048576 | 7.65 (min 6.98, max 8.16) | 66.09 (min 60.16, max 79.19) | 16.28 | 58.19 |
5242880 | 7.92 (min 7.54, max 8.35) | 60.72 (min 52.82, max 70.53) | 15.75 | 60.03 |
Buffer size (bytes) | omapl138-lcdk: Write EXT4 Throughput (Mbytes/sec) | omapl138-lcdk: Write EXT4 CPU Load (%) | omapl138-lcdk: Read EXT4 Throughput (Mbytes/sec) | omapl138-lcdk: Read EXT4 CPU Load (%) |
---|---|---|---|---|
102400 | 6.70 (min 5.69, max 7.08) | 83.88 (min 80.33, max 90.82) | 15.85 | 60.60 |
262144 | 8.30 (min 7.56, max 8.62) | 80.31 (min 75.08, max 90.97) | 15.83 | 58.45 |
524288 | 8.28 (min 7.02, max 8.71) | 83.11 (min 81.10, max 86.37) | 16.24 | 57.58 |
1048576 | 8.06 (min 7.16, max 8.69) | 79.22 (min 71.51, max 87.91) | 16.32 | 57.23 |
5242880 | 7.83 (min 7.06, max 8.74) | 80.06 (min 67.43, max 93.41) | 16.20 | 58.43 |
The performance numbers were captured using the following:
- SanDisk 8GB MicroSDHC Class 10 Memory Card
- Partition was mounted with async option
2.2.1.20. USB Driver¶
2.2.1.20.1. MUSB/XHCI Host controller¶
Warning
IMPORTANT: For Mass-storage applications, the performance numbers can be severely affected if the media is mounted in sync mode. Hot plug scripts in the filesystem mount removable media in sync mode to ensure data integrity. For performance sensitive applications, umount the auto-mounted filesystem and re-mount in async mode.
Setup : Inateck ASM1153E USB hard disk is connected to usb0 port. File read/write performance data on usb0 port is captured.
Table: USB Host VFAT
Buffer size (bytes) | am335x-evm: Write VFAT Throughput (Mbytes/sec) | am335x-evm: Write VFAT CPU Load (%) | am335x-evm: Read VFAT Throughput (Mbytes/sec) | am335x-evm: Read VFAT CPU Load (%) | am57xx-evm: Write VFAT Throughput (Mbytes/sec) | am57xx-evm: Write VFAT CPU Load (%) | am57xx-evm: Read VFAT Throughput (Mbytes/sec) | am57xx-evm: Read VFAT CPU Load (%) | am654x-evm: Write VFAT Throughput (Mbytes/sec) | am654x-evm: Write VFAT CPU Load (%) | am654x-evm: Read VFAT Throughput (Mbytes/sec) | am654x-evm: Read VFAT CPU Load (%) |
---|---|---|---|---|---|---|---|---|---|---|---|---|
102400 | 19.91 (min 18.72, max 20.32) | 27.92 (min 24.90, max 37.32) | 18.19 | 32.75 | 264.27 (min 137.71, max 297.79) | 55.65 (min 50.34, max 58.21) | 341.30 | 37.04 | 37.19 (min 32.84, max 38.37) | 4.61 (min 3.77, max 6.93) | 38.64 | 4.07 |
262144 | 19.34 (min 17.97, max 19.85) | 28.14 (min 25.64, max 35.75) | 18.21 | 32.46 | 263.30 (min 136.60, max 295.77) | 54.46 (min 48.67, max 56.52) | 325.27 | 32.76 | 37.59 (min 33.66, max 38.60) | 4.24 (min 3.52, max 5.93) | 38.83 | 2.84 |
Table: USB Host EXT2
Buffer size (bytes) | am335x-evm: Write EXT2 Throughput (Mbytes/sec) | am335x-evm: Write EXT2 CPU Load (%) | am335x-evm: Read EXT2 Throughput (Mbytes/sec) | am335x-evm: Read EXT2 CPU Load (%) | am57xx-evm: Write EXT2 Throughput (Mbytes/sec) | am57xx-evm: Write EXT2 CPU Load (%) | am57xx-evm: Read EXT2 Throughput (Mbytes/sec) | am57xx-evm: Read EXT2 CPU Load (%) | am654x-evm: Write EXT2 Throughput (Mbytes/sec) | am654x-evm: Write EXT2 CPU Load (%) | am654x-evm: Read EXT2 Throughput (Mbytes/sec) | am654x-evm: Read EXT2 CPU Load (%) |
---|---|---|---|---|---|---|---|---|---|---|---|---|
102400 | 20.25 (min 18.85, max 20.64) | 25.64 (min 22.93, max 34.08) | 18.18 | 32.46 | 284.87 (min 153.04, max 318.47) | 51.47 (min 50.00, max 53.85) | 337.42 | 32.08 | 38.85 (min 37.31, max 39.26) | 3.32 (min 2.84, max 4.63) | 38.33 | 4.41 |
1048576 | 19.81 (min 18.37, max 20.30) | 25.41 (min 23.33, max 32.97) | 18.17 | 34.74 | 285.20 (min 151.71, max 319.04) | 50.71 (min 49.18, max 53.23) | 368.98 | 32.00 | 38.95 (min 37.43, max 39.42) | 3.34 (min 2.84, max 4.05) | 38.19 | 3.86 |
5242880 | 19.84 (min 18.34, max 20.35) | 25.40 (min 22.11, max 33.88) | 18.29 | 32.26 | 284.92 (min 152.66, max 319.09) | 51.58 (min 45.61, max 56.06) | 363.15 | 37.04 | 39.00 (min 37.35, max 39.54) | 3.30 (min 2.94, max 4.30) | 38.24 | 5.12 |
Table: USBDEVICE NCM IPERF TCP THROUGHPUT
Window Size (kbytes) | am335x-evm: TX Throughput (Mbits/sec) | am335x-evm: RX Throughput (Mbits/sec) | am57xx-evm: TX Throughput (Mbits/sec) | am57xx-evm: RX Throughput (Mbits/sec) | am654x-evm: TX Throughput (Mbits/sec) | am654x-evm: RX Throughput (Mbits/sec) |
---|---|---|---|---|---|---|
8 | 99.12 | 12.83 | 180.10 | 32.60 | 216.60 | 44.00 |
16 | 89.60 | 21.01 | 209.30 | 55.40 | 218.00 | 78.70 |
32 | 80.40 | 59.60 | 261.40 | 188.70 | 285.10 | 200.80 |
64 | 83.00 | 72.60 | 278.30 | 257.00 | 356.70 | 345.00 |
128 | 83.30 | 73.70 | 278.80 | 260.00 | 357.70 | 355.00 |
2.2.1.21. CRYPTO Driver¶
2.2.1.21.1. OpenSSL Performance¶
Algorithm | Buffer Size | am335x-evm: throughput | am57xx-evm: throughput | am654x-evm: throughput |
---|---|---|---|---|
aes-128-cbc | 1024 | 10707.97 | 14142.81 | 21044.22 |
aes-128-cbc | 16 | 2038.86 | 3963.51 | 356.28 |
aes-128-cbc | 256 | 3951.62 | 5149.10 | 5730.56 |
aes-128-cbc | 64 | 6522.79 | 13341.33 | 1434.92 |
aes-128-cbc | 8192 | 19802.79 | 31099.56 | 106181.97 |
aes-192-cbc | 1024 | 10326.36 | 14168.75 | 20696.75 |
aes-192-cbc | 16 | 1993.16 | 3831.26 | 359.53 |
aes-192-cbc | 256 | 3996.42 | 5107.11 | 5647.53 |
aes-192-cbc | 64 | 6216.17 | 12637.14 | 1425.26 |
aes-192-cbc | 8192 | 19761.83 | 31074.99 | 100941.82 |
aes-256-cbc | 1024 | 10310.31 | 14148.95 | 20603.90 |
aes-256-cbc | 16 | 1999.13 | 3957.61 | 384.85 |
aes-256-cbc | 256 | 3471.27 | 5114.71 | 5640.70 |
aes-256-cbc | 64 | 6079.68 | 12663.32 | 1431.91 |
aes-256-cbc | 8192 | 18085.21 | 31176.02 | 96266.92 |
des-cbc | 1024 | 11829.93 | 9155.93 | 14641.49 |
des-cbc | 16 | 2053.66 | 312.67 | 3281.91 |
des-cbc | 256 | 9820.33 | 3907.33 | 12553.39 |
des-cbc | 64 | 5579.50 | 1186.24 | 8016.38 |
des-cbc | 8192 | 12836.86 | 15029.59 | 15387.31 |
des3 | 1024 | 5160.28 | 9041.24 | 20194.99 |
des3 | 16 | 1643.03 | 314.46 | 521.75 |
des3 | 256 | 4683.61 | 3923.11 | 5811.88 |
des3 | 64 | 3328.68 | 1190.19 | 1697.88 |
des3 | 8192 | 5163.69 | 14731.95 | 69080.41 |
md5 | 1024 | 8355.84 | 13727.40 | 30480.73 |
md5 | 16 | 403.34 | 926.36 | 672.86 |
md5 | 256 | 3364.27 | 4427.01 | 9729.96 |
md5 | 64 | 1589.03 | 3626.33 | 2612.84 |
md5 | 8192 | 36227.75 | 56011.43 | 80442.71 |
sha1 | 1024 | 8345.60 | 13240.66 | 37263.70 |
sha1 | 16 | 362.45 | 831.46 | 653.37 |
sha1 | 256 | 3155.11 | 4829.10 | 10138.97 |
sha1 | 64 | 1424.68 | 3245.29 | 2594.97 |
sha1 | 8192 | 34947.07 | 55170.39 | 168102.57 |
Algorithm | am335x-evm: CPU Load | am57xx-evm: CPU Load | am654x-evm: CPU Load |
---|---|---|---|
aes-128-cbc | 36.00 | 52.00 | 45.00 |
aes-192-cbc | 51.00 | 51.00 | 45.00 |
aes-256-cbc | 52.00 | 52.00 | 45.00 |
des-cbc | 98.00 | 21.00 | 99.00 |
des3 | 97.00 | 22.00 | 46.00 |
md5 | 81.00 | 73.00 | 99.00 |
sha1 | 59.00 | 75.00 | 99.00 |
time -v openssl speed -elapsed -evp aes-128-cbc
2.2.1.21.2. IPSec Performance¶
Note: queue_len is set to 300 and software fallback threshold set to 9 to enable software support for optimal performance
Algorithm | am335x-evm: Throughput | am335x-evm: Packets/Sec | am335x-evm: CPU Load | am57xx-evm: Throughput | am57xx-evm: Packets/Sec | am57xx-evm: CPU Load | am654x-evm: Throughput | am654x-evm: Packets/Sec | am654x-evm: CPU Load |
---|---|---|---|---|---|---|---|---|---|
3des | 17.90 | 1.00 | 75.50 | ||||||
aes128 | 4.40 | 0.00 | 54.00 | 104.00 | 8.00 | 53.30 | 186.60 | 15.00 | 32.50 |
aes192 | 25.10 | 2.00 | 91.10 | 144.30 | 12.00 | 60.70 | |||
aes256 | 29.60 | 2.00 | 78.50 | 144.30 | 12.00 | 60.10 |
2.2.2. RT Kernel Performance Guide¶
Read This First
All performance numbers provided in this document are gathered using following Evaluation Modules unless otherwise specified.
Name | Description |
---|---|
AM335x | AM335x Evaluation Module rev 1.5B with ARM running at 1000MHz, DDR3-400 (400MHz/800 MT/S), TMDXEVM3358 |
AM437x-gpevm | AM437x-gpevm Evaluation Module rev 1.5A with ARM running at 1000MHz, DDR3-400 (400MHz/800 MT/S), TMDSEVM437X |
AM572x IDK | AM57xx Evaluation Module rev A2 with ARM running at 1500MHz, DDR3L-533 (533 MHz/1066 MT/S), TMDXIDK5278 |
AM574x IDK | AM57xx Industrial Development Kit rev 1.0A with ARM running at 1500MHz, DDR3L-533 (533 MHz/1066 MT/S), TMDXIDK5278 |
K2HK EVM | K2 Hawkings Evaluation Module rev 40 with ARM running at 1200MHz, DDR3-1600 (800 MHz/1600 MT/S), EVMK2H |
K2G EVM | K2 Galileo Evaluation Module rev C, DDR3-1333 (666 MHz/1333 MT/S), EVMK2G |
AM65x EVM | AM65x Evaluation Module rev 1.0 with ARM running at 800MHz, DDR4-2400 (1333 MT/S), TMDX654GPEVM |
Table: Evaluation Modules
About This Manual
This document provides performance data for each of the device drivers which are part of the Process SDK Linux package. This document should be used in conjunction with release notes and user guides provided with the Process SDK Linux package for information on specific issues present with drivers included in a particular release.
If You Need Assistance
For further information or to report any problems, contact http://community.ti.com/ or http://support.ti.com/
2.2.2.1. System Benchmarks¶
2.2.2.1.1. LMBench¶
LMBench is a collection of microbenchmarks of which the memory bandwidth and latency related ones are typically used to estimate processor memory system performance.
Latency: lat_mem_rd-stride128-szN, where N is equal to or smaller than the cache size at given level measures the cache miss penalty. N that is at least double the size of last level cache is the latency to external memory.
Bandwidth: bw_mem_bcopy-N, where N is is equal to or smaller than the cache size at a given level measures the achivable memory bandwidth from software doing a memcpy() type operation. Typical use is for external memory bandwidth calculation. The bandwidth is calculated as byte read and written counts as 1 which should be roughly half of STREAM copy result.
Benchmarks | am574x-idk: perf |
---|---|
af_unix_sock_stream_latency (microsec) | 72.29 |
af_unix_socket_stream_bandwidth (MBs) | 868.00 |
bw_file_rd-io-1mb (MB/s) | 677.51 |
bw_file_rd-o2c-1mb (MB/s) | 524.75 |
bw_mem-bcopy-16mb (MB/s) | 1561.43 |
bw_mem-bcopy-1mb (MB/s) | 3548.75 |
bw_mem-bcopy-2mb (MB/s) | 2547.31 |
bw_mem-bcopy-4mb (MB/s) | 1691.57 |
bw_mem-bcopy-8mb (MB/s) | 1570.78 |
bw_mem-bzero-16mb (MB/s) | 3954.03 |
bw_mem-bzero-1mb (MB/s) | 3998.97 (min 3548.75, max 4449.19) |
bw_mem-bzero-2mb (MB/s) | 3453.04 (min 2547.31, max 4358.76) |
bw_mem-bzero-4mb (MB/s) | 2938.20 (min 1691.57, max 4184.83) |
bw_mem-bzero-8mb (MB/s) | 2789.07 (min 1570.78, max 4007.35) |
bw_mem-cp-16mb (MB/s) | 847.10 |
bw_mem-cp-1mb (MB/s) | 3681.64 (min 3043.63, max 4319.65) |
bw_mem-cp-2mb (MB/s) | 2603.96 (min 944.29, max 4263.63) |
bw_mem-cp-4mb (MB/s) | 2492.25 (min 858.65, max 4125.84) |
bw_mem-cp-8mb (MB/s) | 2429.62 (min 851.88, max 4007.35) |
bw_mem-fcp-16mb (MB/s) | 927.54 |
bw_mem-fcp-1mb (MB/s) | 3336.66 (min 2224.12, max 4449.19) |
bw_mem-fcp-2mb (MB/s) | 2710.45 (min 1062.13, max 4358.76) |
bw_mem-fcp-4mb (MB/s) | 2514.09 (min 843.35, max 4184.83) |
bw_mem-fcp-8mb (MB/s) | 2468.79 (min 930.23, max 4007.35) |
bw_mem-frd-16mb (MB/s) | 910.75 |
bw_mem-frd-1mb (MB/s) | 2137.80 (min 2051.47, max 2224.12) |
bw_mem-frd-2mb (MB/s) | 1351.28 (min 1062.13, max 1640.42) |
bw_mem-frd-4mb (MB/s) | 927.24 (min 843.35, max 1011.12) |
bw_mem-frd-8mb (MB/s) | 925.47 (min 920.70, max 930.23) |
bw_mem-fwr-16mb (MB/s) | 3953.06 |
bw_mem-fwr-1mb (MB/s) | 3185.56 (min 2051.47, max 4319.65) |
bw_mem-fwr-2mb (MB/s) | 2952.03 (min 1640.42, max 4263.63) |
bw_mem-fwr-4mb (MB/s) | 2568.48 (min 1011.12, max 4125.84) |
bw_mem-fwr-8mb (MB/s) | 2464.03 (min 920.70, max 4007.35) |
bw_mem-rd-16mb (MB/s) | 2641.57 |
bw_mem-rd-1mb (MB/s) | 8072.03 (min 7213.88, max 8930.17) |
bw_mem-rd-2mb (MB/s) | 6510.48 (min 5636.98, max 7383.97) |
bw_mem-rd-4mb (MB/s) | 2072.58 (min 1209.37, max 2935.78) |
bw_mem-rd-8mb (MB/s) | 1808.11 (min 948.65, max 2667.56) |
bw_mem-rdwr-16mb (MB/s) | 897.67 |
bw_mem-rdwr-1mb (MB/s) | 4024.71 (min 3043.63, max 5005.78) |
bw_mem-rdwr-2mb (MB/s) | 2003.25 (min 944.29, max 3062.20) |
bw_mem-rdwr-4mb (MB/s) | 966.96 (min 858.65, max 1075.27) |
bw_mem-rdwr-8mb (MB/s) | 884.13 (min 851.88, max 916.38) |
bw_mem-wr-16mb (MB/s) | 923.63 |
bw_mem-wr-1mb (MB/s) | 6967.98 (min 5005.78, max 8930.17) |
bw_mem-wr-2mb (MB/s) | 5223.09 (min 3062.20, max 7383.97) |
bw_mem-wr-4mb (MB/s) | 1142.32 (min 1075.27, max 1209.37) |
bw_mem-wr-8mb (MB/s) | 932.52 (min 916.38, max 948.65) |
bw_mmap_rd-mo-1mb (MB/s) | 2736.51 |
bw_mmap_rd-o2c-1mb (MB/s) | 790.89 |
bw_pipe (MB/s) | 213.41 |
bw_unix (MB/s) | 868.00 |
lat_connect (us) | 147.82 |
lat_ctx-2-128k (us) | 1.86 |
lat_ctx-2-256k (us) | 4.00 |
lat_ctx-4-128k (us) | 8.12 |
lat_ctx-4-256k (us) | 0.00 |
lat_fs-0k (num_files) | 100.00 |
lat_fs-10k (num_files) | 45.00 |
lat_fs-1k (num_files) | 65.00 |
lat_fs-4k (num_files) | 68.00 |
lat_mem_rd-stride128-sz1000k (ns) | 19.28 |
lat_mem_rd-stride128-sz125k (ns) | 19.06 |
lat_mem_rd-stride128-sz250k (ns) | 19.28 |
lat_mem_rd-stride128-sz31k (ns) | 9.74 |
lat_mem_rd-stride128-sz50 (ns) | 4.01 |
lat_mem_rd-stride128-sz500k (ns) | 19.28 |
lat_mem_rd-stride128-sz62k (ns) | 19.06 |
lat_mmap-1m (us) | 163.00 |
lat_ops-double-add (ns) | 1.10 |
lat_ops-double-mul (ns) | 5.05 |
lat_ops-float-add (ns) | 1.10 |
lat_ops-float-mul (ns) | 5.05 |
lat_ops-int-add (ns) | 1.01 |
lat_ops-int-bit (ns) | 0.67 |
lat_ops-int-div (ns) | 88.24 |
lat_ops-int-mod (ns) | 15.50 |
lat_ops-int-mul (ns) | 3.16 |
lat_ops-int64-add (ns) | 1.11 |
lat_ops-int64-bit (ns) | 1.04 |
lat_ops-int64-div (ns) | 183.98 |
lat_ops-int64-mod (ns) | 34.31 |
lat_pagefault (us) | 1.83 |
lat_pipe (us) | 43.74 |
lat_proc-exec (us) | 2050.00 |
lat_proc-fork (us) | 2002.67 |
lat_proc-proccall (us) | 0.01 |
lat_select (us) | 41.47 |
lat_sem (us) | 3.95 |
lat_sig-catch (us) | 11.88 |
lat_sig-install (us) | 2.29 |
lat_sig-prot (us) | 0.45 |
lat_syscall-fstat (us) | 10.29 |
lat_syscall-null (us) | 0.46 |
lat_syscall-open (us) | 363.56 |
lat_syscall-read (us) | 0.69 |
lat_syscall-stat (us) | 17.84 |
lat_syscall-write (us) | 0.57 |
lat_tcp (us) | 0.87 |
lat_unix (us) | 72.29 |
latency_for_0.50_mb_block_size (nanosec) | 19.28 |
latency_for_1.00_mb_block_size (nanosec) | 9.64 (min 0.00, max 19.28) |
pipe_bandwidth (MBs) | 213.41 |
pipe_latency (microsec) | 43.74 |
procedure_call (microsec) | 0.01 |
select_on_200_tcp_fds (microsec) | 41.47 |
semaphore_latency (microsec) | 3.95 |
signal_handler_latency (microsec) | 2.29 |
signal_handler_overhead (microsec) | 11.88 |
tcp_ip_connection_cost_to_localhost (microsec) | 147.82 |
tcp_latency_using_localhost (microsec) | 0.87 |
Table: LM Bench Metrics
2.2.2.1.2. Dhrystone¶
Dhrystone is a core only benchmark that runs from warm L1 caches in all modern processors. It scales linearly with clock speed. For standard ARM cores the DMIPS/MHz score will be identical with the same compiler and flags.
Benchmarks | am574x-idk: perf | am654x-idk: perf |
---|---|---|
cpu_clock (MHz) | 10.00 | 400.00 |
dhrystone_per_mhz (DMIPS/MHz) | 334.80 | 6.10 |
dhrystone_per_second (DhrystoneP) | 5882353.00 | 4255319.00 |
Table: Dhrystone Benchmark
2.2.2.1.3. Whetstone¶
Benchmarks | am574x-idk: perf | am654x-idk: perf |
---|---|---|
whetstone (MIPS) | 3333.30 | 3333.30 |
Table: Whetstone Benchmark
2.2.2.1.4. Linpack¶
Linpack measures peak double precision (64 bit) floating point performance in sloving a dense linear system.
Benchmarks | am574x-idk: perf | am654x-idk: perf |
---|---|---|
linpack (Kflops) | 648472.00 | 330144.00 |
Table: Linpack Benchmark
2.2.2.1.5. NBench¶
Benchmarks | am574x-idk: perf | am654x-idk: perf |
---|---|---|
assignment (Iterations) | 13.35 | 7.68 |
fourier (Iterations) | 12555.00 | 8196.30 |
fp_emulation (Iterations) | 61.65 | 61.36 |
huffman (Iterations) | 902.59 | 671.87 |
idea (Iterations) | 2861.70 | 1915.40 |
lu_decomposition (Iterations) | 550.55 | 311.81 |
neural_net (Iterations) | 15.32 | 4.21 |
numeric_sort (Iterations) | 511.63 | 298.26 |
string_sort (Iterations) | 93.90 | 94.43 |
Table: NBench Benchmarks
2.2.2.1.6. Stream¶
STREAM is a microbenchmarks for measuring data memory system performance without any data reuse. It is designed to miss on caches and exercise data prefetcher and apeculative accesseses. it uses double precision floating point (64bit) but in most modern processors the memory access will be the bottleck. The four individual scores are copy, scale as in multiply by constant, add two numbers, and triad for multiply accumulate. For bandwidth a byte read counts as one and a byte written counts as one resulting in a score that is double the bandwidth LMBench will show.
Benchmarks | am574x-idk: perf | am654x-idk: perf |
---|---|---|
add (MB/s) | 2937.40 | 1589.40 |
copy (MB/s) | 3126.80 | 1832.70 |
scale (MB/s) | 3118.00 | 1824.50 |
triad (MB/s) | 2900.20 | 1494.40 |
Table: Stream CoreMarkPro ^^^^^^^^^^^^^^^^^^^^^^^^^^^ CoreMark®-Pro is a comprehensive, advanced processor benchmark that works with and enhances the market-proven industry-standard EEMBC CoreMark® benchmark. While CoreMark stresses the CPU pipeline, CoreMark-Pro tests the entire processor, adding comprehensive support for multicore technology, a combination of integer and floating-point workloads, and data sets for utilizing larger memory subsystems.
Table: CoreMarkPro
2.2.2.1.7. MultiBench¶
MultiBench is a suite of benchmarks that allows processor and system designers to analyze, test, and improve multicore processors. It uses three forms of concurrency: Data decomposition: multiple threads cooperating on achieving a unified goal and demonstrating a processors support for fine grain parallelism. Processing multiple data streams: uses common code running over multiple threads and demonstrating how well a processor scales over scalable data inputs. Multiple workload processing: shows the scalability of general-purpose processing, demonstrating concurrency over both code and data. MultiBench combines a wide variety of application-specific workloads with the EEMBC Multi-Instance-Test Harness (MITH), compatible and portable with most any multicore processors and operating systems. MITH uses a thread-based API (POSIX-compliant) to establish a common programming model that communicates with the benchmark through an abstraction layer and provides a flexible interface to allow a wide variety of thread-enabled workloads to be tested.
Table: Multibench
2.2.2.1.8. Spec2K6¶
CPU2006 is a set of benchmarks designed to test the CPU performance of a modern server computer system. It is split into two components, the first being CINT2006, the other being CFP2006 (SPECfp), for floating point testing.
SPEC defines a base runtime for each of the 12 benchmark programs. For SPECint2006, that number ranges from 1000 to 3000 seconds. The timed test is run on the system, and the time of the test system is compared to the reference time, and a ratio is computed. That ratio becomes the SPECint score for that test. (This differs from the rating in SPECINT2000, which multiplies the ratio by 100.)
As an example for SPECint2006, consider a processor which can run 400.perlbench in 2000 seconds. The time it takes the reference machine to run the benchmark is 9770 seconds. Thus the ratio is 4.885. Each ratio is computed, and then the geometric mean of those ratios is computed to produce an overall value.
Table: Spec2K6
2.2.2.2. Maximum Latency under different use cases¶
2.2.2.2.1. Shield (dedicated core) Case¶
shield_shell()
{
create_cgroup nonrt 0
create_cgroup rt 1
for pid in $(cat /sys/fs/cgroup/tasks); do /bin/echo $pid > /sys/fs/cgroup/nonrt/tasks; done
/bin/echo $$ > /sys/fs/cgroup/rt/tasks
}
Use Case | am574x-idk: Latency |
---|---|
L_PERF_SHIELD_SMP_1080ENC | 28.00 |
L_PERF_SHIELD_SMP_GRAPHICS | 29.00 |
L_PERF_SHIELD_SMP_HACKBENCH | 29.00 |
L_PERF_SHIELD_SMP_MEM | 29.00 |
L_PERF_SHIELD_SMP_NET | 30.00 |
L_PERF_SHIELD_SMP_NO_LOAD | 29.00 |
L_PERF_SHIELD_SMP_STRESS_LOAD | 37.00 |
L_PERF_SHIELD_SMP_UART | 31.00 |
L_PERF_SHIELD_SMP_USB | 31.00 |
2.2.2.3. Boot-time Measurement¶
2.2.2.3.1. Boot media: MMCSD¶
Boot Configuration | am574x-idk: boot time (sec) | am654x-idk: boot time (sec) |
---|---|---|
Kernel boot time test when bootloader, kernel and sdk-rootfs are in mmc-sd | 38.17 (min 37.82, max 38.73) | 15.13 (min 14.82, max 15.47) |
Kernel boot time test when init is /bin/sh and bootloader, kernel and sdk-rootfs are in mmc-sd | 8.03 (min 7.99, max 8.04) | 5.80 (min 5.78, max 5.84) |
Table: Boot time MMC/SD
2.2.2.3.2. Boot media: NAND¶
Table: Boot time NAND
2.2.2.4. ALSA SoC Audio Driver¶
- Access type - RW_INTERLEAVED
- Channels - 2
- Format - S16_LE
- Period size - 64
Table: Audio Capture
Sampling Rate (Hz) | am574x-idk: Throughput (bits/sec) | am574x-idk: CPU Load (%) |
---|---|---|
8000 | 1024418.00 | 0.10 |
11025 | 1024418.00 | 0.04 |
16000 | 1024418.00 | 0.09 |
22050 | 1024418.00 | 0.07 |
24000 | 1024418.00 | 0.06 |
32000 | 1024418.00 | 0.09 |
44100 | 1411775.00 | 1.66 |
48000 | 1536626.00 | 0.13 |
88200 | 2823548.00 | 3.10 |
96000 | 3073247.00 | 0.04 |
Table: Audio Playback
2.2.2.5. Sensor Capture¶
Capture video frames (MMAP buffers) with v4l2c-ctl and record the reported fps
Table: Sensor Capture
2.2.2.6. Display Driver¶
Table: Display performance (LCD)
Table: Display performance (DVI)
Mode | am574x-idk: Fps |
---|---|
1024x576@60 | 59.97 (min 59.88, max 60.09) |
1024x768@60 | 60.00 (min 59.88, max 60.14) |
1024x768@70 | 70.07 (min 69.84, max 70.23) |
1024x768@75 | 75.03 (min 74.84, max 75.23) |
1152x864@75 | 75.00 (min 74.81, max 75.22) |
1280x1024@60 | 60.02 (min 59.89, max 60.15) |
1280x1024@75 | 75.02 (min 74.86, max 75.25) |
1280x720@60 | 60.00 (min 59.90, max 60.11) |
1280x768@60 | 59.87 (min 59.70, max 60.07) |
1280x768@75 | 74.89 (min 74.72, max 75.12) |
1280x800@60 | 59.81 (min 59.71, max 59.99) |
1280x800@75 | 74.93 (min 74.74, max 75.12) |
1280x960@60 | 60.00 (min 59.85, max 60.12) |
1360x768@60 | 59.95 (min 59.79, max 60.11) |
1400x1050@60 | 59.98 (min 59.77, max 60.23) |
1400x1050@75 | 74.85 (min 73.64, max 75.39) |
1440x900@60 | 59.89 (min 59.52, max 60.22) |
1440x900@75 | 74.98 (min 74.70, max 75.21) |
1600x1200@60 | 60.00 (min 59.87, max 60.15) |
1600x1200@65 | 65.00 (min 64.87, max 65.18) |
1600x1200@70 | 70.00 (min 69.72, max 70.21) |
1600x900@60 | 60.00 (min 59.66, max 60.38) |
1680x1050@60 | 59.95 (min 59.82, max 60.09) |
1680x1050@75 | 74.89 (min 74.69, max 75.14) |
1680x945@60 | 60.02 (min 59.75, max 60.16) |
1920x1080@60 | 60.00 (min 59.88, max 60.16) |
2048x1152@60 | 60.00 (min 59.87, max 60.15) |
640x480@60 | 59.98 (min 58.98, max 60.11) |
640x480@73 | 72.81 (min 72.70, max 72.97) |
640x480@75 | 75.00 (min 74.83, max 75.16) |
720x400@70 | 70.08 (min 69.92, max 70.24) |
800x600@56 | 56.25 (min 56.13, max 56.37) |
800x600@60 | 60.32 (min 60.24, max 60.44) |
800x600@72 | 72.19 (min 71.99, max 72.36) |
800x600@75 | 75.00 (min 74.76, max 75.27) |
832x624@75 | 74.55 (min 73.98, max 75.19) |
848x480@60 | 60.00 (min 59.90, max 60.12) |
Table: Display performance (HDMI)
2.2.2.7. Graphics SGX Driver¶
Run GLBenchmark and capture performance reported Display rate (Fps), Fill rate, Vertex Throughput, etc. All display outputs (HDMI and LCD) are connected when running these tests
2.2.2.7.1. Performance (Fps)¶
Benchmark | am574x-idk: Test Number | am574x-idk: Fps |
---|---|---|
GLB25_EgyptTestC24Z16FixedTime test | 2500005.00 | 30.16 (min 17.44, max 58.07) |
GLB25_EgyptTestC24Z16_ETC1 test | 2501001.00 | 36.36 (min 16.59, max 60.06) |
GLB25_EgyptTestC24Z16_ETC1to565 test | 2501401.00 | 36.39 (min 16.67, max 60.02) |
GLB25_EgyptTestC24Z16_PVRTC4 test | 2501101.00 | 36.10 (min 16.57, max 60.05) |
GLB25_EgyptTestC24Z24MS4 test | 2500003.00 | 32.43 (min 15.52, max 60.00) |
Table: GLBenchmark 2.5 Performance
2.2.2.7.2. Vertex Throughput¶
Benchmark | am574x-idk: Test Number | am574x-idk: Rate (triangles/sec) |
---|---|---|
GLB25_TriangleTexFragmentLitTestC24Z16 test | 2500511.00 | 17604802.00 |
GLB25_TriangleTexTestC24Z16 test | 2500301.00 | 85685896.00 |
GLB25_TriangleTexVertexLitTestC24Z16 test | 2500411.00 | 20564996.00 |
Table: GLBenchmark 2.5 Vertex Throughput
2.2.2.7.3. Pixel Throughput¶
Benchmark | am574x-idk: Test Number | am574x-idk: Rate (texel/sec) | am574x-idk: Fps |
---|---|---|---|
GLB25_FillTestC24Z16 test | 2500101.00 | 1050842688.00 | 42.73 (min 40.33, max 44.00) |
Table: GLBenchmark 2.5 Pixel Throughput
2.2.2.8. Multimedia (Decode)¶
Run gstreamer pipeline “gst-launch-1.0 playbin uri=file://<Path to stream> video-sink=”kmssink sync=false connector=<connector id>” audio-sink=fakesink” and calculate performance based on the execution time reported. All display display outputs (HDMI and LCD) were connected when running these tests, but playout was forced to LCD via the connector=<connector id> option.
2.2.2.8.1. H264¶
Resolution | am574x-idk: Fps | am574x-idk: IVA Freq (MHz) | am574x-idk: IPU Freq (MHz) |
---|---|---|---|
1080p | 59.00 | ||
720p | 59940.00 | ||
CIF | 90000.00 |
Table: Gstreamer H264 in AVI Container Decode Performance
2.2.2.8.2. MPEG4¶
Resolution | am574x-idk: Fps | am574x-idk: IVA Freq (MHz) | am574x-idk: IPU Freq (MHz) |
---|---|---|---|
CIF | 30.00 | ||
QVGA | 30.00 |
Table: GStreamer MPEG4 in 3GP Container Decode Performance
2.2.2.9. Ethernet Driver¶
2.2.2.9.1. TCP Throughput¶
TCP Window Size (KBytes) | am574x-idk: Throughput (Mbits/sec) | am574x-idk: CPU Load | am654x-idk: Throughput (Mbits/sec) | am654x-idk: CPU Load |
---|---|---|---|---|
8 | 537.36 | 567.20 | ||
16 | 569.60 | 782.40 | ||
32 | 514.40 | |||
64 | 700.00 | 1249.60 | ||
128 | 571.20 | 1268.80 | ||
256 | 433.60 | 1311.20 |
Table: TCP Throughput No Interrupt Pacing
TCP Window Size (KBytes) | am574x-idk: Throughput (Mbits/sec) | am574x-idk: CPU Load | am654x-idk: Throughput (Mbits/sec) | am654x-idk: CPU Load |
---|---|---|---|---|
8 | 528.64 | 0.00 | ||
16 | 574.40 | 0.00 | ||
32 | 692.00 | 0.00 | ||
64 | 757.60 | 0.00 | ||
128 | 549.60 | 0.00 | ||
256 | 462.40 | 0.00 |
Table: TCP Throughput Interrupt Pacing
2.2.2.9.2. UDP Throughput¶
UDP Packet Size(bytes) | am574x-idk: Throughput (Mbits/sec) | am574x-idk: CPU Load | am574x-idk: Packets Per Second (kpps) | am654x-idk: Throughput (Mbits/sec) | am654x-idk: CPU Load | am654x-idk: Packets Per Second (kpps) |
---|---|---|---|---|---|---|
64 | 15.40 | 86.70 | 29.00 | |||
128 | 32.50 | 90.00 | 31.00 | |||
256 | 66.00 | 87.90 | 32.00 | |||
512 | 123.30 | 83.20 | 30.00 | |||
1024 | 243.00 | 82.90 | 29.00 | 473.00 | 100.00 | 57.00 |
1470 | 349.00 | 83.00 | 29.00 | 92.00 | 25.80 | 7.00 |
1500 | 211.00 | 83.00 | 17.00 | 446.00 | 100.00 | 37.00 |
8000 | 952.00 | 53.10 | 14.00 |
Table: UDP Throughput Egress
UDP Packet Size(bytes) | am574x-idk: Throughput (Mbits/sec) | am574x-idk: CPU Load | am574x-idk: Packets Per Second (kpps) | am654x-idk: Throughput (Mbits/sec) | am654x-idk: CPU Load | am654x-idk: Packets Per Second (kpps) |
---|---|---|---|---|---|---|
64 | 38.80 | 97.50 | 74.00 | 39.70 | 36.40 | 76.00 |
128 | 80.10 | 98.20 | 78.00 | 76.10 | 47.40 | 74.00 |
256 | 156.10 | 98.60 | 76.00 | 156.60 | 47.70 | 76.00 |
512 | 328.00 | 98.90 | 80.00 | 326.00 | 47.70 | 79.00 |
1024 | 646.00 | 98.80 | 78.00 | 618.00 | 48.20 | 75.00 |
1470 | 898.00 | 98.70 | 76.00 | 459.00 | 40.80 | 39.00 |
1500 | 295.00 | 84.40 | 24.00 | 240.00 | 35.30 | 20.00 |
8000 | 752.00 | 76.00 | 11.00 |
Table: UDP Throughput Ingress
- iperf version 2.0.5
- For receive performance, on DUT, invoke iperf in server mode.
iperf -s -u
- For transmit performance, on DUT, invoke iperf in client mode.
iperf -c <server ip> -b <bandwidth limit> -f M -t 60
2.2.2.10. PCIe Driver¶
2.2.2.10.1. PCIe-ETH¶
TCP Window Size(Kbytes) | am654x-idk: Bandwidth (Mbits/sec) |
---|---|
128 | 0.00 |
256 | 0.00 |
Table: PCI Ethernet
2.2.2.11. NAND Driver¶
2.2.2.12. QSPI Flash Driver¶
2.2.2.12.1. AM574x-IDK¶
Buffer size (bytes) | am574x-idk: Write UBIFS Throughput (Mbytes/sec) | am574x-idk: Write UBIFS CPU Load (%) | am574x-idk: Read UBIFS Throughput (Mbytes/sec) | am574x-idk: Read UBIFS CPU Load (%) |
---|---|---|---|---|
102400 | 0.30 (min 0.26, max 0.46) | 66.75 (min 65.75, max 68.16) | 15.17 | 31.03 |
262144 | 0.26 (min 0.24, max 0.28) | 66.00 (min 65.22, max 66.43) | 15.72 | 20.83 |
524288 | 0.26 (min 0.24, max 0.27) | 65.14 (min 63.15, max 66.35) | 15.57 | 29.63 |
1048576 | 0.26 (min 0.24, max 0.28) | 66.61 (min 64.84, max 67.53) | 15.32 | 20.83 |
2.2.2.12.2. AM654X-IDK¶
Buffer size (bytes) | am654x-idk: Write UBIFS Throughput (Mbytes/sec) | am654x-idk: Write UBIFS CPU Load (%) | am654x-idk: Read UBIFS Throughput (Mbytes/sec) | am654x-idk: Read UBIFS CPU Load (%) |
---|---|---|---|---|
102400 | 0.61 (min 0.48, max 1.11) | 25.11 (min 25.03, max 25.19) | 33.73 | 26.92 |
262144 | 0.43 (min 0.31, max 0.50) | 25.12 (min 25.06, max 25.19) | 33.38 | 20.00 |
524288 | 0.43 (min 0.30, max 0.51) | 25.14 (min 25.07, max 25.18) | 29.39 | 24.14 |
1048576 | 0.46 (min 0.34, max 0.52) | 25.15 (min 25.10, max 25.17) | 29.01 | 24.14 |
2.2.2.13. SPI Flash Driver¶
2.2.2.14. EMMC Driver¶
Warning
IMPORTANT: The performance numbers can be severely affected if the media is mounted in sync mode. Hot plug scripts in the filesystem mount removable media in sync mode to ensure data integrity. For performance sensitive applications, umount the auto-mounted filesystem and re-mount in async mode.
2.2.2.14.1. AM574x-IDK¶
Buffer size (bytes) | am574x-idk: Write EXT2 Throughput (Mbytes/sec) | am574x-idk: Write EXT2 CPU Load (%) | am574x-idk: Read EXT2 Throughput (Mbytes/sec) | am574x-idk: Read EXT2 CPU Load (%) |
---|---|---|---|---|
102400 | 32.49 (min 27.56, max 33.99) | 13.56 (min 9.56, max 26.42) | 63.76 | 23.49 |
262144 | 32.17 (min 27.12, max 33.83) | 13.81 (min 10.25, max 26.76) | 67.55 | 23.96 |
524288 | 32.09 (min 26.92, max 33.50) | 13.82 (min 10.34, max 26.17) | 73.58 | 21.71 |
1048576 | 32.20 (min 27.05, max 34.03) | 13.79 (min 9.90, max 27.31) | 75.53 | 22.18 |
5242880 | 32.32 (min 27.07, max 34.00) | 14.16 (min 10.63, max 26.61) | 75.43 | 23.38 |
Buffer size (bytes) | am574x-idk: Write EXT4 Throughput (Mbytes/sec) | am574x-idk: Write EXT4 CPU Load (%) | am574x-idk: Read EXT4 Throughput (Mbytes/sec) | am574x-idk: Read EXT4 CPU Load (%) |
---|---|---|---|---|
102400 | 32.13 (min 28.88, max 33.15) | 15.32 (min 11.36, max 24.03) | 64.78 | 21.38 |
262144 | 32.21 (min 28.76, max 33.32) | 16.30 (min 13.65, max 23.86) | 68.75 | 22.19 |
524288 | 32.26 (min 28.92, max 33.17) | 15.59 (min 13.04, max 23.42) | 75.13 | 22.94 |
1048576 | 32.40 (min 28.79, max 33.57) | 15.77 (min 13.00, max 23.89) | 77.53 | 24.45 |
5242880 | 32.34 (min 28.73, max 33.58) | 16.12 (min 13.73, max 24.21) | 77.66 | 24.16 |
2.2.2.14.2. AM654x-IDK¶
Buffer size (bytes) | am654x-idk: Write VFAT Throughput (Mbytes/sec) | am654x-idk: Write VFAT CPU Load (%) | am654x-idk: Read VFAT Throughput (Mbytes/sec) | am654x-idk: Read VFAT CPU Load (%) |
---|---|---|---|---|
102400 | 33.69 (min 25.57, max 35.78) | 6.80 (min 5.48, max 11.15) | 131.45 | 12.37 |
262144 | 33.75 (min 25.67, max 35.82) | 6.70 (min 5.31, max 11.14) | 135.10 | 16.89 |
524288 | 33.81 (min 25.61, max 35.95) | 6.78 (min 5.58, max 11.28) | 143.23 | 18.90 |
1048576 | 33.77 (min 25.52, max 35.88) | 6.81 (min 5.34, max 11.51) | 160.20 | 17.13 |
5242880 | 33.78 (min 25.74, max 35.86) | 6.67 (min 5.25, max 11.45) | 158.60 | 19.84 |
Buffer size (bytes) | am654x-idk: Write EXT2 Throughput (Mbytes/sec) | am654x-idk: Write EXT2 CPU Load (%) | am654x-idk: Read EXT2 Throughput (Mbytes/sec) | am654x-idk: Read EXT2 CPU Load (%) |
---|---|---|---|---|
102400 | 34.55 (min 30.89, max 35.59) | 4.89 (min 4.09, max 7.89) | 135.55 | 16.12 |
262144 | 34.51 (min 30.59, max 35.64) | 4.87 (min 4.06, max 7.60) | 144.03 | 15.90 |
524288 | 34.47 (min 30.48, max 35.61) | 4.84 (min 3.92, max 7.66) | 157.64 | 16.54 |
1048576 | 34.48 (min 30.62, max 35.57) | 4.79 (min 3.93, max 7.33) | 168.31 | 18.62 |
5242880 | 34.59 (min 30.49, max 35.69) | 4.81 (min 3.85, max 7.64) | 166.93 | 17.62 |
Buffer size (bytes) | am654x-idk: Write EXT4 Throughput (Mbytes/sec) | am654x-idk: Write EXT4 CPU Load (%) | am654x-idk: Read EXT4 Throughput (Mbytes/sec) | am654x-idk: Read EXT4 CPU Load (%) |
---|---|---|---|---|
102400 | 34.25 (min 29.59, max 35.45) | 5.87 (min 4.58, max 8.79) | 127.66 | 15.17 |
262144 | 34.79 (min 29.83, max 36.10) | 6.23 (min 5.36, max 8.86) | 145.12 | 16.67 |
524288 | 34.61 (min 29.75, max 35.97) | 6.28 (min 5.42, max 8.84) | 163.36 | 17.53 |
1048576 | 34.56 (min 29.63, max 36.10) | 6.06 (min 5.19, max 9.08) | 177.20 | 18.45 |
5242880 | 34.74 (min 29.87, max 36.17) | 6.13 (min 5.43, max 8.80) | 176.53 | 19.66 |
2.2.2.15. SATA Driver¶
- Filesize used is : 1G
- SATA II Harddisk used is: Seagate ST3500514NS 500G
2.2.2.15.1. mSATA Driver¶
- Filesize used is : 1G
- MSATA Harddisk used is: SMS200S3/30G Kingston mSATA SSD drive
2.2.2.16. MMC/SD Driver¶
Warning
IMPORTANT: The performance numbers can be severely affected if the media is mounted in sync mode. Hot plug scripts in the filesystem mount removable media in sync mode to ensure data integrity. For performance sensitive applications, umount the auto-mounted filesystem and re-mount in async mode.
2.2.2.16.1. AM574x-IDK¶
Buffer size (bytes) | am574x-idk: Write EXT4 Throughput (Mbytes/sec) | am574x-idk: Write EXT4 CPU Load (%) | am574x-idk: Read EXT4 Throughput (Mbytes/sec) | am574x-idk: Read EXT4 CPU Load (%) |
---|---|---|---|---|
102400 | 17.19 (min 16.54, max 17.43) | 7.88 (min 5.88, max 13.64) | 18.41 | 6.60 |
262144 | 18.13 (min 16.88, max 18.89) | 8.96 (min 7.16, max 13.82) | 22.15 | 6.62 |
524288 | 18.29 (min 17.38, max 18.94) | 8.28 (min 6.37, max 13.63) | 22.68 | 6.74 |
1048576 | 18.36 (min 17.01, max 18.84) | 9.44 (min 7.53, max 15.56) | 22.80 | 7.17 |
5242880 | 18.15 (min 17.45, max 18.91) | 9.02 (min 7.00, max 14.52) | 22.83 | 7.10 |
2.2.2.16.2. AM654x-IDK¶
Buffer size (bytes) | am654x-idk: Write VFAT Throughput (Mbytes/sec) | am654x-idk: Write VFAT CPU Load (%) | am654x-idk: Read VFAT Throughput (Mbytes/sec) | am654x-idk: Read VFAT CPU Load (%) |
---|---|---|---|---|
102400 | 15.47 (min 13.18, max 16.72) | 3.34 (min 2.47, max 6.27) | 22.15 | 3.29 |
262144 | 15.88 (min 13.49, max 16.71) | 3.30 (min 2.44, max 6.18) | 22.16 | 3.44 |
524288 | 14.98 (min 11.49, max 16.66) | 3.10 (min 2.15, max 5.39) | 22.61 | 3.19 |
1048576 | 15.49 (min 13.54, max 16.63) | 3.28 (min 2.26, max 6.47) | 23.09 | 3.14 |
5242880 | 15.36 (min 13.45, max 16.62) | 3.27 (min 1.99, max 6.43) | 23.03 | 3.20 |
Buffer size (bytes) | am654x-idk: Write EXT2 Throughput (Mbytes/sec) | am654x-idk: Write EXT2 CPU Load (%) | am654x-idk: Read EXT2 Throughput (Mbytes/sec) | am654x-idk: Read EXT2 CPU Load (%) |
---|---|---|---|---|
102400 | 16.43 (min 15.17, max 17.07) | 2.43 (min 1.96, max 3.77) | 22.54 | 2.49 |
262144 | 16.21 (min 15.01, max 17.08) | 2.35 (min 1.76, max 3.99) | 22.83 | 2.74 |
524288 | 16.13 (min 14.02, max 17.11) | 2.30 (min 1.64, max 3.84) | 23.41 | 2.36 |
1048576 | 16.09 (min 13.01, max 17.01) | 2.30 (min 1.97, max 3.32) | 23.56 | 2.54 |
5242880 | 16.61 (min 15.61, max 17.12) | 2.23 (min 1.75, max 3.58) | 23.57 | 2.60 |
Buffer size (bytes) | am654x-idk: Write EXT4 Throughput (Mbytes/sec) | am654x-idk: Write EXT4 CPU Load (%) | am654x-idk: Read EXT4 Throughput (Mbytes/sec) | am654x-idk: Read EXT4 CPU Load (%) |
---|---|---|---|---|
102400 | 17.27 (min 16.07, max 17.99) | 2.98 (min 2.40, max 4.80) | 22.67 | 2.71 |
262144 | 17.44 (min 15.76, max 18.00) | 3.17 (min 2.71, max 4.71) | 23.01 | 2.81 |
524288 | 17.51 (min 16.17, max 17.87) | 3.25 (min 2.61, max 5.10) | 23.51 | 2.42 |
1048576 | 17.42 (min 16.26, max 17.97) | 3.15 (min 2.62, max 4.90) | 23.76 | 2.50 |
5242880 | 16.94 (min 14.38, max 17.86) | 3.05 (min 2.49, max 4.60) | 23.69 | 2.33 |
The performance numbers were captured using the following:
- SanDisk 8GB MicroSDHC Class 10 Memory Card
- Partition was mounted with async option
2.2.2.21. USB Driver¶
2.2.2.21.1. MUSB/XHCI Host controller¶
Warning
IMPORTANT: For Mass-storage applications, the performance numbers can be severely affected if the media is mounted in sync mode. Hot plug scripts in the filesystem mount removable media in sync mode to ensure data integrity. For performance sensitive applications, umount the auto-mounted filesystem and re-mount in async mode.
Setup : Inateck ASM1153E USB hard disk is connected to usb0 port. File read/write performance data on usb0 port is captured.
Table: USB Host VFAT
Buffer size (bytes) | am654x-idk: Write VFAT Throughput (Mbytes/sec) | am654x-idk: Write VFAT CPU Load (%) | am654x-idk: Read VFAT Throughput (Mbytes/sec) | am654x-idk: Read VFAT CPU Load (%) |
---|---|---|---|---|
102400 | 33.41 (min 25.18, max 35.71) | 8.62 (min 6.95, max 12.45) | 38.22 | 7.43 |
262144 | 33.59 (min 25.37, max 36.11) | 9.17 (min 6.68, max 12.75) | 37.28 | 5.35 |
Table: USB Host EXT2
Buffer size (bytes) | am654x-idk: Write EXT2 Throughput (Mbytes/sec) | am654x-idk: Write EXT2 CPU Load (%) | am654x-idk: Read EXT2 Throughput (Mbytes/sec) | am654x-idk: Read EXT2 CPU Load (%) |
---|---|---|---|---|
102400 | 35.81 (min 31.04, max 37.36) | 6.69 (min 5.81, max 8.86) | 37.55 | 7.89 |
1048576 | 35.68 (min 31.06, max 37.19) | 6.37 (min 5.40, max 9.21) | 37.92 | 6.65 |
5242880 | 35.67 (min 31.00, max 37.06) | 6.98 (min 5.77, max 9.27) | 37.46 | 5.90 |
2.2.2.22. CRYPTO Driver¶
2.2.2.22.1. OpenSSL Performance¶
Algorithm | Buffer Size | am574x-idk: throughput | am654x-idk: throughput |
---|---|---|---|
aes-128-cbc | 1024 | 6656.68 | 12538.88 |
aes-128-cbc | 16 | 1548.14 | 261.21 |
aes-128-cbc | 256 | 1926.57 | 4059.05 |
aes-128-cbc | 64 | 5513.94 | 1074.71 |
aes-128-cbc | 8192 | 23016.79 | 66617.34 |
aes-192-cbc | 1024 | 7769.77 | 12994.22 |
aes-192-cbc | 16 | 1478.41 | 225.37 |
aes-192-cbc | 256 | 2297.77 | 3276.97 |
aes-192-cbc | 64 | 5196.07 | 818.94 |
aes-192-cbc | 8192 | 24526.85 | 77376.17 |
aes-256-cbc | 1024 | 7786.84 | 12377.09 |
aes-256-cbc | 16 | 1509.84 | 212.30 |
aes-256-cbc | 256 | 2307.93 | 3261.61 |
aes-256-cbc | 64 | 5339.97 | 818.01 |
aes-256-cbc | 8192 | 24578.73 | 63070.21 |
des-cbc | 1024 | 5108.39 | 14333.61 |
des-cbc | 16 | 115.01 | 2599.46 |
des-cbc | 256 | 1663.32 | 11803.39 |
des-cbc | 64 | 449.45 | 6904.58 |
des-cbc | 8192 | 13170.01 | 15198.89 |
des3 | 1024 | 5677.06 | 15437.82 |
des3 | 16 | 135.50 | 204.84 |
des3 | 256 | 1930.33 | 4310.70 |
des3 | 64 | 530.26 | 1058.03 |
des3 | 8192 | 12907.86 | 56729.60 |
md5 | 1024 | 6046.72 | 27976.02 |
md5 | 16 | 414.81 | 599.95 |
md5 | 256 | 1946.45 | 8748.54 |
md5 | 64 | 1640.96 | 2329.37 |
md5 | 8192 | 31932.42 | 73717.08 |
sha1 | 1024 | 6013.95 | 26284.03 |
sha1 | 16 | 403.90 | 581.86 |
sha1 | 256 | 1954.56 | 8417.28 |
sha1 | 64 | 1591.34 | 2262.38 |
sha1 | 8192 | 31812.27 | 65533.27 |
Algorithm | am574x-idk: CPU Load | am654x-idk: CPU Load |
---|---|---|
aes-128-cbc | 52.00 | 47.00 |
aes-192-cbc | 50.00 | 47.00 |
aes-256-cbc | 50.00 | 48.00 |
des-cbc | 20.00 | 99.00 |
des3 | 16.00 | 43.00 |
md5 | 72.00 | 99.00 |
sha1 | 72.00 | 99.00 |
time -v openssl speed -elapsed -evp aes-128-cbc
2.2.2.22.2. IPSec Performance¶
Note: queue_len is set to 300 and software fallback threshold set to 9 to enable software support for optimal performance
Algorithm | am654x-idk: Throughput | am654x-idk: Packets/Sec | am654x-idk: CPU Load |
---|---|---|---|
aes128 | 88.40 | 7.00 | 27.20 |
2.2.2.22.3. PRU Ethernet¶
Table: ** PRU UDP Throughput Egress**
UDP Packet Size(bytes) | am654x-idk: Throughput (Mbits/sec) | am654x-idk: CPU Load | am654x-idk: Packets Per Second (kpps) |
---|---|---|---|
64 | 28.20 | 44.30 | 54.00 |
1024 | 89.70 | 16.30 | 10.00 |
1470 | 93.70 | 12.20 | 7.00 |
Table: PRU UDP Throughput Ingress