Lines Matching +full:gpu +full:- +full:id

9 * NVLink-C2C0
10 * NVLink-C2C1
15 ----------
19 PMUs are managed by a common driver "arm-cs-arch-pmu". This driver describes
22 the driver provides "cpumask" sysfs attribute to show the CPU id used to handle
29 -------
32 strongly-ordered (SO) PCIE write traffic to local/remote memory. Please see
37 see /sys/bus/event_source/devices/nvidia_scf_pmu_<socket-id>.
41 * Count event id 0x0 in socket 0::
43 perf stat -a -e nvidia_scf_pmu_0/event=0x0/
45 * Count event id 0x0 in socket 1::
47 perf stat -a -e nvidia_scf_pmu_1/event=0x0/
49 NVLink-C2C0 PMU
50 --------------------
52 The NVLink-C2C0 PMU monitors incoming traffic from a GPU/CPU connected with
53 NVLink-C2C (Chip-2-Chip) interconnect. The type of traffic captured by this PMU
56 * NVIDIA Grace Hopper Superchip: Hopper GPU is connected with Grace SoC.
58 In this config, the PMU captures GPU ATS translated or EGM traffic from the GPU.
69 see /sys/bus/event_source/devices/nvidia_nvlink_c2c0_pmu_<socket-id>.
73 * Count event id 0x0 from the GPU/CPU connected with socket 0::
75 perf stat -a -e nvidia_nvlink_c2c0_pmu_0/event=0x0/
77 * Count event id 0x0 from the GPU/CPU connected with socket 1::
79 perf stat -a -e nvidia_nvlink_c2c0_pmu_1/event=0x0/
81 * Count event id 0x0 from the GPU/CPU connected with socket 2::
83 perf stat -a -e nvidia_nvlink_c2c0_pmu_2/event=0x0/
85 * Count event id 0x0 from the GPU/CPU connected with socket 3::
87 perf stat -a -e nvidia_nvlink_c2c0_pmu_3/event=0x0/
89 The NVLink-C2C has two ports that can be connected to one GPU (occupying both
90 ports) or to two GPUs (one GPU per port). The user can use "port" bitmap
97 * Count event id 0x0 from the GPU connected with socket 0 on port 0::
99 perf stat -a -e nvidia_nvlink_c2c0_pmu_0/event=0x0,port=0x1/
101 * Count event id 0x0 from the GPUs connected with socket 0 on port 0 and port 1::
103 perf stat -a -e nvidia_nvlink_c2c0_pmu_0/event=0x0,port=0x3/
105 NVLink-C2C1 PMU
106 -------------------
108 The NVLink-C2C1 PMU monitors incoming traffic from a GPU connected with
109 NVLink-C2C (Chip-2-Chip) interconnect. This PMU captures untranslated GPU
110 traffic, in contrast with NvLink-C2C0 PMU that captures ATS translated traffic.
115 see /sys/bus/event_source/devices/nvidia_nvlink_c2c1_pmu_<socket-id>.
119 * Count event id 0x0 from the GPU connected with socket 0::
121 perf stat -a -e nvidia_nvlink_c2c1_pmu_0/event=0x0/
123 * Count event id 0x0 from the GPU connected with socket 1::
125 perf stat -a -e nvidia_nvlink_c2c1_pmu_1/event=0x0/
127 * Count event id 0x0 from the GPU connected with socket 2::
129 perf stat -a -e nvidia_nvlink_c2c1_pmu_2/event=0x0/
131 * Count event id 0x0 from the GPU connected with socket 3::
133 perf stat -a -e nvidia_nvlink_c2c1_pmu_3/event=0x0/
135 The NVLink-C2C has two ports that can be connected to one GPU (occupying both
136 ports) or to two GPUs (one GPU per port). The user can use "port" bitmap
143 * Count event id 0x0 from the GPU connected with socket 0 on port 0::
145 perf stat -a -e nvidia_nvlink_c2c1_pmu_0/event=0x0,port=0x1/
147 * Count event id 0x0 from the GPUs connected with socket 0 on port 0 and port 1::
149 perf stat -a -e nvidia_nvlink_c2c1_pmu_0/event=0x0,port=0x3/
152 ---------------
154 The CNVLink PMU monitors traffic from GPU and PCIE device on remote sockets
160 see /sys/bus/event_source/devices/nvidia_cnvlink_pmu_<socket-id>.
167 /sys/bus/event_source/devices/nvidia_cnvlink_pmu_<socket-id>/format/rem_socket
172 traffic from remote GPU and PCIE devices.
176 * Count event id 0x0 for the traffic from remote socket 1, 2, and 3 to socket 0::
178 perf stat -a -e nvidia_cnvlink_pmu_0/event=0x0,rem_socket=0xE/
180 * Count event id 0x0 for the traffic from remote socket 0, 2, and 3 to socket 1::
182 perf stat -a -e nvidia_cnvlink_pmu_1/event=0x0,rem_socket=0xD/
184 * Count event id 0x0 for the traffic from remote socket 0, 1, and 3 to socket 2::
186 perf stat -a -e nvidia_cnvlink_pmu_2/event=0x0,rem_socket=0xB/
188 * Count event id 0x0 for the traffic from remote socket 0, 1, and 2 to socket 3::
190 perf stat -a -e nvidia_cnvlink_pmu_3/event=0x0,rem_socket=0x7/
194 ------------
201 see /sys/bus/event_source/devices/nvidia_pcie_pmu_<socket-id>.
207 /sys/bus/event_source/devices/nvidia_pcie_pmu_<socket-id>/format/root_port
212 * Count event id 0x0 from root port 0 and 1 of socket 0::
214 perf stat -a -e nvidia_pcie_pmu_0/event=0x0,root_port=0x3/
216 * Count event id 0x0 from root port 0 and 1 of socket 1::
218 perf stat -a -e nvidia_pcie_pmu_1/event=0x0,root_port=0x3/
223 ----------------
227 * **NVIDIA Grace Hopper Superchip**: Hopper GPU is connected with Grace SoC.
232 * SOCKET-A * * SOCKET-B *
241 * : GPU :<--NVLink-->: Grace :<---CNVLink--->: Grace :<--NVLink-->: GPU : *
252 GMEM = GPU Memory (e.g. HBM)
256 | Following table contains traffic coverage of Grace SoC PMU in socket-A:
260 +--------------+-------+-----------+-----------+-----+----------+----------+
262 + +-------+-----------+-----------+-----+----------+----------+
263 | Destination | |GPU ATS |GPU Not-ATS| | Socket-B | Socket-B |
264 | |PCI R/W|Translated,|Translated | CPU | CPU/PCIE1| GPU/PCIE2|
267 | Local | PCIE |NVLink-C2C0|NVLink-C2C1| SCF | SCF PMU | CNVLink |
269 +--------------+-------+-----------+-----------+-----+----------+----------+
270 | Local GMEM | PCIE | N/A |NVLink-C2C1| SCF | SCF PMU | CNVLink |
272 +--------------+-------+-----------+-----------+-----+----------+----------+
273 | Remote | PCIE |NVLink-C2C0|NVLink-C2C1| SCF | | |
276 +--------------+-------+-----------+-----------+-----+----------+----------+
277 | Remote GMEM | PCIE |NVLink-C2C0|NVLink-C2C1| SCF | | |
279 +--------------+-------+-----------+-----------+-----+----------+----------+
289 * SOCKET-A * * SOCKET-B *
298 * : Grace :<--------NVLink------->: Grace : *
309 GMEM = GPU Memory (e.g. HBM)
313 | Following table contains traffic coverage of Grace SoC PMU in socket-A:
317 +-----------------+-----------+---------+----------+-------------+
319 + +-----------+---------+----------+-------------+
320 | Destination | | | Socket-B | Socket-B |
324 | Local | PCIE PMU | SCF PMU | SCF PMU | NVLink-C2C0 |
326 +-----------------+-----------+---------+----------+-------------+
329 | over NVLink-C2C | | | | |
330 +-----------------+-----------+---------+----------+-------------+