Home
last modified time | relevance | path

Searched defs:thread_offset (Results 1 – 6 of 6) sorted by relevance

/aosp_15_r20/external/pytorch/aten/src/ATen/native/cuda/cutlass_extensions/gemm/warp/
H A Dmma_tensorop_dequantizer.h141 const int thread_offset = warp_offset + quad; in MmaTensorOpDequantizer() local
249 const int thread_offset = warp_offset + quad; in MmaTensorOpDequantizer() local
343 const int thread_offset = warp_offset + base_col; in MmaTensorOpDequantizer() local
429 const int thread_offset = warp_offset + base_col; in MmaTensorOpDequantizer() local
/aosp_15_r20/external/eigen/unsupported/Eigen/CXX11/src/Tensor/
H A DTensorScanSycl.h155 …const Index thread_offset = (ScanParameters<Index>::ScanPerThread * local_id * scanParameters.scan… in operator() local
327 …const Index thread_offset = ScanParameters<Index>::ScanPerThread * local_id * scanParameters.scan_… in operator() local
/aosp_15_r20/external/pytorch/aten/src/ATen/native/cuda/
H A DPersistentSoftmax.cuh235 int thread_offset = first_batch * stride + local_idx; in softmax_warp_backward() local
/aosp_15_r20/external/pytorch/aten/src/ATen/native/transformers/cuda/mem_eff_attention/iterators/
H A Depilogue_predicated_tile_iterator.h233 TensorCoord thread_offset = in params_() local
/aosp_15_r20/external/mesa3d/src/broadcom/compiler/
H A Dvir_register_allocate.c441 struct qreg thread_offset = in v3d_setup_spill_base() local
/aosp_15_r20/external/pytorch/aten/src/ATen/native/
H A DTensorShape.cpp2357 const auto thread_offset = [&]() { in index_select_sparse_cpu() local