1Mesa 23.3.0 Release Notes / 2023-11-29 2====================================== 3 4Mesa 23.3.0 is a new development release. People who are concerned 5with stability and reliability should stick with a previous release or 6wait for Mesa 23.3.1. 7 8Mesa 23.3.0 implements the OpenGL 4.6 API, but the version reported by 9glGetString(GL_VERSION) or glGetIntegerv(GL_MAJOR_VERSION) / 10glGetIntegerv(GL_MINOR_VERSION) depends on the particular driver being used. 11Some drivers don't support all the features required in OpenGL 4.6. OpenGL 124.6 is **only** available if requested at context creation. 13Compatibility contexts may report a lower version depending on each driver. 14 15Mesa 23.3.0 implements the Vulkan 1.3 API, but the version reported by 16the apiVersion property of the VkPhysicalDeviceProperties struct 17depends on the particular driver being used. 18 19SHA256 checksum 20--------------- 21 22:: 23 24 50f729dd60ed6335b989095baad81ef5edf7cfdd4b4b48b9b955917cb07d69c5 mesa-23.3.0.tar.xz 25 26 27New drivers 28----------- 29- NVK: A Vulkan driver for Nvidia hardware 30 31New features 32------------ 33- VK_EXT_pipeline_robustness on ANV 34- VK_KHR_maintenance5 on RADV 35- OpenGL ES 3.1 on Asahi 36- GL_ARB_compute_shader on Asahi 37- GL_ARB_shader_atomic_counters on Asahi 38- GL_ARB_shader_image_load_store on Asahi 39- GL_ARB_shader_image_size on Asahi 40- GL_ARB_shader_storage_buffer_object on Asahi 41- GL_ARB_sample_shading on Asahi 42- GL_OES_sample_variables on Asahi 43- GL_OES_shader_multisample_interpolation on Asahi 44- GL_OES_gpu_shader5 on Asahi 45- EGL_ANDROID_blob_cache works when disk caching is disabled 46- VK_KHR_cooperative_matrix on RADV/GFX11+ 47 48 49Bug fixes 50--------- 51 52- crash in si_update_tess_io_layout_state during _mesa_ReadPixels (radeonsi_dri, mesa 23.2.1) 53- mesa: vertex attrib regression 54- [RADV] War Thunder has some grass flickering. 55- radv: satisfactory broken shader 56- RADV problem with R7 M440 in some games 57- gpu driver crashes when opening ingame map playing dead space 2023 58- [anv] Valheim water misrendering 59- EGL/v3d: EGL applications under a X compositor doesn't work 60- RADV: trunc_coord breaks ambient occlusion in Dirt Rally and other games 61- radv: Mass Effect Legendary Edition: a line going across the screen is visible in some areas with Ambient Occlusion enabled 62- anv: DIRT5 gfx11_generated_draws_spv_source triggers "assert(!copy_value_is_divergent(src) || copy_value_is_divergent(dest));" 63- panfrost: gbm_bo_get_offset() wrongly returns 0 for second plane of NV12 buffers 64- [RADV][TONGA] - BeamNG.drive (284160) - Artifacts are present when looking at the skybox. 65- LEGO Star Wars: The Skywalker Saga graphical glitches (DXVK) on R9 380 66- [radv] Crypt not rendering properly 67- Leaks of DescriptorSet debug names 68- [Tracing flake] Missing geometry in trace\@freedreno-a630\@freedoom\@freedoom-phase2-gl-high.trace 69- Unreal Engine 5.2 virtual shadow maps have glitchy/lazy tile updates 70- RADV: Visual glitches in Unreal Engine 5.2.1 when using material with anisotropy and light channel 2 71- radv: Regression with UE5 test 72- SIGSEGV with MESA_VK_TRACE=rgp and compute only queue 73- [ANV] Corruptions in Battlefield 4 74- anv regression w/ commit e488773b29d97 ("anv: Fast clear depth/stencil surface in vkCmdClearAttachments") 75- ir3: dEQP-GLES31.functional.synchronization.inter_invocation.image_atomic_read_write crash on a6xx gen4 76- Zink + Venus: driver can't handle INVALID<->LINEAR! 77- Anv: Particles have black square artifacts on Counter Strike 2 on Skylake 78- Lords of the Fallen 2023 Red Eye mode crashing game and desktop 79- [radeonsi] [vulkan] [23.3-rc1 regression] Video output corrupted in QMplay2 with Vulkan renderer 80- [BISECTED] ac/radeon commit somehow breaks nv12 surface from HEVC decode 81- Parsec displays completely green screen with hardware decoder selected while using Mesa 23.3 and Mesa 24 82- H264 to H264 transcode output corruption with gst-vaapi 83- opencl-jpeg-encoder does not work with nouveau/rusticl, works with nouveau/clover 84- [R600] X-plane 11 demo (Linux Native) crashes upon launch on HD5870 and HD6970 85- Ubuntu 23.10 build error with rusticl_opencl_bindings.rs 86- Rusticl fails to build 87- ANV not handling VkMutableDescriptorTypeCreateInfoEXT::pMutableDescriptorTypeLists[i] being out of range 88- tu: Wolfenstein: The New Order misrenders on a740 89- DRI_PRIME fails with ACO only radeonsi 90- nir_to_tgsi: Incorrect handling of indirect array access 91- ANV gen9 32 bit vulkan asserts on many cts tests 92- GPU hang observed while launching 3DMark Wildlife Unlimited on MTL 93- ac/gpu_info: Query maximum submitted IBs from the kernel 94- RADV: regression in 23.2.1 causing GPU hang with RDNA1 in various UE5 games 95- GPU page faults reported while playing Talos Principle 2 (demo) 96- No CCS_E scanout on tgl+ with ANV 97- anv: Modifier tests assert-fail on TGL+ 98- ci: zink-tu jobs no longer included in manual pipelines 99- [ANV][A770] GravityMark segfaults and buffer allocation errors 100- etnaviv: gc2000 gles2 regression 101- ci_run_n_monitor: pipeline finding unreliable 102- nvk: Implement VK_EXT_dynamic_rendering_unused_attachments 103- anv: jsl timeline semaphores flaky 104- anv: OOB access in vkDestroyDevice? 105- nvk: Implement VK_EXT_primitive_topology_list_restart 106- nvk: Implement VK_EXT_image_sliced_view_of_3d 107- nvk: Implement VK_KHR_workgroup_memory_explicit_layout 108- util/macros: BITFIELD64_RANGE raises an error with mesa-clang if we try to set last bit 109- r300/r400 regression; can't compile \`if/then` in shaders 110- iris: gbm_bo_get_offset() wrongly returns 0 for second plane of NV12/P010 buffers 111- nvk: Implement VK_EXT_depth_bias_control 112- ICL/zink: gpu hang on 'piglit.object namespace pollution.framebuffer with gldrawpixels' 113- [R600] Wolfenstein: The New Order text glitch on menu 114- need extension to request image/texture not use data dependent compression 115- rusticl: segfault in clCreateKernel on AMD Instinct MI100 116- !25587 broke xserver 117- GPU Hang in Deep Rock Galactic on DG2 118- intel: Wrong length for 3DSTATE_3D_MODE on gfx125 119- [radeonsi] Wargame: Red Dragon /w OpenGL stopped working with ACO 120- traces job reference images missing again sometimes 121- Vulkan Texture/Polygon Glitches in Games 122- freedreno: dmabuf modify query ignores format 123- virgl: removing PIPE_CAP_CLEAR_TEXTURE completely breaks virglrenderer 124- Turnip build error on termux 125- failiure in amd llvm helper 126- failiure in amd llvm helper 127- radv_amdgpu_cs_submit: Assertion \`chunk_data[request->number_of_ibs - 1].ib_data.ip_type == request->ip_type' failed. 128- hasvk: subgroups regression 129- radeonsi: broken hardware decoding (vaapi/vulkan) on RDNA2 gpu (bisected) 130- aco: SwizzleInvocationsMaskedAMD behavior is not correct for reads from inactive lanes 131- anv: dEQP-VK.ssbo.phys.layout.random.16bit.scalar.13 slow 132- [RDNA3] CS:GO - excessive power consumption and lower performance in Vulkan while MSAA is set to 4x or 8x 133- [ICL] piglit.spec.arb_gl_spirv.execution.ssbo.unsized-array regression 134- radv: Counter Strike 2 has multiple bugs while rendering smoke grenade effect 135- Doom Eternal freezing on NAVI31 with current git 136- iris CTS blend test fail with MSAA config on DG2 137- anv: 32bit mesa asserts 138- RADV: Randomly dissapearing objects in Starfield with RX 5xx and Vega graphics 139- anv: missing barrier handling on video engines 140- radv: Star Wars The Old Republic hang when DCC is enabled 141- radv: Resident Evil 6 hangs 7900XTX GPU when DCC is enabled if in Options go to Display settings 142- radv: Resident Evil 6 Benchmark Tool hangs 7900 XTX GPU when DCC is enabled immediately after splash screen 143- ANV: fp64 shader leaked 144- v3d: noop drm-shim raises some warnings 145- freedreno: crashdec/etc chip_id support 146- intel: compute dispatches with variable workgroup size have ralloc_asprintf CPU overhead 147- ci build issues with builtin types 148- freedreno: running angle perf traces with GALLIUM_THREAD=0 crashes 149- RadeonSI: glClear() causes clear texture for some frames on RX580 150- radeonsi: corruption when seeking video decoded with vaapi in mpv 151- Zink/HasVK regression bisected to "gallium: move vertex stride to CSO" 152- [radv] [Path Of Exile] - one setting in the workaround file breaks shadows/lighting rendering. Other workaround settings seems obsolete. 153- radv: images don't always have extents in RGP 154- shader_test causing a crash in compiler 155- D3D12: Video decoding requirements are too restrictive. ID3D12VideoDevice3 should not be required. 156- Crash in st_ReadPixels 157- [regression] intel build issue on i386 158- [ANV] [DG2/A770] The Spirit and The Mouse, miscellaneous issues with Mesa Git 159- zink on hasvk regression: Assertion \`(dyn)->vi_binding_strides[first_binding + i] == (strides[i])' failed. 160- Penumbra: Overture hangs on new game loading screen 161- [r300, RV516] Some deqp-gles2\@performance\@shader\@control_statement vertex tests cause hard lockup & reboot in mesa 22.3.1 (regression over 22.1.7) on a Radeon X1550 162- v3dv: Add a feature that implicitly copies the linear image to the tiled image prior to sampling from it 163- radv: Regression from 266b2cfe5bf3feda16747c50c1638fb5a0426958 164- h264 encoding picture showed randomly repeated frames. 165- Mesa CI: NAVI10 hangs when running VKCTS on Linux 6.1 166- zink: no uniform buffer objects support for v3dv? 167- v3dv: Request for VkImageDrmFormatModifierExplicitCreateInfoEXT::pPlaneLayouts support 168- [ANV] [DG2/A770] The Spirit and The Mouse, occasional flickering geometry 169- [Google][Rex][anv] GLES dEQP test fails in anv when run via ANGLE-on-Venus on ChromeOS ARCVM. 170- VAAPI on VCN: bad stream may crash whole gfx system 171- Crash after GPU reset 172- Bifrost PanVK should not be in CI 173- [Intel][Vulkan][Gen12] vkCmdCopyImage() generates garbage data when the destination texture is bound to a piece of used device memory 174- mesa: new glcts fails 175- tu: GPL support is broken 176- lavapipe: ycbcr regression 177- aco: Assertion when compiling CP2077 shader 178- anv: flakiness on tgl+ with samplemask handling 179- [RADV] Dead by Daylight memory leak (shader-related?) on 23.1.6 180- r300: optionally convert MULs into output modifier for the following MUL or DOT instructions 181- r300: better 1-x presubtract pattern matching 182- gpu hang on DG2 when running KHR-GLES31.core.texture_cube_map_array.image_op_tess* 183- KHR-GLES31.core.texture_cube_map_array.image_op_tessellation_evaluation_sh fail on GFX12+ 184- wsi: deadlocks when DISPLAY is changed 185- hasvk: Incompatible with minigbm/gralloc4 on Android 186- VAAPI: AMDGPU crash on RX 6900 XT on corrupted video 187- lavapipe/llvmpipe: shader unregister crash 188- [ANV] [DG2/A380] Corruption in Borderlands 3 189- blorp regression on dg2 190- decouple -Dshader-cache= from EGL_ANDROID_blob_cache 191- radv: commit 81641b01555faa4dd1dfc7de2513ad8d63e77ab7 leaded to artifacts in Quake II RTX 192- [radv] Colors are distorted in Cyberpunk 2077 with ray tracing enabled 193- Forza Horizon 5 stuttering since mesa 23.1.4 / 9b008673 revert as a FIX 194- ubsan + gtest build fails 195- glCopyTexSubImage2D is very slow on Intel 196- NVE4 (GeForce 710) fails to get vdpau in mesa git 197- [RADV] red and pink tinted shadows in Overwatch 2 on 7900 XTX 198- nouveau prevents hardware acceleration with Chromium (Wayland) 199- Corrupt text rendering in Blender 200- DRI2 gallium frontend is using bad format type 201- regression - MR 23089 - Hellblade RT crashing 202- Incorrect vlVaCreateBuffer/vlVaMapBuffer behavior for buffer type VAEncCodedBufferType in Gallium 203- Issue with clang-format 204- Follow-up from "Draft: intel: Disable color fast-clears for blorp_copy" 205- nightly VA-API build: new timeout 206- r600: retire the SB optimizer 207- ci: do not download perfetto on-fly in build jobs 208- Shared Memory Leak With Qt OpenGL Applications 209- OpenGL, SIGSEGV when program pipeline objects has separated vertex shader progam and separated fragment shader progam with in/out 210- vaDeriveImage returns VA_STATUS_ERROR_OPERATION_FAILED 211- 975a8ecc881873744d851ab0ef45ad7698eaa0ef "frontends/va: use resources instead of views" cause radeonsi can't play video. 212- zink: reduce pipeline hash size 213- Rusticl,radeonsi: ac_rtld error(2): too much LDS 214- aco, radv Rage 2 menu corruption - bisected 215- radv, aco: World War Z character texture regression on 7900xtx 216- android: De-stage drm_gralloc support from mesa3d 217- Cyberpunk screen goes black at game launch on integrated Gfx 218- lavapipe/llvmpipe: regressions since descriptor rewrite 219- intel: State cache invalidation after BLORP binding table setup ought to be unnecessary on ICL. 220- ci: HW job logs have spam at the end 221- kernel crash seen on AMD Raven device 222- crocus: regression crashing in doubles/ubo tests 223- turnip: object management CTS crashes 224- a618: multiple assertions with different kernel config on u_vector_add 225- [anv] Death Stranding crashes 226- Can no longer build Clover without llvmspirvlib 227- [radeonsi][vaapi] segfault in vl_video_buffer_sampler_view_components() when using vaapisink receiving I420 format 228- Baldurs Gate 3 (DX11) - Graphical corruption on RDNA3 (ACO regression) 229- [AMDGPU] Compiling large Blender Eevee shader node trees is unusably slow 230- Building llvmpipe with LP_USE_TEXTURE_CACHE set fails since 23.2.0-rc1: error C2039: dynamic_state is not member of lp_build_sampler_soa in lp_tex_sample.c 231- r300: calculate some cycles estimate for shader-db 232- intel: Deathloop and other DX12 games fail assert(validated) with invalid SEL instruction 233- GTF-GL46.gtf21.GL.build.CorrectFull_vert regressed on intel platforms 234- error message when encoding via VAAPI AMD 235- gpu hangs on dg2 with mesh shading enabled on vkcts 236- radeonsi: Deadlock when creating a new GL context in parallel with linking a shader on another GL context 237- robustness2 raygen tests intermittently fail in Intel Mesa CI 238- ci/ci_run_n_monitor.py: KeyError: 'clang-format' 239- glthread: huge performance regression 240- DirectX games do not launch on Intel HD Graphics 4000 (IVB GT2) [bisected] 241- rusticl: fails to build for iris + radeonsi 242 243 244Changes 245------- 246 247Adam Jackson (3): 248 249- egl: Implement EGL_EXT_explicit_device 250- mesa: Implement and advertise GL_MESA_sampler_objects 251- docs: Mention 'meson devenv' in the pre-install test instructions 252 253Aditya Swarup (6): 254 255- isl: enable Tile64 for 3D images 256- intel/isl: Unittest for linear to Ytile conversion 257- intel/isl: Convert linear texture to Tile4 format 258- intel/isl: Convert Tile4 texture to linear format 259- intel/isl: Linear to Tile-4 conversion unittest 260- Revert "iris: Disable tiled memcpy for Tile4" 261 262Alba Mendez (1): 263 264- meson: support installation tags 265 266Alejandro Piñeiro (61): 267 268- v3dv: re-enable sync_fd import/export on the simulator 269- broadcom(cle,clif,common,simulator): add 7.1 version on the list of versions to build 270- broadcom/cle: update the packet definitions for new generation v71 271- broadcom/common: add some common v71 helpers 272- broadcom/qpu: add comments on waddr not used on V3D 7.x 273- broadcom/qpu: set V3D 7.x names for some waddr aliasing 274- broadcom/compiler: rename small_imm to small_imm_b 275- broadcom/compiler: add small_imm a/c/d on v3d_qpu_sig 276- broadcom/qpu: add v71 signal map 277- broadcom/qpu: define v3d_qpu_input, use on v3d_qpu_alu_instr 278- broadcom/qpu: add raddr on v3d_qpu_input 279- broadcom/qpu: defining shift/mask for raddr_c/d 280- broadcom/commmon: add has_accumulators field on v3d_device_info 281- broadcom/qpu: add qpu_writes_rf0_implicitly helper 282- broadcom/qpu: add pack/unpack support for v71 283- broadcom/compiler: phys index depends on hw version 284- broadcom/compiler: don't favor/select accum registers for hw not supporting it 285- broadcom/vir: implement is_no_op_mov for v71 286- broadcom/compiler: update vir_to_qpu::set_src for v71 287- broadcom/qpu_schedule: add process_raddr_deps 288- broadcom/qpu: update disasm_raddr for v71 289- broadcom/qpu: return false on qpu_writes_accumulatorXX helpers for v71 290- broadcom/compiler: add support for varyings on nir to vir generation for v71 291- broadcom/compiler: payload_w is loaded on rf3 for v71 292- broadcom/qpu_schedule: update write deps for v71 293- broadcom/compiler: update register classes to not include accumulators on v71 294- broadcom/qpu: implement switch rules for fmin/fmax fadd/faddnf for v71 295- broadcom/compiler: update one TMUWT restriction for v71 296- broadcom/compiler: update ldunif/ldvary comment for v71 297- broadcom/compiler: update payload registers handling when computing live intervals 298- broadcom/qpu: new packing/conversion v71 instructions 299- v3dv/meson: add v71 hw generation 300- v3dv: emit TILE_BINNING_MODE_CFG and TILE_RENDERING_MODE_CFG_COMMON for v71 301- v3dv/cmd_buffer: emit TILE_RENDERING_MODE_CFG_RENDER_TARGET_PART1 for v71 302- v3dvx/cmd_buffer: emit CLEAR_RENDER_TARGETS for v71 303- v3dv/cmd_buffer: emit CLIPPER_XY_SCALING for v71 304- v3dv/uniforms: update VIEWPORT_X/Y_SCALE uniforms for v71 305- v3dv/cmd_buffer: just don't fill up early-z fields for CFG_BITS for v71 306- v3dv: default vertex attribute values are gen dependant 307- v3dv/pipeline: default vertex attributes values are not needed for v71 308- v3dv/pipeline: handle GL_SHADER_STATE_RECORD changed size on v71 309- v3dv: no specific separate_segments flag for V3D 7.1 310- v3dv: add support for TFU jobs in v71 311- v3d: add v71 hw generation 312- v3d: emit TILE_BINNING_MODE_CFG and TILE_RENDERING_MODE_CFG_COMMON for v71 313- v3d: TILE_RENDERING_MODE_CFG_RENDER_TARGET_PART1 314- v3d: emit CLEAR_RENDER_TARGETS for v71 315- v3d: just don't fill up early-z fields for CFG_BITS for v71 316- v3d: emit CLIPPER_XY_SCALING for v71 317- v3d: no specific separate_segments flag for V3D 7.1 318- v3d: default vertex attributes values are not needed for v71 319- v3d/uniforms: update VIEWPORT_X/Y_SCALE uniforms for v71 320- v3d: handle new texture state transfer functions in v71 321- v3d: handle new TEXTURE_SHADER_STATE v71 YCbCr fields 322- v3d: setup render pass color clears for any format bpp in v71 323- v3d: GFX-1461 does not affect V3D 7.x 324- v3d: don't convert floating point border colors in v71 325- v3d: handle Z clipping in v71 326- v3d: add support for TFU blit in v71 327- v3dv: implement depthBounds support for v71 328- doc/features: update after last v3d changes 329 330Alex Denes (1): 331 332- virgl: link VA driver with build-id 333 334Alexander Orzechowski (1): 335 336- radeonsi: Set PIPE_CONTEXT_LOSE_CONTEXT_ON_RESET for auxiliary contexts 337 338Alyssa Rosenzweig (431): 339 340- zink: Switch to register intrinsics 341- gallium/trace: Collect enums from multiple files 342- gallium,util: Move blend enums to util/ 343- gallium,util: Move util_blend_dst_alpha_to_one 344- util/blend: Add helpers for normalizing inverts 345- vulkan: Add helpers for blend enum translation 346- lvp: Use common blend/logicop translation 347- nir/lower_blend: Use util enums 348- panfrost: Convert to PIPE_BLEND enums internally 349- gallium: Remove pipe->compiler BLEND enum translation 350- compiler: Remove blend enums duplicating util 351- nir/legacy: Fix fneg(load_reg) case 352- nir/legacy: Fix handling of fsat(fabs) 353- ntt: Switch to new-style registers and modifiers 354- ir3: Convert to register intrinsics 355- nir: Add fence_{pbe,mem}_to_tex(_pixel)_agx intrinsics 356- nir: Devendor load_sample_mask 357- nir: Promote tess_coord_r600 to tess_coord_xy 358- nir: Add nir_lower_tess_coord_z pass 359- r600: Use nir_lower_tess_coord_xy 360- ir3: Use nir_lower_tess_coord_z 361- nir: Initialize workgroup_size in builder_init_simple_shader 362- v3dv: Rely on nir_builder setting workgroup size 363- radv: Rely on workgroup_size initialization 364- panfrost: Fix transform feedback on v9 365- r600/sfn: Remove nir_register unit tests 366- panfrost: Lower vertex_id for XFB 367- panfrost: Fix transform feedback on v9 harder 368- asahi: Augment fake drm_asahi_params_global 369- asahi: Use nir_builder_at more 370- asahi: Remove unused #define 371- asahi: Refactor PBE upload routine 372- asahi: Extract shader_initialize helper 373- asahi: Serialize NIR in memory 374- asahi: Identify background/EOT counts 375- asahi,agx: Set coherency bit for clustered targets 376- ail: Page-align layers for writable images 377- asahi: Mark writeable images as such 378- asahi: Reallocate to set the writeable image flag 379- asahi: Add agx_batch_track_image helper 380- asahi: Add texture/image indexing lowering pass 381- asahi: Upload at most the max texture state registers 382- asahi: Upload image descriptors 383- asahi: Make clear the non-sRGBness of EOT images 384- asahi: Don't restrict sampler views 385- asahi: Forbid 2D Linear with images 386- agx: Add try_coalesce_with helper 387- agx: Try to allocate phis compatibly with sources 388- agx: Try to allocate phi sources with phis 389- agx: Try to allocate phi sources with loop phis 390- agx: Vectorize 16-bit parallel copies 391- agx: Reduce un/packs with mem access lowering 392- agx: Fix bogus assert 393- asahi: Augment PBE descriptor for software access 394- asahi: Extend PBE packing for image support 395- asahi: Use nir_lower_robust_access 396- agx: Legalize image LODs to be 16-bit 397- agx: Lower image size to txs 398- agx: Generalize texture/PBE packing 399- agx: Add image write instruction 400- agx: Model texture bindless base 401- agx: Handle bindless properly for txs lowering 402- agx: Pack bindless textures 403- agx: Translate texture bindless handles 404- agx: Translate image_store from NIR 405- agx: Handle frag side effects without render targets 406- agx: Wait for outstanding stores before barriers 407- agx: Implement image barriers 408- agx: Handle early_fragment_tests 409- agx: Add interleave opcode 410- agx: Extract coords_for_buffer_texture helper 411- agx: Extract texture_descriptor_ptr_for_* helpers 412- agx: Lower image atomics 413- agx: Lower buffer images 414- asahi,agx: Fix txf sampler 415- agx: Add image_load opcode 416- agx: Extract texture write mask handling 417- agx: Implement image_load 418- agx: Emit global memory barriers for images 419- agx: Don't emit silly barriers 420- agx: Implement fence_*_to_tex_agx intrinsics 421- agx: Add simple image fencing pass 422- agx: Require tag writes with side effects 423- agx: Plumb in coverage mask 424- asahi: Extract sampler_view_for_surface 425- asahi: Introduce concept of spilled render targets 426- asahi: Add agx_tilebuffer_spills query 427- asahi: Do not support masking with spilled RTs 428- asahi: Ignore spilled render targets in EOT shaders 429- asahi: Ignore spilled render targets with partial renders 430- asahi: Extract some tilebuffer lowering code 431- asahi: Lower tilebuffer access for spilled RTs 432- asahi: Lower multisample image stores 433- asahi: Permit meta shaders to use preambles 434- asahi: Ignore spilled render targets for background load 435- asahi: Offset clear colour uniform by 4 436- asahi: Execute preambles for background programs 437- asahi: Advertise Z16_UNORM 438- ir2: Switch to nir_legacy 439- intel/fs: Don't read reg.base_offset 440- panfrost: Remove unused helpers 441- nir: Remove nir_lower_locals_to_regs 442- nir: Rename lower_locals_to_reg_intrinsics back 443- nir: Remove register arrays 444- asahi: Don't depend on glibc to decode 445- pan/bi: Remove leftover include 446- nir/trivialize: Handle more RaW hazards 447- panfrost: Disable blending for no-op logic ops 448- nir/lower_blend: Fix 32-bit logicops 449- nir/lower_blend: Optimize out PIPE_LOGICOP_NOOP 450- clang-format: Ignore original panfrost commit 451- nir/schedule: Assume no old-style registers 452- gallium/u_simple_shaders: Optimize out ffloors 453- gallium/u_transfer_helper: Remove dead forward decl 454- nir/loop_analyze: Drop unused inverse_comparison 455- nir/passthrough_gs: Drop unused array_size_for_prim 456- panfrost: Add missing static inline annotation 457- pan/decode: Drop unused debug function 458- pan/mdg: Add missing static inline annotation 459- panfrost: Drop unused decode_position for samples 460- panfrost: Only define pan_blitter_get_blend_shaders for midgard 461- panfrost: Add missing inline 462- panfrost: Gate overdraw_alpha on Bifrost+ 463- nir: Rename scoped_barrier -> barrier 464- nir: Remove lower_to_source_mods 465- nir: Remove lower_vec_to_movs 466- nir: Remove reg_intrinsics parameter to convert_from_ssa 467- nir: Remove register load/store builders 468- r600/sfn: Stop referencing legacy functionality 469- r600/sfn: Ignore instruction write masks 470- nouveau/codegen: Drop writemask check 471- vc4,broadcom/compiler: Drop write_mask handling 472- zink: Collapse is_ssa check 473- nir: Add {...} before case 474- nir/from_ssa: Drop legacy reg support 475- nir/schedule: Drop nir_schedule_dest_pressure 476- nir: Drop NIR reg create/destroy 477- nir: Remove nir_index_local_regs and callers 478- nir/schedule: Drop more nir_register handling 479- nir: Remove nir_foreach_register 480- nir: remove nir_{src,dest}_for_reg 481- ntt: Drop nir_register reference 482- nir/print: Assume SSA 483- nir/clone: Assume SSA 484- nir/serialize: Drop legacy NIR 485- nir/validate: Assume SSA 486- nir: Remove impl->{registers,reg_alloc} 487- nir: Remove nir_alu_dest::saturate 488- treewide: Drop is_ssa asserts 489- nir: Collapse some SSA checks 490- treewide: Remove more is_ssa asserts 491- nir: Remove reg-only dest manipulation 492- nir: Remove stale todo 493- nir/print: Drop legacy NIR 494- nir: Drop nir_alu_src::{negate,abs} 495- treewide: sed out more is_ssa 496- pan/mdg: Assume SSA 497- treewide: Drop some is_ssa if's 498- nir: Drop trivial reg handling 499- aco: Remove is_ssa check 500- intel: Collapse is_ssa checks 501- llvmpipe: Assume SSA 502- ir3: Collapse is_ssa checks 503- lima: Collapse is_ssa checks 504- radeonsi: Collapse SSA check 505- nir/gather_ssa_types: Collapse SSA checks 506- nir/worklist: Assume SSA 507- nir/range_analysis: Assume SSA 508- treewide: Collapse more SSA checks 509- nir/instr_set: Assume SSA 510- nir: Collapse more SSA checks 511- nir: Remove def_is_register 512- nir: Do not init dests 513- nir: Initialize source as a NULL SSA def 514- nir: Collapse more SSA checks 515- nir: Remove nir_{src,dest}::is_ssa 516- nir: Drop nir_register 517- nir/from_ssa: Remove pointless union 518- ir3: Drop write_mask handling 519- rogue: Stop reading write masks 520- etnaviv: Don't use alu->dest.write_mask 521- etnaviv: What if we just didn't have a compiler? 522- intel/vec4: Don't use legacy write mask 523- ntt: Evaluate write_mask check 524- nir: Remove nir_alu_dest::write_mask 525- nir: Remove nir_foreach_def 526- lima: Clean up after deleting asserts 527- nir: Remove no-op remove_def_cb 528- nir: Drop no-op all_srcs_are_ssa 529- nir: Simplify alu_instr_is_copy 530- nir: Add load_coefficients_agx intrinsic 531- agx: Implement nir_intrinsic_load_coefficients_agx 532- agx: Allow more varying slots 533- agx: Set lower_fisnormal 534- agx: Forcibly vectorize pointcoord coeffs 535- agx: Add interpolateAtOffset lowering pass 536- agx: Lower flat shading in NIR 537- asahi: Stub num_dies 538- asahi: Move a bunch of helpers to common 539- agx: Lower 8-bit ALU 540- agx: Handle 8-bit vecs 541- asahi,agx: Respect no16 even for I/O 542- agx: Don't lower load_local_invocation_index 543- agx/dce: Use the helper 544- agx: Fix atomics with no destination 545- agx: Fix shader info with sample mask writes 546- agx: Do not move bindless handles 547- agx: Put else instructions in the right block 548- agx: Use unconditional else instruction 549- agx: Optimize out pointless else instructions 550- agx: Fix length bit confusion 551- agx: Require an immediate for \`nest` 552- agx: Use compressed fadd/fmul encodings 553- agx: Optimize swaps of 2x16 channels 554- agx: Optimize logical_end removal 555- agx: Fix AGX_MESA_DEBUG=demand 556- agx: Maintain ctx->max_reg while assigning regs 557- agx: Allow 64-bit memory regs 558- agx: Fix accounting for phis 559- agx: Set phi sources in predecessors 560- agx: Stop setting registers after the shader 561- agx: Use agx_replace_src 562- agx: Assert invariant stated in the comment 563- agx: Don't use ssa_to_reg across blocks 564- agx: Don't reuse ssa_to_reg across blocks 565- agx: Remove unused allocation 566- agx: Stop setting forwarding bit 567- agx: Handle blocks with no predecessors 568- agx: Lower f2u8/f2i8 569- agx: Handle conversions to 8-bit 570- agx: Fix uadd_sat packing 571- agx: Fix 64-bit immediate moves 572- agx: Lower f2f16_rtz 573- agx: Handle f2f16_rtne like f2f16 574- agx: Handle <32-bit local memory access 575- agx: Do not allow creating vec8 576- asahi: Legalize compression before blitting 577- nir: Drop "SSA" from NIR language 578- agx: Stop passing nir_dest around 579- agx: Remove agx_nir_ssa_index 580- pan/mdg: Don't reference nir_dest 581- pan/bi: Don't reference nir_dest 582- asahi: Do not reference nir_dest 583- panfrost: Do not reference nir_dest 584- zink: Do not reference nir_dest 585- ir3: Do not reference nir_dest 586- dxil: Do not reference nir_dest 587- nir: Drop nir_dest_init 588- panfrost: Pack stride at CSO create time on v9 589- lvp,nir/lower_input_attachments: Use nir_trim_vector 590- broadcom/compiler: Use nir_trim_vector explicitly 591- nir: Assert that nir_ssa_for_src components matches 592- nir: Add nir_shader_intrinsics_pass 593- nir: Lower fquantize2f16 594- agx: Lower fquantize2f16 595- nir/lower_helper_writes: Consider bindless images 596- nir/passthrough_gs: Correctly set vertices_in 597- nir/passthrough_gs: Fix array size 598- nir/print: Print access qualifiers for intrinsics 599- nir/lower_gs_intrinsics: Remove end primitive for points 600- panfrost/ci: Disable T720 601- nir: Add load_sysval_agx intrinsic 602- agx: Fix extraneous bits with b2b32 603- agx: Use more barriers 604- asahi: Copy CSO stride 605- agx: Assert vertex_id, instance_id are VS-only 606- asahi: Keep drawoverhead from OOMing itself 607- agx: Don't blow up when lowering textures twice 608- agx/lower_vbo: Handle nonzero component 609- agx: Allow loop headers without later preds 610- agx: Handle b2i8 611- agx: Convert 8-bit comparisons 612- agx: Implement imul_high 613- asahi: Advertise OpenGL ES 3.1! 614- asahi/decode: Turn assert into error 615- asahi: Report local_size from compiler 616- asahi: Use local_size from compiler directly 617- asahi: Pass layer stride in pixels, not elements 618- agx: Clear sample count after lowering MSAA 619- agx: Clear image_array after lowering 620- asahi: Preserve atomic ops when rewriting image to bindless 621- agx: Use 16-bit reg for pixel_coord 622- asahi: Generalize query logic 623- asahi: Simplify occlusion query batch tracking 624- asahi: Refactor agx_get_query_result 625- asahi: Only touch batch->occlusion_queries for occlusion 626- asahi: Sync when beginning a query 627- asahi: Add non-occlusion query tracking 628- asahi: Add get_query_address helper 629- agx/fence_images: Use intrinsics_pass 630- agx: Do not fence write-only images 631- asahi: Add missing LOD source for agx_meta's txfs 632- agx: Do some texture lowering early 633- agx: Add helper returning if a descriptor crawl is needed 634- nir,asahi: Remove texture_base_agx 635- asahi: Move UBO lowering into GL driver 636- asahi: Add sysval tables for each shader stage 637- asahi: Split out per-stage sysvals 638- asahi: Collapse grid_info 639- asahi: Extract agx_upload_textures 640- asahi: Upload a single draw_uniforms per draw 641- asahi: Add real per-stage dirty flags 642- asahi: Extract sampler upload 643- asahi: Put unuploaded uniforms on the batch 644- asahi: Decouple sysval lowering from uniform assignment 645- asahi: Use finer dirty tracking for blend constant 646- asahi: Use proper dirty tracking for VBOs 647- asahi: Dirty track VBOs + blend const separately 648- asahi: Dirty the shader stage when the shader changes 649- asahi: Fix shader stage dirtying 650- treewide: Use nir_shader_intrinsic_pass sometimes 651- treewide: Also handle struct nir_builder form 652- nir/lower_shader_calls: Fix warning with clang 653- nir: Add nir_before/after_impl cursors 654- treewide: Use nir_before/after_impl in easy cases 655- treewide: Use nir_before/after_impl for more elaborate cases 656- radv: Use before/after_cf_list for entrypoints 657- ci: Disable known broken Bifrost Vulkan job 658- ci: Disable WHL jobs 659- nir/opt_if: Simplify if's with general conditions 660- asahi: Fixes for clang-warnings 661- agx: Fix jmp_exec_none encoding 662- agx/validate: Print to stderr 663- agx: Annotate opcodes with a scheduling class 664- agx: Add schedule-specialized get_sr variants 665- agx: Include schedule class in the opcode info 666- agx: Schedule for register pressure 667- agx: Lower pack_32_4x8_split 668- asahi: Force translucency for ignored render targets 669- agx: Remove logical_end instructions 670- agx: Lower pseudo-ops later 671- agx: Expand nest 672- agx: Lower nest later 673- agx: Split nest instruction into begin_cf + break 674- agx: Add break_if_*cmp instructions 675- agx: Add agx_first/last_instr helpers 676- agx: Use agx_first_instr 677- agx: Detect conditional breaks 678- agx: Omit push_exec at top level 679- agx: Omit while_icmp without continue 680- agx: Add helper to determine if a NIR loop uses continue 681- agx: Only use nest by 1 for loops w/o continue 682- agx: Add pseudo-instructions for icmp/fcmp 683- agx: Generate unfused comparison pseudo ops 684- agx: Fuse conditions into if's 685- agx: Fuse compares into selects 686- agx: Add unit test for if_cmp fusing 687- agx: Add unit test for cmp+sel fusing 688- asahi: Translate cube array dimension 689- ail: Force page-alignment for layered attachments 690- agx: Handle cube arrays when clamping arrays 691- agx: Lower coordinates for cube map array images 692- agx: Run opt_idiv_const after lowering texture 693- asahi: Forbid linear 1D Array images 694- asahi: Handle linear 1D Arrays 695- asahi: Conditionally expose cube arrays 696- gallium,mesa/st: Add PIPE_CONTEXT_NO_LOD_BIAS flag 697- asahi: Skip LOD bias lowering for GLES 698- nir: Add nir_function_instructions_pass helper 699- nir: Add NIR_OP_IS_DERIVATIVE property 700- nir: Hoist nir_op_is_derivative 701- nir/opt_preamble: Use nir_op_is_derivative 702- nir/opt_gcm: Use nir_op_is_derivative more 703- nir/gather_info: Use nir_op_is_derivative 704- nir/opt_sink: Sink load_constant_agx 705- nir/opt_sink: Sink load_local_pixel_agx 706- nir/opt_sink: Sink frag coord instructions 707- nir/opt_sink: Do not move derivatives 708- nir/opt_sink: Move ALU with constant sources 709- nir/opt_sink: Also consider load_preamble as const 710- agx: Enable sinking ALU 711- treewide: Drop nir_ssa_for_src users 712- treewide: Remove remaining nir_ssa_for_src 713- nir: Remove nir_ssa_for_src 714- asahi: Clamp index buffer extent to what's read 715- agx: Align the reg file for 256-bit vectors 716- agx: Hoist sample_mask/zs_emit 717- agx: Set PIPE_SHADER_CAP_CONT_SUPPORTED 718- agx: Augment if/else/while_cmp with a target 719- agx: Add jumps to block ends 720- agx: Add agx_prev_block helper 721- agx: Insert jmp_exec_none instructions 722- nir: Add layer_id_written_agx sysval 723- nir: Support arrays in block_image_store_agx 724- agx/nir_lower_texture: Allow disabling layer clamping 725- agx: Pack block image store dim correctly 726- agx: Handle layered block image stores 727- agx: Add pass to lower layer ID writes 728- asahi: Add helper to get layer id in internal program 729- asahi,agx: Select layered rendering outputs 730- agx: Support packed layered rendering writes 731- agx/tilebuffer: Support layered layouts 732- agx/lower_tilebuffer: Support spilled layered RTs 733- asahi: Use layered layouts 734- asahi: Expose VS_LAYER_VIEWPORT behind a flag 735- asahi: Account for layering for attachment views 736- asahi: Assume LAYER is flat-shaded 737- asahi: Add pass to predicate layer ID reads 738- asahi: Predicate layer ID reads 739- asahi: Write to cubes/etc attachments as 2D array 740- asahi: Use a 2D Array texture for array render targets 741- asahi: Generate layered EOT programs 742- asahi: Handle layered background programs 743- lima/pp: Do not use union undefined behaviour 744- nir: Add trivial nir_src_* getters 745- nir: Use set_parent_instr internally 746- nir: Use getters for nir_src::parent_* 747- nir: Assert the nir_src union is used safely 748- nir: Use a tagged pointer for nir_src parents 749- nir: Add ACCESS_CAN_SPECULATE 750- ir3: Set CAN_SPECULATE before opt_preamble 751- ir3: Model cost of phi nodes for opt_preamble 752- nir/opt_preamble: Walk cf_list manually 753- nir/opt_preamble: Preserve IR when replacing phis 754- nir/opt_preamble: Unify foreach_use logic 755- nir/opt_preamble: Move phis for movable if's 756- nir/opt_preamble: Respect ACCESS_CAN_SPECULATE 757- freedreno/ci: Minetest 758- r600/sfn: Handle load_global_constant 759- nir/opt_phi_precision: Work with libraries 760- nir/legalize_16bit_sampler_srcs: Use instr_pass 761- nir/print: Handle KERNEL 762- nir/lower_io: Use load_global_constant for OpenCL 763- nir/opt_algebraic: Reduce int64 764- nir/opt_algebraic: Optimize LLVM booleans 765- nir/trivialize_registers: Handle obscure load hazard 766- hasvk: Support builiding on non-Intel 767- crocus: Support building on non-Intel 768- meson: Add vulkan-drivers=all option 769- meson: Add gallium-drivers=all option 770- agx: Fix fragment side effects scheduling 771 772Amber (7): 773 774- ir3: make wave_granularity configurable 775- turnip: Add support for devices not supporting double thread size. 776- turnip: make sampler_minmax support configurable. 777- freedreno, turnip: set correct reg_size_vec4 for a6xx_gen1_low 778- ir3: handle non-uniform case for atomic image/ssbo intrinsics 779- freedreno: Add support for devices not supporting double thread size. 780- turnip: Add debug option to allow non-conforming features. 781 782Andrew Randrianasulu (1): 783 784- nv50/ir: Remove few nvc0 specific defines from nv50-specific header. 785 786Antonio Gomes (9): 787 788- rusticl/kernel: Removing unnecessary clone in kernel launch 789- rusticl/kernel: Add CsoWrapper 790- rusticl/compiler: Add NirPrintfInfo 791- rusticl: Move Cso to Program 792- rusticl/compiler: Remove unnecessary functions 793- rusticl: Move NirKernelBuild to ProgramDevBuild 794- rusticl/program: New helper functions to NirKernelBuild 795- rusticl/core: Delete KernelDevState and KernelDevStateInner 796- rusticl/core: Make convert_spirv_to_nir output pair (KernelInfo, NirShader) 797 798Asahi Lina (29): 799 800- docs/tgsi: Specify that depth texture fetches are replicated 801- asahi: Add synctvb debug flag 802- asahi: Add smalltile debug option 803- asahi: Add nomsaa debug flag 804- asahi: decode: Add a params argument to pass through 805- asahi: Add extra CDM header block for G14X 806- asahi: wrap: Handle freeing shmems 807- asahi: decode: Refactor to always copy GPU mem to local buffers 808- asahi: decode: Add a function to construct decode_params from a chip_id 809- asahi: Add a shared library interface for decode 810- asahi: Add a noshadow debug flag 811- asahi: Do not overallocate BOs by more than 2x 812- asahi: Fix race in BO stats accounting 813- asahi: Always use resource size, not BO size 814- asahi: Print info about shadowed resources 815- asahi: Impose limits on resource shadowing 816- asahi: Force linear for SHARED buffers with no/implicit modifier 817- asahi: Enable explicit coherency for G14D (multi-die) 818- asahi: Handle non-written RTs correctly 819- asahi: Fix incorrect BO bitmap reallocations 820- asahi: Allocate staging resources as staging 821- asahi: cmdbuf: Identify call/ret bits 822- asahi: decode: Implement VDM call/ret 823- asahi: decode: Do not assert on buffer overruns 824- asahi: Fix VDM pipeline field width 825- asahi: Add scaffolding for supporting driconf options 826- asahi: Add and support the no_fp16 driconf flag 827- driconf: Disable fp16 for browsers 828- asahi: Allow no16 flag for disk cache 829 830Bas Nieuwenhuizen (16): 831 832- aco: fix nir_op_vec8/16 with 16-bit elements. 833- aco: Fix some constant patterns in 16-bit vec4 construction with s_pack. 834- nir: Fix 16-component nir_replicate. 835- radv: Expose VK_EXT_external_memory_acquire_unmodified. 836- util/perf: Add gpuvis integration. 837- egl,venus,vulkan,turnip,freedreno: Update CPU trace init to init more than perfetto. 838- vulkan: Add CPU tracing for vkWaitForFences. 839- docs: Add documentation for gpuvis. 840- vulkan: Add trace points for more Vulkan waiting functions. 841- radv: Use a double jump to limit nops in DGC for dynamic sequence count. 842- nir: Add AMD cooperative matrix intrinsics. 843- aco: Add WMMA instructions. 844- aco: Make RA understand WMMA instructions. 845- radv: Don't transparently use wave32 with cooperative matrices. 846- radv: Add cooperative matrix lowering. 847- radv: Expose VK_KHR_cooperative_matrix. 848 849Benjamin Cheng (10): 850 851- radv/video: use app provided hevc scaling list order 852- radv/video: copy from correct H264 scaling lists 853- anv/video: copy from correct H264 scaling lists 854- vulkan/video: add helper to derive H264 scaling lists 855- radv/video: use vk_video_derive_h264_scaling_list 856- anv/video: use vk_video_derive_h264_scaling_list 857- util/vl: extract gallium vl scanning data to shared code 858- radv/video: send h264 scaling list in raster order 859- anv/video: send h264 scaling list in raster order 860- radv/video: find SPS with pps_seq_parameter_set_id 861 862Benjamin Lee (1): 863 864- nvk: Fix segfault when opening DRI device file returns error 865 866Biswapriyo Nath (1): 867 868- radv/video: Match function definitions to declarations 869 870Boris Brezillon (1): 871 872- panfrost: Flag the right shader when updating images 873 874Boyuan Zhang (3): 875 876- virgl: Add vp9 picture desc 877- virgl: Implement vp9 hardware decode 878- radeonsi/vcn: disable tmz ctx buffer for VCN_2_2_0 879 880Caio Oliveira (134): 881 882- nir: Use instructions_pass() for nir_fixup_deref_modes() 883- meson: Ensure that LLVMSPIRVLib is not required for Clover 884- nir: Let nir_fixup_deref_modes() fix deref_casts when possible 885- nir: Add nir_opt_reuse_constants() 886- radv: Use nir_opt_reuse_constants() 887- compiler/types: Use ralloc for the key in array_types 888- compiler/types: Use smaller keys for array_types table 889- compiler/types: Extract get_explicit_matrix_instance() function 890- compiler/types: Use smaller keys for explicit_matrix_types table 891- anv/tests: Refactor state_pool_test_helper to not use macros for parametrization 892- anv/tests: Link a single anv_tests binary using gtest 893- anv/tests: Propagate failures to gtest 894- hasvk/tests: Refactor state_pool_test_helper to not use macros for parametrization 895- hasvk/tests: Link a single hasvk_tests binary using gtest 896- hasvk/tests: Propagate failures to gtest 897- util: Add convenience macros for linear allocator 898- compiler/types: Use right hash for function types 899- compiler/types: Don't duplicate empty string 900- compiler/types: Constify a couple of pointers in glsl_type 901- compiler/types: Remove unused GLSL_TYPE_FUNCTION and related functions 902- compiler/types: Move GLSL specific builtin structs into glsl/ 903- glsl: Add missing glsl_types initialization to test_optpass 904- glsl: Don't create struct type builtins 905- compiler/types: Add extra level of macro to builtin_macros 906- compiler/types: Use designated initializer syntax to specify builtins 907- compiler/types: Move local cache details to implementation file 908- compiler/types: Add a mem_ctx for the glsl_type_cache 909- compiler/types: Use type cache mem_ctx for hash tables 910- compiler/types: Don't store a mem_ctx per type 911- compiler/types: Simplify clearing the glsl_type_cache 912- compiler/types: Move static asserts about glsl_type to a central place 913- compiler/types: Store builtin types directly as data 914- compiler/types: Use a linear (arena) allocator for glsl_types 915- compiler/types: Make struct glsl_type visible to C code 916- compiler/types: Add workaround to use builtin_type_macros.h in C 917- compiler/types: Move builtin type initialization to C 918- glsl: Annotate _mesa_glsl_error() with PRINTFLIKE 919- compiler/types: Fix array name dimension flipping for unsized arrays 920- compiler/types: Use Python to generate code for builtin types 921- compiler/types: Use glsl_get_type_name() to access the type name 922- compiler/types: Change glsl_type::name to be an uintptr_t 923- compiler/types: Use a string table for builtin type names 924- intel/compiler/xe2: Account for reg_unit() in TCS intrinsics 925- intel/compiler/xe2: Account for reg_unit() in TES intrinsics 926- intel/fs/xe2+: Update BS payload setup for Xe2 reg size. 927- intel/fs/xe2+: Update TASK/MESH payload setup for Xe2 reg size. 928- compiler: Use a meson dependency for libcompiler 929- meson: Remove unnecessary inc_compiler mentions 930- rusticl: Ensure NIR generated headers will be available 931- clover: Hide SPIR-V related code behind HAVE_CLOVER_SPIRV 932- clover: Only compile/depend libclspirv and libclnir when using SPIR-V support 933- compiler: Only enable mesaclc helper if we have OpenCL SPIR-V support 934- intel/compiler: Don't allocate memory for SIMD select error handling 935- microsoft/compiler: Fix printf formatting string issues 936- util: Add more PRINTFLIKE and MALLOCLIKE annotations 937- util: Remove ralloc_parent from linear_header 938- util: Use linear parent to (r)allocated extra nodes 939- util: Remove size from linear_parent creation 940- util: Make DECLARE_LINEAR_ALLOC_* macros assume no destructors 941- util: Use an opaque type for linear context 942- util: Remove usages of linear_realloc() 943- util: Remove linear_realloc() 944- util: Remove size information from child allocations 945- util: Remove per-buffer header in linear alloc for release mode 946- util: Add a few basic tests for linear_alloc 947- util: Fix bookkeeping of linear node sizes 948- intel/compiler: Don't store stage name and abbrev 949- intel/compiler/xe2: URB fence uses LSC now 950- intel/compiler/xe2: Fix URB writes in TCS 951- intel/compiler/xe2: Update TCS ICP handle code to support SIMD16 952- compiler/types: Add support for Cooperative Matrix types 953- nir: Add new intrinsics for Cooperative Matrix 954- nir: Handle cooperative matrix in various passes 955- spirv: Expose some memory related functions in vtn_private.h 956- spirv: Let vtn_ssa_value hold references to variables 957- spirv: Implement SPV_KHR_cooperative_matrix 958- compiler/types: Remove private related declarations 959- compiler/types: Remove use of new/delete 960- compiler/types: Remove use of references 961- compiler/types: Remove use of auto 962- compiler/types: Use C compatible cast syntax 963- compiler/types: Spell struct and enum in type names 964- compiler/types: Add void parameter to ensure these are valid C prototypes 965- intel/fs: Tweak default case of fs_inst::size_read() 966- compiler/types: Move the C++ inline functions in glsl_type out of the struct body 967- compiler/types: Move C declarations into glsl_types.h 968- compiler/types: Flip wrapping of base_type checks 969- compiler/types: Flip wrapping of various type identification checks 970- compiler/types: Flip wrapping of convenience accessors for vector types 971- compiler/types: Flip wrapping of basic "get type" functions 972- rusticl: Add Rust bindings for inline glsl_types functions 973- util: Add size to ralloc_header in debug mode 974- util: Add a canary to identify gc_ctx in debug mode 975- util: Add function print information about a ralloc tree 976- util: Avoid waste space when linear alloc'ing large sizes 977- spirv: Expose stage enum conversion in vtn_private.h 978- spirv: Change spirv2nir to use the shorter shader name abbreviations 979- spirv: List entry-points in spirv2nir when unsure what to use 980- spirv: Let spirv2nir find out the shader to use 981- intel/compiler: Don't emit calls to validate() in release build 982- compiler/types: Flip wrapping of "type contains?" predicate functions 983- compiler/types: Flip wrapping of array related functions 984- compiler/types: Flip wrapping of cmat related functions 985- compiler/types: Flip wrapping of CL related functions 986- compiler/types: Flip wrapping of size related functions 987- compiler/types: Flip wrapping of struct related functions 988- compiler/types: Flip wrapping of interface related functions 989- compiler/types: Flip wrapping of layout related functions 990- compiler/types: Flip wrapping of record_compare 991- compiler/types: Flip wrapping of get_instance() 992- compiler/types: Flip wrapping of texture/sampler/image get instance functions 993- compiler/types: Flip wrapping of various get instance functions 994- compiler/types: Flip wrapping of get row/column type helpers 995- compiler/types: Flip wrapping of remaining non-trivial type getters 996- compiler/types: Flip wrapping of remaining small data getters 997- compiler/types: Flip wrapping of numeric type conversion functions 998- compiler/types: Move remaining code from nir_types to glsl_types 999- rusticl: Add bindings for glsl_vector_type() 1000- compiler/types: Add more glsl_contains_*() functions and use them in C++ 1001- compiler/types: Add glsl_get_mul_type() and use it in C++ 1002- compiler/types: Add glsl_type_compare_no_precision() and use it in C++ 1003- compiler/types: Add glsl_type_uniform_locations() and use it in C++ 1004- compiler/types: Add glsl_get_std430_array_stride() and use it in C++ 1005- compiler/types: Add glsl_get_explicit_*() functions and use them in C++ 1006- compiler/types: Implement glsl_type::field_type() in terms of existing functions 1007- compiler/types: Add glsl_simple_explicit_type() and simplify glsl_simple_type() 1008- compiler/types: Add remaining type extraction functions and use them in C++ 1009- compiler/types: Use C instead of C++ constants for builtin types 1010- compiler/types: Remove usages of C++ members in glsl_types.cpp 1011- compiler/types: Annotate extern "C" only once in glsl_types.cpp 1012- compiler/types: Rename glsl_types.cpp to glsl_types.c 1013- compiler/types: Remove warnings about potential fallthrough 1014- compiler/types: Move comments and reorganize declarations 1015- anv: Fix leak when compiling internal kernels 1016 1017Carsten Haitzler (2): 1018 1019- kmsro: Add hdlcd DPU 1020- panfrost: Add GPU variant of G57 to the set of known ids 1021 1022Charles Giessen (1): 1023 1024- panvk: Use 1.0 in ICD Manifest json 1025 1026Charmaine Lee (8): 1027 1028- svga: set clear_texture to NULL for vgpu9 1029- svga: fix stride used in vertex declaration 1030- svga: fix persistent mapped surface update to constant buffer 1031- svga: restrict use of rawbuf for constant buffer access to GL43 device 1032- svga: fix immediates used in rawbuf for constant buffer 1033- svga: use srv raw buffer for accessing readonly shader buffer 1034- svga: sync resource content from backing resource before image upload 1035- svga: ignore sampler view resource if not used by shaders 1036 1037Chia-I Wu (38): 1038 1039- radv: fix separate depth/stencil layouts in fb state 1040- radv: fix separate depth/stencil layouts in resolve meta 1041- radv: refactor depth clear in clear meta 1042- radv: fix separate depth/stencil layouts in clear meta 1043- amd/ci: update radv-stoney-aco-fails.txt for depth/stencil clear 1044- radv: disable tc-compat htile for layered images on gfx8 1045- amd/ci: update radv-stoney-aco-fails.txt for depth/stencil resolve 1046- winsys/amdgpu: fix a race between import and destroy 1047- ac/surface: limit RADEON_SURF_NO_TEXTURE to color surfaces 1048- winsys/radeon: fix a race between bo import and destroy 1049- vulkan/runtime: add a helper for ETC2 emulation 1050- radv: use vk_tecompress_etc2 from the runtime 1051- vulkan/runtime: fix image type check for ETC2 emulation 1052- vulkan/runtime: fix a harmless typo for ETC2 emulation 1053- vulkan/runtime, radv: remove 1D support from ETC2 emulation 1054- radv: add radv_is_format_emulated 1055- radv: simplify view format override for emulated formats 1056- radv: hard code format features for emulated formats 1057- mesa: make astc_decoder.glsl vk-compatible 1058- radv, drirc: rename radv_require_{etc2,astc} 1059- anv: remove unused field from anv_image_view 1060- anv: add anv_image_view_{init,finish} 1061- anv: support image views with surface state stream 1062- anv: add anv_push_descriptor_set_{init,finish} 1063- anv: support alternative push descriptor sets 1064- anv: add anv_descriptor_set_write 1065- anv: add anv_cmd_buffer_{save,restore}_state 1066- anv: add anv_is_format_emulated 1067- anv: add a hidden plane for emulated formats 1068- anv: decompress on upload for emulated formats 1069- anv: fix up image views for emulated formats 1070- anv: fix up blit src for emulated formats 1071- anv: advertise emulated formats 1072- anv: add support for vk_require_astc driconf 1073- util: improve BITFIELD_MASK and BITFIELD64_MASK on clang 1074- anv: prep for gen9 astc workaround 1075- anv: add gen9 astc workaround 1076- radv: fix image view extent override for astc 1077 1078Chris Spencer (9): 1079 1080- radv: initialize result when pipeline cache creation fails 1081- anv/android: Fix importing hardware buffers with planar formats 1082- anv/android: Add support for AHARDWAREBUFFER_FORMAT_YV12 1083- anv: Advertise Vulkan 1.3 on Android 13 1084- anv: Don't reject Android image format if external props not supplied 1085- android: Add explanatory comment to u_gralloc 1086- anv/android: Enable shared presentable image support 1087- anv/video: use correct enum value for max level IDC 1088- radv/video: use correct enum value for max level IDC 1089 1090Christian Gmeiner (41): 1091 1092- nir/print: print instr pass_flags 1093- etnaviv: move nir texture lowerings into one pass 1094- nir: add enta specific intrinsic used for txs lowering 1095- etnaviv: nir: support intrinsic used for txs lowering 1096- etnaviv: nir: lower nir_texop_txs 1097- ci/etnaviv: update ci expectations 1098- etnaviv: make use of BITFIELD_BIT(..) macro 1099- etnaviv: name the enum used for pass_flags 1100- etnaviv: add is_dead_instruction(..) helper 1101- etnaviv: extend etna_pass_flags with source modifiers 1102- etnaviv: do not clear all pass_flags before RA 1103- etnaviv: nir: look at parent instr in lower_alu(..) 1104- etnaviv: nir: add etna_nir_lower_to_source_mods(..) 1105- etnaviv: nir: switch to etna_nir_lower_to_source_mods(..) 1106- etnaviv: nir: convert to new-style NIR registers 1107- freedreno/regs: remove double assignment of self.current_domain 1108- freedreno/regs: remove not used variable 1109- freedreno/regs: remove dead code 1110- freedreno/regs: python does not need ';' 1111- etnaviv: switch to log2f(..) 1112- etnaviv: switch to U_FIXED(..) macro 1113- etnaviv: switch to S_FIXED(..) macro 1114- etnaviv: fix null pointer dereference 1115- etnaviv: switch to float_to_ubyte(..) 1116- ci/etnaviv: update ci expectation 1117- etnaviv: unbreak cmdline compiler 1118- agx/lower_address: Use intrinsics_pass 1119- agx/lower_address: Remove not used has_offset 1120- isaspec: python does not need ';' 1121- docs: Move isaspec out of drivers/freedreno 1122- isaspec: Add support for templates 1123- isaspec: encode: Correct used regex 1124- isaspec: Add method to get all instrustions 1125- isaspec: Add support for custom meta information 1126- isaspec: Add BitSetEnumValue object 1127- spirv: Don't use libclc for rotate 1128- docs: update etnaviv extensions 1129- etnaviv: drm: Be able to mark end of context init 1130- etnaviv: Skip 'empty' cmd streams 1131- ci: Bump PyYAML to 6.0.1 1132- etnaviv: Don't leak disk_cache 1133 1134Collabora's Gfx CI Team (2): 1135 1136- Uprev Piglit to ed58dfbd12be34fa3dab97a7a2987b890e0637f1 1137- Uprev Piglit to f7db20b03de6896d013826c0a731bc4417c1a5a0 1138 1139Cong Liu (2): 1140 1141- r300: Fix out-of-bounds access in ntr_emit_store_output() 1142- virgl:Fix ITEM_CPY macro pointer copy bug 1143 1144Connor Abbott (83): 1145 1146- afuc: Rework and significantly expand README.rst 1147- tu: Fix vk2tu_*_stage flag type 1148- tu: Fix and simplify execution dependency handling 1149- tu, freedreno/a6xx: Remove has_ccu_flush_bug 1150- ir3: Handle GS stream "mixing" with non-point output primitives 1151- tu: Disable transformFeedbackPreservesProvokingVertex 1152- isaspec: Add "displayname" for altering {NAME} when decoding 1153- isaspec: Add support for "absolute" branches 1154- isaspec: Add support for function and entrypoint labels 1155- isaspec: Add "custom" field type 1156- isaspec: Add callback after decoding an instruction 1157- isaspec: Rename isa_decode() to isa_disasm() 1158- isaspec: Add initial decoding support 1159- afuc: Fix xmov lexer typo 1160- afuc: Convert to isaspec 1161- afuc: Add setbit/clrbit 1162- afuc: Fix writing $00 1163- freedreno/afuc: Initial a7xx support 1164- ir3: Parse (eq) flag 1165- ir3, freedreno, tu: Plumb through SP_FS_PREFETCH_CNTL::ENDOFQUAD 1166- tu: Add missing last_baryf statistic 1167- freedreno, tu, ir3: Add last_helper statistic 1168- ir3: Gather pixlod status earlier 1169- ir3: Implement helper invocation optimization 1170- vk/graphic_state, tu: Use dynamic blend count from subpass 1171- freedreno/a7xx: Add CP_RESET_CONTEXT_STATE 1172- vk/graphics_state: Fix copying MS locations pipeline state 1173- tu: Remove MSAA draw state 1174- tu: Merge SAMPLE_LOCATIONS and SAMPLE_LOCATIONS_ENABLE draw states 1175- tu: Merge PC_RASTER_CNTL into RAST draw state 1176- tu: Stop reusing base Vulkan dynamic state enums 1177- tu: Merge depth/stencil draw states 1178- tu: Rename PrimID-related registers 1179- tu, freedreno/a6xx: Don't use VS for PrimID passthru state 1180- tu: Pull entangled shader state into program config 1181- ir3: Add ir3_find_input_loc() helper 1182- tu: Split up tu6_emit_vpc() 1183- freedreno, ir3, tu: Constify various uses of ir3_shader_variant 1184- ir3: Add helper to determine when variant exceeds safe constlen 1185- tu: Split program draw state into per-shader states 1186- tu: Fix per-view viewport state propagation 1187- tu: Fix tu6_emit_*_fdm size call 1188- tu: Fix assert in FDM state emission 1189- tu: Actually emit patchpoint for viewports with FDM 1190- nir/lower_subgroups: Don't do multiple lowerings at once 1191- nir/spirv: Add inverse_ballot intrinsic 1192- amd: Use inverse ballot intrinsic if available 1193- tu: Create singleton "empty" shaders 1194- tu: Start tracking shaders independently of pipeline 1195- tu: Move FS-specific pipeline information to the shader 1196- tu: Use shader directly for VS/TCS output size and patch size 1197- tu: Rewrite tessellation modes handling 1198- tu: Rework passing shared consts 1199- tu: Decouple program state from the pipeline 1200- tu: Use pipeline feedback loop flag indirectly 1201- tu: Rewrite remaining pipeline LRZ handling 1202- tu: Don't reference pipeline for some draw states 1203- tu: Make compute dispatch use the shader 1204- tu: Don't use pipeline for dynamic draw states 1205- tu: Don't use pipeline for bandwidth validity 1206- tu: Don't use pipeline for per_view_viewport 1207- tu: Don't use pipeline for active stages 1208- tu: Remove pipeline from state 1209- zink: Rework color clamping and conversion 1210- freedreno/fdl: Use A8_UNORM HW format for sampling 1211- tu: Support clearing A8_UNORM 1212- freedreno/fdl: Support PIPE_FORMAT_R5G5B5A1_UNORM on a6xx 1213- tu/clear_blit: Fix staging image view layer count 1214- tu/clear_blit: Allow VK_REMAINING_ARRAY_LAYERS as layerCount 1215- tu: Allow VK_WHOLE_SIZE in tu_CmdBindVertexBuffers2EXT pSizes 1216- tu: Implement vkCmdBindIndexBuffer2KHR 1217- tu: Implement vkGetImageSubresourceLayout2KHR and vkGetDeviceImageSubresourceLayoutKHR 1218- tu: Implement vkGetRenderingAreaGranularityKHR 1219- tu: Use new buffer usage flags 1220- tu: Support VkPipelineCreateFlags2CreateInfoKHR 1221- tu: Check for DEVICE_LOST in vkGetEventStatus() 1222- tu: Add maintenance5 properties 1223- freedreno/ci: Skip dEQP-VK.info.device_extensions 1224- tu: Expose VK_KHR_maintenance5 1225- freedreno/ci: Remove minetest trace 1226- v3d/ci: Remove minetest trace 1227- ir3/ra: Don't swap killed sources for early-clobber destination 1228- tu: Fix re-emitting VS param state after it is re-enabled 1229 1230Corentin Noël (16): 1231 1232- ci: Add locked flag to bindgen-cli installation 1233- virgl: Do not expose EXT_texture_mirror_clamp when using a GLES host 1234- ci: disable Collabora's LAVA lab for maintenance 1235- llvmpipe: make sure to initialize the lp_setup_context slots with the default values 1236- virgl: Cover all the formats defined in the virgl definition 1237- mesa: Ensure that the baselevel will never exceed the maximal supported number 1238- ci: Uprev virglrenderer 1239- freedreno/drm/virtio: Use MESA_TRACE_SCOPE instead of _BEGIN/_END 1240- tu: Use MESA_TRACE_SCOPE instead of _BEGIN/_END 1241- aux/tc: Use MESA_TRACE_SCOPE instead of _BEGIN/_END 1242- venus: Change the only occurrence of VN_TRACE_BEGIN/END to VN_TRACE_SCOPE 1243- util: Avoid the use of MESA_TRACE_BEGIN/END 1244- util/perf: Remove the tracing categories 1245- util: Remove MESA_TRACE_BEGIN/END 1246- mesa/bufferobj: ensure that very large width+offset are always rejected 1247- frontends/va: Remove wrong use of ProfileToPipe 1248 1249Daniel Schürmann (9): 1250 1251- nir/opt_move: fix handling of if-condition 1252- aco: append p_logical_end after monolithic RT shaders 1253- aco/insert_exec_mask: set Exact mode after p_discard_if when necessary 1254- aco: don't optimize cross-lane instructions across p_wqm 1255- aco: make p_wqm a marker instruction without Operands/Definitions 1256- aco: don't insert a copy when emitting p_wqm 1257- aco: insert a single p_end_wqm after the last derivative calculation 1258- aco/insert_exec_mask: Simplify WQM handling (1/2) 1259- aco/insert_exec_mask: Simplify WQM handling (2/2) 1260 1261Daniel Stone (23): 1262 1263- dri: Support 1555/4444 formats 1264- egl/dri2: Don't look up image extension twice 1265- egl/wayland: Always initialise fd_display_gpu 1266- egl/wayland: Add image loader extension for swrast 1267- egl/wayland: Never use DRI2_LOADER extension 1268- egl/wayland: Assume modern DRI interface versions 1269- egl/drm: Use IMAGE_DRIVER instead of DRI2_LOADER 1270- egl/drm: Assume modern DRI interface versions 1271- ci: Disable nouveau CI 1272- panfrost/vk: Use correct sampler dimensions for MSAA 1273- ci: Declare stages before jobs 1274- ci/radeonsi: Add new flake 1275- ci/d3d12: Add new flake 1276- ci/intel: Add new skqp flake 1277- ci/zink: Add new zink-lvp flakes 1278- ci/radeonsi: Skip more really slow tests 1279- ci/zink: Add another conversion fail on a618 1280- ci: Move farm-disable rules before anything else 1281- ci: Always set user container jobs to manual 1282- ci: Use container rules for containers 1283- ci: Only look at file changes for MRs 1284- ci: Fix pre-merge pipelines with no code changes 1285- ci: Try really hard to print final result string 1286 1287Daniel van Vugt (1): 1288 1289- glx: Increment dpy->request before issuing an error that had no request 1290 1291Danylo Piliaiev (71): 1292 1293- freedreno/cffdec: Decode CP_DRAW_AUTO 1294- freedreno, turnip: Clarify some RB_CCU_CNTL fields 1295- freedreno,turnip: Make number of VSC pipes configurable 1296- freedreno,turnip: Make CS shared memory size configurable 1297- freedreno,turnip: Make VS input attr/binding count configurable 1298- freedreno: Add A605, A608, A610, A612 GPUs definition 1299- turnip: Make multiview support configurable per generation 1300- ir3: Make FS tex prefetch optimization optional 1301- ir3: Use NIR info to enable per sample shading 1302- freedreno/regs: Rename SP_FS_CTRL_REG0.DIFF_FINE into LODPIXMASK 1303- ir3: Fix FS quad ops returning wrong values from helper invocations 1304- tu,freedreno: Forbid blit event for R8G8_SRGB due to gpu faults 1305- radv: fix unused non-xfb shader outputs not being removed 1306- vulkan/nir: Add common helper to check if output is XFB 1307- radv: Use common nir_vk_is_not_xfb_output 1308- turnip: Use common nir_vk_is_not_xfb_output 1309- freedreno/regs: Define unknown SP_FS_PREFETCH_CNTL fields 1310- freedreno/registers: Refactor gen_header.py to allow more options 1311- freedreno/registers: Generate python files with reg offsets 1312- freedreno: Add a list of raw magic regs 1313- freedreno: Fully define a730 and a740 device properties 1314- ir3/tests: Use fd_dev_info to infer GPU generation 1315- freedreno/computerator: Fix remaining issues with A7XX 1316- isaspec: Make possible to obtain gpu_id in <expr> blocks 1317- ir3/a7xx: cat5 mode1 has swapped tex/samp ids 1318- ir3/a7xx: Don't multiply global mem instruction's offset by 4 1319- ir3/a7xx: insert lock/unlock at the end of every compute shader 1320- ir3/a7xx: Add ccinv instruction 1321- ir3/a7xx: Use ccinv for data synchronization 1322- ir3/a7xx: Disable shared consts for a7xx 1323- tu/common: Generalize TU_GENX macro 1324- tu: Basic a7xx support 1325- freedreno/fdl: Set LOSSLESSCOMPEN for image when ubwc is enabled on a7xx 1326- tu/a7xx: Fix geometry shaders 1327- tu/a7xx: Fix tesselation shaders 1328- tu/a7xx: Fix multiview 1329- tu/a7xx: Fix flat shading 1330- tu/a7xx: Fix occlusion query 1331- tu/a7xx: Fix 3d blits after multiview usage 1332- tu/a7xx: Fix CmdDrawIndirectByteCountEXT 1333- tu/a7xx: Disable LRZ 1334- ir3/lower_tex_prefetch: Fix crash with lowered load_barycentric_at_offset 1335- tu: Exclude SP_UNKNOWN_AE73 from reg stomping 1336- tu: Call tu_cs_dbg_stomp_regs with appropriate GPU gen 1337- freedreno/replay: Add limited support for KGSL 1338- freedreno/rddecompiler: Update to handle a7xx 1339- freedreno/replay: Add "print" instr to ir3 asm to be used in replay 1340- freedreno/replay: Add "gpu_print" function for command streams 1341- tu/perfetto: Remove now unnecessary tu_perfetto_util 1342- tu/perfetto: Allow gpu time to be passed into tu_perfetto_submit 1343- tu/kgsl: Fix memory leak of tmp allocations during submissions 1344- tu/kgsl: Support u_trace and perfetto 1345- tu/a7xx: Correctly record timestamps for u_trace 1346- tu/virtio: Fix incorrect call to tu_perfetto_submit 1347- ci: Compile Turnip's virtio kmd in debian-arm64 1348- freedreno/registers: Refine a7xx push consts registers 1349- ir3,tu: Refactor push consts info plumbing 1350- freedreno: Make possible to specify A7XX feature flags 1351- turnip,ir3: Implement A7XX push consts load via preamble 1352- tu: Add push_consts_per_stage debug option 1353- tu: Fix VK_FORMAT_A8_UNORM_KHR using UBWC when !has_8bpp_ubwc 1354- tu/kgsl: Fix field order in kgsl_command_object init 1355- tu: Fix stale tu_render_pass_attachment::store_stencil with dyn rendering 1356- tu: Zero init tu_render_pass and tu_subpass for dynamic rendering 1357- tu: Disable preamble push consts when they are not used 1358- ir3: Fix values of #wrmask not being compatible with ir3 parser 1359- tu: Count a whole push consts range in constlen for PREAMBLE push consts 1360- freedreno/rddecompiler: Use fd_dev_gen to pass gpu_id to ir3 disasm 1361- freedreno/rddecompiler: Decompile repeated IBs 1362- freedreno: Fix field size of A6XX_TEX_CONST[3].ARRAY_PITCH 1363- tu: Fix reading of stale (V)PC_PRIMITIVE_CNTL_0 1364 1365Dave Airlie (163): 1366 1367- ci: remove binding model from the asan skips for lavapipe. 1368- gallivm: fix atomic global temporary storage. 1369- llvmpipe: fix fragdata/lastfragdata heuristic a bit more. 1370- nvk: add missing finish calls 1371- nvk: add some initial wsi framework. 1372- nvk: fix header guards to be less generic. 1373- nvk: add bind buffer memory 1374- nvk: Add initial queue 1375- nvk: add cmd buffer framework 1376- nvk: Reset pushbufs on command buffer reset 1377- nvk: reindent descriptor sets to mesa std. 1378- nvk: add initial descriptor pool framework. 1379- nvk: some boilerplate for descriptor sets 1380- nvk: add descriptor set bo allocation. 1381- nvk: implement buffer address. 1382- nvk: descriptor set freeing fix 1383- nvk: move to new command stream generator. 1384- nvk: port the blit and copy code to new command submission. 1385- nouveau/ws: drop the old push generators. 1386- nvk: link in codegen without gallium bits. 1387- nvk: Initial wiring in of the compiler 1388- nvk: Basic descriptor binding 1389- nouveau/vk: add support for compute classes to generator. 1390- nvk: retrieve gpc/mp counts from kernel. 1391- nvk: add support for preamble and tls allocation. 1392- nvk: add record result to cmd_buffer. 1393- nvk: add command stream upload buffer. 1394- nouveau/winsys: Add m2mf/compute objects 1395- nvk: add some basic format wrapping framework 1396- nvk: add some compute limits 1397- nvk: add basic nve4+ compute support. 1398- nvk: fix empty cmd submission. 1399- nouveau/ws: add a push reset just for references. 1400- nouveau/classes: add 906f header support. 1401- nvk: add initial 8/16 byte clears. 1402- nvk: fix pipeline pushbuf sizing 1403- nvk: increase graphics cpu push buffer 1404- nvk: fix depth emission ordering. 1405- nvk: add some limits/features from binary driver. 1406- nvk: add indexed draw support. 1407- nvk: assign vertex locations according to input attrib index 1408- nvk: lower io to temps to avoid output reads in vertex shaders 1409- nvk: handle NULL to destroy descriptor pool 1410- nvk: add basic primitive restart 1411- nvk: fix copy lower address extraction 1412- nvk: fix multiple pipelines failure allocation case. 1413- nvk: init dev->physical_device earlier. 1414- nvk/winsys: store device ptr into bo instead of ptr 1415- nvk: set the device fd 1416- nil: Fix image align and size constraints 1417- nvk: Report image alignments from NIL 1418- nouveau/winsys: allocate unique object handles across channels. 1419- nvk/nil: don't ask for compressed image kind 1420- nvk/barrier: handle host bit. 1421- nvk: add compute support for ampere 1422- nvk: add min_lod to spirv caps. 1423- nvk: fix r32_sint format support 1424- nvk: expose EXT_sampler_filter_minmax 1425- nvk: fix transform feedback crash when optimiser removes things. 1426- nvk: merge tess info between tcs/tes. 1427- nvk: introduce an optimisation loop. 1428- nvk: add support for D32_SFLOAT_S8_UINT 1429- nvk/query: fix push buffer size for copy pool results. 1430- nvk: init image fields for requirements 1431- nvk: handle alignments in device memory 1432- nvk/tess: don't emit patch control points in pipeline 1433- nvk: align geometry clip setting with nvc0 1434- nvk: fix independent color write masks. 1435- nvk: enable rgb32 texel buffer support 1436- nvk: enable EXT_depth_clip_control 1437- nvk: enable EXT_depth_clip_enable 1438- nvk: always sync internal cmd bufs for vma lifetimes. 1439- nouveau/winsys: add support for the vma bind interfaces 1440- nvk: Add support for sparse buffers 1441- nvk: Add support for sparse images 1442- nvk/queue: add support for syncobjs and sparse binds 1443- nvk: Handle pre-turing indirect buffers with sparse 1444- nvk: enable sparse features 1445- nvk: enable a bunch of external fence/semaphore bits 1446- nvk: enable sparse residency buffer on maxwell+ 1447- nvk: add new internal bo allocation flag. 1448- docs: add two nvk exts to features.txt 1449- zink: use fprintf instead of printf to align the requirements warnings 1450- nvk: align sampler allocation counts with nvidia. 1451- zink: turn off threaded cpu access if not visible. 1452- nvk: add gart forced cmd pool side buffer. 1453- nvk: add cond render upload buffer. 1454- nvk: enable KHR_shader_clock. 1455- nvk: NOUVEAU_WS_BO_LOCAL is a trap. 1456- gallivm: drop unused info parameter 1457- llvmpipe/fs: drop cbuf 0 since it's lowered now. 1458- gallivm/nir: avoid using params->info 1459- llvmpipe/fs: move some tgsi checks in nir path to nir code. 1460- llvmpipe/cs: convert to using tgsi->nir 1461- llvmpipe/cs: drop tgsi for compute/mesh/task shader internals. 1462- lavapipe: use vk_buffer common code. 1463- lavapipe: use vk_buffer_range common code. 1464- llvmpipe/fs: switch to using tgsi->nir instead of handling tgsi 1465- llvmpipe/analyse: drop TGSI path. 1466- llvmpipe/fs: start using nir info in some places. 1467- llvmpipe/fs: drop the simple shader logic 1468- llvmpipe/fs: rewrite output finding using nir. 1469- nvk: add build_id linker argument. 1470- nir/gather: add support for fbfetch and bindless image loads. 1471- llvmpipe/cs: further cleanups after tgsi removal. 1472- llvmpipe: move to nir lowering for fquantize2f16 1473- rusticl: don't store ptrs to nir_variables across opt passes. 1474- llvmpipe: enable f16 paths on aarch64. 1475- clover/llvm: move to modern pass manager. 1476- nir: use a _clone so users calling their variable clone don't get a warning 1477- nir: rename nir_inline_functions.c to nir_functions.c 1478- nir: use nir_function_instructions_pass in the inliner. 1479- nir: move the libclc lowering over to functions file. 1480- nir/functions: use helper to get function for a name. 1481- nir/functions: put link state into a struct 1482- nir/functions: move linker pass to new helper 1483- nir: add nir function clone 1484- nir: don't inline linked functions 1485- gallivm/nir: split prepasses out to make per-function work easier. 1486- gallivm: rework translator to allow per-impl work. 1487- spirv/nir: parse function control and store in nir. 1488- nir: add driver_functions option to avoid inlining. 1489- nir: add a function usage tracker 1490- rusticl: use cleanup funcs 1491- gallivm: add support for function calling 1492- llvmpipe/cs: add support for function calls. 1493- llvmpipe: enable driver functions. 1494- radv: don't emit event code on video queues. 1495- spirv: use a pointer sized int type for opencl event_t 1496- clover: fix parameter arguments since recent translator changes. 1497- radv/video: take db alignment into account when allocating images. 1498- ac,radeonsi: move vcn enc structs to common 1499- ac,radeonsi: move vcn enc av1 default cdf file to common 1500- nir: add a deref slot counter that handles compact 1501- llvmpipe/linear: drop tgsi path. 1502- gallivm: drop tgsi aos paths. 1503- llvmpipe/nir: call gather info to update inputs read properly 1504- llvmpipe/fs: start converting interp/input paths to nir. 1505- llvmpipe/fs: start converting dervied state to nir based. 1506- llvmpipe/linear: convert to using nir for output. 1507- llvmpipe/linear: move to nir inputs 1508- draw/mesh: reset some user state values on mesh draws. 1509- llvmpipe/fs: fix regression in sample mask handling from tgsi removal. 1510- llvmpipe: reset viewport_index_slot in fb bind 1511- llvmpipe/cs: migrate to generic jit texture from pipe code. 1512- llvmpipe/cs: migrate cs image handle to common jit code. 1513- lavapipe: fix some whitespace in advance of other changes. 1514- lavapipe: fix subresource layers asserts 1515- lavapipe: support host image copying on compressed texture formats 1516- llvmpipe: don't create texture functions for planar textures. 1517- lavapipe: don't emit blit src/dst for subsampled formats. 1518- llvmpipe: don't support planar formats for buffers. 1519- lavapipe: convert sampler to use vk base class. 1520- lavapipe: cleanup copy code to use a local region variable. 1521- lavapipe: start introducing planes structure. 1522- lavapipe: allocate image and image view planes. 1523- lavapipe: handle planes in copies 1524- lavapipe: handle planes in get image sub resource 1525- lavapipe: add descriptor sets bindings for planar images 1526- lavapipe: handle planes in texture lowering. 1527- lavapipe: expose planar ycbcr formats and new ycbcr features 1528- lavapipe + docs: update ycbcr extension enables. 1529- intel-clc: avoid using spirv-linker. 1530 1531David Heidelberg (82): 1532 1533- ci/freedreno: update a530 flakes 1534- ci: build kernel in gfx-ci/linux and just use binaries in Mesa3D CI 1535- ci: update kernel to 6.3.13 1536- ci/freedreno: add fails introduced by upreving to 6.3.13 1537- Revert "lima/ci: temporarily disable deqp-egl tests due to timeouts" 1538- ci/radeonsi: stoney arb_timer_query got fixed between kernel 6.3.1..13 1539- ci/lima: EGL testing was disabled when fp16 fail was removed 1540- ci/freedreno: fix unexpectedpass flake on a630 1541- ci/freedreno: add another a530 flakes 1542- ci: add quirk for GitLab assuming changes is always true for scheduled runs 1543- ci/microsoft: when re-enabling Windows Farm, always run the container 1544- ci/freedreno: add a530 flakes, remove one fail which recently started passing 1545- ci/panfrost: introduce OpenGL testing with Mali-G57 MP5 on Asurada chromebook 1546- ci/freedreno: cover all texture gather flakes 1547- ci/freedreno: add a530 flake vs-lessthanequal-uvec4-uvec4 1548- ci/farms: always compare the code against main repository 1549- Revert "ci/farms: always compare the code against main repository" 1550- ci/kernel: add amd patch to prevent crashes when starting X 1551- ci/kdl: remove extra-verbose ls command 1552- ci/nouveau: add 20 minutes timeout to gk20a and align gm20b 1553- ci/freedreno: document another mapbuffer flake on a530 1554- ci/amd: fix timeouting radeonsi-raven-va-full job 1555- docs/ci: default to port 80 for the caching proxy 1556- docs/ci: update to systemd and used version of the trace for testing 1557- docs/ci: remove default nginx config, which we don't need for proxy 1558- bin/ci: handle errors more gracefully in update_traces_checksum script 1559- ci/freedreno: document another flakes on Adreno 530 1560- ci: add perfetto into mesa git-cache 1561- ci/panfrost: re-enable t760 and t860 traces as a nightly job 1562- CI: Re-enable G52 Vulkan testing 1563- ci/panfrost: t760-gles is nightly job, test also GLES 3 and 3.1 1564- ci/zink: Add flake seen in the wild 1565- ci/build: limit debian-build-testing to 30 minutes 1566- ci/amd: add glx\@glx-visuals-depth flake to raven 1567- ci/freedreno: document vs-nested-return-sibling-loop2 flake on Adreno 530 1568- ci/farms: enabled Microsoft job only when conditions are met 1569- ci/deqp: really remove the uncompressed results.csv file 1570- ci/baremetal: do not install curl, it's already there 1571- ci/baremetal: shorten BM_KERNEL to filename and BM_DTB to name only 1572- ci/freedreno: document another a530 flake batch 1573- ci: remove LAVA prefix from variables which can be used also elsewhere 1574- ci/zink: drop a630, which we currently have very low amount available 1575- ci/freedreno: the tag belongs to the apq8016 only 1576- ci/freedreno: switch references, the farm-rules takes care about this 1577- ci/freedreno: handle disabling farm properly for each FD/Collabora farm 1578- ci/freedreno: another batch of Adreno 530 flakes 1579- gtest: backport ansi color fix 1580- ci: disable Material Testers.x86_64_2020.04.08_13.38_frame799.rdc trace 1581- panfrost/ci: revert Disable T720 1582- ci/piglit: add extra space on top to prevent single quote getting into URL 1583- ci/freedreno: There is only one King of Town. 1584- ci: switch to 6.4 kernel, improving Adreno 660 reliability 1585- ci/iris: add GL46.arrays_of_arrays_gl.SizedDeclarationsPrimitive timeout 1586- ci/panfrost: add G52 flakes 1587- ci/panfrost: we have enough device, parallelize Vulkan tests 1588- ci/virgl: flakes in functional.draw_buffers_indexed group 1589- ci/freedreno: add another a530 flake 1590- ci/panfrost: add G52 simple_tests.partial_image_pot_same_format_noclear flake 1591- panvk: architecture isn't invalid, just unsupported 1592- panvk: catch unsupported arch in the panvk_physical_device_init 1593- Revert "ci: disable a660 jobs" 1594- docs: add LAVA farm informations 1595- ci: disable Google Freedreno farm, currently timeouting on all jobs 1596- Revert "ci: disable Google Freedreno farm, currently timeouting on all jobs" 1597- ci/farms: no need to check RUNNER_TAG for Collabora farm 1598- ci/traces: extend no-output timeout by 5 minutes 1599- ci/venus: add fragment.32B_in_memory_with_vec4_s32 flake 1600- iris: do not mention specifically clover for OpenCL support 1601- ci/freedreno: disable broke cheza (Adreno 630) runners 1602- ci/bare-metal: correct workaround for R8152 issue while retrieving TFTP data 1603- ci/bare-metal: drop unused imports, sort, use SPDX license 1604- ci/lima: farm is down, disable for now 1605- ci: do not report failed job when flakes reporting fails 1606- ci/freedreno: re-enable Cheza (Adreno 630) runners 1607- ci/traces: upload only missing trace images 1608- ci/traces: keep images for every job except the performance testing 1609- ci/traces: rename upload function to reflect it works with S3 1610- ci/traces: always export piglit EXTRA_ARGS 1611- ci: ci_marge_queue.py 1612- ci/freedreno: fix copy paste causing a618_gl being run only in manual pipeline 1613- ci/freedreno: disable Adreno 660 Vulkan pre-merge 1614- ci/traces: drop the freedoom-phase2-gl-high.trace 1615 1616David Rosca (70): 1617 1618- radeonsi: Use DIV_ROUND_UP instead of ALIGN_POT 1619- frontends/va: Skip processing buffers already converted with EFC 1620- frontends/va: Don't use EFC with scaling or filtering enabled 1621- radeonsi/vcn: Don't use chroma in AV1 encode with RGB input 1622- frontends/va: Parse H264 SPS for video signal parameters 1623- frontends/va: Parse HEVC SPS for video signal parameters 1624- frontends/va: Add postproc support for converting to full range 1625- radeonsi/vcn: Set H264 video signal parameters in bitstream 1626- radeonsi/vcn: Set HEVC video signal parameters in bitstream 1627- radeonsi/vcn: Enable full/limited range support for H264/HEVC/AV1 1628- radeonsi/vcn: Fix setting color range in AV1 bitstream 1629- gallium/auxiliary/vl: Fix RGB->YCbCr full range matrix 1630- gallium/auxiliary/vl: Handle UV subsampling in compute_shader_yuv 1631- gallium/auxiliary/vl: Fix blurry output of compute_shader_yuv 1632- frontends/va: Add YUV420 to NV12 postproc conversion 1633- gallium/auxiliary/vl: Fix chroma and blurry output of cs video_buffer 1634- gallium/auxiliary/vl: Fix chroma offset of compute_shader_weave 1635- frontends/va: Also map VAImageBufferType for reading 1636- frontends/va: Alloc interlaced surface for interlaced pics 1637- frontends/vdpau: Alloc interlaced surface for interlaced pics 1638- radeonsi: Don't prefer interlaced for video decode 1639- ci/amd: Skip VAAPI CreateSurfacesWithConfigAttribs/1121 test 1640- frontends/va: Don't allow multi-plane derive without driver support 1641- frontends/va: Init view_resources array in vlVaPut/GetImage 1642- radeonsi: Copy all planes with multi-plane staging textures 1643- radeonsi: Enable PIPE_VIDEO_CAP_SUPPORTS_CONTIGUOUS_PLANES_MAP 1644- ci/amd: Skip all VAAPI tests that creates too many huge surfaces 1645- radeonsi/vcn: Update rate control when framerate changes with HEVC 1646- frontends/va: Ignore requested size when creating VAEncCodedBufferType 1647- gallium/auxiliary/vl: Set correct csc matrix in set_buffer_layer 1648- radeonsi/vcn: Fix leaking fences in decode 1649- gallium/auxiliary/vl: Add BT.709 full csc matrix 1650- frontends/va: Set csc matrix in postproc 1651- gallium/auxiliary/vl: Don't set csc matrix in video_buffer/rgb_to_yuv_layer 1652- frontends/va: Add BT.709 as supported postproc color standard 1653- Revert "radeonsi/vcn: add an exception of field case for h264 decoding" 1654- gallium/auxiliary/vl: Set vertex element src_stride in vl_deint_filter 1655- gallium/auxiliary: Fix util_compute_blit half texel offset with scaling 1656- gallium/auxiliary/vl: Map range when updating constants 1657- gallium/auxiliary/vl: Clamp coordinates in compute shaders 1658- gallium/auxiliary/vl: Support chroma sample location in compute shaders 1659- frontends/va: Support chroma sample location in postproc 1660- frontends/va: Flush after unmapping VAImageBufferType 1661- frontends/va: Parse chroma sample location in H264/HEVC SPS 1662- radeonsi/vcn: Set H264/HEVC chroma sample location in bitstream 1663- radeonsi/vcn: Don't hang GPU when using DCC surface as encoder input 1664- frontends/va: Track surfaces in context 1665- frontends/va: Destroy fences when destroying surface or context 1666- radeonsi/vcn: Implement destroy_fence vfunc 1667- frontends/va: Process VAEncSequenceParameterBufferType first in vaRenderPicture 1668- frontends/va: Set default rate control values once when creating encoder 1669- gallium/auxiliary/vl: Add RGB to YUV compute shader 1670- gallium/auxiliary/vl: Use chroma offset in YUV to RGB weave compute shader 1671- gallium/auxiliary/vl: Fix YUV to RGB bob compute shader deinterlacing 1672- gallium/auxiliary/vl: Only map the shader constants buffer in render 1673- frontends/va: Add High Quality preset mode 1674- radeonsi/vcn: Add High Quality encoding preset for AV1 1675- radeonsi: Fix plane size in si_copy_multi_plane_texture 1676- frontends/va: Implement vaMapBuffer2 1677- frontends/va: Fix locking in vlVaBeginPicture 1678- frontends/va: Parse H264 SPS for max_num_reorder_frames 1679- util/vl: Fix vl_rbsp parser with bitstreams without emulation bytes 1680- frontends/va: Fix parsing packed headers without emulation bytes 1681- radeonsi/vcn: Add encode support for H264 B-frames 1682- frontends/va: Map decoder and postproc surfaces for reading 1683- radeonsi: Fix offset for linear surfaces on GFX < 9 1684- gallium/auxiliary/vl: Fix coordinates clamp in compute shaders 1685- gallium/auxiliary: Fix coordinates clamp in util_compute_blit 1686- gallium/auxiliary/vl: Scale dst_rect x0/y0 when rendering chroma plane 1687- util/rbsp: Fill bits twice if reading more than 16 bits 1688 1689Derek Foreman (2): 1690 1691- vulkan/wsi: Allow binding presentation_timing when software rendering 1692- vulkan/wsi: warn about unset present_mode in PresentModeCompatibilityExt 1693 1694Dmitry Baryshkov (3): 1695 1696- gallium: move kmsro definition to the bottom of the file 1697- gallium: unbreak kmsro/freedreno case 1698- tu: Pass real size of prime buffers to allocator 1699 1700Dmitry Osipenko (3): 1701 1702- util/cache_test: Re-add test for disabled cache 1703- util/cache_test: Fix disabled cache test using SHADER_CACHE_DISABLE_BY_DEFAULT 1704- util/cache_test: Add test for get/put() with disabled cache 1705 1706Dor Askayo (1): 1707 1708- nouveau: add exported GEM handles to the global list 1709 1710Dr. David Alan Gilbert (6): 1711 1712- rusticl/core: Add profiling time storage (queued) to event 1713- rusticl: Wire the 'queued' profiling time up 1714- rusticl: Wire the 'submit' profiling time up 1715- rusticl: Wrap pipe queries 1716- rusticl: Wrap pipe query reads 1717- rusticl: Wire the 'start' and 'end' profilng times up 1718 1719Dylan Baker (4): 1720 1721- VERSION: bump to 23.3.0-devel 1722- docs: Update release calendar for 23.2.0-rc1 1723- docs: truncate feature list for 23.3-devel 1724- meson: use a single dependency call for lua 1725 1726Echo J (5): 1727 1728- nvk: Fix some cast defines 1729- nvk: Add A8B8G8R8_*_PACK32 format support 1730- nvk: Add bufferImageGranularity limit 1731- nvk: Reset offset value in ResetDescriptorPool 1732- nil: Add A4B4G4R4_UNORM format support 1733 1734Emma Anholt (111): 1735 1736- ci/radv: Clarify when the ANGLE GS failures started happening. 1737- ci: Uprev ANGLE to 0518a3ff4d4e ("Android: Simplify power metrics collection") 1738- ci/tgl: Improve the info for ANGLE's MSAA regression on TGL. 1739- ci/tu: Add more crash cases for the multithreading bugs caught on a630. 1740- ci/tu: Mark descriptor_buffer.basic.limits as failing in gmem too. 1741- ci/tu: Drop some xfails for !24086 1742- tu: Fix data race in userspace VMA management. 1743- ci/a5xx: Add another GPU hanging piglit test to the skips. 1744- Revert "ci: Disable nouveau CI" 1745- nvk: Avoid strict aliasing warning in the pushbuffer encoding. 1746- nvk: Fix uninitialized result usage in NVK_DEBUG_ZERO_MEMORY. 1747- nvk: Fix unused result warnings in pushbuf resets. 1748- nvk: Remove duplicate (disabled) point sprite setup. 1749- nvk: Fix missing init of the stages to sync against. 1750- nvk: Use depth_clamp_enable to select PIXEL_*_Z_CLAMP. 1751- nouveau/winsys: Fix an undefined use in the error path. 1752- nvk: Quiet a compiler warning. 1753- nvk: Clean up redundant vendor checking for physical device creation. 1754- nvk: Add support for probing as a platform device. 1755- nvk: Disable shaderStorageImageReadWithoutFormat pre-Maxwell. 1756- freedreno/a5xx: Fix border color structure size. 1757- freedreno/a5xx: Skip emitting unused texture descriptors for images. 1758- freedreno/ir3: Move pvtmem per-fiber size alignment to the compiler. 1759- ci/freedreno: Drop a bunch of stale a530 xfails. 1760- ci/freedreno: Sort another a530 xfail with its friends. 1761- ci/freedreno: Update comments for some a530 xfails. 1762- ci/freedreno: Add some more db820c xfails. 1763- freedreno/devices: Move fibers_per_sp to the common info struct. 1764- freedreno/devices: Set num_sp_cores explicitly for pre-gen6. 1765- freedreno/a6xx: Move pvtmem allocation to ir3_gallium. 1766- freedreno/a3xx: Add the shift for MEMSIZEPERITEM according to db410c docs. 1767- freedreno/a5xx: Refactor SHADER_OBJ emit to a helper function. 1768- freedreno/a5xx: Set num_sp_cores and set PC/VFD_POWER_CNTL accordingly. 1769- freedreno/a5xx: Add private mem support. 1770- freedreno/cffdec: Fix decode on pixel 2 blob's COMPUTE_CHECKPOINT 1771- ci/freedreno: Add a regression test for decoding a540 blob's compute shaders. 1772- freedreno: Fix crashdec pre-a6xx. 1773- freedreno/a5xx: Skip SSBO emit when none are enabled. 1774- vulkan/util: Make multialloc succeed with 0 allocations. 1775- turnip: Track the first/last subpass an attachment is used in. 1776- turnip: Skip emitting empty CP_COND_REG_EXEC. 1777- turnip: Save the renderpass's clear values in the cmdbuf state. 1778- turnip: Move gmem clears and loads to the first subpass that uses them. 1779- turnip: Move sysmem clears to the first subpass that uses them. 1780- ci/freedreno: Skip some tests on a5xx that destabilize other tests. 1781- freedreno/a3-5xx: Don't try to emit ISAM for SSBO loads. 1782- ci/turnip: Add a660 VK coverage. 1783- disk_cache: Disable the "List" test for RO disk cache. 1784- blorp: Disable unaligned partial HIZ fast clears for HIZ_CCS too. 1785- intel/fs: Move defin/defout setup to the start of the loop. 1786- intel/fs: Move the defin[]/defout[] screening up to livein[]/liveout[] setup. 1787- intel/fs: Simplify compute_start_end(). 1788- ci/freedreno: Add another excessive-constlen UBO skip. 1789- ci/anv: Drop DEQP_VER:vk setting. 1790- ci/anv: Drop "-vk" from the job name. 1791- ci/anv: Add a manual full VK run for TGL. 1792- ci/anv: Add testing on JSL. 1793- freedreno: Build drm subdir before perfcntrs, which uses it. 1794- ci/intel: Add various updates from our nightly runs. 1795- ci/virgl: Disable virgl-iris-traces. 1796- ci/zink: Add a few updates for anv/tgl from the nightly runs. 1797- ci/fastboot: Use a case insensitive match for a fastboot line. 1798- ci/etnaviv: Skip some tests that hang the GPU and knock out other tests. 1799- ci/etnaviv: Drop some gc2k flakes that I think are resolved. 1800- ci/anv: Drop incorrect xfail addition for TGL 1801- ci/anv: Drop the 16bit.scalar.13 skip. 1802- ci/etnaviv: Minor xfail/flake polishing. 1803- ci/etnaviv: Skip a GLES2 test that times out the asan job. 1804- ci/zink: Skip more doubles tests on anv that flake at 3 minute timeouts. 1805- ci/docker: Clear the results file before starting a new deqp test run. 1806- ci/crocus: Add a related flake to a known one. 1807- ci/etnaviv: return gl-1.4-tex1d-2dborder as a known flake 1808- ci/crocus: Add known piglit flakes 1809- ci/hasvk: Add a bunch of new CTS border color fails. 1810- i915: Re-clang-format and enforce it in CI. 1811- i915: Print the relevant counts vs limits when throwing errors. 1812- i915: Don't log I915_DEBUG=fs output for blit shaders. 1813- i915: Save fragment program compile error messages in the fragment shader. 1814- i915: Do a test compile at glLinkShader() time. 1815- i915: Make exceeding tex indirect count fatal. 1816- i915: Use nir_group_loads() to reduce texture indirection phases. 1817- ci/crocus: Generalize the drawarrays-vertex-count flakes. 1818- ci/zink: Skip 3-minute-long glx-visuals timeouts. 1819- ci/zink: Skip dmat[34] op tests in general, as well 1820- ci/crocus: Disable flaky unvanquished-ultra trace 1821- nir/print: Decode system values in the variable declarations. 1822- ci/zink: Add a TGL flake that's showed up in nightlies recently. 1823- ci/radeonsi: Drop an xfail for vangogh. 1824- i915: Make I915_DEBUG=fs log shaders that fail to link due to CF. 1825- nir: Flatten ifs with discards in nir_opt_peephole_select for HW without CF. 1826- glsl: Remove lower_discard(). 1827- ci/zink: Only test half of piglit pre-merge on anv. 1828- ci: Stop doing internal retries in bare-metal. 1829- ci/bare-metal: Drop the 2 vs 1 exit code from poe_run. 1830- ci/bare-metal: Default our boards to a 20-minute timeout for the whole job. 1831- ci/iris: Drop parallel on kbl piglit to 2. 1832- ci/freedreno: Fold a630_egl into a630_gl. 1833- ci/freedreno: Move skqp testing to a618. 1834- ci/zink: Cut zink-lvp coverage in half. 1835- ci/freedreno: Generalize the implicit_unmap timeouts. 1836- ci_run_n_monitor: Poll mesa/mesa and user/mesa for pipelines at the same time. 1837- glx: Delete support for GLX_OML_swap_method. 1838- ci: drop skip for glx-swap-copy. 1839- dri: Drop a duplicate mesa vs pipe format table. 1840- docs/ci: Drop old instructions for farm disabling 1841- docs/ci: Add some links in the CI docs to how to track job flakes 1842- glsl: Remove int64 div/mod lowering. 1843- llvmpipe: Set nir_lower_dround_even. 1844- nir: Add nir_lower_dsign as 64-bit fsign lowering. 1845- glsl: Retire dround lowering. 1846- ci_run_n_monitor: Always resolve --rev arguments for looking up pipelines. 1847 1848Eric Engestrom (194): 1849 1850- ci: avoid running hardware jobs if lint fails - now on LAVA too! 1851- ci: avoid running hardware jobs if lint fails - now on Windows too! 1852- ci: replace copy of nouveau rules with reference 1853- ci: drop leftover kernel configs 1854- ci: use !reference for scheduled_pipeline retry rule 1855- ci: add .llvmpipe-manual-rules and use it 1856- ci: add .gallium-core-rules and use it instead of gallium_core_file_list anchor 1857- ci: replace llvmpipe_file_list anchor with reference 1858- ci: replace softpipe_file_list anchor with reference 1859- ci: replace lavapipe_file_list anchor with reference 1860- ci: replace iris_file_list anchor with reference 1861- ci: replace radv_file_list anchor with reference 1862- ci: replace radeonsi_file_list anchor with reference 1863- ci: replace virgl_file_list anchor with reference 1864- ci: move etnaviv files rules to src/etnaviv/ci/gitlab-ci.yml 1865- ci: move freedreno files rules to src/freedreno/ci/gitlab-ci.yml 1866- ci: move nouveau files rules to src/gallium/drivers/nouveau/ci/gitlab-ci.yml 1867- ci: move panfrost files rules to src/panfrost/ci/gitlab-ci.yml 1868- ci: move broadcom files rules to src/broadcom/ci/gitlab-ci.yml 1869- ci: move lima files rules to src/gallium/drivers/lima/ci/gitlab-ci.yml 1870- ci: move amd files rules to src/amd/ci/gitlab-ci.yml 1871- ci: move microsoft files rules to src/microsoft/ci/gitlab-ci.yml 1872- ci: move zink files rules to src/gallium/drivers/zink/ci/gitlab-ci.yml 1873- ci: move virtio files rules to src/virtio/ci/gitlab-ci.yml 1874- ci: move intel files rules to src/intel/ci/gitlab-ci.yml 1875- ci: move virgl files rules to src/gallium/drivers/virgl/ci/gitlab-ci.yml 1876- ci: move llvmpipe files rules to src/gallium/drivers/llvmpipe/ci/gitlab-ci.yml 1877- ci: move softpipe files rules to src/gallium/drivers/softpipe/ci/gitlab-ci.yml 1878- ci: move lavapipe files rules to src/gallium/drivers/lavapipe/ci/gitlab-ci.yml 1879- ci: delete install.tar after extracting it to avoid re-uploading it 1880- docs: add release notes for 23.1.4 1881- docs: add sha256sum for 23.1.4 1882- docs: update calendar for 23.1.4 1883- asahi: drop unused include paths 1884- ci/lint: deduplicate formatting check jobs 1885- ci/lint: also print a diff for rust format issues 1886- ci: allow hw jobs even if lint jobs fail for non-Marge pipelines 1887- ci: print rustfmt's version 1888- ci: print clang-format's version 1889- bin/ci_run_n_monitor: get git sha from pipeline if specified, instead of requiring --rev to match 1890- lavapipe/ci: use tighter changes: rules 1891- ci: add a 10min job timeout to formatting checks 1892- ci: reduce bare-metal retries of poe_run to only 3 attempts 1893- broadcom/ci: reduce vc4-rpi3-gl timeout to 30min (instead of 1h) 1894- broadcom/ci: reduce v3d-rpi4-gl timeout to 30min (instead of 1h) 1895- broadcom/ci: reduce v3d-rpi4-traces timeout to 30min (instead of 1h) 1896- broadcom/ci: reduce v3dv-rpi4-vk timeout to 30min (instead of 1h) 1897- ci: add .core-rules to .gallium-core-rules 1898- ci: drop rule for non-existent src/include/ 1899- docs: add release notes for 23.1.5 1900- docs: add sha256sum for 23.1.5 1901- docs: update calendar for 23.1.5 1902- ci: include some timing information in the git cache download script 1903- docs/ci: stop trying to enumerate drivers that are tested using VK-GL-CTS 1904- docs/ci: in paragraph about the CI being overwhelmed, mention our tool to help with that 1905- docs/ci: drop mention of build systems variants in the CI 1906- docs/ci: expand the description of test suites 1907- bin: add wrapper to run scripts in a python venv 1908- bin/ci/ci_run_n_monitor: use venv wrapper 1909- bin/ci/gitlab_gql: use venv wrapper 1910- bin/ci/update_traces_checksum: use venv wrapper 1911- bin/pick-ui: use venv wrapper 1912- ci: include mold in x86_64_test-base & rootfs images 1913- ci: use mold to build deqp 1914- zink/ci: set the default timeout for zink jobs to 30min instead of 1h 1915- egl: make _eglFilterConfigArray static 1916- egl: fixup _eglFilterConfigArray() params and drop _eglFallbackMatch() wrapper 1917- ci: build nvk 1918- ci: document max image tag length 1919- docs/radv: mark VK_EXT_tooling_info as implemented 1920- docs/radv: mark VK_INTEL_shader_integer_functions2 as implemented 1921- git-blame-ignore-revs: repeat instruction on how to enable to avoid having to look for it 1922- git-blame-ignore-revs: add radv formatting commit 1923- git-blame-ignore-revs: add pvr formatting commit 1924- meson: fix indentation 1925- docs/v3dv: mark direct display extensions as implemented 1926- ci: reorder vk drivers alphabetically in debian-vulkan job 1927- ci: build hasvk in debian-vulkan job 1928- ci/zink+radv: set a timeout of 2x the normal runtime 1929- amd/ci: drop duplicate test expectations 1930- panfrost: upcast uint8/uint16 before shifting them beyond their range 1931- ci/a530: document piglit flake 1932- docs: add release notes for 23.1.6 1933- docs: add sha256sum for 23.1.6 1934- docs: update calendar for 23.1.6 1935- docs: add one more 23.1.x release 1936- ci: rename \*.log to \*.txt to work around gitlab bug 1937- ci/freedreno: reuse freedreno_gl_file_list instead of re-definining it 1938- egl: bump extension string length 1939- vc4: drop duplicate .lower_ldexp 1940- zink: fix format in zink_make_{image,texture}_handle_resident() 1941- v3dv: fix VK_PIPELINE_ROBUSTNESS_{BUFFER,IMAGE}_BEHAVIOR_DEVICE_DEFAULT_EXT copy/paste typo 1942- v3dv: fix copy/pasted type of \`sample` 1943- v3dv: fix shader stage name in error message 1944- v3d/qpu: fix type of function argument 1945- ci/deqp: backport fix for dEQP-EGL.functional.wide_color.*_888_colorspace_* 1946- ci/farm-rules: fix missing valve-infra jobs in scheduled pipelines 1947- bin/ci_run_n_monitor: error out if both --project and --pipeline-url are passed 1948- ci: document farm rules 1949- ci/b2c: skip install.tar extraction if the tarball is not present 1950- ci/b2c: don't allow failures in test script preparation 1951- ci/b2c: assert that install folder is present whether or not the tarball was extracted 1952- ci/amd: split the polaris10 rules into one for each farm 1953- ci: skip containers & build jobs when disabling a farm 1954- docs: add release notes for 23.1.7 1955- docs: add sha256sum for 23.1.7 1956- docs: update calendar for 23.1.7 1957- docs: add one more 23.1.x release 1958- ci: taking igalia farm offline 1959- ci/b2c: drop logic to remove install.tar 1960- ci: drop clover leftover 1961- Revert "ci: taking igalia farm offline" 1962- bin/ci_run_n_monitor: print in which repo we're looking for the pipeline 1963- bin/ci_run_n_monitor: automatically pick MR pipelines when they exist 1964- ci: remove duplicate fork pipeline in MRs 1965- ci_run_n_monitor: add comment to explain "MR > fork" logic 1966- ci: don't run everything just because a farm gets re-enabled 1967- ci/windows: centralize definition of windows runners tags 1968- ci/windows: add windows docker runner tags to .windows-docker-vs2019 1969- ci/windows: drop build rules from test jobs 1970- ci: document which image tags need to be bumped when updating piglit 1971- ci: document which image tags need to be bumped when updating {alpine,debian,fedora}/x86_64 1972- ci/farm-rules: rename .disable-farm-mr-rules to make it clear it's only about MRs 1973- ci/farm-rules: re-add "run every container and build job when a farm gets re-enabled" 1974- ci/zink: drop redundant \`MESA_LOADER_DRIVER_OVERRIDE: zink` 1975- docs: add release notes for 23.1.8 1976- docs: add sha256sum for 23.1.8 1977- docs: update calendar for 23.1.8 1978- docs: add another 23.1.x 1979- ci: limit build jobs to 30min so that they can retry when they go wrong 1980- docs: drop outdated and redundant note about the minimum meson version 1981- ci/zink+radv: specify that zink-radv-navi10-valve should run in the mupuf farm 1982- ci/zink+radv: bump the timeout of zink-radv-navi10-valve by 10 minutes 1983- docs: add calendar for 23.3 1984- ci: unify container and build jobs rules 1985- docs/meson: drop mention that our meson is ready 1986- ci/docs: drop extra overwritten rules 1987- ci/zink+radv: document flake 1988- docs: document the merging process and what is allowed or not 1989- ci: drop unused shader-db clone + build from alpine image 1990- ci: drop unused shader-db clone + build from fedora image 1991- ci: move shader-db clone/build into its own script 1992- ci/deqp-runner: fix indentation 1993- ci/deqp-runner: restore exit-on-error after getting deqp-runner's exit code 1994- ci: fix shebang in build-deqp-runner.sh 1995- docs: add release notes for 23.1.9 1996- docs: add sha256sum for 23.1.9 1997- docs: update calendar for 23.1.9 1998- ci: drop unused ephemeral packages in alpine image 1999- docs/ci: rewrite the "farm maintenance ^ other change" rule to mean what we actually meant 2000- ci: skip dEQP-VK.api.driver_properties.conformance_version for everyone 2001- pick-ui: use assignment expressions 2002- pick-ui: use more expressive variable names 2003- pick-ui: add \`Backport-to: XX.Y` nomination 2004- v3d/ci: move traces job to wayland 2005- ci: print deqp version in the job log 2006- ci/b2c: move to the shiny new \`gfx-ci/ci-tron` repo 2007- ci/b2c: use latest mesa-trigger image 2008- include/dri_interface.h: restore define mistakenly removed in !25587 2009- ci_run_n_monitor: dependency jobs must always be started 2010- util/xmlconfig: drop driInjectDataDir() now that DRIRC_CONFIGDIR is always supported 2011- util/xmlconfig: inline datadir 2012- ci/b2c: change artifacts path to match baremetal and LAVA 2013- VERSION: bump for rc1 2014- .pick_status.json: Update to e64a97694ac9dc97f65e1a8e91a5c9789109fd2c 2015- .pick_status.json: Update to 4cdd094ae1e97d857a6b9dbc291d7bbe6ea266ac 2016- .pick_status.json: Update to e4a1bc70dd739ca8addddc940af08312b038e288 2017- .pick_status.json: Update to faed5d647f2416bb0ce3a9d33a3955169c70dc52 2018- VERSION: bump for 23.3.0-rc2 2019- .pick_status.json: Update to 1f1ec1c6bcc2a32a3c1df8c2cc7a2f4e7139b7ec 2020- .pick_status.json: Mark 8dda860f83ac30d042dc6beb4438cc925d1fd130 as denominated 2021- .pick_status.json: Update to 7d6f9ccfbeab050c26775d5e03578a01526cbfcb 2022- .pick_status.json: Update to aa33ca0a52591961f8ae01dc253354462ed17c18 2023- .pick_status.json: Update to a77ea9555aa00cc12f3d1c440252e940ff552500 2024- .pick_status.json: Mark 227300345ed38377190b0eaf08694d5c42ee7e60 as denominated 2025- VERSION: bump for 23.3.0-rc3 2026- .pick_status.json: Update to 56451ce773c11094a8c08fdc6b500bb8bdcf37e1 2027- .pick_status.json: Mark fa7ec4226bdf48bf63438e303af83ecd58ec95f2 as denominated 2028- .pick_status.json: Update to 08f851f4361cfbdb211dc70d03cf3ebff331c3ee 2029- .pick_status.json: Update to 03a7cb261828b350dd9b56bd74850197ca9eba33 2030- .pick_status.json: Mark fcfa68a632e5711cc657b103c9a0384928e9bf49 as denominated 2031- VERSION: bump for 23.3.0-rc4 2032- .pick_status.json: Update to f05688aa3299a27430119b27e45181a6f415bff8 2033- egl/dri2: increase NUM_ATTRIBS to fit all the attributes 2034- .pick_status.json: Update to f39ed0063b4cd3e5a71efad2d43ce31f574c698d 2035- .pick_status.json: Update to b07a58157d0b110dbc09a42cffe7046c3200dd3b 2036- VERSION: bump for 23.3.0-rc5 2037- .pick_status.json: Update to f843b14c171299e1696ca6d971ccaa496f60c3ab 2038- intel/perf: fix regex escaping 2039- intel/ci: fix .hasvk-manual-rules 2040- VERSION: bump for 23.3.0 2041- Revert "VERSION: bump for 23.3.0" 2042- docs: add release notes for 23.3.0 2043- Revert "docs: add release notes for 23.3.0" 2044 2045Erico Nunes (10): 2046 2047- lima/ppir: don't optimize loads with different block successors 2048- lima/ppir: convert to nir_legacy 2049- lima/gpir: switch to register intrinsics 2050- egl/drm: fix EGL_EXT_buffer_age with gbm contexts 2051- lima: fix plbu block stride calculation 2052- ci: disable lima LAVA lab for maintance 2053- Revert "ci: disable lima LAVA lab for maintance" 2054- v3dv: allow headless device without display device 2055- Revert "ci/lima: farm is down, disable for now" 2056- v3dv: Rework to remove drm authentication for wsi 2057 2058Erik Faye-Lund (30): 2059 2060- meson: report with_glvnd in summary 2061- docs: upgrade bootstrap to 5.3.1 2062- docs: expand mobile-menu without js 2063- panfrost: delete stale editorconfig file 2064- docs/panfrost: link to lima 2065- docs/panfrost: use code-blocks with wrapping for long blocks 2066- docs/panfrost: use math-role to denote powers of two 2067- docs: fix linkcheck 2068- docs: update a few links to https 2069- docs: update anchor for link 2070- docs: update link to git-wiki 2071- docs: link to upstream etnaviv 2072- docs: apply some trivial redirects 2073- docs: use doc-role when linking to lists article 2074- docs: keep up with intels ever-moving documentation 2075- docs: mark some redirects as allowed 2076- docs: only link to old docs from html 2077- docs: use html_static_path for static files 2078- ci/etnaviv: update ci expectation 2079- ci/etnaviv: allow failure on failing test 2080- zink: fix wording of warning 2081- ci/etnaviv: move failure to flake 2082- meson: add wayland-protocols from meson wrapdb 2083- util/xmlconfig: add an env-var for overriding drirc search dir 2084- meson: add src/util to the drirc search path 2085- docs/relnotes: remove cruft from end of lines 2086- docs/ci: escape at-symbols 2087- docs/relnotes: escape some at-symbols 2088- bin/gen_release_notes: escape at-symbols 2089- panfrost: use perf_debug instead of open-coding 2090 2091Faith Ekstrand (809): 2092 2093- nv50/ir: Convert to new-style NIR registers 2094- nv50/ir: Support vector movs 2095- intel/fs: Add support for new-style registers 2096- intel/vec4: Assume get_nir_dest() provides a sane write-mask 2097- intel/vec4: Add support for new-style registers 2098- intel: Switch to intrinsic-based registers 2099- intel/fs: Drop support for nir_register 2100- intel/vec4: Drop support for nir_register 2101- anv,hasvk,iris: sampler_prog_key::swizzles is only used on crocus 2102- nir: Properly handle divergence for load_reg 2103- nir/trivialize: Maintain divergence information 2104- nir/trivialize: Trivialize cross-block loads 2105- vc4: Convert to new-style NIR registers 2106- nir/schedule: Support load/store_reg 2107- broadcom/compiler: Convert to new-style NIR registers 2108- intel/fs: Use write masks from store_reg intrinsics 2109- intel/fs: Rework the overlapping mov/vec case 2110- intel/fs: Assume NIR is in SSA form 2111- nir: Add a backend_flags field to nir_tex_instr 2112- intel/fs: Add a parameter to speed up register spilling 2113- nir/builder: Allow tex helpers on image types 2114- nir/builder: Add a nir_txs_deref() helper 2115- vulkan: Add a core vk_buffer_view struct 2116- vulkan: Add a more direct way to use a NIR shader 2117- vulkan: Add a vk_query_pool base object 2118- vulkan: Add common vkCmdBegin/EndQuery wrappers 2119- vulkan/format: Add the remaining 1-plane YCbCr formats 2120- vulkan: Add a core vk_sampler struct 2121- nv50/nir: Lower to scratch AFTER optimization 2122- nouveau: Allow GLSL_SAMPLER_DIM_SUBPASS* 2123- nouveau/nir: Implement support for compact arrays 2124- nouveau/codegen: Handle/indirect goes before sample index 2125- nouveau/codegen: Use a NULL format for PIPE_FORMAT_NONE for images 2126- nouveau/codegen: Don't convertSurfaceFormat for unknown formats 2127- nv50/ir: Run nir_divergence_analysis before out-of-SSA 2128- anv: Use vk_sampler 2129- anv: Use vk_buffer_view 2130- vulkan: Add init/finish helpers for vk_query_pool 2131- anv: Use vk_query_pool 2132- anv: Use the common versions of vkBegin/EndQuery() 2133- nir/builder: Don't assume we have compiler options 2134- Revert "mesa, compiler: Move gl_texture_index to glsl_types.h" 2135- Revert "compiler: Combine duplicated implementation of is_gl_identifier into glsl_types.h" 2136- vulkan: Use VkBufferUsageFlags2 in vk_buffer 2137- clang-format: Set ColumnLimit to 78 2138- nvk: Implement EnumerateInstanceVersion 2139- nvk: Add stub implementations of VkImage and VkImageView 2140- nvk: Add stub implementation of VkSampler 2141- nvk: Add a stub implementation of VkBuffer 2142- nvk: Implement VkDescriptorSetLayout 2143- nvk: Implement VkPipelineLayout 2144- nvk: Add initial descriptor set lowering 2145- nvk: Implement vkUpdateDescriptorSets 2146- nvk: Expose nvk_descriptor_stride_align_for_type 2147- nvk: Re-format descriptor set layouts 2148- nvk: Re-format pipeline layouts 2149- nvk: Re-format descriptor sets some more 2150- nvk/buffer: Take an offset in nvk_buffer_address 2151- nvk/buffer: Add a push_buffer_ref helper 2152- nvk/copy: Use nvk_buffer_address in CmdCopyBuffer 2153- nvk/image: Add image address helpers 2154- nvk/copy: Use nvk_image_base_address() 2155- nvk: Add an nvk_device_physical helper 2156- nvk: Add a skeleton for pipelines 2157- nvk: Re-arrange nvk_descriptor_set.h a bit 2158- nvk: Reformat nvk_nir_lower_descriptors 2159- nvk: Add a couple descriptor set address helpers 2160- nvk: Move nvk_cmd_pool cast definitions 2161- nvk: Rework whitespace in nvk_cmd_buffer.c 2162- nvk: Add a root descriptor table 2163- nvk: Fetch descriptor set addresses from the root table 2164- nvk: Re-arrange nir_lower_explicit_io a bit 2165- nvk: Lower load_global_constant_offset 2166- nvk: Drop image_view_init 2167- nvk: Stop returning VK_ERROR_FORMAT_NOT_SUPPORTED for non-blitable 2168- nvk: Allow R32_UINT 2169- nvk: Mark nvk_push_descriptor_set_ref() inline 2170- nvk: Add a descriptor table data structure 2171- nvk: Copy in the nouveau TIC format table 2172- nvk/image_view: Reformat and fix Create/DestroyImageView 2173- nvk: Add an image descriptor table to the device 2174- nvk: Fill out TIC table entries for image views 2175- nvk: Set b->cursor when lowering image intrinsics 2176- nvk: Unify descriptor loading in lower_descriptors 2177- nvk: Re-format nvk_image_view.h a bit 2178- nvk: Re-format nvk_buffer.c a bit 2179- nvk: Add a stub implementation of buffer views 2180- nvk: Make texture descriptors a bit more acceptable to codegen 2181- nvk: GART os host-cache-coherent 2182- nvk: Reserve a null image descriptor 2183- nvk: Rework descriptor writes 2184- nouveau: Add stubs for an image layout library called NIL 2185- nil: Create images 2186- nil: Add the TIC format table from nouveau 2187- nil: Add a nil_view and code to fill out TIC entries 2188- nvk: Add an nvk_get_format helper 2189- nvk: Use helpers for push_ref 2190- nvk: Align arguments consistently in copy/blit code 2191- nvk: Move Fill/UpdateBuffer to nvk_cmd_copy 2192- Revert "nvk: Stop returning VK_ERROR_FORMAT_NOT_SUPPORTED for non-blitable" 2193- nvk: Manually offset for array layers in copy/blit 2194- nvk: Convert to using NIL for image layout 2195- nvk: Re-indent image entrypoints 2196- nvk: Implement VkGetImageSubresourceLyout 2197- nvk: Reset and properly clean up command buffer upload areas 2198- nvk: Rework format features queries 2199- nvk: Add a more competent GetPhysicalDeviceImageFormatProperties 2200- nvk: Support compressed images in copy commands 2201- nvk: Drop vk_sync BO refs after push_submit 2202- nil: Drop miptail support for now 2203- nil: Don't minify image dimensions when setting up TIC 2204- nil: Refactor TIC image extent setup 2205- nil: Fix image array layer alignments 2206- nvk: Teture pool sizes are maximums not sizes 2207- nvk: Re-format nvk_sampler.c 2208- nvk: Implement samplers 2209- nil: Add a helper for filling out buffer TIC entries 2210- nvk: Move is_storage_image_format to nvk_format.c 2211- nvk: Implement buffer views 2212- nvk: Advertise KHR_dedicated_allocation 2213- nvk: Use the correct root descriptor table size for CmdDispatch 2214- nvk: Add support for dynamic buffers 2215- nvk: Better advertise image format features 2216- nvk: Advertise descriptor array indexing 2217- nvk: Advertise non-zero descriptor set limits 2218- nvk: Use a descriptor type instead of a hand-rolled thing 2219- nvk: Handle cube storage images properly 2220- nvk: Load the requested descriptor size 2221- nvk: Implement push constants 2222- nvk: Properly indent a comment 2223- nvk: Fix descriptor offset alignment 2224- nvk: Use a switch for descriptor types in load_descriptor 2225- nvk: Support inline uniform blocks 2226- nvk: Delete the storage TIC in nvk_image_view_destroy 2227- nvk: Assert that we don't double-free descriptors 2228- nvk: Initial vkCmdClearImage support 2229- nvk: Unconditionally zero image format properties 2230- nvk: No-op sparse image format properties 2231- nvk: Advertise minUniformBufferOffsetAlignment 2232- nvk: Rework OOM handling for descriptor pools 2233- nvk: Bind immutable samplers on descriptor set creation 2234- nvk: Padd shader BOs by 4K to avoid I-cache overflow 2235- nvk: Include nvk_private.h in everything 2236- nvk: Make image/buffer address helpers const 2237- nouveau/push: Add a P_INLINE_FLOAT helper 2238- nvk: Init WSI after setting up supported_sync_types 2239- nouveau/parser: Fix an integer overflow and a typo 2240- nouveau/parser: Properly dump most arrays used by 3D 2241- nouveau/parser: Better dump float data 2242- nouveau/parser: Handle arrays properly in P_IMMD() 2243- nouveau/push: Make P_IMMD more versatile 2244- nouveau: Null terminate the debug flag list 2245- nouveau: Generate 3D headers 2246- nvk: Add graphics state to command buffers 2247- nvk: Split pipeline binding into helpers 2248- nvk: Switch to vk_pipeline_shader_stage_to_nir 2249- nvk: Don't free the NIR in nvk_compile_nir 2250- nvk: Add an nvk_shader_address helper 2251- nvk: Free pipeline shader BOs 2252- nvk: Expose pipeline alloc/free functions 2253- nvk: Make shader_upload take an nvk_device 2254- nvk/shader: Assign I/O locations and gather info 2255- nvk/shader: Populate headers for vertex and fragment shaders 2256- nvk: Add a nvk_cmd_buffer_device() helper 2257- nvk: Import 3D context init code from nouveau 2258- nil/format: Add helpers for render formats 2259- nvk: Add boilerplate for Begin/EndRendering 2260- nvk: Misc. additional state setup 2261- nvk: Emit dynamic graphics state 2262- nvk: Implement push constants and descriptors for graphics 2263- nouveau: Add CPU push buffers 2264- nvk: Graphics pipelines 2265- nvk: Implement vkCmdDraw() 2266- nvk: Color attachments clears via image clears 2267- vulkan/meta: Add the start of a meta framework 2268- vulkan/meta: Add an object tracking list 2269- vulkan/meta: Add a concept of rect pipelines 2270- vulkan/meta: Implement attachment clears 2271- vulkan/meta: Implement start-of-rendering clears 2272- vulkan/meta: Add implementations of Clear*Image 2273- nvk: Add an attachment format even for secondaries 2274- nvk: Add an addr field to nvk_buffer 2275- nvk: Expose a bind_vertex_buffer helper 2276- nvk: Use vk_meta for CmdClearAttachments 2277- nvk: Stop using vk_cmd_set_dynamic_graphics_state in meta_end() 2278- nvk: Enable all the dynamic state features 2279- nouveau: Fix pushbuf ref reset for user command buffers 2280- nvk: add linear image creation support. 2281- nvk: Use max alignment for descriptor pool sizes 2282- nil: Switch to using the new headers for TIC entries 2283- nvk: Use meta for CmdClear*Image 2284- nvk: Zero client memory objects 2285- nvk: Bind texture and sampler header pools for 3D 2286- nvk: Use the new headers for samplers 2287- nvk: Implement nir_intrinsic_load_frag_coord 2288- vulkan/meta_clear: Populate VkRenderingInfo::renderArea 2289- nvk: Don't assert when there are no attachments 2290- nvk: Track and reference all device memory objects 2291- vulkan: Allow scissors or viewports to be set without counts 2292- nvk/copy: Mape bpp part of nouveau_copy_buffer 2293- nvk: Implement copies for D24_UNORM_S8_UINT images 2294- nvk: Drop sample locations structs 2295- nvk/meta: Save and restore VI state 2296- nvk: Re-initialize dynamic_graphics_state.vi when recycling 2297- nvk: Move the vertex format table into nvk_format.h 2298- nvk: Advertise vertex buffer format featues 2299- nvk: Clean up try_create_physical_device error handling 2300- nouveau/parser: Dump more fields as float 2301- nvk: Depth bounds need fui() 2302- nouveau: Add class information to nouveau_ws_device 2303- nil: Properly depend on nouveau winsys and nvidia-headers 2304- nil: Use nvidia headers for texture format enums 2305- nil: Use the nvidia headers for render target format enums 2306- nil: Use nvidia headers for ZS format enums 2307- nil: Rename rt to czt in the format info struct 2308- nil: Rename rendering to color_target 2309- nil: Re-introduce the format capabilities 2310- nil: Add more format support helpers 2311- nvk: Advertise more format features 2312- nvk: Clear dynamic state dirty after flushing it all 2313- vulkan/meta: Make stencil reference dynamic for clears 2314- nvk: Depth buffers don't allow Z-tiling 2315- nvk: Disable sparse Z on Maxwell+ 2316- nil: Compute PTE kinds and tile modes for images 2317- nouveau: Add a function to allocate a tiled buffer 2318- nvk: Add internal helpers for device memory allocation 2319- nvk: Do internal dedicated allocations for ZS images 2320- nvk: Fix depth/stencil render pass clears 2321- nvk: Fix viewport Z scale 2322- nvk: Enable two-sided stencil 2323- nvk: Flip the front-face setting 2324- nvk: Advertise depth/stencil support 2325- nvk: Don't destroy NULL descriptor pool BOs 2326- nvk: Call nir_lower_input_attachments 2327- nvk: Set GEOMETRY_SHADER_SELECTS_LAYER properly 2328- nvk: Return OUT_OF_DEVICE_MEMORY if bo_new fails 2329- nil: Add a PTE kind for Z32_FLOAT 2330- nvk: Add nvk_queue_init/finish() helpers 2331- nvk: Align descriptor buffers to NVK_MIN_UBO_ALIGNMENT 2332- nvk: Re-flow a couple function prototypes 2333- nvk: Assert samples == 1 2334- nvk: Allocate descriptors for input attachments 2335- nvk: Wire up early z and post depth coverage 2336- nvk: Save/restore push constants around meta ops 2337- nouveau/parser: Add array and float tags for clear values 2338- nvk: Use hardware clears for attachment clears 2339- nvk: Add image_view_init/finish functions 2340- nvk: Implement vkCmdClear*Image directly 2341- nvk: Use a UINT format to clear non-renderable images 2342- nvk: Don't advertise tiling on non-power-of-two formats 2343- nvk: Fix max anisotropy 2344- nvk: Assert on CmdExecuteCommands 2345- nvk: VkSamplerCreateInfo::mipLodBias is signed 2346- nvk: Fix border color alpha 2347- nil/format: Depth/stencil formats appear as red 2348- nil: Fix max mip level 2349- nil: Fix nonnormalized coordinates 2350- nvk: Set up clip and cull distances 2351- nvk: Fix dynamic buffer descriptor copies 2352- nvk: Inline nouveau_copy_linear 2353- nvk/copy: Rename push to p 2354- nvk/blit: Rename push to p 2355- nvk/dispatch: Rename push to p 2356- nvk: Drop most buffer tracking 2357- nvk: Rework TLS/SLM and image/sampler table handling 2358- nvk: Invalidate texture header and sampler caches each submit 2359- nvk/sampler: Free descriptor table entries 2360- nvk: Rework nvk_descriptor_table_add/remove 2361- nvk: Implement descriptor table growing 2362- nvk: Zero unused descriptors 2363- nvk: Add some asserts for nv50 compiler image restrictions 2364- nvk: Update to the new command buffer infrastructure 2365- nvk: Split nvk_queue into its own file 2366- nvk: Start every command buffer with a nop 2367- nvk: Initialize fixed draw/default state once 2368- nouveau/parser: Convert to mako 2369- nouveau/parser: Use more idiomatic python 2370- nouveau/parser: Put the dump helpers in C files 2371- nvk: Use f for extension features 2372- nvk: Drop a TODO 2373- nvk: Use VK_IMAGE_USAGE_*_ATTACHMENT_BIT for image clears 2374- nvk: Increase the graphics pipeline push space 2375- nil: Don't claim texture support for 2-bit SNORM 2376- nouveau/push: Fix a void pointer arithmetic bug 2377- nouveau/parser: Add more arrays 2378- nouveau/mme: Add basic structures for the Turing+ MME 2379- nouveau/mme: Add isaspec XML for the Turing+ MME 2380- nouveau/mme: Add an assembler and disassembler for the Turring+ MME 2381- nouveau/mme: Add a builder for the Turing+ MME 2382- nouveau/mme: Add a tiny simulator for the Turing+ MME 2383- nouveau/mme: Add an isaspec-based dumper 2384- nouveau/mme: Make the winsys headers C++ safe 2385- nouveau/mme: Add unit tests for the Turing+ MME simulator 2386- nvk: Add MME infrastructure 2387- nvk: Use MME for clears 2388- nouveau/mme: Add helper macros for setting fields 2389- nvk: Use MME for vkCmdDraw[Indexed]() 2390- nvk: Implement vkCmdDraw[Indexed]Indirect() 2391- nvk: Use p for the nouveau_ws_push_buffer in zero_vram 2392- nouveau: Add an nv_push struct 2393- nouveau: Rename the fields of vk_push 2394- nouveau: Move nv_push and helpers to their own header 2395- nouveau/parser: Take a FILE* in DUMP_*_MTHD_DATA 2396- nouveau: Move push validate to nv_push.c 2397- nouveau: Move push dumping to nv_push.c 2398- nvk: Use nv_push directly for graphics pipelines 2399- nouveau: Add a nouveau_ws_bo_new_mapped helper 2400- nvk: Use bo_new_mapped for the zero page 2401- nvk: Always allocate empty_push 2402- nvk: Move queue_sumbit to nvk_queue_drm_nouveau.c 2403- nvk: Submit pushbufs directly 2404- nvk: Use a regular BO for the empty push 2405- nvk: Use a regular BO for the queue state push 2406- nvk: Add an nvk_queue_submit_simple helper 2407- nvk: Initialize the queue later in device setup 2408- nvk: Use submit_simple for draw state init 2409- nvk: Use queue_submit_simple for zero_vram 2410- nvk: Break nvk_cmd_pool into its own file 2411- nvk: Use cmd instead of cmd_buffer 2412- nvk: Add BO recycling to the command pool 2413- nvk: Return VkResult from nvk_cmd_buffer_upload_alloc 2414- nvk: memcpy root descriptors for compute instead of doing a DMA 2415- nvk: Fully populate QMDs before uploading 2416- nvk: Constant buffer alignment is actually 64B 2417- nvk: Rework side-band data upload 2418- nvk: Add an nvk_cmd_buffer_push helper 2419- nvk: Add an nvk_cmd_buffer_ref_bo helper 2420- nvk: Allocate upload buffers from the command pool 2421- nvk: Use nvk_cmd_bo for push bufs 2422- nvk: Implement vkCmdExecuteCommands() 2423- nvk: Remove remaining references to nouveau_push.h 2424- nouveau: Use DRM interfaces directly in MME tests 2425- nouveau: Drop nouveau_ws_push 2426- nvk: Re-indent vk_instance.c 2427- nvk: Use vk_object_zalloc/free for descriptor pools/sets 2428- nvk: Fix up whitespace in nvk_descriptor_set.c 2429- nvk: Implement VK_KHR_push_descriptor 2430- nvk: Reference descriptor set layouts in the sets themselves 2431- nvk: Embed a nv_device_info in nvk_physical_device 2432- nvk: Add an nvk_queue_submit wrapper 2433- nvk: Also store the push BO map in nvk_queue_state 2434- nvk: Bring back push sync and dumping 2435- nvk: drop nvk_nir.h 2436- nvk: Add lowering for load_global_constant_bounded 2437- nvk: Properly implement robustBufferAccess 2438- vulkan/meta: Add key types 2439- vulkan/meta: Add a helper for image view types 2440- vulkan/meta: Add a create_sampler helper 2441- vulkan/meta: Fixes for clear 2442- vulkan/meta: Implement vkCmdBlitImage() 2443- nvk: Support load_layer_id 2444- nvk/meta: Save/restore descriptor set 0 2445- nvk: Use meta for doing blits with the 3D hardware 2446- nvk: WFI in pipeline barriers 2447- util/vma: Allow initializing zero-size heaps 2448- nvk: Rework nvk_queue_submit_simple() 2449- nvk: Add a heap data structure 2450- nvk: Return a VkResult from nvk_shader_upload() 2451- nvk: Add a shader heap to nvk_device 2452- nvk: Allocate shaders from a heap 2453- nvk: Rework whitespace in nvk_device_memory.c 2454- nvk: Style fixes in nvk_physical_device.c 2455- nvk: Reset semaphore syncs on wait 2456- nvk/wsi: Style fixes 2457- nvk/wsi: Use the common present implementation 2458- nouveau/parser: Parse all fields in each method 2459- nvk: Add a query pool object 2460- nvk: Implement timestamp queries 2461- nvk: Implement pipeline statistics and occlusion queries 2462- nouveau/mme: Allow ZERO as the destinatio nof mme_load_to 2463- nouveau/mme: Assert on OOB registers 2464- nouveau/mme: Add support for freeing registers 2465- nouveau/mme: Add a couple helpers for working 64-bit addresses 2466- nouveau/mme: Add a helper for MME_DMA_READ_FIFOED 2467- nvk: Use mme_tu104_read_fifoed() 2468- nvk: Implement vkCmdCopyQueryPoolResults() 2469- nvk: Handle large command buffer uploads better 2470- nvk: Use a normal DMA for CmdUpdateBuffer 2471- nouveau/parser: Handle 6F methods 2472- nvk: Use mme_load_addr64() 2473- nvk: Use poll for BO waits 2474- nvk: Events 2475- nvk: Don't crash if we fail to allocate a push BO 2476- nvk: Stop leaking command pool BOs 2477- nvk: Enable VK_KHR_create_renderpass2 2478- nvk: Advertise VK_KHR_imageless_framebuffer 2479- nvk: Flush the current pushbuf before allocating a new one 2480- nvk: Advertise VK_KHR_separate_depth_stencil_layout 2481- nvk: Tell WSI we don't support legacy scanout 2482- nouveau: Add PCI information to nv_device_info 2483- nvk: Implement VK_EXT_pci_bus_info 2484- nvk: Bind 3D images as 3D for clears 2485- nvk: Support copies between 3D and 2D images 2486- nil: Add a helper for getting 2D views of 3D images 2487- nvk: Support 2D views of 3D images 2488- nvk: Advertise VK_KHR_maintenance1 2489- nvk: Use 2D array views for 3D storage images 2490- nil: Fix include guards in nil_image.h 2491- nvk: Advertise custom border color features 2492- vulkan: Add a helper for swizzling color values 2493- nvk: Implement VK_EXT_border_color_swizzle 2494- nvk: Advertise VK_EXT_extended_dynamic_state3 2495- nvk: Move more states to dynamic 2496- nvk: Advertise VK_KHR_storage_buffer_storage_class 2497- nvk: Add a helper for pushing descriptors 2498- nouveau/headers: Add generated headers to dependencies 2499- nvk: Implement VK_EXT/KHR_buffer_device_address 2500- nvk: Break the guts of CmdDispatch into a helper 2501- nvk: Implement DispatchIndirect 2502- nouveau/mme: Add a mul64 helper 2503- nvk: Implement CS invocations statistics queries 2504- nil: Use ONE for the anixotropic coarse spread function 2505- nil: Properly support MSAA 2506- nil: Add an offset4d struct and some helpers 2507- nouveau/parser: Sort METHOD_ARRAY_SIZES 2508- nouveau/parser: Handle SET_ANTI_ALIAS_SAMPLE_POSITIONS 2509- nvk: Stop asserting on MSAA 2510- nvk: Handle zero color attachments better 2511- nvk: Handle multisampled render targets properly 2512- nvk: Support copies of MSAA images 2513- nvk: Use the right view format for stencil texturing 2514- nvk: Pass through a shader key for fragment shaders and MSAA 2515- nvk: Set correct multisample regs for graphics pipelines 2516- nvk: Stop creating a new upload BO every time 2517- nvk: Fill out sample locations on Maxwell B+ 2518- vulkan/meta: Bind whole LODs of 3D blit destinations 2519- vulkan/meta: Add a helper for building texture ops 2520- vulkan/meta: Break the guts of blit into a helper 2521- vulkan/meta: Support writing stencil as iterative discard 2522- vulkan/meta: Rename vk_meta_blit.c to vk_meta_blit_resolve.c 2523- vulkan/meta: Add support for MSAA resolves 2524- nvk/meta: Fix restore for descriptor set 0 2525- nvk: Use meta for MSAA resolves 2526- nvk: Replace gl_SamplePosition with fract(gl_FragCoord.xy) 2527- nvk: Stop advertising higher framebufferNoAttachmentsSampleCounts 2528- nvk: Advertise MSAA via image format properties 2529- nvk: Advertise VK_KHR_depth_stencil_resolve 2530- nvk: Assert that descriptor buffer access stays in-bounds 2531- nvk: Add a bo size to nvk_descriptor_set 2532- nvk/format: Style fix for VkFormatProperties3KHR 2533- nvk: Support VK_FORMAT_B10G11R11_UFLOAT_PACK32 for vertex buffers 2534- nvk: Add a devenv ICD json file 2535- nvk: Advertise EXT_vertex_attribute_divisor 2536- nvk: Lower image_size to txs 2537- nvk: Fix a comment 2538- nvk: Add an nvk_buffer_addr_range helper 2539- nvk: Use nvk_buffer_addr_range for buffer descriptors 2540- nvk: Re-order Vulkan 1.0 feature bits 2541- nvk: Enable inheritedQueries 2542- nvk: Enable VK_EXT_provoking_vertex 2543- nvk: Advertise samplerMirrorClampToEdge via 1.2 features 2544- nvk: Advertise VK_KHR_bind_memory2 2545- nvk: Enable KHR_dynamic_rendering 2546- nvk: Advertise KHR_uniform_buffer_standard_layout 2547- nvk: Advertise EXT_index_type_uint8 2548- nvk: Advertise VK_EXT_separate_stencil_usage 2549- nvk: Capitalize NVK in user exposed strings 2550- nvk: Rename grid_size to group_count 2551- nvk: Lower load_num_workgroups ourselves 2552- nvk: Drop block_size from the root descriptor table 2553- nvk: Add a helper for loading resource_index-based descriptors 2554- nvk: Set maxMemoryAllocationCount 2555- nouveau/winsys: Take a drmDevicePtr in nouveau_ws_device_new() 2556- nouveau/winsys: Add an info to nouveau_ws_device 2557- nouveau/winsys: Move device type into nv_device_info 2558- nouveau/nil: Take an nv_device_info for image functions 2559- nouveau/nil: Use nv_device_info for format queries 2560- nouveau/mme: Invoke SET_OBJECT in the tests 2561- nouveau/mme: Make alu_op_to_str static 2562- nouveau/mme: Move mme_value into its own header 2563- nouveau/mme: Add a mme_reg_alloc struct 2564- nouveau/mme: Add an intermediate MME_ALU_OP enum 2565- nouveau/mme: Add an intermediate MME_CMP_OP enum 2566- nouveau/mme: Use mme_mov() for temp copies of register IMM32 sources 2567- nouveau/mme: Make helpers less Turing specific 2568- nouveau/mme: Break the Turing builder guts into a separate header 2569- nouveau/mme: Move the guts of mme_merge_to() into mme_tu104_builder.c 2570- nouveau/mme: Move the guts of mme_state_arr_to() into mme_tu104_builder.c 2571- nouveau/mme: Drop the implicit_imm parameter from mme_alu_to() 2572- nouveau/mme: Move the cf_stack struct to mme_builder.h 2573- nouveau/mme: Prepare the builder for multiple GPU generations 2574- nouveau/mme: Take an nv_device_info in mme_builder_init 2575- Support immediates in MERGE 2576- Add add immediate optimizations 2577- nvk: Add support for contiguous heaps to nvk_heap 2578- nvk: Use a contiguous shader heap pre-Volta 2579- nvk: Disable indirect draw/dispatch and query copy MMEs for now 2580- nvk: Free a couple regs in nvk_mme_build_draw_*() 2581- nvk: Properly align root descriptor tables for pre-Pascal 2582- nvk: Compile all NIR before running codegen 2583- vulkan/meta: Insert a geometry shader when needed 2584- nvk: Use a GS for layerered rendering pre-MaxwellB 2585- nvk: Handle zero-size index and vertex buffers pre-Turing 2586- nvk: Cosmetic clean-ups to Create/DestroyDevice 2587- nil: Only choose a PTE kind for tiled images 2588- nouveau/mme: Fix is_int18 for negative numbers 2589- nouveau/mme: Don't swap x and y in mme_fermi_merge_to() 2590- nouveau/mme: Take a const nv_device_info in mme_builder_init 2591- nouveau/mme: Unify some of the test framework 2592- nouveau/mme: Add some generic builder tests 2593- nouveau/mme: Add builder tests for SUB 2594- nouveau/mme: Use a uint32_t for size in mme_fermi_bfe() 2595- nouveau/mme: nouveau/mme: Add builder tests for SLL and SRL 2596- nvk/drm: Take a byte offset/range in push_add_push 2597- nvk: Rework nvk_cmd_push a bit 2598- nvk: Add a helper for pushing indirect data 2599- nvk: Make some MME builder names more consistent 2600- nouveau/mme: Don't allow WaW dependencies in the same Turing instruction 2601- nvk: Reduce register pressure in nvk_mme_build_draw*() 2602- nouveau/push: Add an NV_PUSH_MAX_COUNT #define 2603- nvk: Implement Draw*Indirect on pre-Turing 2604- vulkan/meta: Use the new NIR texture helpers 2605- nvk: Add a build test for MMEs 2606- nvk: Don't over-size push descriptor sets 2607- nvk: Return VK_ERROR_INCOMPATIBLE_DRIVER if the PCI vendor isn't NVIDIA 2608- nvk: Bump init context batch size 2609- nouveau/mme: Fix nested while instructions on Turing+ 2610- nouveau/mme: Add a helper to dump instructions 2611- nvk: Rework extension enables 2612- nvk: Rework features enables 2613- nvk: Advertise shaderImageGatherExtended 2614- nouveau/mme: Add a bfe helper 2615- nouveau/mme: Ensure that zero-initizlied mme_value is ZERO 2616- nvk: De-duplicate MME code for setting draw params 2617- nvk: Clamp viewport clip to max range 2618- nvk: Use the same lock for the submit and the memory objects list 2619- nvk: Advertise ICD/loader interface version 4 2620- nvk: Add instace WSI entrypoints 2621- nouveau/mme: Use ADD for ine with an immediate 2622- nouveau/mme: Fix while loops pre-Turing 2623- nvk: Add begin to mme_scratch 2624- nvk: Use the new load/store_scratch helpers for DRAW_PAD_DW 2625- nouveau/mme: Add a helper for re-allocating registers 2626- nvk: Rework spill helpers and DRAW_COUNT spilling 2627- nvk: Spill DRAW_IDX pre-Turing 2628- nvk: Break the inner MME draw loop into a helper 2629- nvk: Increase the push runout to 512 dwords 2630- nil: Add a nil_image_for_level helper 2631- nil: Add an image_level_as_uncompressed helper 2632- nvk: Implement uncompressed views of compressed images 2633- nvk: Set pointClippingBehavior 2634- nvk: Expose VK_KHR_maintenance2 2635- nvk: Add a separate #define for SSBO alignment 2636- nvk: Set spirv_to_nir_options::min_*_alignment 2637- nvk: Use vk_device_memory 2638- nvk: Implement VK_KHR_map_memory2 2639- nvk: Sort SPIR-V caps 2640- nvk: Advertise EXT_shader_viewport_index_layer on MaxwellB+ 2641- nvk: Only use view_id for layer in multiview 2642- nvk/heap: Set the right pitch for heap resize copies 2643- nvk: Advertise shaderStorageImageReadWithoutFormat 2644- nvk: Fix the NO_PREFETCH assert for CmdDrawIndirect 2645- nvk: Advertise KHR_spirv_1_4 2646- nvk: s/device/dev in nvk_image.c 2647- nvk: Add helpers for binding image planes 2648- nvk: Take an nvk_image_plane in nouveau_copy_rect_image 2649- nvk: Use the max descriptor alignemtn in GetDescriptorSetLayoutSupport 2650- nvk: Use NVIDIA_VENDOR_ID in pdev try_create() 2651- nvk: Use abbreviated names in nvk_device_memory.c 2652- nvk: Add device and driver UUIDs 2653- nvk: Add external memory queries 2654- nvk: Dedicated allocations override internal 2655- nvk: Require dedicated allocations for external images 2656- nouveau/winsys: Add dma-buf import support 2657- nvk: Support dma-buf import 2658- nvk: Support dma-buf export 2659- nvk: Enable external memory extensions 2660- nvk: Reformat nvk_buffer.c 2661- nvk: Add a buffer alignment helper 2662- nvk: Add an addr field to nvk_image_plane 2663- nvk: Use canonical variable names in nvk_physical_device.c 2664- nvk: Use canonical variable names in nvk_shader.c 2665- nvk: Use canonical variable names in nvk_bo_sync.c 2666- nvk: Use canonical variable names in nvk_sampler.c 2667- nvk: Drop nvk_physical_device::instance 2668- nvk: Only advertise EXT_pci_bus_info on discrete GPUs 2669- nouveau: Put PCI info in a pci substruct in nv_device_info 2670- nouveau: Stop using hex for SM numbers 2671- nvk: Set deviceType based on nv_device_info::type 2672- nouveau: Move more stuff into nv_device_info 2673- nouveau: Move gart_size to nv_device_info 2674- nvk: Use nv_device_info for class checks 2675- nvk: Rename nvk_device::ctx to ws_ctx 2676- nvk: Add a ws_dev to nvk_device and use it 2677- nvk: Move the winsys device to nvk_device 2678- nvk: Don't enumerate pre-Kepler GPUs 2679- nvk: Implement VK_EXT_physical_device_drm 2680- nvk: Require an environment variable for poorly tested hardware 2681- nvk: Use the new core vk_sampler struct 2682- Revert "vulkan: Allow scissors or viewports to be set without counts" 2683- vulkan/meta: Add a get_pipeline_layout helper 2684- vulkan/meta: Use vk_meta_get_pipeline_layout in blit/resolve 2685- nvk: Bind 3D depth/stencil images as 2D arrays 2686- nvk: Flush more state on VI_BINDINGS_VALID dirty 2687- nvk: Don't skip zero-size bindings in GetDescriptorSetLayoutSupport 2688- docs: Add a docs page for NVK 2689- docs: Add NVK to features.txt 2690- docs/relnotes: Stick something about NVK in new_features.txt 2691- nouveau: Drop GART size from nv_device_info 2692- nil: Add a nil_image_level_extent_px() helper 2693- nvk: Use the new NIL helper for image level extents for copies 2694- nvk: Improve image format properties and limits 2695- nvk: Rework multi-plane format features a bit 2696- nvk: Use nvk_root_descriptor_offset for drawInfoBase 2697- nvk: Add a root_desc_addr to the root descriptor table 2698- nvk: Add support for variable pointers 2699- nvk: Enable the SPIR-V DeviceGroup capability 2700- nvk: Separate the MME query copy code out a bit 2701- nvk: Implement CopyQueryPoolResults with a compute shader 2702- nvk: Misc. style nits 2703- nvk: Rework memory requirements to handle aspects correctly 2704- nvk: Implement the maintenance5 image layout queries 2705- nvk: Use VkBufferUsageFlags2 2706- nvk: Implement CmdBindIndexBuffer2KHR 2707- nvk: Implement GetRenderingAreaGranularityKHR 2708- nvk: Decorate CmdBegin/EndRendering entrypoints 2709- nouveau: Move shader topology info to nv_device_info 2710- drm-uapi: Import nouveau_drm.h 2711- nouveau/winsys: Use the imported nouveau_drm.h headers 2712- nvk: Use the imported nouveau_drm.h headers 2713- nouveau/shim: Use the imported nouveau_drm.h headers 2714- nouveau/mme: Support the new UAPI 2715- nvk: Use an empty EXEC for the empty submit case 2716- nouveau/winsys: Allow nouveau_ws_device_new() without VM_BIND 2717- nvk: Print an error message if VM_BIND support is missing 2718- nvk: Enable the new UAPI 2719- nvk: Use more consistent device variable names 2720- nvk: Call nir_lower_int64 2721- nir/gl: Move glsl_type::sampler_target() into a helper in its one caller 2722- nvk: Remove plane sources from tex instructions 2723- nvk: Use common physical device properties 2724- nv50/ir: Rework conversions for texture array indices 2725- clang-format: Add nir_foreach_reg_* 2726- clang-format: nir_foreach_src is not a foreach macro 2727- clang-format: Set the default ColumnLimit to 0 2728- nir: Re-align a couple enums and add clang-format comments 2729- nir: Don't clang-format const_value helpers 2730- nir: Don't clang-format a couple typedefs 2731- nir: Don't clang-format debug print setup 2732- nir: More manual formatting 2733- nir: Pretty format type mapping helpers 2734- nir: Wrap pass macros in braces 2735- nir: Add a do to the do/while in nir_const_value_t_array() 2736- nir: Add a .clang-format file 2737- nir: clang-format src/compiler/nir/\*.[ch] 2738- nvk: Don't use nir_ssa_for_src() 2739- nir: Drop most instances of nir_ssa_dest_init() 2740- nir: Drop more instances of nir_ssa_dest_init() 2741- nir/clone: Clone nir_def nor nir_dest 2742- nir/serialize: [De]serialize nir_def nor nir_dest 2743- nir: Drop nir_ssa_dest_init() 2744- nir: Drop nir_ssa_dest_init_for_type() 2745- nir: nir_foreach_ssa_def() -> nir_foreach_def() 2746- st,zink,sfn: Use nir_foreach_def instead of nir_foreach_dest 2747- dxil: Use nir_foreach_def() instead of nir_foreach_dest() 2748- nir/from_ssa: Use nir_foreach_def() instead of nir_foreach_dest() 2749- nir: Drop nir_foreach_dest() 2750- intel/vec4: Stop passing around nir_dest 2751- intel/fs: Stop passing around nir_dest and nir_alu_dest 2752- broadcom: Stop using nir_dest directly 2753- vc4: Stop passing around nir_dest 2754- nir,ntt,a2xx,lima: Stop using nir_dest directly 2755- lima: Stop using nir_dest directly 2756- etnaviv: Stop passing around nir_dest 2757- r600/sfn: Stop passing around nir_dest and nir_alu_dest 2758- nv50/ir: Stop passing around nir_dest and nir_alu_dest 2759- nir/gather_types: Stop passing around nir_dest 2760- nir/dce: Stop passing around nir_dest 2761- nir/propagate_invariant: Stop passing around nir_dest 2762- nir/validate: Replace all dest validation with validate_def 2763- nir/print: Replace all dest printing with print_def 2764- nir: Get rid of nir_dest_bit_size() 2765- nir: Get rid of nir_dest_num_components() 2766- nir: Get rid of nir_dest_is_divergent() 2767- nir: Drop nir_alu_dest 2768- nir: Drop nir_dest 2769- util/format: 8-bit interleaved YUV formats are UNORM 2770- gallivm: Support G8B8_G8R8_422_UNORM and B8G8_R8G8_422_UNORM 2771- blorp: Use R8G8_UINT for YCRCB_* formats with CCS 2772- anv: Disable CCS_E for ISL_FORMAT_YCRCB_* 2773- vulkan/format: Use correct swizzle for 1-plane YCbCr formats 2774- gallivm: Drop the Vulkan YUV format hacks 2775- nir: Rename nir_instr_type_ssa_undef to nir_instr_type_undef 2776- nir s/nir_get_ssa_scalar/nir_get_scalar/ 2777- nir: s/live_ssa_def/live_def/ 2778- nir: s/nir_instr_ssa_def/nir_instr_def/ 2779- nir: Rework nir_scalar_chase_movs a bit 2780- nir: Fix nir_op_mov handling in nir_collect_src_uniforms 2781- nir: Handle nir_op_mov properly in opt_shrink_vectors 2782- nir: Don't handle nir_op_mov in get_undef_mask in opt_undef 2783- nir: Clean up nir_op_is_vec() and its callers 2784- nir/large_constants: Use nir_component_mask_t 2785- nir/large_constants: Add read/write_const_values helpers 2786- nir/opt_large_constants: Add Small constant handling 2787- spirv: Re-emit constants at their uses 2788- nir: Take a nir_def * in nir_tex_instr_add_src() 2789- nir: Take a nir_def * in nir_phi_instr_add_src() 2790- nir/opt_undef: Don't rewrite a bcsel to mov 2791- nir: Add a nir_instr_clear_src() helper and use it 2792- nir: Add and use a nir_instr_init_src() helper 2793- nir: Drop nir_if_rewrite_condition() 2794- nir: Drop most uses of nir_instr_rewrite_src_ssa() 2795- nir: Drop nir_instr_rewrite_src_ssa() 2796- nir: Drop most uses if nir_instr_rewrite_src() 2797- nir: Drop nir_instr_rewrite_src() 2798- nir: Drop nir_push_if_src() 2799- nir: Fix metadata in nir_lower_is_helper_invocation 2800- nir: Use nir_shader_intrinsic_pass() a few places 2801- drm-uapi: Sync nouveau_drm.h 2802- nvk: Plumb no_prefetch through to the DRM back-end 2803- nouveau/mme: Fix a compile warning 2804- intel/isl: Rename ISL_TILING_Yf/s to ISL_TILING_SKL_Yf/s 2805- intel/isl: Add ICL variants of Yf and Ys tiling 2806- intel/isl: Implement correct tile size calculations for Ys/Yf 2807- intel/isl: Use the depth field of phys_level0_sa for GFX4_2D 3D surfaces 2808- intel/isl: Fill out the correct phys_total_extent for Ys/Yf/Tile64 2809- intel/isl: Indent uncompressed surface code 2810- intel/isl: Support Ys, Yf & Tile64 in isl_surf_get_uncompressed_surf 2811- intel/isl: Support Yf/Ys tiling in surf_fill_state 2812- intel/isl: Support Yf/Ys tiling in emit_depth_stencil_hiz 2813- intel/isl: Add initial data-structure support for miptails 2814- intel/isl: Add support for computing offsets with miptails 2815- intel/isl: Support miptails in isl_surf_get_uncompressed_surf 2816- intel/isl: Start using miptails 2817- intel/isl: Disallow CCS on 3D surfaces with miptails 2818- intel/isl: Allow Ys tiling 2819- anv: Align memory VA to support for Ys, Tile64 tiled images 2820- nvk: Clean up includes 2821- nvk: Add include guards to nvk_bo_sync.h 2822- nvk: SPDX everything 2823- nouveau/nil: SPDX everything 2824- nouveau/mme: SPDX everything 2825- nvk: Don't add a dummy attachment when gl_SampleMask is written 2826- nvk: Set the discard bit for Z/S self-deps 2827- nvk: Invalidate the texture cache in PipelineBarrier 2828- nvk: Lower interp_at_sample to interp_at_offset 2829- nvk: Disable statistics around meta ops 2830- nvk: Clean up viewport math 2831- nvk: Fix depth clipping parameters 2832- nvk: Enable dynamic clip/clamp enable 2833- nvk: Set GUARDBAND_Z_SCALE_1 when Z-clipping 2834- r600: Use more auto-generated nir_builder helpers 2835- r600: Use nir_builder helpers for load/store_shared_r600 2836- nvk: Re-order physical device limits 2837- nvk: Advertise maxMemoryAllocationCount = 4096 2838- nvk: Advertise discreteQueuePriorities = 2 2839- nvk: Rip out old UAPI support 2840- nvk/drm: Drop the push_add_push_bo() helper 2841- nvk/drm: Drop the push_add_bo() helper 2842- nvk: Drop command buffer BO tracking 2843- nvk: Drop memory object tracking 2844- nvk: Drop the device-level mutex 2845- nvk: Get rid of the tiled memory allocation helpers 2846- nvk/drm: Restructure nvk_queue_submit_drm_nouveau() 2847- nvk/drm: Split exec as needed for large command buffers 2848- nvk: Don't store the descriptor pool BO in the set 2849- nvk: Store a 20-bit driver_build_sha in nvk_instance 2850- nvk: Hook up the disk cache 2851- nvk: Re-structure early shader compilation a bit 2852- nvk: Add a default pipeline cache 2853- nvk: Cache NIR shaders 2854- nvk: Init pipelineCacheUUID 2855- drm-uapi: Sync nouveau_drm.h 2856- nvk: Take GETPARAM_EXEC_PUSH_MAX into account 2857- nvk: Handle zero-sized sparse buffers 2858- nvk: Use align() and align64() instead of ALIGN_POT 2859- nouveau: Generate headers for Maxwell B compute 2860- nvk: Add a nvk_cmd_buffer_compute_cls() helper 2861- nvk: Invalidate sampler/texture header caches in BeginCommandBuffer() 2862- nvk: Invalidate SKED caches at the top of command buffers 2863- nvk: Advertise more inline uniform block limits 2864- nvk: Emit MME_DMA_SYSMEMBAR before indirect draw/dispatch 2865- nvk: Set max descriptors to 2^20 for most descriptor types 2866- nvk: Reset descriptor pool allocator when all sets are destroyed 2867- nil/format: Use A for alpha blend 2868- nil/format: Advertise R10G10B10A2_UINT texture buffer support 2869- nvk: Disable depth or stencil tests when unbound 2870- nvk: Always emit at least one color attachment 2871- nvk: Improve address space and buffer size limits 2872- nvk: Always set pixel_min/max_Z to CLAMP 2873- nvk: Use nouveau_ws_bo_unmap() instead of munmap() 2874- nvk: Free the disk cache 2875- nvk: Add an nvk_shader_finish() helper 2876- nvk: Handle unbinding images and buffers 2877- nvk: Clean up the disk cache on physical device create fail path 2878- vulkan/wsi: Allow for larger linear images 2879- nvk: Add a nvk_cmd_buffer_dirty_render_pass() helper 2880- nvk: Re-sort device features 2881- nvk: Implement VK_EXT_depth_bias_control 2882- nvk: Advertise VK_KHR_workgroup_memory_explicit_layout 2883- nvk: Implement VK_EXT_image_sliced_view_of_3d 2884- nvk: Advertise VK_EXT_primitive_topology_list_restart 2885- nvk: Advertise VK_EXT_attachment_feedback_loop_layout 2886- features: Mark VK_EXT_attachment_feedback_loop_layout done for NVK 2887- nvk: Re-arrange Vulkan 1.2 features to match the header 2888- nvk: Advertise shaderOutputLayer and shaderOutputViewportIndex 2889- nvk: Enable descriptorIndexing 2890- nvk: Implement VK_EXT_dynamic_rendering_unused_attachments 2891- nir: Add a nir_ssa_def_all_uses_are_fsat() helper 2892- nir: Add convert_alu_types to divergence analysis 2893- nir/lower_tex: Add a lower_txd_clamp option 2894- nir: Add a load_sysval_nv intrinsic 2895- nir: Add NV-specific texture opcodes 2896- nir: Add an load_barycentric_at_offset_nv intrinsic 2897- nir: Add a range to most I/O intrinsics 2898- nir: Add NVIDIA-specific I/O intrinsics 2899- nir/lower_bit_size: Fix subgroup lowering for floats 2900- nir: add deref follower builder for casts. 2901- nir: Handle wildcards with casts in copy_prop_vars 2902 2903Felix DeGrood (12): 2904 2905- anv: save a shader source uint32_t hash in gfx/compute pipelines 2906- anv: Add Source hash field to VkPipelineExecutableStatisticKHR 2907- iris: save shader source sha1 in ish 2908- mesa: propagate shader source sha1 from gl_shader to nir_shader 2909- intel: use shader source hash in INTEL_MEASURE 2910- intel/compiler: use shader source hash in shader dump code 2911- anv: add fake sparse support 2912- anv: enable fake sparse for Elden Ring 2913- anv: debug messaging for sparse texture usage 2914- anv: fix frame count reporting in INTEL_MEASURE 2915- anv: set ComputeMode.PixelAsyncComputeThreadLimit = 4 2916- anv: remove CS_FLUSH from query regression 2917 2918Feng Jiang (9): 2919 2920- virgl: Only PIPE_BUFFER with VIRGL_BIND_CUSTOM flag is considered busy during creation 2921- meson: Export winsys function symbols for target va 2922- frontends/va: Add slice_count to AV1 slice_parameter 2923- virgl/video: Add definition of virgl_av1_picture_desc 2924- virgl/video: Add support for AV1 decoding 2925- virgl/video: Enable AV1 decoding 2926- meson: Rename dri-vdpau.dyn to dri.dyn 2927- CODEOWNERS: Add \@flynnjiang for VirGL video 2928- meson: Move video to separate section in meson configuration summary 2929 2930Filip Gawin (1): 2931 2932- crocus: Avoid fast-clear with incompatible view 2933 2934Flora Cui (1): 2935 2936- radeonsi: limit CP DMA to skip holes in sparse bo 2937 2938Francisco Jerez (29): 2939 2940- intel/fs/ra: Define REG_CLASS_COUNT constant specifying the number of register classes. 2941- intel/vec4/ra: Define REG_CLASS_COUNT constant specifying the number of register classes. 2942- intel/compiler: Make MAX_VGRF_SIZE macro depend on devinfo and update it for Xe2. 2943- intel/fs/ra/xe2: Scale up register allocation granularity by 2x on Xe2+ platforms. 2944- intel/eu/xe2+: Fix encoding of various message descriptors for change in register size. 2945- intel/fs: Fix signedness of payload_node_count argument of calculate_payload_ranges(). 2946- intel/fs/xe2+: Fix payload node live range calculations for change in register size. 2947- intel/fs/xe2+: Fix grf_count in post-RA scheduling for updated register file size. 2948- intel/fs/xe2+: Fixes for increased accumulator register width. 2949- intel/fs/xe2+: Scale MAX_SAMPLER_MESSAGE_SIZE by native register size. 2950- intel/eu/xe2+: Update validation of GRF region size to account for Xe2 reg size 2951- intel/fs/xe2+: Allow increased SIMD width for various get_fpu_lowered_simd_width() restrictions. 2952- intel/compiler/xe2+: Represent dispatch_grf_start_reg in native GRF units. 2953- intel/fs/xe2+: Update encoding of FB write message payload. 2954- intel/fs/xe2+: Round up fs_builder::vgrf() size calculation to HW register unit. 2955- intel/fs/xe2+: Scale BRW_MAX_MSG_LENGTH by native register size. 2956- intel/fs/xe2+: Fix payload layout of sampler messages for Xe2 reg size 2957- intel/fs/xe2+: Update GS payload setup for Xe2 reg size. 2958- intel/fs/xe2+: Update TCS payload setup for Xe2 reg size. 2959- intel/fs/xe2+: Update TES payload setup for Xe2 reg size. 2960- intel/fs: Lower unsupported regioning with non-trivial 2D regions on FIXED_GRFs. 2961- intel/fs/xe2+: Update regioning lowering offset alignment checks for Xe2 regs. 2962- intel/fs/xe2+: Fix execution width of SHADER_OPCODE_GET_BUFFER_SIZE for SIMD16 EU. 2963- intel/fs/xe2+: Fix calculation of spill message width for Xe2 regs. 2964- intel/xe2+: Round up size to reg_unit() in fs_reg_alloc::alloc_spill_reg(). 2965- intel/fs/xe2+: Fix URB writes with 0 data components. 2966- intel/fs: Specify number of data components of logical URB writes via control immediate. 2967- intel/fs: Delete manual 'inst->mlen' calculations from all uses of logical URB writes. 2968- intel/fs: Delete manual 'inst->mlen' calculations from all uses of logical URB reads. 2969 2970Frank Binns (10): 2971 2972- pvr: clang-format fixes 2973- pvr: skip setting up SPM consts buffer when no const shared regs are used 2974- pvr: cleanup SPM EOT dynarray after upload 2975- pvr: treat VK_IMAGE_CREATE_MUTABLE_FORMAT_BIT as not supported 2976- pvr: remove pvr_pbe_get_src_pos() 2977- pvr: fix attachments segfault in pvr_is_stencil_store_load_needed() 2978- pvr: fix allocation size of clear colour consts shared regs buffer 2979- pvr: change a few places to use PVR_DW_TO_BYTES() 2980- pvr: fix setup of load op unresolved msaa mask 2981- pvr: emit PPP state when vis_test dirty bit is set 2982 2983Friedrich Vock (19): 2984 2985- radv/ci: Set DRIVER_NAME in LAVA raven vkcts jobs 2986- radv: Handle VK_SUBOPTIMAL_KHR in trace layers 2987- ac/msgpack: make fixstrs a const char 2988- ac/sqtt,radv: Split internal and API hash in PSO correlations 2989- ac/rgp: Write lds_size metadata 2990- ac/rgp: Add metadata for separate-compiled RT stages 2991- radv/sqtt: Move record filling to helper function 2992- radv/sqtt: Unregister records based on hash 2993- radv/sqtt: Write LDS size metadata in code objects 2994- radv/sqtt: Handle separately-compiled RT pipelines 2995- ac/sqtt,radv/sqtt: Add and use marker for separate RT compilation 2996- nir/load_store_vectorize: Handle intrinsics with constant base 2997- radv/rt: Pre-initialize instance address 2998- radv: Initialize shader freelist on allocation 2999- radv: Fix check in insert_block 3000- radv/rt: Reject hits within 10ULP of previous hits in emulated RT 3001- radv/rra: Recognize LPDDR memory 3002- radv/rmv: Recognize LPDDR memory 3003- vulkan: Don't use set_foreach_remove when destroying pipeline caches 3004 3005Ganesh Belgur Ramachandra (5): 3006 3007- radeonsi: stores bottom_edge_rule option in the rasterizer state 3008- radeonsi: sets OPTIMAL_BIN_SELECTION to 0 if using bottom_edge_rule 3009- radeonsi: "clear_render_target" shader in nir 3010- radeonsi: "clear_render_target_1d_array" shader in nir 3011- radeonsi: "clear_12bytes_buffer" shader in nir 3012 3013Georg Lehmann (39): 3014 3015- aco/gfx11: fix get_gfx11_true16_mask with v_cmp_class_f16 3016- aco: improve get_gfx11_true16_mask description 3017- aco: combine a & ~b to bfi(b, 0, a) 3018- aco/gfx11: use v_cmp_class_f16 with opsel for bitnz/bitz 3019- aco: fix non constant 16bit bitnz/bitz 3020- ac/nir: handle more special cases in ac_nir_unpack_arg 3021- aco: use s_bitreplicate_b64_b32 to set exec to 0xffff0000ffff0000 3022- nir/opt_intrinsics: optimize (exclusive_scan(op, a) op a) to inclusive scan 3023- aco: always use rtne for fquantize2f16 3024- nir/opt_if: also rewrite uniform uses for read_invocation 3025- nir: unify lower_bitfield_insert with has_{bfm,bfi,bitfield_select} 3026- nir: unify lower_bitfield_extract with has_bfe 3027- nir: unify lower_find_msb with has_{find_msb_rev,uclz} 3028- aco: fix u2f16 with 32bit input 3029- aco: combine a | ~b to bfi(b, a, -1) 3030- aco: use v_cvt_f32_ubyte for signed casts too 3031- nir: add nir_scalar intrinsic helpers 3032- nir: add nir_scalar_equal 3033- aco: implement some exclusive scans with inclusive scans 3034- aco/gfx11: don't use bfe for local_invocation_id if the others are always 0 3035- nir/opt_algebraic: remove broken fddx/fddy patterns 3036- aco: simplify masked swizzle dpp selection by removing or_mask first 3037- aco: fix p_extract with v1 dst and s1 operand 3038- aco: implement 64bit div find_lsb 3039- nir: scalarize masked_swizzle_amd created from shuffle_xor 3040- aco/optimizer: check if we can use omod before labeling it 3041- aco/optimizer: copy propagate to output modifier instructions 3042- aco: remove -0.0 for 32 bit fsign with mul_legacy/omod when denorms are flushed 3043- nir: make quad intrinsic dst bit size match src0 3044- nir/lower_subgroups: use intrinsic builder more 3045- aco: assume new generations are unsupported by clrx 3046- aco: assume newer generation will use GFX11 wait_imm packing 3047- aco: print final ir instead if printing asm is unsupported 3048- aco/gfx11: optimize dual source export 3049- aco/gfx11: apply clamp/omod to vinterp 3050- aco: support v_fma_f32_dpp as fma_mix 3051- aco/gfx11: support vinterp as fma_mix 3052- aco: add missing scc def for SALU quad broadcast 3053- aco/sched: treat p_dual_src_export_gfx11 like export 3054 3055George Ouzounoudis (38): 3056 3057- nouveau/codegen: Support compact clip distances with arrayed_io 3058- nouveau/codegen: Handle nir op amul 3059- nouveau/codegen: Fix compact patch varyings in case of NIR 3060- nouveau/codegen: Add capability to pre-specify tessellation domain 3061- nvk: Do not increment instance id across draws 3062- nvk: Add a macro for root descriptor table byte offsets 3063- nvk: Set base vertex state in sequential mme draw 3064- nvk: Support base instance in instanced draw calls 3065- nvk: Switch point rasterization to point sprites 3066- nvk: Support large points 3067- nvk: Compile geometry shaders 3068- nouveau/mme: Keep device info in mme_builder 3069- nvk: Simplify mme build function argument 3070- nvk: Support VK_KHR_shader_draw_parameters 3071- nvk: Support for vertex shader transform feedback 3072- nvk: Support transform feedback indirect draws 3073- nvk: Support transform feedback geometry streams 3074- nvk: Support transform feedback queries 3075- nvk: Support vertex shader transform feedback on Fermi 3076- nvk: Disable PRIMITIVE_RESTART_VERTEX_ARRAY by default 3077- nvk: Fix geometry shader active stream mask 3078- nvk: Support geometry shaders 3079- nvk: Basic tessellation shader support 3080- nvk: Assign locations correctly for arrayed IO 3081- nvk: Enable multiview with tessellation shader 3082- nvk: Fix cases where execution mode is specified in the tesc shader. 3083- nvk: Respect tessellation domain origin state 3084- nvk: Lower io to temporaries for tessellation evaluation nir 3085- nvk: Support VkDescriptorSetVariableDescriptorCountLayoutSupport 3086- nvk: Handle cases of descriptor bindings with variable counts 3087- nvk: Add nir non-uniform optimization pass 3088- nvk: Enable descriptor indexing 3089- nvk: Do not keep redundant info for tessellation domain 3090- nouveau/codegen: Do not keep redundant info for tessellation domain 3091- nvk: Enable dynamic line rasterization mode state 3092- nvk: Fix support for VK_EXT_sample_locations 3093- nvk: Support dynamic state for enabling sample locations 3094- nouveau/codegen: Add a 4th optimization level for MemoryOpts 3095 3096Gert Wollny (63): 3097 3098- r600/sfn: Switch to register intrinsics 3099- r600/sfn/tests: add simple copy-prop test with register source 3100- r600/sfn: Allow for larger ALU CF's 3101- r600/sfn: Handle indirect array load/store dependencies better 3102- r600/sfn: Increase LDS fetch schedule priority 3103- r600/sfn: Add peephole optimization to move a dest to the previous op 3104- r600/sfn: reorder the value factory class member declaration a bit 3105- r600/sfn: Add some tests for proper register access 3106- r600/sfn: Print more info if scheduling fails 3107- r600/sfn: remove debug output leftovers 3108- r600/sfn: Fix use of multiple IDX with kcache 3109- r600/sfn: Always check arrays writes before allowing copy propagation 3110- r600/sfn: set block sizes based on chip class 3111- r600/sfn: Fix typo with block type 3112- r600/sfn: override slot count for IfInstr 3113- r600/sfn: Add method to convert to AluGroup directly 3114- r600/sfn: Add flags to check whether a group starts CF and can do that 3115- r600/sfn: make remaining slots a signed value 3116- r600/sfn: on Cayman loading an index register needs only one slot 3117- r600/sfn: Splizt ALU blocks in scheduler to fit into 128 slots 3118- r600/sfn: rework checks for ALU CF emission 3119- r600/sfn: Schedule AR uses befor possible groups 3120- r600: Explicitly force new CF in gs copy shader 3121- r600: Assert when backend wants to create a new ALU CF 3122- r600: don't check possible size of ALU CF 3123- r600: don't use sb disasm to disassamble copy shader 3124- r600: Force CF when emitting a NOP on R600 in gs copy shader 3125- r600/sfn: Don't try to propagate to vec4 with more than one use 3126- r600/sfn: Only switch to other CF if no AR uses are pending 3127- r600/sfn: AR loads should depend on all previous non ALU instructions 3128- r600/sfn: Renumber shader blocks in scheduler 3129- r600/sfn: Track whether a register is ALU clause local 3130- r600/sfn: Use clause local registers in RA 3131- r600/sfn: Take source uses into account when switching channels 3132- r600/sfn: take number of dest values into account 3133- r600: retire SB optimizer 3134- r600/sfn: work around injecting extra CF's to handle hardware bugs 3135- r600: use correct cso pointer for fetch shader 3136- r600/sfn: Make use of four clause local registers 3137- r600/sfn: drop unused ControlFlowInstr type enum 3138- r600/sfn: factor out resource as extra class 3139- r600/sfn: Simplify dependency chain for index loads on EG 3140- r600: print texture resource index mode separately 3141- r600/sfn: Make address split pass obligatory 3142- r600/sfn: rename method resource_base to resource_id 3143- r600/sfn: Add old address to update_indirect_addr 3144- r600/sfn: Sepeate resource and sampler in texture instructions 3145- r600/sfn: get rid of the method to get the index mode 3146- r600/sfn: sort the uniforms of the right shader 3147- r600/sfn: Fix use of scheduled_shader vs shader 3148- virgl: report MIRROR_CLAMP features better 3149- ci: Upref virglrenderer 3150- copyimage: check requested slice early when cube maps are involved 3151- mesa: check numlevels and numlayers when creating a texture view 3152- virgl: Use common clear_texture if host doesn't support the feature 3153- r600/sfn: don't remove texture sources by using the enum value 3154- r600: drop egcm_load_index_reg 3155- r600/sfn: Don't override a chgr pinning during copy propagation 3156- r600/sfn: When simplifying src vec4 pinnings, also check all uses 3157- virgl: Fix logic for reporting PIPE_MIRROR_CLAMP 3158- r600: Add callbacks for get_driver_uuid and get_device_uuid 3159- r600: Link with libgalliumvl, when enabling rusticl this is needed 3160- r600/sfn: Fixup component count only if intrinsic has it 3161 3162Guilherme Gallo (5): 3163 3164- bin/ci: Ensure that all jobs have nodes in DAG 3165- ci/radeonsi: Update flake list 3166- ci/freedreno: Add a new flake 3167- ci/zink: Found some flakes 3168- ci/anv: Catch some flakes 3169 3170Hannes Mann (1): 3171 3172- vulkan/wsi/wayland: Fix detection of tearing control protocol 3173 3174Hans-Kristian Arntzen (2): 3175 3176- wsi/x11: Fix potential deadlock in present ID. 3177- wsi/x11: Don't allow signal_present_id to rewind. 3178 3179Helen Koike (21): 3180 3181- ci: re-add EXTRA_LOCAL_PACKAGES to rootfs 3182- ci: add EXTRA_LOCAL_PACKAGES to apt-get install 3183- docs/ci: Add docs for EXTRA_LOCAL_PACKAGES 3184- ci: disable duplicated pipelines triggered by marge 3185- ci: add --project option to ci_run_n_monitor.py 3186- ci/android: remove strace output from cuttlefish-runner.sh 3187- ci: add locked flag to bindgen-cli on x86_64_build.sh 3188- ci: separate hiden jobs to -inc.yml files 3189- ci/ci_run_n_monitor: add docs for multiple targets 3190- ci/ci_run_n_monitor: print stress test results per job 3191- ci/ci_run_n_monitor: simplify with defaultdict 3192- ci/ci_run_n_monitor: merge print_job_status_change with print_job_status 3193- ci/ci_run_n_monitor: make --target mandatory 3194- ci/ci_run_n_monitor: merge enable_job with retry_job 3195- ci/ci_run_n_monitor: simplify enable/cancel logic in monitor_pipeline() 3196- ci/ci_run_n_monitor: allow <user>/<project> in --project 3197- ci/ci_run_n_monitor: limit repetitions on --stress 3198- ci/marge_queue: add missing python-dateutils to requirements.txt 3199- ci/ci_run_n_monitor: keep monitoring if a job is still running 3200- ci/marge_queue: add pretty_dutation() 3201- ci/ci_run_n_monitor: print job duration time 3202 3203Honglei Huang (7): 3204 3205- virgl/video: Add support for mpeg12 decoding 3206- virgl/video: Add support for vc1 decoding 3207- virgl/video: Add support for jpeg decoding 3208- virgl/video: Add support for hevc10bit decoding. 3209- virgl/video: Add more pipe type in virgl formats convert table 3210- virgl/video: Add jpeg buf start code check 3211- virgl: Enable vp9 hardware decode 3212 3213Hyunjun Ko (3): 3214 3215- anv: use ycbcr_info for P010 format 3216- anv: don't use cmd_buffer after destroyed. 3217- anv: don't flush_llc on gen9 3218 3219Iago Toral Quiroga (100): 3220 3221- nir/trivialize: Move decl_reg to the start of the block 3222- v3dv: stop incrementing UBO indices by one 3223- nir/lower_robustness: drop skip_ubo_0 option 3224- v3dv: fix incorrect key setup 3225- broadcom/compiler: stop asserting on Vulkan environment 3226- broadcom/compiler: use NIR's lowering for dispatch base 3227- broadcom/compiler: move uniform offset lowering from compiler to GL driver 3228- broadcom/compiler: move vulkan's point coord lowering to the driver 3229- v3dv: don't set lower_wpos_pntc for Vulkan 3230- broadcom/compiler: always clamp results from logic ops 3231- broadcom/compiler: drop execution environment from the shader key 3232- v3dv: drop cpu path for buffer to image copies 3233- v3dv: remove unused code 3234- nir/lower_tex: copy backend_flags field when copying a tex instruction 3235- nir/lower_tex: use a callback to check sampler return size packing 3236- squash! v3dv,broadcom/compiler: don't abuse sampler index 3237- v3dv: assert that only tex instructions with sampler state have a sampler src 3238- v3d: fix texture packing lowering 3239- v3d,v3dv: use fquantize2f16 lowering in NIR 3240- v3dv: be more precise in vkGetImageSubresourceLayout 3241- v3dv: handle pPlaneLayouts in VkImageDrmFormatModifierExplicitCreateInfoEXT 3242- v3dv: bump up MAX_UNIFORM_BUFFERS to 16 3243- v3dv: add support for sampling simple 2D linear textures 3244- v3dv: expand sampling from linear image hack to support multi-planar images 3245- v3dv: don't assume that bound descriptors have been written 3246- v3dv: only handle Android Hardware Buffer on Android 3247- v3dv: we can sample from 1D array too 3248- broadcom/compiler: add a couple of shader key helpers 3249- v3d: compute nir sha1 for uncompiled shader state 3250- v3d: use pre-computed shader sha1 for disk cache 3251- v3d: fix RAM shader cache 3252- v3d: get rid of shader_state pointer in v3d_key 3253- broadcom/simulator: reset CFG7 for compute dispatch in v71 3254- broadcom/common: retrieve V3D revision number 3255- broadcom/compiler: update node/temp translation for v71 3256- broadcom/compiler: implement "reads/writes too soon" checks for v71 3257- broadcom/compiler: implement read stall check for v71 3258- broadcom/compiler: add a v3d71_qpu_writes_waddr_explicitly helper 3259- broadcom/compiler: prevent rf2-3 usage in thread end delay slots for v71 3260- broadcom/qpu: add new ADD opcodes for FMOV/MOV in v71 3261- broadcom/qpu: fix packing/unpacking of fmov variants for v71 3262- broadcom/compiler: make vir_write_rX return false on platforms without accums 3263- broadcom/compiler: rename vir_writes_rX to vir_writes_rX_implicitly 3264- broadcom/compiler: only handle accumulator classes if present 3265- broadcom/compiler: don't assign rf0 to temps across implicit rf0 writes 3266- broadcom/compiler: CS payload registers have changed in v71 3267- broadcom/compiler: don't schedule rf0 writes right after ldvary 3268- broadcom/compiler: allow instruction merges in v71 3269- broadcom/qpu: add MOV integer packing/unpacking variants 3270- broadcom/qpu: fail packing on unhandled mul pack/unpack 3271- broadcom/compiler: generalize check for shaders using pixel center W 3272- broadcom/compiler: v71 isn't affected by double-rounding of viewport X,Y coords 3273- broadcom/compiler: update peripheral access restrictions for v71 3274- broadcom/qpu: add packing for fmov on ADD alu 3275- broadcom/compiler: handle rf0 flops storage restriction in v71 3276- broadcom/compiler: enable ldvary pipelining on v71 3277- broadcom/compiler: try to use ldunif(a) instead of ldunif(a)rf in v71 3278- broadcom/compiler: don't assign rf0 to temps that conflict with ldvary 3279- broadcom/compiler: convert mul to add when needed to allow merge 3280- broadcom/compiler: implement small immediates for v71 3281- broadcom/compiler: update thread end restrictions for v7.x 3282- broadcom/compiler: update ldvary thread switch delay slot restriction for v7.x 3283- broadcom/compiler: lift restriction for branch + msfign after setmsf for v7.x 3284- broadcom/compiler: start allocating from RF 4 in V7.x 3285- broadcom/compiler: validate restrictions after TLB Z write 3286- broadcom/compiler: lift restriction on vpmwt in last instruction for V3D 7.x 3287- broadcom/compiler: fix up copy propagation for v71 3288- broadcom/compiler: don't allocate spill base to rf0 in V3D 7.x 3289- broadcom/compiler: improve allocation for final program instructions 3290- broadcom/compiler: don't assign registers to unused nodes/temps 3291- broadcom/compiler: only assign rf0 as last resort in V3D 7.x 3292- v3dv: expose V3D revision number in device name 3293- v3dv/device: handle new rpi5 device (bcm2712) 3294- v3dv: setup render pass color clears for any format bpp in v71 3295- v3dv: setup TLB clear color for meta operations in v71 3296- v3dv: fix up texture shader state for v71 3297- v3dv: handle new texture state transfer functions in v71 3298- v3dv: implement noop job for v71 3299- v3dv: handle render pass global clear for v71 3300- v3dv: GFX-1461 does not affect V3D 7.x 3301- broadcom/compiler: update thread end restrictions validation for v71 3302- v3dv: handle early Z/S clears for v71 3303- v3dv: handle RTs with no color targets in v71 3304- v3dv: don't convert floating point border colors in v71 3305- v3dv: handle Z clipping in v71 3306- v3dv: make v3dv_viewport_compute_xform depend on the V3D version 3307- v3dv: fix depth clipping then Z scale is too small in V3D 7.x 3308- v3d/v3dv: fix texture state array stride packing for V3D 7.1.5 3309- v3d,v3dv: support up to 8 render targets in v7.1+ 3310- v3d,v3dv: don't use max internal bpp for tile sizing in V3D 7.x 3311- v3d,v3dv: propagate NaNs bits in shader state records are reserved in v7.x 3312- v3dv: use new texture shader state rb_swap and reverse fields in v3d 7.x 3313- v3dv: fix color write mask for v3d 7.x 3314- v3d,v3dv: fix depth bias for v3d 7.x 3315- v3d,v3dv: fix compute for V3D 7.1.6+ 3316- v3dv: expose fullDrawIndexUint32 in V3D 7.x 3317- v3dv: expose depthClamp in V3D 7.x 3318- v3dv: expose scalarBlockLayout on V3D 7.x 3319- v3dv: fix confusing nomenclature about DRM nodes 3320- v3d,v3dv: fix MMU error from hardware prefetch after ldunifa 3321 3322Ian Douglas Scott (1): 3323 3324- egl/wayland: Don't segfault if \`create_wl_buffer` returns \`NULL` 3325 3326Ian Romanick (38): 3327 3328- intel/fs: Always do opt_algebraic after opt_copy_propagation makes progress 3329- intel/fs: Constant fold SHL 3330- intel/fs: Constant fold OR and AND 3331- util/rb-tree: Return the actual first node from rb_tree_search 3332- util/rb-tree: Fix typo in comment 3333- nir/builder: Add nir_extract_i8_imm and nir_extract_u8_imm helpers 3334- nir/algebraic: Remove redundant pack / unpack lowering patterns 3335- intel/fs: Completely re-write the combine constants pass 3336- intel/fs: Combine constants for SEL instructions too 3337- intel/fs: Combine constants for integer instructions too 3338- intel/fs: New VGRF packing scheme for constant combining 3339- intel/compiler: Combine control barriers with identical memory semantics 3340- intel/compiler: Don't evict for workgroup-scope fences 3341- glsl/list: Clean up an inappropriate comment 3342- util/rb-tree: Work around C++'s dislike of offsetof 3343- util/rb-tree: Inline rb_tree_init 3344- intel/fs: Don't continue fixed point iteration just because liveout changes 3345- intel/fs: Don't try to copy propagate into a source again after progress is made 3346- intel/fs: Make try_constant_propagate and try_copy_propagate file private 3347- intel/fs: Move src.file checks out of try_constant_propagate and try_copy_propagate 3348- intel/fs: Don't loop in try_constant_propagate 3349- intel/fs: Simplify check in can_propagate_from 3350- intel/fs: Make opt_copy_propagation_local file private 3351- intel/fs: Encapsulate per-block ACP in a structure 3352- intel/fs: Use rb_tree to store ACP entries by source 3353- intel/fs: Use rb_tree to store ACP entries by destination 3354- intel/fs: Use rb_tree for copy prop dataflow 3355- intel/fs: Merge copy prop dataflow loops 3356- intel/compiler/xe2: Update fs_visitor::setup_vs_payload to account for Xe2 reg size 3357- intel/compiler/xe2: Use SIMD16 for nir_intrinsic_image_size 3358- intel/compiler/xe2: TXD is lowered to SIMD16 in SIMD32 mode 3359- nir/rematerialize: Rematerialize ALUs used only by compares with zero 3360- intel/compiler/xe2: Handle new URB read messages 3361- intel/compiler/xe2: Handle new URB write messages 3362- intel/compiler/xe2: Update fs_visitor::emit_urb_writes to not assume SIMD8 3363- spirv: Track when a shader has a cooperative matrix 3364- intel/fs: Add DP4A to get_lowered_simd_width 3365- nir/split_vars: Don't split arrays of cooperative matrix types 3366 3367Igor Torrente (4): 3368 3369- zink: Fix enumerate devices when running compositor 3370- zink: Removes \`disable_xcb_surface` 3371- zink: Fix one addicional case when running a compositor 3372- zink: fix for startup crash of weston running on top of zink + venus 3373 3374Illia Abernikhin (2): 3375 3376- state_tracker: moving initialisation of whandle out from if statement whandle initialization inside if statement but used also outside 3377- i915: change format in dbg string Actually, uintptr_t is of type unsigned long, but the debug line uses the %d format specifier, which expects an int. 3378 3379Illia Polishchuk (7): 3380 3381- iris: remove NULL check for already dereferenced pointer earlier 3382- s/Intel: fix/anv: fix: potentially overflowing expression in genX 3383- glx: fix dead code when gc var cannot be null due to earlier check 3384- state_tracker: fix dereference before null check 3385- anv, drirc: Add workaround to speed up Cyberpunk 2077 reg allocation 3386- zink: move find_sampler_var from zink to nir core 3387- nir: fix invalid sampler search by texture id 3388 3389Italo Nicola (24): 3390 3391- mesa/main: account for RTT samples when updating framebuffer 3392- mesa/main: allow readpix/teximage to read from implicitly multisampled fbos 3393- panfrost/genxml: fix Surface With Stride descriptor alignment 3394- panfrost/genxml: add Multiplanar Surface descriptor 3395- panfrost: refactor (un)packing of surface descriptors 3396- pan/decode: decode Multiplanar Surface descriptors 3397- panfrost: prepare pan_image_view for multiplanar formats 3398- panfrost: prepare the driver to support YUYV and variants 3399- panfrost: advertise support for YUYV and variants 3400- panfrost: mandate proper alignment requirement depending format and arch 3401- panfrost: add PAN_MESA_DEBUG=yuv for debugging yuv sampler 3402- gallium/st: add non-CSC lowering of I420 as PIPE_FORMAT_R8_G8_B8_420 3403- gallium/st: add non-CSC lowering of YV12 as PIPE_FORMAT_R8_B8_G8_420 3404- pan/bi: add support for I420 and YV12 sampling 3405- gallium/st: lower NV21 to R8_B8G8 instead of G8_B8R8 3406- panfrost: fix invalid memory access in get_equation_str() 3407- pan/decode: handle more than one panfrost_device 3408- panfrost/ci: updated CI expectations 3409- egl: reenable partial redraw with a warning when using gallium hud 3410- pan/genxml: add Width/Height fields to v9+ Plane descriptor 3411- panfrost: rename _needs_multiplanar_descriptor to _is_yuv 3412- panfrost: prepare v9+ to support YUV sampling 3413- panfrost: use centered YUV chroma siting 3414- panfrost: advertise YUV formats for valhall 3415 3416Iván Briano (23): 3417 3418- anv: ensure CFE_STATE is emitted for ray tracing pipelines 3419- iris: ensure mesh is disabled on context init 3420- anv: ensure mesh is disabled on context init 3421- anv: implement Wa_14019750404 3422- intel/compiler: call brw_nir_adjust_payload from brw_postprocess_nir 3423- anv,hasvk: respect provoking vertex setting on geometry shaders 3424- anv: fix missing 3DSTATE_SBE_CLIP emission 3425- anv: ensure pipelines have all state 3426- anv: tell blorp to do mesh stuff only if it's enabled 3427- blorp: fix hangs with mesh enabled 3428- anv: use a simpler MUE layout for fast linked libraries 3429- anv: track what kind of pipeline a fragment shader may be used with 3430- intel/fs: read viewport and layer from the FS payload 3431- intel/fs: handle URB setup for fast linked mesh pipelines 3432- anv: enable VK_EXT_mesh_shader where supported 3433- intel/fs: use ffsll so we don't explode on 32 bits 3434- vulkan/runtime: add internal parameter to vk_spirv_to_nir 3435- nir/lower_int64: respect rounding mode when casting to float 3436- intel/compiler: round f2f16 correctly for RTNE case 3437- util: add double_to_float16 helpers 3438- nir: round f2f16{_rtne/_rtz} correctly for constant expressions 3439- anv: advertise VK_KHR_global_priority_queue 3440- anv: use the right vertexOffset on CmdDrawMultiIndexed 3441 3442Jani Nikula (1): 3443 3444- docs/vulkan: fixup some typos 3445 3446Janne Grunau (4): 3447 3448- asahi: toggle more barrier bits after transform feedback 3449- asahi,agx: Fix stack buffer overflow in agx_link_varyings_vs_fs 3450- asahi,agx: Upload constant buffers immediately 3451- asahi: decode: Fix uint64_t format modifiers in agxdecode_stateful() 3452 3453Jesse Natalie (2): 3454 3455- nir_lower_mem_access_bit_sizes: Fix write-mask-constrained 3-byte stores as atomics 3456- d3d12: Fix multidimensional array ordering 3457 3458Jianxun Zhang (1): 3459 3460- intel/common: Only set op mask on instructions in decoder 3461 3462Jonathan Marek (2): 3463 3464- freedreno: move redump.h to common code + cleanup 3465- tu: add a TU_DEBUG=rd option for cmdstream dumping 3466 3467Jordan Justen (73): 3468 3469- isl: Add ISL_SURF_USAGE_STREAM_OUT_BIT 3470- anv,iris,hasvk: Use ISL_SURF_USAGE_STREAM_OUT_BIT for setting stream-out MOCS 3471- genxml/hsw: Add additional MOCS field enumerations 3472- genxml/chv: Add MEMORY_OBJECT_CONTROL_STATE_CHV to document compared to BDW 3473- isl/dev: Add uncached MOCS value 3474- isl: Set MOCS to uncached for MTL stream-out 3475- intel/isl: Use intel_needs_workaround() for MTL CCS WA 3476- intel/compiler: Use nir SUBGROUP_INVOCATION for RT TOPOLOGY_ID 3477- intel/dev: Add LNL platform enum 3478- intel/dev: Support xe2 device init (for intel_device_info_test) 3479- intel/tools: Use 'env bash' to find bash executable 3480- intel/decoder: Fix xml filename when verx10 % 10 is not 0 3481- intel/decoder: Add intel_spec_load_common() 3482- intel/decoder: Make intel_spec_load_filename() have separate dir and name strings 3483- intel/genxml: Align "Texture Coordinate Mode" naming 3484- intel/genxml: Split some genxml sorting code into a intel_genxml module 3485- intel/genxml: Convert gen_bits_header to use ElementTree 3486- intel/genxml: Convert gen_pack_header to use ElementTree 3487- intel/genxml: Add GenXml class into intel_genxml module 3488- intel/genxml: Add filter_engines() to GenXml class 3489- intel/genxml: Move sorting & writing into GenXml class 3490- intel/genxml: Don't rewrite sorted xml if the contents didn't change 3491- intel/genxml: Add final newline to output when saving xml 3492- intel/genxml: Update xml with gen_sort_tags.py output 3493- intel/dev: Use RPL-U name on RPL-U devices 3494- intel/dev: Add more RPL PCI IDs 3495- anvil,hasvk: Rename need_clflush to need_flush 3496- intel/common: Move intel_clflush.h to intel_mem.h/intel_mem.c 3497- anvil,hasvk: Replace intel_clflush_range with intel_flush_range 3498- intel/common: Add intel_flush_range_no_fence 3499- anvil,hasvk: Use intel_flush_range_no_fence to flush command buffers 3500- util/u_cpu_detect: Drop unused has_tsc 3501- util/u_cpu_detect: Detect clflushopt support 3502- meson: Check for the __builtin_ia32_clflushopt function 3503- intel/clflush: Add support for clflushopt instruction 3504- intel/dev/xe: Move placeholder subslice info into XEHP_FEATURES 3505- intel/genxml: Ignore tail leading/trailing whitespace in node_validator() 3506- intel/genxml: Fix comparing xml when node counts differ 3507- intel/dev: Update device string for MTL PCI ID 0x7d55 3508- intel/genxml: Support importing from another genxml file 3509- intel/genxml: Add support for excluding items when importing 3510- intel/genxml: Add all xml files as pack dependencies 3511- intel/genxml: Add GenXml.optimize_xml_import() 3512- intel/genxml: Drop assertion to allow for importing 3513- intel/genxml: Add GenXml.add_xml_imports method 3514- intel/genxml: Add GenXml.flatten_xml() method 3515- intel/genxml: Add genxml_import.py script 3516- intel/decoder: ralloc_steal() values from spec context for fields and enums 3517- intel/decoder: Implement support for importing genxml 3518- intel/genxml: Start Xe2 support 3519- intel/genxml: Auto-import genxml files using genxml_import.py 3520- intel/common: Add sse2_args for 32-bit build when -Dsse2=false was set 3521- intel/compiler/fs: Support Xe2 reg size in assign_curb_setup 3522- intel/compiler: Update opt_split_sends() for Xe2 reg size 3523- intel/compiler: Update emit_rt_lsc_fence() for Xe2 3524- intel/compiler: Update lower_trace_ray_logical_send() for Xe2 3525- intel/compiler: Update ray-tracing intrinsic lowering for Xe2 3526- intel/compiler: Update RT stack_id access for Xe2 3527- intel/fs: Update SSBO & shared uniform block loads for Xe2 3528- intel/genxml: Build with gen20.xml 3529- intel/isl: Build for Xe2 3530- iris: Build for Xe2 3531- anv/blorp: Use anv_genX to set device->blorp.exec 3532- anv: Disable Ray Tracing on xe2 until our compiler supports Xe2 RT 3533- anv: Build for Xe2 3534- anv: Print warning that Xe2 is not supported rather than failing 3535- intel/compiler: Add enum xe2_lsc_cache_store 3536- intel/compiler: Use enum xe2_lsc_cache_store on xe2 3537- intel/compiler: Add enum xe2_lsc_cache_load 3538- intel/compiler: Use enum xe2_lsc_cache_load on xe2 3539- anv/batch: Check if batch already has an error in anv_queue_submit_simple_batch() 3540- anv/batch: Assert that extend_cb is non-NULL if the batch is out of space 3541- intel/dev: Add 0x56ba-0x56bd DG2 PCI IDs 3542 3543Jose Maria Casanova Crespo (2): 3544 3545- vc4: mark buffers as initialized at vc4_texture_subdata 3546- vc4: Fix mask RGBA validation at YUV blit 3547 3548José Expósito (3): 3549 3550- zink: Fix crash on zink_create_screen error path 3551- zink: fix dereference before NULL check 3552- zink: allow software rendering only if selected 3553 3554José Roberto de Souza (51): 3555 3556- anv: Use workaround framework to Wa_14016118574 3557- intel/aux_map: Nuke format_enum 3558- intel/aux_map: Use get_aux_entry() in remove_mapping() 3559- intel/aux_map: Replace magic number by INTEL_AUX_MAP_ENTRY_VALID_BIT 3560- intel/aux_map: Rename some variables to improve readability 3561- intel/aux_map: Mask out bits above index 47 in intel_aux_get_meta_address_mask() 3562- intel/aux_map: Convert l1_entry_addr_out to canonical 3563- intel/aux_map: Drop magic sub table size number 3564- intel/aux_map: Add function and macro to return l2 and l1 table masks 3565- anv: Add gem_create_userptr() to KMD backend 3566- anv: Replace handle by anv_bo in the gem_close() 3567- anv: Add support for userptr in Xe KMD 3568- intel: Sync xe_drm.h 3569- intel/dev/xe: Add support for small-bar setups 3570- anv: Request Xe KMD to place BOs to CPU visible VRAM when required 3571- iris: Request Xe KMD to place BOs to CPU visible VRAM when required 3572- iris/xe: Call iris_lost_context_state() when batch engine is replaced 3573- intel/dev: Port intel_dev_info tool to Xe KMD 3574- iris: Replace I915_EXEC_FENCE_SIGNAL by IRIS_BATCH_FENCE_SIGNAL in common code 3575- intel: Move i915_drm.h specific code from common/intel_gem.h to common/i915/intel_gem.h 3576- intel/common: Move functions inside of C++ ifdef 3577- intel: Rename intel_gem_add_ext() to intel_i915_gem_add_ext() 3578- iris: Move i915_gem_set_domain() call to i915 backend 3579- iris: Move iris_bufmgr_bo_close() to kmd backend 3580- iris: Add gem_create_userptr() to KMD backend 3581- iris: Add support for userptr in Xe KMD 3582- intel/genxml/gen125: Add missing fields in MI_MATH 3583- iris: Set MI_MATH MOCS field 3584- anv: Set MI_MATH MOCS field 3585- intel/tests/mi_builder: Set MI_MATH MOCS field 3586- intel/genxml/gen125: Set MI_MATH MOCS field as non-zero 3587- anv: Nuke unused READ_ONCE() from anv_batch_chain.c 3588- anv: Remove VkAllocationCallbacks parameter from reloc functions 3589- anv: Return earlier in anv_reloc_list functions 3590- intel: Sync xe_drm.h and rename engine to exec_queue 3591- anv: Override vendorID for Hogwarts Legacy 3592- intel/isl: Remove unknown workaround 3593- intel/isl: Remove Wa_22011186057 3594- anv: Update Wa_16014390852 for MTL 3595- intel: Sync xe_drm.h 3596- anv: Move i915 specific gem_set_caching to backend 3597- anv: Move i915 specific code from common anv_gem.c 3598- anv: Move bo_alloc_flags_to_bo_flags() to backend 3599- anv: Move i915 handling of imported bos bo_flags 3600- anv: Remove i915_drm.h include from common code 3601- iris: Lock bufmgr->lock before call vma_free() in error path 3602- iris: Nuke useless flags from iris_fine_fence_new() 3603- intel: Prepare implementation of Wa_18019816803 and Wa_16013994831 for future platforms 3604- intel: Sync xe_drm.h 3605- anv: Switch Xe KMD vm bind to sync 3606- anv: Add missing ANV_BO_ALLOC_EXTERNAL flags when calling anv_device_import_bo() 3607 3608Juan A. Suarez Romero (7): 3609 3610- broadcom/ci: update expected results 3611- vc4/ci: update expected results 3612- v3d/shim: include new ioctl parameters 3613- v3dv/ci: update expected list 3614- broadcom: add performance counters for V3D 7.x 3615- broadcom/simulator: add per-hw version calls 3616- v3d/vc4/ci: add new fails/timeout 3617 3618Julia Tatz (10): 3619 3620- gallium/dri: fix dri2_from_names 3621- aux/trace: skip multi-line comments in enums2names 3622- aux/trace: deduplicate enum dump macro work 3623- aux/trace: move trace_sample_view logic 3624- aux/trace: fix set_hw_atomic_buffers method name 3625- aux/trace: add screen video methods 3626- aux/trace: add context video methods 3627- aux/trace: wrap video_codec & video_buffer 3628- aux/trace: unwrap refrence frames in picture_desc 3629- aux/trace: trace video_buffer method return vals 3630 3631Julia Zhang (1): 3632 3633- radeonsi: modify algorithm of skipping holes of sparse bo 3634 3635Julian Hagemeister (1): 3636 3637- Gallium: Fix shared memory segment leak 3638 3639Juston Li (10): 3640 3641- zink: remove venus from renderpass optimizations 3642- venus: sync protocol for VK_EXT_vertex_input_dynamic_state 3643- venus: implement VK_EXT_vertex_input_dynamic_state 3644- venus: set lvp queries as saturate on overflow 3645- venus: add helper function to get cmd handle 3646- venus: refactor out common cmd feedback functions 3647- venus: support deferred query feedback recording 3648- venus: track/recycle appended query feedback cmds 3649- venus: append query feedback at submission time 3650- venus: switch to unconditionally deferred query feedback 3651 3652Kai Wasserbäch (3): 3653 3654- fix: clover: LLVM 18 renamed/moved CGFT_*, update compat layer 3655- fix: clover: LLVM 18: s/CodeGenOpt::/CodeGenOptLevel::/ 3656- fix: clover: warning: ignoring return value of ‘int posix_memalign(…)’ [-Wunused-result] 3657 3658Karmjit Mahil (29): 3659 3660- pvr: Remove mrt setup from SPM EOT 3661- pvr: Compile SPM EOT shader 3662- pvr: Use the SPM EOT on barrier stores 3663- pvr: Remove some magic numbers and increments from km stream 3664- pvr: Restructure \`rogue_kmd_stream.xml` 3665- pvr: Submit PR commands 3666- pvr: Use the correct size for the unified store allocation 3667- pvr: Allow query stage for barrier sub cmds 3668- pvr: Fix occlusion query unaccounted for user fences 3669- pvr: Fix writing query availability write out 3670- pvr: Fix packing issue with max_{x,y}_clip 3671- pvr: Fix csb relocation status assert on \`pvr_csb_finish()` 3672- pvr: Fix \`for` loop itarator usage 3673- pvr: Fix dynamic desc offset storage 3674- pvr: Fix cubemap layer stride 3675- pvr: Use the render passes' attachments array to setup ISP state 3676- pvr: Adjust EOT PBE state to account for the iview's base array layer 3677- pvr: Fix MRT index in PBE state 3678- pvr: Fix pbe_emit assert 3679- pvr: Fix OOB access of pbe_{cs,reg}_words 3680- pvr: Order tile buffer EOT emits to be last 3681- pvr: Fix subpass sample count on ds attachment only 3682- pvr: Refactor subpass ds and sample count setup 3683- pvr: Fix SPM load shader sample rate 3684- pvr: Fix PPP_SCREEN sizes 3685- vulkan: Add \`vk_subpass_dependency_is_fb_local()` helper 3686- tu: Use common \`vk_subpass_dependency_is_fb_local()` 3687- pvr: Don't merge subpasses on framebuffer-global dependancy 3688- pvr: Only setup the bgobj to load if we have a load_op 3689 3690Karol Herbst (213): 3691 3692- nvc0: initial Ada enablement 3693- rusticl/mesa: make svm_migrate optional 3694- llvmpipe: enable system SVM 3695- nvc0: fix num_gprs for Volta+ 3696- rusticl: fix warnings with newer rustc 3697- gm107/ir: fix SULDP for loads without a known format 3698- nv50/ir/nir: fix txq emission on MS textures 3699- nv50/ir/nir: Fix zero source handling of tex instructions. 3700- rusticl/kernel: only handle function_temp memory before lowering printf 3701- meson,ci: bump meson req for rusticl to 1.2 3702- rusticl/nir: add helper functions we need for a NIR_PASS macro 3703- rusticl/nir: add a nir_pass macro 3704- rusticl/nir: use the new nir_pass macro 3705- rusticl/kernel: rename res to internal_args inside lower_and_optimize_nir_late 3706- rusticl/kernel: merge lower_and_optimize_nir_pre_inputs and lower_and_optimize_nir_late 3707- rusticl/kernel: move things around in lower_and_optimize_nir 3708- rusticl/kernel: get rid of initial function_temp type lowering 3709- rusticl/kernel: mark can_remove_var as unsafe and document it 3710- n50/compute: submit initial compute state in nv50_screen_create 3711- nvk: add vulkan skeleton 3712- nouveau/winsys: add the new winsys implementation 3713- nvk: use winsys lib 3714- nvk: fix nvk_buffer include guards 3715- nouveau/headers: add script to sync in-tree headers with open-gpu-doc 3716- nouveau/headers: initial sync of headers 3717- nvk: implement GetPhysicalDeviceQueueFamilyProperties2 to make the CTS happy 3718- nvk: advertize memory heaps and types 3719- nouveau/ws: reorganize a little 3720- nouveau/ws: dup the fd 3721- nouveau/ws: add a field for the SM version 3722- nvk: set nonCoherentAtomSize as the CTS divides with this value 3723- nouveau/ws: add bo API 3724- nvk: add basic device memory support 3725- nouveau/headers: add nvtypes.h 3726- nouveau/headers: typedef Nv void types 3727- nouveau/headers: add host classes 3728- nouveau/ws: add context support 3729- nouveau/ws: add a cmd buffer 3730- novueau/bo: refcount it 3731- novueau/bo: add nouveau_ws_bo_wait 3732- nvk: allocate a GPU context for each VkDevice 3733- nvk: add nvk_bo_sync 3734- nvk: add nvk_CmdPipelineBarrier2 stub 3735- nvk: impl nvk_CmdCopyBuffer 3736- nouveau/ws: fix setting push bo domains 3737- nouveau/ws: PUSH_IMMD only works with 16 bit values 3738- nouveau/ws: set GPU object class 3739- nouveau/ws: bind 2D class 3740- nvk: use fermi class definitions 3741- nvk: add basic support for images 3742- nvk: simple format table 3743- nvk: add support for blits 3744- nvk: report maxMipLevels as 1 3745- nvk: optimize blit command buffer gen 3746- nvk: implement CmdFillBuffer 3747- nvk: implement CmdUpdateBuffer 3748- nvk: implement CmdCopyBuffer2 3749- nvk: advertise VK_KHR_copy_commands2 3750- nvk: implicitly reset the command buffer 3751- nouveau/ws: handle 0inc inside nvk_push_val as well 3752- nvk: reduce pitch even further in CmdFillBuffer 3753- nvk: support multiple miplevels 3754- nvk: support array blits over multiple layers 3755- nvk: tiling prep work for VK_EXT_image_2d_view_of_3d 3756- nouveau/ws: make sure we don't submit nonsense 3757- nouveau/ws: assert on broken channel 3758- nvk/blit: assert that formats are supported 3759- nouveau/headers: Generate parser functions 3760- nouveau/ws: initial debugging options for command submissions 3761- nouveau/ws: depend on generated class header files 3762- nouveau/ws: get rid of libdrm 3763- nouveau/ws: use new NVIF interface to query oclasses 3764- nvk: set deviceName 3765- nouveau/headers: add path for 3D headers 3766- nouveau/headers: initial 3D headers import 3767- nouveau/ws: allocate 3D subchan 3768- nouveau/ws: allocate copy subchan as well 3769- nouveau/ws: add API to query if the context was killed 3770- nouveau/ws: add a bo unmap helper function 3771- nvk: clean up bo mappings 3772- nouveau/ws: bound check nouveau_ws_push_append 3773- nouveau/ws: rework refing push buffer bos 3774- nouveau/ws: push chaining 3775- nvk: fix OOB read inside nvk_get_va_format 3776- nvk: alloc a zero page and use it for vertex runouts 3777- nvk: fix zero page refing 3778- nvk: support exporting buffers 3779- nvk: fix some class version checks 3780- nvk: properly align shaders pre Turing 3781- nvk: rework QMD handling to support pre Turing 3782- nvk: align desc root table 3783- nvk: Use SET_PIPELINE_PROGRAM pre-Volta 3784- nvk: properly align slm size 3785- nvk: use remaps for image copies 3786- nvk: reduce pitch for FillBuffer 3787- nvk: bind more subchans in init_context_state 3788- nvk: support pre Maxwell Texture Headers 3789- nvk/device: fix order of error handling 3790- nvk: allocate VAB memory area 3791- nvk: wire up M2MF for Fermi 3792- nouveau/mme: add test for BEQ with magic exit offset 3793- nouveau/mme: add a macro exit helper 3794- nvk: Add a macro to set MMIO registers via falcons 3795- nouveau/winsys: fix SM value for Ada 3796- nvk: fix num_gprs for Volta+ 3797- nvk: replace mp with tpc 3798- nvk: properly calculate SLM region by taking per arch limits into account 3799- nouveau: fix max_warps_per_mp_for_sm for builds with asserts disabled 3800- nvk: enable fp helper invocations loads on more gens 3801- nv50/ir: use own info struct for sys vals 3802- nv50/ir: convert system values to gl_system_value 3803- nouveau/mme: fix OOB access inside while_ine builder test 3804- nouveau/mme: fix OOB inside tu104 simulator 3805- clc: use CLANG_RESOURCE_DIR for clang's resource path 3806- nv50: fix code uploads bigger than 0x10000 bytes 3807- nouveau: take glsl_type ref unconditionally 3808- rusticl/kernel: optimize nir between lowering io and explicit types 3809- nv50: limit max code uploads to 0x8000 3810- zink: fix source type in load/store scratch 3811- zink: fix global stores 3812- zink: update some compute caps 3813- rusticl: add debug option to sync every event 3814- rusticl/device: _MAX_CONST_BUFFER0_SIZE is unsigned 3815- ci: disable a660 jobs 3816- nir: make workgroup_id 32 bit only 3817- nir: make num_workgroups 32 bit only 3818- ac: drop 64 bit handling for cl workgroup intrinsics 3819- gallivm/nir: drop 64 bit handling for cl workgroup intrinsics 3820- intel/compiler: drop 64 bit handling for cl workgroup intrinsics 3821- panfrost: drop 64 bit handling for cl workgroup intrinsics 3822- rusticl: reduce global_invocation_id_zero_base to 32 bit 3823- panfrost: drop pan_nir_lower_64bit_intrin 3824- rusticl/disk_cache: fix stack corruption 3825- rusticl/query: fix use-after-free, but also fix incorrect usage of unsafe 3826- rusticl/event: disable profiling for devices without timestamps 3827- rusticl/queue: properly implement clCreateCommandQueueWithProperties 3828- rusticl/memory: do not verify pitch for IMAGE1D_BUFFER 3829- rusticl/memory: only specify PIPE_BIND_SHADER_IMAGE where supported 3830- asahi: fetch available system memory 3831- asahi: lower hadd 3832- asahi: handle kernels 3833- asahi: handle load_workgroup_size 3834- asahi: handle load_global_invocation_id_zero_base 3835- asahi: implement get_compute_state_info 3836- asahi: implement set_global_binding 3837- asahi: implement clear_buffer 3838- asahi: gracefully handle allocating linear images 3839- asahi: handle images in is_format_supported 3840- rusticl/memory: fallback if allocating linear images fails 3841- rusticl: enable asahi 3842- rusticl/mesa: create contexts with PIPE_CONTEXT_NO_LOD_BIAS 3843- docs/features: cl_khr_3d_image_writes needs driver support 3844- rusticl/mesa: fix \`set_constant_buffer` when passing an empty buffer 3845- rusticl/kernel: skip adding global id offsets if not used 3846- meson/rusticl: add sha1_h 3847- rusticl/mesa/context: fix clear_sampler_views 3848- nir: add nir_lower_alu_vec8_16_srcs pass 3849- zink: lower vec8/16 3850- rusticl/mesa: create COMPUTE_ONLY contexts 3851- rusticl: fix clippys bool_to_int_with_if 3852- rusticl/memory: fix potential use-after-free in clEnqueueSVMMemFill 3853- nir/load_libclc: fix libclc memory leak 3854- rusticl/kernel: Fix creation from programs not built for every device 3855- ci: add half-life 2 freedreno flake 3856- zink: implement get_compute_state_info 3857- zink: copy has_variable_shared_mem cs property 3858- zink: pass entire pipe_grid_info into zink_program_update_compute_pipeline_state 3859- zink: refactor spec constant handling 3860- zink: variable shared mem support 3861- zink: support more nir opcodes 3862- zink: make spirv_builder_emit_*op compatible with spec constants 3863- zink: support samplers with unnormalized_coords 3864- zink: implement remaining pack ops via bitcast 3865- zink: fix RA textures 3866- zink: fix load/store scratch offsets 3867- rusticl/mesa/screen,device: add driver_name 3868- rusticl: enable zink 3869- pipe-loader: allow to load multiple zink devices 3870- rusticl: bump rustc version to 1.66 3871- rusticl/mesa/nir: mark more methods as mut 3872- rusticl/mesa/nir: Mark NirShader and NirPrintfInfo as Send and Sync 3873- rusticl/mesa: mark PipeResource as Send and Sync 3874- rusticl/mesa: mark PipeTransfer as Send 3875- rusticl/cl: mark _cl_image_desc as Send and Sync 3876- rusticl/queue: get rid of pointless Option around our worker thread handle 3877- rusticl/queue: make it Sync 3878- rusticl/kernel: get rid of Arcs in KernelDevStateVariant 3879- rusticl/memory: use get_mut instead of lock in drop 3880- zink: implement PIPE_COMPUTE_CAP_MAX_COMPUTE_UNITS 3881- rusticl/api: remove cl_closure macro 3882- zink: implement load_global_constant 3883- zink: properly emit PhysicalStorageBufferAddresses cap 3884- nir/lower_mem_access_bit_sizes: fix invalid shift bit_size 3885- rusticl/device: restrict 1Dbuffer images for RGB and RGBx 3886- rusticl/memory: use PIPE_BUFFER for IMAGE1D_BUFFER images 3887- rusticl/format: disable all sRGB formats 3888- asahi: flush denorms on exact fmin/fmax 3889- zink: wrap shared memory blocks in a struct 3890- zink: properly alias shared memory 3891- zink: fix zink_destroy_screen for early screen creation fails 3892- docs/features: remove empty lines confusing mesamatrix 3893- rusticl/device: restrict image_buffer_size 3894- rusticl/device: restrict param_max_size further 3895- rusticl/mem: properly set pipe_image_view::access 3896- zink: lower fisnormal as it requires the Kernel Cap 3897- radv: fix buffers in vkGetDescriptorEXT with size not aligned to 4 3898- rusticl/queue: Only take a weak ref to the last Event 3899- rusticl/mesa: pass PIPE_BIND_LINEAR in resource_create_texture_from_user 3900- zink: deallocate global_bindings array 3901- rusticl/mesa/screen: do not derefence the entire pipe_screen struct 3902- nvc0: implement PIPE_CAP_TIMER_RESOLUTION 3903- rusticl/queue: do not send empty lists of event to worker queue 3904- rusticl/queue: fix implicit flushing of queue dependencies 3905 3906Kenneth Graunke (21): 3907 3908- iris: Re-emit 3DSTATE_DS for each primitive (workaround 14019750404) 3909- intel/compiler: Fix sparse cube map array coordinate lowering 3910- intel/compiler: Respect NIR_DEBUG_PRINT_INTERNAL for DEBUG_OPTIMIZER 3911- intel/fs: Account for payload GRFs when calculating register pressure 3912- intel/compiler: Move SCHEDULE_NONE handling into schedule_instructions() 3913- intel/fs: Index scheduler mode string table by mode enum 3914- intel/fs: Make helpers for saving/restoring instruction order 3915- intel/fs: Pick the lowest register pressure schedule when spilling 3916- intel/fs: Dump IR for pre-RA scheduler modes in DEBUG_OPTIMIZER 3917- iris: Check prog[] instead of uncompiled[] for BLORP state skipping 3918- nir: Fix function parameter indentation in nir_opt_barriers.c 3919- nir: Add an optimization pass to reduce barrier modes 3920- nir: Reduce the scope of shared memory barriers 3921- lavapipe: Don't delete control barriers 3922- virgl, nir_to_tgsi: Add a hack for promoting partial memory barriers 3923- dxil: Set UAV_FENCE_THREAD_GROUP any time global isn't required 3924- glsl: Use nir_opt_barrier_modes() to drop unnecessary barriers 3925- anv: Use nir_opt_barrier_modes() to drop unnecessary barriers 3926- mesa: Fix zeroing of new ParameterValues array entries when growing 3927- intel/fs: Fix Xe2 URB read/lowering with per-slot offsets 3928- anv: Add support for a transfer queue on Alchemist 3929 3930Kevron Rees (1): 3931 3932- Force vk vendor for spider-man remastered 3933 3934Konrad Dybcio (5): 3935 3936- freedreno: Set magic writes per-GPU, using existing data 3937- freedreno: Include speedbin fallback in 740 chipid to fix probing 3938- freedreno: Include speedbin fallback in 730 chipid to fix probing 3939- freedreno: Include speedbin fallback in 690 chipid to fix probing 3940- freedreno: Add Adreno 643 3941 3942Konstantin Seurer (95): 3943 3944- radv: Stop using the misleading round_up_u* functions 3945- radv/meta_buffer: Stop setting RADV_META_SAVE_DESCRIPTORS 3946- radv/meta_buffer: Rename size_minus16 to max_offset 3947- llvmpipe: Fix compiling with LP_USE_TEXTURE_CACHE 3948- nir/tests: Refactor boilerplate into a common header 3949- nir/tests: Use a single binary 3950- draw: Do not restart the primitive_id at 0 3951- gallivm: Fix subsampled format sampling under Vulkan 3952- gallivm: Ignore nir_tex_src_plane 3953- lavapipe: Remove dummy sampler ycbcr conversion 3954- lavapipe: Store immutable_samplers as lvp_sampler array 3955- lavapipe: Fix binding immutable samplers with desc buffers 3956- lavapipe: Implement samplerYcbcrConversion 3957- lavapipe: Advertise samplerYcbcrConversion 3958- llvmpipe: Zero extend vectors in widen_to_simd_width 3959- vulkan: Add a generated vk_properties struct 3960- radv: Use common physical device properties 3961- clang-format: Disable formatting by default 3962- lavapipe: Use common physical device properties 3963- nir/from_ssa: Don't insert store_reg instructions before phis 3964- gallivm: Run nir_convert_to_lcssa before nir_convert_from_ssa 3965- lavapipe/ci: Remove descriptor_indexing fails 3966- radv/rt: Rename shader_pc and next_shader 3967- radv/rt: Rename traversal_shader to traversal_shader_addr 3968- nir/opt_large_constants: Handle small float arrays 3969- bin: Update spirv sources 3970- vulkan: Allow beta extensions for physical device features 3971- vulkan: Allow beta extensions for physical device properties 3972- vulkan Add enqueue entrypoint for CmdDispatchGraphAMDX 3973- nir: Add shader enqueue data structures and handling 3974- spirv: Update headers and grammer JSON 3975- spirv: Implement SPV_AMDX_shader_enqueue 3976- lavapipe: Add lvp_pipeline_type 3977- lavapipe: Implement exec graph pipelines 3978- lavapipe: Implement AMDX_shader_enqueue commands 3979- lavapipe: Advertise AMDX_shader_enqueue 3980- radv: Add internal_nodes_offset to scratch_layout 3981- radv: Remove leaf_args::dst_offset 3982- radv/rt: Remove some dead code 3983- radv/rt: Do not apply stack_ptr for non-recursive stages 3984- radv/rt: Add and use radv_build_traversal 3985- radv/rt: Insert rt_return_amd before lowering shader calls 3986- radv/rt: Split stage initialization and hashing 3987- aco: Do not fixup registers if there are no shader calls 3988- radv: Stop updating the stack_size in insert_rt_case 3989- lavapipe: Lock around CSO destroys 3990- vulkan/wsi/x11: Implement capture hotkey using the keymap 3991- venus: Use the common GetPhysicalDeviceFeatures2 implementation 3992- nir/lower_shader_calls: Limit the remat chain length 3993- lavapipe: Avoid lowering shaders twice 3994- lavapipe: Fix the locking around cso destruction 3995- aco/validate: Handle p_wqm like p_parallelcopy 3996- aco: Use bytes() instead of size() in emit_wqm 3997- aco: Unify demote and demote_if selection 3998- radv: Only generate debug info if required 3999- aco/lower_to_cssa: Fix typo 4000- radv: Don't use the depth image view for depth bias emission 4001- radv/rt: Store NIR shaders separately 4002- radv/rt: Add monolithic raygen lowering 4003- radv/rt: Enable monolithic pipelines 4004- radv/ci: Document new flake 4005- vulkan/properties: Handle unsized arrays properly 4006- radv: Remove dead radix_sort_vk_get_memory_requirements call 4007- radv/radix_sort: Vendor the radix sort dispatch code 4008- radv: Perform multiple sorts in parallel 4009- radv/ci: Improve ray tracing skips 4010- ac/llvm: Fix typed loads with 16bit formats 4011- ac/llvm: Use the correct return type for uadd_carry and usub_borrow 4012- ac/llvm: Use float types for float atomics 4013- radv: Don't advertise features requiring PS epilogs with LLVM 4014- radv: Update navi21 llvm fails 4015- radv/rt: Handle stages without nir properly 4016- radv: Remove ray tracing shader module identifier skips 4017- radv/bvh: Treat instances with mask == 0 as inactive 4018- radv/ray_queries: Skip cull_mask handling if it is FF 4019- radv/rt: Skip cull_mask handling if it is FF 4020- aco/spill: Make sure that offset stays in bounds 4021- nir: Add nir_cf_node_cf_tree_prev 4022- nir: Add nir_foreach_block_in_cf_node_reverse 4023- nir: Add nir_rematerialize_deref_in_use_blocks 4024- nir/lcssa: Fix rematerializing derefs 4025- nir/deref: Layer rematerialization helpers 4026- lavapipe/ci: Fix asan expectations 4027- hasvk: Use the common GetPhysicalDeviceFeatures2 implementation 4028- vulkan: Remove vk_get_physical_device_core_1_*_feature_ext 4029- radv/bvh/ploc: Load child bounds from LDS 4030- radv: Merge the sync_data and header initialization 4031- radv: Do not sync after radv_update_buffer_cp 4032- zink: Initialize primitive types to an invalid value 4033- nir/passthrough_gs: Support edge flags with points 4034- zink: Enable edge flags with points 4035- mesa: Fix glBegin/End when LINE_LOOP is not supported 4036- llvmpipe: Compile a nop texture function for unsupported configurations 4037- radv/rt: Use nir_shader_instructions_pass for lower_rt_instructions 4038- radv/sqtt: Fix tracing acceleration structure commands 4039 4040Lang Yu (5): 4041 4042- amd/common: add AMD_CODE_PROPERTY_ENABLE_WAVEFRONT_SIZE32 property 4043- radeonsi: use AMD_CODE_PROPERTY_ENABLE_WAVEFRONT_SIZE32 to determine wave size 4044- radeonsi: use wave size to determine index stride 4045- amd/common: add missing stuff for gfx11.5 4046- amd/radeonsi: add missing stuff for gfx11.5 4047 4048Leandro Ribeiro (13): 4049 4050- egl: rewrite outdated comment in _eglFindDevice() 4051- egl: remove unused parameter from _eglAddDRMDevice() 4052- egl: simplify _eglAddDRMDevice() 4053- egl: make explicit that we don't support render nodes for software EGLDevice 4054- egl: move is_render_node flag to platform_wayland 4055- loader: rename loader_open_render_node() to loader_open_render_node_platform_device() 4056- loader: add driver list as parameter in loader_open_render_node_platform_device() 4057- pipe-loader: add pipe_loader_get_compatible_render_capable_device_fd() 4058- dri: add queryCompatibleRenderOnlyDeviceFd() to __DRI_MESA extension 4059- kmsro: try to use only compatible render-capable devices 4060- loader: add loader_is_device_render_capable() 4061- egl/drm: get compatible render-only device fd for kms-only device 4062- egl: error out if we can't find an EGLDevice in _eglFindDevice() 4063 4064Leo Liu (4): 4065 4066- radeonsi: add AV1 profile to supported profile 4067- radeonsi/vcn: fix the incorrect dt_size 4068- Revert "frontends/va: Also map VAImageBufferType for reading" 4069- ac/gpu_info: override ib_size_alignment for VCN_DEC and JPEG 4070 4071Lina Versace (14): 4072 4073- docs: Add row for VK_KHR_maintenance5 4074- intel/pci_ids: Consistently use lowercase 4075- venus: Sync protocol for VK_EXT_graphics_pipeline_library 4076- venus: Erase pViewports and pScissors in fewer cases 4077- venus: Fix crash when VkGraphicsPipelineCreateInfo::layout is missing 4078- venus: Fix subpass attachments 4079- venus: Drop incorrectly-used always-true pipeline vars 4080- venus: Use VkImageAspectFlags in vn_subpass 4081- venus: Add enum vn_pipeline_type 4082- venus: Renames for VkGraphicsPipelineCreateInfo fixes 4083- venus: Refactor pipeline fixup into two stages 4084- venus: Do pipeline fixes for VK_EXT_graphics_pipeline_library 4085- venus: Enable VK_EXT_graphics_pipeline_library behind debug flag 4086- venus: Fix -Wmaybe-uninitialized 4087 4088LingMan (22): 4089 4090- rusticl/memory: fix potential use-after-free in clEnqueueSVMFree 4091- rusticl: Rename XyzCB aliases to FuncXyzCB 4092- rusticl: add structs to hold the C callbacks 4093- rusticl: use CreateContextCB 4094- rusticl: use DeleteContextCB 4095- rusticl: use EventCB 4096- rusticl: use MemCB 4097- rusticl: use ProgramCB 4098- rusticl: use SVMFreeCb 4099- rusticl: Make EventSig take ownership of its environment 4100- rusticl: add a safe abstraction to execute a DeleteContextCB 4101- rusticl: add a safe abstraction to execute an EventCB 4102- rusticl: add a safe abstraction to execute a MemCB 4103- rusticl: add a safe abstraction to execute an SVMFreeCb 4104- rusticl: add a safe abstraction to execute a CreateContextCB 4105- rusticl: add a safe abstraction to execute a ProgramCB 4106- rusticl/api: drop a few include paths 4107- rusticl: mark the fields of callback structs private 4108- rusticl: drop an \`#[allow(dead_code)]` marker 4109- rusticl/core: don't take a lock while dropping \`Context` 4110- rusticl: Show an error message if the build is attempted with an outdated bindgen version 4111- rusticl: Show an error message if the version of bindgen can't be detected 4112 4113Lionel Landwerlin (169): 4114 4115- anv: hide exec_flags selection inside the i915 backend 4116- isl: add a tool to query surface parameters 4117- intel/fs: fix missing predicate on SEL instruction 4118- intel/compiler: rework input parameters 4119- ci/a530: switch a few tests to flakes to unblock CI 4120- vulkan: bump header register to 1.3.258 4121- intel/fs: don't try to rebuild sequences of non ssa values 4122- intel/vec4: fix log_data pointer 4123- intel/fs: consider UNDEF as non-partial write 4124- intel/fs: add more UNDEFs around SEND messages 4125- isl: add ability to store buffer size in unused RENDER_SURFACE_STATE fields 4126- anv: simplify buffer address+size loads from descriptor buffer 4127- intel/fs: add support for sparse accesses 4128- intel/nir: handle image_sparse_load in storage format lowering 4129- intel/nir: add lower for sparse images & textures 4130- anv: wire image sparse loads 4131- blorp: switch blorp_update_clear_color to early return 4132- blorp: update and move fast clear PIPE_CONTROLs to drivers 4133- anv: fix 3DSTATE_RASTER::APIMode field setting 4134- anv: enable EDS3 ConservativeRasterizationMode 4135- vulkan: skip non required extension structures 4136- vulkan/runtime: add a layered implementation of vkCmdBindIndexBuffer 4137- anv: enable INTEL_DEBUG=nofc 4138- anv: fake non intel vendorID for Death Stranding 4139- hasvk: fix null descriptor handling with A64 messages 4140- anv: remove descriptor array bounds checking 4141- hasvk: remove descriptor array bounds checking 4142- anv/hasvk: track robustness per pipeline stage 4143- anv: implement VK_EXT_pipeline_robustness 4144- intel/fs: track more steps with INTEL_DEBUG=optimizer 4145- intel/fs: add variable for output of debug backend optimizer 4146- intel/decoder: constify some input parameters 4147- blorp: drop programming of 3DSTATE_(MESH|TASK)_SHADER 4148- anv: emit 3DSTATE_GS only once per pipeline 4149- intel/decoder: add options to decode surfaces/samplers 4150- anv: get rid of genX(emit_multisample) 4151- anv: move genX(rasterization_mode) to gfx8_cmd_buffer.c 4152- anv: don't try to access dynamic buffers from surface states 4153- iris: ensure stalling pipe control before fast clear 4154- intel/compiler: disable per-sample interpolation modes with non-per-sample dispatch 4155- intel/compiler: fix dynamic alpha-to-coverage handling 4156- intel/fs: implement dynamic interpolation mode for dynamic persample shaders 4157- intel/fs: move lower of non-uniform at_sample barycentric to NIR 4158- zink+anv: add regression testing with pipeline libraries 4159- anv: implement vkCmdBindIndexBuffer2KHR 4160- anv: handle new VkBufferViewUsageCreateInfoKHR 4161- anv: add vkGetRenderingAreaGranularityKHR() 4162- anv: implement GetDeviceImageSubresourceLayoutKHR/GetImageSubresourceLayout2KHR 4163- anv: add maintenance5 A8_UNORM/A1B5G5R5_UNORM support 4164- anv: deal with new pipeline flags 4165- anv: enable KHR_maintenance5 4166- anv: add missing ISL storage usage 4167- genxml/gfx11: remove Tiled Resource Mode field from HIER_DEPTH_BUFFER 4168- genxml/gfx12: rename Tiled Resource Mode 4169- isl: program 3DSTATE_HIER_DEPTH_BUFFER_BODY::TiledMode as documented 4170- intel/isl: Disallow Yf, Ys and Tile64 for 3D depth/stencil surfaces 4171- isl: disable Yf/Ys/Tile64 tilings for 1D images 4172- isl: add a usage flag to request 2D/3D compatible views 4173- isl: disallow TileYs/Yf on 3D storage images on Gfx9/11 4174- intel/isl: Add a max_miptail_levels field to isl_tile_info 4175- isl: make isl_surf_get_uncompressed_surf robust to argument accesses 4176- isl: add Gfx12/12.5 restriction on 3D surfaces & compression 4177- isl: disallow miptails on planar formats 4178- isl: disable miptails on gfx12 with yuv formats 4179- isl: disable CCS on Ys/Yf 4180- blorp: allow 3D blits/copies on Ys/Yf/Tile64 tiling 4181- intel/aux_map: correctly program tiling mode for Ys 4182- isl: reorder tiling selection 4183- anv: enable standard Y tiles 4184- isl/tilememcpy_test: add multiple tile testing 4185- anv: rename total_batch_size 4186- anv: reuse cmd_buffer::total_batch_size 4187- intel/measure: track batch buffer sizes 4188- intel/nir: rerun lower_tex if it lowers something 4189- intel/fs: limit register flag interaction of FIND_*LIVE_CHANNEL 4190- hasvk: add state cache invalidation back before fast clears 4191- blorp: remove unused variable 4192- anv: remove ReorderMode from pipeline 3DSTATE_GS emission 4193- anv: change anv_batch_emit_merge to also do packing 4194- intel/anv: batch stats util 4195- intel/decoder: implement accumulated prints 4196- anv: move all dynamic state emission to cmd_buffer_flush_dynamic_state 4197- anv: rename files to represent their usage 4198- anv: categorize partial/final pipeline instruction 4199- anv: split 3DSTATE_TE packing between static & dynamic parts 4200- anv: split 3DSTATE_VFG emission 4201- anv: add a flag tracking occlusion query count change 4202- anv: split pipeline programming into instructions 4203- vulkan/runtime: add helper to name dirty states 4204- anv: add new low level emission & dirty state tracking 4205- anv: remove unused state emission 4206- anv: split BLEND_STATE packing from BLEND_STATE_POINTERS emit 4207- docs: update Anv documentation about dynamic state emission 4208- anv: create individual logical engines on i915 when possible 4209- anv: Copy/Clear MSAA images over companion RCS while we are on compute 4210- pps-producer: add ability to select device with DRI_PRIME 4211- anv: remove aux checking asserts 4212- anv: bound image usages to the associated queue family 4213- anv: fix 3DSTATE_VFG emission 4214- anv: emit 3DSTATE_URB_ALLOC_(MESH|TASK) only when mesh shaders are enabled 4215- anv: ensure mesh pipeline have all pre-rasterization stages disabled 4216- anv: ensure partially packed instructions are emitted in the pipeline 4217- anv: fix missing 3DSTATE_SBE_MESH emission 4218- anv: fix utrace timestamp buffer copies 4219- anv: add a memcpy compute internal kernel 4220- anv: add simple shader support without a command buffer 4221- anv: move simple shaders code to its own object 4222- anv: move utrace flush out of backends 4223- anv: enable utrace timestamp buffer copies on compute engine 4224- intel: don't assume Linux minor dev node 4225- intel/ds: lock submissions to u_trace_context 4226- util/u_trace: count number of tracepoints 4227- intel/ds: track number of tracepoint timestamp copies 4228- anv/utrace: trace CPU on timestamp buffer readiness 4229- intel/ds: avoid dropping traces when running out of shared memory 4230- anv/iris: widen Wa_14015946265 to Gfx11+ 4231- anv: add missing workaround for 3DSTATE_LINE_STIPPLE 4232- iris: add missing workaround for 3DSTATE_LINE_STIPPLE 4233- intel/fs: handle ishl in surface/sampler rematerialization 4234- intel/fs: handle add3 in surface/sampler rematerialization 4235- intel/fs: switch from SIMD 1 to 8 instructions surface/sampler rematerialization 4236- anv: fix internal compute copy shader build 4237- anv: reduce working temporary memory for BVH builds 4238- anv: move bo_pool allocation flags to init caller 4239- anv: use buffer pools for BVH build buffers 4240- intel/ds: track acceleration RT commands 4241- anv: fix index buffer size programming 4242- anv: implement INTEL_DEBUG=reemit 4243- anv: add missing workaround handling in simple shader 4244- anv: fix a couple of missing input for 3DSTATE_RASTER programming 4245- anv: flag 3DSTATE_RASTER as dirty after simple shader primitive 4246- vulkan: bump headers/registry to 1.3.267 4247- anv: rename primary in container in ExecuteCommands() 4248- anv: add support for VK_EXT_nested_command_buffer 4249- anv: simplify push descriptors 4250- anv: fixup spirv cap for ImageReadWithoutFormat on Gfx12.5 4251- Revert "intel/fs: limit register flag interaction of FIND_*LIVE_CHANNEL" 4252- anv: update batch chaining to Gfx9 commands 4253- anv: workaround Gfx11 with optimized state emission 4254- u_trace: generate tracepoint index parameter in perfetto callbacks 4255- u_trace: generate tracepoint name array in perfetto header 4256- intel/ds: provide names for different events of a timeline's row 4257- anv: reuse local variable for gfx state 4258- anv: track render targets & render area changes separately 4259- anv: don't uninitialize bvh_bo_pool is not initialized 4260- anv: uninitialize queues before utrace 4261- anv: move generation shader return instruction to last draw lane 4262- anv: fix generated draws gl_DrawID with more than 8192 indirect draws 4263- anv: extract out draw call generation 4264- anv: identify internal shader in NIR 4265- anv: avoid MI commands to copy draw indirect count 4266- anv: move generation batch fields to a sub-struct 4267- util/glsl2spirv: add ability to pass defines 4268- anv: factor out host/gpu internal shaders interfaces 4269- anv: index indirect data buffer with absolute offset 4270- anv: add ring buffer mode to generated draw optimization 4271- anv: merge gfx9/11 indirect draw generation shaders 4272- anv: document the draw indirect optimization ring mode 4273- anv: fixup 32bit build of internal shaders 4274- anv: fix uninitialized use of compute initialization batch 4275- intel/fs: fix dynamic interpolation mode selection 4276- anv/meson: add missing dependency on the interface header 4277- anv: fix corner case of mutable descriptor pool creation 4278- isl: disable MCS compression on R9G9B9E5 4279- intel/fs: rerun divergence analysis prior to convert_from_ssa 4280- intel/nir/rt: fix reportIntersection() hitT handling 4281- anv: fix CC_VIEWPORT pointer dirty after blorp/simple-shaders 4282- anv: fix dirty state tracking for 3DSTATE_PUSH_CONSTANT_ALLOC 4283- intel/perf: fix querying of configurations 4284 4285Louis-Francis Ratté-Boulianne (15): 4286 4287- panfrost: Fix error in comment 4288- panfrost: Add methods to determine slice and body alignment 4289- panfrost: Add method to get size of AFBC subblocks 4290- panfrost: Precalculate stride and nr of blocks for AFBC layouts 4291- panfrost: Add panfrost_batch_write_bo 4292- panfrost: Make panfrost_resource_create_with_modifier public 4293- panfrost: Split out internal of \`panfrost_launch_grid` 4294- panfrost: Add infrastructure for internal AFBC compute shaders 4295- panfrost: Add method to get size of AFBC superblocks valid data 4296- panfrost: Add support for AFBC packing 4297- panfrost: Legalize resource when attaching to a batch 4298- panfrost: Don't force constant modifier after converting 4299- panfrost: Add debug flag to force packing of AFBC textures on upload 4300- panfrost: Add some debug utility methods for resources 4301- panfrost: Add env variable for max AFBC packing ratio 4302 4303Lucas Stach (33): 4304 4305- ci/etnaviv: update ci expectation 4306- etnaviv: move resource seqnos to level 4307- etnaviv: flush destination before executing blit 4308- etnaviv: optimize resource copies by skipping clean levels 4309- etnaviv: add helper to mark resource level as flushed 4310- etnaviv: add helper to mark resource level as changed 4311- etnaviv: add helper to transfer resource level age to another 4312- etnaviv: add helper to get TS validity 4313- etnaviv: add helper to set TS validity 4314- etnaviv: move TS meta into etna_resource_level 4315- etnaviv: add tile status buffer status into TS metadata 4316- etnaviv: optimize sampler source update 4317- etnaviv: allow sampler TS even if the resource is flushed 4318- etnaviv: keep blit destination tile status valid if possible 4319- etnaviv: optimize render resource update 4320- etnaviv: optimize transfers when whole resource level is discarded 4321- etnaviv: split etna_copy_resource_box levels parameter in src/dst 4322- etnaviv: don't allocate full resource as transfer staging 4323- etnaviv: check for valid TS as condition to create the staging resource 4324- etnaviv: reword comment about staging resource usage 4325- etnaviv: remove huge outdated comment 4326- etnaviv: move buffer range tracking into the PIPE_MAP_WRITE clause 4327- etnaviv: remove superfluous braces 4328- etnaviv: remove always true assert in etna_transfer_unmap 4329- etnaviv: remove bogus comment about replacing resource storage 4330- etnaviv: initialize VIVS_GL_BUG_FIXES 4331- etnaviv: fix read staging buffer leak 4332- Revert "ci/etnaviv: allow failure on failing test" 4333- mesa: enable NV_texture_barrier in GLES2+ (again) 4334- etnaviv: use correct blit box sizes when copying resource 4335- etnaviv: zero shared TS metadata block 4336- Revert "etnaviv: use correct blit box sizes when copying resource" 4337- mesa: add GL_APPLE_sync support 4338 4339Luigi Santivetti (1): 4340 4341- pvr: do not claim support for ASTC texture compression 4342 4343M Henning (31): 4344 4345- nv50/ir: Drop nir_jump_return handling 4346- nv50/ir: Remove ArgumentMovesPass 4347- nv50/ir: Remove Function.stackPtr 4348- nv50/ir: Remove dead loop from assignSlot 4349- nv50/ir: Remove SpillSlot 4350- nvc0: Keep nir directly in nvc0_program 4351- nv50: Keep nir directly in nv50_program 4352- nouveau: Delete nv50_ir_from_tgsi.cpp 4353- nouveau: Drop tgsi support from nv50_ir_prog_info 4354- nouveau: Drop ConverterCommon::Subroutine 4355- nouveau: Drop BuildUtil::DataArray 4356- nouveau: Drop BuildUtil::Location 4357- nouveau: Delete the nouveau_compiler tool 4358- nv/codegen: Call nir_shader_gather_info 4359- nv/codegen: Implement nir_op_fquantize2f16 4360- nvk: Remove reference to genUserClip 4361- nv/codegen: Use nir_lower_clip 4362- nv50_ir_from_nir: Use nir's lower_fpow 4363- nv/codegen: Delete OP_POW 4364- nv/codegen: Fix an uninitialized variable warning 4365- nv/codegen: Delete OP_WRSV 4366- nv/codegen: Delete OP_EXP, OP_LOG 4367- nv/codegen: Remove fragCoord variable. 4368- nv/codegen: Merge from_common into from_nir 4369- nv/codegen: Remove unused clipVertexOutput var 4370- nv50_ir_ra: Delete unused functions 4371- nv/codegen: Delete unused OP_CONSTRAINT 4372- nv/codegen: Delete periodicMask32 4373- nv/codegen: Remove Function::buildDefSets 4374- nv/codegen: Change copy-constructor call to assign 4375- nv/codegen: Delete copy and assign 4376 4377Maaz Mombasawala (2): 4378 4379- svga: Make surfaces shareable at creation. 4380- svga: Unify gmr and mob surface pool managers 4381 4382Marcin Ślusarz (16): 4383 4384- iris: avoid duplicating validation entries 4385- hasvk: remove dead code & comments related to mesh shading 4386- anv: drop support for VK_NV_mesh_shader 4387- intel/compiler: remove NV_mesh_shader support 4388- intel/compiler: remove redundant code 4389- anv: drop unused function 4390- anv: merge cases leading to the same code 4391- intel/compiler/mesh: compactify MUE layout 4392- intel/compiler,anv: put some vertex and primitive data in headers 4393- intel/compiler: load debug mesh compaction options once 4394- intel/compiler/test: fix crashes when TEST_DEBUG is set 4395- intel/compiler: add lsc_msg_desc_wcmask 4396- intel/compiler: add initial support for URB_LOGICAL_SRC_CHANNEL_MASK to lower_urb_write_logical_send_xe2 4397- intel/compiler/mesh: fix position of output URB handle for xe2 4398- intel/compiler/mesh: implement IO for xe2 4399- intel/compiler: mask GS URB handles at thread payload construction 4400 4401Marek Olšák (125): 4402 4403- Revert "ac/nir/ngg: Follow intrinsic sources when analyzing before culling." 4404- glthread: determine global locking once every 64 batches to fix get_time perf 4405- mesa: fix 38% decrease in display list performance of Viewperf2020/NX8_StudioAA 4406- freedreno,lima,zink: update CI fixes and flakes 4407- util/u_queue: fix util_queue_finish deadlock by merging lock and finish_lock 4408- util/u_queue: always enable UTIL_QUEUE_INIT_SCALE_THREADS, remove the flag 4409- radeonsi: fix a CDNA regression breaking compute 4410- glthread: sync for VDPAU sync functions 4411- radeonsi: turn sh_base[PIPE_SHADER_VERTEX] into a constant in emit_draw_packets 4412- radeonsi: restructure the loop for non-indexed multi draws 4413- radeonsi: cosmetic changes to radeon_opt_* macros 4414- radeonsi: handle draw user SGPRs as tracked registers 4415- radeonsi: update obsolete comments about compiler queues 4416- radeonsi: remove si_compute.h, move the contents into si_pipe.h 4417- radeonsi: move si_update/emit_tess_io_layout_state into si_state_shaders.cpp 4418- radeonsi: move si_emit_spi_map into si_state_shaders.cpp 4419- radeonsi: move si_emit_rasterizer_prim_state out of si_emit_all_states 4420- radeonsi: remove splitting IBs that use too much memory 4421- radeonsi: add padding to si_resource to fix Viewperf2020/catiav5test1 perf 4422- radeonsi: remove unused check_mem parameter from si_sampler_view_add_buffer 4423- radeonsi: remove the draw counter with primitive restart from the HUD 4424- radeonsi: always inline si_prefetch_shaders 4425- radeonsi: specialize si_draw_rectangle using a C++ template 4426- radeonsi: add index parameter into si_atom::emit 4427- radeonsi: split direct pm4 emission from si_pm4_emit 4428- radeonsi: move code around si_pm4_emit_state into si_pm4_emit_state 4429- radeonsi: merge pm4 state and atom emit loops into one 4430- radeonsi: add a simple version of si_pm4_emit_state for non-shader states 4431- radeonsi: handle deferred cache flushes as a state (si_atom) 4432- radeonsi: remove render condition logic from si_draw by reordering atoms 4433- radeonsi: abort when failing to upload descriptors instead of skipping draws 4434- radeonsi: rename shader_pointers state -> gfx_shader_pointers 4435- radeonsi: merge si_upload_*_descriptors into si_emit_*_shader_pointers 4436- radeonsi: convert si_gfx_resources_add_all_to_bo_list to a state atom 4437- radeonsi/ci: update gfx11 failures 4438- radeonsi: move GE_CNTL emission from si_draw into si_emit_vgt_pipeline_state 4439- radeonsi: use num_patches_per_workgroup directly in si_get_ia_multi_vgt_param 4440- radeonsi: enable shader culling by default because it helps Viewperf 4441- radeonsi: rewrite how occlusion query precision is determined for performance 4442- radeonsi: set PIPE_CONTEXT_LOSE_CONTEXT_ON_RESET on aux_context explicitly 4443- radeon_winsys: move allow_context_lost from cs_create to ctx_create 4444- winsys/amdgpu: rework how SW reset status is generated and reported 4445- radeon_winsys: add a ctx_set_sw_reset_status callback 4446- radeonsi: don't abort for descriptor failures, let the winsys handle it 4447- radeonsi: don't use threadID.yz/blockID.yz for copy_image if those are always 0 4448- radeonsi: don't use threadID.yz/blockID.yz for compute_blit if they're always 0 4449- nir: fix constant evaluation of fddx/fddy sourcing Inf & NaN constant 4450- nir/algebraic: collapse ALU opcodes sourcing NaN 4451- ac/gpu_info: add the /dev/dri/ filename into radeon_info 4452- Revert "ac: don't call ac_query_pci_bus_info from ac_query_gpu_info" 4453- ac: implement AMD_FORCE_FAMILY properly, remove SI_FORCE_FAMILY 4454- ac: document ac_shader_args::gs_vtx_offset 4455- ac: minor updates to packet documentation and definitions 4456- ac: change offsets of DMA_DATA dwords to prevent reg offset conflicts 4457- ac: improve the IB parser 4458- ac: update gfx11 shadowed register tables 4459- ac: add a standalone IB parser program 4460- ac/surface: trivial non-functional changes 4461- ac/surface: add radeon_surf::u::gfx9::uses_custom_pitch 4462- radeonsi: allow setting any index in radeon_set_sh_reg_idx 4463- radeonsi: rename uses_subgroup_info to uses_tg_size 4464- radeonsi: improve the heuristic when to use Wave32 for compute shaders 4465- radeonsi: simplify/merge emit_shader_ngg functions 4466- radeonsi: don't pass gl_Layer to PS for blit shaders 4467- radeonsi/gfx11: pass attribute ring addr via SGPR instead of memory for blits 4468- radeonsi: fix templated si_draw_rectangle callback for Navi14 4469- nir: replace undef only used by ALU opcodes with 0 or NaN 4470- nir: remove nir_op_unpack_64 handling from nir_opt_undef 4471- ac/llvm: don't convert undef to 0 because nir_opt_undef does it now 4472- meson: use llvm-config instead of cmake to fix linking errors with meson 1.2.1 4473- gallivm: fix build with LLVM 18 4474- amd/llvm: fix build with LLVM 18 4475- radeonsi: fix compute-only contexts 4476- ac/llvm: replace removed amdgcn.ldexp for LLVM 18 4477- ac/perfcounter: remove a bogus assert to fix an assertion failure on gfx11 4478- ac/llvm: set !fpmath 3.0 for llvm.sqrt 4479- ac/gpu_info: don't align IBs to the GL2 cache line size 4480- ac/llvm: fix flat PS input corruption 4481- amd: rename GFX110x to NAVI31-33 4482- ac/gpu_info: replace ib_alignment with per-IP IB base and size alignments 4483- ac/gpu_info: pad IBs according to ib_size_alignment 4484- winsys/amdgpu: pad gfx and compute IBs with a single NOP packet 4485- Revert "radeonsi: specialize si_draw_rectangle using a C++ template" 4486- radeonsi/ci: update navi10 results 4487- gallium/util: fix GALLIUM_TESTS=1 by using cso_set_vertex_buffers_and_elements 4488- gallium/util: add more tests for compute-only contexts 4489- radeonsi: add another aux context for uploading shaders 4490- radeonsi: upload shaders via a staging buffer so as not to map VRAM directly 4491- ac/surface: don't require exact pitch for gfx6-8 tiled imports 4492- Revert "ac/gpu_info: override ib_size_alignment for VCN_DEC and JPEG" 4493- Revert "radv/amdgpu: fix alignment of command buffers" 4494- Revert "radv: fix alignment of DGC command buffers" 4495- Revert "winsys/amdgpu: pad gfx and compute IBs with a single NOP packet" 4496- Revert "ac/gpu_info: pad IBs according to ib_size_alignment" 4497- Revert "ac/gpu_info: replace ib_alignment with per-IP IB base and size alignments" 4498- nir: sort variables by location in nir_lower_io_passes to work around a bug 4499- nir: recompute IO bases after DCE in nir_lower_io_passes 4500- nir: add dual-slot input information into load_input intrinsics 4501- nir: take dual slot input info into account when computing IO driver locations 4502- nir: gather dual slot input information 4503- nir: expose reusable linking helpers for cloning uniform loads 4504- nir: handle nir_var_mem_ubo in nir_clone_uniform_variable 4505- ac/gpu_info: split ib_alignment as ip[type].ib_alignment 4506- ac/gpu_info: move ib_pad_dw_mask into ip[] 4507- ac/gpu_info: drop the hack unifying all IB alignments 4508- ac/gpu_info: conservatively decrease IB alignment and padding to 256B 4509- ac/gpu_info: set gfx and compute IB padding to only 8 dwords 4510- winsys/amdgpu: properly pad the IB in amdgpu_submit_gfx_nop 4511- winsys/amdgpu: correctly pad noop IBs for RADEON_NOOP=1 4512- winsys/amdgpu: pad gfx and compute IBs with only 1 NOP 4513- ac/gpu_info: don't allow register shadowing with SR-IOV due to bad performance 4514- radeonsi: disable register shadowing without SR-IOV to fix bad performance 4515- winsys/amdgpu: don't send CP_GFX_SHADOW chunk if shadow address is not set 4516- radeonsi/ci: update gfx1100 results 4517- nir: split FLOAT_CONTROLS_SIGNED_ZERO_INF_NAN_PRESERVE_FP* flags 4518- nir/algebraic: use only signed_zero_preserve_* for addition by 0 patterns, etc. 4519- mesa: don't pass Infs to the shader via gl_Fog.scale 4520- radeonsi/ci: update the runner for new build scripts 4521- radeonsi/ci: enable GTF tests in the runner 4522- radeonsi/ci: enable GLES CTS in the runner 4523- radeonsi/ci: update failures and flakes 4524- amd/common: update DCC for gfx11.5 4525- radeonsi: initialize perfetto in the right place 4526- radeonsi/gfx11: don't set OREO_MODE to fix rare corruption 4527- nir: fix gathering TESS_LEVEL_INNER/OUTER usage with lowered IO 4528 4529Marek Vasut (1): 4530 4531- etnaviv: Fully replicate back stencil config 4532 4533Mark Collins (10): 4534 4535- tu/a7xx: Adapt r3d blits for A7xx 4536- freedreno/rnn: Remove %n usage in fprintf 4537- freedreno: Only add drm/computerator when system_has_kms_drm 4538- freedreno/decode: Support building replay for multiple KMDs 4539- freedreno+meson: Add lua+libarchive+libxml from Meson WrapDB 4540- meson: Warn about side-effects from DRM for FD KMDs 4541- meson: Update libarchive to v3.7.2-2 4542- freedreno/common: Add max_sets property to A6xxGPUInfo 4543- tu: Support higher descriptor set count for A7XX 4544- tu,util/driconf: Add option to not reserve descriptor set 4545 4546Mark Janes (1): 4547 4548- intel: allow reduced memory usage for INTEL_MEASURE 4549 4550Martin Roukala (né Peres) (22): 4551 4552- radv/ci: drop the auto-reboot-on-hang for vkcts-navi10 4553- radv/ci: use the default kernel on vkcts-navi10 4554- zink/ci: automatically reboot when hitting a kernel BUG on vangogh 4555- zink/ci: document more flakes seen on vangogh 4556- radv/ci: move vkcts-navi10 testing to KWS 4557- radv/ci: add more tests to the navi10 vkcts flake list 4558- radv/ci: increase the parallelism of the vkcts-navi21 job 4559- radv/ci: add more tests to the navi21 vkcts flake list 4560- radv/ci/vkcts-navi21: catch all the line_stipple_(enable|params) flakes 4561- radv/ci/vkcts-navi21: document more flakes 4562- radv/ci/vkcts-navi10: catch all the line-related flakes 4563- radv/ci: update the vkcts gfx1100 flake/fail lists 4564- radv/ci: add a manual job to run vkcts on navi31 4565- radv/ci: add a manual job for vkd3d-proton on navi31 4566- ci/vkcts-vangogh: mark dEQP-VK.dynamic_rendering.primary_cmd_buff.basic.* as flake 4567- ci/vkcts-navi21: mark more of the RT handles checks as flakes 4568- ci: make B2C_JOB_VOLUME_EXCLUSIONS to all .b2c-test jobs 4569- zink/ci: remove 19 tests from the zink-radv-polaris10-fails list 4570- ci/b2c: switch containers to a back-up ahead of valve-infra renaming 4571- zink/ci: remove 42 tests from the zink-radv-polaris10-fails list 4572- radv/ci: tighten the vkcts-navi21 timeouts 4573- zink/ci: tighten the zink-radv-vangogh timeouts 4574 4575Martin Stransky (1): 4576 4577- llvmpipe: fix UAF in lp_scene_is_resource_referenced. 4578 4579Mary (6): 4580 4581- nouveau/mme: Add initial Fermi definition 4582- nouveau/mme: Add Fermi builder 4583- nouveau/mme: Add Fermi simulator 4584- nouveau/mme: Add Fermi hardware tests 4585- agx: Move nir_lower_fragcolor out of agx_preprocess_nir 4586- agx: Ensure to lower 1D image load/store to 2D 4587 4588Mary Guillemard (4): 4589 4590- nir: Add NVIDIA-specific geometry shader opcodes 4591- venus: skip bind sparse info when checking for feedback query 4592- zink: Check for VK_EXT_extended_dynamic_state3 before setting A2C 4593- venus: Do not submit batch manually when no feedback is required 4594 4595Matt Coster (21): 4596 4597- pvr: Pad rogue_regarray_cache_key union members to avoid UB 4598- pvr: Clean up extension tables 4599- pvr: Refactor pvr_GetPhysicalDeviceProperties2() 4600- docs: Fixup imagination/pvr extension support 4601- pvr: Add VK_KHR_get_display_properties2 4602- pvr: Add VK_KHR_get_memory_requirements2 4603- pvr: Add VK_KHR_get_surface_capabilities2 4604- pvr: Print VkStructureType name on pvr_debug_ignored_stype() 4605- pvr: Add VK_KHR_copy_commands2 4606- pvr: Don't override commands copied to new buffer when extending cs 4607- pvr: Do not require TA_STATE_HEADER.pres_ispctl_dbsc for {db,sc}enable 4608- pvr: Zero tail of cs buffers after linking when dumping cs 4609- pvr: Cleanup comments in pvr_physical_device_get_supported_*() 4610- pvr: Don't rely on GNU void pointer arithmetic 4611- pvr: Force compile error on GNU void pointer arithmetic 4612- pvr: Switch to common pipeline cache implementation 4613- pvr: Use vk_sampler base 4614- pvr: Clean up & fix sampler border color support 4615- pvr: Don't pass pvr_physical_device when only device info is needed 4616- pvr: Minor refactor of pvr_device.c 4617- pvr: Use common physical device properties 4618 4619Matt Turner (10): 4620 4621- Revert "intel/fs: only avoid SIMD32 if strictly inferior in throughput" 4622- intel: Rearrange for next commit 4623- intel: Consider with_intel_clc in with_any_intel 4624- intel: Only build blorp if drivers are enabled 4625- intel: Only build ds if drivers are enabled 4626- intel: Only build perf if drivers or tools are enabled 4627- intel: Allow using intel_clc from the system 4628- intel: Limit Intel Vulkan RT to x86_64 4629- r600: Add missing dep on git_sha1.h 4630- util: Include stdint.h in libdrm.h 4631 4632Mauro Rossi (7): 4633 4634- nouveau/ws: fix building error in nouveau_ws_push_dump() 4635- vulkan/meta: fix gnu-empty-initializer build error 4636- nouveau/mme: fix print inst for case MME_FERMI_OP_MERGE 4637- anv/android: remove numFds check 4638- hasvk/android: remove numFds check 4639- Android.mk: filter out cflags to build with Android 14 bundled clang 4640- Android.mk: disable android-libbacktrace to build with Android 14 4641 4642Mike Blumenkrantz (293): 4643 4644- ci: bump VVL to 1.3.257 4645- zink: set pipeline dynamic state count after all dynamic states are set 4646- zink: set feedback attachments on batch init 4647- zink: be even dumber about buffer refs when replacing storage 4648- zink: emit SpvCapabilitySampleMaskPostDepthCoverage with SpvExecutionModePostDepthCoverage 4649- zink: fix the fix for separate shader program refcounting 4650- kopper: handle pixmap creation failure more gracefully 4651- glxsw: check geometry of drawables on creation 4652- kopper: move pixmap param for drawable creation to info struct 4653- glx/dri3: split out modifier check 4654- glx/sw: check for modifier support in the kopper path 4655- kopper: pass modifier availability to drawable creation 4656- kopper: determine modifier support per-drawable 4657- zink: don't clobber descriptor mode on multiple screen creation 4658- nir: fix slot calculations for compact variables with location_frac 4659- lavapipe: use the component offset directly for xfb 4660- nir: add a helper for calculating variable slots 4661- radv: bump max xfb output to 128 4662- ir3: bump max xfb output to 128 4663- gallium: bump PIPE_MAX_SO_OUTPUTS to 128 4664- zink: add feedback loop exts to optimal profile 4665- glsl: only explicitly check GS components in PSIZ injection with output variables 4666- lavapipe: statically allocate fb attachment array 4667- lavapipe: zero fb attachment array at rp start 4668- lavapipe: don't check geometry for fb attachments 4669- lavapipe: be slightly more permissive for bad apps (and cts) with dynrender 4670- lavapipe: VK_EXT_host_image_copy 4671- zink: better handle separate shader dsl creation when no bindings exist 4672- zink: force image barriers after dmabuf import 4673- ci: bump VVL to 1.3.261 4674- zink: use VK_WHOLE_SIZE when binding null db buffer descriptors 4675- zink: unset line stipple ds3 state flags when stipple not available 4676- nir/lower_io_to_scalar: fix 64bit io splitting 4677- nir/linking_helpers: force type matching in does_varying_match 4678- nir/print: print location names for (some) tess slots 4679- nir/print: always group variables by type when printing 4680- zink: add batch refs for transient images 4681- zink: fix zs resolve attachment indexing 4682- zink: don't add VK_IMAGE_USAGE_ATTACHMENT_FEEDBACK_LOOP_BIT_EXT for transient images 4683- zink: don't append msrtss to dynamic render if not supported 4684- zink: set msrtss depth resolve mode when enabled 4685- zink: hook up VK_KHR_workgroup_memory_explicit_layout 4686- zink: propagate have_workgroup_memory_explicit_layout to ntv 4687- zink: use SPV_KHR_workgroup_memory_explicit_layout when available 4688- zink: add more locking for pipeline cache 4689- zink: add VK_PIPELINE_CACHE_CREATE_EXTERNALLY_SYNCHRONIZED_BIT_EXT 4690- aux/trace: fix winsys handle dumping 4691- zink: generated tcs is on the tes, not the vs 4692- zink: apply ZINK_DEBUG=noopt to linked separate shaders 4693- gallivm: handle A8_UNORM image stores 4694- llvmpipe: enable A8_UNORM for shader images 4695- llvmpipe: export PIPE_CAP_IMAGE_LOAD_FORMATTED 4696- lavapipe: GetRenderingAreaGranularityKHR 4697- llvmpipe: block weird uses of subsampled formats in buffers 4698- llvmpipe: fix early depth + alpha2coverage + occlusion query interaction 4699- lavapipe: fix BindVertexBuffers2 buffer size handling 4700- lavapipe: fix resolves where src image has a layer offset 4701- lavapipe: block yuv formats from getting blit feature flags 4702- lavapipe: BindIndexBuffer2 4703- lavapipe: GetDeviceImageSubresourceLayoutKHR 4704- lavapipe: VK_REMAINING_ARRAY_LAYERS for copy ops 4705- lavapipe: maintenance5 4706- zink: fix xfb buffer array sizing to use buffer limit, not output 4707- zink: move ZINK_DEBUG=nir printing to just before compile 4708- draw: fix so debug offset printing 4709- zink: reindex ssa defs before dumping debug shaders 4710- lavapipe: zero-init pipe_sampler_state 4711- zink: explicitly set non-optimal last_vertex_stage shader key on ctx create 4712- zink: fix big tcs output io 4713- zink: don't try to replace separate shader prog in noopt mode 4714- zink: pre-convert mode in fixup_io_locations 4715- zink: add a special separate shader i/o mode for legacy variables 4716- nir: minor fixes for io_to_scalar 4717- nir/lower_io: add a new doubles-only 64bit lowering option 4718- nir: add a filter cb to lower_io_to_scalar 4719- d3d10umd: use cso_context to set vertex buffers and elements 4720- virgl: move virgl_vertex_elements_state to header 4721- virgl: fix some indentation 4722- nouveau: calloc vertex csos 4723- gallium: move vertex stride to CSO 4724- zink: fix null config screen creation 4725- zink: fix crash in lower_pv_mode_gs_store 4726- u/draw: skip zero-sized indirect draws 4727- lavapipe: handle VkPipelineCreateFlagBits2KHR 4728- lavapipe: handle VkBufferUsageFlags2KHR 4729- zink: ci updates 4730- zink: track start/stop of a couple query types 4731- zink: require EDS1 for CWE usage 4732- zink: unset primgen suspended flag when ending a primgen query 4733- zink: rework rast-discard for primgen queries 4734- zink: rip out some awkward parts of the old non-cwe path 4735- zink: drop CWE requirement for renderpass tracking with primgen queries 4736- nir/zink: fix gs emulation xfb_info sizing 4737- zink: move fragcolor lowering further along the compile process 4738- zink: add a mode param to find_var_with_location_frac 4739- zink: use lowered io (kinda) for i/o vars 4740- zink: stop lowering indirect derefs 4741- ntt: handle interp intrinsics as derefs 4742- zink: delete split_blocks pass 4743- zink: delete lower_64bit_vertex_attribs pass 4744- zink: fix clip/cull dist xfb inlining 4745- zink: delete all the extra gross xfb handling 4746- zink: stop using pipe_stream_output 4747- zink: remove pipe_stream_output from function params 4748- zink: ci updates 4749- aux/trace: print bindless handles as pointers 4750- zink: remove unused param from create_ici 4751- zink: split create_ici to init and eval 4752- zink: add maintenance extensions to profile 4753- zink: use maintenance5 4754- zink: use real A8_UNORM when possible 4755- vk/graphics: fix CWE handling with DS3 4756- Revert "vk/wsi/x11: handle geometry updating more asynchronously" 4757- r600: store the mask of buffers used by a vertex state 4758- r600: better tracking for vertex buffer emission 4759- zink: wait on async fence during ctx program removal 4760- zink: handle patch variable locations for separate shaders better 4761- zink: don't start multiple cache jobs for the same program 4762- zink: use the "set" optimal key for prog last_variant_hash for consistency 4763- zink: sanitize optimal keys 4764- zink: copy some cs shader properties to the program struct 4765- zink: handle global atomic intrinsics 4766- zink: use Aligned with global load/store ops 4767- zink: fix rewrite_read_as_0 filtering 4768- rusticl: fixes for zink shader images 4769- zink: pass KERNEL shaders through successfully 4770- zink: add a618 flake 4771- zink: break out ds3 state resetting 4772- zink: be consistent with ds3 state resetting for blits 4773- zink: fix optimal_keys warning message 4774- zink: force-reset unordered flags for buffer barriers on non-matching batch access 4775- zink: reset unordered flags for image barriers on non-matching batch access 4776- zink: make image barrier init functions void return 4777- zink: simplify some image barrier conditionals 4778- zink: remove sync TODO 4779- zink: add lavapipe flake 4780- ci: disable nouveau shaderdb 4781- egl/dri3: only set driver_name if not already set 4782- egl: call dri3_x11_connect() for zink 4783- egl: bind dri2_set_WL_bind_wayland_display for zink when necessary 4784- zink: be more precise about flagging rp changes around unordered u_blitter 4785- zink: don't block reordering during ref updates in unordered blits 4786- lavapipe: update vbo indices before propagating stride 4787- lavapipe: fix pipeline stride propagation 4788- zink: fix linear modifier dmabuf imports 4789- zink: polaris ci updates 4790- aux/tc: handle stride mismatch during rp-optimized subdata 4791- zink: always add a per-prog ref for gpl libs 4792- zink: use a pointer to simplify submit struct mechanics 4793- zink: make zink_resource_image_barrier2_init public 4794- zink: add a third submitinfo (unused for now) 4795- zink: make submitinfo handling easier to manage with enum 4796- zink: add another submitinfo for fd semaphore waits 4797- zink: add a screen cache for fd semaphores 4798- zink: add a util for getting cached fd semaphores 4799- zink: hook up cached fd semaphore usage for batch signal/waits 4800- zink: handle implicit sync for dmabufs 4801- zink: handle multi-plane implicit sync 4802- zink: ci updates 4803- zink: set is_xfb=false for all i/o variables 4804- zink: reorder bindless io lowering 4805- zink: fix typing on bindless io lowering 4806- zink: delete some bindless io lowering code 4807- zink: use nir_io_semantics::num_slots for indirect var creation 4808- zink: simplify an arrayed io check during variable creation 4809- zink: use explicit stride from types instead of copying old_var stride 4810- zink: use MAX_PATCH_VERTICES directly for arrayed io var sizing 4811- zink: use explicit sizing for builtins when creating variables 4812- zink: create new vars without copying existing ones 4813- zink: add a new linker pass to handle mismatched i/o components 4814- zink: use right function to get src_type in eliminate_io_wrmasks 4815- zink: re-rework i/o variable handling to make having variables entirely optional 4816- ci: bump VVL to 1.3.263 4817- zink: simplify redundant is_buffer check 4818- zink: use VkFormatProperties3 4819- lavapipe: handle VkHostImageCopyDevicePerformanceQueryEXT 4820- lavapipe: don't advertise UNDEFINED layout for HIC 4821- zink: hook up VK_EXT_host_image_copy 4822- zink: move mem type detection up in file 4823- zink: disable HIC without resizable BAR 4824- zink: add a fixup method for extra driver props 4825- zink: fix some off-by-one indentation 4826- zink: use some return codes for check_ici errors 4827- zink: check/use suboptimal HIC during ici init 4828- zink: use HIC for image subdata when possible 4829- zink: slightly refactor psiz deletion during linking 4830- zink: delete all psiz=1.0 stores if maintenance5 is present 4831- nir/inline_uniforms: fix oob access with nir_find_inlinable_uniforms 4832- zink: add ZINK_DEBUG=quiet 4833- zink: imply ZINK_DEBUG=quiet if ZINK_DEBUG=optimal_keys is set on turnip 4834- zink: set optimal_keys for turnip jobs 4835- aux/tc: fix staging buffer sizing for texture_subdata 4836- aux/tc: fix address calc for segmented texture subdata 4837- zink: ci updates 4838- lavapipe: KHR_map_memory2 4839- zink: slightly refactor pipeline compile selection 4840- zink: add a flag for combined pipeline compile for doing FAIL_ON_PIPELINE_COMPILE_REQUIRED 4841- zink: remove an intermediate variable in pipeline compile selection 4842- zink: use FAIL_ON_PIPELINE_COMPILE_REQUIRED for GPL path 4843- zink: pass a stage mask to pipeline create functions 4844- glsl: check for xfb setting xfb info 4845- zink: don't warn about missing scalarBlockLayout on v3dv 4846- aux/tc: fix renderpass tracking fb state clobber scenario 4847- vk/enum2str: add more max enum vendors 4848- aux/tc: fix rp info handling around tc_sync calls 4849- aux/tc: don't use pipe_buffer_create_with_data() for rp-optimized subdata 4850- zink: flag db maps as unsynchronized 4851- lavapipe: clamp cache uuid size 4852- lavapipe: EXT_load_store_op_none 4853- tu: handle unused color attachments without crashing 4854- zink: use much bigger dummy surfaces 4855- zink: propagate rp_tc_info_updated across unordered blits 4856- zink: use null attachments for null attachments with dynamic render 4857- egl/swrast: expose EXT_swap_buffers_with_damage and EXT_present_opaque 4858- egl/wayland: split out wl drm extension init 4859- egl/wayland: use more registry listeners to better handle device init 4860- egl/wayland: enable WL_bind_wayland_display for zink 4861- zink: delete injected pointsize during shader creation 4862- zink: require maintenance5 for shobj 4863- zink: delete a non-maintenance5 workaround for shobj use 4864- lavapipe: set separate_shaders for shader objects 4865- zink: set workgroup_memory_explicit_layout for shader validation 4866- zink: add a ZINK_DEBUG=validation alias 4867- zink: fix semaphore signal ordering 4868- zink: move swapchain fence to swapchain object 4869- zink: avoid UAF on wayland async present with to-be-retired swapchain 4870- zink: always trace_screen_unwrap in acquire 4871- lavapipe: fix variable descriptor count support handling 4872- lavapipe: always set independent blend 4873- lavapipe: more vertex stride fixups 4874- lavapipe: set default viewport and scissor count for cmdbufs 4875- lavapipe: set default min sample shading to 1 4876- glx: XFree visual info 4877- radv: fix external handle type queries for dmabuf/fd 4878- zink: fix crashing in image rebinds 4879- zink: move push descriptor disable to driver workarounds 4880- zink: move v3dv scalarBlockLayout workaround 4881- zink: fix end-of-batch barrier pipeline stages 4882- zink: guarantee egl syncobj lifetime 4883- aux/trace: dump enum names for map usage 4884- gallium: add PIPE_MAP_NONE 4885- Revert "egl/wayland: Add image loader extension for swrast" 4886- egl/wayland: don't block in swrast when updating buffers for zink 4887- egl/wayland: return sooner from swrast_update_buffers() if zink 4888- zink: don't check submit count for unflushed usage 4889- egl: don't set ForceSoftware for all zink loading 4890- zink: error at handle export on missing EXT_image_drm_format_modifier 4891- gbm: delete some zink handling 4892- zink: apply ZINK_DEBUG=quiet to all missing feature warnings 4893- zink: set ZINK_DEBUG=quiet for polaris jobs 4894- lavapipe: don't block begin/end cmdbuf pipeline barriers 4895- ci: add a630 trace flakes 4896- zink: shrink vectors during optimization 4897- zink: always clamp shader stage in descriptor handling 4898- zink: add set_global_binding 4899- zink: eliminate samplers from no-sampler CL texops 4900- zink: add some checks to determine whether queue is init on screen destroy 4901- zink: don't destroy any simple_mtx_t objects during screen destroy 4902- zink: don't destroy uninitialized disk cache thread 4903- zink: reorder glsl_type_singleton_init_or_ref call 4904- zink: use screen destructor for creation fails 4905- zink: fix readback_present locking 4906- zink: add automatic swapchain readback using heuristics 4907- lavapipe: VK_EXT_nested_command_buffer 4908- zink: ignore unacquired swapchain images during end-of-frame flush 4909- nir/lower_fragcolor: preserve location_frac 4910- zink: update pointer for GPL pipeline cache entry formats 4911- zink: fix legacy depth texture rewriting for single component reads 4912- egl: unify dri2_egl_display creation 4913- egl: init dri3 version info during screen creation 4914- egl/glx: don't load non-sw zink without dri3 support 4915- egl: add automatic zink fallback loading between hw and sw drivers 4916- glx: add automatic zink fallback loading between hw and sw drivers 4917- ci: don't set GALLIUM_DRIVER for zink 4918- egl/wayland: only add more registry listeners for hardware devices 4919- zink: only increment image_rebind_counter on image export if binds exist 4920- zink: check for sampler view existence during zink_rebind_all_images() 4921- zink: use weston for anv ci 4922- zink: blow up broken xservers more reliably 4923- zink: delete some dead modifier handling 4924- ci: skip implicit modifier piglits for zink 4925- zink: don't block large vram allocations 4926- zink: add copy box locking 4927- zink: emit SpvCapabilitySampleRateShading with SampleId 4928- zink: always set VK_EXTERNAL_MEMORY_HANDLE_TYPE_HOST_ALLOCATION_BIT_EXT for usermem 4929- zink: clamp resolve extents to src/dst geometry 4930- zink: only emit xfb execution mode for last vertex stage 4931- aux/u_transfer_helper: set rendertarget bind for msaa staging resource 4932- zink: unset explicit_xfb_buffer for non-xfb shaders 4933- mesa/st/texture: match width+height for texture downloads of cube textures 4934- zink: add more locking for compute pipelines 4935- radv: correctly return oom from the device when failing to create a cs 4936- zink: check for cbuf0 writes before setting A2C 4937 4938Mohamed Ahmed (19): 4939 4940- vulkan/util: Support 10-bit and 12-bit color formats in ycbcr_info in vk_format.c 4941- vulkan/util: Support VK_EXT_ycbcr_2plane_444_formats color formats in vk_format.c 4942- vulkan/util: Use ycbcr_info for multiplane helpers in vk_format.c 4943- nvk: implement vkGetDeviceImageMemoryRequirementsKHR() 4944- nvk: add stub for vkGetDeviceImageSparseMemoryRequirementsKHR() 4945- nvk: implement vkGetDeviceBufferMemoryRequirementsKHR() 4946- nvk: advertise VK_KHR_maintenance4 4947- nvk: advertise DemoteToHelperInvocation 4948- nvk: Enable multiplane images and image views 4949- nouveau/nvk: Add YCbCr sampler NIR lowering pass 4950- nouveau/nvk: Support multi-plane descriptors in nvk_nir_lower_descriptors.c 4951- nouveau/nvk: Create helper function for sampler creation 4952- nouveau/nvk: Add multiple sampler planes for CONVERSION_SEPARATE_RECONSTRUCTION_FILTER_BIT 4953- nouveau/nvk: Enable VK_KHR_sampler_ycbcr 4954- util/format: Add G8B8_G8R8_422_UNORM and B8G8_R8G8_422_UNORM formats 4955- vulkan/format: Translate G8B8G8R8_422_UNORM and B8G8R8G8_422_UNORM properly 4956- nvk: Enable SEPARATE_RECONSTRUCTION_FILTER_BIT for multi-planar formats only 4957- nvk: Enable MIDPOINT_CHROMA_SAMPLES_BIT for multi-planar formats only 4958- nil: Add support for G8B8_G8R8_UNORM and B8G8_R8G8_UNORM 4959 4960Nanley Chery (33): 4961 4962- iris: Remap DRM_FORMAT_MOD_INVALID more often during import 4963- anv: Don't support ASTC images with modifiers 4964- intel: Add and use isl_drm_modifier_get_plane_count 4965- anv: Handle explicit surface layout of DG2_RC_CCS 4966- anv: Reduce accesses of isl_mod_info->aux_usage 4967- iris: Reduce accesses of mod_info->aux_usage 4968- crocus: Delete modifier with aux code 4969- hasvk: Delete modifier with aux code 4970- iris: Swap stencil and modifier aux assignment order 4971- intel: Describe modifier compression with booleans 4972- intel/isl: Move the Tile4 modifier score case down 4973- intel/isl: Add a score for DG2_RC_CCS 4974- intel/blorp: Ambiguate after CCS resolves on gfx7-8 4975- iris: Reorder render_aux_usage parameters 4976- iris: Pass the render format to prepare_render 4977- iris: Create BLORP surfaces after resource preparation 4978- iris: Handle clear color compatibility in prepare_render 4979- iris: Sample more texture view fast-clears on gfx11+ 4980- iris: Fix aux usage tracking in prepare_render 4981- iris: Fix iris_copy_region calls involving FCV_CCS_E 4982- iris: Drop get_copy_region_aux_settings 4983- iris: Inline iris_can_sample_mcs_with_clear 4984- anv: Initialize the clear color more often for FCV 4985- intel: Return a bool from intel_aux_map_add_mapping 4986- anv: Move scope of CCS binding determination 4987- anv: Allocate space for aux-map CCS in image bindings 4988- anv: Wrap aux surface image binding queries 4989- anv: Refactor CCS disabling at image bind time 4990- anv: Place images into the aux-map when safe to do so 4991- anv: Loosen anv_bo_allows_aux_map 4992- anv: Meet CCS alignment reqs with dedicated allocs 4993- anv: Delete implicit CCS code 4994- intel/isl: Add scores for GEN12_RC_CCS and MTL_RC_CCS 4995 4996Neal Gompa (1): 4997 4998- asahi: Fix 32-bit x86 build with correct data type for overflow error message 4999 5000Neha Bhende (1): 5001 5002- ntt: lower indirect tesslevels in ntt 5003 5004Paul Gofman (2): 5005 5006- driconf: add a workaround for Captain Lycop: Invasion of the Heters 5007- driconf: add a workaround for Rainbow Six Extraction 5008 5009Paulo Zanoni (15): 5010 5011- anv: rename the vm_bind vfuncs 5012- anv: add a new vm_bind vfunc 5013- anv/xe: make vm_binds async 5014- anv/xe: return failure in case waiting for the vm_bind syncobj fails 5015- anv: remove misleading comment about batch_len 5016- iris: assert bufmgr->bo_deps_lock is held 5017- iris: avoid stack overflow in iris_bo_wait_syncobj() 5018- iris: assert(bo->deps) after realloc() 5019- intel/isl: add ISL_SURF_USAGE_SPARSE_BIT 5020- intel/isl: simplify the check for maximum surface size 5021- anv/sparse: add the initial code for Sparse Resources 5022- anv/sparse: get ready to issue a single vm_bind ioctl per non-opaque bind 5023- anv/sparse: add INTEL_DEBUG=sparse 5024- anv: enable sparse resources by default 5025- vulkan: fix potential memory leak in create_rect_list_pipeline() 5026 5027Pavel Ondračka (44): 5028 5029- r300: update RV370 failures 5030- r300: check for index overflow when translating from TGSI 5031- r300: source register index is always unsigned 5032- r300: bump the RC_MAX_INDEX_BITS 5033- r300: normal instruction can't have presubtract op 5034- r300: add a helper for checking number of temporary sources 5035- r300: cycles estimate for shader-db 5036- r300: fix cycles calculation 5037- r300: don't abort on flow control when using draw for vs 5038- r300: add dEQP baseline for RV370 with forced swtcl 5039- r300: copy ntt to r300 compiler 5040- r300: add lower_sqrt to nir option 5041- r300: remove unused intrinsics in ntr 5042- r300: remove irrelevant opcodes in ntr 5043- r300: remove unused integer support in ntr 5044- r300: remove ntr_tgsi_usage_mask 5045- r300: remove more unused 64-bit pieces from ntr 5046- r300: simplify vectorization rules 5047- r300: remove more ntr unused helpers 5048- r300: remove the unneeded ntr_lower_vec_to_reg callback 5049- r300: remove unneeded 64bit and atomic lowering passes 5050- r300: remove unused ntr default settings 5051- r300: remove ntr default options 5052- r300: simplify ntr_emit_load_ubo 5053- r300: simplify ntr_emit_load_input 5054- r300: remove some virglrenderer specifics from ntr 5055- r300: simplify ntr_setup_uniforms 5056- r300: simplify ntr_output_decl 5057- r300: simplify ntr_try_store_in_tgsi_output 5058- r300: remove some unsupported texture opcodes 5059- r300: remove unused barrier code from ntr 5060- r300: simplify ntr_get_gl_varying_semantic 5061- r300: remove the nrt main optimization loop 5062- r300: reorder for easier presubtract 1-x pattern recognition 5063- r300: exit early in presubtract is not supported 5064- r300: implement bias presubtract 5065- r300: convert x * 2 into x + x for presubtract 5066- r300: move power of two multipliers down 5067- r300: there is no limitation on presubtract source file 5068- r300: use w channel for scalar opcodes if possible 5069- r300: reduce number of iterations for vertex shader loops 5070- r300: enable nir_move_vec_src_uses_to_dest 5071- nir/move_vec_src_uses_to_dest: skip reuse if vec is used only once in store_output 5072- nir/move_vec_src_uses_to_dest: allow to skip reuse of constant sources 5073 5074Philipp Zabel (1): 5075 5076- etnaviv: fix segfault after compile failure 5077 5078Pierre-Eric Pelloux-Prayer (18): 5079 5080- radeonsi/sdma: use multiple commands if required 5081- radv/sdma: use multiple commands if required 5082- radv/sdma: use correct limits for gfx10.3 5083- glx: drop the 'libGL' log prefix 5084- loader: refactor DRI_PRIME handling code 5085- loader: extend DRI_PRIME to support =N 5086- loader: add DRI_PRIME_DEBUG env var 5087- device_select_layer: support DRI_PRIME=n 5088- docs: update DRI_PRIME documentation 5089- device_select: add shortcut for MESA_VK_DEVICE_SELECT_FORCE_DEFAULT_DEVICE 5090- st/mesa: check renderbuffer before using it 5091- radeonsi: emit framebuffer state after allocating cmask 5092- amd/common: update addrlib for gfx11.5 5093- amd/common: add registers for gfx11.5 5094- ac/nir: extract must_wait_attr_ring helper 5095- amd, radeonsi: Add code to enable gfx11.5 5096- mesa: restore call to _mesa_set_varying_vp_inputs from set_vertex_processing_mode 5097- radeonsi: check sctx->tess_rings is valid before using it 5098 5099Piotr Kocia (2): 5100 5101- nir: Remove dead nir_const_value variables 5102- glsl: ir_function_param_visitor::visit_enter always true condition 5103 5104Qiang Yu (77): 5105 5106- aco,radv: replace tess_input_vertices shader info param 5107- radeonsi: aco does not pass LS outputs to HS by arg 5108- radeonsi: extract si_get_prev_stage_nir_shader to be shared with aco 5109- radeonsi: init aco shader info for merged LS/HS 5110- radeonsi: simplify si_build_wrapper_function 5111- radeonsi: move vertex shader vb desc input sgpr args to last 5112- radeonsi: remove param type check in wrapper function 5113- radeonsi: refine si_llvm_ls_build_end 5114- radeonsi: refine si_llvm_es_build_end 5115- radeonsi: aco compile support merged mono shader 5116- radeonsi: calculate lds size for merged shaders 5117- radeonsi: enable aco compile for mono merged LS/HS 5118- radeonsi: enable aco compile for mono merged ES/GS 5119- aco: extract aco_compile_shader_part from aco_compile_ps_epilog 5120- aco: add p_end_with_regs pseudo instruction 5121- aco: move jump to epilog out of ic_merged_wave_info 5122- aco: add tcs end regs for epilog usage 5123- aco: allow tcs with epilog to keep nir store output instruction 5124- aco: add pending_lds_access option for insert waitcnt 5125- aco: add tcs epilog generation for radeonsi 5126- aco: don't emit s_endpgm for tcs with epilog 5127- aco: skip scratch init when no scratch arg provide 5128- aco,radeonsi: save const addr to symbol 5129- ac/nir/tess: move tess factor output out of control flow 5130- aco: use semantic location as io temp index 5131- radeonsi: add exec_size to shader binary 5132- radeonsi: support upload multi part shader binary 5133- radeonsi: share si_get_tcs_out_patch_stride with aco 5134- radeonsi: fill part mode tcs aco shader info 5135- radeonsi: extract si_llvm_build_shader_part 5136- radeonsi: remove separate_prolog arg from prolog/epilog build 5137- radeonsi: add si_get_tcs_epilog_args 5138- radeonsi: change si_fill_aco_options args 5139- radeonsi: add si_aco_build_shader_part 5140- radeonsi: part mode standalone tcs support aco compile 5141- radeonsi: remove unused arg of get_tcs_tes_buffer_address 5142- aco: simplify setup_tcs_info 5143- aco: pass sw_stage when setup_isel_context 5144- aco: prepare fix_ls_vgpr_init_bug to be used by gl vs prolog 5145- aco: add vs prolog instruction selection for radeonsi 5146- aco: add aco compile interface for radeonsi vs prolog 5147- aco: do not fix_exports when program is prolog 5148- radeonsi: fill aco_shader_info->is_monolithic 5149- radeonsi: remove is_monolithic from vs prolog key 5150- radeonsi: extract si_get_vs_prolog_args to be shared with aco 5151- radeonsi: fix aco options has_ls_vgpr_init_bug setup 5152- radeonsi: add vs prolog aco build 5153- radeonsi: set vs has prolog aco shader info 5154- radeonsi: enable aco compile for part mode standalone vs 5155- aco,radv,radeonsi: rename is_monolithic to merged_shader_compiled_separately 5156- ac,radeonsi: move ps arg pos_fixed_pt to ac_shader_args 5157- aco: do not eliminate final exec write when p_end_with_regs block 5158- aco: remove p_end_with_regs from needs_exact() 5159- aco: add ps prolog generation for radeonsi 5160- aco: handle ps outputs from radeonsi 5161- aco: add create_fs_end_for_epilog for radeonsi 5162- aco,radv: remove unused ps epilog info fields 5163- aco,radv: rename ps epilog info inputs to colors 5164- aco: simplify export_fs_mrt_color 5165- aco,radv: add radeonsi spec ps epilog code 5166- aco: compact ps expilog color export for radeonsi 5167- aco,radv,radeonsi: pass spi ps input ena and addr 5168- aco: do not fix_exports when program has epilog 5169- aco: fix assertion fail when program contains empty block 5170- aco: create exit block for p_end_with_regs to branch to 5171- aco: wait memory ops done before go to next shader part 5172- radeonsi: reduce sgpr count for scratch_offset when aco 5173- radeonsi: init spi_ps_input_addr for part mode ps 5174- radeonsi: extract si_prolog_get_internal_binding_slot 5175- radeonsi: extract si_get_ps_prolog_args to be shared with aco 5176- ac,radeonsi: remove unused ps prolog key fields 5177- radeonsi: add ps prolog shader part build 5178- radeonsi: extract si_get_ps_epilog_args to be shared with aco 5179- radeonsi: fill aco shader info for ps part 5180- radeonsi: add ps epilog shader part build 5181- radeonsi: enable aco compile for part mode ps 5182- radeonsi: disable disk cache when use aco 5183 5184Rebecca Mckeever (32): 5185 5186- vulkan/runtime: Add helper functions for VK_EXT_host_image_copy 5187- nouveau/codegen: Support nir_intrinsic_load_workgroup_id_zero_base 5188- nouveau/codegen: Set lower_device_index_to_zero 5189- nvk: Convert system values for gl_PointCoord and PointCoord into inputs 5190- nvk: Add base_group to root descriptor table 5191- nvk: Lower base_workgroup_id 5192- nvk: Implement nvk_CmdDispatchBase and delete nvk_CmdDispatch 5193- nvk: Advertise KHR_device_group 5194- nvk: Add VK_FORMAT_B4G4R4A4_UNORM_PACK16 format to nil_format_info table 5195- nvk: Add A4B4G4R4 formats to nil_format_info table 5196- nvk: Advertise EXT_4444_formats 5197- nvk: Enable shadow sampling 5198- nvk: Implement VK_EXT_non_seamless_cube_map 5199- nouveau/nil: Add macros for ufixed 5200- nvk: Implement VK_EXT_image_view_min_lod 5201- nvk: Update mutable descriptor struct type 5202- nvk: Replace asserts with conditional that sets type_list = NULL 5203- nvk: Implement nvk_GetDescriptorSetLayoutSupport 5204- nvk: Enable VK_KHR_maintenance3 5205- nvk: Advertise VK_EXT_mutable_descriptor_type 5206- nvk: Set image index to zero for NULL nvk_buffer_view 5207- nvk: Advertise VK_EXT_image_robustness 5208- nvk: Advertise VK_EXT_robustness2 5209- nvk: Add view_index to root descriptor table 5210- nvk: Lower nir_intrinsic_load_view_index 5211- nvk: Add draw support for multiview 5212- nvk: Add query support for multiview 5213- nvk: Add input attachments support for multiview 5214- nvk: Advertise VK_KHR_multiview 5215- nvk: Load view_mask to shadow scratch in nvk_CmdBeginRendering 5216- nvk: Combine CLEAR_VIEWS and CLEAR_LAYERS MME macros 5217- nvk: Move code inside view mask loops to a helper function 5218 5219Rhys Perry (89): 5220 5221- ac/llvm: fix AC_TM_CHECK_IR 5222- radv: fix radv_get_ballot_bit_size with CS 5223- ac/llvm: fix wave32 ac_build_mbcnt_add with 64-bit mask 5224- ac/llvm: skip ballot zext for 32-bit dest with wave32-as-wave64 5225- radv: add conformant_trunc_coord to cache UUID 5226- radv: don't unset TRUNC_COORD if conformant_trunc_coord=true 5227- ac/nir: always round cube array layers 5228- nir/unsigned_upper_bound: fix phi(bcsel) 5229- nir/tests: add test for unsigned_upper_bound with loop header phis 5230- nir/opt_dead_cf: remove nodes after a jump earlier 5231- nir/tests: add nir_opt_dead_cf_test.jump_before_constant_if 5232- aco: insert s_nop before VGPR deallocation 5233- nir/lower_shader_calls: vectorize stack access for all shaders 5234- radv: workaround WWZ exporting index=1 through location=1 5235- radv: correctly skip MRT output NaN fixup for meta shaders 5236- radv: don't set vertex_attribute_strides on GFX8+ 5237- radv/ci: skip some mesh shader tests on GFX1100 5238- aco: summarize register demand after handling branches 5239- aco: don't create sendmsg(dealloc_vgprs) if scratch is used 5240- radv: disable 64-bit color attachments 5241- radv: fix 128bpp comp-to-single clears 5242- radv: support 128bpp comp-to-single with all colors 5243- radv/gfx11: re-enable 0001/1110 clear values 5244- nir/lower_shader_calls: fix align_offset 5245- nir/opt_load_store_vectorize: support scratch access 5246- radv: vectorize RT stack access 5247- radv: vectorize scratch access 5248- aco: fix p_bpermute_gfx6 with input at non-zero byte 5249- aco: fix p_bpermute_gfx6's exec save/restore with wave32 5250- aco: clarify bpermute pseudo opcode names 5251- aco: add adjust_bpermute_dst helper 5252- aco/spill: skip p_branch in process_block 5253- aco/spill: add all live-in to merge block spill candidates 5254- nir/lower_system_values change num_workgroups to uint32_t 5255- radv: optimize mesh workgroup ID using ts_mesh_dispatch_dimensions 5256- radv: use shortcut_1d_workgroup_id 5257- aco: remove fast path in insert_exec_mask's process_instructions 5258- aco/optimizer_postRA: check overwritten_subdword in is_overwritten_since() 5259- aco: check logical_phi_info at p_logical_end when eliminating exec writes 5260- aco: remove unused p_logical_end check when optimizing branching sequence 5261- radv: disable mesh dispatch XYZ_DIM when possible 5262- nir/deref: remove rematerialize_deref_in_block cache 5263- aco: reset prefetch in the correct block after removing the exit 5264- aco/waitcnt: replace wait_cnt::\*_cnt with booleans 5265- aco/waitcnt: add print helpers 5266- nir/lower_int64: fix find_lsb(0) 5267- nir/algebraic: optimize u2u32(a >> 32) 5268- aco/optimizer_postRA: don't combine DPP across exec on GFX8/9 5269- aco: don't combine DPP into v_cmpx 5270- aco: disable zero offset optimization for strict WQM coords 5271- nir/constant_folding: remove zero texel offset 5272- aco: remove zero offset optimization 5273- aco: shrink DPP8_instruction 5274- aco: add fetch_inactive field to DPP instructions 5275- nir: add fetch inactive index to quad_swizzle_amd/masked_swizzle_amd 5276- aco: disable FI for quad/masked swizzle 5277- aco: fix LdsDirectVMEMHazard WaW with the wrong waitcnt 5278- aco: only mitigate VcmpxExecWARHazard when necessary 5279- aco: fix s_setreg hazards 5280- aco: consider exec_hi in reads_exec() 5281- aco: resolve all possible hazards at the end of shader parts 5282- aco/tests: test that hazards are resolved at the end of shader parts 5283- radv: skip zero-sized memcpy 5284- ac/nir: fix out-of-bounds access in ac_nir_export_position 5285- radv: fix signed integer overflow 5286- Revert "radv: pre-init surface info" 5287- nir: improve ms_cross_invocation_output_access with local_invocation_id 5288- aco,nir: add export_row_amd intrinsic 5289- ac/nir: add row parameter to helpers 5290- ac/nir: remove dead code 5291- ac/nir: refactor mesh vertex/primitive export 5292- ac/nir: implement mesh shader gs_fast_launch=2 5293- ac/nir: optimize mesh shader local_invocation_index 5294- radv: implement mesh shader gs_fast_launch=2 5295- ac/nir: add emit_ms_outputs helper 5296- ac/nir,radv: pass workgroup size to ac_nir_lower_ngg_ms 5297- ac/nir: implement mesh shader multi-row export 5298- radv: implement mesh shader multi-row export 5299- radv: enable mesh shader gs_fast_launch=2 and multi-row export 5300- nir/serialize: fix signed integer overflow 5301- nir/lower_shader_calls: skip zero-sized qsort 5302- util: skip zero-sized SHA1Update 5303- radv: call lower_array_deref_of_vec before lower_io_arrays_to_elements 5304- radv: skip radv_remove_varyings for mesh shaders 5305- radv: disable gs_fast_launch=2 by default 5306- docs: fix RADV_THREAD_TRACE_CACHE_COUNTERS default 5307- radv: add radv_disable_trunc_coord option 5308- radv: enable radv_disable_trunc_coord for vkd3d-proton/DXVK 5309- ac/nir: fix partial mesh shader output writes on GFX11 5310 5311Rob Clark (60): 5312 5313- freedreno: move virtgpu msm_proto.h to common 5314- freedreno/drm/virtio: Remove unused header 5315- tu/msm: staticify a couple things 5316- tu/knl: Remove some random const'ness 5317- drm-uapi: Update virtgpu header 5318- freedreno: Update virtgpu proto 5319- freedreno/drm/virtio: Use global_faults 5320- tu: close submitqueues before device_finish() 5321- tu/drm: Factor out shared helpers 5322- tu/drm: Add missing error path cleanup 5323- tu/drm: Split out helper for iova alloc 5324- tu: Add virtgpu support 5325- util: Decouple disk cache from EGL_ANDROID_blob_cache 5326- docs: Followup to !24636 5327- tu: Workaround bionic _SC_LEVEL1_DCACHE_LINESIZE 5328- ir3+tu: Simplify ir3_find_sysval_regid callers 5329- freedreno/a6xx: Drop unused screen args 5330- freedreno/a6xx: Re-work fd6_emit_shader 5331- freedreno/a6xx: Re-write the function-of-doom 5332- freedreno: Implement ATI_meminfo 5333- freedreno/a6xx: ARB_post_depth_coverage 5334- freedreno/a6xx: ARB_sample_locations 5335- freedreno/a6xx: ARB_texture_filter_minmax 5336- freedreno/a6xx: EXT_demote_to_helper_invocation 5337- freedreno/a6xx: EXT_shader_image_load_formatted 5338- freedreno/a6xx: EXT_depth_bounds_test 5339- freedreno/a6xx: Use pipe_blit_info::sample0_only 5340- freedreno/a6xx: Handle PIPE_BIND_BLENDABLE 5341- freedreno/a6xx: ARB_shader_viewport_layer_array 5342- tu: Fix heap size 5343- freedreno: Fix crash with debug msgs enabled 5344- freedreno/layout: Handle 565/etc MSAA special case 5345- freedreno/decode: Fix printing chip-id 5346- freedreno/a6xx: Add L8_SRGB 5347- freedreno: Add reformatting commits to .git-blame-ignore-revs 5348- freedreno/fence: Hold a strong ref to batch 5349- freedreno/decode: Lookup device info 5350- freedreno/decode: Use info->chip to decode 5351- freedreno/decode: Remove gpu_id 5352- freedreno: Indentation fix 5353- freedreno: Use explicit QCOM_TILED3 modifier 5354- freedreno/a6xx: Remove dummy packet for globals 5355- freedreno: Fix streamout offset_buf dirtiness 5356- freedreno: Fix user const buffer dirtiness 5357- freedreno/batch: Move query_buf allocation 5358- freedreno: Add private-BO tracking 5359- freedreno: Add missing indirect_draw_count tracking 5360- freedreno: Move/add some attach_bo() 5361- freedreno: Add attach-bo debugging 5362- freedreno: Rework supported-modifiers handling 5363- mesa: Introduce MESA_texture_const_bandwidth 5364- mesa: Implement MESA_texture_const_bandwidth 5365- freedreno: Add PIPE_CAP_HAS_CONST_BW support 5366- panfrost: Add PIPE_CAP_HAS_CONST_BW support 5367- iris: Add PIPE_CAP_HAS_CONST_BW support 5368- radeonsi: Add PIPE_CAP_HAS_CONST_BW support 5369- tu/msm: Fix timeline semaphore support 5370- tu/virtio: Fix timeline semaphore support 5371- freedreno/drm: Fix race in zombie import 5372- freedreno: Always attach bo to submit 5373 5374Robert Foss (9): 5375 5376- egl: Expose access to DeviceList 5377- egl: Rename _eglRefreshDeviceList() to _eglDeviceRefreshList() 5378- egl: Refresh DeviceList during eglInitialize() 5379- egl/surfaceless: Use EGL DeviceList instead of drmGetDevices2() 5380- egl/android: Use EGL DeviceList instead drmGetDevices2() 5381- egl: Rename _eglAddDevice() to _eglFindDevice() 5382- egl: Rename _eglAddDevice() to _eglFindDevice() 5383- egl: Fix attrib_list[0] == EGL_NONE check 5384- egl: Always set _EGLDisplay->Device during eglGetPlatformDisplay() 5385 5386Robert Mader (6): 5387 5388- egl/wayland: wait for compositor to release shm buffers 5389- iris: Support parameter queries for main planes 5390- util: Add new helpers for pipe resources 5391- panfrost: Support parameter queries for main planes 5392- vc4/resource: Support offset query for multi-planar planes 5393- v3d/resource: Support offset query for multi-planar planes 5394 5395Rohan Garg (33): 5396 5397- iris: migrate WA 14013910100 to use the WA framework 5398- iris: migrate WA 14016118574 to use the WA framework 5399- iris: fix iris for WA 16013000631 5400- intel/perf: add perf query support for Intel Raptorlake 5401- intel/genxml: set a default value for "Pixel Position Offset Enable" in genxml 5402- anv: use the WA infrastructure where possible when generating state 5403- anv: use the correct GFX_VERx10 macro for WA 5404- anv,iris: program the maximum number of threads on compute queue init 5405- anv: drop CFE state validation checks 5406- iris: track reset signalling instead of replacing the context 5407- iris: allow for a unsynchronized device reset query 5408- anv: partially revert 2e8b1f6d 5409- anv: emitting 3DSTATE_PRIMITIVE_REPLICATION is required on Gen12+ 5410- anv: use the pre defined _3DPRIMITIVE_DIRECT macro 5411- anv: drop dead ifdef 5412- iris: use the correct WA macros and lineage numbers 5413- anv: use the lineage number for WA 5414- crocus: add a __gen_get_batch_address declaration 5415- crocus: fix GFX_VERx10 macro 5416- blorp: drop undefined macro 5417- iris: migrate preemption streamwout wa to WA infra 5418- intel/genxml: update PIPE_CONTROL instruction for dg2 5419- anv: define clear color localy within can_fast_clear_color_att 5420- intel/compiler: Adjust CS payload registers for new register width on Xe2+ 5421- intel/compiler: Adjust fence message lengths for new register width on Xe2+ 5422- intel/compiler: Adjust barrier emission for Xe2+ 5423- intel/genxml: fix 3DSTATE_3D_MODE length to align with BSpec 5424- anv: ensure that FCV_CCS_E fast clears are properly tracked 5425- anv: enable FCV for Gen12.5 5426- anv: fix debug string for PC flush 5427- anv: cleanup includes 5428- anv: turn off non zero fast clears for CCS_E 5429- anv: selectively enable FCV optimization for DG2 5430 5431Roland Scheidegger (1): 5432 5433- lavapipe: further limit accurate_a0 hack 5434 5435Roman Stratiienko (22): 5436 5437- egl: android: Remove legacy name-based shared buffers support 5438- util: Add NONNULL macro 5439- android: Introduce the Android buffer info abstraction 5440- android: Fix num_planes assignment in u_gralloc_fallback 5441- v3dv/android: Use u_gralloc code 5442- v3dv/android: Enable shared presentable image support 5443- v3dv: Migrate to vk_device_memory 5444- v3dv/android: Skip swapchain binding 5445- v3dv: Rely on the internal tiled flag instead of the common vk structure 5446- v3dv/android: Add a helper function to support explicit layouts 5447- v3dv/android: Rework Android native buffer importing logic 5448- v3dv: Use format stored in vk_image and vk_image_view after init 5449- v3dv: Split v3dv_image_init to use layout setting logic separately 5450- v3dv/android: Add AHardwareBuffer support 5451- v3dv: Enable VK API v1.2 for Android 5452- panvk: Add Android ICD loader entry point 5453- u_gralloc: Remove inline modifiers from the functions 5454- u_gralloc: Remove usage of NONNULL macro 5455- Revert "util: Add NONNULL macro" 5456- u_gralloc: Add a function that returns gralloc type 5457- dri: Remove __driDriverExtensions leftovers 5458- v3d: Don't implicitly clear the content of the imported buffer 5459 5460Ruijing Dong (2): 5461 5462- frontends/va: checking va version for av1enc support 5463- radeonsi/vcn: change max_poc to fixed value for hevc encoder. 5464 5465Ryan Neph (1): 5466 5467- vulkan/android: add missed STACK_ARRAY_FINISH() 5468 5469Sagar Ghuge (34): 5470 5471- intel/compiler: Look at 2 register worth of data instead of 4 5472- isl: Disable MCS compression just on ACM platform 5473- intel: Add env variable to add break point on/before draw 5474- anv: Add GPU breakpoint before/after specific draw call 5475- iris: Add GPU breakpoint before/after draw call 5476- blorp: Implement blorp hooks to emit breakpoint 5477- docs: Add INTEL_DEBUG_BKP_BEFORE/AFTER_DRAW_COUNT 5478- intel/isl: Enable INTEL_DEBUG=noccs/nohiz in ISL helpers 5479- anv,hasvk: drop unnecessary DEBUG_NO_CCS/NO_HIZ checks 5480- iris,crocus: drop unnecessary DEBUG_NO_CCS/NO_HIZ checks 5481- blorp: Drop unnecessary assertions in blorp_can_hiz_clear_depth 5482- anv: Add helper to create companion RCS command buffer 5483- anv: Split out End/Destroy/Reset cmd buffer code into helper 5484- anv: Handle companion RCS in end/destory/reset code path 5485- intel: Add helper to create/destroy i915 VM 5486- intel: Pass virtual memory address space ID while creating context 5487- anv: Create companion RCS engine 5488- anv: Move compute specfic bits under compute queue init 5489- anv: Execute RCS init batch on companion RCS context/engine 5490- anv: Setup companion RCS command buffer submission 5491- anv: Execute an empty batch to sync main and companion RCS batch 5492- anv: Add secondary companion RCS cmd buffer to primary 5493- anv: Skip layout transition on the compute queue 5494- anv: Extract batch print code to anv_print_batch helper 5495- iris: Enable always flush cache with DEBUG_STALL option 5496- intel/genxml: Add STATE_COMPUTE_MODE instruction 5497- anv: Program and emit STATE_COMPUTE_MODE 5498- anv: Enable barrier handling on video engines 5499- isl: Use 16-bit instead of 8-bits for surface format info fields 5500- anv: Handle end of pipe with MI_FLUSH_DW on transfer queue 5501- anv: Enable transfer queue only on ACM+ platforms 5502- blorp: Use the correct miptail start LOD for surfaces 5503- anv: Write timestamp using MI_FLUSH_DW on blitter 5504- anv: Flush data cache while clearing depth using HIZ_CCS_WT 5505 5506Saleemkhan Jamadar (1): 5507 5508- radeonsi/vcn: set jpeg reg version for gfx 1150 5509 5510Samuel Holland (3): 5511 5512- Android.mk: Allow building only Vulkan drivers 5513- Android.mk: Explicitly enable/disable LLVM support 5514- Android.mk: Only link LLVM for radeonsi, not amd_vk 5515 5516Samuel Pitoiset (299): 5517 5518- radv: remove support for VK_INDIRECT_COMMANDS_TOKEN_TYPE_STATE_FLAGS_NV 5519- radv: make radv_get_pa_su_sc_mode_cntl() static 5520- zink/ci: update list of expected failures for NAVI10 5521- radv: stop using a pipeline for emitting VGT_VERTEX_REUSE_BLOCK_CNTL 5522- radv: remove unused param in radv_pipeline_emit_vgt_gs_out() 5523- radv: pass a shaders array for computing ia_multi_vgt_param 5524- radv: bind the pre-compiled PS epilog to the cmdbuf state 5525- radv: stop using an array of binaries when compiling a compute shader 5526- radv: add radv_compile_cs() to compile a compute shader 5527- radv: remove the pipeline dependency for creating a GS copy shader 5528- radv: add a helper to compute the ESGS itemsize 5529- radv: use the number of GS linked inputs to compute the ESGS itemsize 5530- radv: determine ES info for VS/TES with GS earlier 5531- radv: determine as_ls earlier by using the next stage 5532- radv: simplify getting next VS stage for VS prologs 5533- radv: use next_stage for determining the stage to lower NGG 5534- radv/amdgpu: fix dumping CS with the chained IBs path 5535- radv/amdgpu: rename old_ib to ib in radv_amdgpu_winsys_cs_dump() 5536- radv: pass submit info to radv_check_gpu_hangs() 5537- radv: initialize stage/next_stage earlier 5538- radv: set next_stage to MESA_SHADER_NONE if there is no FS 5539- radv: rework considering force VRS without relying on graphics pipeline 5540- radv: stop passing radv_graphics_pipeline to radv_fill_shader_info() 5541- radv: move removing all varyings when the FS is a noop 5542- radv: rename graphics pipeline linking helpers 5543- radv: simplify lowering NGG GS intrinsics 5544- radv: rework determining the NGG stage without a graphics pipeline 5545- radv: cleanup pipeline compute emit helpers 5546- radv: rename radv_pipeline_stage to radv_shader_stage 5547- radv: rename NGG query state to be more generic 5548- radv: declare the shader query user SGPR for emulating GS counters 5549- radv: enable pipelinestat query emulation for legacy GS 5550- radv: simplify the NGG vs legacy pipelinestat query path 5551- radv: rename RADV_SHADER_QUERY_PIPELINE_STAT_OFFSET 5552- radv: implement nir_intrinsic_atomic_add_gs_invocation_count_amd 5553- radv: emulate GEOMETRY_SHADER_INVOCATIONS query on RDNA1-2 5554- radv: track whether inputs/outputs are linked per shader stage 5555- radv: add support for VS/TES as ES without shaders IO linking 5556- radv: use next_stage to determine if the layer should be exported 5557- radv: use next stage to determine if primID/clip dist should be exported 5558- radv: compute the legacy GS info earlier 5559- radv: stop copying some NIR info fields from TES to TCS 5560- radv: stop lowering patch vertices for TES 5561- radv: do not always copy the number of tess patches to TES 5562- radv: initialize tcs.tes_{patch}_inputs_read to a default value 5563- radv: prevent linking TCS<->TES when TES is NULL 5564- radv: use a packed user SGPR for the TES state 5565- radv: stop checking if patch control points is dynamic everywhere 5566- radv: copy the number of TCS vertices out to TES shader info 5567- radv: add support for dynamic TCS vertices out for TES 5568- radv: remove radv_shader_info::tes::num_linked_patch_inputs 5569- amd,radeonsi: move si_shader_io_get_unique_index_patch() to common code 5570- radv: allow to use fixed IO locations for VS<->TCS<->TES without linking 5571- aco: add aco_shader_info::tcs::has_epilog 5572- aco: add infra for compiling TCS epilogs 5573- radv,aco: move has_epilog to radv_shader_info 5574- radv: assume a TCS needs an epilog unless it's linked with a TES 5575- radv: do not write tess factors in main TCS when it has an epilog 5576- radv: track if TES reads tess factors differently 5577- radv: declare new argument for the TCS epilog PC 5578- radv: add radv_tcs_epilog_key 5579- radv: add infra for creating TCS epilogs 5580- radv: add support for a TCS epilogs cache in the device 5581- radv: add support for emitting TCS epilogs in cmdbuf 5582- radv: remove unnecessary check in radv_pipeline_nir_to_asm() 5583- radv: stop passing a graphics pipeline to radv_pipeline_nir_to_asm() 5584- radv: inline radv_pipeline_get_nir() in radv_graphics_pipeline_compile() 5585- radv: add a struct for the retained shaders and GPL 5586- radv: add radv_graphics_shaders_compile() to compile graphics shaders 5587- radv: remove redundant check in radv_cmd_buffer_after_draw() 5588- radv: track if patch control points is dynamic from the cmdbuf state 5589- radv: re-emit binning state if the framebuffer is dirty 5590- radv: track if vertex binding stride is dynamic from the cmdbuf state 5591- vulkan: bump header register to 1.3.261 5592- vulkan/runtime: add common implementation for GetImageSubresourceLayout() 5593- vulkan/format: add VK_FORMAT_{A8_UNORM,A1B5G5R5_UNORM_PACK16}_KHR 5594- radv: use the RT prolog scratch size directly for tracing rays 5595- radv: add a helper to get the maximum number of scratch waves per shader 5596- radv: update the number of scratch waves for RT prolog at bind time 5597- radv: update cmdbuf scratch size info when shaders are bound 5598- vulkan: add init/finish helpers for vk_buffer_view 5599- radv: use vk_buffer_view 5600- radv: use vk_sampler 5601- radv: use common vkCmdBegin/EndQuery wrappers 5602- radv: use vk_query 5603- zink: fix setting VkShaderCreateInfoEXT::nextStage 5604- radv/rt: fix capture/replay support 5605- vulkan/render_pass: add common vkGetRenderingAreaGranularityKHR() 5606- radv: implement vkCmdBindIndexBuffer2KHR() 5607- radv: allow VK_WHOLE_SIZE for pSizes in vkCmdBindVertexBuffers2() 5608- radv/rmv: remove unused pipeline create flags when logging pipelines 5609- radv: store pipeline create flags to radv_pipeline::create_flags 5610- radv: add support for VkPipelineCreateFlags2CreateInfoKHR 5611- radv: add support for VkBufferUsageFlags2CreateInfoKHR 5612- radv: allow VK_REMAINING_ARRAY_LAYERS with VkImageSubresourceLayers 5613- radv: implement radv_Get{Device}ImageSubresourceLayout2KHR() 5614- radv: advertise VK_KHR_maintenance5 5615- radv: remove useless NULL for pipeline layout during shader info pass 5616- radv: introduce radv_shader_layout for per-stage descriptor layout 5617- radv: stop passing redundant stage to radv_shader_stage_init() 5618- radv: re-introduce radv_pipeline_stage_init() 5619- radv: add support for loading the LSHS vertex stride from a SGPR 5620- radv: use the number of VS outputs for computing the tessellation info 5621- vulkan: ignore VkPipelineColorWriteCreateInfoEXT if the state is dynamic 5622- radv: reduce TCS_OFFCHIP_LAYOUT_NUM_PATCHES to 6-bits 5623- radv: add missing comment about TCS_OFFCHIP_LAYOUT_LSHS_VERTEX_STRIDE 5624- radv: fix emitting TCS epilogs for GFX6-9 5625- radv: remove radv_cmd_buffer::cached_vertex_formats 5626- radv: remove unused param from radv_pipeline_init_multisample_state() 5627- radv: simplify declaring VS specific input SGPRs 5628- radv: stop copying if VS or TES uses the InvocationID built-in 5629- Revert "radv/amdgpu: workaround a kernel bug when replacing sparse mappings" 5630- Revert "radv/amdgpu: skip adding per VM BOs for sparse during CS BO list build" 5631- radv/amdgpu: allow to execute external IBs on the compute queue 5632- radv/amdgpu: add support for submitting external IBs with the chained path 5633- zink/ci: update list of expected failures for NAVI10 5634- radv: use the maximum possible workgroup size for TCS epilogs 5635- radv: stop declaring the scratch offset argument for TCS epilogs 5636- radv: declare shader arguments for TCS epilogs 5637- radv: add tcs_out_patch_fits_subgroup to radv_tcs_epilog_key 5638- aco: fix jumping from main TCS to epilog on GFX9+ 5639- aco: adjust TCS epilogs for RADV 5640- aco: allow SGPRs operands with p_jump_to_epilog 5641- aco: implement create_tcs_jump_to_epilog() 5642- radv: track the pipeline bind point for indirect commands layout 5643- radv: prepare radv_get_sequence_size() for DGC compute 5644- radv: prepare radv_prepare_dgc() for DGC compute 5645- radv: implement NV_device_generated_commands_compute 5646- radv: allow DGC on the compute queue 5647- radv: advertise NV_device_generated_commands_compute 5648- aco: rework printing shader stages 5649- radv: fix the per-patch data offset when TES isn't linked with TCS 5650- radv: stop declaring unused SGPR arguments for PS epilogs 5651- radv: add radv_shader_info::is_monolithic 5652- radv: use info->uses_view_index directly when declaring shader arguments 5653- radv: do not inline push constants for non-monolithic shaders 5654- radv: force indirect descriptor sets for non-monolithic shaders 5655- radv: always declare some arguments for non-monolithic VS/TCS shaders 5656- radv: add a new shader argument for non-monolithic shaders PC 5657- ac: allow to mark shader arguments as preserved 5658- radv: preserve shader arguments for non-monolithic VS/TCS on GFX9+ 5659- aco: disable shared VGPRs for non-monolithic shaders on GFX9+ 5660- aco: ensure to initialize exec manually for VS as LS on GFX9+ 5661- aco: add support for compiling VS+TCS separately on GFX9+ 5662- radv: always declare some arguments for non-monolithic {VS,TES}/GS shaders 5663- radv: preserve shader arguments for non-monolithic {VS,TES}/GS on GFX9+ 5664- aco: ensure to initialize exec manually for non-monolithic {VS,TES}/GS on GFX9+ 5665- aco: add support for compiling {VS,TES}+GS separately on GFX9+ 5666- radv,aco: remove unused clip/cull distances variables 5667- radv: rename tcs_shader to tcs in radv_emit_tcs_epilog_state() 5668- radv: small cleanups in radv_emit_patch_control_points() 5669- radv: fix emitting TCS epilogs if TES and GS are linked on GFX9+ 5670- radv: remove the pipeline dependency for emitting VGT_GS_MODE 5671- aco: fix emitting TCS epilogs end on GFX9+ 5672- radv: re-order IO slot layout for stages that aren't linked 5673- amd/ci: update list of failures/flakes for glcts-vangogh-valve 5674- ci: uprev vkd3d-proton 5675- ci: uprev Fossilize 5676- ci: add comment explaining which image tags to update for Fossilize 5677- radv: preserve shader argument for separate compilation of NGG shaders 5678- aco: flag blocks with long-jump as export_end for separate compilation 5679- aco: adjust fix_exports() for VS/TES as NGG and non-monolithic shaders 5680- aco: allow separate compilation of NGG shaders 5681- zink/ci: add zink-radv-polaris10-valve 5682- radv/ci: re-enable vkcts-polaris10-valve 5683- radv: fix capturing indirect dispatches with SQTT 5684- radv/ci: re-enable vkd3d-polaris10-valve 5685- ci: do not fail vkd3d-proton job when the expectations match 5686- radv/amdgpu: fix executing secondaries without IB2 5687- radv/amdgpu: do not copy the original chain link for IBs 5688- radv: avoid emitting SQTT markers for DGC calls 5689- radv: add support for DGC with SQTT 5690- zink/ci: merge GLCTS testing with GLESx for RADV 5691- zink/ci: merge piglit testing with deqp-runner for RADV 5692- radv: fix interactions with primitives generated queries and pipeline stats 5693- radv: skip DGC calls when the indirect sequence count is zero with a predicate 5694- radv: avoid emitting THREAD_TRACE_MARKER for predicated draws/dispatches 5695- radv: adjust next stage for VS prologs and merged shaders compiled separately 5696- radv: adjust emitted prolog regs for merged shaders compiled separately 5697- radv: do not use pre-compiled prologs when VS is compiled separately 5698- radv: remove useless PIPELINE_CREATE_2_LIBRARY_BIT check for retained shaders 5699- radv: fix enabling DGCC 5700- radv: fix emitting SQTT userdata when CAM is needed 5701- radv: fix capturing RGP on RDNA3 with more than one Shader Engine 5702- zink/ci: update list of expected failures for POLARIS10/NAVI10 5703- radv: set THREAD_TRACE_TOKEN_MASK.BOP_EVENTS_TOKEN_INCLUDE on GFX10.3+ 5704- radv: disable unsupported hw shader stages for RGP on GFX11+ 5705- radv: fix instruction timing on GFX11 5706- ac/rgp: use correct API stage string for mesh/task shaders 5707- radv: set THREAD_TRACE_MARKER_ENABLE for mesh/task draws 5708- radv: emit relocation for mesh/task shaders 5709- issue_templates/Bug Report: fix outdated URL for GFXReconstruct 5710- ac,radv,radeonsi: rework SPM counters configuration and share it 5711- ac/perfcounter: add new SQ_WGP block for GFX11+ 5712- ac/spm: add SPM counters configuration for GFX11 5713- radv: enable the PKT3 CAM bit for some SPM register writes 5714- radv,radeonsi: use AC_SPM_SEGMENT_TYPE_xxx instead of magic values 5715- ac/spm: remove useless SPM block setting for GFX9 and older GPUs 5716- ac/spm: add SPM block definition for GFX10-GFX10.3 5717- ac/gpu_info: init num_cu_per_sh from the kernel 5718- ac/perfcounter: set the number of instances of GL1C to 4 5719- ac/perfcounter: compute the number of global instances of TCP,SQ,GL1C and GL2C 5720- ac/spm: fix checking if the counter instance is valid 5721- ac/spm: rework how segment muxsel RAM are filled 5722- ac/spm: initialize and set instance mapping for counters 5723- radv: reserve more CS space in SQTT/SPM paths 5724- ac/spm: use block flags to initialize instance mapping 5725- ac/spm: select correct segment type for per-SE blocks 5726- radv,radeonsi: make sure to emit GRBM_GFX_INDEX before SQ select registers 5727- ac/spm: fix number of instances of GL2C 5728- ac,radv,radeonsi: prepare support for multi-instance SPM SQ counters 5729- ac,radv,radeonsi: prepare support for multi-instance SPM generic counters 5730- ac/spm: move the counter instance to ac_spm_counter_create_info 5731- ac/spm: enable support for multi-instance counters 5732- radv: fix checking if RGP is enabled with others tracing tools 5733- radv: fix missing ISA with RGP and GPL 5734- ac/perfcounter: add SG_WQP group for GFX11 5735- ac/perfcounter: add GFX11 groups 5736- drirc: remove Path of Exile workarounds 5737- radv: remove drirc workarounds for Path Of Exile 5738- radv: remove absolute_depth_bias workaround 5739- ac/gpu_info: define AMD_MAX_WGP 5740- ac/spm: add new segment types for GFX11 5741- ac/spm: add support for GFX11 5742- radv: add SPM support for GFX11 5743- radv: enable cache counters for RGP on GFX11 5744- ci: update to vulkan-cts-1.3.6.3 5745- radv/ci: skip dEQP-VK.robustness.* on Vangogh due to weird GPU hangs 5746- nir: rename atomic_add_gs_invocation_count_amd to make it more generic 5747- ac/nir: add lowering for mesh shader queries 5748- ac/nir: add lowering for task shader queries 5749- radv: add GDS counters offset for mesh/task queries 5750- radv: adjust lowering of intrinsic queries for mesh/task shaders 5751- radv: enable lowering of mesh/task shader queries when enabled 5752- radv: declare shader_query_state for mesh/task shaders 5753- radv: stop skip emitting CB states when there is no color attachment 5754- radv: re-enable DCC with mipmaps on GFX11 5755- radv: fix COMPUTE_SHADER_INVOCATIONS query on compute queue 5756- radv: emit missing PA_{SC,SU}_LINE_STIPPLE_xxx regs in gfx preamble 5757- radv: fix alignment of DGC command buffers 5758- radv/ci: update list of expected failures on PITCAIRN 5759- radv/ci: update list of flakes for NAVI10/VEGA10 5760- radv/amdgpu: fix alignment of command buffers 5761- radv: enable DCC for MSAA images on GFX11 5762- zink/ci: update list of expectations for zink-anv-tgl 5763- zink/ci: bump zink-anv-tgl-full timeout to 1h45m 5764- radv/ci: rename GFX1100 lists to NAVI31 5765- radv: fix emulated geometry shader primitives/invocations queries 5766- radv/ci: remove duplicate skipped tests for RAVEN/STONEY 5767- radv/ci: exclude dEQP-VK.texture.explicit_lod.2d.sizes.128x128_* for all jobs 5768- radv: fix synchronization with emulated GS primitives/invocations queries 5769- radv/ci: remove no longer existing test for VANGOGH 5770- radv/ci: cleanup list of expected failures for NAVI10/NAVI21/VEGA10 5771- radv: always write the sample positions when a new descriptor BO is created 5772- radv: fill the scratch BO in radv_fill_shader_rings() 5773- radv: fix gang submissions with chaining 5774- radv: fix re-emitting streamout descriptors for NGG streamout 5775- radv: fix IB alignment 5776- zink: use warn_missing_feature for missing modifier support 5777- radv: fix destroying GDS/OA BOs 5778- radv: allocate only 1 GDS OA counter for gfx10 NGG streamout 5779- ac/nir: only consider overflow for valid feedback buffers 5780- radv/ci: update list of expected failures on RAVEN 5781- radv/ci: update list of flakes for VANGOGH 5782- radv/ci: update list of flakes for STONEY 5783- radv: disable primitive restart for non-indexed draws on GFX11 5784- radv: enable radv_disable_aniso_single_level=true for Zink too 5785- amd/llvm,aco,radv: implement NGG streamout with GDS_STRMOUT registers on GFX11 5786- radv: mark GDS as needed for XFB queries with NGG streamout on GFX11 5787- radv: skip GDS allocation for NGG streamout on GFX11 5788- zink/ci: remove expected failures that are skipped for RADV 5789- ci: update CTS to vulkan-cts-1.3.7.0 5790- ci: bump the number of tests per group from 500 to 5000 for Vulkan drivers 5791- ci: bump DEQP_FRACTION for some jobs 5792- radv: set ENABLE_PING_PONG_BIN_ORDER for GFX11.5 5793- radv: initialize video decoder for GFX11.5 5794- ac/gpu_info: query the maximum number of IBs per submit from the kernel 5795- Revert "radv: fix finding shaders by PC" 5796- radv: fix missing predicate bit for WRITE_DATA helper 5797- ac/gpu_info: fix querying the maximum number of IBs per ring 5798- radv: remove outdated RADV_DEBUG=vmfaults support 5799- amd: update amdgpu_drm.h 5800- amd: add has_gpuvm_fault_query 5801- radv/amdgpu: add support quering the last GPUVM fault 5802- radv: query and report the last GPUVM fault with RADV_DEBUG=hang 5803- radv: report the last GPUVM fault when a device lost is detected 5804- ac/gpu_info: remove bogus assertion about number of COMPUTE/SDMA queues 5805- radv: fix a synchronization issue with primitives generated query on RDNA1-2 5806- radv: bind the non-dynamic graphics state from the pipeline unconditionally 5807- radv: fix compute shader invocations query on compute queue on GFX6 5808- radv: emit COMPUTE_PIPELINESTAT_ENABLE for CS invocations on ACE 5809- nir: fix inserting the break instruction for partial loop unrolling 5810- radv: fix registering queues for RGP with compute only 5811- radv: set radv_zero_vram=true for Unreal Engine 4/5 5812- radv: fix a descriptor leak with debug names and host base descriptor set 5813- radv: add a missing async compute workaround for Tonga/Iceland 5814- radv: disable TC-compatible HTILE on Tonga and Iceland 5815- radv: set radv_invariant_geom=true for War Thunder 5816- radv: do not set OREO_MODE to fix rare corruption on GFX11 5817 5818Saroj Kumar (4): 5819 5820- radeonsi: Add perfetto support in radeonsi 5821- radeonsi: Add u_trace init code in radeonsi 5822- radeonsi: Add tracepoints in radeonsi driver 5823- radeonsi: fixes compilaton error when perfetto is disabled 5824 5825Sathishkumar S (2): 5826 5827- radeonsi/vcn: support variable number of bs_bufs 5828- radeonsi/vcn: num bs_bufs must be proportional to num jpeg engines 5829 5830Semjon Kravtsenko (1): 5831 5832- glx: Assign unique serial number to GLXBadFBConfig error 5833 5834Seppo Yli-Olli (1): 5835 5836- zink: Fix SyntaxWarning in zink_extensions script 5837 5838Sergi Blanch Torne (7): 5839 5840- Introduce ci-kdl builder and launcher. 5841- Integrate ci-kdl in the building process and launch process. 5842- ci: disable Collabora's LAVA lab for maintance 5843- Revert "ci: disable Collabora's LAVA lab for maintance" 5844- Revert "ci: disable Collabora's LAVA lab for maintance" 5845- ci: disable Collabora's LAVA lab for maintance 5846- Revert "ci: disable Collabora's LAVA lab for maintance" 5847 5848Sid Pranjale (1): 5849 5850- nvk: Enable VK_EXT_load_store_op_none 5851 5852Sil Vilerino (20): 5853 5854- util: Blake3 - Identify arm64ec as aarch64 instead of x64 5855- d3d12: Fix Map/Unmap of YUV resources 5856- d3d12: Fix H264 interlaced decode 5857- d3d12: Video Decode - Remove unnecessary copy for texture array case 5858- util/vl_vlc: Use UINT64_MAX instead of ~0UL with MSVC compiler 5859- d3d12: Extend video screen AV1 encode tile support checking 5860- aux/tc: Add ASSERTED to unreferenced release build variable 5861- d3d12: Video - Relax ID3D12VideoDevice QI version for decode, process 5862- frontends/va: Add profile param when querying PIPE_VIDEO_CAP_ENC_QUALITY_LEVEL 5863- d3d12: Upgrade to D3D12 Agility SDK 1.611 Video interface 5864- d3d12: Fixes AV1 tx_mode_support reporting and unsupported tx_mode overriding 5865- d3d12: Video Decode - Wait for GPU completion before destroying decoder in-flight objects 5866- d3d12: Do not destroy codec when destroying video buffer 5867- d3d12: AV1 encode - Add lower resolution fallback check for uniform tile support 5868- d3d12: AV1 encode - add fallback for app passing unsupported pic_params.InterpolationFilter 5869- d3d12: AV1 Encode - Fix VAConfigAttribEncMaxRefFrames reporting 5870- frontend/va: Add support for VAConfigAttribEncMaxTileRows/Cols 5871- d3d12: Add support for PIPE_VIDEO_CAP_ENC_MAX_TILE_ROWS/COLS 5872- d3d12: Allocate d3d12_video_buffer with higher alignment for compatibility 5873- d3d12: d3d12_video_buffer_create_impl - Fix resource importing 5874 5875Simon Ser (7): 5876 5877- wayland: enable use of wayland-protocols as a subproject 5878- vulkan/wsi/wayland: add support for IMMEDIATE 5879- vulkan/wsi/wayland: fix unset present_mode 5880- radv/winsys: check amdgpu_create_bo_from_user_mem() for EINVAL 5881- egl: extract EGLDevice setup in dedicated function 5882- egl: move dri2_setup_device() after dri2_setup_extensions() 5883- egl: ensure a render node is passed to _eglFindDevice() 5884 5885Simon Zeni (1): 5886 5887- nouveau/winsys: use mmap instead of mmap64 in nouveau_bo 5888 5889SoroushIMG (1): 5890 5891- pvr: fix mipmap size calculation for bc formats 5892 5893Sviatoslav Peleshko (9): 5894 5895- dri: Use RGB internal formats for RGBX formats 5896- intel/isl: Don't over-allocate CLEAR_COLOR size to use whole cache line 5897- anv: Do fast clear color initialization more delicately 5898- zink: Change zink_vertex_elements_hw_state::b.strides to VkDeviceSize 5899- intel/fs: Check if the whole ubo load range is in the push const range 5900- zink: Store zink_vertex_elements_hw_state::b.strides by binding id 5901- intel/fs: Fix "packed word exception" condition for register regioning 5902- intel/eu/validate: Validate "packed word exception" stricter 5903- nir/loop_analyze: Fix inverted condition handling in iterations calculation 5904 5905Sylvain Munaut (9): 5906 5907- egl/dri2: Add a couple of missing mutex release in error path 5908- mesa: Enable ARB_texture_border_clamp in GL Core 5909- include: Fix the PFN declarations to be pointers as they should 5910- glx: Add missing MesaGLInteropGLXFlushObjects 5911- glx: Export the MESA GL Interop functions through glXGetProcAddress 5912- egl: Export the MESA GL Interop functions through eglGetProcAddress 5913- glx: Remove MESA_depth_float_bit from enum 5914- glx: Advertise GLX_MESA_gl_interop extension if support present 5915- egl: Advertise EGL_MESA_gl_interop extension if support present 5916 5917Tapani Pälli (34): 5918 5919- intel/blorp: add a new flag to communicate PSS sync need 5920- anv: implement required PSS sync for Wa_18019816803 5921- iris: implement required PSS sync for Wa_18019816803 5922- vulkan/runtime: change assert to match specification needs 5923- anv: remove assert, size is asserted in the runtime 5924- anv: refactor batch_set_preemption to use batch_emit_pipe_control 5925- anv: implement a dummy depth flush for Wa_14016712196 5926- iris: implement a dummy depth flush for Wa_14016712196 5927- mesa: fix some TexParameter and SamplerParameter cases 5928- mesa: remove GL_UNSIGNED_BYTE as supported for snorm reads 5929- ci: add a fix for KHR-GLES3.packed_pixels.*snorm tests 5930- anv: implement Wa_14018912822 5931- iris: implement Wa_14018912822 5932- driconf: use lower_depth_range_rate for The Spirit and The Mouse 5933- mesa: disable snorm readpix clamping with EXT_render_snorm 5934- iris: modify Wa_14014414195 to use intel_needs_workaround 5935- mesa: some cleanups for texparam extension checks 5936- iris: avoid issues with undefined clip distance 5937- crocus: avoid issues with undefined clip distance 5938- anv: refactor to fix pipe control debugging 5939- anv: fix a leak of fp64_nir shader 5940- iris: use intel_needs_workaround for Wa_14014414195 part 2 5941- iris: correct dst alpha blend factor in Wa_14018912822 5942- iris/anv: move Wa_14018912822 as a drirc workaround 5943- iris: flush data cache when flushing HDC on GFX < 12 5944- anv: HDC flush is available only for GFX_VER 12+ 5945- iris: HDC flush is available only for GFX_VER 12+ 5946- intel/genxml: remove HDC from gen11.xml, it is not available 5947- mesa/st: ignore StencilSampling if stencil not part of the format 5948- intel/dev: expand existing fix for all gfx12 with small EU count 5949- egl: fix leaking drmDevicePtr in _eglFindDevice 5950- iris: add data cache flush for pre hiz op 5951- anv/drirc: add option to disable FCV optimization 5952- drirc: Set limit_trig_input_range option for Valheim 5953 5954Tatsuyuki Ishi (8): 5955 5956- radv/amdgpu: Remove unused bo_list variable from cs_submit. 5957- radv/winsys: Remove unused struct radv_winsys_bo_list. 5958- radv/amdgpu: Do not pass in a BO handle when clearing PRT VA region. 5959- radv: Fix IB size for RADV_DEBUG=hang. 5960- radv: Fix dumping vertex descriptors with RADV_DEBUG=hang. 5961- radv/amdgpu: Use rwlock to protect access to virtual BOs. 5962- zink: Fix missing sparse buffer bind synchronization. 5963- zink: Fix waiting for texture commit semaphores. 5964 5965Thomas H.P. Andersen (65): 5966 5967- tgsi: remove unused tgsi_shader_info.num_tokens 5968- tgsi: remove unused tgsi_shader_info.array_max 5969- tgsi: remove unused tgsi_shader_info.num_memory_instructions 5970- tgsi: remove unused tgsi_shader_info.colors_read 5971- tgsi: remove unused tgsi_shader_info.colors_written 5972- tgsi: remove unused tgsi_shader_info.reads_position 5973- tgsi: remove unused tgsi_shader_info.reads_samplemask 5974- svga: remove unused struct field 5975- tgsi: remove unused tgsi_shader_info.reads_tess_factors 5976- tgsi: remove unused tgsi_shader_info fields 5977- tgsi: remove unused tgsi_shader_info fields 5978- tgsi: remove unused tgsi_shader_info.uses_drawid 5979- tgsi: remove unused tgsi_shader_info fields 5980- tgsi: remove unused tgsi_shader_info.uses_subgroup_info 5981- tgsi: remove unused tgsi_shader_info.writes_primid 5982- tgsi: remove unused tgsi_shader_info.uses_doubles 5983- tgsi: remove unused tgsi_shader_info.uses_derivatives 5984- tgsi: remove unused tgsi_shader_info.uses_bindless_samplers 5985- tgsi: remove unused tgsi_shader_info.uses_bindless_images 5986- tgsi: remove unused tgsi_shader_info.clipdist_writemask 5987- tgsi: remove unused tgsi_shader_info.culldist_writemask 5988- tgsi: remove unused tgsi_shader_info.images_load 5989- tgsi: remove unused tgsi_shader_info.images_store 5990- tgsi: remove unused tgsi_shader_info.images_atomic 5991- tgsi: remove unused tgsi_shader_info.uses_bindless_buffer_load 5992- tgsi: remove unused tgsi_shader_info.uses_bindless_buffer_store 5993- tgsi: remove unused tgsi_shader_info.uses_bindless_buffer_atomic 5994- tgsi: remove unused tgsi_shader_info.uses_bindless_image_load 5995- tgsi: remove unused tgsi_shader_info.uses_bindless_image_store 5996- tgsi: remove unused tgsi_shader_info.uses_bindless_image_atomic 5997- tgsi: remove unused tgsi_shader_info.indirect_files_read 5998- tgsi: remove unused tgsi_shader_info.indirect_files_written 5999- tgsi: remove unused tgsi_shader_info.const_buffers_indirect 6000- tgsi: remove unused tgsi_shader_info.max_depth 6001- tgsi: drop two unused functions 6002- nvk: use common physical device enumeration 6003- nvk: fix implicit-fallthrough warnings with clang 6004- nvk: delete commented code 6005- nvk: fix mem leaks 6006- nvk: use common descriptor set layout code 6007- nvk: use common pipeline layout code 6008- nvk: advertise KHR_shader_non_semantic_info 6009- nvk: advertise KHR_image_format_list 6010- nvk: advertise EXT_private_data 6011- nvk: advertise KHR_sampler_mirror_clamp_to_edge 6012- nvk: KHR_descriptor_update_template 6013- nvk: CmdPushDescriptorSetWithTemplateKHR 6014- nvk: drop dead assignment 6015- nvk: drop dead assignment 6016- nvk: fix initialization override 6017- nvk: sort extensions 6018- nvk: advertize KHR_relaxed_block_layout 6019- nvk: add check for VK_IMAGE_CREATE_2D_VIEW_COMPATIBLE_BIT_EXT 6020- nvk: advertise EXT_image_2d_view_of_3d 6021- nvk: fix maxPushDescriptors 6022- nvk: call correct macro to clear views 6023- nouveau/mme: use fermi enum in fermi builder 6024- nvk: add warning on non-nouveau drm driver 6025- nvk: Implement VK_KHR_draw_indirect_count on Turing+ 6026- nvk: set device info before use in nvk_get_device_extensions 6027- nvk: simplify code by using new helpers 6028- nvk: remove duplicated device features 6029- nvk: EXT_conditional_rendering 6030- nvk: advertise VK_EXT_tooling_info 6031- nvk: set optimization level to 3 6032 6033Thong Thai (3): 6034 6035- radeonsi: enable vcn encoder rgb input support 6036- Update radeon_vcn_enc.c 6037- frontends/va/config: report max width and height for encoding/decoding 6038 6039Timothy Arceri (27): 6040 6041- glsl: fix validation of ES vertex attribs 6042- nir/opt_copy_prop_vars: don't clone copies if branch empty 6043- nir/opt_copy_prop_vars: speedup cloning of copy tables 6044- nir/opt_copy_prop_vars: remove var hash entry on kill alias 6045- nir/opt_copy_prop_vars: skip cloning of copies arrays until needed 6046- nir/opt_copy_prop_vars: drop reuse of dynamic arrays 6047- glsl: fix spirv sso validation 6048- glsl: mark structs containing images as bindless 6049- util: add radeonsi workaround for Nowhere Patrol 6050- glsl: fix out params in glsl to nir 6051- glsl_to_nir: add more unhandled function types 6052- nir: replace use of nir_src_copy() 6053- nir: remove unused nir_src_copy() 6054- nir: remove unused param from nir_alu_src_copy() 6055- glsl: remove field from gl_shader_program 6056- glsl: move get_varying_type() declaration earlier 6057- glsl: add nir version of validate_first_and_last_interface_explicit_locations() 6058- glsl: switch to nir validate_first_and_last_interface_explicit_locations() 6059- glsl: remove unused validate_first_and_last_interface_explicit_locations() 6060- nir: fix typo in comment 6061- nir: copy explicit_invariant flag to nir vars 6062- glsl: move interpolation_string() to linker_util 6063- glsl: move is_gl_identifier() to linker_util 6064- nir: add used field to nir variables 6065- glsl: implement cross_validate_outputs_to_inputs() in nir linker 6066- glsl: switch to nir linkers cross_validate_outputs_to_inputs() 6067- glsl: remove now unused varying linker code 6068 6069Timur Kristóf (39): 6070 6071- aco: Fix subgroup_id intrinsic on GFX10.3+. 6072- ac/nir: Simplify arg unpacking when shift is zero. 6073- ac/nir: Add new pass to lower intrinsics to shader args. 6074- radv: Move radv_select_hw_stage to radv_shader_info. 6075- radv: Use ac_nir_lower_intrinsics_to_args. 6076- radeonsi: Move si_select_hw_stage to si_shader_info. 6077- radeonsi: Use ac_nir_lower_intrinsics_to_args. 6078- aco: Remove subgroup_id and num_subgroups intrinsics. 6079- ac/llvm: Remove subgroup_id and num_subgroups intrinsics. 6080- aco: Refactor select_program to smaller functions. 6081- nir/opt_dead_cf: Remove if branches with undef condition. 6082- ac/nir: Add done arg to ac_nir_export_position. 6083- ac/nir: Slightly refactor how pos0 exports are added when missing. 6084- ac/nir/ngg: Wait for attribute stores before VS/TES/GS pos0 export. 6085- ac/nir/ngg: Refactor mesh shader primitive export. 6086- ac/nir/ngg: Wait for attribute ring stores in mesh shaders. 6087- ac/nir/ngg: Extract nogs_export_vertex_params function. 6088- ac/gpu_info: Add some SDMA related information. 6089- ac: Clarify SDMA opcode defines. 6090- ac: Add amd_ip_type argument to ac_parse_ib and ac_parse_ib_chunk. 6091- ac: Rename ac_do_parse_ib to parse_pkt3_ib. 6092- ac: Print IP type for IBs. 6093- ac: Add rudimentary implementation of printing SDMA IBs. 6094- radv: Rename SDMA file to radv_sdma.c 6095- radv: Use const device argument in radv_sdma_copy_buffer. 6096- radv: Use const on vi_alpha_is_on_msb arguments. 6097- radv: Only call si_cp_dma_wait_for_idle on GFX and ACE queues. 6098- radv: Move radv_cp_wait_mem to radv_cs.h and add queue family argument. 6099- radv: Refactor WRITE_DATA helper function. 6100- radv: Use new WRITE_DATA helper in more places. 6101- radv: Add queue family argument to some functions. 6102- radv: Wait for bottom of pipe in ACE gang wait postamble. 6103- radv: Simplify gang CS and semaphore initialization. 6104- radv: Allow gang submit use cases other than task shaders. 6105- radv: Slightly refactor gang semaphore functions. 6106- radv: Add gang follower semaphore functions. 6107- radv: Support SDMA in radv_cs_write_data_head. 6108- radv: Support SDMA in radv_cp_wait_mem. 6109- radv: Support SDMA in si_cs_emit_write_event_eop. 6110 6111Vignesh Raman (4): 6112 6113- ci: add Vignesh Raman into restricted traces access list 6114- Do explicit cast to suppress clang warnings 6115- ci: enforce -Wimplicit-const-int-float-conversion for clang 6116- ci: Uprev crosvm 6117 6118Vinson Lee (8): 6119 6120- nvk: Fix assert 6121- lavapipe: Fix struct initialization 6122- intel/decoder: Fix memory leak on error path 6123- nv50: Remove unused value 6124- vk/wsi/x11: Remove dead code 6125- freedreno/replay: Fix implicit-function-declaration error 6126- anv: Fix transfer type assert 6127- broadcom/qpu: Remove duplicate variable opcode 6128 6129Vitaliy Triang3l Kuzmin (3): 6130 6131- r600/asm: Fix AR force_add_cf setting if a clause is not open 6132- r600/asm: Make sure MOVA and SET_CF_IDX are in the same clause 6133- r600: Replace R600_BIG_ENDIAN with UTIL_ARCH_BIG_ENDIAN 6134 6135Vlad Schiller (15): 6136 6137- pvr: Implement VK_EXT_tooling_info 6138- pvr: Add 'info' PVR_DEBUG flag 6139- pvr: Implement VK_KHR_format_feature_flags2 6140- pvr: Remove PVR_WINSYS_BO_FLAG_ZERO_ON_ALLOC flag 6141- pvr: Add VK_KHR_driver_properties 6142- pvr: Use correct index when writing query availability data 6143- pvr: Enable VK_EXT_scalar_block_layout 6144- pvr: Enable KHR_image_format_list 6145- pvr: Enable VK_KHR_uniform_buffer_standard_layout 6146- pvr: Implement VK_KHR_external_fence 6147- pvr: Implement VK_KHR_external_semaphore 6148- pvr: Enable VK_KHR_bind_memory2 extension 6149- pvr: Implement VK_EXT_texel_buffer_alignment 6150- pvr: Implement VK_EXT_host_query_reset 6151- pvr: Fix VK_EXT_texel_buffer_alignment 6152 6153WinLinux1028 (1): 6154 6155- radeonsi: prefix function with si\_ to prevent name collision 6156 6157Xaver Hugl (1): 6158 6159- vulkan wsi: add support for PresentOptionAsyncMayTear 6160 6161Yiwei Zhang (46): 6162 6163- venus: handle query feedback creation failure 6164- venus: ensure consistency of query overflow behavior 6165- venus: add a missing barrier before copying query feedback 6166- venus: refactor query feedback cmd record 6167- venus: reduce to use 4K mem suballoc align on platforms known to fit 6168- turnip: flush cache for dstBuffer in vkCmdCopyQueryPoolResults 6169- lvp: avoid reading immutable sampler from desc write info 6170- ci/venus: update venus-lavapipe expectations 6171- venus: fix a cmd builder render_pass state leak across reset 6172- venus: fix cmd state leak across implicit reset 6173- venus: log and doc the broken query feedback in suspended render pass 6174- venus: move transient storage from cmd to pool 6175- venus: remove redundant fb tracking from cmd builder 6176- venus: use tracked queue_family_index from the cmd pool 6177- venus: cleanup vn_cmd_begin_render_pass usage 6178- venus: add helpers to track subpass view mask 6179- venus: avoid redundant tracking of render pass 6180- venus: refactor more cmd states into cmd builder 6181- venus: use in_render_pass to skip present_src counting 6182- ci/venus: remove fixed tests that no longer run 6183- ci/venus: reenable pipeline cts 6184- venus: suppress a false logging 6185- venus: add no_sparse debug option to disable sparse resource support 6186- venus: set deviceMemoryReport feature 6187- venus: expose at least one cached memory type 6188- venus: expose KHR_external_fence/sempahore_fd extensions 6189- venus: fix a device memory report leak 6190- vulkan: remove a dup entry from vk_image_usage_to_ahb_usage 6191- vulkan/android: improve vkQueueSignalReleaseImageANDROID 6192- vulkan/android: add missing AHARDWAREBUFFER_USAGE_GPU_DATA_BUFFER usage 6193- vulkan/android: drop vk_buffer dependency from common AHB impl 6194- venus: use common vk_queue object 6195- venus: use common ANB implementation 6196- venus: use more common vk_queue related implementations 6197- venus: drop device, family, index, flags tracking from vn_queue 6198- venus: fix re-export of imported classic 3d resources 6199- venus: remove redundant bo roundtrip and add more docs 6200- venus: track VkPhysicalDeviceMemoryProperties instead 6201- venus: refactor vn_device_memory to prepare for async alloc 6202- venus: make device memory alloc async 6203- venus: enable Vulkan 1.3 for Android 13 and above 6204- zink: sync queue access for vkQueueWaitIdle 6205- venus: properly expose KHR_external_fence/sempahore_fd 6206- ci/venus: mark more flaky tests after recent cts uprev 6207- venus: fix query feedback batch leak and race upon submission 6208- zink: apply can_do_invalid_linear_modifier to Venus 6209 6210Yogesh Mohan Marimuthu (12): 6211 6212- gallium: remove start_slot parameter from pipe_context::set_vertex_buffers 6213- ac/surface: add astc block size to bpe_to_format() function 6214- util: move ASTCLutHolder from mesa/main to util 6215- vulkan/formats,zink: move vk_format_from_pipe_format() function 6216- vulkan/runtime: add compute astc decoder helper functions 6217- vulkan add 3D texture support for compute astc decoder 6218- radv: integrate meta astc compute decoder to radv 6219- radeonsi: add more documentation for dpbb debug env variable 6220- docs: remove document for unused variable dfsm from AMD_DEBUG 6221- radeonsi: correct old comment in si_emit_framebuffer_state() 6222- radeonsi: In gfx6_init_gfx_preamble_state() use gfx_level only from sctx 6223- radeonsi: add radeonsi to GL_RENDERER string 6224 6225Yonggang Luo (43): 6226 6227- lima: Convert to use nir_foreach_function_impl when possible 6228- freedreno: Switch to use nir_foreach_function_impl in tu_shader.cc 6229- zink: Convert to use nir_foreach_function_impl when possible 6230- lavapipe: Convert to use nir_foreach_function_impl 6231- lavapipe: fixes indent of function lvp_inline_uniforms 6232- microsoft/compiler: convert to use nir_foreach_function_with_impl in function emit_module 6233- microsoft/clc/compiler: Convert to use nir_foreach_function_impl when possible 6234- radeonsi: Convert to use nir_foreach_function_impl 6235- ac: Switch to use nir_foreach_function_impl in function analyze_shader_before_culling 6236- util: Move pipe_swizzle from p_defines.h to u_formats.h 6237- util: Move PIPE_MASK_* from p_defines.h to u_formats.h 6238- util: Move pipe_color_union from p_defines.h into u_formats.h 6239- util: Move u_pack_color.h and dbughelp.h into src/util from/src/gallium/auxiliary/util/ 6240- util: Remove include "pipe/\*.h" in src/util/* files 6241- util:Move only gallium used u_debug_refcnt.* and u_debug_describe.* into src/gallium/auxiliary/util/ 6242- util/meson: Getting mesa util core to be self contained 6243- pvr: decouple vulkan driver and compiler from gallium 6244- freedreno: decouple compiler and vulkan driver from gallium 6245- glx: decouple from gallium 6246- meson: Remove arm_neon_workaround 6247- nouveau/drm-shim: Decouple from gallium 6248- ac/radv: decouple radv vulkan driver and compiler from gallium 6249- etnaviv: decouple drm from gallium 6250- asahi: decouple layout from gallium 6251- compiler: Move WRITEMASK_* from prog_instruction.h into shader_enums.h 6252- intel/blorp: Use float directly to avoid #include "mesa/main/format_utils.h" 6253- intel/blorp: brw_sampler_prog_key_data::swizzles is only and should only accessed in crocus 6254- intel/brw: Define and use BRW_SWIZZLE_* instead of SWIZZLE_* 6255- crocus: #include "program/prog_instruction.h" for SWIZZLE_* 6256- intel/compiler,intel/blorp,intel/vulkan: decouple vulkan driver and compiler from gallium 6257- util/treewide: Use alignas(x) instead __attribute__((aligned(x))) 6258- v3dv: Use alignas(8) over 64 bit atomic value 6259- svga: use alignas over struct MKSGuestStatInfoEntry 6260- radv: Fixes mingw linkage error undefined reference to \`radv_GetCalibratedTimestampsEXT' 6261- v3d: Use DIV_ROUND_UP instead div_round_up 6262- freedreno: Use shared DIV_ROUND_UP instead div_round_up 6263- sfn: Use 4 instead of ATOMIC_COUNTER_SIZE 6264- intel/brw: use 4 instead of MAX_VERTEX_STREAMS to avoid #include "mesa/main/config.h" 6265- d3d12: replace use of MAX_VERTEX_STREAMS with PIPE_MAX_VERTEX_STREAMS 6266- compiler: use 4 instead ATOMIC_COUNTER_SIZE in glsl_types.h to avoid #include "mesa/main/config.h" 6267- compiler/glsl: Move glsl_print_type from glsl_types.* to ir_print_visitor.cpp 6268- util: Deduplicate macros between u_math.h and macros.h 6269- nvk: Should use alignment instead of align 6270 6271Yusuf Khan (4): 6272 6273- nouveau/ws: remove the drm.h header 6274- nvk: implement GetDeviceMemoryCommitment 6275- nvk: support GetImageSparseMemoryRequirements2 6276- nvk: expose KHR_driver_properties 6277 6278Zhang Ning (1): 6279 6280- Revert "intel/ci: disable iris-jsl-deqp because it always fails for an AMD MR" 6281 6282antonino (14): 6283 6284- virgl: add ci flake 6285- freedreno: add ci flake 6286- zink: remove unused indices from \`nir_load_push_constant` calls 6287- zink/nir: add a zink specific intrinsic for push constants 6288- vulkan/wsi: add \`vk_wsi_force_swapchain_to_current_extent` driconf 6289- drirc: enable \`vk_wsi_force_swapchain_to_current_extent` for "The Talos Principle" 6290- drirc: enable \`vk_wsi_force_swapchain_to_current_extent` for "Serious Sam Fusion" 6291- vulkan: Extend vkGet/SetPrivateDataEXT handling to all platforms 6292- vulkan: Extend vkGet/SetPrivateDataEXT handling to VkSurface 6293- vulkan: Handle vkSetDebugUtilsObjectNameEXT on WSI objects 6294- zink: store bindless var when creating it to avoid creating it again 6295- nir: fix several crashes in \`nir_lower_tex` 6296- nir: don't take the derivative of the array index in \`nir_lower_tex` 6297- vulkan: use instance allocator for \`object_name` in some objects 6298 6299cheyang (1): 6300 6301- isaspec : fix isaspec build error in aosp 6302 6303georgeouzou (1): 6304 6305- nvk: Support VK_EXT_line_rasterization 6306 6307jazzfool (1): 6308 6309- zink: Hash only first 32 bits of zink_gfx_pipeline_state with full DS3 6310 6311lorn10 (1): 6312 6313- docs: Update Clover's env variable documentation 6314 6315norablackcat (2): 6316 6317- spirv/nir_to_spirv: add expect assume op codes 6318- rusticl: add cl_khr_expect_assume 6319 6320timmac-qmc (1): 6321 6322- glsl: fix potential crash with DisableUniformArrayResize 6323 6324twisted89 (1): 6325 6326- util/driconf: add workarounds for the Chronicles of Riddick 6327 6328wangra (1): 6329 6330- tu/kgsl: Fix bitfield of DITHER_MODE_MRT6 6331 6332xurui (1): 6333 6334- glx: There is no need to psc++ 6335