1Mesa 24.1.0 Release Notes / 2024-05-22 2====================================== 3 4Mesa 24.1.0 is a new development release. People who are concerned 5with stability and reliability should stick with a previous release or 6wait for Mesa 24.1.1. 7 8Mesa 24.1.0 implements the OpenGL 4.6 API, but the version reported by 9glGetString(GL_VERSION) or glGetIntegerv(GL_MAJOR_VERSION) / 10glGetIntegerv(GL_MINOR_VERSION) depends on the particular driver being used. 11Some drivers don't support all the features required in OpenGL 4.6. OpenGL 124.6 is **only** available if requested at context creation. 13Compatibility contexts may report a lower version depending on each driver. 14 15Mesa 24.1.0 implements the Vulkan 1.3 API, but the version reported by 16the apiVersion property of the VkPhysicalDeviceProperties struct 17depends on the particular driver being used. 18 19SHA256 checksum 20--------------- 21 22:: 23 24 b7eac8c79244806b1c276eeeacc329e4a5b31a370804c4b0c7cd16837783f78b mesa-24.1.0.tar.xz 25 26 27New features 28------------ 29 30- VK_EXT_map_memory_placed on RADV, ANV and NVK 31- VK_KHR_shader_subgroup_rotate on RADV and ANV and NVK 32- VK_KHR_load_store_op_none on RADV, ANV, NVK and Turnip 33- VK_KHR_line_rasterization on RADV, ANV, NVK and Turnip 34- VK_KHR_index_type_uint8 on RADV, ANV, NVK and Turnip 35- VK_KHR_shader_expect_assume on all Vulkan drivers 36- VK_KHR_shader_maximal_reconvergence on RADV, ANV and NVK 37- VK_KHR_shader_quad_control on RADV 38- OpenGL 4.6 on Asahi 39- OpenGL ES 3.2 on Asahi 40- Mali G610 and G310 on Panfrost 41- Mali T600 on Panfrost 42- VK_KHR_shader_subgroup_uniform_control_flow on NVK 43- alphaToOne/extendedDynamicState3AlphaToOneEnable on RADV 44- VK_EXT_device_address_binding_report on RADV 45- VK_EXT_external_memory_dma_buf for lavapipe 46- VK_EXT_queue_family_foreign for lavapipe 47- VK_EXT_shader_object on RADV 48- VK_EXT_nested_command_buffer on NVK and RADV 49- VK_EXT_queue_family_foreign on NVK 50- VK_EXT_image_drm_format_modifier on NVK 51 52 53Bug fixes 54--------- 55 56- anv: unbounded shader cache 57- radv: Crash due to nir validation fail in Enshrouded 58- bisected: turnip: deqp regressions 59- android: sRGB configs no longer exist after !27709 60- [24.1-rc4] fatal error: intel/dev/intel_wa.h: No such file or directory 61- vcn: rewinding attached video in Totem cause [mmhub] page fault 62- When using amd gpu deinterlace, tv bt709 properties mapping to 2 chroma 63- ci: switch from CI_JOB_JWT to id_tokens 64- VCN decoding freezes the whole system 65- [RDNA2 [AV1] [VAAPI] hw decoding glitches in Thorium 123.0.6312.133 after https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28960 66- nvk: Tracker issue for gamescope support 67- nvk: Implement VK_EXT_image_drm_format_modifier 68- WSI: Support VK_IMAGE_ASPECT_MEMORY_PLANE_i_BIT_EXT for DRM Modifiers in Vulkan 69- [Regression][Bisected] EGL/Wayland: QT applications terminated by SIGSEGV (Address boundary error) when using dGPU 70- radv: Enshrouded GPU hang on RX 6800 71- NVK Zink: Wrong color in Unigine Valley benchmark 72- panforst: T604 issue with using u32 for flat varyings 73- [anv] FINISHME: support YUV colorspace with DRM format modifiers 74- gen9/11 test became flaky: piglit.spec.!opengl 1_4.blendminmax 75- 24.0.6: build fails 76- mesa 24 intel A770 KOTOR black shadow smoke scenes 77- [bisected][regression] kitty fails to start due to \`glfwWindowHint(GLFW_SRGB_CAPABLE,true)` 78- r600: bisected 5eb0136a3c561 breaks a number of piglits 79- Graphical glitches in RPCS3 after updating Vulkan Intel drivers 80- [R600] OpenGL and VDPAU regression in Mesa 23.3.0 - some bitmaps get distorted. 81- VAAPI radeonsi: VBAQ broken with HEVC 82- radv/video: 10-bit support 83- radv: vkCmdWaitEvents2 is broken 84- Zink: enabled extensions and features may not match 85- glRasterPos: "Assertion \`prog->base_serialized_nir' failed." if a shader is loaded from the shader cache 86- radv: mesa-9999/src/amd/vulkan/radv_image_view.c:147: radv_set_mutable_tex_desc_fields: Assertion \`(plane->surface.u.gfx9.surf_pitch * plane->surface.bpe) % 256 == 0' failed. 87- ACO doesn't hide lds_param_load latencies 88- ACO doesn't form a VMEM clause for image stores in one case on GFX11 89- r600: Valheim hangs CAYMAN gpu (regression/bisected) 90- r600: Artifacts in Oxygen Not Included around air ducts and pipes (regression, bisected) 91- radv: UMR wave parsing format is outdated 92- radv: GetImageMemoryRequirements2 does not look at VkImagePlaneMemoryRequirementsInfo 93- RADV, regression : Objects randomly appear/disappear on Unreal Engine 4 titles using D3D12 backend on Polaris 94- mesa 23.1.0-rc3 flickering textures/lighting in Unreal 4 games Polaris10 95- ACO tests SIGSEGV in debian-vulkan job with LTO enabled 96- radv: Address binding report for images is incorrect. 97- blorp: avoid dirtying push constants in 3D 98- anv: flaky vkd3d-proton test_buffer_feedback_instructions_sm51 99- FTBFS: commit aaccc25a4dd9ccfc134e51a7e81168334d63a909 broke mesa snapshot build 100- d3d12_screen.cpp:60:10: fatal error: ShlObj.h: No such file or directory 101- r300: crash when compiling some GSK shaders 102- anv: vkd3d-proton test_stress_suballocation failure 103- d3d12: Zwift renders with bad textures/lighting 104- nir_opt_remove_phis breaks divergence analysis 105- intel: Require 64KB alignment when using CCS and multiple engines 106- NVK: Misrendering with Civilization 6 107- radv: RDR2 might need zerovram 108- intel-clc build failure, i think? 109- Issues rendering gtk4 window decorations on v3d on Fedora-40/mesa-24.0 110- clc: Failure when linking with llvm+clang 18.1 (-Dshared-llvm=disabled) 111- LLVM-18 build issue 112- vulkan/wsi/x11: VK_SUBOPTIMAL_KHR is never reported by the swapchain 113- Broken vaapi encoding on Radeon RX 6900XT 114- RUSTICL creating a shared reference to mutable static is discouraged and will become a hard error 115- anv: GPU hang on Assassin's Creed Valhalla while running benchmark 116- nvk: dota 2 crashes after ~5 seconds in game 117- dzn: conflicting defines with DirectX headers 1.613.0 118- VAAPI: Incorrect HEVC block size reported with radeonsi 119- radv: WWE 2K24 has very quirky DCC issues on RDNA2 120- anv: Dirt 5 crashes at tryCreatingPipelineStateFromCache 121- freedreno: remove headergen2 122- freedreno: remove headergen2 123- vulkan/wsi: crash in dEQP-VK.wsi.wayland.swapchain.simulate_oom.min_image_count 124- Document that Zink on MoltenVK is not expected to work 125- KiCAD 3D Viewer - rounded pads rendered incorrectly (texture mapping or stencil test error) 126- OpenSCAD rendering incorrect and inconsistent on radeonsi 127- intel/fs: regression on MTL with 64bit values in UBO 128- ci: split debian-build-testing? 129- [freedreno] Black background on SuperTux Kart with postmarketOS and Oneplus 6T 130- [radv] Half-Life Alyx renders solid black for reflective surfaces 131- iris: iris_resource_get_handle returns wrong modifier 132- [RX 7900 XTX] Helldivers 2 cause GPU reset 133- radeon: Crash in radeon_bo_can_reclaim_slab 134- regression/bisected: commit 4e3f3c10e14d8778781c81f39ced659b5ec2f148 broke mesa snapshot build 135- RV530 renders improperly at non 4:3 resolutions. 136- anv: new cooperative matrix failures with CTS 1.3.8.0 137- nvk: Missing implementation of VkImageSwapchainCreateInfoKHR and VkBindImageMemorySwapchainInfoKHR 138- mesa > 23.1.9 [opencl,video_cards_nouveau] fails to build due to missing symbol vl_video_buffer_is_format_supported 139- intel/meson: Make intel_stub_gpu work with \`meson devenv` 140- Follow-up from "iris: Fix plane indexing and handling on image import" 141- nvk,nak: Implement shaderStorageImageMultisample 142- nvk,nak: Implement VK_KHR_shader_subgroup_uniform_control_flow 143- \`[gfxhub0] no-retry page fault` triggered by \`AMD_TEST=testdmaperf` on gfx90c APU 144- nvk: glcts hangs 145- v3d: Line rendering broken when smoothing is enabled 146- PowerVR reports minMemoryMapAlignment of 64 147- RADV: GPU crash when setting 'RADV_DEBUG=allbos' 148- [intel] mesa ftbfs with time_t64 149- d3d12_resource.cpp:307:49: error: no matching function for call to 'ID3D12Heap::GetDesc()' 150- radv regression between a337a0c8072d0be487e43c2b7b132e003c6d5a5e and 83f741124b66818053b6b1b2f7e42f5217a27004 151- [build failure] [armhf] - error: #error "_TIME_BITS=64 is allowed only with _FILE_OFFSET_BITS=64" 152- R400 should have native support for sin/cos in VS 153- [radv] Crash when VkGraphicsPipelineCreateInfo::flags = ~0u 154- intel: all workarounds disabled with ATS skus 155- vulkan: GPL now broken 156- Gen4 assertion \`force_writemask_all' failed. 157- src/gallium/auxiliary/rtasm/rtasm_x86sse.c:198:10: runtime error: store to misaligned address 0x7fabba0cd011 for type 'int', which requires 4 byte alignment 158- [radv] Holographic projection texture glitch in Rage 2 159- RustiCL: Callbacks are not called upon errors 160- MTL: regressions in vulkancts due to BO CCS allocations 161- zink: spec\@ext_external_objects\@vk-image-overwrite fail 162- vaapi: radeonsi: surface_region.{x,y} is not honored in processing when source is RGB 163- nvk: Implement VK_EXT_shader_object 164- nvk: Implement VK_EXT_graphics_pipeline_library 165- turnip: UBWC disabled for MSAA 166- KHR-Single-GL46.arrays_of_arrays_gl.AtomicUsage fails on MTL 167- GTF-GL46.gtf42.GL3Tests.texture_storage.texture_storage_texture_as_framebuffer_attachment fails on MTL 168- nvk: Implement VK_KHR_maintenance5 169- [intel][anv][build][regression] - genX_grl.h:27:10: fatal error: grl/grl_cl_kernel.h: No such file or directory 170- RX 6600 VDPAU not recognizing HEVC_MAIN_10 correctly 171- Running an app on another AMD GPU (offload, DRI_PRIME) produces corrupted frames on Wayland. 172- regression in radeonsi since 9aa205668bcbf701f8f694551c284cd8e4cc17a3 (crashes in vbo_save_playback_vertex_list) 173- clang/libclc related Mesa build failures 174- Ninja Install Error 175- anv: add a dri config to enable implicit fencing on external memory interop 176- VDPAU declares a texture as "immutable" without also setting its ImmutableLevels attribute. 177- Segfault in glsl_to_nir.cpp nir_visitor::visit when assigning interface block 178- [rusticl]WARNING: Project targets '>= 1.1.0' but uses feature deprecated since '1.0.0': module rust has been stabilized. drop "unstable-" prefix from the module name 179- RX6600 hardware HEVC video decode fails for VDPAU but works for VA-API. (Can lock up GPU!) 180- Rusticl panics when getting program build logs using opencl.hpp 181- ue5 game issues lighting Rog Ally 7080u (z1e) 182- Missing textures in RoboCop: Rogue City with mesh shaders enabled 183- Intel/anv: Allow pre-compiled shader caches to be reused across multiple devices 184- radv: Multiview PSO forgets to export layer in some cases. 185- -Dintel-rt=enabled fails to build on 32-bit 186- MTL: regressions in vulkancts due to BO CCS allocations 187- intel: build failures 188- regression/bisected commit 4de62731f4db56360026cbb6a3b8566f86f22466 broke HW acceleration in the Google Chrome 189- i386 intel build failure: meson.build:45:6: ERROR: Unknown variable "prog_intel_clc". 190- rusticl: clEnqueueFillBuffer (among others) fails on buffers created from GL object. 191- MTL raytracing regression 192- [ANV/DG2] Unexpectedly slow replay of RenderDoc frame capture of Resident Evil 4 Remake 193- zink: flickering artifacts in Selaco 194- [ADL] gpu hang on dEQP-VK.synchronization.internally_synchronized_objects.pipeline_cache_graphics 195- Turnip spam on non-turnip devices 196- Intermittent compiler failures when building valhall tests 197- panfrost: graphical artifacts on T604 (T600) 198- Dying Light native artifacts on Intel A770 199- r300: Amnesia: The Dark Descent heavy corruption 200- [ANV/DG2] Age of Empires IV fullscreen "banding" artefacts 201- [mtl][anv] dEQP-VK.pipeline.monolithic.depth.format.d32_sfloat.compare_ops.* failures when run multithreaded 202- [mtl][anv] flaky tests in pipeline.monolithic.extended_dynamic_state*stencil_state_face* series 203- Broken colors/dual-source blending on PinePhone (Pro) since 23.1.0 204- r600/sfn: "Indexed locks of kcache banks 14 and 15 are ignored" in the ALU clause documentation 205- turnip: Logarithmic-time subgroup reductions using brcst.active and getlast.w8 instructions on a6xx gen4+ 206- GTF-GL46.gtf42.GL3Tests.texture_storage.texture_storage_compressed_texture_data regression 207- microsoft/compiler: Missing globally-coherent logic 208- Regression between 23.0.4 and 23.1.0: texture glitches in osgEarth 209- [Broadcom] Warning when runnin every OpenGL game on Vulkan using ZINK 210- radeonsi unsynchronized flips/tearing with KMS DRM rendering on 780M 211- radeonsi has an unchecked hard dependency on libelf 212- DR crashes with mesa 24 and rusticl (radeonsi) 213- Piglit tests assert on gen9 with zink 214- vlc crashes when playing 1920x1080 video with Radeon RX6600 hardware acceleration and deinterlacing enabled. 215- [radeonsi] Regression: graphical artifacting on water texture in OpenGOAL 216- Assertion when creating dmabuf-compatible VkImage on Tigerlake 217- Palworld fails to launch on Intel Arc unless "force_vk_vendor" is set to "-1". 218- panfrost: implement line smoothing 219- r300: backend DCE fails in piglit glsl-vs-copy-propagation-1.shader_test 220- [AMDGPU RDNA3] Antialiasing is broken in Blender 221- MTL: vulkan cooperative matrix tests gpu hang on MTL 222- nvk: Implement VK_KHR_zero_initialize_workgroup_memory 223- Assassin's Creed Odyssey wrong colors on Arc A770 224- VAAPI: EFC on VCN2 produces broken H264 video and crashes the HEVC encoder 225- etnaviv, modesetting, and glxgears 226- The Finals fails to launch with DX12 on Intel Arc unless "force_vk_vendor" is set to -1. 227- nvk: \`VK_KHR_zero_initialize_workgroup_memory` and \`VK_KHR_shader_subgroup_extended_types` not marked as complete in features.txt 228- nvk: Implement variableMultisampleRate 229- VA-API CI tests freeze 230- radv: games render with garbage output on RX5600M through PRIME with DCC 231- Warning when use ALIGN over uint64_t and uintptr_t 232 233 234Changes 235------- 236 237Adrian Perez de Castro (1): 238 239- Revert "egl/wayland: Remove EGL_WL_create_wayland_buffer_from_image" 240 241Agate, Jesse (6): 242 243- amd/vpelib: Studio Range Handling 244- amd/vpelib: White Screen Fix 245- amd/vpelib: VPT Failing Test Cases 246- amd/vpelib: VPE integration for HLG 247- amd/vpelib: Add PQ Norm to VPE interface 248- amd/vpelib: Refactor norm factor logic 249 250Alan Liu (4): 251 252- radeonsi/vpe: Add environment variable to set embbuf number 253- radeonsi/vpe: Don't map and unmap emb_buffer every time in process_frame 254- amd/vpelib: remove unused header file 255- radeonsi/vpe: support vpe 1.1 256 257Alejandro Piñeiro (3): 258 259- broadcom/compiler: fix coverity warning (unitialized pointer read) 260- v3dv/bo: use mtx_lock/unlock on cache_init too 261- v3dv: expose VK_EXT_depth_clip_enable 262 263Alexandre Marquet (1): 264 265- pan/mdg: quirk to disable auto32 266 267Alyssa Rosenzweig (328): 268 269- nir/lower_ssbo: rewrite 270- nir/lower_blend: return progress 271- nir/lower_io_to_temporaries: return prog 272- nir/lower_clip_cull_distance_arrays: return prog 273- nir: return prog from drawpixels 274- nir/lower_bitmap: return prog 275- nir/lower_alpha_test: rewrite with intrinsics_pass 276- nir/lower_point_size_mov: return prog 277- nir/lower_passthrough_edgeflags: return progress 278- nir/lower_io_arrays_to_elements: return prog 279- nir/lower_flatshade: fix metadata 280- glsl: return progress in point size linking 281- glsl: don't use NIR_PASS_V 282- glsl: fix metadata in gl_nir_zero_initialize_clip_distance 283- mesa/st: return progress in st_nir_lower_wpos_ytransform 284- mesa/st: use instructions_pass for plane lowering 285- mesa/st: return progress lowering builtins 286- mesa/st: don't use NIR_PASS_V 287- agx: rm deadcode 288- agx: ingest undefs in the backend 289- agx: stop lowering in opt loop 290- agx: only lower vars to ssa once 291- agx: fix metadata in layer lowering 292- agx: unset silly nir opts 293- agx: return progress from passes 294- asahi: return progress from passes 295- asahi: avoid silly internal NIR_PASS in gs lowering 296- asahi: don't use NIR_PASS for removing entrypoints 297- asahi: don't use NIR_PASS_V 298- nir/passthrough_gs: plug leak 299- compiler,gallium: move u_decomposed_prim to common 300- nir/passthrough_gs: flesh out gs_in_prim 301- compiler: add a vs.tes_agx bit 302- asahi: add more uapi stubs 303- asahi: gut macOS related code 304- asahi: lower poly stipple 305- asahi: Implement skeleton for tessellation 306- asahi: fix metadata for images with VS lowered to GS 307- asahi: implement VBO robustness 308- asahi: implement reset queries 309- asahi: enable robustness 310- asahi: fix unbound ssbos 311- asahi: optimize more when linking libagx 312- asahi: decode uniform_high records 313- agx: implement load_subgroup_invocation 314- agx: lower more subgroups 315- agx: introduce ballot pseudo 316- agx: fuse ballot+cmp 317- nir: add active_subgroup_invocation_agx sysval 318- agx: implement active_subgroup_invocation_agx 319- agx: optimize first_invocation 320- agx: optimize vote_eq 321- asahi: fix prim restart unrolling with indirects 322- asahi: delete bogus assertion 323- asahi: plug passthrough tcs leak 324- asahi: rework meta shader infra 325- asahi: plug geometry shader leaks 326- asahi: plug pre-gs leak 327- asahi: plug early_serialized_nir leak 328- asahi: plug so target leak 329- asahi: plug glsl type leak 330- asahi: plug geometry heap leak 331- asahi: fix UB in qbo's 332- agx: add some more bitop tests 333- Revert "asahi: don't canonicalize nans/flush denorms when copying" 334- asahi: sync with query mismatches 335- asahi: enable tcs caching 336- asahi: don't sync for uninitialized buffer 337- asahi: fix valid buffer tracking for SSBO/image/XFB 338- asahi: handle read-only SSBOs 339- asahi: honour discard_whole | persistent 340- agx: only run early tests if needed 341- docs/asahi: fix strided linear note 342- ail: add tests for linear<-->twiddled copies 343- ail: port tiling routines to c++ 344- ail: use template for tiled memcpy 345- agx: don't inline imms into stack_store 346- agx: optimize b2x(inot) 347- agx: reassociate bcsel with ior/iand 348- asahi: implement pipeline stats as a checkbox 349- asahi: log geometry shaders separate from xfb 350- asahi: don't use util_resource_size 351- asahi: fix vbo dirty track 352- asahi: force inline ppp update logic 353- asahi: skip set if tested 354- asahi: rm dead 355- asahi: track bit count, not word count 356- asahi: enable compblit behind dbg flag 357- asahi: allow disk cache with compblit 358- asahi: assert invariant 359- asahi: drop silly else 360- asahi: rewrite queries 361- asahi: split up stage uniform upload 362- asahi: dirty track stage uniforms 363- asahi: collapse stage uniform upload 364- asahi: optimize "no changes" case 365- asahi: optimize no changes descriptor case 366- asahi: move some code into dirty tracking 367- asahi: drop any_draws 368- asahi: fix instance count with indirect draw 369- asahi: collapse if 370- asahi: hoist xfb code 371- asahi: hoist layer id code 372- asahi: rm blank 373- asahi: track batches with incoherent writes 374- asahi: optimize memory_barrier 375- asahi,agx: use intrinsics pass 376- agx: clamp register file based on workgroup size 377- agx: improve scratch size accounting 378- asahi: add has_scratch to shader key 379- agx: set nr_preamble_gprs for preamble scratch 380- asahi: allocate preamble scratch 381- agx: allow 16-bit immediate on stack load/store 382- agx: print register vectors 383- agx: introduce "memory variables" 384- agx: add spill/fill lowering pass 385- agx: unit test memory parallel copies 386- agx: unit test spill/fill lowering 387- agx: add parallel copy printing 388- agx: add =spill debug option 389- asahi: bump max threads per wg 390- asahi: drop xfb hack 391- asahi: allow vertex/geom/tess side effects 392- agx: fix buffer overflow with varying slots 393- asahi,agx: use hw clip distance 394- asahi: fix dirty tracking issue 395- asahi: rip out existing MDI+GS implementation 396- libagx: fix buggy align macro 397- asahi: make GS flatshade_first more dynamic 398- libagx: use native static_assert on host 399- libagx: use real PACKED macro 400- libagx: static assert some sizes 401- libagx: generalize vertex_id_for_topology 402- asahi: simplify IA mode handling 403- asahi: add shader_info::outputs for gs lower 404- asahi: add geometry parameters for separable GS 405- asahi: rework shader stage handling a bit 406- asahi: separate GS from VS 407- asahi: rm arrayed output lowering 408- asahi: allow bindful GS textures 409- asahi: shrink GS key 410- asahi: infer stage in descriptor update 411- asahi: be a bit more methodical with shader stages 412- nir: rm load_vert_id_in_prim_agx 413- asahi: allow lowering bindings after lowering textures 414- asahi: collapse indirection with GS 415- asahi: support stage override in sysval lower 416- asahi: set gs_grid[0] even for direct draws 417- asahi: use load_instance_id in gs lowering 418- asahi: fix vertex out size calc 419- asahi: invert geometry shaders 420- asahi: implement GS disk caching 421- asahi: rm dead 422- asahi: simplify expressions involving xfb 423- asahi: avoid silly psiz writes even with gs 424- asahi: eliminate tri fan % 425- asahi: make provoking vertex dynamic 426- asahi: make gs topology dynamic 427- asahi: support GS in shaderdb 428- asahi: always support ARB_clip_control 429- asahi: make clip_halfz dynamic 430- asahi: rm ia key 431- agx: remove discard -> zs_emit lower 432- agx: rm dead sample count argument 433- agx: call agx_nir_lower_sample_mask earlier 434- agx: rm unused backend nr_samples 435- agx: rm unused opt_ixor_bcsel 436- agx: sink wait_pix 437- asahi: Implement ARB_texture_barrier by decompression 438- asahi: quelch gcc warning 439- agx: rm ridiculous dependency 440- agx: decouple compiler from genxml 441- agx: use #pragma once 442- asahi/lib: use #pragma once 443- ail: use #pragma once 444- asahi: use #pragma once 445- asahi: clean up format table renderability 446- asahi: split out genxml/ directory 447- agx: move SSBO lowering 448- agx: call texture lowering in the driver 449- agx: move texture lowering into lib 450- agx: decouple from libagx 451- asahi: reorder compiler before clc 452- asahi: precompile helper program 453- agx: add "is helper program?" key bit 454- asahi: advertise GL4.6 and ES3.2 455- docs: update for GL4.6 and ES3.2 on asahi 456- vulkan: add vk_index_type_to_restart helper 457- tu: use vk_index_to_restart 458- anv,hasvk: use vk_index_to_restart 459- util/hash_table: add u64 foreach macro 460- util/ralloc: add memdup 461- treewide: use ralloc_memdup 462- panfrost: Add a library to build CSF command streams 463- panfrost: Add support for the CSF job frontend 464- nir/opt_shrink_vectors: hoist alu helpers 465- nir/opt_shrink_vectors: shrink some intrinsics from start 466- util: add _mesa_hash_table_u64_num_entries 467- nir/print: do not print empty lists on intrinsics 468- util/hash_table: add DERIVE macro 469- panfrost: derive ht 470- asahi: derive ht 471- nvk: derive ht 472- radeonsi: derive ht 473- v3d: derive ht 474- glsl_types: derive ht 475- asahi: bump maximum samplers for Blender 476- asahi: allow more samplers for shaderdb 477- asahi: move more code out of agx_preprocess_nir 478- asahi/lib: fix overread with stateful 479- asahi: fix overread with samplers 480- asahi: clarify how unroll index buffers are offsetted 481- asahi: zero more in the unroll path 482- asahi: fix unit mismatch with unroll path 483- asahi: fix stage accounting for meta compute shaders 484- asahi: export build_meta_shader 485- asahi: add flush_query_writers helper 486- asahi: add helper to classify queries 487- asahi: accelerate QBO copies 488- asahi: fix depth bias interactions with points/lines 489- asahi: implement CDM stream linking for GS 490- asahi: be robust against tess batch changes 491- asahi: stop merging VS and TCS 492- asahi: drop TCS key 493- asahi: drop asahi_vs_next_stage 494- libagx: improve static assert message 495- asahi/clc: fix mem leaks 496- agx/opt_cse: alloc less 497- agx: fix stack smash with spilling 498- agx: fix allocating phi sources past the reg file 499- agx: add more asserts 500- agx: add num_successors helper 501- agx: fix 16-bit mem swaps 502- agx: scalarize vector phis 503- agx: allow vector phis to pass validation 504- agx: assert phis don't have .kill set 505- agx: fix bogus implicit cast with 2d msaa arrays 506- agx: sink harder 507- agx: implement live range splits of phis 508- agx: don't leak shuffle copies 509- agx: add more iterator macros 510- agx: add temp_like helper 511- agx: add before_function cursor 512- agx: add limit for max sources per non-phi 513- agx: coalesce phi webs 514- agx: try to coalesce moves 515- agx: drop scratch regs for spilling 516- agx: validate phi sources for consistency 517- agx: add SSA reindexing pass 518- agx: add SSA repair pass 519- agx: add Braun-Hack spiller pass 520- agx: switch to Braun-Hack spiller 521- agx: use dense reg_to_ssa map 522- agx: make add_successor public 523- agx: add helpers for multiblock unit tests 524- agx: add tests for SSA repair 525- agx: move spill/fills accounting to shaderdb 526- agx: enable indirect temps 527- agx: generalize remat code 528- agx: implement get_sr remat 529- asahi: use less bindless samplers 530- agx: add more shaderdb stats 531- agx: fix lowering uniforms with abs/neg 532- agx: restrict high uniforms with textures 533- agx: extract "accepts uniform?" ISA query 534- agx: model 64-bit uniform restriction on ALU 535- agx: extract agx_is_float_src 536- agx: promote constants to uniforms 537- agx: compact 32-bit constants 538- agx: test constant compaction 539- agx: implement load_subgroup_id 540- libagx: polyfill glsl ballot() 541- libagx: accelerate restart unroll across a subgroup 542- libagx: accelerate prim restart unroll across wg 543- libagx: deal with silly NIR 544- libagx: parallelize prefix sum over 1024 threads 545- agx: use funop short form 546- agx: split select opt into its own pass 547- agx: vectorize uniform_store 548- agx: start a crude cycle model 549- agx/opt_preamble: improve preamble cost function 550- agx/opt_preamble: restrain ourselves 551- agx/opt_preamble: preamble cycle estimates 552- agx/opt_preamble: improve rewrite cost est 553- docs/asahi: document UVS 554- nir: add offset to load_coefficients_agx 555- nir: add intrinsics for lowered VS outputs 556- asahi: add agx_push_packed 557- asahi: drop =varyings debug 558- asahi: extract agx_cf_binding 559- agx: explicitly assign coeff registers 560- agx: pack indirect CF 561- agx: handle indirect varyings 562- asahi: advertise indirect fs inputs 563- agx: rm unnecessary iter hack 564- agx: pack indirect st_vary 565- agx: inline imm into st_vary 566- asahi: rewrite varying linking 567- asahi: drop now-empty base key 568- asahi: make point size replacement dynamic 569- asahi: stop using GLSL indirect lowering 570- agx/lower_vbo: dce as we go 571- asahi: drop dead linked_so code 572- asahi: use ht derive more 573- asahi: fix _packed USC structs 574- asahi: delete layer id code 575- asahi: don't set writes_memory for tib spilling 576- agx: optimize out wait_pix in some cases 577- agx: inline sampler states 578- agx: always reserve sampler #0 for txf 579- asahi: fix bit sizes in point sprite lower 580- nir: add samples_log2_agx sysval 581- nir: add export/load_exported_agx intrinsics 582- agx: wire up samples_log2 sr 583- agx: generalize preloaded cache 584- agx: implement exports 585- agx: document non-monolithic ABI 586- asahi: add agx_usc_push_packed helper 587- asahi: constify agx_build_tilebuffer_layout 588- asahi: don't allocate tib space for gaps 589- nir: add intrinsics for non-monolithic agx shaders 590- agx: drop shader stage assertion 591- asahi: static assert blend key size 592- agx: add agx_shader_part data structure 593- agx: add main_size info 594- asahi: add fast linker 595- asahi/clc: stop padding binaries 596- asahi: switch to VS/FS prolog/epilog system 597 598Amber (3): 599 600- tu: wideLines support for a7xx. 601- tu: Add MESA_VK_DYNAMIC_RS_LINE_WIDTH to tu_rast_state. 602- tu: re-emit vertex buffer on MESA_VK_DYNAMIC_VI_BINDINGS_VALID dirty. 603 604Amber Harmonia (1): 605 606- freedreno/common: Fix register stomper ranges for A7XX 607 608Andres Calderon Jaramillo (1): 609 610- radeonsi: get enc/dec caps from kernel only on amdgpu 611 612Antoine Coutant (2): 613 614- clc: retrieve libclang path at runtime. 615- drisw: fix build without dri3 616 617Anton Bambura (2): 618 619- panfrost: Enable Mali-T600 620- docs/panfrost: Document Mali-T600 support 621 622Antonio Gomes (6): 623 624- mesa/st: Skip querying PCI values in interop_query_device_info if version >= 4 625- rusticl/gl: Bump mesa_glinterop_device_info to version 4 626- gallium: Add new PIPE_CAP_CL_GL_SHARING 627- iris: Set PIPE_CAP_CL_GL_SHARING to true 628- radeonsi: Set PIPE_CAP_CL_GL_SHARING to true 629- rusticl/device: Verify for PIPE_CAP_CL_GL_SHARING when enabling gl_sharing 630 631Arthur Huillet (1): 632 633- nvk: remove useless MME scratch 26 usage 634 635Asahi Lina (17): 636 637- asahi: libagx: introduce AGX_STATIC_ASSERT 638- agx: Rename some SRs 639- nir: Add AGX-specific helper opcodes 640- agx: Hook up AGX helper NIR intrinsics 641- agx: Hook up helper intrinsics into CL 642- agx: Add scaffolding to build the helper shader at device init 643- agx: compiler: Add fence_helper_exit_agx barrier 644- agx: compiler: Export scratch size to the driver 645- agx: compiler: Enable stack_adjust 646- asahi: libagx: Move PACKED and GLOBAL macros to libagx.h 647- asahi: cmdbuf: Fix scratch bucket offset/size 648- asahi: Implement scratch allocation 649- asahi: scratch: Add feature to debug core IDs 650- asahi: Hook up scratch 651- asahi: Allocate scratch for shaders 652- asahi: Enable scratch debugging 653- asahi: batch: Trace before waiting for syncobj 654 655Assadian, Navid (2): 656 657- amd/vpelib: Apply inverse gamut remap to background 658- amd/vpelib: Use uint64 for buffer size 659 660Axel Davy (5): 661 662- frontend/nine: Fix ff ps key 663- frontend/nine: Fix programmable vs check 664- frontend/nine: Fix missing light flag check 665- frontend/nine: Fix destruction race 666- frontend/nine: Reset should EndScene 667 668Bas Nieuwenhuizen (10): 669 670- util/disk_cache: Add marker on cache usage. 671- radv: Remove ray_launch_size_addr_amd system value. 672- radv: Add winsys argument to buffer map/unmap. 673- radv/winsys: Use radv_buffer_map wrapper. 674- radv/amdgpu: Use mmap directly. 675- radv: Support for mapping a buffer at a fixed address. 676- radv: Implement reserving the VA range on unmap. 677- radv: Expose VK_EXT_map_memory_placed. 678- radv: Fix differing aspect masks for multiplane image copies. 679- radv: Use zerovram for Enshrouded. 680 681Benjamin Lee (14): 682 683- nak: support predicate swaps on SM50 684- nak: support predicate sel on SM50 685- nak: fix frnd on SM50 686- nak: implement FSWZADD on SM50 687- nak: implement FLO on SM50 688- nak: fix iabs on SM50 with an explicit i2i op 689- nak: implement rro op on SM50 690- nak: use rro when emitting mufu on SM50 691- nak: implement kill op on SM50 692- nak: implement cs2r op on SM50 693- nak: handle nop ops from NAK IR on SM50 694- nak: fix lod mode encoding for SM50 tld op 695- nak: fix tex offset encoding on SM50 696- nvk: disable shaderResourceMinLod on pre-sm70 697 698Benjamin Tissoires (3): 699 700- CI: add mr-label-maker.yml config 701- .mr-label-maker.yml: fix wrong label 702- CI: add a test for checking the validity of .mr-label-maker.yml 703 704Biju Das (1): 705 706- gallium: Add Renesas rzg2l-du DRM entry point 707 708Blisto (1): 709 710- driconf: set vk_x11_strict_image_count for Atlas Fallen Vulkan 711 712Bob Beckett (2): 713 714- panfrost: Add an entry for panthor in the renderonly_drivers[] array 715- panfrost: Add the gallium glue to get panfrost loaded when panthor is detected 716 717Boris Brezillon (193): 718 719- panvk: Fix tracing 720- panvk: Fix access to unitialized panvk_pipeline_layout::num_sets field 721- panfrost: Kill unused forward declarations in pan_texture.h 722- panfrost: Add a per-gen panfrost_format_from_pipe_format() helper 723- panfrost: Add a per-gen panfrost_blendable_format_from_pipe_format() helper 724- panfrost: Make panfrost_format_to_bifrost_blend() a per-gen helper 725- panfrost: Add panfrost_[blendable]_format_table() helpers 726- panfrost: Move panfrost_is_yuv() to pan_format.h 727- panfrost: Move YUV-debugging out of panfrost_new_texture() 728- panfrost: Stop passing a panfrost_device to panfrost_new_texture() 729- panfrost: Don't pass a panfrost_device to panfrost_format_supports_afbc() 730- panfrost: Don't pass a panfrost_device to panfrost_afbc_can_tile() 731- panfrost: Stop passing a panfrost_device to pan_blend_get_internal_desc() 732- panfrost: Stop exposing pan_blend_create_shader() 733- panfrost: Stop passing a panfrost_device to pan_blend_create_shader() 734- panfrost: Stop passing a panfrost_device to pan_inline_rt_conversion() 735- panfrost: Make the pan_blend logic panfrost_device-agnostic 736- panfrost: Get rid of unused panfrost_device arguments in pan_blitter.c 737- panfrost: Pass the tile buffer budget through pan_fb_info 738- panfrost: Pass the sample position array through pan_fb_info 739- panfrost: Pass no_hierarchical_tiling info through pan_tiler_context 740- panfrost: Pass tiler heap info through pan_tiler_context 741- panvk: Inline pan_wls_mem_size() 742- panfrost: Make pan_desc.{c,h} panfrost_device agnostic 743- panfrost: Drop unused panfrost_device forward declaration in pan_shader.h 744- panfrost: Make pan_layout.c panfrost_device agnostic 745- panfrost: Make pan_sample.c panfrost_device agnostic 746- panfrost: Make pan_encoder.h panfrost_device agnostic 747- panfrost: Remove unused header inclusions from pan_blitter.h 748- panfrost: Make pan_blitter.h includable from non per-gen files 749- panfrost: Make pan_blitter.{c,h} panfrost_device agnostic 750- panfrost: Make pan_indirect_dispatch panfrost_device agnostic 751- panfrost: Make pan_pool.h panfrost_{device,bo} agnostic 752- panfrost: Make pan_props.c panfrost_device agnostic 753- panfrost: Make pan_texture.{c,h} panfrost_bo agnostic 754- panfrost: Make pan_desc.{c,h} panfrost_bo agnostic 755- panfrost: Remove uneeded pan_device.h inclusions 756- panfrost: Make panfrost_texfeatures.c panfrost_device agnostic 757- panfrost: Make pan_perf panfrost_device agnostic 758- panfrost: Add a helper to retrieve a panfrost_bo from a pan_kmod_bo 759- panvk: Get rid of unused pdev arguments passed to some meta helpers 760- panvk: Stop passing panfrost_device around in internal meta helpers 761- panvk: Store various physical device properties at the physical_device level 762- panvk: Use vk_device::drm_fd instead of going back to the physical device 763- panvk: Move panfrost_device and panvk_meta to panvk_device 764- panvk: Add a decode context at the panvk_device level 765- panvk: Instantiate our own blitter/blend_shader caches 766- panvk: Add pan_kmod_{vm,dev} objects to panvk_device 767- panvk: Add the concept of private BO 768- panvk: Transition panvk_pool to panvk_priv_bo 769- panvk: Transition panvk_descriptor_set to panvk_priv_bo 770- panvk: Transition panvk_pipeline to panvk_priv_bo 771- panvk: Transition panvk_{image,buffer}_view to panvk_priv_bo 772- panvk: Track blit src/dst using pan_kmod_bo objects 773- panvk: Keep a ref to a pan_kmod_bo in panvk_image 774- panvk: Keep a ref to a pan_kmod_bo in panvk_buffer 775- panvk: Keep tiler_heap and sample_positions BOs at the panvk_device level 776- panvk: Move away from panfrost_{bo,device} 777- panfrost: Move pan_{bo,device}.{c,h} to the gallium driver dir 778- panfrost: Clamp the render area to the damage region 779- panfrost: v4 doesn't have Blend descriptors 780- panfrost: Pad compute jobs with zeros on v4 781- pan/va: Add missing valhall_enums dep to valhall_disasm 782- pan/kmod: Fix typo in pan_kmod_vm_op_check() helper 783- pan/kmod: Add a PAN_KMOD_VM_FLAG_TRACK_ACTIVITY flag 784- pan/kmod: Reject pre 1.1 panfrost kernel drivers 785- panfrost: Rework the way we compute thread info 786- panfrost: Prepare support for GPU variants 787- pan/perf: Reject panthor kernel driver 788- drm-uapi: Add panthor uAPI 789- pan/kmod: Add a backend for panthor 790- panfrost: Add v10 support to libpanfrost 791- pan/genxml: Various CS related improvements in v10.xml 792- pan/decode: Introduce the concept of usermode queue 793- panfrost: Don't allocate a tiler heap buffer on v10+ 794- pan/genxml: Make sure pan_pack() evaluates 'dst' only once 795- panfrost: Relax position result alignment constraint on v10+ 796- panfrost: Add arch-specific context init/cleanup hooks 797- panfrost: Add a panfrost_context_reinit() helper 798- panfrost: Add a cleanup_batch() method to panfrost_vtable 799- panfrost: Enable v10 in the gallium driver 800- panfrost: Advertize G610 support 801- panfrost: Advertize G310 support 802- panfrost: Update the release note to mention G310/G610 addition 803- vk/meta: Add the PUSH_DESCRIPTOR_BIT flag when creating blit pipeline layouts 804- vk/meta: Fix base_type selection in build_{clear,blit}_shader() 805- panvk: Fix call ordering in panvk_DestroyDevice() 806- panvk: clang-format the source files 807- panvk: Kill the panvk_pack_color() prototype 808- panvk: Add VKAPI_{ATTR,CALL} specifiers to all panvk-specific entrypoints 809- panvk: Do not handle invalid NULL memory object in BindImageMemory2() 810- panvk: Get rid of unused panvk_image_get_plane_size() helper 811- panvk: Get rid of the custom device lost handling 812- panvk: Fix allocation scope of command buffer sub-objects 813- panvk: Add missing util_dynarray_init() in panvk_cmd_open_batch() 814- panvk: Don't open-code panvk_cmd_open_batch() in CmdBeginRenderPass2() 815- panvk: Don't allocate a TEXTURE descriptor in CreateImageView() 816- panvk: s/panvk_event_op/panvk_cmd_event_op/ 817- panvk: Allocate descriptor set arrays using vk_multialloc_zalloc() 818- panvk: Don't pass a device where we don't need one 819- panvk: Get rid of unused panvk_cmd_buffer fields 820- panvk: Kill panvk_{Create,Destroy}SamplerYcbcrConversion() 821- panvk: Drop panvk_framebuffer 822- panvk: Get rid of panvk_pipeline_cache 823- panvk: Make panvk_buffer_view inherit from vk_buffer_view 824- panvk: Make panvk_device_memory inherit from vk_device_memory 825- panvk: Make pan_AllocateMemory() robust to errors 826- panvk: Add extra checks to panvk_MapMemory() 827- panvk: Implement {Map,Unmap}Memory2KHR 828- panvk: Make panvk_sampler inherit from vk_sampler 829- panvk: Fix GetPhysicalDeviceProperties2() to report accurate info 830- panvk: Get rid of fields we already have in vk_xxx objects 831- panvk: Disable global offset on varying and non-VS attribute descriptors 832- panfrost: Move the image attribute offset adjustment to a NIR pass 833- panvk: Implement dynamic rendering entry points 834- nir: Extend nir_get_io_offset_src_number() to support load_push_constant 835- nir: Extend nir_lower_mem_access_bit_sizes() to support push constants 836- pan/bi: Lower push constant accesses 837- pan/bi: Lower load_push_constant with dynamic indexing 838- pan/bi: Update the push constant count when emitting load_push_constant 839- panvk: Move some macros to panvk_macros.h 840- panvk: Move image related definitions to panvk_image.{h,c} 841- panvk: Move the VkBuffer logic to its own source file 842- panvk: Move the VkBufferView logic to its own file 843- panvk: Move the VkDeviceMemory logic to panvk_device_memory.{c,h} 844- panvk: Move the VkSampler logic to its own file 845- panvk: Move panvk_pipeline definition to panvk_pipeline.h 846- panvk: Move VkImageView logic to its own source files 847- panvk: Move the VkEvent logic to panvk_event.{c,h} 848- panvk: Move panvk_descriptor_{set,pool} definitions to panvk_descriptor_set.h 849- panvk: Move VkDescriptorSetLayout logic to panvk_descriptor_set_layout.{c,h} 850- panvk: Move VkPipelineLayout logic to its own file 851- panvk: Move shader related definitions to panvk_[vX\_]shader.{c,h} 852- panvk: Kill panvk_[vX\_]cs.{c,h} 853- panvk: Move panvk_{draw,dispatch}_info definitions to panvk_vX_cmd_buffer.c 854- panvk: Move the VkCommandPool logic to panvk_cmd_pool.{c,h} 855- panvk: Move VkQueue logic to panvk_[vX\_]queue.{c,h} 856- panvk: Add a panvk_arch_dispatch_ret() variant 857- panvk: Make the device creation/destruction per-arch 858- panvk: Move the VkInstance logic to panvk_instance.{c,h} 859- panvk: Move the VkPhysicalDevice logic to panvk_physical_device.{c,h} 860- panvk: Move panvk_meta definitions to panvk_meta.h 861- panvk: Move panvk_device definition to panvk_device.h 862- panvk: Move the panvk_cmd_buffer definitions in panvk_cmd_buffer.h 863- panvk: Move the panvk_priv_bo logic to panvk_priv_bo.{c,h} 864- panvk: Move panvk_wsi definitions to panvk_wsi.h 865- panvk: Kill panvk_private.h 866- panvk: Make panvk_buffer_view per-gen 867- panvk: Make panvk_image_view per-gen 868- panvk: Make panvk_sampler a per-gen 869- panvk: Make panvk_cmd_buffer per-gen 870- panvk: Make panvk_shader per-gen 871- panvk: Make panvk_descriptor_set per-gen 872- panvk: Make panvk_descriptor_set_layout per-gen 873- panvk: Make panvk_pipeline per-gen 874- panvk: Make panvk_queue per-gen 875- panvk: Make panvk_pipeline_layout per-gen 876- panvk: Fix attach-less rendering 877- panvk: Fix the colorAttachmentCount check in begin_rendering_init_fbinfo() 878- pan/bi: Support fragment store_output() with a non-zero offset 879- panvk: Don't assume VkGraphicsPipelineCreateInfo::pColorBlendState != NULL 880- pan/bi: Allow subpass sampler dims 881- panvk: Fix input attachment support 882- panvk: Fill pan_tls_info::wls::instances 883- panvk: Make sure the sample_pattern is set in the tiler descriptor 884- panvk: We don't support resolve operations yet 885- pan/bi: Extend bi_emit_texc() to support wider direct tex/sampler idx 886- panvk: Don't assume pViewportState != NULL 887- panvk: Fix img2buf copies with image X offset not aligned on 16 pixels 888- panvk: Fix has_non_vs_attribute() test in panvk_draw_prepare_vs_attribs() 889- panvk: Make sure we pick a valid wrap_mode_r value for unnormalizedCoordinates 890- panvk: Fix depth/stencil image views 891- panvk: Make sure we have a decode context created when we need one 892- panvk: Don't advertize vertex_buffer cap on sRGB formats 893- panvk: Swizzle the border color on v7 when the format is BGR 894- panvk: Re-order things in panvk_physical_device_init() 895- panvk: Fill maxCustomBorderColorSamplers 896- panvk: Skip tiler jobs when the vertex shader doesn't write the position 897- panvk: Make sure we use the proper format for views of depth+stencil images 898- panvk: Abort on fault when PANVK_DEBUG=sync 899- panvk/ci: Make sure we catch GPU faults 900- panvk/ci: Enable dEQP-VK.pipeline.monolithic.* 901- panvk: Add support for KHR_push_descriptor 902- panvk/ci: Re-enable copy_and_blit tests 903- panvk: Stop declaring one push constant array per graphics stage 904- panvk: Pass the push constant array to draw/dispatch calls 905- panvk: Stop lowering push constant loads to UBO loads 906- panvk: Dissociate UBO and push_constant emission 907- nir/lower_blend: Fix nir_blend_logicop() for 8/16-bit integer formats 908- panfrost: do not write outside num_wg_sysval 909- panfrost: Add the BO containing fragment program descriptor to the batch 910- pan/kmod: Fix a syncobj leak in the panthor backend 911- pan/kmod: Make default allocator thread-safe 912 913Boyuan Zhang (5): 914 915- radeonsi/vcn: only use multi slices reflist when available 916- meson: bump the minimal required vdpau version to 1.4 917- ac/gpu_info: Add vcn dec and enc version query 918- radeonsi/vcn: choose rc_per_pic by encode verison 919- radeonsi/vcn: mark rc_per_pic as obsoleted 920 921Błażej Szczygieł (2): 922 923- gallivm/ssbo: replace run time loop by compile time loop 924- gallivm/ssbo: mask offset with exec_mask instead of building the 'if' 925 926Caio Oliveira (268): 927 928- intel/compiler/xe2: Implement instruction compaction for DPAS. 929- intel/compiler: Add couple of tests for fs_combine_constants 930- intel/compiler: Fix rebuilding the CFG in fs_combine_constants 931- intel: Use an intel enum for cmat scope 932- intel/compiler: Enable lower_rotate_to_shuffle in subgroup lowering 933- anv: Advertise VK_KHR_shader_subgroup_rotate 934- iris: Remove unused brw_* includes 935- iris: Remove prototypes for unsupported Gfx versions 936- iris: Remove unused paramater 937- iris: Call blorp_finish() when destroying context 938- crocus: Call blorp_finish() when destroying context 939- intel/compiler: Rename brw_image_param to isl_image_param 940- intel/compiler: Rename BRW_WM_MSAA_* enums to INTEL_MSAA_* 941- intel/compiler: Rename BRW_TESS_* enums to INTEL_TESS_* 942- intel/compiler: Rename DISPATCH_MODE_* enums to INTEL_DISPATCH_MODE_* 943- intel/compiler: Rename brw_vue_map to intel_vue_map 944- intel/compiler: Rename brw_cs_dispatch_info to intel_cs_dispatch_info 945- intel/compiler: Move disassemble functions to own header file 946- intel/compiler: Include brw_disasm_info.h where its used 947- intel/compiler: Merge intel_disasm.[ch] into corresponding brw files 948- intel: Rename i965_{asm,disasm} tools to brw_{asm,disasm} 949- intel/blorp: Don't require specific prog_data type in callback 950- intel/blorp: Remove brw\_ prefix when not applicable 951- intel/blorp: Simplify blorp_compile_fs() interface 952- intel/blorp: Simplify blorp_compile_cs() interface 953- intel/blorp: Use a struct to return blorp_compile_*() results 954- intel/blorp: Remove outdated reference in comment 955- intel/blorp: Move brw_blorp_get_urb_length helper 956- intel/blorp: Avoid brw types in blorp_priv.h 957- intel/blorp: Move brw_compiler.h include to where is needed 958- intel/blorp: Use a Meson dependency for blorp 959- intel: Add missing dependencies on blorp 960- intel/decoder: Move decoder to a separate module 961- intel/compiler: Collect NIR-only passes in intel_nir.h 962- intel/compiler: Rename the passes and files related to intel_nir.h 963- intel/compiler: Rename brw_gfx_ver_enum.h to intel_gfx_ver_enum.h 964- intel: Remove brw\_ prefix from process debug function 965- intel/isl: Include compiler generic header 966- anv: Remove lower_atomics from storage image lowering opts 967- iris: Remove no-ops from storage image lowering 968- intel/compiler: Use "intel" prefix for walk_order enum 969- iris: Add stage to iris_compiled_shader 970- iris: Don't use prog_data to guard 3DSTATE_CONSTANT_* code 971- iris: Reduce dependency on brw_*_prog_data structs 972- iris: Take ownership of prog_data when applying it 973- iris: Use uint32_t instead of brw_param_builtin 974- iris: Move compiler creation to iris_program.c 975- iris: Add IRIS_MAX_* constants to replace BRW_MAX_* usage 976- iris: Add helper to access use_tcs_multi_patch 977- iris: Add helper for indirect_ubos_use_sampler 978- iris: Move iris_get_compiler_options to iris_program.c 979- iris: Include brw_compiler.h only when needed 980- intel/meson: Remove usage of meson.source_root and meson.build_root 981- intel/meson: Fix warning about broken str.format 982- intel/elk: Fork Gfx8- compiler by copying existing code 983- intel/elk: Compile ELK library, tests and tools 984- intel/elk: Remove compiler specific devinfo hash 985- intel/elk: Remove a bunch of files that don't apply for Gfx8- 986- intel/elk: Use common code in intel/compiler 987- intel/elk: Remove stages not used in Gfx8- 988- intel/elk: Remove DPAS lowering 989- intel/elk: Rename files to use elk prefix 990- intel/elk: Rename header guards 991- intel/elk: Update doxygen-like file comments 992- intel/elk: Rename C++ namespace 993- intel/elk: Rename symbols 994- intel/elk: Don't include elk_eu_defines.h in elk_nir.h 995- intel/elk: Create separate header for opcodes 996- intel/blorp: Move brw specific code to a separate file 997- intel/blorp: Explicitly include brw_compiler.h header 998- intel/blorp: Add ELK support 999- intel/blorp: Remove Gfx9+ references in elk code 1000- intel/decoder: Add ELK support 1001- crocus: Use ELK compiler 1002- hasvk: Use ELK compiler 1003- iris: Rename screen->compiler to screen->brw 1004- iris: Use ELK compiler for Gfx8 1005- intel/tools: Add ELK support for aubinator 1006- intel/tools: Add ELK support for aubinator_error_decode 1007- intel/tools: Add ELK support for intel_hang_replay 1008- intel/tools: Add ELK support for aubinator_viewer 1009- intel/tools: Add ELK support for intel_hang_viewer 1010- intel: Use _brw suffix for genX headers that rely on brw 1011- intel/meson: Rename libintel_compiler to libintel_compiler_brw 1012- intel/tools: Add extra compiler device sha only for Gfx9+ 1013- intel/elk: Move nir_options to its own c/h file pair 1014- intel-clc: Use correct set of nir_options when building for Gfx8 1015- intel/elk: Use anonymous namespace in fs_combine_constants 1016- intel/elk: Remove tests for Gfx9+ 1017- intel/brw: Remove assembler tests for Gfx8- 1018- intel/brw: Remove EU compaction tests for Gfx8- 1019- intel/brw: Remove EU validation tests for Gfx8- 1020- intel/brw: Remove pass test cases for Gfx8- 1021- intel/brw: Assert Gfx9+ 1022- intel/compiler: Remove has_render_target_reads from wm_prog_data 1023- intel/brw: Remove Gfx8- passes from optimize() 1024- intel/brw: Pull opt_copy_propagation out of fs_visitor 1025- intel/brw: Pull opt_cmod_propagation out of fs_visitor 1026- intel/brw: Pull opt_saturate_propagation out of fs_visitor 1027- intel/brw: Pull dead_code_eliminate out of fs_visitor 1028- intel/brw: Pull opt_combine_constants out of fs_visitor 1029- intel/brw: Pull opt_cse out of fs_visitor 1030- intel/brw: Pull bank_conflicts out of fs_visitor 1031- intel/brw: Pull peephole_sel out of fs_visitor 1032- intel/brw: Pull redundant_halt out of fs_visitor 1033- intel/brw: Pull opt_algebraic out of fs_visitor 1034- intel/brw: Pull split/compact virtual_grf opts out of fs_visitor 1035- intel/brw: Pull opt_split_sends out of fs_visitor 1036- intel/brw: Pull opt_zero_samples out of fs_visitor 1037- intel/brw: Pull eliminate_find_live_channel out of fs_visitor 1038- intel/brw: Pull remove_extra_rounding_modes out of fs_visitor 1039- intel/brw: Pull register_coalesce out of fs_visitor 1040- intel/brw: Pull lower_constant_loads out of fs_visitor 1041- intel/brw: Pull lower_pack out of fs_visitor 1042- intel/brw: Pull lower_simd_width out of fs_visitor 1043- intel/brw: Pull lower_barycentrics out of fs_visitor 1044- intel/brw: Pull lower_logical_sends out of fs_visitor 1045- intel/brw: Pull fixup_nomask_control_flow out of fs_visitor 1046- intel/brw: Pull lower_integer_multiplication out of fs_visitor 1047- intel/brw: Pull lower_sub_sat out of fs_visitor 1048- intel/brw: Pull lower_derivatives out of fs_visitor 1049- intel/brw: Pull lower_regioning out of fs_visitor 1050- intel/brw: Pull fixup_sends_duplicate_payload out of fs_visitor 1051- intel/brw: Pull lower_uniform_pull_constant_loads out of fs_visitor 1052- intel/brw: Pull lower_find_live_channel out of fs_visitor 1053- intel/brw: Pull lower_load_payload out of fs_visitor 1054- intel/brw: Use references for a couple of backend_shader passes 1055- intel/brw: Simplify OPT macro usage in fs_visitor::optimize 1056- intel/brw: Pull fixup_3src_null_dest out of fs_visitor 1057- intel/brw: Pull emit_dummy_memory_fence_before_eot out of fs_visitor 1058- intel/brw: Pull emit_dummy_mov_instruction out of fs_visitor 1059- intel/brw: Pull lower_scoreboard out of fs_visitor 1060- intel/brw: Pull optimize() out of fs_visitor 1061- intel/brw: Move optimize and small optimizations to brw_fs_opt.cpp 1062- intel/brw: Move virtual GRF opts into their own file 1063- intel/brw: Move fs algebraic to its own file 1064- intel/brw: Move small lowering passes into brw_fs_lower.cpp 1065- intel/brw: Move lower_integer_multiplication to its own file 1066- intel/brw: Expose flag_mask/bit_mask fs helpers 1067- intel/brw: Move lower_simd_width to its own file 1068- intel/brw: Move workarounds to a separate file 1069- intel/blorp: Remove Gfx8- references in BRW code 1070- intel/brw: Move brw_compile_* functions out of vec4-specific files 1071- intel/brw: Move type_size_* functions out of vec4-specific file 1072- intel/brw: Always use scalar shaders 1073- intel/brw: Remove vec4 backend 1074- intel/brw: Remove now unused vec4-only opcodes 1075- intel/brw: Remove unused legacy shader stages 1076- intel/brw: Remove Gfx8- code from disassembler 1077- intel/brw: Remove Gfx8- code from assembler 1078- intel/brw: Remove Gfx8- code from brw_compile_* functions 1079- intel/brw: Remove Gfx8- code from scheduler 1080- intel/brw: Remove Gfx8- code from register allocator 1081- intel/brw: Remove Gfx8- code from thread payload 1082- intel/brw: Remove Gfx8- code from NIR conversion 1083- intel/brw: Remove Gfx8- code from lower storage image pass 1084- intel/brw: Remove Gfx8- code from lower logical sends 1085- intel/brw: Remove Gfx8- code from generator 1086- intel/brw: Remove Gfx8- code from backend passes 1087- intel/brw: Remove Gfx8- code from EU compaction 1088- intel/brw: Remove Gfx8- code from IR performance analysis 1089- intel/brw: Remove Gfx8- code from EU emission 1090- intel/brw: Remove Gfx8- code from EU validation 1091- intel/brw: Remove Gfx8- code from NIR passes 1092- intel/brw: Remove Gfx4-5 manual compression selection 1093- intel/brw: Remove Gfx8- code from EU codegen helpers 1094- intel/brw: Remove Gfx8- code from NIR options 1095- intel/brw: Remove Gfx8- code from register type helpers 1096- intel/brw: Remove Gfx8- specific EU inst helpers 1097- intel/brw: Remove Gfx8- code from inst FC and F macros 1098- intel/brw: Replace inst F8 macro with F macro 1099- intel/brw: Remove Gfx8- code from inst F20 macros 1100- intel/brw: Remove Gfx8- code from inst FD20 and FV20 macros 1101- intel/brw: Remove Gfx8- code from inst FI macros 1102- intel/brw: Remove Gfx8- code from inst BRW_IA*_ADDR_IMM macros 1103- intel/brw: Remove Gfx8- code from inst FFDC, FDC and FD macros 1104- intel/brw: Update comments for FK macro 1105- intel/brw: Replace inst FF macro with F or F20 macros 1106- intel/brw: Remove F16TO32 and F32TO16 opcodes 1107- intel/brw: Remove Gfx8- code from builder 1108- intel/brw: Remove Gfx8- code from fs_inst 1109- intel/brw: Remove Gfx8- code from VUE map 1110- intel/brw: Remove Gfx8- code from SIMD lowering 1111- intel/brw: Remove Gfx8- code from visitor 1112- intel/brw: Remove Gfx8- remaining opcodes 1113- intel/brw: Remove MRF type 1114- intel/brw: Inline brw_nir_apply_sampler_key code 1115- intel/brw: Remove unused attrib workarounds 1116- intel/brw: Remove edgeflag_is_last VS parameter 1117- intel/brw: Remove Gfx8- fields from \*_prog_key structs 1118- intel/brw: Remove Gfx8- fields from \*_prog_data structs 1119- intel/brw: Use a single register set 1120- intel/brw: Remove runtime_check_aads_emit 1121- intel/brw: Remove automatic_exec_sizes 1122- intel/brw: Use fs_visitor instead of backend_shader in various passes 1123- intel/brw: Fold fs_instruction_scheduler into instruction_scheduler 1124- intel/brw: Change cfg_t to refer to fs_visitor 1125- intel/brw: Move dump_* functions into fs_visitor 1126- intel/brw: Fold backend_shader into fs_visitor 1127- intel/brw: Remove extra stage_prog_data field in fs_visitor 1128- intel/brw: Remove brw_shader.h 1129- intel/meson: Add dependencies for brw and elk 1130- intel/compiler: Remove nir_print_instr hack in disasm_info 1131- intel/brw: Use C++ for brw_disasm_info.c 1132- intel/brw: Hide the definition of cfg_t et al from C code 1133- intel/brw: Use fs_inst in cfg_t 1134- intel/brw: Use fs_inst explicitly in various passes 1135- intel/brw: Use fs_inst in disasm_annotate() 1136- intel/brw: Move functions from backend_instruction into fs_inst 1137- intel/brw: Fold backend_instruction into fs_inst 1138- intel/brw: Remove typedefs from fs_builder 1139- intel/brw: Fold backend_reg into fs_reg 1140- intel/brw: Simplify usage of reg immediate helpers 1141- intel/compiler: Fix SIMD lowering when instruction needs a larger SIMD 1142- intel/elk: Remove split sends 1143- intel/elk: Remove DPAS opcode 1144- intel/elk: Remove BTD and RT opcodes 1145- intel/elk: Remove DP4A opcode 1146- intel/elk: Remove ROR and ROL opcodes 1147- intel/elk: Remove IADD3 opcode 1148- intel/elk: Remove EU compaction logic for Gfx9+ 1149- intel/elk: Remove encoding for Gfx9+ 1150- intel/elk: Remove SYNC opcode and SWSB annotations 1151- intel/elk: Remove Gfx12 SFIDs and related LSC code 1152- intel/elk: Remove Gfx9+ sampler messages and modes 1153- intel/elk: Rename symbols for A64 OWord Block R/W messages 1154- intel/elk: Remove Gfx9+ dataport messages 1155- intel/elk: Remove FB_READ opcodes 1156- intel/elk: Remove Gfx12.5 URB message 1157- intel/elk: Remove ex_desc and ex_mlen from elk_inst 1158- intel/elk: Remove Xe2 logical sends lowering 1159- intel/elk: Remove unused sources from ELK_SHADER_OPCODE_SEND 1160- intel/elk: Remove unused SEND features 1161- intel/elk: Remove validation code for Gfx9+ 1162- intel/elk: Remove Gfx9+ from nir conversion 1163- intel/elk: Remove Gfx9+ from compile/run functions 1164- intel/elk: Remove FB_WRITE_LOGICAL_SRC_SRC_STENCIL 1165- intel/elk: Remove Gfx9+ from passes 1166- intel/elk: Remove Gfx9+ from thread payload 1167- intel/elk: Remove Gfx9+ from EU emission 1168- intel/elk: Remove coarse pixel handling 1169- intel/elk: Remove Gfx9+ from FS generator 1170- intel/elk: Remove Gfx9+ from Reg related code 1171- intel/elk: Remove Gfx9+ from asm grammar 1172- intel/elk: Remove Gfx9+ from disasm 1173- intel/elk: Remove Gfx9+ from NIR auxiliary code 1174- intel/elk: Remove use_tcs_multi_patch 1175- intel/elk: Remove Gfx9+-only passes 1176- intel/elk: Remove uses of intel_device_info_is_9lp() 1177- intel/elk: Remove remaining Gfx9+ code 1178- intel/elk: Remove multi-polygon support 1179- intel/elk: Clean up unused code in elk_compiler.h 1180- intel/brw: Use hstride instead of stride for accumulator 1181- intel/brw: Use helper to create accumulator register 1182- intel/brw: Fix validation of accumulator register 1183- anv: Enable VK_KHR_shader_maximal_reconvergence 1184- intel/tools: Make intel_stub_gpu work when using meson devenv 1185- intel/brw: Implement quad_vote_any and quad_vote_all 1186- intel/brw: Use predicates for quad_vote_any and quad_vote_all when available 1187- anv: Enable VK_KHR_shader_quad_control 1188- intel/brw: Handle Xe2 in brw_fs_opt_zero_samples 1189- intel/brw: Remove vestiges of sources on IF opcode, only valid on Gfx6 1190- intel/brw: Add a src array for the common case in fs_inst 1191- intel/brw: Refactor FS validation macros 1192- intel/brw: Remove two duplicated validate calls in optimizer 1193- intel/brw: Move validate out of fs_visitor 1194- intel/brw: Support FIXED_GRF when generating code for CLUSTER_BROADCAST 1195- intel/brw: Lower VGRFs to FIXED_GRFs earlier 1196 1197Casey Bowman (1): 1198 1199- anv: Override VendorID for Hitman 3 1200 1201Charlie Turner (2): 1202 1203- amd, radeonsi: Lower minimum supported video dimensions for AV1 1204- {vulkan,radv,anv}/video: fix issue in H264 scaling lists derivation 1205 1206Chia-I Wu (7): 1207 1208- radv: fix pipeline stats mask 1209- meson: fix a build ereror 1210- radv: hide the sparse queue when radv_legacy_sparse_binding 1211- radv: hide the sparse queue on older kernels 1212- radv: set VK_SYNC_FEATURE_GPU_MULTI_WAIT 1213- aco: fix nir_op_pack_32_4x8 handling 1214- radv: fix 2d/3d image copy on compute queue 1215 1216Chris Rankin (4): 1217 1218- vdpau: Declare texture object as immutable using helper function. 1219- vdpau: Refactor query for video surface formats. 1220- meson: bump the minimal required vdpau version to 1.5 1221- frontends/vdpau: Add support for VDPAU AV1 decoding. 1222 1223Christian Duerr (1): 1224 1225- panfrost: Fix dual-source blending 1226 1227Christian Gmeiner (100): 1228 1229- .gitignore: Add .venv folder 1230- etnaviv/isa: Add missing dep of encode.py/decode.py calls on isa.py 1231- isaspec: encode.py: Include assert.h 1232- isaspec: encode.py: Include util/log.h 1233- etnaviv: Remove no_oneconst_limit from etna_inst 1234- isaspec: encode: Constify encode.type 1235- isaspec: encode: Constify bitset_params 1236- etnaviv: Remove not used etna_assemble_set_imm(..) 1237- etnaviv: Fix how we determine the max supported number of varyings 1238- etnaviv: isa: Remove duplicate #instruction-alu-atomic 1239- etnaviv: isa: Add dsx and dsy opcodes 1240- etnaviv: isa: Add frc opcode 1241- etnaviv: isa: Add norm_dp2, norm_dp3 and norm_dp4 opcodes 1242- etnaviv: isa: Add bit_extract opcode 1243- etnaviv: isa: Correct dp2 opcode 1244- etnaviv: isa: Add branch_any opcode 1245- etnaviv: isa: Name cond enum value 22 1246- etnaviv: isa: Add movai opcode 1247- etnaviv: isa: Add bit_rev opcode 1248- etnaviv: isa: Add texldb opcode 1249- etnaviv: isa: Add texldl opcode 1250- etnaviv: isa: Add texldd opcode 1251- etnaviv: isa: Remove note about GC3000 1252- etnaviv: isa: Add div opcode 1253- etnaviv: isa: Reorder instructions 1254- etnaviv: isa: Rename reg_group u2 to u 1255- etnaviv: isa: Add internal register group 1256- etnaviv: isa: Add movar opcode 1257- etnaviv: isa: Move {TEX_SWIZ} 1258- etnaviv: isa: Correct SRC0_AMODE 1259- etnaviv: isa: Correct #instruction-cf-src1-src2 bitset name 1260- etnaviv: isa: Correct #instruction-alu-no-dst-maybe-src1-src2 name 1261- etnaviv: isa: Correct #instruction-alu-no-dst-has-src0-src1 expr name 1262- etnaviv: isa: Combine branch and branch_if 1263- etnaviv: isa: Support unary branch instruction 1264- etnaviv: isa: Support unary texkill instruction 1265- etnaviv: isa: Support multiple encodings for texldl 1266- etnaviv: isa: Fix #instruction-tex-src0-src1-src2 bitset 1267- etnaviv: isa: Support multiple encodings for texldb 1268- isaspec: Remove not used isa_decode_hook 1269- isaspec: deocde: Hide all the internals ISA details 1270- isaspec: decode: Add isa specific functions 1271- isaspec: decode: Make isa_decode_bitset(..) private 1272- freedreno/isa: Rework meson dependency for libir3decode 1273- etnaviv: isa: Rework meson dependency for libetnaviv_decode 1274- isaspec: deocde: Make isa_bitset arrays static 1275- isaspec: deocde: Make isa_decode_field(..) private 1276- isaspec: decode: Add libisaspec 1277- isaspec: deocde: Remove generic functions from public interface 1278- etnaviv: isa: Define a dontcare bit in atomic instructions 1279- etnaviv: isa: Add name attributes 1280- etnaviv: isa: Generate c header containing enums 1281- etnaviv: isa: Generate opcode enum 1282- etnaviv: isa: Add an empty libetnaviv_encode 1283- etnaviv: Link against libetnaviv_encode 1284- etnaviv: Move struct etna_inst to src/etnaviv 1285- etnaviv: isa: Make use of generated enums 1286- etnaviv: isa: Add rouding to etna_inst 1287- etnaviv: Set dst.use for MOVAR 1288- etnaviv: isa: Add encode support 1289- etnaviv: isa: Add isa_assemble_instruction(..) 1290- etnaviv: Switch to isa_assemble_instruction(..) 1291- etnaviv: Move swizzle related macros to scr/etnaviv 1292- etnaviv: Switch to macros from isa.h 1293- etnaviv: Remove isa.xml.h 1294- etnaviv: Do not set tex.amode for rounding 1295- ci/etnaviv: Remove duplicates 1296- ci/etnaviv: Do not skip tex-miplevel piglits 1297- etnaviv: Remove offline shader compiler 1298- etnaviv: Introduce common etna_core_info 1299- etnaviv: drm: Make use of etna_core_info 1300- etnaviv: drm: Add etna_gpu_get_core_info(..) 1301- etnaviv: Switch to etna_core_info 1302- etnaviv: Move hw header to common place 1303- etnaviv: Introduce etna_feature enum 1304- etnaviv: common: Add feature bitset 1305- etnaviv: drm: Initialize etna_core_info based on kernel features 1306- etnaviv: Switch to etna_core APIs 1307- etnaviv: drm: Query some id values in etna_gpu_new(..) 1308- etnaviv: hwdb: Import gc_feature_database from NXP 1309- etnaviv: hwdb: Import gc_feature_database from Amlogic 1310- etnaviv: hwdb: Import gc_feature_database from ST 1311- ci: Install python3-pycparser in build container 1312- etnaviv: hwdb: Generate hwdb.h 1313- etnaviv: hwdb: Add etna_query_feature_db(..) 1314- etnaviv: drm: Make use of hwdb 1315- etnaviv: common: Add enum etna_core_type 1316- etnaviv: common: Add some limit values 1317- etnaviv: hwdb: Fill limits 1318- etnaviv: drm: Fill limits 1319- etnaviv: Copy values from etna_core_info 1320- etnaviv: drm: Remove fallback value for ETNA_GPU_NUM_CONSTANTS 1321- etnaviv: Drop not needed check if seamless cube map is supported 1322- etnaviv: hwdb: Drop stdint.h dependency 1323- nvk: Remove duplicate DRM_NODE_RENDER check 1324- meson: Add missing newline at eof 1325- etnaviv: Switch to etna_core_disable_feature(..) 1326- etnaviv: Fix disabling of features 1327- etnaviv: drm: Drop NPU-related params 1328- clc: Always use spir for 32 bit 1329 1330Collabora's Gfx CI Team (4): 1331 1332- Uprev Piglit to e9316bcd12544aaf7e753ce37fe50d64165d9598 1333- Uprev Piglit to 2a1c49a81cd9a6bf5d0c3a9b87225be94771ca96 1334- Uprev Piglit to 1e631479c0b477006dd7561c55e06269d2878d8d 1335- Uprev Piglit to dd6f7eaf82e8dd442da28b346c236141cbcce0b1 1336 1337Connor Abbott (56): 1338 1339- freedreno: Add a7xx crashdump-related registers and enums 1340- ir3/ra: Add specialized shared register RA/spilling 1341- ir3: Set branchstack earlier 1342- ir3: Rewrite (jp) and branchstack handling 1343- ir3: Calculate physical edges correctly 1344- ir3: Fix comment thinko 1345- ir3/ra: Fix bug with collect source handling 1346- tu: Add more info to ldg inline uniform path 1347- ir3/a7xx: Fix load_global_ir3 with immediate offset 1348- ir3: Initial support for pushing globals with ldg.k 1349- tu: Follow pipeline compatibility rules for dynamic descriptors 1350- tu: Reenable MSAA UBWC on a6xx gen1 1351- tu: Enable UBWC for SNORM formats on a740+ 1352- tu: Enable UBWC for storage images on a7xx 1353- vk/graphics_state: Remove bogus assert in CmdSetSampleMaskEXT 1354- vk/graphics_state: Add stubs required by VK_EXT_shader_objects 1355- freedreno/afuc: Decode (peek) modifier 1356- freedreno/afuc: Add missing ALU encode case for bic 1357- freedreno/afuc: Bump max instructions for a7xx 1358- freedreno/afuc: Fix setbit/clrbit parsing 1359- freedreno/afuc: Use left recursion in parser 1360- freedreno/afuc: Improve jump table handling 1361- freedreno/afuc: Add .align directive 1362- freedreno/afuc: Add more general T_IDENTIFIER in lexer 1363- freedreno/afuc: Add support for multiple sections when assembling 1364- freedreno/afuc: Allow -e option on a7xx 1365- freedreno/afuc: Emulate THREAD_SYNC on a660 1366- freedreno/afuc: Run entire bootstrap routine 1367- freedreno/afuc: Add a7xx test case 1368- freedreno/afuc: Add magic control reg values for a740 1369- freedreno/afuc: Add section on reassembling firmwares and relocations 1370- freedreno/a7xx: Add CP_CCHE_INVALIDATE 1371- tu: Implement CCHE invalidation 1372- nir/divergence_analysis: Add ir3-specific intrinsics 1373- nir/divergence_analysis: Add uniform_load_tears option 1374- nir/divergence_analysis: Fix load_view_index divergence in VS 1375- ir3: Allow single-predecessor phis 1376- ir3: Run divergence analysis at the end 1377- ir3: Remove loop shared copy check 1378- ir3: Use divergence analysis for (jp) and physical CFG 1379- freedreno/afuc: Switch to using the GPU ID in the firmware 1380- freedreno/afuc: Add a7xx new-style branch instructions 1381- freedreno/afuc: Add initial support for a750 1382- freedreno: Make has_ibo_ubwc a7xx specific 1383- freedreno,tu: Disable UBWC for storage images on a750 1384- ir3/legalize: Fix intra-block state propagation with loops 1385- ir3: Rewrite nop insertion 1386- docs/android: Fix example meson cross file 1387- docs/android: Improve instructions for replacing driver 1388- ir3: Don't use non-contiguous component masks for FS 1389- ir3: Don't pack FS inlocs 1390- freedreno/a7xx: Register updates from kgsl 1391- ir3: Add scan_clusters_macro to ir3_valid_flags() 1392- ir3: Add scan_clusters.macro to is_subgroup_cond_mov_macro() 1393- ir3/ra: Don't demote movmsk instructions to non-shared 1394- docs/android: Fix example meson cross file again 1395 1396Constantine Shablia (6): 1397 1398- panvk: implement vkGetBufferDeviceAddress 1399- panvk: advertise bufferDeviceAddress 1400- vulkan/runtime: fix typo 1401- mesa: fix typo 1402- pan/bi: fix 1D array tex coord lowering 1403- panfrost: report correct MAX_VARYINGS 1404 1405Corentin Noël (37): 1406 1407- zink: Avoid the use of negative array offsets 1408- zink: Use memmove when dealing with overlapping memory 1409- glsl: Make sure to not cast ir_dereference_variable into ir_variable 1410- glsl: Make sure that the variable is a ir_variable before unreferencing it 1411- zink: Initialize zink_shader_object 1412- zink: Initialize zink_bindless_descriptor to zero on creation 1413- zink: Initialize pipe_query_result 1414- zink: Do not shadow the variable ret 1415- zink: Avoid variable shadowing everywhere 1416- zink: Only call reapply_color_write if EXT_color_write_enable is available 1417- ci_run_n_monitor: Allow the upstream format to not exist 1418- zink: use symbolic values instead of 0 1419- zink: do not use undefined stage mask if on missing KHR_synchronization2 1420- glsl: Ensure that we are dealing with ir_variable and ir_rvalue 1421- venus: sync protocol for VK_EXT_attachment_feedback_loop_layout 1422- venus: enable VK_EXT_attachment_feedback_loop_layout 1423- zink: Return early if the file descriptor could not have been duplicated/acquired 1424- ci: Update virglrenderer and crosvm 1425- zink: Make wrap_surface return a zink_ctx_surface directly 1426- zink: Use an intermediary variable for create_surface 1427- zink: Separate the template from the wrapped surface 1428- zink: Return early if the source could not have been acquired 1429- zink: Move zink_surface_destroy before zink_create_surface 1430- zink: Make sure to not leak anything on surface creation failure 1431- zink: Change zink_get_surface to return a zink_surface 1432- zink: Add error logging on surface creation failure 1433- st_pbo/compute: Use the correct structure type when allocating a specialized key 1434- zink: Make sure to initialize all the fields of VkMemoryBarrier 1435- dri/kopper: Assume a non-null drawable in flush_frontbuffer 1436- zink: Removed unused function 1437- zink: Removed unused num_texel_buffers member 1438- zink: Removed unused push_valid member 1439- zink: Remove ctx from zink_gfx_program 1440- ci: Change propagated variables into an array 1441- ci: Add VK_DRIVER_FILES passthrough from jobs to tests 1442- ci: Allow to pass LIBGL_ALWAYS_SOFTWARE to the guest environment 1443- ci: Add zink-venus-lvp job 1444 1445Daniel Almeida (17): 1446 1447- nak/sm50 add support for suld 1448- nak/sm50: add support for suatom 1449- nak/sm50: add support for isberd 1450- nak: sm50: add support for OpOut 1451- nak: sm50: fadd: ensure src[0] is in a register 1452- nak/sm50: legalize: display instruction on panic 1453- nak/sm50: add support for brev 1454- nak: sm50: fix some legalization issues 1455- nak/sm50: add a memstream abstraction 1456- nak/sm50: add an annotate debug flag 1457- nak/sm50: support annotations through OpAnnotate 1458- nak/sm50: sprinkle OpAnnotate in optimization passes 1459- meson,ci: Add the paste crate 1460- nil: Add the start of a Rust library 1461- nil: Rewrite nil_format in rust 1462- nil: Re-implement nil_image in Rust 1463- nil: Rewrite the TIC code in Rust 1464 1465Daniel Schürmann (52): 1466 1467- aco/insert_exec_mask: unify exec restore code after divergent control flow 1468- aco/insert_exec_mask: replace phi for loop restore mask with explicit copies 1469- aco/insert_exec_mask: only create loop phis for exec mask if necessary 1470- aco: give spiller more room to assign spilled SGPRs to VGPRs 1471- spirv: Fix SpvOpExpectKHR 1472- vulkan: enable VK_KHR_shader_expect_assume 1473- spirv: Update headers and grammar JSON 1474- aco/insert_exec_mask: Fix unconditional demote at top-level control flow. 1475- aco/insert_exec_mask: tiny refactor 1476- aco: always terminate quads if they have been demoted entirely 1477- aco/insert_exec_mask: Reduce latency when switching to WQM. 1478- spirv: implement SPV_KHR_maximal_reconvergence 1479- aco: enable WQM if demote is used with maximal reconvergence 1480- radv: enable VK_KHR_shader_maximal_reconvergence 1481- spirv: implement SPV_KHR_quad_control 1482- radv: enable VK_KHR_shader_quad_control 1483- radv: fix initialization of radv_shader_layout->use_dynamic_descriptors 1484- aco: rematerialize constants in every basic block during optimizer 1485- aco: reorder code and use namespaces in aco_interface.cpp 1486- aco/util: small_vec few additions 1487- aco: use small_vec as Block::edge_vec for predecessors and successors 1488- aco/spill: refactor SSA repairing 1489- aco/spill: don't allocate extra spill_id for phi operands in add_coupling_code() 1490- aco/spill: add spills_entry interferences only when necessary 1491- aco/spill: refactor adding spilled vars into separate function add_to_spills() 1492- aco/spill: keep live-out variables spilled at branch blocks 1493- aco/spill: don't prefer to spill phis at merge blocks 1494- aco/spill: add interferences with variables spilled at loop headers 1495- aco/spill: avoid re-spilling loop-carried variables in process_block() 1496- aco/spill: avoid re-spilling loop-carried variables in add_coupling_code() 1497- aco/spill: keep loop-carried variables spilled at loop headers 1498- aco/spill: keep loop-carried variables spilled at merge blocks 1499- aco/spill: select more loop-carried variables to be spilled 1500- aco/spill: keep loop variables spilled during nested loops 1501- aco: use instr_class::branch to identify SOPP branches 1502- aco: remove SOPP_instruction::block member 1503- aco: unify different SALU types into single struct SALU_instruction 1504- aco/builder: use accessor functions instead of casting to subtypes 1505- aco: change return type of create_instruction() to Instruction* 1506- aco: defer instruction size from aco::Format in create_instruction() 1507- aco: remove create_instruction() template parameter 1508- aco: move create_instruction() to aco_ir.cpp 1509- aco/spill: Fix assertion for nested loops 1510- aco/spill: pass live_vars to spill_ctx 1511- aco/spill: compute live-in variables from live-out 1512- aco/spill: maintain valid live vars at any point 1513- aco/spill: use live variables instead of next_use_distances in add_coupling_code() 1514- aco/spill: gather information about average use distances 1515- aco/spill: use average use distances in process_block() 1516- aco/spill: use average use distances in init_live_in_vars() for merge blocks 1517- aco/spill: use average use distances to spill loop variables 1518- aco/ra: fix kill flags after renaming fixed Operands 1519 1520Daniel Stone (50): 1521 1522- egl: Return BAD_CONFIG when robust access unsupported 1523- st/dri: Use correct pipe_resource for GL texture image export 1524- dri: Redeclare __DRI_IMAGE_FORMAT_* as PIPE_FORMAT_* 1525- st/dri2: Remove __DRI_IMAGE_FORMAT conversion 1526- st/dri2: Pass pipe_format to driCreateConfigs 1527- st/dri2: Use u_format to get config format information 1528- util: Add util_format_get_component_shift 1529- st/dri: Remove format tables from driCreateConfigs 1530- st/dri: Completely remove mesa_format from config setup 1531- st/dri: Add transient HAS_ZS() helper 1532- st/dri: Rework depth/stencil format selection 1533- st/dri: Use pipe_format for Z/S modes 1534- st/dri: Check format properties from format helpers 1535- st/dri: Store pipe_format in gl_config 1536- egl/wayland: Remove format-query fallback 1537- st/dri: Reuse stored renderbuffer format 1538- st/dri: Reuse stored texture format 1539- dri/kopper: Move format -> FourCC translation up a level 1540- dri/kopper: Add translations for sRGB formats 1541- dri/kopper: Reorder format tables 1542- dri/kopper: Flatten pipe_format/DRIImage/FourCC conversion 1543- egl/wayland: Query image FourCC for linear copies 1544- egl/wayland: s/DRI_IMAGE_FORMAT/pipe_format/g 1545- egl/wayland: Add opaque-equivalent FourCCs 1546- egl/wayland: Fix EGL_EXT_present_opaque 1547- egl/wayland: Use pipe_format to look up configs 1548- egl/wayland: Use FourCC to look up wl_buffer support 1549- egl/wayland: Add helper to check server format support 1550- egl/wayland: Use helper to look up visual 1551- egl/wayland: Eliminate double loop for configs 1552- egl/wayland: Simplify alternate-format fallback for configs 1553- egl/wayland: Remove WL_SHM_* format listings 1554- egl/wayland: Use pipe_format for format names 1555- egl/wayland: Remove shift/size masks 1556- egl: Fail display creation if no EGLConfigs created 1557- egl/wayland: Remove check for EGLConfig presence 1558- egl/gbm: Remove check for EGLConfig presence 1559- egl/x11: Remove check for EGLConfig presence 1560- egl/android: Remove check for EGLConfig presence 1561- egl/{surfaceless,device}: Remove check for EGLConfig presence 1562- egl: Automatically set EGLConfig ID 1563- egl: Use pipe_format for pbuffer configs 1564- gbm/dri: Query DRIImage for FourCC directly 1565- gbm: Remove hardcoded color-channel data 1566- egl/android: Remove hard-coded color-channel data 1567- egl/x11: Compare config shifts/sizes locally 1568- egl: Remove shifts/sizes from dri2_add_config argument 1569- st/dri: Use pipe_format from config directly 1570- egl/dri: Use pipe_format instead of DRI_IMAGE_FORMAT 1571- egl/wayland: Remove EGL_WL_create_wayland_buffer_from_image 1572 1573Danylo Piliaiev (37): 1574 1575- freedreno/replay: Delete all buffers after each submission 1576- freedreno/replay: Correctly free iova on msm backend 1577- freedreno/replay: Add WSL backend for Windows 1578- ir3: Fix "print" meta instruction synchronization 1579- ir3: Add fullsync and fullnop ir3 dbg options for over-syncing 1580- freedreno/replay: Make meta "print" instruction take any number of regs 1581- tu: Do not print anything on systems without Adreno GPU 1582- tu/a7xx: Make A7XX_RB_UNKNOWN_8E06 value configurable per-gen 1583- tu: Define and set to zero all SP_*_VGPR_CONFIG regs 1584- ir3: Add ldg.k instruction 1585- tu/a7xx: Correctly set A7XX_HLSQ_UNKNOWN_A9AE.SYSVAL_REGS_COUNT 1586- tu/a7xx: Do not preload shaders, HW does it by default 1587- tu: Use SS6_INDIRECT consts upload path for 3d blits 1588- turnip,ir3/a750: Implement consts loading via preamble 1589- tu: Use SS6_INDIRECT for VS params 1590- turnip,ir3/a750: Implement inline uniforms via ldg.k 1591- tu/a750: Consider vertex attr buff in gmem allocation 1592- freedreno,tu: Move varying interp and varying repl modes to xml 1593- freedreno/devices: Update magic regs for a7xx 1594- tu: Exclude more a7xx regs from stomping 1595- tu: Add workaround for D3D11 games accessing UBO out of bounds 1596- tu/a7xx: Write even more magic regs to fix rendering issues on Android 1597- tu: Do not emit zero-sized fs params 1598- freedreno/a7xx: Fix base_align for non-UBWC depth-stencil 1599- tu/autotuner: Use CP_EVENT_WRITE7 for submission fence 1600- tu: Update prim restart state when we switch from/to indexed draw 1601- tu: Fix dynamic state not always being emitted 1602- meson: Correctly get sizeof_pointer with cross-compilers 1603- freedreno/devices: Do not write to 8E79 on a750, KGSL has it protected 1604- freedreno/replay: Use real queueid for submissions and waits 1605- freedreno,tu/a7xx: Add PC_TESS_PARAM_SIZE and PC_TESS_FACTOR_SIZE 1606- tu: Update RP state depending on pipeline in first RP draw 1607- tu: Emit non-draw-state state at the first draw call 1608- freedreno/devices: Add A740v3 from Quest 3 1609- util/vma: Add function to get max continuous free size 1610- freedreno/replay: Allocated maximum available size for cs overriding 1611- ir3: Do not set clip/cull mask if no one writes clip/cull 1612 1613Dario Mylonopoulos (1): 1614 1615- llvmpipe: fixed race condition in lp_rast_destroy that causes a crash on windows 1616 1617Dave Airlie (46): 1618 1619- vulkan/video: drop unused function. 1620- vulkan/video: rename some of the parameter tracking structs. 1621- vulkan/video: start to wrap the video structs for deep copies. 1622- vulkan/video: start deep copying the parameters structures 1623- vulkan/video: constify the encoding apis. 1624- radv/video: refactor sq start/end code to avoid decode hangs. 1625- radv: don't submit empty command buffers on encoder ring. 1626- gallivm: fix coroutines with llvm 18 1627- gallivm: passing fp16_split_fp64 to fp16 lowering. 1628- nvk: allow 3d compressed textures 1629- nvk: mem cannot be null in binding buffers/images. 1630- zink: use sparse residency for buffers. 1631- vulkan: update registry/includes to 1.3.277 1632- vulkan/video: add AV1 decode support to common code 1633- radv: fix correct padding on uvd 1634- radv: init decoder ip block earlier. 1635- radv/uvd: uvd kernel checks for full dpb allocation. 1636- radv: don't submit 0 length on UVD either. 1637- egl: don't bind zink under dri2/3 1638- glx/dri3: handle zink fallback if loader picks it. 1639- loader: handle picking zink for nouveau for certain GPUs. 1640- nouveau/winsys: fix bda heap leak. 1641- nvk: fix dri options leak. 1642- egl/dri2: if zink is preferred from dri3 skip dri2 paths. 1643- radv/video: fix filling out decode operations. 1644- radv/video: use vcn ip version in more places. 1645- radv: rename it_ptr to it_probs_ptr in advance of adding av1 1646- radv/video: use proper struct sizes for decoder structs. 1647- radv/video: add VK_KHR_video_decode_av1 support. 1648- nvk: free leaked cmd_buffer descriptors state. 1649- nvk: only unmap heap bos that were mapped 1650- nvk: enable a mappable bar heap when rebar is disabled. 1651- radv/video: fix h265 decode with unaligned w/h 1652- mesa: reorder st context teardown 1653- vulkan/video: copy the profile over for h264 encode. 1654- radv/video: export unified queue header/tail functions. 1655- radv: add direct cs emit for a dword. 1656- radv: add encoder queue support pieces and encoder queries. 1657- radv/video: add parameter patching calls. 1658- radv/video: add initial support for encoding with h264. 1659- radv/video: add h265 encode support 1660- radv/video: enable video encoding behind perftest flag 1661- radv/video: handle encode control parameters better. 1662- radv/video: don't advertise timestamp bits for decode/encode 1663- egl/dri2: don't bind dri2 for zink 1664- radv/video/encode: fix quality params on v2 hw. 1665 1666David (Ming Qiang) Wu (1): 1667 1668- frontends/va: make vlVaSyncSurface blocking 1669 1670David Heidelberg (57): 1671 1672- ci/deqp: uprev deqp-runner for Linux too to 0.18.0 1673- ci/lima: update expectations, failing tests are being skipped 1674- ci: bump kernel to 6.6.12, modularize i915, add Transparent Huge Pages 1675- ci: shorter kernel tag, included Vivante NPU patches 1676- ci: disable Valve farm in Keywords 1677- ci: bump libdrm to 2.4.120 1678- ci/VK-GL-CTS: add patches to fix dEQP-VK.glsl.derivate crashes 1679- ci: Valve farm (Keywords location) works again 1680- meson: upgrade zlib wrap to 1.3.1 1681- util: use crc32_z instead of crc32 and bump zlib dep to 1.2.9 1682- ci: bump kernel to 6.6.16 + enable X2APIC 1683- ci/freedreno: add fail found by new Piglit 1684- ci/etnaviv: update expectations 1685- ci: temporarily disable Collabora farm 1686- ci: enable Collabora farm 1687- ci: re-enable Collabora farm after maintenance 1688- ci/intel: decompose anv-tgl-test so we can specify custom devices for TGL 1689- ci/intel: add acer-cp514-2h-11{30,60}g7-volteer 1690- ci/intel: move machine definition to the intel-tgl-skqp job 1691- ci/intel: split asus-cx9400-volteer into acer-cp514-2h-11{30,60}g7-volteer 1692- drm-shim: Avoid invalid file and time bits combination 1693- intel/tools: avoid invalid time and file bits combination 1694- ci/deqp: backport Implement support for the EGL_EXT_config_select_group extension GL-CTS patch 1695- ci/freedreno: update expectations comment 1696- ci/deqp: add EGL patch for correct suite (GLES, not GL) 1697- nine: convert licenses block to SPDX 1698- nine: fill missing licenses headers and copyrights 1699- nine: drop useless and a bit too long line 1700- ci: uprev kernel to 6.6.21 1701- ci/freedreno: disable workarounds for Adreno 618, 630, and 660 1702- ci/freedreno: mark fails resolved by "drm/msm/gem: Add metadata uapi" 1703- ci: reduce irrelevant output to a simple list of libraries 1704- util: move gen_zipped_file into generic util and rename to gen_zipped_xml_file 1705- ci/r300: implement rules for d3d9 testing 1706- ci/svga: add missed test and gl-rules include 1707- r300: convert to SPDX license block and fix small typos 1708- r300: add missing licence to the r300_public.h 1709- r300: add missing copyright header 1710- docs: we support EGL 1.5 for a long time 1711- ci/amd: meld radv-traces into radv-raven-traces 1712- ci/amd: drop old PIGLIT_REPLAY_DESCRIPTION_FILE surpassed by PIGLIT_TRACES_FILE 1713- frontend/nine: fix typos 1714- r600: update licensing to SPDX header 1715- r600: add license header to r600_formats.h 1716- r600: add license info to the r600_opcodes.h 1717- r600: add license information to the sfn_shader_gs.h 1718- r600: fix typos 1719- ci: disable sona devices, all devices are offline 1720- ci/intel: sona device_type is back online 1721- ci: temporarily disable Android test builds 1722- ci: disable Igalia farm 1723- meson: implement split-debug 1724- freedreno/ci: move the disabled jobs from include to the main file 1725- ci/deqp: correct EGL_EXT_config_select_group detection 1726- egl/x11: Move RGBA visuals in the second config selection group 1727- winsys/i915: depends on intel_wa.h 1728- subprojects: uprev perfetto to v45.0 1729 1730David Rosca (31): 1731 1732- radeonsi/vcn: Fix H264 slice header when encoding I frames 1733- frontends/va: Fix updating AV1 rate control parameters 1734- radeonsi/vcn: Don't reinitialize encode session on bitrate/fps change 1735- frontends/va: Only set VP9 segmentation fields when segmentation is enabled 1736- frontends/va: Separate QP for I/P/B frames 1737- radeonsi/vcn: Use temporal_layer_index to select temporal layer 1738- radeonsi/vcn: Implement separate QP for I/P/B frames 1739- radv/video: Set maxActiveReferencePictures to 16 for H264/5 1740- frontends/vdpau: Fix cdef strengths and lr_unit_shift in AV1 decode 1741- frontends/vdpau: Support creating VDP_CHROMA_TYPE_420_16 surfaces 1742- radv/video: Fix setting slice QP 1743- radv/video: Set correct bitstream buffer size 1744- radv/video: Set VBV buffer size and level 1745- radv/video: Select temporal layer when encoding each frame 1746- radv/video: Set maxSublayerCount to 4 for H265 1747- radv/video: Avoid resetting rate control every frame 1748- radv/video: Implement per picture type min/max QP 1749- radv/video: Set correct bit depth and format for 10bit input 1750- radv/video: Check encode profiles and bit depth in capabilities query 1751- radv/video: Report maxBitrate in encode capabilities 1752- radeonsi/vcn: Allocate session buffer in VRAM 1753- radeonsi/vcn: Fix 10bit HEVC VPS general_profile_compatibility_flags 1754- radeonsi/vcn: Only enable VBAQ with rate control mode 1755- frontends/va: Fix AV1 slice_data_offset with multiple slice data buffers 1756- Revert "radeonsi/vcn: AV1 skip the redundant bs resize" 1757- frontends/va: Only increment slice offset after first slice parameters 1758- radeonsi: Update buffer for other planes in si_alloc_resource 1759- frontends/va: Store slice types for H264 decode 1760- radeonsi/vcn: Ensure DPB has as many buffers as references 1761- radeonsi/vcn: Allow duplicate buffers in DPB 1762- radeonsi/vcn: Ensure at least one reference for H264 P/B frames 1763 1764David Stern (1): 1765 1766- vulkan/wsi/x11: Explicitly discard errors from xcb_present_pixmap. 1767 1768David Tobolik (1): 1769 1770- rusticl: implement cl_khr_suggested_local_work_size 1771 1772Derek Foreman (10): 1773 1774- egl/wayland: Fix possible buffer leak 1775- loader/wayland: Add named queue fallback 1776- egl/wayland: Give names to our Wayland event queues 1777- vulkan/wsi/wayland: Give names to our Wayland event queues 1778- vulkan/wsi/wayland: Remove confusing comment 1779- vulkan/wsi/wayland: Adjust presentation id locking 1780- vulkan/wsi/wayland: Use wl_display_dispatch_queue_timeout 1781- vulkan/wsi/wayland: More descriptive name for swapchain queue 1782- vulkan/wsi/wayland: Fix use after free 1783- vulkan/wsi/wayland: Remove unused get_min_image_count_for_mode_group 1784 1785Dmitry Baryshkov (11): 1786 1787- freedreno/drm: don't crash for unsupported devices 1788- freedreno/regs: define the wide bus enable bit in DSI_VID_CFG0 1789- freedreno/registers: fix generation dependencies 1790- freedreno/registers: add missing copyright imports 1791- freedreno/registers: inline mdp4_csc group 1792- freedreno/registers: fix WB doffsets array in mdp5.xml 1793- freedreno/registers: support processing display display headers 1794- freedreno/registers: limit the rules schema 1795- freedreno/registers: drop unsupported features from schema 1796- freedreno/rnn: drop headergen2 1797- freedreno/rnn: drop custom aprintf function 1798 1799Dmitry Osipenko (2): 1800 1801- virtio/vdrm: Fix lockup in vdrm_host_sync() 1802- iris: Use Mesa internal drm-uapi headers 1803 1804Dylan Baker (8): 1805 1806- intel/vulkan: assume() that we don't use "ISL_NUM_FORMATS" 1807- intel/hasvk: assume() we don't get ISL_NUM_FORMATS 1808- meson: drop intel-cl deprecation of 'false' 1809- meson: rework intel-rt option to be a feature 1810- meson: Allow building intel-clc for the host if it can be run 1811- intel/brw: track last successful pass and leave the loop early 1812- nvk: drop meson version check that is always true 1813- nouveau: require cbindgen >= 0.25 1814 1815Echo J (9): 1816 1817- nvk: Set ICD version to 1.3 1818- nvk: Implement the VR-related display extensions 1819- nak: Rip out a few dead_code statements 1820- nvk: Add NVK to the Vulkan device name 1821- nvk: Advertise VK_VALVE_mutable_descriptor_type 1822- nvk: Implement calibrated timestamps 1823- vulkan: Add implicit pipeline caching support 1824- nvk: Use implicit pipeline cache 1825- nvk: Don't advertise residencyAlignedMipSize on MaxwellB+ 1826 1827Emma Anholt (2): 1828 1829- ci: Add full-run xfails missed in the 1.3.7.0 CTS update. 1830- ci: Disable VK full runs that time out since 1.3.7.0 (hasvk, anv-tgl, a630) 1831 1832Emmanuel Vadot (1): 1833 1834- util: Allow kcmp on FreeBSD 1835 1836Eric Engestrom (282): 1837 1838- VERSION: bump to 24.1 1839- docs: reset new_features.txt 1840- docs: update calendar for 24.0.0-rc1 1841- ci: make sure we evaluate the python-test rules first 1842- docs: fix syntax highlighting on non-code text snippet 1843- docs: fix syntax highlighting on shell commands 1844- ci/deqp: ensure that in \`default` builds, wayland + x11 + xcb are all built 1845- zink+anv/ci: add known failures 1846- ci: fix job dependency error in MRs for bin/ci/* scripts 1847- nouveau/ci: don't run nouveau (gl) tests on nvk changes 1848- amd/ci: simplify deqp config 1849- amd/ci: add flakes seen today 1850- docs: update calendar for 24.0.0-rc2 1851- zink+radv/ci: drop duplicates flakes lines 1852- CODEOWNERS: add myself as as person of contact for CI changes 1853- CODEOWNERS: remove myself as a person of contact for a few things 1854- radv/ci: sort navi21 flakes 1855- amd/ci: add flakes seen today 1856- amd/ci: consider much more of dEQP-VK.query_pool.statistics_query.host_query_reset.* to be flaky 1857- r300/ci: add flakes 1858- ci/deqp: backport fix for zlib.net not allowing tarball download anymore 1859- rpi3/ci: update piglit & deqp expectations 1860- rpi4/ci: skip more of the dEQP-VK.ssbo.phys.layout.* tests that timeout occasionally 1861- rpi3/ci: add flake seen today 1862- rpi4/ci: add timeouts seen today 1863- rpi5/ci: add flake seen today 1864- docs: add release notes for 23.3.4 1865- docs: update calendar for 23.3.4 1866- docs: add sha256sum for 23.3.4 1867- docs: update calendar for 24.0.0-rc3 1868- ci_run_n_monitor: drop always-true condition 1869- ci_run_n_monitor: allow passing multiple targets 1870- ci/deqp: fix default target check when target is not specified 1871- ci/deqp: simplify version log dump 1872- ci/deqp: avoid storing the huge list of vk tests on android builds 1873- ci/deqp: move editable part to the top of the file 1874- ci/deqp: split vk and gl builds 1875- ci/deqp: drop the implicit DEQP_TARGET; explicitly set \`default` in VK builds 1876- ci/deqp: only compile EGL tests in GL builds, not VK builds 1877- ci/deqp: only compile the test binaries that are relevant to the build 1878- ci/deqp: only keep the mustpass lists that are relevant to the build 1879- ci: bump the image tags to rebuild all the deqp variants 1880- Revert "bin/ci: Add GitLab basic token validation" 1881- Reapply "bin/ci: Add GitLab basic token validation" 1882- util: rename __check_suid() to __normal_user() 1883- tree-wide: use __normal_user() everywhere instead of writing the check manually 1884- zink+anv/ci: add a couple more flakes 1885- util: simplify logic in __normal_user() 1886- util: check for setgid() as well in __normal_user() 1887- ci: always skip dEQP-VK.info.device_extensions 1888- vk/util: fix 'beta' check for physical device features 1889- vk/util: fix 'beta' check for physical device properties 1890- ci: when specifying a driver remove all other ones 1891- docs: update calendar for 24.0.0 1892- docs: add release notes for 24.0.0 1893- docs: add sha256sum for 24.0.0 1894- docs/release-calendar: add planned 24.0.x bugfix releases 1895- docs: add release notes for 23.3.5 1896- docs: update calendar for 23.3.5 1897- docs: add sha256sum for 23.3.5 1898- v3d-rpi4-gl: reduce the parallelism from 10 to 8 1899- docs/calendar: add 24.1 branchpoint and release schedule 1900- ci: drop dash in image tags dates 1901- ci: enforce maximum image tag length 1902- ci: reduce maximum image tags length from 30 to 20 1903- ci: explain purpose of the word after the date in image tags 1904- panfrost: fix UB caused by shifting signed int too far 1905- ci_run_n_monitor: avoid spamming a ton of "new status: created" for all the jobs at the beginning 1906- ci: build panvk in debian-vulkan job 1907- nouveau/tests: fix null dereference 1908- ci: build nvk in debian-vulkan job 1909- v3dv/ci: test the WSI on rpi4 and rpi5 1910- radv: enable VK_EXT_headless_surface on all platforms except Windows 1911- v3dv: enable VK_EXT_headless_surface on all platforms except Windows 1912- tu: enable VK_EXT_headless_surface on all platforms except Windows 1913- anv: enable VK_EXT_headless_surface on all platforms except Windows 1914- hasvk: enable VK_EXT_headless_surface on all platforms except Windows 1915- dzn: enable VK_EXT_headless_surface on all platforms except Windows 1916- nvk: enable VK_EXT_headless_surface on all platforms except Windows 1917- panvk: enable VK_EXT_headless_surface on all platforms except Windows 1918- vn: enable VK_EXT_headless_surface on all platforms except Windows 1919- lvp: enable VK_EXT_headless_surface on all platforms except Windows 1920- pvr: enable VK_EXT_headless_surface on all platforms except Windows 1921- ci_run_n_monitor: warn user if they forgot to push the branch 1922- ci_run_n_monitor: add some types for gitlab objects 1923- ci_run_n_monitor: update job when it goes through enable_job() 1924- ci_run_n_monitor: add method to get a pipeline job by its id 1925- ci_run_n_monitor: track new job when retrying a job 1926- ci_run_n_monitor: refresh job state when starting it 1927- gitlab_gql: print error returned by server in --print-merged-yaml 1928- ci_run_n_monitor: implicitly include \`parallel:` jobs 1929- ci_run_n_monitor: print the target regex before adding the X/N bit 1930- docs: add release notes for 24.0.1 1931- docs: add sha256sum for 24.0.1 1932- docs: add release notes for 23.3.6 1933- docs: update calendar for 23.3.6 1934- docs: add sha256sum for 23.3.6 1935- docs: update calendar for 24.0.1 1936- ci_run_n_monitor: explain why/when there might be no tracked remote 1937- ci_run_n_monitor: allow detached heads as well 1938- docs: add release notes for 24.0.2 1939- docs: add sha256sum for 24.0.2 1940- docs: update calendar for 24.0.2 1941- ci_run_n_monitor: fix handling of optional jobs again 1942- ci_run_n_monitor: read job logs as utf-8 1943- vk/util: trivial cleanups in vk_icd_gen.py 1944- vk/util: print a nice error in vk_icd_gen.py when VK_HEADER_VERSION is not defined 1945- ci/android: use a specific version of android-cuttlefish 1946- ci: document which image tags to bump when touching build-mold.sh 1947- ci: uprev mold to the latest release 1948- ci/image-tags: move KERNEL_ROOTFS_TAG to group the test images together 1949- ci/deqp: only apply the android patches to the android build 1950- ci/deqp: build deqp-egl using mold as well 1951- ci/deqp: make deql-egl for android less of a special case 1952- ci/deqp: control the GL release independently of VK 1953- ci/deqp: control the GLES release independently of GL 1954- r300/ci: group tex-miplevel-selection flakes together 1955- r300/ci: add another tex-miplevel-selection flake 1956- iris/ci: add pbuffer flakes for amly, same as apl and glk 1957- panfrost/ci: skip dEQP-GLES31.functional.copy_image.non_compressed.* on t760 as they hang 1958- rpi3/ci: update expectations for vc4-rpi3-gl-piglit-full:arm32 2/4 1959- freedreno/ci: add another a618 flake 1960- zink+anv: update expectations 1961- r300/ci: add flakes 1962- radeonsi/ci: add vangogh piglit flake 1963- zink+radv: update navi31 expectations (one test fixed) 1964- softpipe: update expectations 1965- ci/deqp: drop zlib url patch 1966- ci/deqp: split vk/gl/gles patches 1967- ci/deqp-runner: inline never-used DEQP_VARIANT variable 1968- ci/deqp: use the proper gl/gles releases for deqp-gl*, deqp-gles*, deqp-egl 1969- ci/venus-lavapipe: drop unused DEQP_VER that's being overwritten by DEQP_SUITE anyway 1970- ci/lavapipe: fold \`DEQP_VER: vk` and drop .deqp-test-vk 1971- docs: delay 24.1 branchpoint by 2 weeks 1972- vk/update-aliases: drop VK_ERROR\_ prefix substitution 1973- ci/deqp-runner: do a release build instead of debug 1974- ci/deqp-runner: set android rust target in the caller (debian/x86_64_test-android.sh) 1975- ci/deqp-runner: bring "install from crate" & "install from git" to feature parity 1976- ci/deqp-runner: update repo url 1977- ci/deqp-runner: fix list of image tags to update 1978- ci/image-tags: re-generate all the images building deqp-runner 1979- docs: add release notes for 24.0.3 1980- docs: add sha256sum for 24.0.3 1981- ci/deqp: document which build produces which binary 1982- ci: include all the src/\**/gitlab-ci.yml files 1983- nouveau: add missing vl lib 1984- nouveau/ci: fix yaml indentation 1985- nouveau/ci: only trigger jobs for relevant changes 1986- Revert "nouveau: add missing vl lib" 1987- ci/deqp: backport fix for dEQP-VK.wsi.direct_drm.* bug 1988- vc4/ci: add flake 1989- radeonsi/ci: udpate expected failures 1990- r300: mark new fails 1991- v3dv/ci: update expectations 1992- v3d/ci: mark spec\@ext_framebuffer_blit\@fbo-blit-check-limits as fixed 1993- vc4/ci: add another \`spec\@!opengl 1.1\@depthstencil-default_fb-drawpixels` flake 1994- vc4/ci: add another \`spec\@arb_vertex_buffer_object\@vbo-subdata-many draw` flake 1995- v3dv/ci: mark the \`dEQP-VK.wsi.*.maintenance1.deferred_alloc.*` flakes seen so far as happening on all platforms 1996- v3dv/ci: add other flakes seen during nightly run 1997- ci: fix shader-db job existence condition 1998- v3dv/ci: assume dEQP-VK.wsi.wayland.swapchain.simulate_oom.* have been fixed 1999- v3dv/ci: add more flakes 2000- v3dv/ci: assume list of dEQP-VK.wsi.*.maintenance1.present_modes.* flakes is the same between xcb & xlib and between rpi4 & rpi5 2001- ci: enable MESA_VK_ABORT_ON_DEVICE_LOSS globally 2002- ci/deqp-runner: split gl & gles groups to use the correct binary 2003- ci/deqp-runner: print deqp-gles version log as well 2004- ci: deduplicate converting the current job runtime into %M:%S 2005- ci: convert the job start date into a timestamp only once 2006- ci: simplify unnecessarily complex printf 2007- radv/ci: sort tahiti flakes 2008- radv/ci: add a bunch of flakes seen recently 2009- v3dv/ci: track regression 2010- rpi/ci: add flakes 2011- radv/ci: add more flakes 2012- v3dv/ci: add more flakes 2013- docs: update calendar for 24.0.3 2014- docs: update calendar for 24.0.4 2015- docs: add release notes for 24.0.4 2016- docs: add sha256sum for 24.0.4 2017- v3dv/ci: another batch of flakes 2018- radv/ci: another batch of flakes 2019- radv/ci: another batch of flakes 2020- radv/ci: dEQP-VK.spirv_assembly.type.vec4.i8.mod_geom Fail -> Crash on tahiti 2021- ci: don't run rustfmt on every core change 2022- ci_run_n_monitor: explain how to pass multiple targets without having to use regexes 2023- rpi/ci: another batch of flakes 2024- docs: mesa also implements gles 3.0+ 2025- docs/egl: various wording improvements 2026- ci: take kws farm offline 2027- ci: restore kws farm 2028- radv/ci: simplify tahiti flakes list 2029- ci: fold .test-check into its only user, python-test 2030- ci: run python-test when editing the CI itself 2031- ci: run python-test automatically only in merge pipelines 2032- docs/macos: drop reference to former github mirror 2033- docs/nir: vec4 reference 2034- docs/envvars: fix reference 2035- docs/isl: fix references to ISL_AUX_USAGE_CCS_* 2036- docs/isl: stop trying to link to classic drivers code 2037- docs/isl: VK_FORMAT_xxx_PACKEDn is not a real format, don't try to link to it 2038- docs/isl: fix enum references 2039- docs: fix inline c identifier reference -> inline code 2040- isl: fix inline c identifier reference -> inline code 2041- nir: add missing stdint include 2042- docs/anv: fix envvar documentation 2043- docs/nvk: fix envvar documentation 2044- ci: mark vmware farm as offline 2045- ci: add missing rule to disable vmware farm 2046- ci: raise the log level threshold of spirv logs 2047- docs/envvars: document some vulkan loader env vars 2048- docs: replace references to the deprecated VK_ICD_FILENAMES with the new VK_DRIVER_FILES 2049- docs: replace references to the deprecated VK_INSTANCE_LAYERS with the new VK_LOADER_LAYERS_ENABLE 2050- docs/zink: format the envvar value as code instead of plain text 2051- meson: add VK_DRIVER_FILES to devenv, alongside the old VK_ICD_FILENAMES 2052- ci: drop unused VK_ICD_FILENAMES passthrough from jobs to tests 2053- ci: use the new VK_DRIVER_FILES env var 2054- ci/deqp: backport fix for dEQP-VK.pipeline.*.render_to_image.*.huge.* 2055- ci: fix nightly build 2056- ci: fix nightly build (v2) 2057- ci/llvmpipe: make sure manual jobs don't auto-retry 2058- ci/llvmpipe: fix out of date fails list 2059- ci/lavapipe: fix out of date fails list 2060- ci/lavapipe: skip test that sometimes times out 2061- ci: add nightly full run of llvmpipe 2062- ci: add nightly full run of lavapipe 2063- gallium/dri: reuse existing meson variables 2064- meson: regroup glvnd lines to get an easier-to-review diff in the next commit 2065- meson: turn \`glvnd` option into a feature 2066- ci: explicitly disable glvnd to avoid regression when making it auto 2067- meson: auto-enable glvnd when libglvnd is installed 2068- mr-label-maker: include \*/gitlab-ci-inc.yml in GitLab CI changes 2069- mr-label-maker: be explicit about the various CI files 2070- docs: add release notes for 24.0.5 2071- docs: update calendar for 24.0.5 2072- docs: add sha256sum for 24.0.5 2073- ci: delete mistaken duplicate llvmpipe-{fails,skips}.txt 2074- etnaviv: avoid re-defining prog_python 2075- egl: drop dead dri2_dpy param in dri2_wl_visual_idx_from_config() 2076- lavapipe: add 1 new failure and 1 new timeout since CTS uprev to 1.3.8.0 2077- vk/overlay-layer: drop unused imports 2078- vk/overlay-layer: fix None checks 2079- vk/overlay-layer: simplify print and make it more readable 2080- docs/rusticl: add an intro explaining what Rusticl is 2081- wsi/x11: drop unused param in x11_present_to_x11_sw() 2082- radv: initialize a couple of variables 2083- util: simplify loop logic in util_format_get_first_non_void_channel() 2084- util/futex: replace double-cast check with a simple sign check 2085- docs/ci: explain how gitlab considers "changes" when pushing on a fork branch 2086- rpi5/ci: sort flakes 2087- rpi5/ci: add flakes from last night's run 2088- rpi4/ci: sort flakes 2089- rpi4/ci: add new flakes from last night's run 2090- radeonsi/ci: update vangogh expectations after piglit uprev 2091- llvmpipe/ci: update expectations after piglit uprev 2092- VERSION: bump for 24.1.0-rc1 2093- .pick_status.json: Update to 4660ee1deaace6457bf5fbf3fc8810e4a2453cb5 2094- ci: fix container rules on release branches and tags 2095- .pick_status.json: Update to 84632dce93f44e8d88cda47648cfd4cc0958918f 2096- .pick_status.json: Update to 8248cc0bf45d0d7558cc3d77a63dcd078a96aa66 2097- ci: pass MESA_VK_ABORT_ON_DEVICE_LOSS through to the DUT 2098- .pick_status.json: Update to 86281ef15fca378ef48bcb072a762168e537820d 2099- .pick_status.json: Update to 47f6e24ad5dfcb59dd1511800aee8c56b4f8fee4 2100- meson: simplify \`-gsplit-dwarf` compiler argument check 2101- meson: move tsan-blacklist.txt to build-support with the other build support files 2102- VERSION: bump for 24.1.0-rc2 2103- .pick_status.json: Update to 603982ea802b3846e91a943b413a7baf430e875d 2104- .pick_status.json: Update to 569c2fcf952a3ec13ddf77c0058e769bf68f3aaf 2105- .pick_status.json: Update to 9666756f603f0285d8a93ef93db1c7ec702b671f 2106- .pick_status.json: Update to b8e79d2769b4a4aed7e2103cf0405acc5bdadb86 2107- VERSION: bump for 24.1.0-rc3 2108- .pick_status.json: Update to 18c53157318d6c8e572062f6bb768dfb621a55fd 2109- .pick_status.json: Update to 406dda70e7c9baa59c975eb64025e7c3b210c3bc 2110- .pick_status.json: Update to 5502ecd7716045e76f13f007a4aa5f5653c80ecd 2111- util/format: add missing null check in util_format_is_srgb() 2112- .pick_status.json: Update to d516721cd0cb16d0b601c42c01de0fdcc4ae887b 2113- .pick_status.json: Update to aa9244c8f6bfa3fb33cf233104b00fc44fc9459f 2114- .pick_status.json: Mark a45f1990860db3a8da6d7251bb627a314dfb8423 as denominated 2115- VERSION: bump for 24.1.0-rc4 2116- .pick_status.json: Update to b2282e3a571f18b48b8b717ec32da1d0ed93f1b5 2117- .pick_status.json: Update to 471ac97a4af751226bc51076130deae252bb481e 2118- .pick_status.json: Update to 2487a875527f636565a7b39036690fbf7c5d46db 2119- .pick_status.json: Update to 3584fc64828ad2ad4d486572ec915aab8321aadd 2120 2121Eric R. Smith (13): 2122 2123- panfrost: fix panfrost drm-shim 2124- panfrost: add lowering pass for multisampled images 2125- panfrost: support multi-sampled image load/store 2126- panfrost: protect alpha calculation from accessing non-existent component 2127- panfrost: make drm-shim work again for panfrost 2128- panfrost: make sure blends always have 4 components 2129- panfrost: mark indirect compute buffer as read 2130- gallium: handle copy_image of depth textures 2131- panfrost: fix polygon offset calculation for floating point Z 2132- panfrost: fix a GPU/CPU synchronization problem 2133- panfrost: mark separate_stencil as valid when surface is valid 2134- panfrost: fix an incorrect stencil clear optimization 2135- panfrost: add a barrier when launching xfb jobs in CSF 2136 2137Erico Nunes (2): 2138 2139- Revert "ci: lima farm is down" 2140- ci: enable shader-db on lima 2141 2142Erik Faye-Lund (32): 2143 2144- panfrost: add support for forcing sample-counts 2145- panfrost: pass reduced primitive type instead of points 2146- panfrost: add line_smooth shader-key and lowering 2147- panfrost: clean up active_prim update 2148- panfrost: implement line-smoothing 2149- mesa/main: add support for EXT_texture_storage 2150- mesa: fix error-handling for ETC2/RGTC textures 2151- glapi: move EXT_texture_storage to the right position 2152- targets/va: override LIBVA_DRIVERS_PATH in devenv 2153- mesa/main: fix _mesa_base_tex_format for BGRA 2154- mesa/main: mark GL_BGRA as color-renderable 2155- mesa/main: mark GL_BGRA8_EXT as color-renderable 2156- mesa/main: work around chrome/firefox bug 2157- mesa/main: allow GL_BGRA for FBOs 2158- panvk: do not handle illegal null 2159- glsl: Make error_value a real ir_rvalue type 2160- panfrost: give afbc-packing its own flag 2161- panfrost: add driconf infrastructure 2162- panfrost: add pan_force_afbc_packing driconf 2163- mesa: prefer read-format of RG for snorm 2164- gallium: remove always-false parameter 2165- panvk: use integers instead of strings 2166- panfrost: silence compiler warning 2167- panfrost: add tiler-heap driconfs 2168- panvk: wire up version-overriding 2169- panfrost: implement a driver-specific max-miplevel 2170- panfrost: use perf_debug_ctx instead of perf_debug 2171- panfrost: perf_debug_ctx -> perf_debug 2172- panfrost: use util_debug_message for perf_debug 2173- panfrost: do not deref potentially null pointer 2174- panfrost: correct first-tracking for signature 2175- panvk: avoid dereferencing a null-pointer 2176 2177Erik Kurzinger (2): 2178 2179- wsi/wayland: don't use explicit sync with sw 2180- wsi/x11: support explicit sync 2181 2182Faith Ekstrand (284): 2183 2184- nvk: Add an explicit mapping from shader stages to cbuf bindings 2185- nvk: Return an nvk_cbuf_map from nvk_lower_nir() 2186- nvk: Use s instead of set_idx in CmdBindDescriptorSets 2187- nvk: Rework descriptor set binding 2188- nvk: Make dynamic cbuf indices relative to the descriptor set 2189- nvk: Handle missing descriptor sets in nvk_nir_lower_descriptors 2190- nvk: Invalidate state after secondary command buffers 2191- nvk: Set a minimum of one patch control point 2192- nak: Disallow gl_FragData and set MRT correctly 2193- nak: Add explicit padding to nak_shader_info 2194- nvk: Emit SET_ANTI_ALIAS at draw time when no render targets are bound 2195- nvk: Move SET_HYBRID_ANTI_ALIAS_CONTROL to draw time 2196- nvk: Advertise variableMultisampleRate and EDS3RasterizationSamples 2197- nvk: Add a couple more features to features.txt 2198- nak: Stop passing --explicit-padding to bindgen 2199- nak: Implement nir_op_pack_half_2x16_rtz_split 2200- nak: Implement nir_op_ufind_msb_rev 2201- nak: Rename OpBrev to OpBRev 2202- nak: Implement nir_op_bfm 2203- nouveau/mme/fermi: Stop truncating iadd immediates 2204- nouveau/mme: Stop using isaspec 2205- nvk: Set framebufferIntegerColorSampleCounts 2206- nvk: Unref shaders on pipeline free 2207- nvk: Add a #define for max shared memory size 2208- nvk: Properly configure the min/max shared mem size 2209- nvk: Implement VK_KHR_zero_initialize_workgroup_memory 2210- nir,spirv: Add support for SPV_NV_shader_sm_builtins 2211- nak: Add support for SPV_NV_shader_sm_builtins 2212- nvk: Advertise VK_NV_shader_sm_builtins 2213- nvk/draw: Map cbuf slots to shaders, not cbuf_maps 2214- nak: Refactor shader upload math 2215- nvk: Wire up nir_opt_large_constants 2216- nak: Enable NIR fuse_ffmaN 2217- nak: Legalize OpBMsk 2218- nvk: Don't exnore ExternalImageFormatInfo 2219- nvk: Set maxInlineUniformTotalSize 2220- nak: Fix TCS output reads 2221- anv: Add helpers for getting the surface state from an image view 2222- anv: Advertise VK_EXT_attachment_feedback_loop_layout 2223- nak: Choose S2R vs CS2R based on sysval index 2224- nak: Add a source barrier intrinsic 2225- nak: Loop to ensure we get accurate shader clocks 2226- nvk: Stop requiring dedicated allocations 2227- nvk: Advertise Vulkan 1.3 2228- nvk: Do a second submit to check for errors in the sync case 2229- nvk: Whitespace fixes 2230- nvk: Disable all cbufs in nvk_queue_init_context_draw_state() 2231- nvk: Call lower_compute_system_values after zer_initialize_workgroup_memory 2232- nak/nir: Stop lowering load_local_invocation_index 2233- nil: Set the level offset to 0 in nil_image_for_level 2234- nvk: Fix whitespace in nvk_image.c 2235- nouveau/winsys: Re-order channel creation 2236- nouveau/winsys: Allow only allocating a subset of engines 2237- nvk/queue: Pull DRM specfc stuff into nvk_queue_drm.c 2238- nvk/queue: Refactor the push builder a bit 2239- nvk: Move the nouveau_ws_context to nvk_queue 2240- nvk: Add an array of queue families to nvk_physical_device 2241- nvk/queue: Rework context state init 2242- nvk/queue: Only initialize the necessary engines 2243- nvk: Use VM_BIND for contiguous heaps instead of copying 2244- nvk: Only map heaps that explicitly request maps 2245- nvk: Add an upload queue 2246- nvk: Add an upload queue to nvk_device 2247- nvk: Use the upload queue for shader uploads 2248- nvk: Don't set CONSTANT_BUFFER_SELECTOR with a zero size 2249- nvk/heap: Use nvk_heap_bo::addr instead of bo->offset 2250- nvk/heap: Rework over-allocation 2251- nvk: Convert shader addresses to offsets in nvk_shader.c 2252- vulkan: Update XML and headers to 1.3.278 2253- nvk: Use nouveau_ws_bo_new_mapped() for descriptors 2254- nouveau/winsys: Add a fixed_addr to nouveau_ws_bo_map 2255- nvk: Implement VK_EXT_map_memory_placed 2256- nvk: Invalidate the texture cache before MSAA resolves 2257- nvk: Don't use WAIT_AVAILABLE in nvk_upload_queue_sync 2258- drm-uapi: Sync nouveau_drm.h 2259- nouveau/winsys: Add a vram_used query 2260- nvk: Add a nouveau_ws_device to nvk_physical_device 2261- nvk: Add a hand-rolled nvk_memory_heap struct 2262- nvk: Use 3/4 of total system memory for the VRAM heap 2263- nvk: Add an available query to nvk_memory_heap 2264- nvk: implement EXT_memory_budget 2265- nouveau/winsys: Getch the BAR size from the kernel 2266- nvk/heap: Upload shaders on the CPU when we have a map 2267- nvk: Upload shaders on the CPU when we have ReBAR 2268- nvk: Expose a host-visible VRAM type when we have REBAR 2269- nvk: Only expose VK_KHR_present_id/wait when we have WSI 2270- nvk: Advertise VK+KHR_incremental_present 2271- nil: Add PIPE_FORMAT_R5G5B5A1_UNORM 2272- nak: Add writes_point_size to nak_shader_info 2273- nvk: Handle missing gl_PointSize in the last geometry stage 2274- nvk/copy: Handle VK_REMAINING_ARRAY_LAYERS 2275- vulkan/meta: Handle VK_REMAINING_ARRAY_LAYERS in blit and resolve 2276- nvk: Use VkPipelineCreateFlags2 flag names 2277- nvk: Advertise VK_KHR_maintenance5 2278- vulkan: Add a vk_get_subgroup_size() helper 2279- vulkan: Move the descriptor set limit to vk_limits.h 2280- vulkan: Add runtime code for VK_EXT_shader_object 2281- vulkan: Add a vk_render_pass_state_has_attachment_info() helper 2282- vulkan: Rework vk_render_pass_state::attachments 2283- vulkan: Add a new dynamic state for render pass attachments 2284- vulkan: Add a vk_pipeline base struct 2285- vulkan: Add push constant ranges to vk_pipeline_layout 2286- vulkan: Add a BLAKE3 hash to vk_descriptor_set_layout 2287- vulkan: Add generic graphics and compute VkPipeline implementations 2288- nvk: Populate vk_descriptor_set_layout::blake3 2289- nvk/shader: Refactor some helpers 2290- nvk: Move populate_fs_key to nvk_shader.c 2291- nvk: Pass an array of descriptor sets to nvk_lower_nir 2292- nvk: Move nir_lower_patch_vertices to nvk_lower_nir() 2293- nvk: Use vk_render_pass_state::attachments for write masks 2294- nvk: Switch to shader objects 2295- nvk: Advertise VK_KHR_graphics_pipeline_library 2296- nvk: Advertise VK_EXT_shader_object 2297- nak: Implement nir_op_iadd3 on SM70+ 2298- nir: Add an imad opcode 2299- nak: Move NAK_FS_OUT_COLOR next to the enum 2300- nak: Add support for imad on Volta+ and enable it in simple cases 2301- nvk: Advertise a CTS version of 1.3.7.3 2302- nvk: Drop the non-conformant warning on Turing-Ada 2303- nvk: Don't print the NVK_I_WANT_A_BROKEN_VULKAN_DRIVER warning in release builds 2304- meson: Rename nouveau-experimental to nouveau and build by default on x86 2305- vulkan/pipeline: Whitespace fix 2306- vulkan/pipeline: Handle fully compiled library shaders properly 2307- nvk: Advertise VK_KHR_pipeline_library 2308- docs/nvk: Update the conformance status section 2309- docs/nvk: Update the NVK_DEBUG docs 2310- docs/nvk: Document NAK_DEBUG 2311- nil: Enable A8_UNORM for storage buffers 2312- vulkan/pipeline: Always init pipeline cache objects 2313- nak: Fix printing of OpIsberd 2314- nak/sm50: Fix encoding of immediates in OpFFma 2315- nak/sm50: Use OpBfe instead of OpBRev for nir_op_find_lsb 2316- nak: Support F2I for 8-bit integers on SM50 2317- nvk: Return os_page_size for minMemoryMapAlignment 2318- nouveau: Import g_nv_name_released.h from NVIDIA OGK 2319- nvk: Report official GPU names from NVIDIA when we have them 2320- nvk: Use row_stride_B instead of width for render and copies 2321- nil: Rework tiling calculations 2322- nil: Add a concept of width to tile sizes 2323- nil: Add a concept of sliced 3D image views 2324- nvk: Use "real" 3D image views 2325- nvk/queue: Add a push_bind helper 2326- nvk: Refactor opaque image binds 2327- nvk/queue: Add support for non-opaque sparse binds 2328- nak: Rename resident to fault 2329- nak: Plumb is_sparse through from NIR for texture ops 2330- nak/nir: Add sparse support to shrink_image_load() 2331- nak: Wire up sparse residency for texture ops 2332- nil: Fix a typo in a comment 2333- nvk: Document the register name for the helper load workaround 2334- nvk: Always wait for the FALCON in set_priv_reg 2335- nvk: Disable the Out Of Range Address exception 2336- nvk: Drop a bunch of dev->pdev and just use pdev 2337- nvk: Add and use more cmd_buffer_*_cls helpers 2338- nvk: Replace more dev->pdev with nvk_device_physical() 2339- nvk: Drop nvk_device::pdev 2340- zink: Remove interpolateAtSample() when not multi-sampling 2341- nil: Move Z slice offset calculations to a helper 2342- nvk: Add a nil_image helper variable in BeginRendering 2343- nvk: Manually offset array and Z slices in BeginRendering 2344- nil: Advertise support for PIPE_FORMAT_R5G6B5_UNORM 2345- nil: Whitespace fix 2346- nil: Add support for larger textures on Pascal+ 2347- nil: Add a helper to view a MSAA image as samples 2348- nil: Expose nil_pix_extent_sa() 2349- nvk: Use a HW generation names instead of chipsets 2350- nvk: Stop pretending to handle Intel image intrinsics 2351- nvk: Use different descriptor layouts for storage vs. sampled images 2352- nvk: Implement shaderStorageImageMultisample 2353- zink: Rework sparse texture lowering 2354- nvk: Ignore rasterizationSamples when handling sampleShadingEnable 2355- nvk: Always set SET_ATTRIBUTE_POINT_SIZE 2356- Revert "nvk: Enable VK_KHR_shader_subgroup_uniform_control_flow" 2357- nvk: Move the mutableDescriptorType enable 2358- nir: Take a nir_def in nir_goto_if() 2359- nir/print: Inline print_ssa_use() 2360- nir/builder: Correctly handle decl_reg or undef as the first instruction 2361- nir: Improve the comment for nir_block::imm_dom 2362- nir: Add a sort_unstructured_blocks() helper 2363- nir: Validate that unstructured blocks are in reverse PDFS order 2364- nir/lower_reg: Remove dead reg_decl intrinsics 2365- nir/lower_reg: Support unstructured control-flow 2366- nir/repair_ssa: Support unstructured control-flow 2367- nir/gather_types: Support unstructured control-flow 2368- nir: Mark divergent regs in phis_to_regs_block() 2369- nir: Add a lower_terminate_to_demote pass 2370- nak: Add a copy_fs_outputs_nv intrinsic 2371- nak: Move barrier removal into its own pass 2372- nak: Add a condition to bar_break_nv 2373- nak/nir: Add a control-flow lowering pass 2374- nak: Add more NIR wrappers for walking the NIR CFG 2375- nak: Add NIR helpers for jump instructions 2376- nak: Add helpers for emitting jumps 2377- nak: Handle unstructured NIR 2378- nak: Use the new lowering pass on SM70+ 2379- nak: Remove the old barriers pass 2380- nak/nir: Use nir_lower_terminate_to_demote() 2381- nvk: Advertise VK_KHR_shader_maximal_reconvergence 2382- nvk: Advertise VK_KHR_shader_subgroup_uniform_control_flow 2383- nak/nir: Emit nir_intrinsic_ald_nv directly for system values 2384- nak/nir: Rename load_interpolated_input 2385- nak/nir: Add a load_fs_input hepler for flat inputs 2386- nak/nir: Emit nir_intrinsic_ipa_nv directly for FS system values 2387- nak/nir: Use nir_io_semantics for varyings and attributes 2388- nak: Break lower_fs_inputs into its own file 2389- nak/nir: Clean up lower_fs_inputs a bit 2390- nak: Call nir_lower_io_to_temporaries for FS outputs 2391- nak/nir: Use nir_io_semantics for FS outputs 2392- nak: Drop lower_io_arrays_to_elements_no_indirects for FS outputs 2393- nak: Simplify over-all I/O lowering 2394- nak: Don't write undefined FS outputs 2395- nak: Plumb through LDC modes 2396- nak: Implement load_ubo with an indirect cbuf index 2397- nvk: Support VkBindMemoryStatusKHR 2398- nvk: Advertise VK_KHR_maintenance6 2399- nir: Delete the rest of the CF list when adding a halt 2400- nak: Don't do a scope break cascade for nir_jump_halt 2401- nil: Add a CSV version of the format table 2402- nil: Re-organize the format table 2403- nil: Switch to using the CSV generated table 2404- nil: Drop bogus color formats from non-renderable luminance/alpha formats 2405- nil: Remove 2-bit SNORM from the format table 2406- nil: Drop unneeded types from formats 2407- nvk: Use the page-aligned BO size for the descriptor pool 2408- nvk: Use a VMA heap for descriptor memory 2409- nvk: Use a linked list for descriptor sets in a pool 2410- nvk: Add a _pad field to nvk_cbuf 2411- nvk: Delete dead descriptor code 2412- nvk: Add a _pad field to nvk_fs_key 2413- nvk: Add a bunch of -Wpadded errors 2414- vulkan: Add a bunch of -Wpadded errors 2415- nouveau: Move .rustfmt.toml from NAK to src/nouveau/ 2416- nouveau: Use hyphenated arguments to class_parser.py 2417- nouveau/headers: Add initial Rust bindings 2418- nouveau/headers: Add Rust bindings for texture headers 2419- ci: Add cbindgen to the build images 2420- nil: Move nil_tic_format to nil_format_table.h 2421- nil: Move to a single header file 2422- nvk: Stop using nvk_extent4d short names 2423- nil: Rename nil_tiling::gob_height_8 to gob_height_is_8 2424- nak/bitview: Add a SetField<f32> implementation 2425- nil: Delete unused USAGE bits 2426- nil: Make the Rust library the primary build target 2427- nil: Add Extent/Offset4D::new() helpers 2428- nil: Drop the nil_extent/offset4d() helpers 2429- nil: Take a format in el_to_B() 2430- nil: Enforce units via Rust types 2431- nil: cbindgen is required 2432- nvk: Improve the unsupported handle type error 2433- nvk: Restrict shaderFloat16 to Ampere+ for now 2434- nouveau/headers: Move the classes into a submodule for Rust 2435- nouveau/headers: Generate Rust for QMDs 2436- nak: Add helpers for filling QMDs 2437- nvk: QMDs are 64 dwords 2438- nvk: Use the NAK helpers to fill QMDs 2439- nouveau: Import the hwref headers from Nvidia OGK 2440- nouveau/headers: Add the MMU headers to the Rust crate 2441- nil: Use the enums from the hwref headers for PTE kinds 2442- nil: add s8 pte kind 2443- nil: Be more speicific about Maxwell in the format table 2444- nil: Advertise S8_UINT on MaxwellB+ 2445- nvk: Hash ycbcr conversions in the descriptor set layout hash 2446- nvk: Re-emit sample locations when rasterization samples changes 2447- nvk/meta: Restore set_sizes[0] 2448- nvk/upload_queue: Only upload one line of data 2449- vulkan/wsi: Bind memory planes, not YCbCr planes. 2450- nvk: Improve the GetMemoryFdKHR error 2451- nouveau/winsys: Take a reference to BOs found in the cache 2452- nouveau/winsys: Make BO_LOCAL and BO_GART separate flags 2453- nvk: Allow GART for dma-bufs 2454- nil: Use the right PTE kind for Z32 pre-Turing 2455- nvk: Set color/Z compression based on nil_image::compressed 2456- nil: Default to NV_MMU_PTE_KIND_GENERIC_MEMORY on Turing+ 2457- nvk: Allow VK_IMAGE_ASPECT_MEMORY_PLANE_0_BIT 2458- drm-uapi: Sync nouveau_drm.h 2459- nouveau/winsys: Add back nouveau_ws_bo_new_tiled() 2460- nvk: Support image creation with modifiers 2461- nvk: Set tile mode and PTE kind on dedicated dma-buf BOs 2462- nvk: Implement DRM format modifier queries 2463- nvk: Advertise VK_EXT_queue_family_foreign 2464- nvk: Advertise VK_EXT_image_drm_format_modifier 2465- nvk/wsi: Advertise modifier support 2466- zink: Set workarounds.can_do_invalid_linear_modifier for NVK 2467- nvk/meta: Save and restore set_dynamic_buffer_start 2468 2469Felix DeGrood (6): 2470 2471- driconf: Change vendorid on Palworld for Intel 2472- driconf: Fake vendorid for RDR2 2473- mesa-overlay: defer listening to socket until first frame 2474- driconf: add SotTR DX12 to Intel XeSS workaround 2475- iris: Increase target batch size to 128 KB 2476- intel/ds: add pipe control reasons to perfetto flushes 2477 2478Francisco Jerez (36): 2479 2480- intel/fs: Use full 32-bit sample masks when immediate. 2481- intel/eu/validate: SEND instructions don't have immediate encodings on Gen12+. 2482- intel/eu/gfx12.5+: Don't fail validation with ARF register restriction error for indirect addressing. 2483- intel/compiler/xe2: Add Xe2 bounds to FF() macro. 2484- intel/compiler/xe2: Implement codegen of general instruction controls. 2485- intel/compiler/xe2: Implement codegen of 2-source instruction operands. 2486- intel/compiler/xe2: Implement codegen of indirect immediates. 2487- intel/compiler/xe2: Implement codegen of three-source instructions. 2488- intel/compiler: Add assume() checks to brw_compact_inst_(set\_)bits(). 2489- intel/compiler/xe2: Implement codegen of compact instructions. 2490- intel/compiler/xe2: Implement instruction compaction. 2491- intel/compiler/xe2: Fix for NibCtrl field removal. 2492- intel/compiler/xe2: Fix for the removal of most predication modes. 2493- intel/compiler/xe2: Add extra flag registers. 2494- intel/compiler/xe2: Fix for the removal of AccWrCtrl. 2495- intel/ir/xe2+: Add support for 32 SBID tokens to performance model. 2496- intel/fs/xe2+: Disable bank conflict mitigation pass for now. 2497- intel/eu/xe2+: Translate brw_reg fields in REG_SIZE units to physical 512b GRF units during codegen. 2498- intel/fs: Set the default execution group to 0 when not representable by the platform. 2499- intel/fs: Emit QUAD_SWIZZLE instructions with WE_all for derivative lowering. 2500- intel/fs/xe2+: Allow SIMD16 MULH instructions. 2501- intel/brw/xe2: Render target reads have been removed from the hardware. 2502- intel/brw/xe2+: Update encoding of FB write descriptor message control. 2503- intel/brw/xe2+: Update encoding of FB write extended descriptor. 2504- intel/brw/xe2+: Double allowed SIMD width of FB write SEND messages. 2505- intel/brw/xe2+: Allow FS stencil output in SIMD16 dispatch mode. 2506- intel/brw/xe2+: Allow dual-source blending in SIMD16 mode. 2507- intel/blorp/xe2+: Don't use replicated-data clears. 2508- intel/brw/gfx12: Setup PS thread payload registers required for ALU-based pixel interpolation. 2509- intel/brw/xe2+: Setup PS thread payload registers required for ALU-based pixel interpolation. 2510- iris/xe2+: Disable coherent framebuffer fetch. 2511- intel/brw/xehp+: Replace lsc_msg_desc_dest_len()/lsc_msg_desc_src0_len() with helpers to do the computation. 2512- intel/eu/xehp+: Don't initialize mlen and rlen descriptor fields from lsc_msg_desc*(). 2513- intel/brw/xehp+: Drop redundant arguments of lsc_msg_desc*(). 2514- intel/fs/gfx20+: Implement sub-dword integer regioning restrictions. 2515- intel/fs/gfx20+: Handle subdword integer regioning restrictions in copy propagation. 2516 2517Frank Binns (3): 2518 2519- CODEOWNERS: update Imagination maintainers 2520- pvr: fix up some includes 2521- pvr: split out device info into per GPU headers 2522 2523Friedrich Vock (27): 2524 2525- radv/rt: Add workaround to make leaves always active 2526- radv: Fix shader replay allocation condition 2527- nir: Make is_trivial_deref_cast public 2528- nir: Handle casts in nir_opt_copy_prop_vars 2529- radv/amdgpu: Fix build on BSD 2530- winsys/amdgpu: Fix build on BSD 2531- util: Provide a secure_getenv fallback for platforms without it 2532- vulkan: Use secure_getenv for trigger files 2533- aux/trace: Guard triggers behind __normal_user 2534- vtn: Use secure_getenv for shader dumping 2535- mesa/main: Use secure_getenv for shader dumping 2536- radv: Use secure_getenv in radv_builtin_cache_path 2537- radv: Use secure_getenv for RADV_THREAD_TRACE_TRIGGER 2538- util/disk_cache: Use secure_getenv to determine cache directories 2539- radv/rt: Write inactive node data in ALWAYS_ACTIVE workaround 2540- radv/rt: Optimize update shader VGPR usage 2541- radv,driconf: Enable active AS leaf workaround for Jedi Survivor 2542- radv/rt: Handle monolithic pipelines in capture/replay 2543- vulkan/runtime: Allow more than 8 DRM devices 2544- radv: Set SCRATCH_EN for RT pipelines based on dynamic stack size 2545- radv/rt: Fix frontface culling with emulated RT 2546- radv/rt: Force active leaves for every updateable accel struct 2547- radv,driconf: Remove active accel struct workaround 2548- radv: Only enable SEs that the device reports 2549- radeonsi: Only enable SEs that the device reports 2550- aco/tests: Insert p_logical_start/end in reduce_temp tests 2551- aco/spill: Insert p_start_linear_vgpr right after p_logical_end 2552 2553Ganesh Belgur Ramachandra (1): 2554 2555- compiler,glsl: fix warning when -finstrument-functions is used 2556 2557Georg Lehmann (60): 2558 2559- aco: reassign split vector to SOPC 2560- aco: stop scheduling at p_logical_end 2561- nir: add ballot_relaxed and as_uniform intrinsics 2562- aco: implement as_uniform and ballot_relaxed 2563- ac/llvm: implement as_uniform and ballot_relaxed 2564- nir: add lowering for boolean shuffle 2565- radv: lower boolean shuffle 2566- radeonsi: lower boolean shuffle 2567- aco: remove boolean shuffle isel 2568- aco: fix printing dpp8 2569- aco: validate v_permlane opsel correctly 2570- aco: support v_permlane64_b32 2571- aco/gfx11: use v_nop to resolve VcmpxPermlaneHazard 2572- aco/gfx11: resolve VcmpxPermlaneHazard for v_permlane64 2573- aco: implement rotate 2574- radv: enable VK_KHR_shader_subgroup_rotate 2575- radv: report rotate subgroup feature bits 2576- anv: report rotate subgroup feature bits 2577- aco/gfx11+: disable v_pk_fmac_f16_dpp 2578- aco: add packed fma dpp note to README-ISA 2579- aco: don't remove branches that skip v_writelane_b32 2580- aco/print_ir: don't use alloca for input modifiers 2581- aco: print neg prettier for packed math 2582- aco: don't print hi() for permlane opsel 2583- aco: print permlane16 bc/fi 2584- aco: print exec/vcc_lo/hi for single dword access 2585- aco/gfx11+: limit hard clauses to 32 instructions 2586- radv/gfx11+: add rtwave32 perftest option 2587- aco: use fmamk/ak instead of fma with inline constant for more VOPD 2588- nir: remove rotate scope 2589- nir/divergence_analysis: fix subgroup mask 2590- aco: create pseudo instructions with correct struct 2591- aco/post-ra: rename overwritten_subdword to allow additional uses 2592- aco/post-ra: assume scc is going to be overwritten by phis at end of blocks 2593- aco: store if pseudo instr needs scratch reg 2594- aco/post-ra: track pseudo scratch sgpr/scc clobber 2595- aco/ssa_elimination: check if pseudo scratch reg overwrittes regs used for v_cmpx opt 2596- aco/builder: improve v_mul_imm for negative imm 2597- aco/builder: use 24bit mul if low bits of imm are zero 2598- aco/optimizer: combine v_mul_i32_i24 and add to mad 2599- aco: avoid full 32bit imul for uniform reduce/scan 2600- aco: don't combine mul+add_clamp to mad_clamp 2601- aco/ra: use SDWA for 16bit instructions when the second byte is blocked 2602- aco/vn: remove instruction hash templates 2603- aco: use v1 definition for v_interp_p1lv_f16 2604- aco/assembler: add vintrp high_16bit support 2605- aco: swap opsel and wait_exp for vinterp 2606- aco: support high_16bits FS IO 2607- aco/tests: add assembler tests for interp high_16bits 2608- aco/gfx9: all non legacy opsel instructions only write 16bits 2609- aco: use v_interp_p2_f16 opsel 2610- aco: add ra test for hi v_interp_p2_f16 2611- radv: sink alu 2612- radv: move alu 2613- nir: don't try to optimize exclusive min/max scan to inclusive 2614- nir: rename to nir_opt_16bit_tex_image 2615- ac/nir: add ac_nir_opt_pack_half 2616- radv: use ac_nir_opt_pack_half 2617- radv, radeonsi: don't use D16 for f2f16_rtz 2618- zink: use bitcasts instead of pack/unpack double opcodes 2619 2620George Ouzounoudis (1): 2621 2622- vulkan: Fix null pointer dereferencing on sample locations state 2623 2624Gert Wollny (76): 2625 2626- r600: lower dround_even also on hardware that supports fp64 2627- virgl: Use better reporting for mirror_clamp features 2628- ci: Upref virglrenderer 2629- zink: Factor out create buffer from resource_object_create 2630- zink: shorten lifetime of success variable in resource_object_create 2631- zink: Factor out create sampler conversion in resource_object_create 2632- zink: factor out get_format_feature_flags in resource_object_create 2633- zink: factor out get_image_memory_requirement in resource_object_create 2634- zink: reduce number of #ifdefs in resource_create_object 2635- zink: extract get_export_flags from resource_object_create 2636- zink: extract function allocate_bo from resource_create_object 2637- zink: redesign the allocation try loop to test all heaps 2638- zink: extract function create_image from resource_object_create 2639- zink: extract function update_alloc_info from resource_object_create 2640- zink: extract update_obj_info from resource_object_create 2641- zink: extract debug_resource_mem from resource_object_create 2642- zink: drop duplicate assignment to obj->alignment 2643- zink: extract allocate_bo_and_update_obj from resource_object_create 2644- zink: Move more code to create_image and create_buffer 2645- zink: simplify call to get_export_flags 2646- zink: remove duplicate arguments and use VkMemoryRequirements locally 2647- zink: use enums as return values in resource_object_create 2648- radv: Fix compilation with gcc-13 and tsan enabled 2649- nir/lower_int64: Fix compilation with gcc-13 and tsan enabled 2650- nir/builder: Fix compilation with gcc-13 when tsan is enabled 2651- zink: Fix return type and values of create_buffer and create_images 2652- zink: extract check_unordered_exec from zink_get_cmdbuf 2653- zink: remove duplicate check and assignment in zink_resource_image_needs_barrier 2654- zink: extract emit_memory_barrier from zink_resource_image_barrier 2655- zink: extract emit_memory_barrier::for_buffer from zink_resource_buffer_barrier 2656- zink: extract update_unordered_access_and_get_cmdbuf 2657- zink/sync: remove duplicate assignments in UNSYNCHRONIZED case 2658- zink: move zink_resource_copies_reset out of exportable_lock 2659- zink: remove invalid scope in bo allocation loop 2660- r600: handle indirect access to kcache 14 and 15 2661- zink/nir_to_spirv: emit ViewportIndex cap also for inputs 2662- zink: use only ZINK_BIND_DESCRIPTOR 2663- zink: decrease aggressiveness of increasing descriptor data space adaptive 2664- zink/nir-to-spirv: Make sure sampleid for InterpolateAtSample is int 2665- nir-to-spirv: Cast SSBO input pointer when needed 2666- zink: set handle type also for user memory 2667- zink: acquire - maybe clear timeout after waiting for presentation fence 2668- nir_to_spirv: Allow LOD for external images 2669- zink: ctx->last_fence really wants to be a batch_state, so accomodate it 2670- zink: another fence that is better off as a batch state 2671- ntv: remove store_def_raw 2672- ntv: remove store_ssa_def 2673- ntv: pass def->index to store_def 2674- ntv: simplify increasing the number of dest componnets for sparse tex 2675- zink/ntv: introduce structure using the source params 2676- zink/ntv: extract get_tex_srcs 2677- zink/ntv: use new struct to pass texture parameters 2678- zink/ntv: extract find_sampler_and_texture_index 2679- zink/ntv: simplify evaluation of sampled_type 2680- zink/ntv: extract get_texture_load 2681- zink/ntv: extract get_texop_dest_type 2682- zink/ntv: Extract move_tex:proj_into_coord 2683- zink/ntv: replace if-chain with switch in emit_tex 2684- zink/ntv: extract picking the image to load from 2685- zink/ntv: extract emit_tex_readop as function that reads texture pixel data 2686- zink/ntv: pull result out of cases and use a common store_def 2687- zink: if AcquireNextImageKHR fails with VK_NOT_READY or VK_TIMEOUT retry 2688- meson: Add blacklist when compiling with tsan 2689- futex: disable futexes when compiling with tsan 2690- util/u_queue: read fence->signalled locked with TSAN 2691- tsan-blacklist: ignore race when reading lp_fence signalled status 2692- llvmpipe: Don't emit certain debug code when TSAN is enabled 2693- tsan-blacklist: Ignore race in get_max_abs_timeout_ns 2694- tsan-blacklist: surpress two race conditions in TC 2695- r600/sfn: Add array element parent also to array 2696- r600/sfn: Use dependecies to order barriers and LDS/RAT instructions 2697- r600/sfn: call nir_lower_doubles explicitely 2698- r600/sfn: when emitting fp64 op2 groups pre-load values 2699- r600/sfn: Don't put b2f64 conversion into ALU group 2700- zink/kopper: Wait for last QueuePresentKHR to finish before acquiring for readback 2701- mesa/st: don't use base shader serialization when uniforms are not packed 2702 2703Guilherme Gallo (25): 2704 2705- ci/lava: Turn the r8152 issue check into a counter 2706- ci/lava: Detect r8152 issue during boot phase 2707- ci/lava: Detect hard resets during test phase 2708- bin/ci: Don't submit jobs on integration test 2709- ci/lava: Ignore DUT feedback messages 2710- ci/lava: Fix the integration test 2711- bin/ci: Propagate the token to GitlabGQL 2712- bin/ci: Move get_token_from_default_dir to common 2713- bin/ci: Refactor read_token function 2714- bin/ci: Add GitLab basic token validation 2715- ci/lava: Broader R8152 error handling 2716- radv+zink/ci: Update xfiles based on nightly run 2717- radv/ci: Update xfiles based on nightly run 2718- v3d/ci: Update xfiles based on nightly run 2719- freedreno/ci: Update xfiles based on nightly run 2720- etnaviv/ci: Update xfiles based on nightly run 2721- r300/ci: Update xfiles based on nightly run 2722- ci/a618: Rebalance a618-limozeen jobs 2723- ci/a618: Add zink-tu-a618-full 2724- ci/lava: A few formatting cleanups 2725- ci/lava: Introduce unretriable exception handling 2726- ci/lava: Don't run jobs if the remaining execution time is too short 2727- ci/lava: Fix how exception entry in structured log 2728- ci: Add S3 id_token for all jobs 2729- ci: Use id_tokens for JWT auth 2730 2731Gurchetan Singh (6): 2732 2733- mesa/util: Check __ANDROID__ when for detecting for Android 2734- mesa/util: add <linux/fcntl.h> 2735- mesa/util: use DETECT_OS_ANDROID in anon_file.c 2736- mesa/vulkan: use a simpler path for header in trampoline gen 2737- mesa/vulkan: use DETECT_OS_ANDROID 2738- vk_image.c: #ifndef _WIN32 --> DETECT_OS_LINUX + DETECT_OS_BSD 2739 2740Haihao Xiang (1): 2741 2742- anv: Fix typo in transition_color_buffer 2743 2744Hannes Mann (2): 2745 2746- gallium/pipe: Add contiguous planes per-surface attribute 2747- frontends/va: Only export one handle for contiguous planes 2748 2749Hans-Kristian Arntzen (20): 2750 2751- wsi/x11: Remove unused vk_alpha in get_dri3_modifiers. 2752- wsi/x11: Compare modifiers before signalling SUBOPTIMAL. 2753- wsi/x11: Add drirc option to ignore SUBOPTIMAL. 2754- wsi/x11: Add workaround for Detroit Become Human. 2755- wsi/x11: Rewrite implementation to always use threads. 2756- wsi/x11: Implement VK_EXT_swapchain_maintenance1. 2757- wsi/x11: Keep track of multiple presentation requests. 2758- wsi/x11: Make chain->status atomic. 2759- wsi/wl: Refactor out code to update current present ID. 2760- wsi/wl: Improve fallback for present_wait. 2761- wsi/common: Allow KHR_present_wait on WL. 2762- wsi/x11: Disable vk_xwayland_wait_ready by default on most drivers. 2763- wsi/x11: Rewrite logic for how we consider minImageCount. 2764- radv: export multiview in VS/TES/GS for depth-only rendering 2765- wsi/wl: Fix deadlock in dispatch_queue_timeout. 2766- wsi/wayland: Replace surface pilfer with retired bool. 2767- wsi/wayland: Init outstanding list earlier. 2768- wsi/x11: Return OUT_OF_DATE on sw resize. 2769- vulkan/runtime: Check correct callback list for binding report. 2770- radv: Store range rather than bo_size in VkBuffer/VkImage. 2771 2772Helen Koike (6): 2773 2774- ci/ci_run_n_monitor: move get_gitlab_pipeline_from_url() to gitlab_common 2775- ci/ci_gantt_chart: add tool to analyse pipeline execution time 2776- ci/ci_gantt_chart: add timeout vertical line 2777- ci/ci_gantt_chart: add option to save output to a file 2778- ci/ci_gantt_chart: show duration on hover 2779- ci/ci_post_gantt: add script that post gantt to Marge's messages 2780 2781Hsieh, Mike (4): 2782 2783- amd/vpelib: add new tf enum and add flag for geometric scaling 2784- amd/vpelib: skip gamma remap and cs conversion when geometric scaling 2785- amd/vpelib: geometric scaling fix 2786- amd/vpelib: Add UID for 3d Lut and control logic 2787 2788Hyunjun Ko (3): 2789 2790- anv/video: fix out-of-bounds read 2791- anv/video: fix scan order for scaling lists on H265 decoding. 2792- anv/video: Fix to set correct offset and size for parsing h265 slice header. 2793 2794Iago Toral Quiroga (49): 2795 2796- broadcom/compiler: fix incorrect flags setup in non-uniform if path 2797- broadcom/compiler: fix incorrect flags update for subgroup elect 2798- broadcom/compiler: add new SFU instructions in V3D 7.x 2799- broadcom/compiler: don't move subgroup reduction instructions above setmsf 2800- broadcom/compiler: support subgroup ballot 2801- broadcom/compiler: support subgroup shuffle 2802- broadcom/compiler: support subgroup vote 2803- broadcom/compiler: support subgroup quad 2804- v3dv: expose more subgroup features on V3D 7.x 2805- broadcom/compiler: be more careful with unifa in non-uniform control flow 2806- brodcom/compiler: implement non-compute TSY barrier 2807- broadcom/compiler: support subgroup reduction operations from fragment shaders 2808- v3dv: allow subgroup operations in fragment shaders 2809- broadcom/compiler: fix lane selection for subgroups in fragment shaders 2810- v3d,v3dv: fix BO allocation for shared vars 2811- v3dv: fix subpass clear with draw call for multi-layered framebuffers 2812- v3dv: always set view index before drawing 2813- v3dv: fix copying v3dv_end_query_info into primaries from secondaries 2814- v3dv: refactor checking and adding pending jobs 2815- v3dv: add a helper to constrain clip window to render area 2816- v3dv: add helper to check if we need to use a draw for a depth/stencil clear 2817- v3dv: add helper to build a render pass for dynamic rendering 2818- v3dv: add a helper to setup a framebuffer for dynamic rendering 2819- v3dv: add a vk_render_pass_state to pipelines 2820- v3dv: don't assume that pipelines have a render pass 2821- v3dv: implement vkCmdBeginRendering and vkCmdEndRendering 2822- v3dv: implement dynamic rendering resume/suspend 2823- v3dv: rename SECONDARY job type to INCOMPLETE 2824- v3dv: fix resume address patching for secondary command buffers 2825- v3dv: handle render pass continue flag with dynamic passes 2826- v3dv: also emit subpass clears with secondary command buffers 2827- v3dv: enable VK_KHR_dynamic_rendering 2828- broadcom/ci: skips for tests that don't check for extension support correctly 2829- broadcom/ci: add new expected test failures 2830- broadcom/ci: add a test that fails only in CI 2831- broadcom/ci: add skips for unsupported features 2832- v3dv: fix image creation when exceeding maxResourceSize 2833- v3d: implement fix for GFXH-1602 2834- broadcom/compiler: fix workaround for GFXH-1602 2835- v3dv: require multisync kernel 2836- v3dv: drop single sync kernel interface 2837- v3dv: add a v3dv_job_clone helper 2838- v3dv: fix job pointers from cloned CLs 2839- v3dv: store the offset of the BRANCH instruction in a CL 2840- v3dv: fix job suspend with command buffer simultaneous use flag 2841- broadcom/compiler: enable perquad with uses_wide_subgroup_intrinsics 2842- v3d/simulator: size counter_values array correctly on V3D 7.x 2843- broadcom/ci: document external causes for some CTS 1.3.8 failures 2844- v3dv: fix VK_KHR_vertex_attribute_divisor 2845 2846Ian Romanick (54): 2847 2848- nir: Minor clean up in nir_alu_srcs_negative_equal 2849- intel/compiler: Disable DPAS instructions on MTL 2850- intel/compiler: Use u_foreach_bit64 in brw_get_compiler_config_value 2851- intel/compiler: Track lower_dpas flag in brw_get_compiler_config_value 2852- intel/compiler: Track mue_compaction and mue_header_packing flags in brw_get_compiler_config_value 2853- intel/fs: Fix shift counts for 8- and 16-bit types 2854- intel/rt: Don't directly generate umul_32x16 2855- intel/compiler/xe2: Update get_sampler_lowered_simd_width 2856- intel/fs: Move opcode modification before the switch that emits srcs 2857- intel/compiler/xe2: Use new sample_*_mlod messages 2858- nir: Pack texture LOD and array index to a single 32-bit value 2859- intel/compiler/xe2: Emit texture instructions w/ combined LOD and array index 2860- intel/compiler/xe2: Set SIMD mode for sampler messages 2861- nir: Add documentation for subgroup\_.._mask 2862- intel/fs: Delete stale comment in nir_intrinsic_ballot implementation 2863- nir: Mark nir_intrinsic_load_global_block_intel as divergent 2864- intel/fs: Enable nir_opt_uniform_atomics in all shader stages 2865- intel/fs: Use constant of same type to write flag 2866- intel/fs: Add fast path for ballot(true) 2867- nir: Initial framework for optimizing uniform subgroup operations 2868- intel/fs: Use nir_opt_uniform_subgroup 2869- nir: Optimize uniform iadd, fadd, and ixor reduction operations 2870- nir: Optimize uniform vote_all and vote_any 2871- i915: Fix value returned for PIPE_CAP_MAX_TEXTURE_CUBE_LEVELS 2872- intel/brw: Silence "statement may fall through" warning 2873- intel/brw: Correctly dump subnr for FIXED_GRF in INTEL_DEBUG=optimizer 2874- intel/compiler: Enforce 64-bit RepCtrl restriction in eu_validate 2875- intel/brw: Integer multiply w/ DW and W sources is not commutative 2876- intel/brw: Combine constants for src0 of integer multiply too 2877- intel/brw: Combine constants for src0 of POW instructions too 2878- intel/brw: Avoid a silly add with zero in assign_curb_setup 2879- intel/fs: Don't allow 0 stride on MOV destination 2880- intel/brw/xe2: Correctly disassemble RT write subtypes 2881- intel/brw: Fix handling of accumulator register numbers 2882- intel/brw: Allow SIMD16 F and HF type conversion moves 2883- intel/brw: Remove last vestiges of could_coissue 2884- intel/brw: Clear write_accumulator flag when changing the destination 2885- intel/brw: Use enums for DPAS source regioning 2886- nir: intel/brw: Change the order of sources for nir_dpas_intel 2887- intel/brw/xe2+: DPAS must be SIMD16 now 2888- intel/brw/xe2+: Use phys_nr and phys_subnr in DPAS encoding 2889- intel/brw/xe2: Update brw_nir_analyze_ubo_ranges to account for 512b physical registers 2890- intel/brw/xe2: Update uniform handling to account for 512b physical registers 2891- intel/compiler: Ensure load_barycentric_at_sample and load_interpolated_input remain together 2892- intel/brw: Don't call nir_opt_remove_phis before nir_convert_from_ssa 2893- intel/elk: Don't call nir_opt_remove_phis before nir_convert_from_ssa 2894- intel/brw: Delete stray nir_opt_dce 2895- intel/elk: Delete stray nir_opt_dce 2896- intel/brw/xe2+: Implement Wa 22016140776 2897- intel/brw/xe2+: Only apply Wa 22016140776 to math instructions 2898- intel/brw: Fix handling of cmat_signed_mask 2899- nir: intel/brw: Remove cmat_signed_mask from dpas_intel intrinsic 2900- intel/brw: Fix optimize_extract_to_float for i2f of unsigned extract 2901- intel/elk: Fix optimize_extract_to_float for i2f of unsigned extract 2902 2903Isaac Marovitz (1): 2904 2905- asahi: Add >16 Sampler Access for Ryujinx 2906 2907Iván Briano (10): 2908 2909- anv: flush query clears for all gens 2910- anv, hasvk: pMutableDescriptorTypeLists can be out of range on pool creation 2911- compiler/types: fix serialization of cooperative matrix 2912- intel/cmat: fix stride calculation in cmat load/store 2913- nir/algebraic: avoid double lowering of some fp64 operations 2914- nir/lower_doubles: preserve sign of zero if we are asked to 2915- nir/lower_doubles: preserve NaN when asked to do so 2916- anv, hasvk: check requirements for USAGE_INPUT_ATTACHMENT properly 2917- anv: check requirements for VK_IMAGE_USAGE_FRAGMENT_SHADING_RATE 2918- anv: fix casting to graphics_pipeline_base 2919 2920JCWasmx86 (1): 2921 2922- meson: Fix invalid kwarg name 2923 2924Jan Beich (1): 2925 2926- util: mimic KCMP_FILE via KERN_FILE on DragonFly and FreeBSD 2927 2928Jani Nikula (1): 2929 2930- docs: fix doc build 'intel/dev/intel_device_info_gen.h' file not found 2931 2932Javier Martinez Canillas (2): 2933 2934- clc: silence a warn_unused_result 2935- gallium: Add ssd130x to the list of kmsro drivers 2936 2937Jesse Natalie (115): 2938 2939- ci/windows: Update WARP to 1.0.9 NuGet 2940- mesa: Consider mesa format in addition to internal format for mip/cube completeness 2941- ci/windows: Rev Vulkan SDK and piglit 2942- d3d12: Set up spirv-as and fix expectations 2943- microsoft/compiler: Declare shader model 6.8 / validator 1.8 2944- microsoft/compiler: Handle comparison bias/gradient sampling 2945- dzn: Add a debug option to enable experimental shader models 2946- microsoft/compiler: Add feature flags for new comparison sampling ops 2947- dzn: Implement maintenance3 VariableDescriptorCountLayoutSupport 2948- dzn: Fix enhanced barrier layout for depth blits 2949- dzn: Handle VkBindImageMemorySwapchainInfoKHR 2950- dzn: Disable depth/stencil for partial binding from dynamic rendering 2951- spirv2dxil: Fix the spirv2dxil command line tool 2952- spirv2dxil: Handle aliasing/overlapping UBO/SSBO variables 2953- util: Detect arm64ec as aarch64 (and x86_64) 2954- glsl: Work around MSVC arm64 optimizer bug 2955- dzn: Don't set view instancing mask until after the PSO 2956- dzn: Fix path passed to CreateDeviceFactory 2957- d3d12: Fix path passed to CreateDeviceFactory 2958- microsoft/compiler: Use double pack/unpack instead of int for reduce ops on doubles 2959- dzn: Add a stencil blit fallback 2960- dzn: Add missing condition to immutable sampler init loop 2961- dzn: Add missing blit source barriers for enhanced barriers 2962- microsoft/compiler: Respect ACCESS_COHERENT in UAV variable data 2963- microsoft/compiler: Add a pass for promoting ACCESS_COHERENT on loads/stores 2964- spirv2dxil: Lower the Vulkan memory model and coherent loads/stores 2965- dzn: Add missing handling of VK_PIPELINE_STAGE_2_DRAW_INDIRECT_BIT 2966- dzn: Add barrier to copy source for DispatchIndirect copies 2967- dzn: Support non-static samplers for meta 2968- dzn: Add a debug flag for forcing off native view instancing 2969- dzn: Don't resolve for RESOLVE_MODE_NONE 2970- dzn: Use correct format for depth/stencil resolves 2971- dzn: Use blits for all non-averaging resolves 2972- microsoft/compiler: Only use simplified subgroup ID algorithm for compute 2973- d3d12: Subgroup ballot 2974- microsoft/compiler: Relax assert for SPIR-V barriers 2975- spirv2dxil: Remove dead branches early during shader compilation 2976- spirv2dxil: Trivial fixes for tessellation shaders 2977- dzn: Simultaneous-access is mutually exclusive with MSAA 2978- dzn: Fix tessellation shader insertion into PSO desc 2979- dzn: Add a driconf option to disable dzn for specific apps and use it for RDR2 2980- microsoft/compiler: For emulating scan, ensure all threads are active when reading cross-lane 2981- microsoft/compiler: Fix wave size control for SM6.6+ 2982- microsoft/compiler: Fix wave size control for SM6.8+ 2983- wgl: Support a single-buffered winsys framebuffer 2984- wgl: Flush frontbuffer when calling swapbuffers on single-buffered fb 2985- wgl: Add no-gdi-single-buffered and gdi-double-buffered PFDs 2986- wgl: Enable WGL_ARB_pixel_format_float 2987- wgl: Add HDR pixel formats 2988- winsys/d3d12: Support single-buffered mode 2989- d3d12: Support R16G16B16A16_FLOAT display targets 2990- microsoft/compiler: Fix SM6.6 non-bindless handle annotation for UAV counter 2991- dzn: Fix conditions for barrier in texture-converting copy case 2992- wgl: Check for stw_device->screen before trying to destroy it 2993- spirv2dxil: Set push constant register space to nonzero 2994- microsoft/compiler: Remove deref load/store/atomic ops that statically go out of array bounds 2995- microsoft/compiler: Remove code after discard/terminate in later optimization steps 2996- wgl: Initialize DEVMODE struct 2997- d3d12: Point sprite lowering pass needs to handle arrays 2998- nir_lower_tex_shadow: For old-style shadows, use vec4(result, 0, 0, 1) 2999- spirv2dxil: Support buffer_device_address 3000- dzn: Support bufferDeviceAddress 3001- wgl: Delete unused context param to swap 3002- wgl: Check for null before dereferencing ctx in swap 3003- nir_tests: Add /bigobj when compiling with MSVC 3004- dzn: Include vulkan_core.h instead of vulkan.h in the device enum header 3005- dzn: Initialize memoryTypeBits for querying properties on imported handles 3006- microsoft/compiler: domainLocation component index needs to be i8 3007- microsoft/compiler: Disable GS streams workaround for validator 1.8 3008- ci/windows: Update DirectX-Headers, Agility SDK, zlib, DXC, and WARP 3009- ci/debian: Update DirectX-Headers 3010- nir: Handle ptr_as_array for build_deref_follower 3011- microsoft/compiler: Don't store static-indexing handles that are dynamically emitted 3012- microsoft/clc: When possible, compute a part-constant "pointer" value for kernel inputs 3013- microsoft/compiler: Simplify code emitting CL globals 3014- clc: Move libclc helpers back to microsoft/clc 3015- microsoft/clc: Add linkage capability to libclc build to silence warning 3016- microsoft/clc: Adjust order of UAV binding assignment 3017- microsoft/clc: Install clon12compiler 3018- wgl: The default swap interval is supposed to be 1 3019- d3d12: Fix d3d12_lower_triangle_strip if multiple vars are in a single location 3020- microsoft/compiler: When sorting variables, put unused variables last 3021- microsoft/compiler: Move kill-unused/undefined varying pass from spirv to common 3022- microsoft/compiler: Simplify I/O component type enum handling 3023- microsoft/compiler: Expect front-facing var as an input 3024- microsoft/compiler: Improve linking helpers 3025- microsoft/compiler: Don't duplicate work from gather_info in var sorting 3026- d3d12: Move some lowering passes to pre-variant 3027- d3d12: Lower uniforms to UBO by nir options 3028- d3d12: Minor logging improvements 3029- d3d12: Fix var splitting pass writemasks 3030- d3d12: Explicitly add tess factor vars to tess signatures 3031- d3d12: Forward front-facing for passthrough GS 3032- d3d12: Capture always_active_io in varying data 3033- d3d12: Use TES inputs rather than VS outputs for TCS variant key 3034- d3d12: Add primitive ID sysval to input bitmask (for GS in) 3035- d3d12: Gather info less and before the final compilation steps 3036- d3d12: Remove variables instead of adding them for linking 3037- d3d12: Don't compile useless variants during shader creation 3038- microsoft/compiler: Add a fractional var mask for variable sorting 3039- d3d12: Set fractional var masks 3040- d3d12: Add a debug flag for loading WinPixGpuCapturer.dll 3041- ci/windows: Bump Agility SDK to 1.613.2 for ExecuteIndirect validation fix 3042- microsoft/compiler: Handle base vertex/instance sysvals as DXIL intrinsics 3043- spirv2dxil: Support passing first vertex / base instance to DXIL backend 3044- spirv2dxil: Output more specific metadata for whether draw sysvals are needed 3045- dzn: Delete dzn structs for indirect draw args and use D3D ones 3046- dzn: Query options21 3047- dzn: Understand whether first-vertex and base-instance are needed for a pipeline 3048- dzn: Update pipeline cache params to take all options into account 3049- dzn: Rework indirect drawing keys for shaders and command signatures 3050- dzn: Add a hash table of command signatures with non-default strides 3051- dzn: Don't copy app indirect args if we don't need to 3052- glsl: Use a stable attr sort for VS in / FS out 3053- d3d12: Include <shlobj.h> with lowercase name 3054 3055Job Noorman (43): 3056 3057- tu: support l1 dcache size on musl 3058- ir3: fix setting shared flag on parallel copy arguments 3059- ir3: optimize read_first.macro to a mov 3060- ir3: fix printing of brcst.active and quad_shuffle 3061- ir3: optimize subgroup operations using brcst.active 3062- ir3: set reconvergence for scan_clusters.macro 3063- ir3: add disassembly for flat.b 3064- ir3: update a0/a1 users when cloning instructions 3065- ir3: fix alignment of spill slots 3066- ir3: validate instruction block pointer 3067- ir3: add terminators to blocks 3068- ir3: fix instruction count before kill_sched 3069- ir3: print branch sources 3070- ir3: remove OPC_B and brtype from cat0 3071- ir3: remove comp1/2 from cat0 3072- ir3: allow liveness calculation for different register types 3073- ir3: allow finding SSA uses for a subset of uses 3074- ir3: implement RA for predicate registers 3075- ir3: validate no registers are invalid after RA 3076- ir3: integrate predicates into RA validation 3077- ir3: optimize bitwise ops that can directly write predicates 3078- ir3: insert predicate conversions after their source 3079- ir3: fold negations into cmps.ne zero 3080- nir: add search helper is_only_used_by_if 3081- ir3: fold and/or and negations into branches 3082- freedreno/ci: Update pixmark-piano-v2 hash 3083- ir3: fix freeing incorrect register in loops 3084- ir3: fix returning false instead of NULL 3085- freedreno/registers: fix installation of schema 3086- zink: print shaderdb info via debug message callback 3087- ir3: calculate SSA uses at the start of predicates RA 3088- ir3: fix finding uses of reloaded defs in predicates RA 3089- ir3-disasm: run clang-format 3090- ir3-disasm: remove unused #includes 3091- ir3-disasm: add options to specify GPU by chip ID or name 3092- ir3-disasm: add option to disassemble hex number 3093- freedreno,computerator: support initialization of buffers 3094- ir3: remove unnecessary tessellation epilogue 3095- ir3: model predt/predf without sources 3096- ir3: add support for precolored sources in predicate RA 3097- ir3: add support for predication 3098- freedreno/drm-shim: add a730, a740, and a750 3099- freedreno/drm-shim: remove duplicate entry for a630 3100 3101Jonathan Gray (4): 3102 3103- intel/dev: update DG2 device names 3104- intel/dev: update DG2 device names 3105- intel/dev: update DG2 device names 3106- intel/dev: 0x7d45 is mtl-u not mtl-h 3107 3108Jonathan Marek (1): 3109 3110- tu/a750: Basic a750 support 3111 3112Jordan Justen (26): 3113 3114- intel/dev/common: Add xe2 support to get_l3_list() 3115- intel/dev: Add ARL platform enums 3116- intel/dev: Add intel_device_info_is_mtl_or_arl() 3117- intel/l3: Define l3 config for ARL 3118- iris: Extend MTL modifiers to ARL devices 3119- intel/i915: ARL also supports the set-PAT uapi 3120- intel/dev: Define engine prefetch for ARL 3121- isl: Define MOCS for ARL 3122- isl: Handle ARL in isl_drm_modifier_get_score() 3123- intel/compiler: Lower DPAS instructions on ARL except ARL-H 3124- anv/drirc: Extend option to disable FCV optimization to ARL 3125- anv/query: Follow MTL code paths on ARL 3126- intel/dev: Add device info for ARL 3127- intel/compiler: Set branch shader required-width as 16 for xe2 3128- intel/compiler: Implement nir_intrinsic_load_topology_id_intel for xe2 3129- intel/compiler: Verify SIMD16 is used for xe2 BTD/RT dispatch 3130- intel/dev: Add 2 additional ADL-N PCI ids 3131- intel/compiler: Adjust fs_visitor::emit_cs_terminate() for Xe2 3132- intel/dev: Adjust device strings for ATS-M devices 3133- intel/dev: Add ATS-M PCI ID for Data Center GPU Flex 170G 3134- intel/compiler/fs: Restore SIMD32 restriction for ray_queries on Xe2 3135- intel/compiler: nib_ctrl no longer exists on Xe2+ 3136- intel/dev/mesa_defs.json: Add LNL WA entries 3137- intel/dev: Add 0x56be and 0x56bf DG2 PCI IDs 3138- intel/dev: Change ATS-M 0x56c2 string from 170G to 170V 3139- intel/brw: Avoid getting a stride of 0 for nir_intrinsic_exclusive_scan 3140 3141Jose Maria Casanova Crespo (7): 3142 3143- ci: Adds /usr/local/bin to PATH at piglit-traces.sh 3144- v3d: Fix indentation at v3d_flush_jobs_writing_resource 3145- v3d: Only flush jobs that write texture from different job submission. 3146- v3d: Early return if job is not writing the resource 3147- v3d: Implement GL_ARB_texture_barrier 3148- broadcom/compiler: needs_quad_helper_invocation enable PER_QUAD TMU access 3149- ci: re-enable Igalia farm 3150 3151Joshua Ashton (34): 3152 3153- winsys/amdgpu: Hook up guilt to amdgpu_ctx_set_sw_reset_status 3154- winsys/amdgpu: Limit usage of query_reset_state2 3155- radv/amdgpu: Handle -ENODATA and -ETIME from cs_submit 3156- radv: Mark device loss if QueueSubmit failed immediately 3157- radv: Remove check_status 3158- radv/amdgpu: Remove ctx_query_reset_status 3159- radv: Add radv_get_tdr_timeout_for_ip helper 3160- radv: Ensure vkGetQueryPoolResults returns in finite-time 3161- android: Use system = 'android' in crossfile 3162- meson: Enable zink in gallium_drivers by default 3163- meson: Enable d3d12 in gallium_drivers by default on Windows 3164- anv: Enable EXT_swapchain_maintenance1 3165- v3dv: Enable EXT_swapchain_maintenance1 3166- lavapipe: Enable EXT_swapchain_maintenance1 3167- v3dv: Enable EXT_swapchain_colorspace 3168- lavapipe: Enable EXT_swapchain_colorspace 3169- wsi: Pass wsi_drm_image_params to wsi_configure_native_image 3170- wsi: Pass wsi_drm_image_params to wsi_configure_prime_image 3171- wsi: Add explicit_sync to wsi_image_info 3172- wsi: Add explicit_sync to wsi_drm_image_params 3173- build: Add linux-drm-syncobj-v1 wayland protocol 3174- wsi: Track if timeline semaphores are supported 3175- wsi: Add acquired member to wsi_image 3176- wsi: Track CPU side present ordering via a serial 3177- wsi: Get timeline semaphore exportable handle types 3178- wsi: Add common infrastructure for explicit sync 3179- ci: Bump wayland-protocols version to 1.34 3180- ci: Bump DEBIAN_BASE_TAG for now 3181- meson: Update wayland-protocols wrap to 1.34 3182- meson: Bump wayland-protocols requirement to 1.34 3183- wsi: Implement linux-drm-syncobj-v1 3184- tu: Expose VK_EXT_surface/swapchain_maintenance1 3185- radv: Enable KHR_video_queue if encode is enabled 3186- radv: Properly initialize imageCreateFlags in GetPhysicalDeviceVideoFormatPropertiesKHR 3187 3188José Expósito (2): 3189 3190- zink: add render-passes HUD query 3191- meson: Update proc_macro2 meson.build patch 3192 3193José Roberto de Souza (88): 3194 3195- intel/isl/xe2: Disable route of Sampler LD message to LSC 3196- anv: Fix PAT entry for userptr in integrated GPUs 3197- intel/genxml/xe2: Remove L3ALLOC 3198- intel/dev: Reduce usage of intel_device_info_compute_system_memory() 3199- intel: Make memory heaps consistent between KMDs 3200- anv: Fix calculation of syncs required in Xe KMD 3201- iris: Avoid read of uninitialized value in blorp_clear_stencil_as_rgba() 3202- iris: Fix return of iris_wait_syncobj() 3203- iris: Wait for drm_xe_exec_queue to be idle before destroying it 3204- intel/common: Add functions to handle async vm bind 3205- anv: Start to use intel_bind_timeline 3206- iris: Start to use intel_bind_timeline 3207- anv: Switch to truly asynchronous VM binding in Xe KMD 3208- iris: Switch to truly asynchronous VM binding in Xe KMD 3209- intel: Fix intel_get_mesh_urb_config() 3210- anv: Drop include to common/i915/intel_gem.h 3211- intel/common: Fix location of C++ support macro in intel_gem.h 3212- intel: Remove circular dependency between intel/dev and intel/common 3213- intel/common: Add intel_engines_supported_count() 3214- anv: Use intel_engines_supported_count() 3215- iris: Use intel_engines_supported_count() 3216- intel: Sync i915_drm.h 3217- intel/common: Implement i915_engines_is_guc_semaphore_functional() 3218- intel: Sync xe_drm.h 3219- intel/common: Implement xe_engines_is_guc_semaphore_functional() 3220- iris: Fix iris_batch_is_banned() check 3221- anv: Use DRM_XE_VM_BIND_OP_UNMAP_ALL to unbind whole bos 3222- docs/anv: Add recommended GuC firmware version 3223- iris: Set (EXEC_OBJECT_SUPPORTS_48B_ADDRESS | EXEC_OBJECT_PINNED) in a single place 3224- iris: Remove iris_bo::kflags 3225- iris: Move i915 set and get tiling uAPI calls to i915 specific code 3226- iris: Remove more i915_drm.h includes from common code 3227- intel: Move intel_define.h to i915/intel_define.h 3228- intel/common: Remove more i915_drm.h includes from common code 3229- intel/tools/error_decode: Add function to try to open error dump file 3230- intel/tools/error_decode: Simply error message handling 3231- intel/tools/error_decode: Add support to search for Xe KMD error dumps 3232- intel/tools/error_decode: Detect and split error dump file parsing by KMD 3233- intel: Sync xe_drm.h 3234- anv/xe: Add VMs to error dump 3235- iris/xe: Add VMs to error dump 3236- intel/tools/error_decode: Move code that can be shared between i915 and Xe error decoders 3237- intel/tools/error_decode: Parse Xe KMD error dump file 3238- intel/tools: Fix compilation in 32 bits 3239- intel/nullhw: Fix 32bits compilation warnings 3240- iris: Add IRIS_HEAP_DEVICE_LOCAL_CPU_VISIBLE_SMALL_BAR heap type 3241- iris: Force lmem cpu accessible for bos with clear-color 3242- iris/xe: Consider pat_index while unbinding the bo 3243- anv: Call flush_pipeline_select_gpgpu() for compute engines in compute code paths 3244- anv: Skip cmd_buffer_emit_bt_pool_base_address() in blitter and video engines 3245- intel: Drop pre-production steppings 3246- anv: Fix Xe KMD userptr unbind 3247- intel/dev: Nuke 'ver == 10' check 3248- intel/dev: Nuke display_ver 3249- intel: Enable Xe KMD support by default 3250- iris: Set BO_ALLOC_NO_SUBALLOC when allocating bo for slab 3251- anv: Replace the 2 sparse booleans by 1 enum 3252- anv: Set VK_QUEUE_PROTECTED_BIT during queue families initialization 3253- anv: Set VM control to true in Xe KMD 3254- intel/tools/error_decode: Fix parsing in Xe decoder 3255- intel/tools/error_decode: Add function to print batch in Xe decoder 3256- intel/tools/error_decode: Parse HW context in Xe decoder 3257- iris: Move tiling_to_modifier() implementation to i915 folder 3258- iris: Remove i915_drm.h include from iris_indirect_gen.c 3259- intel/decoder: Fix binding table pointer entry being marked as invalid 3260- anv: Set STATE_COMPUTE_MODE mask bit when zeroing compute mode 3261- intel/genxml: Add more instdone registers 3262- intel/genxml/gfx125: Fix definition of INTERFACE_DESCRIPTOR_DATA::Thread group dispatch size 3263- intel/genxml/xe2: Update definition of INTERFACE_DESCRIPTOR_DATA 3264- anv: Create protected engine context when i915 supports vm control 3265- anv: Remove protected memory types from default_buffer_mem_types 3266- intel/tools/error2hangdump: Print out_filename when failed to open it 3267- intel/tools/error2hangdump: Replace drm_i915_gem_engine_class by intel_engine_class 3268- intel/tools: Move Xe KMD error decode functions to a separated file 3269- intel/tools: Move ascii85_decode_char() to error_decode_lib 3270- intel/tools: Move more Xe KMD error decode functions to error_decode_xe_lib 3271- intel/tools/error2hangdump: Move code that will be shared with Xe parser to error2hangdump_lib 3272- intel/tools/error2hangdump: Move i915 parser to a function 3273- intel/tools/error2hangdump: Add Xe KMD support 3274- anv: Add missing ANV_BO_ALLOC_INTERNAL 3275- iris: Add comments to BO_ALLOC flags 3276- iris: Avoid creation of slabs and cache buckets of lmem heaps in integrated gpus 3277- iris: Avoid allocation of not needed iris_bucket_cache 3278- intel/tools/aubinator_error_decode: Move definition of option_color to header 3279- intel/decoder: Add intel_print_group_custom_spacing() 3280- intel/tools: Parse INSTDONE registers in Xe KMD error dump 3281- intel: Sync xe_drm.h 3282- intel/dev: Read GFX IP version during runtime 3283 3284Juan A. Suarez Romero (32): 3285 3286- Revert "v3d: use kmsro to create drm screen on real hw" 3287- v3d: show warning on creating a v3d screen on real hw 3288- v3d/vc4/ci: reset the list of timeout tests 3289- Revert "v3d: show warning on creating a v3d screen on real hw" 3290- broadcom/simulator: protect simulator BO rallocs with mutexes 3291- v3d/ci: run OpenGL 3.1 tests 3292- v3dv/ci: increase timeout for full jobs in 30min 3293- ci: disable Igalia farm 3294- Revert "ci: disable Igalia farm" 3295- Revert "ci: disable Igalia farm" 3296- v3d/ci: update expected results 3297- v3d/ci: update expected list 3298- Revert "v3d/ci: update expected list" 3299- vc4/ci: update expected list 3300- v3d/ci: add new failures 3301- v3dv/ci: update expected list 3302- v3dv/ci: remove crashes from expected list 3303- v3d,v3d: use new simulator 3304- v3dv: disable Early Z for multisampled 16-bit depth buffers 3305- v3d: disable Early Z for multisampled 16-bit depth buffers 3306- broadcom/compiler: fix SFU check for 7.1 3307- v3dv: mark some promoted extensions as supported 3308- v3d: add load_fep_w_v3d intrinsic 3309- v3d: fix line coords with perspective projection 3310- compiler,gallium: move u_reduced_prim to common 3311- v3dv: assume that rasterization state can be NULL 3312- v3dv: enable smooth line rendering 3313- broadcom/ci: add new expected failures 3314- v3d: configure polygon mode when enabled 3315- broadcom/ci: update expected results 3316- v3dv/ci: update expected list 3317- nir/lower_clip: update inputs/ouputs read/written bitmask 3318 3319Juston Li (13): 3320 3321- venus: refactor query feedback cmds 3322- venus: acquire mutex when recycling query feedback cmds 3323- venus: free query batches for VK_COMMAND_POOL_RESET_RELEASE_RESOURCES_BIT 3324- venus: add comments for query feedback batch free list 3325- venus: recycle linked query feedback immediately during submission 3326- venus: handle empty resolved query feedback list 3327- venus: fix image reqs cache store locking 3328- venus: extract cache hash/equals functions into common 3329- venus: image format properties cache 3330- venus: move feedback on empty last batch to prior batch 3331- venus: fix VkDeviceGroupSubmitInfo cmd counts from feedback 3332- venus: extend device format prop cache with VkFormatProperties3 3333- Revert "zink: store last pipeline directly for zink_gfx_program::last_pipeline" 3334 3335Kai Wasserbäch (2): 3336 3337- fix(FTBFS): clc: adapt to new LLVM 19 DiagnosticHandlerTy 3338- fix(FTBFS): clover: adapt to new LLVM 19 DiagnosticHandlerTy 3339 3340Karol Herbst (131): 3341 3342- nak/algebraic: merge run and main function 3343- nak/algebraic: write code to an output file 3344- clc: use spirv triple starting with llvm-17 3345- clc: add support for the native spir-v backend 3346- rusticl/kernel: run opt/lower_memcpy later to fix a crash 3347- rusticl/kernel: add a few comments in regards to pass ordering 3348- rusticl/kernel: no need to reset the scratch size anymore 3349- nir/printf: remove treat_doubles_as_floats 3350- clc: require LLVM-14 3351- clc: merge blocks handling optional features 3352- clc: require LLVM-15 3353- nir: rework and fix rotate lowering 3354- rusticl/program: rework debug logging option 3355- rusticl/spirv: do not attempt to parse spirv after failed link 3356- rusticl/spirv: use bool::then_Some inside SPIRVBin::link 3357- rusticl/program: add clc_validator_options helper function 3358- rusticl/program: add debug option to validate internal spirvs 3359- nak/opt_out: fix comparison in try_combine_outs 3360- nak: simplify phi_dsts 3361- nak: make it compile with clippy 3362- rusticl/meson: use rust_abi instead of rust_crate_type 3363- rust/spirv: fix clippy lint on unneeded late initialization 3364- rusticl/kernel: check that local size on dispatch doesn't exceed limits 3365- nak/meson: specify rust flags globally and allow some clippy lints 3366- nak: fix clippy::extra_unused_lifetimes warnings 3367- nak: fix clippy::mem_replace_with_default warnings 3368- nak: fix clippy::useless_conversion warnings 3369- nak: fix clippy::needless_lifetimes warnings 3370- nak: fix clippy::needless_borrow warnings 3371- nak: fix clippy::while_let_loop warnings 3372- nak: fix clippy::match_like_matches_macro warnings 3373- nak: fix clippy::needless_return warnings 3374- nak: fix clippy::redundant_closure warnings 3375- nak: fix clippy::unwrap_or_default warnings 3376- nak: fix clippy::manual_while_let_some warnings 3377- nak: fix clippy::clone_on_copy warnings 3378- nak: fix clippy::single_match warnings 3379- rusticl/util: add a wrapper around "thread-safe" C types 3380- rusticl/mesa/device: convert to ThreadSafeCPtr 3381- rusticl/mesa/screen: convert to ThreadSafeCPtr 3382- rusticl/mesa: add thread-safe wrapper for pipe_image_views 3383- rusticl/context: store SVM pointers as usize 3384- rusticl/gl: mark GLCtxManager as Send + Sync 3385- rusticl/mem: make Mem Send/Sync by storing mapping ptrs as usize 3386- rusticl/program: mark NirKernelBuild as Send and Sync 3387- meson: remove opencl-external-clang-headers option and rely on shared-llvm 3388- clc: force fPIC for every user when using shared LLVM 3389- nir/lower_cl_images: record image_buffers and msaa_images 3390- rusticl/mem: properly handle buffers 3391- rusticl/mem: support GL_TEXTURE_BUFFER 3392- rust/api: add RustTypes enum 3393- rusticl/util: support nested structs in offset_of! 3394- rusticl/api: allow CLObjectBase to be placed anywhere 3395- rusticl/icd: move get_ref_vec_from_arr into the Rusticl type 3396- rusticl/icd: move refcnt() and get rid of needless atomic ops 3397- rusticl/icd: move retain() and release() 3398- rusticl/icd: move get_arc_vec_from_arr and rename it 3399- rusticl/icd: fold leak_ref into its only consumer 3400- rusticl/icd: move get_ref() 3401- rusticl/device: deduplicate devices with sorting 3402- rusticl/icd: move from_arc() and rename it 3403- rusticl/event: drop from_cl_arr and use arcs_from_arr 3404- rusticl/icd: move get_arc() and rename it 3405- rusticl/icd: split Arc part out of CLObject into new trait 3406- rusticl/device: get rid of pointless Arc overhead 3407- rusticl/icd: actually allow dispatching CL types 3408- rusticl/mem: split into Buffer and Image 3409- rusticl/mem: use pattern matching in is_parent_buffer 3410- rusticl/mem: move fill methods into concrete types 3411- core/memory: drop Arc for &Arc<Queue> function parameters 3412- rusticl/mem: move map methods into concrete types 3413- rusticl/mem: move shadow sync methods into concrete types 3414- rusticl/mem: split unmap into Buffer and Image versions 3415- rusticl/mem: move copy and write buffer impls into Buffer 3416- rusticl/mem: split read_to_user_rect into Buffer and Image versions 3417- rusticl/mem: split write_from_user_rect into buffer and image 3418- rusticl/mem: move copy_to_rect into Buffer 3419- rusticl/mem: split copy_to into Buffer and Image 3420- rusticl/mem: split Buffer::copy_to into Buffer and Image versions 3421- rusticl/mem: split Image::copy_to into Buffer and Image versions 3422- rusticl/mem: get rid of pixel_size 3423- rusticl/mem: move tx_image into Image 3424- rusticl/mem: fold tx_raw into tx 3425- rusticl/image: call tx on the parent buffer directly 3426- rusticl/mem: move is_parent_buffer into Image 3427- rusticl/mem: move tx into Buffer 3428- rusticl/mem: remove get_res 3429- rusticl/mem: move comment describing how mapping works 3430- rusticl/mem: reorganize Image::map 3431- rusticl/mem: move MemBase::map into the users 3432- rusticl/mem: move tx_raw_async methods into Buffer and Image 3433- rusticl/mem: move Buffer and Image specific fields into the subtypes 3434- rusticl/mem: reorganize MemBase::from_gl a little 3435- rusticl/mem: move pipe_image_host_access into Image 3436- rusticl/kernel: recalculate scratch and shared memory after opts 3437- rusticl/program: fix CL_PROGRAM_BINARIES for devs with no builds 3438- meson/rusticl: import rust instead of unstable-rust 3439- clc: include opencl-c.h for extensions needing it 3440- meson: do not pull in clc for clover 3441- intel: Only build shaders with anv and iris 3442- zink: lower unaligned memory accesses 3443- rusticl/context: complete conversion of SVM pointers to usize 3444- rusticl/memory: store host_ptr as usize 3445- rusticl/memory: make closures Send and Sync 3446- rusticl/event: make EventSig Send + Sync 3447- rusticl/spirv: mark SPIRVBin as Send and Sync 3448- rusticl/kernel: make it Send and Sync 3449- rusticl/icd: verify all cl classes are Send and Sync 3450- rusticl/meson: remove -Aclippy::arc-with-non-send-sync flag 3451- rusticl/kernel: make builds private 3452- rusticl/event: we need to call the CL_COMPLETE callback on errors as well 3453- rusticl/kernel: assign sampler locations before DCEing variables 3454- rusticl/device: support query_memory_info to retrieve available memory 3455- drm-uapi: Sync nouveau_drm.h 3456- nvk: use c.get_supported_arguments for compiler flags 3457- nouveau: import libdrm_nouveau 3458- nouveau: call glsl_type_singleton_init_or_ref earlier 3459- nouveau/drm: drop immediate parameter from nouveau_pushbuf_new 3460- nouveau/drm: rely on nouveau_pushbuf::channel being always set 3461- nouveau/drm: drop unsued chan argument from nouveau_pushbuf_kick 3462- nouveau/drm: remove nouveau_client::id 3463- rusticl/util: make create_pipe_box indepentend of pipe_box's field types 3464- meson: fix link failure with llvm-18 3465- rusticl/program: handle -cl-no-subgroup-ifp 3466- nouveau: fix potential double-free in nouveau_drm_screen_create 3467- nir: fix nir_shader_get_function_for_name for functions without names. 3468- rusticl: use stream uploader for cb0 if prefered 3469- rusticl/icd: remove CLObject 3470- event: break long dependency chains on drop 3471- rusticl/mesa/context: flush context before destruction 3472- nir/lower_cl_images: set binding also for samplers 3473 3474Kenneth Graunke (76): 3475 3476- iris: Don't return timestamps modulo 36-bits 3477- intel/dev: Fix typo (ajust -> adjust) 3478- iris: Implement query_memory_info() on discrete cards 3479- intel/nir: Pass devinfo and prog_data to brw_nir_lower_cs_intrinsics 3480- intel: Add driver support for hardware generated local invocation IDs 3481- intel: Use hardware generated compute shader local invocation IDs 3482- driconf: Advertise GL_EXT_shader_image_load_store on iris for SVP13 3483- iris: Implement INTEL_DEBUG=heaps 3484- intel/fs: Don't include sync.nop in instruction count statistics 3485- intel/fs: Don't rely on CSE for VARYING_PULL_CONSTANT_LOAD 3486- intel/brw: Delete enum brw_urb_write_flags 3487- intel/brw: Delete more unused defines 3488- intel/brw: Delete legacy SFIDs 3489- intel/brw: Delete SIMD4x2 URB opcodes 3490- intel/brw: Delete more unused compression stuff 3491- intel/brw: Delete SINCOS 3492- intel/brw: Delete constant_buffer_0_is_relative 3493- intel/brw: Delete compiler->supports_shader_constants 3494- intel/brw: Delete enum gfx6_gather_sampler_wa 3495- intel/brw: Delete brw_wm_prog_key::line_aa 3496- intel/brw: Delete unnecessary brw_wm_prog_data fields 3497- intel/brw: Delete some swizzling functions 3498- intel/brw: Delete brw_eu_util.c 3499- intel/brw: Change unit tests to use TEX_LOGICAL instead of TEX 3500- intel/brw: Delete SHADER_OPCODE_TXF_CMS[_LOGICAL] 3501- intel/brw: Delete SHADER_OPCODE_TXF_UMS 3502- intel/brw: Allow CSE on TXF_CMS_W_GFX12_LOGICAL 3503- intel/brw: Delete legacy texture opcodes 3504- intel/brw: Mark FIND[_LAST]_LIVE_CHANNEL as not writing the flag 3505- intel/brw: Replace CS_OPCODE_CS_TERMINATE with SHADER_OPCODE_SEND 3506- intel/brw: Avoid copy propagating any fixed registers into EOTs 3507- intel/brw: Handle SHADER_OPCODE_SEND without src[3] in copy prop 3508- intel/brw: Add assertions that EOT messages live in g112+ 3509- intel/brw: Copy the smaller payload in fixup_sends_duplicate_payload 3510- intel/brw: Make register coalescing obey the g112-g127 restriction 3511- intel/brw: Call constant combining after copy propagation/algebraic 3512- intel/brw: Remove SIMD lowering to a larger SIMD size 3513- intel/brw: Unindent code after previous change 3514- iris: Fix tessellation evaluation shaders that use scratch 3515- intel/brw: Emit better code for read_invocation(x, constant) 3516- iris: Remove suballocation in iris_flush_resource() 3517- iris: Eliminate prototype introduced in the previous patch 3518- ra: Add debug functions for printing spill costs and benefits 3519- intel/fs: Avoid generating useless UNDEFs for every SSA def 3520- intel/brw: Split out 64-bit lowering from algebraic optimizations 3521- intel/brw: Don't consider UNIFORM_PULL_CONSTANT_LOAD a send-from-GRF 3522- intel/brw: Eliminate top-level FIND_LIVE_CHANNEL & BROADCAST once 3523- intel/brw: Fix check for 64-bit SEL lowering types 3524- intel/brw: Assert that min/max are not happening in 64-bit SEL lowering 3525- intel/brw: Use correct execution pipe for lowering SEL on DF 3526- intel/brw: Unify DF and Q/UQ lowering for MOV 3527- Revert "intel/brw: Don't consider UNIFORM_PULL_CONSTANT_LOAD a send-from-GRF" 3528- intel/brw: Fix opt_split_sends() to allow for FIXED_GRF send sources 3529- intel/brw: Fix register coalescing's LOAD_PAYLOAD dst offset handling 3530- intel/brw: Fix destination stride assertion in copy propagation 3531- intel/brw: Allow changing types for LOAD_PAYLOAD with 1 source 3532- intel/brw: Delete brw_fs_lower_minmax 3533- anv, hasvk: Save the original instance ID 3534- anv, hasvk: Move multiview remapping loop below output stores 3535- anv, hasvk: Fix nir_lower_multiview to re-emit outputs before EmitVertex 3536- intel/brw: Stop checking mlen on math opcodes in CSE pass 3537- intel/brw: Rearrange fs_inst fields 3538- intel/brw: Fix generate_mov_indirect to check has_64bit_int not float 3539- intel/brw: Fix lower_regioning for BROADCAST, MOV_INDIRECT on Q types 3540- intel/brw: Update comments for indirect MOV splitting 3541- intel/brw: Don't mention gfx7 limitations in shuffle comments 3542- intel/brw: Drop dead CHV checks. 3543- intel/brw: Drop align16 support in brw_broadcast() 3544- intel/brw: Drop gfx7 scratch message setup code 3545- intel/brw: Delete if_depth_in_loop 3546- intel/brw: Delete fs_visitor::vgrf helper 3547- intel/brw: Drop default size of 1 from bld.vgrf() calls 3548- intel/brw: Use SHADER_OPCODE_SEND for coherent framebuffer reads 3549- intel/brw: Replace FS_OPCODE_LINTERP with BRW_OPCODE_PLN 3550- intel/brw: Make an fs_builder::SYNC helper 3551- isl: Set MOCS to uncached for Gfx12.0 blitter sources/destinations 3552 3553Konrad Dybcio (1): 3554 3555- freedreno/registers: Add some HWCG regs 3556 3557Konstantin (8): 3558 3559- util/printf: Include stdio.h 3560- util/printf: Expose util_printf_prev_tok 3561- ac/debug: Handle the output of recent umr versions 3562- radv/debug: Canonicalize shader addr 3563- radv: Canonicalize addresses in radv_find_shader 3564- radv/debug: Try to find unbound shaders 3565- radv/debug: Dump descriptor binding information 3566- ac/parse_ib: Always print the value of the whole register 3567 3568Konstantin Seurer (105): 3569 3570- nak/repair_ssa: Remap PHI sources as well 3571- ac/llvm: Enable helper invocations for quad OPs 3572- radv: Vectorize load_global_constant 3573- lavapipe: Fix DGC vertex buffer handling 3574- gallivm: Use saturating fpto*i conversions 3575- lavapipe: Mark vertex elements dirty if the stride changed 3576- lavapipe: Report the correct preprocess buffer size 3577- radv: Implement NIR debug printf 3578- llvmpipe: Stop refcounting sample functions 3579- llvmpipe: Compile sample functioins on demand 3580- radv/rt: Use doubles inside intersect_ray_amd_software_tri 3581- llvmpipe: Fix building with llvm11 3582- nir/print: Don't print shared_size twice 3583- nir/print: Rename workgroup-size to workgroup_size 3584- radv/radix_sort: clang-format 3585- radv: Reduce the amount of radv_device_to_handle calls 3586- radv: Make radv_write_user_event_marker non-static 3587- radv: Emit user events during acceleration structure builds 3588- radv: Skip unused acceleration structure build paths 3589- radv/sqtt: Set SeparateCompiled for monolithic RT pipelines 3590- radv/sqtt: Handle ray tracing pipelines with no traversal shader 3591- radv/rt: Lower ray payloads like hit attribs 3592- radv/rra: Rename rra_chunk_type to rra_chunk_version 3593- radv/rra: Use memcpy for chunk descriptions 3594- radv/rra: Remove useless variable 3595- radv/rra: Refactor error handling 3596- radv/rra: Dump basic ray history tokens 3597- docs: Document RADV_RRA_TRACE_HISTORY_SIZE 3598- radv/rra: Implement ahit/isec counters 3599- amd/common: Use the correct register table for GFX10_3 3600- radv: Wire up ac_gather_context_rolls 3601- zink: Always set mfence->submit_count to the fence submit_count 3602- Revert "zink: always force flushes when originating from api frontend" 3603- llvmpipe: Use full subgroups when possible 3604- gallivm: Consider the initial mask when terminating loops 3605- lavapipe: Advertise VK_KHR_shader_maximal_reconvergence 3606- ci: Update llvmpipe trace checksums 3607- ac/parse_ib: Add and use print_addr 3608- ac/parse_ib: Dump the ADDR field of PKT3_SET_BASE 3609- ac/parse_ib: Annotate addresses with UAF/OOB info 3610- ac/parse_ib: Handle 32bit PKT3_DISPATCH_INDIRECT addrs 3611- ac/parese_ib: Handle more packets 3612- radv/rra: Avoid reading past the ray history buffer 3613- radv/meta: Add shader - device mapping for radv_build_printf 3614- vulkan/cmd_queue: Implement CmdBuildAccelerationStructuresKHR 3615- lavapipe: Implement VK_KHR_acceleration_structure 3616- lavapipe: Add ray traversal code 3617- lavapipe: Implement VK_KHR_ray_query 3618- lavapipe: Advertise VK_KHR_deferred_host_operations 3619- lavapipe: Advertise VK_KHR_acceleration_structure 3620- lavapipe: Advertise VK_KHR_ray_query 3621- lavapipe/ci: Document ray query failures 3622- docs: Document lavapipe ray tracing features 3623- vulkan: Implement DebugMarkerSetObjectNameEXT 3624- radv/rt: Implement RADV_DEBUG=shaderstats 3625- radv/rt: Add radv_ray_tracing_stage_info 3626- radv/rt: Fixup constant args 3627- aco: Only fix used variables to registers 3628- radv/rt: Avoid passing unused data to the next stage 3629- radv/rt: Inline constant trace_ray srcs into the traversal shader 3630- radv/rt: Inline constant information about ray flags 3631- radv/rt: Fix raygen_imported condition 3632- zink: Handle aoa derefs of images 3633- ac: Annotate context rolls 3634- ac/parse_ib: Replace the parameter list with ac_ib_parser 3635- ac/parse_ib: Implement annotations 3636- radv: Add support for IB annotations 3637- radv: Add an IB annotation layer 3638- ac: Improve context roll readability 3639- radv: Use radv_buffer_map for parsing IBs 3640- radv/rt: Use 32-bit offsets for load_sbt_entry 3641- radv: Skip more acceleration structure build markers 3642- radv/printf: Use fprintf instead of printf 3643- nir/print: Fix printing booleans with bit_size>1 3644- nir/serialize: Encode data for temporaries 3645- nir: Add lavapipe ray tracing intrinsics 3646- llvmpipe: Fix function call handling 3647- lavapipe: Add lvp_spirv_to_nir 3648- lavapipe: Make lvp_shader_init non-static 3649- lavapipe: Make lvp_create_pipeline_nir non-static 3650- lavapipe: Lower mem_constant variables 3651- lavapipe: Defer binding compute state 3652- lavapipe: Remove unused ray tracing variables 3653- lavapipe: Add more ray tracing helpers 3654- lavapipe: Pass lvp_ray_flags into lvp_aabb_intersection_cb 3655- lavapipe: Use the pipeline type in get_pcbuf_size 3656- lavapipe: Inline fill_ubo0 3657- lavapipe: Add an api_stage parameter to update_pcbuf 3658- lavapipe: Fix a memory leak in lvp_push_internal_buffer 3659- lavapipe: Implement VK_KHR_ray_tracing_pipeline 3660- lavapipe: Implement KHR_ray_tracing_maintenance1 3661- lavapipe: Implement VK_EXT_pipeline_library_group_handles 3662- lavapipe: Implement VK_KHR_ray_tracing_position_fetch 3663- radv: Destroy leaf_updateable_pipeline 3664- lavapipe: Handle accel struct queries in handle_copy_query_pool_results 3665- lavapipe: Implement ray_tracing_maintenance1 queries 3666- lavapipe: Do nort use NIR_PASS during lowering 3667- lavapipe: Handle multiple planes in GetDescriptorEXT 3668- lavapipe: Explicitely support ycbcr formats 3669- Revert "gallivm/ssbo: mask offset with exec_mask instead of building the 'if'" 3670- radv: Handle all dependencies of CmdWaitEvents2 3671- nir/print: Do not access invalid indices of load_uniform 3672- radv: Fix radv_shader_arena_block list corruption 3673- radv: Remove arenas from capture_replay_arena_vas 3674- radv: Zero initialize capture replay group handles 3675 3676Krzysztof Kurek (1): 3677 3678- panfrost: fix shift overflow in \`bi_fold_constant` 3679 3680Leo Liu (2): 3681 3682- radeonsi: fix video processing path without VPE enabled 3683- ac/gpu_info: Fix broken UVD firmware query 3684 3685Lepton Wu (1): 3686 3687- llvmpipe: Set "+64bit" for X86_64 3688 3689Lin, Ricky (1): 3690 3691- amd/vpelib: Rename the parameters of init vpe function 3692 3693Lionel Landwerlin (186): 3694 3695- anv: fix disabled Wa_14017076903/18022508906 3696- intel/aux_map: fix fallback unmapping range on failure 3697- anv: hide vendor ID for The Finals 3698- intel/decoder: make vertex data decoding optional 3699- intel/decoder: don't ignore BT entries at offset 0 3700- intel/genxml: add CCS_INSTDONE register 3701- intel/genxml: add GAM done register description 3702- intel/hang_viewer: add aux-tt view 3703- anv: export descriptor flushing functions 3704- anv: fix include guards 3705- anv: fix missing header 3706- anv: move generated draw flush helper to its own file 3707- anv: move draw commands to their own file 3708- anv: move compute/ray-tracing commands to their own file 3709- anv: rename video command file 3710- nir/alu_srcs_negative_equal: bail earlier if possible 3711- nir/comparison_pre_tests: update expectations 3712- anv: using a single struct for kernel upload 3713- anv: fix pipeline executable properties with graphics libraries 3714- isl: add print helpers for debug 3715- anv: implement undocumented tile cache flush requirements 3716- anv: reorder anv_astc_emu.c 3717- anv: remove unused perfetto declarations 3718- anv: rename layers entrypoints 3719- anv: add BO flag for internal driver allocations 3720- anv: track total state stream allocated blocks from the pool 3721- anv: track imported ray tracing pipeline groups 3722- anv: initial RMV support 3723- vulkan/runtime: handle new image layout 3724- anv: don't prevent L1 untyped cache flush in 3D mode 3725- anv: promote EXT_index_type_uint8 to KHR 3726- anv: promote EXT_line_rasterization to KHR 3727- anv: promote EXT_load_store_op_none to KHR 3728- anv: add missing alignment for AUX-TT mapping 3729- intel/ds: track predication of blorp operations 3730- vulkan/runtime: add helper to query attachment layout 3731- anv: ensure consistent layout transitions in render passes 3732- anv: add check that in renderpass barriers apply to attachments 3733- anv: handle image feedback loop usage 3734- anv: implement VK_EXT_attachment_feedback_loop_dynamic_state 3735- anv/hasvk: don't report error when intel_get_device_info_from_fd fails 3736- anv: factor out aux-tt binding logic for future reuse 3737- anv: rename aux_tt image field 3738- anv: retain ccs image binding address 3739- anv: fix transfer barriers flushes with compute queue 3740- vulkan/runtime: handle new dynamic states for attachment remapping 3741- docs/features: drop gen8+/gen9+ on Anv 3742- docs/features: synchronize new features for Anv 3743- vulkan/multialloc: bump max number to 16 3744- vulkan/runtime: rework VK_KHR_dynamic_rendering_local_read state tracking 3745- anv: reduce cache flushing for indirect commands on Gfx12.5+ 3746- anv: don't unmap AUX ranges at BO delete 3747- isl: printout sparse usage 3748- isl: add a no-aux-align usage flag 3749- anv: move ALLOC_HOST_CACHED_COHERENT as define 3750- anv: use address helper to compute address u64 value 3751- intel/aux_map: add BSpec reference 3752- intel/aux_map: add helper to compute offset in aux data 3753- anv: re-introduce BO CCS allocations 3754- intel/dev: fix missing dependency on generated packing heaers 3755- anv: factor out post submit queue debug code 3756- intel/fs: indent lowering code to make it more readable 3757- intel/fs: rerun divergence prior to lowering non-uniform interpolate at sample 3758- anv: fix incorrect flushing on shader query copy 3759- meson: add a new option to enable intel-clc without building RT shaders 3760- intel/compiler: make default NIR compiler options visible 3761- intel-clc: move ISA generation to its own function 3762- intel/clc: add ability to output NIR 3763- intel-clc: print text input 3764- genxml: enable opencl code generation 3765- genxml: generate opencl packing headers 3766- genxml: remove NDEBUG_UNUSED 3767- intel/ds: new tracepoints for generated commands 3768- meson: add option to install intel-clc 3769- ci: build a host version of mesa for cross builds 3770- anv: rewrite internal shaders using OpenCL 3771- intel/shaders: add iris variant of indirect draws generation shader 3772- intel/shaders: enable gfx8 support 3773- iris: make binding table shifting values available outside iris_state.c 3774- iris: make KSP helper available outside iris_state.c 3775- iris: make URB programming available outside iris_state.c 3776- iris: factor out index buffer emission 3777- iris: add an option for not emit draw parameters 3778- iris: enable generated indirect draws 3779- meson: enforce build of intel-clc with anv/iris 3780- anv: remove redundant asserts 3781- anv: don't allocate aux padded BOs with host pointers 3782- anv: fix buffer marker cache flush issues on MTL 3783- anv: enable query clear/copy using shaders on MTL/ARL 3784- anv: fixup push descriptor shader analysis 3785- anv: factor out descriptor buffer flushing 3786- anv: reenable ANV_ALWAYS_BINDLESS 3787- anv: remove unused definition 3788- anv: fix Wa_16013994831 macros 3789- anv: fix emission of Wa_14015055625 3790- genxml: generate opencl temporary variables with private qualifier 3791- intel/clc: lower temp function/shader variables together 3792- intel/clc: workaround LLVM17 opaque pointers 3793- anv: disable Wa_16013994831 3794- ci/anv: add more testing for optimization paths 3795- intel/ci: bump anv/tgl fraction to 6 3796- intel/nir: only consider ray query variables in lowering 3797- anv: limit depth flush on dynamic render pass suspend 3798- anv: add missing generated file dep 3799- anv: optimize push descriptor updates 3800- anv: add new heap/pool for descriptor buffers 3801- anv: create new helper for small allocations 3802- anv: add a second dynamic state heap for descriptor buffers 3803- anv: move aux-tt to general state pool 3804- anv: allocate slice_hash for descriptor buffer 3805- anv: allocate border colors for descriptor buffers 3806- anv: allocate fsr states for descriptor buffer 3807- anv: implement data write entry points for EXT_descriptor_buffer 3808- anv: compute a sampler hash based on parameters 3809- anv: add embedded sampler parameters in descriptor set layout hash 3810- intel/fs: add plumbing for embedded samplers 3811- nir: add additional flag to resource_intel for embedded samplers 3812- anv: add embedded sampler support 3813- anv: add new helper to update binding table pool offset 3814- anv: add descriptor set layout support for descriptor buffers 3815- anv: add pipeline/shader support for descriptor buffers 3816- anv: handle push descriptor writes with descriptor buffers 3817- anv: implement descriptor buffer binding 3818- anv: disable mutable combined image/sampler in descriptor buffer 3819- anv: expose VK_EXT_descriptor_buffer 3820- anv: fix non matching image/view format attachment resolve 3821- anv: fix helper usage for CmdUpdateBuffer() 3822- anv: remove some wrapping around mmap 3823- anv: add support for VK_EXT_map_memory_placed 3824- anv: delay internal shader upload to when needed 3825- anv: fix companion command buffer initialization 3826- anv: fix incorrect ISL usage in buffer view creation 3827- anv/iris/blorp: use the right MOCS values for each engine 3828- anv: try to keep the pipeline in GPGPU mode when buffer transfer ops 3829- anv: don't copy the null descriptor from the GPU memory 3830- intel/fs: fixup sampler header message 3831- anv: return unsupported for FSR images on Gfx12.0 3832- intel/fs: remove some unused send helpers 3833- anv: ignore descriptor alignment for inline uniforms 3834- intel/fs: bump max simd size of some messages for xe2 3835- anv: track embedded sampler counts in layouts 3836- anv: allocate pipeline bindings tables dynamically on the heap 3837- anv: avoid partially compiled warning with GPL 3838- blorp: handle a few allocation failure cases 3839- anv: fix invalid border color free 3840- anv: fix block pool allocation failure 3841- anv: fix temporary state pool allocation failures 3842- anv: fix bitfield checks in gfx runtime flushing 3843- anv: fix query clearing with blorp compute operations 3844- blorp: add support for cached dynamic states 3845- anv: reduce blorp dynamic state emissions 3846- anv: optimize emission of dynamic state with blorp 3847- anv: fix protected memory allocations 3848- anv: pull surface state copies for secondary in one loop 3849- anv: disable protected content around surface state copies 3850- anv: disable generated draws in protected command buffers 3851- anv: update protection fault property 3852- anv: fix incorrect blorp dynamic state heap usage 3853- intel/fs: printout a couple of more late compile steps 3854- intel/fs: fixup instruction scheduling last grf write tracking 3855- anv: add missing data flush out of L3 for transform feedback writes 3856- anv: mark descriptors & pipeline dirty after blorp compute 3857- isl: set NullPageCoherencyEnable for depth/stencil sparse surfaces 3858- anv: only check patch_control_points changes in runtime flush 3859- anv: increase maxResourceDescriptorBufferRange on DG2+ 3860- anv: reuse vk_common_GetImageSubresourceLayout 3861- anv: move all format props checks to anv_get_image_format_properties() 3862- drirc: rename hasvk only option 3863- vulkan: track compression control flags on vk_image 3864- anv: implement VK_EXT_image_compression_control 3865- anv: disable capture replay with descriptor buffer 3866- anv: remove useless dynamic state allocation for samplers 3867- anv: add capture/replay support for image with descriptor buffers 3868- anv: add capture/replay support for buffer with descriptor buffers 3869- anv: add a new reserved pool for capture/release 3870- anv: enable shader border color capture/replay 3871- anv: enable capture/replay with descriptor buffers 3872- anv: disable dual source blending state if not used in shader 3873- intel/brw: fixup wm_prog_data_barycentric_modes() 3874- anv: fixup alloc failure handling in reserved_array_pool 3875- anv: fix leak of custom border colors 3876- anv: fix ycbcr plane indexing with indirect descriptors 3877- brw: add more condition for reducing sampler simdness 3878- anv: fix push constant subgroup_id location 3879- nir/divergence: add missing load_printf_buffer_address 3880- anv: use weak_ref mode for global pipeline caches 3881 3882Louis-Francis Ratté-Boulianne (5): 3883 3884- panfrost: factor out method to check whether we can discard resource 3885- panfrost: add copy_resource flag to pan_resource_modifier_convert 3886- panfrost: add can_discard flag to pan_legalize_afbc_format 3887- panfrost: Legalize before updating part of a AFBC-packed texture 3888- panfrost: Add AFBC packing support for RG formats 3889 3890Luc Ma (1): 3891 3892- gallium/u_blitter: Fix a few uninitialized fb_state 3893 3894Luca Bacci (1): 3895 3896- meson,windows: Use relative paths in Vulkan ICD manifest files 3897 3898Lucas Fryzek (19): 3899 3900- egl/wayland/sw: don't invert y \`wl_surface_damage_buffer` 3901- drisw/winsys: Flip y coordinate when creating pipe boxes 3902- drisw: clamp damage region to texture bounds 3903- llvmpipe: explicitly reject (most) yuv formats 3904- gallium: Add dmabuf arg to memory fd allocation API 3905- llvmpipe: Implement dmabuf handling 3906- drisw: reuse kopper image extension vtable if modifiers/dmabuf is supported 3907- llvmpipe: conditionally export PIPE_CAP_DMABUF 3908- lavapipe: support VK_EXTERNAL_MEMORY_HANDLE_TYPE_DMA_BUF_BIT_EXT 3909- lavapipe: EXT_external_memory_dma_buf 3910- llvmpipe: make it possible to import and bind unbacked resources 3911- lavapipe: include drm_fourcc.h 3912- lavapipe: check drm modifier info during image create 3913- lavapipe: EXT_image_drm_format_modifier 3914- venus/ci: Add patch for modifiers test to check import/export bits 3915- lp: Wrap udmabuf usage in HAVE_LIBDRM ifdef 3916- ci/lp: Remove ext buffer YUV tests from fails 3917- llvmpipe: Only return null resource handle when dt is not mapped 3918- llvmpipe: Only use udmabuf if header is found 3919 3920Lucas Stach (18): 3921 3922- etnaviv: disable 64bpp render/sampler formats 3923- etnaviv: track resource sharing 3924- etnaviv: only add shared resources to implicit flush list 3925- etnaviv: implicitly update shared texture resources 3926- etnaviv: don't use int filter for depth textures 3927- etnaviv: tex_desc: emit texture comparator 3928- etnaviv: fix fixpoint conversion of negative values 3929- ci/etnaviv: update expectations 3930- etnaviv: fix depth writes without testing 3931- etnaviv: rs: take src dimensions into account when increasing height alignment 3932- etnaviv: use correct blit box sizes when copying resource 3933- etnaviv: fix separate depth/stencil clears 3934- etnaviv: trigger TS derivation after slow clear 3935- etnaviv: split TS and non-TS RS clear commands 3936- etnaviv: ci: update expectation with fixed depth/stencil clears 3937- etnaviv: rs: treat depth-only clear to X8Z24 surfaces as full clear 3938- ci/etnaviv: update expectation after piglit uprev 3939- etnaviv: flip the switch on MSAA support 3940 3941Luigi Santivetti (1): 3942 3943- pvr: return the OS page size for minMemoryMapAlignment 3944 3945Lynne (1): 3946 3947- radv/av1: limit profile and bit depth to supported values 3948 3949M Henning (10): 3950 3951- nvk: Don't clobber vb0 after repeated blits 3952- nak: Remove assert on nir->info.outputs_written 3953- nvk: Early-out impossible descriptor allocations 3954- nak: Fix ldg/stg/atomg encoding to use globalmem 3955- nak: Set fewer bits in writes_color 3956- nak: Use undef for unset FSOut components 3957- nak: Remove old union_find implementation 3958- nak: Rewrite union_find and use it in repair_ssa 3959- nak: Count GLOBAL_SIZE_OFFSET in bytes, not words 3960- nvk: Don't use a descriptor cbuf if it's too large 3961 3962Manuel Stoeckl (1): 3963 3964- util/disk_cache: try getenv(HOME) before getpwuid->pw_dir 3965 3966Marcin Ślusarz (1): 3967 3968- intel/compiler/xe2: fix decoding of sampler simd mode 3969 3970Marek Olšák (244): 3971 3972- gallium/u_vbuf: replace unnecessary dst_index with "i" 3973- gallium: remove unbind_trailing_count from set_vertex_buffers 3974- cso: don't unbind vertex buffers when enabling/disabling u_vbuf 3975- winsys/amdgpu: merge loops decrementing num_active_ioctls & unreferencing bufs 3976- winsys/amdgpu: cosmetic touchups 3977- winsys/amdgpu: don't clear buffer list elements after IB submission 3978- winsys/amdgpu: add more fence_reference helpers 3979- winsys/amdgpu: don't clear fence list elements after IB submission 3980- winsys/amdgpu: remove misplaced duplicated comment 3981- winsys/amdgpu: represent IB_MAX_SUBMIT_DWORDS in bytes 3982- winsys/amdgpu: represent max_ib_size_dw in bytes 3983- winsys/amdgpu: cosmetic touchups around IB sizes 3984- amd: unify NIR options between RADV and radeonsi 3985- ac/nir: don't write TCS outputs to memory if no_varying is set 3986- ac/nir: rename clipdist_enable_mask -> clip_cull_dist_mask 3987- ac/nir: optimize out multiplications in small line culling 3988- ac/nir: simplify code at the beginning of ac_nir_gs_shader_query 3989- ac,radeonsi: emulate GS primitive pipeline stat on gfx11 because of culling 3990- radeonsi: report more detailed output stats for shader-db 3991- radeonsi: expose shader profiles to other .c files 3992- radeonsi: don't use staging uploads for buffers & shaders with all VRAM visible 3993- radeonsi: deduplicate gfx10_ngg_get_vertices_per_prim / get_num_vert_per_prim 3994- radeonsi: change GS_STATE_PROVOKING_VTX_INDEX to 1 bit PROVOKING_VTX_FIRST 3995- radeonsi: split si_update_ngg_prim_state_sgpr into 2 functions 3996- radeonsi: pack GS_STATE_ESGS_VERTEX_STRIDE better to save 2 bits 3997- radeonsi: remove no-op additions for viewport0_y_inverted 3998- radeonsi: remove unused preloaded instance_divisor_constbuf 3999- radeonsi: rename \*trivial_vs_prolog -> \*trivial_vs_inputs 4000- radeonsi/gfx11: clean up MAX_ALLOWED_TILES_IN_WAVE programming 4001- radeonsi/ci: update gfx11 flakes 4002- radeonsi/gfx11: flush DB before Z/S clear to work around dEQP failures 4003- radeonsi: don't flush CS before and after every blitter invocation 4004- mesa,gallium: move the thread scheduler to src/util 4005- gallium: rename PIPE\_.._PIN_THREADS_TO_L3_CACHE -> .._UPDATE_THREAD_SCHEDULING 4006- st/mesa: rename ST_L3_PINNING_DISABLED -> ST_THREAD_SCHEDULER_DISABLED 4007- util: add mesa_pin_threads environment variable that sets a static affinity mask 4008- glthread: apply the thread scheduling policy when the context is created 4009- glthread: apply the thread scheduling policy when a batch executes synchronously 4010- gallium/hud: add "csv" option to print values to stdout as CSV 4011- nir: remove INTERP_MODE_COLOR 4012- nir: relax validation failure for generic TCS outputs with no_varying 4013- nir: remove and replace underused option pack_varying_options 4014- nir: replace lower_io_variables with a GLSL NIR flag 4015- nir: add a lower_mediump_io callback into options 4016- nir: add vertex divergence into nir_divergence_analysis 4017- winsys/amdgpu: fix a race condition when reading ws->num_buffers 4018- winsys/amdgpu: add real buffers of slab entries in the CS thread 4019- winsys/amdgpu: change the signature of amdgpu_add_bo_fences_to_dependencies 4020- winsys/amdgpu: move code out of amdgpu_add_bo_fences_to_dependencies for reuse 4021- winsys/amdgpu: merge 2 loops iterating over slab entries in amdgpu_cs_submit_ib 4022- winsys/amdgpu: merge 2 loops iterating over sparse BOs in amdgpu_cs_submit_ib 4023- winsys/amdgpu: merge 2 loops iterating over real BOs in amdgpu_cs_submit_ib 4024- winsys/amdgpu: skip code checking RADEON_USAGE_SYNCHRONIZED for slabs 4025- winsys/amdgpu: simplify amdgpu_do_add_buffer to remove memset 4026- winsys/amdgpu: don't ref/unref slab BOs in amdgpu_cs_submit_ib 4027- radeonsi: use num_vertex_buffers instead of ARRAY_SIZE 4028- radeonsi/ci: add gfx11 flakes 4029- gallium: always set vertex elements before setting vertex buffers 4030- gallium/u_blitter: set take_ownership=true for set_vertex_buffers 4031- st/mesa: set take_ownership=true for set_vertex_buffers in st_draw_quad 4032- gallium/util: add take_ownership parameter into util_draw_vertex_buffer 4033- st/mesa: set take_ownership=true for util_draw_vertex_buffer in st_DrawTex 4034- st/mesa: set take_ownership=true for set_vertex_buffers in st_pbo_draw 4035- gallium/hud: set take_ownership=true for set_vertex_buffers 4036- cso: remove CSO_UNBIND_VERTEX_BUFFER0 4037- gallium/u_threaded: remove the count=0 path from tc_call_set_vertex_buffers 4038- gallium/u_threaded: allow drivers to change tc_call_set_vertex_buffers function 4039- gallium: remove take_ownership from set_vertex_buffers, assume it's true 4040- gallium/noop: don't leak resources due to take_ownership 4041- radeonsi,aco: remove the VS prolog 4042- gallium/u_threaded: expose helpers for filling set_vertex_buffers externally 4043- st/mesa: rename attribs -> arrays in st_atom_array to indicate non-zero strides 4044- st/mesa: do (inputs_read & enabled_arrays) outside setup_arrays 4045- st/mesa: do (inputs_read & ~enabled_arrays) outside st_setup_current 4046- st/mesa: move a piece of _mesa_draw_array_attrib out of the loop in setup_arrays 4047- st/mesa: cosmetic touchups in st_atom_array.cpp 4048- st/mesa: change the update enum of vertex elements 4049- st/mesa: move st_update_functions into st_context 4050- st/mesa: constify the pipe_draw_info parameter and remove obsolete comments 4051- mesa: inline {Create,Draw}GalliumVertexState callbacks 4052- mesa: inline _mesa_set_vao_immutable 4053- mesa: add gl_vertex_array_object::NonIdentityBufferAttribMapping 4054- util/idalloc: make deleting invalid IDs a no-op 4055- mesa: remove unused _mesa_HashTable code 4056- mesa: clean up unnecessary _mesa_HashTable locked/unlocked wrappers 4057- mesa: re-format main/hash.h, move inlines to the end, some code to main/hash.c 4058- mesa: fold _mesa_HashDeleteAll into _mesa_DeleteHashTable 4059- mesa: remove _mesa_HashTable::InDeleteAll 4060- st/mesa: merge 3 unlikely blocks in _mesa_get_bufferobj_reference 4061- st/mesa: remove !obj checking in _mesa_get_bufferobj_reference when it's useless 4062- mesa: fix incorrect _mesa_HashInsertLocked parameter in _mesa_EndList 4063- mesa: use util_idalloc_alloc_range for _mesa_HashFindFreeKeyBlock 4064- winsys/amdgpu: convert amdgpu_cs.c to .cpp 4065- winsys/amdgpu: enable unlimited number of parallel queues for VCN 4066- util/idalloc: optimize foreach by tracking the greatest non-zero element 4067- mesa: declare _mesa_HashTable::id_alloc as non-pointer 4068- mesa: declare _mesa_HashTable inside structures instead of as a pointer 4069- mesa: remove isGenName parameter from _mesa_HashInsert 4070- mesa: use util_idalloc_foreach for looping in _mesa_HashTable 4071- mesa: replace _mesa_HashTable::ht with util_sparse_array for faster lookups 4072- d3d12: make DrawTransformFeedback not depend on the vertex buffer offset 4073- mesa: don't use the slow VAO path except for drivers that want to use it 4074- st/mesa: add VAO fast path C++ template variants for st_update_array callback 4075- st/mesa: optimize st_update_arrays using lots of C++ template variants 4076- glthread: re-enable thread scheduling in st/mesa when glthread is disabled 4077- glthread: use _mesa_glthread_fence_call() instead of duplicating that code 4078- glthread: add no_error variants of glDrawElements* 4079- glthread: add no_error variants of glDrawArrays* 4080- glthread: remove cmd_size from constant-sized calls 4081- glthread: clean up how vertex stride is packed 4082- glthread: pack "size" in Pointer calls as 16 bits 4083- mesa: deduplicate get_index_size_shift code 4084- mesa: deduplicate is_index_type_valid code 4085- glthread: pack the primitive type to 8 bits 4086- glthread: pack the index type to 8 bits 4087- glthread: rewrite glDrawElements call packing 4088- glthread: rewrite glDrawArrays call packing 4089- glapi: fix type names for glthread and handle all types 4090- glthread: sort fixed-sized parameters before returning them 4091- glthread: move global marshal_XML.py functions into class marshal_function 4092- glthread: precompute fixed_params and variable_params lists 4093- gltrhead: merge 3 blocks conditional on marshal_sync in print_async_body 4094- glthread: separate unmarshal function generation into print_unmarshal_func 4095- glthread: separate marshal code generation into print_marshal_async_code 4096- glthread: remove "if True" from print_marshal_async_code 4097- glapi: pass pointer size to python for glthread from meson 4098- glthread: pack glVertexAttribPointer calls better 4099- glthread: fix multi draws with a negative draw count 4100- glthread: pack uploaded user vertex buffers and offsets better 4101- glthread: deduplicate batch finalization code 4102- glthread: don't check cmd_size for small variable-sized calls 4103- glthread: use marshal_count instead of count for more functions 4104- glthread: rewrite glBindBuffer packing 4105- glthread: add a packed variant of glDrawElements with 16-bit count and indices 4106- glthread: add a packed version of DrawElementsUserBuf 4107- glthread: generate packed versions of gl*Pointer/Offset calls 4108- amd: update addrlib 4109- mesa: deduplicate initialization of gl_pixelstore_attrib 4110- mesa: move struct gl_pixelstore_attrib into glthread.h 4111- glthread: track glPixelStore(GL_UNPACK_*) 4112- glthread: execute small glBitmap asynchronously 4113- glthread: execute small glDrawPixels asynchronously 4114- glthread: invert _mesa_glthread_has_no_{un}pack_buffer by removing the negation 4115- amd/registers: add correct gfx11.x enums for BINNING_MODE 4116- radeonsi: disable binning correctly on gfx11.5 4117- radeonsi/gfx11: fix programming of PA_SC_BINNER_CNTL_1.MAX_ALLOC_COUNT 4118- radeonsi/gfx10.3: add a GPU hang workaround for legacy tess+GS 4119- radeonsi: allocate only one set of tessellation rings per device 4120- radeonsi/gfx11: program the attribute ring right before draws 4121- radeonsi: program tessellation rings right before draws 4122- radeonsi/gfx11: program SAMPLE_MASK_TRACKER_WATERMARK optimally for APUs 4123- ac: use the gfx11 shadowed register tables for gfx11.5 4124- radeonsi/gfx11: add missing DCC_RD_POLICY setting 4125- radeonsi: add radeonsi_cache_rb_gl2 option enabling GL2 caching for CB and DB 4126- nir/divergence_analysis: change function prototypes 4127- nir/divergence_analysis: load_primitive_id is convergent within a primitive 4128- nir/divergence_analysis: load_instance_id is convergent within a primitive 4129- nir/divergence_analysis: handle derefs of system values 4130- nir: print nir_io_semantics::invariant 4131- nir: add nir_block::divergent to indicate a divergent entry condition 4132- ac/llvm: fix SSBO bounds checking by using raw instead of struct opcodes 4133- radeonsi: fix the DMA compute shader 4134- radeonsi: don't test so many wave limits for AMD_TEST=testdmaperf 4135- nir: add a utility computing post-dominance of SSA uses 4136- nir: add nir_opt_varyings, new pass optimizing and compacting varyings 4137- nir/tests: add tests for nir_opt_varyings 4138- radeonsi: set the lower_mediump_io callback for GLSL 4139- radeonsi: set trivial NIR options for nir_opt_varyings 4140- radeonsi: enable uniform propagation for varyings except VP/Energy 4141- radeonsi: add test failures due to incorrect tests for nir_opt_varyings 4142- st/mesa: get dual slot input info from NIR if IO is lowered 4143- st/mesa: lower sysvals slightly sooner 4144- st/mesa: skip a few NIR passes that don't work with lowered IO 4145- glsl/linker,st/mesa: enable nir_opt_varyings and lower IO in the linker 4146- amd/ci: update stoney results 4147- r300: port scanout pitch alignment from the DDX to fix DRI3 4148- r300: enable tiling for scanout to fix DRI3 performance 4149- radeonsi/ci: run GLCTS, ESCTS, and dEQP from the glcts directory 4150- radeonsi/ci: update failures 4151- Unbreak Viewperf by reverting "util: use crc32_z instead of crc32 and bump zlib dep to 1.2.9" 4152- gallium: use u_box_3d to initialize pipe_box instead of non-designated initializers 4153- gallium: increase the size of pipe_box y, height fields to allow bigger textures 4154- nir: rename AMD XFB intrinsics to \*_gfx11_amd 4155- nir,amd: add nir_intrinsic_load_debug_log_desc_amd and its use 4156- aco: implement aco_is_gpu_supported using switch statement 4157- aco: add a helper printing shader asm by disassembling via LLVM 4158- ac/llvm: remove remnants of gfx10 NGG streamout 4159- radeonsi: implement the shader debug log from ac_nir_store_debug_log_amd 4160- nir/validate: validate interp_mode of load_barycentric_* 4161- nir/lower_io: add nir_io_semantics::interp_explicit_strict 4162- nir/validate: validate more fields of nir_io_semantics 4163- tgsi_to_nir: translate TG4 4164- nir/opt_varyings: don't generate IO with unsupported bit sizes 4165- nir/opt_varyings: simplify nir_io_semantics::num_slots of directly-indexed slots 4166- nir/opt_varyings: handle load_input_vertex 4167- ac/surface: add radeon_surf::thick_tiling 4168- ac/nir: allow 16-bit results for resinfo 4169- ac/llvm: simplify extracting an element in get_image_coords 4170- ac/llvm: add support for 16-bit coordinates (A16) for image (non-sampler) opcodes 4171- ac/llvm: allow image loads to return less than 4 components, trim DMASK 4172- ac/llvm: remove handling of input and output loads/stores that are lowered 4173- ac/llvm: remove unused fields of ac_shader_abi 4174- ac/llvm: simplify the optimization barrier and apply it to the whole vector 4175- ac: add helper ac_get_ip_type_string to remove duplication 4176- nir: add more build helpers 4177- nir: allow FP16 in nir_format_linear_to_srgb 4178- nir: add nir_intrinsic_optimization_barrier_sgpr_amd 4179- nir: change "user_data_amd" sysval from 4 to 8 components 4180- nir/use_dominance: set the root as post-dominator of unmovable instructions 4181- util: add new format helpers 4182- util: import pipe_box and its helpers 4183- ac/llvm: fix assertions for texture instructions with 16-bit LOD bias 4184- ac/llvm: always trim components of texture instructions, trim DMASK 4185- ac/surface: constify and reindent NIR meta address-from-coord function params 4186- radeonsi/ci: update gfx11 failures 4187- radeonsi/gfx11: don't prefetch constants in binaries into the instruction cache 4188- radeonsi/gfx11: enable DCC fast clears for 8-bit and 16-bit formats 4189- radeonsi: use the same nir_lower_subgroups_options as RADV 4190- radeonsi: add the radeonsi_optimize_io option into the shader cache key 4191- radeonsi: check has_stable_pstate in the winsys 4192- radeonsi: move TCS epilog key bits to the key->ge.opt section 4193- radeonsi: fix initialization of occlusion query buffers for disabled RBs 4194- radeonsi: don't expose samples_identical and don't lower FMASK if it's disabled 4195- radeonsi: allow input NIR to use descriptors in image opcodes 4196- radeonsi: move blitter resource_copy_region implementation to si_gfx_copy_image 4197- radeonsi: move blitter clear_render_target impl into si_gfx_clear_render_target 4198- radeonsi: preserve NaNs in draw-based resource_copy_region 4199- radeonsi: use simpler UINT fallback formats for draw-based resource_copy_region 4200- radeonsi: remove si_use_compute_copy_for_float_formats 4201- radeonsi: change allow_flat_shading to make it a single condition 4202- radeonsi: don't call resource_copy_region in pipe->blit 4203- radeonsi/gfx11: implement DCC clear to "single" for fast non-0/1 clears 4204- radeonsi: disable VRS flat shading for selected 8xMSAA and thick tiling cases 4205- radeonsi: don't use si_get_flush_flags() for flushing images 4206- radeonsi: don't flush CB in si_launch_grid_internal_images if not needed 4207- radeonsi: don't flush CB and DB if there have been no draw calls 4208- radeonsi: enable fast FB clears for conditional rendering 4209- radeonsi: make clear_render_target clear DCC directly instead of via pipe->clear() 4210- radeonsi: don't add whether NIR is used into the shader key 4211- radeonsi: only expose 8 EQAA samples due to shader limitations 4212- radeonsi: always run nir_opt_16bit_tex_image 4213- radeonsi: use ip_type in debug code instead of hardcoding GFX 4214- radeonsi: implement user_data_amd for 5, 6, and 7 components correctly 4215- util: shift the mask in BITSET_TEST_RANGE_INSIDE_WORD to be relative to b 4216 4217Mark Collins (32): 4218 4219- tu/kgsl: Fix sync_wait'd FD in kgsl_syncobj_wait 4220- tu/a7xx: Update CCU layout logic for A7XX 4221- tu: Allow GMEM on A7XX when TU_DEBUG=gmem 4222- tu: Set A7XX registers in \`tu6_tile_render_begin` 4223- tu: Set \`CP_THREAD_CONTROL::CONCURRENT_BIN_DISABLE` in A7XX HW init 4224- tu: Only set PC/VFD PWR_CNTL regs on A6XX 4225- tu: Use \`CP_SET_PSEUDO_REG` for A7XX VSC stream regs 4226- tu/autotune: Use \`CP_EVENT_WRITE7::ZPASS_DONE` on A7XX 4227- tu: Set \`RB_UNKNOWN_88E4` for A7XX event blits 4228- freedreno/devices: Update A7XX tile values 4229- tu: Use full size color CCU in sysmem mode 4230- tu: Update CCU layout selection logic for seperate stencil stores 4231- tu: Allow event blit to resolve depth stencil formats 4232- tu: Fix 2D blit path for GMEM stores on A7XX 4233- tu: Use \`Z24_UNORM_S8_UINT_AS_R8G8B8A8` for A7XX GMEM D24S8 blits/clear 4234- tu: Disable LRZ properly on A7XX 4235- tu: Set RB_CCU_CNTL during HW init on A7XX 4236- tu: Fix CP_BLIT sync on A7XX 4237- tu: Clear \`VSC_UNKNOWN_0D08` on A7XX 4238- tu: Add blit cache flushing for input attachments 4239- tu: Unconditionally enable GMEM on A7XX 4240- fd/replay: Fix wrbuffer name extraction 4241- fd/replay: Dump wrbuf into cwd rather than exe directory 4242- fd/replay: Clamp dumped wrbuf to buffer size 4243- fd/replay: Clear wrbufs after submitting cmdstreams for DRM 4244- fd/replay: Add wrbuf support for KGSL/DXG 4245- fd/replay: Error when VMA AS allocation fails 4246- fd/replay+rddecompiler: Add option to clear wrbufs at start 4247- fd/rddecompiler: Disable IR3 cache for replay context 4248- fd/decode: Build generate_rd executable rather 4249- fd/replay: Use generate_rd as default CS generator 4250- fd/decode: Fix "OPTSIONS" typo in help messages 4251 4252Mark Janes (18): 4253 4254- hasvk: add missing linker arguments 4255- util: add parson for handling json files 4256- intel/dev: specify struct intel_device_info type details in python 4257- intel/dev: generate declarations for struct intel_device_info 4258- intel/tools: add intel device meson dependencies 4259- intel/dev: implement json serialization for intel_device_info 4260- intel/dev/tools: add json as an output format for intel_dev_info 4261- intel/tools: load json device info in drm_shim 4262- intel/dev: improve meson invocation for intel_device_info gen 4263- intel/compiler: generate a hash function to use with the shader cache 4264- iris: use device info sha in device renderer string 4265- anv: use intel_device_info to set device UUID 4266- intel/tools: move intel_dev_info to intel/tools 4267- intel/tools: add shader compiler hash key to json devinfo format 4268- pan/va: Add missing valhall_enums dep to bifrost_tests 4269- intel/dev: declare workarounds required by ATSM platforms 4270- intel/dev: remove pci revision from shader cache key 4271- intel/compiler: drop unused ray-tracing fields from cache hash 4272 4273Martell Malone (3): 4274 4275- nine: r500 under 20 fragments cap is a warning 4276- nine: detect emulation fallback of d3d coordinates 4277- nine: update verbiage for enduser device messages 4278 4279Martin Krastev (5): 4280 4281- svga/ci: land vmware mesa-ci lava farm 4282- svga/ci: workaround vmware farm's inability to use public DNS 8.8.8.8 4283- svga/ci: re-enable vmware farm 4284- svga/ci: add two new piglit flakes to svga 4285- svga/ci: disable vmware farm 4286 4287Martin Roukala (né Peres) (10): 4288 4289- radeonsi/ci: update vangogh's expectations after piglit uprev 4290- zink/ci: update navi31's expectations after piglit uprev 4291- zink/ci: update polaris10's expectations after piglit uprev 4292- radv/ci: switch vkcts-polaris10 from mupuf to KWS' farm 4293- radv/ci: add a vkcts-tahiti job 4294- radv/ci: add a vkd3d-tahiti job 4295- ci/b2c: rename .b2c-test-{vk,gl} to .b2c-x86_64-test-{vk,gl} 4296- ci/b2c: rename .deqp-test-valve into .b2c-deqp-test 4297- ci/b2c: allow setting the DTB to be used 4298- ci/valve: remove the traces runner 4299 4300Mary Guillemard (37): 4301 4302- nouveau: nvidia_header: Add AMPERE_A in vk_push_print 4303- nouveau: nvidia_header: Add TURING_COMPUTE_A and AMPERE_COMPUTE_A in vk_push_print 4304- nouveau: nvidia_header: Add AMPERE_COMPUTE_B in vk_push_print 4305- nouveau: nvidia-headers: Add compute array parsing to class_parser.py 4306- nouveau: nvidia-headers: Add nv_push_dump tool 4307- nouveau: mme: Add a dumper 4308- agx: Add more bitops in agx_bitop_table 4309- agx: Remove and/or/xor pseudo ops 4310- agx: Fuse not into and/or/xor 4311- agx: Add a bitop optimizer pass 4312- pan/bi: assert indices when offsets are present in bi_emit_tex_valhall 4313- pan/lib: Remove variables in blitter 4314- pan/bi: Rework indices for tex on Valhall 4315- pan/bi: Rework indices for image on Valhall 4316- pan/bi: Rework indices for attributes on Valhall 4317- pan/bi: Lower ubo table in indices for Valhall 4318- panfrost, pan/lib: Move pan_resource_table to panfrost 4319- nvk: Always copy conditional rendering value before compare 4320- drm-shim: Add io region handling in mmap 4321- panfrost: Add support for Panthor in drm-shim 4322- docs: Document Mali-G610 in drm-shim section 4323- panfrost: group up stubbed params in drm-shim 4324- nouveau: Add support for TERT opcodes in vk_push_print 4325- nouveau: Fix NINC TERT handling in vk_push_print 4326- nak: Support unaligned swizzles in 8/16 bits vec srcs 4327- nak: move folding logic to Src::fold_imm 4328- nak: Add F16 and F16v2 sources 4329- nak: Improve copy propagation pass to handle F16 4330- nak: Add 16-bits float operations 4331- nvk: Advertise shaderFloat16 4332- nvk: Allow various alu op to be vectorized for 2xfp16 4333- nak: Allow SHF to use immediate encoding for shift 4334- panvk: Return os_page_size for minMemoryMapAlignment 4335- panvk: Fix driver UUID not being filled 4336- panvk: Move to vk_properties 4337- panvk: Advertise VK_KHR_driver_properties 4338- nak: Pass has_mod to all form of src2 requiring it 4339 4340Mary Strodl (2): 4341 4342- rusticl: set OCL_ICD_VENDORS as directory, not file 4343- NirShader: don't fail on null constant_buffer 4344 4345Matt Turner (5): 4346 4347- util: Add DETECT_ARCH_HPPA macro 4348- util/tests: Disable half-float NaN test on hppa/old-mips 4349- meson: Limit intel_vk_rt to x86_64 4350- anv/drirc: Add option to control implicit sync on external BOs 4351- intel: Build float64 shader only for Vulkan 4352 4353Matthew Waters (1): 4354 4355- teximage: allow glCopyTex{Sub}Image[123]D into R/RG textures with OpenGL ES 2.0 4356 4357Max R (11): 4358 4359- d3d10umd: Fix compilation 4360- winsys/gdi: Handle R8G8B8 formats 4361- winsys/gdi: Custom acquisition of hDC 4362- d3d10umd: Use flush_frontbuffer for Present 4363- virgl: Fix compilation on MSVC 4364- virgl: Fix crash when no VE bound 4365- virgl: Implement PIPE_QUERY_GPU_FINISHED 4366- virgl: Allow importing resources without known templ 4367- virgl: Pass cmd_buf to flush_frontbuffer 4368- d3d10umd, meson: Allow naming d3d10umd DLLs 4369- d3d10umd: Rename d3d10sw target to d3d10umd 4370 4371Michel Dänzer (2): 4372 4373- egl/wayland: Flush after blitting to linear copy 4374- wsi/wayland: Dispatch event queue in wsi_wl_swapchain_queue_present 4375 4376Mike Blumenkrantz (313): 4377 4378- vk/cmdbuf: add back deleted maint6 workgraph bits 4379- lavapipe: use pushconstants2 for dgc 4380- lavapipe: fix devenv icd filename 4381- zink: fix separate shader patch variable location adjustment 4382- lavapipe: delete extra descriptor buffer layout validation 4383- zink: use local screen variable in surface creation 4384- zink: hook up maint6 4385- zink: use maint6 for multi-layer compressed surface creation 4386- zink: set more dynamic states when using shader objects 4387- lavapipe: KHR_dynamic_rendering_local_read 4388- zink: always map descriptor buffers as COHERENT 4389- zink: fix descriptor buffer unmaps on screen destroy 4390- lavapipe: RM2024 extension promotions 4391- zink: add a tu flake 4392- zink: prune dmabuf export tracking when adding resource binds 4393- zink: fix sparse bo placement 4394- zink: zero allocate resident_defs array in ntv 4395- zink: move sparse lowering up in file 4396- zink: run sparse lowering after all optimization passes 4397- zink: add back (safe) optimizations after sparse lowering 4398- zink: split out sparse_residency_code_and lowering 4399- mesa: plumb errors through to texture allocation 4400- zink: adjust swizzled deref loads by the variable component offset 4401- nir/lower_io: fix handling for compact arrays with indirect derefs 4402- zink: only add arrays to indirect non-tcs variables 4403- zink: promote a conditional on gfx shader destroy 4404- zink: clamp zink_gfx_lib_cache::stages_present for generated tcs 4405- zink: promote gpl libs freeing during shader destroy out of prog loop 4406- zink: don't add VK_IMAGE_CREATE_2D_ARRAY_COMPATIBLE_BIT for sparse textures 4407- zink: add a ci skip 4408- ci: bump VVL to snapshot-2024wk06 4409- zink: update vvl expectations 4410- mesa: check driver format support for certain GetInternalformativ queries 4411- zink: always enable glsl_correct_derivatives_after_discard 4412- zink: add a750 baseline 4413- zink: delete maxDescriptorBufferBindings checks 4414- zink: flag the use_img as unsync access with buf2img copies, not the swapchain 4415- zink: pre-check formats for samplecount support 4416- zink: validate sample count on image create 4417- zink: add an assert for dummy fb surface creation 4418- zink: compute bo unique_id on use, not creation 4419- zink: avoid infinite recursion on (very) small BAR systems in bo alloc 4420- egl/x11/swrast: deduplicate ANGLE_sync_control_rate enablement 4421- drisw: hook up EXT_buffer_age 4422- drisw/egl: delete unused buffer age handling 4423- vk/wsi/x11/sw: use swapchain depth for putimage 4424- mesa: add more driver support checks for more format queries 4425- zink: add checks/compat for low-spec descriptor buffer implementations 4426- zink: add a second fence disambiguation case 4427- zink: force host-visible allocations for MAP_COHERENT resources 4428- zink: hook up KHR_dynamic_rendering_local_read 4429- zink: use KHR_dynamic_rendering_local_read 4430- ci: make clang-format job warn on failure instead of killing the pipeline 4431- zink: handle stencil_fallback in zink_clear_depth_stencil 4432- zink: don't destroy the current batch state on context destroy 4433- zink: only scan active batch states for free states if > 1 exist 4434- zink: fix longstanding issue with active batch state recycling 4435- zink: assert that batch_id is valid in zink_screen_check_last_finished() 4436- zink: move flagging rp_changed in zink_update_fbfetch() to caller 4437- zink: don't pre-init dummy fbfetch surface when missing nullDescriptor feature 4438- zink: also set null fbfetch surfaces when no fb surface is bound 4439- zink: break out null fbfetch init for descriptor buffer 4440- zink: create/resize dummy surfaces on-demand 4441- zink: start out with 256x256 sized dummy surfaces 4442- zink: don't pre-init null fbfetch info 4443- zink: clamp in_rp clears to fb size 4444- zink: fix (dynamic rendering) execution of scissored clears during flush 4445- zink: fix swapchain readback conditional 4446- zink: lock buffer age when chundering swapchain for readback 4447- zink: flag acquired swapchain image as readback target on acquire, not present 4448- zink: make kopper_swapchain_image::acquired the resource that acquired it 4449- zink: add a swapchain readback case for reading differently-acquired image 4450- zink: make readback attempts count towards ZINK_READBACK_THRESHOLD 4451- zink: update swapchain readback cache on create 4452- zink: set and manage a flag indicating that swapchain readback needs updating 4453- zink: only update swapchain readback cache on create if necessary 4454- zink: only update swapchain readback cache when necessary 4455- zink: use new flag to determine whether swapchain readback cache is usable 4456- zink: update nv blob baseline 4457- zink: add nvk baseline 4458- ci: disable clang-format job 4459- zink: apply all storage memory masks to control barriers if no modes are specified 4460- zink: emit SpvCapabilityImageMSArray for ms arrayed storage images 4461- zink: null out bo usage when allocating from slab 4462- zink: fix unsynchronized read-mapping of device-local buffers 4463- zink: delete unused buffer map conditional 4464- zink: force max buffer alignment on return ptrs for mapped staging buffers 4465- gallium: add a nboxes param to flush_frontbuffer 4466- winsys/sw: propagate nboxes to displaytarget_display() 4467- drisw: plumb through a swapBuffersWithDamage interface 4468- egl/wayland/sw: move swrast_update_buffers() directly into swapbuffers 4469- egl/wayland/sw: move dri2_wl_swrast_commit_backbuffer() directly into swapbuffers 4470- egl/wayland: unify back/current swapping between zink and swrast 4471- egl/wayland/sw: split out surface attach from dri2_wl_swrast_commit_backbuffer() 4472- egl/wayland/sw: call dri2_wl_swrast_attach_backbuffer() before swap 4473- egl/wayland/sw: trigger damage from put_image2 4474- egl/wayland/sw: move partial->full copy promotion to swapbuffers 4475- egl/wayland/sw: fix no-op updating of current backbuffer 4476- egl/wayland/sw: pass damage region through from put_image2 to wl_surface_damage 4477- egl/wayland/sw: clamp putimage geometry to surface size 4478- drisw/xlib: loop over all the boxes in display() hook 4479- drisw/winsys: loop over all the boxes in display() 4480- drisw: pass all frontend swapbuffer damage rects through 4481- egl/kopper: plumb through SwapBuffersWithDamage 4482- egl/kopper: advertise EXT_swap_buffers_with_damage only in non-sw mode 4483- egl/wayland: split out kopper vtable 4484- egl/wayland: add a separate hook for kopper buffer age 4485- egl/wayland: split out kopper swapbuffers functions 4486- egl/kopper: call swrast buffer age query for kopper+swrast 4487- kopper: set drawable buffer age 4488- egl/wayland/kopper: actually call kopper swapbuffer functions 4489- egl/wayland: split out kopper update_buffers 4490- egl/wayland: delete swrast references to zink 4491- zink: fix stencil-only blitting with stencil fallback 4492- zink: make zink_kopper_present_info public 4493- zink: use a slab allocator for zink_kopper_present_info 4494- zink: hook up VK_KHR_incremental_present 4495- zink: use VK_KHR_incremental_present to propagate damage rects 4496- zink: hook up KHR_partial_update 4497- vulkan/dispatch_table: add an uncompacted version of the table 4498- zink: use uncompacted vk_dispatch_table 4499- egl/dri2: use the right egl platform enum 4500- glx: only print zink failure-to-load messages if explicitly requested 4501- zink: stop enabling EXT_conservative_rasterization 4502- lavapipe bump descriptor buffer address space limits 4503- zink: fix PIPE_CAP_MAX_SHADER_PATCH_VARYINGS 4504- zink: call CmdSetRasterizationStreamEXT when using shader objects 4505- nvk: bump NVK_PUSH_MAX_SYNCS to 256 4506- zink: update nvk baseline 4507- util/blitter: iterate samples in stencil_fallback 4508- mesa: fix CopyTexImage format compatibility checks for ES 4509- zink: update nvk baseline with nvk changes 4510- driconf: add radv_zero_vram for Crystal Project (1637730) 4511- zink: update nv baseline 4512- zink: track whether shaders use load_barycentric_at_sample 4513- zink: apply zink_shader::uses_sample to fs variant updating 4514- zink: destroy batch states after copy context 4515- zink: set VkExternalMemoryBufferCreateInfo for opaque fds too 4516- zink: simplify vb masking on bind 4517- mesa: force rendertarget usage on required-renderable formats 4518- zink: try getting sparse page size again without storage bit on fail 4519- u/inlines: constify util_res_sample_count() 4520- zink: only add STORAGE bit for sparse images based on multisample usage 4521- zink: nvk baseline updates 4522- zink: set the sparse format usage flags directly based on queried props 4523- zink: delete faked_e5sparse 4524- zink: rename optimal_key in update_gfx_program_optimal() 4525- zink: use the sanitized key in update_gfx_program_optimal() 4526- zink: always sync and replace separable progs even with ZINK_DEBUG=noopt 4527- zink: add even more strict checks for separate shader usage 4528- zink: be even stricter with shader object usage about blocking invalid usage 4529- zink: remove stale comments for DRLR usage 4530- zink: add a pass to strip out multisample storage image ops 4531- zink: don't deref swapchain image array with UINT32_MAX 4532- zink: handle image_deref_samples when stripping MS image instrs 4533- zink: iterate all the modes when doing separate shader fixups 4534- mesa/st: add 'base_serialized_nir' 4535- mesa/st: add is_draw_shader param to st_finalize_nir 4536- mesa/st: when creating draw shader variants, use the base nir and skip driver opts 4537- mesa/st: use sanitized shader keys for feedback draws 4538- zink: do io fixup on patch variables too 4539- zink: defer present barrier to flush if a clear is pending 4540- zink: clamp present region size 4541- zink: clamp swapchain renderarea instead of asserting 4542- zink: set dynamic rendering color attachment layouts 4543- radv: inline radv_device_fault_detection_enabled 4544- ci: bump VVL to v1.3.281 4545- nir/divergence: add zink intrinsics 4546- nir/opt_varyings: update alu type when rewriting src/dest for moved ops 4547- zink: only check that CUBE_COMPATIBLE for images doesn't subtract flags 4548- zink: don't use set_foreach_remove with dmabuf_exports 4549- zink: make descriptor pool creation more robust 4550- zink: fix shaderdb pipeline compile 4551- zink: delete some ntv dead code 4552- zink: always sort io variables by location after re-creating them 4553- zink: use outputs_written mask to detect edge flag usage 4554- zink: update xfb info after lower_to_scalar 4555- zink: run scan_nir before variable rework 4556- zink: apply component offset for CLIP/CULL DIST1 location derefs 4557- zink: manually calc clip/cull distance sizes 4558- zink: add a helper to detect clip/cull dist locations 4559- zink: always use shader sizes for clip/cull dist variables 4560- zink: fix generated variable expansion 4561- zink: check for arrayness rather than tess io vars for indirect array vars 4562- zink: track a mask of arrayed io locations on shaders 4563- zink: call gather_info during shader creation 4564- zink: always check patch io during rework_io_vars 4565- zink: don't clobber indirect array reads with missing components 4566- zink: fix io slot calculation for vertex inputs in add_derefs 4567- zink: fix add_derefs case for compact arrays 4568- zink: only use location_frac for deref array indexing for compact variables 4569- llvmpipe: fix DRAW_USE_LLVM=0 4570- nir/lower_wpos_ytransform: move new value load to start of function, reuse 4571- nir/lower_wpos_ytransform: reuse input zw components for fragcoord rewrite 4572- nir/lower_wpos_ytransform: update comment to reflect variable usage 4573- nir/lower_wpos_ytransform: scalarize emit_wpos_adjustment 4574- nir/lower_wpos_ytransform: fix for lowered io 4575- glsl: handle xfb resources for spirv before running varying opts 4576- mesa: clamp binary pointer in ShaderBinary if length==0 4577- gallium: rework PIPE_CAP_POINT_SIZE_FIXED 4578- zink: delete some maintenance5 psiz pruning 4579- zink: fix add_derefs for partial interp loads of derefs 4580- zink: assert that ntv interp handling isn't doing implicit component expansion 4581- egl/x11: disable swapbufferswithdamage for zink without kopper 4582- glx/egl: fix LIBGL_KOPPER_DISABLE 4583- glsl: set PSIZ bit in outputs_written when injecting a 1.0 psiz write 4584- nir/lower_clamp_color_outputs: fix use with lowered io 4585- nir/lower_flatshade: break out location checking 4586- nir/lower_flatshade: fix with lowered io 4587- nir/lower_alpha_test: fix use with lowered io 4588- nir/lower_two_sided_color: rework for lowered io 4589- nir/lower_drawpixels: fix for lowered io 4590- nir/lower_clip_disable: fix for lowered io 4591- nir/lower_point_size_mov: rework. 4592- nir/lower_point_size_mov: fix for lowered io 4593- nir/texcoord_replace: fix scalarized io handling 4594- nir/dominance: fix comment 4595- drisw: reorder image extensions 4596- sw_winsys: add displaytarget_create_mapped 4597- winsys/null: implement displaytarget_create_mapped 4598- winsys/drisw: implement displaytarget_create_mapped 4599- winsys/drisw: implement dmabuf handling 4600- lavapipe: add a function for asserting external memory handle types 4601- winsys: add WINSYS_HANDLE_TYPE_UNBACKED 4602- winsys: add more stride members to winsys_handle 4603- lavapipe: EXT_queue_family_foreign 4604- lavapipe: rework mem handle type assert to handle dmabuf 4605- lavapipe: handle drm image format queries 4606- lavapipe: handle drm image imports 4607- docs: update lavapipe features 4608- nir: add compact_arrays to nir_shader_compiler_options 4609- nir/gather_info: fix gathering for compact arrayed builtins 4610- zink: set compact_arrays in compiler options 4611- microsoft/compiler: set compact_arrays in compiler options 4612- lavapipe: don't clamp index buffer size for null index buffer draws 4613- v3d: set use_clipdist_array=true for lower_clip? 4614- nir/lower_clip: surgerize for lowered io 4615- nir/lower_clip: handle scalarized io 4616- zink: block LA formats with srgb 4617- llvmpipe: clamp 32bit query results to low 32 bits rather than MIN 4618- lavapipe: clamp 32bit query results to low 32 bits rather than MIN 4619- agx: set compact_arrays in compiler options 4620- v3d: set compact_arrays in compiler options 4621- intel: set compact_arrays in compiler options 4622- freedreno: set compact_arrays in compiler options 4623- glsl: stop using PIPE_CAP_NIR_COMPACT_ARRAYS and check compact_arrays 4624- ttn: stop using PIPE_CAP_NIR_COMPACT_ARRAYS and check compact_arrays 4625- glsl: move an assert from st_context over to avoid using PIPE_CAP_NIR_COMPACT_ARRAYS 4626- mesa: delete LowerCombinedClipCullDistance from consts 4627- st/program: stop using PIPE_CAP_NIR_COMPACT_ARRAYS and use compact_arrays 4628- nine: stop checking PIPE_CAP_NIR_COMPACT_ARRAYS and use compact_arrays 4629- gallium: delete PIPE_CAP_NIR_COMPACT_ARRAYS 4630- zink: set indirect io compiler flags 4631- zink: set lower_to_scalar 4632- zink: rework rework_io_vars 4633- zink: set nir_io_glsl_lower_derefs in compiler options 4634- zink: add a pass to fix vertex input locations 4635- zink: enable opt_varyings with ZINK_DEBUG=ioopt 4636- zink: ci updates 4637- nir/remove_unused_io_vars: check all components to determine variable liveness 4638- ci: kill piano trace globally 4639- nir: print i/o variables in location order 4640- lavapipe: disable stencil test if no stencil attachment 4641- egl: fix defines for zink's dri3 check 4642- egl/android: fix zink loading 4643- egl: use os_get_option for MESA_LOADER_DRIVER_OVERRIDE 4644- zink: disable buffer reordering correctly on shader image binds 4645- nir/print: stop trying to match i/o vars using base/driver_location 4646- zink: add ZINK_DEBUG=nopc to completely disable precompilation 4647- zink: destroy shaderdb pipelines 4648- zink: add VK_PIPELINE_CREATE_CAPTURE_STATISTICS_BIT_KHR for shaderdb 4649- brw/lower_a2c: fix for scalarized fs outputs 4650- zink: copy shader name when copying shader info 4651- zink: run nir_lower_io_to_scalar (mostly) unconditionally and earlier 4652- zink: vectorize io loads/stores when possible 4653- zink: ci updates 4654- zink: prune some piglit cts fails 4655- loader: delete unused param from pipe_loader_vk_probe_dri() 4656- glx: fix some indentation 4657- glx: add an 'implicit' param to createScreen 4658- glx: pass implicit load param through allocation 4659- dri: plumb a 'implicit' param through createNewScreen interfaces 4660- gbm: plumb an 'implicit' param through device creation 4661- frontends/dri: plumb an 'implicit' param through screen init 4662- pipe-loader: plumb a flag for implicit driver load through screen creation 4663- zink: don't print error messages when failing an implicit driver load 4664- glx: silence more implicit-load zink errors 4665- mesa/st: don't use serialized_nir for cached shaders 4666- zink: make NOREORDER mode context-based 4667- zink: disable command reordering for compute-only contexts 4668- nir: store variable names to io instrs during io lowering 4669- nir/lower_io_to_scalar: preserve variable names when splitting io 4670- nir/clone: preserve intrinsic name field across clones 4671- nir/print: print io instr->name if available 4672- zink: preserve/merge variable names when generating new variables 4673- glthread: check for invalid primitive modes in DrawElementsBaseVertex 4674- zink: reconstruct features pnext after determining extension support 4675- zink: prune zink_shader::programs under lock 4676- zink: fully wait on all program fences during ctx destroy 4677- kopper: fix bufferage/swapinterval handling for non-window swapchains 4678- zink: slightly better swapinterval failure handling 4679- kopper: don't set drawable buffer age 4680- zink: clean up accidental debug print 4681- egl/x11: disable dri3 with LIBGL_KOPPER_DRI2=1 as expected 4682- zink: add a batch ref for committed sparse resources 4683- u_blitter: stop leaking saved blitter states on no-op blits 4684- freedreno/replay: use inttypes format string for 64bit 4685- frontends/dri: only release pipe when screen init fails 4686- frontends/dri: always init opencl_func_mutex in InitScreen hooks 4687- zink: clean up semaphore arrays on batch state destroy 4688- egl/dri2: fix error returns on dri2_initialize_x11_dri3 fail 4689- nir/lower_aaline: fix for scalarized outputs 4690- nir/linking: fix nir_assign_io_var_locations for scalarized dual blend 4691 4692Mike Hsieh (1): 4693 4694- amd/vpelib: Add param check for geometric scaling and refactor 4695 4696Mohamed Ahmed (14): 4697 4698- nil: change image_level_size() to take tiling in account 4699- nil: Add helper function to get tile size in pixels 4700- nil: Add helpers for conversion from pixel values to tiles 4701- nil: Expose tiling_extent_B() 4702- nil: Add support for sparse resident images 4703- nvk: add sparse queries 4704- nvk: enable sparse residency features 4705- nak: wire up shader resource residency intrinsics 4706- nak: wire up sparse image loads 4707- nvk: advertise shader resource residency 4708- nil: Add a nil_image::compressed bit 4709- nil: Add some helpers for DRM format modifiers 4710- nil: Support creating images with DRM modifiers 4711- nvk: enable rendering to DRM_FORMAT_MOD_LINEAR images 4712 4713Mykhailo Skorokhodov (2): 4714 4715- egl/wayland: Fix sRGB format look up for config 4716- ci/lima: expect fail of window_8888_colorspace_srgb on wayland 4717 4718Nanley Chery (13): 4719 4720- iris: Don't memset the extra_aux memory range 4721- iris: Don't memset CCS on integrated gfx12 4722- iris: Enable pass-through state init for gfx12 CCS 4723- isl: Pick a better initial state for zeroed MCS 4724- iris: Copy main ISL surf when reallocating in place 4725- iris: Report the correct modifier for Tile4 images 4726- iris: Use resource_get_param in resource_get_handle 4727- intel/isl: Remove inconsistency when choosing Tile64 4728- intel/isl: Remove inconsistency when encoding Tile64 4729- intel/isl: Remove a CCS_D check from gfx12+ code 4730- intel/isl: Enable a 64KB alignment WA for flat-CCS 4731- intel/isl: Use Tile64 to align images for CCS WA 4732- intel/isl: Disable miptails to align LODs for CCS WA 4733 4734Neil Armstrong (1): 4735 4736- freedreno: Add a750 clock gating control related registers 4737 4738Nikita Popov (1): 4739 4740- Pass no-verify-fixpoint option to instcombine in LLVM 18 4741 4742Oskar Viljasaar (3): 4743 4744- vulkan/properties: Start looping from the next member in GPDP2 4745- tu: Use common physical device properties infrastructure 4746- compiler/types: Fix glsl_dvec*_type() helpers 4747 4748Patrick Lerda (16): 4749 4750- glsl/nir: fix gl_nir_cross_validate_outputs_to_inputs() memory leak 4751- r300: fix vertex_buffer related refcnt imbalance 4752- r300: fix r300_destroy_context() related memory leaks 4753- r300: fix memory leaks when register allocation fails 4754- r300: fix constants_remap_table memory leak 4755- radeonsi/gfx10: fix main_shader_part_ngg_es memory leak 4756- r300: enable R400 cos and sin hardware vertex shader opcodes 4757- ac/llvm,radeonsi: fix memory leaks triggered by ac_nir_translate() errors 4758- r300: fix NIR passes regression 4759- r300: fix constants_remap_table memory leak related to the dummy shader path 4760- r300: fix r300_draw_elements() behavior 4761- panfrost: remove panfrost_create_shader_state() related dead code 4762- gallium/auxiliary/vl: fix typo which negatively impacts the src_stride initialization 4763- clover: fix pipe_box update regression 4764- clover: fix memory leak related to optimize 4765- r600: fix vertex state update clover regression 4766 4767Paul Gofman (3): 4768 4769- glsl: allow out arrays in #110 with allow_glsl_120_subset_in_110 4770- driconf: add a workaround for Joe Danger 2 4771- driconf: add a workaround for Joe Danger 4772 4773Paulo Zanoni (35): 4774 4775- zink: fix bind size handling in buffer_bo_commit() 4776- anv/sparse: add an extra step before anv_sparse_bind_resource_memory() 4777- anv/sparse: allow binding operations to match the resource size 4778- anv+zink/ci: remove recently fixed tests from the crash list 4779- anv/sparse: don't issue a single bind operation per vm_bind ioctl 4780- anv/sparse: leave the semaphore waits and signals to the vm_bind ioctl 4781- anv/sparse: don't use the bind_timeline when doing sparse binding 4782- anv: change the vm_bind-related kmd_backend vfuncs to return VkResult 4783- anv: add an anv_pipe_bits bit to allow invalidating the TLB 4784- anv/trtt: invalidate the TLB after writing TR-TT entries 4785- anv/trtt: update GFX_TRTT_VA_RANGE for LNL 4786- anv: don't leak device->vma_samplers 4787- anv: set shaderFloat64 to true when fp64_workaround_enabled 4788- driconf/anv: set fp64_workaround_enabled to DIRT 5 4789- anv/xe: don't leak xe_syncs during trtt submission 4790- anv/xe: don't overwrite the result from vk_sync_wait() 4791- vulkan: don't zero-initialize STACK_ARRAY()'s stack array 4792- anv, iris: add missing CS_STALL bit for GPGPU texture invalidation 4793- anv: reduce struct anv_image_memory_range from 32 to 24 bytes 4794- vulkan: reduce struct vk_object_base by 8 bytes 4795- anv/sparse: remove useless isl_surf_get_tile_info() call 4796- anv/sparse: remove unnecessary popcount assertions 4797- anv/sparse: adjust sparse_bind_image_memory debug messages 4798- anv/sparse: remove unused dump_vk_sparse_memory_bind() 4799- anv/sparse: replace device->using_sparse with device->num_sparse_resources 4800- anv/sparse: rework anv_free_sparse_bindings() error handling 4801- anv/xe: extract anv_vm_bind_to_drm_xe_vm_bind() 4802- anv/xe: add a 'flags' parameter to the vm_bind() kmd_backend function 4803- anv/xe: slightly improve error handling for the vm_bind ioctl 4804- anv/xe: assert we're using drm_syncobjs only once 4805- anv/xe: de-duplicate xe_exec_fill_sync() 4806- anv/xe: rename and refactor xe_exec_fill_sync() 4807- anv/sparse: fail the right way in anv_GetDeviceImageSparseMemoryRequirements() 4808- anv: const-correct anv_{image,buffer}_is_sparse() 4809- isl: add ISL_TILING_64_XE2 to isl_tiling_to_name() 4810 4811Pavel Ondračka (38): 4812 4813- r300: fix reusing of color varying slots for generic ones 4814- r300: skip draw if vertex shader does not write gl_Position 4815- r300/ci: switch to deqp-runner suite 4816- r300/ci: add the KHR gles2 tests 4817- r300/ci: move streaming-texture-leak from fails to skips 4818- r300: fix writemask for nir_intrinsic_load_ubo_vec4 4819- r300: skip backend DCE for vertex shaders 4820- r300: remove R3xx/R4xx backend absulute modifier lowering 4821- r300/ci: add dEQP on RV380 4822- r300: remove backend SLE and SGT support 4823- r300: add r300_is_only_used_as_float helper 4824- r300: optimize out more modifiers produced later 4825- r300: lower comparison ops early in NIR 4826- r300: remove SGE, SNE, SLT, SGE lowering in the backend 4827- r300: remove the remaining of backend constant folding 4828- r300: remove backend support for SUB 4829- r300/ci: update piglit fails 4830- r300: remove compiler tests 4831- r300/ci: add two more observed piglit flakes 4832- r300: fix vs output register indexing 4833- r300: add explicit flrp lowering 4834- ci: install xwayland in x86_64_test-gl 4835- ci: build nine in debian-testing 4836- ci: build nine tests 4837- r300/ci: enable nine tests 4838- r300: explicitly check if sin/cos input is already in correct range 4839- r300: move sin/cos input fixups to finalize_nir 4840- r300: remove some late NIR passes 4841- nir/lower_vec_to_regs: always set cursor before inserting decl_reg 4842- r300: check for the extra restrictions on presubtract swizzles 4843- r300: move presubtract pass later 4844- r300: optimize swizzle for inline constants 4845- r300: inline unoptimized_ra ntr option 4846- r300: get rid of the unused ubo_vec4_max ntr option 4847- r300: remove the ntr lower_cmp option 4848- r300: move lower_fabb option out of the options struct 4849- r300: remove nir_to_rc_options wrapper 4850- r300/ci: failures list update 4851 4852Peyton Lee (6): 4853 4854- radeonsi/vpe: remove wait source surface fence and while loop 4855- radeonsi/vpe: disable info log 4856- radeonsi/vpe: move flush to si_vpe_processor_end_frame 4857- radeonsi/vpe: support multi-buffer 4858- radeonsi/vpe: pre-allocate stream struceutre 4859- radeonsi/vpe: add support for p010 4860 4861Philip Rebohle (1): 4862 4863- radv: Remove dead shared variables after optimization loop. 4864 4865Philipp Zabel (6): 4866 4867- rusticl: work around reference-to-mutable-static warnings 4868- etnaviv: common: Add PIPE_3D feature bit 4869- etnaviv: Avoid duplicate query of ETNA_GPU_FEATURES_0 parameter 4870- etnaviv: hwdb: Add VIP_V7 and NN_XYDP0 feature bits 4871- etnaviv: Add nn_core_version field to etna_specs 4872- etnaviv/nn: Extend post-multiplier for v8 architecture 4873 4874Pierre-Eric Pelloux-Prayer (21): 4875 4876- radeonsi: compute epitch when modifying surf_pitch 4877- Revert "ci/radeonsi: disable VA-API testing on raven" 4878- radeonsi: emit cache flushes before draw registers 4879- radeonsi: adjust flags for si_compute_shorten_ubyte_buffer 4880- winsys/amdgpu: use syncobj rather than amdgpu fence 4881- ac, radeonsi: remove has_syncobj, has_fence_to_handle 4882- radeonsi: try to disable dcc if compute_blit is the only option 4883- meson: require libelf when radeonsi is built 4884- egl/drm: flush before calling get_back_bo 4885- radv: don't remove the blit queue from the device queues 4886- winsys/amdgpu: unmap user fence BO before destroy 4887- winsys/amdgpu: remove unused amdgpu_fence_is_syncobj 4888- wsi/wl: flush connection on swapchain failure 4889- mesa: deal with vbo_save_vertex_list::modes being NULL 4890- wsi/wl: check wsi_wl_surface's validity before use 4891- egl/wayland: use __DRI_IMAGE_PRIME_LINEAR_BUFFER in get_back_bo 4892- winsys/radeon: pass priv instead NULL to radeon_bo_can_reclaim 4893- radeonsi: preserve alpha if needed in kill_ps_outputs_cb 4894- amd: fix addrlib regression 4895- aco: don't use python 3.7+ feature in aco_opcodes.py 4896- radv: don't use python 3.9 feature in radv_annotate_layer_gen.py 4897 4898Qiang Yu (1): 4899 4900- radeonsi: split RADEON_USAGE_NEEDS_IMPLICIT_SYNC into CB and DB flags 4901 4902Ray Smith (2): 4903 4904- panfrost: Don't try to set bifrost blendable format on midgard 4905- panfrost: Fix format tables for v4 and v5 4906 4907Rhys Perry (84): 4908 4909- radv: do nir_shader_gather_info after radv_nir_lower_rt_abi 4910- nir/lower_non_uniform: set non_uniform=false when lowering is not needed 4911- nir/lower_shader_calls: remove CF before nir_opt_if 4912- aco: fix labelling of s_not with constant 4913- aco: add VOPD format 4914- aco: add VOPD statistic 4915- aco: refactor schedule_ilp main loop 4916- aco: implement VOPD scheduler 4917- aco: enable VOPD scheduler 4918- aco: fix >8 byte linear vgpr copies 4919- aco/tests: fix to_hw_instr.swap_linear_vgpr 4920- aco: refactor create_vopd_instruction 4921- aco: swap operands to create VOPD instructions 4922- aco: turn v_mov_b32 into addition to create VOPD instructions 4923- aco: improve printing of VOPD instructions 4924- aco/tests: add tests for VOPD operand swapping 4925- aco/tests: use raw strings in form_hard_clauses.nsa 4926- radv: support minmax filter for more formats 4927- aco/ra: don't initialize assigned in initializer list 4928- aco/ra: fix GFX9- writelane 4929- aco: don't combine linear and normal VGPR copies 4930- aco/ra: disable p_start_linear_vgpr allocation hint 4931- aco: allow p_start_linear_vgpr to use multiple operands 4932- aco: require linear vgpr uses to be late kill 4933- aco: only allow linear vgpr kills in top-level blocks 4934- aco/ra: constify various RegisterFile 4935- aco/ra: move parallelcopy creation into helper 4936- aco/ra: change get_reg_bounds() helper 4937- aco/ra: rework linear VGPR allocation 4938- aco/ra: disable live range splitting of linear vgprs 4939- aco/ra: emit linear VGPR parallel copy separately 4940- aco/tests: add tests for linear VGPR register allocation 4941- aco: optimize for purely linear VGPR copies 4942- nir/algebraic: don't create 64-bit min/max/ior if lowered 4943- nir/algebraic: remove duplicated iand(ien, ine)/ior(ieq, ieq) patterns 4944- nir/algebraic: optimize 64-bit comparisons with zero'd halves to 32-bit 4945- nir/lower_int64: allow 64-bit comparisons when lowering minmax 4946- nir/search: fix nir_replace_instr() debug code 4947- aco: don't pass constant to is_overwritten_since() 4948- radv: don't advertise DGC with LLVM 4949- radv: stop using 5/8 component SSBO stores 4950- radv,aco: allow VS prologs to increase VGPR usage 4951- aco: don't reuse misaligned attribute destination VGPRs in VS prologs 4952- aco/util: add small_vec 4953- radv: use dual_color_blend_by_location with Half-Life Alyx 4954- aco/cssa: reset equal_anc_out if merging fails 4955- aco/cssa: update comments 4956- aco: fix GFX6 buffer_load_dwordx4 opcode number 4957- aco: rename opcode->instruction 4958- aco: refactor VOPC opcode list 4959- aco: use single tuple for all opcode numbers 4960- aco: use op() 4961- aco: move dot/wmma instructions into VOP3P list 4962- aco: unify MIMG opcode lists 4963- aco/gfx11: fix scratch ST mode assembly 4964- aco: split instruction assembly into functions 4965- aco: always emit float mode for merged shaders compiled separately 4966- aco: avoid breaking clauses with waitcnt 4967- nir: add mqsad_4x8, shfr and nir_opt_mqsad 4968- aco: implement mqsad_4x8 and shfr 4969- ac/llvm: implement mqsad_4x8 and shfr 4970- amd: set has_shfr32=true 4971- radv: optimize msad_4x8 to mqsad_4x8 4972- radv: memset radv_pipeline_cache_object data 4973- nir: add nir_remove_after_cf_node helper 4974- aco: remove unreachable merge blocks 4975- aco: ensure loop exits exist in NIR 4976- aco: save/reset/combine has_divergent_continue in uniform branches 4977- nir,aco: add test intrinsics 4978- aco/tests: add isel test helpers 4979- aco/tests: add control flow tests 4980- aco: assume no unreachable blocks 4981- aco: don't include the clause in VMEM_CLAUSE_MAX_GRAB_DIST 4982- aco: remove occupancy check in dealloc_vgprs() 4983- aco/tests: don't assume constructor order 4984- aco/tests: remove LLVM 11 code 4985- radv: cache RT stage info 4986- aco: include LDSDIR in latency/etc stats 4987- aco: make store clauses more aggressively 4988- aco: schedule LDSDIR instructions 4989- aco: schedule LDS instructions 4990- aco: split vop3p results 4991- aco/waitcnt: fix DS/VMEM ordered writes when mixed 4992- aco: create lcssa phis for continue_or_break loops when necessary 4993 4994Rob Clark (31): 4995 4996- freedreno/a6xx: fix comment 4997- freedreno/registers: Pass full args to dump_c() 4998- freedreno/registers: De-duplicate xml_reg_files 4999- freedreno/registers: Don't re-parse files 5000- freedreno/registers: Generate copyright comment blurb 5001- freedreno/registers: Add basic kernel header support 5002- freedreno/registers: A couple newline changes 5003- tu/drm/virtio: Fix dmabuf import 5004- freedreno/drm: Submit should hold ref to device 5005- freedreno/drm: Fix teardown crash harder 5006- freedreno/decode: Fix prefetch handling for IB1 crash 5007- freedreno: Fix MSAA z/s layout in GMEM 5008- freedreno/crashdec: Find potential fault buffers 5009- tu: Give suballoc bo's a name 5010- freedreno/a6xx: Add dual_color_blend_by_location 5011- freedreno/a6xx: Fix z/s preserving sysmem clear blit 5012- freedreno/pps: Don't re-init perfcntrs 5013- freedreno: Add bo usage hints 5014- freedreno/drm: Add perfetto memory tracing 5015- tu: Add perfetto memory tracing 5016- pps: Enable memory traces 5017- pps: Config tweaks to avoid loosing traces 5018- freedreno/registers: Add license header 5019- egl/android: Fix gl_config dereference 5020- freedreno/drm/virtio: Fix deadlock on exit 5021- freedreno+virgl: Add missing driconf 5022- freedreno: Update a618 xfails 5023- ci: Add deqp fix for pipeline_statistics_3 tests 5024- tu: Fix a6xx lineWidthGranularity 5025- egl/android: Fix sRGB visuals 5026- freedreno/ir3: Fix ldg/stg offset 5027 5028Robert Beckett (1): 5029 5030- vulkan/wsi: fix force_bgra8_unorm_first 5031 5032Robert Mader (5): 5033 5034- crocus: Support offset query for multi-planar planes 5035- panfrost: Use pipe resource helper 5036- egl: Implement EGL_EXT_config_select_group 5037- egl: Implement EGL_MESA_x11_native_visual_id 5038- egl/x11: Allow all RGB visuals to match 32-bit RGBA EGLConfigs 5039 5040Robin Kertels (3): 5041 5042- nvk: Enable EXT_nested_command_buffer. 5043- nak: Enable lowering rotate to shuffle. 5044- nvk: Advertise VK_KHR_shader_subgroup_rotate. 5045 5046Rohan Garg (35): 5047 5048- anv: refactor emit_dynamic_buffer_binding_table_entry 5049- isl,blorp,anv: introduce ISL_TILING_64_XE2 for Xe2+ platforms 5050- anv: untyped data port flush required when a pipeline sets the VK_ACCESS_2_SHADER_STORAGE_READ_BIT 5051- anv: factor out common code for determining surface usage from a VkDescriptorType 5052- anv: cleanup duplicate robustness flag calculations 5053- anv: add a command streamer stall on Xe2+ when switching pipelines 5054- intel/compiler: Xe2+ can do URB load/store with a byte offset 5055- anv: drop duplicated 3DSTATE_SLICE_TABLE_STATE_POINTERS emission 5056- anv, blorp: Set COMPUTE_WALKER Message SIMD field 5057- intel/genxml: update PIPE_CONTROL so that we can decode it on the CCS 5058- iris,anv: WA 1509820217 is no impact for Xe2+ 5059- intel/brw: Use the dimensions supplied in the instruction 5060- intel/brw: Cleanup send generation 5061- intel/brw: Update written size depending on the LSC message 5062- intel/brw: Set the right cache control bits for xe2 5063- intel/brw: Adjust src1 length bits for xe2+ 5064- anv,blorp: implement restrictions from WA 1406738321 5065- anv: 3D surfaces have fewer layers for higher miplevels 5066- isl: enable CCS for 3D surfaces on gen12.5 and above 5067- intel/brw: account for sources when determining if a operation uses half floats 5068- intel/brw: Xe2+ can do SIMD16 for extended math on HF types 5069- intel/brw: update disassembly for MATH pipe 5070- intel/brw: adjust the copy propgation pass to account for wider GRF's on Xe2+ 5071- intel/brw: minor rework to de duplicate variable assignment 5072- intel/brw: Handle typed surface and atomic messages for xe2+ 5073- intel/brw: Lower DWORD scattered read writes to lsc 5074- intel/eu/validate: Allow SIMD16 for mixed mode float operations on xe2+ 5075- iris: slow clear higher miplevels on single sampled 8bpp resources that have TILE64 5076- intel/blorp: add fast clear rectangle dimensions for single sampled TILE64 CCS surfaces 5077- isl: allow CCS on single sampled TILE64 surfaces 5078- anv: Enable HiZ on multi-LOD depth buffers. 5079- anv: use u_foreach_bit to iterate over the the view mask like we do for transition_clear_color 5080- anv: formatting fix when printing pipe controls 5081- anv: allocate space for generated indirect draw id's using the temporary allocation helper 5082- Revert "iris: slow clear higher miplevels on single sampled 8bpp resources that have TILE64" 5083 5084Roland Scheidegger (2): 5085 5086- auxiliary/draw: fix streamout overflow calculation 5087- auxiliary/rtasm: fix unaligned stores 5088 5089Romain Naour (1): 5090 5091- glxext: don't try zink if not enabled in mesa 5092 5093Ruijing Dong (6): 5094 5095- radeonsi/vcn: data structure av1 enc long term reference. 5096- radeonsi/vcn: vcn4 av1 long term ref support 5097- frontends/va: get av1 encoding ref frame infos for L0. 5098- radeonsi/vcn: add enc surface alignment caps 5099- frontends/va: add surface alignment attribute 5100- radeonsi/vcn: update to use correct padding size. 5101 5102Ryan Neph (3): 5103 5104- venus: fix shmem leak on vn_ring_destroy 5105- virgl: use PIPE_MAX_SAMPLERS in bind_samplers_states 5106- venus: reclaim signal semaphore feedback resources for wasteful clients 5107 5108Sagar Ghuge (28): 5109 5110- intel/fs: Track instance id in gs_thread_payload 5111- vulkan/runtime: Track VkSharingMode in vk_image 5112- anv: Disable compression if we have concurrent sharing mode 5113- intel/compiler/xe2: Handle 6-bit message type for Gfx20+ 5114- intel/compiler: Add texture operation lowering pass 5115- intel/compiler: Use nir_tex_src_backend1 to pack LOD and array index 5116- nir: Drop intel specific lowering code 5117- intel/compiler: Lower texture operation to combine LOD and AI 5118- intel/dev: Update max_subslices_per_slice comment 5119- intel/compiler: Fix disassembly of URB message descriptor on Xe2+ 5120- anv: Drop warnings for engine initialization failure 5121- anv: Set timestampValidBits to 64bits 5122- intel/compiler: Trim vector properly till array index 5123- intel/compiler: Adjust sample_b parameter according to new layout 5124- intel/compiler: Pack LOD/bias and array index on TG4 messages 5125- intel/compiler: Pack texture LOD and offset to a single 32-bit value 5126- intel/compiler: Add helper method to decide if header is required 5127- intel/compiler: Add gather4_i/l/[_c]/b sampler message 5128- intel/compiler: Add texture gather offset LOD/Bias message support 5129- nir: Allow nir_texop_tg4 in implicit derivative 5130- intel/compiler: Enable packing of offset with LOD or Bias 5131- anv: Implement VK_AMD_texture_gather_bias_lod 5132- anv/xe: Consider pat_index while unbinding the bo 5133- anv: Fix typo in DestinationAlphaBlendFactor value 5134- anv: Use appropriate argument format for indirect draw 5135- isl: Update isl_swizzle_supports_rendering comment 5136- isl: Update shader channel select for missing components 5137- intel/compiler: Disassemble mlen/rlen/ex_mlen in units of registers 5138 5139Saleemkhan Jamadar (1): 5140 5141- radeonsi/vcn: set jpeg reg version for gfx 1151 5142 5143Samuel Pitoiset (419): 5144 5145- radv: constify stages in radv_rt_fill_group_info() 5146- radv/rt: re-use radv_ray_tracing_stage::sha1 for hashing RT pipelines 5147- radv: correctly return VK_ERROR_OUT_OF_DEVICE_MEMORY when mapping a BO fails 5148- radv/nir: pass radv_shader_stage to some radv_nir_xxx() functions 5149- radv/nir: remove useless struct for nir_shader typedef 5150- radv: remove one unused parameter in radv_fill_shader_info_ngg() 5151- radv: move radv_pipeline_key::mesh_fast_launch_2 to the per-device cache key 5152- radv: add radv_shader_stage_key to radv_shader_stage 5153- radv: use radv_shader_stage_key directly with pre-existing fields 5154- radv: add optimisations_disabled to radv_shader_stage_key 5155- radv: remove unecessary radv_nir_compiler_options::key 5156- radv: remove unused lower_rt_instruction_monolithic_state::key 5157- radv: stop passing the pipeline key when compiling compute/rt shaders 5158- radv: re-organize radv_pipeline_key 5159- radv: add vertex_robustness1 to radv_shader_stage_key 5160- radv: introduce radv_graphics_state_key 5161- zink/ci: skip more arb_shader_image_load_store.* on Polaris10/Navi10 5162- radv: add keep_statistic_info to radv_shader_stage_key 5163- radv: add shader_version to radv_shader_stage_key 5164- radv: pass radv_shader_stage_key to radv_pipeline_stage_init() 5165- radv: make sure to retain shaders key for imported shaders with GPL 5166- radv: cleanup radv_generate_pipeline_key() 5167- radv: add radv_pipeline_get_shader_key() 5168- radv/rt: cleanup radv_parse_rt_stage() 5169- radv: hash radv_shader_stage_key 5170- radv: stop hashing radv_pipeline_key for compute/rt pipelines 5171- radv: remove the pipeline key for compute pipelines 5172- radv: remove the pipeline key for ray tracing pipelines 5173- radv: remove an extra new line in radv_shader.h 5174- radv: pass radv_graphics_state_key to radv_hash_shaders() 5175- radv: remove radv_generate_pipeline_key() 5176- radv: rename radv_pipeline_key to radv_graphics_pipeline_key 5177- radv: delay emitting streamout enable at draw time 5178- aco: silent checking if clrxdisasm is available 5179- radv: fix indirect dispatches on the compute queue on GFX7 5180- radv: fix indirect draws with NULL index buffer on GFX10 5181- radv: remove unused parameter to gather_shader_info_mesh() 5182- radv: add a per-stage key field for mesh shaders with a task shader 5183- vulkan: bump headers/registry to 1.3.276 5184- lavapipe: fix build since vulkan spec update 5185- vulkan: promote VK_EXT_line_rasterization to KHR 5186- vulkan: promote VK_EXT_index_type_uint8 to KHR 5187- radv: add a helper for binding the custom blend mode 5188- radv: add a helper to get the VGT_GS_OUT value 5189- radv: prevent accessing NULL pipelines when emitting VBO with ESO 5190- radv: re-emit the TCS epilog when a new TCS is bound 5191- radv: enable prologs/epilogs in-memory cache for shader objects 5192- radv: add required NV entrypoints for VK_EXT_shader_object 5193- radv: initialize default dynamic state when beginning a new cmdbuf 5194- radv: add radv_shader_stage::next_stage field 5195- radv: add radv_shader_layout::dynamic_offset_count 5196- radv: add support for creating/destroying shader objects 5197- radv: make some pipeline graphics helpers non-static for ESO 5198- radv: add support for binding/emitting shader objects 5199- radv: advertise VK_EXT_shader_object on GFX6-8 5200- radv: advertise VK_KHR_load_store_op_none 5201- radv: promote VK_EXT_line_rasterization to KHR 5202- radv: advertise VK_KHR_line_rasterization 5203- radv: promote VK_EXT_index_type_uint8 to KHR 5204- radv: advertise VK_KHR_index_type_uint8 5205- radv: use device->vk.enabled_features instead of iterating twice 5206- radv: fix segfault when getting device vm fault info 5207- radv/ci: enable RADV_PERFTEST=shader_object for vkcts-polaris10-valve 5208- radv: refactor gfx103_pipeline_emit_vgt_draw_payload_cntl() 5209- radv: refactor gfx103_pipeline_emit_vrs_state() 5210- radv: use the non-emitted graphics pipeline for the needed dynamic states 5211- radv: fix the late scissor emission workaround with ESO on GFX9 5212- radv: set NGG fields in vgt_shader_key for ESO on GFX10+ 5213- radv: do not ignore RADV_DYNAMIC_FRAGMENT_SHADING_RATE for ESO on GFX10.3+ 5214- radv: emit more default states for ESO on GFX10.3+ 5215- radv: export alpha-to-coverage via MRTZ for ESO on GFX11 5216- radv: fix detecting invalid binaries with ESO 5217- radv: fix emitting tess domain origin for merged TES+GS on GFX9 5218- radv: emit required programming for tess on GFX10+ in radv_emit_hw_vs() 5219- radv: rebind mesh/task shaders when restoring meta context 5220- radv: determine next stage for mesh/task with ESO 5221- radv: ignore unneeded dynamic states with mesh shaders and ESO 5222- radv: determine the last VGT api stage with mesh shaders and ESO 5223- radv: bind and emit mesh/task shaders with ESO 5224- radv: prevent crashes when a task shader is compiled unlinked with ESO 5225- radv: init the shader key in radv_shader_stage_init() for ESO 5226- radv: add support for VK_SHADER_CREATE_NO_TASK_SHADER_BIT_EXT 5227- radv: add a helper to know if device fault detection is enabled 5228- radv: refactor dumping GPU hang reports by using chunks 5229- radv: add support for keeping GPU hang reports in memory 5230- radv: export GPU hang reports through VK_EXT_device_fault 5231- radv: enable deviceFaultVendorBinary if RADV_DEBUG=hang is set 5232- radv: remove radv_graphics_state_key::dynamic_patch_control_points 5233- radv: determine the workgroup size for TCS earlier 5234- radv: set the default workgroup size for VS as LS 5235- radv: constify radv_device in radv_emit_shader_pointer() 5236- radv: check active NIR stages before trying to merge shaders on GFX9+ 5237- radv: only merge shader info stages if both stages exist on GFX9+ 5238- radv: rework shader arguments for separate compilation of VS+TCS on GFX9+ 5239- radv: always mark drawid/base_instance used with ESO 5240- radv: force TCS stage for VS as LS compiled separately on GFX9+ 5241- radv: always emit PGM_RSRC1_HS when emitting the TCS epilog state 5242- radv: add support for emitting VS+TCS compiled separately on GFX9+ 5243- radv: do not allow to enable VK_EXT_shader_object with LLVM 5244- radv: add a workaround for mipmaps and minLOD on GFX6-8 5245- radv/sqtt: fix describing queue submits for RGP 5246- radv: limit maxIndirectCommandsTokenCount to 512 5247- radv: remove one indentation level in radv_fill_shader_info_ngg() 5248- radv: squash GFX10/GFX10.3 NGG restrictions in the same condition 5249- radv: always set GS as NGG if present on GFX11 5250- radv: use next_stage to determine the NGG stage 5251- radv: check for MESA_SHADER_TESS_EVAL in radv_fill_shader_info_ngg() 5252- radv: determine the ES stage earlier when processing binary config 5253- radv: determine the workgroup size for GS non-NGG earlier 5254- radv: set the default workgroup size for VS/TES as ES 5255- radv: change the user SGPR idx of AC_UD_TES_STATE 5256- radv: add a new user SGPR for the ESGS ring item size 5257- radv/nir: lower esgs_vertex_stride for GS compiled separately on GFX9+ 5258- radv: rework shader arguments for separate compilation of VS+GS on GFX9+ 5259- radv: declare streamout buffers for VS+GS compiled separately on GFX9+ 5260- radv: force GS stage for VS as ES compiled separately on GFX9+ 5261- radv: add support for emitting VS+GS compiled separately on GFX9+ 5262- radv/ci: remove VKD3D_CONFIG=dxr11 for navi21/navi31 5263- radv: remove unused radv_indirect_command_layout::state_offset 5264- radv: only load 3x32-bit elements when emitting draws with mesh shader 5265- docs: fix RADV_DEBUG=nonggc description 5266- radv: add RADV_DEBUG=nongg_gs for GFX10/GFX10.3 5267- radv: add radv_disable_ngg_gs and enable it for Persona 3 Reload 5268- radv: fix RGP barrier reason for RP barriers inserted by the runtime 5269- radv: force GS stage for TES as ES compiled separately on GFX9+ 5270- radv: declare streamout buffers for TES+GS compiled separately on GFX9+ 5271- radv: declare AC_UD_TES_STATE for separate compilation of GS on GFX9+ 5272- radv: bind the vertex input SGPR only for relevant stages 5273- radv: add support for emitting TES+GS compiled separately on GFX9+ 5274- radv: allow RADV_PERFTEST=shader_object on GFX9/VEGA10 5275- radv/ci: enable RADV_PERFTEST=shader_object on VEGA10 5276- radv: cleanup radv_shader_combine_cfg_vs_tcs() 5277- radv: fix emitting VS prologs for merged shaders compiled separately on GFX10+ 5278- radv: clear RADV_CMD_DIRTY_SHADERS when resetting the shader object state 5279- radv: clear the custom blend mode when resetting gfx pipeline state 5280- radv: fix re-emitting DB_RENDER_CONTROL when resetting gfx pipeline state 5281- radv: make sure to reset the GS copy shader with ESO 5282- radv: fix selecting shader variants with ESO 5283- radv: fix setting the rasterized primitive for ESO 5284- radv: enable GS_FAST_LAUNCH=2 by default for RDNA3 APUs (Phoenix) 5285- radv: only configure {XYZ_DIM,DRAW_INDEX}_REG for mesh shaders if enabled 5286- radv: re-enable GS_FAST_LAUNCH=2 by default on GFX11 5287- radv: stop using conditional rendering internally when preprocessing DGC 5288- radv: disable conditional rendering if enabled when preprocessing DGC 5289- radv: pass the ES stage when emitting geometry shader with ESO 5290- radv: determine the ES stage for merged NGG shaders compiled separately 5291- radv: prefix radv_vgt_shader_key::streamout with ngg 5292- radv: set radv_vgt_shader_key::ngg_streamout for ESO 5293- radv: determine the number of invocations only for VS/TES as NGG 5294- radv: store the number of outputs for VS/TES as NGG 5295- radv: use radv_shader_info for computing NGG LDS layout 5296- radv: fix a compilation warning in radv_bind_graphics_shaders() 5297- spirv: only consider IO variables when adjusting patch locations for TES 5298- radv: move mesh_fast_launch_2 to radv_physical_device 5299- radv: initialize disk cache slightly later when creating a physical device 5300- radv: introduce a per physical device cache key 5301- radv: fix binary shaders compatibility with ESO 5302- radv: fix indirect dispatches on compute queue with conditional rendering on GFX7 5303- radv: remove the union in radv_shader_object 5304- radv: fix a big memleak with VK_EXT_shader_object 5305- radv: free NIR shaders when creating linked shaders with ESO 5306- radv: simplify binding the GS copy shader with ESO 5307- radv: rename radv_emit_shaders() to radv_emit_graphics_shaders() 5308- radv: simplify emitting VGT_ESGS_RING_ITEMSIZE for ESO 5309- radv: re-emit more states when a shader compiled separately is bound 5310- radv: only enable emulated mesh/task shader invocations on GFX10.3 5311- radv: add support for mesh primitives queries on GFX11 5312- radv: add support for task shader invocations queries on GFX11 5313- radv: remove a TODO about adding mesh/task queries on GFX11 5314- radv: store/reset conditional rendering user info in the helpers 5315- radv: add support for conditional rendering on the compute queue with DGC 5316- radv: remove unused parameter in gfx10_get_ngg_query_info() 5317- radv: do not set gs.has_pipeline_stat_query twice for NGG GS 5318- radv: use so.num_outputs to determine if NGG shaders need XFB queries 5319- radv: determine NGG query info before linking shader info 5320- radv: pass gfx10_ngg_info to gfx10_get_ngg_info() 5321- radv: pass radv_shader_info to gfx10_get_ngg_info() 5322- radv: determine NGG culling info before NGG info 5323- radv: compute NGG scratch LDS base in gfx10_get_ngg_info() 5324- radv: compute the total LDS usage in gfx10_get_ngg_info() 5325- radv: disable VK_FORMAT_E5B9G9R9_UFLOAT_PACK32 with minmax filter on GFX6 5326- radv/ci: enable RADV_PERFTEST=shader_object for vkcts-tahiti-valve 5327- radv: clean up MAX_ALLOWED_TILES_IN_WAVE programming 5328- radv: add missing RADV_DEBUG_NO_NGG_GS to the physical device cache key 5329- radv: fix conditional rendering with direct mesh+task draws and multiview 5330- radv: move conditional rendering for compute in radv_cmd_state 5331- radv: get the pipeline layout info from the push constant token with DGC 5332- radv: add a helper to calculate the compute resource limits 5333- radv: add a function to get compute pipeline metadata for DGC 5334- radv: add support for VK_PIPELINE_CREATE_INDIRECT_BINDABLE_BIT_NV 5335- radv: implement vkGetPipelineIndirectXXX() for DGC 5336- radv: implement vkCmdUpdatePipelineIndirectBufferNV() 5337- radv: implement indirect compute pipeline binds with DGC 5338- radv: handle indirect pipeline binds with scratch and DGC 5339- radv: force shader BOs to be local BOS with DGC indirect compute pipelines 5340- radv: enable deviceGeneratedComputePipelines 5341- radv: fix conditional rendering on compute queue on GFX6 5342- radv: add missing conditional rendering for indirect dispatches on GFX6 5343- radv: add a helper to emit PKT3_COND_EXEC 5344- radv: add a new user SGPR for NGG shaders compiled separately with ESO 5345- radv: lower lds_ngg_{gs_out_vertex_base,_scratch_base} with ESO 5346- radv: add support for emitting NGG shaders with ESO 5347- radv: allow RADV_PERFTEST=shader_object on GFX11 5348- radv: enable radv_zero_vram for RAGE2 5349- radv: preserve streamout_buffers user SGPR for VS/TES + GS compiled separately 5350- radv: always use ace_cs for the gang CS variable 5351- radv: refactor emitting the view index for task shaders 5352- radv: allocate a 32-bit value for the MEC fw bug with indirect mesh+task earlier 5353- radv: stop passing radv_cmd_buffer to draw functions with task shaders 5354- radv/ci: remove RT tests from the VANGOGH skip list 5355- radv/ci: remove dEQP-VK.robustness.* from the VANGOGH skip list 5356- radv: disable NGG in more situations with ESO on GFX10/GFX10.3 5357- radv: implement has_vgt_flush_ngg_legacy_bug for ESO 5358- radv: allow RADV_PERFTEST=shader_object on GFX10/GFX10.3 5359- radv,aco: stop duplicating PS/TCS epilog fields 5360- radv: add a helper to emit PS/TCS epilogs 5361- radv/ci: enable RADV_PERFTEST=shader_object for VKCTS jobs on GFX10+ 5362- radv/ci: mark dEQP-VK.shader_object.binding.mesh_swap_task as flake on NAVI21 5363- radv: stop using the custom blend mode for PS epilogs 5364- radv: re-emit RB+ state with PS epilogs only when the col format changes 5365- radv: fix emitting default blend state for PS without epilogs and ESO 5366- radv: allow RADV_PERFTEST=shader_object on all GFX9 GPUs 5367- radvi/ci: enable RADV_PERFTEST=shader_object for RENOIR 5368- util/u_debug: fix parsing of "all" again 5369- ci: use Linux kernel 6.6 for RADV 5370- radv/ci: update list of flakes for VKCTS jobs 5371- zink/ci: update list of flakes for RADV jobs 5372- ac/nir: fix exporting NGG streamout outputs with implicit PrimId from VS/TES 5373- radv: fix determining if PrimId is used for merged shaders compiled separately 5374- radv: determine radv_vgt_shader_key::has_ngg_xxx with the last VGT shader 5375- radv: rework generating vgt_shader_key for pipelines 5376- radv: fix wave32 support with ESO 5377- radv: add helpers to bind the GS copy shader and the RT prolog 5378- radv: fix RADV_PERFTEST=dmashaders with ESO 5379- radv: emit VGT_GS_OUT_PRIM_TYPE as part of the dynamic primitive topology 5380- radv: disable binning correctly on GFX11.5 5381- radv: fix programming of PA_SC_BINNER_CNTL_1.MAX_ALLOC_COUNT on GFX11 5382- radv: program SAMPLE_MASK_TRACKER_WATERMARK optimally for GFX11 APUs 5383- radv: add a GPU hang workaround for legacy tess+GS for GFX10.3 5384- radv: fix occlusion queries with MSAA and no attachments 5385- radv: add radv_force_pstate_peak_gfx11_dgpu and enable it for Helldivers 2 5386- zink/ci: enable RADV_PERFTEST=shader_object for polaris10 5387- radv: add a workaround for null IBO on GFX6 5388- zink/ci: update CI lists 5389- radv: always export MRTZ in FS epilogs with ESO on GFX11 5390- radv: trigger a new PS epilog when the framebuffer is dirty with ESO 5391- zink/ci: allow RADV_PERFTEST=shader_object on NAVI31 5392- radv: invalidate L2 metadata for VK_ACCESS_2_MEMORY_READ_BIT 5393- radv: make sure to disable NGG culling with TES when the FS stage is unknown 5394- zink/ci: enable RADV_PERFTEST=shader_object for NAVI10/VANGOGH 5395- radv/rmv: add missing logging when sparse BOs are destroyed 5396- radv/rmv: add missing logging when events are destroyed 5397- radv/rmv: fix logging of per-queue destroyed BOs 5398- radv/rmv: fix logging sparse residency 5399- radv/winsys: move BO size to radeon_winsys_bo 5400- radv/rmv: remove BO size parameter in radv_rmv_log_bo_allocate() 5401- radv: make some create resources helpers static 5402- radv/rmv: remove unnecessary is_internal parameter to some helpers 5403- radv: add radv_bo_{create,destroy}() helpers 5404- radv/rmv: prevent logging BOs allocated in GDS/OA domains 5405- radv/rmv: log allocated/destroyed BOs in radv_buffer_{create,destroy}() 5406- radv: add radv_bo_virtual_bind() helper 5407- radv: fix conditional rendering with mesh+task and multiview (again) 5408- radv: remove useless RADV_DEBUG=nomemorycache 5409- radv: implement alpha-to-one 5410- radv: advertise alphaToOne 5411- radv: advertise extendedDynamicState3AlphaToOneEnable with ACO 5412- docs: add alpha-to-one features for RADV 5413- radv: rename radv_physical_device variables to pdev everywhere 5414- radv/winsys: rename gpu_info to pci_ids in the null winsys 5415- radv: rename radeon_info variables to gpu_info everywhere 5416- radv: rename radv_physical_device::rad_info to info 5417- radv: remove radv_device::physical_device 5418- radv: remove radv_queue::device 5419- radv: remove radv_cmd_buffer::device 5420- radv: remove radv_device::instance 5421- radv: remove radv_physical_device::instance 5422- radv: declare radv_cmd_update_descriptor_xxx() in radv_descriptor_set.h 5423- radv: declare format related functions in radv_formats.h 5424- radv: pass a radv_physical_device to radv_use_llvm_for_stage() 5425- radv: move radv_device_supports_etc() to radv_physical_device.c 5426- radv: move some VK_DEFINE_NONDISP_HANDLE_CASTS to radv_descriptor_set.h 5427- radv: add radv_sampler.h 5428- radv: add radv_event.h 5429- radv: add radv_buffer_view.h 5430- radv: add radv_buffer.h 5431- radv: add radv_video.h 5432- radv: add radv_image.h 5433- radv: add radv_image_view.h 5434- radv: add radv_query.h 5435- radv: add radv_perfcounter.h 5436- radv: add radv_device_generated_commands.h 5437- radv: enable radv_zero_vram for Red Dead Redemption 2 5438- vulkan/debug_utils: add a helper for reporting address binding 5439- radv: implement VK_EXT_device_address_binding_report 5440- radv: advertise VK_EXT_device_address_binding_report 5441- radv: move radv_prim_vertex_count to si_cmd_buffer.c 5442- radv: move radv_userdata_locations to radv_shader_args.h 5443- radv: move radv_shader_{layout,stage} to radv_shader.h 5444- radv: add radv_device_memory.h 5445- radv: add radv_instance.h 5446- radv: add radv_queue.h 5447- radv: add radv_physical_device.h 5448- radv: add radv_rra.h 5449- radv: add radv_device.h 5450- radv: add radv_pipeline_cache.h 5451- radv: add radv_pipeline.h 5452- radv: add radv_pipeline_compute.h 5453- radv: add radv_pipeline_rt.h 5454- radv: add radv_pipeline_graphics.h 5455- radv: add radv_wsi.h 5456- radv: add radv_sqtt.h 5457- radv: add radv_shader_object.h 5458- radv: add radv_spm.h 5459- radv: add radv_cmd_buffer.h 5460- radv: add radv_rmv.h 5461- radv: add radv_cp_reg_shadowing.h 5462- radv: add radv_printf.h 5463- radv: move radv_get_tdr_timeout_for_ip() to radv_query.h 5464- radv: move radv_queue_ring() to radv_queue.c 5465- radv: add radv_nir_to_llvm.h 5466- radv: add radv_android.h 5467- radv: add radv_shader_info.h 5468- radv: move CP DMA related code to radv_cp_dma.c/h 5469- radv: move more cmd buffer related code to radv_cmd_buffer.c 5470- radv: merge radv_write_guardband() with radv_emit_guardband_state() 5471- radv: merge radv_write_scissors() with radv_emit_scissor() 5472- radv: move radv_get_viewport_xform() to radv_pipeline_graphics.c 5473- radv: move radv_create_gfx_config() to radv_device.c 5474- radv: move radv_emit_{compute,graphics}() to radv_queue.c 5475- radv: move code related to sample positions to radv_device.c 5476- radv: rename si_cmd_buffer.c to radv_cs.c 5477- radv: remove unused radv_printflike() 5478- radv: remove pre-declarations needed for WSI entrypoints 5479- radv: remove remaining forward declarations and comments in radv_private.h 5480- radv: replace RADV_FROM_HANDLE by VK_FROM_HANDLE 5481- radv: add missing endif comment for some headers 5482- radv: rename remaining phys_dev occurrences to pdev 5483- radv: replace radv_minify() by u_minify() 5484- radv: replace align_{u32,u64}() by align{64}() 5485- radv: replace align_u32_npot() by ALIGN_NPOT 5486- radv: replace radv_float_to_{u,s}fixed() by util_{un}signed_fixed() 5487- util: add util_is_aligned() 5488- radv: replace radv_is_aligned() by util_is_aligned() 5489- radv: move RADV_SUPPORT_CALIBRATED_TIMESTAMPS to radv_physical_device.c 5490- radv: move RADV_API_VERSION to radv_instance.h 5491- radv: move CLOCK_MONOTONIC_RAW define to radv_physical_device.h 5492- radv: move RADV_USE_WSI_PLATFORM define to radv_wsi.h 5493- radv: remove radv_private.h 5494- radv: make radv_get_vgt_index_size() static 5495- radv: move radv_get_user_sgpr() to radv_shader.c 5496- radv: move radv_queue_family_to_ring() to radv_queue.c 5497- radv: remove old comment in radv_cs.c 5498- radv: move radv_printf_data to radv_printf.h 5499- radv: make sure the heap budget is less than or equal to the heap size 5500- radv: use SPDX-License-Identifier 5501- radv: enable VK_EXT_shader_object by default 5502- aco: use SPDX-License-Identifier 5503- ci: uprev vkd3d-proton to c3b385606a93baed42482d822805e0d9c2f3f603 5504- docs: mark VK_KHR_maintenance6 as DONE for RADV 5505- radv: determine if the cache is disabled at device creation time 5506- radv: add skip_shaders_cache also for compute/rt pipelines 5507- radv: stop using a graphics pipeline for generating the graphics key 5508- radv/rt: constify device in radv_init_rt_stage_hashes() 5509- radv/rt: handle creation feedback like graphics/compute pipelines 5510- radv/rt: stop passing pCreateInfo to radv_ray_tracing_pipeline_cache_search() 5511- radv/rmv: fix missing image bind logging for WSI images 5512- radv: fix missing addr binding report for WSI image binds 5513- radv: fix addr binding report for disjoint image binds 5514- radv/rmv: fix image binds logging for disjoint images 5515- radv: add a helper to set image bindings 5516- radv: fix missing unbind report when an image is destroyed 5517- radv: fix missing unbind report when a buffer is destroyed 5518- radv/rt: remove dead code about intersection shaders in radv_pipeline_get_shader_key() 5519- radv: add a helper for hashing pipelines 5520- radv: rework and add a helper for hashing a compute pipeline 5521- radv: stop ignoring shader stages that don't need to be imported with GPL 5522- radv: add missing SQTT markers when an indirect indexed draw is used with DGC 5523- radv/rt: use radv_pipeline_hash_shader_stage() 5524- radv/rt: stop computing unused hash for the traversal shader 5525- radv: use canonicalized VA for VM fault reports 5526- radv: simplify importing pipeline layout with GPL 5527- radv: return early when PS is NULL in radv_pipeline_init_blend_state() 5528- radv: simplify checking for PS epilogs in radv_pipeline_init_blend_state() 5529- radv: remove unused parameter in radv_skip_graphics_pipeline_compile() 5530- radv: simplify the check for exporting multiview in the last VGT stage 5531- radv/rt: remove unnecessary pipeline parameter to radv_rt_fill_group_info() 5532- radv/rt: remove unnecessary pipeline parameter to radv_generate_rt_shaders_key() 5533- radv/rt: initialize shader group capture/replay in a separate function 5534- radv/rt: rework handle_from_stages to pass hashes directly 5535- radv/rt: insert shaders to cache right after they are compiled 5536- radv/rt: add radv_rt_pipeline_compile() 5537- radv: clear color attachments without exports before compaction 5538- ci: uprev CTS to vulkan-cts-1.3.8.0 5539- radv/ci: add one more flake since CTS 1.3.8.0 for RENOIR 5540- radv/ci: update lists for TAHITI and Zink/Polaris10 5541- radV/ci: reduce the parallelism of navi21 to 3 5542- radv: fix waiting for occlusion queries on GFX6-8 5543- radv: return per plane requirements for disjoint images 5544- zink/ci: update CI lists since piglit uprev 5545- radv/rt: remove unnecessary param to radv_ray_tracing_pipeline_cache_insert() 5546- radv/rt: move radv_ray_tracing_pipeline::sha1 to radv_pipeline 5547- radv: use radv_pipeline::sha1 for graphics/compute pipelines 5548- radv: rework pipeline cache search helpers 5549- radv: add RADV_DEBUG=psocachestats to report per-pipeline cache hits/misses 5550- vulkan: pass cmdbuf level to vk_command_buffer_ops::create() 5551- radv/amdgpu: do not use IB2 for nested command buffers 5552- radv: track if nested command buffers uses indirect draws 5553- radv: advertise VK_EXT_nested_command_buffer 5554- ac,radeonsi: add helpers to compute the number of tess patches/lds size 5555- radv: rework the number of tess patches computation 5556- ac: allow to use 64K of LDS for tessellation on GFX9+ 5557- ci: uprev CTS to 1.3.8.2 5558- radv: fix image format properties with fragment shading rate usage 5559- radv: remove bogus VkShaderCreateInfoEXT::flags being 0 assert for compute 5560- radv: allow 3d views with VK_IMAGE_CREATE_2D_VIEW_COMPATIBLE_BIT_EXT 5561- radv: mark some formats as unsupported on GFX8/CARRIZO 5562- radv: set image view descriptors as buffer for non-graphics GPU 5563- radv: only set ALPHA_IS_ON_MSB if the image has DCC on GFX6-9 5564 5565Sathishkumar S (5): 5566 5567- ac/gpu_info: query the number of ip instance 5568- radeonsi/vcn: avoid hard-coding the number of jpeg instance 5569- ac/gpu_info: fix regression in vulkan hw decode 5570- radeonsi/vcn: use num_instances from radeon_info 5571- ac/gpu_info: update multimedia info 5572 5573Sean Anderson (2): 5574 5575- gallium: lima: Don't round height when checking alignment 5576- Add Xilinx ZynqMP KMSRO entrypoint 5577 5578Sebastian Wick (4): 5579 5580- radeonsi: Destroy queues before the aux contexts 5581- util: Add timespec_sub_saturate to avoid negative time for deadlines 5582- loader/wayland: Add fallback wl_display_dispatch_queue_timeout 5583- vulkan/wsi/wayland: Use dispatch_queue_timeout in acquire_next_image 5584 5585Sergi Blanch Torne (10): 5586 5587- ci: disable Collabora's farm due to maintance 5588- Revert "ci: disable Collabora's farm due to maintance" 5589- ci: disable Collabora's farm due to maintance 5590- Revert "ci: disable Collabora's farm due to maintance" 5591- ci: Nightly run expectations update 5592- ci: disable Collabora's farm due to maintance 5593- Revert "ci: disable Collabora's farm due to maintance" 5594- Uprev Piglit to f7ece74a107a2f99b2f494d978c84f8d51faa703 5595- ci: kernel stored in a different s3 bucket 5596- ci: identify and label S3 buckets 5597 5598Shih, Jude (4): 5599 5600- amd/vpelib: Need a debug flag to support 2tap downscaling 5601- amd/vpelib: Add VPE prefix on API to avoid naming conflict 5602- amd/vpelib: Solve the downscaling problem for 2 tap 5603- amd/vpelib: Solve link error due to missing static for one function 5604 5605Sil Vilerino (25): 5606 5607- d3d12: Do not assume multi-subregion support when querying for driver encode support 5608- d3d12: Implement cap for PIPE_VIDEO_CAP_ENC_INTRA_REFRESH 5609- d3d12: Clean up H264 video decode interlaced code path 5610- frontends/va, d3d12: Fix PIPE_VIDEO_SLICE_MODE_MAX_SLICE_SICE -> PIPE_VIDEO_SLICE_MODE_MAX_SLICE_SIZE typo 5611- d3d12: d3d12_video_encoder_negotiate_current_h264_slices_configuration to use correct mode when intra-refresh is on 5612- d3d12: Do not use PIPE_BIND_DISPLAY_TARGET for d3d12_video_buffer 5613- d3d12: AV1 encode - Configure CQP using qp and new qp_inter parameters 5614- d3d12: H264 encode - Update CQP using current frame type as per VA frontend change 5615- d3d12: HEVC encode - Update CQP using current frame type as per VA frontend change 5616- frontend/va: Support media only post proc without compositor using shaders or surfaces 5617- frontend/va: Use get_resources in VaDeriveImage for media only devices without get_surfaces support 5618- d3d12: Add partial media, compute, graphics support with CORE and GENERIC feature levels 5619- d3d12: Refactor graphics functions from context and blit to separate files 5620- d3d12: Add GetDesc wrapper for ID3D12Heap 5621- d3d12: Only check D3D12_FEATURE_DATA_PLACED_RESOURCE_SUPPORT_INFO for D3D_FEATURE_LEVEL_1_0_GENERIC 5622- d3d12: Video Encode - Add driver workaround for rate control reconfiguration 5623- d3d12: Implement PIPE_VIDEO_CAP_ENC_SURFACE_ALIGNMENT 5624- d3d12: Bump directx-headers dependency to v613 5625- d3d12: Support H264 slice L0/L1 active number override 5626- d3d12: Support HEVC slice L0/L1 active number override 5627- d3d12: Fix leak of batch->bos on video-only builds 5628- d3d12: Fix leak dxil_module::serialized_dependency_table 5629- d3d12: Fix util_blitter_destroy destruction ordering 5630- vl_win32_screen_create: Take ownership of winsys injected to created d3d12_screen 5631- d3d12/ci: Add vainfo with appverifier CI check 5632 5633Simon Ser (2): 5634 5635- egl/wayland: ensure wl_drm is available before use 5636- egl/wayland: explain why implicit modifier downgrade is allowed 5637 5638Stéphane Cerveau (1): 5639 5640- vulkan/video: hevc: b-frames can be reference or not 5641 5642Surafel Assefa (1): 5643 5644- radeonsi: Adds return on failure to get plane info 5645 5646Sviatoslav Peleshko (6): 5647 5648- nir: Use alu source components count in nir_alu_srcs_negative_equal 5649- anv,driconf: Add sampler coordinate precision workaround for AoE 4 5650- driconf: Apply dual color blending workaround to Dying Light 5651- anv: Store host-located copy of NULL surface state for faster memcpy 5652- dri: Flush the context after flush_resource when creating shareable image 5653- anv: Fix descriptor sampler offsets assignment 5654 5655Tapani Pälli (35): 5656 5657- hasvk: remove cmd_buffer_ray_query_globals function decl 5658- hasvk: remove gfx9 specific code from emit_sample_pattern 5659- hasvk: remove softpin (GFX_VERx10 >= 90) related code 5660- hasvk: remove gfx9 specific cs stall from emit_ps_depth_count 5661- anv: check for wa 16013994831 in emit_so_memcpy_end 5662- iris: expand pre-hiz data cache flush to gfx >= 125 5663- anv: expand pre-hiz data cache flush to gfx >= 125 5664- iris: replace constant cache invalidate with hdc flush 5665- anv: move \*bits_for_access_flags to genX_cmd_buffer 5666- anv: use workaround framework for Wa_22018402687 5667- intel/blorp: add a TODO note about stencil buffer resolve 5668- intel: refactor urb configuration, add intel_urb_config 5669- intel/common: provide a helper for urb setup comparison 5670- blorp/crocus: refactor blorp_emit_urb_config 5671- iris: implement Wa_16014912113 5672- anv: implement Wa_16014912113 5673- blorp: implement Wa_16014912113 callback for drivers 5674- anv: flush tile cache independent of format with HIZ-CCS flush 5675- anv: revert cache flushing changes for indirect commands 5676- intel/blorp: disable use of REP16 independent of format 5677- iris: make sure DS and TE are sent in pairs on >= gfx125 5678- iris: make sure aux is disabled for external objects 5679- anv: make sure aux is disabled for memory objects 5680- hasvk: make sure aux is disabled for memory objects 5681- crocus: make sure aux is disabled for memory objects 5682- anv: use workaround framework for Wa_16013000631 5683- anv: setup distribution granularity with Wa_14019166699 5684- iris: refactor function that checks primitive id usage 5685- iris: setup distribution granularity with Wa_14019166699 5686- anv: disable fcv optimization on >= gfx125 5687- intel/blorp: remove unused blorp batch flag 5688- intel/compiler: add assert for Wa_22017182272 5689- anv: add dirty tracking for push constant data 5690- iris: change stream uploader default size to 2MB 5691- anv: skip gfx push constants alloc optimization on gfx9/11 5692 5693Tatsuyuki Ishi (8): 5694 5695- radv: Recompute max_waves after postprocessing RT config 5696- radv: never set DISABLE_WR_CONFIRM for CP DMA clears and copies 5697- util: Optimize mesa_hex_to_bytes 5698- radv: Add radv_spirv_to_nir_options that summarize early gfx states. 5699- radv: Rename cache_search_nir to cache_lookup_nir_handle. 5700- radv: Re-sort RADV_PERFTEST env vars in docs 5701- radv: Implement NIR caching behind RADV_PERFTEST=nircache. 5702- radv: Remove radv_queue::device again 5703 5704Teng, Jin Chung (1): 5705 5706- d3d12: HEVC Encode - Query slice config mode based on user slice setting 5707 5708Thomas H.P. Andersen (6): 5709 5710- nvk: promote VK_EXT_index_type_uint8 to KHR 5711- nvk: promote VK_EXT_line_rasterization to KHR 5712- nvk: promote load_store_op_none to KHR 5713- docs: update features.txt for nvk 5714- nvk: support driconf option force_vk_vendor 5715- driconf: override vendor id for X4 Foundations on NVK 5716 5717Thong Thai (2): 5718 5719- radeonsi/vcn: remove EFC support for renoir 5720- frontends/va/postproc: do not use efc if image is to be translated 5721 5722Timothy Arceri (78): 5723 5724- glsl: add nir version of validate_geometry_shader_emissions() 5725- glsl: use nir version of geom stream validation 5726- glsl: remove now unused GLSL IR validate_geometry_shader_emissions() 5727- glsl: don't tree graft globals 5728- Revert "ci: Enable GALLIUM_DUMP_CPU=true only in the clang job" 5729- glsl: add basic params for AMD_gpu_shader_half_float extension 5730- glsl: add half float support to the parser 5731- glsl: add explicit half float conversion support 5732- glsl: update assert to allow for half float support 5733- glsl: add glsl_type_is_float_16() helper 5734- glsl: add implicit half float conversions 5735- glsl: add ubo packing support for half floats 5736- glsl: skip conversion of half float back to float for GL queries 5737- glsl: add some new helpers for half float builtin functions 5738- glsl: add half float angle and trigonometry functions 5739- glsl: add half float exponential functions 5740- glsl: add f2f16() helper to ir_builder 5741- glsl: add half float support for common functions 5742- glsl: add support for half float packing functions 5743- glsl: add half float geometric functions 5744- glsl: add half float matrix functions 5745- glsl: add half float vector relational functions 5746- glsl: allow half float varyings 5747- glsl: add half float interpolation functions 5748- glsl: add half float derivative functions 5749- glsl: add half float AMD_shader_trinary_minmax functions 5750- compiler/types: Add a contains_32bit helper 5751- gallium: add PIPE_CAP_FP16 for AMD_gpu_shader_half_float 5752- glsl: add missing error check for half float varying 5753- nir: allow gather info to handle nir_deref_type_array_wildcard 5754- glsl: support array wildcards in lower named interface blocks 5755- glsl: split var copies before lowering named interfaces 5756- glsl: fix potential crash in expression flattening 5757- glsl: move some lowering to the compiler 5758- glsl_to_nir: merge function param handling 5759- glsl_to_nir: support conversion of struct/array function params 5760- glsl_to_nir: support conversion of struct/array function returns 5761- glsl_to_nir: support conversion of opaque function params 5762- glsl: don't inline functions in glsl ir 5763- nir: add some nir_parameter fields 5764- glsl: add missing define to linker_util.h 5765- glsl: add nir version of function recursion detection 5766- glsl: move function inlining out of glsl_to_nir() 5767- glsl: make use of nir recursion detection 5768- glsl: implement nir version of lower discard flow 5769- glsl: make use of nir lower discard flow 5770- glsl: remove now unused glsl ir lower discard pass 5771- glsl: make an explicitly safe version of visit_exec_list() 5772- glsl_to_nir: never convert instructions after jump 5773- glsl: remove unrequired do_lower_jumps() call 5774- glsl: move invariant builtin validation to the nir linker 5775- nir: add max_array_access data field 5776- nir: add implicit_sized_array data field 5777- glsl: add resize_tes_inputs() to the nir linker 5778- nir: add variable field from_ssbo_unsized_array 5779- glsl: don't remove redefined per vertex block 5780- glsl: add nir implemenation of block validation 5781- glsl: switch to NIR block validation 5782- glsl: call new nir resize_tes_inputs() pass 5783- glsl: remove now unused resize_tes_inputs() 5784- glsl: remove now unused glsl ir block validation 5785- glsl: move some linking calls to gl_nir_link_glsl() 5786- glsl: switch verify_subroutine_associated_funcs() to nir 5787- nir: add subroutine fields to nir_function 5788- glsl: move link_assign_subroutine_types() to the nir linker 5789- glsl: move check_explicit_uniform_locations() to NIR linker 5790- glsl: move mode_string() to helper 5791- glsl: add some data members to nir_variable 5792- glsl: make validate_intrastage_arrays() usable across files 5793- glsl: move cross_validate_uniforms() to the nir linker 5794- glsl: use shader info to store gs verts 5795- glsl: use info from shader when linking 5796- glsl: move validate_{stage}_shader_executable() to the nir linker 5797- glsl: remove now unused do_dead_functions() 5798- glsl: remove FragDepthLayout field 5799- glsl: remove ActiveStreamMask field 5800- glsl: remove UsesEndPrimitive field 5801- glsl: inline _mesa_copy_linked_program_data() 5802 5803Timur Kristóf (121): 5804 5805- radv: Correctly select SDMA support for PRIME blit. 5806- nir: Fix divergence of reductions. 5807- nir: Fix divergence analysis of load_patch_vertices_in. 5808- nir: Cleanup divergence analysis for mesh shaders. 5809- nir: Clean up divergence analysis for TES patch input loads. 5810- aco: Eliminate SCC copies when possible. 5811- radv: Lower mesh shader draw ID to zero when they have a task shader. 5812- radv: Extract input and output stride info to new functions. 5813- radv: Use mapped driver locations for determining I/O strides. 5814- aco: Allow passing constant operand to is_overwritten_since. 5815- radv/llvm: Remove dead code. 5816- radv: Allow NGG culling with LLVM. 5817- compiler: Add helper for counting tess level components. 5818- ac/nir/tess: Always record tess level info and use it at the end. 5819- ac/nir/tess: Don't record mapped tess level location. 5820- ac/nir/tess: Split tess factor write into multiple functions. 5821- ac/nir/tess: Emit tess factor output independently of whether it can be passed by registers. 5822- ac/nir/tess: Refactor how the end of HS is emitted. 5823- aco: Use common helper for counting tess level components. 5824- aco: Use tess factors when TCS jumps to epilog. 5825- radv: Declare tess_lvl_in/out args for TCS epilogs. 5826- radv: Always pass tess factors to epilogs in registers. 5827- radv, aco: Delete now dead TCS epilog code. 5828- nir: Add two new AMD specific tess intrinsics. 5829- radeonsi: Implement new intrinsics for monolithic shaders. 5830- radv: Copy TES primitive mode to TCS info. 5831- radv: Implement new tess intrinsics. 5832- radv: Call nir_opt_dead_cf in radv_optimize_nir_algebraic. 5833- ac/nir/tess: Emit tess factor stores based on new intrinsics. 5834- radv: Completely delete TCS epilogs. 5835- radv, aco: Remove the code that jumped to RADV's TCS epilogs. 5836- ac/llvm, radeonsi: Handle tess_rel_patch_id in common code. 5837- radeonsi: Put HS output count in TCS offchip layout, not patch data offset. 5838- radeonsi: Implement dynamic TCS intrinsics for non-monolithic shaders. 5839- radeonsi: Delete TCS epilogs entirely. 5840- aco: Delete all TCS epilog code. 5841- radeonsi: Add number of VS outputs to TCS output layout. 5842- radeonsi: Remove tess bits from VS state. 5843- radeonsi: Use one more bit for number of patches in TCS offchip layout. 5844- ac/nir/tess: Remove dead code that was meant for epilogs. 5845- radv: Add number of LS and HS outputs to tcs_offchip_layout. 5846- radv: Change input patch size in TCS offchip layout to match RadeonSI. 5847- radv: Change number of patches in TCS offchip layout to match RadeonSI. 5848- radv: Include output patch size in TCS offchip layout. 5849- radv: Reuse TCS offchip layout between TCS and TES. 5850- nir/gather_info: Record per-primitive outputs without variables. 5851- nir: Record per-primitive inputs without variables. 5852- nir/recompute_io_bases: Sort per-primitive PS inputs last. 5853- ac/nir: Introduce ac_nir_calc_io_offset_mapped. 5854- ac/nir/tess: Load tess factors from variable when they are passed in registers. 5855- ac/nir/tess: Clarify when a TCS output is stored in LDS or VRAM. 5856- ac/nir/tess: Return undef when loading an unwritten TCS output. 5857- ac/nir/tess: Map TCS LDS IO locations without gaps. 5858- ac/nir/tess: Calculate reserved LDS outputs based on IO info. 5859- ac/nir/tess: Remove superfluous args for reserved TCS outputs. 5860- ac/nir/tess: Clarify when VS-TCS I/O can use registers. 5861- radv: Only add extra dword to LS-HS stride when there are LS outputs. 5862- radv: Pass key structures to gather intrinsic info. 5863- radv: Extract gather_load_vs_input_info function. 5864- radv: Slightly refactor gather_intrinsic_store_output_info. 5865- radv: Record PS input clip/cull mask instead of number. 5866- radv: Use NIR IO semantics to determine GS output info. 5867- radv: Add helper for determining per-attribute vertex buffer descriptors. 5868- radv: Add helper to determine usage of VS prologs. 5869- radv: Remove unused VS input usage mask. 5870- radv: Use NIR IO semantics to determine VS input info. 5871- radv: Use IO semantic location for shader output info. 5872- aco/optimizer_postRA: Remove a check from SCC no-compare optimization. 5873- radv: Use NIR IO semantics to determine FS input info. 5874- radv: Remove I/O variables after nir_lower_io. 5875- radv: Slightly refactor the determination of max_ps_params. 5876- radv: Increase maximum allowed PS params for enabling NGG culling. 5877- radv: Remove unused gfx_level from gfx10_emit_ge_pc_alloc. 5878- ac/nir/ngg: Don't create dummy output variable for primitive ID. 5879- ac/nir/ngg: Use IO semantics for determining instance rate inputs. 5880- ac/nir/ngg: Rename confusing driver_location variable in mesh shader lowering. 5881- radv: Use NIR IO semantics for VS input location mapping. 5882- radv: Don't set driver locations for mesh shaders. 5883- radv: Don't set driver locations for FS outputs. 5884- radv: Don't set driver locations for last pre-rasterization stage. 5885- radv: Keep track of TCS outputs that need LDS. 5886- radv: Remove dead code for creating per-patch IO mask. 5887- radv: Add radv_gather_unlinked_io_mask to shader info header. 5888- radv: Always use fixed I/O locations for TCS outputs in VRAM. 5889- radv: Clean up gathering linked I/O info. 5890- nir/print: Print per-primitive and explicit strict IO info. 5891- nir/recompute_io_bases: Fix per-primitive inputs. 5892- nir/gather_info: Clear per-primitive I/O masks at the beginning. 5893- nir/lower_io_to_scalar: Support explicit (and per-vertex) FS inputs. 5894- nir/lower_io_to_scalar: Support per-primitive outputs. 5895- nir/opt_varyings: Allow optimizing primitive ID for MS -> FS. 5896- nir/opt_varyings: Support per-primitive I/O. 5897- nir/opt_varyings: Fix explicit and per-vertex FS inputs. 5898- nir/opt_varyings: Add early return when producer stage is task. 5899- nir/opt_varyings: Only propagate constant MS outputs, not other uniforms. 5900- nir/opt_varyings: Debug print during relocate_slot. 5901- nir/opt_varyings: Fix relocate_slot so it doesn't mix up 32-bit and 16-bit I/O. 5902- nir/opt_varyings: Add workaround for RADV mesh shader multiview. 5903- ac/nir/ngg: Remove support for loading mesh shader outputs. 5904- ac/nir/ngg: Refactor MS primitive indices for scalarized IO. 5905- ac/nir/ngg: Slightly refactor mesh shader cull flag stores. 5906- ac/nir/ngg: Use just one IO semantics variable in MS output store. 5907- ac/nir/ngg: Refactor update_ms_output_info. 5908- ac/nir/ngg: Refactor MS output store into two functions. 5909- ac/nir/ngg: Split 16-bit MS output stores by components. 5910- ac/nir/ngg: Enable packing 16-bit mesh shader outputs. 5911- radv: Run DCE before deleting I/O variables. 5912- radv: Only consider interpolated inputs as 16-bit float. 5913- radv: Refactor emitting PS input types. 5914- radv: Remove superfluous bool arg from slot_to_ps_input. 5915- radv: Allow using high 16 bits of PS input slots. 5916- radv: Rename per_vertex_shaded_mask to explicit_strict_shaded_mask. 5917- radv: Rename LDS related variables in get_tcs_num_patches. 5918- radv: Calculate VRAM tess patch size independently of LDS size. 5919- ac/nir/tess: Split I/O mapping to two functions. 5920- ac/nir/tess: Use LDS IO mapping when loading tess levels from LDS. 5921- ac/nir/ngg: Implement packed 16-bit VS/TES outputs in non-dedicated slots. 5922- ac/nir/ngg: Implement packed 16-bit GS outputs in non-dedicated slots. 5923- ac/nir/lower_legacy_vs: Implement packed 16-bit VS/TES outputs in non-dedicated slots. 5924- ac/nir/lower_legacy_gs: Implement packed 16-bit GS outputs in non-dedicated slots. 5925- ac/nir/ngg: Fix packing 16-bit MS outputs. 5926 5927Tomeu Vizoso (20): 5928 5929- ci: disable Igalia farm 5930- gallium/util: Fix pipe_buffer_copy 5931- mesa: Import TensorFlow Lite headers 5932- teflon: Initial commit 5933- etnaviv: Update headers from rnndb 5934- etnaviv: Add a bunch of new params for NPUs 5935- etnaviv: Don't emit boilerplate for compute only contexts 5936- etnaviv: Use NN cores to accelerate convolutions 5937- etnaviv: Use TP cores to accelerate tensor transformations 5938- teflon: Add table with known supported models to docs 5939- etnaviv: Don't init the blitter in compute-only contexts 5940- etnaviv/nn: Implement zero run length encoding of weights 5941- teflon: Enable convolutions with number of output channels not divisible by 8 5942- etnaviv/nn: Ensure tile_y is > 0 5943- etnaviv/nn: Fix calculation of remaining out channels 5944- etnaviv/nn: Move unused field to its right place in the struct 5945- etnaviv/nn: Enable image cache 5946- etnaviv/nn: Don't shortcut ZRL bits calculation 5947- etnaviv/nn: Keep track of the sign bit when decrementing to zero 5948- etnaviv/nn: Make parallel jobs disabled by default 5949 5950Tranquillity Codes (1): 5951 5952- intel: Skip ioctls for querying device info when hardware is unsupported 5953 5954Valentine Burley (27): 5955 5956- tu: Promote VK_EXT_index_type_uint8 to KHR 5957- tu: Promote VK_EXT_load_store_op_none to KHR 5958- tu: Promote VK_EXT_line_rasterization to KHR 5959- docs: Update features.txt for anv, nvk and tu 5960- nvk: Enable VK_KHR_shader_subgroup_uniform_control_flow 5961- nvk: Advertise VK_KHR_vertex_attribute_divisor 5962- nvk: Reorder device features 5963- tu: Implement VK_KHR_map_memory2 5964- tu: Advertise VK_KHR_vertex_attribute_divisor 5965- tu: Reorder device features 5966- nvk: Fix missing implementation of creating images from swapchains 5967- nvk: Expose VK_EXT_display_control 5968- nvk: Expose VK_EXT_surface/swapchain_maintenance1 5969- nvk: Expose VK_EXT_swapchain_colorspace 5970- docs/features: Add missing VK_EXT_surface/swapchain_maintenance1 entry 5971- tu/rmv: Remove tu_rmv_DebugMarkerSetObjectNameEXT 5972- nvk: Trivially expose three VK_GOOGLE extensions 5973- tu: Expose VK_KHR_surface_protected_capabilities 5974- tu: Trivially expose three VK_GOOGLE extensions 5975- docs: Update features.txt for tu 5976- docs: Update features.txt and new_features.txt for anv and nvk 5977- nvk: Add support for version 2 of all descriptor binding commands 5978- tu: Move tu_BindImageMemory2() to tu_image.cc 5979- tu: Replace TU_HAS_SURFACE with TU_USE_WSI_PLATFORM 5980- tu: Fix missing implementation of creating images from swapchains 5981- tu: Replace TU_FROM_HANDLE with VK_FROM_HANDLE 5982- drm-shim: Stub syncobj reset ioctl 5983 5984Vasily Khoruzhick (4): 5985 5986- lima: ppir: always use vec4 for output register 5987- lima: ppir: use dummy program if FS has empty body 5988- lima: gpir: abort compilation if load_uniform instrinsic src isn't const 5989- lima: update expected CI failures 5990 5991Vignesh Raman (5): 5992 5993- ci: Add kmod 5994- ci: disable Collabora's farm due to maintenance 5995- Split debian-build-testing job 5996- ci: Implement support for replaying ANGLE restricted traces 5997- ci: handle missing dri libraries during listing 5998 5999Vinson Lee (2): 6000 6001- intel/disasm: Remove duplicate variable reg_file 6002- intel/clc: Fix file descriptor leak 6003 6004Visan, Tiberiu (1): 6005 6006- amd/vpelib: revert SRGB to 709 6007 6008Vlad Schiller (2): 6009 6010- pvr: Implement VK_EXT_memory_budget 6011- pvr: Implement VK_KHR_index_type_uint8 6012 6013Yifan Zhang (2): 6014 6015- amd: Add code to enable gfx11.5.1 6016- radv: initialize video decoder for GFX11.5.1 6017 6018Yiwei Zhang (105): 6019 6020- venus: avoid redundant layout transition for optimal internal layout 6021- venus: populate oom from ring submit alloc failures 6022- vulkan/wsi/wayland: fix returns and avoid leaks for failed swapchain 6023- venus: ensure object id is unique 6024- venus: fix pipeline layout lifetime 6025- venus: drop some redundant comment 6026- venus: fix pipeline derivatives 6027- venus: fix to respect the final pipeline layout 6028- venus: allow tls ring submission to utilize the entire ring shmem 6029- venus: default to enable GPL 6030- venus: force async pipeline create on threads creating descriptor pools 6031- venus: use obj handle instead of id in device memory report 6032- anv: refactor wsi_memory_allocate_info handling 6033- anv: optimize the implicit fencing support of external memory 6034- anv: extend implicit fencing support for case requiring implicit write 6035- vulkan/util: drop redundant code gen from vk_extensions_gen.py 6036- vulkan/runtime: refactor to use DETECT_OS_ANDROID instead of ANDROID 6037- v3dv: refactor to use DETECT_OS_ANDROID instead of ANDROID 6038- venus: refactor to use DETECT_OS_ANDROID instead of ANDROID 6039- hasvk: refactor to use DETECT_OS_ANDROID instead of ANDROID 6040- anv: refactor to use DETECT_OS_ANDROID instead of ANDROID 6041- radv: refactor to use DETECT_OS_ANDROID instead of ANDROID 6042- turnip: refactor to use DETECT_OS_ANDROID instead of ANDROID 6043- egl: refactor to use DETECT_OS_ANDROID instead of ANDROID 6044- gallium: refactor to use DETECT_OS_ANDROID 6045- util: refactor to use DETECT_OS_ANDROID 6046- meson: drop -DANDROID 6047- venus: update tracepoints to align with later optimizations 6048- venus: fix the cmd stride used for qfb recording 6049- venus: rewrite fence feedback interception to minimize batches 6050- venus: refactor to add vn_cached_storage 6051- venus: use vn_cached_storage for vn_queue_submission allocs 6052- venus: misc cleanups for queue submission 6053- venus: simplify feedback types tracking during submission 6054- venus: massive feedback renamings for consistency and clarity 6055- venus: refactor to add vn_queue_submission_setup_batch 6056- venus: simplify to drop the struct vn_feedback_cmds accessor 6057- venus: refactor semaphore feedback 6058- venus: add vn_set_temp_cmd helper to initialize feedback batch cmd 6059- venus: fix to ensure sfb cmds can get recycled 6060- venus: mandate a few venus capsets long required before 1.0 6061- venus: sync protocol for VK_KHR_fragment_shading_rate 6062- venus: add VK_KHR_fragment_shading_rate 6063- vulkan: fix runtime libraries' dep against generated headers 6064- venus: fix ffb batch prepare for a corner case and avoid a memcpy UB 6065- vulkan: remove unused wsi_common_entrypoints include and dep 6066- vulkan: properly ensure wsi_entrypoints header gen order 6067- vulkan: remove header files from lib source files 6068- vulkan: refactor the runtime header gen order dependency 6069- anv/hasvk: default image_read_without_format to true 6070- venus: qfb to track cmd handle directly 6071- venus: combine query record and reset 6072- venus: massive qfb renamings 6073- venus: minor cmd count related refactors 6074- venus: drop vn_get_temp_cmd_ptr 6075- venus: simplify vn_cmd_reset and apply more code sharing 6076- venus: refactor query record recycle 6077- venus: rewrite qfb vn_feedback helpers 6078- venus: refactor vn_queue_submission_add_query_feedback 6079- venus: add vn_queue_submission_get_resolved_query_records 6080- venus: optimize to further batch query records 6081- venus: roundtrip now belongs to ring 6082- venus: minor naming cleanups 6083- venus: ensure shmem is attached to renderer before use for guest vram 6084- venus: avoid excessive ring notifications 6085- venus: further reduce idle timeout from 5ms to 1ms 6086- venus: add enum vn_relax_reason 6087- venus: avoid constant busy wait for query result waiting 6088- venus: deprecate unused perf env vars 6089- venus: decorate cmd enqueue macro internals with compiler hints 6090- venus: add a more relaxed polling strategy 6091- venus: cleanup 2 TODOs from 1.3 support 6092- venus: remove obsolete TODOs 6093- venus: use STACK_ARRAY to simplify modifier query 6094- venus: use STACK_ARRAY to simplify BindBufferMemory2 6095- venus: use STACK_ARRAY to simplify BindImageMemory2 6096- venus: use STACK_ARRAY to simplify render pass creation 6097- venus: use STACK_ARRAY to simplify physical device enumeration 6098- venus: use STACK_ARRAY to simplify set layout creation 6099- venus: use STACK_ARRAY to simplify sync wait 6100- venus: rely on enum vn_descriptor_type for internal trackings 6101- venus: move async_set_allocation check outside helpers 6102- venus: set alloc to skip earlier for reserved and invalid bindings 6103- venus: optimize mutable state restore 6104- venus: misc set alloc and cleanup refactors 6105- venus: drop vn_should_sanitize_descriptor_set_writes 6106- venus: refactor descriptor set update and push 6107- venus: use STACK_ARRAY to simplify descriptor set update and push 6108- venus: use more relaxed profile for TLS ring seqno wait 6109- venus: avoid the redundant template entry 6110- venus: fix to drop an extra ; 6111- venus: simplify push descriptor update with template 6112- venus: optimize set update template data population 6113- venus: simplify need and ignore rules for desc image info 6114- venus: use STACK_ARRAY to simplify set template update and push 6115- venus: clean up legacy descriptor update template bits 6116- venus: fix swapchain image memory bind 6117- venus: fix VkDeviceGroupSubmitInfo::deviceMask for feedback cmds 6118- venus: avoid client allocators for ring internals 6119- venus: fix to destroy all pipeline handles on early error paths 6120- turnip: msm: clean up iova on error path 6121- turnip: msm: fix racy gem close for re-imported dma-buf 6122- turnip: virtio: fix error path in virtio_bo_init 6123- turnip: virtio: fix iova leak upon found already imported dmabuf 6124- turnip: virtio: fix racy gem close for re-imported dma-buf 6125 6126Yogesh Mohan Marimuthu (6): 6127 6128- winsys/amdgpu: sws instead of ws for amdgpu_screen_winsys 6129- winsys/amdgpu: rws instead of ws for radeon_winsys 6130- winsys/amdgpu: aws instead of ws for amdgpu_winsys 6131- winsys/amdgpu: use _destroy_locked() for failure to create winsys 6132- winsys/amdgpu: remove tab space 6133- winsys/amdgpu: add more comments for winsys create in header file 6134 6135Yonggang Luo (37): 6136 6137- util: Add function util_is_power_of_two_nonzero_uintptr and macro IS_POT_NONZERO 6138- asahi,panfrost: Use IS_POT_NONZERO to replace util_is_power_of_two_nonzero for different size 6139- treewide: Use util_is_power_of_two_nonzero{64|_uintptr} when needed 6140- svga: Cleanup duplicate ALIGN macro defines 6141- nouveau: Use align64 instead of ALIGN over input layer_size_B 6142- treewide: Use align64 instead of ALIGN for 64 bit value parameter 6143- util: Update ALIGN prototype to match align 6144- compiler/spirv: The spirv shader is binary, should write in binary mode 6145- compiler/spirv: There is not need unqualify const in function vtn_string_literal 6146- compiler/spirv: vtn_add_printf_string support for handling OpBitcast 6147- zink: Update zink-anv-tgl flakes 6148- treewide: Remove vulkan/runtime vulkan/util prefix in include path 6149- freedreno/vulkan: Use vk_dynamic_graphics_state_init instead of direct assignment 6150- vulkan/runtime: Mark vk_default_dynamic_graphics_state to be private 6151- Revert "meson/vulkan/util: allow venus to drop compiler deps" 6152- vulkan: allow building venus without libcompiler 6153- glx: Remove DEBUG code in xfont.c 6154- panfrost/shared: avoid use gallium helper in pan_minmax_cache.* 6155- panfrost/meson: remove redundant gallium include from meson files 6156- treewide: Replace the invalid usage #if DEBUG with #ifdef DEBUG 6157- util: Cleanup strtod.(h|c) by introduce _mesa_get_locale 6158- meson: Extract with_mesa_debug and with_mesa_ndebug for latter usage 6159- meson: Define MESA_DEBUG for latter usage 6160- treewide: Replace usage of macro DEBUG with MESA_DEBUG when possible 6161- meson: Remove the non-used -DDEBUG manually 6162- intel/meson: Remove redundant inc_gallium 6163- radv: Remove redundant inc_gallium 6164- radv: Rename src/amd/vulkan/vk_format.h to src/amd/vulkan/radv_formats.h 6165- vulkan: Move vk_format_is_alpha and vk_format_is_alpha_on_msb into vk_format.h from pvr 6166- pvr: inline and remove vk_format_get_channel_width 6167- pvr: Merge imagination/vulkan/vk_format.h into imagination/vulkan/pvr_formats.h 6168- pvr: Add pvr\_ prefix for vk_format_* functions in pvr_formats.h 6169- util: Fixes futex_wait on win32 6170- util: futex_wait use TIME_MONOTONIC on win32 for consistence with other platform 6171- util: Turn futex_wake parameter to int32_t for consistence across platforms 6172- broadcom/common: Now "util/box.h" is under src, so remove the FIXME 6173- nouveau: Fixes error: unused import: \`crate::nvh_classes_cl906f::\*` 6174 6175Yusuf Khan (4): 6176 6177- nvk: remove some dead code files 6178- nvk: fix valve segfault from setting a descriptor set from NULL 6179- crocus: fix potential null pointer dereference if transfer_mapping fails 6180- nouveau: Fix crash when destination or source screen fences are null 6181 6182Zack Rusin (1): 6183 6184- svga: Fix instanced draw detection 6185 6186Zan Dobersek (11): 6187 6188- freedreno: add fd_rd_output facilities for gzip-compressed RD dumps 6189- tu/msm: fix RD_CHIP_ID size used when dumping RD 6190- tu: tu_device should clean up its global bo 6191- vulkan/rmv: enable logging miscellaneous internal resources 6192- tu: add RMV support 6193- freedreno/fdl: avoid overflow in layout size computations 6194- tu: fix memory leaks in tu_shader 6195- fd: enable prefixing the RD output filename 6196- tu/autotune: use SAMPLE_COUNT_END_OFFSET when writing the ending sample count 6197- tu: RB_SAMPLE_COUNT_ADDR is also used on a7xx 6198- tu/query: improve CP_EVENT_WRITE7::ZPASS_DONE usage 6199 6200antonino (1): 6201 6202- zink: plug leak in \`zink_create_quads_emulation_gs` 6203 6204chyyran (1): 6205 6206- util/format/fxt1: include "u_format_pack.h" instead of "util/format/u_format_pack.h" 6207 6208daoxiang.gong (1): 6209 6210- zink - Fix for minLod and maxLod when mipmap filter is disabled 6211 6212duncan.hopkins (19): 6213 6214- compiler/clc: fix compiler issue on MacOS with st_mtim[e] in stat. 6215- egl: MacOS platform guard around pthread_condattr_setclock() 6216- egl: Added DRI3 code guards. 6217- egl: Changed EGLNativeDisplayType size check to make sure it is big enough instead of exactly the same size. 6218- gallium/dri: Switch xf86drm.h for util/libdrm.h to allow for the no-op shim to be used. 6219- gallium/dri: Added XCB dependency to frontends/dri/libdrm build. Fix header issues with xcb.h being used. 6220- util: Updated util/libdrm.h stubs with drmGetMagic() 6221- dri: guarded DRI code. 6222- glx: Switched DRI2 functions over to use Apple specific alternatives and extension name. 6223- meson: relaxed some meson restrictions on MacOS/Apple allowing for wider build support. 6224- apple: Extended Apple feature support using GLX_USE_APPLE. 6225- apple: Meson defines GLX_USE_APPLE to allow for Gallium drivers to work on MacOS. 6226- zink: Fixed header location and compiling issue with [[deprecated]] from newer MoltenVK versions. 6227- zink: use portability EXT on Apple. 6228- zink: stopped the use of VkFormatProperties3 if the reported API is less than 1.3 or VK_KHR_format_feature_flags2 not present. 6229- zink: removed \`MESA_PRIM_QUADS` from the supported \`PIPE_CAP_SUPPORTED_PRIM_MODES`. 6230- zink: Avoid issues when kopper tries using XCB WSI on Apple. 6231- zink/apple: added \`moltenvk-dir` search to allow MoltenVK to be sourced from brew. 6232- zink/apple: update docs to reflect the current status of Zink on macOS. 6233 6234nyanmisaka (2): 6235 6236- frontends/va: Report vendor and device ID through VADisplayPCIID 6237- radeonsi/uvd_enc: update to use correct padding size 6238 6239qbojj (1): 6240 6241- vulkan: Fix calculation of flags in vk_graphics_pipeline_state_fill 6242 6243thfrwn (1): 6244 6245- mesa: fix off-by-one for newblock allocation in dlist_alloc 6246