xref: /aosp_15_r20/external/mesa3d/docs/relnotes/24.1.0.rst (revision 6104692788411f58d303aa86923a9ff6ecaded22)
1Mesa 24.1.0 Release Notes / 2024-05-22
2======================================
3
4Mesa 24.1.0 is a new development release. People who are concerned
5with stability and reliability should stick with a previous release or
6wait for Mesa 24.1.1.
7
8Mesa 24.1.0 implements the OpenGL 4.6 API, but the version reported by
9glGetString(GL_VERSION) or glGetIntegerv(GL_MAJOR_VERSION) /
10glGetIntegerv(GL_MINOR_VERSION) depends on the particular driver being used.
11Some drivers don't support all the features required in OpenGL 4.6. OpenGL
124.6 is **only** available if requested at context creation.
13Compatibility contexts may report a lower version depending on each driver.
14
15Mesa 24.1.0 implements the Vulkan 1.3 API, but the version reported by
16the apiVersion property of the VkPhysicalDeviceProperties struct
17depends on the particular driver being used.
18
19SHA256 checksum
20---------------
21
22::
23
24    b7eac8c79244806b1c276eeeacc329e4a5b31a370804c4b0c7cd16837783f78b  mesa-24.1.0.tar.xz
25
26
27New features
28------------
29
30- VK_EXT_map_memory_placed on RADV, ANV and NVK
31- VK_KHR_shader_subgroup_rotate on RADV and ANV and NVK
32- VK_KHR_load_store_op_none on RADV, ANV, NVK and Turnip
33- VK_KHR_line_rasterization on RADV, ANV, NVK and Turnip
34- VK_KHR_index_type_uint8 on RADV, ANV, NVK and Turnip
35- VK_KHR_shader_expect_assume on all Vulkan drivers
36- VK_KHR_shader_maximal_reconvergence on RADV, ANV and NVK
37- VK_KHR_shader_quad_control on RADV
38- OpenGL 4.6 on Asahi
39- OpenGL ES 3.2 on Asahi
40- Mali G610 and G310 on Panfrost
41- Mali T600 on Panfrost
42- VK_KHR_shader_subgroup_uniform_control_flow on NVK
43- alphaToOne/extendedDynamicState3AlphaToOneEnable on RADV
44- VK_EXT_device_address_binding_report on RADV
45- VK_EXT_external_memory_dma_buf for lavapipe
46- VK_EXT_queue_family_foreign for lavapipe
47- VK_EXT_shader_object on RADV
48- VK_EXT_nested_command_buffer on NVK and RADV
49- VK_EXT_queue_family_foreign on NVK
50- VK_EXT_image_drm_format_modifier on NVK
51
52
53Bug fixes
54---------
55
56- anv: unbounded shader cache
57- radv: Crash due to nir validation fail in Enshrouded
58- bisected: turnip: deqp regressions
59- android: sRGB configs no longer exist after !27709
60- [24.1-rc4] fatal error: intel/dev/intel_wa.h: No such file or directory
61- vcn: rewinding attached video in Totem cause [mmhub] page fault
62- When using amd gpu deinterlace, tv bt709 properties mapping to 2 chroma
63- ci: switch from CI_JOB_JWT to id_tokens
64- VCN decoding freezes the whole system
65- [RDNA2 [AV1] [VAAPI] hw decoding glitches in Thorium 123.0.6312.133 after https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28960
66- nvk: Tracker issue for gamescope support
67- nvk: Implement VK_EXT_image_drm_format_modifier
68- WSI: Support VK_IMAGE_ASPECT_MEMORY_PLANE_i_BIT_EXT for DRM Modifiers in Vulkan
69- [Regression][Bisected] EGL/Wayland: QT applications terminated by SIGSEGV (Address boundary error) when using dGPU
70- radv: Enshrouded GPU hang on RX 6800
71- NVK Zink: Wrong color in Unigine Valley benchmark
72- panforst: T604 issue with using u32 for flat varyings
73- [anv] FINISHME: support YUV colorspace with DRM format modifiers
74- gen9/11 test became flaky: piglit.spec.!opengl 1_4.blendminmax
75- 24.0.6: build fails
76- mesa 24 intel A770 KOTOR black shadow smoke scenes
77- [bisected][regression] kitty fails to start due to \`glfwWindowHint(GLFW_SRGB_CAPABLE,true)`
78- r600: bisected   5eb0136a3c561 breaks a number of piglits
79- Graphical glitches in RPCS3 after updating Vulkan Intel drivers
80- [R600] OpenGL and VDPAU regression in Mesa 23.3.0 - some bitmaps get distorted.
81- VAAPI radeonsi: VBAQ broken with HEVC
82- radv/video: 10-bit support
83- radv: vkCmdWaitEvents2 is broken
84- Zink: enabled extensions and features may not match
85- glRasterPos: "Assertion \`prog->base_serialized_nir' failed." if a shader is loaded from the shader cache
86- radv: mesa-9999/src/amd/vulkan/radv_image_view.c:147: radv_set_mutable_tex_desc_fields: Assertion \`(plane->surface.u.gfx9.surf_pitch * plane->surface.bpe) % 256 == 0' failed.
87- ACO doesn't hide lds_param_load latencies
88- ACO doesn't form a VMEM clause for image stores in one case on GFX11
89- r600: Valheim hangs CAYMAN gpu (regression/bisected)
90- r600: Artifacts in Oxygen Not Included around air ducts and pipes (regression, bisected)
91- radv: UMR wave parsing format is outdated
92- radv: GetImageMemoryRequirements2 does not look at VkImagePlaneMemoryRequirementsInfo
93- RADV, regression : Objects randomly appear/disappear on Unreal Engine 4 titles using D3D12 backend on Polaris
94- mesa 23.1.0-rc3 flickering textures/lighting in Unreal 4 games Polaris10
95- ACO tests SIGSEGV in debian-vulkan job with LTO enabled
96- radv: Address binding report for images is incorrect.
97- blorp: avoid dirtying push constants in 3D
98- anv: flaky vkd3d-proton test_buffer_feedback_instructions_sm51
99- FTBFS: commit aaccc25a4dd9ccfc134e51a7e81168334d63a909 broke mesa snapshot build
100- d3d12_screen.cpp:60:10: fatal error: ShlObj.h: No such file or directory
101- r300: crash when compiling some GSK shaders
102- anv: vkd3d-proton test_stress_suballocation failure
103- d3d12: Zwift renders with bad textures/lighting
104- nir_opt_remove_phis breaks divergence analysis
105- intel: Require 64KB alignment when using CCS and multiple engines
106- NVK: Misrendering with Civilization 6
107- radv: RDR2 might need zerovram
108- intel-clc build failure, i think?
109- Issues rendering gtk4 window decorations on v3d on Fedora-40/mesa-24.0
110- clc: Failure when linking with llvm+clang 18.1 (-Dshared-llvm=disabled)
111- LLVM-18 build issue
112- vulkan/wsi/x11: VK_SUBOPTIMAL_KHR is never reported by the swapchain
113- Broken vaapi encoding on Radeon RX 6900XT
114- RUSTICL creating a shared reference to mutable static is discouraged and will become a hard error
115- anv: GPU hang on Assassin's Creed Valhalla while running benchmark
116- nvk: dota 2 crashes after ~5 seconds in game
117- dzn: conflicting defines with DirectX headers 1.613.0
118- VAAPI: Incorrect HEVC block size reported with radeonsi
119- radv: WWE 2K24 has very quirky DCC issues on RDNA2
120- anv: Dirt 5 crashes at tryCreatingPipelineStateFromCache
121- freedreno: remove headergen2
122- freedreno: remove headergen2
123- vulkan/wsi: crash in dEQP-VK.wsi.wayland.swapchain.simulate_oom.min_image_count
124- Document that Zink on MoltenVK is not expected to work
125- KiCAD 3D Viewer - rounded pads rendered incorrectly (texture mapping or stencil test error)
126- OpenSCAD rendering incorrect and inconsistent on radeonsi
127- intel/fs: regression on MTL with 64bit values in UBO
128- ci: split debian-build-testing?
129- [freedreno] Black background on SuperTux Kart with postmarketOS and Oneplus 6T
130- [radv] Half-Life Alyx renders solid black for reflective surfaces
131- iris: iris_resource_get_handle returns wrong modifier
132- [RX 7900 XTX] Helldivers 2 cause GPU reset
133- radeon: Crash in radeon_bo_can_reclaim_slab
134- regression/bisected: commit 4e3f3c10e14d8778781c81f39ced659b5ec2f148 broke mesa snapshot build
135- RV530 renders improperly at non 4:3 resolutions.
136- anv: new cooperative matrix failures with CTS 1.3.8.0
137- nvk: Missing implementation of VkImageSwapchainCreateInfoKHR and VkBindImageMemorySwapchainInfoKHR
138- mesa > 23.1.9 [opencl,video_cards_nouveau]  fails to build due to missing symbol vl_video_buffer_is_format_supported
139- intel/meson: Make intel_stub_gpu work with \`meson devenv`
140- Follow-up from "iris: Fix plane indexing and handling on image import"
141- nvk,nak: Implement shaderStorageImageMultisample
142- nvk,nak: Implement VK_KHR_shader_subgroup_uniform_control_flow
143- \`[gfxhub0] no-retry page fault` triggered by \`AMD_TEST=testdmaperf` on gfx90c APU
144- nvk: glcts hangs
145- v3d: Line rendering broken when smoothing is enabled
146- PowerVR reports minMemoryMapAlignment of 64
147- RADV: GPU crash when setting 'RADV_DEBUG=allbos'
148- [intel] mesa ftbfs with time_t64
149- d3d12_resource.cpp:307:49: error: no matching function for call to 'ID3D12Heap::GetDesc()'
150- radv regression between a337a0c8072d0be487e43c2b7b132e003c6d5a5e and 83f741124b66818053b6b1b2f7e42f5217a27004
151- [build failure] [armhf] - error: #error "_TIME_BITS=64 is allowed only with _FILE_OFFSET_BITS=64"
152- R400 should have native support for sin/cos in VS
153- [radv] Crash when VkGraphicsPipelineCreateInfo::flags = ~0u
154- intel: all workarounds disabled with ATS skus
155- vulkan: GPL now broken
156- Gen4 assertion \`force_writemask_all' failed.
157- src/gallium/auxiliary/rtasm/rtasm_x86sse.c:198:10: runtime error: store to misaligned address 0x7fabba0cd011 for type 'int', which requires 4 byte alignment
158- [radv] Holographic projection texture glitch in Rage 2
159- RustiCL: Callbacks are not called upon errors
160- MTL: regressions in vulkancts due to BO CCS allocations
161- zink: spec\@ext_external_objects\@vk-image-overwrite fail
162- vaapi: radeonsi: surface_region.{x,y} is not honored in processing when source is RGB
163- nvk: Implement VK_EXT_shader_object
164- nvk: Implement VK_EXT_graphics_pipeline_library
165- turnip: UBWC disabled for MSAA
166- KHR-Single-GL46.arrays_of_arrays_gl.AtomicUsage fails on MTL
167- GTF-GL46.gtf42.GL3Tests.texture_storage.texture_storage_texture_as_framebuffer_attachment fails on MTL
168- nvk: Implement VK_KHR_maintenance5
169- [intel][anv][build][regression] - genX_grl.h:27:10: fatal error: grl/grl_cl_kernel.h: No such file or directory
170- RX 6600 VDPAU not recognizing HEVC_MAIN_10 correctly
171- Running an app on another AMD GPU (offload, DRI_PRIME) produces corrupted frames on Wayland.
172- regression in radeonsi since 9aa205668bcbf701f8f694551c284cd8e4cc17a3 (crashes in vbo_save_playback_vertex_list)
173- clang/libclc related Mesa build failures
174- Ninja Install Error
175- anv: add a dri config to enable implicit fencing on external memory interop
176- VDPAU declares a texture as "immutable" without also setting its ImmutableLevels attribute.
177- Segfault in glsl_to_nir.cpp nir_visitor::visit when assigning interface block
178- [rusticl]WARNING: Project targets '>= 1.1.0' but uses feature deprecated since '1.0.0': module rust has been stabilized. drop "unstable-" prefix from the module name
179- RX6600 hardware HEVC video decode fails for VDPAU but works for VA-API. (Can lock up GPU!)
180- Rusticl panics when getting program build logs using opencl.hpp
181- ue5 game issues lighting Rog Ally 7080u (z1e)
182- Missing textures in RoboCop: Rogue City with mesh shaders enabled
183- Intel/anv: Allow pre-compiled shader caches to be reused across multiple devices
184- radv: Multiview PSO forgets to export layer in some cases.
185- -Dintel-rt=enabled fails to build on 32-bit
186- MTL: regressions in vulkancts due to BO CCS allocations
187- intel: build failures
188- regression/bisected commit 4de62731f4db56360026cbb6a3b8566f86f22466 broke HW acceleration in the Google Chrome
189- i386 intel build failure: meson.build:45:6: ERROR: Unknown variable "prog_intel_clc".
190- rusticl: clEnqueueFillBuffer (among others) fails on buffers created from GL object.
191- MTL raytracing regression
192- [ANV/DG2] Unexpectedly slow replay of RenderDoc frame capture of Resident Evil 4 Remake
193- zink: flickering artifacts in Selaco
194- [ADL] gpu hang on dEQP-VK.synchronization.internally_synchronized_objects.pipeline_cache_graphics
195- Turnip spam on non-turnip devices
196- Intermittent compiler failures when building valhall tests
197- panfrost: graphical artifacts on T604 (T600)
198- Dying Light native artifacts on Intel A770
199- r300: Amnesia: The Dark Descent heavy corruption
200- [ANV/DG2] Age of Empires IV fullscreen "banding" artefacts
201- [mtl][anv] dEQP-VK.pipeline.monolithic.depth.format.d32_sfloat.compare_ops.* failures when run multithreaded
202- [mtl][anv] flaky tests in pipeline.monolithic.extended_dynamic_state*stencil_state_face* series
203- Broken colors/dual-source blending on PinePhone (Pro) since 23.1.0
204- r600/sfn: "Indexed locks of kcache banks 14 and 15 are ignored" in the ALU clause documentation
205- turnip: Logarithmic-time subgroup reductions using brcst.active and getlast.w8 instructions on a6xx gen4+
206- GTF-GL46.gtf42.GL3Tests.texture_storage.texture_storage_compressed_texture_data regression
207- microsoft/compiler: Missing globally-coherent logic
208- Regression between 23.0.4 and 23.1.0: texture glitches in osgEarth
209- [Broadcom]  Warning when runnin every OpenGL game on Vulkan using ZINK
210- radeonsi unsynchronized flips/tearing with KMS DRM rendering on 780M
211- radeonsi has an unchecked hard dependency on libelf
212- DR crashes with mesa 24 and rusticl (radeonsi)
213- Piglit tests assert on gen9 with zink
214- vlc crashes when playing 1920x1080 video with Radeon RX6600 hardware acceleration and deinterlacing enabled.
215- [radeonsi] Regression: graphical artifacting on water texture in OpenGOAL
216- Assertion when creating dmabuf-compatible VkImage on Tigerlake
217- Palworld fails to launch on Intel Arc unless "force_vk_vendor" is set to "-1".
218- panfrost: implement line smoothing
219- r300: backend DCE fails in piglit glsl-vs-copy-propagation-1.shader_test
220- [AMDGPU RDNA3] Antialiasing is broken in Blender
221- MTL: vulkan cooperative matrix tests gpu hang on MTL
222- nvk: Implement VK_KHR_zero_initialize_workgroup_memory
223- Assassin's Creed Odyssey wrong colors on Arc A770
224- VAAPI: EFC on VCN2 produces broken H264 video and crashes the HEVC encoder
225- etnaviv, modesetting, and glxgears
226- The Finals fails to launch with DX12 on Intel Arc unless "force_vk_vendor" is set to -1.
227- nvk: \`VK_KHR_zero_initialize_workgroup_memory` and \`VK_KHR_shader_subgroup_extended_types` not marked as complete in features.txt
228- nvk: Implement variableMultisampleRate
229- VA-API CI tests freeze
230- radv: games render with garbage output on RX5600M through PRIME with DCC
231- Warning when use ALIGN over uint64_t and uintptr_t
232
233
234Changes
235-------
236
237Adrian Perez de Castro (1):
238
239- Revert "egl/wayland: Remove EGL_WL_create_wayland_buffer_from_image"
240
241Agate, Jesse (6):
242
243- amd/vpelib: Studio Range Handling
244- amd/vpelib: White Screen Fix
245- amd/vpelib: VPT Failing Test Cases
246- amd/vpelib: VPE integration for HLG
247- amd/vpelib: Add PQ Norm to VPE interface
248- amd/vpelib: Refactor norm factor logic
249
250Alan Liu (4):
251
252- radeonsi/vpe: Add environment variable to set embbuf number
253- radeonsi/vpe: Don't map and unmap emb_buffer every time in process_frame
254- amd/vpelib: remove unused header file
255- radeonsi/vpe: support vpe 1.1
256
257Alejandro Piñeiro (3):
258
259- broadcom/compiler: fix coverity warning (unitialized pointer read)
260- v3dv/bo: use mtx_lock/unlock on cache_init too
261- v3dv: expose VK_EXT_depth_clip_enable
262
263Alexandre Marquet (1):
264
265- pan/mdg: quirk to disable auto32
266
267Alyssa Rosenzweig (328):
268
269- nir/lower_ssbo: rewrite
270- nir/lower_blend: return progress
271- nir/lower_io_to_temporaries: return prog
272- nir/lower_clip_cull_distance_arrays: return prog
273- nir: return prog from drawpixels
274- nir/lower_bitmap: return prog
275- nir/lower_alpha_test: rewrite with intrinsics_pass
276- nir/lower_point_size_mov: return prog
277- nir/lower_passthrough_edgeflags: return progress
278- nir/lower_io_arrays_to_elements: return prog
279- nir/lower_flatshade: fix metadata
280- glsl: return progress in point size linking
281- glsl: don't use NIR_PASS_V
282- glsl: fix metadata in gl_nir_zero_initialize_clip_distance
283- mesa/st: return progress in st_nir_lower_wpos_ytransform
284- mesa/st: use instructions_pass for plane lowering
285- mesa/st: return progress lowering builtins
286- mesa/st: don't use NIR_PASS_V
287- agx: rm deadcode
288- agx: ingest undefs in the backend
289- agx: stop lowering in opt loop
290- agx: only lower vars to ssa once
291- agx: fix metadata in layer lowering
292- agx: unset silly nir opts
293- agx: return progress from passes
294- asahi: return progress from passes
295- asahi: avoid silly internal NIR_PASS in gs lowering
296- asahi: don't use NIR_PASS for removing entrypoints
297- asahi: don't use NIR_PASS_V
298- nir/passthrough_gs: plug leak
299- compiler,gallium: move u_decomposed_prim to common
300- nir/passthrough_gs: flesh out gs_in_prim
301- compiler: add a vs.tes_agx bit
302- asahi: add more uapi stubs
303- asahi: gut macOS related code
304- asahi: lower poly stipple
305- asahi: Implement skeleton for tessellation
306- asahi: fix metadata for images with VS lowered to GS
307- asahi: implement VBO robustness
308- asahi: implement reset queries
309- asahi: enable robustness
310- asahi: fix unbound ssbos
311- asahi: optimize more when linking libagx
312- asahi: decode uniform_high records
313- agx: implement load_subgroup_invocation
314- agx: lower more subgroups
315- agx: introduce ballot pseudo
316- agx: fuse ballot+cmp
317- nir: add active_subgroup_invocation_agx sysval
318- agx: implement active_subgroup_invocation_agx
319- agx: optimize first_invocation
320- agx: optimize vote_eq
321- asahi: fix prim restart unrolling with indirects
322- asahi: delete bogus assertion
323- asahi: plug passthrough tcs leak
324- asahi: rework meta shader infra
325- asahi: plug geometry shader leaks
326- asahi: plug pre-gs leak
327- asahi: plug early_serialized_nir leak
328- asahi: plug so target leak
329- asahi: plug glsl type leak
330- asahi: plug geometry heap leak
331- asahi: fix UB in qbo's
332- agx: add some more bitop tests
333- Revert "asahi: don't canonicalize nans/flush denorms when copying"
334- asahi: sync with query mismatches
335- asahi: enable tcs caching
336- asahi: don't sync for uninitialized buffer
337- asahi: fix valid buffer tracking for SSBO/image/XFB
338- asahi: handle read-only SSBOs
339- asahi: honour discard_whole | persistent
340- agx: only run early tests if needed
341- docs/asahi: fix strided linear note
342- ail: add tests for linear<-->twiddled copies
343- ail: port tiling routines to c++
344- ail: use template for tiled memcpy
345- agx: don't inline imms into stack_store
346- agx: optimize b2x(inot)
347- agx: reassociate bcsel with ior/iand
348- asahi: implement pipeline stats as a checkbox
349- asahi: log geometry shaders separate from xfb
350- asahi: don't use util_resource_size
351- asahi: fix vbo dirty track
352- asahi: force inline ppp update logic
353- asahi: skip set if tested
354- asahi: rm dead
355- asahi: track bit count, not word count
356- asahi: enable compblit behind dbg flag
357- asahi: allow disk cache with compblit
358- asahi: assert invariant
359- asahi: drop silly else
360- asahi: rewrite queries
361- asahi: split up stage uniform upload
362- asahi: dirty track stage uniforms
363- asahi: collapse stage uniform upload
364- asahi: optimize "no changes" case
365- asahi: optimize no changes descriptor case
366- asahi: move some code into dirty tracking
367- asahi: drop any_draws
368- asahi: fix instance count with indirect draw
369- asahi: collapse if
370- asahi: hoist xfb code
371- asahi: hoist layer id code
372- asahi: rm blank
373- asahi: track batches with incoherent writes
374- asahi: optimize memory_barrier
375- asahi,agx: use intrinsics pass
376- agx: clamp register file based on workgroup size
377- agx: improve scratch size accounting
378- asahi: add has_scratch to shader key
379- agx: set nr_preamble_gprs for preamble scratch
380- asahi: allocate preamble scratch
381- agx: allow 16-bit immediate on stack load/store
382- agx: print register vectors
383- agx: introduce "memory variables"
384- agx: add spill/fill lowering pass
385- agx: unit test memory parallel copies
386- agx: unit test spill/fill lowering
387- agx: add parallel copy printing
388- agx: add =spill debug option
389- asahi: bump max threads per wg
390- asahi: drop xfb hack
391- asahi: allow vertex/geom/tess side effects
392- agx: fix buffer overflow with varying slots
393- asahi,agx: use hw clip distance
394- asahi: fix dirty tracking issue
395- asahi: rip out existing MDI+GS implementation
396- libagx: fix buggy align macro
397- asahi: make GS flatshade_first more dynamic
398- libagx: use native static_assert on host
399- libagx: use real PACKED macro
400- libagx: static assert some sizes
401- libagx: generalize vertex_id_for_topology
402- asahi: simplify IA mode handling
403- asahi: add shader_info::outputs for gs lower
404- asahi: add geometry parameters for separable GS
405- asahi: rework shader stage handling a bit
406- asahi: separate GS from VS
407- asahi: rm arrayed output lowering
408- asahi: allow bindful GS textures
409- asahi: shrink GS key
410- asahi: infer stage in descriptor update
411- asahi: be a bit more methodical with shader stages
412- nir: rm load_vert_id_in_prim_agx
413- asahi: allow lowering bindings after lowering textures
414- asahi: collapse indirection with GS
415- asahi: support stage override in sysval lower
416- asahi: set gs_grid[0] even for direct draws
417- asahi: use load_instance_id in gs lowering
418- asahi: fix vertex out size calc
419- asahi: invert geometry shaders
420- asahi: implement GS disk caching
421- asahi: rm dead
422- asahi: simplify expressions involving xfb
423- asahi: avoid silly psiz writes even with gs
424- asahi: eliminate tri fan %
425- asahi: make provoking vertex dynamic
426- asahi: make gs topology dynamic
427- asahi: support GS in shaderdb
428- asahi: always support ARB_clip_control
429- asahi: make clip_halfz dynamic
430- asahi: rm ia key
431- agx: remove discard -> zs_emit lower
432- agx: rm dead sample count argument
433- agx: call agx_nir_lower_sample_mask earlier
434- agx: rm unused backend nr_samples
435- agx: rm unused opt_ixor_bcsel
436- agx: sink wait_pix
437- asahi: Implement ARB_texture_barrier by decompression
438- asahi: quelch gcc warning
439- agx: rm ridiculous dependency
440- agx: decouple compiler from genxml
441- agx: use #pragma once
442- asahi/lib: use #pragma once
443- ail: use #pragma once
444- asahi: use #pragma once
445- asahi: clean up format table renderability
446- asahi: split out genxml/ directory
447- agx: move SSBO lowering
448- agx: call texture lowering in the driver
449- agx: move texture lowering into lib
450- agx: decouple from libagx
451- asahi: reorder compiler before clc
452- asahi: precompile helper program
453- agx: add "is helper program?" key bit
454- asahi: advertise GL4.6 and ES3.2
455- docs: update for GL4.6 and ES3.2 on asahi
456- vulkan: add vk_index_type_to_restart helper
457- tu: use vk_index_to_restart
458- anv,hasvk: use vk_index_to_restart
459- util/hash_table: add u64 foreach macro
460- util/ralloc: add memdup
461- treewide: use ralloc_memdup
462- panfrost: Add a library to build CSF command streams
463- panfrost: Add support for the CSF job frontend
464- nir/opt_shrink_vectors: hoist alu helpers
465- nir/opt_shrink_vectors: shrink some intrinsics from start
466- util: add _mesa_hash_table_u64_num_entries
467- nir/print: do not print empty lists on intrinsics
468- util/hash_table: add DERIVE macro
469- panfrost: derive ht
470- asahi: derive ht
471- nvk: derive ht
472- radeonsi: derive ht
473- v3d: derive ht
474- glsl_types: derive ht
475- asahi: bump maximum samplers for Blender
476- asahi: allow more samplers for shaderdb
477- asahi: move more code out of agx_preprocess_nir
478- asahi/lib: fix overread with stateful
479- asahi: fix overread with samplers
480- asahi: clarify how unroll index buffers are offsetted
481- asahi: zero more in the unroll path
482- asahi: fix unit mismatch with unroll path
483- asahi: fix stage accounting for meta compute shaders
484- asahi: export build_meta_shader
485- asahi: add flush_query_writers helper
486- asahi: add helper to classify queries
487- asahi: accelerate QBO copies
488- asahi: fix depth bias interactions with points/lines
489- asahi: implement CDM stream linking for GS
490- asahi: be robust against tess batch changes
491- asahi: stop merging VS and TCS
492- asahi: drop TCS key
493- asahi: drop asahi_vs_next_stage
494- libagx: improve static assert message
495- asahi/clc: fix mem leaks
496- agx/opt_cse: alloc less
497- agx: fix stack smash with spilling
498- agx: fix allocating phi sources past the reg file
499- agx: add more asserts
500- agx: add num_successors helper
501- agx: fix 16-bit mem swaps
502- agx: scalarize vector phis
503- agx: allow vector phis to pass validation
504- agx: assert phis don't have .kill set
505- agx: fix bogus implicit cast with 2d msaa arrays
506- agx: sink harder
507- agx: implement live range splits of phis
508- agx: don't leak shuffle copies
509- agx: add more iterator macros
510- agx: add temp_like helper
511- agx: add before_function cursor
512- agx: add limit for max sources per non-phi
513- agx: coalesce phi webs
514- agx: try to coalesce moves
515- agx: drop scratch regs for spilling
516- agx: validate phi sources for consistency
517- agx: add SSA reindexing pass
518- agx: add SSA repair pass
519- agx: add Braun-Hack spiller pass
520- agx: switch to Braun-Hack spiller
521- agx: use dense reg_to_ssa map
522- agx: make add_successor public
523- agx: add helpers for multiblock unit tests
524- agx: add tests for SSA repair
525- agx: move spill/fills accounting to shaderdb
526- agx: enable indirect temps
527- agx: generalize remat code
528- agx: implement get_sr remat
529- asahi: use less bindless samplers
530- agx: add more shaderdb stats
531- agx: fix lowering uniforms with abs/neg
532- agx: restrict high uniforms with textures
533- agx: extract "accepts uniform?" ISA query
534- agx: model 64-bit uniform restriction on ALU
535- agx: extract agx_is_float_src
536- agx: promote constants to uniforms
537- agx: compact 32-bit constants
538- agx: test constant compaction
539- agx: implement load_subgroup_id
540- libagx: polyfill glsl ballot()
541- libagx: accelerate restart unroll across a subgroup
542- libagx: accelerate prim restart unroll across wg
543- libagx: deal with silly NIR
544- libagx: parallelize prefix sum over 1024 threads
545- agx: use funop short form
546- agx: split select opt into its own pass
547- agx: vectorize uniform_store
548- agx: start a crude cycle model
549- agx/opt_preamble: improve preamble cost function
550- agx/opt_preamble: restrain ourselves
551- agx/opt_preamble: preamble cycle estimates
552- agx/opt_preamble: improve rewrite cost est
553- docs/asahi: document UVS
554- nir: add offset to load_coefficients_agx
555- nir: add intrinsics for lowered VS outputs
556- asahi: add agx_push_packed
557- asahi: drop =varyings debug
558- asahi: extract agx_cf_binding
559- agx: explicitly assign coeff registers
560- agx: pack indirect CF
561- agx: handle indirect varyings
562- asahi: advertise indirect fs inputs
563- agx: rm unnecessary iter hack
564- agx: pack indirect st_vary
565- agx: inline imm into st_vary
566- asahi: rewrite varying linking
567- asahi: drop now-empty base key
568- asahi: make point size replacement dynamic
569- asahi: stop using GLSL indirect lowering
570- agx/lower_vbo: dce as we go
571- asahi: drop dead linked_so code
572- asahi: use ht derive more
573- asahi: fix _packed USC structs
574- asahi: delete layer id code
575- asahi: don't set writes_memory for tib spilling
576- agx: optimize out wait_pix in some cases
577- agx: inline sampler states
578- agx: always reserve sampler #0 for txf
579- asahi: fix bit sizes in point sprite lower
580- nir: add samples_log2_agx sysval
581- nir: add export/load_exported_agx intrinsics
582- agx: wire up samples_log2 sr
583- agx: generalize preloaded cache
584- agx: implement exports
585- agx: document non-monolithic ABI
586- asahi: add agx_usc_push_packed helper
587- asahi: constify agx_build_tilebuffer_layout
588- asahi: don't allocate tib space for gaps
589- nir: add intrinsics for non-monolithic agx shaders
590- agx: drop shader stage assertion
591- asahi: static assert blend key size
592- agx: add agx_shader_part data structure
593- agx: add main_size info
594- asahi: add fast linker
595- asahi/clc: stop padding binaries
596- asahi: switch to VS/FS prolog/epilog system
597
598Amber (3):
599
600- tu: wideLines support for a7xx.
601- tu: Add MESA_VK_DYNAMIC_RS_LINE_WIDTH to tu_rast_state.
602- tu: re-emit vertex buffer on MESA_VK_DYNAMIC_VI_BINDINGS_VALID dirty.
603
604Amber Harmonia (1):
605
606- freedreno/common: Fix register stomper ranges for A7XX
607
608Andres Calderon Jaramillo (1):
609
610- radeonsi: get enc/dec caps from kernel only on amdgpu
611
612Antoine Coutant (2):
613
614- clc: retrieve libclang path at runtime.
615- drisw: fix build without dri3
616
617Anton Bambura (2):
618
619- panfrost: Enable Mali-T600
620- docs/panfrost: Document Mali-T600 support
621
622Antonio Gomes (6):
623
624- mesa/st: Skip querying PCI values in interop_query_device_info if version >= 4
625- rusticl/gl: Bump mesa_glinterop_device_info to version 4
626- gallium: Add new PIPE_CAP_CL_GL_SHARING
627- iris: Set PIPE_CAP_CL_GL_SHARING to true
628- radeonsi: Set PIPE_CAP_CL_GL_SHARING to true
629- rusticl/device: Verify for PIPE_CAP_CL_GL_SHARING when enabling gl_sharing
630
631Arthur Huillet (1):
632
633- nvk: remove useless MME scratch 26 usage
634
635Asahi Lina (17):
636
637- asahi: libagx: introduce AGX_STATIC_ASSERT
638- agx: Rename some SRs
639- nir: Add AGX-specific helper opcodes
640- agx: Hook up AGX helper NIR intrinsics
641- agx: Hook up helper intrinsics into CL
642- agx: Add scaffolding to build the helper shader at device init
643- agx: compiler: Add fence_helper_exit_agx barrier
644- agx: compiler: Export scratch size to the driver
645- agx: compiler: Enable stack_adjust
646- asahi: libagx: Move PACKED and GLOBAL macros to libagx.h
647- asahi: cmdbuf: Fix scratch bucket offset/size
648- asahi: Implement scratch allocation
649- asahi: scratch: Add feature to debug core IDs
650- asahi: Hook up scratch
651- asahi: Allocate scratch for shaders
652- asahi: Enable scratch debugging
653- asahi: batch: Trace before waiting for syncobj
654
655Assadian, Navid (2):
656
657- amd/vpelib: Apply inverse gamut remap to background
658- amd/vpelib: Use uint64 for buffer size
659
660Axel Davy (5):
661
662- frontend/nine: Fix ff ps key
663- frontend/nine: Fix programmable vs check
664- frontend/nine: Fix missing light flag check
665- frontend/nine: Fix destruction race
666- frontend/nine: Reset should EndScene
667
668Bas Nieuwenhuizen (10):
669
670- util/disk_cache: Add marker on cache usage.
671- radv: Remove ray_launch_size_addr_amd system value.
672- radv: Add winsys argument to buffer map/unmap.
673- radv/winsys: Use radv_buffer_map wrapper.
674- radv/amdgpu: Use mmap directly.
675- radv: Support for mapping a buffer at a fixed address.
676- radv: Implement reserving the VA range on unmap.
677- radv: Expose VK_EXT_map_memory_placed.
678- radv: Fix differing aspect masks for multiplane image copies.
679- radv: Use zerovram for Enshrouded.
680
681Benjamin Lee (14):
682
683- nak: support predicate swaps on SM50
684- nak: support predicate sel on SM50
685- nak: fix frnd on SM50
686- nak: implement FSWZADD on SM50
687- nak: implement FLO on SM50
688- nak: fix iabs on SM50 with an explicit i2i op
689- nak: implement rro op on SM50
690- nak: use rro when emitting mufu on SM50
691- nak: implement kill op on SM50
692- nak: implement cs2r op on SM50
693- nak: handle nop ops from NAK IR on SM50
694- nak: fix lod mode encoding for SM50 tld op
695- nak: fix tex offset encoding on SM50
696- nvk: disable shaderResourceMinLod on pre-sm70
697
698Benjamin Tissoires (3):
699
700- CI: add mr-label-maker.yml config
701- .mr-label-maker.yml: fix wrong label
702- CI: add a test for checking the validity of .mr-label-maker.yml
703
704Biju Das (1):
705
706- gallium: Add Renesas rzg2l-du DRM entry point
707
708Blisto (1):
709
710- driconf: set vk_x11_strict_image_count for Atlas Fallen Vulkan
711
712Bob Beckett (2):
713
714- panfrost: Add an entry for panthor in the renderonly_drivers[] array
715- panfrost: Add the gallium glue to get panfrost loaded when panthor is detected
716
717Boris Brezillon (193):
718
719- panvk: Fix tracing
720- panvk: Fix access to unitialized panvk_pipeline_layout::num_sets field
721- panfrost: Kill unused forward declarations in pan_texture.h
722- panfrost: Add a per-gen panfrost_format_from_pipe_format() helper
723- panfrost: Add a per-gen panfrost_blendable_format_from_pipe_format() helper
724- panfrost: Make panfrost_format_to_bifrost_blend() a per-gen helper
725- panfrost: Add panfrost_[blendable]_format_table() helpers
726- panfrost: Move panfrost_is_yuv() to pan_format.h
727- panfrost: Move YUV-debugging out of panfrost_new_texture()
728- panfrost: Stop passing a panfrost_device to panfrost_new_texture()
729- panfrost: Don't pass a panfrost_device to panfrost_format_supports_afbc()
730- panfrost: Don't pass a panfrost_device to panfrost_afbc_can_tile()
731- panfrost: Stop passing a panfrost_device to pan_blend_get_internal_desc()
732- panfrost: Stop exposing pan_blend_create_shader()
733- panfrost: Stop passing a panfrost_device to pan_blend_create_shader()
734- panfrost: Stop passing a panfrost_device to pan_inline_rt_conversion()
735- panfrost: Make the pan_blend logic panfrost_device-agnostic
736- panfrost: Get rid of unused panfrost_device arguments in pan_blitter.c
737- panfrost: Pass the tile buffer budget through pan_fb_info
738- panfrost: Pass the sample position array through pan_fb_info
739- panfrost: Pass no_hierarchical_tiling info through pan_tiler_context
740- panfrost: Pass tiler heap info through pan_tiler_context
741- panvk: Inline pan_wls_mem_size()
742- panfrost: Make pan_desc.{c,h} panfrost_device agnostic
743- panfrost: Drop unused panfrost_device forward declaration in pan_shader.h
744- panfrost: Make pan_layout.c panfrost_device agnostic
745- panfrost: Make pan_sample.c panfrost_device agnostic
746- panfrost: Make pan_encoder.h panfrost_device agnostic
747- panfrost: Remove unused header inclusions from pan_blitter.h
748- panfrost: Make pan_blitter.h includable from non per-gen files
749- panfrost: Make pan_blitter.{c,h} panfrost_device agnostic
750- panfrost: Make pan_indirect_dispatch panfrost_device agnostic
751- panfrost: Make pan_pool.h panfrost_{device,bo} agnostic
752- panfrost: Make pan_props.c panfrost_device agnostic
753- panfrost: Make pan_texture.{c,h} panfrost_bo agnostic
754- panfrost: Make pan_desc.{c,h} panfrost_bo agnostic
755- panfrost: Remove uneeded pan_device.h inclusions
756- panfrost: Make panfrost_texfeatures.c panfrost_device agnostic
757- panfrost: Make pan_perf panfrost_device agnostic
758- panfrost: Add a helper to retrieve a panfrost_bo from a pan_kmod_bo
759- panvk: Get rid of unused pdev arguments passed to some meta helpers
760- panvk: Stop passing panfrost_device around in internal meta helpers
761- panvk: Store various physical device properties at the physical_device level
762- panvk: Use vk_device::drm_fd instead of going back to the physical device
763- panvk: Move panfrost_device and panvk_meta to panvk_device
764- panvk: Add a decode context at the panvk_device level
765- panvk: Instantiate our own blitter/blend_shader caches
766- panvk: Add pan_kmod_{vm,dev} objects to panvk_device
767- panvk: Add the concept of private BO
768- panvk: Transition panvk_pool to panvk_priv_bo
769- panvk: Transition panvk_descriptor_set to panvk_priv_bo
770- panvk: Transition panvk_pipeline to panvk_priv_bo
771- panvk: Transition panvk_{image,buffer}_view to panvk_priv_bo
772- panvk: Track blit src/dst using pan_kmod_bo objects
773- panvk: Keep a ref to a pan_kmod_bo in panvk_image
774- panvk: Keep a ref to a pan_kmod_bo in panvk_buffer
775- panvk: Keep tiler_heap and sample_positions BOs at the panvk_device level
776- panvk: Move away from panfrost_{bo,device}
777- panfrost: Move pan_{bo,device}.{c,h} to the gallium driver dir
778- panfrost: Clamp the render area to the damage region
779- panfrost: v4 doesn't have Blend descriptors
780- panfrost: Pad compute jobs with zeros on v4
781- pan/va: Add missing valhall_enums dep to valhall_disasm
782- pan/kmod: Fix typo in pan_kmod_vm_op_check() helper
783- pan/kmod: Add a PAN_KMOD_VM_FLAG_TRACK_ACTIVITY flag
784- pan/kmod: Reject pre 1.1 panfrost kernel drivers
785- panfrost: Rework the way we compute thread info
786- panfrost: Prepare support for GPU variants
787- pan/perf: Reject panthor kernel driver
788- drm-uapi: Add panthor uAPI
789- pan/kmod: Add a backend for panthor
790- panfrost: Add v10 support to libpanfrost
791- pan/genxml: Various CS related improvements in v10.xml
792- pan/decode: Introduce the concept of usermode queue
793- panfrost: Don't allocate a tiler heap buffer on v10+
794- pan/genxml: Make sure pan_pack() evaluates 'dst' only once
795- panfrost: Relax position result alignment constraint on v10+
796- panfrost: Add arch-specific context init/cleanup hooks
797- panfrost: Add a panfrost_context_reinit() helper
798- panfrost: Add a cleanup_batch() method to panfrost_vtable
799- panfrost: Enable v10 in the gallium driver
800- panfrost: Advertize G610 support
801- panfrost: Advertize G310 support
802- panfrost: Update the release note to mention G310/G610 addition
803- vk/meta: Add the PUSH_DESCRIPTOR_BIT flag when creating blit pipeline layouts
804- vk/meta: Fix base_type selection in build_{clear,blit}_shader()
805- panvk: Fix call ordering in panvk_DestroyDevice()
806- panvk: clang-format the source files
807- panvk: Kill the panvk_pack_color() prototype
808- panvk: Add VKAPI_{ATTR,CALL} specifiers to all panvk-specific entrypoints
809- panvk: Do not handle invalid NULL memory object in BindImageMemory2()
810- panvk: Get rid of unused panvk_image_get_plane_size() helper
811- panvk: Get rid of the custom device lost handling
812- panvk: Fix allocation scope of command buffer sub-objects
813- panvk: Add missing util_dynarray_init() in panvk_cmd_open_batch()
814- panvk: Don't open-code panvk_cmd_open_batch() in CmdBeginRenderPass2()
815- panvk: Don't allocate a TEXTURE descriptor in CreateImageView()
816- panvk: s/panvk_event_op/panvk_cmd_event_op/
817- panvk: Allocate descriptor set arrays using vk_multialloc_zalloc()
818- panvk: Don't pass a device where we don't need one
819- panvk: Get rid of unused panvk_cmd_buffer fields
820- panvk: Kill panvk_{Create,Destroy}SamplerYcbcrConversion()
821- panvk: Drop panvk_framebuffer
822- panvk: Get rid of panvk_pipeline_cache
823- panvk: Make panvk_buffer_view inherit from vk_buffer_view
824- panvk: Make panvk_device_memory inherit from vk_device_memory
825- panvk: Make pan_AllocateMemory() robust to errors
826- panvk: Add extra checks to panvk_MapMemory()
827- panvk: Implement {Map,Unmap}Memory2KHR
828- panvk: Make panvk_sampler inherit from vk_sampler
829- panvk: Fix GetPhysicalDeviceProperties2() to report accurate info
830- panvk: Get rid of fields we already have in vk_xxx objects
831- panvk: Disable global offset on varying and non-VS attribute descriptors
832- panfrost: Move the image attribute offset adjustment to a NIR pass
833- panvk: Implement dynamic rendering entry points
834- nir: Extend nir_get_io_offset_src_number() to support load_push_constant
835- nir: Extend nir_lower_mem_access_bit_sizes() to support push constants
836- pan/bi: Lower push constant accesses
837- pan/bi: Lower load_push_constant with dynamic indexing
838- pan/bi: Update the push constant count when emitting load_push_constant
839- panvk: Move some macros to panvk_macros.h
840- panvk: Move image related definitions to panvk_image.{h,c}
841- panvk: Move the VkBuffer logic to its own source file
842- panvk: Move the VkBufferView logic to its own file
843- panvk: Move the VkDeviceMemory logic to panvk_device_memory.{c,h}
844- panvk: Move the VkSampler logic to its own file
845- panvk: Move panvk_pipeline definition to panvk_pipeline.h
846- panvk: Move VkImageView logic to its own source files
847- panvk: Move the VkEvent logic to panvk_event.{c,h}
848- panvk: Move panvk_descriptor_{set,pool} definitions to panvk_descriptor_set.h
849- panvk: Move VkDescriptorSetLayout logic to panvk_descriptor_set_layout.{c,h}
850- panvk: Move VkPipelineLayout logic to its own file
851- panvk: Move shader related definitions to panvk_[vX\_]shader.{c,h}
852- panvk: Kill panvk_[vX\_]cs.{c,h}
853- panvk: Move panvk_{draw,dispatch}_info definitions to panvk_vX_cmd_buffer.c
854- panvk: Move the VkCommandPool logic to panvk_cmd_pool.{c,h}
855- panvk: Move VkQueue logic to panvk_[vX\_]queue.{c,h}
856- panvk: Add a panvk_arch_dispatch_ret() variant
857- panvk: Make the device creation/destruction per-arch
858- panvk: Move the VkInstance logic to panvk_instance.{c,h}
859- panvk: Move the VkPhysicalDevice logic to panvk_physical_device.{c,h}
860- panvk: Move panvk_meta definitions to panvk_meta.h
861- panvk: Move panvk_device definition to panvk_device.h
862- panvk: Move the panvk_cmd_buffer definitions in panvk_cmd_buffer.h
863- panvk: Move the panvk_priv_bo logic to panvk_priv_bo.{c,h}
864- panvk: Move panvk_wsi definitions to panvk_wsi.h
865- panvk: Kill panvk_private.h
866- panvk: Make panvk_buffer_view per-gen
867- panvk: Make panvk_image_view per-gen
868- panvk: Make panvk_sampler a per-gen
869- panvk: Make panvk_cmd_buffer per-gen
870- panvk: Make panvk_shader per-gen
871- panvk: Make panvk_descriptor_set per-gen
872- panvk: Make panvk_descriptor_set_layout per-gen
873- panvk: Make panvk_pipeline per-gen
874- panvk: Make panvk_queue per-gen
875- panvk: Make panvk_pipeline_layout per-gen
876- panvk: Fix attach-less rendering
877- panvk: Fix the colorAttachmentCount check in begin_rendering_init_fbinfo()
878- pan/bi: Support fragment store_output() with a non-zero offset
879- panvk: Don't assume VkGraphicsPipelineCreateInfo::pColorBlendState != NULL
880- pan/bi: Allow subpass sampler dims
881- panvk: Fix input attachment support
882- panvk: Fill pan_tls_info::wls::instances
883- panvk: Make sure the sample_pattern is set in the tiler descriptor
884- panvk: We don't support resolve operations yet
885- pan/bi: Extend bi_emit_texc() to support wider direct tex/sampler idx
886- panvk: Don't assume pViewportState != NULL
887- panvk: Fix img2buf copies with image X offset not aligned on 16 pixels
888- panvk: Fix has_non_vs_attribute() test in panvk_draw_prepare_vs_attribs()
889- panvk: Make sure we pick a valid wrap_mode_r value for unnormalizedCoordinates
890- panvk: Fix depth/stencil image views
891- panvk: Make sure we have a decode context created when we need one
892- panvk: Don't advertize vertex_buffer cap on sRGB formats
893- panvk: Swizzle the border color on v7 when the format is BGR
894- panvk: Re-order things in panvk_physical_device_init()
895- panvk: Fill maxCustomBorderColorSamplers
896- panvk: Skip tiler jobs when the vertex shader doesn't write the position
897- panvk: Make sure we use the proper format for views of depth+stencil images
898- panvk: Abort on fault when PANVK_DEBUG=sync
899- panvk/ci: Make sure we catch GPU faults
900- panvk/ci: Enable dEQP-VK.pipeline.monolithic.*
901- panvk: Add support for KHR_push_descriptor
902- panvk/ci: Re-enable copy_and_blit tests
903- panvk: Stop declaring one push constant array per graphics stage
904- panvk: Pass the push constant array to draw/dispatch calls
905- panvk: Stop lowering push constant loads to UBO loads
906- panvk: Dissociate UBO and push_constant emission
907- nir/lower_blend: Fix nir_blend_logicop() for 8/16-bit integer formats
908- panfrost: do not write outside num_wg_sysval
909- panfrost: Add the BO containing fragment program descriptor to the batch
910- pan/kmod: Fix a syncobj leak in the panthor backend
911- pan/kmod: Make default allocator thread-safe
912
913Boyuan Zhang (5):
914
915- radeonsi/vcn: only use multi slices reflist when available
916- meson: bump the minimal required vdpau version to 1.4
917- ac/gpu_info: Add vcn dec and enc version query
918- radeonsi/vcn: choose rc_per_pic by encode verison
919- radeonsi/vcn: mark rc_per_pic as obsoleted
920
921Błażej Szczygieł (2):
922
923- gallivm/ssbo: replace run time loop by compile time loop
924- gallivm/ssbo: mask offset with exec_mask instead of building the 'if'
925
926Caio Oliveira (268):
927
928- intel/compiler/xe2: Implement instruction compaction for DPAS.
929- intel/compiler: Add couple of tests for fs_combine_constants
930- intel/compiler: Fix rebuilding the CFG in fs_combine_constants
931- intel: Use an intel enum for cmat scope
932- intel/compiler: Enable lower_rotate_to_shuffle in subgroup lowering
933- anv: Advertise VK_KHR_shader_subgroup_rotate
934- iris: Remove unused brw_* includes
935- iris: Remove prototypes for unsupported Gfx versions
936- iris: Remove unused paramater
937- iris: Call blorp_finish() when destroying context
938- crocus: Call blorp_finish() when destroying context
939- intel/compiler: Rename brw_image_param to isl_image_param
940- intel/compiler: Rename BRW_WM_MSAA_* enums to INTEL_MSAA_*
941- intel/compiler: Rename BRW_TESS_* enums to INTEL_TESS_*
942- intel/compiler: Rename DISPATCH_MODE_* enums to INTEL_DISPATCH_MODE_*
943- intel/compiler: Rename brw_vue_map to intel_vue_map
944- intel/compiler: Rename brw_cs_dispatch_info to intel_cs_dispatch_info
945- intel/compiler: Move disassemble functions to own header file
946- intel/compiler: Include brw_disasm_info.h where its used
947- intel/compiler: Merge intel_disasm.[ch] into corresponding brw files
948- intel: Rename i965_{asm,disasm} tools to brw_{asm,disasm}
949- intel/blorp: Don't require specific prog_data type in callback
950- intel/blorp: Remove brw\_ prefix when not applicable
951- intel/blorp: Simplify blorp_compile_fs() interface
952- intel/blorp: Simplify blorp_compile_cs() interface
953- intel/blorp: Use a struct to return blorp_compile_*() results
954- intel/blorp: Remove outdated reference in comment
955- intel/blorp: Move brw_blorp_get_urb_length helper
956- intel/blorp: Avoid brw types in blorp_priv.h
957- intel/blorp: Move brw_compiler.h include to where is needed
958- intel/blorp: Use a Meson dependency for blorp
959- intel: Add missing dependencies on blorp
960- intel/decoder: Move decoder to a separate module
961- intel/compiler: Collect NIR-only passes in intel_nir.h
962- intel/compiler: Rename the passes and files related to intel_nir.h
963- intel/compiler: Rename brw_gfx_ver_enum.h to intel_gfx_ver_enum.h
964- intel: Remove brw\_ prefix from process debug function
965- intel/isl: Include compiler generic header
966- anv: Remove lower_atomics from storage image lowering opts
967- iris: Remove no-ops from storage image lowering
968- intel/compiler: Use "intel" prefix for walk_order enum
969- iris: Add stage to iris_compiled_shader
970- iris: Don't use prog_data to guard 3DSTATE_CONSTANT_* code
971- iris: Reduce dependency on brw_*_prog_data structs
972- iris: Take ownership of prog_data when applying it
973- iris: Use uint32_t instead of brw_param_builtin
974- iris: Move compiler creation to iris_program.c
975- iris: Add IRIS_MAX_* constants to replace BRW_MAX_* usage
976- iris: Add helper to access use_tcs_multi_patch
977- iris: Add helper for indirect_ubos_use_sampler
978- iris: Move iris_get_compiler_options to iris_program.c
979- iris: Include brw_compiler.h only when needed
980- intel/meson: Remove usage of meson.source_root and meson.build_root
981- intel/meson: Fix warning about broken str.format
982- intel/elk: Fork Gfx8- compiler by copying existing code
983- intel/elk: Compile ELK library, tests and tools
984- intel/elk: Remove compiler specific devinfo hash
985- intel/elk: Remove a bunch of files that don't apply for Gfx8-
986- intel/elk: Use common code in intel/compiler
987- intel/elk: Remove stages not used in Gfx8-
988- intel/elk: Remove DPAS lowering
989- intel/elk: Rename files to use elk prefix
990- intel/elk: Rename header guards
991- intel/elk: Update doxygen-like file comments
992- intel/elk: Rename C++ namespace
993- intel/elk: Rename symbols
994- intel/elk: Don't include elk_eu_defines.h in elk_nir.h
995- intel/elk: Create separate header for opcodes
996- intel/blorp: Move brw specific code to a separate file
997- intel/blorp: Explicitly include brw_compiler.h header
998- intel/blorp: Add ELK support
999- intel/blorp: Remove Gfx9+ references in elk code
1000- intel/decoder: Add ELK support
1001- crocus: Use ELK compiler
1002- hasvk: Use ELK compiler
1003- iris: Rename screen->compiler to screen->brw
1004- iris: Use ELK compiler for Gfx8
1005- intel/tools: Add ELK support for aubinator
1006- intel/tools: Add ELK support for aubinator_error_decode
1007- intel/tools: Add ELK support for intel_hang_replay
1008- intel/tools: Add ELK support for aubinator_viewer
1009- intel/tools: Add ELK support for intel_hang_viewer
1010- intel: Use _brw suffix for genX headers that rely on brw
1011- intel/meson: Rename libintel_compiler to libintel_compiler_brw
1012- intel/tools: Add extra compiler device sha only for Gfx9+
1013- intel/elk: Move nir_options to its own c/h file pair
1014- intel-clc: Use correct set of nir_options when building for Gfx8
1015- intel/elk: Use anonymous namespace in fs_combine_constants
1016- intel/elk: Remove tests for Gfx9+
1017- intel/brw: Remove assembler tests for Gfx8-
1018- intel/brw: Remove EU compaction tests for Gfx8-
1019- intel/brw: Remove EU validation tests for Gfx8-
1020- intel/brw: Remove pass test cases for Gfx8-
1021- intel/brw: Assert Gfx9+
1022- intel/compiler: Remove has_render_target_reads from wm_prog_data
1023- intel/brw: Remove Gfx8- passes from optimize()
1024- intel/brw: Pull opt_copy_propagation out of fs_visitor
1025- intel/brw: Pull opt_cmod_propagation out of fs_visitor
1026- intel/brw: Pull opt_saturate_propagation out of fs_visitor
1027- intel/brw: Pull dead_code_eliminate out of fs_visitor
1028- intel/brw: Pull opt_combine_constants out of fs_visitor
1029- intel/brw: Pull opt_cse out of fs_visitor
1030- intel/brw: Pull bank_conflicts out of fs_visitor
1031- intel/brw: Pull peephole_sel out of fs_visitor
1032- intel/brw: Pull redundant_halt out of fs_visitor
1033- intel/brw: Pull opt_algebraic out of fs_visitor
1034- intel/brw: Pull split/compact virtual_grf opts out of fs_visitor
1035- intel/brw: Pull opt_split_sends out of fs_visitor
1036- intel/brw: Pull opt_zero_samples out of fs_visitor
1037- intel/brw: Pull eliminate_find_live_channel out of fs_visitor
1038- intel/brw: Pull remove_extra_rounding_modes out of fs_visitor
1039- intel/brw: Pull register_coalesce out of fs_visitor
1040- intel/brw: Pull lower_constant_loads out of fs_visitor
1041- intel/brw: Pull lower_pack out of fs_visitor
1042- intel/brw: Pull lower_simd_width out of fs_visitor
1043- intel/brw: Pull lower_barycentrics out of fs_visitor
1044- intel/brw: Pull lower_logical_sends out of fs_visitor
1045- intel/brw: Pull fixup_nomask_control_flow out of fs_visitor
1046- intel/brw: Pull lower_integer_multiplication out of fs_visitor
1047- intel/brw: Pull lower_sub_sat out of fs_visitor
1048- intel/brw: Pull lower_derivatives out of fs_visitor
1049- intel/brw: Pull lower_regioning out of fs_visitor
1050- intel/brw: Pull fixup_sends_duplicate_payload out of fs_visitor
1051- intel/brw: Pull lower_uniform_pull_constant_loads out of fs_visitor
1052- intel/brw: Pull lower_find_live_channel out of fs_visitor
1053- intel/brw: Pull lower_load_payload out of fs_visitor
1054- intel/brw: Use references for a couple of backend_shader passes
1055- intel/brw: Simplify OPT macro usage in fs_visitor::optimize
1056- intel/brw: Pull fixup_3src_null_dest out of fs_visitor
1057- intel/brw: Pull emit_dummy_memory_fence_before_eot out of fs_visitor
1058- intel/brw: Pull emit_dummy_mov_instruction out of fs_visitor
1059- intel/brw: Pull lower_scoreboard out of fs_visitor
1060- intel/brw: Pull optimize() out of fs_visitor
1061- intel/brw: Move optimize and small optimizations to brw_fs_opt.cpp
1062- intel/brw: Move virtual GRF opts into their own file
1063- intel/brw: Move fs algebraic to its own file
1064- intel/brw: Move small lowering passes into brw_fs_lower.cpp
1065- intel/brw: Move lower_integer_multiplication to its own file
1066- intel/brw: Expose flag_mask/bit_mask fs helpers
1067- intel/brw: Move lower_simd_width to its own file
1068- intel/brw: Move workarounds to a separate file
1069- intel/blorp: Remove Gfx8- references in BRW code
1070- intel/brw: Move brw_compile_* functions out of vec4-specific files
1071- intel/brw: Move type_size_* functions out of vec4-specific file
1072- intel/brw: Always use scalar shaders
1073- intel/brw: Remove vec4 backend
1074- intel/brw: Remove now unused vec4-only opcodes
1075- intel/brw: Remove unused legacy shader stages
1076- intel/brw: Remove Gfx8- code from disassembler
1077- intel/brw: Remove Gfx8- code from assembler
1078- intel/brw: Remove Gfx8- code from brw_compile_* functions
1079- intel/brw: Remove Gfx8- code from scheduler
1080- intel/brw: Remove Gfx8- code from register allocator
1081- intel/brw: Remove Gfx8- code from thread payload
1082- intel/brw: Remove Gfx8- code from NIR conversion
1083- intel/brw: Remove Gfx8- code from lower storage image pass
1084- intel/brw: Remove Gfx8- code from lower logical sends
1085- intel/brw: Remove Gfx8- code from generator
1086- intel/brw: Remove Gfx8- code from backend passes
1087- intel/brw: Remove Gfx8- code from EU compaction
1088- intel/brw: Remove Gfx8- code from IR performance analysis
1089- intel/brw: Remove Gfx8- code from EU emission
1090- intel/brw: Remove Gfx8- code from EU validation
1091- intel/brw: Remove Gfx8- code from NIR passes
1092- intel/brw: Remove Gfx4-5 manual compression selection
1093- intel/brw: Remove Gfx8- code from EU codegen helpers
1094- intel/brw: Remove Gfx8- code from NIR options
1095- intel/brw: Remove Gfx8- code from register type helpers
1096- intel/brw: Remove Gfx8- specific EU inst helpers
1097- intel/brw: Remove Gfx8- code from inst FC and F macros
1098- intel/brw: Replace inst F8 macro with F macro
1099- intel/brw: Remove Gfx8- code from inst F20 macros
1100- intel/brw: Remove Gfx8- code from inst FD20 and FV20 macros
1101- intel/brw: Remove Gfx8- code from inst FI macros
1102- intel/brw: Remove Gfx8- code from inst BRW_IA*_ADDR_IMM macros
1103- intel/brw: Remove Gfx8- code from inst FFDC, FDC and FD macros
1104- intel/brw: Update comments for FK macro
1105- intel/brw: Replace inst FF macro with F or F20 macros
1106- intel/brw: Remove F16TO32 and F32TO16 opcodes
1107- intel/brw: Remove Gfx8- code from builder
1108- intel/brw: Remove Gfx8- code from fs_inst
1109- intel/brw: Remove Gfx8- code from VUE map
1110- intel/brw: Remove Gfx8- code from SIMD lowering
1111- intel/brw: Remove Gfx8- code from visitor
1112- intel/brw: Remove Gfx8- remaining opcodes
1113- intel/brw: Remove MRF type
1114- intel/brw: Inline brw_nir_apply_sampler_key code
1115- intel/brw: Remove unused attrib workarounds
1116- intel/brw: Remove edgeflag_is_last VS parameter
1117- intel/brw: Remove Gfx8- fields from \*_prog_key structs
1118- intel/brw: Remove Gfx8- fields from \*_prog_data structs
1119- intel/brw: Use a single register set
1120- intel/brw: Remove runtime_check_aads_emit
1121- intel/brw: Remove automatic_exec_sizes
1122- intel/brw: Use fs_visitor instead of backend_shader in various passes
1123- intel/brw: Fold fs_instruction_scheduler into instruction_scheduler
1124- intel/brw: Change cfg_t to refer to fs_visitor
1125- intel/brw: Move dump_* functions into fs_visitor
1126- intel/brw: Fold backend_shader into fs_visitor
1127- intel/brw: Remove extra stage_prog_data field in fs_visitor
1128- intel/brw: Remove brw_shader.h
1129- intel/meson: Add dependencies for brw and elk
1130- intel/compiler: Remove nir_print_instr hack in disasm_info
1131- intel/brw: Use C++ for brw_disasm_info.c
1132- intel/brw: Hide the definition of cfg_t et al from C code
1133- intel/brw: Use fs_inst in cfg_t
1134- intel/brw: Use fs_inst explicitly in various passes
1135- intel/brw: Use fs_inst in disasm_annotate()
1136- intel/brw: Move functions from backend_instruction into fs_inst
1137- intel/brw: Fold backend_instruction into fs_inst
1138- intel/brw: Remove typedefs from fs_builder
1139- intel/brw: Fold backend_reg into fs_reg
1140- intel/brw: Simplify usage of reg immediate helpers
1141- intel/compiler: Fix SIMD lowering when instruction needs a larger SIMD
1142- intel/elk: Remove split sends
1143- intel/elk: Remove DPAS opcode
1144- intel/elk: Remove BTD and RT opcodes
1145- intel/elk: Remove DP4A opcode
1146- intel/elk: Remove ROR and ROL opcodes
1147- intel/elk: Remove IADD3 opcode
1148- intel/elk: Remove EU compaction logic for Gfx9+
1149- intel/elk: Remove encoding for Gfx9+
1150- intel/elk: Remove SYNC opcode and SWSB annotations
1151- intel/elk: Remove Gfx12 SFIDs and related LSC code
1152- intel/elk: Remove Gfx9+ sampler messages and modes
1153- intel/elk: Rename symbols for A64 OWord Block R/W messages
1154- intel/elk: Remove Gfx9+ dataport messages
1155- intel/elk: Remove FB_READ opcodes
1156- intel/elk: Remove Gfx12.5 URB message
1157- intel/elk: Remove ex_desc and ex_mlen from elk_inst
1158- intel/elk: Remove Xe2 logical sends lowering
1159- intel/elk: Remove unused sources from ELK_SHADER_OPCODE_SEND
1160- intel/elk: Remove unused SEND features
1161- intel/elk: Remove validation code for Gfx9+
1162- intel/elk: Remove Gfx9+ from nir conversion
1163- intel/elk: Remove Gfx9+ from compile/run functions
1164- intel/elk: Remove FB_WRITE_LOGICAL_SRC_SRC_STENCIL
1165- intel/elk: Remove Gfx9+ from passes
1166- intel/elk: Remove Gfx9+ from thread payload
1167- intel/elk: Remove Gfx9+ from EU emission
1168- intel/elk: Remove coarse pixel handling
1169- intel/elk: Remove Gfx9+ from FS generator
1170- intel/elk: Remove Gfx9+ from Reg related code
1171- intel/elk: Remove Gfx9+ from asm grammar
1172- intel/elk: Remove Gfx9+ from disasm
1173- intel/elk: Remove Gfx9+ from NIR auxiliary code
1174- intel/elk: Remove use_tcs_multi_patch
1175- intel/elk: Remove Gfx9+-only passes
1176- intel/elk: Remove uses of intel_device_info_is_9lp()
1177- intel/elk: Remove remaining Gfx9+ code
1178- intel/elk: Remove multi-polygon support
1179- intel/elk: Clean up unused code in elk_compiler.h
1180- intel/brw: Use hstride instead of stride for accumulator
1181- intel/brw: Use helper to create accumulator register
1182- intel/brw: Fix validation of accumulator register
1183- anv: Enable VK_KHR_shader_maximal_reconvergence
1184- intel/tools: Make intel_stub_gpu work when using meson devenv
1185- intel/brw: Implement quad_vote_any and quad_vote_all
1186- intel/brw: Use predicates for quad_vote_any and quad_vote_all when available
1187- anv: Enable VK_KHR_shader_quad_control
1188- intel/brw: Handle Xe2 in brw_fs_opt_zero_samples
1189- intel/brw: Remove vestiges of sources on IF opcode, only valid on Gfx6
1190- intel/brw: Add a src array for the common case in fs_inst
1191- intel/brw: Refactor FS validation macros
1192- intel/brw: Remove two duplicated validate calls in optimizer
1193- intel/brw: Move validate out of fs_visitor
1194- intel/brw: Support FIXED_GRF when generating code for CLUSTER_BROADCAST
1195- intel/brw: Lower VGRFs to FIXED_GRFs earlier
1196
1197Casey Bowman (1):
1198
1199- anv: Override VendorID for Hitman 3
1200
1201Charlie Turner (2):
1202
1203- amd, radeonsi: Lower minimum supported video dimensions for AV1
1204- {vulkan,radv,anv}/video: fix issue in H264 scaling lists derivation
1205
1206Chia-I Wu (7):
1207
1208- radv: fix pipeline stats mask
1209- meson: fix a build ereror
1210- radv: hide the sparse queue when radv_legacy_sparse_binding
1211- radv: hide the sparse queue on older kernels
1212- radv: set VK_SYNC_FEATURE_GPU_MULTI_WAIT
1213- aco: fix nir_op_pack_32_4x8 handling
1214- radv: fix 2d/3d image copy on compute queue
1215
1216Chris Rankin (4):
1217
1218- vdpau: Declare texture object as immutable using helper function.
1219- vdpau: Refactor query for video surface formats.
1220- meson: bump the minimal required vdpau version to 1.5
1221- frontends/vdpau: Add support for VDPAU AV1 decoding.
1222
1223Christian Duerr (1):
1224
1225- panfrost: Fix dual-source blending
1226
1227Christian Gmeiner (100):
1228
1229- .gitignore: Add .venv folder
1230- etnaviv/isa: Add missing dep of encode.py/decode.py calls on isa.py
1231- isaspec: encode.py: Include assert.h
1232- isaspec: encode.py: Include util/log.h
1233- etnaviv: Remove no_oneconst_limit from etna_inst
1234- isaspec: encode: Constify encode.type
1235- isaspec: encode: Constify bitset_params
1236- etnaviv: Remove not used etna_assemble_set_imm(..)
1237- etnaviv: Fix how we determine the max supported number of varyings
1238- etnaviv: isa: Remove duplicate #instruction-alu-atomic
1239- etnaviv: isa: Add dsx and dsy opcodes
1240- etnaviv: isa: Add frc opcode
1241- etnaviv: isa: Add norm_dp2, norm_dp3 and norm_dp4 opcodes
1242- etnaviv: isa: Add bit_extract opcode
1243- etnaviv: isa: Correct dp2 opcode
1244- etnaviv: isa: Add branch_any opcode
1245- etnaviv: isa: Name cond enum value 22
1246- etnaviv: isa: Add movai opcode
1247- etnaviv: isa: Add bit_rev opcode
1248- etnaviv: isa: Add texldb opcode
1249- etnaviv: isa: Add texldl opcode
1250- etnaviv: isa: Add texldd opcode
1251- etnaviv: isa: Remove note about GC3000
1252- etnaviv: isa: Add div opcode
1253- etnaviv: isa: Reorder instructions
1254- etnaviv: isa: Rename reg_group u2 to u
1255- etnaviv: isa: Add internal register group
1256- etnaviv: isa: Add movar opcode
1257- etnaviv: isa: Move {TEX_SWIZ}
1258- etnaviv: isa: Correct SRC0_AMODE
1259- etnaviv: isa: Correct #instruction-cf-src1-src2 bitset name
1260- etnaviv: isa: Correct #instruction-alu-no-dst-maybe-src1-src2 name
1261- etnaviv: isa: Correct #instruction-alu-no-dst-has-src0-src1 expr name
1262- etnaviv: isa: Combine branch and branch_if
1263- etnaviv: isa: Support unary branch instruction
1264- etnaviv: isa: Support unary texkill instruction
1265- etnaviv: isa: Support multiple encodings for texldl
1266- etnaviv: isa: Fix #instruction-tex-src0-src1-src2 bitset
1267- etnaviv: isa: Support multiple encodings for texldb
1268- isaspec: Remove not used isa_decode_hook
1269- isaspec: deocde: Hide all the internals ISA details
1270- isaspec: decode: Add isa specific functions
1271- isaspec: decode: Make isa_decode_bitset(..) private
1272- freedreno/isa: Rework meson dependency for libir3decode
1273- etnaviv: isa: Rework meson dependency for libetnaviv_decode
1274- isaspec: deocde: Make isa_bitset arrays static
1275- isaspec: deocde: Make isa_decode_field(..) private
1276- isaspec: decode: Add libisaspec
1277- isaspec: deocde: Remove generic functions from public interface
1278- etnaviv: isa: Define a dontcare bit in atomic instructions
1279- etnaviv: isa: Add name attributes
1280- etnaviv: isa: Generate c header containing enums
1281- etnaviv: isa: Generate opcode enum
1282- etnaviv: isa: Add an empty libetnaviv_encode
1283- etnaviv: Link against libetnaviv_encode
1284- etnaviv: Move struct etna_inst to src/etnaviv
1285- etnaviv: isa: Make use of generated enums
1286- etnaviv: isa: Add rouding to etna_inst
1287- etnaviv: Set dst.use for MOVAR
1288- etnaviv: isa: Add encode support
1289- etnaviv: isa: Add isa_assemble_instruction(..)
1290- etnaviv: Switch to isa_assemble_instruction(..)
1291- etnaviv: Move swizzle related macros to scr/etnaviv
1292- etnaviv: Switch to macros from isa.h
1293- etnaviv: Remove isa.xml.h
1294- etnaviv: Do not set tex.amode for rounding
1295- ci/etnaviv: Remove duplicates
1296- ci/etnaviv: Do not skip tex-miplevel piglits
1297- etnaviv: Remove offline shader compiler
1298- etnaviv: Introduce common etna_core_info
1299- etnaviv: drm: Make use of etna_core_info
1300- etnaviv: drm: Add etna_gpu_get_core_info(..)
1301- etnaviv: Switch to etna_core_info
1302- etnaviv: Move hw header to common place
1303- etnaviv: Introduce etna_feature enum
1304- etnaviv: common: Add feature bitset
1305- etnaviv: drm: Initialize etna_core_info based on kernel features
1306- etnaviv: Switch to etna_core APIs
1307- etnaviv: drm: Query some id values in etna_gpu_new(..)
1308- etnaviv: hwdb: Import gc_feature_database from NXP
1309- etnaviv: hwdb: Import gc_feature_database from Amlogic
1310- etnaviv: hwdb: Import gc_feature_database from ST
1311- ci: Install python3-pycparser in build container
1312- etnaviv: hwdb: Generate hwdb.h
1313- etnaviv: hwdb: Add etna_query_feature_db(..)
1314- etnaviv: drm: Make use of hwdb
1315- etnaviv: common: Add enum etna_core_type
1316- etnaviv: common: Add some limit values
1317- etnaviv: hwdb: Fill limits
1318- etnaviv: drm: Fill limits
1319- etnaviv: Copy values from etna_core_info
1320- etnaviv: drm: Remove fallback value for ETNA_GPU_NUM_CONSTANTS
1321- etnaviv: Drop not needed check if seamless cube map is supported
1322- etnaviv: hwdb: Drop stdint.h dependency
1323- nvk: Remove duplicate DRM_NODE_RENDER check
1324- meson: Add missing newline at eof
1325- etnaviv: Switch to etna_core_disable_feature(..)
1326- etnaviv: Fix disabling of features
1327- etnaviv: drm: Drop NPU-related params
1328- clc: Always use spir for 32 bit
1329
1330Collabora's Gfx CI Team (4):
1331
1332- Uprev Piglit to e9316bcd12544aaf7e753ce37fe50d64165d9598
1333- Uprev Piglit to 2a1c49a81cd9a6bf5d0c3a9b87225be94771ca96
1334- Uprev Piglit to 1e631479c0b477006dd7561c55e06269d2878d8d
1335- Uprev Piglit to dd6f7eaf82e8dd442da28b346c236141cbcce0b1
1336
1337Connor Abbott (56):
1338
1339- freedreno: Add a7xx crashdump-related registers and enums
1340- ir3/ra: Add specialized shared register RA/spilling
1341- ir3: Set branchstack earlier
1342- ir3: Rewrite (jp) and branchstack handling
1343- ir3: Calculate physical edges correctly
1344- ir3: Fix comment thinko
1345- ir3/ra: Fix bug with collect source handling
1346- tu: Add more info to ldg inline uniform path
1347- ir3/a7xx: Fix load_global_ir3 with immediate offset
1348- ir3: Initial support for pushing globals with ldg.k
1349- tu: Follow pipeline compatibility rules for dynamic descriptors
1350- tu: Reenable MSAA UBWC on a6xx gen1
1351- tu: Enable UBWC for SNORM formats on a740+
1352- tu: Enable UBWC for storage images on a7xx
1353- vk/graphics_state: Remove bogus assert in CmdSetSampleMaskEXT
1354- vk/graphics_state: Add stubs required by VK_EXT_shader_objects
1355- freedreno/afuc: Decode (peek) modifier
1356- freedreno/afuc: Add missing ALU encode case for bic
1357- freedreno/afuc: Bump max instructions for a7xx
1358- freedreno/afuc: Fix setbit/clrbit parsing
1359- freedreno/afuc: Use left recursion in parser
1360- freedreno/afuc: Improve jump table handling
1361- freedreno/afuc: Add .align directive
1362- freedreno/afuc: Add more general T_IDENTIFIER in lexer
1363- freedreno/afuc: Add support for multiple sections when assembling
1364- freedreno/afuc: Allow -e option on a7xx
1365- freedreno/afuc: Emulate THREAD_SYNC on a660
1366- freedreno/afuc: Run entire bootstrap routine
1367- freedreno/afuc: Add a7xx test case
1368- freedreno/afuc: Add magic control reg values for a740
1369- freedreno/afuc: Add section on reassembling firmwares and relocations
1370- freedreno/a7xx: Add CP_CCHE_INVALIDATE
1371- tu: Implement CCHE invalidation
1372- nir/divergence_analysis: Add ir3-specific intrinsics
1373- nir/divergence_analysis: Add uniform_load_tears option
1374- nir/divergence_analysis: Fix load_view_index divergence in VS
1375- ir3: Allow single-predecessor phis
1376- ir3: Run divergence analysis at the end
1377- ir3: Remove loop shared copy check
1378- ir3: Use divergence analysis for (jp) and physical CFG
1379- freedreno/afuc: Switch to using the GPU ID in the firmware
1380- freedreno/afuc: Add a7xx new-style branch instructions
1381- freedreno/afuc: Add initial support for a750
1382- freedreno: Make has_ibo_ubwc a7xx specific
1383- freedreno,tu: Disable UBWC for storage images on a750
1384- ir3/legalize: Fix intra-block state propagation with loops
1385- ir3: Rewrite nop insertion
1386- docs/android: Fix example meson cross file
1387- docs/android: Improve instructions for replacing driver
1388- ir3: Don't use non-contiguous component masks for FS
1389- ir3: Don't pack FS inlocs
1390- freedreno/a7xx: Register updates from kgsl
1391- ir3: Add scan_clusters_macro to ir3_valid_flags()
1392- ir3: Add scan_clusters.macro to is_subgroup_cond_mov_macro()
1393- ir3/ra: Don't demote movmsk instructions to non-shared
1394- docs/android: Fix example meson cross file again
1395
1396Constantine Shablia (6):
1397
1398- panvk: implement vkGetBufferDeviceAddress
1399- panvk: advertise bufferDeviceAddress
1400- vulkan/runtime: fix typo
1401- mesa: fix typo
1402- pan/bi: fix 1D array tex coord lowering
1403- panfrost: report correct MAX_VARYINGS
1404
1405Corentin Noël (37):
1406
1407- zink: Avoid the use of negative array offsets
1408- zink: Use memmove when dealing with overlapping memory
1409- glsl: Make sure to not cast ir_dereference_variable into ir_variable
1410- glsl: Make sure that the variable is a ir_variable before unreferencing it
1411- zink: Initialize zink_shader_object
1412- zink: Initialize zink_bindless_descriptor to zero on creation
1413- zink: Initialize pipe_query_result
1414- zink: Do not shadow the variable ret
1415- zink: Avoid variable shadowing everywhere
1416- zink: Only call reapply_color_write if EXT_color_write_enable is available
1417- ci_run_n_monitor: Allow the upstream format to not exist
1418- zink: use symbolic values instead of 0
1419- zink: do not use undefined stage mask if on missing KHR_synchronization2
1420- glsl: Ensure that we are dealing with ir_variable and ir_rvalue
1421- venus: sync protocol for VK_EXT_attachment_feedback_loop_layout
1422- venus: enable VK_EXT_attachment_feedback_loop_layout
1423- zink: Return early if the file descriptor could not have been duplicated/acquired
1424- ci: Update virglrenderer and crosvm
1425- zink: Make wrap_surface return a zink_ctx_surface directly
1426- zink: Use an intermediary variable for create_surface
1427- zink: Separate the template from the wrapped surface
1428- zink: Return early if the source could not have been acquired
1429- zink: Move zink_surface_destroy before zink_create_surface
1430- zink: Make sure to not leak anything on surface creation failure
1431- zink: Change zink_get_surface to return a zink_surface
1432- zink: Add error logging on surface creation failure
1433- st_pbo/compute: Use the correct structure type when allocating a specialized key
1434- zink: Make sure to initialize all the fields of VkMemoryBarrier
1435- dri/kopper: Assume a non-null drawable in flush_frontbuffer
1436- zink: Removed unused function
1437- zink: Removed unused num_texel_buffers member
1438- zink: Removed unused push_valid member
1439- zink: Remove ctx from zink_gfx_program
1440- ci: Change propagated variables into an array
1441- ci: Add VK_DRIVER_FILES passthrough from jobs to tests
1442- ci: Allow to pass LIBGL_ALWAYS_SOFTWARE to the guest environment
1443- ci: Add zink-venus-lvp job
1444
1445Daniel Almeida (17):
1446
1447- nak/sm50 add support for suld
1448- nak/sm50: add support for suatom
1449- nak/sm50: add support for isberd
1450- nak: sm50: add support for OpOut
1451- nak: sm50: fadd: ensure src[0] is in a register
1452- nak/sm50: legalize: display instruction on panic
1453- nak/sm50: add support for brev
1454- nak: sm50: fix some legalization issues
1455- nak/sm50: add a memstream abstraction
1456- nak/sm50: add an annotate debug flag
1457- nak/sm50: support annotations through  OpAnnotate
1458- nak/sm50: sprinkle OpAnnotate in optimization passes
1459- meson,ci: Add the paste crate
1460- nil: Add the start of a Rust library
1461- nil: Rewrite nil_format in rust
1462- nil: Re-implement nil_image in Rust
1463- nil: Rewrite the TIC code in Rust
1464
1465Daniel Schürmann (52):
1466
1467- aco/insert_exec_mask: unify exec restore code after divergent control flow
1468- aco/insert_exec_mask: replace phi for loop restore mask with explicit copies
1469- aco/insert_exec_mask: only create loop phis for exec mask if necessary
1470- aco: give spiller more room to assign spilled SGPRs to VGPRs
1471- spirv: Fix SpvOpExpectKHR
1472- vulkan: enable VK_KHR_shader_expect_assume
1473- spirv: Update headers and grammar JSON
1474- aco/insert_exec_mask: Fix unconditional demote at top-level control flow.
1475- aco/insert_exec_mask: tiny refactor
1476- aco: always terminate quads if they have been demoted entirely
1477- aco/insert_exec_mask: Reduce latency when switching to WQM.
1478- spirv: implement SPV_KHR_maximal_reconvergence
1479- aco: enable WQM if demote is used with maximal reconvergence
1480- radv: enable VK_KHR_shader_maximal_reconvergence
1481- spirv: implement SPV_KHR_quad_control
1482- radv: enable VK_KHR_shader_quad_control
1483- radv: fix initialization of radv_shader_layout->use_dynamic_descriptors
1484- aco: rematerialize constants in every basic block during optimizer
1485- aco: reorder code and use namespaces in aco_interface.cpp
1486- aco/util: small_vec few additions
1487- aco: use small_vec as Block::edge_vec for predecessors and successors
1488- aco/spill: refactor SSA repairing
1489- aco/spill: don't allocate extra spill_id for phi operands in add_coupling_code()
1490- aco/spill: add spills_entry interferences only when necessary
1491- aco/spill: refactor adding spilled vars into separate function add_to_spills()
1492- aco/spill: keep live-out variables spilled at branch blocks
1493- aco/spill: don't prefer to spill phis at merge blocks
1494- aco/spill: add interferences with variables spilled at loop headers
1495- aco/spill: avoid re-spilling loop-carried variables in process_block()
1496- aco/spill: avoid re-spilling loop-carried variables in add_coupling_code()
1497- aco/spill: keep loop-carried variables spilled at loop headers
1498- aco/spill: keep loop-carried variables spilled at merge blocks
1499- aco/spill: select more loop-carried variables to be spilled
1500- aco/spill: keep loop variables spilled during nested loops
1501- aco: use instr_class::branch to identify SOPP branches
1502- aco: remove SOPP_instruction::block member
1503- aco: unify different SALU types into single struct SALU_instruction
1504- aco/builder: use accessor functions instead of casting to subtypes
1505- aco: change return type of create_instruction() to Instruction*
1506- aco: defer instruction size from aco::Format in create_instruction()
1507- aco: remove create_instruction() template parameter
1508- aco: move create_instruction() to aco_ir.cpp
1509- aco/spill: Fix assertion for nested loops
1510- aco/spill: pass live_vars to spill_ctx
1511- aco/spill: compute live-in variables from live-out
1512- aco/spill: maintain valid live vars at any point
1513- aco/spill: use live variables instead of next_use_distances in add_coupling_code()
1514- aco/spill: gather information about average use distances
1515- aco/spill: use average use distances in process_block()
1516- aco/spill: use average use distances in init_live_in_vars() for merge blocks
1517- aco/spill: use average use distances to spill loop variables
1518- aco/ra: fix kill flags after renaming fixed Operands
1519
1520Daniel Stone (50):
1521
1522- egl: Return BAD_CONFIG when robust access unsupported
1523- st/dri: Use correct pipe_resource for GL texture image export
1524- dri: Redeclare __DRI_IMAGE_FORMAT_* as PIPE_FORMAT_*
1525- st/dri2: Remove __DRI_IMAGE_FORMAT conversion
1526- st/dri2: Pass pipe_format to driCreateConfigs
1527- st/dri2: Use u_format to get config format information
1528- util: Add util_format_get_component_shift
1529- st/dri: Remove format tables from driCreateConfigs
1530- st/dri: Completely remove mesa_format from config setup
1531- st/dri: Add transient HAS_ZS() helper
1532- st/dri: Rework depth/stencil format selection
1533- st/dri: Use pipe_format for Z/S modes
1534- st/dri: Check format properties from format helpers
1535- st/dri: Store pipe_format in gl_config
1536- egl/wayland: Remove format-query fallback
1537- st/dri: Reuse stored renderbuffer format
1538- st/dri: Reuse stored texture format
1539- dri/kopper: Move format -> FourCC translation up a level
1540- dri/kopper: Add translations for sRGB formats
1541- dri/kopper: Reorder format tables
1542- dri/kopper: Flatten pipe_format/DRIImage/FourCC conversion
1543- egl/wayland: Query image FourCC for linear copies
1544- egl/wayland: s/DRI_IMAGE_FORMAT/pipe_format/g
1545- egl/wayland: Add opaque-equivalent FourCCs
1546- egl/wayland: Fix EGL_EXT_present_opaque
1547- egl/wayland: Use pipe_format to look up configs
1548- egl/wayland: Use FourCC to look up wl_buffer support
1549- egl/wayland: Add helper to check server format support
1550- egl/wayland: Use helper to look up visual
1551- egl/wayland: Eliminate double loop for configs
1552- egl/wayland: Simplify alternate-format fallback for configs
1553- egl/wayland: Remove WL_SHM_* format listings
1554- egl/wayland: Use pipe_format for format names
1555- egl/wayland: Remove shift/size masks
1556- egl: Fail display creation if no EGLConfigs created
1557- egl/wayland: Remove check for EGLConfig presence
1558- egl/gbm: Remove check for EGLConfig presence
1559- egl/x11: Remove check for EGLConfig presence
1560- egl/android: Remove check for EGLConfig presence
1561- egl/{surfaceless,device}: Remove check for EGLConfig presence
1562- egl: Automatically set EGLConfig ID
1563- egl: Use pipe_format for pbuffer configs
1564- gbm/dri: Query DRIImage for FourCC directly
1565- gbm: Remove hardcoded color-channel data
1566- egl/android: Remove hard-coded color-channel data
1567- egl/x11: Compare config shifts/sizes locally
1568- egl: Remove shifts/sizes from dri2_add_config argument
1569- st/dri: Use pipe_format from config directly
1570- egl/dri: Use pipe_format instead of DRI_IMAGE_FORMAT
1571- egl/wayland: Remove EGL_WL_create_wayland_buffer_from_image
1572
1573Danylo Piliaiev (37):
1574
1575- freedreno/replay: Delete all buffers after each submission
1576- freedreno/replay: Correctly free iova on msm backend
1577- freedreno/replay: Add WSL backend for Windows
1578- ir3: Fix "print" meta instruction synchronization
1579- ir3: Add fullsync and fullnop ir3 dbg options for over-syncing
1580- freedreno/replay: Make meta "print" instruction take any number of regs
1581- tu: Do not print anything on systems without Adreno GPU
1582- tu/a7xx: Make A7XX_RB_UNKNOWN_8E06 value configurable per-gen
1583- tu: Define and set to zero all SP_*_VGPR_CONFIG regs
1584- ir3: Add ldg.k instruction
1585- tu/a7xx: Correctly set A7XX_HLSQ_UNKNOWN_A9AE.SYSVAL_REGS_COUNT
1586- tu/a7xx: Do not preload shaders, HW does it by default
1587- tu: Use SS6_INDIRECT consts upload path for 3d blits
1588- turnip,ir3/a750: Implement consts loading via preamble
1589- tu: Use SS6_INDIRECT for VS params
1590- turnip,ir3/a750: Implement inline uniforms via ldg.k
1591- tu/a750: Consider vertex attr buff in gmem allocation
1592- freedreno,tu: Move varying interp and varying repl modes to xml
1593- freedreno/devices: Update magic regs for a7xx
1594- tu: Exclude more a7xx regs from stomping
1595- tu: Add workaround for D3D11 games accessing UBO out of bounds
1596- tu/a7xx: Write even more magic regs to fix rendering issues on Android
1597- tu: Do not emit zero-sized fs params
1598- freedreno/a7xx: Fix base_align for non-UBWC depth-stencil
1599- tu/autotuner: Use CP_EVENT_WRITE7 for submission fence
1600- tu: Update prim restart state when we switch from/to indexed draw
1601- tu: Fix dynamic state not always being emitted
1602- meson: Correctly get sizeof_pointer with cross-compilers
1603- freedreno/devices: Do not write to 8E79 on a750, KGSL has it protected
1604- freedreno/replay: Use real queueid for submissions and waits
1605- freedreno,tu/a7xx: Add PC_TESS_PARAM_SIZE and PC_TESS_FACTOR_SIZE
1606- tu: Update RP state depending on pipeline in first RP draw
1607- tu: Emit non-draw-state state at the first draw call
1608- freedreno/devices: Add A740v3 from Quest 3
1609- util/vma: Add function to get max continuous free size
1610- freedreno/replay: Allocated maximum available size for cs overriding
1611- ir3: Do not set clip/cull mask if no one writes clip/cull
1612
1613Dario Mylonopoulos (1):
1614
1615- llvmpipe: fixed race condition in lp_rast_destroy that causes a crash on windows
1616
1617Dave Airlie (46):
1618
1619- vulkan/video: drop unused function.
1620- vulkan/video: rename some of the parameter tracking structs.
1621- vulkan/video: start to wrap the video structs for deep copies.
1622- vulkan/video: start deep copying the parameters structures
1623- vulkan/video: constify the encoding apis.
1624- radv/video: refactor sq start/end code to avoid decode hangs.
1625- radv: don't submit empty command buffers on encoder ring.
1626- gallivm: fix coroutines with llvm 18
1627- gallivm: passing fp16_split_fp64 to fp16 lowering.
1628- nvk: allow 3d compressed textures
1629- nvk: mem cannot be null in binding buffers/images.
1630- zink: use sparse residency for buffers.
1631- vulkan: update registry/includes to 1.3.277
1632- vulkan/video: add AV1 decode support to common code
1633- radv: fix correct padding on uvd
1634- radv: init decoder ip block earlier.
1635- radv/uvd: uvd kernel checks for full dpb allocation.
1636- radv: don't submit 0 length on UVD either.
1637- egl: don't bind zink under dri2/3
1638- glx/dri3: handle zink fallback if loader picks it.
1639- loader: handle picking zink for nouveau for certain GPUs.
1640- nouveau/winsys: fix bda heap leak.
1641- nvk: fix dri options leak.
1642- egl/dri2: if zink is preferred from dri3 skip dri2 paths.
1643- radv/video: fix filling out decode operations.
1644- radv/video: use vcn ip version in more places.
1645- radv: rename it_ptr to it_probs_ptr in advance of adding av1
1646- radv/video: use proper struct sizes for decoder structs.
1647- radv/video: add VK_KHR_video_decode_av1 support.
1648- nvk: free leaked cmd_buffer descriptors state.
1649- nvk: only unmap heap bos that were mapped
1650- nvk: enable a mappable bar heap when rebar is disabled.
1651- radv/video: fix h265 decode with unaligned w/h
1652- mesa: reorder st context teardown
1653- vulkan/video: copy the profile over for h264 encode.
1654- radv/video: export unified queue header/tail functions.
1655- radv: add direct cs emit for a dword.
1656- radv: add encoder queue support pieces and encoder queries.
1657- radv/video: add parameter patching calls.
1658- radv/video: add initial support for encoding with h264.
1659- radv/video: add h265 encode support
1660- radv/video: enable video encoding behind perftest flag
1661- radv/video: handle encode control parameters better.
1662- radv/video: don't advertise timestamp bits for decode/encode
1663- egl/dri2: don't bind dri2 for zink
1664- radv/video/encode: fix quality params on v2 hw.
1665
1666David (Ming Qiang) Wu (1):
1667
1668- frontends/va: make vlVaSyncSurface blocking
1669
1670David Heidelberg (57):
1671
1672- ci/deqp: uprev deqp-runner for Linux too to 0.18.0
1673- ci/lima: update expectations, failing tests are being skipped
1674- ci: bump kernel to 6.6.12, modularize i915, add Transparent Huge Pages
1675- ci: shorter kernel tag, included Vivante NPU patches
1676- ci: disable Valve farm in Keywords
1677- ci: bump libdrm to 2.4.120
1678- ci/VK-GL-CTS: add patches to fix dEQP-VK.glsl.derivate crashes
1679- ci: Valve farm (Keywords location) works again
1680- meson: upgrade zlib wrap to 1.3.1
1681- util: use crc32_z instead of crc32 and bump zlib dep to 1.2.9
1682- ci: bump kernel to 6.6.16 + enable X2APIC
1683- ci/freedreno: add fail found by new Piglit
1684- ci/etnaviv: update expectations
1685- ci: temporarily disable Collabora farm
1686- ci: enable Collabora farm
1687- ci: re-enable Collabora farm after maintenance
1688- ci/intel: decompose anv-tgl-test so we can specify custom devices for TGL
1689- ci/intel: add acer-cp514-2h-11{30,60}g7-volteer
1690- ci/intel: move machine definition to the intel-tgl-skqp job
1691- ci/intel: split asus-cx9400-volteer into acer-cp514-2h-11{30,60}g7-volteer
1692- drm-shim: Avoid invalid file and time bits combination
1693- intel/tools: avoid invalid time and file bits combination
1694- ci/deqp: backport Implement support for the EGL_EXT_config_select_group extension GL-CTS patch
1695- ci/freedreno: update expectations comment
1696- ci/deqp: add EGL patch for correct suite (GLES, not GL)
1697- nine: convert licenses block to SPDX
1698- nine: fill missing licenses headers and copyrights
1699- nine: drop useless and a bit too long line
1700- ci: uprev kernel to 6.6.21
1701- ci/freedreno: disable workarounds for Adreno 618, 630, and 660
1702- ci/freedreno: mark fails resolved by "drm/msm/gem: Add metadata uapi"
1703- ci: reduce irrelevant output to a simple list of libraries
1704- util: move gen_zipped_file into generic util and rename to gen_zipped_xml_file
1705- ci/r300: implement rules for d3d9 testing
1706- ci/svga: add missed test and gl-rules include
1707- r300: convert to SPDX license block and fix small typos
1708- r300: add missing licence to the r300_public.h
1709- r300: add missing copyright header
1710- docs: we support EGL 1.5 for a long time
1711- ci/amd: meld radv-traces into radv-raven-traces
1712- ci/amd: drop old PIGLIT_REPLAY_DESCRIPTION_FILE surpassed by PIGLIT_TRACES_FILE
1713- frontend/nine: fix typos
1714- r600: update licensing to SPDX header
1715- r600: add license header to r600_formats.h
1716- r600: add license info to the r600_opcodes.h
1717- r600: add license information to the sfn_shader_gs.h
1718- r600: fix typos
1719- ci: disable sona devices, all devices are offline
1720- ci/intel: sona device_type is back online
1721- ci: temporarily disable Android test builds
1722- ci: disable Igalia farm
1723- meson: implement split-debug
1724- freedreno/ci: move the disabled jobs from include to the main file
1725- ci/deqp: correct EGL_EXT_config_select_group detection
1726- egl/x11: Move RGBA visuals in the second config selection group
1727- winsys/i915: depends on intel_wa.h
1728- subprojects: uprev perfetto to v45.0
1729
1730David Rosca (31):
1731
1732- radeonsi/vcn: Fix H264 slice header when encoding I frames
1733- frontends/va: Fix updating AV1 rate control parameters
1734- radeonsi/vcn: Don't reinitialize encode session on bitrate/fps change
1735- frontends/va: Only set VP9 segmentation fields when segmentation is enabled
1736- frontends/va: Separate QP for I/P/B frames
1737- radeonsi/vcn: Use temporal_layer_index to select temporal layer
1738- radeonsi/vcn: Implement separate QP for I/P/B frames
1739- radv/video: Set maxActiveReferencePictures to 16 for H264/5
1740- frontends/vdpau: Fix cdef strengths and lr_unit_shift in AV1 decode
1741- frontends/vdpau: Support creating VDP_CHROMA_TYPE_420_16 surfaces
1742- radv/video: Fix setting slice QP
1743- radv/video: Set correct bitstream buffer size
1744- radv/video: Set VBV buffer size and level
1745- radv/video: Select temporal layer when encoding each frame
1746- radv/video: Set maxSublayerCount to 4 for H265
1747- radv/video: Avoid resetting rate control every frame
1748- radv/video: Implement per picture type min/max QP
1749- radv/video: Set correct bit depth and format for 10bit input
1750- radv/video: Check encode profiles and bit depth in capabilities query
1751- radv/video: Report maxBitrate in encode capabilities
1752- radeonsi/vcn: Allocate session buffer in VRAM
1753- radeonsi/vcn: Fix 10bit HEVC VPS general_profile_compatibility_flags
1754- radeonsi/vcn: Only enable VBAQ with rate control mode
1755- frontends/va: Fix AV1 slice_data_offset with multiple slice data buffers
1756- Revert "radeonsi/vcn: AV1 skip the redundant bs resize"
1757- frontends/va: Only increment slice offset after first slice parameters
1758- radeonsi: Update buffer for other planes in si_alloc_resource
1759- frontends/va: Store slice types for H264 decode
1760- radeonsi/vcn: Ensure DPB has as many buffers as references
1761- radeonsi/vcn: Allow duplicate buffers in DPB
1762- radeonsi/vcn: Ensure at least one reference for H264 P/B frames
1763
1764David Stern (1):
1765
1766- vulkan/wsi/x11: Explicitly discard errors from xcb_present_pixmap.
1767
1768David Tobolik (1):
1769
1770- rusticl: implement cl_khr_suggested_local_work_size
1771
1772Derek Foreman (10):
1773
1774- egl/wayland: Fix possible buffer leak
1775- loader/wayland: Add named queue fallback
1776- egl/wayland: Give names to our Wayland event queues
1777- vulkan/wsi/wayland: Give names to our Wayland event queues
1778- vulkan/wsi/wayland: Remove confusing comment
1779- vulkan/wsi/wayland: Adjust presentation id locking
1780- vulkan/wsi/wayland: Use wl_display_dispatch_queue_timeout
1781- vulkan/wsi/wayland: More descriptive name for swapchain queue
1782- vulkan/wsi/wayland: Fix use after free
1783- vulkan/wsi/wayland: Remove unused get_min_image_count_for_mode_group
1784
1785Dmitry Baryshkov (11):
1786
1787- freedreno/drm: don't crash for unsupported devices
1788- freedreno/regs: define the wide bus enable bit in DSI_VID_CFG0
1789- freedreno/registers: fix generation dependencies
1790- freedreno/registers: add missing copyright imports
1791- freedreno/registers: inline mdp4_csc group
1792- freedreno/registers: fix WB doffsets array in mdp5.xml
1793- freedreno/registers: support processing display display headers
1794- freedreno/registers: limit the rules schema
1795- freedreno/registers: drop unsupported features from schema
1796- freedreno/rnn: drop headergen2
1797- freedreno/rnn: drop custom aprintf function
1798
1799Dmitry Osipenko (2):
1800
1801- virtio/vdrm: Fix lockup in vdrm_host_sync()
1802- iris: Use Mesa internal drm-uapi headers
1803
1804Dylan Baker (8):
1805
1806- intel/vulkan: assume() that we don't use "ISL_NUM_FORMATS"
1807- intel/hasvk: assume() we don't get ISL_NUM_FORMATS
1808- meson: drop intel-cl deprecation of 'false'
1809- meson: rework intel-rt option to be a feature
1810- meson: Allow building intel-clc for the host if it can be run
1811- intel/brw: track last successful pass and leave the loop early
1812- nvk: drop meson version check that is always true
1813- nouveau: require cbindgen >= 0.25
1814
1815Echo J (9):
1816
1817- nvk: Set ICD version to 1.3
1818- nvk: Implement the VR-related display extensions
1819- nak: Rip out a few dead_code statements
1820- nvk: Add NVK to the Vulkan device name
1821- nvk: Advertise VK_VALVE_mutable_descriptor_type
1822- nvk: Implement calibrated timestamps
1823- vulkan: Add implicit pipeline caching support
1824- nvk: Use implicit pipeline cache
1825- nvk: Don't advertise residencyAlignedMipSize on MaxwellB+
1826
1827Emma Anholt (2):
1828
1829- ci: Add full-run xfails missed in the 1.3.7.0 CTS update.
1830- ci: Disable VK full runs that time out since 1.3.7.0 (hasvk, anv-tgl, a630)
1831
1832Emmanuel Vadot (1):
1833
1834- util: Allow kcmp on FreeBSD
1835
1836Eric Engestrom (282):
1837
1838- VERSION: bump to 24.1
1839- docs: reset new_features.txt
1840- docs: update calendar for 24.0.0-rc1
1841- ci: make sure we evaluate the python-test rules first
1842- docs: fix syntax highlighting on non-code text snippet
1843- docs: fix syntax highlighting on shell commands
1844- ci/deqp: ensure that in \`default` builds, wayland + x11 + xcb are all built
1845- zink+anv/ci: add known failures
1846- ci: fix job dependency error in MRs for bin/ci/* scripts
1847- nouveau/ci: don't run nouveau (gl) tests on nvk changes
1848- amd/ci: simplify deqp config
1849- amd/ci: add flakes seen today
1850- docs: update calendar for 24.0.0-rc2
1851- zink+radv/ci: drop duplicates flakes lines
1852- CODEOWNERS: add myself as as person of contact for CI changes
1853- CODEOWNERS: remove myself as a person of contact for a few things
1854- radv/ci: sort navi21 flakes
1855- amd/ci: add flakes seen today
1856- amd/ci: consider much more of dEQP-VK.query_pool.statistics_query.host_query_reset.* to be flaky
1857- r300/ci: add flakes
1858- ci/deqp: backport fix for zlib.net not allowing tarball download anymore
1859- rpi3/ci: update piglit & deqp expectations
1860- rpi4/ci: skip more of the dEQP-VK.ssbo.phys.layout.* tests that timeout occasionally
1861- rpi3/ci: add flake seen today
1862- rpi4/ci: add timeouts seen today
1863- rpi5/ci: add flake seen today
1864- docs: add release notes for 23.3.4
1865- docs: update calendar for 23.3.4
1866- docs: add sha256sum for 23.3.4
1867- docs: update calendar for 24.0.0-rc3
1868- ci_run_n_monitor: drop always-true condition
1869- ci_run_n_monitor: allow passing multiple targets
1870- ci/deqp: fix default target check when target is not specified
1871- ci/deqp: simplify version log dump
1872- ci/deqp: avoid storing the huge list of vk tests on android builds
1873- ci/deqp: move editable part to the top of the file
1874- ci/deqp: split vk and gl builds
1875- ci/deqp: drop the implicit DEQP_TARGET; explicitly set \`default` in VK builds
1876- ci/deqp: only compile EGL tests in GL builds, not VK builds
1877- ci/deqp: only compile the test binaries that are relevant to the build
1878- ci/deqp: only keep the mustpass lists that are relevant to the build
1879- ci: bump the image tags to rebuild all the deqp variants
1880- Revert "bin/ci: Add GitLab basic token validation"
1881- Reapply "bin/ci: Add GitLab basic token validation"
1882- util: rename __check_suid() to __normal_user()
1883- tree-wide: use __normal_user() everywhere instead of writing the check manually
1884- zink+anv/ci: add a couple more flakes
1885- util: simplify logic in __normal_user()
1886- util: check for setgid() as well in __normal_user()
1887- ci: always skip dEQP-VK.info.device_extensions
1888- vk/util: fix 'beta' check for physical device features
1889- vk/util: fix 'beta' check for physical device properties
1890- ci: when specifying a driver remove all other ones
1891- docs: update calendar for 24.0.0
1892- docs: add release notes for 24.0.0
1893- docs: add sha256sum for 24.0.0
1894- docs/release-calendar: add planned 24.0.x bugfix releases
1895- docs: add release notes for 23.3.5
1896- docs: update calendar for 23.3.5
1897- docs: add sha256sum for 23.3.5
1898- v3d-rpi4-gl: reduce the parallelism from 10 to 8
1899- docs/calendar: add 24.1 branchpoint and release schedule
1900- ci: drop dash in image tags dates
1901- ci: enforce maximum image tag length
1902- ci: reduce maximum image tags length from 30 to 20
1903- ci: explain purpose of the word after the date in image tags
1904- panfrost: fix UB caused by shifting signed int too far
1905- ci_run_n_monitor: avoid spamming a ton of "new status: created" for all the jobs at the beginning
1906- ci: build panvk in debian-vulkan job
1907- nouveau/tests: fix null dereference
1908- ci: build nvk in debian-vulkan job
1909- v3dv/ci: test the WSI on rpi4 and rpi5
1910- radv: enable VK_EXT_headless_surface on all platforms except Windows
1911- v3dv: enable VK_EXT_headless_surface on all platforms except Windows
1912- tu: enable VK_EXT_headless_surface on all platforms except Windows
1913- anv: enable VK_EXT_headless_surface on all platforms except Windows
1914- hasvk: enable VK_EXT_headless_surface on all platforms except Windows
1915- dzn: enable VK_EXT_headless_surface on all platforms except Windows
1916- nvk: enable VK_EXT_headless_surface on all platforms except Windows
1917- panvk: enable VK_EXT_headless_surface on all platforms except Windows
1918- vn: enable VK_EXT_headless_surface on all platforms except Windows
1919- lvp: enable VK_EXT_headless_surface on all platforms except Windows
1920- pvr: enable VK_EXT_headless_surface on all platforms except Windows
1921- ci_run_n_monitor: warn user if they forgot to push the branch
1922- ci_run_n_monitor: add some types for gitlab objects
1923- ci_run_n_monitor: update job when it goes through enable_job()
1924- ci_run_n_monitor: add method to get a pipeline job by its id
1925- ci_run_n_monitor: track new job when retrying a job
1926- ci_run_n_monitor: refresh job state when starting it
1927- gitlab_gql: print error returned by server in --print-merged-yaml
1928- ci_run_n_monitor: implicitly include \`parallel:` jobs
1929- ci_run_n_monitor: print the target regex before adding the X/N bit
1930- docs: add release notes for 24.0.1
1931- docs: add sha256sum for 24.0.1
1932- docs: add release notes for 23.3.6
1933- docs: update calendar for 23.3.6
1934- docs: add sha256sum for 23.3.6
1935- docs: update calendar for 24.0.1
1936- ci_run_n_monitor: explain why/when there might be no tracked remote
1937- ci_run_n_monitor: allow detached heads as well
1938- docs: add release notes for 24.0.2
1939- docs: add sha256sum for 24.0.2
1940- docs: update calendar for 24.0.2
1941- ci_run_n_monitor: fix handling of optional jobs again
1942- ci_run_n_monitor: read job logs as utf-8
1943- vk/util: trivial cleanups in vk_icd_gen.py
1944- vk/util: print a nice error in vk_icd_gen.py when VK_HEADER_VERSION is not defined
1945- ci/android: use a specific version of android-cuttlefish
1946- ci: document which image tags to bump when touching build-mold.sh
1947- ci: uprev mold to the latest release
1948- ci/image-tags: move KERNEL_ROOTFS_TAG to group the test images together
1949- ci/deqp: only apply the android patches to the android build
1950- ci/deqp: build deqp-egl using mold as well
1951- ci/deqp: make deql-egl for android less of a special case
1952- ci/deqp: control the GL release independently of VK
1953- ci/deqp: control the GLES release independently of GL
1954- r300/ci: group tex-miplevel-selection flakes together
1955- r300/ci: add another tex-miplevel-selection flake
1956- iris/ci: add pbuffer flakes for amly, same as apl and glk
1957- panfrost/ci: skip dEQP-GLES31.functional.copy_image.non_compressed.* on t760 as they hang
1958- rpi3/ci: update expectations for vc4-rpi3-gl-piglit-full:arm32 2/4
1959- freedreno/ci: add another a618 flake
1960- zink+anv: update expectations
1961- r300/ci: add flakes
1962- radeonsi/ci: add vangogh piglit flake
1963- zink+radv: update navi31 expectations (one test fixed)
1964- softpipe: update expectations
1965- ci/deqp: drop zlib url patch
1966- ci/deqp: split vk/gl/gles patches
1967- ci/deqp-runner: inline never-used DEQP_VARIANT variable
1968- ci/deqp: use the proper gl/gles releases for deqp-gl*, deqp-gles*, deqp-egl
1969- ci/venus-lavapipe: drop unused DEQP_VER that's being overwritten by DEQP_SUITE anyway
1970- ci/lavapipe: fold \`DEQP_VER: vk` and drop .deqp-test-vk
1971- docs: delay 24.1 branchpoint by 2 weeks
1972- vk/update-aliases: drop VK_ERROR\_ prefix substitution
1973- ci/deqp-runner: do a release build instead of debug
1974- ci/deqp-runner: set android rust target in the caller (debian/x86_64_test-android.sh)
1975- ci/deqp-runner: bring "install from crate" & "install from git" to feature parity
1976- ci/deqp-runner: update repo url
1977- ci/deqp-runner: fix list of image tags to update
1978- ci/image-tags: re-generate all the images building deqp-runner
1979- docs: add release notes for 24.0.3
1980- docs: add sha256sum for 24.0.3
1981- ci/deqp: document which build produces which binary
1982- ci: include all the src/\**/gitlab-ci.yml files
1983- nouveau: add missing vl lib
1984- nouveau/ci: fix yaml indentation
1985- nouveau/ci: only trigger jobs for relevant changes
1986- Revert "nouveau: add missing vl lib"
1987- ci/deqp: backport fix for dEQP-VK.wsi.direct_drm.* bug
1988- vc4/ci: add flake
1989- radeonsi/ci: udpate expected failures
1990- r300: mark new fails
1991- v3dv/ci: update expectations
1992- v3d/ci: mark spec\@ext_framebuffer_blit\@fbo-blit-check-limits as fixed
1993- vc4/ci: add another \`spec\@!opengl 1.1\@depthstencil-default_fb-drawpixels` flake
1994- vc4/ci: add another \`spec\@arb_vertex_buffer_object\@vbo-subdata-many draw` flake
1995- v3dv/ci: mark the \`dEQP-VK.wsi.*.maintenance1.deferred_alloc.*` flakes seen so far as happening on all platforms
1996- v3dv/ci: add other flakes seen during nightly run
1997- ci: fix shader-db job existence condition
1998- v3dv/ci: assume dEQP-VK.wsi.wayland.swapchain.simulate_oom.* have been fixed
1999- v3dv/ci: add more flakes
2000- v3dv/ci: assume list of dEQP-VK.wsi.*.maintenance1.present_modes.* flakes is the same between xcb & xlib and between rpi4 & rpi5
2001- ci: enable MESA_VK_ABORT_ON_DEVICE_LOSS globally
2002- ci/deqp-runner: split gl & gles groups to use the correct binary
2003- ci/deqp-runner: print deqp-gles version log as well
2004- ci: deduplicate converting the current job runtime into %M:%S
2005- ci: convert the job start date into a timestamp only once
2006- ci: simplify unnecessarily complex printf
2007- radv/ci: sort tahiti flakes
2008- radv/ci: add a bunch of flakes seen recently
2009- v3dv/ci: track regression
2010- rpi/ci: add flakes
2011- radv/ci: add more flakes
2012- v3dv/ci: add more flakes
2013- docs: update calendar for 24.0.3
2014- docs: update calendar for 24.0.4
2015- docs: add release notes for 24.0.4
2016- docs: add sha256sum for 24.0.4
2017- v3dv/ci: another batch of flakes
2018- radv/ci: another batch of flakes
2019- radv/ci: another batch of flakes
2020- radv/ci: dEQP-VK.spirv_assembly.type.vec4.i8.mod_geom Fail -> Crash on tahiti
2021- ci: don't run rustfmt on every core change
2022- ci_run_n_monitor: explain how to pass multiple targets without having to use regexes
2023- rpi/ci: another batch of flakes
2024- docs: mesa also implements gles 3.0+
2025- docs/egl: various wording improvements
2026- ci: take kws farm offline
2027- ci: restore kws farm
2028- radv/ci: simplify tahiti flakes list
2029- ci: fold .test-check into its only user, python-test
2030- ci: run python-test when editing the CI itself
2031- ci: run python-test automatically only in merge pipelines
2032- docs/macos: drop reference to former github mirror
2033- docs/nir: vec4 reference
2034- docs/envvars: fix reference
2035- docs/isl: fix references to ISL_AUX_USAGE_CCS_*
2036- docs/isl: stop trying to link to classic drivers code
2037- docs/isl: VK_FORMAT_xxx_PACKEDn is not a real format, don't try to link to it
2038- docs/isl: fix enum references
2039- docs: fix inline c identifier reference -> inline code
2040- isl: fix inline c identifier reference -> inline code
2041- nir: add missing stdint include
2042- docs/anv: fix envvar documentation
2043- docs/nvk: fix envvar documentation
2044- ci: mark vmware farm as offline
2045- ci: add missing rule to disable vmware farm
2046- ci: raise the log level threshold of spirv logs
2047- docs/envvars: document some vulkan loader env vars
2048- docs: replace references to the deprecated VK_ICD_FILENAMES with the new VK_DRIVER_FILES
2049- docs: replace references to the deprecated VK_INSTANCE_LAYERS with the new VK_LOADER_LAYERS_ENABLE
2050- docs/zink: format the envvar value as code instead of plain text
2051- meson: add VK_DRIVER_FILES to devenv, alongside the old VK_ICD_FILENAMES
2052- ci: drop unused VK_ICD_FILENAMES passthrough from jobs to tests
2053- ci: use the new VK_DRIVER_FILES env var
2054- ci/deqp: backport fix for dEQP-VK.pipeline.*.render_to_image.*.huge.*
2055- ci: fix nightly build
2056- ci: fix nightly build (v2)
2057- ci/llvmpipe: make sure manual jobs don't auto-retry
2058- ci/llvmpipe: fix out of date fails list
2059- ci/lavapipe: fix out of date fails list
2060- ci/lavapipe: skip test that sometimes times out
2061- ci: add nightly full run of llvmpipe
2062- ci: add nightly full run of lavapipe
2063- gallium/dri: reuse existing meson variables
2064- meson: regroup glvnd lines to get an easier-to-review diff in the next commit
2065- meson: turn \`glvnd` option into a feature
2066- ci: explicitly disable glvnd to avoid regression when making it auto
2067- meson: auto-enable glvnd when libglvnd is installed
2068- mr-label-maker: include \*/gitlab-ci-inc.yml in GitLab CI changes
2069- mr-label-maker: be explicit about the various CI files
2070- docs: add release notes for 24.0.5
2071- docs: update calendar for 24.0.5
2072- docs: add sha256sum for 24.0.5
2073- ci: delete mistaken duplicate llvmpipe-{fails,skips}.txt
2074- etnaviv: avoid re-defining prog_python
2075- egl: drop dead dri2_dpy param in dri2_wl_visual_idx_from_config()
2076- lavapipe: add 1 new failure and 1 new timeout since CTS uprev to 1.3.8.0
2077- vk/overlay-layer: drop unused imports
2078- vk/overlay-layer: fix None checks
2079- vk/overlay-layer: simplify print and make it more readable
2080- docs/rusticl: add an intro explaining what Rusticl is
2081- wsi/x11: drop unused param in x11_present_to_x11_sw()
2082- radv: initialize a couple of variables
2083- util: simplify loop logic in util_format_get_first_non_void_channel()
2084- util/futex: replace double-cast check with a simple sign check
2085- docs/ci: explain how gitlab considers "changes" when pushing on a fork branch
2086- rpi5/ci: sort flakes
2087- rpi5/ci: add flakes from last night's run
2088- rpi4/ci: sort flakes
2089- rpi4/ci: add new flakes from last night's run
2090- radeonsi/ci: update vangogh expectations after piglit uprev
2091- llvmpipe/ci: update expectations after piglit uprev
2092- VERSION: bump for 24.1.0-rc1
2093- .pick_status.json: Update to 4660ee1deaace6457bf5fbf3fc8810e4a2453cb5
2094- ci: fix container rules on release branches and tags
2095- .pick_status.json: Update to 84632dce93f44e8d88cda47648cfd4cc0958918f
2096- .pick_status.json: Update to 8248cc0bf45d0d7558cc3d77a63dcd078a96aa66
2097- ci: pass MESA_VK_ABORT_ON_DEVICE_LOSS through to the DUT
2098- .pick_status.json: Update to 86281ef15fca378ef48bcb072a762168e537820d
2099- .pick_status.json: Update to 47f6e24ad5dfcb59dd1511800aee8c56b4f8fee4
2100- meson: simplify \`-gsplit-dwarf` compiler argument check
2101- meson: move tsan-blacklist.txt to build-support with the other build support files
2102- VERSION: bump for 24.1.0-rc2
2103- .pick_status.json: Update to 603982ea802b3846e91a943b413a7baf430e875d
2104- .pick_status.json: Update to 569c2fcf952a3ec13ddf77c0058e769bf68f3aaf
2105- .pick_status.json: Update to 9666756f603f0285d8a93ef93db1c7ec702b671f
2106- .pick_status.json: Update to b8e79d2769b4a4aed7e2103cf0405acc5bdadb86
2107- VERSION: bump for 24.1.0-rc3
2108- .pick_status.json: Update to 18c53157318d6c8e572062f6bb768dfb621a55fd
2109- .pick_status.json: Update to 406dda70e7c9baa59c975eb64025e7c3b210c3bc
2110- .pick_status.json: Update to 5502ecd7716045e76f13f007a4aa5f5653c80ecd
2111- util/format: add missing null check in util_format_is_srgb()
2112- .pick_status.json: Update to d516721cd0cb16d0b601c42c01de0fdcc4ae887b
2113- .pick_status.json: Update to aa9244c8f6bfa3fb33cf233104b00fc44fc9459f
2114- .pick_status.json: Mark a45f1990860db3a8da6d7251bb627a314dfb8423 as denominated
2115- VERSION: bump for 24.1.0-rc4
2116- .pick_status.json: Update to b2282e3a571f18b48b8b717ec32da1d0ed93f1b5
2117- .pick_status.json: Update to 471ac97a4af751226bc51076130deae252bb481e
2118- .pick_status.json: Update to 2487a875527f636565a7b39036690fbf7c5d46db
2119- .pick_status.json: Update to 3584fc64828ad2ad4d486572ec915aab8321aadd
2120
2121Eric R. Smith (13):
2122
2123- panfrost: fix panfrost drm-shim
2124- panfrost: add lowering pass for multisampled images
2125- panfrost: support multi-sampled image load/store
2126- panfrost: protect alpha calculation from accessing non-existent component
2127- panfrost: make drm-shim work again for panfrost
2128- panfrost: make sure blends always have 4 components
2129- panfrost: mark indirect compute buffer as read
2130- gallium: handle copy_image of depth textures
2131- panfrost: fix polygon offset calculation for floating point Z
2132- panfrost: fix a GPU/CPU synchronization problem
2133- panfrost: mark separate_stencil as valid when surface is valid
2134- panfrost: fix an incorrect stencil clear optimization
2135- panfrost: add a barrier when launching xfb jobs in CSF
2136
2137Erico Nunes (2):
2138
2139- Revert "ci: lima farm is down"
2140- ci: enable shader-db on lima
2141
2142Erik Faye-Lund (32):
2143
2144- panfrost: add support for forcing sample-counts
2145- panfrost: pass reduced primitive type instead of points
2146- panfrost: add line_smooth shader-key and lowering
2147- panfrost: clean up active_prim update
2148- panfrost: implement line-smoothing
2149- mesa/main: add support for EXT_texture_storage
2150- mesa: fix error-handling for ETC2/RGTC textures
2151- glapi: move EXT_texture_storage to the right position
2152- targets/va: override LIBVA_DRIVERS_PATH in devenv
2153- mesa/main: fix _mesa_base_tex_format for BGRA
2154- mesa/main: mark GL_BGRA as color-renderable
2155- mesa/main: mark GL_BGRA8_EXT as color-renderable
2156- mesa/main: work around chrome/firefox bug
2157- mesa/main: allow GL_BGRA for FBOs
2158- panvk: do not handle illegal null
2159- glsl: Make error_value a real ir_rvalue type
2160- panfrost: give afbc-packing its own flag
2161- panfrost: add driconf infrastructure
2162- panfrost: add pan_force_afbc_packing driconf
2163- mesa: prefer read-format of RG for snorm
2164- gallium: remove always-false parameter
2165- panvk: use integers instead of strings
2166- panfrost: silence compiler warning
2167- panfrost: add tiler-heap driconfs
2168- panvk: wire up version-overriding
2169- panfrost: implement a driver-specific max-miplevel
2170- panfrost: use perf_debug_ctx instead of perf_debug
2171- panfrost: perf_debug_ctx -> perf_debug
2172- panfrost: use util_debug_message for perf_debug
2173- panfrost: do not deref potentially null pointer
2174- panfrost: correct first-tracking for signature
2175- panvk: avoid dereferencing a null-pointer
2176
2177Erik Kurzinger (2):
2178
2179- wsi/wayland: don't use explicit sync with sw
2180- wsi/x11: support explicit sync
2181
2182Faith Ekstrand (284):
2183
2184- nvk: Add an explicit mapping from shader stages to cbuf bindings
2185- nvk: Return an nvk_cbuf_map from nvk_lower_nir()
2186- nvk: Use s instead of set_idx in CmdBindDescriptorSets
2187- nvk: Rework descriptor set binding
2188- nvk: Make dynamic cbuf indices relative to the descriptor set
2189- nvk: Handle missing descriptor sets in nvk_nir_lower_descriptors
2190- nvk: Invalidate state after secondary command buffers
2191- nvk: Set a minimum of one patch control point
2192- nak: Disallow gl_FragData and set MRT correctly
2193- nak: Add explicit padding to nak_shader_info
2194- nvk: Emit SET_ANTI_ALIAS at draw time when no render targets are bound
2195- nvk: Move SET_HYBRID_ANTI_ALIAS_CONTROL to draw time
2196- nvk: Advertise variableMultisampleRate and EDS3RasterizationSamples
2197- nvk: Add a couple more features to features.txt
2198- nak: Stop passing --explicit-padding to bindgen
2199- nak: Implement nir_op_pack_half_2x16_rtz_split
2200- nak: Implement nir_op_ufind_msb_rev
2201- nak: Rename OpBrev to OpBRev
2202- nak: Implement nir_op_bfm
2203- nouveau/mme/fermi: Stop truncating iadd immediates
2204- nouveau/mme: Stop using isaspec
2205- nvk: Set framebufferIntegerColorSampleCounts
2206- nvk: Unref shaders on pipeline free
2207- nvk: Add a #define for max shared memory size
2208- nvk: Properly configure the min/max shared mem size
2209- nvk: Implement VK_KHR_zero_initialize_workgroup_memory
2210- nir,spirv: Add support for SPV_NV_shader_sm_builtins
2211- nak: Add support for SPV_NV_shader_sm_builtins
2212- nvk: Advertise VK_NV_shader_sm_builtins
2213- nvk/draw: Map cbuf slots to shaders, not cbuf_maps
2214- nak: Refactor shader upload math
2215- nvk: Wire up nir_opt_large_constants
2216- nak: Enable NIR fuse_ffmaN
2217- nak: Legalize OpBMsk
2218- nvk: Don't exnore ExternalImageFormatInfo
2219- nvk: Set maxInlineUniformTotalSize
2220- nak: Fix TCS output reads
2221- anv: Add helpers for getting the surface state from an image view
2222- anv: Advertise VK_EXT_attachment_feedback_loop_layout
2223- nak: Choose S2R vs CS2R based on sysval index
2224- nak: Add a source barrier intrinsic
2225- nak: Loop to ensure we get accurate shader clocks
2226- nvk: Stop requiring dedicated allocations
2227- nvk: Advertise Vulkan 1.3
2228- nvk: Do a second submit to check for errors in the sync case
2229- nvk: Whitespace fixes
2230- nvk: Disable all cbufs in nvk_queue_init_context_draw_state()
2231- nvk: Call lower_compute_system_values after zer_initialize_workgroup_memory
2232- nak/nir: Stop lowering load_local_invocation_index
2233- nil: Set the level offset to 0 in nil_image_for_level
2234- nvk: Fix whitespace in nvk_image.c
2235- nouveau/winsys: Re-order channel creation
2236- nouveau/winsys: Allow only allocating a subset of engines
2237- nvk/queue: Pull DRM specfc stuff into nvk_queue_drm.c
2238- nvk/queue: Refactor the push builder a bit
2239- nvk: Move the nouveau_ws_context to nvk_queue
2240- nvk: Add an array of queue families to nvk_physical_device
2241- nvk/queue: Rework context state init
2242- nvk/queue: Only initialize the necessary engines
2243- nvk: Use VM_BIND for contiguous heaps instead of copying
2244- nvk: Only map heaps that explicitly request maps
2245- nvk: Add an upload queue
2246- nvk: Add an upload queue to nvk_device
2247- nvk: Use the upload queue for shader uploads
2248- nvk: Don't set CONSTANT_BUFFER_SELECTOR with a zero size
2249- nvk/heap: Use nvk_heap_bo::addr instead of bo->offset
2250- nvk/heap: Rework over-allocation
2251- nvk: Convert shader addresses to offsets in nvk_shader.c
2252- vulkan: Update XML and headers to 1.3.278
2253- nvk: Use nouveau_ws_bo_new_mapped() for descriptors
2254- nouveau/winsys: Add a fixed_addr to nouveau_ws_bo_map
2255- nvk: Implement VK_EXT_map_memory_placed
2256- nvk: Invalidate the texture cache before MSAA resolves
2257- nvk: Don't use WAIT_AVAILABLE in nvk_upload_queue_sync
2258- drm-uapi: Sync nouveau_drm.h
2259- nouveau/winsys: Add a vram_used query
2260- nvk: Add a nouveau_ws_device to nvk_physical_device
2261- nvk: Add a hand-rolled nvk_memory_heap struct
2262- nvk: Use 3/4 of total system memory for the VRAM heap
2263- nvk: Add an available query to nvk_memory_heap
2264- nvk: implement EXT_memory_budget
2265- nouveau/winsys: Getch the BAR size from the kernel
2266- nvk/heap: Upload shaders on the CPU when we have a map
2267- nvk: Upload shaders on the CPU when we have ReBAR
2268- nvk: Expose a host-visible VRAM type when we have REBAR
2269- nvk: Only expose VK_KHR_present_id/wait when we have WSI
2270- nvk: Advertise VK+KHR_incremental_present
2271- nil: Add PIPE_FORMAT_R5G5B5A1_UNORM
2272- nak: Add writes_point_size to nak_shader_info
2273- nvk: Handle missing gl_PointSize in the last geometry stage
2274- nvk/copy: Handle VK_REMAINING_ARRAY_LAYERS
2275- vulkan/meta: Handle VK_REMAINING_ARRAY_LAYERS in blit and resolve
2276- nvk: Use VkPipelineCreateFlags2 flag names
2277- nvk: Advertise VK_KHR_maintenance5
2278- vulkan: Add a vk_get_subgroup_size() helper
2279- vulkan: Move the descriptor set limit to vk_limits.h
2280- vulkan: Add runtime code for VK_EXT_shader_object
2281- vulkan: Add a vk_render_pass_state_has_attachment_info() helper
2282- vulkan: Rework vk_render_pass_state::attachments
2283- vulkan: Add a new dynamic state for render pass attachments
2284- vulkan: Add a vk_pipeline base struct
2285- vulkan: Add push constant ranges to vk_pipeline_layout
2286- vulkan: Add a BLAKE3 hash to vk_descriptor_set_layout
2287- vulkan: Add generic graphics and compute VkPipeline implementations
2288- nvk: Populate vk_descriptor_set_layout::blake3
2289- nvk/shader: Refactor some helpers
2290- nvk: Move populate_fs_key to nvk_shader.c
2291- nvk: Pass an array of descriptor sets to nvk_lower_nir
2292- nvk: Move nir_lower_patch_vertices to nvk_lower_nir()
2293- nvk: Use vk_render_pass_state::attachments for write masks
2294- nvk: Switch to shader objects
2295- nvk: Advertise VK_KHR_graphics_pipeline_library
2296- nvk: Advertise VK_EXT_shader_object
2297- nak: Implement nir_op_iadd3 on SM70+
2298- nir: Add an imad opcode
2299- nak: Move NAK_FS_OUT_COLOR next to the enum
2300- nak: Add support for imad on Volta+ and enable it in simple cases
2301- nvk: Advertise a CTS version of 1.3.7.3
2302- nvk: Drop the non-conformant warning on Turing-Ada
2303- nvk: Don't print the NVK_I_WANT_A_BROKEN_VULKAN_DRIVER warning in release builds
2304- meson: Rename nouveau-experimental to nouveau and build by default on x86
2305- vulkan/pipeline: Whitespace fix
2306- vulkan/pipeline: Handle fully compiled library shaders properly
2307- nvk: Advertise VK_KHR_pipeline_library
2308- docs/nvk: Update the conformance status section
2309- docs/nvk: Update the NVK_DEBUG docs
2310- docs/nvk: Document NAK_DEBUG
2311- nil: Enable A8_UNORM for storage buffers
2312- vulkan/pipeline: Always init pipeline cache objects
2313- nak: Fix printing of OpIsberd
2314- nak/sm50: Fix encoding of immediates in OpFFma
2315- nak/sm50: Use OpBfe instead of OpBRev for nir_op_find_lsb
2316- nak: Support F2I for 8-bit integers on SM50
2317- nvk: Return os_page_size for minMemoryMapAlignment
2318- nouveau: Import g_nv_name_released.h from NVIDIA OGK
2319- nvk: Report official GPU names from NVIDIA when we have them
2320- nvk: Use row_stride_B instead of width for render and copies
2321- nil: Rework tiling calculations
2322- nil: Add a concept of width to tile sizes
2323- nil: Add a concept of sliced 3D image views
2324- nvk: Use "real" 3D image views
2325- nvk/queue: Add a push_bind helper
2326- nvk: Refactor opaque image binds
2327- nvk/queue: Add support for non-opaque sparse binds
2328- nak: Rename resident to fault
2329- nak: Plumb is_sparse through from NIR for texture ops
2330- nak/nir: Add sparse support to shrink_image_load()
2331- nak: Wire up sparse residency for texture ops
2332- nil: Fix a typo in a comment
2333- nvk: Document the register name for the helper load workaround
2334- nvk: Always wait for the FALCON in set_priv_reg
2335- nvk: Disable the Out Of Range Address exception
2336- nvk: Drop a bunch of dev->pdev and just use pdev
2337- nvk: Add and use more cmd_buffer_*_cls helpers
2338- nvk: Replace more dev->pdev with nvk_device_physical()
2339- nvk: Drop nvk_device::pdev
2340- zink: Remove interpolateAtSample() when not multi-sampling
2341- nil: Move Z slice offset calculations to a helper
2342- nvk: Add a nil_image helper variable in BeginRendering
2343- nvk: Manually offset array and Z slices in BeginRendering
2344- nil: Advertise support for PIPE_FORMAT_R5G6B5_UNORM
2345- nil: Whitespace fix
2346- nil: Add support for larger textures on Pascal+
2347- nil: Add a helper to view a MSAA image as samples
2348- nil: Expose nil_pix_extent_sa()
2349- nvk: Use a HW generation names instead of chipsets
2350- nvk: Stop pretending to handle Intel image intrinsics
2351- nvk: Use different descriptor layouts for storage vs. sampled images
2352- nvk: Implement shaderStorageImageMultisample
2353- zink: Rework sparse texture lowering
2354- nvk: Ignore rasterizationSamples when handling sampleShadingEnable
2355- nvk: Always set SET_ATTRIBUTE_POINT_SIZE
2356- Revert "nvk: Enable VK_KHR_shader_subgroup_uniform_control_flow"
2357- nvk: Move the mutableDescriptorType enable
2358- nir: Take a nir_def in nir_goto_if()
2359- nir/print: Inline print_ssa_use()
2360- nir/builder: Correctly handle decl_reg or undef as the first instruction
2361- nir: Improve the comment for nir_block::imm_dom
2362- nir: Add a sort_unstructured_blocks() helper
2363- nir: Validate that unstructured blocks are in reverse PDFS order
2364- nir/lower_reg: Remove dead reg_decl intrinsics
2365- nir/lower_reg: Support unstructured control-flow
2366- nir/repair_ssa: Support unstructured control-flow
2367- nir/gather_types: Support unstructured control-flow
2368- nir: Mark divergent regs in phis_to_regs_block()
2369- nir: Add a lower_terminate_to_demote pass
2370- nak: Add a copy_fs_outputs_nv intrinsic
2371- nak: Move barrier removal into its own pass
2372- nak: Add a condition to bar_break_nv
2373- nak/nir: Add a control-flow lowering pass
2374- nak: Add more NIR wrappers for walking the NIR CFG
2375- nak: Add NIR helpers for jump instructions
2376- nak: Add helpers for emitting jumps
2377- nak: Handle unstructured NIR
2378- nak: Use the new lowering pass on SM70+
2379- nak: Remove the old barriers pass
2380- nak/nir: Use nir_lower_terminate_to_demote()
2381- nvk: Advertise VK_KHR_shader_maximal_reconvergence
2382- nvk: Advertise VK_KHR_shader_subgroup_uniform_control_flow
2383- nak/nir: Emit nir_intrinsic_ald_nv directly for system values
2384- nak/nir: Rename load_interpolated_input
2385- nak/nir: Add a load_fs_input hepler for flat inputs
2386- nak/nir: Emit nir_intrinsic_ipa_nv directly for FS system values
2387- nak/nir: Use nir_io_semantics for varyings and attributes
2388- nak: Break lower_fs_inputs into its own file
2389- nak/nir: Clean up lower_fs_inputs a bit
2390- nak: Call nir_lower_io_to_temporaries for FS outputs
2391- nak/nir: Use nir_io_semantics for FS outputs
2392- nak: Drop lower_io_arrays_to_elements_no_indirects for FS outputs
2393- nak: Simplify over-all I/O lowering
2394- nak: Don't write undefined FS outputs
2395- nak: Plumb through LDC modes
2396- nak: Implement load_ubo with an indirect cbuf index
2397- nvk: Support VkBindMemoryStatusKHR
2398- nvk: Advertise VK_KHR_maintenance6
2399- nir: Delete the rest of the CF list when adding a halt
2400- nak: Don't do a scope break cascade for nir_jump_halt
2401- nil: Add a CSV version of the format table
2402- nil: Re-organize the format table
2403- nil: Switch to using the CSV generated table
2404- nil: Drop bogus color formats from non-renderable luminance/alpha formats
2405- nil: Remove 2-bit SNORM from the format table
2406- nil: Drop unneeded types from formats
2407- nvk: Use the page-aligned BO size for the descriptor pool
2408- nvk: Use a VMA heap for descriptor memory
2409- nvk: Use a linked list for descriptor sets in a pool
2410- nvk: Add a _pad field to nvk_cbuf
2411- nvk: Delete dead descriptor code
2412- nvk: Add a _pad field to nvk_fs_key
2413- nvk: Add a bunch of -Wpadded errors
2414- vulkan: Add a bunch of -Wpadded errors
2415- nouveau: Move .rustfmt.toml from NAK to src/nouveau/
2416- nouveau: Use hyphenated arguments to class_parser.py
2417- nouveau/headers: Add initial Rust bindings
2418- nouveau/headers: Add Rust bindings for texture headers
2419- ci: Add cbindgen to the build images
2420- nil: Move nil_tic_format to nil_format_table.h
2421- nil: Move to a single header file
2422- nvk: Stop using nvk_extent4d short names
2423- nil: Rename nil_tiling::gob_height_8 to gob_height_is_8
2424- nak/bitview: Add a SetField<f32> implementation
2425- nil: Delete unused USAGE bits
2426- nil: Make the Rust library the primary build target
2427- nil: Add Extent/Offset4D::new() helpers
2428- nil: Drop the nil_extent/offset4d() helpers
2429- nil: Take a format in el_to_B()
2430- nil: Enforce units via Rust types
2431- nil: cbindgen is required
2432- nvk: Improve the unsupported handle type error
2433- nvk: Restrict shaderFloat16 to Ampere+ for now
2434- nouveau/headers: Move the classes into a submodule for Rust
2435- nouveau/headers: Generate Rust for QMDs
2436- nak: Add helpers for filling QMDs
2437- nvk: QMDs are 64 dwords
2438- nvk: Use the NAK helpers to fill QMDs
2439- nouveau: Import the hwref headers from Nvidia OGK
2440- nouveau/headers: Add the MMU headers to the Rust crate
2441- nil: Use the enums from the hwref headers for PTE kinds
2442- nil: add s8 pte kind
2443- nil: Be more speicific about Maxwell in the format table
2444- nil: Advertise S8_UINT on MaxwellB+
2445- nvk: Hash ycbcr conversions in the descriptor set layout hash
2446- nvk: Re-emit sample locations when rasterization samples changes
2447- nvk/meta: Restore set_sizes[0]
2448- nvk/upload_queue: Only upload one line of data
2449- vulkan/wsi: Bind memory planes, not YCbCr planes.
2450- nvk: Improve the GetMemoryFdKHR error
2451- nouveau/winsys: Take a reference to BOs found in the cache
2452- nouveau/winsys: Make BO_LOCAL and BO_GART separate flags
2453- nvk: Allow GART for dma-bufs
2454- nil: Use the right PTE kind for Z32 pre-Turing
2455- nvk: Set color/Z compression based on nil_image::compressed
2456- nil: Default to NV_MMU_PTE_KIND_GENERIC_MEMORY on Turing+
2457- nvk: Allow VK_IMAGE_ASPECT_MEMORY_PLANE_0_BIT
2458- drm-uapi: Sync nouveau_drm.h
2459- nouveau/winsys: Add back nouveau_ws_bo_new_tiled()
2460- nvk: Support image creation with modifiers
2461- nvk: Set tile mode and PTE kind on dedicated dma-buf BOs
2462- nvk: Implement DRM format modifier queries
2463- nvk: Advertise VK_EXT_queue_family_foreign
2464- nvk: Advertise VK_EXT_image_drm_format_modifier
2465- nvk/wsi: Advertise modifier support
2466- zink: Set workarounds.can_do_invalid_linear_modifier for NVK
2467- nvk/meta: Save and restore set_dynamic_buffer_start
2468
2469Felix DeGrood (6):
2470
2471- driconf: Change vendorid on Palworld for Intel
2472- driconf: Fake vendorid for RDR2
2473- mesa-overlay: defer listening to socket until first frame
2474- driconf: add SotTR DX12 to Intel XeSS workaround
2475- iris: Increase target batch size to 128 KB
2476- intel/ds: add pipe control reasons to perfetto flushes
2477
2478Francisco Jerez (36):
2479
2480- intel/fs: Use full 32-bit sample masks when immediate.
2481- intel/eu/validate: SEND instructions don't have immediate encodings on Gen12+.
2482- intel/eu/gfx12.5+: Don't fail validation with ARF register restriction error for indirect addressing.
2483- intel/compiler/xe2: Add Xe2 bounds to FF() macro.
2484- intel/compiler/xe2: Implement codegen of general instruction controls.
2485- intel/compiler/xe2: Implement codegen of 2-source instruction operands.
2486- intel/compiler/xe2: Implement codegen of indirect immediates.
2487- intel/compiler/xe2: Implement codegen of three-source instructions.
2488- intel/compiler: Add assume() checks to brw_compact_inst_(set\_)bits().
2489- intel/compiler/xe2: Implement codegen of compact instructions.
2490- intel/compiler/xe2: Implement instruction compaction.
2491- intel/compiler/xe2: Fix for NibCtrl field removal.
2492- intel/compiler/xe2: Fix for the removal of most predication modes.
2493- intel/compiler/xe2: Add extra flag registers.
2494- intel/compiler/xe2: Fix for the removal of AccWrCtrl.
2495- intel/ir/xe2+: Add support for 32 SBID tokens to performance model.
2496- intel/fs/xe2+: Disable bank conflict mitigation pass for now.
2497- intel/eu/xe2+: Translate brw_reg fields in REG_SIZE units to physical 512b GRF units during codegen.
2498- intel/fs: Set the default execution group to 0 when not representable by the platform.
2499- intel/fs: Emit QUAD_SWIZZLE instructions with WE_all for derivative lowering.
2500- intel/fs/xe2+: Allow SIMD16 MULH instructions.
2501- intel/brw/xe2: Render target reads have been removed from the hardware.
2502- intel/brw/xe2+: Update encoding of FB write descriptor message control.
2503- intel/brw/xe2+: Update encoding of FB write extended descriptor.
2504- intel/brw/xe2+: Double allowed SIMD width of FB write SEND messages.
2505- intel/brw/xe2+: Allow FS stencil output in SIMD16 dispatch mode.
2506- intel/brw/xe2+: Allow dual-source blending in SIMD16 mode.
2507- intel/blorp/xe2+: Don't use replicated-data clears.
2508- intel/brw/gfx12: Setup PS thread payload registers required for ALU-based pixel interpolation.
2509- intel/brw/xe2+: Setup PS thread payload registers required for ALU-based pixel interpolation.
2510- iris/xe2+: Disable coherent framebuffer fetch.
2511- intel/brw/xehp+: Replace lsc_msg_desc_dest_len()/lsc_msg_desc_src0_len() with helpers to do the computation.
2512- intel/eu/xehp+: Don't initialize mlen and rlen descriptor fields from lsc_msg_desc*().
2513- intel/brw/xehp+: Drop redundant arguments of lsc_msg_desc*().
2514- intel/fs/gfx20+: Implement sub-dword integer regioning restrictions.
2515- intel/fs/gfx20+: Handle subdword integer regioning restrictions in copy propagation.
2516
2517Frank Binns (3):
2518
2519- CODEOWNERS: update Imagination maintainers
2520- pvr: fix up some includes
2521- pvr: split out device info into per GPU headers
2522
2523Friedrich Vock (27):
2524
2525- radv/rt: Add workaround to make leaves always active
2526- radv: Fix shader replay allocation condition
2527- nir: Make is_trivial_deref_cast public
2528- nir: Handle casts in nir_opt_copy_prop_vars
2529- radv/amdgpu: Fix build on BSD
2530- winsys/amdgpu: Fix build on BSD
2531- util: Provide a secure_getenv fallback for platforms without it
2532- vulkan: Use secure_getenv for trigger files
2533- aux/trace: Guard triggers behind __normal_user
2534- vtn: Use secure_getenv for shader dumping
2535- mesa/main: Use secure_getenv for shader dumping
2536- radv: Use secure_getenv in radv_builtin_cache_path
2537- radv: Use secure_getenv for RADV_THREAD_TRACE_TRIGGER
2538- util/disk_cache: Use secure_getenv to determine cache directories
2539- radv/rt: Write inactive node data in ALWAYS_ACTIVE workaround
2540- radv/rt: Optimize update shader VGPR usage
2541- radv,driconf: Enable active AS leaf workaround for Jedi Survivor
2542- radv/rt: Handle monolithic pipelines in capture/replay
2543- vulkan/runtime: Allow more than 8 DRM devices
2544- radv: Set SCRATCH_EN for RT pipelines based on dynamic stack size
2545- radv/rt: Fix frontface culling with emulated RT
2546- radv/rt: Force active leaves for every updateable accel struct
2547- radv,driconf: Remove active accel struct workaround
2548- radv: Only enable SEs that the device reports
2549- radeonsi: Only enable SEs that the device reports
2550- aco/tests: Insert p_logical_start/end in reduce_temp tests
2551- aco/spill: Insert p_start_linear_vgpr right after p_logical_end
2552
2553Ganesh Belgur Ramachandra (1):
2554
2555- compiler,glsl: fix warning when -finstrument-functions is used
2556
2557Georg Lehmann (60):
2558
2559- aco: reassign split vector to SOPC
2560- aco: stop scheduling at p_logical_end
2561- nir: add ballot_relaxed and as_uniform intrinsics
2562- aco: implement as_uniform and ballot_relaxed
2563- ac/llvm: implement as_uniform and ballot_relaxed
2564- nir: add lowering for boolean shuffle
2565- radv: lower boolean shuffle
2566- radeonsi: lower boolean shuffle
2567- aco: remove boolean shuffle isel
2568- aco: fix printing dpp8
2569- aco: validate v_permlane opsel correctly
2570- aco: support v_permlane64_b32
2571- aco/gfx11: use v_nop to resolve VcmpxPermlaneHazard
2572- aco/gfx11: resolve VcmpxPermlaneHazard for v_permlane64
2573- aco: implement rotate
2574- radv: enable VK_KHR_shader_subgroup_rotate
2575- radv: report rotate subgroup feature bits
2576- anv: report rotate subgroup feature bits
2577- aco/gfx11+: disable v_pk_fmac_f16_dpp
2578- aco: add packed fma dpp note to README-ISA
2579- aco: don't remove branches that skip v_writelane_b32
2580- aco/print_ir: don't use alloca for input modifiers
2581- aco: print neg prettier for packed math
2582- aco: don't print hi() for permlane opsel
2583- aco: print permlane16 bc/fi
2584- aco: print exec/vcc_lo/hi for single dword access
2585- aco/gfx11+: limit hard clauses to 32 instructions
2586- radv/gfx11+: add rtwave32 perftest option
2587- aco: use fmamk/ak instead of fma with inline constant for more VOPD
2588- nir: remove rotate scope
2589- nir/divergence_analysis: fix subgroup mask
2590- aco: create pseudo instructions with correct struct
2591- aco/post-ra: rename overwritten_subdword to allow additional uses
2592- aco/post-ra: assume scc is going to be overwritten by phis at end of blocks
2593- aco: store if pseudo instr needs scratch reg
2594- aco/post-ra: track pseudo scratch sgpr/scc clobber
2595- aco/ssa_elimination: check if pseudo scratch reg overwrittes regs used for v_cmpx opt
2596- aco/builder: improve v_mul_imm for negative imm
2597- aco/builder: use 24bit mul if low bits of imm are zero
2598- aco/optimizer: combine v_mul_i32_i24 and add to mad
2599- aco: avoid full 32bit imul for uniform reduce/scan
2600- aco: don't combine mul+add_clamp to mad_clamp
2601- aco/ra: use SDWA for 16bit instructions when the second byte is blocked
2602- aco/vn: remove instruction hash templates
2603- aco: use v1 definition for v_interp_p1lv_f16
2604- aco/assembler: add vintrp high_16bit support
2605- aco: swap opsel and wait_exp for vinterp
2606- aco: support high_16bits FS IO
2607- aco/tests: add assembler tests for interp high_16bits
2608- aco/gfx9: all non legacy opsel instructions only write 16bits
2609- aco: use v_interp_p2_f16 opsel
2610- aco: add ra test for hi v_interp_p2_f16
2611- radv: sink alu
2612- radv: move alu
2613- nir: don't try to optimize exclusive min/max scan to inclusive
2614- nir: rename to nir_opt_16bit_tex_image
2615- ac/nir: add ac_nir_opt_pack_half
2616- radv: use ac_nir_opt_pack_half
2617- radv, radeonsi: don't use D16 for f2f16_rtz
2618- zink: use bitcasts instead of pack/unpack double opcodes
2619
2620George Ouzounoudis (1):
2621
2622- vulkan: Fix null pointer dereferencing on sample locations state
2623
2624Gert Wollny (76):
2625
2626- r600: lower dround_even also on hardware that supports fp64
2627- virgl: Use better reporting for mirror_clamp features
2628- ci: Upref virglrenderer
2629- zink: Factor out create buffer from resource_object_create
2630- zink: shorten lifetime of success variable in resource_object_create
2631- zink: Factor out create sampler conversion in resource_object_create
2632- zink: factor out get_format_feature_flags in resource_object_create
2633- zink: factor out get_image_memory_requirement in resource_object_create
2634- zink: reduce number of #ifdefs in resource_create_object
2635- zink: extract get_export_flags from resource_object_create
2636- zink: extract function allocate_bo from resource_create_object
2637- zink: redesign the allocation try loop to test all heaps
2638- zink: extract function create_image from resource_object_create
2639- zink: extract function update_alloc_info from resource_object_create
2640- zink: extract update_obj_info from resource_object_create
2641- zink: extract debug_resource_mem from resource_object_create
2642- zink: drop duplicate assignment to obj->alignment
2643- zink: extract allocate_bo_and_update_obj from resource_object_create
2644- zink: Move more code to create_image and create_buffer
2645- zink: simplify call to get_export_flags
2646- zink: remove duplicate arguments and use VkMemoryRequirements locally
2647- zink: use enums as return values in resource_object_create
2648- radv: Fix compilation with gcc-13 and tsan enabled
2649- nir/lower_int64: Fix compilation with gcc-13 and tsan enabled
2650- nir/builder: Fix compilation with gcc-13 when tsan is enabled
2651- zink: Fix return type and values of create_buffer and create_images
2652- zink: extract check_unordered_exec from zink_get_cmdbuf
2653- zink: remove duplicate check and assignment in zink_resource_image_needs_barrier
2654- zink: extract emit_memory_barrier from zink_resource_image_barrier
2655- zink: extract emit_memory_barrier::for_buffer from  zink_resource_buffer_barrier
2656- zink: extract update_unordered_access_and_get_cmdbuf
2657- zink/sync: remove duplicate assignments in UNSYNCHRONIZED case
2658- zink: move zink_resource_copies_reset out of exportable_lock
2659- zink: remove invalid scope in bo allocation loop
2660- r600: handle indirect access to kcache 14 and 15
2661- zink/nir_to_spirv: emit ViewportIndex cap also for inputs
2662- zink: use only ZINK_BIND_DESCRIPTOR
2663- zink: decrease aggressiveness of increasing descriptor data space adaptive
2664- zink/nir-to-spirv: Make sure sampleid for InterpolateAtSample is int
2665- nir-to-spirv: Cast SSBO input pointer when needed
2666- zink: set handle type also for user memory
2667- zink: acquire - maybe clear timeout after waiting for presentation fence
2668- nir_to_spirv: Allow LOD for external images
2669- zink: ctx->last_fence really wants to be a batch_state, so accomodate it
2670- zink: another fence that is better off as a batch state
2671- ntv: remove store_def_raw
2672- ntv: remove store_ssa_def
2673- ntv: pass def->index to store_def
2674- ntv: simplify increasing the number of dest componnets for sparse tex
2675- zink/ntv: introduce structure using the source params
2676- zink/ntv: extract get_tex_srcs
2677- zink/ntv: use new struct to pass texture parameters
2678- zink/ntv: extract find_sampler_and_texture_index
2679- zink/ntv: simplify evaluation of sampled_type
2680- zink/ntv: extract get_texture_load
2681- zink/ntv: extract get_texop_dest_type
2682- zink/ntv: Extract move_tex:proj_into_coord
2683- zink/ntv: replace if-chain with switch in emit_tex
2684- zink/ntv: extract picking the image to load from
2685- zink/ntv: extract emit_tex_readop as function that reads texture pixel data
2686- zink/ntv: pull result out of cases and use a common store_def
2687- zink: if AcquireNextImageKHR fails with VK_NOT_READY or VK_TIMEOUT retry
2688- meson: Add blacklist when compiling with tsan
2689- futex: disable futexes when compiling with tsan
2690- util/u_queue: read fence->signalled locked with TSAN
2691- tsan-blacklist: ignore race when reading lp_fence signalled status
2692- llvmpipe: Don't emit certain debug code when TSAN is enabled
2693- tsan-blacklist: Ignore race in get_max_abs_timeout_ns
2694- tsan-blacklist: surpress two race conditions in TC
2695- r600/sfn: Add array element parent also to array
2696- r600/sfn: Use dependecies to order barriers and LDS/RAT instructions
2697- r600/sfn: call nir_lower_doubles explicitely
2698- r600/sfn: when emitting fp64 op2 groups pre-load values
2699- r600/sfn: Don't put b2f64 conversion into ALU group
2700- zink/kopper: Wait for last QueuePresentKHR to finish before acquiring for readback
2701- mesa/st: don't use base shader serialization when uniforms are not packed
2702
2703Guilherme Gallo (25):
2704
2705- ci/lava: Turn the r8152 issue check into a counter
2706- ci/lava: Detect r8152 issue during boot phase
2707- ci/lava: Detect hard resets during test phase
2708- bin/ci: Don't submit jobs on integration test
2709- ci/lava: Ignore DUT feedback messages
2710- ci/lava: Fix the integration test
2711- bin/ci: Propagate the token to GitlabGQL
2712- bin/ci: Move get_token_from_default_dir to common
2713- bin/ci: Refactor read_token function
2714- bin/ci: Add GitLab basic token validation
2715- ci/lava: Broader R8152 error handling
2716- radv+zink/ci: Update xfiles based on nightly run
2717- radv/ci: Update xfiles based on nightly run
2718- v3d/ci: Update xfiles based on nightly run
2719- freedreno/ci: Update xfiles based on nightly run
2720- etnaviv/ci: Update xfiles based on nightly run
2721- r300/ci: Update xfiles based on nightly run
2722- ci/a618: Rebalance a618-limozeen jobs
2723- ci/a618: Add zink-tu-a618-full
2724- ci/lava: A few formatting cleanups
2725- ci/lava: Introduce unretriable exception handling
2726- ci/lava: Don't run jobs if the remaining execution time is too short
2727- ci/lava: Fix how exception entry in structured log
2728- ci: Add S3 id_token for all jobs
2729- ci: Use id_tokens for JWT auth
2730
2731Gurchetan Singh (6):
2732
2733- mesa/util: Check __ANDROID__ when for detecting for Android
2734- mesa/util: add <linux/fcntl.h>
2735- mesa/util: use DETECT_OS_ANDROID in anon_file.c
2736- mesa/vulkan: use a simpler path for header in trampoline gen
2737- mesa/vulkan: use DETECT_OS_ANDROID
2738- vk_image.c: #ifndef _WIN32 --> DETECT_OS_LINUX + DETECT_OS_BSD
2739
2740Haihao Xiang (1):
2741
2742- anv: Fix typo in transition_color_buffer
2743
2744Hannes Mann (2):
2745
2746- gallium/pipe: Add contiguous planes per-surface attribute
2747- frontends/va: Only export one handle for contiguous planes
2748
2749Hans-Kristian Arntzen (20):
2750
2751- wsi/x11: Remove unused vk_alpha in get_dri3_modifiers.
2752- wsi/x11: Compare modifiers before signalling SUBOPTIMAL.
2753- wsi/x11: Add drirc option to ignore SUBOPTIMAL.
2754- wsi/x11: Add workaround for Detroit Become Human.
2755- wsi/x11: Rewrite implementation to always use threads.
2756- wsi/x11: Implement VK_EXT_swapchain_maintenance1.
2757- wsi/x11: Keep track of multiple presentation requests.
2758- wsi/x11: Make chain->status atomic.
2759- wsi/wl: Refactor out code to update current present ID.
2760- wsi/wl: Improve fallback for present_wait.
2761- wsi/common: Allow KHR_present_wait on WL.
2762- wsi/x11: Disable vk_xwayland_wait_ready by default on most drivers.
2763- wsi/x11: Rewrite logic for how we consider minImageCount.
2764- radv: export multiview in VS/TES/GS for depth-only rendering
2765- wsi/wl: Fix deadlock in dispatch_queue_timeout.
2766- wsi/wayland: Replace surface pilfer with retired bool.
2767- wsi/wayland: Init outstanding list earlier.
2768- wsi/x11: Return OUT_OF_DATE on sw resize.
2769- vulkan/runtime: Check correct callback list for binding report.
2770- radv: Store range rather than bo_size in VkBuffer/VkImage.
2771
2772Helen Koike (6):
2773
2774- ci/ci_run_n_monitor: move get_gitlab_pipeline_from_url() to gitlab_common
2775- ci/ci_gantt_chart: add tool to analyse pipeline execution time
2776- ci/ci_gantt_chart: add timeout vertical line
2777- ci/ci_gantt_chart: add option to save output to a file
2778- ci/ci_gantt_chart: show duration on hover
2779- ci/ci_post_gantt: add script that post gantt to Marge's messages
2780
2781Hsieh, Mike (4):
2782
2783- amd/vpelib: add new tf enum and add flag for geometric scaling
2784- amd/vpelib: skip gamma remap and cs conversion when geometric scaling
2785- amd/vpelib: geometric scaling fix
2786- amd/vpelib: Add UID for 3d Lut and control logic
2787
2788Hyunjun Ko (3):
2789
2790- anv/video: fix out-of-bounds read
2791- anv/video: fix scan order for scaling lists on H265 decoding.
2792- anv/video: Fix to set correct offset and size for parsing h265 slice header.
2793
2794Iago Toral Quiroga (49):
2795
2796- broadcom/compiler: fix incorrect flags setup in non-uniform if path
2797- broadcom/compiler: fix incorrect flags update for subgroup elect
2798- broadcom/compiler: add new SFU instructions in V3D 7.x
2799- broadcom/compiler: don't move subgroup reduction instructions above setmsf
2800- broadcom/compiler: support subgroup ballot
2801- broadcom/compiler: support subgroup shuffle
2802- broadcom/compiler: support subgroup vote
2803- broadcom/compiler: support subgroup quad
2804- v3dv: expose more subgroup features on V3D 7.x
2805- broadcom/compiler: be more careful with unifa in non-uniform control flow
2806- brodcom/compiler: implement non-compute TSY barrier
2807- broadcom/compiler: support subgroup reduction operations from fragment shaders
2808- v3dv: allow subgroup operations in fragment shaders
2809- broadcom/compiler: fix lane selection for subgroups in fragment shaders
2810- v3d,v3dv: fix BO allocation for shared vars
2811- v3dv: fix subpass clear with draw call for multi-layered framebuffers
2812- v3dv: always set view index before drawing
2813- v3dv: fix copying v3dv_end_query_info into primaries from secondaries
2814- v3dv: refactor checking and adding pending jobs
2815- v3dv: add a helper to constrain clip window to render area
2816- v3dv: add helper to check if we need to use a draw for a depth/stencil clear
2817- v3dv: add helper to build a render pass for dynamic rendering
2818- v3dv: add a helper to setup a framebuffer for dynamic rendering
2819- v3dv: add a vk_render_pass_state to pipelines
2820- v3dv: don't assume that pipelines have a render pass
2821- v3dv: implement vkCmdBeginRendering and vkCmdEndRendering
2822- v3dv: implement dynamic rendering resume/suspend
2823- v3dv: rename SECONDARY job type to INCOMPLETE
2824- v3dv: fix resume address patching for secondary command buffers
2825- v3dv: handle render pass continue flag with dynamic passes
2826- v3dv: also emit subpass clears with secondary command buffers
2827- v3dv: enable VK_KHR_dynamic_rendering
2828- broadcom/ci: skips for tests that don't check for extension support correctly
2829- broadcom/ci: add new expected test failures
2830- broadcom/ci: add a test that fails only in CI
2831- broadcom/ci: add skips for unsupported features
2832- v3dv: fix image creation when exceeding maxResourceSize
2833- v3d: implement fix for GFXH-1602
2834- broadcom/compiler: fix workaround for GFXH-1602
2835- v3dv: require multisync kernel
2836- v3dv: drop single sync kernel interface
2837- v3dv: add a v3dv_job_clone helper
2838- v3dv: fix job pointers from cloned CLs
2839- v3dv: store the offset of the BRANCH instruction in a CL
2840- v3dv: fix job suspend with command buffer simultaneous use flag
2841- broadcom/compiler: enable perquad with uses_wide_subgroup_intrinsics
2842- v3d/simulator: size counter_values array correctly on V3D 7.x
2843- broadcom/ci: document external causes for some CTS 1.3.8 failures
2844- v3dv: fix VK_KHR_vertex_attribute_divisor
2845
2846Ian Romanick (54):
2847
2848- nir: Minor clean up in nir_alu_srcs_negative_equal
2849- intel/compiler: Disable DPAS instructions on MTL
2850- intel/compiler: Use u_foreach_bit64 in brw_get_compiler_config_value
2851- intel/compiler: Track lower_dpas flag in brw_get_compiler_config_value
2852- intel/compiler: Track mue_compaction and mue_header_packing flags in brw_get_compiler_config_value
2853- intel/fs: Fix shift counts for 8- and 16-bit types
2854- intel/rt: Don't directly generate umul_32x16
2855- intel/compiler/xe2: Update get_sampler_lowered_simd_width
2856- intel/fs: Move opcode modification before the switch that emits srcs
2857- intel/compiler/xe2: Use new sample_*_mlod messages
2858- nir: Pack texture LOD and array index to a single 32-bit value
2859- intel/compiler/xe2: Emit texture instructions w/ combined LOD and array index
2860- intel/compiler/xe2: Set SIMD mode for sampler messages
2861- nir: Add documentation for subgroup\_.._mask
2862- intel/fs: Delete stale comment in nir_intrinsic_ballot implementation
2863- nir: Mark nir_intrinsic_load_global_block_intel as divergent
2864- intel/fs: Enable nir_opt_uniform_atomics in all shader stages
2865- intel/fs: Use constant of same type to write flag
2866- intel/fs: Add fast path for ballot(true)
2867- nir: Initial framework for optimizing uniform subgroup operations
2868- intel/fs: Use nir_opt_uniform_subgroup
2869- nir: Optimize uniform iadd, fadd, and ixor reduction operations
2870- nir: Optimize uniform vote_all and vote_any
2871- i915: Fix value returned for PIPE_CAP_MAX_TEXTURE_CUBE_LEVELS
2872- intel/brw: Silence "statement may fall through" warning
2873- intel/brw: Correctly dump subnr for FIXED_GRF in INTEL_DEBUG=optimizer
2874- intel/compiler: Enforce 64-bit RepCtrl restriction in eu_validate
2875- intel/brw: Integer multiply w/ DW and W sources is not commutative
2876- intel/brw: Combine constants for src0 of integer multiply too
2877- intel/brw: Combine constants for src0 of POW instructions too
2878- intel/brw: Avoid a silly add with zero in assign_curb_setup
2879- intel/fs: Don't allow 0 stride on MOV destination
2880- intel/brw/xe2: Correctly disassemble RT write subtypes
2881- intel/brw: Fix handling of accumulator register numbers
2882- intel/brw: Allow SIMD16 F and HF type conversion moves
2883- intel/brw: Remove last vestiges of could_coissue
2884- intel/brw: Clear write_accumulator flag when changing the destination
2885- intel/brw: Use enums for DPAS source regioning
2886- nir: intel/brw: Change the order of sources for nir_dpas_intel
2887- intel/brw/xe2+: DPAS must be SIMD16 now
2888- intel/brw/xe2+: Use phys_nr and phys_subnr in DPAS encoding
2889- intel/brw/xe2: Update brw_nir_analyze_ubo_ranges to account for 512b physical registers
2890- intel/brw/xe2: Update uniform handling to account for 512b physical registers
2891- intel/compiler: Ensure load_barycentric_at_sample and load_interpolated_input remain together
2892- intel/brw: Don't call nir_opt_remove_phis before nir_convert_from_ssa
2893- intel/elk: Don't call nir_opt_remove_phis before nir_convert_from_ssa
2894- intel/brw: Delete stray nir_opt_dce
2895- intel/elk: Delete stray nir_opt_dce
2896- intel/brw/xe2+: Implement Wa 22016140776
2897- intel/brw/xe2+: Only apply Wa 22016140776 to math instructions
2898- intel/brw: Fix handling of cmat_signed_mask
2899- nir: intel/brw: Remove cmat_signed_mask from dpas_intel intrinsic
2900- intel/brw: Fix optimize_extract_to_float for i2f of unsigned extract
2901- intel/elk: Fix optimize_extract_to_float for i2f of unsigned extract
2902
2903Isaac Marovitz (1):
2904
2905- asahi: Add >16 Sampler Access for Ryujinx
2906
2907Iván Briano (10):
2908
2909- anv: flush query clears for all gens
2910- anv, hasvk: pMutableDescriptorTypeLists can be out of range on pool creation
2911- compiler/types: fix serialization of cooperative matrix
2912- intel/cmat: fix stride calculation in cmat load/store
2913- nir/algebraic: avoid double lowering of some fp64 operations
2914- nir/lower_doubles: preserve sign of zero if we are asked to
2915- nir/lower_doubles: preserve NaN when asked to do so
2916- anv, hasvk: check requirements for USAGE_INPUT_ATTACHMENT properly
2917- anv: check requirements for VK_IMAGE_USAGE_FRAGMENT_SHADING_RATE
2918- anv: fix casting to graphics_pipeline_base
2919
2920JCWasmx86 (1):
2921
2922- meson: Fix invalid kwarg name
2923
2924Jan Beich (1):
2925
2926- util: mimic KCMP_FILE via KERN_FILE on DragonFly and FreeBSD
2927
2928Jani Nikula (1):
2929
2930- docs: fix doc build 'intel/dev/intel_device_info_gen.h' file not found
2931
2932Javier Martinez Canillas (2):
2933
2934- clc: silence a warn_unused_result
2935- gallium: Add ssd130x to the list of kmsro drivers
2936
2937Jesse Natalie (115):
2938
2939- ci/windows: Update WARP to 1.0.9 NuGet
2940- mesa: Consider mesa format in addition to internal format for mip/cube completeness
2941- ci/windows: Rev Vulkan SDK and piglit
2942- d3d12: Set up spirv-as and fix expectations
2943- microsoft/compiler: Declare shader model 6.8 / validator 1.8
2944- microsoft/compiler: Handle comparison bias/gradient sampling
2945- dzn: Add a debug option to enable experimental shader models
2946- microsoft/compiler: Add feature flags for new comparison sampling ops
2947- dzn: Implement maintenance3 VariableDescriptorCountLayoutSupport
2948- dzn: Fix enhanced barrier layout for depth blits
2949- dzn: Handle VkBindImageMemorySwapchainInfoKHR
2950- dzn: Disable depth/stencil for partial binding from dynamic rendering
2951- spirv2dxil: Fix the spirv2dxil command line tool
2952- spirv2dxil: Handle aliasing/overlapping UBO/SSBO variables
2953- util: Detect arm64ec as aarch64 (and x86_64)
2954- glsl: Work around MSVC arm64 optimizer bug
2955- dzn: Don't set view instancing mask until after the PSO
2956- dzn: Fix path passed to CreateDeviceFactory
2957- d3d12: Fix path passed to CreateDeviceFactory
2958- microsoft/compiler: Use double pack/unpack instead of int for reduce ops on doubles
2959- dzn: Add a stencil blit fallback
2960- dzn: Add missing condition to immutable sampler init loop
2961- dzn: Add missing blit source barriers for enhanced barriers
2962- microsoft/compiler: Respect ACCESS_COHERENT in UAV variable data
2963- microsoft/compiler: Add a pass for promoting ACCESS_COHERENT on loads/stores
2964- spirv2dxil: Lower the Vulkan memory model and coherent loads/stores
2965- dzn: Add missing handling of VK_PIPELINE_STAGE_2_DRAW_INDIRECT_BIT
2966- dzn: Add barrier to copy source for DispatchIndirect copies
2967- dzn: Support non-static samplers for meta
2968- dzn: Add a debug flag for forcing off native view instancing
2969- dzn: Don't resolve for RESOLVE_MODE_NONE
2970- dzn: Use correct format for depth/stencil resolves
2971- dzn: Use blits for all non-averaging resolves
2972- microsoft/compiler: Only use simplified subgroup ID algorithm for compute
2973- d3d12: Subgroup ballot
2974- microsoft/compiler: Relax assert for SPIR-V barriers
2975- spirv2dxil: Remove dead branches early during shader compilation
2976- spirv2dxil: Trivial fixes for tessellation shaders
2977- dzn: Simultaneous-access is mutually exclusive with MSAA
2978- dzn: Fix tessellation shader insertion into PSO desc
2979- dzn: Add a driconf option to disable dzn for specific apps and use it for RDR2
2980- microsoft/compiler: For emulating scan, ensure all threads are active when reading cross-lane
2981- microsoft/compiler: Fix wave size control for SM6.6+
2982- microsoft/compiler: Fix wave size control for SM6.8+
2983- wgl: Support a single-buffered winsys framebuffer
2984- wgl: Flush frontbuffer when calling swapbuffers on single-buffered fb
2985- wgl: Add no-gdi-single-buffered and gdi-double-buffered PFDs
2986- wgl: Enable WGL_ARB_pixel_format_float
2987- wgl: Add HDR pixel formats
2988- winsys/d3d12: Support single-buffered mode
2989- d3d12: Support R16G16B16A16_FLOAT display targets
2990- microsoft/compiler: Fix SM6.6 non-bindless handle annotation for UAV counter
2991- dzn: Fix conditions for barrier in texture-converting copy case
2992- wgl: Check for stw_device->screen before trying to destroy it
2993- spirv2dxil: Set push constant register space to nonzero
2994- microsoft/compiler: Remove deref load/store/atomic ops that statically go out of array bounds
2995- microsoft/compiler: Remove code after discard/terminate in later optimization steps
2996- wgl: Initialize DEVMODE struct
2997- d3d12: Point sprite lowering pass needs to handle arrays
2998- nir_lower_tex_shadow: For old-style shadows, use vec4(result, 0, 0, 1)
2999- spirv2dxil: Support buffer_device_address
3000- dzn: Support bufferDeviceAddress
3001- wgl: Delete unused context param to swap
3002- wgl: Check for null before dereferencing ctx in swap
3003- nir_tests: Add /bigobj when compiling with MSVC
3004- dzn: Include vulkan_core.h instead of vulkan.h in the device enum header
3005- dzn: Initialize memoryTypeBits for querying properties on imported handles
3006- microsoft/compiler: domainLocation component index needs to be i8
3007- microsoft/compiler: Disable GS streams workaround for validator 1.8
3008- ci/windows: Update DirectX-Headers, Agility SDK, zlib, DXC, and WARP
3009- ci/debian: Update DirectX-Headers
3010- nir: Handle ptr_as_array for build_deref_follower
3011- microsoft/compiler: Don't store static-indexing handles that are dynamically emitted
3012- microsoft/clc: When possible, compute a part-constant "pointer" value for kernel inputs
3013- microsoft/compiler: Simplify code emitting CL globals
3014- clc: Move libclc helpers back to microsoft/clc
3015- microsoft/clc: Add linkage capability to libclc build to silence warning
3016- microsoft/clc: Adjust order of UAV binding assignment
3017- microsoft/clc: Install clon12compiler
3018- wgl: The default swap interval is supposed to be 1
3019- d3d12: Fix d3d12_lower_triangle_strip if multiple vars are in a single location
3020- microsoft/compiler: When sorting variables, put unused variables last
3021- microsoft/compiler: Move kill-unused/undefined varying pass from spirv to common
3022- microsoft/compiler: Simplify I/O component type enum handling
3023- microsoft/compiler: Expect front-facing var as an input
3024- microsoft/compiler: Improve linking helpers
3025- microsoft/compiler: Don't duplicate work from gather_info in var sorting
3026- d3d12: Move some lowering passes to pre-variant
3027- d3d12: Lower uniforms to UBO by nir options
3028- d3d12: Minor logging improvements
3029- d3d12: Fix var splitting pass writemasks
3030- d3d12: Explicitly add tess factor vars to tess signatures
3031- d3d12: Forward front-facing for passthrough GS
3032- d3d12: Capture always_active_io in varying data
3033- d3d12: Use TES inputs rather than VS outputs for TCS variant key
3034- d3d12: Add primitive ID sysval to input bitmask (for GS in)
3035- d3d12: Gather info less and before the final compilation steps
3036- d3d12: Remove variables instead of adding them for linking
3037- d3d12: Don't compile useless variants during shader creation
3038- microsoft/compiler: Add a fractional var mask for variable sorting
3039- d3d12: Set fractional var masks
3040- d3d12: Add a debug flag for loading WinPixGpuCapturer.dll
3041- ci/windows: Bump Agility SDK to 1.613.2 for ExecuteIndirect validation fix
3042- microsoft/compiler: Handle base vertex/instance sysvals as DXIL intrinsics
3043- spirv2dxil: Support passing first vertex / base instance to DXIL backend
3044- spirv2dxil: Output more specific metadata for whether draw sysvals are needed
3045- dzn: Delete dzn structs for indirect draw args and use D3D ones
3046- dzn: Query options21
3047- dzn: Understand whether first-vertex and base-instance are needed for a pipeline
3048- dzn: Update pipeline cache params to take all options into account
3049- dzn: Rework indirect drawing keys for shaders and command signatures
3050- dzn: Add a hash table of command signatures with non-default strides
3051- dzn: Don't copy app indirect args if we don't need to
3052- glsl: Use a stable attr sort for VS in / FS out
3053- d3d12: Include <shlobj.h> with lowercase name
3054
3055Job Noorman (43):
3056
3057- tu: support l1 dcache size on musl
3058- ir3: fix setting shared flag on parallel copy arguments
3059- ir3: optimize read_first.macro to a mov
3060- ir3: fix printing of brcst.active and quad_shuffle
3061- ir3: optimize subgroup operations using brcst.active
3062- ir3: set reconvergence for scan_clusters.macro
3063- ir3: add disassembly for flat.b
3064- ir3: update a0/a1 users when cloning instructions
3065- ir3: fix alignment of spill slots
3066- ir3: validate instruction block pointer
3067- ir3: add terminators to blocks
3068- ir3: fix instruction count before kill_sched
3069- ir3: print branch sources
3070- ir3: remove OPC_B and brtype from cat0
3071- ir3: remove comp1/2 from cat0
3072- ir3: allow liveness calculation for different register types
3073- ir3: allow finding SSA uses for a subset of uses
3074- ir3: implement RA for predicate registers
3075- ir3: validate no registers are invalid after RA
3076- ir3: integrate predicates into RA validation
3077- ir3: optimize bitwise ops that can directly write predicates
3078- ir3: insert predicate conversions after their source
3079- ir3: fold negations into cmps.ne zero
3080- nir: add search helper is_only_used_by_if
3081- ir3: fold and/or and negations into branches
3082- freedreno/ci: Update pixmark-piano-v2 hash
3083- ir3: fix freeing incorrect register in loops
3084- ir3: fix returning false instead of NULL
3085- freedreno/registers: fix installation of schema
3086- zink: print shaderdb info via debug message callback
3087- ir3: calculate SSA uses at the start of predicates RA
3088- ir3: fix finding uses of reloaded defs in predicates RA
3089- ir3-disasm: run clang-format
3090- ir3-disasm: remove unused #includes
3091- ir3-disasm: add options to specify GPU by chip ID or name
3092- ir3-disasm: add option to disassemble hex number
3093- freedreno,computerator: support initialization of buffers
3094- ir3: remove unnecessary tessellation epilogue
3095- ir3: model predt/predf without sources
3096- ir3: add support for precolored sources in predicate RA
3097- ir3: add support for predication
3098- freedreno/drm-shim: add a730, a740, and a750
3099- freedreno/drm-shim: remove duplicate entry for a630
3100
3101Jonathan Gray (4):
3102
3103- intel/dev: update DG2 device names
3104- intel/dev: update DG2 device names
3105- intel/dev: update DG2 device names
3106- intel/dev: 0x7d45 is mtl-u not mtl-h
3107
3108Jonathan Marek (1):
3109
3110- tu/a750: Basic a750 support
3111
3112Jordan Justen (26):
3113
3114- intel/dev/common: Add xe2 support to get_l3_list()
3115- intel/dev: Add ARL platform enums
3116- intel/dev: Add intel_device_info_is_mtl_or_arl()
3117- intel/l3: Define l3 config for ARL
3118- iris: Extend MTL modifiers to ARL devices
3119- intel/i915: ARL also supports the set-PAT uapi
3120- intel/dev: Define engine prefetch for ARL
3121- isl: Define MOCS for ARL
3122- isl: Handle ARL in isl_drm_modifier_get_score()
3123- intel/compiler: Lower DPAS instructions on ARL except ARL-H
3124- anv/drirc: Extend option to disable FCV optimization to ARL
3125- anv/query: Follow MTL code paths on ARL
3126- intel/dev: Add device info for ARL
3127- intel/compiler: Set branch shader required-width as 16 for xe2
3128- intel/compiler: Implement nir_intrinsic_load_topology_id_intel for xe2
3129- intel/compiler: Verify SIMD16 is used for xe2 BTD/RT dispatch
3130- intel/dev: Add 2 additional ADL-N PCI ids
3131- intel/compiler: Adjust fs_visitor::emit_cs_terminate() for Xe2
3132- intel/dev: Adjust device strings for ATS-M devices
3133- intel/dev: Add ATS-M PCI ID for Data Center GPU Flex 170G
3134- intel/compiler/fs: Restore SIMD32 restriction for ray_queries on Xe2
3135- intel/compiler: nib_ctrl no longer exists on Xe2+
3136- intel/dev/mesa_defs.json: Add LNL WA entries
3137- intel/dev: Add 0x56be and 0x56bf DG2 PCI IDs
3138- intel/dev: Change ATS-M 0x56c2 string from 170G to 170V
3139- intel/brw: Avoid getting a stride of 0 for nir_intrinsic_exclusive_scan
3140
3141Jose Maria Casanova Crespo (7):
3142
3143- ci: Adds /usr/local/bin to PATH at piglit-traces.sh
3144- v3d: Fix indentation at v3d_flush_jobs_writing_resource
3145- v3d: Only flush jobs that write texture from different job submission.
3146- v3d: Early return if job is not writing the resource
3147- v3d: Implement GL_ARB_texture_barrier
3148- broadcom/compiler: needs_quad_helper_invocation enable PER_QUAD TMU access
3149- ci: re-enable Igalia farm
3150
3151Joshua Ashton (34):
3152
3153- winsys/amdgpu: Hook up guilt to amdgpu_ctx_set_sw_reset_status
3154- winsys/amdgpu: Limit usage of query_reset_state2
3155- radv/amdgpu: Handle -ENODATA and -ETIME from cs_submit
3156- radv: Mark device loss if QueueSubmit failed immediately
3157- radv: Remove check_status
3158- radv/amdgpu: Remove ctx_query_reset_status
3159- radv: Add radv_get_tdr_timeout_for_ip helper
3160- radv: Ensure vkGetQueryPoolResults returns in finite-time
3161- android: Use system = 'android' in crossfile
3162- meson: Enable zink in gallium_drivers by default
3163- meson: Enable d3d12 in gallium_drivers by default on Windows
3164- anv: Enable EXT_swapchain_maintenance1
3165- v3dv: Enable EXT_swapchain_maintenance1
3166- lavapipe: Enable EXT_swapchain_maintenance1
3167- v3dv: Enable EXT_swapchain_colorspace
3168- lavapipe: Enable EXT_swapchain_colorspace
3169- wsi: Pass wsi_drm_image_params to wsi_configure_native_image
3170- wsi: Pass wsi_drm_image_params to wsi_configure_prime_image
3171- wsi: Add explicit_sync to wsi_image_info
3172- wsi: Add explicit_sync to wsi_drm_image_params
3173- build: Add linux-drm-syncobj-v1 wayland protocol
3174- wsi: Track if timeline semaphores are supported
3175- wsi: Add acquired member to wsi_image
3176- wsi: Track CPU side present ordering via a serial
3177- wsi: Get timeline semaphore exportable handle types
3178- wsi: Add common infrastructure for explicit sync
3179- ci: Bump wayland-protocols version to 1.34
3180- ci: Bump DEBIAN_BASE_TAG for now
3181- meson: Update wayland-protocols wrap to 1.34
3182- meson: Bump wayland-protocols requirement to 1.34
3183- wsi: Implement linux-drm-syncobj-v1
3184- tu: Expose VK_EXT_surface/swapchain_maintenance1
3185- radv: Enable KHR_video_queue if encode is enabled
3186- radv: Properly initialize imageCreateFlags in GetPhysicalDeviceVideoFormatPropertiesKHR
3187
3188José Expósito (2):
3189
3190- zink: add render-passes HUD query
3191- meson: Update proc_macro2 meson.build patch
3192
3193José Roberto de Souza (88):
3194
3195- intel/isl/xe2: Disable route of Sampler LD message to LSC
3196- anv: Fix PAT entry for userptr in integrated GPUs
3197- intel/genxml/xe2: Remove L3ALLOC
3198- intel/dev: Reduce usage of intel_device_info_compute_system_memory()
3199- intel: Make memory heaps consistent between KMDs
3200- anv: Fix calculation of syncs required in Xe KMD
3201- iris: Avoid read of uninitialized value in blorp_clear_stencil_as_rgba()
3202- iris: Fix return of iris_wait_syncobj()
3203- iris: Wait for drm_xe_exec_queue to be idle before destroying it
3204- intel/common: Add functions to handle async vm bind
3205- anv: Start to use intel_bind_timeline
3206- iris: Start to use intel_bind_timeline
3207- anv: Switch to truly asynchronous VM binding in Xe KMD
3208- iris: Switch to truly asynchronous VM binding in Xe KMD
3209- intel: Fix intel_get_mesh_urb_config()
3210- anv: Drop include to common/i915/intel_gem.h
3211- intel/common: Fix location of C++ support macro in intel_gem.h
3212- intel: Remove circular dependency between intel/dev and intel/common
3213- intel/common: Add intel_engines_supported_count()
3214- anv: Use intel_engines_supported_count()
3215- iris: Use intel_engines_supported_count()
3216- intel: Sync i915_drm.h
3217- intel/common: Implement i915_engines_is_guc_semaphore_functional()
3218- intel: Sync xe_drm.h
3219- intel/common: Implement xe_engines_is_guc_semaphore_functional()
3220- iris: Fix iris_batch_is_banned() check
3221- anv: Use DRM_XE_VM_BIND_OP_UNMAP_ALL to unbind whole bos
3222- docs/anv: Add recommended GuC firmware version
3223- iris: Set (EXEC_OBJECT_SUPPORTS_48B_ADDRESS | EXEC_OBJECT_PINNED) in a single place
3224- iris: Remove iris_bo::kflags
3225- iris: Move i915 set and get tiling uAPI calls to i915 specific code
3226- iris: Remove more i915_drm.h includes from common code
3227- intel: Move intel_define.h to i915/intel_define.h
3228- intel/common: Remove more i915_drm.h includes from common code
3229- intel/tools/error_decode: Add function to try to open error dump file
3230- intel/tools/error_decode: Simply error message handling
3231- intel/tools/error_decode: Add support to search for Xe KMD error dumps
3232- intel/tools/error_decode: Detect and split error dump file parsing by KMD
3233- intel: Sync xe_drm.h
3234- anv/xe: Add VMs to error dump
3235- iris/xe: Add VMs to error dump
3236- intel/tools/error_decode: Move code that can be shared between i915 and Xe error decoders
3237- intel/tools/error_decode: Parse Xe KMD error dump file
3238- intel/tools: Fix compilation in 32 bits
3239- intel/nullhw: Fix 32bits compilation warnings
3240- iris: Add IRIS_HEAP_DEVICE_LOCAL_CPU_VISIBLE_SMALL_BAR heap type
3241- iris: Force lmem cpu accessible for bos with clear-color
3242- iris/xe: Consider pat_index while unbinding the bo
3243- anv: Call flush_pipeline_select_gpgpu() for compute engines in compute code paths
3244- anv: Skip cmd_buffer_emit_bt_pool_base_address() in blitter and video engines
3245- intel: Drop pre-production steppings
3246- anv: Fix Xe KMD userptr unbind
3247- intel/dev: Nuke 'ver == 10' check
3248- intel/dev: Nuke display_ver
3249- intel: Enable Xe KMD support by default
3250- iris: Set BO_ALLOC_NO_SUBALLOC when allocating bo for slab
3251- anv: Replace the 2 sparse booleans by 1 enum
3252- anv: Set VK_QUEUE_PROTECTED_BIT during queue families initialization
3253- anv: Set VM control to true in Xe KMD
3254- intel/tools/error_decode: Fix parsing in Xe decoder
3255- intel/tools/error_decode: Add function to print batch in Xe decoder
3256- intel/tools/error_decode: Parse HW context in Xe decoder
3257- iris: Move tiling_to_modifier() implementation to i915 folder
3258- iris: Remove i915_drm.h include from iris_indirect_gen.c
3259- intel/decoder: Fix binding table pointer entry being marked as invalid
3260- anv: Set STATE_COMPUTE_MODE mask bit when zeroing compute mode
3261- intel/genxml: Add more instdone registers
3262- intel/genxml/gfx125: Fix definition of INTERFACE_DESCRIPTOR_DATA::Thread group dispatch size
3263- intel/genxml/xe2: Update definition of INTERFACE_DESCRIPTOR_DATA
3264- anv: Create protected engine context when i915 supports vm control
3265- anv: Remove protected memory types from default_buffer_mem_types
3266- intel/tools/error2hangdump: Print out_filename when failed to open it
3267- intel/tools/error2hangdump: Replace drm_i915_gem_engine_class by intel_engine_class
3268- intel/tools: Move Xe KMD error decode functions to a separated file
3269- intel/tools: Move ascii85_decode_char() to error_decode_lib
3270- intel/tools: Move more Xe KMD error decode functions to error_decode_xe_lib
3271- intel/tools/error2hangdump: Move code that will be shared with Xe parser to error2hangdump_lib
3272- intel/tools/error2hangdump: Move i915 parser to a function
3273- intel/tools/error2hangdump: Add Xe KMD support
3274- anv: Add missing ANV_BO_ALLOC_INTERNAL
3275- iris: Add comments to BO_ALLOC flags
3276- iris: Avoid creation of slabs and cache buckets of lmem heaps in integrated gpus
3277- iris: Avoid allocation of not needed iris_bucket_cache
3278- intel/tools/aubinator_error_decode: Move definition of option_color to header
3279- intel/decoder: Add intel_print_group_custom_spacing()
3280- intel/tools: Parse INSTDONE registers in Xe KMD error dump
3281- intel: Sync xe_drm.h
3282- intel/dev: Read GFX IP version during runtime
3283
3284Juan A. Suarez Romero (32):
3285
3286- Revert "v3d: use kmsro to create drm screen on real hw"
3287- v3d: show warning on creating a v3d screen on real hw
3288- v3d/vc4/ci: reset the list of timeout tests
3289- Revert "v3d: show warning on creating a v3d screen on real hw"
3290- broadcom/simulator: protect simulator BO rallocs with mutexes
3291- v3d/ci: run OpenGL 3.1 tests
3292- v3dv/ci: increase timeout for full jobs in 30min
3293- ci: disable Igalia farm
3294- Revert "ci: disable Igalia farm"
3295- Revert "ci: disable Igalia farm"
3296- v3d/ci: update expected results
3297- v3d/ci: update expected list
3298- Revert "v3d/ci: update expected list"
3299- vc4/ci: update expected list
3300- v3d/ci: add new failures
3301- v3dv/ci: update expected list
3302- v3dv/ci: remove crashes from expected list
3303- v3d,v3d: use new simulator
3304- v3dv: disable Early Z for multisampled 16-bit depth buffers
3305- v3d: disable Early Z for multisampled 16-bit depth buffers
3306- broadcom/compiler: fix SFU check for 7.1
3307- v3dv: mark some promoted extensions as supported
3308- v3d: add load_fep_w_v3d intrinsic
3309- v3d: fix line coords with perspective projection
3310- compiler,gallium: move u_reduced_prim to common
3311- v3dv: assume that rasterization state can be NULL
3312- v3dv: enable smooth line rendering
3313- broadcom/ci: add new expected failures
3314- v3d: configure polygon mode when enabled
3315- broadcom/ci: update expected results
3316- v3dv/ci: update expected list
3317- nir/lower_clip: update inputs/ouputs read/written bitmask
3318
3319Juston Li (13):
3320
3321- venus: refactor query feedback cmds
3322- venus: acquire mutex when recycling query feedback cmds
3323- venus: free query batches for VK_COMMAND_POOL_RESET_RELEASE_RESOURCES_BIT
3324- venus: add comments for query feedback batch free list
3325- venus: recycle linked query feedback immediately during submission
3326- venus: handle empty resolved query feedback list
3327- venus: fix image reqs cache store locking
3328- venus: extract cache hash/equals functions into common
3329- venus: image format properties cache
3330- venus: move feedback on empty last batch to prior batch
3331- venus: fix VkDeviceGroupSubmitInfo cmd counts from feedback
3332- venus: extend device format prop cache with VkFormatProperties3
3333- Revert "zink: store last pipeline directly for zink_gfx_program::last_pipeline"
3334
3335Kai Wasserbäch (2):
3336
3337- fix(FTBFS): clc: adapt to new LLVM 19 DiagnosticHandlerTy
3338- fix(FTBFS): clover: adapt to new LLVM 19 DiagnosticHandlerTy
3339
3340Karol Herbst (131):
3341
3342- nak/algebraic: merge run and main function
3343- nak/algebraic: write code to an output file
3344- clc: use spirv triple starting with llvm-17
3345- clc: add support for the native spir-v backend
3346- rusticl/kernel: run opt/lower_memcpy later to fix a crash
3347- rusticl/kernel: add a few comments in regards to pass ordering
3348- rusticl/kernel: no need to reset the scratch size anymore
3349- nir/printf: remove treat_doubles_as_floats
3350- clc: require LLVM-14
3351- clc: merge blocks handling optional features
3352- clc: require LLVM-15
3353- nir: rework and fix rotate lowering
3354- rusticl/program: rework debug logging option
3355- rusticl/spirv: do not attempt to parse spirv after failed link
3356- rusticl/spirv: use bool::then_Some inside SPIRVBin::link
3357- rusticl/program: add clc_validator_options helper function
3358- rusticl/program: add debug option to validate internal spirvs
3359- nak/opt_out: fix comparison in try_combine_outs
3360- nak: simplify phi_dsts
3361- nak: make it compile with clippy
3362- rusticl/meson: use rust_abi instead of rust_crate_type
3363- rust/spirv: fix clippy lint on unneeded late initialization
3364- rusticl/kernel: check that local size on dispatch doesn't exceed limits
3365- nak/meson: specify rust flags globally and allow some clippy lints
3366- nak: fix clippy::extra_unused_lifetimes warnings
3367- nak: fix clippy::mem_replace_with_default warnings
3368- nak: fix clippy::useless_conversion warnings
3369- nak: fix clippy::needless_lifetimes warnings
3370- nak: fix clippy::needless_borrow warnings
3371- nak: fix clippy::while_let_loop warnings
3372- nak: fix clippy::match_like_matches_macro warnings
3373- nak: fix clippy::needless_return warnings
3374- nak: fix clippy::redundant_closure warnings
3375- nak: fix clippy::unwrap_or_default warnings
3376- nak: fix clippy::manual_while_let_some warnings
3377- nak: fix clippy::clone_on_copy warnings
3378- nak: fix clippy::single_match warnings
3379- rusticl/util: add a wrapper around "thread-safe" C types
3380- rusticl/mesa/device: convert to ThreadSafeCPtr
3381- rusticl/mesa/screen: convert to ThreadSafeCPtr
3382- rusticl/mesa: add thread-safe wrapper for pipe_image_views
3383- rusticl/context: store SVM pointers as usize
3384- rusticl/gl: mark GLCtxManager as Send + Sync
3385- rusticl/mem: make Mem Send/Sync by storing mapping ptrs as usize
3386- rusticl/program: mark NirKernelBuild as Send and Sync
3387- meson: remove opencl-external-clang-headers option and rely on shared-llvm
3388- clc: force fPIC for every user when using shared LLVM
3389- nir/lower_cl_images: record image_buffers and msaa_images
3390- rusticl/mem: properly handle buffers
3391- rusticl/mem: support GL_TEXTURE_BUFFER
3392- rust/api: add RustTypes enum
3393- rusticl/util: support nested structs in offset_of!
3394- rusticl/api: allow CLObjectBase to be placed anywhere
3395- rusticl/icd: move get_ref_vec_from_arr into the Rusticl type
3396- rusticl/icd: move refcnt() and get rid of needless atomic ops
3397- rusticl/icd: move retain() and release()
3398- rusticl/icd: move get_arc_vec_from_arr and rename it
3399- rusticl/icd: fold leak_ref into its only consumer
3400- rusticl/icd: move get_ref()
3401- rusticl/device: deduplicate devices with sorting
3402- rusticl/icd: move from_arc() and rename it
3403- rusticl/event: drop from_cl_arr and use arcs_from_arr
3404- rusticl/icd: move get_arc() and rename it
3405- rusticl/icd: split Arc part out of CLObject into new trait
3406- rusticl/device: get rid of pointless Arc overhead
3407- rusticl/icd: actually allow dispatching CL types
3408- rusticl/mem: split into Buffer and Image
3409- rusticl/mem: use pattern matching in is_parent_buffer
3410- rusticl/mem: move fill methods into concrete types
3411- core/memory: drop Arc for &Arc<Queue> function parameters
3412- rusticl/mem: move map methods into concrete types
3413- rusticl/mem: move shadow sync methods into concrete types
3414- rusticl/mem: split unmap into Buffer and Image versions
3415- rusticl/mem: move copy and write buffer impls into Buffer
3416- rusticl/mem: split read_to_user_rect into Buffer and Image versions
3417- rusticl/mem: split write_from_user_rect into buffer and image
3418- rusticl/mem: move copy_to_rect into Buffer
3419- rusticl/mem: split copy_to into Buffer and Image
3420- rusticl/mem: split Buffer::copy_to into Buffer and Image versions
3421- rusticl/mem: split Image::copy_to into Buffer and Image versions
3422- rusticl/mem: get rid of pixel_size
3423- rusticl/mem: move tx_image into Image
3424- rusticl/mem: fold tx_raw into tx
3425- rusticl/image: call tx on the parent buffer directly
3426- rusticl/mem: move is_parent_buffer into Image
3427- rusticl/mem: move tx into Buffer
3428- rusticl/mem: remove get_res
3429- rusticl/mem: move comment describing how mapping works
3430- rusticl/mem: reorganize Image::map
3431- rusticl/mem: move MemBase::map into the users
3432- rusticl/mem: move tx_raw_async methods into Buffer and Image
3433- rusticl/mem: move Buffer and Image specific fields into the subtypes
3434- rusticl/mem: reorganize MemBase::from_gl a little
3435- rusticl/mem: move pipe_image_host_access into Image
3436- rusticl/kernel: recalculate scratch and shared memory after opts
3437- rusticl/program: fix CL_PROGRAM_BINARIES for devs with no builds
3438- meson/rusticl: import rust instead of unstable-rust
3439- clc: include opencl-c.h for extensions needing it
3440- meson: do not pull in clc for clover
3441- intel: Only build shaders with anv and iris
3442- zink: lower unaligned memory accesses
3443- rusticl/context: complete conversion of SVM pointers to usize
3444- rusticl/memory: store host_ptr as usize
3445- rusticl/memory: make closures Send and Sync
3446- rusticl/event: make EventSig Send + Sync
3447- rusticl/spirv: mark SPIRVBin as Send and Sync
3448- rusticl/kernel: make it Send and Sync
3449- rusticl/icd: verify all cl classes are Send and Sync
3450- rusticl/meson: remove -Aclippy::arc-with-non-send-sync flag
3451- rusticl/kernel: make builds private
3452- rusticl/event: we need to call the CL_COMPLETE callback on errors as well
3453- rusticl/kernel: assign sampler locations before DCEing variables
3454- rusticl/device: support query_memory_info to retrieve available memory
3455- drm-uapi: Sync nouveau_drm.h
3456- nvk: use c.get_supported_arguments for compiler flags
3457- nouveau: import libdrm_nouveau
3458- nouveau: call glsl_type_singleton_init_or_ref earlier
3459- nouveau/drm: drop immediate parameter from nouveau_pushbuf_new
3460- nouveau/drm: rely on nouveau_pushbuf::channel being always set
3461- nouveau/drm: drop unsued chan argument from nouveau_pushbuf_kick
3462- nouveau/drm: remove nouveau_client::id
3463- rusticl/util: make create_pipe_box indepentend of pipe_box's field types
3464- meson: fix link failure with llvm-18
3465- rusticl/program: handle -cl-no-subgroup-ifp
3466- nouveau: fix potential double-free in nouveau_drm_screen_create
3467- nir: fix nir_shader_get_function_for_name for functions without names.
3468- rusticl: use stream uploader for cb0 if prefered
3469- rusticl/icd: remove CLObject
3470- event: break long dependency chains on drop
3471- rusticl/mesa/context: flush context before destruction
3472- nir/lower_cl_images: set binding also for samplers
3473
3474Kenneth Graunke (76):
3475
3476- iris: Don't return timestamps modulo 36-bits
3477- intel/dev: Fix typo (ajust -> adjust)
3478- iris: Implement query_memory_info() on discrete cards
3479- intel/nir: Pass devinfo and prog_data to brw_nir_lower_cs_intrinsics
3480- intel: Add driver support for hardware generated local invocation IDs
3481- intel: Use hardware generated compute shader local invocation IDs
3482- driconf: Advertise GL_EXT_shader_image_load_store on iris for SVP13
3483- iris: Implement INTEL_DEBUG=heaps
3484- intel/fs: Don't include sync.nop in instruction count statistics
3485- intel/fs: Don't rely on CSE for VARYING_PULL_CONSTANT_LOAD
3486- intel/brw: Delete enum brw_urb_write_flags
3487- intel/brw: Delete more unused defines
3488- intel/brw: Delete legacy SFIDs
3489- intel/brw: Delete SIMD4x2 URB opcodes
3490- intel/brw: Delete more unused compression stuff
3491- intel/brw: Delete SINCOS
3492- intel/brw: Delete constant_buffer_0_is_relative
3493- intel/brw: Delete compiler->supports_shader_constants
3494- intel/brw: Delete enum gfx6_gather_sampler_wa
3495- intel/brw: Delete brw_wm_prog_key::line_aa
3496- intel/brw: Delete unnecessary brw_wm_prog_data fields
3497- intel/brw: Delete some swizzling functions
3498- intel/brw: Delete brw_eu_util.c
3499- intel/brw: Change unit tests to use TEX_LOGICAL instead of TEX
3500- intel/brw: Delete SHADER_OPCODE_TXF_CMS[_LOGICAL]
3501- intel/brw: Delete SHADER_OPCODE_TXF_UMS
3502- intel/brw: Allow CSE on TXF_CMS_W_GFX12_LOGICAL
3503- intel/brw: Delete legacy texture opcodes
3504- intel/brw: Mark FIND[_LAST]_LIVE_CHANNEL as not writing the flag
3505- intel/brw: Replace CS_OPCODE_CS_TERMINATE with SHADER_OPCODE_SEND
3506- intel/brw: Avoid copy propagating any fixed registers into EOTs
3507- intel/brw: Handle SHADER_OPCODE_SEND without src[3] in copy prop
3508- intel/brw: Add assertions that EOT messages live in g112+
3509- intel/brw: Copy the smaller payload in fixup_sends_duplicate_payload
3510- intel/brw: Make register coalescing obey the g112-g127 restriction
3511- intel/brw: Call constant combining after copy propagation/algebraic
3512- intel/brw: Remove SIMD lowering to a larger SIMD size
3513- intel/brw: Unindent code after previous change
3514- iris: Fix tessellation evaluation shaders that use scratch
3515- intel/brw: Emit better code for read_invocation(x, constant)
3516- iris: Remove suballocation in iris_flush_resource()
3517- iris: Eliminate prototype introduced in the previous patch
3518- ra: Add debug functions for printing spill costs and benefits
3519- intel/fs: Avoid generating useless UNDEFs for every SSA def
3520- intel/brw: Split out 64-bit lowering from algebraic optimizations
3521- intel/brw: Don't consider UNIFORM_PULL_CONSTANT_LOAD a send-from-GRF
3522- intel/brw: Eliminate top-level FIND_LIVE_CHANNEL & BROADCAST once
3523- intel/brw: Fix check for 64-bit SEL lowering types
3524- intel/brw: Assert that min/max are not happening in 64-bit SEL lowering
3525- intel/brw: Use correct execution pipe for lowering SEL on DF
3526- intel/brw: Unify DF and Q/UQ lowering for MOV
3527- Revert "intel/brw: Don't consider UNIFORM_PULL_CONSTANT_LOAD a send-from-GRF"
3528- intel/brw: Fix opt_split_sends() to allow for FIXED_GRF send sources
3529- intel/brw: Fix register coalescing's LOAD_PAYLOAD dst offset handling
3530- intel/brw: Fix destination stride assertion in copy propagation
3531- intel/brw: Allow changing types for LOAD_PAYLOAD with 1 source
3532- intel/brw: Delete brw_fs_lower_minmax
3533- anv, hasvk: Save the original instance ID
3534- anv, hasvk: Move multiview remapping loop below output stores
3535- anv, hasvk: Fix nir_lower_multiview to re-emit outputs before EmitVertex
3536- intel/brw: Stop checking mlen on math opcodes in CSE pass
3537- intel/brw: Rearrange fs_inst fields
3538- intel/brw: Fix generate_mov_indirect to check has_64bit_int not float
3539- intel/brw: Fix lower_regioning for BROADCAST, MOV_INDIRECT on Q types
3540- intel/brw: Update comments for indirect MOV splitting
3541- intel/brw: Don't mention gfx7 limitations in shuffle comments
3542- intel/brw: Drop dead CHV checks.
3543- intel/brw: Drop align16 support in brw_broadcast()
3544- intel/brw: Drop gfx7 scratch message setup code
3545- intel/brw: Delete if_depth_in_loop
3546- intel/brw: Delete fs_visitor::vgrf helper
3547- intel/brw: Drop default size of 1 from bld.vgrf() calls
3548- intel/brw: Use SHADER_OPCODE_SEND for coherent framebuffer reads
3549- intel/brw: Replace FS_OPCODE_LINTERP with BRW_OPCODE_PLN
3550- intel/brw: Make an fs_builder::SYNC helper
3551- isl: Set MOCS to uncached for Gfx12.0 blitter sources/destinations
3552
3553Konrad Dybcio (1):
3554
3555- freedreno/registers: Add some HWCG regs
3556
3557Konstantin (8):
3558
3559- util/printf: Include stdio.h
3560- util/printf: Expose util_printf_prev_tok
3561- ac/debug: Handle the output of recent umr versions
3562- radv/debug: Canonicalize shader addr
3563- radv: Canonicalize addresses in radv_find_shader
3564- radv/debug: Try to find unbound shaders
3565- radv/debug: Dump descriptor binding information
3566- ac/parse_ib: Always print the value of the whole register
3567
3568Konstantin Seurer (105):
3569
3570- nak/repair_ssa: Remap PHI sources as well
3571- ac/llvm: Enable helper invocations for quad OPs
3572- radv: Vectorize load_global_constant
3573- lavapipe: Fix DGC vertex buffer handling
3574- gallivm: Use saturating fpto*i conversions
3575- lavapipe: Mark vertex elements dirty if the stride changed
3576- lavapipe: Report the correct preprocess buffer size
3577- radv: Implement NIR debug printf
3578- llvmpipe: Stop refcounting sample functions
3579- llvmpipe: Compile sample functioins on demand
3580- radv/rt: Use doubles inside intersect_ray_amd_software_tri
3581- llvmpipe: Fix building with llvm11
3582- nir/print: Don't print shared_size twice
3583- nir/print: Rename workgroup-size to workgroup_size
3584- radv/radix_sort: clang-format
3585- radv: Reduce the amount of radv_device_to_handle calls
3586- radv: Make radv_write_user_event_marker non-static
3587- radv: Emit user events during acceleration structure builds
3588- radv: Skip unused acceleration structure build paths
3589- radv/sqtt: Set SeparateCompiled for monolithic RT pipelines
3590- radv/sqtt: Handle ray tracing pipelines with no traversal shader
3591- radv/rt: Lower ray payloads like hit attribs
3592- radv/rra: Rename rra_chunk_type to rra_chunk_version
3593- radv/rra: Use memcpy for chunk descriptions
3594- radv/rra: Remove useless variable
3595- radv/rra: Refactor error handling
3596- radv/rra: Dump basic ray history tokens
3597- docs: Document RADV_RRA_TRACE_HISTORY_SIZE
3598- radv/rra: Implement ahit/isec counters
3599- amd/common: Use the correct register table for GFX10_3
3600- radv: Wire up ac_gather_context_rolls
3601- zink: Always set mfence->submit_count to the fence submit_count
3602- Revert "zink: always force flushes when originating from api frontend"
3603- llvmpipe: Use full subgroups when possible
3604- gallivm: Consider the initial mask when terminating loops
3605- lavapipe: Advertise VK_KHR_shader_maximal_reconvergence
3606- ci: Update llvmpipe trace checksums
3607- ac/parse_ib: Add and use print_addr
3608- ac/parse_ib: Dump the ADDR field of PKT3_SET_BASE
3609- ac/parse_ib: Annotate addresses with UAF/OOB info
3610- ac/parse_ib: Handle 32bit PKT3_DISPATCH_INDIRECT addrs
3611- ac/parese_ib: Handle more packets
3612- radv/rra: Avoid reading past the ray history buffer
3613- radv/meta: Add shader - device mapping for radv_build_printf
3614- vulkan/cmd_queue: Implement CmdBuildAccelerationStructuresKHR
3615- lavapipe: Implement VK_KHR_acceleration_structure
3616- lavapipe: Add ray traversal code
3617- lavapipe: Implement VK_KHR_ray_query
3618- lavapipe: Advertise VK_KHR_deferred_host_operations
3619- lavapipe: Advertise VK_KHR_acceleration_structure
3620- lavapipe: Advertise VK_KHR_ray_query
3621- lavapipe/ci: Document ray query failures
3622- docs: Document lavapipe ray tracing features
3623- vulkan: Implement DebugMarkerSetObjectNameEXT
3624- radv/rt: Implement RADV_DEBUG=shaderstats
3625- radv/rt: Add radv_ray_tracing_stage_info
3626- radv/rt: Fixup constant args
3627- aco: Only fix used variables to registers
3628- radv/rt: Avoid passing unused data to the next stage
3629- radv/rt: Inline constant trace_ray srcs into the traversal shader
3630- radv/rt: Inline constant information about ray flags
3631- radv/rt: Fix raygen_imported condition
3632- zink: Handle aoa derefs of images
3633- ac: Annotate context rolls
3634- ac/parse_ib: Replace the parameter list with ac_ib_parser
3635- ac/parse_ib: Implement annotations
3636- radv: Add support for IB annotations
3637- radv: Add an IB annotation layer
3638- ac: Improve context roll readability
3639- radv: Use radv_buffer_map for parsing IBs
3640- radv/rt: Use 32-bit offsets for load_sbt_entry
3641- radv: Skip more acceleration structure build markers
3642- radv/printf: Use fprintf instead of printf
3643- nir/print: Fix printing booleans with bit_size>1
3644- nir/serialize: Encode data for temporaries
3645- nir: Add lavapipe ray tracing intrinsics
3646- llvmpipe: Fix function call handling
3647- lavapipe: Add lvp_spirv_to_nir
3648- lavapipe: Make lvp_shader_init non-static
3649- lavapipe: Make lvp_create_pipeline_nir non-static
3650- lavapipe: Lower mem_constant variables
3651- lavapipe: Defer binding compute state
3652- lavapipe: Remove unused ray tracing variables
3653- lavapipe: Add more ray tracing helpers
3654- lavapipe: Pass lvp_ray_flags into lvp_aabb_intersection_cb
3655- lavapipe: Use the pipeline type in get_pcbuf_size
3656- lavapipe: Inline fill_ubo0
3657- lavapipe: Add an api_stage parameter to update_pcbuf
3658- lavapipe: Fix a memory leak in lvp_push_internal_buffer
3659- lavapipe: Implement VK_KHR_ray_tracing_pipeline
3660- lavapipe: Implement KHR_ray_tracing_maintenance1
3661- lavapipe: Implement VK_EXT_pipeline_library_group_handles
3662- lavapipe: Implement VK_KHR_ray_tracing_position_fetch
3663- radv: Destroy leaf_updateable_pipeline
3664- lavapipe: Handle accel struct queries in handle_copy_query_pool_results
3665- lavapipe: Implement ray_tracing_maintenance1 queries
3666- lavapipe: Do nort use NIR_PASS during lowering
3667- lavapipe: Handle multiple planes in GetDescriptorEXT
3668- lavapipe: Explicitely support ycbcr formats
3669- Revert "gallivm/ssbo: mask offset with exec_mask instead of building the 'if'"
3670- radv: Handle all dependencies of CmdWaitEvents2
3671- nir/print: Do not access invalid indices of load_uniform
3672- radv: Fix radv_shader_arena_block list corruption
3673- radv: Remove arenas from capture_replay_arena_vas
3674- radv: Zero initialize capture replay group handles
3675
3676Krzysztof Kurek (1):
3677
3678- panfrost: fix shift overflow in \`bi_fold_constant`
3679
3680Leo Liu (2):
3681
3682- radeonsi: fix video processing path without VPE enabled
3683- ac/gpu_info: Fix broken UVD firmware query
3684
3685Lepton Wu (1):
3686
3687- llvmpipe: Set "+64bit" for X86_64
3688
3689Lin, Ricky (1):
3690
3691- amd/vpelib: Rename the parameters of init vpe function
3692
3693Lionel Landwerlin (186):
3694
3695- anv: fix disabled Wa_14017076903/18022508906
3696- intel/aux_map: fix fallback unmapping range on failure
3697- anv: hide vendor ID for The Finals
3698- intel/decoder: make vertex data decoding optional
3699- intel/decoder: don't ignore BT entries at offset 0
3700- intel/genxml: add CCS_INSTDONE register
3701- intel/genxml: add GAM done register description
3702- intel/hang_viewer: add aux-tt view
3703- anv: export descriptor flushing functions
3704- anv: fix include guards
3705- anv: fix missing header
3706- anv: move generated draw flush helper to its own file
3707- anv: move draw commands to their own file
3708- anv: move compute/ray-tracing commands to their own file
3709- anv: rename video command file
3710- nir/alu_srcs_negative_equal: bail earlier if possible
3711- nir/comparison_pre_tests: update expectations
3712- anv: using a single struct for kernel upload
3713- anv: fix pipeline executable properties with graphics libraries
3714- isl: add print helpers for debug
3715- anv: implement undocumented tile cache flush requirements
3716- anv: reorder anv_astc_emu.c
3717- anv: remove unused perfetto declarations
3718- anv: rename layers entrypoints
3719- anv: add BO flag for internal driver allocations
3720- anv: track total state stream allocated blocks from the pool
3721- anv: track imported ray tracing pipeline groups
3722- anv: initial RMV support
3723- vulkan/runtime: handle new image layout
3724- anv: don't prevent L1 untyped cache flush in 3D mode
3725- anv: promote EXT_index_type_uint8 to KHR
3726- anv: promote EXT_line_rasterization to KHR
3727- anv: promote EXT_load_store_op_none to KHR
3728- anv: add missing alignment for AUX-TT mapping
3729- intel/ds: track predication of blorp operations
3730- vulkan/runtime: add helper to query attachment layout
3731- anv: ensure consistent layout transitions in render passes
3732- anv: add check that in renderpass barriers apply to attachments
3733- anv: handle image feedback loop usage
3734- anv: implement VK_EXT_attachment_feedback_loop_dynamic_state
3735- anv/hasvk: don't report error when intel_get_device_info_from_fd fails
3736- anv: factor out aux-tt binding logic for future reuse
3737- anv: rename aux_tt image field
3738- anv: retain ccs image binding address
3739- anv: fix transfer barriers flushes with compute queue
3740- vulkan/runtime: handle new dynamic states for attachment remapping
3741- docs/features: drop gen8+/gen9+ on Anv
3742- docs/features: synchronize new features for Anv
3743- vulkan/multialloc: bump max number to 16
3744- vulkan/runtime: rework VK_KHR_dynamic_rendering_local_read state tracking
3745- anv: reduce cache flushing for indirect commands on Gfx12.5+
3746- anv: don't unmap AUX ranges at BO delete
3747- isl: printout sparse usage
3748- isl: add a no-aux-align usage flag
3749- anv: move ALLOC_HOST_CACHED_COHERENT as define
3750- anv: use address helper to compute address u64 value
3751- intel/aux_map: add BSpec reference
3752- intel/aux_map: add helper to compute offset in aux data
3753- anv: re-introduce BO CCS allocations
3754- intel/dev: fix missing dependency on generated packing heaers
3755- anv: factor out post submit queue debug code
3756- intel/fs: indent lowering code to make it more readable
3757- intel/fs: rerun divergence prior to lowering non-uniform interpolate at sample
3758- anv: fix incorrect flushing on shader query copy
3759- meson: add a new option to enable intel-clc without building RT shaders
3760- intel/compiler: make default NIR compiler options visible
3761- intel-clc: move ISA generation to its own function
3762- intel/clc: add ability to output NIR
3763- intel-clc: print text input
3764- genxml: enable opencl code generation
3765- genxml: generate opencl packing headers
3766- genxml: remove NDEBUG_UNUSED
3767- intel/ds: new tracepoints for generated commands
3768- meson: add option to install intel-clc
3769- ci: build a host version of mesa for cross builds
3770- anv: rewrite internal shaders using OpenCL
3771- intel/shaders: add iris variant of indirect draws generation shader
3772- intel/shaders: enable gfx8 support
3773- iris: make binding table shifting values available outside iris_state.c
3774- iris: make KSP helper available outside iris_state.c
3775- iris: make URB programming available outside iris_state.c
3776- iris: factor out index buffer emission
3777- iris: add an option for not emit draw parameters
3778- iris: enable generated indirect draws
3779- meson: enforce build of intel-clc with anv/iris
3780- anv: remove redundant asserts
3781- anv: don't allocate aux padded BOs with host pointers
3782- anv: fix buffer marker cache flush issues on MTL
3783- anv: enable query clear/copy using shaders on MTL/ARL
3784- anv: fixup push descriptor shader analysis
3785- anv: factor out descriptor buffer flushing
3786- anv: reenable ANV_ALWAYS_BINDLESS
3787- anv: remove unused definition
3788- anv: fix Wa_16013994831 macros
3789- anv: fix emission of Wa_14015055625
3790- genxml: generate opencl temporary variables with private qualifier
3791- intel/clc: lower temp function/shader variables together
3792- intel/clc: workaround LLVM17 opaque pointers
3793- anv: disable Wa_16013994831
3794- ci/anv: add more testing for optimization paths
3795- intel/ci: bump anv/tgl fraction to 6
3796- intel/nir: only consider ray query variables in lowering
3797- anv: limit depth flush on dynamic render pass suspend
3798- anv: add missing generated file dep
3799- anv: optimize push descriptor updates
3800- anv: add new heap/pool for descriptor buffers
3801- anv: create new helper for small allocations
3802- anv: add a second dynamic state heap for descriptor buffers
3803- anv: move aux-tt to general state pool
3804- anv: allocate slice_hash for descriptor buffer
3805- anv: allocate border colors for descriptor buffers
3806- anv: allocate fsr states for descriptor buffer
3807- anv: implement data write entry points for EXT_descriptor_buffer
3808- anv: compute a sampler hash based on parameters
3809- anv: add embedded sampler parameters in descriptor set layout hash
3810- intel/fs: add plumbing for embedded samplers
3811- nir: add additional flag to resource_intel for embedded samplers
3812- anv: add embedded sampler support
3813- anv: add new helper to update  binding table pool offset
3814- anv: add descriptor set layout support for descriptor buffers
3815- anv: add pipeline/shader support for descriptor buffers
3816- anv: handle push descriptor writes with descriptor buffers
3817- anv: implement descriptor buffer binding
3818- anv: disable mutable combined image/sampler in descriptor buffer
3819- anv: expose VK_EXT_descriptor_buffer
3820- anv: fix non matching image/view format attachment resolve
3821- anv: fix helper usage for CmdUpdateBuffer()
3822- anv: remove some wrapping around mmap
3823- anv: add support for VK_EXT_map_memory_placed
3824- anv: delay internal shader upload to when needed
3825- anv: fix companion command buffer initialization
3826- anv: fix incorrect ISL usage in buffer view creation
3827- anv/iris/blorp: use the right MOCS values for each engine
3828- anv: try to keep the pipeline in GPGPU mode when buffer transfer ops
3829- anv: don't copy the null descriptor from the GPU memory
3830- intel/fs: fixup sampler header message
3831- anv: return unsupported for FSR images on Gfx12.0
3832- intel/fs: remove some unused send helpers
3833- anv: ignore descriptor alignment for inline uniforms
3834- intel/fs: bump max simd size of some messages for xe2
3835- anv: track embedded sampler counts in layouts
3836- anv: allocate pipeline bindings tables dynamically on the heap
3837- anv: avoid partially compiled warning with GPL
3838- blorp: handle a few allocation failure cases
3839- anv: fix invalid border color free
3840- anv: fix block pool allocation failure
3841- anv: fix temporary state pool allocation failures
3842- anv: fix bitfield checks in gfx runtime flushing
3843- anv: fix query clearing with blorp compute operations
3844- blorp: add support for cached dynamic states
3845- anv: reduce blorp dynamic state emissions
3846- anv: optimize emission of dynamic state with blorp
3847- anv: fix protected memory allocations
3848- anv: pull surface state copies for secondary in one loop
3849- anv: disable protected content around surface state copies
3850- anv: disable generated draws in protected command buffers
3851- anv: update protection fault property
3852- anv: fix incorrect blorp dynamic state heap usage
3853- intel/fs: printout a couple of more late compile steps
3854- intel/fs: fixup instruction scheduling last grf write tracking
3855- anv: add missing data flush out of L3 for transform feedback writes
3856- anv: mark descriptors & pipeline dirty after blorp compute
3857- isl: set NullPageCoherencyEnable for depth/stencil sparse surfaces
3858- anv: only check patch_control_points changes in runtime flush
3859- anv: increase maxResourceDescriptorBufferRange on DG2+
3860- anv: reuse vk_common_GetImageSubresourceLayout
3861- anv: move all format props checks to anv_get_image_format_properties()
3862- drirc: rename hasvk only option
3863- vulkan: track compression control flags on vk_image
3864- anv: implement VK_EXT_image_compression_control
3865- anv: disable capture replay with descriptor buffer
3866- anv: remove useless dynamic state allocation for samplers
3867- anv: add capture/replay support for image with descriptor buffers
3868- anv: add capture/replay support for buffer with descriptor buffers
3869- anv: add a new reserved pool for capture/release
3870- anv: enable shader border color capture/replay
3871- anv: enable capture/replay with descriptor buffers
3872- anv: disable dual source blending state if not used in shader
3873- intel/brw: fixup wm_prog_data_barycentric_modes()
3874- anv: fixup alloc failure handling in reserved_array_pool
3875- anv: fix leak of custom border colors
3876- anv: fix ycbcr plane indexing with indirect descriptors
3877- brw: add more condition for reducing sampler simdness
3878- anv: fix push constant subgroup_id location
3879- nir/divergence: add missing load_printf_buffer_address
3880- anv: use weak_ref mode for global pipeline caches
3881
3882Louis-Francis Ratté-Boulianne (5):
3883
3884- panfrost: factor out method to check whether we can discard resource
3885- panfrost: add copy_resource flag to pan_resource_modifier_convert
3886- panfrost: add can_discard flag to pan_legalize_afbc_format
3887- panfrost: Legalize before updating part of a AFBC-packed texture
3888- panfrost: Add AFBC packing support for RG formats
3889
3890Luc Ma (1):
3891
3892- gallium/u_blitter: Fix a few uninitialized fb_state
3893
3894Luca Bacci (1):
3895
3896- meson,windows: Use relative paths in Vulkan ICD manifest files
3897
3898Lucas Fryzek (19):
3899
3900- egl/wayland/sw: don't invert y \`wl_surface_damage_buffer`
3901- drisw/winsys: Flip y coordinate when creating pipe boxes
3902- drisw: clamp damage region to texture bounds
3903- llvmpipe: explicitly reject (most) yuv formats
3904- gallium: Add dmabuf arg to memory fd allocation API
3905- llvmpipe: Implement dmabuf handling
3906- drisw: reuse kopper image extension vtable if modifiers/dmabuf is supported
3907- llvmpipe: conditionally export PIPE_CAP_DMABUF
3908- lavapipe: support VK_EXTERNAL_MEMORY_HANDLE_TYPE_DMA_BUF_BIT_EXT
3909- lavapipe: EXT_external_memory_dma_buf
3910- llvmpipe: make it possible to import and bind unbacked resources
3911- lavapipe: include drm_fourcc.h
3912- lavapipe: check drm modifier info during image create
3913- lavapipe: EXT_image_drm_format_modifier
3914- venus/ci: Add patch for modifiers test to check import/export bits
3915- lp: Wrap udmabuf usage in HAVE_LIBDRM ifdef
3916- ci/lp: Remove ext buffer YUV tests from fails
3917- llvmpipe: Only return null resource handle when dt is not mapped
3918- llvmpipe: Only use udmabuf if header is found
3919
3920Lucas Stach (18):
3921
3922- etnaviv: disable 64bpp render/sampler formats
3923- etnaviv: track resource sharing
3924- etnaviv: only add shared resources to implicit flush list
3925- etnaviv: implicitly update shared texture resources
3926- etnaviv: don't use int filter for depth textures
3927- etnaviv: tex_desc: emit texture comparator
3928- etnaviv: fix fixpoint conversion of negative values
3929- ci/etnaviv: update expectations
3930- etnaviv: fix depth writes without testing
3931- etnaviv: rs: take src dimensions into account when increasing height alignment
3932- etnaviv: use correct blit box sizes when copying resource
3933- etnaviv: fix separate depth/stencil clears
3934- etnaviv: trigger TS derivation after slow clear
3935- etnaviv: split TS and non-TS RS clear commands
3936- etnaviv: ci: update expectation with fixed depth/stencil clears
3937- etnaviv: rs: treat depth-only clear to X8Z24 surfaces as full clear
3938- ci/etnaviv: update expectation after piglit uprev
3939- etnaviv: flip the switch on MSAA support
3940
3941Luigi Santivetti (1):
3942
3943- pvr: return the OS page size for minMemoryMapAlignment
3944
3945Lynne (1):
3946
3947- radv/av1: limit profile and bit depth to supported values
3948
3949M Henning (10):
3950
3951- nvk: Don't clobber vb0 after repeated blits
3952- nak: Remove assert on nir->info.outputs_written
3953- nvk: Early-out impossible descriptor allocations
3954- nak: Fix ldg/stg/atomg encoding to use globalmem
3955- nak: Set fewer bits in writes_color
3956- nak: Use undef for unset FSOut components
3957- nak: Remove old union_find implementation
3958- nak: Rewrite union_find and use it in repair_ssa
3959- nak: Count GLOBAL_SIZE_OFFSET in bytes, not words
3960- nvk: Don't use a descriptor cbuf if it's too large
3961
3962Manuel Stoeckl (1):
3963
3964- util/disk_cache: try getenv(HOME) before getpwuid->pw_dir
3965
3966Marcin Ślusarz (1):
3967
3968- intel/compiler/xe2: fix decoding of sampler simd mode
3969
3970Marek Olšák (244):
3971
3972- gallium/u_vbuf: replace unnecessary dst_index with "i"
3973- gallium: remove unbind_trailing_count from set_vertex_buffers
3974- cso: don't unbind vertex buffers when enabling/disabling u_vbuf
3975- winsys/amdgpu: merge loops decrementing num_active_ioctls & unreferencing bufs
3976- winsys/amdgpu: cosmetic touchups
3977- winsys/amdgpu: don't clear buffer list elements after IB submission
3978- winsys/amdgpu: add more fence_reference helpers
3979- winsys/amdgpu: don't clear fence list elements after IB submission
3980- winsys/amdgpu: remove misplaced duplicated comment
3981- winsys/amdgpu: represent IB_MAX_SUBMIT_DWORDS in bytes
3982- winsys/amdgpu: represent max_ib_size_dw in bytes
3983- winsys/amdgpu: cosmetic touchups around IB sizes
3984- amd: unify NIR options between RADV and radeonsi
3985- ac/nir: don't write TCS outputs to memory if no_varying is set
3986- ac/nir: rename clipdist_enable_mask -> clip_cull_dist_mask
3987- ac/nir: optimize out multiplications in small line culling
3988- ac/nir: simplify code at the beginning of ac_nir_gs_shader_query
3989- ac,radeonsi: emulate GS primitive pipeline stat on gfx11 because of culling
3990- radeonsi: report more detailed output stats for shader-db
3991- radeonsi: expose shader profiles to other .c files
3992- radeonsi: don't use staging uploads for buffers & shaders with all VRAM visible
3993- radeonsi: deduplicate gfx10_ngg_get_vertices_per_prim / get_num_vert_per_prim
3994- radeonsi: change GS_STATE_PROVOKING_VTX_INDEX to 1 bit PROVOKING_VTX_FIRST
3995- radeonsi: split si_update_ngg_prim_state_sgpr into 2 functions
3996- radeonsi: pack GS_STATE_ESGS_VERTEX_STRIDE better to save 2 bits
3997- radeonsi: remove no-op additions for viewport0_y_inverted
3998- radeonsi: remove unused preloaded instance_divisor_constbuf
3999- radeonsi: rename \*trivial_vs_prolog -> \*trivial_vs_inputs
4000- radeonsi/gfx11: clean up MAX_ALLOWED_TILES_IN_WAVE programming
4001- radeonsi/ci: update gfx11 flakes
4002- radeonsi/gfx11: flush DB before Z/S clear to work around dEQP failures
4003- radeonsi: don't flush CS before and after every blitter invocation
4004- mesa,gallium: move the thread scheduler to src/util
4005- gallium: rename PIPE\_.._PIN_THREADS_TO_L3_CACHE -> .._UPDATE_THREAD_SCHEDULING
4006- st/mesa: rename ST_L3_PINNING_DISABLED -> ST_THREAD_SCHEDULER_DISABLED
4007- util: add mesa_pin_threads environment variable that sets a static affinity mask
4008- glthread: apply the thread scheduling policy when the context is created
4009- glthread: apply the thread scheduling policy when a batch executes synchronously
4010- gallium/hud: add "csv" option to print values to stdout as CSV
4011- nir: remove INTERP_MODE_COLOR
4012- nir: relax validation failure for generic TCS outputs with no_varying
4013- nir: remove and replace underused option pack_varying_options
4014- nir: replace lower_io_variables with a GLSL NIR flag
4015- nir: add a lower_mediump_io callback into options
4016- nir: add vertex divergence into nir_divergence_analysis
4017- winsys/amdgpu: fix a race condition when reading ws->num_buffers
4018- winsys/amdgpu: add real buffers of slab entries in the CS thread
4019- winsys/amdgpu: change the signature of amdgpu_add_bo_fences_to_dependencies
4020- winsys/amdgpu: move code out of amdgpu_add_bo_fences_to_dependencies for reuse
4021- winsys/amdgpu: merge 2 loops iterating over slab entries in amdgpu_cs_submit_ib
4022- winsys/amdgpu: merge 2 loops iterating over sparse BOs in amdgpu_cs_submit_ib
4023- winsys/amdgpu: merge 2 loops iterating over real BOs in amdgpu_cs_submit_ib
4024- winsys/amdgpu: skip code checking RADEON_USAGE_SYNCHRONIZED for slabs
4025- winsys/amdgpu: simplify amdgpu_do_add_buffer to remove memset
4026- winsys/amdgpu: don't ref/unref slab BOs in amdgpu_cs_submit_ib
4027- radeonsi: use num_vertex_buffers instead of ARRAY_SIZE
4028- radeonsi/ci: add gfx11 flakes
4029- gallium: always set vertex elements before setting vertex buffers
4030- gallium/u_blitter: set take_ownership=true for set_vertex_buffers
4031- st/mesa: set take_ownership=true for set_vertex_buffers in st_draw_quad
4032- gallium/util: add take_ownership parameter into util_draw_vertex_buffer
4033- st/mesa: set take_ownership=true for util_draw_vertex_buffer in st_DrawTex
4034- st/mesa: set take_ownership=true for set_vertex_buffers in st_pbo_draw
4035- gallium/hud: set take_ownership=true for set_vertex_buffers
4036- cso: remove CSO_UNBIND_VERTEX_BUFFER0
4037- gallium/u_threaded: remove the count=0 path from tc_call_set_vertex_buffers
4038- gallium/u_threaded: allow drivers to change tc_call_set_vertex_buffers function
4039- gallium: remove take_ownership from set_vertex_buffers, assume it's true
4040- gallium/noop: don't leak resources due to take_ownership
4041- radeonsi,aco: remove the VS prolog
4042- gallium/u_threaded: expose helpers for filling set_vertex_buffers externally
4043- st/mesa: rename attribs -> arrays in st_atom_array to indicate non-zero strides
4044- st/mesa: do (inputs_read & enabled_arrays) outside setup_arrays
4045- st/mesa: do (inputs_read & ~enabled_arrays) outside st_setup_current
4046- st/mesa: move a piece of _mesa_draw_array_attrib out of the loop in setup_arrays
4047- st/mesa: cosmetic touchups in st_atom_array.cpp
4048- st/mesa: change the update enum of vertex elements
4049- st/mesa: move st_update_functions into st_context
4050- st/mesa: constify the pipe_draw_info parameter and remove obsolete comments
4051- mesa: inline {Create,Draw}GalliumVertexState callbacks
4052- mesa: inline _mesa_set_vao_immutable
4053- mesa: add gl_vertex_array_object::NonIdentityBufferAttribMapping
4054- util/idalloc: make deleting invalid IDs a no-op
4055- mesa: remove unused _mesa_HashTable code
4056- mesa: clean up unnecessary _mesa_HashTable locked/unlocked wrappers
4057- mesa: re-format main/hash.h, move inlines to the end, some code to main/hash.c
4058- mesa: fold _mesa_HashDeleteAll into _mesa_DeleteHashTable
4059- mesa: remove _mesa_HashTable::InDeleteAll
4060- st/mesa: merge 3 unlikely blocks in _mesa_get_bufferobj_reference
4061- st/mesa: remove !obj checking in _mesa_get_bufferobj_reference when it's useless
4062- mesa: fix incorrect _mesa_HashInsertLocked parameter in _mesa_EndList
4063- mesa: use util_idalloc_alloc_range for _mesa_HashFindFreeKeyBlock
4064- winsys/amdgpu: convert amdgpu_cs.c to .cpp
4065- winsys/amdgpu: enable unlimited number of parallel queues for VCN
4066- util/idalloc: optimize foreach by tracking the greatest non-zero element
4067- mesa: declare _mesa_HashTable::id_alloc as non-pointer
4068- mesa: declare _mesa_HashTable inside structures instead of as a pointer
4069- mesa: remove isGenName parameter from _mesa_HashInsert
4070- mesa: use util_idalloc_foreach for looping in _mesa_HashTable
4071- mesa: replace _mesa_HashTable::ht with util_sparse_array for faster lookups
4072- d3d12: make DrawTransformFeedback not depend on the vertex buffer offset
4073- mesa: don't use the slow VAO path except for drivers that want to use it
4074- st/mesa: add VAO fast path C++ template variants for st_update_array callback
4075- st/mesa: optimize st_update_arrays using lots of C++ template variants
4076- glthread: re-enable thread scheduling in st/mesa when glthread is disabled
4077- glthread: use _mesa_glthread_fence_call() instead of duplicating that code
4078- glthread: add no_error variants of glDrawElements*
4079- glthread: add no_error variants of glDrawArrays*
4080- glthread: remove cmd_size from constant-sized calls
4081- glthread: clean up how vertex stride is packed
4082- glthread: pack "size" in Pointer calls as 16 bits
4083- mesa: deduplicate get_index_size_shift code
4084- mesa: deduplicate is_index_type_valid code
4085- glthread: pack the primitive type to 8 bits
4086- glthread: pack the index type to 8 bits
4087- glthread: rewrite glDrawElements call packing
4088- glthread: rewrite glDrawArrays call packing
4089- glapi: fix type names for glthread and handle all types
4090- glthread: sort fixed-sized parameters before returning them
4091- glthread: move global marshal_XML.py functions into class marshal_function
4092- glthread: precompute fixed_params and variable_params lists
4093- gltrhead: merge 3 blocks conditional on marshal_sync in print_async_body
4094- glthread: separate unmarshal function generation into print_unmarshal_func
4095- glthread: separate marshal code generation into print_marshal_async_code
4096- glthread: remove "if True" from print_marshal_async_code
4097- glapi: pass pointer size to python for glthread from meson
4098- glthread: pack glVertexAttribPointer calls better
4099- glthread: fix multi draws with a negative draw count
4100- glthread: pack uploaded user vertex buffers and offsets better
4101- glthread: deduplicate batch finalization code
4102- glthread: don't check cmd_size for small variable-sized calls
4103- glthread: use marshal_count instead of count for more functions
4104- glthread: rewrite glBindBuffer packing
4105- glthread: add a packed variant of glDrawElements with 16-bit count and indices
4106- glthread: add a packed version of DrawElementsUserBuf
4107- glthread: generate packed versions of gl*Pointer/Offset calls
4108- amd: update addrlib
4109- mesa: deduplicate initialization of gl_pixelstore_attrib
4110- mesa: move struct gl_pixelstore_attrib into glthread.h
4111- glthread: track glPixelStore(GL_UNPACK_*)
4112- glthread: execute small glBitmap asynchronously
4113- glthread: execute small glDrawPixels asynchronously
4114- glthread: invert _mesa_glthread_has_no_{un}pack_buffer by removing the negation
4115- amd/registers: add correct gfx11.x enums for BINNING_MODE
4116- radeonsi: disable binning correctly on gfx11.5
4117- radeonsi/gfx11: fix programming of PA_SC_BINNER_CNTL_1.MAX_ALLOC_COUNT
4118- radeonsi/gfx10.3: add a GPU hang workaround for legacy tess+GS
4119- radeonsi: allocate only one set of tessellation rings per device
4120- radeonsi/gfx11: program the attribute ring right before draws
4121- radeonsi: program tessellation rings right before draws
4122- radeonsi/gfx11: program SAMPLE_MASK_TRACKER_WATERMARK optimally for APUs
4123- ac: use the gfx11 shadowed register tables for gfx11.5
4124- radeonsi/gfx11: add missing DCC_RD_POLICY setting
4125- radeonsi: add radeonsi_cache_rb_gl2 option enabling GL2 caching for CB and DB
4126- nir/divergence_analysis: change function prototypes
4127- nir/divergence_analysis: load_primitive_id is convergent within a primitive
4128- nir/divergence_analysis: load_instance_id is convergent within a primitive
4129- nir/divergence_analysis: handle derefs of system values
4130- nir: print nir_io_semantics::invariant
4131- nir: add nir_block::divergent to indicate a divergent entry condition
4132- ac/llvm: fix SSBO bounds checking by using raw instead of struct opcodes
4133- radeonsi: fix the DMA compute shader
4134- radeonsi: don't test so many wave limits for AMD_TEST=testdmaperf
4135- nir: add a utility computing post-dominance of SSA uses
4136- nir: add nir_opt_varyings, new pass optimizing and compacting varyings
4137- nir/tests: add tests for nir_opt_varyings
4138- radeonsi: set the lower_mediump_io callback for GLSL
4139- radeonsi: set trivial NIR options for nir_opt_varyings
4140- radeonsi: enable uniform propagation for varyings except VP/Energy
4141- radeonsi: add test failures due to incorrect tests for nir_opt_varyings
4142- st/mesa: get dual slot input info from NIR if IO is lowered
4143- st/mesa: lower sysvals slightly sooner
4144- st/mesa: skip a few NIR passes that don't work with lowered IO
4145- glsl/linker,st/mesa: enable nir_opt_varyings and lower IO in the linker
4146- amd/ci: update stoney results
4147- r300: port scanout pitch alignment from the DDX to fix DRI3
4148- r300: enable tiling for scanout to fix DRI3 performance
4149- radeonsi/ci: run GLCTS, ESCTS, and dEQP from the glcts directory
4150- radeonsi/ci: update failures
4151- Unbreak Viewperf by reverting "util: use crc32_z instead of crc32 and bump zlib dep to 1.2.9"
4152- gallium: use u_box_3d to initialize pipe_box instead of non-designated initializers
4153- gallium: increase the size of pipe_box y, height fields to allow bigger textures
4154- nir: rename AMD XFB intrinsics to \*_gfx11_amd
4155- nir,amd: add nir_intrinsic_load_debug_log_desc_amd and its use
4156- aco: implement aco_is_gpu_supported using switch statement
4157- aco: add a helper printing shader asm by disassembling via LLVM
4158- ac/llvm: remove remnants of gfx10 NGG streamout
4159- radeonsi: implement the shader debug log from ac_nir_store_debug_log_amd
4160- nir/validate: validate interp_mode of load_barycentric_*
4161- nir/lower_io: add nir_io_semantics::interp_explicit_strict
4162- nir/validate: validate more fields of nir_io_semantics
4163- tgsi_to_nir: translate TG4
4164- nir/opt_varyings: don't generate IO with unsupported bit sizes
4165- nir/opt_varyings: simplify nir_io_semantics::num_slots of directly-indexed slots
4166- nir/opt_varyings: handle load_input_vertex
4167- ac/surface: add radeon_surf::thick_tiling
4168- ac/nir: allow 16-bit results for resinfo
4169- ac/llvm: simplify extracting an element in get_image_coords
4170- ac/llvm: add support for 16-bit coordinates (A16) for image (non-sampler) opcodes
4171- ac/llvm: allow image loads to return less than 4 components, trim DMASK
4172- ac/llvm: remove handling of input and output loads/stores that are lowered
4173- ac/llvm: remove unused fields of ac_shader_abi
4174- ac/llvm: simplify the optimization barrier and apply it to the whole vector
4175- ac: add helper ac_get_ip_type_string to remove duplication
4176- nir: add more build helpers
4177- nir: allow FP16 in nir_format_linear_to_srgb
4178- nir: add nir_intrinsic_optimization_barrier_sgpr_amd
4179- nir: change "user_data_amd" sysval from 4 to 8 components
4180- nir/use_dominance: set the root as post-dominator of unmovable instructions
4181- util: add new format helpers
4182- util: import pipe_box and its helpers
4183- ac/llvm: fix assertions for texture instructions with 16-bit LOD bias
4184- ac/llvm: always trim components of texture instructions, trim DMASK
4185- ac/surface: constify and reindent NIR meta address-from-coord function params
4186- radeonsi/ci: update gfx11 failures
4187- radeonsi/gfx11: don't prefetch constants in binaries into the instruction cache
4188- radeonsi/gfx11: enable DCC fast clears for 8-bit and 16-bit formats
4189- radeonsi: use the same nir_lower_subgroups_options as RADV
4190- radeonsi: add the radeonsi_optimize_io option into the shader cache key
4191- radeonsi: check has_stable_pstate in the winsys
4192- radeonsi: move TCS epilog key bits to the key->ge.opt section
4193- radeonsi: fix initialization of occlusion query buffers for disabled RBs
4194- radeonsi: don't expose samples_identical and don't lower FMASK if it's disabled
4195- radeonsi: allow input NIR to use descriptors in image opcodes
4196- radeonsi: move blitter resource_copy_region implementation to si_gfx_copy_image
4197- radeonsi: move blitter clear_render_target impl into si_gfx_clear_render_target
4198- radeonsi: preserve NaNs in draw-based resource_copy_region
4199- radeonsi: use simpler UINT fallback formats for draw-based resource_copy_region
4200- radeonsi: remove si_use_compute_copy_for_float_formats
4201- radeonsi: change allow_flat_shading to make it a single condition
4202- radeonsi: don't call resource_copy_region in pipe->blit
4203- radeonsi/gfx11: implement DCC clear to "single" for fast non-0/1 clears
4204- radeonsi: disable VRS flat shading for selected 8xMSAA and thick tiling cases
4205- radeonsi: don't use si_get_flush_flags() for flushing images
4206- radeonsi: don't flush CB in si_launch_grid_internal_images if not needed
4207- radeonsi: don't flush CB and DB if there have been no draw calls
4208- radeonsi: enable fast FB clears for conditional rendering
4209- radeonsi: make clear_render_target clear DCC directly instead of via pipe->clear()
4210- radeonsi: don't add whether NIR is used into the shader key
4211- radeonsi: only expose 8 EQAA samples due to shader limitations
4212- radeonsi: always run nir_opt_16bit_tex_image
4213- radeonsi: use ip_type in debug code instead of hardcoding GFX
4214- radeonsi: implement user_data_amd for 5, 6, and 7 components correctly
4215- util: shift the mask in BITSET_TEST_RANGE_INSIDE_WORD to be relative to b
4216
4217Mark Collins (32):
4218
4219- tu/kgsl: Fix sync_wait'd FD in kgsl_syncobj_wait
4220- tu/a7xx: Update CCU layout logic for A7XX
4221- tu: Allow GMEM on A7XX when TU_DEBUG=gmem
4222- tu: Set A7XX registers in \`tu6_tile_render_begin`
4223- tu: Set \`CP_THREAD_CONTROL::CONCURRENT_BIN_DISABLE` in A7XX HW init
4224- tu: Only set PC/VFD PWR_CNTL regs on A6XX
4225- tu: Use \`CP_SET_PSEUDO_REG` for A7XX VSC stream regs
4226- tu/autotune: Use \`CP_EVENT_WRITE7::ZPASS_DONE` on A7XX
4227- tu: Set \`RB_UNKNOWN_88E4` for A7XX event blits
4228- freedreno/devices: Update A7XX tile values
4229- tu: Use full size color CCU in sysmem mode
4230- tu: Update CCU layout selection logic for seperate stencil stores
4231- tu: Allow event blit to resolve depth stencil formats
4232- tu: Fix 2D blit path for GMEM stores on A7XX
4233- tu: Use \`Z24_UNORM_S8_UINT_AS_R8G8B8A8` for A7XX GMEM D24S8 blits/clear
4234- tu: Disable LRZ properly on A7XX
4235- tu: Set RB_CCU_CNTL during HW init on A7XX
4236- tu: Fix CP_BLIT sync on A7XX
4237- tu: Clear \`VSC_UNKNOWN_0D08` on A7XX
4238- tu: Add blit cache flushing for input attachments
4239- tu: Unconditionally enable GMEM on A7XX
4240- fd/replay: Fix wrbuffer name extraction
4241- fd/replay: Dump wrbuf into cwd rather than exe directory
4242- fd/replay: Clamp dumped wrbuf to buffer size
4243- fd/replay: Clear wrbufs after submitting cmdstreams for DRM
4244- fd/replay: Add wrbuf support for KGSL/DXG
4245- fd/replay: Error when VMA AS allocation fails
4246- fd/replay+rddecompiler: Add option to clear wrbufs at start
4247- fd/rddecompiler: Disable IR3 cache for replay context
4248- fd/decode: Build generate_rd executable rather
4249- fd/replay: Use generate_rd as default CS generator
4250- fd/decode: Fix "OPTSIONS" typo in help messages
4251
4252Mark Janes (18):
4253
4254- hasvk: add missing linker arguments
4255- util: add parson for handling json files
4256- intel/dev: specify struct intel_device_info type details in python
4257- intel/dev: generate declarations for struct intel_device_info
4258- intel/tools: add intel device meson dependencies
4259- intel/dev: implement json serialization for intel_device_info
4260- intel/dev/tools: add json as an output format for intel_dev_info
4261- intel/tools: load json device info in drm_shim
4262- intel/dev: improve meson invocation for intel_device_info gen
4263- intel/compiler: generate a hash function to use with the shader cache
4264- iris: use device info sha in device renderer string
4265- anv: use intel_device_info to set device UUID
4266- intel/tools: move intel_dev_info to intel/tools
4267- intel/tools: add shader compiler hash key to json devinfo format
4268- pan/va: Add missing valhall_enums dep to bifrost_tests
4269- intel/dev: declare workarounds required by ATSM platforms
4270- intel/dev: remove pci revision from shader cache key
4271- intel/compiler: drop unused ray-tracing fields from cache hash
4272
4273Martell Malone (3):
4274
4275- nine: r500 under 20 fragments cap is a warning
4276- nine: detect emulation fallback of d3d coordinates
4277- nine: update verbiage for enduser device messages
4278
4279Martin Krastev (5):
4280
4281- svga/ci: land vmware mesa-ci lava farm
4282- svga/ci: workaround vmware farm's inability to use public DNS 8.8.8.8
4283- svga/ci: re-enable vmware farm
4284- svga/ci: add two new piglit flakes to svga
4285- svga/ci: disable vmware farm
4286
4287Martin Roukala (né Peres) (10):
4288
4289- radeonsi/ci: update vangogh's expectations after piglit uprev
4290- zink/ci: update navi31's expectations after piglit uprev
4291- zink/ci: update polaris10's expectations after piglit uprev
4292- radv/ci: switch vkcts-polaris10 from mupuf to KWS' farm
4293- radv/ci: add a vkcts-tahiti job
4294- radv/ci: add a vkd3d-tahiti job
4295- ci/b2c: rename .b2c-test-{vk,gl} to .b2c-x86_64-test-{vk,gl}
4296- ci/b2c: rename .deqp-test-valve into .b2c-deqp-test
4297- ci/b2c: allow setting the DTB to be used
4298- ci/valve: remove the traces runner
4299
4300Mary Guillemard (37):
4301
4302- nouveau: nvidia_header: Add AMPERE_A in vk_push_print
4303- nouveau: nvidia_header: Add TURING_COMPUTE_A and AMPERE_COMPUTE_A in vk_push_print
4304- nouveau: nvidia_header: Add AMPERE_COMPUTE_B in vk_push_print
4305- nouveau: nvidia-headers: Add compute array parsing to class_parser.py
4306- nouveau: nvidia-headers: Add nv_push_dump tool
4307- nouveau: mme: Add a dumper
4308- agx: Add more bitops in agx_bitop_table
4309- agx: Remove and/or/xor pseudo ops
4310- agx: Fuse not into and/or/xor
4311- agx: Add a bitop optimizer pass
4312- pan/bi: assert indices when offsets are present in bi_emit_tex_valhall
4313- pan/lib: Remove variables in blitter
4314- pan/bi: Rework indices for tex on Valhall
4315- pan/bi: Rework indices for image on Valhall
4316- pan/bi: Rework indices for attributes on Valhall
4317- pan/bi: Lower ubo table in indices for Valhall
4318- panfrost, pan/lib: Move pan_resource_table to panfrost
4319- nvk: Always copy conditional rendering value before compare
4320- drm-shim: Add io region handling in mmap
4321- panfrost: Add support for Panthor in drm-shim
4322- docs: Document Mali-G610 in drm-shim section
4323- panfrost: group up stubbed params in drm-shim
4324- nouveau: Add support for TERT opcodes in vk_push_print
4325- nouveau: Fix NINC TERT handling in vk_push_print
4326- nak: Support unaligned swizzles in 8/16 bits vec srcs
4327- nak: move folding logic to Src::fold_imm
4328- nak: Add F16 and F16v2 sources
4329- nak: Improve copy propagation pass to handle F16
4330- nak: Add 16-bits float operations
4331- nvk: Advertise shaderFloat16
4332- nvk: Allow various alu op to be vectorized for 2xfp16
4333- nak: Allow SHF to use immediate encoding for shift
4334- panvk: Return os_page_size for minMemoryMapAlignment
4335- panvk: Fix driver UUID not being filled
4336- panvk: Move to vk_properties
4337- panvk: Advertise VK_KHR_driver_properties
4338- nak: Pass has_mod to all form of src2 requiring it
4339
4340Mary Strodl (2):
4341
4342- rusticl: set OCL_ICD_VENDORS as directory, not file
4343- NirShader: don't fail on null constant_buffer
4344
4345Matt Turner (5):
4346
4347- util: Add DETECT_ARCH_HPPA macro
4348- util/tests: Disable half-float NaN test on hppa/old-mips
4349- meson: Limit intel_vk_rt to x86_64
4350- anv/drirc: Add option to control implicit sync on external BOs
4351- intel: Build float64 shader only for Vulkan
4352
4353Matthew Waters (1):
4354
4355- teximage: allow glCopyTex{Sub}Image[123]D into R/RG textures with OpenGL ES 2.0
4356
4357Max R (11):
4358
4359- d3d10umd: Fix compilation
4360- winsys/gdi: Handle R8G8B8 formats
4361- winsys/gdi: Custom acquisition of hDC
4362- d3d10umd: Use flush_frontbuffer for Present
4363- virgl: Fix compilation on MSVC
4364- virgl: Fix crash when no VE bound
4365- virgl: Implement PIPE_QUERY_GPU_FINISHED
4366- virgl: Allow importing resources without known templ
4367- virgl: Pass cmd_buf to flush_frontbuffer
4368- d3d10umd, meson: Allow naming d3d10umd DLLs
4369- d3d10umd: Rename d3d10sw target to d3d10umd
4370
4371Michel Dänzer (2):
4372
4373- egl/wayland: Flush after blitting to linear copy
4374- wsi/wayland: Dispatch event queue in wsi_wl_swapchain_queue_present
4375
4376Mike Blumenkrantz (313):
4377
4378- vk/cmdbuf: add back deleted maint6 workgraph bits
4379- lavapipe: use pushconstants2 for dgc
4380- lavapipe: fix devenv icd filename
4381- zink: fix separate shader patch variable location adjustment
4382- lavapipe: delete extra descriptor buffer layout validation
4383- zink: use local screen variable in surface creation
4384- zink: hook up maint6
4385- zink: use maint6 for multi-layer compressed surface creation
4386- zink: set more dynamic states when using shader objects
4387- lavapipe: KHR_dynamic_rendering_local_read
4388- zink: always map descriptor buffers as COHERENT
4389- zink: fix descriptor buffer unmaps on screen destroy
4390- lavapipe: RM2024 extension promotions
4391- zink: add a tu flake
4392- zink: prune dmabuf export tracking when adding resource binds
4393- zink: fix sparse bo placement
4394- zink: zero allocate resident_defs array in ntv
4395- zink: move sparse lowering up in file
4396- zink: run sparse lowering after all optimization passes
4397- zink: add back (safe) optimizations after sparse lowering
4398- zink: split out sparse_residency_code_and lowering
4399- mesa: plumb errors through to texture allocation
4400- zink: adjust swizzled deref loads by the variable component offset
4401- nir/lower_io: fix handling for compact arrays with indirect derefs
4402- zink: only add arrays to indirect non-tcs variables
4403- zink: promote a conditional on gfx shader destroy
4404- zink: clamp zink_gfx_lib_cache::stages_present for generated tcs
4405- zink: promote gpl libs freeing during shader destroy out of prog loop
4406- zink: don't add VK_IMAGE_CREATE_2D_ARRAY_COMPATIBLE_BIT for sparse textures
4407- zink: add a ci skip
4408- ci: bump VVL to snapshot-2024wk06
4409- zink: update vvl expectations
4410- mesa: check driver format support for certain GetInternalformativ queries
4411- zink: always enable glsl_correct_derivatives_after_discard
4412- zink: add a750 baseline
4413- zink: delete maxDescriptorBufferBindings checks
4414- zink: flag the use_img as unsync access with buf2img copies, not the swapchain
4415- zink: pre-check formats for samplecount support
4416- zink: validate sample count on image create
4417- zink: add an assert for dummy fb surface creation
4418- zink: compute bo unique_id on use, not creation
4419- zink: avoid infinite recursion on (very) small BAR systems in bo alloc
4420- egl/x11/swrast: deduplicate ANGLE_sync_control_rate enablement
4421- drisw: hook up EXT_buffer_age
4422- drisw/egl: delete unused buffer age handling
4423- vk/wsi/x11/sw: use swapchain depth for putimage
4424- mesa: add more driver support checks for more format queries
4425- zink: add checks/compat for low-spec descriptor buffer implementations
4426- zink: add a second fence disambiguation case
4427- zink: force host-visible allocations for MAP_COHERENT resources
4428- zink: hook up KHR_dynamic_rendering_local_read
4429- zink: use KHR_dynamic_rendering_local_read
4430- ci: make clang-format job warn on failure instead of killing the pipeline
4431- zink: handle stencil_fallback in zink_clear_depth_stencil
4432- zink: don't destroy the current batch state on context destroy
4433- zink: only scan active batch states for free states if > 1 exist
4434- zink: fix longstanding issue with active batch state recycling
4435- zink: assert that batch_id is valid in zink_screen_check_last_finished()
4436- zink: move flagging rp_changed in zink_update_fbfetch() to caller
4437- zink: don't pre-init dummy fbfetch surface when missing nullDescriptor feature
4438- zink: also set null fbfetch surfaces when no fb surface is bound
4439- zink: break out null fbfetch init for descriptor buffer
4440- zink: create/resize dummy surfaces on-demand
4441- zink: start out with 256x256 sized dummy surfaces
4442- zink: don't pre-init null fbfetch info
4443- zink: clamp in_rp clears to fb size
4444- zink: fix (dynamic rendering) execution of scissored clears during flush
4445- zink: fix swapchain readback conditional
4446- zink: lock buffer age when chundering swapchain for readback
4447- zink: flag acquired swapchain image as readback target on acquire, not present
4448- zink: make kopper_swapchain_image::acquired the resource that acquired it
4449- zink: add a swapchain readback case for reading differently-acquired image
4450- zink: make readback attempts count towards ZINK_READBACK_THRESHOLD
4451- zink: update swapchain readback cache on create
4452- zink: set and manage a flag indicating that swapchain readback needs updating
4453- zink: only update swapchain readback cache on create if necessary
4454- zink: only update swapchain readback cache when necessary
4455- zink: use new flag to determine whether swapchain readback cache is usable
4456- zink: update nv blob baseline
4457- zink: add nvk baseline
4458- ci: disable clang-format job
4459- zink: apply all storage memory masks to control barriers if no modes are specified
4460- zink: emit SpvCapabilityImageMSArray for ms arrayed storage images
4461- zink: null out bo usage when allocating from slab
4462- zink: fix unsynchronized read-mapping of device-local buffers
4463- zink: delete unused buffer map conditional
4464- zink: force max buffer alignment on return ptrs for mapped staging buffers
4465- gallium: add a nboxes param to flush_frontbuffer
4466- winsys/sw: propagate nboxes to displaytarget_display()
4467- drisw: plumb through a swapBuffersWithDamage interface
4468- egl/wayland/sw: move swrast_update_buffers() directly into swapbuffers
4469- egl/wayland/sw: move dri2_wl_swrast_commit_backbuffer() directly into swapbuffers
4470- egl/wayland: unify back/current swapping between zink and swrast
4471- egl/wayland/sw: split out surface attach from dri2_wl_swrast_commit_backbuffer()
4472- egl/wayland/sw: call dri2_wl_swrast_attach_backbuffer() before swap
4473- egl/wayland/sw: trigger damage from put_image2
4474- egl/wayland/sw: move partial->full copy promotion to swapbuffers
4475- egl/wayland/sw: fix no-op updating of current backbuffer
4476- egl/wayland/sw: pass damage region through from put_image2 to wl_surface_damage
4477- egl/wayland/sw: clamp putimage geometry to surface size
4478- drisw/xlib: loop over all the boxes in display() hook
4479- drisw/winsys: loop over all the boxes in display()
4480- drisw: pass all frontend swapbuffer damage rects through
4481- egl/kopper: plumb through SwapBuffersWithDamage
4482- egl/kopper: advertise EXT_swap_buffers_with_damage only in non-sw mode
4483- egl/wayland: split out kopper vtable
4484- egl/wayland: add a separate hook for kopper buffer age
4485- egl/wayland: split out kopper swapbuffers functions
4486- egl/kopper: call swrast buffer age query for kopper+swrast
4487- kopper: set drawable buffer age
4488- egl/wayland/kopper: actually call kopper swapbuffer functions
4489- egl/wayland: split out kopper update_buffers
4490- egl/wayland: delete swrast references to zink
4491- zink: fix stencil-only blitting with stencil fallback
4492- zink: make zink_kopper_present_info public
4493- zink: use a slab allocator for zink_kopper_present_info
4494- zink: hook up VK_KHR_incremental_present
4495- zink: use VK_KHR_incremental_present to propagate damage rects
4496- zink: hook up KHR_partial_update
4497- vulkan/dispatch_table: add an uncompacted version of the table
4498- zink: use uncompacted vk_dispatch_table
4499- egl/dri2: use the right egl platform enum
4500- glx: only print zink failure-to-load messages if explicitly requested
4501- zink: stop enabling EXT_conservative_rasterization
4502- lavapipe bump descriptor buffer address space limits
4503- zink: fix PIPE_CAP_MAX_SHADER_PATCH_VARYINGS
4504- zink: call CmdSetRasterizationStreamEXT when using shader objects
4505- nvk: bump NVK_PUSH_MAX_SYNCS to 256
4506- zink: update nvk baseline
4507- util/blitter: iterate samples in stencil_fallback
4508- mesa: fix CopyTexImage format compatibility checks for ES
4509- zink: update nvk baseline with nvk changes
4510- driconf: add radv_zero_vram for Crystal Project (1637730)
4511- zink: update nv baseline
4512- zink: track whether shaders use load_barycentric_at_sample
4513- zink: apply zink_shader::uses_sample to fs variant updating
4514- zink: destroy batch states after copy context
4515- zink: set VkExternalMemoryBufferCreateInfo for opaque fds too
4516- zink: simplify vb masking on bind
4517- mesa: force rendertarget usage on required-renderable formats
4518- zink: try getting sparse page size again without storage bit on fail
4519- u/inlines: constify util_res_sample_count()
4520- zink: only add STORAGE bit for sparse images based on multisample usage
4521- zink: nvk baseline updates
4522- zink: set the sparse format usage flags directly based on queried props
4523- zink: delete faked_e5sparse
4524- zink: rename optimal_key in update_gfx_program_optimal()
4525- zink: use the sanitized key in update_gfx_program_optimal()
4526- zink: always sync and replace separable progs even with ZINK_DEBUG=noopt
4527- zink: add even more strict checks for separate shader usage
4528- zink: be even stricter with shader object usage about blocking invalid usage
4529- zink: remove stale comments for DRLR usage
4530- zink: add a pass to strip out multisample storage image ops
4531- zink: don't deref swapchain image array with UINT32_MAX
4532- zink: handle image_deref_samples when stripping MS image instrs
4533- zink: iterate all the modes when doing separate shader fixups
4534- mesa/st: add 'base_serialized_nir'
4535- mesa/st: add is_draw_shader param to st_finalize_nir
4536- mesa/st: when creating draw shader variants, use the base nir and skip driver opts
4537- mesa/st: use sanitized shader keys for feedback draws
4538- zink: do io fixup on patch variables too
4539- zink: defer present barrier to flush if a clear is pending
4540- zink: clamp present region size
4541- zink: clamp swapchain renderarea instead of asserting
4542- zink: set dynamic rendering color attachment layouts
4543- radv: inline radv_device_fault_detection_enabled
4544- ci: bump VVL to v1.3.281
4545- nir/divergence: add zink intrinsics
4546- nir/opt_varyings: update alu type when rewriting src/dest for moved ops
4547- zink: only check that CUBE_COMPATIBLE for images doesn't subtract flags
4548- zink: don't use set_foreach_remove with dmabuf_exports
4549- zink: make descriptor pool creation more robust
4550- zink: fix shaderdb pipeline compile
4551- zink: delete some ntv dead code
4552- zink: always sort io variables by location after re-creating them
4553- zink: use outputs_written mask to detect edge flag usage
4554- zink: update xfb info after lower_to_scalar
4555- zink: run scan_nir before variable rework
4556- zink: apply component offset for CLIP/CULL DIST1 location derefs
4557- zink: manually calc clip/cull distance sizes
4558- zink: add a helper to detect clip/cull dist locations
4559- zink: always use shader sizes for clip/cull dist variables
4560- zink: fix generated variable expansion
4561- zink: check for arrayness rather than tess io vars for indirect array vars
4562- zink: track a mask of arrayed io locations on shaders
4563- zink: call gather_info during shader creation
4564- zink: always check patch io during rework_io_vars
4565- zink: don't clobber indirect array reads with missing components
4566- zink: fix io slot calculation for vertex inputs in add_derefs
4567- zink: fix add_derefs case for compact arrays
4568- zink: only use location_frac for deref array indexing for compact variables
4569- llvmpipe: fix DRAW_USE_LLVM=0
4570- nir/lower_wpos_ytransform: move new value load to start of function, reuse
4571- nir/lower_wpos_ytransform: reuse input zw components for fragcoord rewrite
4572- nir/lower_wpos_ytransform: update comment to reflect variable usage
4573- nir/lower_wpos_ytransform: scalarize emit_wpos_adjustment
4574- nir/lower_wpos_ytransform: fix for lowered io
4575- glsl: handle xfb resources for spirv before running varying opts
4576- mesa: clamp binary pointer in ShaderBinary if length==0
4577- gallium: rework PIPE_CAP_POINT_SIZE_FIXED
4578- zink: delete some maintenance5 psiz pruning
4579- zink: fix add_derefs for partial interp loads of derefs
4580- zink: assert that ntv interp handling isn't doing implicit component expansion
4581- egl/x11: disable swapbufferswithdamage for zink without kopper
4582- glx/egl: fix LIBGL_KOPPER_DISABLE
4583- glsl: set PSIZ bit in outputs_written when injecting a 1.0 psiz write
4584- nir/lower_clamp_color_outputs: fix use with lowered io
4585- nir/lower_flatshade: break out location checking
4586- nir/lower_flatshade: fix with lowered io
4587- nir/lower_alpha_test: fix use with lowered io
4588- nir/lower_two_sided_color: rework for lowered io
4589- nir/lower_drawpixels: fix for lowered io
4590- nir/lower_clip_disable: fix for lowered io
4591- nir/lower_point_size_mov: rework.
4592- nir/lower_point_size_mov: fix for lowered io
4593- nir/texcoord_replace: fix scalarized io handling
4594- nir/dominance: fix comment
4595- drisw: reorder image extensions
4596- sw_winsys: add displaytarget_create_mapped
4597- winsys/null: implement displaytarget_create_mapped
4598- winsys/drisw: implement displaytarget_create_mapped
4599- winsys/drisw: implement dmabuf handling
4600- lavapipe: add a function for asserting external memory handle types
4601- winsys: add WINSYS_HANDLE_TYPE_UNBACKED
4602- winsys: add more stride members to winsys_handle
4603- lavapipe: EXT_queue_family_foreign
4604- lavapipe: rework mem handle type assert to handle dmabuf
4605- lavapipe: handle drm image format queries
4606- lavapipe: handle drm image imports
4607- docs: update lavapipe features
4608- nir: add compact_arrays to nir_shader_compiler_options
4609- nir/gather_info: fix gathering for compact arrayed builtins
4610- zink: set compact_arrays in compiler options
4611- microsoft/compiler: set compact_arrays in compiler options
4612- lavapipe: don't clamp index buffer size for null index buffer draws
4613- v3d: set use_clipdist_array=true for lower_clip?
4614- nir/lower_clip: surgerize for lowered io
4615- nir/lower_clip: handle scalarized io
4616- zink: block LA formats with srgb
4617- llvmpipe: clamp 32bit query results to low 32 bits rather than MIN
4618- lavapipe: clamp 32bit query results to low 32 bits rather than MIN
4619- agx: set compact_arrays in compiler options
4620- v3d: set compact_arrays in compiler options
4621- intel: set compact_arrays in compiler options
4622- freedreno: set compact_arrays in compiler options
4623- glsl: stop using PIPE_CAP_NIR_COMPACT_ARRAYS and check compact_arrays
4624- ttn: stop using PIPE_CAP_NIR_COMPACT_ARRAYS and check compact_arrays
4625- glsl: move an assert from st_context over to avoid using PIPE_CAP_NIR_COMPACT_ARRAYS
4626- mesa: delete LowerCombinedClipCullDistance from consts
4627- st/program: stop using PIPE_CAP_NIR_COMPACT_ARRAYS and use compact_arrays
4628- nine: stop checking PIPE_CAP_NIR_COMPACT_ARRAYS and use compact_arrays
4629- gallium: delete PIPE_CAP_NIR_COMPACT_ARRAYS
4630- zink: set indirect io compiler flags
4631- zink: set lower_to_scalar
4632- zink: rework rework_io_vars
4633- zink: set nir_io_glsl_lower_derefs in compiler options
4634- zink: add a pass to fix vertex input locations
4635- zink: enable opt_varyings with ZINK_DEBUG=ioopt
4636- zink: ci updates
4637- nir/remove_unused_io_vars: check all components to determine variable liveness
4638- ci: kill piano trace globally
4639- nir: print i/o variables in location order
4640- lavapipe: disable stencil test if no stencil attachment
4641- egl: fix defines for zink's dri3 check
4642- egl/android: fix zink loading
4643- egl: use os_get_option for MESA_LOADER_DRIVER_OVERRIDE
4644- zink: disable buffer reordering correctly on shader image binds
4645- nir/print: stop trying to match i/o vars using base/driver_location
4646- zink: add ZINK_DEBUG=nopc to completely disable precompilation
4647- zink: destroy shaderdb pipelines
4648- zink: add VK_PIPELINE_CREATE_CAPTURE_STATISTICS_BIT_KHR for shaderdb
4649- brw/lower_a2c: fix for scalarized fs outputs
4650- zink: copy shader name when copying shader info
4651- zink: run nir_lower_io_to_scalar (mostly) unconditionally and earlier
4652- zink: vectorize io loads/stores when possible
4653- zink: ci updates
4654- zink: prune some piglit cts fails
4655- loader: delete unused param from pipe_loader_vk_probe_dri()
4656- glx: fix some indentation
4657- glx: add an 'implicit' param to createScreen
4658- glx: pass implicit load param through allocation
4659- dri: plumb a 'implicit' param through createNewScreen interfaces
4660- gbm: plumb an 'implicit' param through device creation
4661- frontends/dri: plumb an 'implicit' param through screen init
4662- pipe-loader: plumb a flag for implicit driver load through screen creation
4663- zink: don't print error messages when failing an implicit driver load
4664- glx: silence more implicit-load zink errors
4665- mesa/st: don't use serialized_nir for cached shaders
4666- zink: make NOREORDER mode context-based
4667- zink: disable command reordering for compute-only contexts
4668- nir: store variable names to io instrs during io lowering
4669- nir/lower_io_to_scalar: preserve variable names when splitting io
4670- nir/clone: preserve intrinsic name field across clones
4671- nir/print: print io instr->name if available
4672- zink: preserve/merge variable names when generating new variables
4673- glthread: check for invalid primitive modes in DrawElementsBaseVertex
4674- zink: reconstruct features pnext after determining extension support
4675- zink: prune zink_shader::programs under lock
4676- zink: fully wait on all program fences during ctx destroy
4677- kopper: fix bufferage/swapinterval handling for non-window swapchains
4678- zink: slightly better swapinterval failure handling
4679- kopper: don't set drawable buffer age
4680- zink: clean up accidental debug print
4681- egl/x11: disable dri3 with LIBGL_KOPPER_DRI2=1 as expected
4682- zink: add a batch ref for committed sparse resources
4683- u_blitter: stop leaking saved blitter states on no-op blits
4684- freedreno/replay: use inttypes format string for 64bit
4685- frontends/dri: only release pipe when screen init fails
4686- frontends/dri: always init opencl_func_mutex in InitScreen hooks
4687- zink: clean up semaphore arrays on batch state destroy
4688- egl/dri2: fix error returns on dri2_initialize_x11_dri3 fail
4689- nir/lower_aaline: fix for scalarized outputs
4690- nir/linking: fix nir_assign_io_var_locations for scalarized dual blend
4691
4692Mike Hsieh (1):
4693
4694- amd/vpelib: Add param check for geometric scaling and refactor
4695
4696Mohamed Ahmed (14):
4697
4698- nil: change image_level_size() to take tiling in account
4699- nil: Add helper function to get tile size in pixels
4700- nil: Add helpers for conversion from pixel values to tiles
4701- nil: Expose tiling_extent_B()
4702- nil: Add support for sparse resident images
4703- nvk: add sparse queries
4704- nvk: enable sparse residency features
4705- nak: wire up shader resource residency intrinsics
4706- nak: wire up sparse image loads
4707- nvk: advertise shader resource residency
4708- nil: Add a nil_image::compressed bit
4709- nil: Add some helpers for DRM format modifiers
4710- nil: Support creating images with DRM modifiers
4711- nvk: enable rendering to DRM_FORMAT_MOD_LINEAR images
4712
4713Mykhailo Skorokhodov (2):
4714
4715- egl/wayland: Fix sRGB format look up for config
4716- ci/lima: expect fail of window_8888_colorspace_srgb on wayland
4717
4718Nanley Chery (13):
4719
4720- iris: Don't memset the extra_aux memory range
4721- iris: Don't memset CCS on integrated gfx12
4722- iris: Enable pass-through state init for gfx12 CCS
4723- isl: Pick a better initial state for zeroed MCS
4724- iris: Copy main ISL surf when reallocating in place
4725- iris: Report the correct modifier for Tile4 images
4726- iris: Use resource_get_param in resource_get_handle
4727- intel/isl: Remove inconsistency when choosing Tile64
4728- intel/isl: Remove inconsistency when encoding Tile64
4729- intel/isl: Remove a CCS_D check from gfx12+ code
4730- intel/isl: Enable a 64KB alignment WA for flat-CCS
4731- intel/isl: Use Tile64 to align images for CCS WA
4732- intel/isl: Disable miptails to align LODs for CCS WA
4733
4734Neil Armstrong (1):
4735
4736- freedreno: Add a750 clock gating control related registers
4737
4738Nikita Popov (1):
4739
4740- Pass no-verify-fixpoint option to instcombine in LLVM 18
4741
4742Oskar Viljasaar (3):
4743
4744- vulkan/properties: Start looping from the next member in GPDP2
4745- tu: Use common physical device properties infrastructure
4746- compiler/types: Fix glsl_dvec*_type() helpers
4747
4748Patrick Lerda (16):
4749
4750- glsl/nir: fix gl_nir_cross_validate_outputs_to_inputs() memory leak
4751- r300: fix vertex_buffer related refcnt imbalance
4752- r300: fix r300_destroy_context() related memory leaks
4753- r300: fix memory leaks when register allocation fails
4754- r300: fix constants_remap_table memory leak
4755- radeonsi/gfx10: fix main_shader_part_ngg_es memory leak
4756- r300: enable R400 cos and sin hardware vertex shader opcodes
4757- ac/llvm,radeonsi: fix memory leaks triggered by ac_nir_translate() errors
4758- r300: fix NIR passes regression
4759- r300: fix constants_remap_table memory leak related to the dummy shader path
4760- r300: fix r300_draw_elements() behavior
4761- panfrost: remove panfrost_create_shader_state() related dead code
4762- gallium/auxiliary/vl: fix typo which negatively impacts the src_stride initialization
4763- clover: fix pipe_box update regression
4764- clover: fix memory leak related to optimize
4765- r600: fix vertex state update clover regression
4766
4767Paul Gofman (3):
4768
4769- glsl: allow out arrays in #110 with allow_glsl_120_subset_in_110
4770- driconf: add a workaround for Joe Danger 2
4771- driconf: add a workaround for Joe Danger
4772
4773Paulo Zanoni (35):
4774
4775- zink: fix bind size handling in buffer_bo_commit()
4776- anv/sparse: add an extra step before anv_sparse_bind_resource_memory()
4777- anv/sparse: allow binding operations to match the resource size
4778- anv+zink/ci: remove recently fixed tests from the crash list
4779- anv/sparse: don't issue a single bind operation per vm_bind ioctl
4780- anv/sparse: leave the semaphore waits and signals to the vm_bind ioctl
4781- anv/sparse: don't use the bind_timeline when doing sparse binding
4782- anv: change the vm_bind-related kmd_backend vfuncs to return VkResult
4783- anv: add an anv_pipe_bits bit to allow invalidating the TLB
4784- anv/trtt: invalidate the TLB after writing TR-TT entries
4785- anv/trtt: update GFX_TRTT_VA_RANGE for LNL
4786- anv: don't leak device->vma_samplers
4787- anv: set shaderFloat64 to true when fp64_workaround_enabled
4788- driconf/anv: set fp64_workaround_enabled to DIRT 5
4789- anv/xe: don't leak xe_syncs during trtt submission
4790- anv/xe: don't overwrite the result from vk_sync_wait()
4791- vulkan: don't zero-initialize STACK_ARRAY()'s stack array
4792- anv, iris: add missing CS_STALL bit for GPGPU texture invalidation
4793- anv: reduce struct anv_image_memory_range from 32 to 24 bytes
4794- vulkan: reduce struct vk_object_base by 8 bytes
4795- anv/sparse: remove useless isl_surf_get_tile_info() call
4796- anv/sparse: remove unnecessary popcount assertions
4797- anv/sparse: adjust sparse_bind_image_memory debug messages
4798- anv/sparse: remove unused dump_vk_sparse_memory_bind()
4799- anv/sparse: replace device->using_sparse with device->num_sparse_resources
4800- anv/sparse: rework anv_free_sparse_bindings() error handling
4801- anv/xe: extract anv_vm_bind_to_drm_xe_vm_bind()
4802- anv/xe: add a 'flags' parameter to the vm_bind() kmd_backend function
4803- anv/xe: slightly improve error handling for the vm_bind ioctl
4804- anv/xe: assert we're using drm_syncobjs only once
4805- anv/xe: de-duplicate xe_exec_fill_sync()
4806- anv/xe: rename and refactor xe_exec_fill_sync()
4807- anv/sparse: fail the right way in anv_GetDeviceImageSparseMemoryRequirements()
4808- anv: const-correct anv_{image,buffer}_is_sparse()
4809- isl: add ISL_TILING_64_XE2 to isl_tiling_to_name()
4810
4811Pavel Ondračka (38):
4812
4813- r300: fix reusing of color varying slots for generic ones
4814- r300: skip draw if vertex shader does not write gl_Position
4815- r300/ci: switch to deqp-runner suite
4816- r300/ci: add the KHR gles2 tests
4817- r300/ci: move streaming-texture-leak from fails to skips
4818- r300: fix writemask for nir_intrinsic_load_ubo_vec4
4819- r300: skip backend DCE for vertex shaders
4820- r300: remove R3xx/R4xx backend absulute modifier lowering
4821- r300/ci: add dEQP on RV380
4822- r300: remove backend SLE and SGT support
4823- r300: add r300_is_only_used_as_float helper
4824- r300: optimize out more modifiers produced later
4825- r300: lower comparison ops early in NIR
4826- r300: remove SGE, SNE, SLT, SGE lowering in the backend
4827- r300: remove the remaining of backend constant folding
4828- r300: remove backend support for SUB
4829- r300/ci: update piglit fails
4830- r300: remove compiler tests
4831- r300/ci: add two more observed piglit flakes
4832- r300: fix vs output register indexing
4833- r300: add explicit flrp lowering
4834- ci: install xwayland in x86_64_test-gl
4835- ci: build nine in debian-testing
4836- ci: build nine tests
4837- r300/ci: enable nine tests
4838- r300: explicitly check if sin/cos input is already in correct range
4839- r300: move sin/cos input fixups to finalize_nir
4840- r300: remove some late NIR passes
4841- nir/lower_vec_to_regs: always set cursor before inserting decl_reg
4842- r300: check for the extra restrictions on presubtract swizzles
4843- r300: move presubtract pass later
4844- r300: optimize swizzle for inline constants
4845- r300: inline unoptimized_ra ntr option
4846- r300: get rid of the unused ubo_vec4_max ntr option
4847- r300: remove the ntr lower_cmp option
4848- r300: move lower_fabb option out of the options struct
4849- r300: remove nir_to_rc_options wrapper
4850- r300/ci: failures list update
4851
4852Peyton Lee (6):
4853
4854- radeonsi/vpe: remove wait source surface fence and while loop
4855- radeonsi/vpe: disable info log
4856- radeonsi/vpe: move flush to si_vpe_processor_end_frame
4857- radeonsi/vpe: support multi-buffer
4858- radeonsi/vpe: pre-allocate stream struceutre
4859- radeonsi/vpe: add support for p010
4860
4861Philip Rebohle (1):
4862
4863- radv: Remove dead shared variables after optimization loop.
4864
4865Philipp Zabel (6):
4866
4867- rusticl: work around reference-to-mutable-static warnings
4868- etnaviv: common: Add PIPE_3D feature bit
4869- etnaviv: Avoid duplicate query of ETNA_GPU_FEATURES_0 parameter
4870- etnaviv: hwdb: Add VIP_V7 and NN_XYDP0 feature bits
4871- etnaviv: Add nn_core_version field to etna_specs
4872- etnaviv/nn: Extend post-multiplier for v8 architecture
4873
4874Pierre-Eric Pelloux-Prayer (21):
4875
4876- radeonsi: compute epitch when modifying surf_pitch
4877- Revert "ci/radeonsi: disable VA-API testing on raven"
4878- radeonsi: emit cache flushes before draw registers
4879- radeonsi: adjust flags for si_compute_shorten_ubyte_buffer
4880- winsys/amdgpu: use syncobj rather than amdgpu fence
4881- ac, radeonsi: remove has_syncobj, has_fence_to_handle
4882- radeonsi: try to disable dcc if compute_blit is the only option
4883- meson: require libelf when radeonsi is built
4884- egl/drm: flush before calling get_back_bo
4885- radv: don't remove the blit queue from the device queues
4886- winsys/amdgpu: unmap user fence BO before destroy
4887- winsys/amdgpu: remove unused amdgpu_fence_is_syncobj
4888- wsi/wl: flush connection on swapchain failure
4889- mesa: deal with vbo_save_vertex_list::modes being NULL
4890- wsi/wl: check wsi_wl_surface's validity before use
4891- egl/wayland: use __DRI_IMAGE_PRIME_LINEAR_BUFFER in get_back_bo
4892- winsys/radeon: pass priv instead NULL to radeon_bo_can_reclaim
4893- radeonsi: preserve alpha if needed in kill_ps_outputs_cb
4894- amd: fix addrlib regression
4895- aco: don't use python 3.7+ feature in aco_opcodes.py
4896- radv: don't use python 3.9 feature in radv_annotate_layer_gen.py
4897
4898Qiang Yu (1):
4899
4900- radeonsi: split RADEON_USAGE_NEEDS_IMPLICIT_SYNC into CB and DB flags
4901
4902Ray Smith (2):
4903
4904- panfrost: Don't try to set bifrost blendable format on midgard
4905- panfrost: Fix format tables for v4 and v5
4906
4907Rhys Perry (84):
4908
4909- radv: do nir_shader_gather_info after radv_nir_lower_rt_abi
4910- nir/lower_non_uniform: set non_uniform=false when lowering is not needed
4911- nir/lower_shader_calls: remove CF before nir_opt_if
4912- aco: fix labelling of s_not with constant
4913- aco: add VOPD format
4914- aco: add VOPD statistic
4915- aco: refactor schedule_ilp main loop
4916- aco: implement VOPD scheduler
4917- aco: enable VOPD scheduler
4918- aco: fix >8 byte linear vgpr copies
4919- aco/tests: fix to_hw_instr.swap_linear_vgpr
4920- aco: refactor create_vopd_instruction
4921- aco: swap operands to create VOPD instructions
4922- aco: turn v_mov_b32 into addition to create VOPD instructions
4923- aco: improve printing of VOPD instructions
4924- aco/tests: add tests for VOPD operand swapping
4925- aco/tests: use raw strings in form_hard_clauses.nsa
4926- radv: support minmax filter for more formats
4927- aco/ra: don't initialize assigned in initializer list
4928- aco/ra: fix GFX9- writelane
4929- aco: don't combine linear and normal VGPR copies
4930- aco/ra: disable p_start_linear_vgpr allocation hint
4931- aco: allow p_start_linear_vgpr to use multiple operands
4932- aco: require linear vgpr uses to be late kill
4933- aco: only allow linear vgpr kills in top-level blocks
4934- aco/ra: constify various RegisterFile
4935- aco/ra: move parallelcopy creation into helper
4936- aco/ra: change get_reg_bounds() helper
4937- aco/ra: rework linear VGPR allocation
4938- aco/ra: disable live range splitting of linear vgprs
4939- aco/ra: emit linear VGPR parallel copy separately
4940- aco/tests: add tests for linear VGPR register allocation
4941- aco: optimize for purely linear VGPR copies
4942- nir/algebraic: don't create 64-bit min/max/ior if lowered
4943- nir/algebraic: remove duplicated iand(ien, ine)/ior(ieq, ieq) patterns
4944- nir/algebraic: optimize 64-bit comparisons with zero'd halves to 32-bit
4945- nir/lower_int64: allow 64-bit comparisons when lowering minmax
4946- nir/search: fix nir_replace_instr() debug code
4947- aco: don't pass constant to is_overwritten_since()
4948- radv: don't advertise DGC with LLVM
4949- radv: stop using 5/8 component SSBO stores
4950- radv,aco: allow VS prologs to increase VGPR usage
4951- aco: don't reuse misaligned attribute destination VGPRs in VS prologs
4952- aco/util: add small_vec
4953- radv: use dual_color_blend_by_location with Half-Life Alyx
4954- aco/cssa: reset equal_anc_out if merging fails
4955- aco/cssa: update comments
4956- aco: fix GFX6 buffer_load_dwordx4 opcode number
4957- aco: rename opcode->instruction
4958- aco: refactor VOPC opcode list
4959- aco: use single tuple for all opcode numbers
4960- aco: use op()
4961- aco: move dot/wmma instructions into VOP3P list
4962- aco: unify MIMG opcode lists
4963- aco/gfx11: fix scratch ST mode assembly
4964- aco: split instruction assembly into functions
4965- aco: always emit float mode for merged shaders compiled separately
4966- aco: avoid breaking clauses with waitcnt
4967- nir: add mqsad_4x8, shfr and nir_opt_mqsad
4968- aco: implement mqsad_4x8 and shfr
4969- ac/llvm: implement mqsad_4x8 and shfr
4970- amd: set has_shfr32=true
4971- radv: optimize msad_4x8 to mqsad_4x8
4972- radv: memset radv_pipeline_cache_object data
4973- nir: add nir_remove_after_cf_node helper
4974- aco: remove unreachable merge blocks
4975- aco: ensure loop exits exist in NIR
4976- aco: save/reset/combine has_divergent_continue in uniform branches
4977- nir,aco: add test intrinsics
4978- aco/tests: add isel test helpers
4979- aco/tests: add control flow tests
4980- aco: assume no unreachable blocks
4981- aco: don't include the clause in VMEM_CLAUSE_MAX_GRAB_DIST
4982- aco: remove occupancy check in dealloc_vgprs()
4983- aco/tests: don't assume constructor order
4984- aco/tests: remove LLVM 11 code
4985- radv: cache RT stage info
4986- aco: include LDSDIR in latency/etc stats
4987- aco: make store clauses more aggressively
4988- aco: schedule LDSDIR instructions
4989- aco: schedule LDS instructions
4990- aco: split vop3p results
4991- aco/waitcnt: fix DS/VMEM ordered writes when mixed
4992- aco: create lcssa phis for continue_or_break loops when necessary
4993
4994Rob Clark (31):
4995
4996- freedreno/a6xx: fix comment
4997- freedreno/registers: Pass full args to dump_c()
4998- freedreno/registers: De-duplicate xml_reg_files
4999- freedreno/registers: Don't re-parse files
5000- freedreno/registers: Generate copyright comment blurb
5001- freedreno/registers: Add basic kernel header support
5002- freedreno/registers: A couple newline changes
5003- tu/drm/virtio: Fix dmabuf import
5004- freedreno/drm: Submit should hold ref to device
5005- freedreno/drm: Fix teardown crash harder
5006- freedreno/decode: Fix prefetch handling for IB1 crash
5007- freedreno: Fix MSAA z/s layout in GMEM
5008- freedreno/crashdec: Find potential fault buffers
5009- tu: Give suballoc bo's a name
5010- freedreno/a6xx: Add dual_color_blend_by_location
5011- freedreno/a6xx: Fix z/s preserving sysmem clear blit
5012- freedreno/pps: Don't re-init perfcntrs
5013- freedreno: Add bo usage hints
5014- freedreno/drm: Add perfetto memory tracing
5015- tu: Add perfetto memory tracing
5016- pps: Enable memory traces
5017- pps: Config tweaks to avoid loosing traces
5018- freedreno/registers: Add license header
5019- egl/android: Fix gl_config dereference
5020- freedreno/drm/virtio: Fix deadlock on exit
5021- freedreno+virgl: Add missing driconf
5022- freedreno: Update a618 xfails
5023- ci: Add deqp fix for pipeline_statistics_3 tests
5024- tu: Fix a6xx lineWidthGranularity
5025- egl/android: Fix sRGB visuals
5026- freedreno/ir3: Fix ldg/stg offset
5027
5028Robert Beckett (1):
5029
5030- vulkan/wsi: fix force_bgra8_unorm_first
5031
5032Robert Mader (5):
5033
5034- crocus: Support offset query for multi-planar planes
5035- panfrost: Use pipe resource helper
5036- egl: Implement EGL_EXT_config_select_group
5037- egl: Implement EGL_MESA_x11_native_visual_id
5038- egl/x11: Allow all RGB visuals to match 32-bit RGBA EGLConfigs
5039
5040Robin Kertels (3):
5041
5042- nvk: Enable EXT_nested_command_buffer.
5043- nak: Enable lowering rotate to shuffle.
5044- nvk: Advertise VK_KHR_shader_subgroup_rotate.
5045
5046Rohan Garg (35):
5047
5048- anv: refactor emit_dynamic_buffer_binding_table_entry
5049- isl,blorp,anv: introduce ISL_TILING_64_XE2 for Xe2+ platforms
5050- anv: untyped data port flush required when a pipeline sets the VK_ACCESS_2_SHADER_STORAGE_READ_BIT
5051- anv: factor out common code for determining surface usage from a VkDescriptorType
5052- anv: cleanup duplicate robustness flag calculations
5053- anv: add a command streamer stall on Xe2+ when switching pipelines
5054- intel/compiler: Xe2+ can do URB load/store with a byte offset
5055- anv: drop duplicated 3DSTATE_SLICE_TABLE_STATE_POINTERS emission
5056- anv, blorp: Set COMPUTE_WALKER Message SIMD field
5057- intel/genxml: update PIPE_CONTROL so that we can decode it on the CCS
5058- iris,anv: WA 1509820217 is no impact for Xe2+
5059- intel/brw: Use the dimensions supplied in the instruction
5060- intel/brw: Cleanup send generation
5061- intel/brw: Update written size depending on the LSC message
5062- intel/brw: Set the right cache control bits for xe2
5063- intel/brw: Adjust src1 length bits for xe2+
5064- anv,blorp: implement restrictions from WA 1406738321
5065- anv: 3D surfaces have fewer layers for higher miplevels
5066- isl: enable CCS for 3D surfaces on gen12.5 and above
5067- intel/brw: account for sources when determining if a operation uses half floats
5068- intel/brw: Xe2+ can do SIMD16 for extended math on HF types
5069- intel/brw: update disassembly for MATH pipe
5070- intel/brw: adjust the copy propgation pass to account for wider GRF's on Xe2+
5071- intel/brw: minor rework to de duplicate variable assignment
5072- intel/brw: Handle typed surface and atomic messages for xe2+
5073- intel/brw: Lower DWORD scattered read writes to lsc
5074- intel/eu/validate: Allow SIMD16 for mixed mode float operations on xe2+
5075- iris: slow clear higher miplevels on single sampled 8bpp resources that have TILE64
5076- intel/blorp: add fast clear rectangle dimensions for single sampled TILE64 CCS surfaces
5077- isl: allow CCS on single sampled TILE64 surfaces
5078- anv: Enable HiZ on multi-LOD depth buffers.
5079- anv: use u_foreach_bit to iterate over the the view mask like we do for transition_clear_color
5080- anv: formatting fix when printing pipe controls
5081- anv: allocate space for generated indirect draw id's using the temporary allocation helper
5082- Revert "iris: slow clear higher miplevels on single sampled 8bpp resources that have TILE64"
5083
5084Roland Scheidegger (2):
5085
5086- auxiliary/draw: fix streamout overflow calculation
5087- auxiliary/rtasm: fix unaligned stores
5088
5089Romain Naour (1):
5090
5091- glxext: don't try zink if not enabled in mesa
5092
5093Ruijing Dong (6):
5094
5095- radeonsi/vcn: data structure av1 enc long term reference.
5096- radeonsi/vcn: vcn4 av1 long term ref support
5097- frontends/va: get av1 encoding ref frame infos for L0.
5098- radeonsi/vcn: add enc surface alignment caps
5099- frontends/va: add surface alignment attribute
5100- radeonsi/vcn: update to use correct padding size.
5101
5102Ryan Neph (3):
5103
5104- venus: fix shmem leak on vn_ring_destroy
5105- virgl: use PIPE_MAX_SAMPLERS in bind_samplers_states
5106- venus: reclaim signal semaphore feedback resources for wasteful clients
5107
5108Sagar Ghuge (28):
5109
5110- intel/fs: Track instance id in gs_thread_payload
5111- vulkan/runtime: Track VkSharingMode in vk_image
5112- anv: Disable compression if we have concurrent sharing mode
5113- intel/compiler/xe2: Handle 6-bit message type for Gfx20+
5114- intel/compiler: Add texture operation lowering pass
5115- intel/compiler: Use nir_tex_src_backend1 to pack LOD and array index
5116- nir: Drop intel specific lowering code
5117- intel/compiler: Lower texture operation to combine LOD and AI
5118- intel/dev: Update max_subslices_per_slice comment
5119- intel/compiler: Fix disassembly of URB message descriptor on Xe2+
5120- anv: Drop warnings for engine initialization failure
5121- anv: Set timestampValidBits to 64bits
5122- intel/compiler: Trim vector properly till array index
5123- intel/compiler: Adjust sample_b parameter according to new layout
5124- intel/compiler: Pack LOD/bias and array index on TG4 messages
5125- intel/compiler: Pack texture LOD and offset to a single 32-bit value
5126- intel/compiler: Add helper method to decide if header is required
5127- intel/compiler: Add gather4_i/l/[_c]/b sampler message
5128- intel/compiler: Add texture gather offset LOD/Bias message support
5129- nir: Allow nir_texop_tg4 in implicit derivative
5130- intel/compiler: Enable packing of offset with LOD or Bias
5131- anv: Implement VK_AMD_texture_gather_bias_lod
5132- anv/xe: Consider pat_index while unbinding the bo
5133- anv: Fix typo in DestinationAlphaBlendFactor value
5134- anv: Use appropriate argument format for indirect draw
5135- isl: Update isl_swizzle_supports_rendering comment
5136- isl: Update shader channel select for missing components
5137- intel/compiler: Disassemble mlen/rlen/ex_mlen in units of registers
5138
5139Saleemkhan Jamadar (1):
5140
5141- radeonsi/vcn: set jpeg reg version for gfx 1151
5142
5143Samuel Pitoiset (419):
5144
5145- radv: constify stages in radv_rt_fill_group_info()
5146- radv/rt: re-use radv_ray_tracing_stage::sha1 for hashing RT pipelines
5147- radv: correctly return VK_ERROR_OUT_OF_DEVICE_MEMORY when mapping a BO fails
5148- radv/nir: pass radv_shader_stage to some radv_nir_xxx() functions
5149- radv/nir: remove useless struct for nir_shader typedef
5150- radv: remove one unused parameter in radv_fill_shader_info_ngg()
5151- radv: move radv_pipeline_key::mesh_fast_launch_2 to the per-device cache key
5152- radv: add radv_shader_stage_key to radv_shader_stage
5153- radv: use radv_shader_stage_key directly with pre-existing fields
5154- radv: add optimisations_disabled to radv_shader_stage_key
5155- radv: remove unecessary radv_nir_compiler_options::key
5156- radv: remove unused lower_rt_instruction_monolithic_state::key
5157- radv: stop passing the pipeline key when compiling compute/rt shaders
5158- radv: re-organize radv_pipeline_key
5159- radv: add vertex_robustness1 to radv_shader_stage_key
5160- radv: introduce radv_graphics_state_key
5161- zink/ci: skip more arb_shader_image_load_store.* on Polaris10/Navi10
5162- radv: add keep_statistic_info to radv_shader_stage_key
5163- radv: add shader_version to radv_shader_stage_key
5164- radv: pass radv_shader_stage_key to radv_pipeline_stage_init()
5165- radv: make sure to retain shaders key for imported shaders with GPL
5166- radv: cleanup radv_generate_pipeline_key()
5167- radv: add radv_pipeline_get_shader_key()
5168- radv/rt: cleanup radv_parse_rt_stage()
5169- radv: hash radv_shader_stage_key
5170- radv: stop hashing radv_pipeline_key for compute/rt pipelines
5171- radv: remove the pipeline key for compute pipelines
5172- radv: remove the pipeline key for ray tracing pipelines
5173- radv: remove an extra new line in radv_shader.h
5174- radv: pass radv_graphics_state_key to radv_hash_shaders()
5175- radv: remove radv_generate_pipeline_key()
5176- radv: rename radv_pipeline_key to radv_graphics_pipeline_key
5177- radv: delay emitting streamout enable at draw time
5178- aco: silent checking if clrxdisasm is available
5179- radv: fix indirect dispatches on the compute queue on GFX7
5180- radv: fix indirect draws with NULL index buffer on GFX10
5181- radv: remove unused parameter to gather_shader_info_mesh()
5182- radv: add a per-stage key field for mesh shaders with a task shader
5183- vulkan: bump headers/registry to 1.3.276
5184- lavapipe: fix build since vulkan spec update
5185- vulkan: promote VK_EXT_line_rasterization to KHR
5186- vulkan: promote VK_EXT_index_type_uint8 to KHR
5187- radv: add a helper for binding the custom blend mode
5188- radv: add a helper to get the VGT_GS_OUT value
5189- radv: prevent accessing NULL pipelines when emitting VBO with ESO
5190- radv: re-emit the TCS epilog when a new TCS is bound
5191- radv: enable prologs/epilogs in-memory cache for shader objects
5192- radv: add required NV entrypoints for VK_EXT_shader_object
5193- radv: initialize default dynamic state when beginning a new cmdbuf
5194- radv: add radv_shader_stage::next_stage field
5195- radv: add radv_shader_layout::dynamic_offset_count
5196- radv: add support for creating/destroying shader objects
5197- radv: make some pipeline graphics helpers non-static for ESO
5198- radv: add support for binding/emitting shader objects
5199- radv: advertise VK_EXT_shader_object on GFX6-8
5200- radv: advertise VK_KHR_load_store_op_none
5201- radv: promote VK_EXT_line_rasterization to KHR
5202- radv: advertise VK_KHR_line_rasterization
5203- radv: promote VK_EXT_index_type_uint8 to KHR
5204- radv: advertise VK_KHR_index_type_uint8
5205- radv: use device->vk.enabled_features instead of iterating twice
5206- radv: fix segfault when getting device vm fault info
5207- radv/ci: enable RADV_PERFTEST=shader_object for vkcts-polaris10-valve
5208- radv: refactor gfx103_pipeline_emit_vgt_draw_payload_cntl()
5209- radv: refactor gfx103_pipeline_emit_vrs_state()
5210- radv: use the non-emitted graphics pipeline for the needed dynamic states
5211- radv: fix the late scissor emission workaround with ESO on GFX9
5212- radv: set NGG fields in vgt_shader_key for ESO on GFX10+
5213- radv: do not ignore RADV_DYNAMIC_FRAGMENT_SHADING_RATE for ESO on GFX10.3+
5214- radv: emit more default states for ESO on GFX10.3+
5215- radv: export alpha-to-coverage via MRTZ for ESO on GFX11
5216- radv: fix detecting invalid binaries with ESO
5217- radv: fix emitting tess domain origin for merged TES+GS on GFX9
5218- radv: emit required programming for tess on GFX10+ in radv_emit_hw_vs()
5219- radv: rebind mesh/task shaders when restoring meta context
5220- radv: determine next stage for mesh/task with ESO
5221- radv: ignore unneeded dynamic states with mesh shaders and ESO
5222- radv: determine the last VGT api stage with mesh shaders and ESO
5223- radv: bind and emit mesh/task shaders with ESO
5224- radv: prevent crashes when a task shader is compiled unlinked with ESO
5225- radv: init the shader key in radv_shader_stage_init() for ESO
5226- radv: add support for VK_SHADER_CREATE_NO_TASK_SHADER_BIT_EXT
5227- radv: add a helper to know if device fault detection is enabled
5228- radv: refactor dumping GPU hang reports by using chunks
5229- radv: add support for keeping GPU hang reports in memory
5230- radv: export GPU hang reports through VK_EXT_device_fault
5231- radv: enable deviceFaultVendorBinary if RADV_DEBUG=hang is set
5232- radv: remove radv_graphics_state_key::dynamic_patch_control_points
5233- radv: determine the workgroup size for TCS earlier
5234- radv: set the default workgroup size for VS as LS
5235- radv: constify radv_device in radv_emit_shader_pointer()
5236- radv: check active NIR stages before trying to merge shaders on GFX9+
5237- radv: only merge shader info stages if both stages exist on GFX9+
5238- radv: rework shader arguments for separate compilation of VS+TCS on GFX9+
5239- radv: always mark drawid/base_instance used with ESO
5240- radv: force TCS stage for VS as LS compiled separately on GFX9+
5241- radv: always emit PGM_RSRC1_HS when emitting the TCS epilog state
5242- radv: add support for emitting VS+TCS compiled separately on GFX9+
5243- radv: do not allow to enable VK_EXT_shader_object with LLVM
5244- radv: add a workaround for mipmaps and minLOD on GFX6-8
5245- radv/sqtt: fix describing queue submits for RGP
5246- radv: limit maxIndirectCommandsTokenCount to 512
5247- radv: remove one indentation level in radv_fill_shader_info_ngg()
5248- radv: squash GFX10/GFX10.3 NGG restrictions in the same condition
5249- radv: always set GS as NGG if present on GFX11
5250- radv: use next_stage to determine the NGG stage
5251- radv: check for MESA_SHADER_TESS_EVAL in radv_fill_shader_info_ngg()
5252- radv: determine the ES stage earlier when processing binary config
5253- radv: determine the workgroup size for GS non-NGG earlier
5254- radv: set the default workgroup size for VS/TES as ES
5255- radv: change the user SGPR idx of AC_UD_TES_STATE
5256- radv: add a new user SGPR for the ESGS ring item size
5257- radv/nir: lower esgs_vertex_stride for GS compiled separately on GFX9+
5258- radv: rework shader arguments for separate compilation of VS+GS on GFX9+
5259- radv: declare streamout buffers for VS+GS compiled separately on GFX9+
5260- radv: force GS stage for VS as ES compiled separately on GFX9+
5261- radv: add support for emitting VS+GS compiled separately on GFX9+
5262- radv/ci: remove VKD3D_CONFIG=dxr11 for navi21/navi31
5263- radv: remove unused radv_indirect_command_layout::state_offset
5264- radv: only load 3x32-bit elements when emitting draws with mesh shader
5265- docs: fix RADV_DEBUG=nonggc description
5266- radv: add RADV_DEBUG=nongg_gs for GFX10/GFX10.3
5267- radv: add radv_disable_ngg_gs and enable it for Persona 3 Reload
5268- radv: fix RGP barrier reason for RP barriers inserted by the runtime
5269- radv: force GS stage for TES as ES compiled separately on GFX9+
5270- radv: declare streamout buffers for TES+GS compiled separately on GFX9+
5271- radv: declare AC_UD_TES_STATE for separate compilation of GS on GFX9+
5272- radv: bind the vertex input SGPR only for relevant stages
5273- radv: add support for emitting TES+GS compiled separately on GFX9+
5274- radv: allow RADV_PERFTEST=shader_object on GFX9/VEGA10
5275- radv/ci: enable RADV_PERFTEST=shader_object on VEGA10
5276- radv: cleanup radv_shader_combine_cfg_vs_tcs()
5277- radv: fix emitting VS prologs for merged shaders compiled separately on GFX10+
5278- radv: clear RADV_CMD_DIRTY_SHADERS when resetting the shader object state
5279- radv: clear the custom blend mode when resetting gfx pipeline state
5280- radv: fix re-emitting DB_RENDER_CONTROL when resetting gfx pipeline state
5281- radv: make sure to reset the GS copy shader with ESO
5282- radv: fix selecting shader variants with ESO
5283- radv: fix setting the rasterized primitive for ESO
5284- radv: enable GS_FAST_LAUNCH=2 by default for RDNA3 APUs (Phoenix)
5285- radv: only configure {XYZ_DIM,DRAW_INDEX}_REG for mesh shaders if enabled
5286- radv: re-enable GS_FAST_LAUNCH=2 by default on GFX11
5287- radv: stop using conditional rendering internally when preprocessing DGC
5288- radv: disable conditional rendering if enabled when preprocessing DGC
5289- radv: pass the ES stage when emitting geometry shader with ESO
5290- radv: determine the ES stage for merged NGG shaders compiled separately
5291- radv: prefix radv_vgt_shader_key::streamout with ngg
5292- radv: set radv_vgt_shader_key::ngg_streamout for ESO
5293- radv: determine the number of invocations only for VS/TES as NGG
5294- radv: store the number of outputs for VS/TES as NGG
5295- radv: use radv_shader_info for computing NGG LDS layout
5296- radv: fix a compilation warning in radv_bind_graphics_shaders()
5297- spirv: only consider IO variables when adjusting patch locations for TES
5298- radv: move mesh_fast_launch_2 to radv_physical_device
5299- radv: initialize disk cache slightly later when creating a physical device
5300- radv: introduce a per physical device cache key
5301- radv: fix binary shaders compatibility with ESO
5302- radv: fix indirect dispatches on compute queue with conditional rendering on GFX7
5303- radv: remove the union in radv_shader_object
5304- radv: fix a big memleak with VK_EXT_shader_object
5305- radv: free NIR shaders when creating linked shaders with ESO
5306- radv: simplify binding the GS copy shader with ESO
5307- radv: rename radv_emit_shaders() to radv_emit_graphics_shaders()
5308- radv: simplify emitting VGT_ESGS_RING_ITEMSIZE for ESO
5309- radv: re-emit more states when a shader compiled separately is bound
5310- radv: only enable emulated mesh/task shader invocations on GFX10.3
5311- radv: add support for mesh primitives queries on GFX11
5312- radv: add support for task shader invocations queries on GFX11
5313- radv: remove a TODO about adding mesh/task queries on GFX11
5314- radv: store/reset conditional rendering user info in the helpers
5315- radv: add support for conditional rendering on the compute queue with DGC
5316- radv: remove unused parameter in gfx10_get_ngg_query_info()
5317- radv: do not set gs.has_pipeline_stat_query twice for NGG GS
5318- radv: use so.num_outputs to determine if NGG shaders need XFB queries
5319- radv: determine NGG query info before linking shader info
5320- radv: pass gfx10_ngg_info to gfx10_get_ngg_info()
5321- radv: pass radv_shader_info to gfx10_get_ngg_info()
5322- radv: determine NGG culling info before NGG info
5323- radv: compute NGG scratch LDS base in gfx10_get_ngg_info()
5324- radv: compute the total LDS usage in gfx10_get_ngg_info()
5325- radv: disable VK_FORMAT_E5B9G9R9_UFLOAT_PACK32 with minmax filter on GFX6
5326- radv/ci: enable RADV_PERFTEST=shader_object for vkcts-tahiti-valve
5327- radv: clean up MAX_ALLOWED_TILES_IN_WAVE programming
5328- radv: add missing RADV_DEBUG_NO_NGG_GS to the physical device cache key
5329- radv: fix conditional rendering with direct mesh+task draws and multiview
5330- radv: move conditional rendering for compute in radv_cmd_state
5331- radv: get the pipeline layout info from the push constant token with DGC
5332- radv: add a helper to calculate the compute resource limits
5333- radv: add a function to get compute pipeline metadata for DGC
5334- radv: add support for VK_PIPELINE_CREATE_INDIRECT_BINDABLE_BIT_NV
5335- radv: implement vkGetPipelineIndirectXXX() for DGC
5336- radv: implement vkCmdUpdatePipelineIndirectBufferNV()
5337- radv: implement indirect compute pipeline binds with DGC
5338- radv: handle indirect pipeline binds with scratch and DGC
5339- radv: force shader BOs to be local BOS with DGC indirect compute pipelines
5340- radv: enable deviceGeneratedComputePipelines
5341- radv: fix conditional rendering on compute queue on GFX6
5342- radv: add missing conditional rendering for indirect dispatches on GFX6
5343- radv: add a helper to emit PKT3_COND_EXEC
5344- radv: add a new user SGPR for NGG shaders compiled separately with ESO
5345- radv: lower lds_ngg_{gs_out_vertex_base,_scratch_base} with ESO
5346- radv: add support for emitting NGG shaders with ESO
5347- radv: allow RADV_PERFTEST=shader_object on GFX11
5348- radv: enable radv_zero_vram for RAGE2
5349- radv: preserve streamout_buffers user SGPR for VS/TES + GS compiled separately
5350- radv: always use ace_cs for the gang CS variable
5351- radv: refactor emitting the view index for task shaders
5352- radv: allocate a 32-bit value for the MEC fw bug with indirect mesh+task earlier
5353- radv: stop passing radv_cmd_buffer to draw functions with task shaders
5354- radv/ci: remove RT tests from the VANGOGH skip list
5355- radv/ci: remove dEQP-VK.robustness.* from the VANGOGH skip list
5356- radv: disable NGG in more situations with ESO on GFX10/GFX10.3
5357- radv: implement has_vgt_flush_ngg_legacy_bug for ESO
5358- radv: allow RADV_PERFTEST=shader_object on GFX10/GFX10.3
5359- radv,aco: stop duplicating PS/TCS epilog fields
5360- radv: add a helper to emit PS/TCS epilogs
5361- radv/ci: enable RADV_PERFTEST=shader_object for VKCTS jobs on GFX10+
5362- radv/ci: mark dEQP-VK.shader_object.binding.mesh_swap_task as flake on NAVI21
5363- radv: stop using the custom blend mode for PS epilogs
5364- radv: re-emit RB+ state with PS epilogs only when the col format changes
5365- radv: fix emitting default blend state for PS without epilogs and ESO
5366- radv: allow RADV_PERFTEST=shader_object on all GFX9 GPUs
5367- radvi/ci: enable RADV_PERFTEST=shader_object for RENOIR
5368- util/u_debug: fix parsing of "all" again
5369- ci: use Linux kernel 6.6 for RADV
5370- radv/ci: update list of flakes for VKCTS jobs
5371- zink/ci: update list of flakes for RADV jobs
5372- ac/nir: fix exporting NGG streamout outputs with implicit PrimId from VS/TES
5373- radv: fix determining if PrimId is used for merged shaders compiled separately
5374- radv: determine radv_vgt_shader_key::has_ngg_xxx with the last VGT shader
5375- radv: rework generating vgt_shader_key for pipelines
5376- radv: fix wave32 support with ESO
5377- radv: add helpers to bind the GS copy shader and the RT prolog
5378- radv: fix RADV_PERFTEST=dmashaders with ESO
5379- radv: emit VGT_GS_OUT_PRIM_TYPE as part of the dynamic primitive topology
5380- radv: disable binning correctly on GFX11.5
5381- radv: fix programming of PA_SC_BINNER_CNTL_1.MAX_ALLOC_COUNT on GFX11
5382- radv: program SAMPLE_MASK_TRACKER_WATERMARK optimally for GFX11 APUs
5383- radv: add a GPU hang workaround for legacy tess+GS for GFX10.3
5384- radv: fix occlusion queries with MSAA and no attachments
5385- radv: add radv_force_pstate_peak_gfx11_dgpu and enable it for Helldivers 2
5386- zink/ci: enable RADV_PERFTEST=shader_object for polaris10
5387- radv: add a workaround for null IBO on GFX6
5388- zink/ci: update CI lists
5389- radv: always export MRTZ in FS epilogs with ESO on GFX11
5390- radv: trigger a new PS epilog when the framebuffer is dirty with ESO
5391- zink/ci: allow RADV_PERFTEST=shader_object on NAVI31
5392- radv: invalidate L2 metadata for VK_ACCESS_2_MEMORY_READ_BIT
5393- radv: make sure to disable NGG culling with TES when the FS stage is unknown
5394- zink/ci: enable RADV_PERFTEST=shader_object for NAVI10/VANGOGH
5395- radv/rmv: add missing logging when sparse BOs are destroyed
5396- radv/rmv: add missing logging when events are destroyed
5397- radv/rmv: fix logging of per-queue destroyed BOs
5398- radv/rmv: fix logging sparse residency
5399- radv/winsys: move BO size to radeon_winsys_bo
5400- radv/rmv: remove BO size parameter in radv_rmv_log_bo_allocate()
5401- radv: make some create resources helpers static
5402- radv/rmv: remove unnecessary is_internal parameter to some helpers
5403- radv: add radv_bo_{create,destroy}() helpers
5404- radv/rmv: prevent logging BOs allocated in GDS/OA domains
5405- radv/rmv: log allocated/destroyed BOs in radv_buffer_{create,destroy}()
5406- radv: add radv_bo_virtual_bind() helper
5407- radv: fix conditional rendering with mesh+task and multiview (again)
5408- radv: remove useless RADV_DEBUG=nomemorycache
5409- radv: implement alpha-to-one
5410- radv: advertise alphaToOne
5411- radv: advertise extendedDynamicState3AlphaToOneEnable with ACO
5412- docs: add alpha-to-one features for RADV
5413- radv: rename radv_physical_device variables to pdev everywhere
5414- radv/winsys: rename gpu_info to pci_ids in the null winsys
5415- radv: rename radeon_info variables to gpu_info everywhere
5416- radv: rename radv_physical_device::rad_info to info
5417- radv: remove radv_device::physical_device
5418- radv: remove radv_queue::device
5419- radv: remove radv_cmd_buffer::device
5420- radv: remove radv_device::instance
5421- radv: remove radv_physical_device::instance
5422- radv: declare radv_cmd_update_descriptor_xxx() in radv_descriptor_set.h
5423- radv: declare format related functions in radv_formats.h
5424- radv: pass a radv_physical_device to radv_use_llvm_for_stage()
5425- radv: move radv_device_supports_etc() to radv_physical_device.c
5426- radv: move some VK_DEFINE_NONDISP_HANDLE_CASTS to radv_descriptor_set.h
5427- radv: add radv_sampler.h
5428- radv: add radv_event.h
5429- radv: add radv_buffer_view.h
5430- radv: add radv_buffer.h
5431- radv: add radv_video.h
5432- radv: add radv_image.h
5433- radv: add radv_image_view.h
5434- radv: add radv_query.h
5435- radv: add radv_perfcounter.h
5436- radv: add radv_device_generated_commands.h
5437- radv: enable radv_zero_vram for Red Dead Redemption 2
5438- vulkan/debug_utils: add a helper for reporting address binding
5439- radv: implement VK_EXT_device_address_binding_report
5440- radv: advertise VK_EXT_device_address_binding_report
5441- radv: move radv_prim_vertex_count to si_cmd_buffer.c
5442- radv: move radv_userdata_locations to radv_shader_args.h
5443- radv: move radv_shader_{layout,stage} to radv_shader.h
5444- radv: add radv_device_memory.h
5445- radv: add radv_instance.h
5446- radv: add radv_queue.h
5447- radv: add radv_physical_device.h
5448- radv: add radv_rra.h
5449- radv: add radv_device.h
5450- radv: add radv_pipeline_cache.h
5451- radv: add radv_pipeline.h
5452- radv: add radv_pipeline_compute.h
5453- radv: add radv_pipeline_rt.h
5454- radv: add radv_pipeline_graphics.h
5455- radv: add radv_wsi.h
5456- radv: add radv_sqtt.h
5457- radv: add radv_shader_object.h
5458- radv: add radv_spm.h
5459- radv: add radv_cmd_buffer.h
5460- radv: add radv_rmv.h
5461- radv: add radv_cp_reg_shadowing.h
5462- radv: add radv_printf.h
5463- radv: move radv_get_tdr_timeout_for_ip() to radv_query.h
5464- radv: move radv_queue_ring() to radv_queue.c
5465- radv: add radv_nir_to_llvm.h
5466- radv: add radv_android.h
5467- radv: add radv_shader_info.h
5468- radv: move CP DMA related code to radv_cp_dma.c/h
5469- radv: move more cmd buffer related code to radv_cmd_buffer.c
5470- radv: merge radv_write_guardband() with radv_emit_guardband_state()
5471- radv: merge radv_write_scissors() with radv_emit_scissor()
5472- radv: move radv_get_viewport_xform() to radv_pipeline_graphics.c
5473- radv: move radv_create_gfx_config() to radv_device.c
5474- radv: move radv_emit_{compute,graphics}() to radv_queue.c
5475- radv: move code related to sample positions to radv_device.c
5476- radv: rename si_cmd_buffer.c to radv_cs.c
5477- radv: remove unused radv_printflike()
5478- radv: remove pre-declarations needed for WSI entrypoints
5479- radv: remove remaining forward declarations and comments in radv_private.h
5480- radv: replace RADV_FROM_HANDLE by VK_FROM_HANDLE
5481- radv: add missing endif comment for some headers
5482- radv: rename remaining phys_dev occurrences to pdev
5483- radv: replace radv_minify() by u_minify()
5484- radv: replace align_{u32,u64}() by align{64}()
5485- radv: replace align_u32_npot() by ALIGN_NPOT
5486- radv: replace radv_float_to_{u,s}fixed() by util_{un}signed_fixed()
5487- util: add util_is_aligned()
5488- radv: replace radv_is_aligned() by util_is_aligned()
5489- radv: move RADV_SUPPORT_CALIBRATED_TIMESTAMPS to radv_physical_device.c
5490- radv: move RADV_API_VERSION to radv_instance.h
5491- radv: move CLOCK_MONOTONIC_RAW define to radv_physical_device.h
5492- radv: move RADV_USE_WSI_PLATFORM define to radv_wsi.h
5493- radv: remove radv_private.h
5494- radv: make radv_get_vgt_index_size() static
5495- radv: move radv_get_user_sgpr() to radv_shader.c
5496- radv: move radv_queue_family_to_ring() to radv_queue.c
5497- radv: remove old comment in radv_cs.c
5498- radv: move radv_printf_data to radv_printf.h
5499- radv: make sure the heap budget is less than or equal to the heap size
5500- radv: use SPDX-License-Identifier
5501- radv: enable VK_EXT_shader_object by default
5502- aco: use SPDX-License-Identifier
5503- ci: uprev vkd3d-proton to c3b385606a93baed42482d822805e0d9c2f3f603
5504- docs: mark VK_KHR_maintenance6 as DONE for RADV
5505- radv: determine if the cache is disabled at device creation time
5506- radv: add skip_shaders_cache also for compute/rt pipelines
5507- radv: stop using a graphics pipeline for generating the graphics key
5508- radv/rt: constify device in radv_init_rt_stage_hashes()
5509- radv/rt: handle creation feedback like graphics/compute pipelines
5510- radv/rt: stop passing pCreateInfo to radv_ray_tracing_pipeline_cache_search()
5511- radv/rmv: fix missing image bind logging for WSI images
5512- radv: fix missing addr binding report for WSI image binds
5513- radv: fix addr binding report for disjoint image binds
5514- radv/rmv: fix image binds logging for disjoint images
5515- radv: add a helper to set image bindings
5516- radv: fix missing unbind report when an image is destroyed
5517- radv: fix missing unbind report when a buffer is destroyed
5518- radv/rt: remove dead code about intersection shaders in radv_pipeline_get_shader_key()
5519- radv: add a helper for hashing pipelines
5520- radv: rework and add a helper for hashing a compute pipeline
5521- radv: stop ignoring shader stages that don't need to be imported with GPL
5522- radv: add missing SQTT markers when an indirect indexed draw is used with DGC
5523- radv/rt: use radv_pipeline_hash_shader_stage()
5524- radv/rt: stop computing unused hash for the traversal shader
5525- radv: use canonicalized VA for VM fault reports
5526- radv: simplify importing pipeline layout with GPL
5527- radv: return early when PS is NULL in radv_pipeline_init_blend_state()
5528- radv: simplify checking for PS epilogs in radv_pipeline_init_blend_state()
5529- radv: remove unused parameter in radv_skip_graphics_pipeline_compile()
5530- radv: simplify the check for exporting multiview in the last VGT stage
5531- radv/rt: remove unnecessary pipeline parameter to radv_rt_fill_group_info()
5532- radv/rt: remove unnecessary pipeline parameter to radv_generate_rt_shaders_key()
5533- radv/rt: initialize shader group capture/replay in a separate function
5534- radv/rt: rework handle_from_stages to pass hashes directly
5535- radv/rt: insert shaders to cache right after they are compiled
5536- radv/rt: add radv_rt_pipeline_compile()
5537- radv: clear color attachments without exports before compaction
5538- ci: uprev CTS to vulkan-cts-1.3.8.0
5539- radv/ci: add one more flake since CTS 1.3.8.0 for RENOIR
5540- radv/ci: update lists for TAHITI and Zink/Polaris10
5541- radV/ci: reduce the parallelism of navi21 to 3
5542- radv: fix waiting for occlusion queries on GFX6-8
5543- radv: return per plane requirements for disjoint images
5544- zink/ci: update CI lists since piglit uprev
5545- radv/rt: remove unnecessary param to radv_ray_tracing_pipeline_cache_insert()
5546- radv/rt: move radv_ray_tracing_pipeline::sha1 to radv_pipeline
5547- radv: use radv_pipeline::sha1 for graphics/compute pipelines
5548- radv: rework pipeline cache search helpers
5549- radv: add RADV_DEBUG=psocachestats to report per-pipeline cache hits/misses
5550- vulkan: pass cmdbuf level to vk_command_buffer_ops::create()
5551- radv/amdgpu: do not use IB2 for nested command buffers
5552- radv: track if nested command buffers uses indirect draws
5553- radv: advertise VK_EXT_nested_command_buffer
5554- ac,radeonsi: add helpers to compute the number of tess patches/lds size
5555- radv: rework the number of tess patches computation
5556- ac: allow to use 64K of LDS for tessellation on GFX9+
5557- ci: uprev CTS to 1.3.8.2
5558- radv: fix image format properties with fragment shading rate usage
5559- radv: remove bogus VkShaderCreateInfoEXT::flags being 0 assert for compute
5560- radv: allow 3d views with VK_IMAGE_CREATE_2D_VIEW_COMPATIBLE_BIT_EXT
5561- radv: mark some formats as unsupported on GFX8/CARRIZO
5562- radv: set image view descriptors as buffer for non-graphics GPU
5563- radv: only set ALPHA_IS_ON_MSB if the image has DCC on GFX6-9
5564
5565Sathishkumar S (5):
5566
5567- ac/gpu_info: query the number of ip instance
5568- radeonsi/vcn: avoid hard-coding the number of jpeg instance
5569- ac/gpu_info: fix regression in vulkan hw decode
5570- radeonsi/vcn: use num_instances from radeon_info
5571- ac/gpu_info: update multimedia info
5572
5573Sean Anderson (2):
5574
5575- gallium: lima: Don't round height when checking alignment
5576- Add Xilinx ZynqMP KMSRO entrypoint
5577
5578Sebastian Wick (4):
5579
5580- radeonsi: Destroy queues before the aux contexts
5581- util: Add timespec_sub_saturate to avoid negative time for deadlines
5582- loader/wayland: Add fallback wl_display_dispatch_queue_timeout
5583- vulkan/wsi/wayland: Use dispatch_queue_timeout in acquire_next_image
5584
5585Sergi Blanch Torne (10):
5586
5587- ci: disable Collabora's farm due to maintance
5588- Revert "ci: disable Collabora's farm due to maintance"
5589- ci: disable Collabora's farm due to maintance
5590- Revert "ci: disable Collabora's farm due to maintance"
5591- ci: Nightly run expectations update
5592- ci: disable Collabora's farm due to maintance
5593- Revert "ci: disable Collabora's farm due to maintance"
5594- Uprev Piglit to f7ece74a107a2f99b2f494d978c84f8d51faa703
5595- ci: kernel stored in a different s3 bucket
5596- ci: identify and label S3 buckets
5597
5598Shih, Jude (4):
5599
5600- amd/vpelib: Need a debug flag to support 2tap downscaling
5601- amd/vpelib: Add VPE prefix on API to avoid naming conflict
5602- amd/vpelib: Solve the downscaling problem for 2 tap
5603- amd/vpelib: Solve link error due to missing static for one function
5604
5605Sil Vilerino (25):
5606
5607- d3d12: Do not assume multi-subregion support when querying for driver encode support
5608- d3d12: Implement cap for PIPE_VIDEO_CAP_ENC_INTRA_REFRESH
5609- d3d12: Clean up H264 video decode interlaced code path
5610- frontends/va, d3d12: Fix PIPE_VIDEO_SLICE_MODE_MAX_SLICE_SICE -> PIPE_VIDEO_SLICE_MODE_MAX_SLICE_SIZE typo
5611- d3d12: d3d12_video_encoder_negotiate_current_h264_slices_configuration to use correct mode when intra-refresh is on
5612- d3d12: Do not use PIPE_BIND_DISPLAY_TARGET for d3d12_video_buffer
5613- d3d12: AV1 encode - Configure CQP using qp and new qp_inter parameters
5614- d3d12: H264 encode - Update CQP using current frame type as per VA frontend change
5615- d3d12: HEVC encode - Update CQP using current frame type as per VA frontend change
5616- frontend/va: Support media only post proc without compositor using shaders or surfaces
5617- frontend/va: Use get_resources in VaDeriveImage for media only devices without get_surfaces support
5618- d3d12: Add partial media, compute, graphics support with CORE and GENERIC feature levels
5619- d3d12: Refactor graphics functions from context and blit to separate files
5620- d3d12: Add GetDesc wrapper for ID3D12Heap
5621- d3d12: Only check D3D12_FEATURE_DATA_PLACED_RESOURCE_SUPPORT_INFO for D3D_FEATURE_LEVEL_1_0_GENERIC
5622- d3d12: Video Encode - Add driver workaround for rate control reconfiguration
5623- d3d12: Implement PIPE_VIDEO_CAP_ENC_SURFACE_ALIGNMENT
5624- d3d12: Bump directx-headers dependency to v613
5625- d3d12: Support H264 slice L0/L1 active number override
5626- d3d12: Support HEVC slice L0/L1 active number override
5627- d3d12: Fix leak of batch->bos on video-only builds
5628- d3d12: Fix leak dxil_module::serialized_dependency_table
5629- d3d12: Fix util_blitter_destroy destruction ordering
5630- vl_win32_screen_create: Take ownership of winsys injected to created d3d12_screen
5631- d3d12/ci: Add vainfo with appverifier CI check
5632
5633Simon Ser (2):
5634
5635- egl/wayland: ensure wl_drm is available before use
5636- egl/wayland: explain why implicit modifier downgrade is allowed
5637
5638Stéphane Cerveau (1):
5639
5640- vulkan/video: hevc: b-frames can be reference or not
5641
5642Surafel Assefa (1):
5643
5644- radeonsi: Adds return on failure to get plane info
5645
5646Sviatoslav Peleshko (6):
5647
5648- nir: Use alu source components count in nir_alu_srcs_negative_equal
5649- anv,driconf: Add sampler coordinate precision workaround for AoE 4
5650- driconf: Apply dual color blending workaround to Dying Light
5651- anv: Store host-located copy of NULL surface state for faster memcpy
5652- dri: Flush the context after flush_resource when creating shareable image
5653- anv: Fix descriptor sampler offsets assignment
5654
5655Tapani Pälli (35):
5656
5657- hasvk: remove cmd_buffer_ray_query_globals function decl
5658- hasvk: remove gfx9 specific code from emit_sample_pattern
5659- hasvk: remove softpin (GFX_VERx10 >= 90) related code
5660- hasvk: remove gfx9 specific cs stall from emit_ps_depth_count
5661- anv: check for wa 16013994831 in emit_so_memcpy_end
5662- iris: expand pre-hiz data cache flush to gfx >= 125
5663- anv: expand pre-hiz data cache flush to gfx >= 125
5664- iris: replace constant cache invalidate with hdc flush
5665- anv: move \*bits_for_access_flags to genX_cmd_buffer
5666- anv: use workaround framework for Wa_22018402687
5667- intel/blorp: add a TODO note about stencil buffer resolve
5668- intel: refactor urb configuration, add intel_urb_config
5669- intel/common: provide a helper for urb setup comparison
5670- blorp/crocus: refactor blorp_emit_urb_config
5671- iris: implement Wa_16014912113
5672- anv: implement Wa_16014912113
5673- blorp: implement Wa_16014912113 callback for drivers
5674- anv: flush tile cache independent of format with HIZ-CCS flush
5675- anv: revert cache flushing changes for indirect commands
5676- intel/blorp: disable use of REP16 independent of format
5677- iris: make sure DS and TE are sent in pairs on >= gfx125
5678- iris: make sure aux is disabled for external objects
5679- anv: make sure aux is disabled for memory objects
5680- hasvk: make sure aux is disabled for memory objects
5681- crocus: make sure aux is disabled for memory objects
5682- anv: use workaround framework for Wa_16013000631
5683- anv: setup distribution granularity with Wa_14019166699
5684- iris: refactor function that checks primitive id usage
5685- iris: setup distribution granularity with Wa_14019166699
5686- anv: disable fcv optimization on >= gfx125
5687- intel/blorp: remove unused blorp batch flag
5688- intel/compiler: add assert for Wa_22017182272
5689- anv: add dirty tracking for push constant data
5690- iris: change stream uploader default size to 2MB
5691- anv: skip gfx push constants alloc optimization on gfx9/11
5692
5693Tatsuyuki Ishi (8):
5694
5695- radv: Recompute max_waves after postprocessing RT config
5696- radv: never set DISABLE_WR_CONFIRM for CP DMA clears and copies
5697- util: Optimize mesa_hex_to_bytes
5698- radv: Add radv_spirv_to_nir_options that summarize early gfx states.
5699- radv: Rename cache_search_nir to cache_lookup_nir_handle.
5700- radv: Re-sort RADV_PERFTEST env vars in docs
5701- radv: Implement NIR caching behind RADV_PERFTEST=nircache.
5702- radv: Remove radv_queue::device again
5703
5704Teng, Jin Chung (1):
5705
5706- d3d12: HEVC Encode - Query slice config mode based on user slice setting
5707
5708Thomas H.P. Andersen (6):
5709
5710- nvk: promote VK_EXT_index_type_uint8 to KHR
5711- nvk: promote VK_EXT_line_rasterization to KHR
5712- nvk: promote load_store_op_none to KHR
5713- docs: update features.txt for nvk
5714- nvk: support driconf option force_vk_vendor
5715- driconf: override vendor id for X4 Foundations on NVK
5716
5717Thong Thai (2):
5718
5719- radeonsi/vcn: remove EFC support for renoir
5720- frontends/va/postproc: do not use efc if image is to be translated
5721
5722Timothy Arceri (78):
5723
5724- glsl: add nir version of validate_geometry_shader_emissions()
5725- glsl: use nir version of geom stream validation
5726- glsl: remove now unused GLSL IR validate_geometry_shader_emissions()
5727- glsl: don't tree graft globals
5728- Revert "ci: Enable GALLIUM_DUMP_CPU=true only in the clang job"
5729- glsl: add basic params for AMD_gpu_shader_half_float extension
5730- glsl: add half float support to the parser
5731- glsl: add explicit half float conversion support
5732- glsl: update assert to allow for half float support
5733- glsl: add glsl_type_is_float_16() helper
5734- glsl: add implicit half float conversions
5735- glsl: add ubo packing support for half floats
5736- glsl: skip conversion of half float back to float for GL queries
5737- glsl: add some new helpers for half float builtin functions
5738- glsl: add half float angle and trigonometry functions
5739- glsl: add half float exponential functions
5740- glsl: add f2f16() helper to ir_builder
5741- glsl: add half float support for common functions
5742- glsl: add support for half float packing functions
5743- glsl: add half float geometric functions
5744- glsl: add half float matrix functions
5745- glsl: add half float vector relational functions
5746- glsl: allow half float varyings
5747- glsl: add half float interpolation functions
5748- glsl: add half float derivative functions
5749- glsl: add half float AMD_shader_trinary_minmax functions
5750- compiler/types: Add a contains_32bit helper
5751- gallium: add PIPE_CAP_FP16 for AMD_gpu_shader_half_float
5752- glsl: add missing error check for half float varying
5753- nir: allow gather info to handle nir_deref_type_array_wildcard
5754- glsl: support array wildcards in lower named interface blocks
5755- glsl: split var copies before lowering named interfaces
5756- glsl: fix potential crash in expression flattening
5757- glsl: move some lowering to the compiler
5758- glsl_to_nir: merge function param handling
5759- glsl_to_nir: support conversion of struct/array function params
5760- glsl_to_nir: support conversion of struct/array function returns
5761- glsl_to_nir: support conversion of opaque function params
5762- glsl: don't inline functions in glsl ir
5763- nir: add some nir_parameter fields
5764- glsl: add missing define to linker_util.h
5765- glsl: add nir version of function recursion detection
5766- glsl: move function inlining out of glsl_to_nir()
5767- glsl: make use of nir recursion detection
5768- glsl: implement nir version of lower discard flow
5769- glsl: make use of nir lower discard flow
5770- glsl: remove now unused glsl ir lower discard pass
5771- glsl: make an explicitly safe version of visit_exec_list()
5772- glsl_to_nir: never convert instructions after jump
5773- glsl: remove unrequired do_lower_jumps() call
5774- glsl: move invariant builtin validation to the nir linker
5775- nir: add max_array_access data field
5776- nir: add implicit_sized_array data field
5777- glsl: add resize_tes_inputs() to the nir linker
5778- nir: add variable field from_ssbo_unsized_array
5779- glsl: don't remove redefined per vertex block
5780- glsl: add nir implemenation of block validation
5781- glsl: switch to NIR block validation
5782- glsl: call new nir resize_tes_inputs() pass
5783- glsl: remove now unused resize_tes_inputs()
5784- glsl: remove now unused glsl ir block validation
5785- glsl: move some linking calls to gl_nir_link_glsl()
5786- glsl: switch verify_subroutine_associated_funcs() to nir
5787- nir: add subroutine fields to nir_function
5788- glsl: move link_assign_subroutine_types() to the nir linker
5789- glsl: move check_explicit_uniform_locations() to NIR linker
5790- glsl: move mode_string() to helper
5791- glsl: add some data members to nir_variable
5792- glsl: make validate_intrastage_arrays() usable across files
5793- glsl: move cross_validate_uniforms() to the nir linker
5794- glsl: use shader info to store gs verts
5795- glsl: use info from shader when linking
5796- glsl: move validate_{stage}_shader_executable() to the nir linker
5797- glsl: remove now unused do_dead_functions()
5798- glsl: remove FragDepthLayout field
5799- glsl: remove ActiveStreamMask field
5800- glsl: remove UsesEndPrimitive field
5801- glsl: inline _mesa_copy_linked_program_data()
5802
5803Timur Kristóf (121):
5804
5805- radv: Correctly select SDMA support for PRIME blit.
5806- nir: Fix divergence of reductions.
5807- nir: Fix divergence analysis of load_patch_vertices_in.
5808- nir: Cleanup divergence analysis for mesh shaders.
5809- nir: Clean up divergence analysis for TES patch input loads.
5810- aco: Eliminate SCC copies when possible.
5811- radv: Lower mesh shader draw ID to zero when they have a task shader.
5812- radv: Extract input and output stride info to new functions.
5813- radv: Use mapped driver locations for determining I/O strides.
5814- aco: Allow passing constant operand to is_overwritten_since.
5815- radv/llvm: Remove dead code.
5816- radv: Allow NGG culling with LLVM.
5817- compiler: Add helper for counting tess level components.
5818- ac/nir/tess: Always record tess level info and use it at the end.
5819- ac/nir/tess: Don't record mapped tess level location.
5820- ac/nir/tess: Split tess factor write into multiple functions.
5821- ac/nir/tess: Emit tess factor output independently of whether it can be passed by registers.
5822- ac/nir/tess: Refactor how the end of HS is emitted.
5823- aco: Use common helper for counting tess level components.
5824- aco: Use tess factors when TCS jumps to epilog.
5825- radv: Declare tess_lvl_in/out args for TCS epilogs.
5826- radv: Always pass tess factors to epilogs in registers.
5827- radv, aco: Delete now dead TCS epilog code.
5828- nir: Add two new AMD specific tess intrinsics.
5829- radeonsi: Implement new intrinsics for monolithic shaders.
5830- radv: Copy TES primitive mode to TCS info.
5831- radv: Implement new tess intrinsics.
5832- radv: Call nir_opt_dead_cf in radv_optimize_nir_algebraic.
5833- ac/nir/tess: Emit tess factor stores based on new intrinsics.
5834- radv: Completely delete TCS epilogs.
5835- radv, aco: Remove the code that jumped to RADV's TCS epilogs.
5836- ac/llvm, radeonsi: Handle tess_rel_patch_id in common code.
5837- radeonsi: Put HS output count in TCS offchip layout, not patch data offset.
5838- radeonsi: Implement dynamic TCS intrinsics for non-monolithic shaders.
5839- radeonsi: Delete TCS epilogs entirely.
5840- aco: Delete all TCS epilog code.
5841- radeonsi: Add number of VS outputs to TCS output layout.
5842- radeonsi: Remove tess bits from VS state.
5843- radeonsi: Use one more bit for number of patches in TCS offchip layout.
5844- ac/nir/tess: Remove dead code that was meant for epilogs.
5845- radv: Add number of LS and HS outputs to tcs_offchip_layout.
5846- radv: Change input patch size in TCS offchip layout to match RadeonSI.
5847- radv: Change number of patches in TCS offchip layout to match RadeonSI.
5848- radv: Include output patch size in TCS offchip layout.
5849- radv: Reuse TCS offchip layout between TCS and TES.
5850- nir/gather_info: Record per-primitive outputs without variables.
5851- nir: Record per-primitive inputs without variables.
5852- nir/recompute_io_bases: Sort per-primitive PS inputs last.
5853- ac/nir: Introduce ac_nir_calc_io_offset_mapped.
5854- ac/nir/tess: Load tess factors from variable when they are passed in registers.
5855- ac/nir/tess: Clarify when a TCS output is stored in LDS or VRAM.
5856- ac/nir/tess: Return undef when loading an unwritten TCS output.
5857- ac/nir/tess: Map TCS LDS IO locations without gaps.
5858- ac/nir/tess: Calculate reserved LDS outputs based on IO info.
5859- ac/nir/tess: Remove superfluous args for reserved TCS outputs.
5860- ac/nir/tess: Clarify when VS-TCS I/O can use registers.
5861- radv: Only add extra dword to LS-HS stride when there are LS outputs.
5862- radv: Pass key structures to gather intrinsic info.
5863- radv: Extract gather_load_vs_input_info function.
5864- radv: Slightly refactor gather_intrinsic_store_output_info.
5865- radv: Record PS input clip/cull mask instead of number.
5866- radv: Use NIR IO semantics to determine GS output info.
5867- radv: Add helper for determining per-attribute vertex buffer descriptors.
5868- radv: Add helper to determine usage of VS prologs.
5869- radv: Remove unused VS input usage mask.
5870- radv: Use NIR IO semantics to determine VS input info.
5871- radv: Use IO semantic location for shader output info.
5872- aco/optimizer_postRA: Remove a check from SCC no-compare optimization.
5873- radv: Use NIR IO semantics to determine FS input info.
5874- radv: Remove I/O variables after nir_lower_io.
5875- radv: Slightly refactor the determination of max_ps_params.
5876- radv: Increase maximum allowed PS params for enabling NGG culling.
5877- radv: Remove unused gfx_level from gfx10_emit_ge_pc_alloc.
5878- ac/nir/ngg: Don't create dummy output variable for primitive ID.
5879- ac/nir/ngg: Use IO semantics for determining instance rate inputs.
5880- ac/nir/ngg: Rename confusing driver_location variable in mesh shader lowering.
5881- radv: Use NIR IO semantics for VS input location mapping.
5882- radv: Don't set driver locations for mesh shaders.
5883- radv: Don't set driver locations for FS outputs.
5884- radv: Don't set driver locations for last pre-rasterization stage.
5885- radv: Keep track of TCS outputs that need LDS.
5886- radv: Remove dead code for creating per-patch IO mask.
5887- radv: Add radv_gather_unlinked_io_mask to shader info header.
5888- radv: Always use fixed I/O locations for TCS outputs in VRAM.
5889- radv: Clean up gathering linked I/O info.
5890- nir/print: Print per-primitive and explicit strict IO info.
5891- nir/recompute_io_bases: Fix per-primitive inputs.
5892- nir/gather_info: Clear per-primitive I/O masks at the beginning.
5893- nir/lower_io_to_scalar: Support explicit (and per-vertex) FS inputs.
5894- nir/lower_io_to_scalar: Support per-primitive outputs.
5895- nir/opt_varyings: Allow optimizing primitive ID for MS -> FS.
5896- nir/opt_varyings: Support per-primitive I/O.
5897- nir/opt_varyings: Fix explicit and per-vertex FS inputs.
5898- nir/opt_varyings: Add early return when producer stage is task.
5899- nir/opt_varyings: Only propagate constant MS outputs, not other uniforms.
5900- nir/opt_varyings: Debug print during relocate_slot.
5901- nir/opt_varyings: Fix relocate_slot so it doesn't mix up 32-bit and 16-bit I/O.
5902- nir/opt_varyings: Add workaround for RADV mesh shader multiview.
5903- ac/nir/ngg: Remove support for loading mesh shader outputs.
5904- ac/nir/ngg: Refactor MS primitive indices for scalarized IO.
5905- ac/nir/ngg: Slightly refactor mesh shader cull flag stores.
5906- ac/nir/ngg: Use just one IO semantics variable in MS output store.
5907- ac/nir/ngg: Refactor update_ms_output_info.
5908- ac/nir/ngg: Refactor MS output store into two functions.
5909- ac/nir/ngg: Split 16-bit MS output stores by components.
5910- ac/nir/ngg: Enable packing 16-bit mesh shader outputs.
5911- radv: Run DCE before deleting I/O variables.
5912- radv: Only consider interpolated inputs as 16-bit float.
5913- radv: Refactor emitting PS input types.
5914- radv: Remove superfluous bool arg from slot_to_ps_input.
5915- radv: Allow using high 16 bits of PS input slots.
5916- radv: Rename per_vertex_shaded_mask to explicit_strict_shaded_mask.
5917- radv: Rename LDS related variables in get_tcs_num_patches.
5918- radv: Calculate VRAM tess patch size independently of LDS size.
5919- ac/nir/tess: Split I/O mapping to two functions.
5920- ac/nir/tess: Use LDS IO mapping when loading tess levels from LDS.
5921- ac/nir/ngg: Implement packed 16-bit VS/TES outputs in non-dedicated slots.
5922- ac/nir/ngg: Implement packed 16-bit GS outputs in non-dedicated slots.
5923- ac/nir/lower_legacy_vs: Implement packed 16-bit VS/TES outputs in non-dedicated slots.
5924- ac/nir/lower_legacy_gs: Implement packed 16-bit GS outputs in non-dedicated slots.
5925- ac/nir/ngg: Fix packing 16-bit MS outputs.
5926
5927Tomeu Vizoso (20):
5928
5929- ci: disable Igalia farm
5930- gallium/util: Fix pipe_buffer_copy
5931- mesa: Import TensorFlow Lite headers
5932- teflon: Initial commit
5933- etnaviv: Update headers from rnndb
5934- etnaviv: Add a bunch of new params for NPUs
5935- etnaviv: Don't emit boilerplate for compute only contexts
5936- etnaviv: Use NN cores to accelerate convolutions
5937- etnaviv: Use TP cores to accelerate tensor transformations
5938- teflon: Add table with known supported models to docs
5939- etnaviv: Don't init the blitter in compute-only contexts
5940- etnaviv/nn: Implement zero run length encoding of weights
5941- teflon: Enable convolutions with number of output channels not divisible by 8
5942- etnaviv/nn: Ensure tile_y is > 0
5943- etnaviv/nn: Fix calculation of remaining out channels
5944- etnaviv/nn: Move unused field to its right place in the struct
5945- etnaviv/nn: Enable image cache
5946- etnaviv/nn: Don't shortcut ZRL bits calculation
5947- etnaviv/nn: Keep track of the sign bit when decrementing to zero
5948- etnaviv/nn: Make parallel jobs disabled by default
5949
5950Tranquillity Codes (1):
5951
5952- intel: Skip ioctls for querying device info when hardware is unsupported
5953
5954Valentine Burley (27):
5955
5956- tu: Promote VK_EXT_index_type_uint8 to KHR
5957- tu: Promote VK_EXT_load_store_op_none to KHR
5958- tu: Promote VK_EXT_line_rasterization to KHR
5959- docs: Update features.txt for anv, nvk and tu
5960- nvk: Enable VK_KHR_shader_subgroup_uniform_control_flow
5961- nvk: Advertise VK_KHR_vertex_attribute_divisor
5962- nvk: Reorder device features
5963- tu: Implement VK_KHR_map_memory2
5964- tu: Advertise VK_KHR_vertex_attribute_divisor
5965- tu: Reorder device features
5966- nvk: Fix missing implementation of creating images from swapchains
5967- nvk: Expose VK_EXT_display_control
5968- nvk: Expose VK_EXT_surface/swapchain_maintenance1
5969- nvk: Expose VK_EXT_swapchain_colorspace
5970- docs/features: Add missing VK_EXT_surface/swapchain_maintenance1 entry
5971- tu/rmv: Remove tu_rmv_DebugMarkerSetObjectNameEXT
5972- nvk: Trivially expose three VK_GOOGLE extensions
5973- tu: Expose VK_KHR_surface_protected_capabilities
5974- tu: Trivially expose three VK_GOOGLE extensions
5975- docs: Update features.txt for tu
5976- docs: Update features.txt and new_features.txt for anv and nvk
5977- nvk: Add support for version 2 of all descriptor binding commands
5978- tu: Move tu_BindImageMemory2() to tu_image.cc
5979- tu: Replace TU_HAS_SURFACE with TU_USE_WSI_PLATFORM
5980- tu: Fix missing implementation of creating images from swapchains
5981- tu: Replace TU_FROM_HANDLE with VK_FROM_HANDLE
5982- drm-shim: Stub syncobj reset ioctl
5983
5984Vasily Khoruzhick (4):
5985
5986- lima: ppir: always use vec4 for output register
5987- lima: ppir: use dummy program if FS has empty body
5988- lima: gpir: abort compilation if load_uniform instrinsic src isn't const
5989- lima: update expected CI failures
5990
5991Vignesh Raman (5):
5992
5993- ci: Add kmod
5994- ci: disable Collabora's farm due to maintenance
5995- Split debian-build-testing job
5996- ci: Implement support for replaying ANGLE restricted traces
5997- ci: handle missing dri libraries during listing
5998
5999Vinson Lee (2):
6000
6001- intel/disasm: Remove duplicate variable reg_file
6002- intel/clc: Fix file descriptor leak
6003
6004Visan, Tiberiu (1):
6005
6006- amd/vpelib: revert SRGB to 709
6007
6008Vlad Schiller (2):
6009
6010- pvr: Implement VK_EXT_memory_budget
6011- pvr: Implement VK_KHR_index_type_uint8
6012
6013Yifan Zhang (2):
6014
6015- amd: Add code to enable gfx11.5.1
6016- radv: initialize video decoder for GFX11.5.1
6017
6018Yiwei Zhang (105):
6019
6020- venus: avoid redundant layout transition for optimal internal layout
6021- venus: populate oom from ring submit alloc failures
6022- vulkan/wsi/wayland: fix returns and avoid leaks for failed swapchain
6023- venus: ensure object id is unique
6024- venus: fix pipeline layout lifetime
6025- venus: drop some redundant comment
6026- venus: fix pipeline derivatives
6027- venus: fix to respect the final pipeline layout
6028- venus: allow tls ring submission to utilize the entire ring shmem
6029- venus: default to enable GPL
6030- venus: force async pipeline create on threads creating descriptor pools
6031- venus: use obj handle instead of id in device memory report
6032- anv: refactor wsi_memory_allocate_info handling
6033- anv: optimize the implicit fencing support of external memory
6034- anv: extend implicit fencing support for case requiring implicit write
6035- vulkan/util: drop redundant code gen from vk_extensions_gen.py
6036- vulkan/runtime: refactor to use DETECT_OS_ANDROID instead of ANDROID
6037- v3dv: refactor to use DETECT_OS_ANDROID instead of ANDROID
6038- venus: refactor to use DETECT_OS_ANDROID instead of ANDROID
6039- hasvk: refactor to use DETECT_OS_ANDROID instead of ANDROID
6040- anv: refactor to use DETECT_OS_ANDROID instead of ANDROID
6041- radv: refactor to use DETECT_OS_ANDROID instead of ANDROID
6042- turnip: refactor to use DETECT_OS_ANDROID instead of ANDROID
6043- egl: refactor to use DETECT_OS_ANDROID instead of ANDROID
6044- gallium: refactor to use DETECT_OS_ANDROID
6045- util: refactor to use DETECT_OS_ANDROID
6046- meson: drop -DANDROID
6047- venus: update tracepoints to align with later optimizations
6048- venus: fix the cmd stride used for qfb recording
6049- venus: rewrite fence feedback interception to minimize batches
6050- venus: refactor to add vn_cached_storage
6051- venus: use vn_cached_storage for vn_queue_submission allocs
6052- venus: misc cleanups for queue submission
6053- venus: simplify feedback types tracking during submission
6054- venus: massive feedback renamings for consistency and clarity
6055- venus: refactor to add vn_queue_submission_setup_batch
6056- venus: simplify to drop the struct vn_feedback_cmds accessor
6057- venus: refactor semaphore feedback
6058- venus: add vn_set_temp_cmd helper to initialize feedback batch cmd
6059- venus: fix to ensure sfb cmds can get recycled
6060- venus: mandate a few venus capsets long required before 1.0
6061- venus: sync protocol for VK_KHR_fragment_shading_rate
6062- venus: add VK_KHR_fragment_shading_rate
6063- vulkan: fix runtime libraries' dep against generated headers
6064- venus: fix ffb batch prepare for a corner case and avoid a memcpy UB
6065- vulkan: remove unused wsi_common_entrypoints include and dep
6066- vulkan: properly ensure wsi_entrypoints header gen order
6067- vulkan: remove header files from lib source files
6068- vulkan: refactor the runtime header gen order dependency
6069- anv/hasvk: default image_read_without_format to true
6070- venus: qfb to track cmd handle directly
6071- venus: combine query record and reset
6072- venus: massive qfb renamings
6073- venus: minor cmd count related refactors
6074- venus: drop vn_get_temp_cmd_ptr
6075- venus: simplify vn_cmd_reset and apply more code sharing
6076- venus: refactor query record recycle
6077- venus: rewrite qfb vn_feedback helpers
6078- venus: refactor vn_queue_submission_add_query_feedback
6079- venus: add vn_queue_submission_get_resolved_query_records
6080- venus: optimize to further batch query records
6081- venus: roundtrip now belongs to ring
6082- venus: minor naming cleanups
6083- venus: ensure shmem is attached to renderer before use for guest vram
6084- venus: avoid excessive ring notifications
6085- venus: further reduce idle timeout from 5ms to 1ms
6086- venus: add enum vn_relax_reason
6087- venus: avoid constant busy wait for query result waiting
6088- venus: deprecate unused perf env vars
6089- venus: decorate cmd enqueue macro internals with compiler hints
6090- venus: add a more relaxed polling strategy
6091- venus: cleanup 2 TODOs from 1.3 support
6092- venus: remove obsolete TODOs
6093- venus: use STACK_ARRAY to simplify modifier query
6094- venus: use STACK_ARRAY to simplify BindBufferMemory2
6095- venus: use STACK_ARRAY to simplify BindImageMemory2
6096- venus: use STACK_ARRAY to simplify render pass creation
6097- venus: use STACK_ARRAY to simplify physical device enumeration
6098- venus: use STACK_ARRAY to simplify set layout creation
6099- venus: use STACK_ARRAY to simplify sync wait
6100- venus: rely on enum vn_descriptor_type for internal trackings
6101- venus: move async_set_allocation check outside helpers
6102- venus: set alloc to skip earlier for reserved and invalid bindings
6103- venus: optimize mutable state restore
6104- venus: misc set alloc and cleanup refactors
6105- venus: drop vn_should_sanitize_descriptor_set_writes
6106- venus: refactor descriptor set update and push
6107- venus: use STACK_ARRAY to simplify descriptor set update and push
6108- venus: use more relaxed profile for TLS ring seqno wait
6109- venus: avoid the redundant template entry
6110- venus: fix to drop an extra ;
6111- venus: simplify push descriptor update with template
6112- venus: optimize set update template data population
6113- venus: simplify need and ignore rules for desc image info
6114- venus: use STACK_ARRAY to simplify set template update and push
6115- venus: clean up legacy descriptor update template bits
6116- venus: fix swapchain image memory bind
6117- venus: fix VkDeviceGroupSubmitInfo::deviceMask for feedback cmds
6118- venus: avoid client allocators for ring internals
6119- venus: fix to destroy all pipeline handles on early error paths
6120- turnip: msm: clean up iova on error path
6121- turnip: msm: fix racy gem close for re-imported dma-buf
6122- turnip: virtio: fix error path in virtio_bo_init
6123- turnip: virtio: fix iova leak upon found already imported dmabuf
6124- turnip: virtio: fix racy gem close for re-imported dma-buf
6125
6126Yogesh Mohan Marimuthu (6):
6127
6128- winsys/amdgpu: sws instead of ws for amdgpu_screen_winsys
6129- winsys/amdgpu: rws instead of ws for radeon_winsys
6130- winsys/amdgpu: aws instead of ws for amdgpu_winsys
6131- winsys/amdgpu: use _destroy_locked() for failure to create winsys
6132- winsys/amdgpu: remove tab space
6133- winsys/amdgpu: add more comments for winsys create in header file
6134
6135Yonggang Luo (37):
6136
6137- util: Add function util_is_power_of_two_nonzero_uintptr and macro IS_POT_NONZERO
6138- asahi,panfrost: Use IS_POT_NONZERO to replace util_is_power_of_two_nonzero for different size
6139- treewide: Use util_is_power_of_two_nonzero{64|_uintptr} when needed
6140- svga: Cleanup duplicate ALIGN macro defines
6141- nouveau: Use align64 instead of ALIGN over input layer_size_B
6142- treewide: Use align64 instead of ALIGN for 64 bit value parameter
6143- util: Update ALIGN prototype to match align
6144- compiler/spirv: The spirv shader is binary, should write in binary mode
6145- compiler/spirv: There is not need unqualify const in function vtn_string_literal
6146- compiler/spirv: vtn_add_printf_string support for handling OpBitcast
6147- zink: Update zink-anv-tgl flakes
6148- treewide: Remove vulkan/runtime vulkan/util prefix in include path
6149- freedreno/vulkan: Use vk_dynamic_graphics_state_init instead of direct assignment
6150- vulkan/runtime: Mark vk_default_dynamic_graphics_state to be private
6151- Revert "meson/vulkan/util: allow venus to drop compiler deps"
6152- vulkan: allow building venus without libcompiler
6153- glx: Remove DEBUG code in xfont.c
6154- panfrost/shared: avoid use gallium helper in pan_minmax_cache.*
6155- panfrost/meson: remove redundant gallium include from meson files
6156- treewide: Replace the invalid usage #if DEBUG with #ifdef DEBUG
6157- util: Cleanup strtod.(h|c) by introduce _mesa_get_locale
6158- meson: Extract with_mesa_debug and with_mesa_ndebug for latter usage
6159- meson: Define MESA_DEBUG for latter usage
6160- treewide: Replace usage of macro DEBUG with MESA_DEBUG when possible
6161- meson: Remove the non-used -DDEBUG manually
6162- intel/meson: Remove redundant inc_gallium
6163- radv: Remove redundant inc_gallium
6164- radv: Rename src/amd/vulkan/vk_format.h to src/amd/vulkan/radv_formats.h
6165- vulkan: Move vk_format_is_alpha and vk_format_is_alpha_on_msb into vk_format.h from pvr
6166- pvr: inline and remove vk_format_get_channel_width
6167- pvr: Merge imagination/vulkan/vk_format.h into imagination/vulkan/pvr_formats.h
6168- pvr: Add pvr\_ prefix for vk_format_* functions in pvr_formats.h
6169- util: Fixes futex_wait on win32
6170- util: futex_wait use TIME_MONOTONIC on win32 for consistence with other platform
6171- util: Turn futex_wake parameter to int32_t for consistence across platforms
6172- broadcom/common: Now "util/box.h" is under src, so remove the FIXME
6173- nouveau: Fixes error: unused import: \`crate::nvh_classes_cl906f::\*`
6174
6175Yusuf Khan (4):
6176
6177- nvk: remove some dead code files
6178- nvk: fix valve segfault from setting a descriptor set from NULL
6179- crocus: fix potential null pointer dereference if transfer_mapping fails
6180- nouveau: Fix crash when destination or source screen fences are null
6181
6182Zack Rusin (1):
6183
6184- svga: Fix instanced draw detection
6185
6186Zan Dobersek (11):
6187
6188- freedreno: add fd_rd_output facilities for gzip-compressed RD dumps
6189- tu/msm: fix RD_CHIP_ID size used when dumping RD
6190- tu: tu_device should clean up its global bo
6191- vulkan/rmv: enable logging miscellaneous internal resources
6192- tu: add RMV support
6193- freedreno/fdl: avoid overflow in layout size computations
6194- tu: fix memory leaks in tu_shader
6195- fd: enable prefixing the RD output filename
6196- tu/autotune: use SAMPLE_COUNT_END_OFFSET when writing the ending sample count
6197- tu: RB_SAMPLE_COUNT_ADDR is also used on a7xx
6198- tu/query: improve CP_EVENT_WRITE7::ZPASS_DONE usage
6199
6200antonino (1):
6201
6202- zink: plug leak in \`zink_create_quads_emulation_gs`
6203
6204chyyran (1):
6205
6206- util/format/fxt1: include "u_format_pack.h" instead of "util/format/u_format_pack.h"
6207
6208daoxiang.gong (1):
6209
6210- zink - Fix for minLod and maxLod when mipmap filter is disabled
6211
6212duncan.hopkins (19):
6213
6214- compiler/clc: fix compiler issue on MacOS with st_mtim[e] in stat.
6215- egl: MacOS platform guard around pthread_condattr_setclock()
6216- egl: Added DRI3 code guards.
6217- egl: Changed EGLNativeDisplayType size check to make sure it is big enough instead of exactly the same size.
6218- gallium/dri: Switch xf86drm.h for util/libdrm.h to allow for the no-op shim to be used.
6219- gallium/dri: Added XCB dependency to frontends/dri/libdrm build. Fix header issues with xcb.h being used.
6220- util: Updated util/libdrm.h stubs with drmGetMagic()
6221- dri: guarded DRI code.
6222- glx: Switched DRI2 functions over to use Apple specific alternatives and extension name.
6223- meson: relaxed some meson restrictions on MacOS/Apple allowing for wider build support.
6224- apple: Extended Apple feature support using GLX_USE_APPLE.
6225- apple: Meson defines GLX_USE_APPLE to allow for Gallium drivers to work on MacOS.
6226- zink: Fixed header location and compiling issue with [[deprecated]] from newer MoltenVK versions.
6227- zink: use portability EXT on Apple.
6228- zink: stopped the use of VkFormatProperties3 if the reported API is less than 1.3 or VK_KHR_format_feature_flags2 not present.
6229- zink: removed \`MESA_PRIM_QUADS` from the supported \`PIPE_CAP_SUPPORTED_PRIM_MODES`.
6230- zink: Avoid issues when kopper tries using XCB WSI on Apple.
6231- zink/apple: added \`moltenvk-dir` search to allow MoltenVK to be sourced from brew.
6232- zink/apple: update docs to reflect the current status of Zink on macOS.
6233
6234nyanmisaka (2):
6235
6236- frontends/va: Report vendor and device ID through VADisplayPCIID
6237- radeonsi/uvd_enc: update to use correct padding size
6238
6239qbojj (1):
6240
6241- vulkan: Fix calculation of flags in vk_graphics_pipeline_state_fill
6242
6243thfrwn (1):
6244
6245- mesa: fix off-by-one for newblock allocation in dlist_alloc
6246