xref: /aosp_15_r20/external/mesa3d/docs/relnotes/22.3.0.rst (revision 6104692788411f58d303aa86923a9ff6ecaded22)
1Mesa 22.3.0 Release Notes / 2022-12-02
2======================================
3
4Mesa 22.3.0 is a new development release. People who are concerned
5with stability and reliability should stick with a previous release or
6wait for Mesa 22.3.1.
7
8Mesa 22.3.0 implements the OpenGL 4.6 API, but the version reported by
9glGetString(GL_VERSION) or glGetIntegerv(GL_MAJOR_VERSION) /
10glGetIntegerv(GL_MINOR_VERSION) depends on the particular driver being used.
11Some drivers don't support all the features required in OpenGL 4.6. OpenGL
124.6 is **only** available if requested at context creation.
13Compatibility contexts may report a lower version depending on each driver.
14
15Mesa 22.3.0 implements the Vulkan 1.3 API, but the version reported by
16the apiVersion property of the VkPhysicalDeviceProperties struct
17depends on the particular driver being used.
18
19SHA256 checksum
20---------------
21
22::
23
24    644bf936584548c2b88762111ad58b4aa3e4688874200e5a4eb74e53ce301746  mesa-22.3.0.tar.xz
25
26
27New features
28------------
29
30- GL_ARB_shader_clock on llvmpipe
31- VK_KHR_shader_clock on lavapipe
32- Mesa-DB, the new single file cache type
33- VK_EXT_attachment_feedback_loop_layout on RADV, lavapipe
34- VK_KHR_global_priority on RADV
35- GL_KHR_blend_equation_advanced_coherent on zink
36- VK_EXT_load_store_op_none on RADV
37- VK_EXT_mutable_descriptor_type on RADV
38- VK_EXT_shader_atomic_float on lvp
39- VK_EXT_shader_atomic_float2 on lvp
40- GL_NV_shader_atomic_float on llvmpipe
41- VK_EXT_image_robustness on v3dv
42- VK_EXT_extended_dynamic_state3 on lavapipe
43- VK_EXT_extended_dynamic_state3 on RADV
44- VK_EXT_pipeline_robustness on v3dv
45- Mali T620 on panfrost
46- Shader disk cache on Panfrost
47- support for R8G8B8, B8G8R8, R16G16B16 and 64-bit vertex buffer formats on RADV
48- initial GFX11/RDNA3 support on RADV
49- various ray tracing optimizations on RADV
50- extendedDynamicState2PatchControlPoints on RADV (VK_EXT_extended_dynamic_state2 feature)
51- Radeon Raytracing Analyzer integration (using RADV_RRA_* environment variables)
52- None
53
54
55Bug fixes
56---------
57
58- r600,regression: Wrong color on a Minecraft shaderpack on Cayman
59- [iris/crocus] Native CS:GO OpenGL: fire produces black rendering artifacts
60- VDPAU GL interop is broken, causing Black and White video
61- panfrost: application crashes with 22.2.4/22.3.0-rc3
62- etnaviv: compiler assertion fail in glmark2 terrain
63- anv: dEQP-VK.pipeline.monolithic.multisample.variable_rate.8_1_fb_1 failure
64- DG2: Age of Empire IV incorrect rendering
65- r600: Sky not rendered correctly with the NIR backend on Transport Fever 2
66- "intel/compiler: Signed integer range analysis for imul_32x16 generation" regresses several OpenCL tests
67- aco: Missing 8-bit uadd_carry
68- !16168 broke Xephyr -glamor fallback to swrast
69- panfrost/rk3399: performance drop in emulationstation and in RetroArch menu [mesa >= 21.3.8, batocera.linux, bisected]
70- d3d12/VA-API: [-wmissing-prototypes] with MSYS2 MinGW-W64 UCRT64 GCC
71- clover: windows:  library filename has \`-1` suffix and a \`lib` prefix when built with mingw
72- freedreno: !19236 broke anitchamber trace
73- radv, dxvk: Rendering errors in World of Tanks after "Switch to dynamic rendering only"
74- gen9 gt3e/gt4e skus fail dEQP-VK.pipeline.multisample.sample_locations_ext.*
75- Intel - GPU Hang when starting new game in Hellblade: Senua's Sacrifice
76- v3d: Wrong colors (pink) in videos in Firefox (likely YUV->RGB shader issue)
77- anv/zink: cts failures on ICL
78- panfrost t860 glmark-es2 regression
79- panfrost_drm.h build error
80- radv: Artifacts in multiple games with yuzu (regression, bisected)
81- Doom (1993) Ray-Traced has render glitches.
82- radv: Flickering in Spider-Man Remastered (Regression) (Bisected)
83- radv: Hitman 2 using Direct3D 12 has discolored squares on RDNA2 with DCC enabled
84- panfrost/midgard -  on Duckstation PSX emulator: segfault on GLES 3.0 and bad shader compilations on 3.3
85- nouveau: Updating to Mesa*22.2.0 makes videos have corrupted colors in Firefox
86- [radeonsi] OpenGL rendering issue with Unreal Tournament 99 & OldUnreal 4.69 rc4
87- Replace the functionals in util/debug.h with util/u_debug.h
88- gallium: nir_opt_access desynchronizes image access modifiers in NIR with Program->sh.ImageAccess
89- [radv/navi21] No sprites being rendered in Quake 2 RTX
90- Hacknet does not render on Panfrost
91- !19259 broke some piglit arb_tessellation_shader tests with radeonsi if TCS passthrough shader is cached
92- panfrost g52 issue
93- [Intel] Doom Eternal crashes shortly after launching
94- GravityMark segfault when using ray tracing ANV
95- nir: Incorrect idiv lowering
96- intel-clc fails with "Unsupported SPIR-V capability" and SEGFAULT.
97- Navi 14 hangs running "rendercheck -t cacomposite -f a8r8g8b8" since !18170
98- glx=xlib can not be built under ubuntu 22.04
99- radv: incorrect query result for VK_QUERY_TYPE_PRIMITIVES_GENERATED_EXT
100- crocus: EGL_ANDROID_native_fence_sync extension is missing
101- freedreno/a6xx: spec@arb_tessellation_shader@execution@tess-instance-id fail
102- r300: wrong primitive positions in CXBX-R emulator with nine and RADEON_DEBUG=notcl
103- EGL re-initialization can fail silently
104- Rusticl: fails to create program if source size is stated, but last char is a null terminator
105- r300: register allocation sometimes generate invalid swizzles
106- Mesa git Upgrade causes cursor detection issues in Wayland, issue was not in 22.3.0-git.20220920.8d66c45.fc36
107- Proposal: Remove rbug, graw, graw tests, xa, xvmc
108- intrinsics.h:112:21: error: use of undeclared identifier 'intel_sub_group_shuffle'
109- tu: broken since dynamic pcp
110- radv: Crash in dEQP-VK.ray_query.misc.dynamic_indexing
111- t860: mediapipe application failed
112- glthread: radeonsi: offset textures in game starsector with glthread enabled
113- Crashing on Windows VM
114- zink: descriptor-handling regression
115- hasvk: noperspective is broken on Haswell hardware (Intel(R) HD Graphics 4400)
116- util/u_atomic.h can not be included in util/u_debug.h
117- radv: dynamic patch control points broken
118- Exanima renders with the wrong colors.
119- Missing tile cache flushes causing corruptions
120- turnip: compute shader hang in the Monster Hunter World
121- anv_private.h:118:10: fatal error: anv_entrypoints.h: No such file or directory
122- freedreno: mis-rendering on gardensofkadesh/homeworld
123- piglit.spec.egl 1_4.egl-ext_egl_image_storage times out
124- nouveau: tegra124: GL_OUT_OF_MEMORY error
125- r600: Shadows incorrecly rendered on Counter-Strike: Global Offensive with the NIR backend
126- r300: regalloc doesn't handle brk correctly
127- Recent WSI commit breaks every Vulkan application on RDNA 2 GPUs
128- 8775f08bf11 makes warcraft 3 crash with free(): double free detected in tcache 2
129- anv: probable ccs issue on icl
130- iris: remap_tess_levels assertion failure in shader-db runs
131- RADV: Mafia & Mafia III :Definitive Edition graphical issues with certain effects
132- Cyberpunk 2077 ground reflectiveness issues using RADV
133- [RADV] Grand Theft Auto V: Shadow rendering bug on Steam Deck
134- rusticl: fails to build on non-Linux: cannot find value \`stderr` in this scope
135- llvmpipe-piglit-rusticl fail at api/clenqueuefillimage
136- Broken build with Clang-16 due to: aco_register_allocation.cpp:382:7: error: no matching function for call to 'printf'
137- src/gallium/drivers/r600/sfn/sfn_instr.cpp:92:33: error: expected primary-expression before 'int'
138- r600: Make NIR the default backend
139- undefined reference to \`intel_ds_begin_xfb'
140- data-race leads to crash in u_trace_context_init()
141- radeonsi/radv: Discrepancy in handling wd_switch_on_eop
142- Shadows rendering issue in Deathloop
143- RADV: Regression with RPCS3 where nothing renders
144- [RADV][bisected] commit 3714c89d0ebf0c541d8f7a7a807ce7c1bfe3cfea at least lead to GPU hang when I run the game "Quake II RTX"
145- eglCreateContext doesn't generate EGL_BAD_MATCH when share_context is on a different EGLDisplay
146- Intel - Homerun Clash misrendering issue
147- r300: dEQP-GLES2.functional.shaders.indexing.tmp_array.float_dynamic_write_dynamic_loop_read_fragment regression
148- turnip/a660: dEQP-VK.spirv_assembly.instruction.terminate_invocation.terminate.subgroup_ballot failure due to "movmsk" reordered with "kill"
149- WSI reports 11 RGB bits is unsupported in X11, using zink/swrast
150- [radeonsi] Graphical corruption in Valve Source-based games
151- RADV: VK_EXT_load_store_op_none support?
152- radv: extendedDynamicState2PatchControlPoints
153- Vega 8 rendering GDM grey blank login screen after recent change
154- r300: possibility of merging ADD and MUL into MAD when effectively only 2 sources are used
155- MESA_DEBUG=incomplete_tex shows unnecessary validation during glClear
156- anv: Performance drop with Vulkan on Wayland
157- lima: crash during dEQP-EGL execution
158- Regression from https://gitlab.freedesktop.org/mesa/mesa/-/commit/8fcb4aa0ebd7b9d0d8f80986fb817afea2fc4a87
159- radv: Enable instruction timing with RGP for RT
160- Updates to 22.2 RCs cause artifacts on nouveau and blank screen on VirtIO
161- nouveau: corruption on NVAC after switching to the NIR backend
162- egl: Remove big-display-lock
163- DG2: vulkancts tests spin forever
164- [regression][bisected] v3dv: dEQP-VK.api.object_management.alloc_callback_fail_multiple.graphics_pipeline fails
165- v3dv: evaluating reducing current value of DEFAULT_MAX_BO_CACHE_SIZE
166- venus-lavapipe flakes
167- amd: LPDDR4 and LPDDR5 hits an assertion failure in ac_memory_ops_per_clock
168- zink: GL_TEXTURE_LOD_BIAS vs mipLodBias, and GTF-GL46.gtf21.GL3Tests.texture_lod_bias.texture_lod_bias_all failure on Turnip
169- ci: radeonsi-raven-skqp also tests RADV but only runs on radeonsi changes
170- Gallium OpenGL megadriver fails to build if LLVM is enabled but not required by any component built
171- panfrost: crash in dEQP test
172- bitpack_helpers.h:34:10: fatal error: valgrind.h: No such file or directory
173- [gen9_iris] piglit causes crash on 4GB-memory machine when running spec/!opengl 1.1/max-texture-size
174- Abort due to "Unknown intrinsic: intrinsic store_deref" (firefox, radeonsi)
175- radv: missing support for some 3-component vertex formats in RDNA2
176- radv: radv_translate_buffer_dataformat overhead
177- nir: Replacing GC with manual memory management
178- turnip: dEQP-VK.subgroups.vote.graphics.subgroupallequal_*int regression on a635
179- regression from !17949
180- r600/sfn: xonotic shows a big performance regression when using the NIR backend with sb disabled
181- r600/sfn: register allocation fails for a number of piglits and deqp tests
182- DG2 corruption in gnome desktop from F36
183- d3d12: Delete custom OpenGLOn12 target, add option to override libgallium_wgl.dll name
184- radeonsi: error allocating sparse buffers.. bug?
185- RADV: vulkaninfo fails to report extended VkShaderStageFlagBits for VK_KHR_ray_tracing_pipeline
186- [radeonsi] Transform feedback with array outputs fails
187- [BISECTED/radeonsi] Flickering triangles with wayfire cube
188- re-evaluate the linker for CI
189- [RADV] (Vega/Navi) Halo: The Master Chief Collection: Texture issue on Halo 3 and Reach
190- ../src/util/u_queue.c:218: util_queue_fence_destroy: Assertion \`fence->signalled' failed for iris when disabled UTIL_FUTEX_SUPPORTED
191- Freedreno turnip mesa cause emulation close after 15-20min gameplay
192- glsl: textureGatherOffset param broken
193- turnip: broken after switching to userspace vma
194- turnip: gfxbench aztec ruins crashes on zink
195- !16863 disabled all meson tests in CI
196- intel unittests failed a lot because recents commits Job Failed #27063524
197- Microsoft CLC runtime crash (at least on 32-bit) since !12510
198- Job Failed #26465985
199- v3dv: android: VTS big regression after 316728a55bc8fe398be1ac2796a22f8c91fb886c
200- radeonsi: driver crashes with kwin-effects-cube
201- [Turnip] Final Fantasy X: Incorrect hair rendering
202- iris: Suboptimal usage of HiZ + CCS for mipmapped buffers
203- Document state of Dozen Vulkan support in features.txt for MesaMatrix site..
204- turnip: gfxbench car chase crash on zink
205- [r300] - osu! - Shader optimizations cause black and glitchy rendering
206- [Bisected] Microsoft CLC: build failure with MSVC and Dshared-llvm=disabled with AMDGPU target available
207- Android.mk: meson enables flags based on the libraries installed on the host system
208- anv: invalid returns from vkCreateDescriptorPool
209- [Bisected][RadeonSI] Blender crashes on startup
210- lavapipe CI times out
211- turnip: depth/stencil regressions
212- radv regression: Hitman 2 rendering issue
213- llvmpipe: assert in KHR-Single-GL45.enhanced_layouts.xfb_all_stages
214- android: vulkan: Disable blocked KHR_performance_query extension (v3dv, turnip, anv)
215- Poor performance/utilization with Splitgate
216- [Bisected][RADV] Rendering issues and performance drop with GFX10 on RPCS3
217- freedreno_gmem.c crashes Firefox Nightly in Khronos
218- freedreno / mesa 22.1.3 crashes Firefox Nightly
219- !17693 broke piglit oes_egl_image_external_essl3
220- VK_PRESENT_MODE_IMMEDIATE_KHR rendering artifacts
221- dzn: Build failure when it's the only driver built - 22.2-devel regression
222- d3d12: Unresolved external symbols from Version.lib
223- a618 vk_full timing out at 60 minutes instead of 120
224
225
226Changes
227-------
228
229Adam Jackson (25):
230
231- egl: Remove some can't-happen supported API checks
232- egl: Move an error check earlier in EGL_BUFFER_AGE_EXT query
233- wsi/x11: Defer clearing image busy flag for non-shm upload
234- egl/kopper: Don't add EGL_SWAP_BEHAVIOR_PRESERVED_BIT configs
235- glx: Use XSaveContext, delete glxhash.c
236- Revert "glx: Use XSaveContext, delete glxhash.c"
237- wsi/x11: Fix the is-visual-supported check
238- nouveau: const cleanup
239- egl/dri2: Respect the arguments to dri2_set_blob_cache_funcs
240- egl/dri2: Fix a typo in a comment
241- egl/dri2: Fix some thinkos in old context release
242- glx/dri*: Unify glx_context subclassing
243- glx/dri: Avoid a weird indirection in driFetchDrawable
244- glx: Remove some excess work from the GLX_FBCONFIG_ID fallback
245- egl: Fix a bunch of maybe-uninitialized warnings
246- egl: Promote _eglLock/UnlockDisplay to internal API
247- egl: Untangle some spaghetti around _eglLookupDisplay
248- egl: Factor out the eglTerminate write-lock pattern
249- egl: Rewrite eglSetBlobCacheFuncsANDROID to use the standard macros
250- mesa/fbo: Don't force both read/draw bindings to zero
251- egl: Formatting fix
252- egl: Factor some common terminate cleanup up to common code
253- egl/dri2: Fix a weird conditional in dri2_make_current
254- egl: Remove a bogus restriction from eglMakeCurrent
255- rusticl: meson devenv support
256
257Adrián Larumbe (5):
258
259- panfrost: Handle Job VA cycles when decoding a dump file
260- panfrost: Remove documentation reference to deprecated parameter
261- panfrost: Add compiler args option for building usermode tools
262- panfrost: Sync panfrost_drm.h from drm-misc-next
263- panfrost: Add userspace crash dump decoder and analyser
264
265Alejandro Piñeiro (11):
266
267- v3dv/bo: reduce DEFAULT_MAX_BO_CACHE_SIZE to 64
268- v3dv/pipeline: use a array instead of individual pointer to stages
269- v3dv/pipeline: keep qpu_insts around if we expect them to be used later
270- v3d: replace all TODO around for FIXME
271- broadcom/compiler: don't call nir_opt_load_store_vectorize on all v3d_optimize_nir calls
272- broadcom/compiler: update how we compute return_words_of_texture_data on non-ssa
273- broadcom/compiler: add more lowerings/optimizations on v3d_optimize_nir
274- v3dv/pipeline: use v3d_optimize_nir
275- broadcom/compiler: pass a strategy struct to vir_compile_init
276- broadcom/compiler: call nir_opt_gcm with a custom strategy
277- v3dv/bo: reset bo and then call gem close
278
279Alejandro Tafalla (1):
280
281- freedreno: Fix graphic glitches on a4xx and a5xx
282
283Alessandro Astone (3):
284
285- Android.mk: Disable lmsensors
286- Android.mk: Filter out --build-id=md5 linker flag
287- Android.mk: Provide a custom entry name to ensure meson checks succeed
288
289Alex Brachet (1):
290
291- nir: Fix qsort comparator function
292
293Alyssa Rosenzweig (414):
294
295- docs/asahi: Document varying interpolation
296- agx: Fix packing of samplers in texture instrs
297- agx: Fix ld_var cf packing
298- asahi: Plumb through lower_clip_fs
299- asahi: Extend counts in BIND packets
300- asahi: Allow large uniform records
301- asahi: Dump all textures&samplers
302- asahi: Fix using multiple textures/samplers
303- asahi: Use a single bind texture/sampler per pipeline
304- asahi: Split vertex/fragment pipeline binds
305- asahi: Correct bind fragment pipeline size
306- asahi: Encode known bits of Linkage in the XML
307- asahi: Fix varying XML
308- asahi: Decode Interpolation packets
309- asahi,agx: Rewrite varying linking
310- agx: Add AGX_MESA_DEBUG=noopt option
311- agx: Model perspective coefficient reg in the IR
312- agx: Rename varying load instructions
313- agx: Use split instead of extract for ldcf
314- agx: Implement noperspective interpolation
315- agx: Lower more ALU operations
316- agx: Align 64-bit register pairs
317- agx: Assert that registers are naturally aligned
318- agx: Extract umul_high implementation
319- agx: Implement nir_op_umul_high
320- agx: Stop using broken idiv lowering
321- agx: Handle type-changing splits
322- agx: Remove p_extract
323- agx: Only emit the used components of gl_FragCoord
324- asahi: Don't advertise ARB_clip_control yet
325- asahi: Identify depth clip mode bit
326- pan/decode: Don't pass around memory handles
327- pan/decode: Remove pandecode_msg
328- pan/decode: Stop passing job index around
329- pan/decode: Stop passing suffixes around
330- pan/decode: Simplify pandecode_fbd
331- pan/decode: Reorder MFBD decoding
332- pan/decode: Unify SFBD/MFBD decoding
333- pan/decode: Defeature disassembler stats
334- pan/decode: Centrally declare pandecode entrypoints
335- pan/decode: Clean up _bifrost_ decode routines
336- pan/mdg: Remove disassembler stats
337- panfrost: Don't segfault on unknown models
338- pan/bi: Implement some extracts and inserts
339- pan/bi: Fuse [US][8|16]_TO_F32 ops
340- pan/bi: Test int8/16 -> float32 opts
341- pan/bi: Optimize bitwise arithmetic of booleans
342- pan/bi: Optimize pattern from nir_lower_idiv
343- pan/bi: Don't use the broken idiv lowering
344- pan/bi: Use nir_opt_idiv_const
345- asahi: Fix warning building for macOS 12.0
346- pan/bi: Don't reorder image loads across stores
347- pan/bi: Don't allow ATEST to take a temporary
348- pan/bi: Handle info.fs.untyped_color_outputs on Valhall
349- panfrost: Handle untyped_color_outputs on Bifrost
350- panfrost: Set blit output variable types correctly
351- panfrost: Don't saturate in Bifrost blend shaders
352- panfrost: Promote blend shader outputs 8->16-bit
353- panfrost: Avoid blend shader when not blending
354- panfrost: Don't compile empty blend shaders
355- panfrost: Invert no_colour to enabled
356- panfrost: Simplify blitter blend shader creation
357- panfrost: Include mask in replace blend shader name
358- panfrost: Assert that blend shaders are nontrivial
359- r600/sfn: Don't use broken idiv lowering
360- gallium: Inline away util_make_fragment_tex_shader_writemask
361- gallium: Inline away util_make_fragment_tex_shader interp_mode
362- gallium: Remove util_make_fragment_tex_shader_xrbias
363- pan/mdg: Replicate swizzles for scalar sources
364- pan/mdg: Reexpress umul_high packing
365- pan/mdg: Use correct idiv lowering
366- pan/mdg: Always write return address to r1.w
367- pan/mdg: Print 3 sources for CSEL
368- panfrost/ci: Update G72 xflakes list
369- gallium: Add u_default_set_debug_callback
370- v3d: Use u_default_set_debug_callback
371- vc4: Use u_default_set_debug_callback
372- softpipe: Use u_default_set_debug_callback
373- lima: Use u_default_set_debug_callback
374- etnaviv: Use u_default_set_debug_callback
375- r300: Use u_default_set_debug_callback
376- panfrost: Route shader-db to debug, not stderr
377- pan/bi: Implement general 8-bit vector construction
378- pan/va: Handle 8-bit lane when lowering constants
379- pan/bi: Implement nir_op_vec8 and nir_op_vec16
380- pan/bi: Assert that vectors are sufficiently small
381- docs/asahi: Document image layouts
382- asahi: Identify compressed render target fields
383- asahi: Rename 'tiled 64x64' to 'twiddled'
384- asahi: Add XML for multisampled textures
385- asahi: Fix bind sizes
386- asahi: Fix ASTC enums
387- ail: Introduce image layout module
388- ail: Rewrite tiled memcpy for correctness
389- asahi: Handle 2-channel sRGB textures
390- asahi: Fix is_format_supported returns
391- asahi: Advertise ETC1
392- asahi: Advertise ASTC formats
393- asahi: Rename our fake twiddled DRM modifier
394- ail: Add unit tests for miptree layouts
395- pan/bi: Fix dual texturing with uniforms
396- pan/bi: Fix out-of-bounds write in va_lower_split_64bit
397- pan/bi: Consider all dests in helper_block_update
398- pan/bi: Model [IF]CMP_{OR,AND,MULTI} ops
399- pan/va: Split out compare instructions
400- pan/bi: Set I->nr_dests, I->nr_srcs
401- pan/bi: Model 3rd source for ATEST
402- pan/bi: Consider nr_srcs/dests in CSE
403- pan/bi: Use nr_{srcs,dests} in bi_foreach_{src,dest}
404- pan/bi: Add validation for nr_srcs/nr_dests
405- pan/bi: Remove trivial dest[0] = bi_null()
406- pan/bi: Clean up destination printing
407- pan/bi: Improve register printing
408- pan/bi: Register allocate BLEND dest on Valhall
409- pan/bi: Don't remove dests in DCE
410- pan/bi: Use bi_emit_split_i32 for trimming vecs
411- pan/bi: Introduce TEXC_DUAL psuedoinstruction
412- pan/bi: Assert destinations are non-NULL
413- pan/bi: Assume destinations are non-NULL
414- pan/bi: Don't read nonexistant destinations
415- pan/bi: Don't read nonexistant sources
416- pan/bi: Add variable dest/src support to builder
417- pan/bi: Use variable src/dest for collect/split
418- pan/bi: Use safe helpers to remove srcs/dests
419- pan/bi: Rebuild some instructions when lowering
420- pan/bi: Rebuild DISCARD when optimizing
421- pan/va: Use builder in va_lower_isel
422- pan/bi: Use builder for MUX -> CSEL opt
423- pan/bi: Use builder for IADD -> IADDC opt
424- pan/bi: Use builder for atomic lowering in scheduler
425- pan/bi: Reduce memory of scheduler access array
426- pan/bi: Dynamically allocate source/dests
427- pan/bi: Don't write registers in optimizer tests
428- pan/bi: Add bi_after_block_logical helper
429- pan/bi: Lift bi_predecessor_index helper
430- pan/bi: Add phi nodes
431- pan/bi: Validate phi ordering
432- pan/bi: Don't lower FAU for phis
433- pan/bi: Remove bogus assert lowering branches
434- pan/bi: Don't optimize if without else
435- pan/bi: Don't reorder phis in pre-RA scheduling
436- pan/bi: Validate before&after pre-RA sched
437- pan/bi: Inline DCE into bi_lower_vector
438- pan/bi: Translate phis from NIR
439- pan/bi: Add SSA-based liveness pass
440- pan/bi: Add brainless out-of-SSA pass
441- pan/bi: Reduce some moves when going out-of-SSA
442- pan/bi: Preserve SSA form from NIR
443- pan/bi: Clean up after converting to SSA
444- pan/bi: Clear reg in squeeze_index
445- pan/bi: Don't use bi_temp_reg
446- pan/bi: Assume SSA in copyprop
447- pan/bi: Assume SSA in mod prop
448- pan/bi: Assume SSA in CSE
449- pan/bi: Assume non-null sources in constant folding
450- pan/bi: Assume SSA in minor passes
451- pan/bi: Assume SSA when scheduling for pressure
452- pan/bi: Assume SSA when translating NIR
453- pan/bi: Assume SSA for helper invocation analysis
454- pan/bi: Move non-SSA liveness into RA
455- pan/bi: Remove NIR registers from the IR
456- pan/bi: Get rid of bi_get_node
457- pan/bi: Inline node_to_index
458- pan/bi: Remove bi_max_temp
459- pan/bi: Simplify bi_get_index prototype
460- pan/bi: Add and use bi_foreach_ssa_src macro
461- pan/bi: Strengenth assert in the validator
462- pan/bi: Remove assert(bi_is_ssa(dest))
463- pan/bi: Add and use bi_replace_src helper
464- pan/bi: Add and use bi_num_successors helper
465- agx: Fix tib access in internal shaders
466- agx: Fix float copyprop of neg(neg) case
467- agx: Add more unit tests for float copyprop
468- agx: Don't hardcode uniform enums
469- agx: Defeature indirect vbufs
470- agx: Plumb through nir_op_txf
471- asahi: Fix Indexed Draw command
472- asahi: Split up unk 2b
473- asahi: Note some more unknowns in the XML
474- asahi: Make ctx->zs a pointer
475- asahi: Export agx_blitter_save
476- asahi: Handle empty fragment shaders
477- asahi: Use util_blitter_clear
478- asahi: Trim garbage at end of set shader
479- asahi: Relax assert in decoder
480- asahi: Label batch flush reasons
481- asahi: Fix depth for cube maps
482- asahi: Fix nonmipmapped array textures
483- asahi: Fix "stride" for tiled textures
484- asahi: Identify and use first level field of texture
485- asahi: Correct SET_SHADER_EXTENDED disambig bit
486- asahi: Add 1D and 1D Array enums
487- asahi: Handle out-of-bounds clear_stencil
488- asahi: Fix depth/stencil buffers
489- asahi: Identify stencil test enable
490- asahi: Split RASTERIZER into constituent words
491- asahi: Identify object type field via PowerVR
492- asahi: Don't use lower_wpos_pntc
493- asahi: Match PPP data structures with PowerVR
494- asahi: Hoist constant PPP state to start of batch
495- asahi: Dirty track everything
496- asahi: Make BO list growable
497- asahi: Annotate VDM/CDM commands as per PVR
498- asahi: Express VDM commands according to PowerVR
499- asahi: Handle Stream Link VDM commands
500- asahi: Allocate new cmdbufs if out of space
501- docs/envvars: Document Asahi variables
502- pan/decode: Fix job cycle detection
503- nir/lower_blend: Avoid emitting unnecessary fsats
504- panfrost: Respect buffer offset for OpenCL
505- panfrost: Honour cso->req_local_mem
506- nir: Add nir_intrinsic_texture_base_agx sysval
507- agx,asahi: Implement nir_intrinsic_load_texture_base_agx
508- agx: Make p_combine take a dynamic src count
509- agx: Implement texture offsets and comparators
510- agx: Implement txd
511- agx: Implement load_global(_constant)
512- agx: Lower txs to a descriptor crawl
513- agx: Don't use nir_find_variable_with_driver_location
514- asahi: Don't crash on <4 channel render targets
515- asahi: Handle blending with MRT
516- u_transfer_helper: Handle Z24X8 for drivers that don't use the interleaved transfer_map
517- u_transfer_helper: Pack Z24S8 to Z24-in-Z32F and S8
518- asahi: Assert cache line alignment on Z/S buffers
519- asahi: Identify ZLS Control word from PowerVR
520- asahi: Identify CDM block types
521- asahi: Decode CDM commands separate from VDM
522- asahi: Shuffle IOGPU structs
523- asahi: Identify IOGPU compute header
524- asahi: Decode IOGPU compute header
525- asahi: Assert that u_transfer_helper is well-behaved
526- asahi: Use the internal format internally
527- asahi: Identify spill buffer histogram
528- asahi: Simplify IOGPU attachment packing
529- asahi: Identify shared memory fields
530- asahi: Consolidate magic numbers for USC controls
531- asahi/genxml: Overflow up to words when packing
532- asahi: Overhaul USC control packing
533- asahi: Identify pixel stride
534- asahi: Identify shared memory layouts
535- panfrost: Evict the BO cache when allocation fails
536- rusticl: Build Panfrost
537- panfrost: Adapt emit_shared_memory for indirect dispatch
538- agx: Convert and clamp array indices in NIR
539- panfrost: Default pipe->clear_texture impl
540- panfrost: Allow compiling MESA_SHADER_KERNEL
541- panfrost: Upload default sampler for txf
542- panfrost: Bump PIPE_CAP_MAX_TEXTURE_ARRAY_LAYERS
543- panfrost: Advertise PIPE_CAP_INT64
544- panfrost: Honour flush-to-zero controls on Valhall
545- panfrost: Assume launch_grid parameters always change
546- pan/va: Fix missing swizzle on CLZ.v2u16
547- pan/bi: Unit test swizzle lowering
548- pan/bi: Lower some 8-bit swizzles
549- pan/bi: Test some 8-bit swizzle lowering
550- pan/bi: Lower swizzles for 8-bit shifts
551- pan/bi: Strip negate when lowering swizzles
552- pan/va: Pack IADD.sat bit
553- pan/va: Fix v4s8 form of R2 opcodes
554- pan/bi: Handle uhadd, urhadd opcodes
555- pan/va: Pack .rhadd bit
556- pan/bi: Add HADD.v4s8.rhadd packing test cases
557- pan/va: Add 8-bit integer max assembler case
558- pan/bi: Lower 8-bit min/max to bcsel+comparison
559- pan/bi: Lower f2i8, f2u8
560- pan/bi: Remove bogus assert for pack_32_2x16
561- pan/bi: Allow selecting from an 8-bit vec8
562- pan/bi: Handle swizzles in unpack_64_2x32_split_{x,y}
563- pan/bi: Lower <32-bit bit_count
564- pan/bi: Lower fisnormal
565- pan/bi: Scalarize phis before the opt loop
566- pan/bi: Call nir_lower_64bit_phis
567- pan/bi: Use .auto for image stores
568- pan/bi: Fix 1D array indexing on Valhall
569- pan/bi: Stub out scoped_barrier
570- pan/bi: Implement unpack_64_2x32
571- agx: Fix bfi_mask packing
572- agx: Emit fewer combines for intrinsics
573- agx: Use 16-bit immediates
574- agx: Validate immediates
575- agx: Test fsat+f2f16 together
576- agx: Ensure that the optimizer sees legitimate SSA
577- agx: Pass mask into ld/st_tile instructions
578- agx: Account for mask when writing registers
579- agx: Remove load_kernel_input path
580- panfrost: Remove load_kernel_input path
581- nir/lower_system_values: Fix cs_local_index_to_id with variable workgroups
582- asahi: Eliminate "Pixel Format" type from GenXML
583- asahi/genxml: Defeature uint/float
584- panfrost,asahi: Remove unused prepare macro
585- panfrost,asahi: Remove unused function
586- asahi,panfrost: Remove exact attribute
587- asahi: Use a header more like Intel's GenXML
588- asahi: Remove no-direct-packing
589- agx: Add iterators for phi/non-phi instructions
590- agx: Set PIPE_SHADER_CAP_INDIRECT_CONST_ADDR
591- agx: Emit smaller combines for nir_op_vec2/3
592- agx: Don't prefix pseudo-ops
593- agx: Improve printing of immediate sources
594- agx: Switch to dynamic allocation of srcs/dests
595- agx: Dynamically size split instruction
596- agx: Rename "combine" to "collect"
597- agx: Emit trap at pack-time, not during isel
598- agx: Print instructions as "dest = src"
599- agx: Print agx_dim appropriately
600- agx: Preload vertex/instance ID only at start
601- agx: Write to r0l with a "nesting" instruction
602- agx: Introduce ra_ctx data structure
603- agx: Pass in max regs as a paramter to RA
604- agx: Reserve live-in regs at the start of block
605- panfrost/ci: Disable t720 jobs
606- nir: Be smarter fusing ffma
607- mesa,gallium: Revert "Make point coord origin a CAP"
608- panfrost/ci: Remove stale fail
609- pan/mdg: Lower PIPE_COMPUTE_CAP_MAX_THREADS_PER_BLOCK on Midgard
610- pan/mdg: Limit work registers for large workgroups
611- pan/mdg: Fix 16-bit alignment with spiller
612- nir: Fix nir_fmax_abs_vec_comp
613- nir: Usher nir_normalize_cubemap_coords into 2022
614- mesa: Fix multiple matrix pops in a row
615- nir/lower_io: Set interpolated_input dest_type
616- u_transfer_helper: Handle Z24S8 with z24_in_z32f but no interleaving
617- asahi: Lower point sprites in driver
618- gallium: Default to PIPE_CAP_MIXED_COLORBUFFER_FORMATS
619- panfrost: Remove NIR_SERIALIZED support
620- asahi: Hold a reference to BOs in a batch
621- nir/opt_preamble: Move load_texture_base_agx
622- agx: Remove command line compiler
623- asahi: Remove AGX_FAKE_DEVICE environment variable
624- asahi: Precompile for shader-db
625- asahi: Route shader-db stats to debug callback
626- asahi: Identify "Uniform high" USC word
627- asahi: Identify more shader-related fields
628- asahi: Remove "padding" field
629- asahi: Make agx_varyings a union
630- asahi: Use USC_UNIFORM_HIGH
631- asahi: Rename LOD clamps tests to fit other packing
632- asahi: Don't use unnecessary test fixture
633- asahi: Add group tests
634- asahi: Support preamble shaders
635- agx: Be less sloppy about high uniforms
636- agx: Check that we don't push too much
637- agx: Make push_count public
638- agx: Handle multiple functions
639- agx: Align preamble shaders
640- agx: Add uniform_store instruction
641- agx: Allow larger indices for ld/st
642- agx: Strengthen assert for packing ld/st instructions
643- agx: Implement scalar load/store_preamble
644- agx: Handle uniforms passed to COLLECT
645- agx: Restrict copyprop of uniforms
646- agx: Handle 64-bit moves
647- agx: Avoid reading high uniforms from device_load
648- agx: Report GPRs to the driver
649- asahi: Set uniform counts accurately
650- asahi: Set GPR count accurately
651- agx: Add agx_nir_opt_preamble pass
652- agx: Use agx_nir_opt_preamble
653- panfrost: Don't use lower_wpos_pntc on Midgard
654- panfrost: Use proper formats for pntc varying
655- etnaviv: Use correct idiv lowering
656- nir/lower_idiv: Remove imprecise_32bit_lowering
657- nir/lower_idiv: Inline convert_instr_precise
658- panfrost: Zero polygon list for fragment-only
659- panfrost: Don't allow VS side effects on midgard
660- pan/bi: Clean up sysval handling a bit
661- pan/mdg: Fix 64-bit address arithmetic
662- panfrost: Use compute-based XFB on Midgard
663- panfrost: Avoid a XFB special case
664- panfrost: Lower MAX_BLOCK_SIZE on Midgard
665- pan/mdg: Don't schedule across memory barrier
666- panfrost,asahi: Support ARB_buffer_storage
667- gallium: Stub support for Asahi + DRM
668- gallium: Only use Asahi's software path on macOS
669- panfrost/ci: Disable trace-based testing
670- panfrost: Enable rendering to 16-bit and 32-bit
671- panfrost: Add lots of perf_debug annotations
672- asahi: Identify counts for compute kernels
673- softpipe: Advertise PIPE_CAP_BUFFER_MAP_PERSISTENT_COHERENT
674- i915g: Advertise PIPE_CAP_BUFFER_MAP_PERSISTENT_COHERENT
675- etnaviv: Advertise PIPE_CAP_BUFFER_MAP_PERSISTENT_COHERENT
676- gallium: Default to PIPE_CAP_BUFFER_MAP_PERSISTENT_COHERENT
677- nir/opt_algebraic: Fuse c - a * b to FMA
678- pan/bi: Don't lower outputs for compute
679- panfrost: Don't use nir_variable in the compilers
680- pan/mdg: Disassemble the .a32 bit
681- pan/mdg: Don't pair ST_VARY.a32 with other instrs
682- pan/mdg: Use .u32 for flat shading
683- pan/mdg: Use bifrost_nir_lower_store_component
684- panfrost: Don't allocate space for empty varyings
685- panfrost: Don't use nir_variable to link varyings
686- panfrost: Delete set_global_resources
687- panfrost: Move small compute functions to pan_context.c
688- panfrost: Remove bogus assert
689- panfrost: Consolidate all shader compiling code
690- panfrost: Merge pan_assemble.c into pan_shader.c
691- panfrost: Remove unused req_input_mem copy
692- panfrost: Rename structs to panfrost_(un)compiled_shader
693- panfrost: Remove uncompiled_shader->active_variant
694- panfrost: Use u_dynarray for variants
695- panfrost: Make fixed_varying_mask a fragment-only key
696- panfrost: Precompile transform feedback program
697- panfrost: Precompile shaders
698- panfrost: Implement a disk cache
699- docs/new_features: Add Panfrost disk cache
700- panfrost: Don't copy resources if replaced
701- panfrost: Replace resource shadowing flush
702- panfrost: Fix build with Perfetto (again)
703- panfrost: Copy resources when necessary
704- panfrost: Require 64-byte alignment on imports
705- panfrost: Fix reference counting with batch->resources
706- panfrost: Use PIPE_CAP_VERTEX_ATTRIB_ELEMENT_ALIGNED_ONLY
707- pan/mdg: Fix out-of-order execution
708- panfrost: Revert "Require 64-byte alignment on imports"
709
710Andri Yngvason (1):
711
712- gallium/vl: Add opaque rgb pixel formats
713
714Anton Bambura (1):
715
716- panfrost: Enable Mali-T620
717
718António Monteiro (2):
719
720- freedreno: get timestamp from os_time_get_nano instead of os_time_get
721- gallium/util: remove network class
722
723Axel Davy (6):
724
725- frontend/nine: Skip invalid swvp calls
726- frontend/nine: Fix buffer tracking out of bounds
727- frontend/nine: Fix ATOC handling
728- frontend/nine: Fix cso restore bug
729- frontend/nine: Fix shader multi-use crash
730- frontend/nine: Fix ff position_t fallback when w = 0
731
732Bas Nieuwenhuizen (56):
733
734- vulkan/wsi: Take max extent into consideration for modifier selection.
735- radv: Remove redundant radv_QueuePresentKHR.
736- radv: Add binding code for 3d sparse images.
737- radv: Add 3d tile shapes for sparse binding.
738- radv: Expose 3d sparse images.
739- amd/common: Don't rely on DCN support checks with modifiers.
740- amd/common: Disable DCC retile modifiers on RDNA1
741- radv: Don't flatten bottom AS exit if statement.
742- radv: Store top of stack in a register.
743- Revert "radv/rt: use derefs for the traversal stack"
744- radv: Use constant for ray traversal exit condition.
745- radv: Use nested ifs for pushing child nodes in traversal loop.
746- radv: Add scratch stack to reduce LDS stack in RT traversal.
747- ac/surface: Fix some warnings.
748- radv: Hardcode root node id.
749- radv: Translate the BVH copy shader to glsl from nir_builder.
750- radv: Remove aabb bounds from instance nodes.
751- radv: Use proper matrices for instance nodes.
752- radv: Use GLSL matrices for instance transforms in BVH.
753- radv: Use deterministic order for dumping acceleration stuctures.
754- radv: Properly initialize all memory in RRA dumps.
755- radv: Do an early check of distance in triangle culling.
756- vulkan/wsi/x11: Fix double free of modifiers.
757- radv: Clean up unused fields in BVH IR.
758- radv: Add a field for the offset of the bvh in the blas.
759- radv: Simplify buffer copy address generation.
760- radv: Start the BVH after the header.
761- radv: Generate parent links in BVH.
762- radv: Only emit parents from parents that actually end up in the tree.
763- radv: Split global & local bvh node variable.
764- radv: Add traversal backtracking with a short stack.
765- radv: Properly annotate all the invalid node usage.
766- radv: Implement pipeline-wide skiptriangles/skipaabbs.
767- radv: Consider inactive internal nodes never in the tree.
768- radv: Constify entry_size.
769- radv: Use a VkPipelineCache handle for meta shaders.
770- radv: Rework modification detection of the meta cache.
771- radv: Mark dEQP-VK.ray_query.misc.dynamic_indexing as crashing in CI.
772- aco: Pre-split result of bvh64_intersect_ray_amd.
773- radv: Improve gfx11 clear colors for DCC.
774- radv: Use correct primgrp size for gfx11.
775- radv: Add BVH IR header.
776- radv: Make the number of internal nodes be written on the GPU.
777- radv: Add PLOC shader
778- radv: Use PLOC for BVH building
779- radv: Speculatively tune RT pipelines for GFX11.
780- radv: Use correct types for loading the rings with LLVM.
781- radv: Handle GSVS ring intrinsic correctly with LLVM.
782- radv: Handle attribute ring intrinsic correctly with LLVM.
783- radv: Fix compute scratch buffer emission.
784- radv: Make the compute scratch waves per SE as well.
785- radv: Fix custom build id with C90 compilation.
786- radv: Use correct init order for ETC2 image views
787- radv: Fix sampler types in ETC2 decode.
788- vulkan: Remove asserts that weren't valid for RADV ETC2 emulation.
789- radv: reserve space for the scissor in vkCmdBeginRendering.
790
791Benjamin Tissoires (1):
792
793- CI: convert to use the new S3 server instead of the legacy minio
794
795Boris Brezillon (1):
796
797- ci/panvk: Skip dEQP-VK.api.object_management.max_concurrent.query_pool
798
799Brian Paul (113):
800
801- st_pbo/compute: fix memset() warning
802- lavapipe: s/u_foreach_bit/u_foreach_bit64/ in handle_pipeline_access()
803- llvmpipe: further bump LP_MAX_TGSI_SHADER_IMAGES to 64
804- llvmpipe: always pass non-zero writemask to assign_reg()
805- util: allow GALLIUM_LOG_FILE=stdout
806- gallivm: move lp_build_nir_aos_context declaration, etc
807- gallivm: change texture/sampler_index params to unsigned
808- llvmpipe: asst. formatting, clean-ups in lp_state_fs.c
809- gallivm: asst. clean-ups in lp_bld_sample_soa.c
810- gallivm: fix nir AOS swizzling issues
811- lavapipe: remove continue statements in emit_state() to be more consistent
812- lavapipe: zero-init sampler objects
813- cso: asst. clean-ups in cso_hash.[ch]
814- cso: use util_bitcount
815- cso: asst. clean-ups in cso_cache.[ch]
816- cso: asst. clean-ups in cso_context.[ch]
817- draw: asst. clean-ups in draw_context.[ch]
818- draw: asst. clean-ups in draw_vs_llvm.c
819- draw: asst. clean-ups in draw_vs_variant.c
820- draw: asst. clean-ups in draw_vs.[ch]
821- draw: remove goto in llvm_pipeline_generic()
822- draw: remove unused draw_has_llvm()
823- draw: asst. clean-ups in draw_pipe.h
824- draw: asst. clean-ups in draw_pt_fetch.c
825- draw: asst. clean-ups in draw_pt_post_vs.c
826- draw: asst. clean-ups in draw_pipe_cull.c
827- draw: asst. clean-ups in draw_pipe_clip.c
828- draw: asst. clean-ups in draw_pipe_flatshade.c
829- draw: asst. clean-ups in draw_pipe_offset.c
830- draw: asst. clean-ups in draw_pipe_pstipple.c
831- draw: asst. clean-ups in draw_pipe_twoside.c
832- draw: asst. clean-ups in draw_pipe_unfilled.c
833- draw: asst. clean-ups in draw_pipe_user_cull.c
834- draw: asst. clean-ups in draw_pipe_util.c
835- draw: asst. clean-ups in draw_pipe_wide_line.c
836- draw: asst. clean-ups in draw_pipe_wide_point.c
837- draw: asst. clean-ups in draw_vertex.[ch]
838- draw: asst. clean-ups in draw_pt_so_emit.c
839- draw: asst. clean-ups in draw_pipe_aapoint.c
840- draw: asst. clean-ups in draw_llvm.c
841- draw: asst. clean-ups in draw_cliptest_tmp.h
842- draw: asst. clean-ups in draw_gs.[ch]
843- draw: asst. clean-ups in draw_prim_assembler.[ch]
844- draw: asst. clean-ups in draw_pipe.c
845- draw: asst. clean-ups in draw_pt_vsplit_tmp.h
846- draw: asst. clean-ups in draw_split_tmp.h
847- draw: asst. clean-ups in draw_pt.c
848- draw: clean-up count assignment in draw_pt_arrays()
849- draw: asst. clean-ups in draw_private.h
850- draw: asst. clean-ups in draw_pt_fetch_shade_pipeline_llvm.c
851- draw: replace double conditional expression with if/else
852- draw: make draw_geometry_shader_run() return void
853- draw: fix some const-correctness in draw_vbo()
854- draw: move vs_slot var in tgsi/llvm_fetch_gs_input()
855- draw: s/start_or_maxelt/start/ in llvm VS code
856- draw: fix vertex id offset bug
857- draw: fix base vertex bug
858- llvmpipe: asst. clean-ups in lp_rast.h
859- llvmpipe: asst. clean-ups in lp_rast_debug.c
860- llvmpipe: asst. clean-ups in lp_rast_priv.h
861- llvmpipe: asst. clean-ups in lp_rast_rect.c
862- llvmpipe: asst. clean-ups in lp_rast.c
863- llvmpipe: asst. clean-ups in lp_rast_linear.c
864- llvmpipe: asst. clean-ups in lp_rast_linear_fallback.c
865- gallium: whitespace fixes in p_context.h, p_screen.h
866- llvmpipe: asst. clean-ups in lp_context.[ch]
867- llvmpipe: asst. clean-ups in lp_surface.c
868- llvmpipe: asst. clean-ups in lp_screen.[ch]
869- llvmpipe: asst. clean-ups in lp_query.c
870- llvmpipe: asst. clean-ups in lp_setup.c
871- llvmpipe: asst. clean-ups in lp_setup_rect.c
872- llvmpipe: asst. clean-ups in lp_setup_vbuf.c
873- llvmpipe: asst. clean-ups in lp_setup_line.c
874- llvmpipe: asst. clean-ups in lp_setup_point.c
875- llvmpipe: remove gotos in do_rect_ccw()
876- llvmpipe: asst. clean-ups in lp_setup_tri.c
877- llvmpipe: asst. clean-ups in lp_bld_alpha.c
878- llvmpipe: asst. clean-ups in lp_bld_blend_aos.c
879- llvmpipe: asst. clean-ups in lp_bld_depth.c
880- llvmpipe: asst. clean-ups in lp_bld_interp.c
881- llvmpipe: asst. clean-ups in lp_debug.h
882- llvmpipe: asst. clean-ups in lp_draw_arrays.c
883- llvmpipe: asst. clean-ups in lp_fence.c
884- llvmpipe: asst. clean-ups in lp_flush.[ch]
885- llvmpipe: asst. clean-ups in lp_linear_interp.c
886- llvmpipe: asst. clean-ups in lp_linear_sampler.c
887- llvmpipe: asst. clean-ups in lp_scene.c
888- llvmpipe: asst. clean-ups in lp_scene.h
889- llvmpipe: asst. clean-ups in lp_state_sampler.c
890- llvmpipe: asst. clean-ups in lp_state_derived.c
891- llvmpipe: asst. clean-ups in lp_state_clip.c
892- llvmpipe: asst. clean-ups in lp_state_cs.c
893- llvmpipe: asst. clean-ups in lp_state_gs.c
894- llvmpipe: asst. clean-ups in lp_state_rasterizer.c
895- llvmpipe: asst. clean-ups in lp_state_so.c
896- llvmpipe: asst. clean-ups in lp_state_surface.c
897- llvmpipe: asst. clean-ups in lp_state_tess.c
898- llvmpipe: asst. clean-ups in lp_state_vertex.c
899- llvmpipe: asst. clean-ups in lp_test_blend.c
900- llvmpipe: asst. clean-ups in lp_test_conv.c
901- llvmpipe: asst. clean-ups in lp_test_format.c
902- llvmpipe: asst. clean-ups in lp_test_main.c
903- llvmpipe: asst. clean-ups in lp_tex_sample.c
904- llvmpipe: asst. clean-ups in lp_texture.[ch]
905- llvmpipe: improve some if/switch code in llvmpipe_get_shader_param()
906- frontend/dri: assorted clean-ups in dri-screen.c
907- frontends/dri: clean-ups in dri_util.c
908- glx: clean-ups in create_context.c
909- glx: clean-ups in drisw_glx.c
910- llvmpipe: s/uint/enum pipe_prim_type/ in lp_setup_context.h
911- llvmpipe: fix comment typo
912- llvmpipe: asst. clean-ups in lp_state_fs.c
913- llvmpipe: check rectangle vertices have equal W components
914
915Caio Oliveira (38):
916
917- intel/compiler: Print more details when fs_visitor::validate() fails
918- intel/compiler: Call get_mesh_urb_handle one level up in call-stack
919- intel/compiler: Get URB handle in emit_task/mesh_intrinsic functions
920- intel/compiler: Make component() work for FIXED_GRF/ARF
921- intel/compiler: Remove INTEL_DEBUG=tcs8
922- intel/compiler: Rename 8_PATCH to MULTI_PATCH
923- intel/compiler: Use fs_reg helpers for TCS icp_handle selection
924- intel/compiler: Use fs_reg helpers for GS icp_handle selection
925- intel/compiler: Create fs_visitor::emit_tcs_barrier()
926- intel/compiler: Add helper for barrier message payload setup for gfx >= 125
927- intel/compiler: Use builder to allocate fs regs for TCS store output
928- intel/compiler: Make a type for Thread Payload and FS variant
929- intel/compiler: Use FS thread payload only for FS
930- intel/compiler: Create struct for TCS thread payload
931- intel/compiler: Store Patch URB output in TCS thread payload struct
932- intel/compiler: Store Primitive ID in TCS thread payload struct
933- intel/compiler: Store start of ICP handles in TCS thread payload struct
934- intel/compiler: Create and use struct for TES thread payload
935- intel/compiler: Create and use struct for VS thread payload
936- intel/compiler: Create and use struct for TASK and MESH thread payloads
937- intel/compiler: Create and use struct for GS thread payload
938- intel/compiler: Store start of ICP handles in GS thread payload struct
939- intel/compiler: Create and use struct for Bindless thread payload
940- intel/compiler: Export brw_get_subgroup_id_param_index()
941- intel/compiler: Create and use struct for CS thread payload
942- intel/compiler: Make thread_payload struct abstract
943- intel/compiler: Add a few more brw_ud* helpers
944- intel/compiler: Use brw_ud* helpers in thread payload code
945- nir/lower_task_shader: Don't fail adding a launch when last instruction is a jump
946- spirv: Pass vtn_block into vtn_emit_branch()
947- spirv: Add workaround to ignore OpReturn after OpEmitMeshTasksEXT
948- spirv: Call vtn_emit_ret_store() only when handling return branch
949- spirv: Make a helper function to check glslang/shaderc
950- spirv: Enable OpEmitMeshTasksEXT workaround for Clay Shader Compiler
951- intel/compiler: Explicitly include build-id when linking intel_clc
952- nir: Don't reorder volatile intrinsics
953- intel/compiler: Fix dynarray usage in intel_clc
954- intel/compiler: Fix missing tie-breaker in brw_nir_analyze_ubo_ranges() ordering code
955
956Caleb Cornett (3):
957
958- wgl: Homogenize read framebuffer reference counting logic
959- wgl: Always release local references in stw_make_current_by_handles
960- wgl: Release the context before deleting it in wglDeleteContext
961
962Chad Versace (43):
963
964- venus: Remove cryptic pointer arithmetic on present attachments
965- venus: Drop vn_present_src_attachment::acquire
966- venus: Track subpass properties
967- venus/pipeline: Refactor vn_fix_graphics_pipeline_create_info
968- venus/pipeline: Add pipeline state bools for ignore rules
969- venus/pipeline: Split ignore rules for pViewportState, pMultisampleState
970- venus/pipeline: Fix ignore rule for pDepthStencilState
971- venus/pipeline: Fix ignore rule for pColorBlendState
972- venus/pipeline: Ignore pViewportState::pViewports
973- venus/pipeline: Ignore pViewportState::pScissors
974- venus/pipeline: Ignore basePipelineHandle
975- venus/pipeline: Fix ignore rules for VK_KHR_dynamic_rendering
976- venus/pipeline: Fix ignore rules for VK_EXT_extended_dynamic_state
977- venus: Dedupe pipeline handle creation
978- venus: Fix failure path on pipeline creation
979- venus: Enable VK_EXT_pipeline_creation_cache_control
980- venus: Sort passthrough extensions
981- vulkan: Make vk_copy_struct_guts() public
982- venus: Simplify vn_GetPhysicalDeviceFeatures2
983- venus: Simplify vn_GetPhysicalDeviceProperties2
984- venus: Enable VK_KHR_shader_integer_dot_product
985- venus: Enable VK_KHR_shader_non_semantic_info
986- venus: Enable VK_KHR_shader_terminate_invocation
987- venus: Enable VK_KHR_zero_initialize_workgroup_memory
988- venus: Enable VK_EXT_subgroup_size_control
989- venus: Enable VK_EXT_texel_buffer_alignment
990- venus: Enable VK_EXT_ycbcr_2plane_444_formats
991- venus: Enable VK_KHR_format_feature_flags2
992- venus: Enable VK_EXT_private_data
993- venus: Enable VK_EXT_tooling_info
994- venus: Document VK_EXT_texture_compression_astc_hdr support
995- venus: Enable VK_EXT_pipeline_creation_feedback
996- venus: Rename some feature/property structs
997- venus: Refactor VN_ADD_TO_PNEXT
998- venus: Add macros VN_SET_CORE_*
999- venus: Fix features/properties for unavailable extensions
1000- venus: Use VkPhysicalDeviceVulkan13{Features,Properties}
1001- venus/queue: Rename some local vars
1002- venus: Change signature of vn_get_fence_feedback_cmd
1003- venus: Add vn_queue_wait_idle_before_present()
1004- venus: Refactor vn_command_buffer_builder temp storage
1005- venus: Enable VK_KHR_synchronization2
1006- venus: Enable Vulkan 1.3
1007
1008Charmaine Lee (6):
1009
1010- svga: fix mksstats build
1011- svga: fix SetConstantBufferOffset command opcode
1012- svga: compare graphics shader stages against COMPUTE
1013- mesa/st: fix reference to nir->info after nir_to_tgsi
1014- svga: support TGSI_SEMANTIC_TEXCOORD in swtnl draw context
1015- svga: fix invalid component access of domain location
1016
1017Chia-I Wu (83):
1018
1019- vulkan: sort ALLOWED_ANDROID_VERSION by api levels
1020- vulkan: fix ALLOWED_ANDROID_VERSION for api level 26
1021- vulkan: fix ALLOWED_ANDROID_VERSION up to api level 29
1022- vulkan: extend ALLOWED_ANDROID_VERSION up to api level 31
1023- turnip: add tu_common.h as the common header
1024- turnip: remove includes that are already in tu_common.h
1025- turnip: add tu_drm.h
1026- turnip: add tu_suballoc.h
1027- turnip: update tu_cs.h
1028- turnip: add tu_query.h
1029- turnip: add tu_image.h
1030- turnip: add tu_formats.h
1031- turnip: update tu_descriptor_set.h
1032- turnip: add tu_shader.h
1033- turnip: add tu_pipeline.h
1034- turnip: add tu_clear_blit.h
1035- turnip: add tu_dynamic_rendering.h
1036- turnip: add tu_lrz.h
1037- turnip: add tu_pass.h
1038- turnip: add tu_wsi.h
1039- turnip: update tu_autotune.h
1040- turnip: add tu_device.h
1041- turnip: add tu_cmd_buffer.h
1042- turnip: add tu_android.h
1043- turnip: update tu_util.h
1044- turnip: move away from tu_private.h
1045- turnip: remove tu_private.h
1046- turnip: remove headers from libtu_files
1047- turnip: use SPDX-License-Identifier
1048- vulkan: disallow VK_KHR_performance_query on android
1049- turnip: fix a use-after-free in autotune
1050- turnip: remove unused member in tu_submission_data
1051- turnip: handle fence wrapping in autotune
1052- freedreno/pps: improve interaction with turnip
1053- turnip: lower the queue priority to 1
1054- turnip: pCreateInfo->pPoolSizes[i] is wordy
1055- turnip: trim down host size of tu_descriptor_pool
1056- turnip: trim down bo size of tu_descriptor_pool
1057- turnip: improve perfetto sync_timestamp
1058- turnip: fix a missing trace_end_gmem_clear
1059- turnip: move trace_start_gmem_store before cond exec
1060- turnip: fix gem_store tracepoint
1061- turnip: tidy up tracepoint header includes
1062- util/u_trace: include the generated header first
1063- util/u_trace: add PERFETTO HeaderScope
1064- turnip: add tu_clone_trace_range helper
1065- turnip: convert tu_perfetto_state to a stack
1066- turnip: clean up tu_perfetto.h
1067- turnip: rename some tracing stages
1068- turnip: add cmd_buffer tracepoint
1069- turnip: improve tracing of secondary cmd buffers
1070- ir3: fix predicate splitting in scheduler
1071- util/perf: move u_perfetto to here
1072- util/perf: add cpu_trace.h
1073- util/perf: fix multiple tracepoints in a scope
1074- util/perfetto: add a simple C wrapper for track events
1075- util/perf: support and prefer perfetto for cpu trace
1076- venus: use MESA_TRACE_*
1077- pps: enable track_event in system.cfg
1078- docs: update perfetto with the latest status
1079- vulkan/wsi: add cpu tracepoints to swapchain functions
1080- vulkan/wsi: add cpu tracepoints for internal waits
1081- turnip: add some cpu tracepionts
1082- freedreno: get cpu timestamp again after FD_TIMESTAMP
1083- util/perf: ignore ut->enabled for iterators
1084- util/perf: remove u_trace::enabled
1085- util/perf: add u_trace_instrument
1086- util/perf: simplify u_trace_instrument
1087- util/perf: use ALWAYS_INLINE for tracepoints
1088- turnip: reduce MAX_STORAGE_BUFFER_RANGE
1089- vulkan: update ALLOWED_ANDROID_VERSION for api level 33
1090- subprojects: uprev perfetto to v29.0
1091- turnip: fix error handling for tu_queue_init
1092- turnip: add tu_physical_device::submitqueue_priority_count
1093- turnip: add support for VK_KHR_global_priority
1094- turnip: use vk_buffer
1095- turnip: use vk_descriptor_set_layout
1096- vulkan: update comments to device enumeration callbacks
1097- turnip: fix kgsl tu_enumerate_devices return code
1098- turnip: advertise VkExternalFenceProperties correctly
1099- venus: fix VkImageFormatListCreateInfo for AHB
1100- freedreno/pps: use 64-bit reads when possible
1101- freedreno/pps: loop countables by references
1102
1103Christian Gmeiner (18):
1104
1105- lima/ci: There is no need to redefine HWCI_TEST_SCRIPT
1106- lima/ci: Make use of .lava-piglit
1107- isaspec: Move isa_decode(..) declaration
1108- ci: include etnaviv support in ARM64 container
1109- ci: switch to mkbootimg.py
1110- ci/bare-metal: introduce BM_MKBOOT_PARAMS
1111- ci/etnaviv: add GC7000 support
1112- etnaviv: Remove possibility to force MSAA
1113- etnaviv: rs: Color resolve works
1114- etnaviv: rs: Directly call etna_try_rs_blit(..)
1115- etnaviv: rs: msaa: Set cache mode for SMALL_MSAA
1116- etnaviv: Update headers from rnndb
1117- etnaviv: blt: Add color resolve support
1118- etnaviv: blt: Directly call etna_try_blt_blit(..)
1119- Revert "etnaviv: completely turn off MSAA"
1120- etnaviv: Support negative float inline immediates
1121- etnaviv: nir: lower extract byte/word
1122- etnaviv: Hide MSAA support behind debug flag
1123
1124Christophe (2):
1125
1126- zink: add profile documentation
1127- Zink: add Zink profiles file
1128
1129Connor Abbott (49):
1130
1131- tu: Decouple vertex input state from shader
1132- tu: Implement VK_EXT_vertex_input_dynamic_state
1133- tu: Fix sysmem depth attachment clear flushing
1134- ir3/spill: Fix extracting from a vector at the end of a block
1135- nir: Free instructions more often
1136- tu/lrz: Fix multiple subpass case with secondaries
1137- tu/lrz: Fix multiple depth attachment case with secondaries
1138- tu: Fix descriptor set size bounds
1139- tu: Don't preload variable-count descriptors
1140- tu: Initial implementation of VK_EXT_inline_uniform_block
1141- tu: Also set SP_FS_MRT_REG for unused attachments
1142- tu: Fix case where vertex input is set but not vertex buffers
1143- tu: Remove unused need_indirect_descriptor_sets
1144- tu: Group pipeline state based on VkPipeline*StateCreateInfo
1145- tu: Rename RB_MSAA_CNTL to RB_BLIT_GMEM_MSAA_CNTL
1146- tu: Always set RB_BLIT_GMEM_MSAA_CNTL when clearing GMEM
1147- tu: Make MSAA emission always dynamic
1148- tu: Split out primtype calculation for tess
1149- ir3: Don't use msaa key for a6xx+
1150- tu: Nuke tu_pipeline_cache
1151- tu: Don't use layer_zero/view_zero in shader key
1152- tu, ir3: Move multi_pos_output to ir3_shader_variant
1153- tu: Don't use output state to compute render components
1154- tu, ir3: Determine dual-src blend from shader for FS outputs
1155- tu: Emit \*_OUTPUT_CNTL1 as part of blend state
1156- tu: Move no_earlyz computation to blend/msaa state
1157- tu: Split up prim order computation
1158- tu: Use vk_pipeline_shader_stage_to_nir
1159- tu: Abstract driver-specific const state
1160- tu: Initial implementation of VK_EXT_graphics_pipeline_library
1161- tu: Fix maxPerStageDescriptorUpdateAfterBindInputAttachments
1162- tu: Set textures_used for input attachments correctly
1163- ir3/analyze_ubo_ranges: Account for reserved consts
1164- Revert "freedreno,ir3: rename Z_CLAMP_ENABLE to Z_CLIP_DISABLE"
1165- freedreno/a6xx, tu: GRAS_CL_CNTL::UNK5 is Z_CLAMP_ENABLE
1166- tu: Fix setting RB_DEPTH_CNTL::Z_CLAMP_ENABLE
1167- ir3: Delete outputs from fixup_regfootprint()
1168- ir3: Move fixup_regfootprint() to ir3_collect_info()
1169- ir3/analyze_ubo_ranges: Account for reserved consts better
1170- ir3: Add missing cat5 encoding to asm parser
1171- tu: Fix param_stride placement
1172- tu: DS primitive stride does not use patch control points
1173- tu: Implement extendedDynamicState2PatchControlPoints
1174- freedreno: Add LABEL flush
1175- tu: Add compute shader instrlen workaround
1176- tu: tu: Clear patchControlPoints dirty state with static patchControlPoints
1177- tu: Use right enum for compute active_shader_stages
1178- tu: Fix binding NULL descriptor sets
1179- ir3: Don't save/restore disasm string pointer
1180
1181Constantine Shablya (3):
1182
1183- nir: add a pass to remove non-uniform access qualifier when the operands are uniform
1184- radv: use nir_opt_uniform_access
1185- anv: use nir_opt_uniform_access
1186
1187Corentin Noël (9):
1188
1189- virgl/ci: Update virglrenderer
1190- ci: disable the freedreno farm.
1191- virgl/ci: Update virglrenderer
1192- gallivm: avoid the use of an uninitialized value
1193- virgl: Add TEXTURE_SHADOW_LOD capability support
1194- virgl: Add nir_lower_tex pass
1195- mesa/main: do not copy the exact size of the string
1196- gallium/hud: use snprintf(..., "%s", ...) instead of strncpy
1197- ci: Remove Wno-error=stringop-truncation from debian-arm64-asan
1198
1199Damian Korczowski (1):
1200
1201- d3d12: Fix placed footprint of subresources
1202
1203Daniel Schürmann (20):
1204
1205- nir/opt_algebraic: a & ~a -> 0
1206- aco: use std::vector::reserve() more often
1207- aco/live_var_analysis: implement faster merging of live_out sets for some cases
1208- aco/optimizer: disallow can_eliminate_and_exec() with s_not
1209- aco/optimizer: do can_eliminate_and_exec() optimization later
1210- aco/optimizer: optimize s_and(exec, s_and(x, y)) more aggressively
1211- aco/optimizer: change inverse_comparison in-place
1212- radv/rt: create separate radv_rt_pipeline struct
1213- radv/rt: inline radv_rt_pipeline_create_() helper into radv_rt_pipeline_create()
1214- aco: simplify operands_offset calculation in create_instruction()
1215- aco: implement custom memory resource
1216- aco: use monotonic_buffer_resource for instructions
1217- aco: implement allocator_traits for monotonic_allocator<T>
1218- aco/opt_value_numbering: use monotonic_allocator for unordered_map
1219- aco/spill: Fix spilling of Phi operands
1220- aco/ra: only rename fixed Operands if the copy-location matches
1221- aco: change thread_local memory resource to pointer
1222- nir: add AMD RT traversal intrinsics
1223- radv/rt: create traversal shader independent from main shader
1224- radv/rt: overwrite hit args with undef in case of a miss
1225
1226Daniel Stone (4):
1227
1228- ci: Use mold for x86-64 and AArch64 builds
1229- CI: Collabora farm down for maintenance
1230- CI: Re-enable Collabora devices
1231- Revert "panfrost/ci: Disable t720 jobs"
1232
1233Danilo Krummrich (10):
1234
1235- nv50/ir/nir: add conversion ops for bit width < 32
1236- nv50/ir/nir: convert to 32 bit for all OP_SET opcodes
1237- nv50/ir/nir: avoid 8/16 bit dest regs for OP_MOV
1238- nv50/ir: add isUnsignedIntType() and isIntType() helpers
1239- nv50/ir/nir: convert 8/16 bit src to 32 bit for {i,u}2f64
1240- nv50/ir/nir: always round towards zero for f2i/f2u
1241- nv50/ir: add intermediate conversion for f2{i,u}{8,16}
1242- nv50/ir: split and cvt 64bit integers for {i,u}2{i,u}{8,16}
1243- nv50/ir: handle S8/S16 integers converting to S64
1244- nv50/ir: handle U8/U16 integers converting to U64
1245
1246Danylo Piliaiev (26):
1247
1248- tu: Flush depth on depth img transition from undef layout
1249- tu: Disable LRZ write when alpha-to-coverage is enabled
1250- freedreno: Disable LRZ write when alpha-to-coverage is enabled
1251- tu: Assert that if draw state is enabled it has valid iova
1252- tu: Switch to userspace iova allocations if kernel supports it
1253- tu: Enable bufferDeviceAddressCaptureReplay
1254- tu: Disable userspace iova allocations
1255- tu: Implement VK_EXT_attachment_feedback_loop_layout
1256- freedreno: PC_SO_STREAM_CNTL_STREAM_ENABLE has per-stream enable bits
1257- tu: Fix streamout with tess_use_shared
1258- tu: Update HS_WAVE_INPUT_SIZE formula
1259- util/u_trace: Add locking for ctx_list
1260- freedreno: Name more \*_DBG_ECO_CNTL regs
1261- freedreno: Add all variable magic regs to device-info tables
1262- tu: Use newly obtained magic reg values
1263- tu: Trivially expose VK_EXT_rasterization_order_attachment_access
1264- ir3: Prevent reordering movmsk with kill
1265- turnip: implement VK_EXT_multi_draw
1266- turnip: Implement VK_EXT_mutable_descriptor_type
1267- tu: Expose Vulkan 1.3
1268- tu: Do not DCE unused output vars used for transform feedback
1269- tu: Lazily init VSC to fix dynamic rendering in secondary cmdbufs
1270- tu: Reset whether there is DS resolve for dynamic subpass
1271- tu: Fix the size of patch control points state
1272- freedreno: Add 'replay' tool which allows to replay cmdstreams
1273- freedreno/fdl: Increase alignment for UBWC images
1274
1275Dave Airlie (85):
1276
1277- gallivm: fix printf hook for cached shaders.
1278- llvmpipe: add shader clock support
1279- lavapipe: enable shader clock
1280- draw: fix up jit type creation for gs/tcs/tes
1281- llvmpipe/gallivm/draw: introduce a buffer type.
1282- zink: add defines for the shader stage templates.
1283- virgl: abstract virgl shader stages from pipe shader stages.
1284- svga: compare shader type against compute not tess eval.
1285- gallium/tgsi: reorder pipe shader type defines.
1286- gallium/iris/crocus: collapse a bunch of conversion functions.
1287- gallium: fixup some inconsistent uses of enum pipe_shader_type.
1288- gallium: use gl shader types as the basis for the gallium ones
1289- gallivm/nir/st: lower image derefs in advance.
1290- lavapipe: lower images to non-derefs in vulkan side
1291- lavapipe: scan shader for info before lowering images.
1292- llvmpipe/ci: update ci results for clover.
1293- draw: don't touch info values that aren't valid.
1294- nir_to_tgsi_info: drop const_buffers_declared
1295- crocus: sync performance monitor code with iris.
1296- vulkan: update rest of the headers to v1.3.225
1297- gallivm: drop unused parameter to lp_build_sample_aos
1298- gallivm/sample: remove unused base parameter from dynamic callbacks.
1299- gallivm: don't indirect image/sampler destroy.
1300- llvmpipe: finish rendering before flushing frontbuffer resources.
1301- radv: move to using common record_result.
1302- turnip: use common command record result.
1303- panvk: use common command buffer record result.
1304- dozen: move to use common command buffer result code.
1305- lavapipe: fix 3d depth stencil image clearing.
1306- gallivm: add atomic 32-bit float support
1307- llvmpipe: export GL_NV_shader_atomic_float
1308- lavapipe: export VK_KHR_shader_atomic_float
1309- docs: add new llvmpipe/lavapipe atomic float extensions
1310- lavapipe: use slow clear path if conditional rendering.
1311- llvmpipe: handle different blocksizes for sampler/image views
1312- lavapipe: take layers into account around render surface creation.
1313- lavapipe: handle view mask layer count properly for begin rendering.
1314- llvmpipe: memset empty images in the key
1315- llvmpipe: bind sampler views/images properly.
1316- gallivm: handle llvm coroutines for llvm > 15
1317- gallivm: fix buffer_type for LLVM 15
1318- lavapipe: add fmin/fmax to image lowering.
1319- gallivm/nir: fix fmin/fmax translation
1320- gallivm/nir: bitcast when non-float ptr type.
1321- llvmpipe/fs: start passing explicit context pointer type.
1322- llvmpipe/fs: convert thread data ptr to opaque ptr friendly apis
1323- llvmpipe/fs: convert linear context to opaque pointers friendly
1324- llvmpipe/cs: convert cs context to opaque friendly api
1325- llvmpipe/cs: convert cs thread data to opaque friendly api
1326- llvmpipe/fs: fix invocations access for opaque ptrs.
1327- llvmpipe/setup: move setup code to explicit pointers.
1328- llvmpipe/cs: move compute code to explicit pointer types
1329- gallivm/struct: add opaque ptr friendly pointer accessors.
1330- llvmpipe/fs: add mask_type for mask_store accesses.
1331- llvmpipe/fs: pass explicit mask_type into interp code.
1332- llvmpipe/fs: port depth code to opaque pointer api
1333- llvmpipe/fs: add sample position type to the interp interface
1334- llvmpipe/fs: pass mask type to alpha to coverage handler
1335- llvmpipe/fs: use explicit api in viewport code.
1336- llvmpipe/fs: handle explicit types around blending and c/zs bufs calcs
1337- llvmpipe/fs: cleanup some remaining mask handling and reuse types
1338- llvmpipe/fs: convert outputs handling to explicit api
1339- gallivm/nir: move to explicit pointer interfaces.
1340- gallivm/nir: fixup atomic path for explicit pointers.
1341- gallivm/sample: move border color load to explicit pointers
1342- gallivm/llvmpipe: hand sample position type in for loading.
1343- llvmpipe/fs/linear: port to explicit pointers.
1344- draw: fix gs/tes explicit pointer handling
1345- draw: handle primid for tess being a different pointer type
1346- gallivm/tgsi: port tgsi to explicit types.
1347- gallium/tgsi: handle temps/outputs array.
1348- vulkan: update beta and video headers to 1.3.230
1349- llvmpipe/tests: port to new pointer interfaces.
1350- gallivm/llvmpipe: add opaque pointers support to sampler
1351- draw/llvmpipe: move texture/sampler/image member load to opaque.
1352- draw/llvmpipe: add way to return pointer types to generic code.
1353- gallivm/sample: use retrieved types to do opaque pointer loads.
1354- gallivm: remove legacy pointer_get apis
1355- gallivm/sample: move some first_level/last_level calcs out
1356- gallivm/nir: drop some unused struct members.
1357- gallivm/sample: refactor multisample offset calcs code.
1358- glsl/types: fix dword slots calc for float16 matricies.
1359- clover/llvm: add kernel arg info for opaque pointers
1360- nir/lower_bool_to_int32: add support for lowering functions.
1361- gallivm: add coro malloc hooks earlier and always.
1362
1363David Heidelberg (71):
1364
1365- ci: compress LAVA rootfs with zstd instead of gzip
1366- ci: replace gzip usage with zstd where posible
1367- ci/bin: ignore Gitlab GQL cache
1368- ci: Turn off the entire Lima farm (devices report out-of-space)
1369- ci: move DXVK instalation outside of x86_test-vk
1370- ci: make shellcheck happy on dxvk script
1371- ci: separate wine setup into own script
1372- ci: Windows runner is experiencing DNS issues; disable Microsoft farm
1373- ci: prepare piglit-traces for WINE and DXVK
1374- ci/lavapipe: implement traces
1375- docs/ci: show how to run traces locally
1376- ci/lavapipe: wine flakes, switch to manual for now
1377- ci: use apt-get in x86_test-vk instead of apt
1378- ci: do not install i386 sub-architecture in VK images
1379- ci: traces: update freedreno and broadcom to brotli compressed traces
1380- ci: GraphQL was already implemented in ci_run_n_monitor.py
1381- ci: implement stress testing jobs in ci_run_n_monitor.py
1382- ci: fix leftover tag in image-tags.yml
1383- ci: rename debian-gallium to debian-build-testing
1384- ci: move lava-test into debian-build-testing
1385- ci: make shellcheck happy about .gitlab-ci/container/ directory
1386- ci: use shellcheck for .gitlab-ci/container/ directory
1387- ci: drop last bits of wine32
1388- ci: bump cross compilation to LLVM 13 where possible
1389- ci: uprev piglit 2022-08-30
1390- ci: set venus on lavapipe to manual due to flakes
1391- ci: print env as other setups do
1392- ci/lava: print set-job-env-vars.sh as other setups do
1393- ci/traces: do not keep a cache, which will be wiped a few steps later
1394- ci/test: collapse printing env
1395- ci/lava: collapse printing env
1396- ci/freedreno: convert traces to new YAML format
1397- ci/virgl: convert traces to new YAML format
1398- ci/llvmpipe: convert traces to new YAML format
1399- ci/lavapipe: convert traces to new YAML format
1400- ci/broadcom: convert traces to new YAML format
1401- ci/panfrost: convert traces to new YAML format
1402- ci/radeonsi: convert traces to new YAML format
1403- ci/intel: convert traces to new YAML format
1404- ci/i915: convert traces to new YAML format
1405- ci/crocus: convert traces to new YAML format
1406- ci: check traces.yml files with yamllint
1407- ci: performance traces: make use of no-perf label
1408- ci: export whole results/ directory for all traces jobs
1409- ci: split ci_run_n_monitor into script and shared parts
1410- ci: introduce update_traces_checksum.py
1411- ci: uprev piglit 2022-09-08
1412- ci: use xargs instead of find -exec
1413- ci: add jq utility
1414- ci/traces: remove first line with YAML version to prevent failure
1415- ci/radeonsi: drop glmark2 terrain trace
1416- ci/panfrost: drop glmark2 terrain trace
1417- ci/intel: drop glmark2 terrain trace
1418- ci/lava: add wine and apitrace into amd64 rootfs container
1419- ci/lava: prepare wineprefix for GL and DXVK
1420- ci/dxvk: fix DXVK 64-bit only wine setup
1421- ci: also handle default wine rootfs (for testing)
1422- ci/traces: prevent showing wine instalation dialog
1423- ci/radeonsi: add traces for radeonsi Zork
1424- ci/amd: move radeonsi in amd directory
1425- ci: uprev DXVK to 1.10.3
1426- ci/traces: Blender demo (Cube Diorama) flakes on Intel APL
1427- ci/freedreno: add Guilherme Gallo into restricted traces access list
1428- ci/panfrost: deduplicate gitlab-ci.yml
1429- ci/panfrost: enable piglit-gl on g52 again and deparalelize
1430- ci/panfrost: Humus Portal trace got fixed, update checksum
1431- ci/panfrost: re-enable traces on t760
1432- ci/linker: update mold to 1.6 and enable on s390x
1433- ci/zink: rename zink job to zink-lvp to better describe it
1434- ci/freedreno: disable antichambers trace
1435- ci/update_traces_checksum.py: check if checksum is in the array, not it's value
1436
1437David Riley (1):
1438
1439- drm-shim: Allow drm-shim to work with glibc fortify.
1440
1441Dawn Han (11):
1442
1443- Update venus-protocol to add extension \`VK_VALVE_mutable_descriptor_type`
1444- Increase enum type in vn_descriptor_set.
1445- Update bitset when init descriptor layout.
1446- Track bitset when create descriptor pool
1447- Enable .VALVE_mutable_descriptor_type in physical_device
1448- venus: support the promoted VK_EXT_mutable_descriptor_type
1449- venus: Update mutable descriptor struct type
1450- venus: extend lifetime of push descriptor set layout
1451- venus: extend VkPipelineLayout lifetime for batched VkCmdPushConstants()
1452- venus: implement vkCmdPushDescriptorSetWithTemplateKHR
1453- venus: enable VK_KHR_push_descriptor
1454
1455Diogo Ivo (1):
1456
1457- nouveau: treat DRM_FORMAT_INVALID as implicit modifier
1458
1459Dmitry Baryshkov (1):
1460
1461- freedreno/registers: update hdmi registers to add more 8x74 regs
1462
1463Dmitry Osipenko (11):
1464
1465- util/disk_cache: Add option to disable compression
1466- util/disk_cache: Enable testing of uncompressed caches
1467- util/disk_cache: Append to disk_cache_load_cache_index() with _foz
1468- util/disk_cache: Move struct cache_entry_file_data to the disk_cache_os.h
1469- util/disk_cache: Make disk_cache_os.h usable by C++ code
1470- util/disk_cache: Add new mesa-db cache type
1471- virgl: Fix unmapping of blob resources
1472- llvmpipe: Align persistent mappings to page size
1473- ci: Bump crosvm version
1474- ci: Update virgl-iris-traces checksums
1475- ci/virgl: Enable virgl-iris-traces
1476
1477Dylan Baker (47):
1478
1479- docs: reset new features
1480- docs: add release notes for 22.1.5
1481- docs: Add sha256 sum for 22.1.5
1482- docs: add release notes for 22.1.4
1483- docs: Add sha256 sum for 22.1.4
1484- docs: update calendar and link releases notes for 22.1.4
1485- docs: update calendar and link releases notes for 22.1.5
1486- docs: update calendar for 22.2.0-rc1
1487- docs: add release notes for 22.2.0
1488- docs: Add sha256 sum for 22.2.0
1489- docs: update calendar and link releases notes for 22.2.0
1490- docs: Add calendar entries for 22.2 release.
1491- intel/genxml: use a single pattern for res
1492- intel/genxml: use a set for lookups
1493- intel/genxml: don't use parens with python assert statement
1494- intel/genxml: add type annotations to gen_sort_tags.py
1495- intel/genxml: use \`a not in {x, y}` instead of \`a != x or a != y`
1496- intel/genxml: remove unused variable in gen_sort_tags.py
1497- intel/genxml: don't overwrite the input of gen_sort_tags in place
1498- intel/genxml: add 'nonzero' to gen_sort_tags
1499- intel/genxml: add a validation mode to gen_sort_tags
1500- intel/genxml: run gen_sort_tags on all of the xml
1501- meson: run genxml sort tests
1502- docs: add release notes for 22.2.1
1503- docs: Add sha256 sum for 22.2.1
1504- docs: update calendar and link releases notes for 22.2.1
1505- intel/genxml: write a space before the '/>` closing tag
1506- intel/genxml: re-process with space before \`/>`
1507- intel/genxml: remove newlines between xml islands
1508- intel/genxml: re-process with extra whitespace removed
1509- intel/genxml: use ElementTree.write and .indent
1510- intel/genxml: reprocess xml with elementree output
1511- intel/genxml: split the processing code up
1512- intel/genxml: use a main() function
1513- intel/genxml: use Path.replace instead of unlink and rename
1514- docs: add release notes for 22.2.2
1515- docs: Add sha256 sum for 22.2.2
1516- docs: update calendar and link releases notes for 22.2.2
1517- docs: Extend calendar entries for 22.2 by 2 releases.
1518- util/indicies: move common static functions to private header
1519- util/indicies: use itertools.product instead of deeply nested loops
1520- util/indicies: write a file with u_indicies_gen.py
1521- util/indicies: simplify some of u_indicies_gen.py
1522- util/indicies: use itertools.product in u_unfilled_gen.py
1523- util/indicies: convert u_unfilled_gen.py to write a file
1524- util/glsl2spirv: fix type error in argument handling
1525- util/glsl2spirv: fix appending extra flags
1526
1527Ella Stanforth (2):
1528
1529- util: fix missing fcntl.h on musl
1530- vulkan: wsi: fix musl build
1531
1532Emma Anholt (108):
1533
1534- ci/bare-metal: Re-open serial and everything after test phase timeout.
1535- zink: Take the tiling path for resources with modifiers.
1536- zink: Replace the "optimal_tiling" flag with a "linear" flag instead.
1537- ci/zink: Re-enable a630 zink-on-turnip traces job using gbm as the backend.
1538- ci/swrast: Add some flakes I've noticed in the IRC channel.
1539- ci: Add testing of the khr-single tests.
1540- ci: Upgrade deqp-runner to 0.15.0.
1541- zink: Make sure that we keep the existing ici pNext chain on inserts.
1542- tu: Move the vkCreateImage format list checks to helper functions.
1543- tu: Treat viewFormatCount==0 as no format list.
1544- tu: Use the format list to decide whether we can do UBWC with MUTABLE.
1545- ci/tu+zink: Re-enable the traces that were asserting about UBWC compatibility.
1546- ir3: Suppress disasm of internal shaders unless IR3_SHADER_DEBUG=internal.
1547- tu: Only emit as many bindless regs as we have seen descriptor sets.
1548- tu: Only emit as many VPC interp/repl regs as will be referenced.
1549- tu: Only emit FS output regs for as many MRTs as we have (but at least 1).
1550- tu: Emit only as many VBs as we've ever seen bound on the command buffer.
1551- ci/tu: Add another cwe_after_bind fail that appeared in the reshuffle.
1552- ci: Move 'never' rules includes above "on_success" rules includes.
1553- ci: disable the freedreno farm.
1554- freedreno/regs: Add a bit of documentation of what SKIP_IB2 does.
1555- turnip: Fix enabling of IB2 skipping.
1556- nir/lower_mediump: Lower FS outputs to 16-bit when the value was upconverted.
1557- nir_to_tgsi: Add support for TG4 with explicit offsets.
1558- nir_to_tgsi: Add support for 64-bit fsign/isign.
1559- nir_to_tgsi: Add support for demote, is_helper_invocation, and subgroup ops.
1560- nir: Make nir_lower_discard_if() handle demotes and terminates, too.
1561- nir_to_tgsi: Use nir_lower_discard_if for demote_if.
1562- nir_to_tgsi: Add support for bindless textures and images.
1563- turnip: Make sure bandwidth config per pixel starts from 0.
1564- freedreno/ir3: Move chip-specific nir compiler options to C code.
1565- freedreno/ir3: Switch to NIR for a3xx/a4xx's vertex id lowering.
1566- gallium,glsl: Delete PIPE_CAP_VERTEXID_NOBASE and lower_vertex_id.
1567- nir: Add a pass to lower mediump temps and shared mem.
1568- spirv: Mark phis as mediump instead of directly lowering them to 16 bit.
1569- turnip: Enable lowering of mediump temps/CS shared to 16-bit.
1570- turnip: Treating non-d/s-write pipelines as not having d/s feedback loops.
1571- mesa: Remove PIPE_CAP_CS_DERIVED_SYSTEM_VALUES_SUPPORTED/lower_cs_derived.
1572- zink: Don't upload shader immediate arrays through UBO 0.
1573- zink: Don't lower indirect derefs of temp arrays.
1574- Revert "ci: disable the freedreno farm."
1575- ci: Add support for Jetson TK1.
1576- nouveau: Fix compiler warnings about silly address checks in ir_print.
1577- ci/turnip: Add missing a618 full-run bypass fails.
1578- turnip: Only emit descriptor loads for active stages in the pipeline.
1579- freedreno/ir3: Consistently lower mediump inputs to 16-bit (when we can).
1580- nir/lower_mediump_vars: Don't lower mediump shared vars with atomic access.
1581- mesa: Lower mediump temps and CS shared when the driver supports FP16+INT16.
1582- turnip: Add some missing LRZ disable debug.
1583- turnip: Ignore dynamic color write enables past our number of attachments.
1584- turnip: Don't look at RB.Z_READ_ENABLE for setting LRZ.Z_TEST_ENABLE.
1585- turnip: Skip rather than invalidate LRZ on gl_FragDepth writes.
1586- turnip: Ignore pDescriptorCounts[] for non-variable-count layouts.
1587- turnip: Keep a host copy of push descriptor sets.
1588- ci/turnip: Use all 9 a618 runners at once for VK testing.
1589- ci/turnip: Bump the full-run a618 runner count to 3.
1590- ci/turnip: Re-enable the compressed cubemap tests.
1591- ci/turnip: Allow running spirv_ids_abuse in full VK runs.
1592- ci/turnip: Drop a couple of spillall skips.
1593- ci/turnip: Move some 15-second-ish test skips to pre-merge skips.
1594- ci/nouveau: Drop BM_POE_TIMEOUT.
1595- ci/nouveau: Rearrange job setup variables.
1596- ci/nouveau: fix up Jetson Nano
1597- ci/nouveau: Update gm20b xfails.
1598- ci: Update kernel to drm-next-2022-08-12-1
1599- turnip: Fix leak of autotune fence BOs.
1600- turnip: Add TU_DEBUG=bos to print stats of BOs live at submit time.
1601- turnip: Fix the "written stencil is unmodified" check.
1602- turnip: Use the simplified stencil write flags for the LRZ-allowed check.
1603- ci/zink+turnip: Add a manual full run of the dEQP CTS.
1604- turnip: Fix busy-waiting on syncobjs with OS_TIMEOUT_INFINITE.
1605- util/dag: Move the callback function pointer to the state.
1606- util/dag: Add a validation function.
1607- freedreno/ir3: Validate our scheduling DAGs after construction.
1608- nir/nir_opt_copy_prop_vars: Don't leak dynarray memory during the pass.
1609- nir/vars_to_ssa: Always do OOB load/store removal.
1610- glsl: Remove lower_output_reads.
1611- glsl: Remove do_set_program_inouts.
1612- glsl: Remove opt_array_splitting.
1613- glsl: remove opt_structure_splitting.
1614- glsl: Remove lower_vec_index_to_swizzle.
1615- ci/iris: Update iris traces checksums.
1616- turnip: Don't use the dynamic color write enable during non-dynamic.
1617- ci/zink: Clear stale xfails for turnip.
1618- zink: Skip border color clamping for compressed formats.
1619- zink: Enable ASTC texture format translation.
1620- zink: Map ETC1 to ETC2 to avoid uncompressing in the frontend.
1621- ci/zink: Add some more flakes for turnip.
1622- zink: Fix RelaxedPrecision decoration of texture samples.
1623- zink: Decorate mediump outputs as RelaxedPrecision.
1624- turnip: Add a perf_debug for feedback-related performance traps.
1625- turnip: Move the ubwc_possible check before mutable formats.
1626- turnip: Add perf_debug for UBWC being disabled due to mutable formats.
1627- turnip: Add perf debug for more UBWC-disable cases that we could support.
1628- zink: Fix dummy CB path decision for VK_EXT_cwe presence.
1629- turnip: Be sure we blit depth, not stencil, for Z32FS8 -> Z32F resolves.
1630- nir/opt_phi_precision: Fix missing swizzles when narrowing phi srcs.
1631- turnip: Enable LRZ testing (not writing) in the presence of discards.
1632- ci/freedreno: Update CivV trace expectation.
1633- ci/freedreno: Update known flakes/timeouts.
1634- turnip: Fix reservation for indirect compute's IR3_DP_SUBGROUP_ID_SHIFT.
1635- zink: No need to use a 2-sample dummy image for bindless without null descs.
1636- zink: Lazily allocate the dummy surfaces.
1637- zink: Enable mesa/st frontend shader caching.
1638- rusticl: Fix the invalid memory migration flags check.
1639- iris: Disable GLSL lower_const_arrays_to_uniforms.
1640- gallium: update docs about PIPE_CAP_PREFER_IMM_ARRAYS_AS_CONSTBUF.
1641- ir3/ra: Make sure we don't pick a preferred reg overflowing the file.
1642
1643Eric Engestrom (56):
1644
1645- bin/gen_release_notes.py: bump advertised vulkan version to 1.3
1646- pvr: fix memleak in error paths
1647- vk/device-select-layer: fix .sType of VkPhysicalDeviceGroupProperties
1648- docs/features: add missing extensions supported by anv
1649- docs/features: add missing extensions supported by lavapipe
1650- docs/features: add missing extensions supported by radv
1651- docs/features: add missing extensions supported by turnip
1652- docs/features: add missing extensions supported by venus
1653- docs/features: start tracking extensions supported by dozen
1654- docs/features: start tracking extensions supported by panvk
1655- docs/features: start tracking extensions supported by powervr
1656- docs/features: drop unsupported VK_KHR_display & VK_KHR_get_display_properties2 for lavapipe
1657- ci: rename GitHub CI workflow from \`ci` to \`macos` to be a bit more explicit about what it covers
1658- meson: drop duplicate \`lib` in lib name
1659- wsi/x11: fix memleak in wsi_x11_connection_create()
1660- aco: drop unused variable
1661- anv: convert assert into unreachable to avoid fallthrough error
1662- meson: replace manual compiler flags with meson arguments
1663- vc4: introduce VC4_DBG() macro to make VC4_DEBUG checks consistent
1664- v3d: introduce V3D_DBG() macro to make V3D_DEBUG checks consistent
1665- wayland: avoid writing interface names as free-form strings
1666- vc4: drop dead qir_POW() helper
1667- v3dv: implement VK_EXT_shader_module_identifier
1668- docs: move rusticl readme into docs/
1669- broadcom: fix dependencies in static_library() calls
1670- docs: add release notes for 22.1.6
1671- docs: update calendar and link releases notes for 22.1.6
1672- docs: add release notes for 22.1.7
1673- docs: update calendar and link releases notes for 22.1.7
1674- ci: unexport local variable (and fix formatting)
1675- ci: bump mold to 1.5
1676- docs/release-calendar: add 22.3 schedule
1677- v3dv: drop error overwrite
1678- vc4: don't lower uadd_carry
1679- vc4: don't use imprecise_32bit_lowering for idiv lowering
1680- vc4: pre-compile shaders to make up for the lack of draw calls in shader-db
1681- vc4: use PIPE_MASK_RGBA name instead of its value 0xf
1682- vc4: send shaderdb output through KHR_debug
1683- vc4: mark piglit copypixels-(draw-)sync as flaky
1684- VERSION: bump to 22.3.0-rc1
1685- .pick_status.json: Update to 1174f376096ed6ceebb0fb2810456f1501a68df7
1686- zink: add missing build_id linker args
1687- vk/runtime: drop incorrect UNUSED annotation
1688- v3dv: avoid freeing already-freed memory
1689- VERSION: bump for 22.3.0-rc2
1690- .pick_status.json: Update to 9bd11f65238ce101bf846f5528f9088630e983f7
1691- ci: avoid triggering vc4 & v3d tests on v3dv-only MRs
1692- VERSION: bump for 22.3.0-rc3
1693- .pick_status.json: Update to f2e535e4fed5bdc13e11a443316a4b77cd5eb81a
1694- v3d(v): account for debug flags when using the cache
1695- .pick_status.json: Mark 56bd81ee21cb42e4a5afa2939514b570a1283754 as denominated
1696- VERSION: bump for 22.3.0-rc4
1697- .pick_status.json: Update to 22be0d09a005b4b955a46b65a919cfd786d6814f
1698- .pick_status.json: Mark 470fbb35efe1935242b346427ec0fa22b40fff2b as denominated
1699- commit_in_branch.py: add support for checking staging branches
1700- docs: add release notes for 22.3.0
1701
1702Erico Nunes (1):
1703
1704- ci: enable lima farm
1705
1706Erik Faye-Lund (226):
1707
1708- docs: fixup link to virgl docs
1709- zink: add have_D32_SFLOAT_S8_UINT boolean
1710- zink: do not use VK_FORMAT_D32_SFLOAT_S8_UINT without checking
1711- zink: remove needless check
1712- glsl/tests: do not perform eol-conversion on windows
1713- zink: type_main -> type_void_func
1714- zink: add spirv_builder_function_call
1715- zink: wrap discard in a function
1716- zink: fix conditions for draw-parameters
1717- zink: fixup indent in caps-check
1718- mesa/main: simplify implmementation of _mesa_is_format_etc2
1719- mesa/main: simplify implementation of _mesa_is_format_astc_2d
1720- mesa/st: simplify st_compressed_format_fallback
1721- mesa/st: avoid double-mapping if both images are the same
1722- mesa/st: do not use memcpy when using compressed fallback
1723- mesa/st: add context-flag for bptc-support
1724- mesa: add format-helper for bptc
1725- mesa: add _mesa_unpack_bptc-function
1726- mesa/st: implement fallback for bptc
1727- mesa/st: enable bptc extension with fallback
1728- zink: allow X32 -> A32
1729- mesa/main: remove always-true define
1730- mesa/main: add fp16-versions of bptc decompression functions
1731- mesa: fall-back to fp16 for bptc-textures
1732- gallium/u_transfer_helper: clean up boolean flags
1733- gallium/u_transfer_helper: add in-place interleave flag
1734- zink: use full transfer-helper api
1735- gallium/u_transfer_helper: hide deinterleave helpers
1736- mesa/st: also override mip filter for integer textures
1737- mesa/st: always force integer filtering to nearest
1738- mesa/st: force nearest-filtering for fp32 textures
1739- zink: request forcing of fp32-samplers to nearest when needed
1740- mesa/st: do not blit when using compressed fallback
1741- zink: clamp miplodbias when creating sampler
1742- zink: add ntv support for demote
1743- zink: add spirv_builder support for is_helper_invocation
1744- zink: add ntv support for is_helper_invocation
1745- zink: expose demote when supported
1746- zink: lower discard/demote depending on derivatives
1747- Revert "Revert "ci: set venus on lavapipe to manual due to flakes""
1748- mesa/st: use _mesa_generate_mipmap for fallback-formats
1749- mesa/st: do not fallback to srgba formats for dxt
1750- util: allow unpacking less than a block from dxt
1751- mesa/st: add context-flags for s3tc
1752- mesa: add format-helpers for s3tc
1753- mesa/main: add _mesa_unpack_s3tc
1754- mesa/st: implement fallback for s3tc
1755- mesa/st: enable s3tc extensions with fallback
1756- v3d: do not pretend to fake rgtc-support
1757- vc4: respect z-offset in tiled blits
1758- vc4: do not attempt to do deep tiled blits
1759- docs/zink: remove bptc from required formats for gl4.2
1760- docs/zink: document rgtc requirement
1761- mesa/st: add context-flag for rgtc
1762- mesa: add format-helper for rgtc
1763- util/format: fix broken indentation
1764- util/format: allow unpacking less than a block from rgtc
1765- util/format: implement rgtc -> r8 / r8g8 unpack
1766- mesa/main: add _mesa_unpack_rgtc
1767- mesa/st: implement fallback for rgtc
1768- mesa/st: do not fall back to uncompressed for rgtc
1769- mesa/st: enable rgtc extension with fallback
1770- panfrost: do not fake rgtc-support
1771- docs: update staus of  mark GL_ARB_texture_compression_bptc
1772- lima: don't store width in resource-level
1773- lima: do not align width/height for non-shared resources
1774- mesa/st: add context-flag for latc
1775- mesa: add format-helper for latc
1776- mesa/main: add support for latc in _mesa_unpack_rgtc
1777- mesa/st: implement fallback for latc
1778- mesa/st: do not fall back to uncompressed for latc
1779- mesa/st: enable latc extensions with fallback
1780- freedreno: do not fake rgtc-support
1781- u_transfer_helper: rip out fake_rgtc code
1782- mesa/st: always use normalized coords for samplers
1783- mesa/st: always use normalized coords for samplers
1784- mesa/st: always use normalized coords for samplers
1785- zink: assert on unnormalized texcoords
1786- docs: add an amber article
1787- zink: emulate latc formats with rgtc
1788- zink: fix luminance clears
1789- zink: fixup indent
1790- zink: refactor swizzle-clamping code
1791- zink: fix luminance/luminance-alpha emulation
1792- zink: correct emulation of luminance-alpha formats
1793- ci/crocus: update ci results
1794- mesa/main: fix broken indent
1795- mesa/main: don't copy signed rgtc via float
1796- docs: fixup broken rst syntax
1797- gallium/u_threaded_context: remove stale comment
1798- gallium: normalized_coords -> unnormalized_coords
1799- zink: fix broken pool-alloc consolidation
1800- zink: fixup dynarray-type
1801- zink: use util_dynarray_clear
1802- mesa/st: rip out point-sprite cap
1803- mesa/main: remove driver-cap for ARB_point_sprite
1804- mesa/st: move static asserts out of translate_prim
1805- mesa/st: remove translate_mode
1806- docs: correct spelling of "implemented"
1807- docs: correct spelling of "attempts"
1808- docs: correct spelling of "bug fixes"
1809- docs: correct spelling of "specified"
1810- docs: correct spelling of "hierarchical"
1811- docs: correct spelling of "implementation"
1812- docs: correct spelling of "to"
1813- docs: correct spelling of "Khronos"
1814- docs: correct spelling of "choosing"
1815- docs: correct spelling of "subtly"
1816- docs: correct spelling of "Raspbian"
1817- docs: correct spelling of "shareable"
1818- docs: correct spelling of "simplification"
1819- docs: correct spelling of "synced"
1820- docs: correct spelling of "maximum"
1821- docs: correct spelling of "hierarchy"
1822- docs: correct spelling of "commutativity"
1823- docs: correct spelling of "compression"
1824- docs: correct spelling of "laid"
1825- docs: correct spelling of "average"
1826- docs: correct spelling of "addition"
1827- docs: correct spelling of "explicitly"
1828- docs: correct spelling of "rasterizer"
1829- docs: fractionary -> fractional
1830- docs: ie. -> i.e.
1831- docs: correct spelling of "correctly"
1832- docs: correct spelling of "traveling"
1833- docs: correct spelling of "freedreno"
1834- zink: rework primitive rasterization type logic
1835- zink: reuse rast_prim for line-rasterization check
1836- docs: correct spelling of "addition"
1837- docs: correct spelling of "passes"
1838- docs: correct spelling of "irrespective"
1839- docs: correct spelling of "semantic"
1840- docs: correct spelling of "specifies"
1841- docs: correct spelling of "Gouraud"
1842- docs: correct spelling of "Copr"
1843- docs: spell out "build directory"
1844- docs: spell out "SHA256 checksum"
1845- docs: spell out "environment variable"
1846- docs: spell out "distribution"
1847- docs: use the official spelling of SHA-1
1848- docs: eg. -> e.g.
1849- docs: XOR'd -> XORed
1850- docs: consistently upper-case VC4, V3D etc
1851- docs: consistently spell DRI in all-caps
1852- docs: consistently spell ANV in allcaps
1853- docs: consistently spell URL in allcaps
1854- docs: consistently spell OK in allcaps
1855- docs: consistently use CMake
1856- docs: consistently spell API all-caps
1857- docs: consistently spell VirGL in camel-case
1858- docs: consistently spell Haswell in camel-case
1859- docs: use consistent casing for dEQP
1860- docs: ip -> IP
1861- docs: irc -> IRC
1862- docs: xorg -> X.Org
1863- docs: pci -> PCI
1864- docs: cpu -> CPU
1865- docs: gpu -> GPU
1866- docs: rgb -> RGB
1867- docs: perfetto -> Perfetto
1868- docs: lavapipe -> Lavapipe
1869- docs: freedreno -> Freedreno
1870- docs: adreno -> Adreno
1871- docs: radeon -> Radeon
1872- docs: nouveau -> Nouveau
1873- docs: llvmpipe -> LLVMpipe
1874- docs: vulkan -> Vulkan
1875- docs: lima -> Lima
1876- docs: ppir -> PPIR
1877- docs: autotools -> Autotools
1878- docs: ArchLinux -> Arch Linux
1879- docs: meson -> Meson
1880- docs: Gitlab -> GitLab
1881- docs: xyzw -> XYZW
1882- docs: hw -> HW
1883- docs: lava -> LAVA
1884- docs: skqp -> SkQP
1885- docs: servo -> Servo
1886- docs: Linux fbdev -> Linux Framebuffer
1887- docs: virtualisation -> virtualization
1888- docs: dependant -> dependent
1889- docs: healthcheck -> health check
1890- docs: vmware -> VMware
1891- docs: spell out "distribution"
1892- docs: spell out "transform and lighting"
1893- docs: de-duplicated -> deduplicated
1894- docs: de-nominate -> denominate
1895- docs: freedreno -> Freedreno
1896- docs: llvmpipe -> LLVMpipe
1897- docs: softpipe -> Softpipe
1898- docs: panfrost -> Panfrost
1899- docs: piglit -> Piglit
1900- docs: ini -> INI
1901- docs: swr -> OpenSWR
1902- docs: yml -> YAML
1903- docs: xml -> XML
1904- docs: Adreno aXXX -> Adreno XXX
1905- docs: remove non-existent directory
1906- docs: do not mention classic swrast
1907- docs: name correct driver
1908- docs: clean up labels
1909- docs: add missing link to mesa3d.org
1910- docs: add links to usenet groups
1911- docs: link to replacement forum
1912- docs: Android.mk -> ndk-build
1913- docs: gitlab -> GitLab
1914- docs: nVidia -> NVIDIA
1915- docs: spell ATI codenames in allcaps
1916- docs: nfs -> NFS
1917- docs: fixup rebase mistake
1918- zink: correct depth-bias enable condition
1919- zink: only set line-width if drawing lines
1920- zink: consider polygon-mode for rast_prim
1921- zink: do not read is_generated unless in tcs shader
1922- zink: put union fields into structs named by the shader-stages
1923- docs/zink: fix and cleanup rst syntax
1924- docs: fixup broken link syntax
1925- docs: do not mention EGL_MESA_drm_display
1926- docs: remove stale envvar-reference
1927- zink: fix json-errors in profile-file
1928- zink: remove needless requirements
1929- zink: fix incorrect requirements
1930- zink: update textureCompressionBC requirement
1931- docs/zink: add missing required device-feature
1932- mesa: treat unsupported queries as dummies
1933- d3d12: fix max-array-layers
1934
1935Fabian Vogt (1):
1936
1937- gallivm: Fix LLVM optimization with the new pass manager
1938
1939Feng Jiang (5):
1940
1941- virgl/vtest: fix memory overwrite problem in virgl_vtest_send_get_caps()
1942- gallium: add key size to the structure pipe_picture_desc
1943- vl: change vl_video_buffer_destroy() to non-static
1944- gallium/util: add a new helper function pipe_buffer_size()
1945- virgl: add support for hardware video acceleration
1946
1947Filip Gawin (5):
1948
1949- r300: add list of deqp gles2 r400 failures
1950- r300: add khr r400 failures
1951- gallium: avoid using float based conditions in loops
1952- r300: don't use smooth line if not requested
1953- r300: update r400 tests
1954
1955Francisco Jerez (4):
1956
1957- intel/fs: Fix horiz_offset() to handle FIXED_GRFs with non-trivial 2D regions.
1958- nir/lower_int64: Enable lowering of 64-bit float to 64-bit integer conversions.
1959- nir/lower_int64: Implement lowering of 64-bit integer to 64-bit float conversions.
1960- nir/lower_int64: Fix float16 to int64 conversions.
1961
1962Frank Binns (13):
1963
1964- pvr: get free list min size from the runtime info
1965- pvr: reset all command buffer state in pvr_cmd_buffer_reset()
1966- pvr: Implement vkResetCommandBuffer
1967- pvr: remove redundant TODO
1968- pvr: remove image pointer from image view struct
1969- pvr: set samplerAnisotropy to false
1970- pvr: cube map tex state packing fixes
1971- pvr: don't setup attachment tex state for images with input attachment bit unset
1972- pvr: finish render job sample count setup
1973- pvr: add required pixel formats
1974- pvr: remove implicit sync support
1975- pvr: setup buffer and image format feature bits
1976- CODEOWNERS: remove rajnesh-kanwal as an Imagination maintainer
1977
1978Friedrich Vock (28):
1979
1980- amd/common: move ac_memory_ops_per_clock into ac_gpu_info.h
1981- amd/common: Remove redundant code for determining memory ops per clock
1982- radv/winsys: Return VK_ERROR_INVALID_EXTERNAL_HANDLE if buffer imports fail
1983- radv: Generalize instance count in BVH states to leaf node count
1984- radv: Add metadata to acceleration structures
1985- radv: Add Radeon Raytracing Analyzer trace dumping utilities
1986- radv: Add Radeon Raytracing Analyzer capturing layer
1987- radv: Enable Radeon Raytracing Analyzer traces
1988- docs: Add documentation about RADV RRA tracing
1989- radv: Use radv_CmdUpdateBuffer instead of radv_update_buffer_cp for geometry info writes
1990- radv: Free geometry infos in radv_CmdBuildAccelerationStructuresKHR
1991- radv: Correct accel struct header size
1992- radv: Rename internal node shader to lbvh_internal
1993- radv: Split CmdBuildAccelerationStructuresKHR into different functions
1994- radv: Add BVH IR types
1995- radv: Add conversion shader for internal nodes
1996- radv: Add conversion shader for leaf nodes
1997- radv: Create pipelines for conversion meta shaders
1998- radv: Build acceleration structures using BVH IR
1999- radv: Rename emulated float helpers
2000- radv: Use a struct for AABBs
2001- radv: Add radv_indirect_unaligned_dispatch
2002- radv/rt: Fix internal converter synchronization
2003- radv/rt: Dispatch internal converter indirectly
2004- radv/rt: Track number of inactive leaf nodes
2005- radv: Add global sync utilities
2006- radv: Add REF as a typename macro to .clang-format
2007- radv: Use spirv1.5 instead of vulkan1.2
2008
2009Georg Lehmann (40):
2010
2011- aco: Check that we don't override exec_val operands during branching sequence optimization.
2012- aco/assembler: Fix v_cmpx with SDWA.
2013- aco: Fix optimizing branching sequence with s_and_saveexec.
2014- aco/assembler: Fix v_cmpx pre GFX10.
2015- aco: Use v_cmpx pre GFX10.
2016- aco: Force tex operand to have the correct sub dword size before packing.
2017- nir: Add nir_ssa_scalar_is_undef.
2018- nir/fold_16bit_tex_image: Add an option to fold image sources.
2019- aco: Implement storage image A16.
2020- aco: Combine 16bit undef and constants instead of using s_pack.
2021- radv: Fold 16bit image sources.
2022- nir: Print selection control for nir_if.
2023- nir/opt_algebraic: Optimize check for single bit.
2024- nir/opt_algebraic: Optimize d3d9 pow with fmulz.
2025- aco: Fix image instructions with lod when 2d_view_of_3d is enabled on GFX9.
2026- ac/llvm: Fix image instructions with lod for 2d on GFX9.
2027- aco: Use plain VOPC for vcmpx when possible.
2028- nir/opt_algebraic: Optimize unpacking of upcasts to 64bit integers.
2029- radv: Enable VK_EXT_load_store_op_none.
2030- aco: Use v_fmaak/v_fmamk if two operands are the same literal.
2031- aco: Unswizzle v_pk_fma_f16 literals to produce more v_pk_fmac_f16.
2032- radv: Fix GLSL BDA struct alignment and use pointer arithmetic SIZEOF.
2033- nir/opt_algebraic: Optimize more (a cmp b ? a : b) to min/max.
2034- aco: Use s_pack_ll for s_bfe operand on GFX9+.
2035- nir: Print nir_selection_control_divergent_always_taken.
2036- nir/opt_algebraic: Mirror optimizations for find_msb_rev.
2037- aco: Implement [ui]find_msb_rev.
2038- ac/llvm: Implement [ui]find_msb_rev.
2039- nir/opt_algebraic: Add an option to lower uclz.
2040- radv,aco: Lower uclz in NIR.
2041- nir/opt_algebraic: Optimize various find_msb_rev patterns.
2042- nir: Fix ifind_msb_rev constant folding.
2043- aco: fmaak/fmamk can't use SDWA.
2044- aco: Don't use opsel for p_insert.
2045- ac/llvm: Implement signed idot on GFX11.
2046- aco: Implement signed idot instructions on GFX11.
2047- radv: Use available dot product instructions on gfx11.
2048- radeonsi: Use available dot product instructions on gfx11.
2049- aco: Use opsel for the third operand.
2050- aco: Use s_pack_ll_b32_b16 for scalar zero extend.
2051
2052Gert Wollny (95):
2053
2054- radeonsi-raven/ci: Move spec@arb_timer_query@timestamp-get to flakes
2055- llvmpipe: Double number of SSBOs
2056- r600/sfn: Schedule shift instruction on R600 in t-slot
2057- r600/sfn: Add GS thread fix just like the TGSI code path
2058- r600/sfn: Sort FS inputs to make interpolated values come first
2059- r600/sfn: Fix color outputs when color0 writes all
2060- r600/sfn: Initialize out buffer when printing op
2061- r600: Force NOPs when loading AR on R600 class hardware
2062- r600/sfn: Handle R600 scratch read
2063- r600: Don't use SB with R600 style scratch reads
2064- r600: Fix SCRATCH OP de-assembly
2065- nir_lower_atomics_to_ssbo: Initialize deref struct
2066- r600/sfn: Handle color0 writes all on R700 like on EG
2067- r600/sfn: Don't tag mem-ring and stream instructions as exports
2068- r600/sfn: Don't schedule GDS instructions early
2069- r600/sfn: Don't scan the whole block for ready instructions
2070- r600/sfn: Use a heuristic to keep SSBO setup and store close
2071- r600: Fix reporting TGSI IR support
2072- r600/sfn: Use a low number for unused target register
2073- virgl: Fix buffer overflow warning:
2074- virgl: Fix ubsan warnings:
2075- virgl: when reading back wait first, then do the transfer
2076- virgl/ci: remove some flakes
2077- r600/sfn: override register ID when it doesn't matter
2078- r600/sfn: ACK all image stores, also at end of loop
2079- r600/sfn: more peephole optimization with conditionals
2080- r600/sfn: Override VPM if access in helpers is requested
2081- r600/sfn: Don't scalarize fdd instructions
2082- virgl: Add some formats that the CTS uses
2083- mesa/glsl: Add support for NV_shader_noperspective_interpolation
2084- r600: set nir option lower_cs_local_index_to_id
2085- virgl: Report CONSTANT_BUFFER_SIZE according to GL_MAX_UNIFORM_BLOCK_SIZE
2086- r600/sfn: Lower tex,txl,txb and txf to backend
2087- r600/sfn: lower txd to backend in nir
2088- r600/sfn: lower txf_ms in nir
2089- r600/sfn: copy-propagate single source texture values
2090- r600/sfn: print tex prepare instructions
2091- r600/sfn: Don't allocate un-used components in texture ops
2092- r600/sfn: Add an easy access to get an instruction as ALU
2093- r600/sfn: Don't assert when setting one value
2094- r600/sfn: Copy propagate into TEX source
2095- r600/sfn: drop some unused code
2096- r600/sfn: copy propagate register load chains
2097- r600/sfn: VS inputs are effectively SSA
2098- r600/sfn: Add a free-channel mask when testing whether a register can switch channel
2099- r600/sfn: fix some channel pinning
2100- r600/sfn: Handle shifts on Cayman
2101- r600/sfn: only use 3 channels on Cayman for trans ops
2102- r600/sfn: Make sure texture lowering is done in the right order
2103- r600/sfn: lower tg4 to backend in NIR
2104- r600: Add slot to ALU disassambly
2105- r600: Make NIR the default shader IR
2106- nir_lower_to_source_mods: Don't sneek in an abs modifier from parent
2107- r600/sfn: Only run 64 bit ops lowering passes when really needed
2108- r600/sfn: Always start a new CF after a KILL instruction
2109- r600/sfn: don't propagate registers into conditional test
2110- r600/sfn: Add peephole optimization for kill instructions
2111- r600/sfn: Delete final lowered nir shader early
2112- r600/sfn: Fix typo
2113- r600/sfn: Use the correct allocator for loop lists
2114- r600/sfn:explicitly initialize the memory pool
2115- r600/sfn: assert on use of abs modifier in op3
2116- r600/sfn: Unroll loops after doing some optimizations
2117- r600/sfn: Always enforce LDS operation order
2118- r600/sfn: Make sure all components are usable when lowering TF inputs
2119- r600/sfn: run cleanup passes after late algebraic opt
2120- nir: move fusing csel and comparisons to opt_late_algebraic
2121- r600/sfn: Unify the handling of resource IDs in instruction
2122- r600/sfn: Handle nir_op_seq and nir_op_sne
2123- r600: Fix printing t-channel in diss-assambly
2124- r600/sfn: Add test for channel changes in TEX source from opt
2125- r600/sfn: Allow copy-prop of group dest into origin
2126- r600/sfn: Trigger TEX CF based on max TEX CF size
2127- r600/sfn: improve scheduling of tex sources
2128- r600/sfn: Increase scheduling priority on uniform reads and non-ssa writes
2129- r600/sfn: elimiate dead registers too
2130- r600: Account for color and clipvertex when evaluating LDS space
2131- r600/sfn: evaluate LDS location for color and clip-vertex too
2132- r600/sfn: Add .clang-format file and apply style
2133- r600: declare for counter locally and fix signed/unsigned warning
2134- r600: Only count ALU registers that are not clause local
2135- r600: Print MOVA_INT dest on Cayman
2136- r600/sfn: Handle load_workgroup_size
2137- r600/sfn: lower uniforms to UBOs
2138- r600/sfn: remove load_uniform handling
2139- etnaviv: Create MSAA surfaces with PE compatible tiling
2140- nir/algeraic_opt: use double options too for lowering ftrunc\@64
2141- r600/sfn: Fix source modifiers for ffract64
2142- r600/sfn: Fix f2u32 and remove backend lowring of f2u64 and f2i64
2143- r600/sfn: Honor shader key w.r.t. atomic counter layout
2144- r600/sfn: Fix location for reading cube array image dimensions
2145- r600/sfn: sort FS color outputs before all other outputs
2146- virgl: don't a use staging when a resources created with the shared flag
2147- virgl: Fix injection of double from const mov instruction
2148- r600/sfn: always use four slots for Cayman trans ops
2149
2150Giancarlo Devich (4):
2151
2152- mesa: Fix stack corruption for PIPE_QUERY_TIMESTAMP
2153- d3d12: Add cache managers for separate buffer usage cases
2154- gallium/pipe: Align allocation size in \`pb_cache_manager_create_buffer`
2155- d3d12: Don't align already-aligned size in \`d3d12_bufmgr_create_buffer`
2156
2157Glenn Kennard (1):
2158
2159- nv30: Fix non-scissored clears after a scissor has been set
2160
2161Guilherme Gallo (29):
2162
2163- ci/lava: Customise sections timeouts via envvars
2164- ci/lava: Add timeout header info for LAVA GL Sections
2165- ci/bin: Add utility to find jobs dependencies
2166- ci/bin: Sort imports from ci_run_n_monitor.py
2167- ci/bin: Make ci_run_n_monitor finds dependencies automatically
2168- ci/bin: Add script to expand jobs manifest
2169- ci/bin: Cache GQL queries
2170- ci/bin: Print job needs DAG in ci_run_n_monitor
2171- ci/skqp: Add an option to run all tests
2172- ci/skqp: Show reports on crashes
2173- ci/skqp: Fix paths in skqp-runner
2174- ci/skqp: Fix Nima-Cpp fetching error
2175- ci/skqp: Build list_gpu_unit_tests and list_gms
2176- ci/skqp: Add support for commenting tests files
2177- ci/skqp: Use SKQP_BIN_DIR instead of hardcoded /skqp dir
2178- ci/skqp: Put generated tests files in artifacts
2179- ci/skqp: Supress irrelevant shellcheck warnings
2180- ci/skqp: Remove .baremetal-skqp-test in favor of .skqp-test
2181- ci/skqp: Add gitlab sections for uncluttering
2182- ci/freedreno: skqp: run with new tests files
2183- ci/radeonsi: skqp: Add fail test files for raven
2184- ci/radeonsi: Add zork jobs and rules
2185- ci/bin: Fix requirements.txt
2186- ci/bin: Remove whitespace from token files
2187- ci: Fix kernel+rootfs.* jobs
2188- ci: Update piglit with s3 support
2189- ci: Update piglit-traces tests expectations
2190- ci: Update ci-fairy in CI, rootfs and containers
2191- freedreno/ci: Skip civilization-v/CivilizationV-trim trace
2192
2193Hans-Kristian Arntzen (3):
2194
2195- vulkan: Update to 1.3.228 headers.
2196- radv: Implement VK_EXT_mutable_descriptor_type.
2197- vk/runtime: Fix narrowing of timeline signal and wait value to u32.
2198
2199Iago Toral Quiroga (87):
2200
2201- broadcom/compiler: don't use imprecise_32bit_lowering for idiv lowering
2202- broadcom/compiler: use nir_opt_idiv_const
2203- broadcom/compiler: simplify code emitted for centroid coordinates
2204- nir/lower_alu: drop unnecessary iand on uadd_carry result
2205- broadcom/compiler: track number of TMU operations in prog data
2206- v3dv: move check_needs_load/store helpers to unversioned code
2207- v3dv: track if a job is compatible with double-buffer mode
2208- v3dv: fix tile state allocation
2209- v3dv: add a v3dv_job_allocate_tile_state helper
2210- v3dv: postpone tile state allocation for render pass jobs
2211- v3dv: tell job_compute_frame_tiling whether we want to use double-buffer mode
2212- v3dv: add a heuristic for double-buffer mode
2213- v3dv: vkCmdClearAttachments no longer generates its own RCL
2214- v3d,v3dv: lower texel buffer aligment requirements
2215- v3dv: implement VK_EXT_texel_buffer_alignment
2216- v3dv: add a helper to destoy pipeline layouts
2217- v3dv: ref/unref pipeline layout objects
2218- v3dv: set maxBufferSize property
2219- v3dv: add a v3dv_image_init helper
2220- v3dv: add a get_image_memory_requirements helper
2221- v3dv: implement vkGetDeviceImageMemoryRequirementsKHR
2222- v3dv: add a get_buffer_memory_requirements helper
2223- v3dv: add a buffer_init helper
2224- v3dv: implement vkGetDeviceBufferMemoryRequirementsKHR
2225- v3dv: implement vkGetDeviceImageSparseMemoryRequirementsKHR
2226- v3dv: expose VK_KHR_maintenance4
2227- nir/lower_variable_initializers: implement non-scoped barrier path
2228- v3dv: implement VK_KHR_zero_initialize_workgroup_memory
2229- v3dv: also check GS stage on image access for binning barriers
2230- v3dv: implement vkCmdPipelineBarrier2
2231- v3dv: port remaining entry points to KHR_synchronization2
2232- v3dv: use VkPipelineStageFlagBits2 when checking semaphore wait stage
2233- v3dv: expose VK_KHR_synchronization2
2234- v3dv: implement VK_KHR_workgroup_memory_explicit_layout
2235- v3dv: switch to common code for command buffer lifecycles
2236- v3dv: expose VK_EXT_tooling_info
2237- v3dv: fix swap_rb and channel_reverse flags in image views
2238- v3dv: expose VK_EXT_border_color_swizzle
2239- v3dv: ref pipeline layout earlier during pipeline init
2240- v3dv: implement VK_EXT_depth_clip_control
2241- v3dv: expose VK_EXT_attachment_feeback_loop_layout
2242- v3dv: fix variable type
2243- v3dv: limit heap size to 4GB
2244- broadcom/simulator: add a helper to get the amount of free heap memory
2245- v3dv: implement VK_EXT_memory_budget
2246- v3dv: expose VK_EXT_primitive_topology_list_restart
2247- v3dv: don't load an attachment for unaligned render area if we are not storing
2248- v3dv: expose VK_EXT_load_store_op_none
2249- v3dv: don't return incompatible driver if GPU is not present
2250- v3dv: fix program id for binning shaders
2251- broadcom/compiler: detect unifa write from signal
2252- broadcom/compiler: check signal writes to magic regs when updating scoreboard
2253- v3dv: optimize ldunif load into unifa write
2254- v3dv: fix VK_EXT_texel_buffer_alignment
2255- broadcom/compiler: increase V3D_MAX_BUFFE_RANGE to 2^30 bytes
2256- broadcom/compiler: fix robust buffer access
2257- broadcom/compiler: handle shared stores with robust buffer access
2258- broadcom/compiler: don't apply robust buffer access to shared variables
2259- broadcom/compiler: rename v3d_nir_lower_robust_buffer_access.c
2260- broadcom/compiler: rename static helpers involved with robust buffer access
2261- broadcom/compiler: add a lowering for robust image access
2262- v3dv: expose VK_EXT_image_robustness
2263- v3dv: use NIR_PASS with v3d_nir_lower_robust_image_access
2264- v3dv: use enabled features from vk_device
2265- broadcom/compiler: trivial code clean-up
2266- vulkan/runtime: include robustness info when hashing a shader stage
2267- v3dv: implement VK_EXT_pipeline_robustness
2268- v3dv: drop layout refs for all allocated sets from a pool on destroy / reset
2269- broadcom/compiler: handle vec2 load/store index
2270- v3dv: refactor events
2271- v3dv: re-enable sync_fd import/export
2272- v3dv: return out of host memory if we fail to create event pipelines
2273- v3dv: split event implementation to a separate file
2274- v3dv: remove unnecessary check for NULL
2275- v3dv: do a better job at cleaning up the device on init failure
2276- v3dv: fix incorrect return type
2277- v3dv: handle allocation failure during pipeline initialization
2278- v3dv: do better cleanup on failure during pipeline cache operation
2279- v3dv: increase limit for active event objects
2280- v3dv: always check VK_ACCESS_2_MEMORY_READ_BIT for read accesses
2281- v3dv: make the helper to emit pipeline barriers public to other files
2282- v3dv: fix event synchronization
2283- v3dv: vkCmdWaitEvents2 takes an array of VkDependencyInfo
2284- v3dv: use vk_alloc instead of malloc
2285- broadcom/compiler: avoid using ldvary sequence to hide latency of branching
2286- v3dv: fix debug dump on BO free
2287- v3dv: ignore imported BOs when tracking BO memory usage
2288
2289Ian Romanick (8):
2290
2291- nir: spirv: Allow 32-bit version of nir_intrinsic_is_sparse_texels_resident
2292- radeonsi: r600: d3d12: st: Use NIR lowering for tg4 offset arrays instead of GLSL lowering
2293- glsl: Remove lower_offset_arrays pass
2294- nir/comparison_pre: See through an inot to apply the optimization
2295- intel/fs: Fix constant propagation into 32x16 integer multiplication
2296- nir/range_analysis: Set higher default maximum for max_workgroup_count
2297- nir/loop_analyze: Fix get_iteration for nir_op_ine
2298- nir/loop_analyze: Fix get_iteration for nir_op_fneu
2299
2300Igor Torrente (3):
2301
2302- vulkan: Add vk_clock_gettime and vk_time_max_deviation
2303- radv: anv: Use the new vk_clock_gettime and vk_time_max_deviation functions
2304- venus: Fix dEQP-VK.pipeline.timestamp.calibrated.host_domain_test failure
2305
2306Ikshwaku Chauhan (1):
2307
2308- Revert "radeon: add EFC support to only VCN2.0 devices"
2309
2310Illia Abernikhin (2):
2311
2312- utils: Move functions from debug.* to u_debug.*
2313- utils: Merge util/debug.* into util/u_debug.* and remove util/debug.*
2314
2315Illia Polishchuk (2):
2316
2317- mesa: skip extra state updates for clear calls
2318- driconf/Intel: Add lower_depth_range_rate option workaround for Homerun Clash misrendering issue
2319
2320Isaac Bosompem (1):
2321
2322- virgl: Set use_staging in resource_from_handle
2323
2324Italo Nicola (6):
2325
2326- etnaviv: skip disk cache initialization on standalone compiler
2327- etnaviv: fix standalone compiler to work without a tgsi backend
2328- clc: add 32-bit target
2329- rusticl: use 32-bit address format for 32-bit devices
2330- rusticl: correctly check global argument size
2331- rusticl: fix MemConstant invalid arg size check
2332
2333Iván Briano (13):
2334
2335- anv: emit scissors when the pipeline changes
2336- anv: pipelineStageCreationFeedbackCount is allowed to be 0
2337- anv: populate rt shader groups if they were found in the cache
2338- anv: Set meshShaderQueries for mesh shader ext features
2339- intel/utrace: create the callback events for xfb trace points
2340- anv/grl: Add a GRL file parser
2341- anv: set READ/WRITE_WITHOUT_FORMAT for buffer views
2342- vulkan_hasvk: set READ/WRITE_WITHOUT_FORMAT for buffer views
2343- anv: compile_upload_rt_shader expects a valid pointer
2344- anv: use DX rules for point rasterization
2345- hasvk: use DX rules for point rasterization
2346- anv: support VK_PIPELINE_CREATE_RAY_TRACING_SKIP_*
2347- anv: enable rayTraversalPrimitiveCulling feature
2348
2349James Park (2):
2350
2351- vulkan: Augment _WIN32 stub comparison
2352- meson,amd: Remove Windows libelf wrap
2353
2354James Zhu (4):
2355
2356- amd/common: some ASICs with gfx9 use compute rings for render
2357- util/format: add util format y8_400_unorm
2358- frontends/va: add support for yuv400 and yuv444
2359- radeonsi/vcn: enable jpeg decode of yuv444 and yuv400
2360
2361Jami Kettunen (1):
2362
2363- freedreno/ir3: Switch to NIR for a5xx's vertex id lowering.
2364
2365Jan Beich (1):
2366
2367- util: unify FreeBSD futex_wait signature with Linux/OpenBSD/Windows
2368
2369Faith Ekstrand (115):
2370
2371- nir: Clean up and improve nir_dedup_inline_samplers
2372- intel/fs_reg_allocate: Improve compressed instruction self-interference
2373- intel/rt: Handle halts in any-hit shaders properly
2374- intel/rt: Handle multiple exits in lower_shader_returns
2375- genxml: Add BVH data structures
2376- anv: Don't require 32-bit addresses for scratch on Gen12.5+
2377- vulkan,anv,dozen: Use VK_IMAGE_LAYOUT_ATTACHMENT_FEEDBACK_LOOP_OPTIMAL_EXT
2378- radv: Use both aspects for depth/stencil blit destinations
2379- radv: Use VK_IMAGE_LAYOUT_ATTACHMENT_FEEDBACK_LOOP_OPTIMAL_EXT for render loops
2380- radv/meta: Respect render area for MSAA resolves
2381- radv/meta: Fix a stack use-after-free with sample locations
2382- radv/meta: Drop subpass_att from radv_fast_clear_color
2383- radv/meta: Use Begin/EndRendering for resolves
2384- radv/meta: s/dest/dst/ in a few functions
2385- radv/meta: Use Begin/EndRendering for clears
2386- radv: Drop set/restore_subpass
2387- radv/meta: Drop redundant depth_view_can_fast_clear
2388- radv: Add a image_transition helper for separate depth/stencil layouts
2389- radv: Don't assume layout == stencilLayout for stencil-only
2390- util,intel: Pull the bit packing helpers from genxml to a common header
2391- broadcom: Use util/bitpack_helpers.h
2392- util: Add util_bitpack_[su]fixed_clamp helpers
2393- asahi: Use util/bitpack_helpers.h
2394- panfrost: Use util/bitpack_helpers.h
2395- anv: Use nir_intrinsic_load_layer_id for ViewId in fragment shaders
2396- anv: Decouple primitive replication check from the fragment shader
2397- anv: Stop looking at the pipeline in multiview lowering
2398- nir: Track per-view outputs in shader_info
2399- intel/compiler: Store the number of position slots in the VUE map
2400- anv: Drop anv_pipeline::use_primitive_replication
2401- pvr: Only destroy the physcial device if it exists
2402- vulkan/cmd_buffer: add record_state to the common command buffer.
2403- vulkan/cmd_queue: Drop vk_cmd_queue::error
2404- vulkan: Add a concept of recycling an object
2405- vulkan: Re-order arguments to vk_command_pool_init
2406- vulkan: Re-order arguments to vk_command_buffer_init
2407- vulkan: Add a vk_command_buffer_ops struct
2408- vulkan: Add a common vkResetCommandBuffer() implementation
2409- vulkan: Add a common vkAllocateCommandBuffers() implementation
2410- vulkan: Recycle command buffers in vk_command_pool
2411- radv: Move to the common command pool framework
2412- tu: Move to the common command pool framework
2413- lvp: Move to the common command pool framework
2414- vulkan,docs: Add documentation Vulkan command pools
2415- vulkan: Dirty VP_VIEWPORTS/SCISSORS when copying viewports/scissors
2416- radv: Set the window scissor to the render area, not framebuffer
2417- radv: Only copy the render area from VRS to HTILE
2418- radv: Leave image layouts alone when doing HW MSAA resolves
2419- radv: Switch to dynamic rendering only
2420- rusticl: Call nir_scale_fdiv
2421- vulkan/runtime: Add a comon vk_descriptor_update_template
2422- anv: Switch to the common descriptor update template struct
2423- vulkan/runtime: Compact descriptor update templates
2424- panvk: Fix buffer views
2425- spirv: Don't use libclc for wait_group_events
2426- nir: Add a helper for finding a function by name
2427- nir/load_libclc: Don't add generic variants that already exist
2428- vulkan: Record more enabled robustness features
2429- vulkan: Add a helper for gathering pipeline robustness
2430- iris: Use a larger alignment for buffer allocations
2431- iris: Fix more BO alignments
2432- iris: Handle resource offsets in buffer copies
2433- iris: Stop looking at textures_used for samplers
2434- iris: Split max #defines for textures/samplers/images
2435- iris: Support up to 64 images
2436- iris: Support up to 128 textures
2437- intel/fs: Always use integer types for indirect MOVs
2438- intel/fs: SEL_EXEC uses the integer pipe for 64-bit stuff
2439- intel/mi_builder: add a way to reserve a register
2440- intel/mi_builder: Add a helper for incrementing reference counts
2441- anv/formats: Advertise ACCELERATION_STRUCTURE_VERTEX_BUFFER_BIT
2442- anv: Add an anv_address_map helper
2443- anv: Implement VK_KHR_pipeline_library
2444- anv: Set up the memory-backed FIFO buffer
2445- anv: Add extern "C" guards
2446- anv: Add support for compiling OpenCL-style kernels
2447- anv: Add support for OpenCL-style kernel dispatch
2448- anv: Import GRL
2449- anv/grl: Build OpenCL kernels
2450- anv/grl: Add a helper for dispatching our pre-built kernels
2451- intel/grl: Parse GRL files and generate C
2452- anv: Build BVHs on the GPU with GRL
2453- anv: Advertise ray-tracing on DG2
2454- intel/devinfo: DG2 supports ray-tracing
2455- vulkan/wsi: Support configuring swapchain images as part of swapchain init
2456- vulkan/wsi/x11: Configure images via params passed to wsi_swapchain_init()
2457- vulkan/wsi/wayland: Configure images via params passed to wsi_swapchain_init()
2458- vulkan/wsi/display: Configure images via params passed to wsi_swapchain_init()
2459- vulkan/wsi/win32: Configure images via params passed to wsi_swapchain_init()
2460- vulkan/wsi: Always configure images in wsi_swapchain_init()
2461- vulkan/wsi: Don't expose as many image configure helpers
2462- vulkan/wsi: Add a typedef for memory type select callbacks
2463- vulkan/wsi: Add a supports_scanout flag
2464- vulkan: Use altlen when available for array lengths
2465- vulkan: Rename viewport_state::negative_one_to_one
2466- vulkan: Add a dynamic state for vp.depth_clip_negative_one_to_one
2467- vulkan: Add more dynamic rasterizer state
2468- vulkan: Add dynamic state for tessellation domain origin
2469- vulkan: Add more dynamic multisample states
2470- vulkan: Add more dynamic color blend states
2471- vulkan,lavapipe: Use a tri-state enum for depth clip enable
2472- iris: Set SamplerCount in shader packets
2473- nir: Reformat a comment
2474- nir: Use nir_shader_instructions_pass in nir_lower_readonly_images_to_tex
2475- nir: Misc. style fixes to nir_lower_readonly_images_to_tex
2476- nir: Unconditionally call nir_trim_vector in nir_lower_readonly_images_to_tex
2477- nir/types: Add some asserts to glsl_get_struct_field()
2478- nouveau/codegen: Support bindless texture queries
2479- intel/lower_mem_access_bit_sizes: Compute alignments automatically
2480- intel: Don't cross DWORD boundaries with byte scratch load/store
2481- vulkan: Handle VK_SUBPASS_EXTERNAL at the end of a subpass
2482- vulkan: Unconditionally add barriers for missing external subpass deps
2483- vulkan: Add state for extraPrimitiveOverestimationSize
2484- vulkan: Add a common implementation of CmdSetSampleLocationsEnableEXT
2485- vulkan: Add a dummy vk_common_CmdSetColorBlendAdvancedEXT()
2486
2487Jeremy Rand (1):
2488
2489- llvmpipe: bump LP_MAX_THREADS to 32
2490
2491Jesse Natalie (82):
2492
2493- ci/windows: Re-enable Windows runners
2494- dzn: Disable extensions that require multi-view
2495- dzn: Use D3D12 constants instead of naked numbers for device limits
2496- ci/windows: Disable Windows builds while the runner is offline again
2497- ci/windows: Windows runner is back online
2498- ci/windows: Install Graphics-Tools package on the base image
2499- egl/wgl: Don't require a native display to support window surfaces
2500- egl/wgl: Implement QuerySurface
2501- wgl: Split pbuffer creation into two functions
2502- egl/wgl: Implement PBuffer surfaces
2503- egl/wgl: Set YInvert attribute to always true
2504- egl/wgl: Implement bind/release texture buffer
2505- wgl: Allow per-framebuffer swap interval overrides
2506- egl/wgl: Support eglSwapInterval
2507- egl/wgl: Support eglWaitClient
2508- egl/wgl: Support eglWaitNative
2509- egl,wgl: Support eglCreateImageKHR
2510- wgl: Pass smapi explicitly to context creation
2511- wgl: Pass smapi explicitly to framebuffer creation
2512- egl/wgl: Use a per-display st_manager
2513- egl/wgl: Hook up image validate/get in smapi
2514- egl/wgl: Support EGL_MESA_query_driver
2515- egl/wgl: Support sync objects
2516- wgl: Use pfi instead of iPixelFormat more often
2517- egl/wgl: Delete unused variables/code
2518- egl/wgl: Fix some awkward sizeof formatting
2519- meson: Don't include glsl compiler if flex/bison aren't found
2520- microsoft/compiler: Discard shouldn't be marked readnone
2521- dxil_nir_lower_int_cubemaps: When not lowering samplers, don't touch sampler types
2522- microsoft/compiler: Fix PSV struct when numthreads is 0
2523- microsoft/clc: Handle a null clc logger
2524- gallium/windows: Delete OpenGLOn12.dll target
2525- meson: Add an option to specify the WGL gallium megadriver filename
2526- mesa: Expose GL_NV_ES1_1_compatibility
2527- microsoft/compiler: Always emit a shader at the max-supported shader model
2528- microsoft/compiler: Support up to shader model 6.5
2529- microsoft/compiler: Add struct and function defs for SM6.6 handle funcs
2530- microsoft/compiler: Add dynamic create handle helper
2531- microsoft/compiler: Add getters for res bind/props structs
2532- microsoft/compiler: Delete double-assignment of sampler metadata field
2533- microsoft/compiler: Pass lower_bound, upper_bound, space to createhandle
2534- microsoft/compiler: Handle SM6.6 handles
2535- microsoft/compiler: SM6.6 is supported
2536- microsoft/compiler: Support SM6.7
2537- d3d12: Get max supported shader model
2538- dzn: Get max supported shader model
2539- u_atomic: Add a helper for pointer compare-exchange
2540- d3d12: Change displayable format logic
2541- ci/windows: Update build container image with all deps so wraps can be disabled
2542- deps: Update DirectX-Headers req to 1.606.4
2543- d3d12: Move some things from screen late-init to early-init
2544- d3d12: Store the d3d12 module reference on the screen
2545- d3d12: Support device factories in addition to global device creation
2546- d3d12: Attempt to use the Agility SDK
2547- d3d12: Use ID3D12DeviceConfiguration for root signature serialization
2548- d3d12: Release sharing contract in flush_frontbuffer
2549- dzn: Store the d3d12 module reference on the instance
2550- dzn: Support device factories in addition to global device creation
2551- dzn: Use architecture props to return correct device type
2552- dzn: Attempt to use the Agility SDK
2553- dzn: Use ID3D12DeviceConfiguration for root signature serialization
2554- ci/windows: Update vk-gl-cts to pick up new zlib dependency
2555- d3d12: Don't put permanently-resident resources in the residency bo list
2556- d3d12: Don't multiply cube array sizes by 6
2557- d3d12: Set PIPE_CAP_MAX_TEXTURE_UPLOAD_MEMORY_BUDGET
2558- dzn: Set factory flags before creating device
2559- d3d12: Advertise PIPE_CAP_BUFFER_MAP_PERSISTENT_COHERENT
2560- include: Add WGL version of GL interop extensions
2561- mesa, dri2: Move interop helpers from dri2 to state tracker
2562- include: Add a flush interop method
2563- mesa: Make fence_sync not static
2564- mesa/st: Split interop object lookup from export
2565- mesa/st: Implement the new flush method
2566- dri: Implement the new flush method
2567- glx: Implement the new flush method
2568- egl/dri2: Implement the new flush method
2569- wgl: Implement interop methods
2570- egl/wgl: Implement interop methods
2571- include: Add driver data to v2 of mesa_glinterop_device_info
2572- gallium: Add screen methods for driver-private data on interop interface
2573- mesa/st: Query driver for interop data
2574- d3d12: Hook up driver-private interop data
2575
2576Jessica Clarke (1):
2577
2578- panfrost/blend: Fix invalid const values leading to NIR validation errors
2579
2580Joan Bruguera (1):
2581
2582- nv50/ir/nir: ignore sampler for TXF/TXQ ops.
2583
2584John Brooks (5):
2585
2586- radv: Add get_addrlib function to radv_radeon_winsys
2587- vulkan: Introduce vk_format_is_block_compressed function
2588- radv: Only apply mipmap view adjustments to block compressed images
2589- radv: Fix corrupted mipmap copies on GFX9+
2590- radv: Fix mipmap views on GFX10+
2591
2592Jonathan (3):
2593
2594- gallium/u_threaded: buffer subdata merging (v2)
2595- gallium/u_threaded: Prepare for buffer sharedness tracking
2596- gallium/u_threaded: buffer sharedness tracking
2597
2598Jonathan Gray (2):
2599
2600- iris: check i915 features after hw gen
2601- util: include sys/time.h for timespec functions
2602
2603Jordan Justen (15):
2604
2605- iris: Drop extra file-descriptor dup in iris_drm_screen_create()
2606- intel/pci_ids: Update ADL-S strings
2607- intel/pci_ids: Add 0x468b ADL-S PCI-id
2608- intel/pci_ids: Update ATS-M device names
2609- intel/pci_ids: Update DG2 device names
2610- intel/pci_ids: Add dg2 0x5698 pci-id
2611- intel/pci_ids: Drop non-upstream dg2 pci-ids
2612- intel/compiler: Use builder to allocate fs regs for gs control data bits
2613- intel/devinfo: Add MTL platforms enums and intel_device_info_is_mtl()
2614- intel/l3: Use L3 full-way allocation setting for gfx12.5 (DG2, MTL)
2615- anv/meson: Use anv_flags and anv_cpp_flags in genX compiles
2616- meson: Deprecate vulkan-rt-drivers intel
2617- meson: Define with_intel_vk_rt based on with_intel_clc
2618- intel/compiler: Broadcast lower code should check 64-bit int support
2619- ci/d3d12: Update quick_shader results with 24 fixes from !19128
2620
2621Jose Maria Casanova Crespo (4):
2622
2623- CI: Igalia farm is down
2624- Revert "CI: Igalia farm is down"
2625- vc4: consolidate shader-db output
2626- v3d: Minor fixes on sand8 blit based on sand30 modifications
2627
2628Joshua Ashton (2):
2629
2630- radv: Remove workarounds for D3D9 feedback loops
2631- freedreno: Disable 8bpp_ubwc on a6xx gen2
2632
2633José Fonseca (1):
2634
2635- util: Add macro util_get_cpu_caps in u_cpu_detect.c for avoid accidentally call it in u_cpu_detect.c
2636
2637José Roberto de Souza (34):
2638
2639- intel: Simply intel_gem_create_context_engines()
2640- anv: Remove duplicated memset() in physical device creation
2641- anv: Remove anv_app_info
2642- anv: Free vmas in case device creation fails in pthread_mutex_init(&device->mutex
2643- anv: Do not duplicate intel_device_info memory in each logical device
2644- anv: Only wait for queue sync if execbuf was properly executed
2645- anv: Do not copy garbage to batch_bo
2646- anv: Nuke anv_execbuf_init()
2647- anv: Group all context operations during device creation
2648- anv: Nuke dead code
2649- anv: Return earlier in anv_gem_get_tiling() when not supported
2650- intel/compiler/fs: Fix compilation of shaders with SHADER_OPCODE_SHUFFLE of float64 type
2651- intel/compiler/fs: Use DF to load constants when has_64bit_int is not supported
2652- vulkan_hasvk: Nuke dead code around I915_ENGINE_CLASS_COMPUTE
2653- anv: Nuke cmd_parser_version
2654- intel/dev: Adjust prefetch_size values for MTL engines
2655- intel: Share code to read render timestamp
2656- intel: Make engine related functions and types not i915 dependent
2657- intel/dev: Split i915 specific parts of intel_get_device_info_from_fd()
2658- anv: Move fetch of i915 physical device parameters
2659- anv: Split i915 specific parts of anv_queue_submit_simple_batch()
2660- anv: Use Vulkan types for priority as much as possible
2661- anv: Split the debug part of anv_queue_exec_locked()
2662- intel: Convert missing i915 engine types to intel
2663- intel: Convert i915 engine type to intel in tools/ common/ and ds/
2664- intel: Add and use intel_engines_class_to_string()
2665- hasvk: Nuke code around local memory
2666- hasvk: Fix build around intel_measure_state_changed() call
2667- iris: Fix enablement of protected contexts
2668- intel/perf: Use intel_device_info functions to compute subslice and eu totals
2669- iris: Nuke pci_id from iris_screen
2670- iris: Drop duplicated errno handling in iris_bo_wait()
2671- iris: Do not export iris_bo_wait()
2672- iris: Set priority for replaced engine context
2673
2674Juan A. Suarez Romero (3):
2675
2676- vc4/simulator: use i915/amd ioctls for BO
2677- vc4: properly restore vc4 debug option
2678- vc4: store tex sampler in proper register
2679
2680Julia Tatz (1):
2681
2682- zink: Advertise PIPE_CAP_NATIVE_FENCE_FD
2683
2684Juston Li (6):
2685
2686- venus: Enable VK_EXT_texture_compression_astc_hdr
2687- venus: sync venus protocol headers for VK_EXT_primitive_topology_list_restart
2688- venus: add support for VK_EXT_primitive_topology_list_restart
2689- venus: sync to latest venus protocol headers
2690- venus: add support for VK_EXT_multi_draw
2691- venus: use buffer cache for vkGetDeviceBufferMemoryRequirements
2692
2693Kai Wasserbäch (22):
2694
2695- chore(deps): clover: raise the minimum LLVM version to 11.0.0
2696- fix(FTBFS): meson: raise C++ standard to C++17
2697- chore(docs): rusticl: improve list of build dependencies
2698- fix: util/format: unused variable ‘desc’ [-Wunused-variable]
2699- fix: nir: unused variable ‘else_block’ [-Wunused-variable]
2700- fix: ac/llvm: unused variable ‘offset’ [-Wunused-variable]
2701- fix: mesa: unused variable ‘ret’ [-Wunused-variable]
2702- fix: mesa/st: variable ‘texobj’ set but not used [-Wunused-but-set-variable]
2703- fix: mesa/st: unused variable ‘src’ [-Wunused-variable]
2704- fix: gallivm: variable ‘type_kind’ set but not used [-Wunused-but-set-variable]
2705- fix: r600/sb: unused variable ‘repdep2’ [-Wunused-variable]
2706- fix: r600/sb: unused variable ‘fop’ [-Wunused-variable]
2707- fix: r600/sb: unused variable ‘r’ [-Wunused-variable]
2708- fix: r600/sfn: variable ‘opinfo’ set but not used [-Wunused-but-set-variable]
2709- fix: r600/sb: warning: unused variable ‘{b,nl}’ [-Wunused-variable]
2710- fix: r600/sfn: unused variable ‘splitpos’ [-Wunused-variable]
2711- fix: r600/sfn: unused variable ‘{splitpos,param}’ [-Wunused-variable]
2712- fix: r600/sfn: variable ‘fail’ set but not used [-Wunused-but-set-variable]
2713- fix: r600/sfn: unused variable ‘spi_sid’ [-Wunused-variable]
2714- fix: iris: unused variable ‘devinfo’ [-Wunused-variable]
2715- fix: zink: unused variable ‘intr’ [-Wunused-variable]
2716- fix: frontends/va: unused variable ‘max_pipe_hevc_slices’ [-Wunused-variable]
2717
2718Karmjit Mahil (30):
2719
2720- pvr: Fix bo mapping on alloc with PVR_BO_ALLOC_FLAG_CPU_ACCESS.
2721- pvr: Compete pvr_calc_fscommon_size_and_tiles_in_flight().
2722- pvr: Fix calculation in rogue_max_compute_shared_registers().
2723- pvr: Change indentation from tab to spaces for xml files.
2724- pvr: Set descriptor dirty flag based on other flags.
2725- pvr: Finish setting up job resolve info.
2726- pvr: Remove unimplemented push descriptor code.
2727- pvr: Add depth_bias_array handling on dbenable.
2728- pvr: Add clear program in pvr_device.
2729- pvr: Add static clear control stream templates in pvr_device.
2730- pvr: Add static clear VDM state in pvr_device.
2731- pvr: Add graphics pipeline barrier handling.
2732- Revert "pvr: Make pvr_cmd_pack() macro clearly internal"
2733- pvr: Implement clear ppp state emission from template.
2734- pvr: Add mid fragment pipeline barrier if needed.
2735- pvr: Handle VK_CULL_MODE_FRONT_AND_BACK.
2736- pvr: Emit cs words for load op on vkCmdBeginRenderPass().
2737- pvr: Add basic skeleton for event sub cmd.
2738- pvr: Remove struct pvr_emit_state and emit header directly.
2739- pvr: Add csb helpers macros to write into raw buffer.
2740- pvr: Make control stream word writing stricter.
2741- pvr: Complete pvr_emit_ppp_state().
2742- pvr: Add assert for texturestate being 0 in when emitting ppp state.
2743- pvr: Add EMIT_MASK in pvr_emit_ppp_state().
2744- pvr: Fix possible seg fault on csb copy.
2745- pvr: Fix overflow before widen warning for rgn headers size calculation.
2746- pvr: Handle pipeline barrier vk_sync.
2747- pvr: Remove outdated comments.
2748- pvr: Add PBE accum format size in bytes.
2749- pvr: Change pvr_get_hw_clear_color() to pack based on accum formats.
2750
2751Karol Herbst (189):
2752
2753- printf: extract clovers printf impl
2754- printf: add some unit tests
2755- nir: serialize printf metadata for CL kernels
2756- nir/lower_images: extract from clover
2757- nir: extract the clc inline sampler dedup pass from clc
2758- microsoft/clc: drop inline sampler reordering
2759- clc: undefine spirv defs to work around LLVMs headers
2760- vtn: silence warning about linkage
2761- gallium: mark the input data as const in pipe_grid_info
2762- util/memstream: insert null terminator on windows
2763- ci: update fails list
2764- ci: bumping all tags to make sure we don't have hidden fails
2765- nvc0: limit max global and alloc size
2766- ci: update CI to reflect clovers LLVM version bump
2767- nv50: fix code heap after pipe_shader_enum change
2768- nouveau: use the contexts pushbuf and client where possible
2769- nouveau/mm: make code thread safe
2770- nouveau/buffer: simplify uses of nouveau_fence_work
2771- nouveau/fence: rework nouveau_fence_emit so we can call it on emitted fences
2772- nouveau: move nouveau_context initialization to common code
2773- nouveau: wrap nouveau_bo_map
2774- nouveau: wrap nouveau_bo_wait
2775- nouveau: wrap all nouveau_pushbuf_space calls
2776- nouveau: wrap nouveau_pushbuf_validate
2777- nouveau: wrap nouveau_pushbuf_refn
2778- nouveau: use PUSH_KICK instead of nouveau_pushbuf_kick
2779- nouveau/fence: make the fence struct part of the fence API
2780- nv50: remove nouveau_fence_signalled prior nouveau_fence_wait
2781- nv50: remove nv50_bufctx_fence call in vbo_kick_notify
2782- nouveau/fence: per context fence
2783- nouveau: per context client and pushbuf
2784- nouveau: make fencing race free
2785- nvc0: make state handling race free
2786- nv50: race free state tracking
2787- nv50/ir: fix OP_UNION resolving when used for vector values
2788- nv50: properly flush the TSC cache on 3D
2789- rusticl: added
2790- rusticl: finish implementing clBuildProgram
2791- rusticl/util: add static_assert macro
2792- rusticl: implement clFinish and clFlush
2793- rusticl: implement clEnqueueReadBuffer
2794- rusticl/mem: support ops on subbuffers
2795- rusticl/kernel: basic implementation
2796- rusticl: nir bindings
2797- rusticl: translate spirv to nir and first steps to kernel arg handling
2798- rusticl/kernel: more clGetKernelWorkGroupInfo props
2799- rusticl/kernel: clGetKernelWorkGroupInfo allows a NULL device
2800- rusticl/mem: use helper context for COPY_HOST_PTR buffers
2801- rusticl/event: wrong but non crashing impl of clWaitForEvents
2802- rusticl: hack for CL 3.0
2803- rusticl/mesa: add fencing support
2804- rusticl/event: proper eventing support
2805- rusticl/kernel: implement clCreateKernelsInProgram
2806- rusticl/kernel: implement clEnqueueTask
2807- rusticl/program: undefine __IMAGE_SUPPORT__ if images are unsupported
2808- rusticl/queue: fix clReleaseCommandQueue
2809- rusticl/mem: implement clCopyBuffer
2810- rusticl/mem: implement clFillBuffer
2811- rusticl/event: implement marker and barrier
2812- rusticl/mem: add clEnqueueMigrateMemObjects stub
2813- rusticl/program: support compiling libraries
2814- rusticl: implement clUnloadPlatformCompiler
2815- rusticl/api: add param to query which contains application provided values
2816- rusticl/program: implement clCreateProgramWithBinary
2817- rusticl/event: add fake impl of clGetEventProfilingInfo
2818- rusticl/kernel: implement CL_KERNEL_COMPILE_WORK_GROUP_SIZE
2819- rusticl/mem: finish clEnqueueMigrateMemObjects
2820- rusticl/kernel: run driver requested lowering passes
2821- rusticl/kernel: add support for offsets
2822- rusticl/kernel: add missing preprocessor definitions
2823- rusticl: advertize CL 1.1 and CL 1.2 extensions
2824- rusticl/context: implement clSetContextDestructorCallback
2825- rusticl/kernel: implement clCloneKernel
2826- rusticl/device: set required double fp config values
2827- rusticl/device: advertize atomic caps
2828- rusticl: claim support for generic pointers
2829- rusticl/program: add il stubs
2830- rusticl: add svm func stubs
2831- rusticl/icd: add more func pointers
2832- rusticl/kernel: report SIMD width as work group sizes
2833- rusticl/kernel: sweep nir to reduce peak memory usage
2834- rusticl/device: expose cles_khr_int64 for compatible embedded devices
2835- rusticl/mem implement the memory \*WithProperties API
2836- rusticl/pipe: add clGetPipeInfo stub end return CL_INVALID_MEM_OBJECT
2837- rusticl/program: add stubs for program ctors and dtors
2838- rusticl/kernel: add stub for clGetKernelSubGroupInfo
2839- rusticl/event: timeout wait on condvar to abort on reaped worker threads
2840- rusticl: add support for printf
2841- rusticl/spirv: add print method
2842- rusticl/mesa: support mapping textures
2843- rusticl/mem: support read/write/copy ops for images
2844- rusticl/memory: rework mapping tracking
2845- rusticl/mem: implement maping images
2846- rusticl/mem: implement fill image
2847- rusticl/mem: implement copy image
2848- rusticl/mem: implement clCreateImage2D and 3D
2849- rusticl/mem: implement copies between buffers and images
2850- rusticl/mesa/context: un Arc the PipeContext
2851- rusticl/mem: implement clCreateSamplerWithProperties
2852- rusticl/kernel: support for images
2853- rusticl/kernel: inline samplers
2854- rusticl/kernel: lower memcpy
2855- rusticl/device: add retain/release callbacks for luxmark v4
2856- rusticl/device: implement cl_khr_3d_image_writes
2857- rusticl/kernel: set CL_DEVICE_PREFERRED_WORK_GROUP_SIZE_MULTIPLE
2858- rusticl/kernel: run some more opt passes
2859- rusticl: get rid of a few clones
2860- rusticl/memory: don't map more than necessary for copy operations
2861- rusticl/device: fix api consistency_device_and_host_timer test
2862- rusticl/queue: implement missing CL 3.0 bits
2863- rusticl/device: fix compiler features_macro
2864- rusticl/mem: add user_ptr fallback shadow buffer
2865- rusticl/mesa: make resource_from_user_memory an optinal feature
2866- rusticl/kernel: implement image_format and image_order
2867- rusticl/mem: fix IMAGE1D_BUFFER
2868- rusticl/mem: only write pitch when required
2869- rusticl: port to Rust 2018
2870- rusticl: kernel caching
2871- rusticl/kernel: prepare for nir caching
2872- rusticl/kernel: cache the nir as well
2873- rusticl: fix compiler features_macro
2874- rusticl/kernel: optimize local size
2875- rusticl: the CTS is a piece of shit
2876- rusticl/kernel: implement CL_KERNEL_ATTRIBUTES
2877- rusticl/program: some boilerplate code for SPIR-V support
2878- rusticl/memory: kernel read_write images prep work
2879- rusticl: disable fp64 support
2880- rusticl/device: allow overwriting the device_type via env
2881- rusticl/program: parse quoted paths in args
2882- rusticl/kernel: fix local buffers
2883- rusticl/device: report mesas version for CL_DRIVER_VERSION
2884- rusticl/icd: implement clGetExtensionFunctionAddressForPlatform
2885- rusticl/icd: add some way of debugging CL function calls
2886- rusticl/kernel: use real references for arguments
2887- rusticl: proper PIPE_MAP flags for internal maps
2888- rusticl: call glsl_type_singleton_init_or_ref
2889- rusticl/program: some more API validation
2890- ci: rusticl with llvmpipe
2891- rusticl: add README file
2892- rusticl/device: print error when libclc fails to load
2893- rusticl/mem: return errors for OOB accesses
2894- rusticl/device: fix custom device detection
2895- rusticl/mem: fix image OOB checks
2896- rusticl/mem: align fill pattern buffer to 4 bytes
2897- rusticl: limit global mem to 2GB
2898- rusticl/kernel: assign locations before passing the nir to drivers
2899- ac/llvm: support non 32 bit sized workgroup ids
2900- ac/llvm: fix load_const for vectors with more than 4 elements
2901- ac/llvm: fix load/store_shared for vectors with more than 4 elements
2902- ac/llvm: fix load_ubo for vectors with more than 4 elements
2903- radeonsi: stop vectorizing unpack_32_2x16_split
2904- iris: use images_used instead of num_images
2905- iris: speed up walking global bindings
2906- iris: bump IRIS_MAX_GLOBAL_BINDINGS to 128
2907- rusticl/kernel: preserve fp16 denorms to fix vload/vstore_half
2908- rusticl: add support for coherent resources
2909- rusticl/mesa: add bx() method to PipeTransfer
2910- rusticl: rework resource mappings a little
2911- rusticl: add helper ctx wrapper for coherent and direct mapping
2912- rusticl/mem: rewrite the (un)mapping code
2913- lp: claim being UMA
2914- rusticl/mem: propper CL_MEM_ALLOC_HOST_PTR support
2915- radeonsi: fail creating textures from user memory
2916- nir/lower_cl_images: set binding
2917- radeonsi: lower hadd and fisnormal
2918- rusticl: remove blit code
2919- rusticl: add RUSTICL_ENABLE to have a global way of enabling devices
2920- docs: Add documentation for Rusticl's env variables
2921- ci/llvmpipe: update to new RUSTICL_ENABLE var
2922- zink: handle more opcodes for CL
2923- zink: handle f2f16_rtz
2924- zink: fix spirv_builder_spec_const_uint
2925- aux/trace: add set_global_binding
2926- rusticl: force BIND_LINEAR on staging resources
2927- rusticl/mem: can only map staging textures directly
2928- rusticl: add small bitset impl
2929- rusticl/nir: add reads_sysval wrapper
2930- rusticl/kernel: lower system values before gathering info
2931- rusticl/kernel: add work_dim lowering
2932- zink: remove leftover work_dim lowering code
2933- rusticl/kernel: fix more 32 bit problems
2934- nir/algebraic: add 8 and 64 bit urol and uror lowering
2935- nir/algebraic: support CL vector accessors
2936- nir/algebraic: generalize vector_cmp lowering
2937- nir/algebraic: add vec8/16 cmp lowering
2938- iris: invalidate sysvals if grid dimension changes
2939- glsl: fix buffer texture type
2940- rusticl/device: put space at the end of CL_DEVICE_VERSION
2941- nir/lower_int64: fix shift lowering
2942
2943Kenneth Graunke (53):
2944
2945- intel/compiler: Use named NIR intrinsic const index accessors
2946- iris: Pass devinfo to iris_resource_level_has_hiz()
2947- iris: Enable HiZ for non-8x4 aligned miplevels on Icelake and later
2948- iris: Always retain ISL_AUX_USAGE_HIZ_CCS_WT in texture aux usage
2949- iris: Clean up iris_sample_with_depth_aux()
2950- intel/compiler: Drop variable group size lowering
2951- iris: Fix PIPE_CAP_UMA
2952- iris: Use linear for exported resources if we can't convey tiling
2953- intel/compiler: Change dg2_plus check to devinfo->verx10 >= 125
2954- iris: Delete unused iris_screen::aperture_bytes field
2955- iris: Don't print out XXX messages if resource creation fails
2956- iris: Fall back if iris_map_copy_region can't create a staging resource
2957- iris: don't create staging resources larger than half the aperture
2958- crocus: Don't print out XXX messages if resource creation fails
2959- crocus: Fall back if iris_map_copy_region can't create a staging resource
2960- crocus: Fix memory leaks on iris_resource_create failure paths
2961- anv: Fail to create a device on ver < 9
2962- anv: Drop checks for version 8 or 9
2963- anv: Delete image param support.
2964- anv: Delete batch buffer growing code.
2965- anv: Delete shader constants UBO from descriptor sets
2966- anv/tests: Don't use relocations in a test case
2967- anv: Delete "back" allocation from state pool
2968- anv: Delete relocation support from anv_block_pool
2969- anv: Delete "back" allocation from anv_block_pool
2970- anv: Drop state pool relocation munging
2971- anv: Delete use_relocations flag
2972- anv: Delete softpin checks
2973- anv: Delete anv_reloc_list_add()
2974- anv: Delete wrapper BOs for relocations
2975- anv: Delete relocation support from batch submission
2976- anv: Delete has_a64_buffer_access flag
2977- anv: Delete has_bindless_images and has_bindless_samples flags
2978- anv: Make a helper function for pinning a state pool's BOs
2979- anv: Drop offset from anv_reloc_list_append
2980- anv: Inline write_reloc into the only remaining caller
2981- anv: Remove anv_batch_emit_reloc and just open-code it
2982- intel/compiler: Use subgroup invocation for ICP handle loads
2983- iris: Ignore aux for copy_region source if there's no unresolved color
2984- intel/compiler: Use an existing URB write to end TCS threads when viable
2985- st/mesa: Optionally call nir_vectorize_tess_levels()
2986- intel/compiler: Vectorize gl_TessLevelInner/Outer[] writes
2987- intel/genxml: Add XY_FAST_COLOR_BLT
2988- blorp: Make blitter_supports_aux accessible from multiple files.
2989- blorp: Implement blitter clears via XY_FAST_COLOR_BLT
2990- blorp: Fix typo in blorp_xy_block_copy_blt
2991- Revert "intel/compiler: Vectorize gl_TessLevelInner/Outer[] writes"
2992- iris: Add miplevel parameters to iris_resource_texture_aux_usage
2993- intel/compiler: Vectorize gl_TessLevelInner/Outer[] writes [v2]
2994- iris: Use nir_intrinsic_load_global_constant for large constants
2995- intel/compiler: Run nir_opt_large_constants before scalarizing consts
2996- st/mesa: Let nir_opt_access() infer non-readable
2997- nir: Drop infer_non_readable option for nir_opt_access()
2998
2999Konrad Kleine (1):
3000
3001- Update docs/drivers/llvmpipe.rst
3002
3003Konstantin Kharlamov (5):
3004
3005- loader/dri3: simplify check for reply
3006- loader/dri3: remove a no-op free call
3007- meson: remove source_root() call in nir compiler path
3008- meson: remove source_root() call in main meson.build
3009- .mailmap: change spelling for Constantine Kharlamov
3010
3011Konstantin Seurer (63):
3012
3013- radv/nir_lower_abi: Use instructions_pass
3014- radv: Move accel struct structs to bvh/bvh.h
3015- radv: Move radv_acceleration_structure
3016- radv: Remove acceleration structure host builds
3017- radv: Always emulate fmin/fmax
3018- radv: Remove accel_struct_build
3019- radv: Remove unused push constant structs
3020- radv: Add create_build_pipeline_spv helper
3021- radv: Add the basics for GLSL bvh kerrnels
3022- radv: Add a GLSL morton kernel implementation
3023- radv: Switch to the GLSL morton implementation
3024- radv: Add a GLSL internal kernel implementation
3025- radv: Switch to the GLSL internal implementation
3026- radv: Add a GLSL leaf kernel implementation
3027- radv: Switch to the GLSL leaf implementation
3028- lavapipe: Remove st_shader_stage_to_ptarget
3029- radv: Fix stack size calculation with stage ids
3030- radv: Inline bvh build headers
3031- radv/ci: Remove host build related fails
3032- lavapipe: Set ss_dirty in emit_state
3033- radv: Advertise subgroup ops for rt stages
3034- vulkan: Add common physical device management
3035- radv: Use the common physical device enumeration
3036- anv: Use the common physical device enumeration
3037- turnip: Use the common physical device enumeration
3038- lvp: Use the common physical device enumeration
3039- panvk: Use the common physical device enumeration
3040- v3dv: Use the common physical device enumeration
3041- radv: Proper handling for inactive instance nodes
3042- radv: Deduplicate push constant structs
3043- radv/rra: Map accel struct VAs to handles
3044- radv/rra: Remove redundant bounds validation
3045- radv: Make the radv_buffer_get_va parameter const
3046- radv/rra: Replace aliasing assert with a warning
3047- radv: Explicitly store the VA of accel structs
3048- radv: Cleanup radv_GetInstanceProcAddr
3049- radv: Add a common traversal build helper
3050- radv/rq: Use the common traversal helper
3051- radv/rtpso: Use the common traversal helper
3052- radv: Use scalar layout for BDA references
3053- radv: Add and use AS and scratch layout structs
3054- radv: Fix incorrect build info indexing
3055- radv: Use cache_uuid for accel struct compatibility
3056- radv: Remove main_loop_case_visited
3057- radv/rra: Transcode nodes recursively
3058- radv: Rename node_internal to node_box32
3059- radv: Add radv_bvh_node_box16
3060- radv: Use half floats for box16 coords
3061- radv/rra: Handle box16 nodes
3062- radv/rra: Calculate bvh size requirements recursively
3063- radv: Remove create_accel_build_shader
3064- radv/rra: Fix dumps in the case of aliasing
3065- radv/rra: Validate before gathering bvh info
3066- radv/rra: Add basic header validation
3067- radv/rra: Continue dumping accel structs if validation fails
3068- radv/rra: Use the accel struct type for header validation
3069- radv/rt: Fix setting tmax for opaque AABBs
3070- radv/rt: Load instance id and custom index on demand
3071- radv/rt: Set vars.arg in the traversal shader
3072- radv/rt: Restore prev barycentrics when rejecting hits
3073- radv/rra: Fix copying accel structs that were not built yet
3074- radv/ray_queries: Fix AABB handling
3075- radv/rt: Check space before emitting descriptors
3076
3077Kostiantyn Lazukin (1):
3078
3079- vulkan/cmd_queue: Do not generate unreachable vk_free_* calls.
3080
3081Kuixi Ren (1):
3082
3083- radeonsi/vcn: Add ability to encode with ltr
3084
3085Laurent Bigonville (1):
3086
3087- Try to fix FTBFS on kfreebsd architecture
3088
3089Leandro Ribeiro (9):
3090
3091- vulkan/wsi/wayland: move some structs to beginning of code
3092- vulkan/wsi/wayland: remove unnecessary spaces in struct fields
3093- vulkan/wsi/wayland: introduce struct wsi_wl_surface
3094- vulkan/wsi/wayland: take ownership of wsi_wl_surface when creating chain
3095- vulkan/wsi/wayland: move wl_surface and wl_display from chain to struct wsi_wl_surface
3096- vulkan/wsi/wayland: remove refcount from struct wsi_wl_display
3097- vulkan/wsi/wayland: add default dma-buf feedback support
3098- vulkan/wsi/wayland: add per-surface dma-buf feedback support
3099- vulkan/wsi/wayland: avoid useless re-allocations when receiving per-surface dma-buf feedback
3100
3101Leo Liu (2):
3102
3103- meson: add with_gallium_virgl to allow it as VA backend driver
3104- frontends/va: fix build error for vaSyncBuffer with older VA
3105
3106LingMan (10):
3107
3108- rusticl: Fix compilation if stdout and/or stderr aren't symbols
3109- rusticl/bindgen: Use \`allowlist-\*\` instead of \`whitelist-\*\` switches
3110- docs/rusticl: Document minimum required bindgen version
3111- rusticl/api: Use iterators in create_program_with_source
3112- rusticl/api: Drop UTF-8 conversion of input source
3113- rusticl/api: Fix creating a program if a nul byte is within the specified string length
3114- rusticl/api: Factor pushing to the Vec out of the if
3115- rusticl/api: Shrink unsafe block
3116- rusticl/api: Don't check the program source for nul bytes needlessly
3117- rusticl/api: Interpret \`lengths\` as a slice of Option<NonZeroUsize>
3118
3119Lionel Landwerlin (152):
3120
3121- anv: fixup PIPE_CONTROL restriction on gfx8
3122- intel/ds: track untyped dataport flushes
3123- intel/fs: store num of resume shaders in prog_data
3124- intel/fs: fixup simd selection with shader calls
3125- intel/compiler: document units of brw_ubo_range fields
3126- intel/nir: specify synchronous value for tracing op
3127- anv: don't return incorrect error code for vkCreateDescriptorPool
3128- intel/drm-shim: update shim to support DG2
3129- anv: fix GetPipelineExecutableStatistics for ray tracing pipelines
3130- anv: fixup assertions on lowered storage formats
3131- anv: fix assert in memory budget code when extension is not supported
3132- anv: prevent trying to mmap non host visible memory
3133- anv: remove the LOCAL_MEM allocation bit
3134- intel/fs: fix load_scratch intrinsic
3135- intel/fs: fixup scratch load/store handling on Gfx12.5+
3136- intel/fs: switch compute push constant loads to LSC
3137- intel/fs: remove unused opcode
3138- intel/fs: fixup SEND validation check on overlapping src0/src1
3139- intel/fs: switch register allocation spilling to use LSC on Gfx12.5+
3140- intel/fs: port block a64/surface messages to use LSC
3141- intel/fs: bump max SIMD size for A64 atomics with LSC
3142- intel: add a hasvk vulkan driver
3143- hasvk: remove entrypoints for gfx9+
3144- hasvk: stop advertising Vk 1.3 on non-softpin
3145- hasvk: remove acceleration structure code
3146- hasvk: remove ray tracing code
3147- hasvk: remove mesh code
3148- anv: silence fedora build warning
3149- anv: remove support for gfx7/8
3150- anv: remove unused gfx7 code
3151- anv/tests: remove back allocation tests
3152- anv/hasvk: tweak loading failure messages
3153- anv: remove non present field in upstream spec
3154- vulkan: fixup 1.3.226 update
3155- anv: enable EXT_depth_clamp_zero_one
3156- hasvk: expose VK_EXT_depth_clamp_zero_one
3157- anv: add a new NO_LOCAL_MEM allocation flag
3158- anv: don't export flush_compute_state
3159- anv: don't export gfx state flushing helper
3160- anv: combine flushes in Draw/DrawIndexed/DrawIndirectByteCountEXT
3161- anv: add missing tracepoint
3162- blorp: defined operations for debug purposes
3163- intel/utrace: make blorp tracepoints more readable
3164- anv: trace xfb queries
3165- anv: clear descriptorsets if AllocateDescriptorSets fails
3166- anv: add support for EXT_mutable_descriptor_type
3167- anv: add missing wokraround for texture cache invalidate
3168- intel/nir/rt: fixup generate hit
3169- intel/nir/rt: spill/fill the entire ray query data
3170- intel/nir/rt: remove ray query mem hit writes at initialization
3171- intel/nir/rt: fix ray query proceed level
3172- intel/nir/rt: change scratch check validation
3173- intel/nir/rt: store ray query state in scratch
3174- intel/fs: fixup a64 messages
3175- intel/nir/rt: fixup alignment of memcpy iterations
3176- ci: add python3-ply to debian/fedora images
3177- ci: disable intel-clc on fedora
3178- ci: build our own version of the LLVM SPIRV translator
3179- ci/debian: don't use libclc from the system
3180- ci: bump llvm to 13 for some builders
3181- ci: disable intel-clc on debian-vulkan
3182- anv: remove HDC flush from invalidate bits
3183- meson: bump required llvm-spirv version with intel-clc
3184- intel/devinfo: Rename & implement num_dual_subslices
3185- intel/rt: fix procedural primitive ID access
3186- intel/nir: reuse rt helper
3187- intel/fs: take a builder arg for resolve_source_modifiers()
3188- intel/nir: fixup preserved metadata in rayquery lowering
3189- intel/nir: fix potential invalid function impl ptr usage
3190- intel/nir: disable assert on async stack id
3191- intel/fs: disable split_array_vars on opencl kernels
3192- genxml: add missing no duplicate anyhit flag
3193- intel/mi_builder: allow half GP registers for dereferencing
3194- anv: disable SIMD16 for RT shaders
3195- anv: add new command buffer space allocation
3196- anv: setup scratch space correctly for RT shaders
3197- anv: bump client visible address heap to 32GiB
3198- anv: use the right dispatch size for tracing shaders
3199- anv: flag BO for write combine when CPU visible and potentially in lmem
3200- iris: better error message with GuC loading failures
3201- intel: add INTEL_DEBUG=capture-all to capture everything upon hang
3202- pps: enable track_event in intel.cfg
3203- anv: add grl build dependency on entrypoints
3204- nir: fix NIR_DEBUG=validate_ssa_dominance
3205- vulkan/runtime: don't lookup the pipeline disk cache if disabled
3206- anv: remove unused code
3207- anv: comment out the Gfx8/9 VB cache key workaround for newer Gens
3208- anv: limit calls into cmd_buffer_flush_dynamic_state
3209- anv: optimize 3DSTATE_VF emission
3210- anv: remove multiple push descriptors
3211- isl: avoid gfx version switch cases on the hot path
3212- anv: limit push constant reemission
3213- anv: track descriptor set layout flags
3214- anv: initialization pipeline layout to 0s
3215- anv: add analysis for push descriptor uses and store it in shader cache
3216- anv: reduce BT emissions & surface state writes with push descriptors
3217- anv: add a layer for hitman3
3218- nir/lower_shader_calls: rematerialize more trivial values
3219- nir/lower_shader_calls: lower scratch access to format internally
3220- nir/lower_shader_calls: avoid respilling values
3221- nir/lower_shader_calls: rematerialize values in more complex cases
3222- nir/lower_shader_calls: add NIR_PASS_V internally
3223- nir/lower_shader_calls: cleanup shaders a bit more post split
3224- nir/lower_shader_calls: add an option structure for future optimizations
3225- nir/lower_shader_calls: add a pass to trim scratch values
3226- nir/lower_shader_calls: add a pass to sort/pack values on the stack
3227- nir/lower_shader_calls: move scratch loads closer to where they're needed
3228- nir/lower_shader_calls: run opt_cse after lower stack intrinsics
3229- radv: tweak lower_shader_calls parameters
3230- anv: enable localized loads for lower_shader_calls
3231- anv: disable mesh in memcpy
3232- anv: init major/minor before WSI
3233- vulkan/wsi/wl: correctly find whether the compositor uses the same GPU
3234- nir/divergence_analysis: add some missing RT intrinsics
3235- nir/divergence_analysis: add missing desc_set_address_intel
3236- intel/clc: assert when libclc shader is not found
3237- drm-uapi: bump headers
3238- isl: add new MOCS field for protected buffers
3239- intel/common: add detection of protected context support
3240- dri: rename PROTECTED_CONTENT in PROTECTED_SURFACE
3241- gallium: rename PROTECTED_CONTENT cap into PROTECTED_SURFACE
3242- egl: Add EGL_EXT_protected_content support
3243- st/gallium: plumb protected context creation
3244- iris: handle protected BO creation
3245- iris: Emit protection & session ID on protected command buffers
3246- iris: enable protected contexts
3247- intel/fs: require UNDEFs register offsets to be aligned to REG_SIZE
3248- intel/fs: make split_virtual_grfs deal with partial undefs
3249- intel/fs: reduce liveness of variables in lowering passes
3250- intel/fs: use fs implementation of dump_instructions
3251- intel/compiler: don't allocate compaction arrays on the stack
3252- anv: remove shader fp64 inspection after parsing
3253- anv: Reduce RHWO optimization (Wa_1508744258)
3254- anv: fix missing VkPhysicalDeviceExtendedDynamicState3PropertiesEXT handling
3255- anv: fixup invalid enum for nir environment
3256- anv: split internal surface states from descriptors
3257- anv: bump pool bucket max allocation size
3258- anv: get rid of ilog2_round_up
3259- intel/perf: allocate cleared counter infos
3260- intel/perf: fix B/C counters accumulation in non query mode
3261- nir/lower_explicit_io: fix metadata preserve
3262- nir/lower_shader_calls: update metadata before validation
3263- nir/lower_shader_calls: wrap only jumps rather than entire code blocks
3264- Revert "nir/lower_shader_calls: put inserted instructions into a dummy block"
3265- intel/fs: put scratch surface in the surface state heap
3266- anv: fix 3d state initialization
3267- anv: ensure CPS is initialized when KHR_fragment_shading_rate is disabled
3268- anv: fixup context initialization on DG2
3269- intel/fs: improve Wa_22013689345 workaround
3270- blorp: support negative offsets in addresses
3271- anv: generate correct addresses for state pool offsets
3272- nir/divergence: add missing btd_shader_type_intel
3273
3274Liviu Prodea (5):
3275
3276- d3d12/dzn/spirv2dxil: Require version library
3277- Microsoft clc: strip lib prefix Otherwise OpenCLon12 ICD can't load it
3278- meson: Microsoft / maybe Intel CLC need the all-targets workaround just like clover
3279- meson: Only draw with llvm depends on native directly Tests, softpipe or AMD drivers don't depend on it directly
3280- clc/clover: Link clang statically when shared-llvm is disabled
3281
3282Lucas Stach (38):
3283
3284- etnaviv: properly check number of layers in surface creation
3285- etnaviv: fix TS buffer allocation for 3D textures
3286- etnaviv: allow 3D textures with TS in transfer
3287- etnaviv: don't expose array and 3D texture support on pre-halti GPUs
3288- etnaviv: expose ARB_draw_instanced
3289- etnaviv: mark instanced draw extensions as supported in docs/features.txt
3290- etnaviv: move checking for MC2.0 for TS into screen init
3291- etnaviv: use linear PE rendering only on properly aligned surfaces
3292- etnaviv: add debug option to disable linear PE feature
3293- etnaviv: fill alpha channel for DXT1_RGB textures
3294- etnaviv: don't use TS with multi-layer resources
3295- etnaviv: texture-desc: sample 1D textures using 2D mode
3296- etnaviv: texture-desc: drop superfluous condition
3297- etnaviv: rewrite sampler TS check
3298- etnaviv: slim down etna_shader_key
3299- etnaviv: pass shader key by reference
3300- etnaviv: properly reference flush_resources
3301- etnaviv: move etna_layout_multiple into etnaviv_resource.c
3302- etnaviv: assert valid layout in etna_layout_multiple
3303- etnaviv: compute linear resource Y alignment in etna_layout_multiple
3304- etnaviv: handle compressed texture formats in etna_layout_multiple
3305- etnaviv: increase alignment for MSAA resources
3306- etnaviv: rs: fix MSAA alignment adjustment
3307- etnaviv: rs: try to find exact format match first
3308- etnaviv: properly size TS buffer for MSAA resources
3309- etnaviv: set LOGIC_OP_UNK24 for MSAA rendering on SMALL_MSAA GPUs
3310- etnaviv: disable PE_COLOR_FORMAT_OVERWRITE with MSAA
3311- etnaviv: blt: scale operations by MSAA mode
3312- etnaviv: blt: try to find exact format match first
3313- etnaviv: update headers from rnndb
3314- etnaviv: fix late Z with MSAA active
3315- etnaviv: fix shader register control with MSAA
3316- etnaviv: switch to late Z when linear PE is used
3317- etnaviv: fix tile status interaction with write mappings
3318- etnaviv: always use RS align when GPU has TEXTURE_HALIGN feature
3319- etnaviv: rs: fix blits with insufficient alignment for dual pipe operation
3320- etnaviv: blt: use correct TS offset in clear operations
3321- etnaviv: fix wrong surface TS clear size
3322
3323Luis Felipe Strano Moraes (8):
3324
3325- anv: fix FTBFS on grl due to changes in clang 15
3326- anv: reword info flag in intel_clc's getopt to avoid clash
3327- anv: add missing separator to help for intel_clc
3328- anv: fixing typo on description of output flag for intel_clc
3329- anv: added proper handling for input argument in intel_clc
3330- anv: adding parsetab.py to the .gitignore for grl
3331- meson: simplified meson for enabling ray-tracing on Intel
3332- meson: only enable intel-clc for x86_64 builds
3333
3334M Henning (1):
3335
3336- nv/nir: Set ssbo CacheMode from intrinsic access
3337
3338Marcin Ślusarz (53):
3339
3340- intel/compiler: fix mesh urb write regression
3341- nir/lower_task_shader: print shader after each step
3342- intel/compiler: add support for non-zero base in [load|store]_shared intrins
3343- Revert "nir/lower_task_shader: don't use base index for shared memory intrinsics"
3344- anv: disable task redistribution
3345- spirv, compiler: add "bool nv" to shader_info.mesh
3346- intel/compiler: implement EXT_mesh_shader
3347- anv: check EXT_mesh_shader whenever NV_mesh_shader is checked
3348- anv: replace VK_SHADER_STAGE_[TASK|MESH]_BIT_NV with VK_SHADER_STAGE_[TASK|MESH]_BIT_EXT
3349- anv: implement EXT_mesh_shader
3350- anv: implement draw calls for EXT_mesh_shader
3351- anv: enable EXT_mesh_shader
3352- anv: fix emission of primitive replication packet for mesh stage
3353- nir: add uses_wide_subgroup_intrinsics to task/mesh shader_info
3354- anv: add support for anv_assume_full_subgroups to task & mesh stages
3355- anv: small cleanup of anv_graphics_pipeline_compile
3356- nir, anv, hasvk, radv: pull uses_wide_subgroup_intrinsics into shader_info
3357- intel/compiler: remove second shading rate lowering for mesh
3358- intel/compiler: print shader after successful brw_nir_lower_shading_rate_output
3359- intel/compiler: use nir_lower_task_shader pass
3360- nir/lower_task_shader: lower small stores & loads to shared when requested
3361- intel/compiler/task: use shared memory for small task payload loads & stores
3362- intel/compiler: refactor brw_nir_lower_mem_access_bit_sizes
3363- intel/compiler: add support for 8/16 bits task payload loads
3364- radv: use nir_shader_instructions_pass in radv_nir_lower_ycbcr_textures
3365- iris: remove invalid nir_metadata_preserve from iris_fix_edge_flags
3366- crocus: remove invalid nir_metadata_preserve from crocus_fix_edge_flags
3367- glsl: use nir_shader_instructions_pass in gl_nir_lower_atomics
3368- glsl: use nir_shader_instructions_pass in gl_nir_lower_images
3369- glsl: use nir_shader_instructions_pass in gl_nir_lower_samplers_as_deref
3370- nir: use nir_shader_instructions_pass in nir_lower_alu
3371- nir: use nir_shader_instructions_pass in nir_lower_bool_to_bitsize
3372- nir: use nir_shader_instructions_pass in nir_lower_bool_to_float
3373- nir: use nir_shader_instructions_pass in nir_lower_bool_to_int32
3374- nir: use nir_shader_instructions_pass in nir_lower_64bit_phis
3375- nir: use nir_shader_instructions_pass in nir_lower_clamp_color_outputs
3376- nir: use nir_shader_instructions_pass in nir_lower_clip_disable
3377- nir: use nir_shader_instructions_pass in nir_lower_clip_halfz
3378- nir: use nir_shader_instructions_pass in nir_lower_drawpixels
3379- nir: use nir_shader_instructions_pass in nir_lower_fb_read
3380- nir: use nir_shader_instructions_pass in nir_lower_frexp
3381- nir: use nir_metadata_none instead of its value
3382- nir: use nir_shader_instructions_pass in nir_lower_interpolation
3383- nir: use nir_shader_instructions_pass in nir_lower_samplers
3384- nir: use nir_shader_instructions_pass in nir_split_var_copies
3385- nir: use nir_shader_instructions_pass in nir_split_per_member_structs
3386- intel/compiler: fix loading of draw_id from task & mesh payload
3387- anv: add support for mesh shading in INTEL_MEASURE
3388- intel/ds: add new category/stage for draw mesh events
3389- anv: add mesh shading tracepoints
3390- intel/genxml: fix width of 3DSTATE_TASK_CONTROL.MaximumNumberofThreadGroups
3391- anv: set 3DSTATE_[MESH|TASK]_CONTROL.MaximumNumberofThreadGroups
3392- anv: program 3DSTATE_MESH_DISTRIB with the recommended values
3393
3394Marek Olšák (135):
3395
3396- nir: add nir_intrinsic_image_samples_identical
3397- nir: add nir_intrinsic_image_descriptor_amd
3398- nir: add nir_texop_descriptor_amd
3399- nir: add shader_info::uses_resource_info_query for txs, levels, samples, etc.
3400- ac/llvm: implement nir_intrinsic_image_deref_samples_identical
3401- ac/llvm: implement nir_intrinsic_image_deref_descriptor_amd
3402- ac/llvm: implement nir_texop_descriptor_amd
3403- ac/nir: add ac_nir_lower_resinfo
3404- radeonsi,radv: run ac_nir_lower_resinfo
3405- ac/llvm: remove all resinfo code now that it's lowered
3406- radeonsi: implement a non-scaled compute blit+resolve and use it on gfx11
3407- radeonsi: don't assume that TC_ACTION_ENA invalidates L1 cache on gfx9
3408- radeonsi/ci: update failing tests on navi21
3409- radeonsi: fix a regression due to reordering PIPE_SHADER_*
3410- ac/llvm: handle external textures in ac_nir_lower_resinfo
3411- radeonsi: merge both fail paths in si_set_vb_descriptor
3412- radeonsi: add vertex buffers into the BO list in set_vertex_buffers
3413- radeonsi: remove vb_descriptors_gpu_list only used for debugging
3414- radeonsi: remove temporary si_context::vb_descriptor_user_sgprs
3415- radeonsi: use si_cp_dma_prefetch_inline for prefetching VBO descriptors
3416- radeonsi: use do..while loops and other cosmetic changes in display list path
3417- ttn: set the correct sampler declaration type in the presense of txs and lod
3418- gallivm: fix TXQ crash with MSAA samplers
3419- gallivm: initialize texture_unit_offset in emit_size_query
3420- gallium/u_blitter: remove unused code for integer MSAA resolve
3421- gallium/u_blitter: fix the has_txf support condition
3422- gallium/u_blitter: treat Z scaling as scaled blits
3423- gallium/u_blitter: clean up IMMs in util_make_fs_blit_msaa_gen
3424- gallium/u_blitter: make nearest filtering emulation using TXF conformant
3425- gallium/u_blitter: make the bilinear filter for MSAA resolving conformant
3426- glthread: unbind framebuffers in glDeleteFramebuffers
3427- glthread: call _mesa_glthread_DeleteBuffers unconditionally
3428- glthread: track GL_READ_FRAMEBUFFER bindings too
3429- glthread: track glBindFramebufferEXT, not just glBindFramebuffer
3430- glthread: add missing code for GL_ARB_sparse_texture
3431- util/xmlconfig: allow drivers to override option values
3432- radeonsi: rename stop_exec_on_failure -> allow_context_lost
3433- radeonsi: allow lost context with aux_contexts
3434- winsys/amdgpu: terminate process on CS rejection when unrobust context is lost
3435- winsys/amdgpu: flatten huge if and reorder code in amdgpu_cs_submit_ib
3436- winsys/amdgpu: change num_rejected_cs to a bool flag
3437- radeonsi/ci: make the running script easy to use
3438- Revert "mesa: implement a display list / glBitmap texture atlas"
3439- mesa: create glBitmap textures while creating display lists
3440- st/mesa: fix potential use-after-free in draw_bitmap_quad
3441- ci: update pass/fail results for spec\@!opengl 1.0\@gl-1.0-dlist-bitmap
3442- winsys/amdgpu: use cached GTT for command buffers and don't set the 32BIT flag
3443- ci: disable the freedreno farm.
3444- radeonsi/ci: add skips of tests not built on Linux
3445- radeonsi: don't pass num_patches via derived_tess_state, pass it via si_context
3446- radeonsi: make the primitive type constant with tessellation
3447- radeonsi: move \*rs to its only use in si_draw
3448- radeonsi: remove the prim_restart_tri_strips_only option
3449- radeonsi: move fixing ngg_culling into si_update_shaders
3450- radeonsi: unify the logic that sets rast_prim
3451- radeonsi: precompute GS_OUT_PRIM in advance
3452- radeonsi: set GS_STATE_OUTPRIM and PROVOKING_VTX_INDEX only when they change
3453- radeonsi: cosmetic changes in si_emit_rasterizer_prim_state
3454- radeonsi: remove 1 draw packet order codepath, keep the first one
3455- radeonsi: move patch_vertices-related tessellation updates out of si_draw
3456- radeonsi: move set_patch_vertices into si_state_shaders.cpp
3457- ac/gpu_info: handle LPDDR4 and 5 in ac_memory_ops_per_clock
3458- ac/surface: disallow 256KB swizzle modes on gfx11 APUs
3459- radeonsi: set VS_OUT_MISC_SIDE_BUS_ENA=1 for clip distance exports on gfx10.3
3460- mesa: fix glDrawRangeElementsBaseVertex should be compiled into display list
3461- glthread: fix draws not compiled into a display list should generate an error
3462- glthread: generate errors for glGet functions between glBegin/End
3463- glthread: always sync for glShaderSource because invalid params can crash
3464- glthread: fix glGetIntegerv(GL_CLIENT_ACTIVE_TEXTURE)
3465- glthread: work around GL_INVALID_OPERATION with OpenGL ES 1.x draws
3466- frontend/dri: make the backgroundCallable extension optional
3467- frontend/dri: sync glthread when calling from the app side
3468- radeonsi: simplify radeonsi_zerovram implementation
3469- driconf: disable glthread for DeusEx:MD and F1 2015 due to a perf drop
3470- radeonsi/ci: add glx\@glx-swap-event_async to CI failures for glthread
3471- radeonsi: don't flush asynchronously for fence_server_signal
3472- glthread: execute glSignalSemaphoreEXT synchronously
3473- radeonsi: enable glthread by default
3474- gallium/u_threaded: add an option for unsychronized create_fence_fd
3475- mesa: replace ALLOC_PRIMS with persistently-allocated context arrays
3476- mesa: remove _mesa_draw_gallium_fallback by adapting st_feedback_draw_vbo
3477- mesa: remove unused code using _mesa_prim
3478- mesa: clean up st_indirect_draw_vbo interface and callers
3479- mesa: trivial simplifications in _mesa_reference_buffer_object_
3480- mesa: inline _mesa_reference_buffer_object
3481- mesa: inline _mesa_handle_bind_buffer_gen and correct things
3482- mesa: restructure bind_buffer_object for better performance
3483- mesa: remove unused gl_buffer_object::Data and Written
3484- mesa: reorder and pack gl_buffer_object
3485- vbo: don't call _glapi_set_dispatch for GL_COMPILE_AND_EXECUTE loopback
3486- mesa: don't restore the dispatch at the end of glCallList
3487- mesa: remove unused function _mesa_get_dispatch
3488- mesa: initialize OutsideBeginEnd directly instead of through Exec
3489- glapi: rename 'vtxfmt' to 'beginend' to make it clear
3490- api_hw_select_init_h.py: simplify the code
3491- mesa: remove deprecated comments
3492- mesa: rename GL dispatch initialization functions to make the intent clear
3493- glthread: use GLenum16 for enums, but clamp it to 0xffff to get correct errors
3494- glthread: use 8-bit GLenum for prim modes where it results in better packing
3495- glthread,hud: draw per-frame values and expose the number of batches per frame
3496- glthread: skip glMultMatrixf if it's identity
3497- mesa: use memcmp instead of floating-point comparisons in glMultMatrixf
3498- mesa: make glPopMatrix a no-op if the matrix hasn't changed
3499- radeonsi: rename si_create_multi_fence -> si_alloc_fence
3500- winsys/amdgpu: fix (enable) preemption for chained IBs
3501- radeonsi: fix max_dw computation for CS preambles
3502- gl_marshal.py: remove/simplify parameters
3503- gl_marshal.py: move the unmarshal table into a separately generated file
3504- gl_marshal.py: rework how the marshal dispatch table is initialized
3505- gl_marshal.py: remove the -O1 hack and manual SET_* inlining
3506- gl_marshal.py: inline functions for readability
3507- gl_marshal.py: simplify print_sync_call and rename to print_call
3508- glthread: use a constant expression instead of cmd_size in custom functions
3509- glthread: add more DrawArrays/Elements variants with fewer fields
3510- glapi: remove EXT and ARB suffixes from Draw functions
3511- glthread: demystify Draw function names
3512- glthread: rewrite CallList merging and do it in the app thread
3513- gallium/u_threaded: don't call simplify_draw_info redundantly
3514- gl_marshal.py: C style fixups
3515- glthread: merge and collapse glBindBuffer calls that unbind and then bind
3516- glthread remove the unused \*last pointer from unmarshal functions
3517- glthread: don't sync for glIsEnabled(GL_BLEND, GL_LIGHTING, GL_POLYGON_STIPPLE)
3518- cso: start without u_vbuf by default if it's not always used
3519- cso: constify some parameters to remove typecasts
3520- cso: fix broken optimization for sampler state lookups
3521- cso: don't destroy CSOs that are saved
3522- cso: inline more functions because some parameters like key_size are literals
3523- cso: make memcmp and hash computation use a literal key size for blend state
3524- driconf: add a workaround for a cursor issue with kwin_wayland
3525- driconf: add a workaround for Cossacks 3
3526- nir: add nir_intrinsic_optimization_barrier_vgpr_amd for LLVM
3527- radeonsi: force the MSAA resolve shader to use 1 clause for MSAA loads
3528- radeonsi/gfx11: fix compute scratch buffer - WAVES is always per SE
3529- st/mesa: suppress async glthread flushing for GLX_EXT_texture_from_pixmap
3530- st/vdpau: fix interop with GL
3531
3532Marek Vasut (1):
3533
3534- etnaviv: Use old set of state registers for PE configuration on GC880
3535
3536Mario Kleiner (1):
3537
3538- vulkan/wsi/display: Reset connector state in vkReleaseDisplay().
3539
3540Mark Collins (8):
3541
3542- tu: Implement VK_EXT_non_seamless_cube_map
3543- tu: Allocate private memory per-device instead of per-pipeline
3544- tu: Clamp priority in DRM submitqueue creation
3545- tu: Expose VK_EXT_tooling_info using common implementation
3546- tu: Retain allocated CSes in tu_autotune_on_submit
3547- tu: Optimize hash_renderpass_instance by removing XXH64_update
3548- tu: Only write \`A6XX_PC_PRIMITIVE_CNTL_0` if changed
3549- tu: Clean up variable usage in \`tu6_draw_common`
3550
3551Mark Janes (6):
3552
3553- Revert "anv: Do not copy garbage to batch_bo"
3554- anv: Track BOs that need a write-combined mapping
3555- anv: Use WC mapped local memory for block pool BO
3556- anv: Allocate buffers with write-combined local memory
3557- anv: compile anv_acceleration_structure.c
3558- anv: work around improper buffer usage in hitman3
3559
3560Martin Krastev (1):
3561
3562- meson: add svga gallium driver to 'auto' group on arm/aarch64
3563
3564Martin Roukala (né Peres) (20):
3565
3566- radv/ci: document a recent regression
3567- radv/ci: move some tests from the vega10 fail to its flake list
3568- radv/ci: add more tests to the flake list of vega10
3569- radv/ci: skip host_write_transfer_src.1048576 on renoir
3570- radv/ci: bump the console activity timeout of VanGogh to 3 minutes
3571- radv/ci: uprev boot2container
3572- radv/ci: disable the POLARIS10 jobs
3573- radv/ci: disable kabini's vkd3d job
3574- radv/ci: run vkcts on the two steam decks in parallel
3575- radv/ci: document an unstable test
3576- radv/ci: move some tests from the renoir fail to its flake list
3577- ci/b2c: add support for the new format of CI_RUNNER_TAGS
3578- radv/ci: bump the job count on navi21
3579- radv/ci: run two jobs in parallel to reduce the execution time
3580- zink: update the RADV expectations
3581- zink: mark a test as a flake in RADV expectations
3582- zink/ci: mark another test as fixed after the frontend caching series
3583- zink/ci: add another subtest to the list of known failures for radv
3584- Revert "glx: Fix drawable refcounting for naked Windows"
3585- Revert "glx/dri: Fix DRI drawable release at MakeCurrent time"
3586
3587Matt Coster (3):
3588
3589- util: Include stddef.h in util/macros.h
3590- pvr: Add pvr_csb_unpack().
3591- pvr: Fix assignment to ISP back_a.sref
3592
3593Matt Turner (4):
3594
3595- intel/tools: Also look for 'batch' tag
3596- freedreno/ir3: Don't use the broken idiv lowering
3597- freedreno/ir3: Use nir_opt_idiv_const
3598- intel/dev: Set display_ver = 13 on all ADL/RPL/DG2
3599
3600Mauro Rossi (5):
3601
3602- util: glsl2spirv.py: ensure '#endif' is printed in new line
3603- hasvk: fix android build and reported API version
3604- Android.mk: Fix gnu++14 related build failures
3605- r600/sfn: allow building with clang 6 (Android 9)
3606- AOSP: Add intel_hasvk vulkan library suffix
3607
3608Max Kellermann (2):
3609
3610- gallium/u_threaded: add missing reference counts for draw_multi slots
3611- gallium/u_threaded: fix offset calculation for draw_multi slots
3612
3613Michael Skorokhodov (1):
3614
3615- egl: Return EGL_BAD_MATCH for invalid share_list
3616
3617Michael Tang (3):
3618
3619- microsoft/compiler: Add dxil_logger type and parameter to nir_to_dxil
3620- nir: add nir_instr_as_str
3621- microsoft/compiler: Change calls to NIR_INSTR_UNSUPPORTED to log to dxil_logger instead
3622
3623Michel Dänzer (6):
3624
3625- gallivm: Fix LLVMAtomicRMWBinOpFMax spelling
3626- lavapipe: Fix float32_atomic_min_max spelling
3627- winsys/amdgpu: Set RADEON_FLAG_32BIT again
3628- radeonsi/ci: Test piglit quick_shader profile as well
3629- Revert "egl/glx: add fallback for zink loading"
3630- vulkan/wsi/wayland: Set num_modifier_lists = 0 if num_drm_modifiers == 0
3631
3632Mihai Preda (21):
3633
3634- gallivm: LLVM-15 opaque pointers: disable LLVMGetElementType(ptr_type)
3635- gallivm: drop unused "emit_load" in lp_llvm_buffer_member()
3636- gallium: drop unused macros in draw_llvm.h
3637- gallivm: fixes for LLVM-15 opaque pointers in lp_bld_nir_aos.c
3638- gallivm: fixes for LLVM-15 opaque pointers in lp_bld_nir_soa.c
3639- gallivm: fixes for LLVM-15 opaque pointers in lp_bld_jit_types.c
3640- gallivm: fixes for LLVM-15 opaque pointers in lp_bld_format_s3tc.c
3641- gallivm: fixes for LLVM-15 opaque pointers in lp_bld_struct.c
3642- gallivm: fixes for LLVM-15 opaque pointers in lp_bld_format_soa.c
3643- gallium/draw: fix LLVM opaque pointer warnings in store_clip()
3644- gallium/draw: fix LLVM opaque pointer warnings in generate_clipmask()
3645- gallium/draw: fix LLVM opaque pointer warnings in draw_gs_llvm_fetch_input()
3646- gallium/draw: fix LLVM opaque pointer warnings in draw_gs_llvm_end_primitive()
3647- gallium/draw: fix LLVM opaque pointer warnings in draw_gs_llvm_epilogue()
3648- gallium/draw: fix LLVM opaque pointer warnings in draw_tcs_llvm_emit_fetch_input()
3649- gallium/draw: fix LLVM opaque pointer warnings in draw_tcs_llvm_emit_fetch_output()
3650- gallium/draw: fix LLVM opaque pointer warnings in draw_tcs_llvm_emit_store_output()
3651- gallium/draw: fix LLVM opaque pointer warnings in store_aos()
3652- gallivm: LLVM opaque pointer fixes in lp_bld_sample.c
3653- gallivm: LLVM opaque pointers: add lp_build_array_get[_ptr]2 in lp_bld_struct.c
3654- gallivm: LLVM opaque pointers: use lp_build_array_get_ptr2() in lp_bld_nir_soa.c
3655
3656Mike Blumenkrantz (443):
3657
3658- zink: handle mutable swapchain images with dmabuf
3659- zink: store VkFormatFeatureFlags on creation
3660- zink: use stored format features for blit functionality
3661- zink: use modifier feature flags during surface creation when necessary
3662- zink: init cache_put program fence on program creation
3663- zink: always set vertex dynamic states
3664- zink: hook up pipeline_library extensions
3665- zink: change zink_create_gfx_program signature
3666- zink: break out program primtype->idx conversion function
3667- zink: reorganize zink_gfx_pipeline_state a little
3668- zink: set program pipeline array idx earlier in zink_get_gfx_pipeline
3669- zink: add a graphics pipeline library implementation
3670- tgsi_to_nir: fix clipdistance store writemask
3671- tgsi_to_nir: handle compact arrays for clipdistance
3672- zink: force flush clears on fb change if fb geometry changes
3673- zink: force a new framebuffer for clear_depth_stencil if the clear region is big
3674- zink: use maintenance4
3675- zink: allow sw loading with D3D_ALWAYS_SOFTWARE for nine
3676- zink: drop mode_changed check from linewidth/depthbias draw updates
3677- zink: handle unscaled depth bias from nine
3678- zink: handle !half_pixel_center
3679- zink: fix return for PIPE_CAP_DEPTH_CLIP_DISABLE
3680- zink: explicitly set nir cursor in rewrite_and_discard_read
3681- zink: handle invalid gl_TexCoord reads
3682- zink: rewrite all undefined shader reads as 0001 instead of undef
3683- zink: export PIPE_CAP_MULTISAMPLE_Z_RESOLVE
3684- zink: support PIPE_QUERY_TIMESTAMP_DISJOINT
3685- nine: check return on resource_get_handle
3686- zink: add u_queue.h to kopper header
3687- zink: demote dmabuf tiling to linear if modifiers aren't supported
3688- zink: use modifier_aspect to check for modifier plane in zink_resource_get_param
3689- zink: don't add modifiers if EXT_image_drm_format_modifier isn't present
3690- zink: add all format modifiers when adding for dmabuf export
3691- zink: don't fixup sparse texops
3692- zink: remove an old RGBA4 nvidia workaround
3693- vulkan: Update the XML and headers to 1.3.224
3694- radv: fix return type for meta resolve shaders
3695- nir/validate: clamp unsized tex dests to 32bit
3696- nir/validate: add some (light) validation for sampler type matching
3697- radv: remove stupid conditional from draw path
3698- radv: force inlining on another draw function
3699- mesa: fix blending when using luminance/intensity emulation
3700- mesa: require render target bind for A/L/I in format selection
3701- mesa/st: add ignore_srgb_decode param to st_convert_sampler
3702- mesa/st: plumb glsl130_or_later through sampler creation
3703- mesa/st: use plumbed value for srgb_skip_decode in sampler conversion
3704- gallium: convert PIPE_QUIRK_TEXTURE_BORDER_COLOR_SWIZZLE to enum
3705- gallium: add PIPE_QUIRK_TEXTURE_BORDER_COLOR_SWIZZLE_ALPHA_NOT_W
3706- util/format: handle SRGB in util_format_luminance_to_red()
3707- zink: stop modifying samplerview swizzle on base struct
3708- zink: add some format functions for emulating alpha formats
3709- zink: reject native blits for emulated alpha formats
3710- zink: handle emulated alpha format clears
3711- zink: support emulating alpha formats using new border color quirk
3712- zink: add a driver workaround for broken L4A4 on nvidia
3713- mesa: track format swizzles on teximage objects
3714- mesa/st: use pre-calculated format swizzle for samplerviews
3715- mesa: track gl_texture_object swizzle internally
3716- mesa/st: use tracked samplerview swizzle values
3717- mesa/st: simplify sampler border color swizzling
3718- zink: remove cached descriptor mode
3719- zink: remove checks for lazy descriptor mode + templates ext
3720- zink: remove descriptor vtable
3721- zink: delete a lot of descriptor code
3722- zink: move lazy descriptor functions into zink_descriptors.c
3723- zink: delete unused struct types
3724- zink: move all types/enums into zink_types.h
3725- zink: move lazy descriptor data types into zink_types.h
3726- zink: delete some unused descriptor struct members
3727- zink: remove descriptor dummy_set
3728- zink: merge context descriptor data structs
3729- zink: merge batch descriptor data structs
3730- zink: merge context descriptor data onto context struct
3731- zink: merge program descriptor data onto program struct
3732- zink: merge batch descriptor data onto batch state
3733- zink: reorder some descriptor structs
3734- zink: rename _lazy descriptor stuff
3735- zink: use statically sized array for descriptor allocation
3736- zink: ZINK_SHADER_COUNT -> ZINK_GFX_SHADER_COUNT
3737- zink: explicitly define ZINK_GFX_SHADER_COUNT
3738- zink: PIPE_SHADER_TYPES -> MESA_SHADER_STAGES
3739- zink: remove all pipe_shader_type usage
3740- zink: delete zink_shader_stage()
3741- zink: fix program cache comparisons
3742- zink: require EXT_non_seamless_cube_map for GPL support
3743- zink: (correctly) require extendedDynamicState2PatchControlPoints for GPL
3744- zink: use a maybe more accurate wild guess for pcp-less gpl
3745- zink: require VK_KHR_descriptor_update_template
3746- zink: add an id member for zink_descriptor_pool_key
3747- zink: use a dynarray instead of hash table for tracking pools
3748- zink: use a single allocation for zink_descriptor_layout_key
3749- zink: rework descriptor pool overflow
3750- nir: add uses_bindless flag for shader_info
3751- zink: bail out of dmabuf resource creation when srgb explosion is expected
3752- zink: iterate over all modifiers and nuke the ones that aren't supported
3753- zink: try to reuse swapchain modifier for dmabuf export
3754- ci: remove broken device-select-layer from build
3755- vk/render_pass: don't deref null resolve attachments
3756- Revert "zink: try to reuse swapchain modifier for dmabuf export"
3757- Revert "zink: add all format modifiers when adding for dmabuf export"
3758- zink: deduplicate some program creation code
3759- zink: deduplicate some program destroy code
3760- zink: move gfx program update/creation into zink_program.c
3761- zink: remove cpu conditional render hack
3762- zink: add c++ guards to zink_pipeline.h
3763- zink: move pipeline cache structs to zink_program.h
3764- zink: move gfx pipeline recalc to c++
3765- zink: move pipeline state comparison to c++ template
3766- zink: use template to eliminate more instructions in pipeline state comparison
3767- zink: remove some gfx pipeline state members
3768- zink: fix shader key struct packing
3769- zink: rework bit iteration in update_gfx_shader_modules to use for loop
3770- zink: simplify shader key comparison by splitting out non-generated tcs
3771- zink: plumb some screen info through gfx shader creation
3772- zink: plumb screen info through shader key comparison
3773- zink: split out nonseamless/inline shader module info calculation
3774- zink: split out gfx shader module creation from cache searching
3775- zink: split out first-time shader module creation
3776- zink: use a list for free batch states
3777- zink: add templates for pipeline stage comparison
3778- zink: use stage comparison template to ignore vertices_per_patch
3779- zink: cache dynamic render vs renderpass pipelines separately
3780- zink: make program cache values more intelligible
3781- zink: change u_foreach_bit to regular for loop in zink_gfx_program_update
3782- zink: use dynarray instead of list for internal shader cache
3783- zink: pass shader stage as param for gfx module update
3784- zink: constify shader module variable
3785- zink: make zink_bo.h compile in c++
3786- zink: add more c++ guards to headers
3787- zink: remove c++ guard from zink_resource.h
3788- zink: move zink_batch_resource_usage_set to be static inline
3789- zink: add is_buffer param to zink_batch_resource_usage_set
3790- zink: use PIPE_CAP_RGB_OVERRIDE_DST_ALPHA_BLEND
3791- zink: merge all the batch state sets onto the struct
3792- zink: make batch internal api static again
3793- zink: split resource tracking into two sets for buffer/texture
3794- zink: track which stages in gfx programs are using inlined uniforms
3795- zink: add gfx pipeline lookup shortcut
3796- st_pbo/compute: handle download failures with fallback
3797- st_pbo/compute: fix 1D_ARRAY offsets
3798- st_pbo/compute: fix 1D coord dimension by pre-trimming vectors
3799- st_pbo/compute: pre-clamp shader geometry to 16bit
3800- st_pbo/compute: pre-clamp loaded geometry based on coord components
3801- zink: handle nir_intrinsic_sparse_residency_code_and mechanics
3802- zink: ignore nir_texop_lod for tex dest matching
3803- vulkan: Update the XML and headers to 1.3.225
3804- zink: support PIPE_CAP_FBFETCH_COHERENT
3805- tu: fix invalid free on alloc failure
3806- lavapipe: stop tracking descriptor image layout
3807- lavapipe: use cso for compute samplers too
3808- lavapipe: remove lvp_sampler::state
3809- lavapipe: replace lvp_sampler internals with pipe_sampler_state
3810- lavapipe: use pipe_sampler_state directly in descriptor info
3811- lavapipe: delete lvp_buffer::offset
3812- lavapipe: use gallium buffer descriptor types directly
3813- lavapipe: apply VK_WHOLE_SIZE to bufferviews at creation
3814- lavapipe: create gallium descriptor image/sampler view types for views
3815- lavapipe: delete some code
3816- zink: use screen param for zink_descriptor_program_deinit
3817- zink: allow programs to solely manage descriptor deinit
3818- zink: create compute programs from compute shaders directly
3819- zink: initialize cache_get_thread unconditionally
3820- zink: add use_local_size flag to compute programs
3821- zink: don't call util_queue_fence_init in zink_screen_get_pipeline_cache()
3822- zink: add param to allow cache_get jobs to run directly
3823- zink: add a pipeline shortcut for basic compute programs
3824- zink: merge compute program hash table onto program struct
3825- zink: don't add compute base pipeline to hash table
3826- zink: remove use_local_size from compute pipeline state
3827- zink: stop zeroing local size if current compute doesn't use it
3828- zink: use different key comparison for local_size-using compute pipelines
3829- zink: track whether shader has cube samplers
3830- zink: move compute pipeline cache update to caller
3831- zink: add a param to allow zink_screen_update_pipeline_cache to run directly
3832- zink: move compute descriptor binds down in execution
3833- zink: add precompilation for compute shaders
3834- zink: add partial async shader compile support
3835- zink: add locking for descriptor layout caches
3836- zink: add a context pointer to zink_program struct
3837- zink: init fbfetch/bindless ctx stuff on shader create
3838- zink: generate sha1 for compute programs
3839- zink: add function to check whether a shader has cube samplers
3840- zink: defer all compute shader creation
3841- zink: don't emit entrypoints for function temp variables
3842- zink: check the variable mode before taking samplemask path in ntv
3843- zink: explicitly use unsigned types for bit shifts
3844- zink: store VkImageViewUsageCreateInfo for surface creation
3845- zink: only add srgb mutable for images with modifiers
3846- zink: avoid needing mutable for z/s sampling
3847- zink: don't set MUTABLE by default
3848- zink: don't emit illegal interpolation
3849- vulkan: initialize a variable
3850- llvmpipe: don't assume pipe_context is always available in flush_frontbuffer
3851- mesa: add GLSL_SOURCE
3852- zink: add spirv builder function for demote
3853- zink: hook up demote extension
3854- zink: fix sharedmem ops with bit_size!=32
3855- zink: flag all assigned output slots as mapped
3856- zink: add a ci skip for anv
3857- mesa/st: split out builtin shader finish
3858- st_pbo/compute: use split shader finish funcs
3859- mesa/st: move compute pbo shutdown to compute pbo file
3860- gallium: add pipe_screen::driver_thread_add_job
3861- st_pbo/compute: use new shader interface to perform async shader creation
3862- mesa/st: add specialized pbo download shaders
3863- mesa/st: add MESA_COMPUTE_PBO env var
3864- zink: implement async nir creation
3865- lavapipe: VK_EXT_attachment_feedback_loop_layout
3866- lavapipe: ARM/EXT_rasterization_order_attachment_access
3867- docs: add more features
3868- radv: avoid bottlenecking on sequential sparse buffer binds
3869- zink: handle split acquire/present
3870- zink: uncap ssbo size
3871- zink: remove special-casing for 64bit runtime array emission
3872- zink: always set var used by get_ssbo_size to the 32bit var
3873- zink: simplify ntv shader descriptor emission
3874- zink: emit Aliased decoration for aliased bo descriptors
3875- zink: fix atomic ssbo indexing with non-32bit values
3876- zink: handle 64bit float atomics
3877- zink: export PIPE_CAP_IMAGE_ATOMIC_FLOAT_ADD
3878- zink: export PIPE_CAP_SHADER_ATOMIC_INT64
3879- zink: don't add void clears if a full clear already exists
3880- zink: split up get_clear_data()
3881- zink: make void clears more robust
3882- zink: rewrite clears on fb bind if only the format has changed
3883- zink: add have_vulkan13 to device info
3884- zink: add functions for using '2' variants of pipeline barriers
3885- zink: add screen interfaces for pipeline barriers
3886- zink: use screen interfaces for pipeline barriers
3887- zink: fix/relax resolve geometry check
3888- zink: defer acquire semaphore destruction
3889- zink: fix atomic ssbo fadd offsets
3890- st/bitmap: use normalized coords in samplers with lowered rects
3891- st/drawpixels: use normalized coords in samplers with lowered rects
3892- zink: disable GPL when sample shading is in use
3893- anv: force inline more pipe flush functions
3894- ci: add a panfrost flake
3895- ci: add nouveau flakes
3896- egl/glx: add fallback for zink loading
3897- zink: don't call CmdBindVertexBuffers2EXT with no attributes
3898- zink: don't always set VK_DYNAMIC_STATE_VERTEX_INPUT_BINDING_STRIDE_EXT with gpl
3899- zink: clamp z/s formats to aspected format for border colors
3900- zink: move color channel clamping to zink_format.c
3901- zink: add border color clamping when customBorderColorWithoutFormat isn't supported
3902- zink: add srgb border color clamping
3903- zink: add stencil border color clamping
3904- zink: check format compatibility up front when creating framebuffer surface
3905- zink: rework flush_frontbuffer to always flush
3906- zink: defer more semaphore destruction
3907- zink: rework batch tracking for resources
3908- zink bump batch state limit a bit
3909- zink: split out bind_sampler_states hook for drivers without EXT_non_seamless_cube_map
3910- zink: remove batch tracking/usage from view types
3911- zink: remove batch usage from sampler states
3912- zink: delete more sw conditional render handling
3913- zink: delete some descriptor caching cruft
3914- glsl: fix gl_CullDistance xfb linking
3915- zink: handle culldistance xfb outputs like clipdistance
3916- zink: track a mask of gfx stages remaining in gfx programs
3917- zink: track the owner context for gfx programs
3918- zink: use screen params for program reference functions
3919- zink: don't try to delete programs from the cache if stages have been modified
3920- zink: add locking for program caches
3921- zink: use program's context for cache deletion
3922- zink: use screen param for zink_shader_free()
3923- zink: PIPE_CAP_SHAREABLE_SHADERS
3924- radv: ALWAYS_INLINE radv_after_draw
3925- radv: ALWAYS_INLINE radv_flush_descriptors
3926- radv: ALWAYS_INLINE radv_is_streamout_enabled()
3927- radv: store pointer to sgprs for last vertex stage
3928- radv: use direct access to last_vgt_api_stage_locs for sgpr emission
3929- zink: add a bool flag for decomposed vertex elements
3930- zink: check on init whether decomposed attrs are needed
3931- zink: wrap zink_shader_key_base access for nonseamless cubemap use
3932- zink: remove a bunch of casts from shader key inlines
3933- zink: handle inline uniforms during first-time shader generation
3934- zink: copy the non-dirty shader stages when updating gfx program
3935- zink: move to AoS for gfx program shader modules
3936- zink: ALWAYS_INLINE bind_gfx_stage
3937- zink: add some padding to small shader keys for byte-alignment
3938- zink: add 'optimal_keys' handling for shader keys
3939- zink: use optimal_keys to massively reduce size of pipeline cache comparisons
3940- zink: split dirty_shader_stages for gfx and compute
3941- zink: simplify bit tracking in zink_gfx_program_update()
3942- zink: be even more granular with optimal_key program updates
3943- zink: unspaghettify some program update code
3944- lavapipe: set writemask for compute shader buffers
3945- lavapipe: propagate shader access info across libraries
3946- zink: copy optimal key always
3947- zink: add asserts to verify optimal key state
3948- zink: fix some batch tracking leaks
3949- zink: free vertex_state_cache on shutdown
3950- zink: always unset var->data.explicit_xfb_buffer
3951- zink: fix sparse queue creation
3952- zink: fix handling for ssbos that are just runtime arrays
3953- zink: name bo variables using bitsize
3954- zink: avoid overflow when clamping bufferviews
3955- zink: allow reordered clear_buffer calls
3956- zink: remove color_write_missing driver workaround
3957- zink: only emit runtime array for ssbos if present
3958- mesa: set normalized_coords for bindless texture buffer samplers
3959- zink: enable VK_ATTACHMENT_LOAD_OP_DONT_CARE for zs in renderpasses
3960- zink: remove zink_context::new_swapchain
3961- zink: rename fb_binds -> fb_bind_count
3962- zink: reorder zink_resource a little
3963- zink: add a mask of fb attachment idx for resources
3964- zink: use feedback loop layout to correctly handle implicit feedback loops
3965- zink: always set VK_PIPELINE_CREATE_COLOR_ATTACHMENT_FEEDBACK_LOOP_BIT_EXT sometimes
3966- nir/opt_undef: add a pass to clean up 64bit undefs
3967- zink: don't flatten 64bit arrays during rewrite
3968- zink: always run optimize_nir after lower_64bit_vars
3969- zink: split up lower_64bit_vars pass
3970- zink: also lower 64bit function temps
3971- aux/trace: dump line_rectangular member of rasterizer state
3972- aux/trace: dump depth_clamp for rasterizer state
3973- zink: always unflag unordered_write when binding image descriptors
3974- zink: set layouts before possibly reordering image copies
3975- zink: move zink_screen_get_pipeline_cache() calls out to callers
3976- zink: rework ZINK_PIPELINE_LIBRARY_FORCE into ZINK_DEBUG flag
3977- zink: move create_pipeline_lib to zink_program.c
3978- zink: hook up VK_EXT_depth_clamp_zero_one
3979- zink: pass actual screen->threaded to zink_tc_context_unwrap()
3980- vulkan: Update the XML and headers to 1.3.230
3981- lavapipe: dynamic state3
3982- zink: add a draw barrier when unbinding a sampler-bound fb surface
3983- zink: replace mixed_zs with zs feedback loops
3984- glthread: use signed vertex buffer offsets when available, don't require them
3985- glthread: handle DeleteBuffers(n=-1) gracefully
3986- zink: fix/improve handling for multi-component bitfield ops
3987- zink: export PIPE_CAP_ALLOW_MAPPED_BUFFERS_DURING_EXECUTION
3988- zink: export PIPE_CAP_MAP_UNSYNCHRONIZED_THREAD_SAFE
3989- pipe-loader: fix zink driinfo header path
3990- zink: enable glthread by default
3991- zink: call flush_resource when presenting garbage
3992- lavapipe: improve blend handling on pipeline binds
3993- zink: use dynamic state3 rasterization states when available
3994- zink: use more dynamic state3 when available
3995- zink: add tgl ci skip
3996- zink: don't access prog->shaders array during pipeline compile
3997- zink: compress zink_gfx_output_key members
3998- zink: further compress zink_gfx_output_key lookups
3999- zink: set VK_PIPELINE_CREATE_RETAIN_LINK_TIME_OPTIMIZATION_INFO_BIT_EXT
4000- zink: require optimal_keys for GPL
4001- zink: add param to disable optimization when combining pipeline libraries
4002- zink: move binding_map to hw_state part of vertex state
4003- zink: reorder zink_destroy_gfx_program()
4004- zink: don't deref prog->shaders during pipeline construction
4005- zink: use an extra pipeline state bit to track coherent fbfetch usage for gpl outputs
4006- zink: reenable GPL
4007- zink: define zink_pipeline_dynamic_state3 as zink_rasterizer_hw_state
4008- zink: set depth clamp
4009- remove xvmc
4010- delete rbug
4011- delete graw tests
4012- vk/graphics_state: don't set default sample locations if rast samples is dynamic
4013- zink: allow creating full pipelines unoptimized
4014- zink: gfx_pipeline_cache_entry -> zink_gfx_pipeline_cache_entry
4015- zink: change zink_gfx_program::last_pipeline -> struct zink_gfx_pipeline_cache_entry
4016- zink: implement async gfx precompile
4017- zink: add ZINK_DEBUG=shaderdb
4018- zink: move descriptor layout/pool stuff to screen object
4019- zink: add a safety assert in descriptor updates
4020- zink: add some bda handling
4021- zink: handle global and scratch vars
4022- zink: move descriptor function up in file
4023- zink: handle oom better during descriptor updating
4024- zink: fix pool overflow handling on batch reset
4025- zink: rework/improve descriptor pool overflow handling on batch reset
4026- zink: delete view objects when unsetting resource usage in batch reset
4027- zink: prevent ballooning of view object memory
4028- lavapipe: store compiler options to physical device
4029- zink: check core feature for pipeline cache control
4030- zink: don't add other usage bits for transient images
4031- zink: clamp line_stipple_factor to 1 if stipple is disabled
4032- zink: unset rp_changed after initializing renderpass attachments
4033- zink: disable fbfetch when flushing clears
4034- glthread: fix buffer allocation size with non-signed buffer offset path
4035- zink: enable tc cpu storage on some buffers
4036- zink: clamp feedback loop layout to GENERAL if the ext isn't supported
4037- zink: delete some big struct members from zink_context
4038- zink: reorder context init to handle COMPUTE_ONLY
4039- zink: clamp shader bindings for compute descriptors
4040- zink: add ci flake
4041- zink: move spec constant emission to the types/consts block
4042- zink: set default spec constant value to 1
4043- zink: add docs for zink_batch.c
4044- zink: delete/cleanup some descriptor #defines
4045- zink: delete unused descriptor struct field
4046- zink: start on some type docs
4047- zink: fix duplicate VK_DYNAMIC_STATE_LINE_STIPPLE_EXT setting
4048- zink: fix ds3 line stipple enable handling
4049- driconf: add mesa_glthread=false for gfxbench
4050- lavapipe: store binding offset for buffers
4051- lavapipe: fix BufferDeviceAddress usage
4052- zink: fix gpl compile optimization
4053- zink: decrement ZINK_FBFETCH_BINDING
4054- zink: don't flag ubo0 as used in shaders with indirect ubo access
4055- zink: allow direct memory mapping for any COHERENT+CACHED buffer
4056- zink: allocate/place memory using memoryTypeIndex directly
4057- zink: support multiple heaps per memory type
4058- zink: delete unused zink descriptor mode
4059- zink: rename some variables in descriptor code
4060- zink: ZINK_DESCRIPTOR_TYPES -> ZINK_DESCRIPTOR_BASE_TYPES
4061- zink: add more zink_descriptor_type enum values to improve readability
4062- zink: delete unused descriptor function
4063- zink: stop passing pointer to changed_sets to populate_sets()
4064- zink: add some descriptor docs
4065- zink: add some docs for c++ pipeline state functions
4066- zink: simplify some program descriptor helpers
4067- zink: split out uniform decriptor bindings in shader data
4068- zink: flag push state changed in batch changed conditional
4069- zink: rescope/simplify some push descriptor code
4070- zink: elide a buffer samplerview update conditional
4071- zink: handle broken resource mapping deadlocks
4072- zink: simplify swapchain imageview handling
4073- zink: delete stale comment for zink_surface
4074- zink: simplify conditional for surface rebind no-ops
4075- zink: use zink_resource_object::views to defer deferred storage view deletion
4076- zink: delete unused zink_surface member
4077- zink: add some breadcrumbs for VK_EXT_multisampled_render_to_single_sampled
4078- zink: add docs for zink_surface
4079- zink: match bitsizes in bo rewriting
4080- zink: pass KERNEL shaders through successfully
4081- zink: rework sampler emission
4082- zink: pass image type to image emission
4083- zink: add a nir pass for CL image typing and sampler tracking
4084- zink: simplify image deref handling
4085- zink: add some spirv builder handling for sampled image ops
4086- zink: emit explicit samplers in ntv based on sampler_mask
4087- zink: always take the compact descriptor size path with kernel shaders
4088- zink: add handling on the vk side for discrete sampler descriptors
4089- zink: add handling for CL-style discrete shader samplers
4090- zink: rip out unused kernel push constant
4091- zink: fix weird formatting
4092- zink: don't force attachment usage for images when unsupported
4093- zink: always add MUTABLE with EXTENDED_USAGE for image creation
4094- zink: don't double-deref bindless texture arrays in shaders
4095- util/tc: split out flush and deferred flush calls
4096- util/tc: split out dsa and fs state cso handling
4097- util/tc: implement renderpass tracking
4098- zink: add a context flag to indicate when blitter is running
4099- zink: use tc renderpass tracking to optimize renderpasses
4100- zink: enable renderpass optimizing for turnip jobs
4101
4102Ming Li (1):
4103
4104- docs: Add d3d10umd and lavapipe to the doc for the gallium frontends.
4105
4106Mykhailo Skorokhodov (8):
4107
4108- util: Add glsl2spirv.py script
4109- nir: Make lower_double_ops recognize SPIR-V mangling
4110- nir: Add assert in nir_lower_doubles
4111- glsl/meson: Add variable to export float64.glsl
4112- anv/meson: Add float64_spv_h custom target
4113- anv: Add softtp64 workaround
4114- drirc: Add fp64_workaround_enabled option
4115- drirc: Apply fp64_workaround_enabled to DOOM Eternal
4116
4117Nanley Chery (8):
4118
4119- mesa/st: Set PIPE_MAP_DISCARD_RANGE in fallback_copy_image
4120- mesa/st: Add and use st_texture_image_insert_transfer
4121- mesa/st: Add and use st_texture_image_transfer::box
4122- mesa/st: Don't map compressed texture fallbacks for reads
4123- mesa/st: Fix indentation in st_MapTextureImage
4124- mesa/st: Drop st_texture_image_transfer::map
4125- iris: Enable INTEL_MEASURE for compute dispatches on XeHP
4126- iris: Reduce use of RHWO optimization (Wa_1508744258)
4127
4128Nicholas Bishop (2):
4129
4130- crocus: Advertise PIPE_CAP_NATIVE_FENCE_FD
4131- crocus: fix off-by-one error when clearing stale syncobjs
4132
4133Oleksii Bozhenko (2):
4134
4135- vulkan/wsi: Pass wsi_image_create_info into anv_GetPhysicalDeviceImageFormatProperties2
4136- anv: Allow aliasing with modifiers for WSI images
4137
4138Omar Akkila (5):
4139
4140- ci: Merge common builds in debian test jobs
4141- ci: Move crosvm to the base test image
4142- ci: Build crosvm with opaque fd mapping support
4143- ci: Enable virtio-experimental in testing
4144- ci: Add vulkan dEQP tests for venus
4145
4146Paul Gofman (1):
4147
4148- vbo/dlist: keep buffers used in loopback_vertex_list() mapped.
4149
4150Pavel Ondračka (22):
4151
4152- r300: fix variables detection for paired ALU and TEX instructions in different branches
4153- r300: allow constant swizzles with inline constant
4154- r300: fix negate mask computation when merging movs
4155- r300: check for identical saturate mode when merging MOVs
4156- r300: run dataflow optimizations in separate loops
4157- r300: generalize the merge_movs pass
4158- r300: don't merge w channel in fragment shaders
4159- r300: merge MOVs into ADD using the 0 swizzle
4160- r300: merge MOVs with MULs or ADDs in merge channels
4161- r300: Set more shadow sampler lowering in precompiles.
4162- r300: allow presubtract when both ADD sources are negative
4163- r300: run copy propagate once more after merging channels
4164- r300: merge together MOV and MAD instructions
4165- r300: add special path for merging movs with the same source
4166- r300: add some synchronization for KIL
4167- r300: fix register rewrite when converting rbg instructions to alpha
4168- r300: reduce CPU overhead in IF transformation pass
4169- Revert "r300: be less agresive with copy propagate in loops"
4170- r300: fix reader detection with breaks
4171- r300: be more careful when pair merging with presubtract
4172- r300: fix generation of invalid swizzles in regalloc
4173- r300: respect buffer offset in r300_set_constant_buffer
4174
4175Pedro J. Estébanez (5):
4176
4177- nir2dxil: Lower texture projections
4178- nir2dxil: Use native helper lane intrinsic on SM >= 6.6
4179- microsoft/spirv2dxil: Refactor to make SM and validator versions public
4180- microsoft/spirv2dxil: Change \`spirv_to_dxil()` to receive SM and validator versions
4181- microsoft/spirv2dxil: Support pack_32_2x16 and pack_32_4x8
4182
4183Pierre Moreau (11):
4184
4185- nv50,nvc0: Do not resize global residents if unnecessary
4186- nv50: Rename fixups to relocs
4187- nv50: Rename interps to fixups
4188- nv50: Report actual VRAM size
4189- nv50: Disallow allocating more than VRAM size
4190- nv50: Mark RESOURCE_FROM_USER_MEMORY_COMPUTE_ONLY as unsupported
4191- nv50/ir: Handle non-32-bit values when cst folding SPLIT
4192- nv50/peephole: Disallow combining sub 4-byte ld/st for now
4193- nv50/ra: Fix the offset computation for compounds
4194- nv50/ir: Avoid generating splits of splits
4195- nv50/nir: A group barrier is CTA-level not global-level
4196
4197Pierre-Eric Pelloux-Prayer (72):
4198
4199- loader: don't return empty string in loader_get_dri_config_device_id
4200- loader: allow DRI_PRIME=vendor_id:device_id syntax
4201- vulkan/device_select: print the dri_prime warning only if needed
4202- vulkan/device_select: allow DRI_PRIME=vendor_id:device_id
4203- docs: document DRI_PRIME
4204- gallium/dri2: make protected buffer checks opt-in
4205- amdgpu/bo: update uses_secure_bos when importing buffers
4206- radeonsi: allocate BIND_RENDER_TARGET as tmz instead of SCANOUT
4207- radeonsi: print radeon_bo_flag when AMD_DEBUG=vm is used
4208- nir: add a nir_opt_if_options enum
4209- radeonsi/tests: remove left-over debug print
4210- radeonsi/tests: add testing of the khr-single tests
4211- radeonsi/tests: update raven expected results
4212- radeonsi/tests: update navi21 expected results
4213- radeonsi/tests: cosmetic changes
4214- radeonsi: prevent u_blitter recursion in si_update_ps_colorbuf0_slot
4215- radeonsi: use LOAD_CONTEXT_REG_INDEX for VGT_STRMOUT_DRAW_OPAQUE
4216- ac/llvm: add a return value to ac_nir_translate
4217- radeonsi: deal with ac_nir_translate failures
4218- radv: deal with ac_nir_translate failures
4219- radeonsi: use nir_opt_large_constants earlier
4220- radeonsi/sqtt: set stable pstate if possible
4221- mesa: avoid reading back textures from VRAM
4222- mesa: simplify _mesa_texstore_z24_s8 and s8_z24
4223- mesa: remove fallback for GL_DEPTH_STENCIL
4224- radeonsi: invalidate L2 when using dcc stores
4225- egl/wayland: fix glthread crashes
4226- winsys/amdgpu: init 'r' before using it
4227- radeonsi/gfx11: don't set VERTS_PER_SUBGRP to 0
4228- tc: do a GPU->CPU copy to initialize cpu_storage
4229- tc: don't use CPU storage for glBufferData
4230- ac/llvm: remove unused vars hash table
4231- radeonsi: remove USE_LDS_SYMBOLS
4232- radeonsi: use nir_lower_explicit_io to get rid of mem_shared deref
4233- ac/llvm: drop visit_deref
4234- ac/llvm: simplify get_memory_ptr
4235- ac/llvm: add helpers to get pointer types of ac_arg
4236- ac/llvm: remove LLVMBuildGEP usages
4237- ac/llvm: add a ac_llvm_pointer type
4238- ac/llvm: port functions to use ac_llvm_pointer
4239- ac/llvm: switch ac_build_gep_ptr to LLVMBuildGEP2
4240- ac/llvm: switch ac_build_pointer_add to LLVMBuildGEP2
4241- ac/llvm: add type param to ac_build_load_invariant
4242- ac/llvm: add xxxx2 functions to iteratively port to opaque pointers
4243- ac/llvm: store lds as ac_llvm_pointer
4244- ac/llvm: store constant_data and scratch as ac_llvm_pointer
4245- ac/llvm: remove unneeded casts
4246- radv/llvm: use ac_build_gep0_type to get args types
4247- radeonsi: use ac_get_arg_pointee_type to get the right type
4248- radeonsi: use LLVMBuildGEP2 in si_build_gep_i8_var
4249- radeonsi: make ngg_gs_get_vertex_storage return a ac_llvm_pointer
4250- radeonsi: port the remaining code to opaque pointers
4251- ac/llvm: remove gep_2 and others temporary functions
4252- radv/llvm: replace LLVMBuildLoad by LLVMBuildLoad2
4253- ac/llvm: add ac_get_ptr_arg
4254- ac/llvm: pass ac_llvm_ptr instead of separate type/ptr
4255- Revert "ac: use LLVMContextSetOpaquePointers if available"
4256- radeonsi: zero init using the correct components count
4257- radeonsi/sqtt: notify sqtt when we skip a flush
4258- radeonsi/sqtt: re-export shaders in a single bo
4259- radeonsi/sqtt: allow to disable spm counters
4260- radeonsi/tests: handle flakes file for all test suite
4261- radeonsi/tests: simplify the script
4262- radeonsi/tests: update expected results for navi21
4263- radeonsi/tests: update expected results for navi10
4264- radeonsi/tests: update expected results for polaris11
4265- radeonsi: simplify si_prefetch_shaders
4266- radeonsi/sqtt: simplify condition to determine if sqtt is on
4267- radeonsi: store the shader gpu adress in si_shader
4268- radeonsi: handle sqtt pipeline in shader prefetch
4269- radeonsi/gfx11: enable sdma copy DRI_PRIME
4270- radeonsi: update db_eqaa even if msaa is disabled
4271
4272PixelyIon (1):
4273
4274- tu: Make KGSL wait IOCTLs interrupt-safe
4275
4276Qiang Yu (62):
4277
4278- nir/lower_gs_intrinsics: fix primitive count for points
4279- radeonsi: fix tcs_out_lds_offsets arg alignment
4280- winsys/amdgpu: fix non-page-aligned sparse buffer creation
4281- nir,ac/llvm: add nir_intrinsic_load_half_line_width_amd
4282- nir,ac/llvm: add nir_intrinsic_load_viewport_xy_scale_and_offset
4283- ac/nir/ngg,radv: use nir_load_viewport_xy_scale_and_offset
4284- ac/nir/cull: support caller react when primitive is rejected
4285- ac/nir/ngg: support line culling
4286- ac/nir/ngg: fix and simplify gs store output lower
4287- ac/nir/ngg: use same driver location for gs output
4288- ac/nir/ngg: save and restore output bit size for gs
4289- ac/nir/ngg: save and restore position output base for nogs
4290- ac/nir/ngg: add gs culling
4291- ac/nir/ngg: support component position store
4292- ac/nir/ngg: support clipdist culling
4293- ac/llvm: cast tes_u/v_replaced to float
4294- radeonsi: implement nir_intrinsic_load_half_line_width_amd
4295- radeonsi: implement nir_intrinsic_load_viewport_xy_scale_and_offset
4296- radeonsi: implement cull nir intrinsics
4297- ac/llvm: move culling replaced registers to abi
4298- radeonsi: use replaced values when present
4299- radeonsi: vs load input re-calculate vertex index after culling
4300- radeonsi: add and implement load_user_clip_plane abi
4301- nir: fix nir_xfb_info buffer_to_stream length
4302- nir: add nir_intrinsic_load_num_vertices_per_primitive_amd
4303- nir: add nir_intrinsic_load_streamout_buffer_amd
4304- nir,ac/llvm: add nir_intrinsic_load_ordered_id_amd
4305- nir: add nir_intrinsic_ordered_xfb_counter_add_amd
4306- ac/llvm: implement nir_intrinsic_ordered_xfb_counter_add_amd
4307- ac/nir/ngg: cleanup prim id to prepare for streamout
4308- ac/nir/ngg: nogs support streamout
4309- ac/nir/ngg: ngg_gs_load_out_vtx_primflag support stream
4310- ac/nir/ngg: support multi stream per output slot for gs
4311- ac/nir/ngg: support gs streamout
4312- radeonsi: implement nir_intrinsic_load_num_vertices_per_primitive_amd
4313- radeonsi: implement nir_intrinsic_load_streamout_buffer_amd
4314- radeonsi: print out remove_streamout shader key
4315- nir,ac/llvm,radv: add nir_intrinsic_load_provoking_vtx_in_prim_amd
4316- ac/nir/ngg: use nir_load_provoking_vtx_in_prim_amd in ngg lower
4317- radv: count gen_prims_queries_enabled
4318- radv: split active_pipeline_gds_queries
4319- nir,ac/nir/ngg,radv: split shader_query_enabled_amd
4320- ac/nir/ngg,radv: use different counters for shader queries
4321- nir,ac,radv: add primitive count add intrinsics
4322- nir: add nir_load_prim_xfb_query_enabled_amd
4323- ac/nir/ngg: add query param to ac_nir_lower_ngg_gs
4324- ac/nir/ngg: add streamout emitted primitive query
4325- radeonsi: implement llvm abi atomic_add_prim_count
4326- radeonsi: implement nir shader query enabled intrinsics
4327- ac/llvm: get back intrinsics used by NGG
4328- ac/nir/ngg: support user edge flags for ngg lower
4329- ac/nir/ngg: pass primitive_id_location as param for nogs lower
4330- nir: add two amd ngg lds base load intrinsics
4331- ac/nir/ngg,radv: move LDS layout calculation out of nir ngg lowering
4332- ac/nir/ngg: allow passthrough with vs primitive id output
4333- ac/nir/ngg: save and restore no_varying/no_sysval_output
4334- ac/nir/ngg,ac/llvm,aco: save nogs ngg culling one lds dword
4335- ac/nir/ngg: add one odd dword to nogs culling pervertex lds
4336- ac/nir/ngg: fix nogs culling with nuw add
4337- ac/nir/ngg: remove nuw for negative value add
4338- ac/llvm: fix gfx11 fs input load for 16bit varying
4339- ac/llvm: fix 16bit varying llvm compile error
4340
4341Rajnesh Kanwal (48):
4342
4343- pvr: Implement vkResetDescriptorPool API.
4344- pvr: Fix clang-format issue.
4345- pvr: Implement vkGetPhysicalDeviceImageFormatProperties2 API.
4346- pvr: Implement vkCmdDrawIndirect API.
4347- pvr: Implement vkCmdDrawIndexedIndirect API.
4348- pvr: Fix multiple file descriptor leaks.
4349- pvr: Implement vkCreateEvent and vkDestroyEvent APIs.
4350- pvr: Implement vkCmdSetEvent2 API.
4351- pvr: Implement vkCmdResetEvent2 API.
4352- pvr: Implement vkCmdWaitEvents2 API.
4353- pvr: Implement vkSetEvent API.
4354- pvr: Implement vkResetEvent API.
4355- pvr: General reformatting and renaming some fields.
4356- pvr: Replace pvr_renderpass_surface_initop with VkAttachmentLoadOp.
4357- pvr: Add support for generating render pass hw setup data.
4358- pvr: Use ~0 for masks instead of ~0U.
4359- pvr: Add missing output register write in pvr_CreateRenderPass2.
4360- pvr: Rename userpass_spawn to isp_userpass.
4361- pvr: Add const to structs that are not supposed to be modified.
4362- pvr: Add attachment resolve support.
4363- pvr: Add support to create subpass load ops.
4364- pvr: Pass load-op struct instead of idx to support subpass load-ops.
4365- pvr: Implement vkCmdNextSubpass API.
4366- pvr: implement vkcmddispatchindirect api.
4367- pvr: Handle indirect buffer address in pvr_setup_descriptor_mappings.
4368- pvr: Convert attachment indexes to unsigned.
4369- pvr: Remove redundant instruction buffer check.
4370- pvr: Remove redundant assignment error.instruction.
4371- pvr: Remove framebuffer NULL check from pvr_CmdBeginRenderPass2.
4372- pvr: Remove dead code from pvr_perform_start_of_render_clears.
4373- pvr: Zero init pbe_cs_words to avoid passing on garbage.
4374- pvr: Fix Unintentional integer overflow (OVERFLOW_BEFORE_WIDEN) errors.
4375- pvr: Call VALGRIND_FREELIKE_BLOCK before unmapping to avoid use after free.
4376- pvr: Add assert to check for non-NULL binding_desc.
4377- pvr: Implement vkGetEventStatus API.
4378- pvr: Add pvr_csb_emit_link function to support secondary buffer links.
4379- pvr: Fix check in pvr_cmd_uses_deferred_cs_cmds.
4380- pvr: Add support for PVR_CMD_STREAM_TYPE_GRAPHICS_DEFERRED stream.
4381- pvr: Create deferred control stream for secondary command buffers.
4382- pvr: Reserve space for vdm state for deferred secondary buffer.
4383- pvr: Add support to copy a control stream to another control stream.
4384- pvr: Implement vkCmdExecuteCommands API.
4385- pvr: Update buffer type macro names for consistency.
4386- pvr: Fix heap type of availability_buffer allocation.
4387- pvr: Fix allocation size passed in pvr_cmd_buffer_alloc_mem.
4388- pvr: Remove double error reporting.
4389- pvr: Split pds compute shader create and upload code for reuse.
4390- pvr: Add vulkan shader factory headers for Query and clear APIs.
4391
4392Renato Pereyra (1):
4393
4394- venus: Increase vn_relax sleep time
4395
4396Rhys Perry (117):
4397
4398- aco: fix LdsBranchVmemWARHazard with 2+ branch chains
4399- aco: set has_VMEM,has_DS=false after a branch
4400- aco: only add vscnt wait when visiting VMEM/DS
4401- aco: improve VcmpxPermlaneHazard workaround
4402- aco: fix hash statistic
4403- docs/radv: advertise VK_KHR_global_priority
4404- aco: fix consecutive exec writes when finding exec_copy instruction
4405- aco: rename is_cmp to is_fp_cmp
4406- aco: fix assembly of vopc_sdwa writing exec
4407- aco: fix re-write of uses of exec_val's lo/hi half
4408- aco: test branch opcode if removing it in try_optimize_branching_sequence
4409- aco: remove val_and_copy_adjacent
4410- aco: improve vcc check for instructions between exec_val and exec_copy
4411- aco: test for one and_savexec opcode in try_optimize_branching_sequence
4412- aco: fix long-jump version of discard early exit
4413- radv: remove claimed support for sRGB vertex buffer formats
4414- radv: fix 16-bit support in radv_lower_vs_input
4415- aco: fix 16-bit VS inputs
4416- radv/llvm: fix packed VS inputs on GFX6/GFX10+
4417- aco: don't expand vec3 VS input load to vec4 on GFX6
4418- aco: allow direct_fetch=true for vec4 VS input loads
4419- nir/print: support nir_texop_descriptor_amd
4420- aco: add SCC clobber in build_cube_select
4421- nir/lower_tex: ignore width of cube textures
4422- radv: enable ac_nir_lower_resinfo for ACO
4423- aco: remove dead code for querying image size/samples/levels
4424- util/ralloc: add HEADER_ALIGN macro
4425- util: add freelist allocator with mark/sweep
4426- nir/serialize: remove unused parameter from read_src()
4427- Revert "nir: Drop the unused instr arg for src/dest copy functions."
4428- nir: adjust nir_src_copy signature to take a nir_instr *
4429- nir: use a GC context for instructions
4430- util/format,vulkan/format: update p_format for Vulkan vertex buffers
4431- ac: add ac_vtx_format_info
4432- radv: add radv_is_vertex_buffer_format_supported
4433- radv,aco: use pipe_format for static vertex input state
4434- radv,aco: use pipe_format for dynamic vertex input state
4435- radv: remove radv_translate_vertex_format
4436- radv: enable R8G8B8, B8G8R8 and R16G16B16 vertex formats
4437- radv,aco: implement 64-bit vertex inputs
4438- radv: enable 64-bit vertex formats
4439- aco/ra: handle empty def_reg interval in get_regs_for_copies
4440- aco/ra: remove bounds parameter from get_regs_for_copies()
4441- aco/ra: rework fixed operands
4442- nir/algebraic: optimize bits=umin(bits, 32-(offset&0x1f))
4443- nir/algebraic: optimize fabs(bcsel(b, fneg(a), a))
4444- radv: disable EXT_vertex_input_dynamic_state when using DGC
4445- radv: shrink zero-initialization in vkCmdSetVertexInputEXT
4446- radv: use nir_ubfe_imm
4447- radv: remove unnecessary .align_mul=4
4448- radv: fix dynamic RT stack size with VGPR spilling
4449- aco: DCE ra_ctx::defs_done
4450- aco: rename Interp_instruction to VINTRP_instruction
4451- aco: add reg() helper to assembler
4452- aco: fix assembly of MUBUF-to-LDS loads
4453- aco: add GFX11 opcode numbers
4454- aco/gfx11: don't use more than 1 NSA dword
4455- aco: update assembler for GFX11
4456- aco: limit GFX11 to 128 VGPRs for now
4457- aco: add LDSDIR instruction format
4458- aco: add VINTERP instruction format
4459- aco: omit read-only memory_sync_info when printing
4460- aco/tests: add GFX11 assembly tests
4461- aco: mostly implement FS input loads on GFX11
4462- aco: fix VMEMtoScalarWriteHazard s_waitcnt mitigation
4463- aco: improve VMEMtoScalarWriteHazard s_waitcnt mitigation
4464- aco: use some helpers in GFX10 hazard workarounds
4465- aco: improve printing of sgpr_null
4466- aco: improve printing of s_waitcnt_depctr
4467- aco: add VMEMtoScalarWriteHazard tests
4468- aco/gfx11: swap ds_cmpst_* data operands
4469- aco: improve wait_imm unpack
4470- aco/gfx11: fix s_waitcnt printing
4471- aco: update sendmsg enum from LLVM
4472- aco/gfx11: deallocate VGPRs at the end of the shader
4473- aco/gfx11: update form_hard_clauses
4474- aco: limit hard clauses to 63 instructions
4475- aco: fix assembler.gfx11.vinterp test
4476- aco: add search_backwards helper
4477- aco/gfx11: workaround VcmpxPermlaneHazard
4478- aco/gfx11: workaround LdsDirectVALUHazard
4479- aco/gfx11: workaround LdsDirectVMEMHazard
4480- aco/gfx11: workaround VALUTransUseHazard
4481- aco/gfx11: workaround VALUPartialForwardingHazard
4482- aco/gfx11: workaround VALUMaskWriteHazard
4483- aco: add ACO_DEBUG=force-waitdeps
4484- nir/algebraic: optimize b<<a + c<<a
4485- ac/llvm: fix mixing non-uniform/uniform sampler/texture descriptors
4486- ac/llvm: ignore g16 if the image opcode doesn't use derivatives
4487- aco/gfx11: optimize LS/HS load_local_invocation_index
4488- aco: swap v_perm_b32 operands
4489- radv,nir: add intrinsics for streamout and GS copy shaders
4490- ac/nir: add legacy streamout and GS copy shader helpers
4491- ac/nir: micro-optimize boolean expression
4492- ac/nir: add ac_nir_lower_ngg_options
4493- aco: add storage_gds
4494- aco: insert waitcnt before/after ds_ordered_count
4495- nir,ac/nir,aco,radv: replace has_input_*_amd with more general intrinsics
4496- aco: don't split swizzled store_buffer_amd on GFX9+
4497- ac/nir: lower gfx11 vertex parameter exports
4498- radv/gfx11: don't create fmask copy/expand pipelines
4499- radv/gfx11: don't create texop_samples_identical
4500- radv,aco: don't use lower_to_fragment_fetch_amd on GFX11+
4501- aco: fix typo in branch lowering
4502- aco/gfx11: perform FS input loads in WQM
4503- aco/gfx11: fix FS input loads in quad-divergent control flow
4504- radv/llvm: fix GS shaders on GFX8/9
4505- docs: update new_features.txt for non-extension additions to RADV
4506- radv: increase gfx1100/gfx1101 physical vgprs
4507- ac/gpu_info: increase gfx1100/gfx1101 physical vgprs
4508- aco/gfx11: increase gfx1100/gfx1101 physical vgprs
4509- radeonsi: increase gfx1100/gfx1101 physical vgprs
4510- nir/lower_bit_size: lower uadd_carry
4511- radv: lower 8/16-bit uadd_carry/usub_borrow
4512- radv: disable MRT compaction with dual-source blending
4513- aco: ensure MRT0 is written with dual source blending
4514- ac/nir: mask shift operands
4515
4516Riteo (1):
4517
4518- vulkan/device_select_wayland: fix a memory leak with DRM device handling
4519
4520Rob Clark (106):
4521
4522- gbm: Add USE_FRONT_RENDERING flag
4523- freedreno: Disable UBWC for frontbuffer rendering
4524- freedreno/drm: Fix potential bo cache vs export crash
4525- freedreno/gmem: Fix col0 calc
4526- freedreno: Drop fixed upper bound on # of tiles
4527- freedreno/drm: Add FD_VA_SIZE param
4528- freedreno: Cap reported video memory by VA size
4529- freedreno/drm: Avoid lock in fd_pipe_del()
4530- egl: atrace support
4531- freedreno/drm/virtio: Add some atrace
4532- util: Fix vs2019 build error
4533- egl: Drop _eglSetFuncName() return
4534- egl: Drop unused _EGL_FUNC_START() arg
4535- egl: Remove redundant fxn param
4536- egl: Convert to simple_mtx_t
4537- egl: Add some locking asserts
4538- egl/dri2: Indenting fix
4539- egl/dri2: Error path cleanups
4540- llvmpipe: Add some missing locking
4541- freedreno: Extract helper to check for batch dependency
4542- freedreno: Simplify add_dep logic
4543- freedreno: Avoid deferred-flush dependency loops
4544- Revert "ci: disable the freedreno farm."
4545- freedreno/a6xx: Random indent fixes
4546- freedreno/drm: Let backend hint about upload vs map
4547- freedreno/drm: Add offset param for fd_bo_upload()
4548- freedreno: Add support for upload transfers
4549- Revert "ci: disable the freedreno farm."
4550- egl/wgl: Make ref_count atomic
4551- egl/dri2: Make ref_count atomic
4552- egl/dri2: Add display lock
4553- egl: Make RefCount atomic
4554- egl: Introduce rwlock to protect eglTerminate()
4555- egl: Relax locking
4556- freedreno: Misc indent fix
4557- freedreno: nr_rings -> nr_priorities
4558- freedreno: Remap high/norm/low priorities
4559- freedreno/virtio: Don't upload if we have valid range
4560- freedreno/drm/virtio: Don't prefer upload for mapped buffers
4561- freedreno/drm/virtio: Handle read after upload
4562- freedreno: We really don't need aligned vbo's
4563- freedreno: Update github wiki links
4564- freedreno/ir3: GC unused macro
4565- freedreno/drm: Inline fd_bo_get_iova()
4566- freedreno/drm: Add fd_ringbuffer_attach_bo()
4567- freedreno/a6xx: Drop "hardpin" support
4568- freedreno/drm: Simplify emit_reloc_common
4569- freedreno/a6xx: Skip IBO state when unused
4570- freedreno/a6xx: Pre-calculate user const state size
4571- freedreno: Drop unused arg
4572- freedreno/a6xx: Move user const upload to bind
4573- freedreno/a6xx: Simplify fd6_build_user_consts()
4574- freedreno: Handle xfb invalidation
4575- freedreno: Use TC cpu-storage to shadow buffers
4576- freedreno/drm: Don't call kernel with no ops
4577- freedreno: Add perf-debug trace
4578- isaspec: Fix out of date comment
4579- gallium/u_threaded: Add some atrace/perfetto
4580- gallium/u_threaded: Add optional call-id tracing
4581- freedreno: Fix fence unref race
4582- docs: Update freedreno features
4583- mesa/st: ARB_vertex_attrib_64bit depend on glsl>=410
4584- freedreno/a6xx: Remove unused param
4585- freedreno/ir3: Fix clipvertex with GS+tess
4586- freedreno/ir3: Lower all the 64b
4587- freedreno/a6xx: Fix MAX_GEOMETRY_OUTPUT_VERTICES cap
4588- freedreno/a6xx: Fix primitives-generated query
4589- freedreno/a6xx: Update caps
4590- freedreno/a6xx: Drop max_scissor tracking
4591- Revert "egl: Factor some common terminate cleanup up to common code"
4592- nir: Add helper to create passthrough TCS shader
4593- radeonsi: Use nir passthrough TCS helper
4594- freedreno: Dirty the TCS state if patch_vertices change
4595- freedreno/ir3: Passthrough TCS support
4596- freedreno: Add tess_state support
4597- freedreno: Passthrough TCS support
4598- v3d/ci: Add a flake
4599- Revert "mesa/st: ARB_vertex_attrib_64bit depend on glsl>=410"
4600- freedreno: Don't advertise TGSI support for GS/tess
4601- freedreno/a6xx: Move layer_zero handling to interp state
4602- freedreno/ir3: Drop unused view_zero/layer_zero lowering
4603- freedreno/a6xx: Fix buffer size clamping
4604- freedreno/a6xx: Support AMD_vertex_shader_layer
4605- freedreno: support multiple viewports
4606- freedreno: Massage scissor state at bind time
4607- freedreno: Move guardband calc to bind time
4608- freedreno/a6xx: Support GL_ARB_viewport_array + gl43
4609- freedreno/a6xx: Enable GL_ARB_texture_mirror_clamp_to_edge
4610- freedreno: Enable GL_ARB_enhanced_layouts
4611- freedreno/ci: Small bit of xfails cleanup
4612- docs: Update features.txt
4613- nir: Add way to create passthrough TCS without VS nir
4614- freedreno: Add ARB_gl_spirv support
4615- freedreno/a6xx: Use box to pass 2d clear params
4616- freedreno/a6xx: Implement ARB_clear_texture
4617- freedreno/a3xx+: Enable ARB_shader_texture_image_samples
4618- freedreno/a3xx+: Enable ARB_derivative_control
4619- freedreno/ir3: Unconditionally lower subgroup ops
4620- freedreno/a6xx: Enable ARB_shader_group_vote
4621- freedreno/a6xx: Fix occlusion queries
4622- freedreno/a6xx: Remove unused field
4623- freedreno/a5xx+a6xx: Add base class for query samples
4624- freedreno/batch: Add a global epilogue
4625- freedreno: Core ARB_query_buffer_object support
4626- freedreno/a6xx: ARB_query_buffer_object support
4627- freedreno/a6xx: Mark gl45 supported
4628
4629Rohan Garg (3):
4630
4631- anv, iris: Disable pre fetching the binding table entries on DG2
4632- intel/compiler: Support 16 bit float ops
4633- anv: Enable 16 bit float ops on devices that have a LSC
4634
4635Roland Scheidegger (4):
4636
4637- lavapipe: initialize index_bias to zero for non-indexed draws
4638- gallivm: always do per-pixel LOD for cube maps
4639- llvmpipe: fix front/back face system value in the fs
4640- gallivm: remove dead cube map lod calculation code
4641
4642Roman Gilg (2):
4643
4644- loader/dri3: Simplify buffer age query
4645- loader/dri3: Add DRI performance option to wait for next buffer on swap
4646
4647Roman Stratiienko (6):
4648
4649- v3dv: Enable sync_fd importing/exporting on Android
4650- v3dv: Limit API version to v1.0 for Android
4651- v3dv: Avoid swapchain_info handling on Android
4652- lima: Ensure jobs initialized before calling lima_job_fini()
4653- Android: Use libgbm_mesa name for SDK30+
4654- meson: Enable system_has_kms_drm for android
4655
4656Ruijing Dong (30):
4657
4658- frontends/va: improve enc quality interface change
4659- radeonsi/vcn: add enc quality bits interface
4660- radeonsi/vcn: dpb change for supporting pre-encoding
4661- radeonsi/vcn: support encoding preset modes
4662- radeonsi/vcn: support VBAQ modes
4663- radeonsi/vcn: remove rate control double begin IBs
4664- radeonsi/vcn: add AUD syntax to h264 encoding
4665- frontend/va: remove some unnecessary code
4666- frontends/va: change getEncParamPreset location
4667- frontends/va: add max_frame_size into rate control
4668- frontends/va: add HRD, filler data enable and etc
4669- frontends/va: enable sao in hevc encoding
4670- radeon/vcn: process invalid frame rate in encoding
4671- radeonsi/vcn: support async
4672- radeonsi/vce: using fixed value for vce 52
4673- gallium/pipe/video: group avc encoder seq structure
4674- radeonsi/vce: apply avc seq parameters
4675- frontends/omx: apply avc seq parameters
4676- frontends/va: apply avc seq parameters
4677- radeonsi/vcn: use avc seq paramters
4678- frontends/va: add aspect ratio and vui timing info
4679- radeonsi/vcn: support aspect ratio and vui timing info
4680- d3d12: Updating refactored fields in pipe_h264_enc_picture_desc
4681- frontends/va: fix av1 decoding image distortion issue
4682- gallium/pipe/video: add data struct for dbk in avc encoding.
4683- frontends/va: add h264 dbk enc parameters
4684- radeonsi/vcn: enable dbk in avc encoding
4685- radeonsi/vcn: enable multi-slice encoding
4686- frontends/va: fixed an av1 decoding image corruption issue.
4687- frontends/va: fixed an av1 dec image corruption.
4688
4689Ryan Houdek (1):
4690
4691- vulkan/wsi: Add dep_libudev to idep dependencies
4692
4693Sagar Ghuge (5):
4694
4695- intel/isl: Setting L1 caching policy to Write-back mode
4696- anv: Specify Untyped L1 cache policy for stateless accesses
4697- iris: Specify Untyped L1 cache policy for stateless accesses
4698- anv: Handle bits to flush data-port's Untyped L1 data cache
4699- iris: Handle new untyped dataport cache flush PIPE_CONTROL field
4700
4701Sajeesh Sidharthan (1):
4702
4703- radeonsi/vcn: set current pic index correctly
4704
4705Sami Kyöstilä (1):
4706
4707- intel/ds: Update to Perfetto API v28.0
4708
4709Samuel Pitoiset (329):
4710
4711- vulkan: add support for VK_IMAGE_LAYOUT_ATTACHMENT_FEEDBACK_LOOP_OPTIMAL_EXT
4712- radv: implement VK_EXT_attachment_feedback_loop_layout
4713- radv: ignore out-of-order rasterization if stencil write mask is dynamic
4714- radv: rework shaders ref counting
4715- radv: use ref counting for VS prologs and PS epilogs
4716- radv: remove RADV_META_SAVE_SAMPLE_LOCATIONS
4717- radv: simplify saving/restoring all dynamic states
4718- radv: remove unused states parameter from some radv_emit_XXX() helpers
4719- radv: simplify radv_bind_dynamic_state() slightly
4720- radv: fix gathering XFB info if there is dead outputs
4721- radv: respect the render area for depth/stencil resolves
4722- radv: fix cleaning the meta query state if an error occured
4723- radv: properly ignore PSO states that are dynamic
4724- radv: remove one useless check about line width and dynamic state
4725- radv: simplify dynamic states uses when determining NGG culling settings
4726- radv: do not set registers for PSO states that are dynamic
4727- radv: remove radv_invariant_geom workarounds for DX games
4728- radv: explain why radv_invariant_geom is still needed for SotTR DX
4729- radv: remove unecessary radv_pipeline::uses_dynamic_stride
4730- radv: copy line stipple factor/pattern only if the state isn't dynamic
4731- radv: copy the sample locations only if the state isn't dynamic
4732- zink: stop skipping bunch of GLESx tests
4733- vulkan: initialize discardRectangleCount even if the state is dynamic
4734- vulkan: fix initializing the primitive topology when it's dynamic
4735- vulkan: add attachment sample count info AMD support
4736- radv: store the provoking vertex mode in the graphics pipeline info
4737- radv: simplify the depth clip enable/disable logic
4738- radv: do not translate polygon mode during gfx info initialization
4739- radv: do not translate logic op during gfx info initialization
4740- radv: do not translate blend op/factor during gfx info initialization
4741- radv: do not translate primitive topology during gfx info initialization
4742- radv: slightly change the color write enable mask
4743- radv: call vk_graphics_pipeline_state_fill() when initializing gfx pipelines
4744- radv: convert radv_tessellation_info to vk_tessellation_state
4745- radv: convert radv_viewport_info to vk_viewpoert_state
4746- radv: convert radv_discard_rectangle_info to vk_discard_rectangles_state
4747- radv: convert radv_rasterization_info to vk_rasterization_state
4748- radv: convert radv_depth_stencil_info to vk_depth_stencil_state
4749- radv: convert radv_fragment_shading_rate_info to vk_fragment_shading_rate_state
4750- radv: convert radv_input_assembly_info to vk_input_assembly_state
4751- radv: convert radv_multisample_info to vk_multisample_state
4752- radv: convert radv_color_blend_info to vk_color_blend_state
4753- radv: convert radv_rendering_info to vk_render_pass_state
4754- radv: convert radv_vertex_input_info to vk_vertex_input_state
4755- radv: remove radv_graphics_pipeline_info completely
4756- radv: advertise VK_KHR_global_priority
4757- radv: remove unused pCreateInfo from radv_pipeline_init_blend_state()
4758- radv: add radv_graphics_pipeline_import_info() helper
4759- radv: introduce a new pipeline type for graphics libs
4760- radv: store vk_graphics_pipeline_state to radv_graphics_pipeline
4761- radv: prevent accessing rasterization state when it's NULL
4762- radv: add a very initial implementation of VK_EXT_graphics_pipeline_library
4763- radv: do not compress DCC in presence of render loops on GFX10+
4764- radv: remove useless check when creating the noop FS
4765- radv: copy linked I/O info for merged stages in radv_fill_shader_info
4766- radv: copy the whole radv_shader_info struct for merged stages
4767- radv: add a radv_postprocess_nir() helper
4768- radv: move lowering FS intrinsics to radv_postprocess_nir()
4769- radv: move dumping NIR shaders to the previous similar loop
4770- radv: remove unused parameter in radv_lower_io_to_mem()
4771- radv: use NIR_PASS with radv_export_implicit_primitive_id
4772- radv: remove useless parameter from radv_link_shaders()
4773- radv: add radv_remove_point_size() linking helper
4774- radv: rework linking shaders in NIR
4775- radv: move assigning driver locations to the new linking helpers
4776- radv: move lowering patch vertices to radv_pipeline_link_tcs()
4777- radv: move lowering the view index to radv_pipeline_link_shaders()
4778- radv: remove redundant assignment of tcs.tcs_vertices_out
4779- radv: remove unused tcs_vertices_out assignment for VS
4780- radv: remove unused num_tess_patches assignment for VS
4781- radv: merge gather_tess_info() with radv_fill_shader_info()
4782- radv: stop setting CB_COLOR_CONTROL.ROP3 from the pipeline
4783- radv: stop clearing bitfields for registers that are emitted dynamically
4784- radv: cleanup dynamic states in radv_emit_graphics_pipeline()
4785- radv: stop emitting RMW context registers for updating sample locations
4786- radv: re-emit viewports if negative one to one or depth clamp mode changed
4787- radv: remove bogus assertion about independent set layouts with GPL
4788- radv: fix missing initialization of the pipeline layout when creating a lib
4789- radv: destroy the pipeline layout if creating a library failed
4790- aco: fix wrong size for 1D images and A16 on GFX9
4791- ac: constify ac_compute_cs_workgroup_size()
4792- radv: stop duplicating radv_es_output_info
4793- radv: use esgs_itemsize when calling ac_nir_lower_es_outputs_to_mem
4794- radv: compute the ESGS itemsize outside of radv_nir_shader_info_pass()
4795- radv: remove dead code about task ring when binding a compute pipeline
4796- radv: replace cs.uses_task_rings by ms.has_task
4797- radv/llvm: remove unused parameter in handle_vs_outputs_post()
4798- radv: stop duplicating radv_vs_output_info
4799- radv: fill radv_vs_output_info unconditionally for vertex related stages
4800- radv: remove redundant VS output parameter assignments
4801- radv: add a helper that links shader info between stages
4802- radv: link primitive ID/clip distance shader info from the new helper
4803- radv: stop gathering info for FS before other stages
4804- radv: move filling cs.block_size
4805- radv: add gather_shader_info_fs() helper
4806- radv: add gather_shader_info_mesh() helper
4807- radv: add gather_shader_info_gs() helper
4808- radv: add gather_shader_info_tes() helper
4809- radv: add gather_shader_info_tcs() helper
4810- radv: add gather_shader_info_vs() helper
4811- radv: add gather_shader_info_cs() helper
4812- radv: add gather_shader_info_task() helper
4813- radv: move more CS info to gather_shader_info_cs()
4814- radv: move more MS info to gather_shader_info_ms()
4815- radv: move determining NGG shader info to radv_fill_shader_info()
4816- radv: store binning settings into the physical device
4817- radv: make sure to emit BREAK_BATCH when color write enable is dynamic
4818- radv: remove useless parameter in si_write_scissors()
4819- radv: split emitting guardband into a separate helper
4820- radv: rework dirtying scissors when the line width changes
4821- radv: emit the guardband state separately from the scissor state
4822- radv: allow null descriptor set layouts in pipeline layouts
4823- radv: determine if the topology uses adjacency from the shaders on GFX9+
4824- radv: fix bogus assertion with RADV_FORCE_VRS
4825- aco: remove unused isel_context::tcs_num_patches
4826- radv: stop setting as_ls for the TCS stage
4827- radv: stop setting as_es for the GS stage
4828- radv: stop setting num_tess_patches for the GS stage
4829- radv: stop setting esgs_itemsize to the GS stage
4830- radv: cleanup helpers that compute NGG info and GS info on GFX9+
4831- radv: move computing wave_size/ballot_bit_size to the shader info pass
4832- radv: move computing NGG info and GS info to radv_nir_shader_info_link()
4833- radv: refactor radv_nir_shader_info_link()
4834- radv: move determining other NGG settings in radv_link_shaders_info()
4835- radv: simplify determining if NGG passthrough can be enabled
4836- radv: cleanup computing the workgroup size for all stages
4837- radv: add and use a helper that merges shader info for merged stages on GFX9+
4838- radv: remove unused parameters in radv_fill_shader_info()
4839- radv: determine as_ls/as_es in radv_link_shaders_info()
4840- radv: constify radv_shader_info where it's missing
4841- radeonsi/ci: trigger radeonsi-raven-skqp for any RADV changes
4842- radv: gather MRTs that are written by the fragment shader
4843- radv: compact MRTs to save PS export memory space
4844- radv: only expose sparseResidencyImage3D on GFX9+
4845- radv: emit SQTT markers for RT related commands
4846- radv: capture RT pipelines from the SQTT layer
4847- radv: fix reporting RT shaders in RGP
4848- radv: do not lower color exports for FS that need an epilog
4849- radv: add radv_remove_color_exports() helper
4850- radv: do not try to remove color exports for FS that need an epilog
4851- radv: keep track of the code size for VS prologs and PS epilogs
4852- radv: create a PS epilog from a library without the main FS
4853- radv: add support for emitting and prefetching PS epilogs
4854- radv: import PS epilog from libraries if present
4855- radv: fix hw remapping of MRT holes with color attachments without export
4856- radv: fix pipelineStageCreationFeedbackCount when it's 0
4857- radv: add a pointer to radv_shader_binary in radv_shader
4858- radv/ci: re-enable dEQP-VK.renderpass2.depth_stencil_resolve.*_samplemask
4859- radv: store the VS prologs/PS epilogs VA at upload time
4860- radv: split upload_shader_part() in two parts
4861- radv: store the binary to radv_shader_part
4862- radv: upload the PS epilog in the existing pipeline BO
4863- radv: reword a comment about dynamic states when rasterization is disabled
4864- radv: emit the LDS size for TCS directly from the pipeline on GFX9+
4865- radv: add radv_pipeline_key::dynamic_patch_control_points
4866- radv: skip computing some tess info when patch control points is dynamic
4867- radv: set workgroup_size to 256 when patch control points is dynamic
4868- aco: prevent a division by zero when patch control points is dynamic
4869- radv: add shader arguments for dynamic patch control points
4870- radv: add ABI lowering support for dynamic patch control points
4871- radv: pass the number of patch control points to si_get_ia_multi_vgt_param()
4872- radv: move emitting PRIMGROUP_SIZE for <= GFX9 from the cmdbuf
4873- radv: move emitting GE_CNTL for non-NGG pipelines from the cmdbuf
4874- radv: implement dynamic patch control points
4875- radv: advertise extendedDynamicState2PatchControlPoints
4876- radv: stop setting redundant viewport/scissor for internal operations
4877- radv: do not bind NULL graphics pipeline when restoring the meta state
4878- radv: reset the compute pipeline when the saved one was NULL
4879- radv: stop dirtying the graphics pipeline when restoring it
4880- radv: stop checking for NULL pipelines in radv_CmdBindPipeline()
4881- radv/ci: cleanup lists of failures/flakes
4882- radv: disable VK_EXT_graphics_pipeline_library with LLVM
4883- radv: rename radv_pipeline_key::vs::dynamic_vs_input to has_prolog
4884- radv: prepare the VS input state for prologs created with GPL
4885- radv: bind the VS input state for prologs created with GPL
4886- radv: enable the VS prologs cache if graphicsPipelineLibrary is enabled
4887- radv: do not remove PSIZ for VS when the topology is unknown
4888- Revert "radv: upload the PS epilog in the existing pipeline BO"
4889- nir: add nir_intrinsic_load_sample_positions_amd
4890- radv: add nir_intrinsic_load_sample_positions_amd in the ABI
4891- radv: run nir_opt_cse before lowering FS intrinsics
4892- radv,aco: lower barycentric_at_sample in NIR
4893- radv: do not link shaders when the next stage is unknown
4894- radv: allow to build the main FS in a graphics pipeline library
4895- radv,aco: do not compact MRTs if the pipeline uses a PS epilog
4896- radv: rename radv_thread_trace_set_pstate() to radv_device_set_pstate()
4897- radv: acquire pstate on-demand when capturing with RGP
4898- radv: constify radv_lookup_user_sgpr()
4899- radv: add radv_pipeline_key::dynamic_rasterization_samples
4900- radv: declare shader arguments for the number of samples for FS
4901- nir: add nir_load_rasterization_samples_amd
4902- radv: lower nir_load_rasterization_samples_amd in ABI
4903- nir,radv: pass the number of samples to load_sample_positions_amd
4904- radv: add barycentric_at_sample lowering when the number of samples is dynamic
4905- radv: emit the rasterization samples through an user SGPR if needed
4906- radv: make sure to initialize wd_switch_on_eop before checking its value
4907- radv: pass a VkSampler to write_sampler_descriptor()
4908- radv: remove unnecessary radv_buffer_view::vk_format
4909- radv: rework and rename radv_make_buffer_descriptor()
4910- zink/ci: skip ext_external_objects from Piglit with RADV
4911- zink/ci: skip arb_shader_clock with RADV
4912- zink/ci: skip GLX tests with RADV
4913- zink/ci: update list of expected failures with RADV on NAVI10
4914- zink/ci: update list of flakes for Piglit with RADV on NAVI10
4915- radv/ci: add piglit testing with Zink on NAVI10
4916- radv: do not free the GS copy shader binary if created from a library
4917- radv: import the GS copy shader from a library if present
4918- radv: determine the last VGT api stage from the active_stages bitfield
4919- radv: allow to build the pre-rasterization stages in a library
4920- radv: remove useless gfx10_ngg_info::enable_vertex_grouping
4921- radv: determine the last VGT api stage also for GPL
4922- radv: use the maximum number of vertices per primitives for NGG with GPL
4923- radv: enable NGG culling unconditionally for GPL but disable it dynamically
4924- radv: rework multiview lowering in NIR slightly
4925- radv: fix handling ViewportIndex with GPL
4926- radv: fix handling primitive ID and clip/cull distances with GPL
4927- radv: fix handling multiview with GPL
4928- radv: fix emitting RBPLUS state when MRTs are compacted
4929- radv: remove NULL checks when binding a graphics pipeline
4930- radv: fix initializing the attribute mask for VS input state and GPL
4931- radv: fix initializing the blend state for NULL render pass with GPL
4932- radv: ignore shader stages that don't need to be imported with GPL
4933- radv: add a new drirc option to enable a unified heap on APUs
4934- radv: enable the unified heap on APUs for Red Dead Redemption 2
4935- radv: fix and rework shaders upload with GPL
4936- radv: always re-emit patch control points when a pipeline with tess is bound
4937- radv: move the disk cache to the vk_physical_device struct
4938- radv: use SHA1_DIGEST_LENGTH in the pipeline cache
4939- radv: do not create a noop FS when the FS is imported from a library
4940- radv: discard the PS epilog when the pipeline doesn't use a fragment shader
4941- ac/llvm: add missing nir_intrinsic_bindless_image_atomic_{fmin,fmax}
4942- radv: set SAMPLE_MASK_TRACKER_WATERMARK to 15 on GFX11
4943- radv: use correct VGT_TESS_DISTRIBUTION settings on GFX11
4944- radv: do not use memory for waiting for cache flushes on GFX11
4945- radv: drop the ES vertex count requirement on GFX11
4946- radv: add RADEON_FLAG_DISCARDABLE
4947- radv: create and configure the ring BO for attributes
4948- radv: declare shader arguments for attributes via memory
4949- nir: add nir_intrinsic_load_ring_attr_{offset}_amd
4950- radv: add lowering for nir_intrinsic_load_ring_attr_{offset}_amd
4951- radv: allocate more space for pipeline statistics query on GFX11
4952- ac: add radeon_info::has_vrs_ds_export_bug
4953- radv: advertise fragmentShadingRateWithShaderDepthStencilWrites on NAVI23+
4954- aco: fix tcs_wave_id unpacking on GFX11
4955- radv: declare new dynamic states
4956- radv: add support for dynamic polygon mode
4957- radv: add support for dynamic tessellation domain origin
4958- radv: add support for dynamic logic op enable
4959- radv: add support for dynamic line stipple enable
4960- radv: add support for dynamic alpha to coverage enable
4961- radv: add support for dynamic sample mask
4962- radv: add support for dynamic depth clip enable
4963- radv: add support for dynamic conservative rasterization mode
4964- radv: add support for dynamic depth clip negative one to one
4965- radv: add support for dynamic provoking vertex mode
4966- radv: add support for dynamic depth clamp enable
4967- radv: advertise VK_EXT_extended_dynamic_state3
4968- radv: disable VK_AMD_texture_gather_bias_lod on GFX11
4969- radv: disable AMD_shader_explicit_vertex_parameter on GFX11
4970- radv/llvm: do not set COMPR for exports on GFX11
4971- ac/nir,radv/llvm: add support for the attribute ring
4972- aco,radv/llvm: do not export parameters on GFX11
4973- ac: fix has_vrs_ds_export_bug for VanGogh
4974- radv: disable {shader,sparse}ImageFloat32AtomicMinMax on GFX11
4975- radv: disable shaderBufferFloat64AtomicMinMax on GFX11
4976- radv: disable dual source blending in more situations
4977- ac/llvm: add support for device clock on GFX11
4978- aco: add support for s_sendmsg_rtn_b{32,64}
4979- aco: split the sendmsg enumeration into sendmsg_rtn
4980- aco: add support for device clock on GFX11
4981- ac/nir/ngg: fix emitting streamout output by using packed location
4982- radv: fix suspending/resuming pipeline statistics queries with GDS
4983- radv: fix VRS limit when attachmentFragmentShadingRate is disabled
4984- radv/llvm: fix dual source blending on GFX11
4985- radv: move nir_opt_idiv_const/nir_lower_idiv after NGG lowering
4986- ac/nir/ngg: fix emitting streamout output by using packed location
4987- radv: implement transform feedback queries with NGG streamout
4988- Revert "radv: add a pointer to radv_shader_binary in radv_shader"
4989- aco: create a new builder variant for ds_add_rtn
4990- aco: implement NIR intrinsics for NGG streamout
4991- aco: remove invalid assertions for NGG streamout
4992- radv/llvm: prevent emitting streamout outputs for NGG
4993- radv: call nir_io_add_intrinsic_xfb_info() after IO lowering
4994- radv: lower NIR intrinsics for NGG streamout
4995- radv: do not unconditionally disable NGG streamout lowering in NIR
4996- radv: reword an incorrect comment about primitives generated query
4997- radv: use 32-bit GDS counters for primitives generated query with NGG
4998- radv: suspend/resume primitives generated query separately for legacy/NGG
4999- radv: fix primitives generated query with NGG only
5000- radv: set the correct buffer size for NGG streamout
5001- radv: enable all possible features with NGG streamout
5002- radv: allow to enable NGG streamout with RADV_PERFTEST=ngg_streamout
5003- radv: enable VK_EXT_transform_feedback with NGG streamout on GFX11
5004- ac/nir: stop using nir_gather_xfb_info_from_intrinsics
5005- radv: assign IO var locations for VS/TES with NGG streamout
5006- radv/amdgpu: add a kernel GDS management workaround for -ENOMEM
5007- radv: use defines instead of magic values for GDS counters offset
5008- radv: move GDS counters after reserved GDS offsets for streamout
5009- radv: fix fallback for extreme geometry with tessellation on GFX11
5010- radv: fix setting MIN_LOD for texture descriptors on GFX11
5011- radv: make the GDS/GDS OA buffer objects resident
5012- radv: invalidate L2 instead of only writeback L2 when using DCC stores
5013- radv: re-emit the guardband state when restoring meta operations
5014- ac/nir,radv: rework and fix NGG queries enables for VS/TES
5015- ac/nir: do not convert GS outputs to the expected variable size on GFX11
5016- radv: re-emit NGG culling settings when conservative rast mode is dynamic
5017- radv: enable lowering of subgroup shuffle in NIR on GFX11+
5018- aco: fix p_interp_gfx11 to not overwrite SCC
5019- aco: fix missing SCC for p_interp_gfx11 in emit_interp_mov_instr()
5020- aco: add p_dual_src_export_gfx11 for dual source blending on GFX11
5021- aco: fix dual source blending on GFX11
5022- aco: fix FS inputs loads in WQM with 16-bit
5023- radv: suspend/resume XFB queries with NGG for meta operations
5024- zink: require extendedDynamicState3ColorBlendEquation for full ds3
5025- radv: stop overallocating LDS for VS/TES when NGG streamout is enabled
5026- radv: fix computing the pervertex LDS size with NGG streamout
5027- radv: enable NGG XFB queries only if streamout is enabled
5028- radv: re-mit streamout buffers to unbind them when NGG streamout is disabled
5029- radv: fix NGG streamout when it's never enabled in runtime
5030- radv: fix possible hangs with NGG streamout and secondary cmdbuf
5031- aco: fix emitting DEALLOC_VGPRS in the discard block
5032- radv: re-emit dynamic depth clamp enable if depth clip enable changed
5033- radv: make sure to mark DCC as compressed on GFX11
5034- radv: introduce RADV_DEBUG=nofmask
5035- radv: set missing SPI_SHADER_PGM_xxx registers on GFX11
5036- radv: set INTERPOLATE_COMP_Z to 0 on GFX11
5037- radv: do not enable DCC for MSAA images without FMASK
5038- radv: do not enable NGG culling on GFX11
5039- radv: disable VRS entirely on GFX11
5040
5041Sarah Walker (3):
5042
5043- pvr: Update FWIF compute register structure
5044- pvr: Remove STRIP_RENDERING members from FWIF
5045- pvr: Update FWIF 3d and compute register structures
5046
5047Sathishkumar S (2):
5048
5049- frontends/va: support yuv 400/444 rt_formats in vaconfig
5050- frontends/va: reallocate surface for yuv400/yuv444 picture
5051
5052Sergei Chernyadyev (1):
5053
5054- radv: fix setting results for initialization failures in thread trace and trap handler
5055
5056Shuicheng Lin (1):
5057
5058- iris: return failure if iris_resource_configure_main fail
5059
5060Sil Vilerino (76):
5061
5062- gallium/va: vaDeriveImage to check PIPE_VIDEO_SUPPORTS_CONTIGUOUS_PLANES_MAP
5063- d3d12: Implement cap PIPE_VIDEO_SUPPORTS_CONTIGUOUS_PLANES_MAP
5064- frontends/va: Add HEVC decode slice descriptors
5065- frontends/va: Add HEVC decode args: IntraPicFlag, no_pic_reordering_flag, no_bipred_flag
5066- frontends/vdpau: Fill HEVC Decode param IntraPicFlag
5067- frontends/omx: Fill HEVC Decode param IntraPicFlag
5068- frontends/va: Mark IsLongTerm in HEVC decode args
5069- frontends/va: Add HEVC Encode support multi slice and extend pipe args
5070- frontends/va: Extend single to multiple L0-L1 references for HEVC Encode
5071- frontends/va: Support HEVC caps regarding features, block sizes, prediction direction
5072- d3d12/va: Name convention rename PIPE_VIDEO_SUPPORTS_CONTIGUOUS_PLANES_MAP to PIPE_VIDEO_CAP_SUPPORTS_CONTIGUOUS_PLANES_MAP
5073- gallium/vl: Allow vl_zscan.h to be included from C++
5074- gallium/vl: Rename s_addr variable in vl_idct.c as it conflicts with windows existing inaddr.h keyword definition
5075- d3d12: Add HEVC Decode/Encode
5076- d3d12: Avoid extra allocation, copies when generating DXVA_Slice_Hxxx_Short arrays
5077- d3d12: Avoid local allocations for D3D12_RESOURCE_BARRIER on hot paths
5078- d3d12: Avoid heap allocations on hot path d3d12_video_decoder_dxva_picparams_from_pipe_picparams_hevc
5079- d3d12: Fill feedback in d3d12_video_encoder_encode_bitstream so vaSyncSurface properly populates buf->coded_size
5080- d3d12: Fix leak in d3d12_video_proc when re-creating ID3D12VideoProcessor
5081- d3d12: Fix winsys displaytarget leak in d3d12_resource
5082- d3d12: Fix leak in d3d12_resource_from_resource and usage in d3d12 video dec, enc
5083- d3d12: Add support for importing d3d12_video_buffer from handle
5084- d3d12: Allow formats other than NV12 in d3d12_video_buffer
5085- d3d12: Allow video processing for formats other than NV12
5086- d3d12: Add VPBlit processor check for D3D12_FEATURE_VIDEO_PROCESS_MAX_INPUT_STREAMS
5087- d3d12: d3d12_video_buffer_create_impl make resident after checking for resource creation
5088- frontends/va: Implement vaSyncBuffer
5089- d3d12: Update HEVC Encode GOP on I frames too
5090- d3d12: Video process - Remove unnecessary batches flush
5091- d3d12: Add initialization values for d3d12_video_encoder
5092- d3d12: Report PIPE_VIDEO_CAP_REQUIRES_FLUSH_ON_END_FRAME
5093- d3d12: Add support for PIPE_VIDEO_CAP_ENC_SUPPORTS_ASYNC_OPERATION
5094- d3d12: Change type of m_FenceValue to uint64_t in d3d12_video_encoder
5095- d3d12: Export some util functions from d3d12_fence for d3d12 video
5096- d3d12: Keep track of feedbacks from previous encode executions
5097- d3d12: Change d3d12_video_encoder to not automatically flush
5098- d3d12: Move encode configuration and capabilities into separate structs
5099- d3d12: Encoder to use independent command allocators per in flight resource set
5100- d3d12: d3d12_video_encoder add reset() to prevent leaks on re-allocation
5101- d3d12: Have d3d12_video_encoder keep separate references for enc, heap and dpb allocations for in flight resources
5102- d3d12: Make d3d12 encode operations async and do not block waiting
5103- d3d12: Add output buffer to inflight resourceset
5104- d3d12: Fix redundant/inconsistent initializations for d3d12_video_encoder
5105- d3d12: Enhance get_feedback not enough buffers message
5106- d3d12: Extract into common variable for metadata slot calculation
5107- d3d12: Make get_feedback return correctly on error when writing \*size=0
5108- d3d12: Relax slice caps requirements for video encode
5109- d3d12: Fix out of bounds index check for texture array DPB
5110- d3d12: Add debug verbosity for output reconstructed picture
5111- d3d12: Fix texture array DPB encode path
5112- d3d12: Fix usage of %ld with PRIu64 instead for cross-platform build
5113- d3d12: Replace size_t with uint64_t in d3d12_video_encoder definitions for cross-plat compatibility
5114- frontends/va: Add more params from VADecPictureParameterBufferAV1 and VASliceParameterBufferAV1 to pipe_av1_picture_desc
5115- frontends/va: Support AV1 Decode with multiple tiles (num_elements > 1) in a single VASliceParameterBufferAV1 buffer
5116- frontends/va: Use is_video_format_supported in vlVaVidEngineBlit instead of format list
5117- frontends/va: Fix AV1 decode qm_* parameters reading from VADecPictureParameterBufferAV1
5118- d3d12: Add support for AV1 Video Decode
5119- d3d12: Support more video formats
5120- d3d12: Fix aliased usage of m_SliceControlBuffer in d3d12_video_decoder_prepare_dxva_slices_control_XXXX
5121- d3d12: Style variable fix in d3d12_video_decoder_prepare_dxva_slices_control_av1
5122- frontends/va: Add VP9 decode multi slice information
5123- frontends/va: Add VP9 decode ref/mode_deltas information
5124- d3d12: Add VP9 Decode support
5125- d3d12: Fix get_index7bits - Reuse previously freed indices
5126- d3d12: HEVC Encode workaround for edge case in caps reporting not contemplated by upper layer interface
5127- d3d12: Fix HEVC wrong caps detection due to bad parenthesis in condition
5128- d3d12: HEVC Set D3D12_VIDEO_ENCODER_CODEC_CONFIGURATION_HEVC_FLAG_USE_ASYMETRIC_MOTION_PARTITION when required by caps
5129- gallium/vl: Add vl_winsys_win32 support
5130- frontends/va: Add windows VA frontend support via vl_winsys_win32 and libva-win32
5131- d3d12: resource_from_handle to validate importing resource ptr from same d3d12 device
5132- CI: Add gallium-va and video-codecs in windows-vs2019 and debian-mingw32-x86_64
5133- d3d12: Create d3d12_resource objects for video usage keeping their DXGI\_..._UNORM formats
5134- d3d12: HEVC Decode - Fix slice size and offsets translation from VA to DXVA
5135- frontend/va: Fix WIN32 VA_DRIVER_INIT_FUNC declaration. Remove declspec as it uses .def file
5136- ci: Add va frontend to windows-build-rules
5137- d3d12: Video Screen - Do not crash if HEVC not supported, return no support instead
5138
5139Simon Fels (1):
5140
5141- venus: drop duplicated YCBCR_2_PLANE_444_FORMATS_FEATURES_EXT
5142
5143Simon Ser (2):
5144
5145- vulkan/wsi/x11: detect Xwayland via the XWAYLAND extension
5146- egl: fix error string for dri2_create_image_dma_buf()
5147
5148Simon Zeni (10):
5149
5150- mesa: fix EXT_EGL_image_storage target validation
5151- gallium: track if st_egl_image was created by a dmabuf
5152- mesa: simplify st_egl_image binding process for texture storage
5153- mesa/st: check egl image and texture target match before binding
5154- mesa: fix GL_INVALID_OPERATION in glEGLImageTargetTexStorageEXT
5155- mesa: unlock texture on error path in glEGLImageTargetTexStorageEXT
5156- egl: fix Xlib include in egldisplay.h
5157- egl: set API symbols public
5158- EGL: sync headers with Khronos
5159- meson: remove unused EGL_NO_X11 definition
5160
5161Sonny Jiang (1):
5162
5163- radeonsi/vcn: add decode support for gfx1101 and gfx1103
5164
5165SoroushIMG (22):
5166
5167- zink: Fix incorrect emission of SPIR-V shift ops
5168- zink: track min_samples state for per sample shading
5169- zink: lower 64bit vars for drivers without float64
5170- zink: optimise lower_64bit_vars for when int64 is supported
5171- zink: lower 64bit pack when float64 not supported
5172- zink: fix nir_op_unpack_64_2x32 emission
5173- zink: fix spirv temp func variable class emission
5174- nir: track whether a loop contains soft fp64 ops
5175- nir: allow to fine tune unrolling for loops with soft fp64 ops
5176- zink: enable loop unrolling for loops that have soft fp64
5177- zink: enable native ETC2 if vk driver supports it
5178- zink: fix invalid Offset set for variables which do not need an offset
5179- zink: stop enabling minmax filtering when not supported
5180- zink: fix isNan mismatch between NIR and SPIR-V
5181- lavapipe: stop allocating 0 size const buffer
5182- zink: cleanup pushconst interface between driver/compiler
5183- zink: use unified pushconst layour for passthorugh tcs
5184- zink: cleanup zink_pipeline_layout_create
5185- zink: add pushconst only pipeline layout
5186- zink: add new framebuffer_is_layered state
5187- zink: clamp gl_Layer output to 0, if framebuffer is not layered
5188- zink: limit gl_Layer clamping to drivers that need it
5189
5190Sunil Khatri (1):
5191
5192- winsys/amdgpu: clamp up the alignment if zero
5193
5194Sviatoslav Peleshko (2):
5195
5196- iris: Always initialize shader compilation queue ready fence
5197- driconf/intel: Use fake vendor to WA bad detection in Source engine games
5198
5199Tapani Pälli (40):
5200
5201- glsl: allow image*Shadow keywords on ES and GLSL >= 420
5202- glsl: add check for too large atomic counter buffer offset
5203- iris: implement Wa_14015946265 for DG2
5204- anv: implement Wa_14015946265 for DG2
5205- intel/compiler: implement Wa_14014595444 for DG2
5206- intel/compiler: fix assert from ver to verx10
5207- intel/genxml: add VFG_PREEMPTION_CHICKEN_BITS register
5208- anv: disable preemption on VFG, Wa_14015207028 for DG2
5209- iris: disable preemption on VFG, Wa_14015207028 for DG2
5210- anv: remove vk_sample_locations_state from emit_multisample
5211- intel/common: clamp sample location coordinate range
5212- anv: remove primitive_topology from 3DPRIMITIVE calls
5213- mesa/st: fix a set_sampler_views call parameter order
5214- anv: implement Wa_14016118574
5215- iris: implement Wa_14016118574
5216- anv: limit to default fast clear color when image used for transfer
5217- intel: revert preemption disable via VFG changes
5218- intel/fs: mark debug variables with ASSERTED
5219- anv: mark debug variables with ASSERTED
5220- mesa: mark debug variables with ASSERTED
5221- anv: dynamic state for polygon mode
5222- anv: dynamic state for tessellation domain origin
5223- anv: dynamic rasterization stream
5224- anv: dynamic line stipple enable
5225- anv: dynamic line rasterization mode
5226- anv: dynamic state for logic op enable
5227- anv: dynamic alpha to one enable
5228- anv: dynamic states for depth clip and clamp
5229- anv: dynamic provoking vertex mode
5230- anv: dynamic color write mask
5231- anv: dynamic color blend equation
5232- anv: dynamic multisample sample mask
5233- anv: toggle on EXT_extended_dynamic_state3
5234- anv: move emit_shading_rate to gfx8_cmd_buffer
5235- anv: fill AlphaToCoverageEnable lazily from state
5236- intel/compiler: run nir_opt_idiv_const before nir_lower_idiv
5237- anv: remove some unused functions
5238- hasvk: remove some unused functions
5239- iris: setup stage bitmask for Wa_22011440098
5240- anv: setup stage bitmask for Wa_22011440098
5241
5242Tatsuyuki Ishi (4):
5243
5244- radv: Implement radv_flush_before_query_copy to workaround UE Vulkan bugs.
5245- radv: Remove unused functions.
5246- radv: Remove unused radv_deferred_queue_submission.
5247- radv,lvp,anv: Delete pre-common dispatch leftover.
5248
5249Thomas Debesse (7):
5250
5251- r600: set clear_buffer = u_default_clear_buffer
5252- r600: info.stage MESA_SHADER_KERNEL as MESA_SHADER_COMPUTE
5253- nv50: call nir_lower_flrp
5254- clover: implement CLOVER_DEVICE_TYPE like RUSTICL_DEVICE_TYPE
5255- Revert "gallium/clover: pass -no-opaque-pointers to Clang", opaque pointers are now implemented
5256- gallium/clover: pass -opaque-pointers to Clang on LLVM 15 and 16
5257- crocus: set clear_buffer = u_default_clear_buffer
5258
5259Thomas H.P. Andersen (15):
5260
5261- nvc0: fix a warning -Wconstant-conversion
5262- radeonsi: avoid a use-after-free
5263- d3d12: fix warnings for missing-braces on clang
5264- r600: fix warnings for missing-braces on clang
5265- ci: remove missing-braces from no-error
5266- util: avoid deprecated builtin has_trivial_destructor
5267- spirv: avoid allocating memory twice
5268- hasvk: Switch to the common descriptor update template struct
5269- panvk: Implement VK_KHR_descriptor_update_template
5270- spirv: mark function unused
5271- llvmpipe: mark function unused
5272- llvmpipe: drop unused function
5273- panfrost: avoid warning about unused function
5274- vulkan: avoid warning about unused function
5275- vulkan: drop const for descriptor set
5276
5277Timothy Arceri (24):
5278
5279- Revert "nir: Preserve offsets in lower_io_to_scalar_early"
5280- glsl: dont lower precision for textureGatherOffsets
5281- glsl: make packed varying helper needs_lowering() external
5282- glsl: fix location for array subscript
5283- nir: update nir_is_supported_terminator_condition()
5284- nir: support loop unrolling with inot conditions
5285- glsl: remove GLSL IR inverse comparison optimisations
5286- nir/loop_analyze: delay instruction cost calculation
5287- nir/loop_analyze: remove cost of redundant selects
5288- util/conf: enable init to zero workaround for Exanima
5289- util/radeonsi: enable zerovram workaround for Exanima
5290- radv: add radv_zero_vram workarounds for OpenGL games
5291- glthread: fix matrix stack depth tracking
5292- mesa: add EXT_debug_label support
5293- glthread: leave dlist dispatch in place for Begin/End
5294- glsl/glsl_to_nir: remove unreachable code
5295- glsl: move rule inside lower_packing_builtins()
5296- glsl: drop sub to add neg lowering in GLSL IR
5297- glsl: always do {CARRY,BORROW}_TO_ARITH lowering
5298- glsl: move lower instructions logic inside that pass
5299- st/glsl_to_ir: remove stale comment
5300- st/glsl_to_ir: remove stale include
5301- mesa: fix typo from adding glGetObjectLabelEXT
5302- nir: fix typo in lower_double options handling
5303
5304Timur Kristóf (64):
5305
5306- ac/nir/cull: Move some code from cull_bbox into helper functions.
5307- ac/nir/cull: Move the contents of cull_bbox into ac_nir_cull_triangle.
5308- ac/nir/cull: Change if condition for bounding box culling.
5309- ac/nir/ngg: Move LDS store of accepted flag into the inner branch.
5310- ac/nir/cull: Make cull functions more consistent.
5311- aco: Optimize branching sequence during SSA elimination.
5312- aco: Remove branch instruction when exec is constant non-zero.
5313- ac/nir/cull: Tweak phi for cull_small_primitive branch.
5314- ac/nir/cull: Fix typo in bounding box culling.
5315- aco: Add faster code path to store_lds for consecutive write mask.
5316- aco: Fix invalidated reference in branching sequence optimization.
5317- aco: Check for instructions that inhibit the branching sequence optimization.
5318- aco/optimizer_postRA: Don't try to optimize dead instructions.
5319- aco: Support s_cselect_b64 in SCC no-compare optimization.
5320- aco: Improve SCC nocompare optimization when SCC is clobbered.
5321- aco: Fix p_init_scratch for task shaders.
5322- vulkan, spirv: Update to Vulkan 1.3.226 and latest SPIR-V headers.
5323- spirv: Support EXT_mesh_shader indices and mark them per-primitive.
5324- spirv: Add mesh_shading capability for EXT_mesh_shader.
5325- spirv: Support EXT_mesh_shader mesh/task stages.
5326- spirv: Support EXT_mesh_shader SetMeshOutputsEXT.
5327- spirv: Support the CullPrimitiveEXT mesh shader built-in.
5328- spirv: Support TaskPayloadWorkgroupEXT storage class.
5329- spirv, nir: Handle EmitMeshTasksEXT opcode.
5330- ac/nir/ngg: Add EXT_mesh_shader primitive indices.
5331- ac/nir/ngg: Add EXT_mesh_shader CullPrimitiveEXT output.
5332- ac/nir/ngg: Add EXT_mesh_shader vertex/primitive count.
5333- radv: Implement EXT_mesh_shader draw calls.
5334- radv: Enable EXT_mesh_shader on RDNA2 with RADV_PERFTEST=ext_ms
5335- nir/lower_system_values: Add shortcut for 1D workgroups.
5336- nir/gather_info: Clear cross-invocation output mask.
5337- radv: Replace NV_mesh_shader macros with EXT.
5338- radv: Enable subgroup feature bits for mesh and task shaders.
5339- radv: Change max preferred task workgroup invocations to 64.
5340- radv: Remove dead shader temps after linking.
5341- ac/nir/ngg: Don't use LDS for same-invocation indices and cull outputs.
5342- ac/nir/ngg: Don't initialize same-invocation mesh shader outputs.
5343- aco/tests: Add post-RA optimizer testcase for partially overwritten VCC.
5344- aco/tests: Add post-RA DPP test cases with control flow.
5345- aco/tests: Add post-RA SCC no-compare tests cases with control flow.
5346- aco/optimizer_postRA: Mark a register overwritten when predecessors disagree.
5347- aco/optimizer_postRA: Don't assume all operand registers were written by same instr.
5348- aco/optimizer_postRA: Fix logical control flow handling.
5349- aco/optimizer_postRA: Clarify terminology.
5350- aco: Change inverse-comparison optimization to work with s_not
5351- spirv: Treat EmitMeshTasksEXT as a terminating instruction.
5352- radv: Use a fallback for marketing name when libdrm doesn't know it.
5353- ac/nir/ngg: Fix cross-invocation indices and cull outputs.
5354- aco: Fix build error with std::max on GCC 12
5355- nir: Document the flatten/dont_flatten selection control options.
5356- nir: Add selection control enum for always taken divergent branches.
5357- aco: Allow explicitly removing jumps on GFX10+ when beneficial.
5358- ac/nir/ngg: Remove jumps from some branches where we know LGKMCNT==0.
5359- ac/nir/tess: Remove jump from tess factor writes.
5360- nir, ac, aco: Add ACCESS intrinsic index to load/store_buffer_amd.
5361- aco: Cleanup load_vmem_mubuf and store_vmem_mubuf functions.
5362- nir, ac, aco: Add index src to load_buffer_amd/store_buffer_amd.
5363- aco: Optimize MUBUF 0 offset when idxen is also being used.
5364- aco/optimizer_postRA: Use unique_ptr + array for instruction indices.
5365- aco/optimizer_postRA: Speed up reset_block() with predecessors.
5366- aco/optimizer_postRA: Properly handle vccz/execz/scc in reset_block.
5367- aco/optimizer_postRA: Delete dead instructions more efficiently.
5368- aco: Move is_dead to aco_ir.h to allow it to get inlined.
5369- aco: Add ACO_DEBUG=novalidateir option.
5370
5371Tomeu Vizoso (16):
5372
5373- radeonsi/ci: Move libva and piglit jobs to Raven Ridge boards
5374- radv/ci: Put one more board to run the CTS on Stoney Ridge
5375- iris/ci: Test on Jasper Lake Chromebooks
5376- iris/ci: Set FDO_CI_CONCURRENT for all Chromebook jobs
5377- ci: Report failures to GitLab for LAVA jobs
5378- ci: Make sure that the Crosvm control socket has been removed
5379- Revert "ci: set venus on lavapipe to manual due to flakes"
5380- ci: Crosvm won't remove the control socket file on stop
5381- Revert "Revert "Revert "ci: set venus on lavapipe to manual due to flakes"""
5382- ci: Uprev ci-templates
5383- ci: Use --no-install-recommends to avoid problems with --no-remove
5384- ci: Install sysvinit-core without --no-remove
5385- ci: Stop explicitly passing env vars to FDO_DISTRIBUTION_EXEC command
5386- gallium: Add PIPE_CAP_QUERY_TIMESTAMP_BITS
5387- ci: Run RadeonSI jobs on changes to src/amd/ci
5388- radeonsi,radv/ci: Increase coverage
5389
5390Victor Hermann Chiletto (1):
5391
5392- radv: refactor descriptor set layout tracking
5393
5394Vinson Lee (10):
5395
5396- bin/perf-annotate-jit.py: Update to Python 3.
5397- zink: Remove duplicate variable zero.
5398- pan/bi: Fix memory leaks.
5399- egl/dri2: Fix missing return with dri2_egl_error_unlock.
5400- radv: Use count_tes_user_sgprs return value.
5401- d3d12: Initialize d3d12_video_encoder_references_manager_hevc::m_current_frame_idx.
5402- radv: Fix file descriptor leak.
5403- zink: Remove duplicate variable zm.
5404- r600/sfn: Initialize TestInstrFromString member m_instr_factory.
5405- v3d: Fix initializer-overrides warning.
5406
5407Väinö Mäkelä (1):
5408
5409- hasvk: Correctly set NonPerspectiveBarycentricEnable on gfx7
5410
5411Xin LI (1):
5412
5413- meson: Improve detection of qsort_r().
5414
5415Yiwei Zhang (52):
5416
5417- venus: increase busy wait order in vn_relax
5418- venus: double the abort timeout to allow long shader compiles
5419- venus: fix external memory ext filtering
5420- venus: require necessary extensions for common wsi support
5421- venus: avoid feedback for external fence
5422- venus: use a separate sync fence for Android wsi
5423- venus: lazily create queue wait fence and make it non-external
5424- venus: avoid pre-allocating the feedback pool
5425- venus: sync to latest venus protocol headers for syncFdFencing
5426- venus: stub out renderer sync_fd fencing features
5427- venus: put android wsi on the sub-optimal path
5428- venus: fix vn_GetFenceFdKHR
5429- venus: fix vn_GetSemaphoreFdKHR
5430- venus: query renderer sync_fd props to fill the feature stubs
5431- venus: re-implement sync_fd external fence
5432- venus: re-implement sync_fd external sempahore
5433- venus: avoid scrubing wsi/external sempahores
5434- venus: allow no external memory handle when renderers lacks support
5435- venus/ci: mark a test as flake
5436- venus: some clang format fixes
5437- venus: enable zink required extensions on Android
5438- venus: enable KHR_driver_properties on Android
5439- venus: double the abort timeout
5440- loader: use os_get_option for driver override
5441- zink: fix core support on Android
5442- zink: fix zink_create_fence_fd to properly import
5443- zink: fix in-fence lifecycle
5444- zink: implement fence_get_fd required by EGL android platform
5445- venus: clean up vn_QueueSubmit
5446- venus: force synchronous submission for external signal semaphore
5447- venus: fix an oob in descriptor pool state restoration
5448- venus: vn_ResetDescriptorPool to reset mutable type states
5449- venus: zero out the pipeline handles
5450- venus: avoid fixing pipeline if not derivative
5451- venus: ignore pInheritanceInfo if not secondary command buffer
5452- venus: ensure to clear VkPipelineCreationFeedback::flags if needed
5453- mesa/st: emit string marker based on PIPE_CAP_STRING_MARKER
5454- meson: libelf is optional
5455- venus/ci: remove redundant crosvm patch
5456- venus: move vendor ext to bottom and sort the ext list
5457- docs: update venus extension support
5458- docs: move VK_EXT_extended_dynamic_state3 out from 1.3
5459- ci/venus: enable render server for venus testing
5460- venus: avoid accessing local var in VN_ADD_EXT_TO_PNEXT_OF
5461- venus: remove redundant codes
5462- venus: handle VkAndroidHardwareBufferFormatProperties2ANDROID
5463- venus: sync to latest venus protocol headers
5464- venus: add VK_EXT_primitives_generated_query support
5465- venus: add VK_EXT_depth_clip_control support
5466- docs: update to latest venus driver support
5467- venus: handle VK_QUERY_TYPE_PRIMITIVES_GENERATED_EXT
5468- venus: fix android wsi with global fencing disabled
5469
5470Yonggang Luo (218):
5471
5472- meson: Place all pre_args about HAVE_DRI* into single place
5473- meson: Cleanup usage dri_drivers as it's a dead option
5474- d3d12: Fixes compile error with mingw/gcc-x64 when static linkage to runtime library
5475- meson: clc compiler test only works on win32
5476- microsoft/clc: Fixes compiling errors with clang/mingw64 in clc/clc_compiler_test.cpp
5477- util: open_memstream also support on MacOS
5478- c11: Fixes the declaration of timespec_get in time.h for macOS
5479- egl: Remove eglextchromium.h and import eglext_angle.h
5480- egl: wire up ANGLE_sync_control_rate
5481- egl/x11: Split declaration of dri2_format_for_depth into header platform_x11.h
5482- egl/x11: implement ANGLE_sync_control_rate
5483- ci/windows: Setup proper MESA_IMAGE_TAG for Windows build/test image
5484- c11: It's should detect PTHREAD_DESTRUCTOR_ITERATIONS when define TSS_DTOR_ITERATIONS
5485- c11: call impl_tss_dtor_invoke with tls callback for win32
5486- c11: Implement and use of thrd_current properly on win32
5487- util: Implement util_thread_get_time_nano on win32
5488- util: Fixes typo in function os_get_total_physical_memory
5489- util: Implement os_get_available_system_memory on Win32
5490- util: Use \`DETECT_OS_WINDOWS` instead \`WIN32` in u_process.c
5491- util: Enable tests that disabled for msvc/mingw
5492- zink: Use DETECT_OS_WINDOWS instead of 'WIN32'
5493- meson: Enable graw tests on mingw
5494- util: Add os_create_anonymous_file support on win32
5495- nine: Fixes compile of nine_pipe.h with clang ../../src/gallium/frontends/nine/nine_pipe.h:377:9: error: unannotated fall-through between switch labels [-Werror,-Wimplicit-fallthrough]         default:         ^ ../../src/gallium/frontends/nine/nine_pipe.h:377:9: note: insert 'break;' to avoid fall-through         default:
5496- panfrost: Do no use designated initializer for union
5497- intel: Fixes compile error of aubinator_viewer.cpp with gcc
5498- util: Fixes memory leak in create_cache_item_header_and_blob
5499- util: Fixes memory leak in u_printf_test.cpp
5500- util: Fixes invalid assumption that return non null by function util_format_fetch_rgba_func
5501- util: Disable usage of __attribute__((__const__)) when the compiler is clang
5502- ci: Improve MacOS github ci
5503- ci: Enable llvmpipe on MacOS
5504- c11: Remove mtx_try from mtx enums as it's not a part of c11 standard
5505- c11: #include <threads.h> when the os/platform provide it
5506- anv: Fixes struct anv_device::info is not initialized with struct anv_physical_device
5507- meson: Use different STRACEDIR folder for intel_devinfo_override_test
5508- meson: add enable-glcpp-tests option
5509- ci: disable glcpp tests for now
5510- Revert "ci: disable unit tests"
5511- util: Use literal instead call to list_inithead in u_queue.c
5512- mapi: Remove unused ENTRY_CURRENT_TABLE_GET
5513- mapi: Remove usage of _glapi_check_multithread and _glapi_destroy_multithread
5514- mapi: Remove reference to unused _gl_DispatchTSD
5515- mapi: Move shared _glapi_set_context and _glapi_set_dispatch into u_current.c
5516- mapi: Move shared stub into u_current.c
5517- mapi: Access _glapi_tls_Dispatch directly without need of macro
5518- docs: Remove document about USE_ELF_TLS
5519- mapi: Improve comment about _glapi_tls_Dispatch and _glapi_tls_Context
5520- aco: Use unreachable instead assert(false)
5521- amd/compiler: Fixes warning [-Wunused-variable] in test_to_hw_instr.cpp
5522- amd/compiler: Fixes warning [-Wunused-variable] in test_optimizer_postRA.cpp
5523- nir: Fixes [-Wdeprecated-declarations] in serialize_tests.cpp
5524- util: Fix [-Wdeprecated-declarations] in mesa-sha1_test.cpp
5525- glsl: Fixes [-Wdeprecated-declarations] in list_iterators.cpp
5526- intel/compiler: Fixes [-Wdeprecated-declarations] in test_eu_compact.cpp
5527- intel/compiler: Fixes [-Wdeprecated-declarations] in test_eu_validate.cpp
5528- virgl: Fixes [-Wdeprecated-declarations] in virgl_staging_mgr_test.cpp
5529- osmesa: Fixes [-Wdeprecated-declarations] in test-render.cpp
5530- ci: Remove -Wno-error=deprecated-declarations cpp option from debian-clang
5531- util: Remove unused rtasm_cpu_has_sse2
5532- util: Rename util_cpu_detect to _util_cpu_detect_local
5533- util: Improve has_sse has_sse2 has_sse3 has_sse4_1 has_avx disable handling
5534- util: Handling GALLIUM_NOSSE in u_cpu_detect.c
5535- util: Handling LP_FORCE_SSE2 in u_cpu_detect.c
5536- llvmpipe: Remove the hack for LLVM4 in lp_bld_init.c
5537- gallium: Fixes memory leak of translate_test.c
5538- util: Use environment variable GALLIUM_OVERRIDE_CPU_CAPS to remove usage of util_cpu_caps in translate_test.c
5539- util: Move detect_done out of struct util_cpu_caps_t
5540- util: Add api util_call_once_with_context
5541- util: Add extern "C" guard in simple_mtx.h
5542- util: Remove the need of _MTX_INITIALIZER_NP in simple_mtx.h
5543- meson/panfrost: Add dep_valgrind for libpanfrost_pixel_format to fixes the compiling error:
5544- llvmpipe: Fixes warning [-Wimplicit-fallthrough] in lp_bld_swizzle.c
5545- llvmpipe: Fixes warning [-Wimplicit-fallthrough] in lp_bld_sample_soa.c
5546- radeonsi: Fixes warning [-Wimplicit-fallthrough] in si_descriptors.c
5547- freedreno: Fixes warning [-Wimplicit-fallthrough] in freedreno_util.h
5548- freedreno: Fixes warning [-Wimplicit-fallthrough] in fd4_texture.h
5549- freedreno: Fixes warning [-Wimplicit-fallthrough] in fd2_texture.c
5550- freedreno: Fixes warning [-Wimplicit-fallthrough] in fd3_texture.c
5551- freedreno: Fixes warning [-Wimplicit-fallthrough] in fd5_texture.h
5552- freedreno: Fixes warning [-Wimplicit-fallthrough] in fd5_emit.c
5553- freedreno: Fixes warning [-Wimplicit-fallthrough] in fd6_emit.c
5554- vdpau: Fixes warning [-Wimplicit-fallthrough] in vdpau_private.h
5555- v3dv: Fixes warning [-Wimplicit-fallthrough] in v3dvx_pipeline.c
5556- v3dv: Remove redundant unreachable in v3dvx_pipeline.c
5557- st: Fixes warning [-Wimplicit-fallthrough] in st_texture.c
5558- llvmpipe: Remove llvm version guard LLVM_VERSION_MAJOR <=> 4
5559- meson: bump the minimal required llvm version to 5.0.0
5560- util: Refactoring check_cpu_caps_override out
5561- util: Add ssse3 and avx cpu caps override option
5562- util: Move the detect of max_vector_bits into u_cpu_detect.c
5563- util: add support for detecting avx512 vector bit size
5564- llvmpipe: Always using util_get_cpu_caps to get cpu caps for llvm on x86
5565- llvmpipe: Enable avx512 support in llvm
5566- docs: document LP_NATIVE_VECTOR_WIDTH GALLIUM_OVERRIDE_CPU_CAPS GALLIUM_NOSSE LP_FORCE_SSE2
5567- ci: EXTRA_OPTIONS -> EXTRA_OPTION for debian-clang
5568- ci: Enable GALLIUM_DUMP_CPU=true only in the clang job
5569- ci: Enable all possible meson build options for clang
5570- ci: Add debian-clang-release build pipeline
5571- ci/fedora: Re-enable vulkan-layers=device-select,overlay
5572- gallium/hud: Fixes compile error in hud/hud_context.c:
5573- util: Fixes comment about debug_get_option
5574- mesa: direct use of mtx_recursive is not c11 conformance
5575- radeonsi: direct use of mtx_recursive is not c11 conformance
5576- svga: direct use of mtx_recursive is not c11 conformance
5577- c11: Update the values of enum mtx_plain, mtx_recursive, mtx_timed to make sure mtx_recursive != mtx_plain | mtx_recursive
5578- drm-shim: drop gnu99 override
5579- clover: Fixes use of designated initializers requires in c++ that doesn't support by MSVC
5580- radv: Remove the redundant #include <gelf.h> and #include <libelf.h> in ac_binary.c
5581- aco: Convert to use u8 literal for Unicode character to fixes msvc warning
5582- util: Add util_call_once for optimize call to util_call_once_with_context out for hot path
5583- util: getting get_tracefile in u_trace.c to be thread safe
5584- util: Remove the macro DEBUG_GET_ONCE_FILE_OPTION
5585- radv/r600/clover: Getting libelf to be optional
5586- radv: Getting radeon_icd to be generated properly on win32
5587- dzn: rename module_dir to icd_lib_path for consistence
5588- lavapipe/meson: rename module_dir to icd_lib_path and swap two lines for consistence
5589- Update 22.2.0.rst
5590- ci: debian-clang: drop -Wno-error=unused-function
5591- r600/sfn: Fixes 'numeric_limits' is not a member of 'std' error in sfn/sfn_instr.cpp
5592- aco: Fixes compiling error about char8_t with c++20
5593- ci/docs: Stick to specific version of python packages
5594- util: Remove the include of windows.h when compiling with MSVC
5595- util: Implement atomic operations consistently across compilers and testing for it
5596- util/mesa/wide: Rename _SIMPLE_MTX_INITIALIZER_NP to SIMPLE_MTX_INITIALIZER
5597- util: Turn -DWINDOWS_NO_FUTEX to be pre_args
5598- ci/windows: Enable gles1 for msvc
5599- ci/windows: Remove -Dlibelf:warning_level=1 as libelf subproject are already removed
5600- ci/windows: Getting the default supported windows version to be 7 when using mingw
5601- gallium: delete graw
5602- nir: No need redefine snprintf anymore in nir.h
5603- d3d10ump: No need define snprintf anymore in InputAssembly.cpp
5604- util: Move the implementation of futex_wake and futex_wait from futex.h to futex.c
5605- docs: Remove graw related words
5606- docs: Remove rbug related documents
5607- nouveau: Remove XVMC_VL environment variable usage
5608- docs: Remove document about XvMC
5609- meson: add dep_valgrind dependency for 'dxil_compiler'
5610- util: Remove the need of _MTX_INITIALIZER_NP by using simple_mtx_t/SIMPLE_MTX_INITIALIZER in u_debug_stack.c
5611- util: Call function mtx_init on exit_mutex to replace the usage of _MTX_INITIALIZER_NP in u_queue.c
5612- util: Remove the need of _MTX_INITIALIZER_NP by using simple_mtx_t/SIMPLE_MTX_INITIALIZER in u_debug_symbol.c
5613- util: Remove the need of _MTX_INITIALIZER_NP by using simple_mtx_t/SIMPLE_MTX_INITIALIZER in u_debug_stack_android.cpp
5614- util: Remove the need of _MTX_INITIALIZER_NP by using simple_mtx_t/SIMPLE_MTX_INITIALIZER in u_debug_memory.c
5615- util: Remove the need of _MTX_INITIALIZER_NP by using simple_mtx_t/SIMPLE_M in u_debug_refcnt.c
5616- mapi: Use util_call_once to init exec_mem and mutex instead _MTX_INITIALIZER_NP
5617- mapi: Remove the need of _MTX_INITIALIZER_NP by using simple_mtx_t/SIMPLE_MTX_INITIALIZER in stub.c
5618- gallium/auxiliary: Remove the need of _MTX_INITIALIZER_NP by using simple_mtx_t/SIMPLE_MTX_INITIALIZER
5619- egl/wgl: Use util_call_once_data to replace usage of mtx_t glFlushMutex
5620- egl/dri2: Use util_call_once_data to replace usage of mtx_t glFlushMutex
5621- mesa: Use util_call_once_data instead of mtx_t init_once_lock
5622- util: #include "c11/threads.h" when necessary in simple_mtx.h
5623- util: #include "util/macros.h" in u_cpu_detect.h for ATTRIBUTE_CONST
5624- c11: Remove the code for Windows XP
5625- meson/mesa/wide: define WIN32_LEAN_AND_MEAN with pre_args
5626- util: Remove the guard WIN32_WINNT >= 0x0600 in u_thread.h
5627- c11: Getting the macro guard of HAVE_PTHREAD in c11/threads.h to be consistence with util/u_uthread.h
5628- util: #include <windows.h> in disk_cache_os.c
5629- llvmpipe: #include <windows.h> in lp_rast.c
5630- util: Remove include of windows.h in memstream.h
5631- util: Remove include of windows.h in rwlock.h
5632- util: Move the function implementation from u_thread.h to u_thread.c
5633- glx: Guard usage of infer_zink && !explicit_zink in glxext.c
5634- glx: Trim trailing spaces in in glx/glxext.c
5635- glx: Remove redundant #ifndef GLX_USE_APPLEGL in glx/glxext.c
5636- glx: Adopt guard of glx_direct and glx_accel in #if defined(GLX_DIRECT_RENDERING) && !defined(GLX_USE_APPLEGL)
5637- glx: Adot #ifdef GLX_USE_WINDOWSGL in #if defined(GLX_DIRECT_RENDERING) && !defined(GLX_USE_APPLEGL)
5638- st: Remove st_api_type because ST_API_OPENVG is not an option anymore
5639- ci: libxvmc-dev is not a dependence anymore
5640- meson: upgrade zlib wrap
5641- ci: upgrade zlib to 1.2.13 in mesa_deps_build.ps1
5642- ci: Add Get-Date in each package build in mesa_deps_build.ps1
5643- ci: Move building libva before clone llvm-project in mesa_deps_build.ps1
5644- ci: Install the clang-cl compiler within mesa_deps_vs2019.ps1
5645- ci: Trigger building visual studio docker image
5646- meson: with_glx never assigned to 'gallium-xlib', so need add with_glx == 'xlib' for dep_glproto
5647- meson: with_glx never assigned to 'gallium-xlib', remove 'gallium-xlib'
5648- xlib: Remove macro guard with GLX_INDIRECT_RENDERING in xlib.c
5649- ci/macos: Building both dri and xlib version of glx
5650- hgl: Remove non-nused hgl_create_st_api
5651- st: move function st_visual_have_buffers into st_manager.c
5652- st: OpenVG are removed, remove st_api::profile_mask as it's constant
5653- st: OpenVG are removed, remove st_api::feature_mask as it's constant
5654- st: Remove st_api::destroy
5655- st: Turn st_api::destroy_drawable to be direct accessed function st_api_destroy_drawable
5656- st: Turn st_api::get_current to be direct accessed function st_api_get_current
5657- st: Turn st_api::make_current to be direct accessed function st_api_make_current
5658- st: Turn st_api::create_context to be direct accessed function st_api_create_context
5659- st: Turn st_api::query_versions to be direct accessed function st_api_query_versions
5660- xlib: Implement xmesa_get_name by return literal "Mesa " PACKAGE_VERSION directly
5661- st: Remove struct st_api,  st_gl_api_create and st_gl_api.h
5662- aco: Do not define NOMINMAX as it's already defined in pre_args now
5663- util: Remove non used UTIL_MAX_L3_CACHES in u_thread.h
5664- util: Merge duplicated function prototype in u_thread.h that named with util_barrier*
5665- util: Move UTIL_MAX_CPUS from u_thread.h to u_cpu_detect.h
5666- util: #include "c11/threads.h" instead of #include "util/u_cpu_detect.h"  u_format.c
5667- xlib: Fixes compiling on linux
5668- xlib: Remove the linkage hack in lib.c by use meson link_whole option
5669- gallium: Remove mgl* prefix in linker script osmesa.sym and libgl-xlib.sym
5670- xlib: Indent with space and trim trailing spaces of xlib files
5671- ci/macOS: Getting the installed binary to be artifacts
5672- ci: Getting debian-clang-release to building with glx=xlib
5673- xlib: Getting xmesa_strict_invalidate to be function
5674- xlib: Remove the usage of global constructor in xlib.c, so the xm_public.h are removed
5675- docs:  -Ddri-drivers= option is deprecated, do not use it anymore
5676- docs: There is only glx=xlib option now
5677- docs: There is no more swrast driver
5678- android: -Ddri-drivers= is not needed anymore
5679- glx: Fixes apple/apple_visual.c that include non-exist util/debug.h
5680- ci: Fixes macos.yml
5681- radv: Fixes prototypes
5682- meson: Refactoring shared gen_vs_module_defs_normal_command out
5683- meson: fixes mingw-clang32 building
5684- clover: Rename \*OpenCL.def to \*OpenCL.def.in
5685- clover: Fixes building with mingw-x86
5686- meson: Fixes name_prefix for clover on mingw
5687- clover: empty soversion when on win32
5688- util: Test __PPC64__ for getting PIPE_ARCH_PPC_64 respond to __PPC64__ take effect
5689- gallivm: use valid LLVM_VERSION_MAJOR instead of LLVM_VERSION
5690
5691Yusuf Khan (7):
5692
5693- nouveau: implement and enable PIPE_CAP_QUERY_MEMORY_INFO
5694- tu: remove tu_syncobj_to_fd
5695- nouveau: get timestamp from os_time_get_nano instead of os_time_get
5696- nouveau: put nv04 push macros in nouveau_winsys
5697- nv50/ir: nir_op_b2i8 and nir_op_b2i16
5698- gallium/util: add a helper for get_timestamp
5699- nouveau: enable PIPE_CAP_UMA when appropriate
5700
5701jenneron (1):
5702
5703- docs/panfrost: Document Mali-T620 support
5704
5705noasakurajin (1):
5706
5707- disable zinks shader cache when the needed functions do not exist
5708
5709nyanmisaka (2):
5710
5711- gallium: Do not include offscreen winsys if glx is disabled
5712- radeonsi: Inherit the old modifier when reallocating the texture.
5713
5714semjon00 (4):
5715
5716- hasvk: don't export flush_compute_state
5717- hasvk: don't export gfx state flushing helper
5718- hasvk: combine flushes in Draw/DrawIndexed/DrawIndirectByteCountEXT
5719- hasvk: force inline more pipe flush functions
5720
5721sjfricke (2):
5722
5723- isl: fix bug where sb.MOCS is not being set
5724- anv: fix assert to build with shader cache disabled
5725