1Mesa 22.3.0 Release Notes / 2022-12-02 2====================================== 3 4Mesa 22.3.0 is a new development release. People who are concerned 5with stability and reliability should stick with a previous release or 6wait for Mesa 22.3.1. 7 8Mesa 22.3.0 implements the OpenGL 4.6 API, but the version reported by 9glGetString(GL_VERSION) or glGetIntegerv(GL_MAJOR_VERSION) / 10glGetIntegerv(GL_MINOR_VERSION) depends on the particular driver being used. 11Some drivers don't support all the features required in OpenGL 4.6. OpenGL 124.6 is **only** available if requested at context creation. 13Compatibility contexts may report a lower version depending on each driver. 14 15Mesa 22.3.0 implements the Vulkan 1.3 API, but the version reported by 16the apiVersion property of the VkPhysicalDeviceProperties struct 17depends on the particular driver being used. 18 19SHA256 checksum 20--------------- 21 22:: 23 24 644bf936584548c2b88762111ad58b4aa3e4688874200e5a4eb74e53ce301746 mesa-22.3.0.tar.xz 25 26 27New features 28------------ 29 30- GL_ARB_shader_clock on llvmpipe 31- VK_KHR_shader_clock on lavapipe 32- Mesa-DB, the new single file cache type 33- VK_EXT_attachment_feedback_loop_layout on RADV, lavapipe 34- VK_KHR_global_priority on RADV 35- GL_KHR_blend_equation_advanced_coherent on zink 36- VK_EXT_load_store_op_none on RADV 37- VK_EXT_mutable_descriptor_type on RADV 38- VK_EXT_shader_atomic_float on lvp 39- VK_EXT_shader_atomic_float2 on lvp 40- GL_NV_shader_atomic_float on llvmpipe 41- VK_EXT_image_robustness on v3dv 42- VK_EXT_extended_dynamic_state3 on lavapipe 43- VK_EXT_extended_dynamic_state3 on RADV 44- VK_EXT_pipeline_robustness on v3dv 45- Mali T620 on panfrost 46- Shader disk cache on Panfrost 47- support for R8G8B8, B8G8R8, R16G16B16 and 64-bit vertex buffer formats on RADV 48- initial GFX11/RDNA3 support on RADV 49- various ray tracing optimizations on RADV 50- extendedDynamicState2PatchControlPoints on RADV (VK_EXT_extended_dynamic_state2 feature) 51- Radeon Raytracing Analyzer integration (using RADV_RRA_* environment variables) 52- None 53 54 55Bug fixes 56--------- 57 58- r600,regression: Wrong color on a Minecraft shaderpack on Cayman 59- [iris/crocus] Native CS:GO OpenGL: fire produces black rendering artifacts 60- VDPAU GL interop is broken, causing Black and White video 61- panfrost: application crashes with 22.2.4/22.3.0-rc3 62- etnaviv: compiler assertion fail in glmark2 terrain 63- anv: dEQP-VK.pipeline.monolithic.multisample.variable_rate.8_1_fb_1 failure 64- DG2: Age of Empire IV incorrect rendering 65- r600: Sky not rendered correctly with the NIR backend on Transport Fever 2 66- "intel/compiler: Signed integer range analysis for imul_32x16 generation" regresses several OpenCL tests 67- aco: Missing 8-bit uadd_carry 68- !16168 broke Xephyr -glamor fallback to swrast 69- panfrost/rk3399: performance drop in emulationstation and in RetroArch menu [mesa >= 21.3.8, batocera.linux, bisected] 70- d3d12/VA-API: [-wmissing-prototypes] with MSYS2 MinGW-W64 UCRT64 GCC 71- clover: windows: library filename has \`-1` suffix and a \`lib` prefix when built with mingw 72- freedreno: !19236 broke anitchamber trace 73- radv, dxvk: Rendering errors in World of Tanks after "Switch to dynamic rendering only" 74- gen9 gt3e/gt4e skus fail dEQP-VK.pipeline.multisample.sample_locations_ext.* 75- Intel - GPU Hang when starting new game in Hellblade: Senua's Sacrifice 76- v3d: Wrong colors (pink) in videos in Firefox (likely YUV->RGB shader issue) 77- anv/zink: cts failures on ICL 78- panfrost t860 glmark-es2 regression 79- panfrost_drm.h build error 80- radv: Artifacts in multiple games with yuzu (regression, bisected) 81- Doom (1993) Ray-Traced has render glitches. 82- radv: Flickering in Spider-Man Remastered (Regression) (Bisected) 83- radv: Hitman 2 using Direct3D 12 has discolored squares on RDNA2 with DCC enabled 84- panfrost/midgard - on Duckstation PSX emulator: segfault on GLES 3.0 and bad shader compilations on 3.3 85- nouveau: Updating to Mesa*22.2.0 makes videos have corrupted colors in Firefox 86- [radeonsi] OpenGL rendering issue with Unreal Tournament 99 & OldUnreal 4.69 rc4 87- Replace the functionals in util/debug.h with util/u_debug.h 88- gallium: nir_opt_access desynchronizes image access modifiers in NIR with Program->sh.ImageAccess 89- [radv/navi21] No sprites being rendered in Quake 2 RTX 90- Hacknet does not render on Panfrost 91- !19259 broke some piglit arb_tessellation_shader tests with radeonsi if TCS passthrough shader is cached 92- panfrost g52 issue 93- [Intel] Doom Eternal crashes shortly after launching 94- GravityMark segfault when using ray tracing ANV 95- nir: Incorrect idiv lowering 96- intel-clc fails with "Unsupported SPIR-V capability" and SEGFAULT. 97- Navi 14 hangs running "rendercheck -t cacomposite -f a8r8g8b8" since !18170 98- glx=xlib can not be built under ubuntu 22.04 99- radv: incorrect query result for VK_QUERY_TYPE_PRIMITIVES_GENERATED_EXT 100- crocus: EGL_ANDROID_native_fence_sync extension is missing 101- freedreno/a6xx: spec@arb_tessellation_shader@execution@tess-instance-id fail 102- r300: wrong primitive positions in CXBX-R emulator with nine and RADEON_DEBUG=notcl 103- EGL re-initialization can fail silently 104- Rusticl: fails to create program if source size is stated, but last char is a null terminator 105- r300: register allocation sometimes generate invalid swizzles 106- Mesa git Upgrade causes cursor detection issues in Wayland, issue was not in 22.3.0-git.20220920.8d66c45.fc36 107- Proposal: Remove rbug, graw, graw tests, xa, xvmc 108- intrinsics.h:112:21: error: use of undeclared identifier 'intel_sub_group_shuffle' 109- tu: broken since dynamic pcp 110- radv: Crash in dEQP-VK.ray_query.misc.dynamic_indexing 111- t860: mediapipe application failed 112- glthread: radeonsi: offset textures in game starsector with glthread enabled 113- Crashing on Windows VM 114- zink: descriptor-handling regression 115- hasvk: noperspective is broken on Haswell hardware (Intel(R) HD Graphics 4400) 116- util/u_atomic.h can not be included in util/u_debug.h 117- radv: dynamic patch control points broken 118- Exanima renders with the wrong colors. 119- Missing tile cache flushes causing corruptions 120- turnip: compute shader hang in the Monster Hunter World 121- anv_private.h:118:10: fatal error: anv_entrypoints.h: No such file or directory 122- freedreno: mis-rendering on gardensofkadesh/homeworld 123- piglit.spec.egl 1_4.egl-ext_egl_image_storage times out 124- nouveau: tegra124: GL_OUT_OF_MEMORY error 125- r600: Shadows incorrecly rendered on Counter-Strike: Global Offensive with the NIR backend 126- r300: regalloc doesn't handle brk correctly 127- Recent WSI commit breaks every Vulkan application on RDNA 2 GPUs 128- 8775f08bf11 makes warcraft 3 crash with free(): double free detected in tcache 2 129- anv: probable ccs issue on icl 130- iris: remap_tess_levels assertion failure in shader-db runs 131- RADV: Mafia & Mafia III :Definitive Edition graphical issues with certain effects 132- Cyberpunk 2077 ground reflectiveness issues using RADV 133- [RADV] Grand Theft Auto V: Shadow rendering bug on Steam Deck 134- rusticl: fails to build on non-Linux: cannot find value \`stderr` in this scope 135- llvmpipe-piglit-rusticl fail at api/clenqueuefillimage 136- Broken build with Clang-16 due to: aco_register_allocation.cpp:382:7: error: no matching function for call to 'printf' 137- src/gallium/drivers/r600/sfn/sfn_instr.cpp:92:33: error: expected primary-expression before 'int' 138- r600: Make NIR the default backend 139- undefined reference to \`intel_ds_begin_xfb' 140- data-race leads to crash in u_trace_context_init() 141- radeonsi/radv: Discrepancy in handling wd_switch_on_eop 142- Shadows rendering issue in Deathloop 143- RADV: Regression with RPCS3 where nothing renders 144- [RADV][bisected] commit 3714c89d0ebf0c541d8f7a7a807ce7c1bfe3cfea at least lead to GPU hang when I run the game "Quake II RTX" 145- eglCreateContext doesn't generate EGL_BAD_MATCH when share_context is on a different EGLDisplay 146- Intel - Homerun Clash misrendering issue 147- r300: dEQP-GLES2.functional.shaders.indexing.tmp_array.float_dynamic_write_dynamic_loop_read_fragment regression 148- turnip/a660: dEQP-VK.spirv_assembly.instruction.terminate_invocation.terminate.subgroup_ballot failure due to "movmsk" reordered with "kill" 149- WSI reports 11 RGB bits is unsupported in X11, using zink/swrast 150- [radeonsi] Graphical corruption in Valve Source-based games 151- RADV: VK_EXT_load_store_op_none support? 152- radv: extendedDynamicState2PatchControlPoints 153- Vega 8 rendering GDM grey blank login screen after recent change 154- r300: possibility of merging ADD and MUL into MAD when effectively only 2 sources are used 155- MESA_DEBUG=incomplete_tex shows unnecessary validation during glClear 156- anv: Performance drop with Vulkan on Wayland 157- lima: crash during dEQP-EGL execution 158- Regression from https://gitlab.freedesktop.org/mesa/mesa/-/commit/8fcb4aa0ebd7b9d0d8f80986fb817afea2fc4a87 159- radv: Enable instruction timing with RGP for RT 160- Updates to 22.2 RCs cause artifacts on nouveau and blank screen on VirtIO 161- nouveau: corruption on NVAC after switching to the NIR backend 162- egl: Remove big-display-lock 163- DG2: vulkancts tests spin forever 164- [regression][bisected] v3dv: dEQP-VK.api.object_management.alloc_callback_fail_multiple.graphics_pipeline fails 165- v3dv: evaluating reducing current value of DEFAULT_MAX_BO_CACHE_SIZE 166- venus-lavapipe flakes 167- amd: LPDDR4 and LPDDR5 hits an assertion failure in ac_memory_ops_per_clock 168- zink: GL_TEXTURE_LOD_BIAS vs mipLodBias, and GTF-GL46.gtf21.GL3Tests.texture_lod_bias.texture_lod_bias_all failure on Turnip 169- ci: radeonsi-raven-skqp also tests RADV but only runs on radeonsi changes 170- Gallium OpenGL megadriver fails to build if LLVM is enabled but not required by any component built 171- panfrost: crash in dEQP test 172- bitpack_helpers.h:34:10: fatal error: valgrind.h: No such file or directory 173- [gen9_iris] piglit causes crash on 4GB-memory machine when running spec/!opengl 1.1/max-texture-size 174- Abort due to "Unknown intrinsic: intrinsic store_deref" (firefox, radeonsi) 175- radv: missing support for some 3-component vertex formats in RDNA2 176- radv: radv_translate_buffer_dataformat overhead 177- nir: Replacing GC with manual memory management 178- turnip: dEQP-VK.subgroups.vote.graphics.subgroupallequal_*int regression on a635 179- regression from !17949 180- r600/sfn: xonotic shows a big performance regression when using the NIR backend with sb disabled 181- r600/sfn: register allocation fails for a number of piglits and deqp tests 182- DG2 corruption in gnome desktop from F36 183- d3d12: Delete custom OpenGLOn12 target, add option to override libgallium_wgl.dll name 184- radeonsi: error allocating sparse buffers.. bug? 185- RADV: vulkaninfo fails to report extended VkShaderStageFlagBits for VK_KHR_ray_tracing_pipeline 186- [radeonsi] Transform feedback with array outputs fails 187- [BISECTED/radeonsi] Flickering triangles with wayfire cube 188- re-evaluate the linker for CI 189- [RADV] (Vega/Navi) Halo: The Master Chief Collection: Texture issue on Halo 3 and Reach 190- ../src/util/u_queue.c:218: util_queue_fence_destroy: Assertion \`fence->signalled' failed for iris when disabled UTIL_FUTEX_SUPPORTED 191- Freedreno turnip mesa cause emulation close after 15-20min gameplay 192- glsl: textureGatherOffset param broken 193- turnip: broken after switching to userspace vma 194- turnip: gfxbench aztec ruins crashes on zink 195- !16863 disabled all meson tests in CI 196- intel unittests failed a lot because recents commits Job Failed #27063524 197- Microsoft CLC runtime crash (at least on 32-bit) since !12510 198- Job Failed #26465985 199- v3dv: android: VTS big regression after 316728a55bc8fe398be1ac2796a22f8c91fb886c 200- radeonsi: driver crashes with kwin-effects-cube 201- [Turnip] Final Fantasy X: Incorrect hair rendering 202- iris: Suboptimal usage of HiZ + CCS for mipmapped buffers 203- Document state of Dozen Vulkan support in features.txt for MesaMatrix site.. 204- turnip: gfxbench car chase crash on zink 205- [r300] - osu! - Shader optimizations cause black and glitchy rendering 206- [Bisected] Microsoft CLC: build failure with MSVC and Dshared-llvm=disabled with AMDGPU target available 207- Android.mk: meson enables flags based on the libraries installed on the host system 208- anv: invalid returns from vkCreateDescriptorPool 209- [Bisected][RadeonSI] Blender crashes on startup 210- lavapipe CI times out 211- turnip: depth/stencil regressions 212- radv regression: Hitman 2 rendering issue 213- llvmpipe: assert in KHR-Single-GL45.enhanced_layouts.xfb_all_stages 214- android: vulkan: Disable blocked KHR_performance_query extension (v3dv, turnip, anv) 215- Poor performance/utilization with Splitgate 216- [Bisected][RADV] Rendering issues and performance drop with GFX10 on RPCS3 217- freedreno_gmem.c crashes Firefox Nightly in Khronos 218- freedreno / mesa 22.1.3 crashes Firefox Nightly 219- !17693 broke piglit oes_egl_image_external_essl3 220- VK_PRESENT_MODE_IMMEDIATE_KHR rendering artifacts 221- dzn: Build failure when it's the only driver built - 22.2-devel regression 222- d3d12: Unresolved external symbols from Version.lib 223- a618 vk_full timing out at 60 minutes instead of 120 224 225 226Changes 227------- 228 229Adam Jackson (25): 230 231- egl: Remove some can't-happen supported API checks 232- egl: Move an error check earlier in EGL_BUFFER_AGE_EXT query 233- wsi/x11: Defer clearing image busy flag for non-shm upload 234- egl/kopper: Don't add EGL_SWAP_BEHAVIOR_PRESERVED_BIT configs 235- glx: Use XSaveContext, delete glxhash.c 236- Revert "glx: Use XSaveContext, delete glxhash.c" 237- wsi/x11: Fix the is-visual-supported check 238- nouveau: const cleanup 239- egl/dri2: Respect the arguments to dri2_set_blob_cache_funcs 240- egl/dri2: Fix a typo in a comment 241- egl/dri2: Fix some thinkos in old context release 242- glx/dri*: Unify glx_context subclassing 243- glx/dri: Avoid a weird indirection in driFetchDrawable 244- glx: Remove some excess work from the GLX_FBCONFIG_ID fallback 245- egl: Fix a bunch of maybe-uninitialized warnings 246- egl: Promote _eglLock/UnlockDisplay to internal API 247- egl: Untangle some spaghetti around _eglLookupDisplay 248- egl: Factor out the eglTerminate write-lock pattern 249- egl: Rewrite eglSetBlobCacheFuncsANDROID to use the standard macros 250- mesa/fbo: Don't force both read/draw bindings to zero 251- egl: Formatting fix 252- egl: Factor some common terminate cleanup up to common code 253- egl/dri2: Fix a weird conditional in dri2_make_current 254- egl: Remove a bogus restriction from eglMakeCurrent 255- rusticl: meson devenv support 256 257Adrián Larumbe (5): 258 259- panfrost: Handle Job VA cycles when decoding a dump file 260- panfrost: Remove documentation reference to deprecated parameter 261- panfrost: Add compiler args option for building usermode tools 262- panfrost: Sync panfrost_drm.h from drm-misc-next 263- panfrost: Add userspace crash dump decoder and analyser 264 265Alejandro Piñeiro (11): 266 267- v3dv/bo: reduce DEFAULT_MAX_BO_CACHE_SIZE to 64 268- v3dv/pipeline: use a array instead of individual pointer to stages 269- v3dv/pipeline: keep qpu_insts around if we expect them to be used later 270- v3d: replace all TODO around for FIXME 271- broadcom/compiler: don't call nir_opt_load_store_vectorize on all v3d_optimize_nir calls 272- broadcom/compiler: update how we compute return_words_of_texture_data on non-ssa 273- broadcom/compiler: add more lowerings/optimizations on v3d_optimize_nir 274- v3dv/pipeline: use v3d_optimize_nir 275- broadcom/compiler: pass a strategy struct to vir_compile_init 276- broadcom/compiler: call nir_opt_gcm with a custom strategy 277- v3dv/bo: reset bo and then call gem close 278 279Alejandro Tafalla (1): 280 281- freedreno: Fix graphic glitches on a4xx and a5xx 282 283Alessandro Astone (3): 284 285- Android.mk: Disable lmsensors 286- Android.mk: Filter out --build-id=md5 linker flag 287- Android.mk: Provide a custom entry name to ensure meson checks succeed 288 289Alex Brachet (1): 290 291- nir: Fix qsort comparator function 292 293Alyssa Rosenzweig (414): 294 295- docs/asahi: Document varying interpolation 296- agx: Fix packing of samplers in texture instrs 297- agx: Fix ld_var cf packing 298- asahi: Plumb through lower_clip_fs 299- asahi: Extend counts in BIND packets 300- asahi: Allow large uniform records 301- asahi: Dump all textures&samplers 302- asahi: Fix using multiple textures/samplers 303- asahi: Use a single bind texture/sampler per pipeline 304- asahi: Split vertex/fragment pipeline binds 305- asahi: Correct bind fragment pipeline size 306- asahi: Encode known bits of Linkage in the XML 307- asahi: Fix varying XML 308- asahi: Decode Interpolation packets 309- asahi,agx: Rewrite varying linking 310- agx: Add AGX_MESA_DEBUG=noopt option 311- agx: Model perspective coefficient reg in the IR 312- agx: Rename varying load instructions 313- agx: Use split instead of extract for ldcf 314- agx: Implement noperspective interpolation 315- agx: Lower more ALU operations 316- agx: Align 64-bit register pairs 317- agx: Assert that registers are naturally aligned 318- agx: Extract umul_high implementation 319- agx: Implement nir_op_umul_high 320- agx: Stop using broken idiv lowering 321- agx: Handle type-changing splits 322- agx: Remove p_extract 323- agx: Only emit the used components of gl_FragCoord 324- asahi: Don't advertise ARB_clip_control yet 325- asahi: Identify depth clip mode bit 326- pan/decode: Don't pass around memory handles 327- pan/decode: Remove pandecode_msg 328- pan/decode: Stop passing job index around 329- pan/decode: Stop passing suffixes around 330- pan/decode: Simplify pandecode_fbd 331- pan/decode: Reorder MFBD decoding 332- pan/decode: Unify SFBD/MFBD decoding 333- pan/decode: Defeature disassembler stats 334- pan/decode: Centrally declare pandecode entrypoints 335- pan/decode: Clean up _bifrost_ decode routines 336- pan/mdg: Remove disassembler stats 337- panfrost: Don't segfault on unknown models 338- pan/bi: Implement some extracts and inserts 339- pan/bi: Fuse [US][8|16]_TO_F32 ops 340- pan/bi: Test int8/16 -> float32 opts 341- pan/bi: Optimize bitwise arithmetic of booleans 342- pan/bi: Optimize pattern from nir_lower_idiv 343- pan/bi: Don't use the broken idiv lowering 344- pan/bi: Use nir_opt_idiv_const 345- asahi: Fix warning building for macOS 12.0 346- pan/bi: Don't reorder image loads across stores 347- pan/bi: Don't allow ATEST to take a temporary 348- pan/bi: Handle info.fs.untyped_color_outputs on Valhall 349- panfrost: Handle untyped_color_outputs on Bifrost 350- panfrost: Set blit output variable types correctly 351- panfrost: Don't saturate in Bifrost blend shaders 352- panfrost: Promote blend shader outputs 8->16-bit 353- panfrost: Avoid blend shader when not blending 354- panfrost: Don't compile empty blend shaders 355- panfrost: Invert no_colour to enabled 356- panfrost: Simplify blitter blend shader creation 357- panfrost: Include mask in replace blend shader name 358- panfrost: Assert that blend shaders are nontrivial 359- r600/sfn: Don't use broken idiv lowering 360- gallium: Inline away util_make_fragment_tex_shader_writemask 361- gallium: Inline away util_make_fragment_tex_shader interp_mode 362- gallium: Remove util_make_fragment_tex_shader_xrbias 363- pan/mdg: Replicate swizzles for scalar sources 364- pan/mdg: Reexpress umul_high packing 365- pan/mdg: Use correct idiv lowering 366- pan/mdg: Always write return address to r1.w 367- pan/mdg: Print 3 sources for CSEL 368- panfrost/ci: Update G72 xflakes list 369- gallium: Add u_default_set_debug_callback 370- v3d: Use u_default_set_debug_callback 371- vc4: Use u_default_set_debug_callback 372- softpipe: Use u_default_set_debug_callback 373- lima: Use u_default_set_debug_callback 374- etnaviv: Use u_default_set_debug_callback 375- r300: Use u_default_set_debug_callback 376- panfrost: Route shader-db to debug, not stderr 377- pan/bi: Implement general 8-bit vector construction 378- pan/va: Handle 8-bit lane when lowering constants 379- pan/bi: Implement nir_op_vec8 and nir_op_vec16 380- pan/bi: Assert that vectors are sufficiently small 381- docs/asahi: Document image layouts 382- asahi: Identify compressed render target fields 383- asahi: Rename 'tiled 64x64' to 'twiddled' 384- asahi: Add XML for multisampled textures 385- asahi: Fix bind sizes 386- asahi: Fix ASTC enums 387- ail: Introduce image layout module 388- ail: Rewrite tiled memcpy for correctness 389- asahi: Handle 2-channel sRGB textures 390- asahi: Fix is_format_supported returns 391- asahi: Advertise ETC1 392- asahi: Advertise ASTC formats 393- asahi: Rename our fake twiddled DRM modifier 394- ail: Add unit tests for miptree layouts 395- pan/bi: Fix dual texturing with uniforms 396- pan/bi: Fix out-of-bounds write in va_lower_split_64bit 397- pan/bi: Consider all dests in helper_block_update 398- pan/bi: Model [IF]CMP_{OR,AND,MULTI} ops 399- pan/va: Split out compare instructions 400- pan/bi: Set I->nr_dests, I->nr_srcs 401- pan/bi: Model 3rd source for ATEST 402- pan/bi: Consider nr_srcs/dests in CSE 403- pan/bi: Use nr_{srcs,dests} in bi_foreach_{src,dest} 404- pan/bi: Add validation for nr_srcs/nr_dests 405- pan/bi: Remove trivial dest[0] = bi_null() 406- pan/bi: Clean up destination printing 407- pan/bi: Improve register printing 408- pan/bi: Register allocate BLEND dest on Valhall 409- pan/bi: Don't remove dests in DCE 410- pan/bi: Use bi_emit_split_i32 for trimming vecs 411- pan/bi: Introduce TEXC_DUAL psuedoinstruction 412- pan/bi: Assert destinations are non-NULL 413- pan/bi: Assume destinations are non-NULL 414- pan/bi: Don't read nonexistant destinations 415- pan/bi: Don't read nonexistant sources 416- pan/bi: Add variable dest/src support to builder 417- pan/bi: Use variable src/dest for collect/split 418- pan/bi: Use safe helpers to remove srcs/dests 419- pan/bi: Rebuild some instructions when lowering 420- pan/bi: Rebuild DISCARD when optimizing 421- pan/va: Use builder in va_lower_isel 422- pan/bi: Use builder for MUX -> CSEL opt 423- pan/bi: Use builder for IADD -> IADDC opt 424- pan/bi: Use builder for atomic lowering in scheduler 425- pan/bi: Reduce memory of scheduler access array 426- pan/bi: Dynamically allocate source/dests 427- pan/bi: Don't write registers in optimizer tests 428- pan/bi: Add bi_after_block_logical helper 429- pan/bi: Lift bi_predecessor_index helper 430- pan/bi: Add phi nodes 431- pan/bi: Validate phi ordering 432- pan/bi: Don't lower FAU for phis 433- pan/bi: Remove bogus assert lowering branches 434- pan/bi: Don't optimize if without else 435- pan/bi: Don't reorder phis in pre-RA scheduling 436- pan/bi: Validate before&after pre-RA sched 437- pan/bi: Inline DCE into bi_lower_vector 438- pan/bi: Translate phis from NIR 439- pan/bi: Add SSA-based liveness pass 440- pan/bi: Add brainless out-of-SSA pass 441- pan/bi: Reduce some moves when going out-of-SSA 442- pan/bi: Preserve SSA form from NIR 443- pan/bi: Clean up after converting to SSA 444- pan/bi: Clear reg in squeeze_index 445- pan/bi: Don't use bi_temp_reg 446- pan/bi: Assume SSA in copyprop 447- pan/bi: Assume SSA in mod prop 448- pan/bi: Assume SSA in CSE 449- pan/bi: Assume non-null sources in constant folding 450- pan/bi: Assume SSA in minor passes 451- pan/bi: Assume SSA when scheduling for pressure 452- pan/bi: Assume SSA when translating NIR 453- pan/bi: Assume SSA for helper invocation analysis 454- pan/bi: Move non-SSA liveness into RA 455- pan/bi: Remove NIR registers from the IR 456- pan/bi: Get rid of bi_get_node 457- pan/bi: Inline node_to_index 458- pan/bi: Remove bi_max_temp 459- pan/bi: Simplify bi_get_index prototype 460- pan/bi: Add and use bi_foreach_ssa_src macro 461- pan/bi: Strengenth assert in the validator 462- pan/bi: Remove assert(bi_is_ssa(dest)) 463- pan/bi: Add and use bi_replace_src helper 464- pan/bi: Add and use bi_num_successors helper 465- agx: Fix tib access in internal shaders 466- agx: Fix float copyprop of neg(neg) case 467- agx: Add more unit tests for float copyprop 468- agx: Don't hardcode uniform enums 469- agx: Defeature indirect vbufs 470- agx: Plumb through nir_op_txf 471- asahi: Fix Indexed Draw command 472- asahi: Split up unk 2b 473- asahi: Note some more unknowns in the XML 474- asahi: Make ctx->zs a pointer 475- asahi: Export agx_blitter_save 476- asahi: Handle empty fragment shaders 477- asahi: Use util_blitter_clear 478- asahi: Trim garbage at end of set shader 479- asahi: Relax assert in decoder 480- asahi: Label batch flush reasons 481- asahi: Fix depth for cube maps 482- asahi: Fix nonmipmapped array textures 483- asahi: Fix "stride" for tiled textures 484- asahi: Identify and use first level field of texture 485- asahi: Correct SET_SHADER_EXTENDED disambig bit 486- asahi: Add 1D and 1D Array enums 487- asahi: Handle out-of-bounds clear_stencil 488- asahi: Fix depth/stencil buffers 489- asahi: Identify stencil test enable 490- asahi: Split RASTERIZER into constituent words 491- asahi: Identify object type field via PowerVR 492- asahi: Don't use lower_wpos_pntc 493- asahi: Match PPP data structures with PowerVR 494- asahi: Hoist constant PPP state to start of batch 495- asahi: Dirty track everything 496- asahi: Make BO list growable 497- asahi: Annotate VDM/CDM commands as per PVR 498- asahi: Express VDM commands according to PowerVR 499- asahi: Handle Stream Link VDM commands 500- asahi: Allocate new cmdbufs if out of space 501- docs/envvars: Document Asahi variables 502- pan/decode: Fix job cycle detection 503- nir/lower_blend: Avoid emitting unnecessary fsats 504- panfrost: Respect buffer offset for OpenCL 505- panfrost: Honour cso->req_local_mem 506- nir: Add nir_intrinsic_texture_base_agx sysval 507- agx,asahi: Implement nir_intrinsic_load_texture_base_agx 508- agx: Make p_combine take a dynamic src count 509- agx: Implement texture offsets and comparators 510- agx: Implement txd 511- agx: Implement load_global(_constant) 512- agx: Lower txs to a descriptor crawl 513- agx: Don't use nir_find_variable_with_driver_location 514- asahi: Don't crash on <4 channel render targets 515- asahi: Handle blending with MRT 516- u_transfer_helper: Handle Z24X8 for drivers that don't use the interleaved transfer_map 517- u_transfer_helper: Pack Z24S8 to Z24-in-Z32F and S8 518- asahi: Assert cache line alignment on Z/S buffers 519- asahi: Identify ZLS Control word from PowerVR 520- asahi: Identify CDM block types 521- asahi: Decode CDM commands separate from VDM 522- asahi: Shuffle IOGPU structs 523- asahi: Identify IOGPU compute header 524- asahi: Decode IOGPU compute header 525- asahi: Assert that u_transfer_helper is well-behaved 526- asahi: Use the internal format internally 527- asahi: Identify spill buffer histogram 528- asahi: Simplify IOGPU attachment packing 529- asahi: Identify shared memory fields 530- asahi: Consolidate magic numbers for USC controls 531- asahi/genxml: Overflow up to words when packing 532- asahi: Overhaul USC control packing 533- asahi: Identify pixel stride 534- asahi: Identify shared memory layouts 535- panfrost: Evict the BO cache when allocation fails 536- rusticl: Build Panfrost 537- panfrost: Adapt emit_shared_memory for indirect dispatch 538- agx: Convert and clamp array indices in NIR 539- panfrost: Default pipe->clear_texture impl 540- panfrost: Allow compiling MESA_SHADER_KERNEL 541- panfrost: Upload default sampler for txf 542- panfrost: Bump PIPE_CAP_MAX_TEXTURE_ARRAY_LAYERS 543- panfrost: Advertise PIPE_CAP_INT64 544- panfrost: Honour flush-to-zero controls on Valhall 545- panfrost: Assume launch_grid parameters always change 546- pan/va: Fix missing swizzle on CLZ.v2u16 547- pan/bi: Unit test swizzle lowering 548- pan/bi: Lower some 8-bit swizzles 549- pan/bi: Test some 8-bit swizzle lowering 550- pan/bi: Lower swizzles for 8-bit shifts 551- pan/bi: Strip negate when lowering swizzles 552- pan/va: Pack IADD.sat bit 553- pan/va: Fix v4s8 form of R2 opcodes 554- pan/bi: Handle uhadd, urhadd opcodes 555- pan/va: Pack .rhadd bit 556- pan/bi: Add HADD.v4s8.rhadd packing test cases 557- pan/va: Add 8-bit integer max assembler case 558- pan/bi: Lower 8-bit min/max to bcsel+comparison 559- pan/bi: Lower f2i8, f2u8 560- pan/bi: Remove bogus assert for pack_32_2x16 561- pan/bi: Allow selecting from an 8-bit vec8 562- pan/bi: Handle swizzles in unpack_64_2x32_split_{x,y} 563- pan/bi: Lower <32-bit bit_count 564- pan/bi: Lower fisnormal 565- pan/bi: Scalarize phis before the opt loop 566- pan/bi: Call nir_lower_64bit_phis 567- pan/bi: Use .auto for image stores 568- pan/bi: Fix 1D array indexing on Valhall 569- pan/bi: Stub out scoped_barrier 570- pan/bi: Implement unpack_64_2x32 571- agx: Fix bfi_mask packing 572- agx: Emit fewer combines for intrinsics 573- agx: Use 16-bit immediates 574- agx: Validate immediates 575- agx: Test fsat+f2f16 together 576- agx: Ensure that the optimizer sees legitimate SSA 577- agx: Pass mask into ld/st_tile instructions 578- agx: Account for mask when writing registers 579- agx: Remove load_kernel_input path 580- panfrost: Remove load_kernel_input path 581- nir/lower_system_values: Fix cs_local_index_to_id with variable workgroups 582- asahi: Eliminate "Pixel Format" type from GenXML 583- asahi/genxml: Defeature uint/float 584- panfrost,asahi: Remove unused prepare macro 585- panfrost,asahi: Remove unused function 586- asahi,panfrost: Remove exact attribute 587- asahi: Use a header more like Intel's GenXML 588- asahi: Remove no-direct-packing 589- agx: Add iterators for phi/non-phi instructions 590- agx: Set PIPE_SHADER_CAP_INDIRECT_CONST_ADDR 591- agx: Emit smaller combines for nir_op_vec2/3 592- agx: Don't prefix pseudo-ops 593- agx: Improve printing of immediate sources 594- agx: Switch to dynamic allocation of srcs/dests 595- agx: Dynamically size split instruction 596- agx: Rename "combine" to "collect" 597- agx: Emit trap at pack-time, not during isel 598- agx: Print instructions as "dest = src" 599- agx: Print agx_dim appropriately 600- agx: Preload vertex/instance ID only at start 601- agx: Write to r0l with a "nesting" instruction 602- agx: Introduce ra_ctx data structure 603- agx: Pass in max regs as a paramter to RA 604- agx: Reserve live-in regs at the start of block 605- panfrost/ci: Disable t720 jobs 606- nir: Be smarter fusing ffma 607- mesa,gallium: Revert "Make point coord origin a CAP" 608- panfrost/ci: Remove stale fail 609- pan/mdg: Lower PIPE_COMPUTE_CAP_MAX_THREADS_PER_BLOCK on Midgard 610- pan/mdg: Limit work registers for large workgroups 611- pan/mdg: Fix 16-bit alignment with spiller 612- nir: Fix nir_fmax_abs_vec_comp 613- nir: Usher nir_normalize_cubemap_coords into 2022 614- mesa: Fix multiple matrix pops in a row 615- nir/lower_io: Set interpolated_input dest_type 616- u_transfer_helper: Handle Z24S8 with z24_in_z32f but no interleaving 617- asahi: Lower point sprites in driver 618- gallium: Default to PIPE_CAP_MIXED_COLORBUFFER_FORMATS 619- panfrost: Remove NIR_SERIALIZED support 620- asahi: Hold a reference to BOs in a batch 621- nir/opt_preamble: Move load_texture_base_agx 622- agx: Remove command line compiler 623- asahi: Remove AGX_FAKE_DEVICE environment variable 624- asahi: Precompile for shader-db 625- asahi: Route shader-db stats to debug callback 626- asahi: Identify "Uniform high" USC word 627- asahi: Identify more shader-related fields 628- asahi: Remove "padding" field 629- asahi: Make agx_varyings a union 630- asahi: Use USC_UNIFORM_HIGH 631- asahi: Rename LOD clamps tests to fit other packing 632- asahi: Don't use unnecessary test fixture 633- asahi: Add group tests 634- asahi: Support preamble shaders 635- agx: Be less sloppy about high uniforms 636- agx: Check that we don't push too much 637- agx: Make push_count public 638- agx: Handle multiple functions 639- agx: Align preamble shaders 640- agx: Add uniform_store instruction 641- agx: Allow larger indices for ld/st 642- agx: Strengthen assert for packing ld/st instructions 643- agx: Implement scalar load/store_preamble 644- agx: Handle uniforms passed to COLLECT 645- agx: Restrict copyprop of uniforms 646- agx: Handle 64-bit moves 647- agx: Avoid reading high uniforms from device_load 648- agx: Report GPRs to the driver 649- asahi: Set uniform counts accurately 650- asahi: Set GPR count accurately 651- agx: Add agx_nir_opt_preamble pass 652- agx: Use agx_nir_opt_preamble 653- panfrost: Don't use lower_wpos_pntc on Midgard 654- panfrost: Use proper formats for pntc varying 655- etnaviv: Use correct idiv lowering 656- nir/lower_idiv: Remove imprecise_32bit_lowering 657- nir/lower_idiv: Inline convert_instr_precise 658- panfrost: Zero polygon list for fragment-only 659- panfrost: Don't allow VS side effects on midgard 660- pan/bi: Clean up sysval handling a bit 661- pan/mdg: Fix 64-bit address arithmetic 662- panfrost: Use compute-based XFB on Midgard 663- panfrost: Avoid a XFB special case 664- panfrost: Lower MAX_BLOCK_SIZE on Midgard 665- pan/mdg: Don't schedule across memory barrier 666- panfrost,asahi: Support ARB_buffer_storage 667- gallium: Stub support for Asahi + DRM 668- gallium: Only use Asahi's software path on macOS 669- panfrost/ci: Disable trace-based testing 670- panfrost: Enable rendering to 16-bit and 32-bit 671- panfrost: Add lots of perf_debug annotations 672- asahi: Identify counts for compute kernels 673- softpipe: Advertise PIPE_CAP_BUFFER_MAP_PERSISTENT_COHERENT 674- i915g: Advertise PIPE_CAP_BUFFER_MAP_PERSISTENT_COHERENT 675- etnaviv: Advertise PIPE_CAP_BUFFER_MAP_PERSISTENT_COHERENT 676- gallium: Default to PIPE_CAP_BUFFER_MAP_PERSISTENT_COHERENT 677- nir/opt_algebraic: Fuse c - a * b to FMA 678- pan/bi: Don't lower outputs for compute 679- panfrost: Don't use nir_variable in the compilers 680- pan/mdg: Disassemble the .a32 bit 681- pan/mdg: Don't pair ST_VARY.a32 with other instrs 682- pan/mdg: Use .u32 for flat shading 683- pan/mdg: Use bifrost_nir_lower_store_component 684- panfrost: Don't allocate space for empty varyings 685- panfrost: Don't use nir_variable to link varyings 686- panfrost: Delete set_global_resources 687- panfrost: Move small compute functions to pan_context.c 688- panfrost: Remove bogus assert 689- panfrost: Consolidate all shader compiling code 690- panfrost: Merge pan_assemble.c into pan_shader.c 691- panfrost: Remove unused req_input_mem copy 692- panfrost: Rename structs to panfrost_(un)compiled_shader 693- panfrost: Remove uncompiled_shader->active_variant 694- panfrost: Use u_dynarray for variants 695- panfrost: Make fixed_varying_mask a fragment-only key 696- panfrost: Precompile transform feedback program 697- panfrost: Precompile shaders 698- panfrost: Implement a disk cache 699- docs/new_features: Add Panfrost disk cache 700- panfrost: Don't copy resources if replaced 701- panfrost: Replace resource shadowing flush 702- panfrost: Fix build with Perfetto (again) 703- panfrost: Copy resources when necessary 704- panfrost: Require 64-byte alignment on imports 705- panfrost: Fix reference counting with batch->resources 706- panfrost: Use PIPE_CAP_VERTEX_ATTRIB_ELEMENT_ALIGNED_ONLY 707- pan/mdg: Fix out-of-order execution 708- panfrost: Revert "Require 64-byte alignment on imports" 709 710Andri Yngvason (1): 711 712- gallium/vl: Add opaque rgb pixel formats 713 714Anton Bambura (1): 715 716- panfrost: Enable Mali-T620 717 718António Monteiro (2): 719 720- freedreno: get timestamp from os_time_get_nano instead of os_time_get 721- gallium/util: remove network class 722 723Axel Davy (6): 724 725- frontend/nine: Skip invalid swvp calls 726- frontend/nine: Fix buffer tracking out of bounds 727- frontend/nine: Fix ATOC handling 728- frontend/nine: Fix cso restore bug 729- frontend/nine: Fix shader multi-use crash 730- frontend/nine: Fix ff position_t fallback when w = 0 731 732Bas Nieuwenhuizen (56): 733 734- vulkan/wsi: Take max extent into consideration for modifier selection. 735- radv: Remove redundant radv_QueuePresentKHR. 736- radv: Add binding code for 3d sparse images. 737- radv: Add 3d tile shapes for sparse binding. 738- radv: Expose 3d sparse images. 739- amd/common: Don't rely on DCN support checks with modifiers. 740- amd/common: Disable DCC retile modifiers on RDNA1 741- radv: Don't flatten bottom AS exit if statement. 742- radv: Store top of stack in a register. 743- Revert "radv/rt: use derefs for the traversal stack" 744- radv: Use constant for ray traversal exit condition. 745- radv: Use nested ifs for pushing child nodes in traversal loop. 746- radv: Add scratch stack to reduce LDS stack in RT traversal. 747- ac/surface: Fix some warnings. 748- radv: Hardcode root node id. 749- radv: Translate the BVH copy shader to glsl from nir_builder. 750- radv: Remove aabb bounds from instance nodes. 751- radv: Use proper matrices for instance nodes. 752- radv: Use GLSL matrices for instance transforms in BVH. 753- radv: Use deterministic order for dumping acceleration stuctures. 754- radv: Properly initialize all memory in RRA dumps. 755- radv: Do an early check of distance in triangle culling. 756- vulkan/wsi/x11: Fix double free of modifiers. 757- radv: Clean up unused fields in BVH IR. 758- radv: Add a field for the offset of the bvh in the blas. 759- radv: Simplify buffer copy address generation. 760- radv: Start the BVH after the header. 761- radv: Generate parent links in BVH. 762- radv: Only emit parents from parents that actually end up in the tree. 763- radv: Split global & local bvh node variable. 764- radv: Add traversal backtracking with a short stack. 765- radv: Properly annotate all the invalid node usage. 766- radv: Implement pipeline-wide skiptriangles/skipaabbs. 767- radv: Consider inactive internal nodes never in the tree. 768- radv: Constify entry_size. 769- radv: Use a VkPipelineCache handle for meta shaders. 770- radv: Rework modification detection of the meta cache. 771- radv: Mark dEQP-VK.ray_query.misc.dynamic_indexing as crashing in CI. 772- aco: Pre-split result of bvh64_intersect_ray_amd. 773- radv: Improve gfx11 clear colors for DCC. 774- radv: Use correct primgrp size for gfx11. 775- radv: Add BVH IR header. 776- radv: Make the number of internal nodes be written on the GPU. 777- radv: Add PLOC shader 778- radv: Use PLOC for BVH building 779- radv: Speculatively tune RT pipelines for GFX11. 780- radv: Use correct types for loading the rings with LLVM. 781- radv: Handle GSVS ring intrinsic correctly with LLVM. 782- radv: Handle attribute ring intrinsic correctly with LLVM. 783- radv: Fix compute scratch buffer emission. 784- radv: Make the compute scratch waves per SE as well. 785- radv: Fix custom build id with C90 compilation. 786- radv: Use correct init order for ETC2 image views 787- radv: Fix sampler types in ETC2 decode. 788- vulkan: Remove asserts that weren't valid for RADV ETC2 emulation. 789- radv: reserve space for the scissor in vkCmdBeginRendering. 790 791Benjamin Tissoires (1): 792 793- CI: convert to use the new S3 server instead of the legacy minio 794 795Boris Brezillon (1): 796 797- ci/panvk: Skip dEQP-VK.api.object_management.max_concurrent.query_pool 798 799Brian Paul (113): 800 801- st_pbo/compute: fix memset() warning 802- lavapipe: s/u_foreach_bit/u_foreach_bit64/ in handle_pipeline_access() 803- llvmpipe: further bump LP_MAX_TGSI_SHADER_IMAGES to 64 804- llvmpipe: always pass non-zero writemask to assign_reg() 805- util: allow GALLIUM_LOG_FILE=stdout 806- gallivm: move lp_build_nir_aos_context declaration, etc 807- gallivm: change texture/sampler_index params to unsigned 808- llvmpipe: asst. formatting, clean-ups in lp_state_fs.c 809- gallivm: asst. clean-ups in lp_bld_sample_soa.c 810- gallivm: fix nir AOS swizzling issues 811- lavapipe: remove continue statements in emit_state() to be more consistent 812- lavapipe: zero-init sampler objects 813- cso: asst. clean-ups in cso_hash.[ch] 814- cso: use util_bitcount 815- cso: asst. clean-ups in cso_cache.[ch] 816- cso: asst. clean-ups in cso_context.[ch] 817- draw: asst. clean-ups in draw_context.[ch] 818- draw: asst. clean-ups in draw_vs_llvm.c 819- draw: asst. clean-ups in draw_vs_variant.c 820- draw: asst. clean-ups in draw_vs.[ch] 821- draw: remove goto in llvm_pipeline_generic() 822- draw: remove unused draw_has_llvm() 823- draw: asst. clean-ups in draw_pipe.h 824- draw: asst. clean-ups in draw_pt_fetch.c 825- draw: asst. clean-ups in draw_pt_post_vs.c 826- draw: asst. clean-ups in draw_pipe_cull.c 827- draw: asst. clean-ups in draw_pipe_clip.c 828- draw: asst. clean-ups in draw_pipe_flatshade.c 829- draw: asst. clean-ups in draw_pipe_offset.c 830- draw: asst. clean-ups in draw_pipe_pstipple.c 831- draw: asst. clean-ups in draw_pipe_twoside.c 832- draw: asst. clean-ups in draw_pipe_unfilled.c 833- draw: asst. clean-ups in draw_pipe_user_cull.c 834- draw: asst. clean-ups in draw_pipe_util.c 835- draw: asst. clean-ups in draw_pipe_wide_line.c 836- draw: asst. clean-ups in draw_pipe_wide_point.c 837- draw: asst. clean-ups in draw_vertex.[ch] 838- draw: asst. clean-ups in draw_pt_so_emit.c 839- draw: asst. clean-ups in draw_pipe_aapoint.c 840- draw: asst. clean-ups in draw_llvm.c 841- draw: asst. clean-ups in draw_cliptest_tmp.h 842- draw: asst. clean-ups in draw_gs.[ch] 843- draw: asst. clean-ups in draw_prim_assembler.[ch] 844- draw: asst. clean-ups in draw_pipe.c 845- draw: asst. clean-ups in draw_pt_vsplit_tmp.h 846- draw: asst. clean-ups in draw_split_tmp.h 847- draw: asst. clean-ups in draw_pt.c 848- draw: clean-up count assignment in draw_pt_arrays() 849- draw: asst. clean-ups in draw_private.h 850- draw: asst. clean-ups in draw_pt_fetch_shade_pipeline_llvm.c 851- draw: replace double conditional expression with if/else 852- draw: make draw_geometry_shader_run() return void 853- draw: fix some const-correctness in draw_vbo() 854- draw: move vs_slot var in tgsi/llvm_fetch_gs_input() 855- draw: s/start_or_maxelt/start/ in llvm VS code 856- draw: fix vertex id offset bug 857- draw: fix base vertex bug 858- llvmpipe: asst. clean-ups in lp_rast.h 859- llvmpipe: asst. clean-ups in lp_rast_debug.c 860- llvmpipe: asst. clean-ups in lp_rast_priv.h 861- llvmpipe: asst. clean-ups in lp_rast_rect.c 862- llvmpipe: asst. clean-ups in lp_rast.c 863- llvmpipe: asst. clean-ups in lp_rast_linear.c 864- llvmpipe: asst. clean-ups in lp_rast_linear_fallback.c 865- gallium: whitespace fixes in p_context.h, p_screen.h 866- llvmpipe: asst. clean-ups in lp_context.[ch] 867- llvmpipe: asst. clean-ups in lp_surface.c 868- llvmpipe: asst. clean-ups in lp_screen.[ch] 869- llvmpipe: asst. clean-ups in lp_query.c 870- llvmpipe: asst. clean-ups in lp_setup.c 871- llvmpipe: asst. clean-ups in lp_setup_rect.c 872- llvmpipe: asst. clean-ups in lp_setup_vbuf.c 873- llvmpipe: asst. clean-ups in lp_setup_line.c 874- llvmpipe: asst. clean-ups in lp_setup_point.c 875- llvmpipe: remove gotos in do_rect_ccw() 876- llvmpipe: asst. clean-ups in lp_setup_tri.c 877- llvmpipe: asst. clean-ups in lp_bld_alpha.c 878- llvmpipe: asst. clean-ups in lp_bld_blend_aos.c 879- llvmpipe: asst. clean-ups in lp_bld_depth.c 880- llvmpipe: asst. clean-ups in lp_bld_interp.c 881- llvmpipe: asst. clean-ups in lp_debug.h 882- llvmpipe: asst. clean-ups in lp_draw_arrays.c 883- llvmpipe: asst. clean-ups in lp_fence.c 884- llvmpipe: asst. clean-ups in lp_flush.[ch] 885- llvmpipe: asst. clean-ups in lp_linear_interp.c 886- llvmpipe: asst. clean-ups in lp_linear_sampler.c 887- llvmpipe: asst. clean-ups in lp_scene.c 888- llvmpipe: asst. clean-ups in lp_scene.h 889- llvmpipe: asst. clean-ups in lp_state_sampler.c 890- llvmpipe: asst. clean-ups in lp_state_derived.c 891- llvmpipe: asst. clean-ups in lp_state_clip.c 892- llvmpipe: asst. clean-ups in lp_state_cs.c 893- llvmpipe: asst. clean-ups in lp_state_gs.c 894- llvmpipe: asst. clean-ups in lp_state_rasterizer.c 895- llvmpipe: asst. clean-ups in lp_state_so.c 896- llvmpipe: asst. clean-ups in lp_state_surface.c 897- llvmpipe: asst. clean-ups in lp_state_tess.c 898- llvmpipe: asst. clean-ups in lp_state_vertex.c 899- llvmpipe: asst. clean-ups in lp_test_blend.c 900- llvmpipe: asst. clean-ups in lp_test_conv.c 901- llvmpipe: asst. clean-ups in lp_test_format.c 902- llvmpipe: asst. clean-ups in lp_test_main.c 903- llvmpipe: asst. clean-ups in lp_tex_sample.c 904- llvmpipe: asst. clean-ups in lp_texture.[ch] 905- llvmpipe: improve some if/switch code in llvmpipe_get_shader_param() 906- frontend/dri: assorted clean-ups in dri-screen.c 907- frontends/dri: clean-ups in dri_util.c 908- glx: clean-ups in create_context.c 909- glx: clean-ups in drisw_glx.c 910- llvmpipe: s/uint/enum pipe_prim_type/ in lp_setup_context.h 911- llvmpipe: fix comment typo 912- llvmpipe: asst. clean-ups in lp_state_fs.c 913- llvmpipe: check rectangle vertices have equal W components 914 915Caio Oliveira (38): 916 917- intel/compiler: Print more details when fs_visitor::validate() fails 918- intel/compiler: Call get_mesh_urb_handle one level up in call-stack 919- intel/compiler: Get URB handle in emit_task/mesh_intrinsic functions 920- intel/compiler: Make component() work for FIXED_GRF/ARF 921- intel/compiler: Remove INTEL_DEBUG=tcs8 922- intel/compiler: Rename 8_PATCH to MULTI_PATCH 923- intel/compiler: Use fs_reg helpers for TCS icp_handle selection 924- intel/compiler: Use fs_reg helpers for GS icp_handle selection 925- intel/compiler: Create fs_visitor::emit_tcs_barrier() 926- intel/compiler: Add helper for barrier message payload setup for gfx >= 125 927- intel/compiler: Use builder to allocate fs regs for TCS store output 928- intel/compiler: Make a type for Thread Payload and FS variant 929- intel/compiler: Use FS thread payload only for FS 930- intel/compiler: Create struct for TCS thread payload 931- intel/compiler: Store Patch URB output in TCS thread payload struct 932- intel/compiler: Store Primitive ID in TCS thread payload struct 933- intel/compiler: Store start of ICP handles in TCS thread payload struct 934- intel/compiler: Create and use struct for TES thread payload 935- intel/compiler: Create and use struct for VS thread payload 936- intel/compiler: Create and use struct for TASK and MESH thread payloads 937- intel/compiler: Create and use struct for GS thread payload 938- intel/compiler: Store start of ICP handles in GS thread payload struct 939- intel/compiler: Create and use struct for Bindless thread payload 940- intel/compiler: Export brw_get_subgroup_id_param_index() 941- intel/compiler: Create and use struct for CS thread payload 942- intel/compiler: Make thread_payload struct abstract 943- intel/compiler: Add a few more brw_ud* helpers 944- intel/compiler: Use brw_ud* helpers in thread payload code 945- nir/lower_task_shader: Don't fail adding a launch when last instruction is a jump 946- spirv: Pass vtn_block into vtn_emit_branch() 947- spirv: Add workaround to ignore OpReturn after OpEmitMeshTasksEXT 948- spirv: Call vtn_emit_ret_store() only when handling return branch 949- spirv: Make a helper function to check glslang/shaderc 950- spirv: Enable OpEmitMeshTasksEXT workaround for Clay Shader Compiler 951- intel/compiler: Explicitly include build-id when linking intel_clc 952- nir: Don't reorder volatile intrinsics 953- intel/compiler: Fix dynarray usage in intel_clc 954- intel/compiler: Fix missing tie-breaker in brw_nir_analyze_ubo_ranges() ordering code 955 956Caleb Cornett (3): 957 958- wgl: Homogenize read framebuffer reference counting logic 959- wgl: Always release local references in stw_make_current_by_handles 960- wgl: Release the context before deleting it in wglDeleteContext 961 962Chad Versace (43): 963 964- venus: Remove cryptic pointer arithmetic on present attachments 965- venus: Drop vn_present_src_attachment::acquire 966- venus: Track subpass properties 967- venus/pipeline: Refactor vn_fix_graphics_pipeline_create_info 968- venus/pipeline: Add pipeline state bools for ignore rules 969- venus/pipeline: Split ignore rules for pViewportState, pMultisampleState 970- venus/pipeline: Fix ignore rule for pDepthStencilState 971- venus/pipeline: Fix ignore rule for pColorBlendState 972- venus/pipeline: Ignore pViewportState::pViewports 973- venus/pipeline: Ignore pViewportState::pScissors 974- venus/pipeline: Ignore basePipelineHandle 975- venus/pipeline: Fix ignore rules for VK_KHR_dynamic_rendering 976- venus/pipeline: Fix ignore rules for VK_EXT_extended_dynamic_state 977- venus: Dedupe pipeline handle creation 978- venus: Fix failure path on pipeline creation 979- venus: Enable VK_EXT_pipeline_creation_cache_control 980- venus: Sort passthrough extensions 981- vulkan: Make vk_copy_struct_guts() public 982- venus: Simplify vn_GetPhysicalDeviceFeatures2 983- venus: Simplify vn_GetPhysicalDeviceProperties2 984- venus: Enable VK_KHR_shader_integer_dot_product 985- venus: Enable VK_KHR_shader_non_semantic_info 986- venus: Enable VK_KHR_shader_terminate_invocation 987- venus: Enable VK_KHR_zero_initialize_workgroup_memory 988- venus: Enable VK_EXT_subgroup_size_control 989- venus: Enable VK_EXT_texel_buffer_alignment 990- venus: Enable VK_EXT_ycbcr_2plane_444_formats 991- venus: Enable VK_KHR_format_feature_flags2 992- venus: Enable VK_EXT_private_data 993- venus: Enable VK_EXT_tooling_info 994- venus: Document VK_EXT_texture_compression_astc_hdr support 995- venus: Enable VK_EXT_pipeline_creation_feedback 996- venus: Rename some feature/property structs 997- venus: Refactor VN_ADD_TO_PNEXT 998- venus: Add macros VN_SET_CORE_* 999- venus: Fix features/properties for unavailable extensions 1000- venus: Use VkPhysicalDeviceVulkan13{Features,Properties} 1001- venus/queue: Rename some local vars 1002- venus: Change signature of vn_get_fence_feedback_cmd 1003- venus: Add vn_queue_wait_idle_before_present() 1004- venus: Refactor vn_command_buffer_builder temp storage 1005- venus: Enable VK_KHR_synchronization2 1006- venus: Enable Vulkan 1.3 1007 1008Charmaine Lee (6): 1009 1010- svga: fix mksstats build 1011- svga: fix SetConstantBufferOffset command opcode 1012- svga: compare graphics shader stages against COMPUTE 1013- mesa/st: fix reference to nir->info after nir_to_tgsi 1014- svga: support TGSI_SEMANTIC_TEXCOORD in swtnl draw context 1015- svga: fix invalid component access of domain location 1016 1017Chia-I Wu (83): 1018 1019- vulkan: sort ALLOWED_ANDROID_VERSION by api levels 1020- vulkan: fix ALLOWED_ANDROID_VERSION for api level 26 1021- vulkan: fix ALLOWED_ANDROID_VERSION up to api level 29 1022- vulkan: extend ALLOWED_ANDROID_VERSION up to api level 31 1023- turnip: add tu_common.h as the common header 1024- turnip: remove includes that are already in tu_common.h 1025- turnip: add tu_drm.h 1026- turnip: add tu_suballoc.h 1027- turnip: update tu_cs.h 1028- turnip: add tu_query.h 1029- turnip: add tu_image.h 1030- turnip: add tu_formats.h 1031- turnip: update tu_descriptor_set.h 1032- turnip: add tu_shader.h 1033- turnip: add tu_pipeline.h 1034- turnip: add tu_clear_blit.h 1035- turnip: add tu_dynamic_rendering.h 1036- turnip: add tu_lrz.h 1037- turnip: add tu_pass.h 1038- turnip: add tu_wsi.h 1039- turnip: update tu_autotune.h 1040- turnip: add tu_device.h 1041- turnip: add tu_cmd_buffer.h 1042- turnip: add tu_android.h 1043- turnip: update tu_util.h 1044- turnip: move away from tu_private.h 1045- turnip: remove tu_private.h 1046- turnip: remove headers from libtu_files 1047- turnip: use SPDX-License-Identifier 1048- vulkan: disallow VK_KHR_performance_query on android 1049- turnip: fix a use-after-free in autotune 1050- turnip: remove unused member in tu_submission_data 1051- turnip: handle fence wrapping in autotune 1052- freedreno/pps: improve interaction with turnip 1053- turnip: lower the queue priority to 1 1054- turnip: pCreateInfo->pPoolSizes[i] is wordy 1055- turnip: trim down host size of tu_descriptor_pool 1056- turnip: trim down bo size of tu_descriptor_pool 1057- turnip: improve perfetto sync_timestamp 1058- turnip: fix a missing trace_end_gmem_clear 1059- turnip: move trace_start_gmem_store before cond exec 1060- turnip: fix gem_store tracepoint 1061- turnip: tidy up tracepoint header includes 1062- util/u_trace: include the generated header first 1063- util/u_trace: add PERFETTO HeaderScope 1064- turnip: add tu_clone_trace_range helper 1065- turnip: convert tu_perfetto_state to a stack 1066- turnip: clean up tu_perfetto.h 1067- turnip: rename some tracing stages 1068- turnip: add cmd_buffer tracepoint 1069- turnip: improve tracing of secondary cmd buffers 1070- ir3: fix predicate splitting in scheduler 1071- util/perf: move u_perfetto to here 1072- util/perf: add cpu_trace.h 1073- util/perf: fix multiple tracepoints in a scope 1074- util/perfetto: add a simple C wrapper for track events 1075- util/perf: support and prefer perfetto for cpu trace 1076- venus: use MESA_TRACE_* 1077- pps: enable track_event in system.cfg 1078- docs: update perfetto with the latest status 1079- vulkan/wsi: add cpu tracepoints to swapchain functions 1080- vulkan/wsi: add cpu tracepoints for internal waits 1081- turnip: add some cpu tracepionts 1082- freedreno: get cpu timestamp again after FD_TIMESTAMP 1083- util/perf: ignore ut->enabled for iterators 1084- util/perf: remove u_trace::enabled 1085- util/perf: add u_trace_instrument 1086- util/perf: simplify u_trace_instrument 1087- util/perf: use ALWAYS_INLINE for tracepoints 1088- turnip: reduce MAX_STORAGE_BUFFER_RANGE 1089- vulkan: update ALLOWED_ANDROID_VERSION for api level 33 1090- subprojects: uprev perfetto to v29.0 1091- turnip: fix error handling for tu_queue_init 1092- turnip: add tu_physical_device::submitqueue_priority_count 1093- turnip: add support for VK_KHR_global_priority 1094- turnip: use vk_buffer 1095- turnip: use vk_descriptor_set_layout 1096- vulkan: update comments to device enumeration callbacks 1097- turnip: fix kgsl tu_enumerate_devices return code 1098- turnip: advertise VkExternalFenceProperties correctly 1099- venus: fix VkImageFormatListCreateInfo for AHB 1100- freedreno/pps: use 64-bit reads when possible 1101- freedreno/pps: loop countables by references 1102 1103Christian Gmeiner (18): 1104 1105- lima/ci: There is no need to redefine HWCI_TEST_SCRIPT 1106- lima/ci: Make use of .lava-piglit 1107- isaspec: Move isa_decode(..) declaration 1108- ci: include etnaviv support in ARM64 container 1109- ci: switch to mkbootimg.py 1110- ci/bare-metal: introduce BM_MKBOOT_PARAMS 1111- ci/etnaviv: add GC7000 support 1112- etnaviv: Remove possibility to force MSAA 1113- etnaviv: rs: Color resolve works 1114- etnaviv: rs: Directly call etna_try_rs_blit(..) 1115- etnaviv: rs: msaa: Set cache mode for SMALL_MSAA 1116- etnaviv: Update headers from rnndb 1117- etnaviv: blt: Add color resolve support 1118- etnaviv: blt: Directly call etna_try_blt_blit(..) 1119- Revert "etnaviv: completely turn off MSAA" 1120- etnaviv: Support negative float inline immediates 1121- etnaviv: nir: lower extract byte/word 1122- etnaviv: Hide MSAA support behind debug flag 1123 1124Christophe (2): 1125 1126- zink: add profile documentation 1127- Zink: add Zink profiles file 1128 1129Connor Abbott (49): 1130 1131- tu: Decouple vertex input state from shader 1132- tu: Implement VK_EXT_vertex_input_dynamic_state 1133- tu: Fix sysmem depth attachment clear flushing 1134- ir3/spill: Fix extracting from a vector at the end of a block 1135- nir: Free instructions more often 1136- tu/lrz: Fix multiple subpass case with secondaries 1137- tu/lrz: Fix multiple depth attachment case with secondaries 1138- tu: Fix descriptor set size bounds 1139- tu: Don't preload variable-count descriptors 1140- tu: Initial implementation of VK_EXT_inline_uniform_block 1141- tu: Also set SP_FS_MRT_REG for unused attachments 1142- tu: Fix case where vertex input is set but not vertex buffers 1143- tu: Remove unused need_indirect_descriptor_sets 1144- tu: Group pipeline state based on VkPipeline*StateCreateInfo 1145- tu: Rename RB_MSAA_CNTL to RB_BLIT_GMEM_MSAA_CNTL 1146- tu: Always set RB_BLIT_GMEM_MSAA_CNTL when clearing GMEM 1147- tu: Make MSAA emission always dynamic 1148- tu: Split out primtype calculation for tess 1149- ir3: Don't use msaa key for a6xx+ 1150- tu: Nuke tu_pipeline_cache 1151- tu: Don't use layer_zero/view_zero in shader key 1152- tu, ir3: Move multi_pos_output to ir3_shader_variant 1153- tu: Don't use output state to compute render components 1154- tu, ir3: Determine dual-src blend from shader for FS outputs 1155- tu: Emit \*_OUTPUT_CNTL1 as part of blend state 1156- tu: Move no_earlyz computation to blend/msaa state 1157- tu: Split up prim order computation 1158- tu: Use vk_pipeline_shader_stage_to_nir 1159- tu: Abstract driver-specific const state 1160- tu: Initial implementation of VK_EXT_graphics_pipeline_library 1161- tu: Fix maxPerStageDescriptorUpdateAfterBindInputAttachments 1162- tu: Set textures_used for input attachments correctly 1163- ir3/analyze_ubo_ranges: Account for reserved consts 1164- Revert "freedreno,ir3: rename Z_CLAMP_ENABLE to Z_CLIP_DISABLE" 1165- freedreno/a6xx, tu: GRAS_CL_CNTL::UNK5 is Z_CLAMP_ENABLE 1166- tu: Fix setting RB_DEPTH_CNTL::Z_CLAMP_ENABLE 1167- ir3: Delete outputs from fixup_regfootprint() 1168- ir3: Move fixup_regfootprint() to ir3_collect_info() 1169- ir3/analyze_ubo_ranges: Account for reserved consts better 1170- ir3: Add missing cat5 encoding to asm parser 1171- tu: Fix param_stride placement 1172- tu: DS primitive stride does not use patch control points 1173- tu: Implement extendedDynamicState2PatchControlPoints 1174- freedreno: Add LABEL flush 1175- tu: Add compute shader instrlen workaround 1176- tu: tu: Clear patchControlPoints dirty state with static patchControlPoints 1177- tu: Use right enum for compute active_shader_stages 1178- tu: Fix binding NULL descriptor sets 1179- ir3: Don't save/restore disasm string pointer 1180 1181Constantine Shablya (3): 1182 1183- nir: add a pass to remove non-uniform access qualifier when the operands are uniform 1184- radv: use nir_opt_uniform_access 1185- anv: use nir_opt_uniform_access 1186 1187Corentin Noël (9): 1188 1189- virgl/ci: Update virglrenderer 1190- ci: disable the freedreno farm. 1191- virgl/ci: Update virglrenderer 1192- gallivm: avoid the use of an uninitialized value 1193- virgl: Add TEXTURE_SHADOW_LOD capability support 1194- virgl: Add nir_lower_tex pass 1195- mesa/main: do not copy the exact size of the string 1196- gallium/hud: use snprintf(..., "%s", ...) instead of strncpy 1197- ci: Remove Wno-error=stringop-truncation from debian-arm64-asan 1198 1199Damian Korczowski (1): 1200 1201- d3d12: Fix placed footprint of subresources 1202 1203Daniel Schürmann (20): 1204 1205- nir/opt_algebraic: a & ~a -> 0 1206- aco: use std::vector::reserve() more often 1207- aco/live_var_analysis: implement faster merging of live_out sets for some cases 1208- aco/optimizer: disallow can_eliminate_and_exec() with s_not 1209- aco/optimizer: do can_eliminate_and_exec() optimization later 1210- aco/optimizer: optimize s_and(exec, s_and(x, y)) more aggressively 1211- aco/optimizer: change inverse_comparison in-place 1212- radv/rt: create separate radv_rt_pipeline struct 1213- radv/rt: inline radv_rt_pipeline_create_() helper into radv_rt_pipeline_create() 1214- aco: simplify operands_offset calculation in create_instruction() 1215- aco: implement custom memory resource 1216- aco: use monotonic_buffer_resource for instructions 1217- aco: implement allocator_traits for monotonic_allocator<T> 1218- aco/opt_value_numbering: use monotonic_allocator for unordered_map 1219- aco/spill: Fix spilling of Phi operands 1220- aco/ra: only rename fixed Operands if the copy-location matches 1221- aco: change thread_local memory resource to pointer 1222- nir: add AMD RT traversal intrinsics 1223- radv/rt: create traversal shader independent from main shader 1224- radv/rt: overwrite hit args with undef in case of a miss 1225 1226Daniel Stone (4): 1227 1228- ci: Use mold for x86-64 and AArch64 builds 1229- CI: Collabora farm down for maintenance 1230- CI: Re-enable Collabora devices 1231- Revert "panfrost/ci: Disable t720 jobs" 1232 1233Danilo Krummrich (10): 1234 1235- nv50/ir/nir: add conversion ops for bit width < 32 1236- nv50/ir/nir: convert to 32 bit for all OP_SET opcodes 1237- nv50/ir/nir: avoid 8/16 bit dest regs for OP_MOV 1238- nv50/ir: add isUnsignedIntType() and isIntType() helpers 1239- nv50/ir/nir: convert 8/16 bit src to 32 bit for {i,u}2f64 1240- nv50/ir/nir: always round towards zero for f2i/f2u 1241- nv50/ir: add intermediate conversion for f2{i,u}{8,16} 1242- nv50/ir: split and cvt 64bit integers for {i,u}2{i,u}{8,16} 1243- nv50/ir: handle S8/S16 integers converting to S64 1244- nv50/ir: handle U8/U16 integers converting to U64 1245 1246Danylo Piliaiev (26): 1247 1248- tu: Flush depth on depth img transition from undef layout 1249- tu: Disable LRZ write when alpha-to-coverage is enabled 1250- freedreno: Disable LRZ write when alpha-to-coverage is enabled 1251- tu: Assert that if draw state is enabled it has valid iova 1252- tu: Switch to userspace iova allocations if kernel supports it 1253- tu: Enable bufferDeviceAddressCaptureReplay 1254- tu: Disable userspace iova allocations 1255- tu: Implement VK_EXT_attachment_feedback_loop_layout 1256- freedreno: PC_SO_STREAM_CNTL_STREAM_ENABLE has per-stream enable bits 1257- tu: Fix streamout with tess_use_shared 1258- tu: Update HS_WAVE_INPUT_SIZE formula 1259- util/u_trace: Add locking for ctx_list 1260- freedreno: Name more \*_DBG_ECO_CNTL regs 1261- freedreno: Add all variable magic regs to device-info tables 1262- tu: Use newly obtained magic reg values 1263- tu: Trivially expose VK_EXT_rasterization_order_attachment_access 1264- ir3: Prevent reordering movmsk with kill 1265- turnip: implement VK_EXT_multi_draw 1266- turnip: Implement VK_EXT_mutable_descriptor_type 1267- tu: Expose Vulkan 1.3 1268- tu: Do not DCE unused output vars used for transform feedback 1269- tu: Lazily init VSC to fix dynamic rendering in secondary cmdbufs 1270- tu: Reset whether there is DS resolve for dynamic subpass 1271- tu: Fix the size of patch control points state 1272- freedreno: Add 'replay' tool which allows to replay cmdstreams 1273- freedreno/fdl: Increase alignment for UBWC images 1274 1275Dave Airlie (85): 1276 1277- gallivm: fix printf hook for cached shaders. 1278- llvmpipe: add shader clock support 1279- lavapipe: enable shader clock 1280- draw: fix up jit type creation for gs/tcs/tes 1281- llvmpipe/gallivm/draw: introduce a buffer type. 1282- zink: add defines for the shader stage templates. 1283- virgl: abstract virgl shader stages from pipe shader stages. 1284- svga: compare shader type against compute not tess eval. 1285- gallium/tgsi: reorder pipe shader type defines. 1286- gallium/iris/crocus: collapse a bunch of conversion functions. 1287- gallium: fixup some inconsistent uses of enum pipe_shader_type. 1288- gallium: use gl shader types as the basis for the gallium ones 1289- gallivm/nir/st: lower image derefs in advance. 1290- lavapipe: lower images to non-derefs in vulkan side 1291- lavapipe: scan shader for info before lowering images. 1292- llvmpipe/ci: update ci results for clover. 1293- draw: don't touch info values that aren't valid. 1294- nir_to_tgsi_info: drop const_buffers_declared 1295- crocus: sync performance monitor code with iris. 1296- vulkan: update rest of the headers to v1.3.225 1297- gallivm: drop unused parameter to lp_build_sample_aos 1298- gallivm/sample: remove unused base parameter from dynamic callbacks. 1299- gallivm: don't indirect image/sampler destroy. 1300- llvmpipe: finish rendering before flushing frontbuffer resources. 1301- radv: move to using common record_result. 1302- turnip: use common command record result. 1303- panvk: use common command buffer record result. 1304- dozen: move to use common command buffer result code. 1305- lavapipe: fix 3d depth stencil image clearing. 1306- gallivm: add atomic 32-bit float support 1307- llvmpipe: export GL_NV_shader_atomic_float 1308- lavapipe: export VK_KHR_shader_atomic_float 1309- docs: add new llvmpipe/lavapipe atomic float extensions 1310- lavapipe: use slow clear path if conditional rendering. 1311- llvmpipe: handle different blocksizes for sampler/image views 1312- lavapipe: take layers into account around render surface creation. 1313- lavapipe: handle view mask layer count properly for begin rendering. 1314- llvmpipe: memset empty images in the key 1315- llvmpipe: bind sampler views/images properly. 1316- gallivm: handle llvm coroutines for llvm > 15 1317- gallivm: fix buffer_type for LLVM 15 1318- lavapipe: add fmin/fmax to image lowering. 1319- gallivm/nir: fix fmin/fmax translation 1320- gallivm/nir: bitcast when non-float ptr type. 1321- llvmpipe/fs: start passing explicit context pointer type. 1322- llvmpipe/fs: convert thread data ptr to opaque ptr friendly apis 1323- llvmpipe/fs: convert linear context to opaque pointers friendly 1324- llvmpipe/cs: convert cs context to opaque friendly api 1325- llvmpipe/cs: convert cs thread data to opaque friendly api 1326- llvmpipe/fs: fix invocations access for opaque ptrs. 1327- llvmpipe/setup: move setup code to explicit pointers. 1328- llvmpipe/cs: move compute code to explicit pointer types 1329- gallivm/struct: add opaque ptr friendly pointer accessors. 1330- llvmpipe/fs: add mask_type for mask_store accesses. 1331- llvmpipe/fs: pass explicit mask_type into interp code. 1332- llvmpipe/fs: port depth code to opaque pointer api 1333- llvmpipe/fs: add sample position type to the interp interface 1334- llvmpipe/fs: pass mask type to alpha to coverage handler 1335- llvmpipe/fs: use explicit api in viewport code. 1336- llvmpipe/fs: handle explicit types around blending and c/zs bufs calcs 1337- llvmpipe/fs: cleanup some remaining mask handling and reuse types 1338- llvmpipe/fs: convert outputs handling to explicit api 1339- gallivm/nir: move to explicit pointer interfaces. 1340- gallivm/nir: fixup atomic path for explicit pointers. 1341- gallivm/sample: move border color load to explicit pointers 1342- gallivm/llvmpipe: hand sample position type in for loading. 1343- llvmpipe/fs/linear: port to explicit pointers. 1344- draw: fix gs/tes explicit pointer handling 1345- draw: handle primid for tess being a different pointer type 1346- gallivm/tgsi: port tgsi to explicit types. 1347- gallium/tgsi: handle temps/outputs array. 1348- vulkan: update beta and video headers to 1.3.230 1349- llvmpipe/tests: port to new pointer interfaces. 1350- gallivm/llvmpipe: add opaque pointers support to sampler 1351- draw/llvmpipe: move texture/sampler/image member load to opaque. 1352- draw/llvmpipe: add way to return pointer types to generic code. 1353- gallivm/sample: use retrieved types to do opaque pointer loads. 1354- gallivm: remove legacy pointer_get apis 1355- gallivm/sample: move some first_level/last_level calcs out 1356- gallivm/nir: drop some unused struct members. 1357- gallivm/sample: refactor multisample offset calcs code. 1358- glsl/types: fix dword slots calc for float16 matricies. 1359- clover/llvm: add kernel arg info for opaque pointers 1360- nir/lower_bool_to_int32: add support for lowering functions. 1361- gallivm: add coro malloc hooks earlier and always. 1362 1363David Heidelberg (71): 1364 1365- ci: compress LAVA rootfs with zstd instead of gzip 1366- ci: replace gzip usage with zstd where posible 1367- ci/bin: ignore Gitlab GQL cache 1368- ci: Turn off the entire Lima farm (devices report out-of-space) 1369- ci: move DXVK instalation outside of x86_test-vk 1370- ci: make shellcheck happy on dxvk script 1371- ci: separate wine setup into own script 1372- ci: Windows runner is experiencing DNS issues; disable Microsoft farm 1373- ci: prepare piglit-traces for WINE and DXVK 1374- ci/lavapipe: implement traces 1375- docs/ci: show how to run traces locally 1376- ci/lavapipe: wine flakes, switch to manual for now 1377- ci: use apt-get in x86_test-vk instead of apt 1378- ci: do not install i386 sub-architecture in VK images 1379- ci: traces: update freedreno and broadcom to brotli compressed traces 1380- ci: GraphQL was already implemented in ci_run_n_monitor.py 1381- ci: implement stress testing jobs in ci_run_n_monitor.py 1382- ci: fix leftover tag in image-tags.yml 1383- ci: rename debian-gallium to debian-build-testing 1384- ci: move lava-test into debian-build-testing 1385- ci: make shellcheck happy about .gitlab-ci/container/ directory 1386- ci: use shellcheck for .gitlab-ci/container/ directory 1387- ci: drop last bits of wine32 1388- ci: bump cross compilation to LLVM 13 where possible 1389- ci: uprev piglit 2022-08-30 1390- ci: set venus on lavapipe to manual due to flakes 1391- ci: print env as other setups do 1392- ci/lava: print set-job-env-vars.sh as other setups do 1393- ci/traces: do not keep a cache, which will be wiped a few steps later 1394- ci/test: collapse printing env 1395- ci/lava: collapse printing env 1396- ci/freedreno: convert traces to new YAML format 1397- ci/virgl: convert traces to new YAML format 1398- ci/llvmpipe: convert traces to new YAML format 1399- ci/lavapipe: convert traces to new YAML format 1400- ci/broadcom: convert traces to new YAML format 1401- ci/panfrost: convert traces to new YAML format 1402- ci/radeonsi: convert traces to new YAML format 1403- ci/intel: convert traces to new YAML format 1404- ci/i915: convert traces to new YAML format 1405- ci/crocus: convert traces to new YAML format 1406- ci: check traces.yml files with yamllint 1407- ci: performance traces: make use of no-perf label 1408- ci: export whole results/ directory for all traces jobs 1409- ci: split ci_run_n_monitor into script and shared parts 1410- ci: introduce update_traces_checksum.py 1411- ci: uprev piglit 2022-09-08 1412- ci: use xargs instead of find -exec 1413- ci: add jq utility 1414- ci/traces: remove first line with YAML version to prevent failure 1415- ci/radeonsi: drop glmark2 terrain trace 1416- ci/panfrost: drop glmark2 terrain trace 1417- ci/intel: drop glmark2 terrain trace 1418- ci/lava: add wine and apitrace into amd64 rootfs container 1419- ci/lava: prepare wineprefix for GL and DXVK 1420- ci/dxvk: fix DXVK 64-bit only wine setup 1421- ci: also handle default wine rootfs (for testing) 1422- ci/traces: prevent showing wine instalation dialog 1423- ci/radeonsi: add traces for radeonsi Zork 1424- ci/amd: move radeonsi in amd directory 1425- ci: uprev DXVK to 1.10.3 1426- ci/traces: Blender demo (Cube Diorama) flakes on Intel APL 1427- ci/freedreno: add Guilherme Gallo into restricted traces access list 1428- ci/panfrost: deduplicate gitlab-ci.yml 1429- ci/panfrost: enable piglit-gl on g52 again and deparalelize 1430- ci/panfrost: Humus Portal trace got fixed, update checksum 1431- ci/panfrost: re-enable traces on t760 1432- ci/linker: update mold to 1.6 and enable on s390x 1433- ci/zink: rename zink job to zink-lvp to better describe it 1434- ci/freedreno: disable antichambers trace 1435- ci/update_traces_checksum.py: check if checksum is in the array, not it's value 1436 1437David Riley (1): 1438 1439- drm-shim: Allow drm-shim to work with glibc fortify. 1440 1441Dawn Han (11): 1442 1443- Update venus-protocol to add extension \`VK_VALVE_mutable_descriptor_type` 1444- Increase enum type in vn_descriptor_set. 1445- Update bitset when init descriptor layout. 1446- Track bitset when create descriptor pool 1447- Enable .VALVE_mutable_descriptor_type in physical_device 1448- venus: support the promoted VK_EXT_mutable_descriptor_type 1449- venus: Update mutable descriptor struct type 1450- venus: extend lifetime of push descriptor set layout 1451- venus: extend VkPipelineLayout lifetime for batched VkCmdPushConstants() 1452- venus: implement vkCmdPushDescriptorSetWithTemplateKHR 1453- venus: enable VK_KHR_push_descriptor 1454 1455Diogo Ivo (1): 1456 1457- nouveau: treat DRM_FORMAT_INVALID as implicit modifier 1458 1459Dmitry Baryshkov (1): 1460 1461- freedreno/registers: update hdmi registers to add more 8x74 regs 1462 1463Dmitry Osipenko (11): 1464 1465- util/disk_cache: Add option to disable compression 1466- util/disk_cache: Enable testing of uncompressed caches 1467- util/disk_cache: Append to disk_cache_load_cache_index() with _foz 1468- util/disk_cache: Move struct cache_entry_file_data to the disk_cache_os.h 1469- util/disk_cache: Make disk_cache_os.h usable by C++ code 1470- util/disk_cache: Add new mesa-db cache type 1471- virgl: Fix unmapping of blob resources 1472- llvmpipe: Align persistent mappings to page size 1473- ci: Bump crosvm version 1474- ci: Update virgl-iris-traces checksums 1475- ci/virgl: Enable virgl-iris-traces 1476 1477Dylan Baker (47): 1478 1479- docs: reset new features 1480- docs: add release notes for 22.1.5 1481- docs: Add sha256 sum for 22.1.5 1482- docs: add release notes for 22.1.4 1483- docs: Add sha256 sum for 22.1.4 1484- docs: update calendar and link releases notes for 22.1.4 1485- docs: update calendar and link releases notes for 22.1.5 1486- docs: update calendar for 22.2.0-rc1 1487- docs: add release notes for 22.2.0 1488- docs: Add sha256 sum for 22.2.0 1489- docs: update calendar and link releases notes for 22.2.0 1490- docs: Add calendar entries for 22.2 release. 1491- intel/genxml: use a single pattern for res 1492- intel/genxml: use a set for lookups 1493- intel/genxml: don't use parens with python assert statement 1494- intel/genxml: add type annotations to gen_sort_tags.py 1495- intel/genxml: use \`a not in {x, y}` instead of \`a != x or a != y` 1496- intel/genxml: remove unused variable in gen_sort_tags.py 1497- intel/genxml: don't overwrite the input of gen_sort_tags in place 1498- intel/genxml: add 'nonzero' to gen_sort_tags 1499- intel/genxml: add a validation mode to gen_sort_tags 1500- intel/genxml: run gen_sort_tags on all of the xml 1501- meson: run genxml sort tests 1502- docs: add release notes for 22.2.1 1503- docs: Add sha256 sum for 22.2.1 1504- docs: update calendar and link releases notes for 22.2.1 1505- intel/genxml: write a space before the '/>` closing tag 1506- intel/genxml: re-process with space before \`/>` 1507- intel/genxml: remove newlines between xml islands 1508- intel/genxml: re-process with extra whitespace removed 1509- intel/genxml: use ElementTree.write and .indent 1510- intel/genxml: reprocess xml with elementree output 1511- intel/genxml: split the processing code up 1512- intel/genxml: use a main() function 1513- intel/genxml: use Path.replace instead of unlink and rename 1514- docs: add release notes for 22.2.2 1515- docs: Add sha256 sum for 22.2.2 1516- docs: update calendar and link releases notes for 22.2.2 1517- docs: Extend calendar entries for 22.2 by 2 releases. 1518- util/indicies: move common static functions to private header 1519- util/indicies: use itertools.product instead of deeply nested loops 1520- util/indicies: write a file with u_indicies_gen.py 1521- util/indicies: simplify some of u_indicies_gen.py 1522- util/indicies: use itertools.product in u_unfilled_gen.py 1523- util/indicies: convert u_unfilled_gen.py to write a file 1524- util/glsl2spirv: fix type error in argument handling 1525- util/glsl2spirv: fix appending extra flags 1526 1527Ella Stanforth (2): 1528 1529- util: fix missing fcntl.h on musl 1530- vulkan: wsi: fix musl build 1531 1532Emma Anholt (108): 1533 1534- ci/bare-metal: Re-open serial and everything after test phase timeout. 1535- zink: Take the tiling path for resources with modifiers. 1536- zink: Replace the "optimal_tiling" flag with a "linear" flag instead. 1537- ci/zink: Re-enable a630 zink-on-turnip traces job using gbm as the backend. 1538- ci/swrast: Add some flakes I've noticed in the IRC channel. 1539- ci: Add testing of the khr-single tests. 1540- ci: Upgrade deqp-runner to 0.15.0. 1541- zink: Make sure that we keep the existing ici pNext chain on inserts. 1542- tu: Move the vkCreateImage format list checks to helper functions. 1543- tu: Treat viewFormatCount==0 as no format list. 1544- tu: Use the format list to decide whether we can do UBWC with MUTABLE. 1545- ci/tu+zink: Re-enable the traces that were asserting about UBWC compatibility. 1546- ir3: Suppress disasm of internal shaders unless IR3_SHADER_DEBUG=internal. 1547- tu: Only emit as many bindless regs as we have seen descriptor sets. 1548- tu: Only emit as many VPC interp/repl regs as will be referenced. 1549- tu: Only emit FS output regs for as many MRTs as we have (but at least 1). 1550- tu: Emit only as many VBs as we've ever seen bound on the command buffer. 1551- ci/tu: Add another cwe_after_bind fail that appeared in the reshuffle. 1552- ci: Move 'never' rules includes above "on_success" rules includes. 1553- ci: disable the freedreno farm. 1554- freedreno/regs: Add a bit of documentation of what SKIP_IB2 does. 1555- turnip: Fix enabling of IB2 skipping. 1556- nir/lower_mediump: Lower FS outputs to 16-bit when the value was upconverted. 1557- nir_to_tgsi: Add support for TG4 with explicit offsets. 1558- nir_to_tgsi: Add support for 64-bit fsign/isign. 1559- nir_to_tgsi: Add support for demote, is_helper_invocation, and subgroup ops. 1560- nir: Make nir_lower_discard_if() handle demotes and terminates, too. 1561- nir_to_tgsi: Use nir_lower_discard_if for demote_if. 1562- nir_to_tgsi: Add support for bindless textures and images. 1563- turnip: Make sure bandwidth config per pixel starts from 0. 1564- freedreno/ir3: Move chip-specific nir compiler options to C code. 1565- freedreno/ir3: Switch to NIR for a3xx/a4xx's vertex id lowering. 1566- gallium,glsl: Delete PIPE_CAP_VERTEXID_NOBASE and lower_vertex_id. 1567- nir: Add a pass to lower mediump temps and shared mem. 1568- spirv: Mark phis as mediump instead of directly lowering them to 16 bit. 1569- turnip: Enable lowering of mediump temps/CS shared to 16-bit. 1570- turnip: Treating non-d/s-write pipelines as not having d/s feedback loops. 1571- mesa: Remove PIPE_CAP_CS_DERIVED_SYSTEM_VALUES_SUPPORTED/lower_cs_derived. 1572- zink: Don't upload shader immediate arrays through UBO 0. 1573- zink: Don't lower indirect derefs of temp arrays. 1574- Revert "ci: disable the freedreno farm." 1575- ci: Add support for Jetson TK1. 1576- nouveau: Fix compiler warnings about silly address checks in ir_print. 1577- ci/turnip: Add missing a618 full-run bypass fails. 1578- turnip: Only emit descriptor loads for active stages in the pipeline. 1579- freedreno/ir3: Consistently lower mediump inputs to 16-bit (when we can). 1580- nir/lower_mediump_vars: Don't lower mediump shared vars with atomic access. 1581- mesa: Lower mediump temps and CS shared when the driver supports FP16+INT16. 1582- turnip: Add some missing LRZ disable debug. 1583- turnip: Ignore dynamic color write enables past our number of attachments. 1584- turnip: Don't look at RB.Z_READ_ENABLE for setting LRZ.Z_TEST_ENABLE. 1585- turnip: Skip rather than invalidate LRZ on gl_FragDepth writes. 1586- turnip: Ignore pDescriptorCounts[] for non-variable-count layouts. 1587- turnip: Keep a host copy of push descriptor sets. 1588- ci/turnip: Use all 9 a618 runners at once for VK testing. 1589- ci/turnip: Bump the full-run a618 runner count to 3. 1590- ci/turnip: Re-enable the compressed cubemap tests. 1591- ci/turnip: Allow running spirv_ids_abuse in full VK runs. 1592- ci/turnip: Drop a couple of spillall skips. 1593- ci/turnip: Move some 15-second-ish test skips to pre-merge skips. 1594- ci/nouveau: Drop BM_POE_TIMEOUT. 1595- ci/nouveau: Rearrange job setup variables. 1596- ci/nouveau: fix up Jetson Nano 1597- ci/nouveau: Update gm20b xfails. 1598- ci: Update kernel to drm-next-2022-08-12-1 1599- turnip: Fix leak of autotune fence BOs. 1600- turnip: Add TU_DEBUG=bos to print stats of BOs live at submit time. 1601- turnip: Fix the "written stencil is unmodified" check. 1602- turnip: Use the simplified stencil write flags for the LRZ-allowed check. 1603- ci/zink+turnip: Add a manual full run of the dEQP CTS. 1604- turnip: Fix busy-waiting on syncobjs with OS_TIMEOUT_INFINITE. 1605- util/dag: Move the callback function pointer to the state. 1606- util/dag: Add a validation function. 1607- freedreno/ir3: Validate our scheduling DAGs after construction. 1608- nir/nir_opt_copy_prop_vars: Don't leak dynarray memory during the pass. 1609- nir/vars_to_ssa: Always do OOB load/store removal. 1610- glsl: Remove lower_output_reads. 1611- glsl: Remove do_set_program_inouts. 1612- glsl: Remove opt_array_splitting. 1613- glsl: remove opt_structure_splitting. 1614- glsl: Remove lower_vec_index_to_swizzle. 1615- ci/iris: Update iris traces checksums. 1616- turnip: Don't use the dynamic color write enable during non-dynamic. 1617- ci/zink: Clear stale xfails for turnip. 1618- zink: Skip border color clamping for compressed formats. 1619- zink: Enable ASTC texture format translation. 1620- zink: Map ETC1 to ETC2 to avoid uncompressing in the frontend. 1621- ci/zink: Add some more flakes for turnip. 1622- zink: Fix RelaxedPrecision decoration of texture samples. 1623- zink: Decorate mediump outputs as RelaxedPrecision. 1624- turnip: Add a perf_debug for feedback-related performance traps. 1625- turnip: Move the ubwc_possible check before mutable formats. 1626- turnip: Add perf_debug for UBWC being disabled due to mutable formats. 1627- turnip: Add perf debug for more UBWC-disable cases that we could support. 1628- zink: Fix dummy CB path decision for VK_EXT_cwe presence. 1629- turnip: Be sure we blit depth, not stencil, for Z32FS8 -> Z32F resolves. 1630- nir/opt_phi_precision: Fix missing swizzles when narrowing phi srcs. 1631- turnip: Enable LRZ testing (not writing) in the presence of discards. 1632- ci/freedreno: Update CivV trace expectation. 1633- ci/freedreno: Update known flakes/timeouts. 1634- turnip: Fix reservation for indirect compute's IR3_DP_SUBGROUP_ID_SHIFT. 1635- zink: No need to use a 2-sample dummy image for bindless without null descs. 1636- zink: Lazily allocate the dummy surfaces. 1637- zink: Enable mesa/st frontend shader caching. 1638- rusticl: Fix the invalid memory migration flags check. 1639- iris: Disable GLSL lower_const_arrays_to_uniforms. 1640- gallium: update docs about PIPE_CAP_PREFER_IMM_ARRAYS_AS_CONSTBUF. 1641- ir3/ra: Make sure we don't pick a preferred reg overflowing the file. 1642 1643Eric Engestrom (56): 1644 1645- bin/gen_release_notes.py: bump advertised vulkan version to 1.3 1646- pvr: fix memleak in error paths 1647- vk/device-select-layer: fix .sType of VkPhysicalDeviceGroupProperties 1648- docs/features: add missing extensions supported by anv 1649- docs/features: add missing extensions supported by lavapipe 1650- docs/features: add missing extensions supported by radv 1651- docs/features: add missing extensions supported by turnip 1652- docs/features: add missing extensions supported by venus 1653- docs/features: start tracking extensions supported by dozen 1654- docs/features: start tracking extensions supported by panvk 1655- docs/features: start tracking extensions supported by powervr 1656- docs/features: drop unsupported VK_KHR_display & VK_KHR_get_display_properties2 for lavapipe 1657- ci: rename GitHub CI workflow from \`ci` to \`macos` to be a bit more explicit about what it covers 1658- meson: drop duplicate \`lib` in lib name 1659- wsi/x11: fix memleak in wsi_x11_connection_create() 1660- aco: drop unused variable 1661- anv: convert assert into unreachable to avoid fallthrough error 1662- meson: replace manual compiler flags with meson arguments 1663- vc4: introduce VC4_DBG() macro to make VC4_DEBUG checks consistent 1664- v3d: introduce V3D_DBG() macro to make V3D_DEBUG checks consistent 1665- wayland: avoid writing interface names as free-form strings 1666- vc4: drop dead qir_POW() helper 1667- v3dv: implement VK_EXT_shader_module_identifier 1668- docs: move rusticl readme into docs/ 1669- broadcom: fix dependencies in static_library() calls 1670- docs: add release notes for 22.1.6 1671- docs: update calendar and link releases notes for 22.1.6 1672- docs: add release notes for 22.1.7 1673- docs: update calendar and link releases notes for 22.1.7 1674- ci: unexport local variable (and fix formatting) 1675- ci: bump mold to 1.5 1676- docs/release-calendar: add 22.3 schedule 1677- v3dv: drop error overwrite 1678- vc4: don't lower uadd_carry 1679- vc4: don't use imprecise_32bit_lowering for idiv lowering 1680- vc4: pre-compile shaders to make up for the lack of draw calls in shader-db 1681- vc4: use PIPE_MASK_RGBA name instead of its value 0xf 1682- vc4: send shaderdb output through KHR_debug 1683- vc4: mark piglit copypixels-(draw-)sync as flaky 1684- VERSION: bump to 22.3.0-rc1 1685- .pick_status.json: Update to 1174f376096ed6ceebb0fb2810456f1501a68df7 1686- zink: add missing build_id linker args 1687- vk/runtime: drop incorrect UNUSED annotation 1688- v3dv: avoid freeing already-freed memory 1689- VERSION: bump for 22.3.0-rc2 1690- .pick_status.json: Update to 9bd11f65238ce101bf846f5528f9088630e983f7 1691- ci: avoid triggering vc4 & v3d tests on v3dv-only MRs 1692- VERSION: bump for 22.3.0-rc3 1693- .pick_status.json: Update to f2e535e4fed5bdc13e11a443316a4b77cd5eb81a 1694- v3d(v): account for debug flags when using the cache 1695- .pick_status.json: Mark 56bd81ee21cb42e4a5afa2939514b570a1283754 as denominated 1696- VERSION: bump for 22.3.0-rc4 1697- .pick_status.json: Update to 22be0d09a005b4b955a46b65a919cfd786d6814f 1698- .pick_status.json: Mark 470fbb35efe1935242b346427ec0fa22b40fff2b as denominated 1699- commit_in_branch.py: add support for checking staging branches 1700- docs: add release notes for 22.3.0 1701 1702Erico Nunes (1): 1703 1704- ci: enable lima farm 1705 1706Erik Faye-Lund (226): 1707 1708- docs: fixup link to virgl docs 1709- zink: add have_D32_SFLOAT_S8_UINT boolean 1710- zink: do not use VK_FORMAT_D32_SFLOAT_S8_UINT without checking 1711- zink: remove needless check 1712- glsl/tests: do not perform eol-conversion on windows 1713- zink: type_main -> type_void_func 1714- zink: add spirv_builder_function_call 1715- zink: wrap discard in a function 1716- zink: fix conditions for draw-parameters 1717- zink: fixup indent in caps-check 1718- mesa/main: simplify implmementation of _mesa_is_format_etc2 1719- mesa/main: simplify implementation of _mesa_is_format_astc_2d 1720- mesa/st: simplify st_compressed_format_fallback 1721- mesa/st: avoid double-mapping if both images are the same 1722- mesa/st: do not use memcpy when using compressed fallback 1723- mesa/st: add context-flag for bptc-support 1724- mesa: add format-helper for bptc 1725- mesa: add _mesa_unpack_bptc-function 1726- mesa/st: implement fallback for bptc 1727- mesa/st: enable bptc extension with fallback 1728- zink: allow X32 -> A32 1729- mesa/main: remove always-true define 1730- mesa/main: add fp16-versions of bptc decompression functions 1731- mesa: fall-back to fp16 for bptc-textures 1732- gallium/u_transfer_helper: clean up boolean flags 1733- gallium/u_transfer_helper: add in-place interleave flag 1734- zink: use full transfer-helper api 1735- gallium/u_transfer_helper: hide deinterleave helpers 1736- mesa/st: also override mip filter for integer textures 1737- mesa/st: always force integer filtering to nearest 1738- mesa/st: force nearest-filtering for fp32 textures 1739- zink: request forcing of fp32-samplers to nearest when needed 1740- mesa/st: do not blit when using compressed fallback 1741- zink: clamp miplodbias when creating sampler 1742- zink: add ntv support for demote 1743- zink: add spirv_builder support for is_helper_invocation 1744- zink: add ntv support for is_helper_invocation 1745- zink: expose demote when supported 1746- zink: lower discard/demote depending on derivatives 1747- Revert "Revert "ci: set venus on lavapipe to manual due to flakes"" 1748- mesa/st: use _mesa_generate_mipmap for fallback-formats 1749- mesa/st: do not fallback to srgba formats for dxt 1750- util: allow unpacking less than a block from dxt 1751- mesa/st: add context-flags for s3tc 1752- mesa: add format-helpers for s3tc 1753- mesa/main: add _mesa_unpack_s3tc 1754- mesa/st: implement fallback for s3tc 1755- mesa/st: enable s3tc extensions with fallback 1756- v3d: do not pretend to fake rgtc-support 1757- vc4: respect z-offset in tiled blits 1758- vc4: do not attempt to do deep tiled blits 1759- docs/zink: remove bptc from required formats for gl4.2 1760- docs/zink: document rgtc requirement 1761- mesa/st: add context-flag for rgtc 1762- mesa: add format-helper for rgtc 1763- util/format: fix broken indentation 1764- util/format: allow unpacking less than a block from rgtc 1765- util/format: implement rgtc -> r8 / r8g8 unpack 1766- mesa/main: add _mesa_unpack_rgtc 1767- mesa/st: implement fallback for rgtc 1768- mesa/st: do not fall back to uncompressed for rgtc 1769- mesa/st: enable rgtc extension with fallback 1770- panfrost: do not fake rgtc-support 1771- docs: update staus of mark GL_ARB_texture_compression_bptc 1772- lima: don't store width in resource-level 1773- lima: do not align width/height for non-shared resources 1774- mesa/st: add context-flag for latc 1775- mesa: add format-helper for latc 1776- mesa/main: add support for latc in _mesa_unpack_rgtc 1777- mesa/st: implement fallback for latc 1778- mesa/st: do not fall back to uncompressed for latc 1779- mesa/st: enable latc extensions with fallback 1780- freedreno: do not fake rgtc-support 1781- u_transfer_helper: rip out fake_rgtc code 1782- mesa/st: always use normalized coords for samplers 1783- mesa/st: always use normalized coords for samplers 1784- mesa/st: always use normalized coords for samplers 1785- zink: assert on unnormalized texcoords 1786- docs: add an amber article 1787- zink: emulate latc formats with rgtc 1788- zink: fix luminance clears 1789- zink: fixup indent 1790- zink: refactor swizzle-clamping code 1791- zink: fix luminance/luminance-alpha emulation 1792- zink: correct emulation of luminance-alpha formats 1793- ci/crocus: update ci results 1794- mesa/main: fix broken indent 1795- mesa/main: don't copy signed rgtc via float 1796- docs: fixup broken rst syntax 1797- gallium/u_threaded_context: remove stale comment 1798- gallium: normalized_coords -> unnormalized_coords 1799- zink: fix broken pool-alloc consolidation 1800- zink: fixup dynarray-type 1801- zink: use util_dynarray_clear 1802- mesa/st: rip out point-sprite cap 1803- mesa/main: remove driver-cap for ARB_point_sprite 1804- mesa/st: move static asserts out of translate_prim 1805- mesa/st: remove translate_mode 1806- docs: correct spelling of "implemented" 1807- docs: correct spelling of "attempts" 1808- docs: correct spelling of "bug fixes" 1809- docs: correct spelling of "specified" 1810- docs: correct spelling of "hierarchical" 1811- docs: correct spelling of "implementation" 1812- docs: correct spelling of "to" 1813- docs: correct spelling of "Khronos" 1814- docs: correct spelling of "choosing" 1815- docs: correct spelling of "subtly" 1816- docs: correct spelling of "Raspbian" 1817- docs: correct spelling of "shareable" 1818- docs: correct spelling of "simplification" 1819- docs: correct spelling of "synced" 1820- docs: correct spelling of "maximum" 1821- docs: correct spelling of "hierarchy" 1822- docs: correct spelling of "commutativity" 1823- docs: correct spelling of "compression" 1824- docs: correct spelling of "laid" 1825- docs: correct spelling of "average" 1826- docs: correct spelling of "addition" 1827- docs: correct spelling of "explicitly" 1828- docs: correct spelling of "rasterizer" 1829- docs: fractionary -> fractional 1830- docs: ie. -> i.e. 1831- docs: correct spelling of "correctly" 1832- docs: correct spelling of "traveling" 1833- docs: correct spelling of "freedreno" 1834- zink: rework primitive rasterization type logic 1835- zink: reuse rast_prim for line-rasterization check 1836- docs: correct spelling of "addition" 1837- docs: correct spelling of "passes" 1838- docs: correct spelling of "irrespective" 1839- docs: correct spelling of "semantic" 1840- docs: correct spelling of "specifies" 1841- docs: correct spelling of "Gouraud" 1842- docs: correct spelling of "Copr" 1843- docs: spell out "build directory" 1844- docs: spell out "SHA256 checksum" 1845- docs: spell out "environment variable" 1846- docs: spell out "distribution" 1847- docs: use the official spelling of SHA-1 1848- docs: eg. -> e.g. 1849- docs: XOR'd -> XORed 1850- docs: consistently upper-case VC4, V3D etc 1851- docs: consistently spell DRI in all-caps 1852- docs: consistently spell ANV in allcaps 1853- docs: consistently spell URL in allcaps 1854- docs: consistently spell OK in allcaps 1855- docs: consistently use CMake 1856- docs: consistently spell API all-caps 1857- docs: consistently spell VirGL in camel-case 1858- docs: consistently spell Haswell in camel-case 1859- docs: use consistent casing for dEQP 1860- docs: ip -> IP 1861- docs: irc -> IRC 1862- docs: xorg -> X.Org 1863- docs: pci -> PCI 1864- docs: cpu -> CPU 1865- docs: gpu -> GPU 1866- docs: rgb -> RGB 1867- docs: perfetto -> Perfetto 1868- docs: lavapipe -> Lavapipe 1869- docs: freedreno -> Freedreno 1870- docs: adreno -> Adreno 1871- docs: radeon -> Radeon 1872- docs: nouveau -> Nouveau 1873- docs: llvmpipe -> LLVMpipe 1874- docs: vulkan -> Vulkan 1875- docs: lima -> Lima 1876- docs: ppir -> PPIR 1877- docs: autotools -> Autotools 1878- docs: ArchLinux -> Arch Linux 1879- docs: meson -> Meson 1880- docs: Gitlab -> GitLab 1881- docs: xyzw -> XYZW 1882- docs: hw -> HW 1883- docs: lava -> LAVA 1884- docs: skqp -> SkQP 1885- docs: servo -> Servo 1886- docs: Linux fbdev -> Linux Framebuffer 1887- docs: virtualisation -> virtualization 1888- docs: dependant -> dependent 1889- docs: healthcheck -> health check 1890- docs: vmware -> VMware 1891- docs: spell out "distribution" 1892- docs: spell out "transform and lighting" 1893- docs: de-duplicated -> deduplicated 1894- docs: de-nominate -> denominate 1895- docs: freedreno -> Freedreno 1896- docs: llvmpipe -> LLVMpipe 1897- docs: softpipe -> Softpipe 1898- docs: panfrost -> Panfrost 1899- docs: piglit -> Piglit 1900- docs: ini -> INI 1901- docs: swr -> OpenSWR 1902- docs: yml -> YAML 1903- docs: xml -> XML 1904- docs: Adreno aXXX -> Adreno XXX 1905- docs: remove non-existent directory 1906- docs: do not mention classic swrast 1907- docs: name correct driver 1908- docs: clean up labels 1909- docs: add missing link to mesa3d.org 1910- docs: add links to usenet groups 1911- docs: link to replacement forum 1912- docs: Android.mk -> ndk-build 1913- docs: gitlab -> GitLab 1914- docs: nVidia -> NVIDIA 1915- docs: spell ATI codenames in allcaps 1916- docs: nfs -> NFS 1917- docs: fixup rebase mistake 1918- zink: correct depth-bias enable condition 1919- zink: only set line-width if drawing lines 1920- zink: consider polygon-mode for rast_prim 1921- zink: do not read is_generated unless in tcs shader 1922- zink: put union fields into structs named by the shader-stages 1923- docs/zink: fix and cleanup rst syntax 1924- docs: fixup broken link syntax 1925- docs: do not mention EGL_MESA_drm_display 1926- docs: remove stale envvar-reference 1927- zink: fix json-errors in profile-file 1928- zink: remove needless requirements 1929- zink: fix incorrect requirements 1930- zink: update textureCompressionBC requirement 1931- docs/zink: add missing required device-feature 1932- mesa: treat unsupported queries as dummies 1933- d3d12: fix max-array-layers 1934 1935Fabian Vogt (1): 1936 1937- gallivm: Fix LLVM optimization with the new pass manager 1938 1939Feng Jiang (5): 1940 1941- virgl/vtest: fix memory overwrite problem in virgl_vtest_send_get_caps() 1942- gallium: add key size to the structure pipe_picture_desc 1943- vl: change vl_video_buffer_destroy() to non-static 1944- gallium/util: add a new helper function pipe_buffer_size() 1945- virgl: add support for hardware video acceleration 1946 1947Filip Gawin (5): 1948 1949- r300: add list of deqp gles2 r400 failures 1950- r300: add khr r400 failures 1951- gallium: avoid using float based conditions in loops 1952- r300: don't use smooth line if not requested 1953- r300: update r400 tests 1954 1955Francisco Jerez (4): 1956 1957- intel/fs: Fix horiz_offset() to handle FIXED_GRFs with non-trivial 2D regions. 1958- nir/lower_int64: Enable lowering of 64-bit float to 64-bit integer conversions. 1959- nir/lower_int64: Implement lowering of 64-bit integer to 64-bit float conversions. 1960- nir/lower_int64: Fix float16 to int64 conversions. 1961 1962Frank Binns (13): 1963 1964- pvr: get free list min size from the runtime info 1965- pvr: reset all command buffer state in pvr_cmd_buffer_reset() 1966- pvr: Implement vkResetCommandBuffer 1967- pvr: remove redundant TODO 1968- pvr: remove image pointer from image view struct 1969- pvr: set samplerAnisotropy to false 1970- pvr: cube map tex state packing fixes 1971- pvr: don't setup attachment tex state for images with input attachment bit unset 1972- pvr: finish render job sample count setup 1973- pvr: add required pixel formats 1974- pvr: remove implicit sync support 1975- pvr: setup buffer and image format feature bits 1976- CODEOWNERS: remove rajnesh-kanwal as an Imagination maintainer 1977 1978Friedrich Vock (28): 1979 1980- amd/common: move ac_memory_ops_per_clock into ac_gpu_info.h 1981- amd/common: Remove redundant code for determining memory ops per clock 1982- radv/winsys: Return VK_ERROR_INVALID_EXTERNAL_HANDLE if buffer imports fail 1983- radv: Generalize instance count in BVH states to leaf node count 1984- radv: Add metadata to acceleration structures 1985- radv: Add Radeon Raytracing Analyzer trace dumping utilities 1986- radv: Add Radeon Raytracing Analyzer capturing layer 1987- radv: Enable Radeon Raytracing Analyzer traces 1988- docs: Add documentation about RADV RRA tracing 1989- radv: Use radv_CmdUpdateBuffer instead of radv_update_buffer_cp for geometry info writes 1990- radv: Free geometry infos in radv_CmdBuildAccelerationStructuresKHR 1991- radv: Correct accel struct header size 1992- radv: Rename internal node shader to lbvh_internal 1993- radv: Split CmdBuildAccelerationStructuresKHR into different functions 1994- radv: Add BVH IR types 1995- radv: Add conversion shader for internal nodes 1996- radv: Add conversion shader for leaf nodes 1997- radv: Create pipelines for conversion meta shaders 1998- radv: Build acceleration structures using BVH IR 1999- radv: Rename emulated float helpers 2000- radv: Use a struct for AABBs 2001- radv: Add radv_indirect_unaligned_dispatch 2002- radv/rt: Fix internal converter synchronization 2003- radv/rt: Dispatch internal converter indirectly 2004- radv/rt: Track number of inactive leaf nodes 2005- radv: Add global sync utilities 2006- radv: Add REF as a typename macro to .clang-format 2007- radv: Use spirv1.5 instead of vulkan1.2 2008 2009Georg Lehmann (40): 2010 2011- aco: Check that we don't override exec_val operands during branching sequence optimization. 2012- aco/assembler: Fix v_cmpx with SDWA. 2013- aco: Fix optimizing branching sequence with s_and_saveexec. 2014- aco/assembler: Fix v_cmpx pre GFX10. 2015- aco: Use v_cmpx pre GFX10. 2016- aco: Force tex operand to have the correct sub dword size before packing. 2017- nir: Add nir_ssa_scalar_is_undef. 2018- nir/fold_16bit_tex_image: Add an option to fold image sources. 2019- aco: Implement storage image A16. 2020- aco: Combine 16bit undef and constants instead of using s_pack. 2021- radv: Fold 16bit image sources. 2022- nir: Print selection control for nir_if. 2023- nir/opt_algebraic: Optimize check for single bit. 2024- nir/opt_algebraic: Optimize d3d9 pow with fmulz. 2025- aco: Fix image instructions with lod when 2d_view_of_3d is enabled on GFX9. 2026- ac/llvm: Fix image instructions with lod for 2d on GFX9. 2027- aco: Use plain VOPC for vcmpx when possible. 2028- nir/opt_algebraic: Optimize unpacking of upcasts to 64bit integers. 2029- radv: Enable VK_EXT_load_store_op_none. 2030- aco: Use v_fmaak/v_fmamk if two operands are the same literal. 2031- aco: Unswizzle v_pk_fma_f16 literals to produce more v_pk_fmac_f16. 2032- radv: Fix GLSL BDA struct alignment and use pointer arithmetic SIZEOF. 2033- nir/opt_algebraic: Optimize more (a cmp b ? a : b) to min/max. 2034- aco: Use s_pack_ll for s_bfe operand on GFX9+. 2035- nir: Print nir_selection_control_divergent_always_taken. 2036- nir/opt_algebraic: Mirror optimizations for find_msb_rev. 2037- aco: Implement [ui]find_msb_rev. 2038- ac/llvm: Implement [ui]find_msb_rev. 2039- nir/opt_algebraic: Add an option to lower uclz. 2040- radv,aco: Lower uclz in NIR. 2041- nir/opt_algebraic: Optimize various find_msb_rev patterns. 2042- nir: Fix ifind_msb_rev constant folding. 2043- aco: fmaak/fmamk can't use SDWA. 2044- aco: Don't use opsel for p_insert. 2045- ac/llvm: Implement signed idot on GFX11. 2046- aco: Implement signed idot instructions on GFX11. 2047- radv: Use available dot product instructions on gfx11. 2048- radeonsi: Use available dot product instructions on gfx11. 2049- aco: Use opsel for the third operand. 2050- aco: Use s_pack_ll_b32_b16 for scalar zero extend. 2051 2052Gert Wollny (95): 2053 2054- radeonsi-raven/ci: Move spec@arb_timer_query@timestamp-get to flakes 2055- llvmpipe: Double number of SSBOs 2056- r600/sfn: Schedule shift instruction on R600 in t-slot 2057- r600/sfn: Add GS thread fix just like the TGSI code path 2058- r600/sfn: Sort FS inputs to make interpolated values come first 2059- r600/sfn: Fix color outputs when color0 writes all 2060- r600/sfn: Initialize out buffer when printing op 2061- r600: Force NOPs when loading AR on R600 class hardware 2062- r600/sfn: Handle R600 scratch read 2063- r600: Don't use SB with R600 style scratch reads 2064- r600: Fix SCRATCH OP de-assembly 2065- nir_lower_atomics_to_ssbo: Initialize deref struct 2066- r600/sfn: Handle color0 writes all on R700 like on EG 2067- r600/sfn: Don't tag mem-ring and stream instructions as exports 2068- r600/sfn: Don't schedule GDS instructions early 2069- r600/sfn: Don't scan the whole block for ready instructions 2070- r600/sfn: Use a heuristic to keep SSBO setup and store close 2071- r600: Fix reporting TGSI IR support 2072- r600/sfn: Use a low number for unused target register 2073- virgl: Fix buffer overflow warning: 2074- virgl: Fix ubsan warnings: 2075- virgl: when reading back wait first, then do the transfer 2076- virgl/ci: remove some flakes 2077- r600/sfn: override register ID when it doesn't matter 2078- r600/sfn: ACK all image stores, also at end of loop 2079- r600/sfn: more peephole optimization with conditionals 2080- r600/sfn: Override VPM if access in helpers is requested 2081- r600/sfn: Don't scalarize fdd instructions 2082- virgl: Add some formats that the CTS uses 2083- mesa/glsl: Add support for NV_shader_noperspective_interpolation 2084- r600: set nir option lower_cs_local_index_to_id 2085- virgl: Report CONSTANT_BUFFER_SIZE according to GL_MAX_UNIFORM_BLOCK_SIZE 2086- r600/sfn: Lower tex,txl,txb and txf to backend 2087- r600/sfn: lower txd to backend in nir 2088- r600/sfn: lower txf_ms in nir 2089- r600/sfn: copy-propagate single source texture values 2090- r600/sfn: print tex prepare instructions 2091- r600/sfn: Don't allocate un-used components in texture ops 2092- r600/sfn: Add an easy access to get an instruction as ALU 2093- r600/sfn: Don't assert when setting one value 2094- r600/sfn: Copy propagate into TEX source 2095- r600/sfn: drop some unused code 2096- r600/sfn: copy propagate register load chains 2097- r600/sfn: VS inputs are effectively SSA 2098- r600/sfn: Add a free-channel mask when testing whether a register can switch channel 2099- r600/sfn: fix some channel pinning 2100- r600/sfn: Handle shifts on Cayman 2101- r600/sfn: only use 3 channels on Cayman for trans ops 2102- r600/sfn: Make sure texture lowering is done in the right order 2103- r600/sfn: lower tg4 to backend in NIR 2104- r600: Add slot to ALU disassambly 2105- r600: Make NIR the default shader IR 2106- nir_lower_to_source_mods: Don't sneek in an abs modifier from parent 2107- r600/sfn: Only run 64 bit ops lowering passes when really needed 2108- r600/sfn: Always start a new CF after a KILL instruction 2109- r600/sfn: don't propagate registers into conditional test 2110- r600/sfn: Add peephole optimization for kill instructions 2111- r600/sfn: Delete final lowered nir shader early 2112- r600/sfn: Fix typo 2113- r600/sfn: Use the correct allocator for loop lists 2114- r600/sfn:explicitly initialize the memory pool 2115- r600/sfn: assert on use of abs modifier in op3 2116- r600/sfn: Unroll loops after doing some optimizations 2117- r600/sfn: Always enforce LDS operation order 2118- r600/sfn: Make sure all components are usable when lowering TF inputs 2119- r600/sfn: run cleanup passes after late algebraic opt 2120- nir: move fusing csel and comparisons to opt_late_algebraic 2121- r600/sfn: Unify the handling of resource IDs in instruction 2122- r600/sfn: Handle nir_op_seq and nir_op_sne 2123- r600: Fix printing t-channel in diss-assambly 2124- r600/sfn: Add test for channel changes in TEX source from opt 2125- r600/sfn: Allow copy-prop of group dest into origin 2126- r600/sfn: Trigger TEX CF based on max TEX CF size 2127- r600/sfn: improve scheduling of tex sources 2128- r600/sfn: Increase scheduling priority on uniform reads and non-ssa writes 2129- r600/sfn: elimiate dead registers too 2130- r600: Account for color and clipvertex when evaluating LDS space 2131- r600/sfn: evaluate LDS location for color and clip-vertex too 2132- r600/sfn: Add .clang-format file and apply style 2133- r600: declare for counter locally and fix signed/unsigned warning 2134- r600: Only count ALU registers that are not clause local 2135- r600: Print MOVA_INT dest on Cayman 2136- r600/sfn: Handle load_workgroup_size 2137- r600/sfn: lower uniforms to UBOs 2138- r600/sfn: remove load_uniform handling 2139- etnaviv: Create MSAA surfaces with PE compatible tiling 2140- nir/algeraic_opt: use double options too for lowering ftrunc\@64 2141- r600/sfn: Fix source modifiers for ffract64 2142- r600/sfn: Fix f2u32 and remove backend lowring of f2u64 and f2i64 2143- r600/sfn: Honor shader key w.r.t. atomic counter layout 2144- r600/sfn: Fix location for reading cube array image dimensions 2145- r600/sfn: sort FS color outputs before all other outputs 2146- virgl: don't a use staging when a resources created with the shared flag 2147- virgl: Fix injection of double from const mov instruction 2148- r600/sfn: always use four slots for Cayman trans ops 2149 2150Giancarlo Devich (4): 2151 2152- mesa: Fix stack corruption for PIPE_QUERY_TIMESTAMP 2153- d3d12: Add cache managers for separate buffer usage cases 2154- gallium/pipe: Align allocation size in \`pb_cache_manager_create_buffer` 2155- d3d12: Don't align already-aligned size in \`d3d12_bufmgr_create_buffer` 2156 2157Glenn Kennard (1): 2158 2159- nv30: Fix non-scissored clears after a scissor has been set 2160 2161Guilherme Gallo (29): 2162 2163- ci/lava: Customise sections timeouts via envvars 2164- ci/lava: Add timeout header info for LAVA GL Sections 2165- ci/bin: Add utility to find jobs dependencies 2166- ci/bin: Sort imports from ci_run_n_monitor.py 2167- ci/bin: Make ci_run_n_monitor finds dependencies automatically 2168- ci/bin: Add script to expand jobs manifest 2169- ci/bin: Cache GQL queries 2170- ci/bin: Print job needs DAG in ci_run_n_monitor 2171- ci/skqp: Add an option to run all tests 2172- ci/skqp: Show reports on crashes 2173- ci/skqp: Fix paths in skqp-runner 2174- ci/skqp: Fix Nima-Cpp fetching error 2175- ci/skqp: Build list_gpu_unit_tests and list_gms 2176- ci/skqp: Add support for commenting tests files 2177- ci/skqp: Use SKQP_BIN_DIR instead of hardcoded /skqp dir 2178- ci/skqp: Put generated tests files in artifacts 2179- ci/skqp: Supress irrelevant shellcheck warnings 2180- ci/skqp: Remove .baremetal-skqp-test in favor of .skqp-test 2181- ci/skqp: Add gitlab sections for uncluttering 2182- ci/freedreno: skqp: run with new tests files 2183- ci/radeonsi: skqp: Add fail test files for raven 2184- ci/radeonsi: Add zork jobs and rules 2185- ci/bin: Fix requirements.txt 2186- ci/bin: Remove whitespace from token files 2187- ci: Fix kernel+rootfs.* jobs 2188- ci: Update piglit with s3 support 2189- ci: Update piglit-traces tests expectations 2190- ci: Update ci-fairy in CI, rootfs and containers 2191- freedreno/ci: Skip civilization-v/CivilizationV-trim trace 2192 2193Hans-Kristian Arntzen (3): 2194 2195- vulkan: Update to 1.3.228 headers. 2196- radv: Implement VK_EXT_mutable_descriptor_type. 2197- vk/runtime: Fix narrowing of timeline signal and wait value to u32. 2198 2199Iago Toral Quiroga (87): 2200 2201- broadcom/compiler: don't use imprecise_32bit_lowering for idiv lowering 2202- broadcom/compiler: use nir_opt_idiv_const 2203- broadcom/compiler: simplify code emitted for centroid coordinates 2204- nir/lower_alu: drop unnecessary iand on uadd_carry result 2205- broadcom/compiler: track number of TMU operations in prog data 2206- v3dv: move check_needs_load/store helpers to unversioned code 2207- v3dv: track if a job is compatible with double-buffer mode 2208- v3dv: fix tile state allocation 2209- v3dv: add a v3dv_job_allocate_tile_state helper 2210- v3dv: postpone tile state allocation for render pass jobs 2211- v3dv: tell job_compute_frame_tiling whether we want to use double-buffer mode 2212- v3dv: add a heuristic for double-buffer mode 2213- v3dv: vkCmdClearAttachments no longer generates its own RCL 2214- v3d,v3dv: lower texel buffer aligment requirements 2215- v3dv: implement VK_EXT_texel_buffer_alignment 2216- v3dv: add a helper to destoy pipeline layouts 2217- v3dv: ref/unref pipeline layout objects 2218- v3dv: set maxBufferSize property 2219- v3dv: add a v3dv_image_init helper 2220- v3dv: add a get_image_memory_requirements helper 2221- v3dv: implement vkGetDeviceImageMemoryRequirementsKHR 2222- v3dv: add a get_buffer_memory_requirements helper 2223- v3dv: add a buffer_init helper 2224- v3dv: implement vkGetDeviceBufferMemoryRequirementsKHR 2225- v3dv: implement vkGetDeviceImageSparseMemoryRequirementsKHR 2226- v3dv: expose VK_KHR_maintenance4 2227- nir/lower_variable_initializers: implement non-scoped barrier path 2228- v3dv: implement VK_KHR_zero_initialize_workgroup_memory 2229- v3dv: also check GS stage on image access for binning barriers 2230- v3dv: implement vkCmdPipelineBarrier2 2231- v3dv: port remaining entry points to KHR_synchronization2 2232- v3dv: use VkPipelineStageFlagBits2 when checking semaphore wait stage 2233- v3dv: expose VK_KHR_synchronization2 2234- v3dv: implement VK_KHR_workgroup_memory_explicit_layout 2235- v3dv: switch to common code for command buffer lifecycles 2236- v3dv: expose VK_EXT_tooling_info 2237- v3dv: fix swap_rb and channel_reverse flags in image views 2238- v3dv: expose VK_EXT_border_color_swizzle 2239- v3dv: ref pipeline layout earlier during pipeline init 2240- v3dv: implement VK_EXT_depth_clip_control 2241- v3dv: expose VK_EXT_attachment_feeback_loop_layout 2242- v3dv: fix variable type 2243- v3dv: limit heap size to 4GB 2244- broadcom/simulator: add a helper to get the amount of free heap memory 2245- v3dv: implement VK_EXT_memory_budget 2246- v3dv: expose VK_EXT_primitive_topology_list_restart 2247- v3dv: don't load an attachment for unaligned render area if we are not storing 2248- v3dv: expose VK_EXT_load_store_op_none 2249- v3dv: don't return incompatible driver if GPU is not present 2250- v3dv: fix program id for binning shaders 2251- broadcom/compiler: detect unifa write from signal 2252- broadcom/compiler: check signal writes to magic regs when updating scoreboard 2253- v3dv: optimize ldunif load into unifa write 2254- v3dv: fix VK_EXT_texel_buffer_alignment 2255- broadcom/compiler: increase V3D_MAX_BUFFE_RANGE to 2^30 bytes 2256- broadcom/compiler: fix robust buffer access 2257- broadcom/compiler: handle shared stores with robust buffer access 2258- broadcom/compiler: don't apply robust buffer access to shared variables 2259- broadcom/compiler: rename v3d_nir_lower_robust_buffer_access.c 2260- broadcom/compiler: rename static helpers involved with robust buffer access 2261- broadcom/compiler: add a lowering for robust image access 2262- v3dv: expose VK_EXT_image_robustness 2263- v3dv: use NIR_PASS with v3d_nir_lower_robust_image_access 2264- v3dv: use enabled features from vk_device 2265- broadcom/compiler: trivial code clean-up 2266- vulkan/runtime: include robustness info when hashing a shader stage 2267- v3dv: implement VK_EXT_pipeline_robustness 2268- v3dv: drop layout refs for all allocated sets from a pool on destroy / reset 2269- broadcom/compiler: handle vec2 load/store index 2270- v3dv: refactor events 2271- v3dv: re-enable sync_fd import/export 2272- v3dv: return out of host memory if we fail to create event pipelines 2273- v3dv: split event implementation to a separate file 2274- v3dv: remove unnecessary check for NULL 2275- v3dv: do a better job at cleaning up the device on init failure 2276- v3dv: fix incorrect return type 2277- v3dv: handle allocation failure during pipeline initialization 2278- v3dv: do better cleanup on failure during pipeline cache operation 2279- v3dv: increase limit for active event objects 2280- v3dv: always check VK_ACCESS_2_MEMORY_READ_BIT for read accesses 2281- v3dv: make the helper to emit pipeline barriers public to other files 2282- v3dv: fix event synchronization 2283- v3dv: vkCmdWaitEvents2 takes an array of VkDependencyInfo 2284- v3dv: use vk_alloc instead of malloc 2285- broadcom/compiler: avoid using ldvary sequence to hide latency of branching 2286- v3dv: fix debug dump on BO free 2287- v3dv: ignore imported BOs when tracking BO memory usage 2288 2289Ian Romanick (8): 2290 2291- nir: spirv: Allow 32-bit version of nir_intrinsic_is_sparse_texels_resident 2292- radeonsi: r600: d3d12: st: Use NIR lowering for tg4 offset arrays instead of GLSL lowering 2293- glsl: Remove lower_offset_arrays pass 2294- nir/comparison_pre: See through an inot to apply the optimization 2295- intel/fs: Fix constant propagation into 32x16 integer multiplication 2296- nir/range_analysis: Set higher default maximum for max_workgroup_count 2297- nir/loop_analyze: Fix get_iteration for nir_op_ine 2298- nir/loop_analyze: Fix get_iteration for nir_op_fneu 2299 2300Igor Torrente (3): 2301 2302- vulkan: Add vk_clock_gettime and vk_time_max_deviation 2303- radv: anv: Use the new vk_clock_gettime and vk_time_max_deviation functions 2304- venus: Fix dEQP-VK.pipeline.timestamp.calibrated.host_domain_test failure 2305 2306Ikshwaku Chauhan (1): 2307 2308- Revert "radeon: add EFC support to only VCN2.0 devices" 2309 2310Illia Abernikhin (2): 2311 2312- utils: Move functions from debug.* to u_debug.* 2313- utils: Merge util/debug.* into util/u_debug.* and remove util/debug.* 2314 2315Illia Polishchuk (2): 2316 2317- mesa: skip extra state updates for clear calls 2318- driconf/Intel: Add lower_depth_range_rate option workaround for Homerun Clash misrendering issue 2319 2320Isaac Bosompem (1): 2321 2322- virgl: Set use_staging in resource_from_handle 2323 2324Italo Nicola (6): 2325 2326- etnaviv: skip disk cache initialization on standalone compiler 2327- etnaviv: fix standalone compiler to work without a tgsi backend 2328- clc: add 32-bit target 2329- rusticl: use 32-bit address format for 32-bit devices 2330- rusticl: correctly check global argument size 2331- rusticl: fix MemConstant invalid arg size check 2332 2333Iván Briano (13): 2334 2335- anv: emit scissors when the pipeline changes 2336- anv: pipelineStageCreationFeedbackCount is allowed to be 0 2337- anv: populate rt shader groups if they were found in the cache 2338- anv: Set meshShaderQueries for mesh shader ext features 2339- intel/utrace: create the callback events for xfb trace points 2340- anv/grl: Add a GRL file parser 2341- anv: set READ/WRITE_WITHOUT_FORMAT for buffer views 2342- vulkan_hasvk: set READ/WRITE_WITHOUT_FORMAT for buffer views 2343- anv: compile_upload_rt_shader expects a valid pointer 2344- anv: use DX rules for point rasterization 2345- hasvk: use DX rules for point rasterization 2346- anv: support VK_PIPELINE_CREATE_RAY_TRACING_SKIP_* 2347- anv: enable rayTraversalPrimitiveCulling feature 2348 2349James Park (2): 2350 2351- vulkan: Augment _WIN32 stub comparison 2352- meson,amd: Remove Windows libelf wrap 2353 2354James Zhu (4): 2355 2356- amd/common: some ASICs with gfx9 use compute rings for render 2357- util/format: add util format y8_400_unorm 2358- frontends/va: add support for yuv400 and yuv444 2359- radeonsi/vcn: enable jpeg decode of yuv444 and yuv400 2360 2361Jami Kettunen (1): 2362 2363- freedreno/ir3: Switch to NIR for a5xx's vertex id lowering. 2364 2365Jan Beich (1): 2366 2367- util: unify FreeBSD futex_wait signature with Linux/OpenBSD/Windows 2368 2369Faith Ekstrand (115): 2370 2371- nir: Clean up and improve nir_dedup_inline_samplers 2372- intel/fs_reg_allocate: Improve compressed instruction self-interference 2373- intel/rt: Handle halts in any-hit shaders properly 2374- intel/rt: Handle multiple exits in lower_shader_returns 2375- genxml: Add BVH data structures 2376- anv: Don't require 32-bit addresses for scratch on Gen12.5+ 2377- vulkan,anv,dozen: Use VK_IMAGE_LAYOUT_ATTACHMENT_FEEDBACK_LOOP_OPTIMAL_EXT 2378- radv: Use both aspects for depth/stencil blit destinations 2379- radv: Use VK_IMAGE_LAYOUT_ATTACHMENT_FEEDBACK_LOOP_OPTIMAL_EXT for render loops 2380- radv/meta: Respect render area for MSAA resolves 2381- radv/meta: Fix a stack use-after-free with sample locations 2382- radv/meta: Drop subpass_att from radv_fast_clear_color 2383- radv/meta: Use Begin/EndRendering for resolves 2384- radv/meta: s/dest/dst/ in a few functions 2385- radv/meta: Use Begin/EndRendering for clears 2386- radv: Drop set/restore_subpass 2387- radv/meta: Drop redundant depth_view_can_fast_clear 2388- radv: Add a image_transition helper for separate depth/stencil layouts 2389- radv: Don't assume layout == stencilLayout for stencil-only 2390- util,intel: Pull the bit packing helpers from genxml to a common header 2391- broadcom: Use util/bitpack_helpers.h 2392- util: Add util_bitpack_[su]fixed_clamp helpers 2393- asahi: Use util/bitpack_helpers.h 2394- panfrost: Use util/bitpack_helpers.h 2395- anv: Use nir_intrinsic_load_layer_id for ViewId in fragment shaders 2396- anv: Decouple primitive replication check from the fragment shader 2397- anv: Stop looking at the pipeline in multiview lowering 2398- nir: Track per-view outputs in shader_info 2399- intel/compiler: Store the number of position slots in the VUE map 2400- anv: Drop anv_pipeline::use_primitive_replication 2401- pvr: Only destroy the physcial device if it exists 2402- vulkan/cmd_buffer: add record_state to the common command buffer. 2403- vulkan/cmd_queue: Drop vk_cmd_queue::error 2404- vulkan: Add a concept of recycling an object 2405- vulkan: Re-order arguments to vk_command_pool_init 2406- vulkan: Re-order arguments to vk_command_buffer_init 2407- vulkan: Add a vk_command_buffer_ops struct 2408- vulkan: Add a common vkResetCommandBuffer() implementation 2409- vulkan: Add a common vkAllocateCommandBuffers() implementation 2410- vulkan: Recycle command buffers in vk_command_pool 2411- radv: Move to the common command pool framework 2412- tu: Move to the common command pool framework 2413- lvp: Move to the common command pool framework 2414- vulkan,docs: Add documentation Vulkan command pools 2415- vulkan: Dirty VP_VIEWPORTS/SCISSORS when copying viewports/scissors 2416- radv: Set the window scissor to the render area, not framebuffer 2417- radv: Only copy the render area from VRS to HTILE 2418- radv: Leave image layouts alone when doing HW MSAA resolves 2419- radv: Switch to dynamic rendering only 2420- rusticl: Call nir_scale_fdiv 2421- vulkan/runtime: Add a comon vk_descriptor_update_template 2422- anv: Switch to the common descriptor update template struct 2423- vulkan/runtime: Compact descriptor update templates 2424- panvk: Fix buffer views 2425- spirv: Don't use libclc for wait_group_events 2426- nir: Add a helper for finding a function by name 2427- nir/load_libclc: Don't add generic variants that already exist 2428- vulkan: Record more enabled robustness features 2429- vulkan: Add a helper for gathering pipeline robustness 2430- iris: Use a larger alignment for buffer allocations 2431- iris: Fix more BO alignments 2432- iris: Handle resource offsets in buffer copies 2433- iris: Stop looking at textures_used for samplers 2434- iris: Split max #defines for textures/samplers/images 2435- iris: Support up to 64 images 2436- iris: Support up to 128 textures 2437- intel/fs: Always use integer types for indirect MOVs 2438- intel/fs: SEL_EXEC uses the integer pipe for 64-bit stuff 2439- intel/mi_builder: add a way to reserve a register 2440- intel/mi_builder: Add a helper for incrementing reference counts 2441- anv/formats: Advertise ACCELERATION_STRUCTURE_VERTEX_BUFFER_BIT 2442- anv: Add an anv_address_map helper 2443- anv: Implement VK_KHR_pipeline_library 2444- anv: Set up the memory-backed FIFO buffer 2445- anv: Add extern "C" guards 2446- anv: Add support for compiling OpenCL-style kernels 2447- anv: Add support for OpenCL-style kernel dispatch 2448- anv: Import GRL 2449- anv/grl: Build OpenCL kernels 2450- anv/grl: Add a helper for dispatching our pre-built kernels 2451- intel/grl: Parse GRL files and generate C 2452- anv: Build BVHs on the GPU with GRL 2453- anv: Advertise ray-tracing on DG2 2454- intel/devinfo: DG2 supports ray-tracing 2455- vulkan/wsi: Support configuring swapchain images as part of swapchain init 2456- vulkan/wsi/x11: Configure images via params passed to wsi_swapchain_init() 2457- vulkan/wsi/wayland: Configure images via params passed to wsi_swapchain_init() 2458- vulkan/wsi/display: Configure images via params passed to wsi_swapchain_init() 2459- vulkan/wsi/win32: Configure images via params passed to wsi_swapchain_init() 2460- vulkan/wsi: Always configure images in wsi_swapchain_init() 2461- vulkan/wsi: Don't expose as many image configure helpers 2462- vulkan/wsi: Add a typedef for memory type select callbacks 2463- vulkan/wsi: Add a supports_scanout flag 2464- vulkan: Use altlen when available for array lengths 2465- vulkan: Rename viewport_state::negative_one_to_one 2466- vulkan: Add a dynamic state for vp.depth_clip_negative_one_to_one 2467- vulkan: Add more dynamic rasterizer state 2468- vulkan: Add dynamic state for tessellation domain origin 2469- vulkan: Add more dynamic multisample states 2470- vulkan: Add more dynamic color blend states 2471- vulkan,lavapipe: Use a tri-state enum for depth clip enable 2472- iris: Set SamplerCount in shader packets 2473- nir: Reformat a comment 2474- nir: Use nir_shader_instructions_pass in nir_lower_readonly_images_to_tex 2475- nir: Misc. style fixes to nir_lower_readonly_images_to_tex 2476- nir: Unconditionally call nir_trim_vector in nir_lower_readonly_images_to_tex 2477- nir/types: Add some asserts to glsl_get_struct_field() 2478- nouveau/codegen: Support bindless texture queries 2479- intel/lower_mem_access_bit_sizes: Compute alignments automatically 2480- intel: Don't cross DWORD boundaries with byte scratch load/store 2481- vulkan: Handle VK_SUBPASS_EXTERNAL at the end of a subpass 2482- vulkan: Unconditionally add barriers for missing external subpass deps 2483- vulkan: Add state for extraPrimitiveOverestimationSize 2484- vulkan: Add a common implementation of CmdSetSampleLocationsEnableEXT 2485- vulkan: Add a dummy vk_common_CmdSetColorBlendAdvancedEXT() 2486 2487Jeremy Rand (1): 2488 2489- llvmpipe: bump LP_MAX_THREADS to 32 2490 2491Jesse Natalie (82): 2492 2493- ci/windows: Re-enable Windows runners 2494- dzn: Disable extensions that require multi-view 2495- dzn: Use D3D12 constants instead of naked numbers for device limits 2496- ci/windows: Disable Windows builds while the runner is offline again 2497- ci/windows: Windows runner is back online 2498- ci/windows: Install Graphics-Tools package on the base image 2499- egl/wgl: Don't require a native display to support window surfaces 2500- egl/wgl: Implement QuerySurface 2501- wgl: Split pbuffer creation into two functions 2502- egl/wgl: Implement PBuffer surfaces 2503- egl/wgl: Set YInvert attribute to always true 2504- egl/wgl: Implement bind/release texture buffer 2505- wgl: Allow per-framebuffer swap interval overrides 2506- egl/wgl: Support eglSwapInterval 2507- egl/wgl: Support eglWaitClient 2508- egl/wgl: Support eglWaitNative 2509- egl,wgl: Support eglCreateImageKHR 2510- wgl: Pass smapi explicitly to context creation 2511- wgl: Pass smapi explicitly to framebuffer creation 2512- egl/wgl: Use a per-display st_manager 2513- egl/wgl: Hook up image validate/get in smapi 2514- egl/wgl: Support EGL_MESA_query_driver 2515- egl/wgl: Support sync objects 2516- wgl: Use pfi instead of iPixelFormat more often 2517- egl/wgl: Delete unused variables/code 2518- egl/wgl: Fix some awkward sizeof formatting 2519- meson: Don't include glsl compiler if flex/bison aren't found 2520- microsoft/compiler: Discard shouldn't be marked readnone 2521- dxil_nir_lower_int_cubemaps: When not lowering samplers, don't touch sampler types 2522- microsoft/compiler: Fix PSV struct when numthreads is 0 2523- microsoft/clc: Handle a null clc logger 2524- gallium/windows: Delete OpenGLOn12.dll target 2525- meson: Add an option to specify the WGL gallium megadriver filename 2526- mesa: Expose GL_NV_ES1_1_compatibility 2527- microsoft/compiler: Always emit a shader at the max-supported shader model 2528- microsoft/compiler: Support up to shader model 6.5 2529- microsoft/compiler: Add struct and function defs for SM6.6 handle funcs 2530- microsoft/compiler: Add dynamic create handle helper 2531- microsoft/compiler: Add getters for res bind/props structs 2532- microsoft/compiler: Delete double-assignment of sampler metadata field 2533- microsoft/compiler: Pass lower_bound, upper_bound, space to createhandle 2534- microsoft/compiler: Handle SM6.6 handles 2535- microsoft/compiler: SM6.6 is supported 2536- microsoft/compiler: Support SM6.7 2537- d3d12: Get max supported shader model 2538- dzn: Get max supported shader model 2539- u_atomic: Add a helper for pointer compare-exchange 2540- d3d12: Change displayable format logic 2541- ci/windows: Update build container image with all deps so wraps can be disabled 2542- deps: Update DirectX-Headers req to 1.606.4 2543- d3d12: Move some things from screen late-init to early-init 2544- d3d12: Store the d3d12 module reference on the screen 2545- d3d12: Support device factories in addition to global device creation 2546- d3d12: Attempt to use the Agility SDK 2547- d3d12: Use ID3D12DeviceConfiguration for root signature serialization 2548- d3d12: Release sharing contract in flush_frontbuffer 2549- dzn: Store the d3d12 module reference on the instance 2550- dzn: Support device factories in addition to global device creation 2551- dzn: Use architecture props to return correct device type 2552- dzn: Attempt to use the Agility SDK 2553- dzn: Use ID3D12DeviceConfiguration for root signature serialization 2554- ci/windows: Update vk-gl-cts to pick up new zlib dependency 2555- d3d12: Don't put permanently-resident resources in the residency bo list 2556- d3d12: Don't multiply cube array sizes by 6 2557- d3d12: Set PIPE_CAP_MAX_TEXTURE_UPLOAD_MEMORY_BUDGET 2558- dzn: Set factory flags before creating device 2559- d3d12: Advertise PIPE_CAP_BUFFER_MAP_PERSISTENT_COHERENT 2560- include: Add WGL version of GL interop extensions 2561- mesa, dri2: Move interop helpers from dri2 to state tracker 2562- include: Add a flush interop method 2563- mesa: Make fence_sync not static 2564- mesa/st: Split interop object lookup from export 2565- mesa/st: Implement the new flush method 2566- dri: Implement the new flush method 2567- glx: Implement the new flush method 2568- egl/dri2: Implement the new flush method 2569- wgl: Implement interop methods 2570- egl/wgl: Implement interop methods 2571- include: Add driver data to v2 of mesa_glinterop_device_info 2572- gallium: Add screen methods for driver-private data on interop interface 2573- mesa/st: Query driver for interop data 2574- d3d12: Hook up driver-private interop data 2575 2576Jessica Clarke (1): 2577 2578- panfrost/blend: Fix invalid const values leading to NIR validation errors 2579 2580Joan Bruguera (1): 2581 2582- nv50/ir/nir: ignore sampler for TXF/TXQ ops. 2583 2584John Brooks (5): 2585 2586- radv: Add get_addrlib function to radv_radeon_winsys 2587- vulkan: Introduce vk_format_is_block_compressed function 2588- radv: Only apply mipmap view adjustments to block compressed images 2589- radv: Fix corrupted mipmap copies on GFX9+ 2590- radv: Fix mipmap views on GFX10+ 2591 2592Jonathan (3): 2593 2594- gallium/u_threaded: buffer subdata merging (v2) 2595- gallium/u_threaded: Prepare for buffer sharedness tracking 2596- gallium/u_threaded: buffer sharedness tracking 2597 2598Jonathan Gray (2): 2599 2600- iris: check i915 features after hw gen 2601- util: include sys/time.h for timespec functions 2602 2603Jordan Justen (15): 2604 2605- iris: Drop extra file-descriptor dup in iris_drm_screen_create() 2606- intel/pci_ids: Update ADL-S strings 2607- intel/pci_ids: Add 0x468b ADL-S PCI-id 2608- intel/pci_ids: Update ATS-M device names 2609- intel/pci_ids: Update DG2 device names 2610- intel/pci_ids: Add dg2 0x5698 pci-id 2611- intel/pci_ids: Drop non-upstream dg2 pci-ids 2612- intel/compiler: Use builder to allocate fs regs for gs control data bits 2613- intel/devinfo: Add MTL platforms enums and intel_device_info_is_mtl() 2614- intel/l3: Use L3 full-way allocation setting for gfx12.5 (DG2, MTL) 2615- anv/meson: Use anv_flags and anv_cpp_flags in genX compiles 2616- meson: Deprecate vulkan-rt-drivers intel 2617- meson: Define with_intel_vk_rt based on with_intel_clc 2618- intel/compiler: Broadcast lower code should check 64-bit int support 2619- ci/d3d12: Update quick_shader results with 24 fixes from !19128 2620 2621Jose Maria Casanova Crespo (4): 2622 2623- CI: Igalia farm is down 2624- Revert "CI: Igalia farm is down" 2625- vc4: consolidate shader-db output 2626- v3d: Minor fixes on sand8 blit based on sand30 modifications 2627 2628Joshua Ashton (2): 2629 2630- radv: Remove workarounds for D3D9 feedback loops 2631- freedreno: Disable 8bpp_ubwc on a6xx gen2 2632 2633José Fonseca (1): 2634 2635- util: Add macro util_get_cpu_caps in u_cpu_detect.c for avoid accidentally call it in u_cpu_detect.c 2636 2637José Roberto de Souza (34): 2638 2639- intel: Simply intel_gem_create_context_engines() 2640- anv: Remove duplicated memset() in physical device creation 2641- anv: Remove anv_app_info 2642- anv: Free vmas in case device creation fails in pthread_mutex_init(&device->mutex 2643- anv: Do not duplicate intel_device_info memory in each logical device 2644- anv: Only wait for queue sync if execbuf was properly executed 2645- anv: Do not copy garbage to batch_bo 2646- anv: Nuke anv_execbuf_init() 2647- anv: Group all context operations during device creation 2648- anv: Nuke dead code 2649- anv: Return earlier in anv_gem_get_tiling() when not supported 2650- intel/compiler/fs: Fix compilation of shaders with SHADER_OPCODE_SHUFFLE of float64 type 2651- intel/compiler/fs: Use DF to load constants when has_64bit_int is not supported 2652- vulkan_hasvk: Nuke dead code around I915_ENGINE_CLASS_COMPUTE 2653- anv: Nuke cmd_parser_version 2654- intel/dev: Adjust prefetch_size values for MTL engines 2655- intel: Share code to read render timestamp 2656- intel: Make engine related functions and types not i915 dependent 2657- intel/dev: Split i915 specific parts of intel_get_device_info_from_fd() 2658- anv: Move fetch of i915 physical device parameters 2659- anv: Split i915 specific parts of anv_queue_submit_simple_batch() 2660- anv: Use Vulkan types for priority as much as possible 2661- anv: Split the debug part of anv_queue_exec_locked() 2662- intel: Convert missing i915 engine types to intel 2663- intel: Convert i915 engine type to intel in tools/ common/ and ds/ 2664- intel: Add and use intel_engines_class_to_string() 2665- hasvk: Nuke code around local memory 2666- hasvk: Fix build around intel_measure_state_changed() call 2667- iris: Fix enablement of protected contexts 2668- intel/perf: Use intel_device_info functions to compute subslice and eu totals 2669- iris: Nuke pci_id from iris_screen 2670- iris: Drop duplicated errno handling in iris_bo_wait() 2671- iris: Do not export iris_bo_wait() 2672- iris: Set priority for replaced engine context 2673 2674Juan A. Suarez Romero (3): 2675 2676- vc4/simulator: use i915/amd ioctls for BO 2677- vc4: properly restore vc4 debug option 2678- vc4: store tex sampler in proper register 2679 2680Julia Tatz (1): 2681 2682- zink: Advertise PIPE_CAP_NATIVE_FENCE_FD 2683 2684Juston Li (6): 2685 2686- venus: Enable VK_EXT_texture_compression_astc_hdr 2687- venus: sync venus protocol headers for VK_EXT_primitive_topology_list_restart 2688- venus: add support for VK_EXT_primitive_topology_list_restart 2689- venus: sync to latest venus protocol headers 2690- venus: add support for VK_EXT_multi_draw 2691- venus: use buffer cache for vkGetDeviceBufferMemoryRequirements 2692 2693Kai Wasserbäch (22): 2694 2695- chore(deps): clover: raise the minimum LLVM version to 11.0.0 2696- fix(FTBFS): meson: raise C++ standard to C++17 2697- chore(docs): rusticl: improve list of build dependencies 2698- fix: util/format: unused variable ‘desc’ [-Wunused-variable] 2699- fix: nir: unused variable ‘else_block’ [-Wunused-variable] 2700- fix: ac/llvm: unused variable ‘offset’ [-Wunused-variable] 2701- fix: mesa: unused variable ‘ret’ [-Wunused-variable] 2702- fix: mesa/st: variable ‘texobj’ set but not used [-Wunused-but-set-variable] 2703- fix: mesa/st: unused variable ‘src’ [-Wunused-variable] 2704- fix: gallivm: variable ‘type_kind’ set but not used [-Wunused-but-set-variable] 2705- fix: r600/sb: unused variable ‘repdep2’ [-Wunused-variable] 2706- fix: r600/sb: unused variable ‘fop’ [-Wunused-variable] 2707- fix: r600/sb: unused variable ‘r’ [-Wunused-variable] 2708- fix: r600/sfn: variable ‘opinfo’ set but not used [-Wunused-but-set-variable] 2709- fix: r600/sb: warning: unused variable ‘{b,nl}’ [-Wunused-variable] 2710- fix: r600/sfn: unused variable ‘splitpos’ [-Wunused-variable] 2711- fix: r600/sfn: unused variable ‘{splitpos,param}’ [-Wunused-variable] 2712- fix: r600/sfn: variable ‘fail’ set but not used [-Wunused-but-set-variable] 2713- fix: r600/sfn: unused variable ‘spi_sid’ [-Wunused-variable] 2714- fix: iris: unused variable ‘devinfo’ [-Wunused-variable] 2715- fix: zink: unused variable ‘intr’ [-Wunused-variable] 2716- fix: frontends/va: unused variable ‘max_pipe_hevc_slices’ [-Wunused-variable] 2717 2718Karmjit Mahil (30): 2719 2720- pvr: Fix bo mapping on alloc with PVR_BO_ALLOC_FLAG_CPU_ACCESS. 2721- pvr: Compete pvr_calc_fscommon_size_and_tiles_in_flight(). 2722- pvr: Fix calculation in rogue_max_compute_shared_registers(). 2723- pvr: Change indentation from tab to spaces for xml files. 2724- pvr: Set descriptor dirty flag based on other flags. 2725- pvr: Finish setting up job resolve info. 2726- pvr: Remove unimplemented push descriptor code. 2727- pvr: Add depth_bias_array handling on dbenable. 2728- pvr: Add clear program in pvr_device. 2729- pvr: Add static clear control stream templates in pvr_device. 2730- pvr: Add static clear VDM state in pvr_device. 2731- pvr: Add graphics pipeline barrier handling. 2732- Revert "pvr: Make pvr_cmd_pack() macro clearly internal" 2733- pvr: Implement clear ppp state emission from template. 2734- pvr: Add mid fragment pipeline barrier if needed. 2735- pvr: Handle VK_CULL_MODE_FRONT_AND_BACK. 2736- pvr: Emit cs words for load op on vkCmdBeginRenderPass(). 2737- pvr: Add basic skeleton for event sub cmd. 2738- pvr: Remove struct pvr_emit_state and emit header directly. 2739- pvr: Add csb helpers macros to write into raw buffer. 2740- pvr: Make control stream word writing stricter. 2741- pvr: Complete pvr_emit_ppp_state(). 2742- pvr: Add assert for texturestate being 0 in when emitting ppp state. 2743- pvr: Add EMIT_MASK in pvr_emit_ppp_state(). 2744- pvr: Fix possible seg fault on csb copy. 2745- pvr: Fix overflow before widen warning for rgn headers size calculation. 2746- pvr: Handle pipeline barrier vk_sync. 2747- pvr: Remove outdated comments. 2748- pvr: Add PBE accum format size in bytes. 2749- pvr: Change pvr_get_hw_clear_color() to pack based on accum formats. 2750 2751Karol Herbst (189): 2752 2753- printf: extract clovers printf impl 2754- printf: add some unit tests 2755- nir: serialize printf metadata for CL kernels 2756- nir/lower_images: extract from clover 2757- nir: extract the clc inline sampler dedup pass from clc 2758- microsoft/clc: drop inline sampler reordering 2759- clc: undefine spirv defs to work around LLVMs headers 2760- vtn: silence warning about linkage 2761- gallium: mark the input data as const in pipe_grid_info 2762- util/memstream: insert null terminator on windows 2763- ci: update fails list 2764- ci: bumping all tags to make sure we don't have hidden fails 2765- nvc0: limit max global and alloc size 2766- ci: update CI to reflect clovers LLVM version bump 2767- nv50: fix code heap after pipe_shader_enum change 2768- nouveau: use the contexts pushbuf and client where possible 2769- nouveau/mm: make code thread safe 2770- nouveau/buffer: simplify uses of nouveau_fence_work 2771- nouveau/fence: rework nouveau_fence_emit so we can call it on emitted fences 2772- nouveau: move nouveau_context initialization to common code 2773- nouveau: wrap nouveau_bo_map 2774- nouveau: wrap nouveau_bo_wait 2775- nouveau: wrap all nouveau_pushbuf_space calls 2776- nouveau: wrap nouveau_pushbuf_validate 2777- nouveau: wrap nouveau_pushbuf_refn 2778- nouveau: use PUSH_KICK instead of nouveau_pushbuf_kick 2779- nouveau/fence: make the fence struct part of the fence API 2780- nv50: remove nouveau_fence_signalled prior nouveau_fence_wait 2781- nv50: remove nv50_bufctx_fence call in vbo_kick_notify 2782- nouveau/fence: per context fence 2783- nouveau: per context client and pushbuf 2784- nouveau: make fencing race free 2785- nvc0: make state handling race free 2786- nv50: race free state tracking 2787- nv50/ir: fix OP_UNION resolving when used for vector values 2788- nv50: properly flush the TSC cache on 3D 2789- rusticl: added 2790- rusticl: finish implementing clBuildProgram 2791- rusticl/util: add static_assert macro 2792- rusticl: implement clFinish and clFlush 2793- rusticl: implement clEnqueueReadBuffer 2794- rusticl/mem: support ops on subbuffers 2795- rusticl/kernel: basic implementation 2796- rusticl: nir bindings 2797- rusticl: translate spirv to nir and first steps to kernel arg handling 2798- rusticl/kernel: more clGetKernelWorkGroupInfo props 2799- rusticl/kernel: clGetKernelWorkGroupInfo allows a NULL device 2800- rusticl/mem: use helper context for COPY_HOST_PTR buffers 2801- rusticl/event: wrong but non crashing impl of clWaitForEvents 2802- rusticl: hack for CL 3.0 2803- rusticl/mesa: add fencing support 2804- rusticl/event: proper eventing support 2805- rusticl/kernel: implement clCreateKernelsInProgram 2806- rusticl/kernel: implement clEnqueueTask 2807- rusticl/program: undefine __IMAGE_SUPPORT__ if images are unsupported 2808- rusticl/queue: fix clReleaseCommandQueue 2809- rusticl/mem: implement clCopyBuffer 2810- rusticl/mem: implement clFillBuffer 2811- rusticl/event: implement marker and barrier 2812- rusticl/mem: add clEnqueueMigrateMemObjects stub 2813- rusticl/program: support compiling libraries 2814- rusticl: implement clUnloadPlatformCompiler 2815- rusticl/api: add param to query which contains application provided values 2816- rusticl/program: implement clCreateProgramWithBinary 2817- rusticl/event: add fake impl of clGetEventProfilingInfo 2818- rusticl/kernel: implement CL_KERNEL_COMPILE_WORK_GROUP_SIZE 2819- rusticl/mem: finish clEnqueueMigrateMemObjects 2820- rusticl/kernel: run driver requested lowering passes 2821- rusticl/kernel: add support for offsets 2822- rusticl/kernel: add missing preprocessor definitions 2823- rusticl: advertize CL 1.1 and CL 1.2 extensions 2824- rusticl/context: implement clSetContextDestructorCallback 2825- rusticl/kernel: implement clCloneKernel 2826- rusticl/device: set required double fp config values 2827- rusticl/device: advertize atomic caps 2828- rusticl: claim support for generic pointers 2829- rusticl/program: add il stubs 2830- rusticl: add svm func stubs 2831- rusticl/icd: add more func pointers 2832- rusticl/kernel: report SIMD width as work group sizes 2833- rusticl/kernel: sweep nir to reduce peak memory usage 2834- rusticl/device: expose cles_khr_int64 for compatible embedded devices 2835- rusticl/mem implement the memory \*WithProperties API 2836- rusticl/pipe: add clGetPipeInfo stub end return CL_INVALID_MEM_OBJECT 2837- rusticl/program: add stubs for program ctors and dtors 2838- rusticl/kernel: add stub for clGetKernelSubGroupInfo 2839- rusticl/event: timeout wait on condvar to abort on reaped worker threads 2840- rusticl: add support for printf 2841- rusticl/spirv: add print method 2842- rusticl/mesa: support mapping textures 2843- rusticl/mem: support read/write/copy ops for images 2844- rusticl/memory: rework mapping tracking 2845- rusticl/mem: implement maping images 2846- rusticl/mem: implement fill image 2847- rusticl/mem: implement copy image 2848- rusticl/mem: implement clCreateImage2D and 3D 2849- rusticl/mem: implement copies between buffers and images 2850- rusticl/mesa/context: un Arc the PipeContext 2851- rusticl/mem: implement clCreateSamplerWithProperties 2852- rusticl/kernel: support for images 2853- rusticl/kernel: inline samplers 2854- rusticl/kernel: lower memcpy 2855- rusticl/device: add retain/release callbacks for luxmark v4 2856- rusticl/device: implement cl_khr_3d_image_writes 2857- rusticl/kernel: set CL_DEVICE_PREFERRED_WORK_GROUP_SIZE_MULTIPLE 2858- rusticl/kernel: run some more opt passes 2859- rusticl: get rid of a few clones 2860- rusticl/memory: don't map more than necessary for copy operations 2861- rusticl/device: fix api consistency_device_and_host_timer test 2862- rusticl/queue: implement missing CL 3.0 bits 2863- rusticl/device: fix compiler features_macro 2864- rusticl/mem: add user_ptr fallback shadow buffer 2865- rusticl/mesa: make resource_from_user_memory an optinal feature 2866- rusticl/kernel: implement image_format and image_order 2867- rusticl/mem: fix IMAGE1D_BUFFER 2868- rusticl/mem: only write pitch when required 2869- rusticl: port to Rust 2018 2870- rusticl: kernel caching 2871- rusticl/kernel: prepare for nir caching 2872- rusticl/kernel: cache the nir as well 2873- rusticl: fix compiler features_macro 2874- rusticl/kernel: optimize local size 2875- rusticl: the CTS is a piece of shit 2876- rusticl/kernel: implement CL_KERNEL_ATTRIBUTES 2877- rusticl/program: some boilerplate code for SPIR-V support 2878- rusticl/memory: kernel read_write images prep work 2879- rusticl: disable fp64 support 2880- rusticl/device: allow overwriting the device_type via env 2881- rusticl/program: parse quoted paths in args 2882- rusticl/kernel: fix local buffers 2883- rusticl/device: report mesas version for CL_DRIVER_VERSION 2884- rusticl/icd: implement clGetExtensionFunctionAddressForPlatform 2885- rusticl/icd: add some way of debugging CL function calls 2886- rusticl/kernel: use real references for arguments 2887- rusticl: proper PIPE_MAP flags for internal maps 2888- rusticl: call glsl_type_singleton_init_or_ref 2889- rusticl/program: some more API validation 2890- ci: rusticl with llvmpipe 2891- rusticl: add README file 2892- rusticl/device: print error when libclc fails to load 2893- rusticl/mem: return errors for OOB accesses 2894- rusticl/device: fix custom device detection 2895- rusticl/mem: fix image OOB checks 2896- rusticl/mem: align fill pattern buffer to 4 bytes 2897- rusticl: limit global mem to 2GB 2898- rusticl/kernel: assign locations before passing the nir to drivers 2899- ac/llvm: support non 32 bit sized workgroup ids 2900- ac/llvm: fix load_const for vectors with more than 4 elements 2901- ac/llvm: fix load/store_shared for vectors with more than 4 elements 2902- ac/llvm: fix load_ubo for vectors with more than 4 elements 2903- radeonsi: stop vectorizing unpack_32_2x16_split 2904- iris: use images_used instead of num_images 2905- iris: speed up walking global bindings 2906- iris: bump IRIS_MAX_GLOBAL_BINDINGS to 128 2907- rusticl/kernel: preserve fp16 denorms to fix vload/vstore_half 2908- rusticl: add support for coherent resources 2909- rusticl/mesa: add bx() method to PipeTransfer 2910- rusticl: rework resource mappings a little 2911- rusticl: add helper ctx wrapper for coherent and direct mapping 2912- rusticl/mem: rewrite the (un)mapping code 2913- lp: claim being UMA 2914- rusticl/mem: propper CL_MEM_ALLOC_HOST_PTR support 2915- radeonsi: fail creating textures from user memory 2916- nir/lower_cl_images: set binding 2917- radeonsi: lower hadd and fisnormal 2918- rusticl: remove blit code 2919- rusticl: add RUSTICL_ENABLE to have a global way of enabling devices 2920- docs: Add documentation for Rusticl's env variables 2921- ci/llvmpipe: update to new RUSTICL_ENABLE var 2922- zink: handle more opcodes for CL 2923- zink: handle f2f16_rtz 2924- zink: fix spirv_builder_spec_const_uint 2925- aux/trace: add set_global_binding 2926- rusticl: force BIND_LINEAR on staging resources 2927- rusticl/mem: can only map staging textures directly 2928- rusticl: add small bitset impl 2929- rusticl/nir: add reads_sysval wrapper 2930- rusticl/kernel: lower system values before gathering info 2931- rusticl/kernel: add work_dim lowering 2932- zink: remove leftover work_dim lowering code 2933- rusticl/kernel: fix more 32 bit problems 2934- nir/algebraic: add 8 and 64 bit urol and uror lowering 2935- nir/algebraic: support CL vector accessors 2936- nir/algebraic: generalize vector_cmp lowering 2937- nir/algebraic: add vec8/16 cmp lowering 2938- iris: invalidate sysvals if grid dimension changes 2939- glsl: fix buffer texture type 2940- rusticl/device: put space at the end of CL_DEVICE_VERSION 2941- nir/lower_int64: fix shift lowering 2942 2943Kenneth Graunke (53): 2944 2945- intel/compiler: Use named NIR intrinsic const index accessors 2946- iris: Pass devinfo to iris_resource_level_has_hiz() 2947- iris: Enable HiZ for non-8x4 aligned miplevels on Icelake and later 2948- iris: Always retain ISL_AUX_USAGE_HIZ_CCS_WT in texture aux usage 2949- iris: Clean up iris_sample_with_depth_aux() 2950- intel/compiler: Drop variable group size lowering 2951- iris: Fix PIPE_CAP_UMA 2952- iris: Use linear for exported resources if we can't convey tiling 2953- intel/compiler: Change dg2_plus check to devinfo->verx10 >= 125 2954- iris: Delete unused iris_screen::aperture_bytes field 2955- iris: Don't print out XXX messages if resource creation fails 2956- iris: Fall back if iris_map_copy_region can't create a staging resource 2957- iris: don't create staging resources larger than half the aperture 2958- crocus: Don't print out XXX messages if resource creation fails 2959- crocus: Fall back if iris_map_copy_region can't create a staging resource 2960- crocus: Fix memory leaks on iris_resource_create failure paths 2961- anv: Fail to create a device on ver < 9 2962- anv: Drop checks for version 8 or 9 2963- anv: Delete image param support. 2964- anv: Delete batch buffer growing code. 2965- anv: Delete shader constants UBO from descriptor sets 2966- anv/tests: Don't use relocations in a test case 2967- anv: Delete "back" allocation from state pool 2968- anv: Delete relocation support from anv_block_pool 2969- anv: Delete "back" allocation from anv_block_pool 2970- anv: Drop state pool relocation munging 2971- anv: Delete use_relocations flag 2972- anv: Delete softpin checks 2973- anv: Delete anv_reloc_list_add() 2974- anv: Delete wrapper BOs for relocations 2975- anv: Delete relocation support from batch submission 2976- anv: Delete has_a64_buffer_access flag 2977- anv: Delete has_bindless_images and has_bindless_samples flags 2978- anv: Make a helper function for pinning a state pool's BOs 2979- anv: Drop offset from anv_reloc_list_append 2980- anv: Inline write_reloc into the only remaining caller 2981- anv: Remove anv_batch_emit_reloc and just open-code it 2982- intel/compiler: Use subgroup invocation for ICP handle loads 2983- iris: Ignore aux for copy_region source if there's no unresolved color 2984- intel/compiler: Use an existing URB write to end TCS threads when viable 2985- st/mesa: Optionally call nir_vectorize_tess_levels() 2986- intel/compiler: Vectorize gl_TessLevelInner/Outer[] writes 2987- intel/genxml: Add XY_FAST_COLOR_BLT 2988- blorp: Make blitter_supports_aux accessible from multiple files. 2989- blorp: Implement blitter clears via XY_FAST_COLOR_BLT 2990- blorp: Fix typo in blorp_xy_block_copy_blt 2991- Revert "intel/compiler: Vectorize gl_TessLevelInner/Outer[] writes" 2992- iris: Add miplevel parameters to iris_resource_texture_aux_usage 2993- intel/compiler: Vectorize gl_TessLevelInner/Outer[] writes [v2] 2994- iris: Use nir_intrinsic_load_global_constant for large constants 2995- intel/compiler: Run nir_opt_large_constants before scalarizing consts 2996- st/mesa: Let nir_opt_access() infer non-readable 2997- nir: Drop infer_non_readable option for nir_opt_access() 2998 2999Konrad Kleine (1): 3000 3001- Update docs/drivers/llvmpipe.rst 3002 3003Konstantin Kharlamov (5): 3004 3005- loader/dri3: simplify check for reply 3006- loader/dri3: remove a no-op free call 3007- meson: remove source_root() call in nir compiler path 3008- meson: remove source_root() call in main meson.build 3009- .mailmap: change spelling for Constantine Kharlamov 3010 3011Konstantin Seurer (63): 3012 3013- radv/nir_lower_abi: Use instructions_pass 3014- radv: Move accel struct structs to bvh/bvh.h 3015- radv: Move radv_acceleration_structure 3016- radv: Remove acceleration structure host builds 3017- radv: Always emulate fmin/fmax 3018- radv: Remove accel_struct_build 3019- radv: Remove unused push constant structs 3020- radv: Add create_build_pipeline_spv helper 3021- radv: Add the basics for GLSL bvh kerrnels 3022- radv: Add a GLSL morton kernel implementation 3023- radv: Switch to the GLSL morton implementation 3024- radv: Add a GLSL internal kernel implementation 3025- radv: Switch to the GLSL internal implementation 3026- radv: Add a GLSL leaf kernel implementation 3027- radv: Switch to the GLSL leaf implementation 3028- lavapipe: Remove st_shader_stage_to_ptarget 3029- radv: Fix stack size calculation with stage ids 3030- radv: Inline bvh build headers 3031- radv/ci: Remove host build related fails 3032- lavapipe: Set ss_dirty in emit_state 3033- radv: Advertise subgroup ops for rt stages 3034- vulkan: Add common physical device management 3035- radv: Use the common physical device enumeration 3036- anv: Use the common physical device enumeration 3037- turnip: Use the common physical device enumeration 3038- lvp: Use the common physical device enumeration 3039- panvk: Use the common physical device enumeration 3040- v3dv: Use the common physical device enumeration 3041- radv: Proper handling for inactive instance nodes 3042- radv: Deduplicate push constant structs 3043- radv/rra: Map accel struct VAs to handles 3044- radv/rra: Remove redundant bounds validation 3045- radv: Make the radv_buffer_get_va parameter const 3046- radv/rra: Replace aliasing assert with a warning 3047- radv: Explicitly store the VA of accel structs 3048- radv: Cleanup radv_GetInstanceProcAddr 3049- radv: Add a common traversal build helper 3050- radv/rq: Use the common traversal helper 3051- radv/rtpso: Use the common traversal helper 3052- radv: Use scalar layout for BDA references 3053- radv: Add and use AS and scratch layout structs 3054- radv: Fix incorrect build info indexing 3055- radv: Use cache_uuid for accel struct compatibility 3056- radv: Remove main_loop_case_visited 3057- radv/rra: Transcode nodes recursively 3058- radv: Rename node_internal to node_box32 3059- radv: Add radv_bvh_node_box16 3060- radv: Use half floats for box16 coords 3061- radv/rra: Handle box16 nodes 3062- radv/rra: Calculate bvh size requirements recursively 3063- radv: Remove create_accel_build_shader 3064- radv/rra: Fix dumps in the case of aliasing 3065- radv/rra: Validate before gathering bvh info 3066- radv/rra: Add basic header validation 3067- radv/rra: Continue dumping accel structs if validation fails 3068- radv/rra: Use the accel struct type for header validation 3069- radv/rt: Fix setting tmax for opaque AABBs 3070- radv/rt: Load instance id and custom index on demand 3071- radv/rt: Set vars.arg in the traversal shader 3072- radv/rt: Restore prev barycentrics when rejecting hits 3073- radv/rra: Fix copying accel structs that were not built yet 3074- radv/ray_queries: Fix AABB handling 3075- radv/rt: Check space before emitting descriptors 3076 3077Kostiantyn Lazukin (1): 3078 3079- vulkan/cmd_queue: Do not generate unreachable vk_free_* calls. 3080 3081Kuixi Ren (1): 3082 3083- radeonsi/vcn: Add ability to encode with ltr 3084 3085Laurent Bigonville (1): 3086 3087- Try to fix FTBFS on kfreebsd architecture 3088 3089Leandro Ribeiro (9): 3090 3091- vulkan/wsi/wayland: move some structs to beginning of code 3092- vulkan/wsi/wayland: remove unnecessary spaces in struct fields 3093- vulkan/wsi/wayland: introduce struct wsi_wl_surface 3094- vulkan/wsi/wayland: take ownership of wsi_wl_surface when creating chain 3095- vulkan/wsi/wayland: move wl_surface and wl_display from chain to struct wsi_wl_surface 3096- vulkan/wsi/wayland: remove refcount from struct wsi_wl_display 3097- vulkan/wsi/wayland: add default dma-buf feedback support 3098- vulkan/wsi/wayland: add per-surface dma-buf feedback support 3099- vulkan/wsi/wayland: avoid useless re-allocations when receiving per-surface dma-buf feedback 3100 3101Leo Liu (2): 3102 3103- meson: add with_gallium_virgl to allow it as VA backend driver 3104- frontends/va: fix build error for vaSyncBuffer with older VA 3105 3106LingMan (10): 3107 3108- rusticl: Fix compilation if stdout and/or stderr aren't symbols 3109- rusticl/bindgen: Use \`allowlist-\*\` instead of \`whitelist-\*\` switches 3110- docs/rusticl: Document minimum required bindgen version 3111- rusticl/api: Use iterators in create_program_with_source 3112- rusticl/api: Drop UTF-8 conversion of input source 3113- rusticl/api: Fix creating a program if a nul byte is within the specified string length 3114- rusticl/api: Factor pushing to the Vec out of the if 3115- rusticl/api: Shrink unsafe block 3116- rusticl/api: Don't check the program source for nul bytes needlessly 3117- rusticl/api: Interpret \`lengths\` as a slice of Option<NonZeroUsize> 3118 3119Lionel Landwerlin (152): 3120 3121- anv: fixup PIPE_CONTROL restriction on gfx8 3122- intel/ds: track untyped dataport flushes 3123- intel/fs: store num of resume shaders in prog_data 3124- intel/fs: fixup simd selection with shader calls 3125- intel/compiler: document units of brw_ubo_range fields 3126- intel/nir: specify synchronous value for tracing op 3127- anv: don't return incorrect error code for vkCreateDescriptorPool 3128- intel/drm-shim: update shim to support DG2 3129- anv: fix GetPipelineExecutableStatistics for ray tracing pipelines 3130- anv: fixup assertions on lowered storage formats 3131- anv: fix assert in memory budget code when extension is not supported 3132- anv: prevent trying to mmap non host visible memory 3133- anv: remove the LOCAL_MEM allocation bit 3134- intel/fs: fix load_scratch intrinsic 3135- intel/fs: fixup scratch load/store handling on Gfx12.5+ 3136- intel/fs: switch compute push constant loads to LSC 3137- intel/fs: remove unused opcode 3138- intel/fs: fixup SEND validation check on overlapping src0/src1 3139- intel/fs: switch register allocation spilling to use LSC on Gfx12.5+ 3140- intel/fs: port block a64/surface messages to use LSC 3141- intel/fs: bump max SIMD size for A64 atomics with LSC 3142- intel: add a hasvk vulkan driver 3143- hasvk: remove entrypoints for gfx9+ 3144- hasvk: stop advertising Vk 1.3 on non-softpin 3145- hasvk: remove acceleration structure code 3146- hasvk: remove ray tracing code 3147- hasvk: remove mesh code 3148- anv: silence fedora build warning 3149- anv: remove support for gfx7/8 3150- anv: remove unused gfx7 code 3151- anv/tests: remove back allocation tests 3152- anv/hasvk: tweak loading failure messages 3153- anv: remove non present field in upstream spec 3154- vulkan: fixup 1.3.226 update 3155- anv: enable EXT_depth_clamp_zero_one 3156- hasvk: expose VK_EXT_depth_clamp_zero_one 3157- anv: add a new NO_LOCAL_MEM allocation flag 3158- anv: don't export flush_compute_state 3159- anv: don't export gfx state flushing helper 3160- anv: combine flushes in Draw/DrawIndexed/DrawIndirectByteCountEXT 3161- anv: add missing tracepoint 3162- blorp: defined operations for debug purposes 3163- intel/utrace: make blorp tracepoints more readable 3164- anv: trace xfb queries 3165- anv: clear descriptorsets if AllocateDescriptorSets fails 3166- anv: add support for EXT_mutable_descriptor_type 3167- anv: add missing wokraround for texture cache invalidate 3168- intel/nir/rt: fixup generate hit 3169- intel/nir/rt: spill/fill the entire ray query data 3170- intel/nir/rt: remove ray query mem hit writes at initialization 3171- intel/nir/rt: fix ray query proceed level 3172- intel/nir/rt: change scratch check validation 3173- intel/nir/rt: store ray query state in scratch 3174- intel/fs: fixup a64 messages 3175- intel/nir/rt: fixup alignment of memcpy iterations 3176- ci: add python3-ply to debian/fedora images 3177- ci: disable intel-clc on fedora 3178- ci: build our own version of the LLVM SPIRV translator 3179- ci/debian: don't use libclc from the system 3180- ci: bump llvm to 13 for some builders 3181- ci: disable intel-clc on debian-vulkan 3182- anv: remove HDC flush from invalidate bits 3183- meson: bump required llvm-spirv version with intel-clc 3184- intel/devinfo: Rename & implement num_dual_subslices 3185- intel/rt: fix procedural primitive ID access 3186- intel/nir: reuse rt helper 3187- intel/fs: take a builder arg for resolve_source_modifiers() 3188- intel/nir: fixup preserved metadata in rayquery lowering 3189- intel/nir: fix potential invalid function impl ptr usage 3190- intel/nir: disable assert on async stack id 3191- intel/fs: disable split_array_vars on opencl kernels 3192- genxml: add missing no duplicate anyhit flag 3193- intel/mi_builder: allow half GP registers for dereferencing 3194- anv: disable SIMD16 for RT shaders 3195- anv: add new command buffer space allocation 3196- anv: setup scratch space correctly for RT shaders 3197- anv: bump client visible address heap to 32GiB 3198- anv: use the right dispatch size for tracing shaders 3199- anv: flag BO for write combine when CPU visible and potentially in lmem 3200- iris: better error message with GuC loading failures 3201- intel: add INTEL_DEBUG=capture-all to capture everything upon hang 3202- pps: enable track_event in intel.cfg 3203- anv: add grl build dependency on entrypoints 3204- nir: fix NIR_DEBUG=validate_ssa_dominance 3205- vulkan/runtime: don't lookup the pipeline disk cache if disabled 3206- anv: remove unused code 3207- anv: comment out the Gfx8/9 VB cache key workaround for newer Gens 3208- anv: limit calls into cmd_buffer_flush_dynamic_state 3209- anv: optimize 3DSTATE_VF emission 3210- anv: remove multiple push descriptors 3211- isl: avoid gfx version switch cases on the hot path 3212- anv: limit push constant reemission 3213- anv: track descriptor set layout flags 3214- anv: initialization pipeline layout to 0s 3215- anv: add analysis for push descriptor uses and store it in shader cache 3216- anv: reduce BT emissions & surface state writes with push descriptors 3217- anv: add a layer for hitman3 3218- nir/lower_shader_calls: rematerialize more trivial values 3219- nir/lower_shader_calls: lower scratch access to format internally 3220- nir/lower_shader_calls: avoid respilling values 3221- nir/lower_shader_calls: rematerialize values in more complex cases 3222- nir/lower_shader_calls: add NIR_PASS_V internally 3223- nir/lower_shader_calls: cleanup shaders a bit more post split 3224- nir/lower_shader_calls: add an option structure for future optimizations 3225- nir/lower_shader_calls: add a pass to trim scratch values 3226- nir/lower_shader_calls: add a pass to sort/pack values on the stack 3227- nir/lower_shader_calls: move scratch loads closer to where they're needed 3228- nir/lower_shader_calls: run opt_cse after lower stack intrinsics 3229- radv: tweak lower_shader_calls parameters 3230- anv: enable localized loads for lower_shader_calls 3231- anv: disable mesh in memcpy 3232- anv: init major/minor before WSI 3233- vulkan/wsi/wl: correctly find whether the compositor uses the same GPU 3234- nir/divergence_analysis: add some missing RT intrinsics 3235- nir/divergence_analysis: add missing desc_set_address_intel 3236- intel/clc: assert when libclc shader is not found 3237- drm-uapi: bump headers 3238- isl: add new MOCS field for protected buffers 3239- intel/common: add detection of protected context support 3240- dri: rename PROTECTED_CONTENT in PROTECTED_SURFACE 3241- gallium: rename PROTECTED_CONTENT cap into PROTECTED_SURFACE 3242- egl: Add EGL_EXT_protected_content support 3243- st/gallium: plumb protected context creation 3244- iris: handle protected BO creation 3245- iris: Emit protection & session ID on protected command buffers 3246- iris: enable protected contexts 3247- intel/fs: require UNDEFs register offsets to be aligned to REG_SIZE 3248- intel/fs: make split_virtual_grfs deal with partial undefs 3249- intel/fs: reduce liveness of variables in lowering passes 3250- intel/fs: use fs implementation of dump_instructions 3251- intel/compiler: don't allocate compaction arrays on the stack 3252- anv: remove shader fp64 inspection after parsing 3253- anv: Reduce RHWO optimization (Wa_1508744258) 3254- anv: fix missing VkPhysicalDeviceExtendedDynamicState3PropertiesEXT handling 3255- anv: fixup invalid enum for nir environment 3256- anv: split internal surface states from descriptors 3257- anv: bump pool bucket max allocation size 3258- anv: get rid of ilog2_round_up 3259- intel/perf: allocate cleared counter infos 3260- intel/perf: fix B/C counters accumulation in non query mode 3261- nir/lower_explicit_io: fix metadata preserve 3262- nir/lower_shader_calls: update metadata before validation 3263- nir/lower_shader_calls: wrap only jumps rather than entire code blocks 3264- Revert "nir/lower_shader_calls: put inserted instructions into a dummy block" 3265- intel/fs: put scratch surface in the surface state heap 3266- anv: fix 3d state initialization 3267- anv: ensure CPS is initialized when KHR_fragment_shading_rate is disabled 3268- anv: fixup context initialization on DG2 3269- intel/fs: improve Wa_22013689345 workaround 3270- blorp: support negative offsets in addresses 3271- anv: generate correct addresses for state pool offsets 3272- nir/divergence: add missing btd_shader_type_intel 3273 3274Liviu Prodea (5): 3275 3276- d3d12/dzn/spirv2dxil: Require version library 3277- Microsoft clc: strip lib prefix Otherwise OpenCLon12 ICD can't load it 3278- meson: Microsoft / maybe Intel CLC need the all-targets workaround just like clover 3279- meson: Only draw with llvm depends on native directly Tests, softpipe or AMD drivers don't depend on it directly 3280- clc/clover: Link clang statically when shared-llvm is disabled 3281 3282Lucas Stach (38): 3283 3284- etnaviv: properly check number of layers in surface creation 3285- etnaviv: fix TS buffer allocation for 3D textures 3286- etnaviv: allow 3D textures with TS in transfer 3287- etnaviv: don't expose array and 3D texture support on pre-halti GPUs 3288- etnaviv: expose ARB_draw_instanced 3289- etnaviv: mark instanced draw extensions as supported in docs/features.txt 3290- etnaviv: move checking for MC2.0 for TS into screen init 3291- etnaviv: use linear PE rendering only on properly aligned surfaces 3292- etnaviv: add debug option to disable linear PE feature 3293- etnaviv: fill alpha channel for DXT1_RGB textures 3294- etnaviv: don't use TS with multi-layer resources 3295- etnaviv: texture-desc: sample 1D textures using 2D mode 3296- etnaviv: texture-desc: drop superfluous condition 3297- etnaviv: rewrite sampler TS check 3298- etnaviv: slim down etna_shader_key 3299- etnaviv: pass shader key by reference 3300- etnaviv: properly reference flush_resources 3301- etnaviv: move etna_layout_multiple into etnaviv_resource.c 3302- etnaviv: assert valid layout in etna_layout_multiple 3303- etnaviv: compute linear resource Y alignment in etna_layout_multiple 3304- etnaviv: handle compressed texture formats in etna_layout_multiple 3305- etnaviv: increase alignment for MSAA resources 3306- etnaviv: rs: fix MSAA alignment adjustment 3307- etnaviv: rs: try to find exact format match first 3308- etnaviv: properly size TS buffer for MSAA resources 3309- etnaviv: set LOGIC_OP_UNK24 for MSAA rendering on SMALL_MSAA GPUs 3310- etnaviv: disable PE_COLOR_FORMAT_OVERWRITE with MSAA 3311- etnaviv: blt: scale operations by MSAA mode 3312- etnaviv: blt: try to find exact format match first 3313- etnaviv: update headers from rnndb 3314- etnaviv: fix late Z with MSAA active 3315- etnaviv: fix shader register control with MSAA 3316- etnaviv: switch to late Z when linear PE is used 3317- etnaviv: fix tile status interaction with write mappings 3318- etnaviv: always use RS align when GPU has TEXTURE_HALIGN feature 3319- etnaviv: rs: fix blits with insufficient alignment for dual pipe operation 3320- etnaviv: blt: use correct TS offset in clear operations 3321- etnaviv: fix wrong surface TS clear size 3322 3323Luis Felipe Strano Moraes (8): 3324 3325- anv: fix FTBFS on grl due to changes in clang 15 3326- anv: reword info flag in intel_clc's getopt to avoid clash 3327- anv: add missing separator to help for intel_clc 3328- anv: fixing typo on description of output flag for intel_clc 3329- anv: added proper handling for input argument in intel_clc 3330- anv: adding parsetab.py to the .gitignore for grl 3331- meson: simplified meson for enabling ray-tracing on Intel 3332- meson: only enable intel-clc for x86_64 builds 3333 3334M Henning (1): 3335 3336- nv/nir: Set ssbo CacheMode from intrinsic access 3337 3338Marcin Ślusarz (53): 3339 3340- intel/compiler: fix mesh urb write regression 3341- nir/lower_task_shader: print shader after each step 3342- intel/compiler: add support for non-zero base in [load|store]_shared intrins 3343- Revert "nir/lower_task_shader: don't use base index for shared memory intrinsics" 3344- anv: disable task redistribution 3345- spirv, compiler: add "bool nv" to shader_info.mesh 3346- intel/compiler: implement EXT_mesh_shader 3347- anv: check EXT_mesh_shader whenever NV_mesh_shader is checked 3348- anv: replace VK_SHADER_STAGE_[TASK|MESH]_BIT_NV with VK_SHADER_STAGE_[TASK|MESH]_BIT_EXT 3349- anv: implement EXT_mesh_shader 3350- anv: implement draw calls for EXT_mesh_shader 3351- anv: enable EXT_mesh_shader 3352- anv: fix emission of primitive replication packet for mesh stage 3353- nir: add uses_wide_subgroup_intrinsics to task/mesh shader_info 3354- anv: add support for anv_assume_full_subgroups to task & mesh stages 3355- anv: small cleanup of anv_graphics_pipeline_compile 3356- nir, anv, hasvk, radv: pull uses_wide_subgroup_intrinsics into shader_info 3357- intel/compiler: remove second shading rate lowering for mesh 3358- intel/compiler: print shader after successful brw_nir_lower_shading_rate_output 3359- intel/compiler: use nir_lower_task_shader pass 3360- nir/lower_task_shader: lower small stores & loads to shared when requested 3361- intel/compiler/task: use shared memory for small task payload loads & stores 3362- intel/compiler: refactor brw_nir_lower_mem_access_bit_sizes 3363- intel/compiler: add support for 8/16 bits task payload loads 3364- radv: use nir_shader_instructions_pass in radv_nir_lower_ycbcr_textures 3365- iris: remove invalid nir_metadata_preserve from iris_fix_edge_flags 3366- crocus: remove invalid nir_metadata_preserve from crocus_fix_edge_flags 3367- glsl: use nir_shader_instructions_pass in gl_nir_lower_atomics 3368- glsl: use nir_shader_instructions_pass in gl_nir_lower_images 3369- glsl: use nir_shader_instructions_pass in gl_nir_lower_samplers_as_deref 3370- nir: use nir_shader_instructions_pass in nir_lower_alu 3371- nir: use nir_shader_instructions_pass in nir_lower_bool_to_bitsize 3372- nir: use nir_shader_instructions_pass in nir_lower_bool_to_float 3373- nir: use nir_shader_instructions_pass in nir_lower_bool_to_int32 3374- nir: use nir_shader_instructions_pass in nir_lower_64bit_phis 3375- nir: use nir_shader_instructions_pass in nir_lower_clamp_color_outputs 3376- nir: use nir_shader_instructions_pass in nir_lower_clip_disable 3377- nir: use nir_shader_instructions_pass in nir_lower_clip_halfz 3378- nir: use nir_shader_instructions_pass in nir_lower_drawpixels 3379- nir: use nir_shader_instructions_pass in nir_lower_fb_read 3380- nir: use nir_shader_instructions_pass in nir_lower_frexp 3381- nir: use nir_metadata_none instead of its value 3382- nir: use nir_shader_instructions_pass in nir_lower_interpolation 3383- nir: use nir_shader_instructions_pass in nir_lower_samplers 3384- nir: use nir_shader_instructions_pass in nir_split_var_copies 3385- nir: use nir_shader_instructions_pass in nir_split_per_member_structs 3386- intel/compiler: fix loading of draw_id from task & mesh payload 3387- anv: add support for mesh shading in INTEL_MEASURE 3388- intel/ds: add new category/stage for draw mesh events 3389- anv: add mesh shading tracepoints 3390- intel/genxml: fix width of 3DSTATE_TASK_CONTROL.MaximumNumberofThreadGroups 3391- anv: set 3DSTATE_[MESH|TASK]_CONTROL.MaximumNumberofThreadGroups 3392- anv: program 3DSTATE_MESH_DISTRIB with the recommended values 3393 3394Marek Olšák (135): 3395 3396- nir: add nir_intrinsic_image_samples_identical 3397- nir: add nir_intrinsic_image_descriptor_amd 3398- nir: add nir_texop_descriptor_amd 3399- nir: add shader_info::uses_resource_info_query for txs, levels, samples, etc. 3400- ac/llvm: implement nir_intrinsic_image_deref_samples_identical 3401- ac/llvm: implement nir_intrinsic_image_deref_descriptor_amd 3402- ac/llvm: implement nir_texop_descriptor_amd 3403- ac/nir: add ac_nir_lower_resinfo 3404- radeonsi,radv: run ac_nir_lower_resinfo 3405- ac/llvm: remove all resinfo code now that it's lowered 3406- radeonsi: implement a non-scaled compute blit+resolve and use it on gfx11 3407- radeonsi: don't assume that TC_ACTION_ENA invalidates L1 cache on gfx9 3408- radeonsi/ci: update failing tests on navi21 3409- radeonsi: fix a regression due to reordering PIPE_SHADER_* 3410- ac/llvm: handle external textures in ac_nir_lower_resinfo 3411- radeonsi: merge both fail paths in si_set_vb_descriptor 3412- radeonsi: add vertex buffers into the BO list in set_vertex_buffers 3413- radeonsi: remove vb_descriptors_gpu_list only used for debugging 3414- radeonsi: remove temporary si_context::vb_descriptor_user_sgprs 3415- radeonsi: use si_cp_dma_prefetch_inline for prefetching VBO descriptors 3416- radeonsi: use do..while loops and other cosmetic changes in display list path 3417- ttn: set the correct sampler declaration type in the presense of txs and lod 3418- gallivm: fix TXQ crash with MSAA samplers 3419- gallivm: initialize texture_unit_offset in emit_size_query 3420- gallium/u_blitter: remove unused code for integer MSAA resolve 3421- gallium/u_blitter: fix the has_txf support condition 3422- gallium/u_blitter: treat Z scaling as scaled blits 3423- gallium/u_blitter: clean up IMMs in util_make_fs_blit_msaa_gen 3424- gallium/u_blitter: make nearest filtering emulation using TXF conformant 3425- gallium/u_blitter: make the bilinear filter for MSAA resolving conformant 3426- glthread: unbind framebuffers in glDeleteFramebuffers 3427- glthread: call _mesa_glthread_DeleteBuffers unconditionally 3428- glthread: track GL_READ_FRAMEBUFFER bindings too 3429- glthread: track glBindFramebufferEXT, not just glBindFramebuffer 3430- glthread: add missing code for GL_ARB_sparse_texture 3431- util/xmlconfig: allow drivers to override option values 3432- radeonsi: rename stop_exec_on_failure -> allow_context_lost 3433- radeonsi: allow lost context with aux_contexts 3434- winsys/amdgpu: terminate process on CS rejection when unrobust context is lost 3435- winsys/amdgpu: flatten huge if and reorder code in amdgpu_cs_submit_ib 3436- winsys/amdgpu: change num_rejected_cs to a bool flag 3437- radeonsi/ci: make the running script easy to use 3438- Revert "mesa: implement a display list / glBitmap texture atlas" 3439- mesa: create glBitmap textures while creating display lists 3440- st/mesa: fix potential use-after-free in draw_bitmap_quad 3441- ci: update pass/fail results for spec\@!opengl 1.0\@gl-1.0-dlist-bitmap 3442- winsys/amdgpu: use cached GTT for command buffers and don't set the 32BIT flag 3443- ci: disable the freedreno farm. 3444- radeonsi/ci: add skips of tests not built on Linux 3445- radeonsi: don't pass num_patches via derived_tess_state, pass it via si_context 3446- radeonsi: make the primitive type constant with tessellation 3447- radeonsi: move \*rs to its only use in si_draw 3448- radeonsi: remove the prim_restart_tri_strips_only option 3449- radeonsi: move fixing ngg_culling into si_update_shaders 3450- radeonsi: unify the logic that sets rast_prim 3451- radeonsi: precompute GS_OUT_PRIM in advance 3452- radeonsi: set GS_STATE_OUTPRIM and PROVOKING_VTX_INDEX only when they change 3453- radeonsi: cosmetic changes in si_emit_rasterizer_prim_state 3454- radeonsi: remove 1 draw packet order codepath, keep the first one 3455- radeonsi: move patch_vertices-related tessellation updates out of si_draw 3456- radeonsi: move set_patch_vertices into si_state_shaders.cpp 3457- ac/gpu_info: handle LPDDR4 and 5 in ac_memory_ops_per_clock 3458- ac/surface: disallow 256KB swizzle modes on gfx11 APUs 3459- radeonsi: set VS_OUT_MISC_SIDE_BUS_ENA=1 for clip distance exports on gfx10.3 3460- mesa: fix glDrawRangeElementsBaseVertex should be compiled into display list 3461- glthread: fix draws not compiled into a display list should generate an error 3462- glthread: generate errors for glGet functions between glBegin/End 3463- glthread: always sync for glShaderSource because invalid params can crash 3464- glthread: fix glGetIntegerv(GL_CLIENT_ACTIVE_TEXTURE) 3465- glthread: work around GL_INVALID_OPERATION with OpenGL ES 1.x draws 3466- frontend/dri: make the backgroundCallable extension optional 3467- frontend/dri: sync glthread when calling from the app side 3468- radeonsi: simplify radeonsi_zerovram implementation 3469- driconf: disable glthread for DeusEx:MD and F1 2015 due to a perf drop 3470- radeonsi/ci: add glx\@glx-swap-event_async to CI failures for glthread 3471- radeonsi: don't flush asynchronously for fence_server_signal 3472- glthread: execute glSignalSemaphoreEXT synchronously 3473- radeonsi: enable glthread by default 3474- gallium/u_threaded: add an option for unsychronized create_fence_fd 3475- mesa: replace ALLOC_PRIMS with persistently-allocated context arrays 3476- mesa: remove _mesa_draw_gallium_fallback by adapting st_feedback_draw_vbo 3477- mesa: remove unused code using _mesa_prim 3478- mesa: clean up st_indirect_draw_vbo interface and callers 3479- mesa: trivial simplifications in _mesa_reference_buffer_object_ 3480- mesa: inline _mesa_reference_buffer_object 3481- mesa: inline _mesa_handle_bind_buffer_gen and correct things 3482- mesa: restructure bind_buffer_object for better performance 3483- mesa: remove unused gl_buffer_object::Data and Written 3484- mesa: reorder and pack gl_buffer_object 3485- vbo: don't call _glapi_set_dispatch for GL_COMPILE_AND_EXECUTE loopback 3486- mesa: don't restore the dispatch at the end of glCallList 3487- mesa: remove unused function _mesa_get_dispatch 3488- mesa: initialize OutsideBeginEnd directly instead of through Exec 3489- glapi: rename 'vtxfmt' to 'beginend' to make it clear 3490- api_hw_select_init_h.py: simplify the code 3491- mesa: remove deprecated comments 3492- mesa: rename GL dispatch initialization functions to make the intent clear 3493- glthread: use GLenum16 for enums, but clamp it to 0xffff to get correct errors 3494- glthread: use 8-bit GLenum for prim modes where it results in better packing 3495- glthread,hud: draw per-frame values and expose the number of batches per frame 3496- glthread: skip glMultMatrixf if it's identity 3497- mesa: use memcmp instead of floating-point comparisons in glMultMatrixf 3498- mesa: make glPopMatrix a no-op if the matrix hasn't changed 3499- radeonsi: rename si_create_multi_fence -> si_alloc_fence 3500- winsys/amdgpu: fix (enable) preemption for chained IBs 3501- radeonsi: fix max_dw computation for CS preambles 3502- gl_marshal.py: remove/simplify parameters 3503- gl_marshal.py: move the unmarshal table into a separately generated file 3504- gl_marshal.py: rework how the marshal dispatch table is initialized 3505- gl_marshal.py: remove the -O1 hack and manual SET_* inlining 3506- gl_marshal.py: inline functions for readability 3507- gl_marshal.py: simplify print_sync_call and rename to print_call 3508- glthread: use a constant expression instead of cmd_size in custom functions 3509- glthread: add more DrawArrays/Elements variants with fewer fields 3510- glapi: remove EXT and ARB suffixes from Draw functions 3511- glthread: demystify Draw function names 3512- glthread: rewrite CallList merging and do it in the app thread 3513- gallium/u_threaded: don't call simplify_draw_info redundantly 3514- gl_marshal.py: C style fixups 3515- glthread: merge and collapse glBindBuffer calls that unbind and then bind 3516- glthread remove the unused \*last pointer from unmarshal functions 3517- glthread: don't sync for glIsEnabled(GL_BLEND, GL_LIGHTING, GL_POLYGON_STIPPLE) 3518- cso: start without u_vbuf by default if it's not always used 3519- cso: constify some parameters to remove typecasts 3520- cso: fix broken optimization for sampler state lookups 3521- cso: don't destroy CSOs that are saved 3522- cso: inline more functions because some parameters like key_size are literals 3523- cso: make memcmp and hash computation use a literal key size for blend state 3524- driconf: add a workaround for a cursor issue with kwin_wayland 3525- driconf: add a workaround for Cossacks 3 3526- nir: add nir_intrinsic_optimization_barrier_vgpr_amd for LLVM 3527- radeonsi: force the MSAA resolve shader to use 1 clause for MSAA loads 3528- radeonsi/gfx11: fix compute scratch buffer - WAVES is always per SE 3529- st/mesa: suppress async glthread flushing for GLX_EXT_texture_from_pixmap 3530- st/vdpau: fix interop with GL 3531 3532Marek Vasut (1): 3533 3534- etnaviv: Use old set of state registers for PE configuration on GC880 3535 3536Mario Kleiner (1): 3537 3538- vulkan/wsi/display: Reset connector state in vkReleaseDisplay(). 3539 3540Mark Collins (8): 3541 3542- tu: Implement VK_EXT_non_seamless_cube_map 3543- tu: Allocate private memory per-device instead of per-pipeline 3544- tu: Clamp priority in DRM submitqueue creation 3545- tu: Expose VK_EXT_tooling_info using common implementation 3546- tu: Retain allocated CSes in tu_autotune_on_submit 3547- tu: Optimize hash_renderpass_instance by removing XXH64_update 3548- tu: Only write \`A6XX_PC_PRIMITIVE_CNTL_0` if changed 3549- tu: Clean up variable usage in \`tu6_draw_common` 3550 3551Mark Janes (6): 3552 3553- Revert "anv: Do not copy garbage to batch_bo" 3554- anv: Track BOs that need a write-combined mapping 3555- anv: Use WC mapped local memory for block pool BO 3556- anv: Allocate buffers with write-combined local memory 3557- anv: compile anv_acceleration_structure.c 3558- anv: work around improper buffer usage in hitman3 3559 3560Martin Krastev (1): 3561 3562- meson: add svga gallium driver to 'auto' group on arm/aarch64 3563 3564Martin Roukala (né Peres) (20): 3565 3566- radv/ci: document a recent regression 3567- radv/ci: move some tests from the vega10 fail to its flake list 3568- radv/ci: add more tests to the flake list of vega10 3569- radv/ci: skip host_write_transfer_src.1048576 on renoir 3570- radv/ci: bump the console activity timeout of VanGogh to 3 minutes 3571- radv/ci: uprev boot2container 3572- radv/ci: disable the POLARIS10 jobs 3573- radv/ci: disable kabini's vkd3d job 3574- radv/ci: run vkcts on the two steam decks in parallel 3575- radv/ci: document an unstable test 3576- radv/ci: move some tests from the renoir fail to its flake list 3577- ci/b2c: add support for the new format of CI_RUNNER_TAGS 3578- radv/ci: bump the job count on navi21 3579- radv/ci: run two jobs in parallel to reduce the execution time 3580- zink: update the RADV expectations 3581- zink: mark a test as a flake in RADV expectations 3582- zink/ci: mark another test as fixed after the frontend caching series 3583- zink/ci: add another subtest to the list of known failures for radv 3584- Revert "glx: Fix drawable refcounting for naked Windows" 3585- Revert "glx/dri: Fix DRI drawable release at MakeCurrent time" 3586 3587Matt Coster (3): 3588 3589- util: Include stddef.h in util/macros.h 3590- pvr: Add pvr_csb_unpack(). 3591- pvr: Fix assignment to ISP back_a.sref 3592 3593Matt Turner (4): 3594 3595- intel/tools: Also look for 'batch' tag 3596- freedreno/ir3: Don't use the broken idiv lowering 3597- freedreno/ir3: Use nir_opt_idiv_const 3598- intel/dev: Set display_ver = 13 on all ADL/RPL/DG2 3599 3600Mauro Rossi (5): 3601 3602- util: glsl2spirv.py: ensure '#endif' is printed in new line 3603- hasvk: fix android build and reported API version 3604- Android.mk: Fix gnu++14 related build failures 3605- r600/sfn: allow building with clang 6 (Android 9) 3606- AOSP: Add intel_hasvk vulkan library suffix 3607 3608Max Kellermann (2): 3609 3610- gallium/u_threaded: add missing reference counts for draw_multi slots 3611- gallium/u_threaded: fix offset calculation for draw_multi slots 3612 3613Michael Skorokhodov (1): 3614 3615- egl: Return EGL_BAD_MATCH for invalid share_list 3616 3617Michael Tang (3): 3618 3619- microsoft/compiler: Add dxil_logger type and parameter to nir_to_dxil 3620- nir: add nir_instr_as_str 3621- microsoft/compiler: Change calls to NIR_INSTR_UNSUPPORTED to log to dxil_logger instead 3622 3623Michel Dänzer (6): 3624 3625- gallivm: Fix LLVMAtomicRMWBinOpFMax spelling 3626- lavapipe: Fix float32_atomic_min_max spelling 3627- winsys/amdgpu: Set RADEON_FLAG_32BIT again 3628- radeonsi/ci: Test piglit quick_shader profile as well 3629- Revert "egl/glx: add fallback for zink loading" 3630- vulkan/wsi/wayland: Set num_modifier_lists = 0 if num_drm_modifiers == 0 3631 3632Mihai Preda (21): 3633 3634- gallivm: LLVM-15 opaque pointers: disable LLVMGetElementType(ptr_type) 3635- gallivm: drop unused "emit_load" in lp_llvm_buffer_member() 3636- gallium: drop unused macros in draw_llvm.h 3637- gallivm: fixes for LLVM-15 opaque pointers in lp_bld_nir_aos.c 3638- gallivm: fixes for LLVM-15 opaque pointers in lp_bld_nir_soa.c 3639- gallivm: fixes for LLVM-15 opaque pointers in lp_bld_jit_types.c 3640- gallivm: fixes for LLVM-15 opaque pointers in lp_bld_format_s3tc.c 3641- gallivm: fixes for LLVM-15 opaque pointers in lp_bld_struct.c 3642- gallivm: fixes for LLVM-15 opaque pointers in lp_bld_format_soa.c 3643- gallium/draw: fix LLVM opaque pointer warnings in store_clip() 3644- gallium/draw: fix LLVM opaque pointer warnings in generate_clipmask() 3645- gallium/draw: fix LLVM opaque pointer warnings in draw_gs_llvm_fetch_input() 3646- gallium/draw: fix LLVM opaque pointer warnings in draw_gs_llvm_end_primitive() 3647- gallium/draw: fix LLVM opaque pointer warnings in draw_gs_llvm_epilogue() 3648- gallium/draw: fix LLVM opaque pointer warnings in draw_tcs_llvm_emit_fetch_input() 3649- gallium/draw: fix LLVM opaque pointer warnings in draw_tcs_llvm_emit_fetch_output() 3650- gallium/draw: fix LLVM opaque pointer warnings in draw_tcs_llvm_emit_store_output() 3651- gallium/draw: fix LLVM opaque pointer warnings in store_aos() 3652- gallivm: LLVM opaque pointer fixes in lp_bld_sample.c 3653- gallivm: LLVM opaque pointers: add lp_build_array_get[_ptr]2 in lp_bld_struct.c 3654- gallivm: LLVM opaque pointers: use lp_build_array_get_ptr2() in lp_bld_nir_soa.c 3655 3656Mike Blumenkrantz (443): 3657 3658- zink: handle mutable swapchain images with dmabuf 3659- zink: store VkFormatFeatureFlags on creation 3660- zink: use stored format features for blit functionality 3661- zink: use modifier feature flags during surface creation when necessary 3662- zink: init cache_put program fence on program creation 3663- zink: always set vertex dynamic states 3664- zink: hook up pipeline_library extensions 3665- zink: change zink_create_gfx_program signature 3666- zink: break out program primtype->idx conversion function 3667- zink: reorganize zink_gfx_pipeline_state a little 3668- zink: set program pipeline array idx earlier in zink_get_gfx_pipeline 3669- zink: add a graphics pipeline library implementation 3670- tgsi_to_nir: fix clipdistance store writemask 3671- tgsi_to_nir: handle compact arrays for clipdistance 3672- zink: force flush clears on fb change if fb geometry changes 3673- zink: force a new framebuffer for clear_depth_stencil if the clear region is big 3674- zink: use maintenance4 3675- zink: allow sw loading with D3D_ALWAYS_SOFTWARE for nine 3676- zink: drop mode_changed check from linewidth/depthbias draw updates 3677- zink: handle unscaled depth bias from nine 3678- zink: handle !half_pixel_center 3679- zink: fix return for PIPE_CAP_DEPTH_CLIP_DISABLE 3680- zink: explicitly set nir cursor in rewrite_and_discard_read 3681- zink: handle invalid gl_TexCoord reads 3682- zink: rewrite all undefined shader reads as 0001 instead of undef 3683- zink: export PIPE_CAP_MULTISAMPLE_Z_RESOLVE 3684- zink: support PIPE_QUERY_TIMESTAMP_DISJOINT 3685- nine: check return on resource_get_handle 3686- zink: add u_queue.h to kopper header 3687- zink: demote dmabuf tiling to linear if modifiers aren't supported 3688- zink: use modifier_aspect to check for modifier plane in zink_resource_get_param 3689- zink: don't add modifiers if EXT_image_drm_format_modifier isn't present 3690- zink: add all format modifiers when adding for dmabuf export 3691- zink: don't fixup sparse texops 3692- zink: remove an old RGBA4 nvidia workaround 3693- vulkan: Update the XML and headers to 1.3.224 3694- radv: fix return type for meta resolve shaders 3695- nir/validate: clamp unsized tex dests to 32bit 3696- nir/validate: add some (light) validation for sampler type matching 3697- radv: remove stupid conditional from draw path 3698- radv: force inlining on another draw function 3699- mesa: fix blending when using luminance/intensity emulation 3700- mesa: require render target bind for A/L/I in format selection 3701- mesa/st: add ignore_srgb_decode param to st_convert_sampler 3702- mesa/st: plumb glsl130_or_later through sampler creation 3703- mesa/st: use plumbed value for srgb_skip_decode in sampler conversion 3704- gallium: convert PIPE_QUIRK_TEXTURE_BORDER_COLOR_SWIZZLE to enum 3705- gallium: add PIPE_QUIRK_TEXTURE_BORDER_COLOR_SWIZZLE_ALPHA_NOT_W 3706- util/format: handle SRGB in util_format_luminance_to_red() 3707- zink: stop modifying samplerview swizzle on base struct 3708- zink: add some format functions for emulating alpha formats 3709- zink: reject native blits for emulated alpha formats 3710- zink: handle emulated alpha format clears 3711- zink: support emulating alpha formats using new border color quirk 3712- zink: add a driver workaround for broken L4A4 on nvidia 3713- mesa: track format swizzles on teximage objects 3714- mesa/st: use pre-calculated format swizzle for samplerviews 3715- mesa: track gl_texture_object swizzle internally 3716- mesa/st: use tracked samplerview swizzle values 3717- mesa/st: simplify sampler border color swizzling 3718- zink: remove cached descriptor mode 3719- zink: remove checks for lazy descriptor mode + templates ext 3720- zink: remove descriptor vtable 3721- zink: delete a lot of descriptor code 3722- zink: move lazy descriptor functions into zink_descriptors.c 3723- zink: delete unused struct types 3724- zink: move all types/enums into zink_types.h 3725- zink: move lazy descriptor data types into zink_types.h 3726- zink: delete some unused descriptor struct members 3727- zink: remove descriptor dummy_set 3728- zink: merge context descriptor data structs 3729- zink: merge batch descriptor data structs 3730- zink: merge context descriptor data onto context struct 3731- zink: merge program descriptor data onto program struct 3732- zink: merge batch descriptor data onto batch state 3733- zink: reorder some descriptor structs 3734- zink: rename _lazy descriptor stuff 3735- zink: use statically sized array for descriptor allocation 3736- zink: ZINK_SHADER_COUNT -> ZINK_GFX_SHADER_COUNT 3737- zink: explicitly define ZINK_GFX_SHADER_COUNT 3738- zink: PIPE_SHADER_TYPES -> MESA_SHADER_STAGES 3739- zink: remove all pipe_shader_type usage 3740- zink: delete zink_shader_stage() 3741- zink: fix program cache comparisons 3742- zink: require EXT_non_seamless_cube_map for GPL support 3743- zink: (correctly) require extendedDynamicState2PatchControlPoints for GPL 3744- zink: use a maybe more accurate wild guess for pcp-less gpl 3745- zink: require VK_KHR_descriptor_update_template 3746- zink: add an id member for zink_descriptor_pool_key 3747- zink: use a dynarray instead of hash table for tracking pools 3748- zink: use a single allocation for zink_descriptor_layout_key 3749- zink: rework descriptor pool overflow 3750- nir: add uses_bindless flag for shader_info 3751- zink: bail out of dmabuf resource creation when srgb explosion is expected 3752- zink: iterate over all modifiers and nuke the ones that aren't supported 3753- zink: try to reuse swapchain modifier for dmabuf export 3754- ci: remove broken device-select-layer from build 3755- vk/render_pass: don't deref null resolve attachments 3756- Revert "zink: try to reuse swapchain modifier for dmabuf export" 3757- Revert "zink: add all format modifiers when adding for dmabuf export" 3758- zink: deduplicate some program creation code 3759- zink: deduplicate some program destroy code 3760- zink: move gfx program update/creation into zink_program.c 3761- zink: remove cpu conditional render hack 3762- zink: add c++ guards to zink_pipeline.h 3763- zink: move pipeline cache structs to zink_program.h 3764- zink: move gfx pipeline recalc to c++ 3765- zink: move pipeline state comparison to c++ template 3766- zink: use template to eliminate more instructions in pipeline state comparison 3767- zink: remove some gfx pipeline state members 3768- zink: fix shader key struct packing 3769- zink: rework bit iteration in update_gfx_shader_modules to use for loop 3770- zink: simplify shader key comparison by splitting out non-generated tcs 3771- zink: plumb some screen info through gfx shader creation 3772- zink: plumb screen info through shader key comparison 3773- zink: split out nonseamless/inline shader module info calculation 3774- zink: split out gfx shader module creation from cache searching 3775- zink: split out first-time shader module creation 3776- zink: use a list for free batch states 3777- zink: add templates for pipeline stage comparison 3778- zink: use stage comparison template to ignore vertices_per_patch 3779- zink: cache dynamic render vs renderpass pipelines separately 3780- zink: make program cache values more intelligible 3781- zink: change u_foreach_bit to regular for loop in zink_gfx_program_update 3782- zink: use dynarray instead of list for internal shader cache 3783- zink: pass shader stage as param for gfx module update 3784- zink: constify shader module variable 3785- zink: make zink_bo.h compile in c++ 3786- zink: add more c++ guards to headers 3787- zink: remove c++ guard from zink_resource.h 3788- zink: move zink_batch_resource_usage_set to be static inline 3789- zink: add is_buffer param to zink_batch_resource_usage_set 3790- zink: use PIPE_CAP_RGB_OVERRIDE_DST_ALPHA_BLEND 3791- zink: merge all the batch state sets onto the struct 3792- zink: make batch internal api static again 3793- zink: split resource tracking into two sets for buffer/texture 3794- zink: track which stages in gfx programs are using inlined uniforms 3795- zink: add gfx pipeline lookup shortcut 3796- st_pbo/compute: handle download failures with fallback 3797- st_pbo/compute: fix 1D_ARRAY offsets 3798- st_pbo/compute: fix 1D coord dimension by pre-trimming vectors 3799- st_pbo/compute: pre-clamp shader geometry to 16bit 3800- st_pbo/compute: pre-clamp loaded geometry based on coord components 3801- zink: handle nir_intrinsic_sparse_residency_code_and mechanics 3802- zink: ignore nir_texop_lod for tex dest matching 3803- vulkan: Update the XML and headers to 1.3.225 3804- zink: support PIPE_CAP_FBFETCH_COHERENT 3805- tu: fix invalid free on alloc failure 3806- lavapipe: stop tracking descriptor image layout 3807- lavapipe: use cso for compute samplers too 3808- lavapipe: remove lvp_sampler::state 3809- lavapipe: replace lvp_sampler internals with pipe_sampler_state 3810- lavapipe: use pipe_sampler_state directly in descriptor info 3811- lavapipe: delete lvp_buffer::offset 3812- lavapipe: use gallium buffer descriptor types directly 3813- lavapipe: apply VK_WHOLE_SIZE to bufferviews at creation 3814- lavapipe: create gallium descriptor image/sampler view types for views 3815- lavapipe: delete some code 3816- zink: use screen param for zink_descriptor_program_deinit 3817- zink: allow programs to solely manage descriptor deinit 3818- zink: create compute programs from compute shaders directly 3819- zink: initialize cache_get_thread unconditionally 3820- zink: add use_local_size flag to compute programs 3821- zink: don't call util_queue_fence_init in zink_screen_get_pipeline_cache() 3822- zink: add param to allow cache_get jobs to run directly 3823- zink: add a pipeline shortcut for basic compute programs 3824- zink: merge compute program hash table onto program struct 3825- zink: don't add compute base pipeline to hash table 3826- zink: remove use_local_size from compute pipeline state 3827- zink: stop zeroing local size if current compute doesn't use it 3828- zink: use different key comparison for local_size-using compute pipelines 3829- zink: track whether shader has cube samplers 3830- zink: move compute pipeline cache update to caller 3831- zink: add a param to allow zink_screen_update_pipeline_cache to run directly 3832- zink: move compute descriptor binds down in execution 3833- zink: add precompilation for compute shaders 3834- zink: add partial async shader compile support 3835- zink: add locking for descriptor layout caches 3836- zink: add a context pointer to zink_program struct 3837- zink: init fbfetch/bindless ctx stuff on shader create 3838- zink: generate sha1 for compute programs 3839- zink: add function to check whether a shader has cube samplers 3840- zink: defer all compute shader creation 3841- zink: don't emit entrypoints for function temp variables 3842- zink: check the variable mode before taking samplemask path in ntv 3843- zink: explicitly use unsigned types for bit shifts 3844- zink: store VkImageViewUsageCreateInfo for surface creation 3845- zink: only add srgb mutable for images with modifiers 3846- zink: avoid needing mutable for z/s sampling 3847- zink: don't set MUTABLE by default 3848- zink: don't emit illegal interpolation 3849- vulkan: initialize a variable 3850- llvmpipe: don't assume pipe_context is always available in flush_frontbuffer 3851- mesa: add GLSL_SOURCE 3852- zink: add spirv builder function for demote 3853- zink: hook up demote extension 3854- zink: fix sharedmem ops with bit_size!=32 3855- zink: flag all assigned output slots as mapped 3856- zink: add a ci skip for anv 3857- mesa/st: split out builtin shader finish 3858- st_pbo/compute: use split shader finish funcs 3859- mesa/st: move compute pbo shutdown to compute pbo file 3860- gallium: add pipe_screen::driver_thread_add_job 3861- st_pbo/compute: use new shader interface to perform async shader creation 3862- mesa/st: add specialized pbo download shaders 3863- mesa/st: add MESA_COMPUTE_PBO env var 3864- zink: implement async nir creation 3865- lavapipe: VK_EXT_attachment_feedback_loop_layout 3866- lavapipe: ARM/EXT_rasterization_order_attachment_access 3867- docs: add more features 3868- radv: avoid bottlenecking on sequential sparse buffer binds 3869- zink: handle split acquire/present 3870- zink: uncap ssbo size 3871- zink: remove special-casing for 64bit runtime array emission 3872- zink: always set var used by get_ssbo_size to the 32bit var 3873- zink: simplify ntv shader descriptor emission 3874- zink: emit Aliased decoration for aliased bo descriptors 3875- zink: fix atomic ssbo indexing with non-32bit values 3876- zink: handle 64bit float atomics 3877- zink: export PIPE_CAP_IMAGE_ATOMIC_FLOAT_ADD 3878- zink: export PIPE_CAP_SHADER_ATOMIC_INT64 3879- zink: don't add void clears if a full clear already exists 3880- zink: split up get_clear_data() 3881- zink: make void clears more robust 3882- zink: rewrite clears on fb bind if only the format has changed 3883- zink: add have_vulkan13 to device info 3884- zink: add functions for using '2' variants of pipeline barriers 3885- zink: add screen interfaces for pipeline barriers 3886- zink: use screen interfaces for pipeline barriers 3887- zink: fix/relax resolve geometry check 3888- zink: defer acquire semaphore destruction 3889- zink: fix atomic ssbo fadd offsets 3890- st/bitmap: use normalized coords in samplers with lowered rects 3891- st/drawpixels: use normalized coords in samplers with lowered rects 3892- zink: disable GPL when sample shading is in use 3893- anv: force inline more pipe flush functions 3894- ci: add a panfrost flake 3895- ci: add nouveau flakes 3896- egl/glx: add fallback for zink loading 3897- zink: don't call CmdBindVertexBuffers2EXT with no attributes 3898- zink: don't always set VK_DYNAMIC_STATE_VERTEX_INPUT_BINDING_STRIDE_EXT with gpl 3899- zink: clamp z/s formats to aspected format for border colors 3900- zink: move color channel clamping to zink_format.c 3901- zink: add border color clamping when customBorderColorWithoutFormat isn't supported 3902- zink: add srgb border color clamping 3903- zink: add stencil border color clamping 3904- zink: check format compatibility up front when creating framebuffer surface 3905- zink: rework flush_frontbuffer to always flush 3906- zink: defer more semaphore destruction 3907- zink: rework batch tracking for resources 3908- zink bump batch state limit a bit 3909- zink: split out bind_sampler_states hook for drivers without EXT_non_seamless_cube_map 3910- zink: remove batch tracking/usage from view types 3911- zink: remove batch usage from sampler states 3912- zink: delete more sw conditional render handling 3913- zink: delete some descriptor caching cruft 3914- glsl: fix gl_CullDistance xfb linking 3915- zink: handle culldistance xfb outputs like clipdistance 3916- zink: track a mask of gfx stages remaining in gfx programs 3917- zink: track the owner context for gfx programs 3918- zink: use screen params for program reference functions 3919- zink: don't try to delete programs from the cache if stages have been modified 3920- zink: add locking for program caches 3921- zink: use program's context for cache deletion 3922- zink: use screen param for zink_shader_free() 3923- zink: PIPE_CAP_SHAREABLE_SHADERS 3924- radv: ALWAYS_INLINE radv_after_draw 3925- radv: ALWAYS_INLINE radv_flush_descriptors 3926- radv: ALWAYS_INLINE radv_is_streamout_enabled() 3927- radv: store pointer to sgprs for last vertex stage 3928- radv: use direct access to last_vgt_api_stage_locs for sgpr emission 3929- zink: add a bool flag for decomposed vertex elements 3930- zink: check on init whether decomposed attrs are needed 3931- zink: wrap zink_shader_key_base access for nonseamless cubemap use 3932- zink: remove a bunch of casts from shader key inlines 3933- zink: handle inline uniforms during first-time shader generation 3934- zink: copy the non-dirty shader stages when updating gfx program 3935- zink: move to AoS for gfx program shader modules 3936- zink: ALWAYS_INLINE bind_gfx_stage 3937- zink: add some padding to small shader keys for byte-alignment 3938- zink: add 'optimal_keys' handling for shader keys 3939- zink: use optimal_keys to massively reduce size of pipeline cache comparisons 3940- zink: split dirty_shader_stages for gfx and compute 3941- zink: simplify bit tracking in zink_gfx_program_update() 3942- zink: be even more granular with optimal_key program updates 3943- zink: unspaghettify some program update code 3944- lavapipe: set writemask for compute shader buffers 3945- lavapipe: propagate shader access info across libraries 3946- zink: copy optimal key always 3947- zink: add asserts to verify optimal key state 3948- zink: fix some batch tracking leaks 3949- zink: free vertex_state_cache on shutdown 3950- zink: always unset var->data.explicit_xfb_buffer 3951- zink: fix sparse queue creation 3952- zink: fix handling for ssbos that are just runtime arrays 3953- zink: name bo variables using bitsize 3954- zink: avoid overflow when clamping bufferviews 3955- zink: allow reordered clear_buffer calls 3956- zink: remove color_write_missing driver workaround 3957- zink: only emit runtime array for ssbos if present 3958- mesa: set normalized_coords for bindless texture buffer samplers 3959- zink: enable VK_ATTACHMENT_LOAD_OP_DONT_CARE for zs in renderpasses 3960- zink: remove zink_context::new_swapchain 3961- zink: rename fb_binds -> fb_bind_count 3962- zink: reorder zink_resource a little 3963- zink: add a mask of fb attachment idx for resources 3964- zink: use feedback loop layout to correctly handle implicit feedback loops 3965- zink: always set VK_PIPELINE_CREATE_COLOR_ATTACHMENT_FEEDBACK_LOOP_BIT_EXT sometimes 3966- nir/opt_undef: add a pass to clean up 64bit undefs 3967- zink: don't flatten 64bit arrays during rewrite 3968- zink: always run optimize_nir after lower_64bit_vars 3969- zink: split up lower_64bit_vars pass 3970- zink: also lower 64bit function temps 3971- aux/trace: dump line_rectangular member of rasterizer state 3972- aux/trace: dump depth_clamp for rasterizer state 3973- zink: always unflag unordered_write when binding image descriptors 3974- zink: set layouts before possibly reordering image copies 3975- zink: move zink_screen_get_pipeline_cache() calls out to callers 3976- zink: rework ZINK_PIPELINE_LIBRARY_FORCE into ZINK_DEBUG flag 3977- zink: move create_pipeline_lib to zink_program.c 3978- zink: hook up VK_EXT_depth_clamp_zero_one 3979- zink: pass actual screen->threaded to zink_tc_context_unwrap() 3980- vulkan: Update the XML and headers to 1.3.230 3981- lavapipe: dynamic state3 3982- zink: add a draw barrier when unbinding a sampler-bound fb surface 3983- zink: replace mixed_zs with zs feedback loops 3984- glthread: use signed vertex buffer offsets when available, don't require them 3985- glthread: handle DeleteBuffers(n=-1) gracefully 3986- zink: fix/improve handling for multi-component bitfield ops 3987- zink: export PIPE_CAP_ALLOW_MAPPED_BUFFERS_DURING_EXECUTION 3988- zink: export PIPE_CAP_MAP_UNSYNCHRONIZED_THREAD_SAFE 3989- pipe-loader: fix zink driinfo header path 3990- zink: enable glthread by default 3991- zink: call flush_resource when presenting garbage 3992- lavapipe: improve blend handling on pipeline binds 3993- zink: use dynamic state3 rasterization states when available 3994- zink: use more dynamic state3 when available 3995- zink: add tgl ci skip 3996- zink: don't access prog->shaders array during pipeline compile 3997- zink: compress zink_gfx_output_key members 3998- zink: further compress zink_gfx_output_key lookups 3999- zink: set VK_PIPELINE_CREATE_RETAIN_LINK_TIME_OPTIMIZATION_INFO_BIT_EXT 4000- zink: require optimal_keys for GPL 4001- zink: add param to disable optimization when combining pipeline libraries 4002- zink: move binding_map to hw_state part of vertex state 4003- zink: reorder zink_destroy_gfx_program() 4004- zink: don't deref prog->shaders during pipeline construction 4005- zink: use an extra pipeline state bit to track coherent fbfetch usage for gpl outputs 4006- zink: reenable GPL 4007- zink: define zink_pipeline_dynamic_state3 as zink_rasterizer_hw_state 4008- zink: set depth clamp 4009- remove xvmc 4010- delete rbug 4011- delete graw tests 4012- vk/graphics_state: don't set default sample locations if rast samples is dynamic 4013- zink: allow creating full pipelines unoptimized 4014- zink: gfx_pipeline_cache_entry -> zink_gfx_pipeline_cache_entry 4015- zink: change zink_gfx_program::last_pipeline -> struct zink_gfx_pipeline_cache_entry 4016- zink: implement async gfx precompile 4017- zink: add ZINK_DEBUG=shaderdb 4018- zink: move descriptor layout/pool stuff to screen object 4019- zink: add a safety assert in descriptor updates 4020- zink: add some bda handling 4021- zink: handle global and scratch vars 4022- zink: move descriptor function up in file 4023- zink: handle oom better during descriptor updating 4024- zink: fix pool overflow handling on batch reset 4025- zink: rework/improve descriptor pool overflow handling on batch reset 4026- zink: delete view objects when unsetting resource usage in batch reset 4027- zink: prevent ballooning of view object memory 4028- lavapipe: store compiler options to physical device 4029- zink: check core feature for pipeline cache control 4030- zink: don't add other usage bits for transient images 4031- zink: clamp line_stipple_factor to 1 if stipple is disabled 4032- zink: unset rp_changed after initializing renderpass attachments 4033- zink: disable fbfetch when flushing clears 4034- glthread: fix buffer allocation size with non-signed buffer offset path 4035- zink: enable tc cpu storage on some buffers 4036- zink: clamp feedback loop layout to GENERAL if the ext isn't supported 4037- zink: delete some big struct members from zink_context 4038- zink: reorder context init to handle COMPUTE_ONLY 4039- zink: clamp shader bindings for compute descriptors 4040- zink: add ci flake 4041- zink: move spec constant emission to the types/consts block 4042- zink: set default spec constant value to 1 4043- zink: add docs for zink_batch.c 4044- zink: delete/cleanup some descriptor #defines 4045- zink: delete unused descriptor struct field 4046- zink: start on some type docs 4047- zink: fix duplicate VK_DYNAMIC_STATE_LINE_STIPPLE_EXT setting 4048- zink: fix ds3 line stipple enable handling 4049- driconf: add mesa_glthread=false for gfxbench 4050- lavapipe: store binding offset for buffers 4051- lavapipe: fix BufferDeviceAddress usage 4052- zink: fix gpl compile optimization 4053- zink: decrement ZINK_FBFETCH_BINDING 4054- zink: don't flag ubo0 as used in shaders with indirect ubo access 4055- zink: allow direct memory mapping for any COHERENT+CACHED buffer 4056- zink: allocate/place memory using memoryTypeIndex directly 4057- zink: support multiple heaps per memory type 4058- zink: delete unused zink descriptor mode 4059- zink: rename some variables in descriptor code 4060- zink: ZINK_DESCRIPTOR_TYPES -> ZINK_DESCRIPTOR_BASE_TYPES 4061- zink: add more zink_descriptor_type enum values to improve readability 4062- zink: delete unused descriptor function 4063- zink: stop passing pointer to changed_sets to populate_sets() 4064- zink: add some descriptor docs 4065- zink: add some docs for c++ pipeline state functions 4066- zink: simplify some program descriptor helpers 4067- zink: split out uniform decriptor bindings in shader data 4068- zink: flag push state changed in batch changed conditional 4069- zink: rescope/simplify some push descriptor code 4070- zink: elide a buffer samplerview update conditional 4071- zink: handle broken resource mapping deadlocks 4072- zink: simplify swapchain imageview handling 4073- zink: delete stale comment for zink_surface 4074- zink: simplify conditional for surface rebind no-ops 4075- zink: use zink_resource_object::views to defer deferred storage view deletion 4076- zink: delete unused zink_surface member 4077- zink: add some breadcrumbs for VK_EXT_multisampled_render_to_single_sampled 4078- zink: add docs for zink_surface 4079- zink: match bitsizes in bo rewriting 4080- zink: pass KERNEL shaders through successfully 4081- zink: rework sampler emission 4082- zink: pass image type to image emission 4083- zink: add a nir pass for CL image typing and sampler tracking 4084- zink: simplify image deref handling 4085- zink: add some spirv builder handling for sampled image ops 4086- zink: emit explicit samplers in ntv based on sampler_mask 4087- zink: always take the compact descriptor size path with kernel shaders 4088- zink: add handling on the vk side for discrete sampler descriptors 4089- zink: add handling for CL-style discrete shader samplers 4090- zink: rip out unused kernel push constant 4091- zink: fix weird formatting 4092- zink: don't force attachment usage for images when unsupported 4093- zink: always add MUTABLE with EXTENDED_USAGE for image creation 4094- zink: don't double-deref bindless texture arrays in shaders 4095- util/tc: split out flush and deferred flush calls 4096- util/tc: split out dsa and fs state cso handling 4097- util/tc: implement renderpass tracking 4098- zink: add a context flag to indicate when blitter is running 4099- zink: use tc renderpass tracking to optimize renderpasses 4100- zink: enable renderpass optimizing for turnip jobs 4101 4102Ming Li (1): 4103 4104- docs: Add d3d10umd and lavapipe to the doc for the gallium frontends. 4105 4106Mykhailo Skorokhodov (8): 4107 4108- util: Add glsl2spirv.py script 4109- nir: Make lower_double_ops recognize SPIR-V mangling 4110- nir: Add assert in nir_lower_doubles 4111- glsl/meson: Add variable to export float64.glsl 4112- anv/meson: Add float64_spv_h custom target 4113- anv: Add softtp64 workaround 4114- drirc: Add fp64_workaround_enabled option 4115- drirc: Apply fp64_workaround_enabled to DOOM Eternal 4116 4117Nanley Chery (8): 4118 4119- mesa/st: Set PIPE_MAP_DISCARD_RANGE in fallback_copy_image 4120- mesa/st: Add and use st_texture_image_insert_transfer 4121- mesa/st: Add and use st_texture_image_transfer::box 4122- mesa/st: Don't map compressed texture fallbacks for reads 4123- mesa/st: Fix indentation in st_MapTextureImage 4124- mesa/st: Drop st_texture_image_transfer::map 4125- iris: Enable INTEL_MEASURE for compute dispatches on XeHP 4126- iris: Reduce use of RHWO optimization (Wa_1508744258) 4127 4128Nicholas Bishop (2): 4129 4130- crocus: Advertise PIPE_CAP_NATIVE_FENCE_FD 4131- crocus: fix off-by-one error when clearing stale syncobjs 4132 4133Oleksii Bozhenko (2): 4134 4135- vulkan/wsi: Pass wsi_image_create_info into anv_GetPhysicalDeviceImageFormatProperties2 4136- anv: Allow aliasing with modifiers for WSI images 4137 4138Omar Akkila (5): 4139 4140- ci: Merge common builds in debian test jobs 4141- ci: Move crosvm to the base test image 4142- ci: Build crosvm with opaque fd mapping support 4143- ci: Enable virtio-experimental in testing 4144- ci: Add vulkan dEQP tests for venus 4145 4146Paul Gofman (1): 4147 4148- vbo/dlist: keep buffers used in loopback_vertex_list() mapped. 4149 4150Pavel Ondračka (22): 4151 4152- r300: fix variables detection for paired ALU and TEX instructions in different branches 4153- r300: allow constant swizzles with inline constant 4154- r300: fix negate mask computation when merging movs 4155- r300: check for identical saturate mode when merging MOVs 4156- r300: run dataflow optimizations in separate loops 4157- r300: generalize the merge_movs pass 4158- r300: don't merge w channel in fragment shaders 4159- r300: merge MOVs into ADD using the 0 swizzle 4160- r300: merge MOVs with MULs or ADDs in merge channels 4161- r300: Set more shadow sampler lowering in precompiles. 4162- r300: allow presubtract when both ADD sources are negative 4163- r300: run copy propagate once more after merging channels 4164- r300: merge together MOV and MAD instructions 4165- r300: add special path for merging movs with the same source 4166- r300: add some synchronization for KIL 4167- r300: fix register rewrite when converting rbg instructions to alpha 4168- r300: reduce CPU overhead in IF transformation pass 4169- Revert "r300: be less agresive with copy propagate in loops" 4170- r300: fix reader detection with breaks 4171- r300: be more careful when pair merging with presubtract 4172- r300: fix generation of invalid swizzles in regalloc 4173- r300: respect buffer offset in r300_set_constant_buffer 4174 4175Pedro J. Estébanez (5): 4176 4177- nir2dxil: Lower texture projections 4178- nir2dxil: Use native helper lane intrinsic on SM >= 6.6 4179- microsoft/spirv2dxil: Refactor to make SM and validator versions public 4180- microsoft/spirv2dxil: Change \`spirv_to_dxil()` to receive SM and validator versions 4181- microsoft/spirv2dxil: Support pack_32_2x16 and pack_32_4x8 4182 4183Pierre Moreau (11): 4184 4185- nv50,nvc0: Do not resize global residents if unnecessary 4186- nv50: Rename fixups to relocs 4187- nv50: Rename interps to fixups 4188- nv50: Report actual VRAM size 4189- nv50: Disallow allocating more than VRAM size 4190- nv50: Mark RESOURCE_FROM_USER_MEMORY_COMPUTE_ONLY as unsupported 4191- nv50/ir: Handle non-32-bit values when cst folding SPLIT 4192- nv50/peephole: Disallow combining sub 4-byte ld/st for now 4193- nv50/ra: Fix the offset computation for compounds 4194- nv50/ir: Avoid generating splits of splits 4195- nv50/nir: A group barrier is CTA-level not global-level 4196 4197Pierre-Eric Pelloux-Prayer (72): 4198 4199- loader: don't return empty string in loader_get_dri_config_device_id 4200- loader: allow DRI_PRIME=vendor_id:device_id syntax 4201- vulkan/device_select: print the dri_prime warning only if needed 4202- vulkan/device_select: allow DRI_PRIME=vendor_id:device_id 4203- docs: document DRI_PRIME 4204- gallium/dri2: make protected buffer checks opt-in 4205- amdgpu/bo: update uses_secure_bos when importing buffers 4206- radeonsi: allocate BIND_RENDER_TARGET as tmz instead of SCANOUT 4207- radeonsi: print radeon_bo_flag when AMD_DEBUG=vm is used 4208- nir: add a nir_opt_if_options enum 4209- radeonsi/tests: remove left-over debug print 4210- radeonsi/tests: add testing of the khr-single tests 4211- radeonsi/tests: update raven expected results 4212- radeonsi/tests: update navi21 expected results 4213- radeonsi/tests: cosmetic changes 4214- radeonsi: prevent u_blitter recursion in si_update_ps_colorbuf0_slot 4215- radeonsi: use LOAD_CONTEXT_REG_INDEX for VGT_STRMOUT_DRAW_OPAQUE 4216- ac/llvm: add a return value to ac_nir_translate 4217- radeonsi: deal with ac_nir_translate failures 4218- radv: deal with ac_nir_translate failures 4219- radeonsi: use nir_opt_large_constants earlier 4220- radeonsi/sqtt: set stable pstate if possible 4221- mesa: avoid reading back textures from VRAM 4222- mesa: simplify _mesa_texstore_z24_s8 and s8_z24 4223- mesa: remove fallback for GL_DEPTH_STENCIL 4224- radeonsi: invalidate L2 when using dcc stores 4225- egl/wayland: fix glthread crashes 4226- winsys/amdgpu: init 'r' before using it 4227- radeonsi/gfx11: don't set VERTS_PER_SUBGRP to 0 4228- tc: do a GPU->CPU copy to initialize cpu_storage 4229- tc: don't use CPU storage for glBufferData 4230- ac/llvm: remove unused vars hash table 4231- radeonsi: remove USE_LDS_SYMBOLS 4232- radeonsi: use nir_lower_explicit_io to get rid of mem_shared deref 4233- ac/llvm: drop visit_deref 4234- ac/llvm: simplify get_memory_ptr 4235- ac/llvm: add helpers to get pointer types of ac_arg 4236- ac/llvm: remove LLVMBuildGEP usages 4237- ac/llvm: add a ac_llvm_pointer type 4238- ac/llvm: port functions to use ac_llvm_pointer 4239- ac/llvm: switch ac_build_gep_ptr to LLVMBuildGEP2 4240- ac/llvm: switch ac_build_pointer_add to LLVMBuildGEP2 4241- ac/llvm: add type param to ac_build_load_invariant 4242- ac/llvm: add xxxx2 functions to iteratively port to opaque pointers 4243- ac/llvm: store lds as ac_llvm_pointer 4244- ac/llvm: store constant_data and scratch as ac_llvm_pointer 4245- ac/llvm: remove unneeded casts 4246- radv/llvm: use ac_build_gep0_type to get args types 4247- radeonsi: use ac_get_arg_pointee_type to get the right type 4248- radeonsi: use LLVMBuildGEP2 in si_build_gep_i8_var 4249- radeonsi: make ngg_gs_get_vertex_storage return a ac_llvm_pointer 4250- radeonsi: port the remaining code to opaque pointers 4251- ac/llvm: remove gep_2 and others temporary functions 4252- radv/llvm: replace LLVMBuildLoad by LLVMBuildLoad2 4253- ac/llvm: add ac_get_ptr_arg 4254- ac/llvm: pass ac_llvm_ptr instead of separate type/ptr 4255- Revert "ac: use LLVMContextSetOpaquePointers if available" 4256- radeonsi: zero init using the correct components count 4257- radeonsi/sqtt: notify sqtt when we skip a flush 4258- radeonsi/sqtt: re-export shaders in a single bo 4259- radeonsi/sqtt: allow to disable spm counters 4260- radeonsi/tests: handle flakes file for all test suite 4261- radeonsi/tests: simplify the script 4262- radeonsi/tests: update expected results for navi21 4263- radeonsi/tests: update expected results for navi10 4264- radeonsi/tests: update expected results for polaris11 4265- radeonsi: simplify si_prefetch_shaders 4266- radeonsi/sqtt: simplify condition to determine if sqtt is on 4267- radeonsi: store the shader gpu adress in si_shader 4268- radeonsi: handle sqtt pipeline in shader prefetch 4269- radeonsi/gfx11: enable sdma copy DRI_PRIME 4270- radeonsi: update db_eqaa even if msaa is disabled 4271 4272PixelyIon (1): 4273 4274- tu: Make KGSL wait IOCTLs interrupt-safe 4275 4276Qiang Yu (62): 4277 4278- nir/lower_gs_intrinsics: fix primitive count for points 4279- radeonsi: fix tcs_out_lds_offsets arg alignment 4280- winsys/amdgpu: fix non-page-aligned sparse buffer creation 4281- nir,ac/llvm: add nir_intrinsic_load_half_line_width_amd 4282- nir,ac/llvm: add nir_intrinsic_load_viewport_xy_scale_and_offset 4283- ac/nir/ngg,radv: use nir_load_viewport_xy_scale_and_offset 4284- ac/nir/cull: support caller react when primitive is rejected 4285- ac/nir/ngg: support line culling 4286- ac/nir/ngg: fix and simplify gs store output lower 4287- ac/nir/ngg: use same driver location for gs output 4288- ac/nir/ngg: save and restore output bit size for gs 4289- ac/nir/ngg: save and restore position output base for nogs 4290- ac/nir/ngg: add gs culling 4291- ac/nir/ngg: support component position store 4292- ac/nir/ngg: support clipdist culling 4293- ac/llvm: cast tes_u/v_replaced to float 4294- radeonsi: implement nir_intrinsic_load_half_line_width_amd 4295- radeonsi: implement nir_intrinsic_load_viewport_xy_scale_and_offset 4296- radeonsi: implement cull nir intrinsics 4297- ac/llvm: move culling replaced registers to abi 4298- radeonsi: use replaced values when present 4299- radeonsi: vs load input re-calculate vertex index after culling 4300- radeonsi: add and implement load_user_clip_plane abi 4301- nir: fix nir_xfb_info buffer_to_stream length 4302- nir: add nir_intrinsic_load_num_vertices_per_primitive_amd 4303- nir: add nir_intrinsic_load_streamout_buffer_amd 4304- nir,ac/llvm: add nir_intrinsic_load_ordered_id_amd 4305- nir: add nir_intrinsic_ordered_xfb_counter_add_amd 4306- ac/llvm: implement nir_intrinsic_ordered_xfb_counter_add_amd 4307- ac/nir/ngg: cleanup prim id to prepare for streamout 4308- ac/nir/ngg: nogs support streamout 4309- ac/nir/ngg: ngg_gs_load_out_vtx_primflag support stream 4310- ac/nir/ngg: support multi stream per output slot for gs 4311- ac/nir/ngg: support gs streamout 4312- radeonsi: implement nir_intrinsic_load_num_vertices_per_primitive_amd 4313- radeonsi: implement nir_intrinsic_load_streamout_buffer_amd 4314- radeonsi: print out remove_streamout shader key 4315- nir,ac/llvm,radv: add nir_intrinsic_load_provoking_vtx_in_prim_amd 4316- ac/nir/ngg: use nir_load_provoking_vtx_in_prim_amd in ngg lower 4317- radv: count gen_prims_queries_enabled 4318- radv: split active_pipeline_gds_queries 4319- nir,ac/nir/ngg,radv: split shader_query_enabled_amd 4320- ac/nir/ngg,radv: use different counters for shader queries 4321- nir,ac,radv: add primitive count add intrinsics 4322- nir: add nir_load_prim_xfb_query_enabled_amd 4323- ac/nir/ngg: add query param to ac_nir_lower_ngg_gs 4324- ac/nir/ngg: add streamout emitted primitive query 4325- radeonsi: implement llvm abi atomic_add_prim_count 4326- radeonsi: implement nir shader query enabled intrinsics 4327- ac/llvm: get back intrinsics used by NGG 4328- ac/nir/ngg: support user edge flags for ngg lower 4329- ac/nir/ngg: pass primitive_id_location as param for nogs lower 4330- nir: add two amd ngg lds base load intrinsics 4331- ac/nir/ngg,radv: move LDS layout calculation out of nir ngg lowering 4332- ac/nir/ngg: allow passthrough with vs primitive id output 4333- ac/nir/ngg: save and restore no_varying/no_sysval_output 4334- ac/nir/ngg,ac/llvm,aco: save nogs ngg culling one lds dword 4335- ac/nir/ngg: add one odd dword to nogs culling pervertex lds 4336- ac/nir/ngg: fix nogs culling with nuw add 4337- ac/nir/ngg: remove nuw for negative value add 4338- ac/llvm: fix gfx11 fs input load for 16bit varying 4339- ac/llvm: fix 16bit varying llvm compile error 4340 4341Rajnesh Kanwal (48): 4342 4343- pvr: Implement vkResetDescriptorPool API. 4344- pvr: Fix clang-format issue. 4345- pvr: Implement vkGetPhysicalDeviceImageFormatProperties2 API. 4346- pvr: Implement vkCmdDrawIndirect API. 4347- pvr: Implement vkCmdDrawIndexedIndirect API. 4348- pvr: Fix multiple file descriptor leaks. 4349- pvr: Implement vkCreateEvent and vkDestroyEvent APIs. 4350- pvr: Implement vkCmdSetEvent2 API. 4351- pvr: Implement vkCmdResetEvent2 API. 4352- pvr: Implement vkCmdWaitEvents2 API. 4353- pvr: Implement vkSetEvent API. 4354- pvr: Implement vkResetEvent API. 4355- pvr: General reformatting and renaming some fields. 4356- pvr: Replace pvr_renderpass_surface_initop with VkAttachmentLoadOp. 4357- pvr: Add support for generating render pass hw setup data. 4358- pvr: Use ~0 for masks instead of ~0U. 4359- pvr: Add missing output register write in pvr_CreateRenderPass2. 4360- pvr: Rename userpass_spawn to isp_userpass. 4361- pvr: Add const to structs that are not supposed to be modified. 4362- pvr: Add attachment resolve support. 4363- pvr: Add support to create subpass load ops. 4364- pvr: Pass load-op struct instead of idx to support subpass load-ops. 4365- pvr: Implement vkCmdNextSubpass API. 4366- pvr: implement vkcmddispatchindirect api. 4367- pvr: Handle indirect buffer address in pvr_setup_descriptor_mappings. 4368- pvr: Convert attachment indexes to unsigned. 4369- pvr: Remove redundant instruction buffer check. 4370- pvr: Remove redundant assignment error.instruction. 4371- pvr: Remove framebuffer NULL check from pvr_CmdBeginRenderPass2. 4372- pvr: Remove dead code from pvr_perform_start_of_render_clears. 4373- pvr: Zero init pbe_cs_words to avoid passing on garbage. 4374- pvr: Fix Unintentional integer overflow (OVERFLOW_BEFORE_WIDEN) errors. 4375- pvr: Call VALGRIND_FREELIKE_BLOCK before unmapping to avoid use after free. 4376- pvr: Add assert to check for non-NULL binding_desc. 4377- pvr: Implement vkGetEventStatus API. 4378- pvr: Add pvr_csb_emit_link function to support secondary buffer links. 4379- pvr: Fix check in pvr_cmd_uses_deferred_cs_cmds. 4380- pvr: Add support for PVR_CMD_STREAM_TYPE_GRAPHICS_DEFERRED stream. 4381- pvr: Create deferred control stream for secondary command buffers. 4382- pvr: Reserve space for vdm state for deferred secondary buffer. 4383- pvr: Add support to copy a control stream to another control stream. 4384- pvr: Implement vkCmdExecuteCommands API. 4385- pvr: Update buffer type macro names for consistency. 4386- pvr: Fix heap type of availability_buffer allocation. 4387- pvr: Fix allocation size passed in pvr_cmd_buffer_alloc_mem. 4388- pvr: Remove double error reporting. 4389- pvr: Split pds compute shader create and upload code for reuse. 4390- pvr: Add vulkan shader factory headers for Query and clear APIs. 4391 4392Renato Pereyra (1): 4393 4394- venus: Increase vn_relax sleep time 4395 4396Rhys Perry (117): 4397 4398- aco: fix LdsBranchVmemWARHazard with 2+ branch chains 4399- aco: set has_VMEM,has_DS=false after a branch 4400- aco: only add vscnt wait when visiting VMEM/DS 4401- aco: improve VcmpxPermlaneHazard workaround 4402- aco: fix hash statistic 4403- docs/radv: advertise VK_KHR_global_priority 4404- aco: fix consecutive exec writes when finding exec_copy instruction 4405- aco: rename is_cmp to is_fp_cmp 4406- aco: fix assembly of vopc_sdwa writing exec 4407- aco: fix re-write of uses of exec_val's lo/hi half 4408- aco: test branch opcode if removing it in try_optimize_branching_sequence 4409- aco: remove val_and_copy_adjacent 4410- aco: improve vcc check for instructions between exec_val and exec_copy 4411- aco: test for one and_savexec opcode in try_optimize_branching_sequence 4412- aco: fix long-jump version of discard early exit 4413- radv: remove claimed support for sRGB vertex buffer formats 4414- radv: fix 16-bit support in radv_lower_vs_input 4415- aco: fix 16-bit VS inputs 4416- radv/llvm: fix packed VS inputs on GFX6/GFX10+ 4417- aco: don't expand vec3 VS input load to vec4 on GFX6 4418- aco: allow direct_fetch=true for vec4 VS input loads 4419- nir/print: support nir_texop_descriptor_amd 4420- aco: add SCC clobber in build_cube_select 4421- nir/lower_tex: ignore width of cube textures 4422- radv: enable ac_nir_lower_resinfo for ACO 4423- aco: remove dead code for querying image size/samples/levels 4424- util/ralloc: add HEADER_ALIGN macro 4425- util: add freelist allocator with mark/sweep 4426- nir/serialize: remove unused parameter from read_src() 4427- Revert "nir: Drop the unused instr arg for src/dest copy functions." 4428- nir: adjust nir_src_copy signature to take a nir_instr * 4429- nir: use a GC context for instructions 4430- util/format,vulkan/format: update p_format for Vulkan vertex buffers 4431- ac: add ac_vtx_format_info 4432- radv: add radv_is_vertex_buffer_format_supported 4433- radv,aco: use pipe_format for static vertex input state 4434- radv,aco: use pipe_format for dynamic vertex input state 4435- radv: remove radv_translate_vertex_format 4436- radv: enable R8G8B8, B8G8R8 and R16G16B16 vertex formats 4437- radv,aco: implement 64-bit vertex inputs 4438- radv: enable 64-bit vertex formats 4439- aco/ra: handle empty def_reg interval in get_regs_for_copies 4440- aco/ra: remove bounds parameter from get_regs_for_copies() 4441- aco/ra: rework fixed operands 4442- nir/algebraic: optimize bits=umin(bits, 32-(offset&0x1f)) 4443- nir/algebraic: optimize fabs(bcsel(b, fneg(a), a)) 4444- radv: disable EXT_vertex_input_dynamic_state when using DGC 4445- radv: shrink zero-initialization in vkCmdSetVertexInputEXT 4446- radv: use nir_ubfe_imm 4447- radv: remove unnecessary .align_mul=4 4448- radv: fix dynamic RT stack size with VGPR spilling 4449- aco: DCE ra_ctx::defs_done 4450- aco: rename Interp_instruction to VINTRP_instruction 4451- aco: add reg() helper to assembler 4452- aco: fix assembly of MUBUF-to-LDS loads 4453- aco: add GFX11 opcode numbers 4454- aco/gfx11: don't use more than 1 NSA dword 4455- aco: update assembler for GFX11 4456- aco: limit GFX11 to 128 VGPRs for now 4457- aco: add LDSDIR instruction format 4458- aco: add VINTERP instruction format 4459- aco: omit read-only memory_sync_info when printing 4460- aco/tests: add GFX11 assembly tests 4461- aco: mostly implement FS input loads on GFX11 4462- aco: fix VMEMtoScalarWriteHazard s_waitcnt mitigation 4463- aco: improve VMEMtoScalarWriteHazard s_waitcnt mitigation 4464- aco: use some helpers in GFX10 hazard workarounds 4465- aco: improve printing of sgpr_null 4466- aco: improve printing of s_waitcnt_depctr 4467- aco: add VMEMtoScalarWriteHazard tests 4468- aco/gfx11: swap ds_cmpst_* data operands 4469- aco: improve wait_imm unpack 4470- aco/gfx11: fix s_waitcnt printing 4471- aco: update sendmsg enum from LLVM 4472- aco/gfx11: deallocate VGPRs at the end of the shader 4473- aco/gfx11: update form_hard_clauses 4474- aco: limit hard clauses to 63 instructions 4475- aco: fix assembler.gfx11.vinterp test 4476- aco: add search_backwards helper 4477- aco/gfx11: workaround VcmpxPermlaneHazard 4478- aco/gfx11: workaround LdsDirectVALUHazard 4479- aco/gfx11: workaround LdsDirectVMEMHazard 4480- aco/gfx11: workaround VALUTransUseHazard 4481- aco/gfx11: workaround VALUPartialForwardingHazard 4482- aco/gfx11: workaround VALUMaskWriteHazard 4483- aco: add ACO_DEBUG=force-waitdeps 4484- nir/algebraic: optimize b<<a + c<<a 4485- ac/llvm: fix mixing non-uniform/uniform sampler/texture descriptors 4486- ac/llvm: ignore g16 if the image opcode doesn't use derivatives 4487- aco/gfx11: optimize LS/HS load_local_invocation_index 4488- aco: swap v_perm_b32 operands 4489- radv,nir: add intrinsics for streamout and GS copy shaders 4490- ac/nir: add legacy streamout and GS copy shader helpers 4491- ac/nir: micro-optimize boolean expression 4492- ac/nir: add ac_nir_lower_ngg_options 4493- aco: add storage_gds 4494- aco: insert waitcnt before/after ds_ordered_count 4495- nir,ac/nir,aco,radv: replace has_input_*_amd with more general intrinsics 4496- aco: don't split swizzled store_buffer_amd on GFX9+ 4497- ac/nir: lower gfx11 vertex parameter exports 4498- radv/gfx11: don't create fmask copy/expand pipelines 4499- radv/gfx11: don't create texop_samples_identical 4500- radv,aco: don't use lower_to_fragment_fetch_amd on GFX11+ 4501- aco: fix typo in branch lowering 4502- aco/gfx11: perform FS input loads in WQM 4503- aco/gfx11: fix FS input loads in quad-divergent control flow 4504- radv/llvm: fix GS shaders on GFX8/9 4505- docs: update new_features.txt for non-extension additions to RADV 4506- radv: increase gfx1100/gfx1101 physical vgprs 4507- ac/gpu_info: increase gfx1100/gfx1101 physical vgprs 4508- aco/gfx11: increase gfx1100/gfx1101 physical vgprs 4509- radeonsi: increase gfx1100/gfx1101 physical vgprs 4510- nir/lower_bit_size: lower uadd_carry 4511- radv: lower 8/16-bit uadd_carry/usub_borrow 4512- radv: disable MRT compaction with dual-source blending 4513- aco: ensure MRT0 is written with dual source blending 4514- ac/nir: mask shift operands 4515 4516Riteo (1): 4517 4518- vulkan/device_select_wayland: fix a memory leak with DRM device handling 4519 4520Rob Clark (106): 4521 4522- gbm: Add USE_FRONT_RENDERING flag 4523- freedreno: Disable UBWC for frontbuffer rendering 4524- freedreno/drm: Fix potential bo cache vs export crash 4525- freedreno/gmem: Fix col0 calc 4526- freedreno: Drop fixed upper bound on # of tiles 4527- freedreno/drm: Add FD_VA_SIZE param 4528- freedreno: Cap reported video memory by VA size 4529- freedreno/drm: Avoid lock in fd_pipe_del() 4530- egl: atrace support 4531- freedreno/drm/virtio: Add some atrace 4532- util: Fix vs2019 build error 4533- egl: Drop _eglSetFuncName() return 4534- egl: Drop unused _EGL_FUNC_START() arg 4535- egl: Remove redundant fxn param 4536- egl: Convert to simple_mtx_t 4537- egl: Add some locking asserts 4538- egl/dri2: Indenting fix 4539- egl/dri2: Error path cleanups 4540- llvmpipe: Add some missing locking 4541- freedreno: Extract helper to check for batch dependency 4542- freedreno: Simplify add_dep logic 4543- freedreno: Avoid deferred-flush dependency loops 4544- Revert "ci: disable the freedreno farm." 4545- freedreno/a6xx: Random indent fixes 4546- freedreno/drm: Let backend hint about upload vs map 4547- freedreno/drm: Add offset param for fd_bo_upload() 4548- freedreno: Add support for upload transfers 4549- Revert "ci: disable the freedreno farm." 4550- egl/wgl: Make ref_count atomic 4551- egl/dri2: Make ref_count atomic 4552- egl/dri2: Add display lock 4553- egl: Make RefCount atomic 4554- egl: Introduce rwlock to protect eglTerminate() 4555- egl: Relax locking 4556- freedreno: Misc indent fix 4557- freedreno: nr_rings -> nr_priorities 4558- freedreno: Remap high/norm/low priorities 4559- freedreno/virtio: Don't upload if we have valid range 4560- freedreno/drm/virtio: Don't prefer upload for mapped buffers 4561- freedreno/drm/virtio: Handle read after upload 4562- freedreno: We really don't need aligned vbo's 4563- freedreno: Update github wiki links 4564- freedreno/ir3: GC unused macro 4565- freedreno/drm: Inline fd_bo_get_iova() 4566- freedreno/drm: Add fd_ringbuffer_attach_bo() 4567- freedreno/a6xx: Drop "hardpin" support 4568- freedreno/drm: Simplify emit_reloc_common 4569- freedreno/a6xx: Skip IBO state when unused 4570- freedreno/a6xx: Pre-calculate user const state size 4571- freedreno: Drop unused arg 4572- freedreno/a6xx: Move user const upload to bind 4573- freedreno/a6xx: Simplify fd6_build_user_consts() 4574- freedreno: Handle xfb invalidation 4575- freedreno: Use TC cpu-storage to shadow buffers 4576- freedreno/drm: Don't call kernel with no ops 4577- freedreno: Add perf-debug trace 4578- isaspec: Fix out of date comment 4579- gallium/u_threaded: Add some atrace/perfetto 4580- gallium/u_threaded: Add optional call-id tracing 4581- freedreno: Fix fence unref race 4582- docs: Update freedreno features 4583- mesa/st: ARB_vertex_attrib_64bit depend on glsl>=410 4584- freedreno/a6xx: Remove unused param 4585- freedreno/ir3: Fix clipvertex with GS+tess 4586- freedreno/ir3: Lower all the 64b 4587- freedreno/a6xx: Fix MAX_GEOMETRY_OUTPUT_VERTICES cap 4588- freedreno/a6xx: Fix primitives-generated query 4589- freedreno/a6xx: Update caps 4590- freedreno/a6xx: Drop max_scissor tracking 4591- Revert "egl: Factor some common terminate cleanup up to common code" 4592- nir: Add helper to create passthrough TCS shader 4593- radeonsi: Use nir passthrough TCS helper 4594- freedreno: Dirty the TCS state if patch_vertices change 4595- freedreno/ir3: Passthrough TCS support 4596- freedreno: Add tess_state support 4597- freedreno: Passthrough TCS support 4598- v3d/ci: Add a flake 4599- Revert "mesa/st: ARB_vertex_attrib_64bit depend on glsl>=410" 4600- freedreno: Don't advertise TGSI support for GS/tess 4601- freedreno/a6xx: Move layer_zero handling to interp state 4602- freedreno/ir3: Drop unused view_zero/layer_zero lowering 4603- freedreno/a6xx: Fix buffer size clamping 4604- freedreno/a6xx: Support AMD_vertex_shader_layer 4605- freedreno: support multiple viewports 4606- freedreno: Massage scissor state at bind time 4607- freedreno: Move guardband calc to bind time 4608- freedreno/a6xx: Support GL_ARB_viewport_array + gl43 4609- freedreno/a6xx: Enable GL_ARB_texture_mirror_clamp_to_edge 4610- freedreno: Enable GL_ARB_enhanced_layouts 4611- freedreno/ci: Small bit of xfails cleanup 4612- docs: Update features.txt 4613- nir: Add way to create passthrough TCS without VS nir 4614- freedreno: Add ARB_gl_spirv support 4615- freedreno/a6xx: Use box to pass 2d clear params 4616- freedreno/a6xx: Implement ARB_clear_texture 4617- freedreno/a3xx+: Enable ARB_shader_texture_image_samples 4618- freedreno/a3xx+: Enable ARB_derivative_control 4619- freedreno/ir3: Unconditionally lower subgroup ops 4620- freedreno/a6xx: Enable ARB_shader_group_vote 4621- freedreno/a6xx: Fix occlusion queries 4622- freedreno/a6xx: Remove unused field 4623- freedreno/a5xx+a6xx: Add base class for query samples 4624- freedreno/batch: Add a global epilogue 4625- freedreno: Core ARB_query_buffer_object support 4626- freedreno/a6xx: ARB_query_buffer_object support 4627- freedreno/a6xx: Mark gl45 supported 4628 4629Rohan Garg (3): 4630 4631- anv, iris: Disable pre fetching the binding table entries on DG2 4632- intel/compiler: Support 16 bit float ops 4633- anv: Enable 16 bit float ops on devices that have a LSC 4634 4635Roland Scheidegger (4): 4636 4637- lavapipe: initialize index_bias to zero for non-indexed draws 4638- gallivm: always do per-pixel LOD for cube maps 4639- llvmpipe: fix front/back face system value in the fs 4640- gallivm: remove dead cube map lod calculation code 4641 4642Roman Gilg (2): 4643 4644- loader/dri3: Simplify buffer age query 4645- loader/dri3: Add DRI performance option to wait for next buffer on swap 4646 4647Roman Stratiienko (6): 4648 4649- v3dv: Enable sync_fd importing/exporting on Android 4650- v3dv: Limit API version to v1.0 for Android 4651- v3dv: Avoid swapchain_info handling on Android 4652- lima: Ensure jobs initialized before calling lima_job_fini() 4653- Android: Use libgbm_mesa name for SDK30+ 4654- meson: Enable system_has_kms_drm for android 4655 4656Ruijing Dong (30): 4657 4658- frontends/va: improve enc quality interface change 4659- radeonsi/vcn: add enc quality bits interface 4660- radeonsi/vcn: dpb change for supporting pre-encoding 4661- radeonsi/vcn: support encoding preset modes 4662- radeonsi/vcn: support VBAQ modes 4663- radeonsi/vcn: remove rate control double begin IBs 4664- radeonsi/vcn: add AUD syntax to h264 encoding 4665- frontend/va: remove some unnecessary code 4666- frontends/va: change getEncParamPreset location 4667- frontends/va: add max_frame_size into rate control 4668- frontends/va: add HRD, filler data enable and etc 4669- frontends/va: enable sao in hevc encoding 4670- radeon/vcn: process invalid frame rate in encoding 4671- radeonsi/vcn: support async 4672- radeonsi/vce: using fixed value for vce 52 4673- gallium/pipe/video: group avc encoder seq structure 4674- radeonsi/vce: apply avc seq parameters 4675- frontends/omx: apply avc seq parameters 4676- frontends/va: apply avc seq parameters 4677- radeonsi/vcn: use avc seq paramters 4678- frontends/va: add aspect ratio and vui timing info 4679- radeonsi/vcn: support aspect ratio and vui timing info 4680- d3d12: Updating refactored fields in pipe_h264_enc_picture_desc 4681- frontends/va: fix av1 decoding image distortion issue 4682- gallium/pipe/video: add data struct for dbk in avc encoding. 4683- frontends/va: add h264 dbk enc parameters 4684- radeonsi/vcn: enable dbk in avc encoding 4685- radeonsi/vcn: enable multi-slice encoding 4686- frontends/va: fixed an av1 decoding image corruption issue. 4687- frontends/va: fixed an av1 dec image corruption. 4688 4689Ryan Houdek (1): 4690 4691- vulkan/wsi: Add dep_libudev to idep dependencies 4692 4693Sagar Ghuge (5): 4694 4695- intel/isl: Setting L1 caching policy to Write-back mode 4696- anv: Specify Untyped L1 cache policy for stateless accesses 4697- iris: Specify Untyped L1 cache policy for stateless accesses 4698- anv: Handle bits to flush data-port's Untyped L1 data cache 4699- iris: Handle new untyped dataport cache flush PIPE_CONTROL field 4700 4701Sajeesh Sidharthan (1): 4702 4703- radeonsi/vcn: set current pic index correctly 4704 4705Sami Kyöstilä (1): 4706 4707- intel/ds: Update to Perfetto API v28.0 4708 4709Samuel Pitoiset (329): 4710 4711- vulkan: add support for VK_IMAGE_LAYOUT_ATTACHMENT_FEEDBACK_LOOP_OPTIMAL_EXT 4712- radv: implement VK_EXT_attachment_feedback_loop_layout 4713- radv: ignore out-of-order rasterization if stencil write mask is dynamic 4714- radv: rework shaders ref counting 4715- radv: use ref counting for VS prologs and PS epilogs 4716- radv: remove RADV_META_SAVE_SAMPLE_LOCATIONS 4717- radv: simplify saving/restoring all dynamic states 4718- radv: remove unused states parameter from some radv_emit_XXX() helpers 4719- radv: simplify radv_bind_dynamic_state() slightly 4720- radv: fix gathering XFB info if there is dead outputs 4721- radv: respect the render area for depth/stencil resolves 4722- radv: fix cleaning the meta query state if an error occured 4723- radv: properly ignore PSO states that are dynamic 4724- radv: remove one useless check about line width and dynamic state 4725- radv: simplify dynamic states uses when determining NGG culling settings 4726- radv: do not set registers for PSO states that are dynamic 4727- radv: remove radv_invariant_geom workarounds for DX games 4728- radv: explain why radv_invariant_geom is still needed for SotTR DX 4729- radv: remove unecessary radv_pipeline::uses_dynamic_stride 4730- radv: copy line stipple factor/pattern only if the state isn't dynamic 4731- radv: copy the sample locations only if the state isn't dynamic 4732- zink: stop skipping bunch of GLESx tests 4733- vulkan: initialize discardRectangleCount even if the state is dynamic 4734- vulkan: fix initializing the primitive topology when it's dynamic 4735- vulkan: add attachment sample count info AMD support 4736- radv: store the provoking vertex mode in the graphics pipeline info 4737- radv: simplify the depth clip enable/disable logic 4738- radv: do not translate polygon mode during gfx info initialization 4739- radv: do not translate logic op during gfx info initialization 4740- radv: do not translate blend op/factor during gfx info initialization 4741- radv: do not translate primitive topology during gfx info initialization 4742- radv: slightly change the color write enable mask 4743- radv: call vk_graphics_pipeline_state_fill() when initializing gfx pipelines 4744- radv: convert radv_tessellation_info to vk_tessellation_state 4745- radv: convert radv_viewport_info to vk_viewpoert_state 4746- radv: convert radv_discard_rectangle_info to vk_discard_rectangles_state 4747- radv: convert radv_rasterization_info to vk_rasterization_state 4748- radv: convert radv_depth_stencil_info to vk_depth_stencil_state 4749- radv: convert radv_fragment_shading_rate_info to vk_fragment_shading_rate_state 4750- radv: convert radv_input_assembly_info to vk_input_assembly_state 4751- radv: convert radv_multisample_info to vk_multisample_state 4752- radv: convert radv_color_blend_info to vk_color_blend_state 4753- radv: convert radv_rendering_info to vk_render_pass_state 4754- radv: convert radv_vertex_input_info to vk_vertex_input_state 4755- radv: remove radv_graphics_pipeline_info completely 4756- radv: advertise VK_KHR_global_priority 4757- radv: remove unused pCreateInfo from radv_pipeline_init_blend_state() 4758- radv: add radv_graphics_pipeline_import_info() helper 4759- radv: introduce a new pipeline type for graphics libs 4760- radv: store vk_graphics_pipeline_state to radv_graphics_pipeline 4761- radv: prevent accessing rasterization state when it's NULL 4762- radv: add a very initial implementation of VK_EXT_graphics_pipeline_library 4763- radv: do not compress DCC in presence of render loops on GFX10+ 4764- radv: remove useless check when creating the noop FS 4765- radv: copy linked I/O info for merged stages in radv_fill_shader_info 4766- radv: copy the whole radv_shader_info struct for merged stages 4767- radv: add a radv_postprocess_nir() helper 4768- radv: move lowering FS intrinsics to radv_postprocess_nir() 4769- radv: move dumping NIR shaders to the previous similar loop 4770- radv: remove unused parameter in radv_lower_io_to_mem() 4771- radv: use NIR_PASS with radv_export_implicit_primitive_id 4772- radv: remove useless parameter from radv_link_shaders() 4773- radv: add radv_remove_point_size() linking helper 4774- radv: rework linking shaders in NIR 4775- radv: move assigning driver locations to the new linking helpers 4776- radv: move lowering patch vertices to radv_pipeline_link_tcs() 4777- radv: move lowering the view index to radv_pipeline_link_shaders() 4778- radv: remove redundant assignment of tcs.tcs_vertices_out 4779- radv: remove unused tcs_vertices_out assignment for VS 4780- radv: remove unused num_tess_patches assignment for VS 4781- radv: merge gather_tess_info() with radv_fill_shader_info() 4782- radv: stop setting CB_COLOR_CONTROL.ROP3 from the pipeline 4783- radv: stop clearing bitfields for registers that are emitted dynamically 4784- radv: cleanup dynamic states in radv_emit_graphics_pipeline() 4785- radv: stop emitting RMW context registers for updating sample locations 4786- radv: re-emit viewports if negative one to one or depth clamp mode changed 4787- radv: remove bogus assertion about independent set layouts with GPL 4788- radv: fix missing initialization of the pipeline layout when creating a lib 4789- radv: destroy the pipeline layout if creating a library failed 4790- aco: fix wrong size for 1D images and A16 on GFX9 4791- ac: constify ac_compute_cs_workgroup_size() 4792- radv: stop duplicating radv_es_output_info 4793- radv: use esgs_itemsize when calling ac_nir_lower_es_outputs_to_mem 4794- radv: compute the ESGS itemsize outside of radv_nir_shader_info_pass() 4795- radv: remove dead code about task ring when binding a compute pipeline 4796- radv: replace cs.uses_task_rings by ms.has_task 4797- radv/llvm: remove unused parameter in handle_vs_outputs_post() 4798- radv: stop duplicating radv_vs_output_info 4799- radv: fill radv_vs_output_info unconditionally for vertex related stages 4800- radv: remove redundant VS output parameter assignments 4801- radv: add a helper that links shader info between stages 4802- radv: link primitive ID/clip distance shader info from the new helper 4803- radv: stop gathering info for FS before other stages 4804- radv: move filling cs.block_size 4805- radv: add gather_shader_info_fs() helper 4806- radv: add gather_shader_info_mesh() helper 4807- radv: add gather_shader_info_gs() helper 4808- radv: add gather_shader_info_tes() helper 4809- radv: add gather_shader_info_tcs() helper 4810- radv: add gather_shader_info_vs() helper 4811- radv: add gather_shader_info_cs() helper 4812- radv: add gather_shader_info_task() helper 4813- radv: move more CS info to gather_shader_info_cs() 4814- radv: move more MS info to gather_shader_info_ms() 4815- radv: move determining NGG shader info to radv_fill_shader_info() 4816- radv: store binning settings into the physical device 4817- radv: make sure to emit BREAK_BATCH when color write enable is dynamic 4818- radv: remove useless parameter in si_write_scissors() 4819- radv: split emitting guardband into a separate helper 4820- radv: rework dirtying scissors when the line width changes 4821- radv: emit the guardband state separately from the scissor state 4822- radv: allow null descriptor set layouts in pipeline layouts 4823- radv: determine if the topology uses adjacency from the shaders on GFX9+ 4824- radv: fix bogus assertion with RADV_FORCE_VRS 4825- aco: remove unused isel_context::tcs_num_patches 4826- radv: stop setting as_ls for the TCS stage 4827- radv: stop setting as_es for the GS stage 4828- radv: stop setting num_tess_patches for the GS stage 4829- radv: stop setting esgs_itemsize to the GS stage 4830- radv: cleanup helpers that compute NGG info and GS info on GFX9+ 4831- radv: move computing wave_size/ballot_bit_size to the shader info pass 4832- radv: move computing NGG info and GS info to radv_nir_shader_info_link() 4833- radv: refactor radv_nir_shader_info_link() 4834- radv: move determining other NGG settings in radv_link_shaders_info() 4835- radv: simplify determining if NGG passthrough can be enabled 4836- radv: cleanup computing the workgroup size for all stages 4837- radv: add and use a helper that merges shader info for merged stages on GFX9+ 4838- radv: remove unused parameters in radv_fill_shader_info() 4839- radv: determine as_ls/as_es in radv_link_shaders_info() 4840- radv: constify radv_shader_info where it's missing 4841- radeonsi/ci: trigger radeonsi-raven-skqp for any RADV changes 4842- radv: gather MRTs that are written by the fragment shader 4843- radv: compact MRTs to save PS export memory space 4844- radv: only expose sparseResidencyImage3D on GFX9+ 4845- radv: emit SQTT markers for RT related commands 4846- radv: capture RT pipelines from the SQTT layer 4847- radv: fix reporting RT shaders in RGP 4848- radv: do not lower color exports for FS that need an epilog 4849- radv: add radv_remove_color_exports() helper 4850- radv: do not try to remove color exports for FS that need an epilog 4851- radv: keep track of the code size for VS prologs and PS epilogs 4852- radv: create a PS epilog from a library without the main FS 4853- radv: add support for emitting and prefetching PS epilogs 4854- radv: import PS epilog from libraries if present 4855- radv: fix hw remapping of MRT holes with color attachments without export 4856- radv: fix pipelineStageCreationFeedbackCount when it's 0 4857- radv: add a pointer to radv_shader_binary in radv_shader 4858- radv/ci: re-enable dEQP-VK.renderpass2.depth_stencil_resolve.*_samplemask 4859- radv: store the VS prologs/PS epilogs VA at upload time 4860- radv: split upload_shader_part() in two parts 4861- radv: store the binary to radv_shader_part 4862- radv: upload the PS epilog in the existing pipeline BO 4863- radv: reword a comment about dynamic states when rasterization is disabled 4864- radv: emit the LDS size for TCS directly from the pipeline on GFX9+ 4865- radv: add radv_pipeline_key::dynamic_patch_control_points 4866- radv: skip computing some tess info when patch control points is dynamic 4867- radv: set workgroup_size to 256 when patch control points is dynamic 4868- aco: prevent a division by zero when patch control points is dynamic 4869- radv: add shader arguments for dynamic patch control points 4870- radv: add ABI lowering support for dynamic patch control points 4871- radv: pass the number of patch control points to si_get_ia_multi_vgt_param() 4872- radv: move emitting PRIMGROUP_SIZE for <= GFX9 from the cmdbuf 4873- radv: move emitting GE_CNTL for non-NGG pipelines from the cmdbuf 4874- radv: implement dynamic patch control points 4875- radv: advertise extendedDynamicState2PatchControlPoints 4876- radv: stop setting redundant viewport/scissor for internal operations 4877- radv: do not bind NULL graphics pipeline when restoring the meta state 4878- radv: reset the compute pipeline when the saved one was NULL 4879- radv: stop dirtying the graphics pipeline when restoring it 4880- radv: stop checking for NULL pipelines in radv_CmdBindPipeline() 4881- radv/ci: cleanup lists of failures/flakes 4882- radv: disable VK_EXT_graphics_pipeline_library with LLVM 4883- radv: rename radv_pipeline_key::vs::dynamic_vs_input to has_prolog 4884- radv: prepare the VS input state for prologs created with GPL 4885- radv: bind the VS input state for prologs created with GPL 4886- radv: enable the VS prologs cache if graphicsPipelineLibrary is enabled 4887- radv: do not remove PSIZ for VS when the topology is unknown 4888- Revert "radv: upload the PS epilog in the existing pipeline BO" 4889- nir: add nir_intrinsic_load_sample_positions_amd 4890- radv: add nir_intrinsic_load_sample_positions_amd in the ABI 4891- radv: run nir_opt_cse before lowering FS intrinsics 4892- radv,aco: lower barycentric_at_sample in NIR 4893- radv: do not link shaders when the next stage is unknown 4894- radv: allow to build the main FS in a graphics pipeline library 4895- radv,aco: do not compact MRTs if the pipeline uses a PS epilog 4896- radv: rename radv_thread_trace_set_pstate() to radv_device_set_pstate() 4897- radv: acquire pstate on-demand when capturing with RGP 4898- radv: constify radv_lookup_user_sgpr() 4899- radv: add radv_pipeline_key::dynamic_rasterization_samples 4900- radv: declare shader arguments for the number of samples for FS 4901- nir: add nir_load_rasterization_samples_amd 4902- radv: lower nir_load_rasterization_samples_amd in ABI 4903- nir,radv: pass the number of samples to load_sample_positions_amd 4904- radv: add barycentric_at_sample lowering when the number of samples is dynamic 4905- radv: emit the rasterization samples through an user SGPR if needed 4906- radv: make sure to initialize wd_switch_on_eop before checking its value 4907- radv: pass a VkSampler to write_sampler_descriptor() 4908- radv: remove unnecessary radv_buffer_view::vk_format 4909- radv: rework and rename radv_make_buffer_descriptor() 4910- zink/ci: skip ext_external_objects from Piglit with RADV 4911- zink/ci: skip arb_shader_clock with RADV 4912- zink/ci: skip GLX tests with RADV 4913- zink/ci: update list of expected failures with RADV on NAVI10 4914- zink/ci: update list of flakes for Piglit with RADV on NAVI10 4915- radv/ci: add piglit testing with Zink on NAVI10 4916- radv: do not free the GS copy shader binary if created from a library 4917- radv: import the GS copy shader from a library if present 4918- radv: determine the last VGT api stage from the active_stages bitfield 4919- radv: allow to build the pre-rasterization stages in a library 4920- radv: remove useless gfx10_ngg_info::enable_vertex_grouping 4921- radv: determine the last VGT api stage also for GPL 4922- radv: use the maximum number of vertices per primitives for NGG with GPL 4923- radv: enable NGG culling unconditionally for GPL but disable it dynamically 4924- radv: rework multiview lowering in NIR slightly 4925- radv: fix handling ViewportIndex with GPL 4926- radv: fix handling primitive ID and clip/cull distances with GPL 4927- radv: fix handling multiview with GPL 4928- radv: fix emitting RBPLUS state when MRTs are compacted 4929- radv: remove NULL checks when binding a graphics pipeline 4930- radv: fix initializing the attribute mask for VS input state and GPL 4931- radv: fix initializing the blend state for NULL render pass with GPL 4932- radv: ignore shader stages that don't need to be imported with GPL 4933- radv: add a new drirc option to enable a unified heap on APUs 4934- radv: enable the unified heap on APUs for Red Dead Redemption 2 4935- radv: fix and rework shaders upload with GPL 4936- radv: always re-emit patch control points when a pipeline with tess is bound 4937- radv: move the disk cache to the vk_physical_device struct 4938- radv: use SHA1_DIGEST_LENGTH in the pipeline cache 4939- radv: do not create a noop FS when the FS is imported from a library 4940- radv: discard the PS epilog when the pipeline doesn't use a fragment shader 4941- ac/llvm: add missing nir_intrinsic_bindless_image_atomic_{fmin,fmax} 4942- radv: set SAMPLE_MASK_TRACKER_WATERMARK to 15 on GFX11 4943- radv: use correct VGT_TESS_DISTRIBUTION settings on GFX11 4944- radv: do not use memory for waiting for cache flushes on GFX11 4945- radv: drop the ES vertex count requirement on GFX11 4946- radv: add RADEON_FLAG_DISCARDABLE 4947- radv: create and configure the ring BO for attributes 4948- radv: declare shader arguments for attributes via memory 4949- nir: add nir_intrinsic_load_ring_attr_{offset}_amd 4950- radv: add lowering for nir_intrinsic_load_ring_attr_{offset}_amd 4951- radv: allocate more space for pipeline statistics query on GFX11 4952- ac: add radeon_info::has_vrs_ds_export_bug 4953- radv: advertise fragmentShadingRateWithShaderDepthStencilWrites on NAVI23+ 4954- aco: fix tcs_wave_id unpacking on GFX11 4955- radv: declare new dynamic states 4956- radv: add support for dynamic polygon mode 4957- radv: add support for dynamic tessellation domain origin 4958- radv: add support for dynamic logic op enable 4959- radv: add support for dynamic line stipple enable 4960- radv: add support for dynamic alpha to coverage enable 4961- radv: add support for dynamic sample mask 4962- radv: add support for dynamic depth clip enable 4963- radv: add support for dynamic conservative rasterization mode 4964- radv: add support for dynamic depth clip negative one to one 4965- radv: add support for dynamic provoking vertex mode 4966- radv: add support for dynamic depth clamp enable 4967- radv: advertise VK_EXT_extended_dynamic_state3 4968- radv: disable VK_AMD_texture_gather_bias_lod on GFX11 4969- radv: disable AMD_shader_explicit_vertex_parameter on GFX11 4970- radv/llvm: do not set COMPR for exports on GFX11 4971- ac/nir,radv/llvm: add support for the attribute ring 4972- aco,radv/llvm: do not export parameters on GFX11 4973- ac: fix has_vrs_ds_export_bug for VanGogh 4974- radv: disable {shader,sparse}ImageFloat32AtomicMinMax on GFX11 4975- radv: disable shaderBufferFloat64AtomicMinMax on GFX11 4976- radv: disable dual source blending in more situations 4977- ac/llvm: add support for device clock on GFX11 4978- aco: add support for s_sendmsg_rtn_b{32,64} 4979- aco: split the sendmsg enumeration into sendmsg_rtn 4980- aco: add support for device clock on GFX11 4981- ac/nir/ngg: fix emitting streamout output by using packed location 4982- radv: fix suspending/resuming pipeline statistics queries with GDS 4983- radv: fix VRS limit when attachmentFragmentShadingRate is disabled 4984- radv/llvm: fix dual source blending on GFX11 4985- radv: move nir_opt_idiv_const/nir_lower_idiv after NGG lowering 4986- ac/nir/ngg: fix emitting streamout output by using packed location 4987- radv: implement transform feedback queries with NGG streamout 4988- Revert "radv: add a pointer to radv_shader_binary in radv_shader" 4989- aco: create a new builder variant for ds_add_rtn 4990- aco: implement NIR intrinsics for NGG streamout 4991- aco: remove invalid assertions for NGG streamout 4992- radv/llvm: prevent emitting streamout outputs for NGG 4993- radv: call nir_io_add_intrinsic_xfb_info() after IO lowering 4994- radv: lower NIR intrinsics for NGG streamout 4995- radv: do not unconditionally disable NGG streamout lowering in NIR 4996- radv: reword an incorrect comment about primitives generated query 4997- radv: use 32-bit GDS counters for primitives generated query with NGG 4998- radv: suspend/resume primitives generated query separately for legacy/NGG 4999- radv: fix primitives generated query with NGG only 5000- radv: set the correct buffer size for NGG streamout 5001- radv: enable all possible features with NGG streamout 5002- radv: allow to enable NGG streamout with RADV_PERFTEST=ngg_streamout 5003- radv: enable VK_EXT_transform_feedback with NGG streamout on GFX11 5004- ac/nir: stop using nir_gather_xfb_info_from_intrinsics 5005- radv: assign IO var locations for VS/TES with NGG streamout 5006- radv/amdgpu: add a kernel GDS management workaround for -ENOMEM 5007- radv: use defines instead of magic values for GDS counters offset 5008- radv: move GDS counters after reserved GDS offsets for streamout 5009- radv: fix fallback for extreme geometry with tessellation on GFX11 5010- radv: fix setting MIN_LOD for texture descriptors on GFX11 5011- radv: make the GDS/GDS OA buffer objects resident 5012- radv: invalidate L2 instead of only writeback L2 when using DCC stores 5013- radv: re-emit the guardband state when restoring meta operations 5014- ac/nir,radv: rework and fix NGG queries enables for VS/TES 5015- ac/nir: do not convert GS outputs to the expected variable size on GFX11 5016- radv: re-emit NGG culling settings when conservative rast mode is dynamic 5017- radv: enable lowering of subgroup shuffle in NIR on GFX11+ 5018- aco: fix p_interp_gfx11 to not overwrite SCC 5019- aco: fix missing SCC for p_interp_gfx11 in emit_interp_mov_instr() 5020- aco: add p_dual_src_export_gfx11 for dual source blending on GFX11 5021- aco: fix dual source blending on GFX11 5022- aco: fix FS inputs loads in WQM with 16-bit 5023- radv: suspend/resume XFB queries with NGG for meta operations 5024- zink: require extendedDynamicState3ColorBlendEquation for full ds3 5025- radv: stop overallocating LDS for VS/TES when NGG streamout is enabled 5026- radv: fix computing the pervertex LDS size with NGG streamout 5027- radv: enable NGG XFB queries only if streamout is enabled 5028- radv: re-mit streamout buffers to unbind them when NGG streamout is disabled 5029- radv: fix NGG streamout when it's never enabled in runtime 5030- radv: fix possible hangs with NGG streamout and secondary cmdbuf 5031- aco: fix emitting DEALLOC_VGPRS in the discard block 5032- radv: re-emit dynamic depth clamp enable if depth clip enable changed 5033- radv: make sure to mark DCC as compressed on GFX11 5034- radv: introduce RADV_DEBUG=nofmask 5035- radv: set missing SPI_SHADER_PGM_xxx registers on GFX11 5036- radv: set INTERPOLATE_COMP_Z to 0 on GFX11 5037- radv: do not enable DCC for MSAA images without FMASK 5038- radv: do not enable NGG culling on GFX11 5039- radv: disable VRS entirely on GFX11 5040 5041Sarah Walker (3): 5042 5043- pvr: Update FWIF compute register structure 5044- pvr: Remove STRIP_RENDERING members from FWIF 5045- pvr: Update FWIF 3d and compute register structures 5046 5047Sathishkumar S (2): 5048 5049- frontends/va: support yuv 400/444 rt_formats in vaconfig 5050- frontends/va: reallocate surface for yuv400/yuv444 picture 5051 5052Sergei Chernyadyev (1): 5053 5054- radv: fix setting results for initialization failures in thread trace and trap handler 5055 5056Shuicheng Lin (1): 5057 5058- iris: return failure if iris_resource_configure_main fail 5059 5060Sil Vilerino (76): 5061 5062- gallium/va: vaDeriveImage to check PIPE_VIDEO_SUPPORTS_CONTIGUOUS_PLANES_MAP 5063- d3d12: Implement cap PIPE_VIDEO_SUPPORTS_CONTIGUOUS_PLANES_MAP 5064- frontends/va: Add HEVC decode slice descriptors 5065- frontends/va: Add HEVC decode args: IntraPicFlag, no_pic_reordering_flag, no_bipred_flag 5066- frontends/vdpau: Fill HEVC Decode param IntraPicFlag 5067- frontends/omx: Fill HEVC Decode param IntraPicFlag 5068- frontends/va: Mark IsLongTerm in HEVC decode args 5069- frontends/va: Add HEVC Encode support multi slice and extend pipe args 5070- frontends/va: Extend single to multiple L0-L1 references for HEVC Encode 5071- frontends/va: Support HEVC caps regarding features, block sizes, prediction direction 5072- d3d12/va: Name convention rename PIPE_VIDEO_SUPPORTS_CONTIGUOUS_PLANES_MAP to PIPE_VIDEO_CAP_SUPPORTS_CONTIGUOUS_PLANES_MAP 5073- gallium/vl: Allow vl_zscan.h to be included from C++ 5074- gallium/vl: Rename s_addr variable in vl_idct.c as it conflicts with windows existing inaddr.h keyword definition 5075- d3d12: Add HEVC Decode/Encode 5076- d3d12: Avoid extra allocation, copies when generating DXVA_Slice_Hxxx_Short arrays 5077- d3d12: Avoid local allocations for D3D12_RESOURCE_BARRIER on hot paths 5078- d3d12: Avoid heap allocations on hot path d3d12_video_decoder_dxva_picparams_from_pipe_picparams_hevc 5079- d3d12: Fill feedback in d3d12_video_encoder_encode_bitstream so vaSyncSurface properly populates buf->coded_size 5080- d3d12: Fix leak in d3d12_video_proc when re-creating ID3D12VideoProcessor 5081- d3d12: Fix winsys displaytarget leak in d3d12_resource 5082- d3d12: Fix leak in d3d12_resource_from_resource and usage in d3d12 video dec, enc 5083- d3d12: Add support for importing d3d12_video_buffer from handle 5084- d3d12: Allow formats other than NV12 in d3d12_video_buffer 5085- d3d12: Allow video processing for formats other than NV12 5086- d3d12: Add VPBlit processor check for D3D12_FEATURE_VIDEO_PROCESS_MAX_INPUT_STREAMS 5087- d3d12: d3d12_video_buffer_create_impl make resident after checking for resource creation 5088- frontends/va: Implement vaSyncBuffer 5089- d3d12: Update HEVC Encode GOP on I frames too 5090- d3d12: Video process - Remove unnecessary batches flush 5091- d3d12: Add initialization values for d3d12_video_encoder 5092- d3d12: Report PIPE_VIDEO_CAP_REQUIRES_FLUSH_ON_END_FRAME 5093- d3d12: Add support for PIPE_VIDEO_CAP_ENC_SUPPORTS_ASYNC_OPERATION 5094- d3d12: Change type of m_FenceValue to uint64_t in d3d12_video_encoder 5095- d3d12: Export some util functions from d3d12_fence for d3d12 video 5096- d3d12: Keep track of feedbacks from previous encode executions 5097- d3d12: Change d3d12_video_encoder to not automatically flush 5098- d3d12: Move encode configuration and capabilities into separate structs 5099- d3d12: Encoder to use independent command allocators per in flight resource set 5100- d3d12: d3d12_video_encoder add reset() to prevent leaks on re-allocation 5101- d3d12: Have d3d12_video_encoder keep separate references for enc, heap and dpb allocations for in flight resources 5102- d3d12: Make d3d12 encode operations async and do not block waiting 5103- d3d12: Add output buffer to inflight resourceset 5104- d3d12: Fix redundant/inconsistent initializations for d3d12_video_encoder 5105- d3d12: Enhance get_feedback not enough buffers message 5106- d3d12: Extract into common variable for metadata slot calculation 5107- d3d12: Make get_feedback return correctly on error when writing \*size=0 5108- d3d12: Relax slice caps requirements for video encode 5109- d3d12: Fix out of bounds index check for texture array DPB 5110- d3d12: Add debug verbosity for output reconstructed picture 5111- d3d12: Fix texture array DPB encode path 5112- d3d12: Fix usage of %ld with PRIu64 instead for cross-platform build 5113- d3d12: Replace size_t with uint64_t in d3d12_video_encoder definitions for cross-plat compatibility 5114- frontends/va: Add more params from VADecPictureParameterBufferAV1 and VASliceParameterBufferAV1 to pipe_av1_picture_desc 5115- frontends/va: Support AV1 Decode with multiple tiles (num_elements > 1) in a single VASliceParameterBufferAV1 buffer 5116- frontends/va: Use is_video_format_supported in vlVaVidEngineBlit instead of format list 5117- frontends/va: Fix AV1 decode qm_* parameters reading from VADecPictureParameterBufferAV1 5118- d3d12: Add support for AV1 Video Decode 5119- d3d12: Support more video formats 5120- d3d12: Fix aliased usage of m_SliceControlBuffer in d3d12_video_decoder_prepare_dxva_slices_control_XXXX 5121- d3d12: Style variable fix in d3d12_video_decoder_prepare_dxva_slices_control_av1 5122- frontends/va: Add VP9 decode multi slice information 5123- frontends/va: Add VP9 decode ref/mode_deltas information 5124- d3d12: Add VP9 Decode support 5125- d3d12: Fix get_index7bits - Reuse previously freed indices 5126- d3d12: HEVC Encode workaround for edge case in caps reporting not contemplated by upper layer interface 5127- d3d12: Fix HEVC wrong caps detection due to bad parenthesis in condition 5128- d3d12: HEVC Set D3D12_VIDEO_ENCODER_CODEC_CONFIGURATION_HEVC_FLAG_USE_ASYMETRIC_MOTION_PARTITION when required by caps 5129- gallium/vl: Add vl_winsys_win32 support 5130- frontends/va: Add windows VA frontend support via vl_winsys_win32 and libva-win32 5131- d3d12: resource_from_handle to validate importing resource ptr from same d3d12 device 5132- CI: Add gallium-va and video-codecs in windows-vs2019 and debian-mingw32-x86_64 5133- d3d12: Create d3d12_resource objects for video usage keeping their DXGI\_..._UNORM formats 5134- d3d12: HEVC Decode - Fix slice size and offsets translation from VA to DXVA 5135- frontend/va: Fix WIN32 VA_DRIVER_INIT_FUNC declaration. Remove declspec as it uses .def file 5136- ci: Add va frontend to windows-build-rules 5137- d3d12: Video Screen - Do not crash if HEVC not supported, return no support instead 5138 5139Simon Fels (1): 5140 5141- venus: drop duplicated YCBCR_2_PLANE_444_FORMATS_FEATURES_EXT 5142 5143Simon Ser (2): 5144 5145- vulkan/wsi/x11: detect Xwayland via the XWAYLAND extension 5146- egl: fix error string for dri2_create_image_dma_buf() 5147 5148Simon Zeni (10): 5149 5150- mesa: fix EXT_EGL_image_storage target validation 5151- gallium: track if st_egl_image was created by a dmabuf 5152- mesa: simplify st_egl_image binding process for texture storage 5153- mesa/st: check egl image and texture target match before binding 5154- mesa: fix GL_INVALID_OPERATION in glEGLImageTargetTexStorageEXT 5155- mesa: unlock texture on error path in glEGLImageTargetTexStorageEXT 5156- egl: fix Xlib include in egldisplay.h 5157- egl: set API symbols public 5158- EGL: sync headers with Khronos 5159- meson: remove unused EGL_NO_X11 definition 5160 5161Sonny Jiang (1): 5162 5163- radeonsi/vcn: add decode support for gfx1101 and gfx1103 5164 5165SoroushIMG (22): 5166 5167- zink: Fix incorrect emission of SPIR-V shift ops 5168- zink: track min_samples state for per sample shading 5169- zink: lower 64bit vars for drivers without float64 5170- zink: optimise lower_64bit_vars for when int64 is supported 5171- zink: lower 64bit pack when float64 not supported 5172- zink: fix nir_op_unpack_64_2x32 emission 5173- zink: fix spirv temp func variable class emission 5174- nir: track whether a loop contains soft fp64 ops 5175- nir: allow to fine tune unrolling for loops with soft fp64 ops 5176- zink: enable loop unrolling for loops that have soft fp64 5177- zink: enable native ETC2 if vk driver supports it 5178- zink: fix invalid Offset set for variables which do not need an offset 5179- zink: stop enabling minmax filtering when not supported 5180- zink: fix isNan mismatch between NIR and SPIR-V 5181- lavapipe: stop allocating 0 size const buffer 5182- zink: cleanup pushconst interface between driver/compiler 5183- zink: use unified pushconst layour for passthorugh tcs 5184- zink: cleanup zink_pipeline_layout_create 5185- zink: add pushconst only pipeline layout 5186- zink: add new framebuffer_is_layered state 5187- zink: clamp gl_Layer output to 0, if framebuffer is not layered 5188- zink: limit gl_Layer clamping to drivers that need it 5189 5190Sunil Khatri (1): 5191 5192- winsys/amdgpu: clamp up the alignment if zero 5193 5194Sviatoslav Peleshko (2): 5195 5196- iris: Always initialize shader compilation queue ready fence 5197- driconf/intel: Use fake vendor to WA bad detection in Source engine games 5198 5199Tapani Pälli (40): 5200 5201- glsl: allow image*Shadow keywords on ES and GLSL >= 420 5202- glsl: add check for too large atomic counter buffer offset 5203- iris: implement Wa_14015946265 for DG2 5204- anv: implement Wa_14015946265 for DG2 5205- intel/compiler: implement Wa_14014595444 for DG2 5206- intel/compiler: fix assert from ver to verx10 5207- intel/genxml: add VFG_PREEMPTION_CHICKEN_BITS register 5208- anv: disable preemption on VFG, Wa_14015207028 for DG2 5209- iris: disable preemption on VFG, Wa_14015207028 for DG2 5210- anv: remove vk_sample_locations_state from emit_multisample 5211- intel/common: clamp sample location coordinate range 5212- anv: remove primitive_topology from 3DPRIMITIVE calls 5213- mesa/st: fix a set_sampler_views call parameter order 5214- anv: implement Wa_14016118574 5215- iris: implement Wa_14016118574 5216- anv: limit to default fast clear color when image used for transfer 5217- intel: revert preemption disable via VFG changes 5218- intel/fs: mark debug variables with ASSERTED 5219- anv: mark debug variables with ASSERTED 5220- mesa: mark debug variables with ASSERTED 5221- anv: dynamic state for polygon mode 5222- anv: dynamic state for tessellation domain origin 5223- anv: dynamic rasterization stream 5224- anv: dynamic line stipple enable 5225- anv: dynamic line rasterization mode 5226- anv: dynamic state for logic op enable 5227- anv: dynamic alpha to one enable 5228- anv: dynamic states for depth clip and clamp 5229- anv: dynamic provoking vertex mode 5230- anv: dynamic color write mask 5231- anv: dynamic color blend equation 5232- anv: dynamic multisample sample mask 5233- anv: toggle on EXT_extended_dynamic_state3 5234- anv: move emit_shading_rate to gfx8_cmd_buffer 5235- anv: fill AlphaToCoverageEnable lazily from state 5236- intel/compiler: run nir_opt_idiv_const before nir_lower_idiv 5237- anv: remove some unused functions 5238- hasvk: remove some unused functions 5239- iris: setup stage bitmask for Wa_22011440098 5240- anv: setup stage bitmask for Wa_22011440098 5241 5242Tatsuyuki Ishi (4): 5243 5244- radv: Implement radv_flush_before_query_copy to workaround UE Vulkan bugs. 5245- radv: Remove unused functions. 5246- radv: Remove unused radv_deferred_queue_submission. 5247- radv,lvp,anv: Delete pre-common dispatch leftover. 5248 5249Thomas Debesse (7): 5250 5251- r600: set clear_buffer = u_default_clear_buffer 5252- r600: info.stage MESA_SHADER_KERNEL as MESA_SHADER_COMPUTE 5253- nv50: call nir_lower_flrp 5254- clover: implement CLOVER_DEVICE_TYPE like RUSTICL_DEVICE_TYPE 5255- Revert "gallium/clover: pass -no-opaque-pointers to Clang", opaque pointers are now implemented 5256- gallium/clover: pass -opaque-pointers to Clang on LLVM 15 and 16 5257- crocus: set clear_buffer = u_default_clear_buffer 5258 5259Thomas H.P. Andersen (15): 5260 5261- nvc0: fix a warning -Wconstant-conversion 5262- radeonsi: avoid a use-after-free 5263- d3d12: fix warnings for missing-braces on clang 5264- r600: fix warnings for missing-braces on clang 5265- ci: remove missing-braces from no-error 5266- util: avoid deprecated builtin has_trivial_destructor 5267- spirv: avoid allocating memory twice 5268- hasvk: Switch to the common descriptor update template struct 5269- panvk: Implement VK_KHR_descriptor_update_template 5270- spirv: mark function unused 5271- llvmpipe: mark function unused 5272- llvmpipe: drop unused function 5273- panfrost: avoid warning about unused function 5274- vulkan: avoid warning about unused function 5275- vulkan: drop const for descriptor set 5276 5277Timothy Arceri (24): 5278 5279- Revert "nir: Preserve offsets in lower_io_to_scalar_early" 5280- glsl: dont lower precision for textureGatherOffsets 5281- glsl: make packed varying helper needs_lowering() external 5282- glsl: fix location for array subscript 5283- nir: update nir_is_supported_terminator_condition() 5284- nir: support loop unrolling with inot conditions 5285- glsl: remove GLSL IR inverse comparison optimisations 5286- nir/loop_analyze: delay instruction cost calculation 5287- nir/loop_analyze: remove cost of redundant selects 5288- util/conf: enable init to zero workaround for Exanima 5289- util/radeonsi: enable zerovram workaround for Exanima 5290- radv: add radv_zero_vram workarounds for OpenGL games 5291- glthread: fix matrix stack depth tracking 5292- mesa: add EXT_debug_label support 5293- glthread: leave dlist dispatch in place for Begin/End 5294- glsl/glsl_to_nir: remove unreachable code 5295- glsl: move rule inside lower_packing_builtins() 5296- glsl: drop sub to add neg lowering in GLSL IR 5297- glsl: always do {CARRY,BORROW}_TO_ARITH lowering 5298- glsl: move lower instructions logic inside that pass 5299- st/glsl_to_ir: remove stale comment 5300- st/glsl_to_ir: remove stale include 5301- mesa: fix typo from adding glGetObjectLabelEXT 5302- nir: fix typo in lower_double options handling 5303 5304Timur Kristóf (64): 5305 5306- ac/nir/cull: Move some code from cull_bbox into helper functions. 5307- ac/nir/cull: Move the contents of cull_bbox into ac_nir_cull_triangle. 5308- ac/nir/cull: Change if condition for bounding box culling. 5309- ac/nir/ngg: Move LDS store of accepted flag into the inner branch. 5310- ac/nir/cull: Make cull functions more consistent. 5311- aco: Optimize branching sequence during SSA elimination. 5312- aco: Remove branch instruction when exec is constant non-zero. 5313- ac/nir/cull: Tweak phi for cull_small_primitive branch. 5314- ac/nir/cull: Fix typo in bounding box culling. 5315- aco: Add faster code path to store_lds for consecutive write mask. 5316- aco: Fix invalidated reference in branching sequence optimization. 5317- aco: Check for instructions that inhibit the branching sequence optimization. 5318- aco/optimizer_postRA: Don't try to optimize dead instructions. 5319- aco: Support s_cselect_b64 in SCC no-compare optimization. 5320- aco: Improve SCC nocompare optimization when SCC is clobbered. 5321- aco: Fix p_init_scratch for task shaders. 5322- vulkan, spirv: Update to Vulkan 1.3.226 and latest SPIR-V headers. 5323- spirv: Support EXT_mesh_shader indices and mark them per-primitive. 5324- spirv: Add mesh_shading capability for EXT_mesh_shader. 5325- spirv: Support EXT_mesh_shader mesh/task stages. 5326- spirv: Support EXT_mesh_shader SetMeshOutputsEXT. 5327- spirv: Support the CullPrimitiveEXT mesh shader built-in. 5328- spirv: Support TaskPayloadWorkgroupEXT storage class. 5329- spirv, nir: Handle EmitMeshTasksEXT opcode. 5330- ac/nir/ngg: Add EXT_mesh_shader primitive indices. 5331- ac/nir/ngg: Add EXT_mesh_shader CullPrimitiveEXT output. 5332- ac/nir/ngg: Add EXT_mesh_shader vertex/primitive count. 5333- radv: Implement EXT_mesh_shader draw calls. 5334- radv: Enable EXT_mesh_shader on RDNA2 with RADV_PERFTEST=ext_ms 5335- nir/lower_system_values: Add shortcut for 1D workgroups. 5336- nir/gather_info: Clear cross-invocation output mask. 5337- radv: Replace NV_mesh_shader macros with EXT. 5338- radv: Enable subgroup feature bits for mesh and task shaders. 5339- radv: Change max preferred task workgroup invocations to 64. 5340- radv: Remove dead shader temps after linking. 5341- ac/nir/ngg: Don't use LDS for same-invocation indices and cull outputs. 5342- ac/nir/ngg: Don't initialize same-invocation mesh shader outputs. 5343- aco/tests: Add post-RA optimizer testcase for partially overwritten VCC. 5344- aco/tests: Add post-RA DPP test cases with control flow. 5345- aco/tests: Add post-RA SCC no-compare tests cases with control flow. 5346- aco/optimizer_postRA: Mark a register overwritten when predecessors disagree. 5347- aco/optimizer_postRA: Don't assume all operand registers were written by same instr. 5348- aco/optimizer_postRA: Fix logical control flow handling. 5349- aco/optimizer_postRA: Clarify terminology. 5350- aco: Change inverse-comparison optimization to work with s_not 5351- spirv: Treat EmitMeshTasksEXT as a terminating instruction. 5352- radv: Use a fallback for marketing name when libdrm doesn't know it. 5353- ac/nir/ngg: Fix cross-invocation indices and cull outputs. 5354- aco: Fix build error with std::max on GCC 12 5355- nir: Document the flatten/dont_flatten selection control options. 5356- nir: Add selection control enum for always taken divergent branches. 5357- aco: Allow explicitly removing jumps on GFX10+ when beneficial. 5358- ac/nir/ngg: Remove jumps from some branches where we know LGKMCNT==0. 5359- ac/nir/tess: Remove jump from tess factor writes. 5360- nir, ac, aco: Add ACCESS intrinsic index to load/store_buffer_amd. 5361- aco: Cleanup load_vmem_mubuf and store_vmem_mubuf functions. 5362- nir, ac, aco: Add index src to load_buffer_amd/store_buffer_amd. 5363- aco: Optimize MUBUF 0 offset when idxen is also being used. 5364- aco/optimizer_postRA: Use unique_ptr + array for instruction indices. 5365- aco/optimizer_postRA: Speed up reset_block() with predecessors. 5366- aco/optimizer_postRA: Properly handle vccz/execz/scc in reset_block. 5367- aco/optimizer_postRA: Delete dead instructions more efficiently. 5368- aco: Move is_dead to aco_ir.h to allow it to get inlined. 5369- aco: Add ACO_DEBUG=novalidateir option. 5370 5371Tomeu Vizoso (16): 5372 5373- radeonsi/ci: Move libva and piglit jobs to Raven Ridge boards 5374- radv/ci: Put one more board to run the CTS on Stoney Ridge 5375- iris/ci: Test on Jasper Lake Chromebooks 5376- iris/ci: Set FDO_CI_CONCURRENT for all Chromebook jobs 5377- ci: Report failures to GitLab for LAVA jobs 5378- ci: Make sure that the Crosvm control socket has been removed 5379- Revert "ci: set venus on lavapipe to manual due to flakes" 5380- ci: Crosvm won't remove the control socket file on stop 5381- Revert "Revert "Revert "ci: set venus on lavapipe to manual due to flakes""" 5382- ci: Uprev ci-templates 5383- ci: Use --no-install-recommends to avoid problems with --no-remove 5384- ci: Install sysvinit-core without --no-remove 5385- ci: Stop explicitly passing env vars to FDO_DISTRIBUTION_EXEC command 5386- gallium: Add PIPE_CAP_QUERY_TIMESTAMP_BITS 5387- ci: Run RadeonSI jobs on changes to src/amd/ci 5388- radeonsi,radv/ci: Increase coverage 5389 5390Victor Hermann Chiletto (1): 5391 5392- radv: refactor descriptor set layout tracking 5393 5394Vinson Lee (10): 5395 5396- bin/perf-annotate-jit.py: Update to Python 3. 5397- zink: Remove duplicate variable zero. 5398- pan/bi: Fix memory leaks. 5399- egl/dri2: Fix missing return with dri2_egl_error_unlock. 5400- radv: Use count_tes_user_sgprs return value. 5401- d3d12: Initialize d3d12_video_encoder_references_manager_hevc::m_current_frame_idx. 5402- radv: Fix file descriptor leak. 5403- zink: Remove duplicate variable zm. 5404- r600/sfn: Initialize TestInstrFromString member m_instr_factory. 5405- v3d: Fix initializer-overrides warning. 5406 5407Väinö Mäkelä (1): 5408 5409- hasvk: Correctly set NonPerspectiveBarycentricEnable on gfx7 5410 5411Xin LI (1): 5412 5413- meson: Improve detection of qsort_r(). 5414 5415Yiwei Zhang (52): 5416 5417- venus: increase busy wait order in vn_relax 5418- venus: double the abort timeout to allow long shader compiles 5419- venus: fix external memory ext filtering 5420- venus: require necessary extensions for common wsi support 5421- venus: avoid feedback for external fence 5422- venus: use a separate sync fence for Android wsi 5423- venus: lazily create queue wait fence and make it non-external 5424- venus: avoid pre-allocating the feedback pool 5425- venus: sync to latest venus protocol headers for syncFdFencing 5426- venus: stub out renderer sync_fd fencing features 5427- venus: put android wsi on the sub-optimal path 5428- venus: fix vn_GetFenceFdKHR 5429- venus: fix vn_GetSemaphoreFdKHR 5430- venus: query renderer sync_fd props to fill the feature stubs 5431- venus: re-implement sync_fd external fence 5432- venus: re-implement sync_fd external sempahore 5433- venus: avoid scrubing wsi/external sempahores 5434- venus: allow no external memory handle when renderers lacks support 5435- venus/ci: mark a test as flake 5436- venus: some clang format fixes 5437- venus: enable zink required extensions on Android 5438- venus: enable KHR_driver_properties on Android 5439- venus: double the abort timeout 5440- loader: use os_get_option for driver override 5441- zink: fix core support on Android 5442- zink: fix zink_create_fence_fd to properly import 5443- zink: fix in-fence lifecycle 5444- zink: implement fence_get_fd required by EGL android platform 5445- venus: clean up vn_QueueSubmit 5446- venus: force synchronous submission for external signal semaphore 5447- venus: fix an oob in descriptor pool state restoration 5448- venus: vn_ResetDescriptorPool to reset mutable type states 5449- venus: zero out the pipeline handles 5450- venus: avoid fixing pipeline if not derivative 5451- venus: ignore pInheritanceInfo if not secondary command buffer 5452- venus: ensure to clear VkPipelineCreationFeedback::flags if needed 5453- mesa/st: emit string marker based on PIPE_CAP_STRING_MARKER 5454- meson: libelf is optional 5455- venus/ci: remove redundant crosvm patch 5456- venus: move vendor ext to bottom and sort the ext list 5457- docs: update venus extension support 5458- docs: move VK_EXT_extended_dynamic_state3 out from 1.3 5459- ci/venus: enable render server for venus testing 5460- venus: avoid accessing local var in VN_ADD_EXT_TO_PNEXT_OF 5461- venus: remove redundant codes 5462- venus: handle VkAndroidHardwareBufferFormatProperties2ANDROID 5463- venus: sync to latest venus protocol headers 5464- venus: add VK_EXT_primitives_generated_query support 5465- venus: add VK_EXT_depth_clip_control support 5466- docs: update to latest venus driver support 5467- venus: handle VK_QUERY_TYPE_PRIMITIVES_GENERATED_EXT 5468- venus: fix android wsi with global fencing disabled 5469 5470Yonggang Luo (218): 5471 5472- meson: Place all pre_args about HAVE_DRI* into single place 5473- meson: Cleanup usage dri_drivers as it's a dead option 5474- d3d12: Fixes compile error with mingw/gcc-x64 when static linkage to runtime library 5475- meson: clc compiler test only works on win32 5476- microsoft/clc: Fixes compiling errors with clang/mingw64 in clc/clc_compiler_test.cpp 5477- util: open_memstream also support on MacOS 5478- c11: Fixes the declaration of timespec_get in time.h for macOS 5479- egl: Remove eglextchromium.h and import eglext_angle.h 5480- egl: wire up ANGLE_sync_control_rate 5481- egl/x11: Split declaration of dri2_format_for_depth into header platform_x11.h 5482- egl/x11: implement ANGLE_sync_control_rate 5483- ci/windows: Setup proper MESA_IMAGE_TAG for Windows build/test image 5484- c11: It's should detect PTHREAD_DESTRUCTOR_ITERATIONS when define TSS_DTOR_ITERATIONS 5485- c11: call impl_tss_dtor_invoke with tls callback for win32 5486- c11: Implement and use of thrd_current properly on win32 5487- util: Implement util_thread_get_time_nano on win32 5488- util: Fixes typo in function os_get_total_physical_memory 5489- util: Implement os_get_available_system_memory on Win32 5490- util: Use \`DETECT_OS_WINDOWS` instead \`WIN32` in u_process.c 5491- util: Enable tests that disabled for msvc/mingw 5492- zink: Use DETECT_OS_WINDOWS instead of 'WIN32' 5493- meson: Enable graw tests on mingw 5494- util: Add os_create_anonymous_file support on win32 5495- nine: Fixes compile of nine_pipe.h with clang ../../src/gallium/frontends/nine/nine_pipe.h:377:9: error: unannotated fall-through between switch labels [-Werror,-Wimplicit-fallthrough] default: ^ ../../src/gallium/frontends/nine/nine_pipe.h:377:9: note: insert 'break;' to avoid fall-through default: 5496- panfrost: Do no use designated initializer for union 5497- intel: Fixes compile error of aubinator_viewer.cpp with gcc 5498- util: Fixes memory leak in create_cache_item_header_and_blob 5499- util: Fixes memory leak in u_printf_test.cpp 5500- util: Fixes invalid assumption that return non null by function util_format_fetch_rgba_func 5501- util: Disable usage of __attribute__((__const__)) when the compiler is clang 5502- ci: Improve MacOS github ci 5503- ci: Enable llvmpipe on MacOS 5504- c11: Remove mtx_try from mtx enums as it's not a part of c11 standard 5505- c11: #include <threads.h> when the os/platform provide it 5506- anv: Fixes struct anv_device::info is not initialized with struct anv_physical_device 5507- meson: Use different STRACEDIR folder for intel_devinfo_override_test 5508- meson: add enable-glcpp-tests option 5509- ci: disable glcpp tests for now 5510- Revert "ci: disable unit tests" 5511- util: Use literal instead call to list_inithead in u_queue.c 5512- mapi: Remove unused ENTRY_CURRENT_TABLE_GET 5513- mapi: Remove usage of _glapi_check_multithread and _glapi_destroy_multithread 5514- mapi: Remove reference to unused _gl_DispatchTSD 5515- mapi: Move shared _glapi_set_context and _glapi_set_dispatch into u_current.c 5516- mapi: Move shared stub into u_current.c 5517- mapi: Access _glapi_tls_Dispatch directly without need of macro 5518- docs: Remove document about USE_ELF_TLS 5519- mapi: Improve comment about _glapi_tls_Dispatch and _glapi_tls_Context 5520- aco: Use unreachable instead assert(false) 5521- amd/compiler: Fixes warning [-Wunused-variable] in test_to_hw_instr.cpp 5522- amd/compiler: Fixes warning [-Wunused-variable] in test_optimizer_postRA.cpp 5523- nir: Fixes [-Wdeprecated-declarations] in serialize_tests.cpp 5524- util: Fix [-Wdeprecated-declarations] in mesa-sha1_test.cpp 5525- glsl: Fixes [-Wdeprecated-declarations] in list_iterators.cpp 5526- intel/compiler: Fixes [-Wdeprecated-declarations] in test_eu_compact.cpp 5527- intel/compiler: Fixes [-Wdeprecated-declarations] in test_eu_validate.cpp 5528- virgl: Fixes [-Wdeprecated-declarations] in virgl_staging_mgr_test.cpp 5529- osmesa: Fixes [-Wdeprecated-declarations] in test-render.cpp 5530- ci: Remove -Wno-error=deprecated-declarations cpp option from debian-clang 5531- util: Remove unused rtasm_cpu_has_sse2 5532- util: Rename util_cpu_detect to _util_cpu_detect_local 5533- util: Improve has_sse has_sse2 has_sse3 has_sse4_1 has_avx disable handling 5534- util: Handling GALLIUM_NOSSE in u_cpu_detect.c 5535- util: Handling LP_FORCE_SSE2 in u_cpu_detect.c 5536- llvmpipe: Remove the hack for LLVM4 in lp_bld_init.c 5537- gallium: Fixes memory leak of translate_test.c 5538- util: Use environment variable GALLIUM_OVERRIDE_CPU_CAPS to remove usage of util_cpu_caps in translate_test.c 5539- util: Move detect_done out of struct util_cpu_caps_t 5540- util: Add api util_call_once_with_context 5541- util: Add extern "C" guard in simple_mtx.h 5542- util: Remove the need of _MTX_INITIALIZER_NP in simple_mtx.h 5543- meson/panfrost: Add dep_valgrind for libpanfrost_pixel_format to fixes the compiling error: 5544- llvmpipe: Fixes warning [-Wimplicit-fallthrough] in lp_bld_swizzle.c 5545- llvmpipe: Fixes warning [-Wimplicit-fallthrough] in lp_bld_sample_soa.c 5546- radeonsi: Fixes warning [-Wimplicit-fallthrough] in si_descriptors.c 5547- freedreno: Fixes warning [-Wimplicit-fallthrough] in freedreno_util.h 5548- freedreno: Fixes warning [-Wimplicit-fallthrough] in fd4_texture.h 5549- freedreno: Fixes warning [-Wimplicit-fallthrough] in fd2_texture.c 5550- freedreno: Fixes warning [-Wimplicit-fallthrough] in fd3_texture.c 5551- freedreno: Fixes warning [-Wimplicit-fallthrough] in fd5_texture.h 5552- freedreno: Fixes warning [-Wimplicit-fallthrough] in fd5_emit.c 5553- freedreno: Fixes warning [-Wimplicit-fallthrough] in fd6_emit.c 5554- vdpau: Fixes warning [-Wimplicit-fallthrough] in vdpau_private.h 5555- v3dv: Fixes warning [-Wimplicit-fallthrough] in v3dvx_pipeline.c 5556- v3dv: Remove redundant unreachable in v3dvx_pipeline.c 5557- st: Fixes warning [-Wimplicit-fallthrough] in st_texture.c 5558- llvmpipe: Remove llvm version guard LLVM_VERSION_MAJOR <=> 4 5559- meson: bump the minimal required llvm version to 5.0.0 5560- util: Refactoring check_cpu_caps_override out 5561- util: Add ssse3 and avx cpu caps override option 5562- util: Move the detect of max_vector_bits into u_cpu_detect.c 5563- util: add support for detecting avx512 vector bit size 5564- llvmpipe: Always using util_get_cpu_caps to get cpu caps for llvm on x86 5565- llvmpipe: Enable avx512 support in llvm 5566- docs: document LP_NATIVE_VECTOR_WIDTH GALLIUM_OVERRIDE_CPU_CAPS GALLIUM_NOSSE LP_FORCE_SSE2 5567- ci: EXTRA_OPTIONS -> EXTRA_OPTION for debian-clang 5568- ci: Enable GALLIUM_DUMP_CPU=true only in the clang job 5569- ci: Enable all possible meson build options for clang 5570- ci: Add debian-clang-release build pipeline 5571- ci/fedora: Re-enable vulkan-layers=device-select,overlay 5572- gallium/hud: Fixes compile error in hud/hud_context.c: 5573- util: Fixes comment about debug_get_option 5574- mesa: direct use of mtx_recursive is not c11 conformance 5575- radeonsi: direct use of mtx_recursive is not c11 conformance 5576- svga: direct use of mtx_recursive is not c11 conformance 5577- c11: Update the values of enum mtx_plain, mtx_recursive, mtx_timed to make sure mtx_recursive != mtx_plain | mtx_recursive 5578- drm-shim: drop gnu99 override 5579- clover: Fixes use of designated initializers requires in c++ that doesn't support by MSVC 5580- radv: Remove the redundant #include <gelf.h> and #include <libelf.h> in ac_binary.c 5581- aco: Convert to use u8 literal for Unicode character to fixes msvc warning 5582- util: Add util_call_once for optimize call to util_call_once_with_context out for hot path 5583- util: getting get_tracefile in u_trace.c to be thread safe 5584- util: Remove the macro DEBUG_GET_ONCE_FILE_OPTION 5585- radv/r600/clover: Getting libelf to be optional 5586- radv: Getting radeon_icd to be generated properly on win32 5587- dzn: rename module_dir to icd_lib_path for consistence 5588- lavapipe/meson: rename module_dir to icd_lib_path and swap two lines for consistence 5589- Update 22.2.0.rst 5590- ci: debian-clang: drop -Wno-error=unused-function 5591- r600/sfn: Fixes 'numeric_limits' is not a member of 'std' error in sfn/sfn_instr.cpp 5592- aco: Fixes compiling error about char8_t with c++20 5593- ci/docs: Stick to specific version of python packages 5594- util: Remove the include of windows.h when compiling with MSVC 5595- util: Implement atomic operations consistently across compilers and testing for it 5596- util/mesa/wide: Rename _SIMPLE_MTX_INITIALIZER_NP to SIMPLE_MTX_INITIALIZER 5597- util: Turn -DWINDOWS_NO_FUTEX to be pre_args 5598- ci/windows: Enable gles1 for msvc 5599- ci/windows: Remove -Dlibelf:warning_level=1 as libelf subproject are already removed 5600- ci/windows: Getting the default supported windows version to be 7 when using mingw 5601- gallium: delete graw 5602- nir: No need redefine snprintf anymore in nir.h 5603- d3d10ump: No need define snprintf anymore in InputAssembly.cpp 5604- util: Move the implementation of futex_wake and futex_wait from futex.h to futex.c 5605- docs: Remove graw related words 5606- docs: Remove rbug related documents 5607- nouveau: Remove XVMC_VL environment variable usage 5608- docs: Remove document about XvMC 5609- meson: add dep_valgrind dependency for 'dxil_compiler' 5610- util: Remove the need of _MTX_INITIALIZER_NP by using simple_mtx_t/SIMPLE_MTX_INITIALIZER in u_debug_stack.c 5611- util: Call function mtx_init on exit_mutex to replace the usage of _MTX_INITIALIZER_NP in u_queue.c 5612- util: Remove the need of _MTX_INITIALIZER_NP by using simple_mtx_t/SIMPLE_MTX_INITIALIZER in u_debug_symbol.c 5613- util: Remove the need of _MTX_INITIALIZER_NP by using simple_mtx_t/SIMPLE_MTX_INITIALIZER in u_debug_stack_android.cpp 5614- util: Remove the need of _MTX_INITIALIZER_NP by using simple_mtx_t/SIMPLE_MTX_INITIALIZER in u_debug_memory.c 5615- util: Remove the need of _MTX_INITIALIZER_NP by using simple_mtx_t/SIMPLE_M in u_debug_refcnt.c 5616- mapi: Use util_call_once to init exec_mem and mutex instead _MTX_INITIALIZER_NP 5617- mapi: Remove the need of _MTX_INITIALIZER_NP by using simple_mtx_t/SIMPLE_MTX_INITIALIZER in stub.c 5618- gallium/auxiliary: Remove the need of _MTX_INITIALIZER_NP by using simple_mtx_t/SIMPLE_MTX_INITIALIZER 5619- egl/wgl: Use util_call_once_data to replace usage of mtx_t glFlushMutex 5620- egl/dri2: Use util_call_once_data to replace usage of mtx_t glFlushMutex 5621- mesa: Use util_call_once_data instead of mtx_t init_once_lock 5622- util: #include "c11/threads.h" when necessary in simple_mtx.h 5623- util: #include "util/macros.h" in u_cpu_detect.h for ATTRIBUTE_CONST 5624- c11: Remove the code for Windows XP 5625- meson/mesa/wide: define WIN32_LEAN_AND_MEAN with pre_args 5626- util: Remove the guard WIN32_WINNT >= 0x0600 in u_thread.h 5627- c11: Getting the macro guard of HAVE_PTHREAD in c11/threads.h to be consistence with util/u_uthread.h 5628- util: #include <windows.h> in disk_cache_os.c 5629- llvmpipe: #include <windows.h> in lp_rast.c 5630- util: Remove include of windows.h in memstream.h 5631- util: Remove include of windows.h in rwlock.h 5632- util: Move the function implementation from u_thread.h to u_thread.c 5633- glx: Guard usage of infer_zink && !explicit_zink in glxext.c 5634- glx: Trim trailing spaces in in glx/glxext.c 5635- glx: Remove redundant #ifndef GLX_USE_APPLEGL in glx/glxext.c 5636- glx: Adopt guard of glx_direct and glx_accel in #if defined(GLX_DIRECT_RENDERING) && !defined(GLX_USE_APPLEGL) 5637- glx: Adot #ifdef GLX_USE_WINDOWSGL in #if defined(GLX_DIRECT_RENDERING) && !defined(GLX_USE_APPLEGL) 5638- st: Remove st_api_type because ST_API_OPENVG is not an option anymore 5639- ci: libxvmc-dev is not a dependence anymore 5640- meson: upgrade zlib wrap 5641- ci: upgrade zlib to 1.2.13 in mesa_deps_build.ps1 5642- ci: Add Get-Date in each package build in mesa_deps_build.ps1 5643- ci: Move building libva before clone llvm-project in mesa_deps_build.ps1 5644- ci: Install the clang-cl compiler within mesa_deps_vs2019.ps1 5645- ci: Trigger building visual studio docker image 5646- meson: with_glx never assigned to 'gallium-xlib', so need add with_glx == 'xlib' for dep_glproto 5647- meson: with_glx never assigned to 'gallium-xlib', remove 'gallium-xlib' 5648- xlib: Remove macro guard with GLX_INDIRECT_RENDERING in xlib.c 5649- ci/macos: Building both dri and xlib version of glx 5650- hgl: Remove non-nused hgl_create_st_api 5651- st: move function st_visual_have_buffers into st_manager.c 5652- st: OpenVG are removed, remove st_api::profile_mask as it's constant 5653- st: OpenVG are removed, remove st_api::feature_mask as it's constant 5654- st: Remove st_api::destroy 5655- st: Turn st_api::destroy_drawable to be direct accessed function st_api_destroy_drawable 5656- st: Turn st_api::get_current to be direct accessed function st_api_get_current 5657- st: Turn st_api::make_current to be direct accessed function st_api_make_current 5658- st: Turn st_api::create_context to be direct accessed function st_api_create_context 5659- st: Turn st_api::query_versions to be direct accessed function st_api_query_versions 5660- xlib: Implement xmesa_get_name by return literal "Mesa " PACKAGE_VERSION directly 5661- st: Remove struct st_api, st_gl_api_create and st_gl_api.h 5662- aco: Do not define NOMINMAX as it's already defined in pre_args now 5663- util: Remove non used UTIL_MAX_L3_CACHES in u_thread.h 5664- util: Merge duplicated function prototype in u_thread.h that named with util_barrier* 5665- util: Move UTIL_MAX_CPUS from u_thread.h to u_cpu_detect.h 5666- util: #include "c11/threads.h" instead of #include "util/u_cpu_detect.h" u_format.c 5667- xlib: Fixes compiling on linux 5668- xlib: Remove the linkage hack in lib.c by use meson link_whole option 5669- gallium: Remove mgl* prefix in linker script osmesa.sym and libgl-xlib.sym 5670- xlib: Indent with space and trim trailing spaces of xlib files 5671- ci/macOS: Getting the installed binary to be artifacts 5672- ci: Getting debian-clang-release to building with glx=xlib 5673- xlib: Getting xmesa_strict_invalidate to be function 5674- xlib: Remove the usage of global constructor in xlib.c, so the xm_public.h are removed 5675- docs: -Ddri-drivers= option is deprecated, do not use it anymore 5676- docs: There is only glx=xlib option now 5677- docs: There is no more swrast driver 5678- android: -Ddri-drivers= is not needed anymore 5679- glx: Fixes apple/apple_visual.c that include non-exist util/debug.h 5680- ci: Fixes macos.yml 5681- radv: Fixes prototypes 5682- meson: Refactoring shared gen_vs_module_defs_normal_command out 5683- meson: fixes mingw-clang32 building 5684- clover: Rename \*OpenCL.def to \*OpenCL.def.in 5685- clover: Fixes building with mingw-x86 5686- meson: Fixes name_prefix for clover on mingw 5687- clover: empty soversion when on win32 5688- util: Test __PPC64__ for getting PIPE_ARCH_PPC_64 respond to __PPC64__ take effect 5689- gallivm: use valid LLVM_VERSION_MAJOR instead of LLVM_VERSION 5690 5691Yusuf Khan (7): 5692 5693- nouveau: implement and enable PIPE_CAP_QUERY_MEMORY_INFO 5694- tu: remove tu_syncobj_to_fd 5695- nouveau: get timestamp from os_time_get_nano instead of os_time_get 5696- nouveau: put nv04 push macros in nouveau_winsys 5697- nv50/ir: nir_op_b2i8 and nir_op_b2i16 5698- gallium/util: add a helper for get_timestamp 5699- nouveau: enable PIPE_CAP_UMA when appropriate 5700 5701jenneron (1): 5702 5703- docs/panfrost: Document Mali-T620 support 5704 5705noasakurajin (1): 5706 5707- disable zinks shader cache when the needed functions do not exist 5708 5709nyanmisaka (2): 5710 5711- gallium: Do not include offscreen winsys if glx is disabled 5712- radeonsi: Inherit the old modifier when reallocating the texture. 5713 5714semjon00 (4): 5715 5716- hasvk: don't export flush_compute_state 5717- hasvk: don't export gfx state flushing helper 5718- hasvk: combine flushes in Draw/DrawIndexed/DrawIndirectByteCountEXT 5719- hasvk: force inline more pipe flush functions 5720 5721sjfricke (2): 5722 5723- isl: fix bug where sb.MOCS is not being set 5724- anv: fix assert to build with shader cache disabled 5725