1Mesa 24.2.0 Release Notes / 2024-08-14 2====================================== 3 4Mesa 24.2.0 is a new development release. People who are concerned 5with stability and reliability should stick with a previous release or 6wait for Mesa 24.2.1. 7 8Mesa 24.2.0 implements the OpenGL 4.6 API, but the version reported by 9glGetString(GL_VERSION) or glGetIntegerv(GL_MAJOR_VERSION) / 10glGetIntegerv(GL_MINOR_VERSION) depends on the particular driver being used. 11Some drivers don't support all the features required in OpenGL 4.6. OpenGL 124.6 is **only** available if requested at context creation. 13Compatibility contexts may report a lower version depending on each driver. 14 15Mesa 24.2.0 implements the Vulkan 1.3 API, but the version reported by 16the apiVersion property of the VkPhysicalDeviceProperties struct 17depends on the particular driver being used. 18 19SHA256 checksum 20--------------- 21 22:: 23 24 c02bb72cea290f78b11895a0c95c7c92394f180d7ff66d4a762ec6950a58addf mesa-24.2.0.tar.xz 25 26 27New features 28------------ 29 30- VK_KHR_dynamic_rendering_local_read on RADV 31- VK_EXT_legacy_vertex_attributes on lavapipe, ANV, Turnip and RADV 32- VK_MESA_image_alignment_control on RADV 33- VK_EXT_shader_replicated_composites on ANV, dozen, hasvk, lavapipe, nvk, RADV, and Turnip 34- VK_KHR_maintenance5 on v3dv 35- VK_KHR_maintenance7 on RADV 36- VK_EXT_depth_clamp_zero_one on v3dv 37- GL_ARB_depth_clamp on v3d 38- Defaulting to a new shader cache implementation, reducing filesystem overhead. 39 40 41Bug fixes 42--------- 43 44- u_debug_stack_test.capture_not_overwritten fails on i386 45- turnip: vulkaninfo crashed 46- turnip-freereno Build error (/usr/local/include/xf86drm.h:40:10: fatal error: drm.h: No such file or directory) 47- tu: compilation failing when compiling turnip with only kgsl and X11 enabled 48- samplerCube constructor in fragment shader no longer converting uvec2 bindless texture handles ( segmentation fault , core dumped ) 49- anv: gen9.5 flakiness in dEQP-VK.multiview.dynamic_rendering.depth.* 50- vaapi decoding corruption with green blocks 51- interpolateAt precision lowering unhandled by glsl_to_nir() 52- [anv] CS2 crashes on LNL 53- [anv] Dota2 does not start on LNL 54- [radeonsi][bisected][regression] glClientWaitSync() quickly times out with INT64_MAX timeout 55- d3d10umd: Build regression in 24.2.0-devel 56- zink/tu: glcts flake on a750 57- nouveau: advertises GL_EXT_memory_object without implementing \`*UID` callbacks 58- LIBGL_DRIVERS_PATH gone 59- [Regression][Vulkan][TGL][Bisected]vkCmdCopyQueryPoolResults failed to write buffer with compute pipeline on Mesa 24.1 60- Worms Revolution: not rendering explosion effects 61- crash on pushbuf_validate nvc0_blit do_blit_framebuffer 62- piglit: cl-api-build-program crashes 63- i915g: glGenerateMipmap() fails with 2048×2048 textures 64- [radeonsi] Asterix & Obelix XXLRomastered: river misrendered (completely black) 65- Build fails without Vulkan 66- No dependency check for PyYAML in meson.build 67- GPU Hang in Metal Gear Rising Revengeance 68- VK_ERROR_DEVICE_LOST A770 DXVK Fallout 3 69- [Bisected] Recent compile issue in libnak 70- anv: Wrong push constant values for bytes? 71- anv: dEQP-VK.protected_memory tests GPU hang on MTL 72- RustiCL (or maybe not...): radeonsi freezes after 2 hours of simulation, zink works just fine 73- ci_run_n_monitor.py doesn't monitor manual jobs 74- Crash in util_idalloc_resize due to glBindTexture with a way-too-large ID 75- mesa-24.1.2 fails to compile: ast_to_hir.cpp:5371:39: error: '%s' directive argument is null 76- [regression][bisected] VMware Xv video displays as black rectangle 77- Blender 4.2,4.3 crashes when rendering with motion blur on RDNA3 cards (OpenGL/radeonsi) 78- nvk: regression with multiple games crashing 79- Transparent background in Blender 3D view with nouveau 80- turnip: latest git does not build anymore 81- ACO Unimplemented intrinsic instr 82- RADV/ACO: assert on per-sample interpolation 83- radv: large descriptor layout creation is slow 84- Gnome shell (wayland) crashes when opening any window 85- DRI Intel drivers fix a problem in Redhat 7 (Mesa 18), but are not included for Redhat 8 (Mesa versions v23, v24) 86- Vulkan: ../src/nouveau/vulkan/nvk_physical_device.c:1109: VK_ERROR_INCOMPATIBLE_DRIVER 87- RADV: Smooth lines affect triangle rendering 88- [armhf build error][regression] error: ‘StringMapIterator’ was not declared in this scope; did you mean ‘llvm::StringMapIterator’? 89- Build fails with latest llvm 19: error: no matching function for call to unwrap(LLVMOpaqueModule*&) 90- tu: support KHR_8bit_storage 91- Incorrect colours on desktop and apps 92- nir: Incorrect nir_opt_algebraic semantics for signed integer constants causing end-to-end miscompiles 93- src/gallium/frontends/clover/meson.build:93:40: ERROR: Unknown variable "idep_mesaclc". 94- panfrost: mpv is broken on T604 95- Nightly CI is broken 96- [radv] GPU hang in Starfield on RX 5700 XT 97- anv, isl, iris: Clarify and improve CCS + FCV on gfx12 98- isl: CPCB horizontal and vertical alignment requirements unknown 99- Indika: flickering black artifacting on the snow 100- intel/isl: Split Xe2 changes into new files of Xe2. 101- rusticl: Generated rusticl_mesa_bindings.c fails to find include 102- isl: Remove 512B pitch requirement for non-displayable CCS 103- MESA 24.1 - broken zink OpenGL under Windows 104- Blue flickering rectangles on AMD RX 7600 105- GPU hangs on AMD Radeon RX 6400 on a fragment shader 106- v3dv: vkcube-wayland crashes 107- intel/brw: scoreboarding regression 108- regression in !29436 for radv+angle on stoney 109- [radv][regression] Starfield invisible terrain on a 7900 XTX 110- free_zombie_shaders() leave context in a bad state (access violation occurs) 111- r300: X11 fails to start with the modesetting driver (glamor is broken with R300/R400 gpus). 112- [NINE]Far Cry 1 trees flicker regression[bisected][traces] 113- Vulkan: Most sync2 implementations are missing new access flags 114- Incorrect buffer_list advance when writing disjoint image descriptors 115- ANV: Block shadows in Cyberpunk on Intel A770 116- ACO ERROR: Temporary never defined or are defined after use 117- [ANV] Graphics memory allocation in Total War: Warhammer 3 118- DG2: God of War trace fails to play 119- Borderlands trace fails to play on dg2 120- NVK: Vulkan apps simply terminated with segfault under wayland and Xwayland 121- NVK: VK_ERROR_OUT_OF_DEVICE_MEMORY on swapchain creation 122- anv/zink regression: piglit.spec.arb_fragment_layer_viewport.layer-no-gs 123- [anv] failures when upgrading vulkancts 1.3.6 -> 1.3.7 on intel mesa ci 124- RustiCL: deadlock when calling clGetProfilingInfo() on callbacks 125- [Intel][Vulkan][Gen12] Vulkan compute shader is 3x slower than the same OpenCL kernel 126- turnip: Broken AHB support 127- zink: nir validation failures in Sparse code 128- nir: nir_opt_varyings uses more stack than musl libc has 129- dEQP-VK.pipeline.pipeline_library.shader_module_identifier.pipeline_from_id.graphics regression 130- freedreno + perfetto missing dependency on adreno_common.xml.h 131- anv: unbounded shader cache 132- radv: Crash due to nir validation fail in Enshrouded 133- vulkan/wsi/wayland: valgrid reports invalid read in \`vk_free` call in \`wsi_wl_surface_analytics_fini` 134- android: sRGB configs no longer exist after !27709 135- bisected: turnip: deqp regressions 136- aco: Radeonsi unable to use rusticl 137- anv: clean up default_pipeline_cache in anv_device 138- [24.1-rc4] fatal error: intel/dev/intel_wa.h: No such file or directory 139- Turnip driver is crashing since turnip: ANB/AHB support got merged 140- vcn: rewinding attached video in Totem cause [mmhub] page fault 141- When using amd gpu deinterlace, tv bt709 properties mapping to 2 chroma 142- a530: ir3_context_error assertion (unknown vertex shader output name: VARYING_SLOT_EDGE) 143- VCN decoding freezes the whole system 144- [RDNA2 [AV1] [VAAPI] hw decoding glitches in Thorium 123.0.6312.133 after https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28960 145- radv regression bisected: Flickering textures (vega) 146- [Regression][Bisected] EGL/Wayland: QT applications terminated by SIGSEGV (Address boundary error) when using dGPU 147- WSI: Support VK_IMAGE_ASPECT_MEMORY_PLANE_i_BIT_EXT for DRM Modifiers in Vulkan 148- nvk: Tracker issue for gamescope support 149- nvk: Implement VK_EXT_image_drm_format_modifier 150- nvk: NVK_DEBUG=zero_memory is hitting an assert 151- nvk: Implement VK_EXT_conservative_rasterization 152- zink sparse: Improper semaphore handling 153- zink sparse: Reference to mip tails should be refcounted 154- radv: Enshrouded GPU hang on RX 6800 155- NVK Zink: Wrong color in Unigine Valley benchmark 156- intel vulkan incremental build takes forever 157- intel vulkan incremental build takes forever 158- 24.0.6: build fails 159- shader with multidimensional array in shader storage buffer takes too long to compile 160- panforst: T604 issue with using u32 for flat varyings 161- lp_screen.c:601:4: error: 'snprintf' will always be truncated; specified size is 16, but format string expands to at least 17 162- [anv] FINISHME: support YUV colorspace with DRM format modifiers 163- gen9/11 test became flaky: piglit.spec.!opengl 1_4.blendminmax 164- mesa 24 intel A770 KOTOR black shadow smoke scenes 165- nvk: Implement VK_EXT_pipeline_robustness 166- [bisected][regression] kitty fails to start due to \`glfwWindowHint(GLFW_SRGB_CAPABLE,true)` 167- r600: bisected 5eb0136a3c561 breaks a number of piglits 168- [bdw][bisected][regression] assertion failure in nir_validate.c 169- Graphical glitches in RPCS3 after updating Vulkan Intel drivers 170- [R600] OpenGL and VDPAU regression in Mesa 23.3.0 - some bitmaps get distorted. 171- VAAPI radeonsi: VBAQ broken with HEVC 172- tu: weird fail in packing tests 173- radv/video: 10-bit support 174- radv: vkCmdWaitEvents2 is broken 175- anv: add support for EDS3::extendedDynamicState3AlphaToCoverageEnable 176- ci: switch from CI_JOB_JWT to id_tokens 177- Zink: enabled extensions and features may not match 178- anv: share embedded samplers 179 180 181Changes 182------- 183 184Adam Jackson (8): 185 186- mesa: Enable EXT_shadow_samplers for GLES2 187- gallium: Rename ${target}/target.c to ${target}/{$target}_target.c 188- treewide: Include mesa_interface.h not dri_interface.h 189- mesa_interface: Set ourselves free 190- mesa_interface: Move out of GL/internal/ 191- gallium/dril: Compatibility stub for the legacy DRI loader interface 192- dri: Let dril handle the DRI driver link farm 193- gallium/meson: Deconflate swrast/softpipe/llvmpipe 194 195Adrian Perez de Castro (1): 196 197- Revert "egl/wayland: Remove EGL_WL_create_wayland_buffer_from_image" 198 199Alejandro Piñeiro (29): 200 201- v3dv/cmd_buffer: always bind pipeline static state 202- v3dv/ci: dEQP-VK.dynamic_state.*.double_static_bind are fixed now 203- v3dv: port dynamic state tracking to use Mesa Vulkan 204- v3dv: provide implementation for vkCmdBindVertexBuffers2 205- v3dv: provide implementation for CmdSetViewportWithCount 206- v3dv: CullMode and FrontFace are dynamic now 207- v3dv: DepthBoundsTestEnable is dynamic now 208- v3dv: move depth CFG bits setting to cmd buffer emission 209- v3dv: ez_state/incompatible_ez_test could be recomputed at cmd_buffer 210- v3dv: PrimitiveTopology is now dynamic 211- v3dv: StencilOp and StencilTestEnable are now dynamic 212- v3dv/ci: update expected list due VK_EXT_extended_dynamic_state 213- v3dv: enable VK_EXT_extended_dynamic_state 214- v3dv/cmd_buffer: missing updates due PrimitiveTopology being dynamic now 215- v3dv: fixes StencilTestEnable handling 216- v3dv: PrimitiveRestartEnable is now dynamic. 217- v3dv: DepthBiasEnable is dynamic now 218- v3dv: SetRasterizerDiscardEnable is dynamic now 219- v3dv: enable VK_EXT_extended_dynamic_state2 220- v3dv: add debug option to disable custom pipeline caches for meta operations 221- v3dv/meta_clear: take into account multiview for the custom clear pipeline caches 222- v3dv/meta_clear: use v3dv_renderpass used as parameter 223- v3dv/device: compute maxDescriptorSet*Limits multiplying per-stage by 4 224- v3dv/device: set DescriptorUpdateAfterBind limits 225- v3d/devinfo: unify comment style 226- broadcom: move HW-dependant constants to v3d_device_info 227- v3d,v3dv: document cl_emit_with_prepacked 228- v3dv/pipeline: ensure vk_graphics_pipeline_all_state alive when still needed 229- drm-shim: stub synobj_timeline_wait and query ioctl 230 231Aleksi Sapon (5): 232 233- lavapipe: fixes for macOS support 234- lavapipe: build "Windows" check should use the host machine, not the \`platforms` option. 235- util: fix memory related OS calls on macOS 236- wsi: fix compilation on macOS 237- util: macOS support for cnd_monotonic 238 239Alessandro Astone (1): 240 241- egl/gbm: Walk device list to initialize DRM platform 242 243Alex Deucher (1): 244 245- ac/surface: fix version check for gfx12 DCC 246 247Alexandre Marquet (2): 248 249- pan/mdg: quirk to disable auto32 250- panfrost: implement SFBD raw format support on v4 251 252Alexandros Frantzis (2): 253 254- egl/wayland: Pass dri2_wl_formats to create_dri_image 255- egl/wayland: Fail EGL surface creation if opaque format is unsupported 256 257Ali Homafar (1): 258 259- lavapipe: Set ICD api_version to 1.3 260 261Alyssa Rosenzweig (222): 262 263- vulkan: optimize vk_dynamic_graphics_state_any_dirty 264- vulkan: add helper to fill out spirv caps automatically 265- nir/lower_subgroups: add filter 266- nir/lower_subgroups: add generic scan/reduce lower 267- nir/lower_subgroups: relax ballot_type_to_uint 268- nir/lower_robust_access: also handle image derefs 269- docs: add header-stub for vk_enum_to_str 270- vulkan: add vk_debug_ignored_stype helper 271- nvk: use common stype debug 272- broadcom: use common stype debug 273- pvr: use common stype debug 274- anv,hasvk: use common stype debug 275- dzn: use common stype debug 276- nir: add is_first_fan_agx sysval 277- nir: add texops for AGX border colour emulation 278- nir: add quad_ballot_agx intrinsic 279- nir,agx: add depth=never workaround 280- nir,agx: fix load_active_subgroup_index 281- compiler: add ACCESS_IN_BOUNDS_AGX 282- agx: optimize and/or with booleans 283- agx: enable more lowering 284- agx: fix query LOD of array 285- agx: fix some ms texture packing 286- agx: handle cross-workgroup memory barriers 287- agx: allow 8-bit bcsel 288- agx: fix phi translation corruption 289- agx: fix load_helper_invocation with sample shading 290- agx: fix frag sidefx with sample shading 291- agx: handle subgroup barriers 292- agx: fix spilling inside sample loop 293- agx: switch to demote internally 294- agx: lower nir_intrinsic_load_num_subgroups 295- agx: delete unreachable blocks 296- agx: model more subgroup ops 297- agx: lower shuffle 298- agx: handle non-immediate shuffles in divergent CF 299- agx: handle quad_broadcast 300- agx: handle quad swaps 301- agx: add missing b2b16 implementation 302- agx: forbid uniforms on ballots 303- agx: lower 8-bit subgroups 304- agx: flesh out subgroup lowering 305- agx: report uses_txf 306- agx: expose agx_link_libagx 307- agx: document another sample_mask restriction 308- agx: reserve scratch registers for mem<-->mem swaps 309- agx: optimize txf with lod 0 310- agx: fix bogus unit test 311- agx: stash early_fragment_test info 312- agx: handle quad reduce 313- agx: implement quad_ballot 314- agx: lower more quad ops 315- agx: optimize elect() 316- agx: fix UB in cursor comparison 317- ail: constify everything 318- asahi: mark eMRT loads as in-bounds 319- asahi: calculate validity when unpacking 320- asahi: agx_translate_sample_count 321- asahi: assert bo size > 0 322- asahi: unwrap pointless null check 323- asahi: implement PIPE_CAP_QUERY_MEMORY_INFO 324- asahi: rm unused #include 325- asahi: resize key 326- asahi: cleanup fs epilog link info 327- asahi: move agx_link_varyings_vs_fs 328- asahi: fix prolog emit 329- asahi: pack UVS key properly 330- asahi: plumb shader stage into info 331- asahi: get debug in common 332- asahi: rm deadcode 333- asahi: drop rgb10a2_sint rendering 334- asahi: add missing rgba4 format 335- asahi: fix 1D array atomics 336- asahi: fix txf/image_load robustness with arrays 337- asahi: rework VBO lower for divisor=0 338- asahi: mv AGX_MAX_OCCLUSION_QUERIES define 339- asahi: handle agx_ppp_fragment_face_2 with no info 340- asahi: clarify format code in image lowering 341- asahi: fix rgb565 blending 342- asahi: fix store_output component/offset 343- asahi: fix sample ID with multiblock epilogs 344- asahi: lower texture instructions with epilogs 345- asahi: fix cull unknown bits 346- asahi: simplify image atomic lowering 347- asahi: move primitive MSAA field 348- asahi: free libagx if we don't use a ralloc memctx 349- asahi: eliminate troublesome empty uniforms 350- asahi: rearrange VS uniforms 351- asahi: set src_type for store_output 352- asahi: rm dead code 353- asahi: add agx_index_size_to_B helper 354- asahi: move some GS lowering into lower_gs 355- asahi: don't use load_num_vertices in geometry shaders 356- asahi: mv vertex_id_for_topology_class into GS lowering 357- asahi: rm another num_vertices use 358- asahi: rm dated comment 359- asahi: rm unused lower 360- asahi: rm num_vertices uses 361- asahi: rm redundant input_vertices 362- asahi: mv initialization of grid z for indirect GS 363- asahi: rm more dead lowering 364- asahi: rm always true param 365- asahi: update comment 366- asahi: update comment for maint5 367- asahi: eliminate num_workgroups for VS->GS + VS->TCS 368- asahi: drop bogus assertion 369- asahi: pack tilebuffer usc word ahead-of-time 370- asahi: add agx_ppp_push_merged helper 371- asahi: use ppp_merge 372- asahi: don't allocate varyings ourselves 373- asahi: don't allocate for ppp updates 374- asahi: extend varying linking for tri fan weirdness 375- asahi: plumb tri fan flatshading through common 376- asahi: don't ralloc in agx_fast_link 377- asahi: extend epilog key for force early frag handling 378- asahi: don't reserve extra UVS space for layer 379- libagx: use sub_group_scan_inclusive_add 380- libagx: add query copy kernel 381- libagx: don't use get_group_id() 382- asahi/decode: QoL improvements 383- asahi: track imports for decode 384- asahi: clean up bg/eot counts 385- asahi: rename meta -> bg/eot 386- asahi: don't allocate for USC words 387- asahi: split frag shader words 388- asahi: split CDM Launch words 389- asahi: unify naming for COUNTS structs 390- nir/tex_instr_result_size: handle subpass_ms 391- nir/lower_point_size: support lowered i/o 392- asahi/decode: drop Apple-specific decode check 393- libagx: rm unused field 394- libagx: fix static assert 395- libagx: fix triangle fan + prim restart + GS/XFB 396- libagx: drop unused !indexed path 397- libagx: add libagx_copy_xfb_counters helper 398- asahi: be robust against out of sync shader info 399- agx: fix draw param gather for sw vs 400- asahi: split param structs for GS internal kernel 401- agx: rework libagx I/O lowering 402- asahi: add missing lowerings 403- asahi: force bindless for eMRT 404- asahi: bounds check eMRT stores 405- asahi: support bigger buffer textures 406- asahi: add AGX_TEXTURE_FLAG_CLAMP_TO_0 flag 407- agx: handle discard with force early tests 408- asahi: pack blend key 409- agx: switch to combined clip/cull 410- asahi: add flag controlling sample mask without MSAA 411- asahi: use scalar outputs for rast shaders 412- asahi: move null descriptor routines to common 413- asahi: implement rba2 for uniform texel buffers 414- asahi: implement rba2 for storage texel buffers 415- agx: prepare for lower_wpos_center 416- asahi: extract agx_calculate_vbo_clamp 417- agx: fix indirect CF accounting 418- mesa: fix duplicate initializer 419- nir/lower_wpos_center: clean up 420- nir/builtin_builder: factor out nir_build_texture_query 421- asahi: use nir_build_texture_query 422- gallium: remove ability to probe asahi 423- asahi: add broken bits of unstable Linux UAPI 424- agx: fix 64-bit bcsel ingestion 425- agx: fix fmin/fmax with (-0, 0) pair 426- libagx: fix uint8_t definition 427- libagx: make index buffer fetch robust 428- libagx: generalize query copies 429- asahi: implement rba2 semantics for vbo 430- asahi: fix vbo clamp with stride=0 431- asahi: implement robustness2 for msaa image stores 432- asahi: be more clever about GS side effects 433- nir/lower_robust_access: handle MSAA images 434- nir: add nir_metadata_control_flow 435- treewide: use nir_metadata_control_flow 436- nir: document restriction on load_smem_amd constantness 437- vulkan: reference count vk_descriptor_update_template 438- vulkan: handle push DUT with emulated secondaries 439- vulkan: fix potential UAF with vk_cmd_enqueue_CmdPushDescriptorSetKHR 440- vulkan: handle enqueueing CmdPushConstants2KHR 441- vulkan: handle enqueueing CmdPushDescriptorSet2KHR 442- lvp: use common push constant enqueue 443- lvp: use common push descriptor set enqueue 444- lvp: fix silly casting for sampler desc updates 445- lvp: use common descriptor update templates 446- nir/format_convert: remove unorm bit size assert 447- nir: add nir_def_replace helper 448- treewide: use nir_def_replace sometimes 449- agx: fix insidious ballot optimizer bug 450- agx: add unit test for ballot bug 451- agx: set discard_is_demote 452- nir: add nir_break_if helper 453- treewide: use nir_break_if 454- nir: fix miscompiles with rules with INT32_MIN 455- nir/algebraic: explicitly suffix constants 456- nir/opt_constant_folding: fix array size define 457- zink: move print_pipeline_stats 458- zink: print pipeline stats for compute shader-db 459- util: add dui/uid helpers 460- nir: add nir_alu_instr float controls queries 461- nir/search: use ALU float control helpers 462- nir: use MIN2/MAX2 opcodes for imin/umax folding 463- nir: strengthen fmin/fmax definitions with signed zero 464- glsl/float64: handle signed zero with min/max 465- nir/lower_double_ops: handle signed zero with min/max 466- nir/lower_alu: remove dead #define 467- nir: add lower_fminmax_signed_zero 468- agx: set lower_fminmax_signed_zero 469- agx: do not flush denorms for fp16 fmin/fmax 470- asahi: drop old comment 471- asahi: drop stale comment 472- asahi: make agx_pack opencl compatible 473- asahi: tuck in null query check 474- libagx: specify heap size explicitly 475- asahi,libagx: tessellate on device 476- libagx: add kernel for incrementing CS counter 477- asahi: handle CS pipeline stat with indirect dispatch 478- libagx: handle VS/IA pipeline stats on GPU 479- asahi: eliminate load_num_workgroups from TCS unrolled ID 480- nouveau/drm-shim: set ram_user 481- nvk: add instruction count exec property 482- nir/opt_constant_folding: fix array size define, pt 2 483- zink: remove extraneous \\n with shaderdb 484- zink: match shader-db report.py format 485 486Amber (1): 487 488- tu: Disable depth and stencil tests when attachment state requires it 489 490Amit Pundir (1): 491 492- android: Fix zink build failure 493 494Amol Surati (1): 495 496- nine: avoid using post-compacted indices with state expecting pre-compacted ones 497 498Antoine Coutant (1): 499 500- drisw: fix build without dri3 501 502Antonio Ospite (2): 503 504- meson: fix deprecation warning in create-android-cross-file.sh 505- android: simplify building libgallium_dri on Android 506 507Arthur Huillet (4): 508 509- nvk: update 3d classes for conservative raster 510- nvk: implement VK_EXT_conservative_rasterization 511- nvk: import SPH headers files from open-gpu-doc 512- nvk: generate Rust bindings from SPH header files 513 514Asahi Lina (1): 515 516- asahi: Make asahi_clc build work on x86_64->x86 builds 517 518Bas Nieuwenhuizen (5): 519 520- radv: Use zerovram for Enshrouded. 521- util/disk_cache: Fix cache marker refresh. 522- util/disk_cache: Delete the old multifile cache if using the default. 523- util/cache_test: Add tests for old cache deletion. 524- relnotes: Add an entry about the new cache default. 525 526Benjamin Lee (1): 527 528- vk/graphics_state: Add last bits for extraPrimitiveOverestimationSize 529 530Boris Brezillon (81): 531 532- panvk: Prepare things so we can push sysvals to our push uniform buffer 533- panvk: Put dynamic uniform buffers at the end of the UBO array 534- panvk: Move the dynamic SSBO descriptors to their own UBO 535- panvk: Prepare dynamic buffer descriptors at bind time 536- panvk: Lower sysvals to push uniforms 537- panvk: Kill panvk_sysval_vec4 538- panfrost: do not write outside num_wg_sysval 539- panfrost: Add the BO containing fragment program descriptor to the batch 540- pan/kmod: Fix a syncobj leak in the panthor backend 541- pan/kmod: Make default allocator thread-safe 542- panvk: clang-format fixups 543- panvk: Kill panvk_queue_get_device() 544- panvk: Get rid of panvk_descriptor_state::dirty 545- panvk: Move panvk_cmd_state::batch to panvk_cmd_buffer::cur_batch 546- panvk: Kill unused dynamic state bits 547- panvk: Get rid of special attribute support 548- panvk: Split the graphics and compute state at the cmd_buffer level 549- panvk: Split compute/graphics pipeline objects 550- panvk: Use memory pools to store pipeline shaders/descriptors 551- panvk: Kill the panvk_pipeline_builder object 552- panvk: Transition the graphics pipeline logic to vk_graphics_pipeline_state 553- panvk: Fully transition to vk_viewport_state 554- panvk: Fully transition to vk_rasterization_state 555- panvk: Fully transition to vk_input_assembly_state 556- panvk: Use vk_color_blend_state to fill our blend constant 557- panvk: Fully transition to vk_depth_stencil_state 558- panvk: Fully transition to vk_multisample_state 559- panvk: Set unused attribute buffers descriptors to zero 560- panvk: Rename non_vs_attribs into img_attribs 561- panvk: Prevent re-emission of image attributes used in vertex shaders 562- panvk: Move VS attribute/buffer state to panvk_cmd_graphics_state 563- panvk: Emit VS-accessible image attributes at a fixed offset 564- panvk: Leave holes in the attribute locations used by a shader 565- panvk: Fully transition to vk_vertex_binding_state 566- pan/blend: Move constant inlining out of pan_blend_create_shader() 567- pan/blend: Expose pan_blend_create_shader() 568- panvk: Add a blend library to deal blend shaders/descriptors 569- panvk: Don't pass the stage to shader_create() 570- panvk: Simplify shader initialization in the pipeline logic 571- panvk: Fix/simplify the shader linking logic 572- panvk: Replace the stages array in panvk_draw_info by vs/fs fields 573- panvk: Move fs_rsd fields to an fs sub-struct 574- panvk: Emit the fragment shader RSD dynamically 575- panvk: Lower global memory IOs 576- panvk: Clean Midgard leftovers in the cmd_close_batch() path 577- panvk: Use vk_pipeline_shader_stage_to_nir() 578- panvk: Kill cmd_get_tiler_context() 579- panvk: Make sure we dump memory mappings before crashing 580- pan/decode: Be robust to NULL texture payload 581- pan/desc: Add missing format in translate_s_format() 582- pan/jc: Drop unused pool argument passed to pan_jc_add_job() 583- panvk: Add a render state to panvk_cmd_graphics_state 584- panvk: Take VK_RENDERING_{RESUM,SUSPEND}ING_BIT flags into account 585- panvk: Force a preload when the render area is not 32x32 aligned 586- panvk: Skip depth/stencil attachments with non-matching aspect mask 587- panvk: Fix dynamic rendering with images containing both depth and stencil 588- panvk: Make sure we don't lose clear-only operations 589- panvk: Make sure we run the fragment shader if alpha_to_coverage is enabled 590- panvk: Make sure replay of command buffers containing Dispatch calls works 591- panvk: Override the default GetRender[in]AreaGranularityKHR() 592- egl: Use gbm_bo_create_with_modifiers2() when the surface has non-zero flags 593- panvk: Fix formatting around OpaqueCaptureAddress implementation 594- panvk/ci: Flag exact_sampling.*.edge_right test as fails 595- pan/bi: Make sure global loads/stores don't exceed 16 bytes 596- pan/bi: Fix dynamic indexing of push constants 597- panvk: Fix Cube/2DArray/3D img -> buf copies 598- panvk: Don't bail out when allocationSize is zero in AllocateMemory() 599- panvk: Prepare for Valhall image views 600- panvk: Prepare for Valhall buffer views 601- panvk: Prepare things for compiling valhall source files 602- panvk: Extend Valhall descriptor set implementation to support Bifrost 603- panvk: Overhaul the Bifrost descriptor set implementation 604- panvk: Refcount private BOs 605- panvk: Store private BOs in lists instead of dynarrays 606- panvk: Prepare panvk_mempool for shared device memory pools 607- panvk: Use memory pools for internal GPU data attached to vulkan objects 608- pan/desc: Extend pan_emit_fbd() to support multilayer rendering 609- pan/desc: Prepare things for fragment job chaining 610- pan/blitter: Let pan_preload_fb() callers queue the jobs to the job chain 611- panvk: Use IDVS jobs when we can 612- panvk: Add support for layered rendering 613 614Caio Oliveira (35): 615 616- intel/brw: Don't print IP as part of the dump 617- intel/brw: Hide register pressure information in dumps 618- intel/brw: Use \`vNN` instead of \`vgrfNN` when printing instructions 619- intel/brw: Fix commas when dumping instructions 620- spirv: Add MESA_SPIRV_DEBUG=values to dump all values 621- intel/brw: Track the number of uses of each def in def_analysis 622- intel/brw: Fix typo in DPAS emission code 623- intel/brw: Add unit tests for scoreboard handling FIXED_GRF with stride 624- intel/brw: Make component_size() consistent between VGRF and FIXED_GRF 625- glsl: Fix warning related to tg4_offsets in release mode 626- intel/brw: Print SWSB information when dumping instructions 627- intel/brw: Reorganize lowering of LocalID/Index to handle Mesh/Task 628- anv: Use brw_nir_lower_cs_intrinsics for lowering Mesh/Task LocalID 629- intel/brw: Remove unused brw_reg related functions 630- intel/brw: Remove RALLOC helper from fs_reg 631- intel/brw: Remove unused variable from test 632- intel/brw: Move fs_reg data members up to brw_reg 633- intel/brw: Use public inheritance for fs_reg/brw_reg 634- intel/brw: Move most member functions from fs_reg to brw_reg 635- intel/brw: Remove conversion from fs_reg to brw_reg 636- intel/brw: Replace some fs_reg constructors with functions 637- intel/brw: Remove duplicated functions between fs_reg/brw_reg 638- intel/brw: Rename brw_reg() helper to brw_make_reg() 639- intel/brw: Make fs_reg an alias of brw_reg 640- intel/brw: Replace uses of fs_reg with brw_reg 641- intel/brw: Rename fs_reg_* helpers to brw_reg_* 642- intel/brw: Move brw_reg helpers into brw_reg.h 643- intel/brw: Don't set acc_wr_control for Xe2 644- intel/brw: Use brw_inst_set_group() to set QtrCtrl and NibCtrl 645- intel/brw: Account for reg_unit() in assembler 646- intel/brw: Don't print extra newlines in assembler 647- intel/brw: Split off assembler logic into library 648- spirv: Don't warn about FPFastMathMode if not OpenCL 649- intel/brw: Convert missing uses of ralloc to linear in fs_live_variables 650- intel/elk: Convert missing uses of ralloc to linear in fs_live_variables 651 652Chia-I Wu (8): 653 654- gallium: add pipe_picture_desc::flush_flags 655- frontends/va: track whether there are imported/exported surfaces 656- frontends/va: set PIPE_FLUSH_ASYNC when possible 657- radeonsi: prep for pipe_picture_desc::flush_flags 658- radeonsi: respect pipe_picture_desc::flush_flags 659- radv: check gs_copy_shader directly for executable props 660- radv: make radv_pipeline_has_ngg static 661- drm-shim: intercept access as well 662 663Christian Gmeiner (42): 664 665- isaspec: Add method to get the displayname of BitSetEnumValue 666- isaspec: Improve 'meta' handling 667- etnaviv: isa: Drop capturing of python output 668- etnaviv: isa: Add clang-format special comments 669- etnaviv: isa: Print dst_full for ALU 670- etnaviv: isa: Switch to enum isa_thread 671- etnaviv: isa: Add more flags to etna_inst 672- etnaviv: isa: Rework modeling of left shift for store/load 673- etnaviv: isa: Add name for full writemask 674- mr-label-maker: Add teflon marker 675- etnaviv: isa: Do src swizzle with isaspec 676- clc: Always use spir for 32 bit 677- etnaviv: Zero init all srcs passed to etna_emit_alu(..) 678- ci: uprev mold to 2.32.0 679- gallium: Add vkms entrypoint 680- nak: Move nak_optimize_nir declaration to nak_private.h 681- meson: Update proc-macro2 subproject 682- meson: Update syn subproject 683- meson: Add pest rust dependencies 684- meson: Add roxmltree rust dependency 685- meson: Add indexmap rust dependencies 686- etnaviv: isa: Add meta elements to instructions 687- etnaviv: isa: Generate Rust FFI bindings for asm.h 688- etnaviv: isa: Make header C++ safe 689- etnaviv: isa: Add meson version check 690- etnaviv: isa: Add IsaParser proc_macro_derive 691- etnaviv: isa: Add struct etna_asm_result 692- etnaviv: isa: Make etna_asm_result usable in Rust 693- etnaviv: isa: Add EtnaAsmResultExt trait 694- etnaviv: isa: Add parser module 695- etnaviv: isa: Add C function impl 696- etnaviv: isa: Add cli assembler 697- etnaviv: isa: Extend disasm test 698- ci/etnaviv: Drop shaders\@glsl-bug-110796 line 699- etnaviv: isa: Drop 1:1 mapping of opc to bits 700- etnaviv: isa: Add support for extended instructions 701- nak: Update comment about explicit padding 702- etnaviv: isa: Add support for bitset's displayname 703- etnaviv: isa: Rework branch instruction 704- nak: Set has_imad32 conditionally 705- nak: Move imad late optimization to nir 706- dri: fix driver names 707 708Christopher Michael (3): 709 710- v3d: Move spec\@arb_texture_view\@rendering-formats, Crash in broadcom-rpi4-fails 711- v3d: Move spec\@!opengl 1.1\@getteximage-formats, Fail in broadcom-rpi4-fails 712- broadcom: fix issue of ‘addr’ is used uninitialized 713 714Colin Marc (3): 715 716- radv/video: don't truncate frame_num and POC to 32 717- vulkan/video: generate profile_tier_level structure correctly 718- vulkan/video: correctly set sub-layer ordering in H.265 VPS/SPS 719 720Collabora's Gfx CI Team (6): 721 722- Uprev Piglit to 7aa7bc1b01d57b4b091c4fc82a94a6ff47f38ebf 723- Uprev Piglit to 8a6ce9c6fc5c8039665655bca4904d5601c6dba0 724- Uprev Piglit to e180f96239edba441f22f58dfc852cafb902844a 725- Uprev Piglit to fdf3fc09deb6beecdf212e65a16c645112540b59 726- Uprev Piglit to 647d0725024f72bc49bbc91c686c5f61168a1fe8 727- Uprev Piglit to 582f5490a124c27c26d3a452fee03a8c85fa9a5c 728 729Cong Liu (1): 730 731- nir: Fix out-of-bounds access in ntt_emit_store_output() 732 733Connor Abbott (92): 734 735- ir3: Add scan_clusters_macro to ir3_valid_flags() 736- ir3: Add scan_clusters.macro to is_subgroup_cond_mov_macro() 737- ir3: Validate tied sources better 738- ir3/ra: Don't demote movmsk instructions to non-shared 739- ir3: Rewrite postsched dependency handling 740- ir3/legalize: Use define for register size 741- ir3: Rewrite regmask implementation 742- ir3/ra: Prepare for shared half-regs 743- ir3/ra: Fix printing shared reg file 744- ir3/ra: Prepare for shared phis 745- ir3: Fix lowering shared parallel copies with immed src 746- ir3/lower_pcopy: Fix immed/const flags for copy from shared 747- ir3: Fix shared parallel copy validation 748- ir3: Don't use swz with shared registers 749- ir3/lower_copies: Handle HW bug with shared half-floats 750- ir3/lower_copies: Fix "inaccessible" half reg lowering with shared regs 751- ir3/ra: Use ra_reg_get_num() for validating num 752- ir3: Use INVALID_REG in array store 753- ir3: Reset num when creating parallel copies 754- ir3: Validate that shared registers are in-bound 755- ir3: Allow propagation of normal->shared copies 756- ir3: Moves with shared destination are always legal 757- ir3/legalize: Take (ss) into account in WaR hazards 758- ir3/legalize: Remove bad (eq) micro-optimization 759- ir3/legalize: any/all/getone are non-prefetch helper users 760- ir3: Use correct category for OPC_PUSH_CONSTS_LOAD_MACRO 761- ir3: Add support for "scalar ALU" 762- ir3: Implement source restrictions for shared ALU 763- ir3: Validate scalar ALU sources 764- ir3: Immediate source for stc is invalid 765- ir3: Don't emit single-source collects 766- ir3/cp: Support swapping mad srcs for shared regs 767- ir3/cf: Don't fold shared conversions 768- ir3: Distinguish lowered shared->normal moves 769- ir3: Add support for ldc.u 770- ir3: Add builder support for shared immediates 771- ir3: Create reduce identity directly 772- ir3: Make type_flags() return a bitmask enum 773- ir3: Support scalar ALU in the builder 774- ir3: Add scalar ALU-specific passes 775- ir3: Get sources before emitting scan_clusters.macro 776- ir3: Rewrite shared reg handling when translating from NIR 777- ir3: Directly use shared registers when possible 778- ir3/nir: Fix imadsh_mix16 definition 779- ir3: Use scalar ALU instructions when possible 780- ir3: Don't scalarize all SSBO instructions 781- ir3: Don't manually scalarize SSBO loads 782- freedreno/a7xx: Add AQE-related registers from kgsl 783- freedreno/a7xx: Add A7XX_HLSQ_DP_STR location from kgsl 784- freedreno/crashdec: Initial a7xx support 785- freedreno: Update HLSQ_*_CMD registers for a7xx 786- docs/android: Fix example meson cross file again 787- ir3: Put VS->TCS barrier after preamble 788- ir3/legalize: Insert dummy bary.f after preamble 789- freedreno,ir3: Add has_early_preamble 790- tu: Workaround early preamble HW bug 791- freedreno/a6xx: Workaround early preamble HW bug 792- ir3: Add ir3_info::early_preamble 793- tu: Implement early preamble 794- freedreno/a6xx: Implement early preamble 795- ir3: Enable early preamble 796- tu: Use image aspects for feedback loops 797- tu: Support VK_EXT_attachment_feedback_loop_dynamic_state 798- tu: Use a7xx terminology for flushes 799- freedreno, tu: Use CLEAN events on a7xx 800- tu: Fix unaligned indirect command synchronization 801- tu: Don't WFI after every dispatch 802- freedreno/a7xx: Fix register file size 803- ir3: Make sure constlen includes stc/ldc.k/ldg.k instructions 804- freedreno: Disable early preamble on a6xx gen4 805- ir3, tu, freedreno: Move early_preamble to ir3_shader 806- tu: Add early preamble statistic 807- ir3: Introduce elect_any_ir3 808- ir3: Use elect_any_ir3 in preambles 809- freedreno: Fix RBBM_NC_MODE_CNTL variants 810- tu: Add support for aligned substreams 811- ir3: Fix UBO size with indirect driver params 812- tu: Make cs writeable for GMEM loads when FDM is enabled 813- tu: Fix fdm_apply_load_coords patchpoint size 814- tu: Support VK_EXT_fragment_density_map on a750 815- tu: Support bufferDeviceAddressCaptureReplay on kgsl 816- freedreno: Fix decoding primitive counter events on a7xx 817- tu: Add VPC hardware workaround for a750 818- ir3: Fix stg/ldg immediate offset on a7xx 819- nir/instr_set: Return the matching instruction 820- nir/instr_set: Don't remove matching instruction 821- ir3: Split out bindless tex/samp encoding 822- ir3: Don't consider r63.x as a GPR 823- ir3: Plumb through descriptor prefetch intrinsics 824- ir3: Make preamble rematerialization common code 825- ir3: Expand preamble rematerialization 826- ir3: Add descriptor prefetching optimization on a7xx 827 828Constantine Shablia (5): 829 830- pan/bi: fix 1D array tex coord lowering 831- pan/bi: clean up tex coord lowering 832- panfrost: report correct MAX_VARYINGS 833- panvk: remove descriptor pool counters 834- panvk: enable KHR and EXT BDA 835 836Corentin Noël (8): 837 838- zink: Always call deinit_multi_pool_overflow when destroying zink_descriptor_pool_multi 839- ci: Allow to override the virglrenderer render server 840- venus: sync protocol for VK_KHR_maintenance5 841- venus: enable VK_KHR_maintenance5 842- venus/ci: add more recently found flakes 843- wsi: Make sure to return a valid wayland id string 844- venus/ci: Update expectations 845- ci: Make sure to install libraries in the right directory on debian 846 847Craig Stout (8): 848 849- util: detect_os: add DETECT_OS_FUCHSIA and DETECT_OS_POSIX_LITE 850- util: u_thread: add Fuchsia support 851- util: os_misc: add Fuchsia support 852- util: u_dl: add Fuchsia support 853- util: os_time: add Fuchsia support 854- vulkan/util: add missing dependencies 855- meson: remove unnecessary line continuation 856- vulkan/runtime: add spirv_info_h to vulkan_lite_runtime_header_gen_deps 857 858Daniel Lundqvist (1): 859 860- radeonsi: Fix unused variable when LLVM is not used for AMD. 861 862Daniel Schürmann (69): 863 864- aco/ra: fix kill flags after renaming fixed Operands 865- aco/ra: assert that the register file is empty after register allocation completed 866- aco/lower_phis: simplify check for uniform predecessors 867- aco: introduce aco_opcode::p_boolean_phi 868- aco/vn: copy-propagate trivial phis 869- aco/lower_phis: generalize init_state() so that it works with any scalar phis 870- aco/lower_phis: implement SGPR phi lowering 871- aco: use SGPR phi lowering for uniform phis in divergent merge blocks 872- aco: use SGPR phi lowering for all loop header phis 873- aco: use SGPR phi lowering for all scalar phis 874- aco/optimizer: remove p_linear_phi handling from optimizer 875- radv: mark nir_opt_loop() as not idempotent 876- radv: move nir_opt_dead_cf() before nir_opt_loop() 877- panfrost: skip gles-3.0-transform-feedback-uniform-buffer-object on Mali G52 and G57 878- nir/loop_analyze: adjust negative (or huge) iteration count check for bit size 879- nir/opt_if: don't split ALU of phi into otherwise empty blocks 880- nir/opt_loop: add loop peeling optimization 881- aco/ra: fix handling of killed operands in compact_relocate_vars() 882- aco/ra: Fix array access when finding register for subdword variables 883- aco/ra: refactor get_reg_simple() with increased stride. 884- aco/ra: move can_write_m0() check into get_reg_specified() 885- aco/ra: re-use registers from killed operands 886- aco/ra: change heuristic to first fit 887- aco/ra: use round robin register allocation 888- aco/assembler: fix MTBUF opcode encoding on GFX11 889- aco/assembler: slightly refactor MTBUF assembly for more readability 890- aco/assembler: fix GFX67 MTBUF opcode encoding 891- aco/scheduler: remove unused register_demand parameter 892- aco: move live var information into struct Program 893- aco/reindex_ssa: replace live_var parameter with boolean 894- aco: make aco::monotonic_buffer_resource declaration visible for aco::IDSet 895- aco: use aco::monotonic_allocator for IDSet 896- spirv: make gl_HelperInvocation volatile if demote is being used 897- radv: emit discard as demote by default 898- nir: introduce discard_is_demote compiler option 899- nir/opt_peephole_select: handle nir_terminate{_if} 900- nir: remove nir_intrinsic_discard 901- zink: pass zink_screen to nir_to_spirv(). 902- nir/shader_info: remove uses_demote 903- spirv: workaround for tests assuming that OpKill terminates invocations or loops 904- aco/scheduler: fix register_demand validation debug code 905- aco/spill: Unconditionally add 2 SGPRs to live-in demand 906- aco: calculate register demand per instruction as maximum necessary to execute the instruction 907- aco: track and use the live-in register demand per basic block 908- aco: remove get_demand_before() 909- aco/live_var_analysis: slightly refactor handling of additional register demand for Operand copies 910- aco/live_var_analysis: ignore dead phis 911- aco/spill: don't remove spilled phis 912- aco/ra: use live_in_demand in should_compact_linear_vgprs() 913- aco: add RegisterDemand member to Instruction 914- aco/util: skip empty blocks in IDSet::insert(IDSet) 915- aco/live_var_analysis: refactor using ctx struct 916- aco/live_var_analysis: ignore phi definition and operand demand at predecessors 917- aco/live_var_analysis: inline block->register_demand updates 918- aco/live_var_analysis: remove unused includes 919- aco/live_var_analysis: use separate allocator for temporary live sets 920- aco/ra: remove special-casing of p_logical_end 921- nir: implement loop invariant code motion (LICM) pass 922- radv: use NIR loop invariant code motion pass 923- nir/opt_sink: ignore loops without backedge 924- aco: compute live-in variables in addition to live-out variables 925- aco/ra: use live-in variables directly rather than computing them 926- aco/spill: use live-in variables directly rather than computing them 927- aco/cssa: use live-in variables instead of live-out variables 928- aco/validate: use live-in variables for RA validation 929- aco/print_ir: print live-in instead of live-out variables 930- aco: remove live-out variables from IR 931- aco/spill: Don't add phi definitions to live-in variables 932- util/disk_cache: enable Mesa-DB disk cache by default 933 934Daniel Stone (27): 935 936- Revert "ci: disable g52" 937- gbm: Support fixed-rate compression allocation 938- venus/ci: Fix timeout 939- venus/ci: Significantly reduce CTS fraction 940- venus/ci: Temporarily disable jobs 941- dri: Fix BGR format exclusion 942- egl/surfaceless: Enable RGBA configs 943- egl/gbm: Enable RGBA configs 944- egl/dri2: Use createImageFromNames for DRM buffers 945- dri: Remove old createImageWithModifiers 946- dri: Remove createImageFromFds 947- dri: Stop answering DRI_IMAGE_ATTRIB_FORMAT 948- gallium/dri: Delete unused helper function 949- gallium/dri: Drop mesa_format indirection for lookup 950- loader/dri3: Use FourCC for create-image entrypoints 951- egl/x11: Update to createImageFromNames 952- dri: Delete createImageFromName 953- dri: Unify createImage and createImageWithModifiers 954- egl/x11: Remove __DRI_IMAGE_FORMAT remnants 955- loader/dri3: Use FourCC for buffer allocations 956- u_format: Rewrite format table to use YAML 957- format: Generate endian-independent format aliases 958- format: Generate sRGB<->linear conversions from table 959- u_format: Reword introduction 960- build: Check for PyYAML in Meson build 961- dri: Allow INVALID for modifier-less drivers 962- gbm/dri: Remove erroneous assert 963 964Danylo Piliaiev (38): 965 966- tu: Handle non-overlapping WaW hazard with buffer copy/fill/update 967- tu/a7xx: Don't set FLUSH_PER_OVERLAP_AND_OVERWRITE for feedback loops 968- tu/a750: Disable HW binning when there is GS 969- freedreno/devices: Add support for Adreno A32 (G3x Gen 2) 970- util/u_trace: Allow mixing of ArgStruct and Arg 971- tu: Add more info to renderpass tracepoint 972- vulkan/wsi: Make current_frame usable in all cases 973- util/u_trace: Pass explicit frame_nr argument to delimit frames 974- tu: Use current_frame from vk device to delimit u_trace frames 975- anv: Use current_frame from vk device to delimit u_trace frames 976- freedreno: Make fd_pps_driver.h usable without including other FD sources 977- turnip/msm: Do rd dump only when there are commands in submission 978- turnip: Implement VK_EXT_depth_clamp_zero_one 979- freedreno/a7xx: Update TPL1_DBG_ECO_CNTL1 to fix UBWC corruption 980- ir3/a7xx: Fix FS consts corruption when other FS has zero constlen 981- tu: Add LRZ disable reason to renderpass tracepoint 982- util/u_trace: Add support for fixed-length string params in tracepoints 983- tu: Add attachments' UBWC info to renderpass tracepoint 984- freedreno/rddecompiler: Make possible to use original shader 985- freedreno/replay: Fix replaying without SET_IOVA 986- freedreno/ir3: mova has special meaning for (r) flag 987- ir3: Correctly assemble mova1 with (r) on const 988- tu: Fix issues with render_pass tracepoint 989- freedreno: Rename TPL1_DBG_ECO_CNTL1.UBWC_WORKAROUND into TP_UBWC_FLAG_HINT 990- tu: Add enable_tp_ubwc_flag_hint feature to a7xx 991- freedreno/devices: Turn off enable_tp_ubwc_flag_hint for a740 by default 992- freedreno/devices: Fix magic regs for Adreno A32 993- freedreno: Describe LRZ feedback mechanism 994- freedreno/devices: Define and appropriately set has_lrz_feedback 995- tu: Use LRZ feedback in gmem 996- tu: Enable LRZ feedback in sysmem 997- freedreno: Use LRZ feedback in gmem 998- ir3: Print bindless samp/tex ids for tex prefetch 999- ir3/tests: Make possible to specify raw instr value as uint64 1000- ir3/tests: Make possible to add generated disasm tests 1001- ir3: Fix decoding of stib.b/ldib.b with offset 1002- turnip/kgsl: Support external memory via ION/DMABUF buffers 1003- tu: Have single Flush/Invalidate memory entrypoints 1004 1005Dave Airlie (27): 1006 1007- radv/video/encode: fix quality params on v2 hw. 1008- Revert "zink: use a slab allocator for zink_kopper_present_info" 1009- nvk: Only enable WSI modifiers if the extension is supported. 1010- draw/texture: handle mip_offset[0] being != 0 for layered textures. 1011- nouveau/nvc0: increase overallocation on shader bo to 2K 1012- nvidia: fixup classes import and import new classes. 1013- nouveau/push: add support for m2mf/i2mf to dumper 1014- nouveau/nvc0: add support for using common pushbuf dumper 1015- radv/video: fix layered decode h264/5 tests. 1016- radv/video: use vcn ip versions for encoder detection. 1017- ac/radv/radeonsi: move av1 ctx/probs size/filling to common code. 1018- ac/radv/radeon: move film grain init to common code. 1019- st/mesa: drop u_simple_shaders.h include where not used. 1020- gallivm: create a pass manager wrapper. 1021- gallivm: move ppc denorm disable to inline 1022- gallivm: split some code out from init module. 1023- gallivm: make lp_bld_coro.h c++ include safe. 1024- gallivm: export target init code for orc-jit to reuse 1025- gallivm: split out generating LLVM Mattrs 1026- llvmpipe: Introduce llvmpipe_memory_allocation 1027- nvk: use 2k overallocation for shader heap. 1028- anv/video: use correct offset for MPR row store scratch buffer. 1029- radv/video: advertise mutable/extended for dst video images. 1030- draw/orcjit: supply stub function for tcs coro 1031- llvmpipe/cs/orcjit: add stub function name for coro 1032- gallivm/sample: fix sampling indirect from vertex shaders 1033- nvc0: fix null ptr deref on fermi due to debug changes. 1034 1035David (Ming Qiang) Wu (2): 1036 1037- radeonsi/vcn: set accurate size for dec header and index_codec 1038- radeonsi/vcn: support DPB_MAX_RES on VCN5 1039 1040David Heidelberg (58): 1041 1042- turnip: rename tu_queue_submit struct to follow ODR 1043- ci: fail pipeline for users who got access to restricted traces 1044- ci/traces: majanes has no longer access to the restricted traces 1045- ci/deqp: correct EGL_EXT_config_select_group detection 1046- egl/x11: Move RGBA visuals in the second config selection group 1047- mailmap: add Freya Gentz entry 1048- etnaviv: migrate from piglit include to generic deqp and toml spec 1049- freedreno/ci: move platform to the deqp toml file for a530 1050- freedreno/ci: move the disabled jobs from include to the main file 1051- freedreno/ci: Switch a306_* to deqp-runner 1052- freedreno/ci: do not depend on single job rules for another jobs 1053- freedreno/ci: switch a306 to weston 1054- freedreno/ci: re-enable a306_piglit 1055- ci/panfrost: disable G52 until machines gets fixed 1056- ci: drop unused piglit-test and integrate it into piglit-traces-test 1057- freedreno/ci: Drop duplicated include and add missing stages 1058- freedreno/ci: Implement nightly piglit job for Adreno 630 and 618 1059- ci/freedreno: update expectations from the nightly run 1060- ci: bump ANGLE 1061- ci: Revert "ci: update failures list with angle for jsl, tgl" 1062- ci/intel: add new jsl flake 1063- ci/panfrost: Revert "ci/panfrost: disable G52 until machines gets fixed" 1064- ci/alpine: re-enable Mold linker 1065- ci/etnaviv: add flakes from nightly runs 1066- winsys/i915: depends on intel_wa.h 1067- subprojects: uprev perfetto to v45.0 1068- ci/r300: update flake list from nightly reports 1069- ci/nouveau: move disabled jobs back from include into main gitlab-ci.yml 1070- ci/nouveau: separate HW definition from SW 1071- ci/nouveau: adjust and add DEVICE_TYPE 1072- ci/freedreno: a3xx will never have Vulkan support 1073- docs: correct svga3d redirected URLs 1074- ci/radv: dEQP-GLES3.functional.polygon_offset.fixed16_render_with_units passes now 1075- ci: re-enable shader-db for nouveau 1076- ci: do not build Nine in debian-build-testing 1077- ci/piglit: be explicit about what we building 1078- ci/lava: enable Piglit OpenCL tests so we can test rusticl on the HW 1079- ci/lava: do not build Vulkan for armhf images 1080- ci/lava: move wayland-protocols to the main section 1081- ci/freedreno: document new failure after piglit update 1082- ci/etnaviv: skip Vulkan tests on GC2000 1083- ci/etnaviv: remove duplicated line from skips 1084- mailmap: update my email 1085- ci/arm64: rustify the build 1086- ci/lava: add support for RustiCL 1087- ci/meson: reuse meson installation 1088- ci: move (c)bindgen to own shell script 1089- ci/radv: Document recent flake 1090- ci/lava: the containers take sometimes more than 60m 1091- ci: propagate RUSTICL_ENABLE and DEBUG variables to the DUTs 1092- rusticl: add -cl-std only when it's not defined 1093- ci/freedreno: some A306 tests now pass/skip since proper GL detection in Piglit 1094- ci: introduce tool for comparing nightly runs 1095- util: bump blake3 from 1.3.3 to 1.5.1, improve armv7 and aarch64 performance 1096- build: pass licensing information in SPDX form 1097- intel/debug: allow silencing CL warnings 1098- llvmpipe: Silence "possibly uninitialized value" warning for ssbo_limit (cont) 1099- ci/alpine: use llvm variables 1100 1101David Rosca (39): 1102 1103- radv/video: Set correct bit depth and format for 10bit input 1104- radv/video: Check encode profiles and bit depth in capabilities query 1105- radv/video: Report maxBitrate in encode capabilities 1106- radeonsi/vcn: Allocate session buffer in VRAM 1107- radeonsi/vcn: Fix 10bit HEVC VPS general_profile_compatibility_flags 1108- radeonsi/vcn: Only enable VBAQ with rate control mode 1109- frontends/va: Fix AV1 slice_data_offset with multiple slice data buffers 1110- Revert "radeonsi/vcn: AV1 skip the redundant bs resize" 1111- frontends/va: Only increment slice offset after first slice parameters 1112- radeonsi: Update buffer for other planes in si_alloc_resource 1113- frontends/va: Store slice types for H264 decode 1114- radeonsi/vcn: Ensure DPB has as many buffers as references 1115- radeonsi/vcn: Allow duplicate buffers in DPB 1116- radeonsi/vcn: Ensure at least one reference for H264 P/B frames 1117- frontends/va: Fix leak when destroying VAEncCodedBufferType 1118- radeonsi/vcn: Avoid copy when resizing bitstream buffer 1119- frontends/va: Send all bitstream buffers to driver at once 1120- frontends/va: Fix crash in vaRenderPicture when decoder is NULL 1121- radv/video: Add missing VCN 3.0.2 to decoder init switch 1122- radeonsi: Make si_compute_clear_image work with 422 subsampled formats 1123- gallium/vl: Init shaders on first use 1124- frontends/va: Don't require exact match for packed headers 1125- gallium: Add is_video_target_buffer_supported 1126- radeonsi: Implement is_video_target_buffer_supported 1127- frontends/va: Use is_video_target_buffer_supported for EFC 1128- frontends/va: Rework EFC logic 1129- frontends/va: Check if target buffer is supported in vlVaEndPicture 1130- gallium: Remove PIPE_VIDEO_CAP_EFC_SUPPORTED 1131- frontends/va: Simplify AV1 slice parameters handling 1132- frontends/va: Move slice_data_offset to context 1133- frontends/va: Rename slice_idx to have_slice_params and move to context 1134- frontends/va: Support multi elements slice parameter buffers for H264/5 1135- gallium: Remove pipe_h264_picture_desc.slice_parameter.slice_count 1136- radeonsi/vcn: Limit size to target size in AV1 decode 1137- radeonsi: Add debug option to enable low latency encode 1138- radeonsi/vcn: Add low latency encode support 1139- frontends/va: Support frame rate per temporal layer for AV1 1140- radeonsi/vcn: Support 10bit RGB for EFC input 1141- radeonsi/vcn: Add decode DPB buffers as CS dependency 1142 1143Deborah Brouwer (1): 1144 1145- ci/lava: Detect a6xx gpu recovery failures 1146 1147Derek Foreman (13): 1148 1149- wsi/wayland: refactor wayland dispatch 1150- egl/wayland: Use loader_wayland_dispatch 1151- perfetto: Add flows 1152- wsi/wayland: Add perfetto flows to image acquisition and presentation 1153- wsi/wayland: Add flow id to presentation feedback 1154- wsi/wayland: Add timing debugging 1155- perfetto: Add simple support for counters 1156- wsi/wayland: Add latency information to perfetto profiling 1157- perfetto: Add some functions for timestamped events 1158- wsi/wayland: Add a perfetto track for image presentation 1159- wsi/wayland: Add tracepoint in wsi_wl_swapchain_wait_for_present 1160- wsi/wayland: Fix use after free from improperly stored VkAllocationCallbacks 1161- wsi/wayland: Use different queue names for different queries 1162 1163Dmitry Baryshkov (1): 1164 1165- freedreno/registers: drop display-related register files 1166 1167Dmitry Osipenko (1): 1168 1169- venus: make cross-device optional 1170 1171Doug Brown (1): 1172 1173- xa: add missing stride setup in renderer_draw_yuv 1174 1175Dr. David Alan Gilbert (1): 1176 1177- treewide: Cleanup unused structs 1178 1179Dylan Baker (12): 1180 1181- meson: use glslang --depfile argument when possible 1182- clc: remove check for null pointer that cannot be true in llvm_mod_to_spirv 1183- compiler/glcpp: don't recalculate macro 1184- intel/compiler: move predicated_break out of backend loop 1185- anv/grl: add some validation that we're not going to overflow 1186- egl/wayland: fix memory leak in error handling case 1187- compilers/clc: Add missing break statements. 1188- mesa: fix memory leak when using shader cache 1189- util/glsl2spirv: fixup the generated depfile when copying sources 1190- tgsi_to_nir: free disk cache value if the size is wrong 1191- crocus: properly free resources on BO allocation failure 1192- crocus: check for depth+stencil before creating resource 1193 1194Echo J (3): 1195 1196- nvk: Add sha1_h as a dependency 1197- d3d10umd: Use pipe_resource_usage enum in translate_resource_usage() 1198- util: Fix the integer addition in os_time_get_absolute_timeout() 1199 1200Eli Schwartz (2): 1201 1202- meson: create libglsl declared dependency to propagate order-only deps 1203- meson: add various generated header dependencies as order-only deps 1204 1205Emma Anholt (13): 1206 1207- nir,panfrost,agx: Fix driver PIXEL_COORD_INTEGER setting and drop workaround. 1208- dri: Fix a pasteo in dri2_from_names() 1209- dri: Consistently use createImageWithModifiers2() 1210- dri: Consistently use createImageFromFds2(), not createImageFromFds() 1211- dri: Replace createImageFromDmaBufs() with createImageFromDmaBufs3() 1212- dri: Drop old createImageFromRenderbuffer() 1213- dri: Consistently use createImageFromDmabufs() not createImageFromFds() 1214- dri: Drop createImageFromFds2() in favor of createImageFromDmaBufs() 1215- dri: Move EGL image lookup/validate setup to dri_init_screen() 1216- mesa: Drop some version checking around ValidateEGLImage 1217- dri: Collapse dri2_validate_egl_image() into dri_validate_egl_image() 1218- dri: Fold lookup_egl_image_validated into its one caller 1219- dri: Drop the old lookupEGLImage wrapper function. 1220 1221Eric Engestrom (295): 1222 1223- VERSION: bump to 20.2 1224- docs: reset new_features.txt 1225- docs: add release notes for 24.0.6 1226- docs: update calendar for 24.0.6 1227- docs: add an extra 24.0.x release 1228- docs: add sha256sum for 24.0.6 1229- docs: update calendar for 24.1.0-rc1 1230- ci: fix container rules on release branches and tags 1231- panvk/ci: add WSI testing to all the deqp-vk jobs 1232- lavapipe/ci: add WSI testing to all the deqp-vk jobs 1233- freedreno/ci: add flake 1234- lavapipe/ci: add flakes 1235- ci: pass MESA_VK_ABORT_ON_DEVICE_LOSS through to the DUT 1236- rpi3/ci: drop duplicate comment without any corresponding actual skip line 1237- v3dv/ci: skip all the WSI tests, they are way too flaky to be worth it 1238- spirv: deduplicate default debug log level 1239- v3dv/ci: add rpi5 failure 1240- ci: mark microsoft farm as offline 1241- meson: simplify \`-gsplit-dwarf` compiler argument check 1242- egl+glx: fix two #ifdef that should be #if like the rest 1243- meson: always set USE_LIBGLVND 1244- meson: use bool.to_int() instead of manually converting 1245- lavapipe/ci: drop fixed test from failures 1246- lavapipe/ci: add the rest of the failures introduced by the 1.3.8.2 uprev 1247- lavapipe/ci: skip another test that goes over the timeout 1248- meson: move tsan-blacklist.txt to build-support with the other build support files 1249- llvmpipe/ci: fix indentation 1250- llvmpipe/ci: only run jobs when their corresponding files are changed 1251- lavapipe/ci: fix indentation 1252- lavapipe/ci: avoid running all lavapipe jobs when llvmpipe ci is changed 1253- lavapipe/ci: only run jobs when their corresponding files are changed 1254- docs: update calendar for 24.1.0-rc2 1255- llvmpipe/ci: trigger jobs on draw & gallivm changes 1256- lavapipe/ci: trigger jobs on draw & gallivm changes 1257- lavapipe/ci: add flakes seen lately 1258- lavapipe/ci: generalize flakes list to all formats for these flaky tests 1259- lavapipe/ci: skip ray tracing tests that sometimes time out 1260- vc4/ci: add fails seen overnight 1261- ci: uprev mold to 2.31.0 1262- lavapipe/ci: skip two more timing out ray query tests 1263- ci: backport fix for gl_PointSize bug in CTS 1264- lavapipe/ci: move a few skips out from under the "llvm jit" comment 1265- mr-label-maker: fix yaml syntax 1266- docs: add release notes for 24.0.7 1267- docs: update calendar for 24.0.7 1268- docs: add sha256sum for 24.0.7 1269- docs: update calendar for 24.1.0-rc3 1270- ci/debian-build-testing: drop extra nesting section 1271- ci/shader-db: drop extra nesting section 1272- rpi4/ci: use deqp-runner suite for vk job as well 1273- rpi5/ci: use deqp-runner suite for vk job 1274- microsoft/clc: fix incorrect changes that got through while the Windows CI was down 1275- llvmpipe: wrap the push/pull in the ifdef as well 1276- radv/ci: add navi21 flakes 1277- zink: avoid designated initializers as they are not supported in C++ < 20 1278- Revert "ci: fail pipeline for users who got access to restricted traces" 1279- radeonsi/ci: document new crash (assert) 1280- util/format: add missing null check in util_format_is_srgb() 1281- ci: drop default VKD3D_PROTON_RESULTS file name 1282- ci: hardcode \`-vkd3d` namespace for VKD3D_PROTON_RESULTS 1283- amd/ci: track changes to VKD3D_PROTON_RESULTS files 1284- mr-label-maker: mark \*-vkd3d.txt files as CI results expectations files 1285- ci: reuse dead .vkd3d-proton-test to make vkd3d less radv-specific 1286- ci: fix section_end in debian-build-testing 1287- ci: rename debian version variable job to include the word "version" 1288- ci: factor out all the deps to build the debian containers into .debian-container 1289- ci: inherit the debian container building infra for test container images 1290- ci/b2c: rename B2C_TIMEOUT_FIRST_* to B2C_TIMEOUT_FIRST_CONSOLE_ACTIVITY_* 1291- ci/b2c: rename B2C_TIMEOUT_* to B2C_TIMEOUT_CONSOLE_ACTIVITY_* 1292- ci/b2c: allow setting timeouts in seconds 1293- ci: drop dead VK_CPU option 1294- ci/piglit-traces: drop re-definition of VK_DRIVER_FILES 1295- ci/init-stage2: set VK_DRIVER_FILES for both xorg and wayland 1296- ci/vkd3d: un-hardcode architecture 1297- ci/vkd3d: fix version sanity check 1298- ci/vkd3d: fail job when failing to get driver version 1299- ci/b2c: remove dead rules: that's always overwritten 1300- ci/env: move dead-code-with-comment to the end of the list to make it clearer 1301- zink/ci: rename .zink-lvp-venus-rules to .zink-venus-lvp-rules to match the rest of the names 1302- README: update links to our own docs 1303- docs: update calendar for 24.1.0-rc4 1304- mailmap: add entry to unify Roman Stratiienko's contributions 1305- nvk/ci: add nvk job on a GA106 (RTX 3060) 1306- zink/ci: add zink+nvk glcts+piglit job on a GA106 (RTX 3060) 1307- zink+nvk/ci: skip glx piglit tests as they all fail 1308- zink+nvk/ci: skip timing out test 1309- zink+nvk/ci: skip more tests that times out 1310- zink+nvk/ci: document flakes seen during stress-testing 1311- zink+nvk/ci: update expected failures 1312- docs: add release notes for 24.0.8 1313- docs: update calendar for 24.0.8 1314- docs: add sha256sum for 24.0.8 1315- docs: add release notes for 24.1.0 1316- docs: add sha256sum for 24.1.0 1317- docs: update calendar for 24.1.0 1318- ci: fix build-kernel.sh -> download-prebuilt-kernel.sh 1319- ci: drop dead variables (see previous commit) 1320- ci: rename debian/arm*_test to debian/baremetal_arm*_test to be clear about which infra uses that 1321- ci: prepare base debian test image for multi-arch 1322- ci: prepare GL debian test image for multi-arch 1323- ci: prepare VK debian test image for multi-arch 1324- ci/image-tags: rename DEBIAN_X86_64_TEST_*_TAG to drop the x86 mention 1325- ci: add debian/arm64_test images for gl & vk 1326- zink/ci: rename zink-turnip collabora rule to make it unambiguous 1327- ci/b2c: add aarch64 tests for gl & vk 1328- turnip/ci: add vkcts jobs on the a750 1329- turnip+zink/ci: add gl & gles CTS jobs on the a750 1330- nvk/ci: adjust the regex for "dut is broken and needs to be rebooted" 1331- nvk/ci: mark the job as failing in case of hangs, instead of silently rebooting 1332- nvk/ci: add missing .test rules to avoid running nvk tests in post-merge pipeline 1333- radv/ci: move amdgpu-specific kernel message warning to src/amd/ci/ 1334- ci/b2c: make B2C_JOB_WARN_REGEX optional 1335- zink+nvk/ci: more KHR-GL46.packed_pixels.varied_rectangle.* flakes, so mark the group as flaky 1336- zink+nvk/ci: add more flakes seen in nightly 1337- zink+nvk/ci: spec\@ext_external_objects\@vk-vert-buf-reuse has been fixed 1338- mr-label-maker: label src/vulkan/wsi/ as wsi 1339- .mailmap: fix email address for \@cpmichael 1340- v3dv/ci: fix typo in \`renderer_check` 1341- ci: disable debian-build-testing until it can be fixed 1342- vc4/ci: skip VK piglit tests 1343- freedreno/a6xx: fix kernel -> compute handling 1344- zink+nvk/ci: add flakes seen in latest nightly run 1345- docs/calendar: add 24.2 branchpoint and release candidates schedule 1346- panfrost/ci: add missing genxml trigger path 1347- panfrost: mark tests as fixed 1348- etnaviv/ci: skip VK piglit tests 1349- radv/ci: document angle regressions from !29436 on stoney 1350- zink+nvk/ci: add flakes seen in latest nightly run 1351- docs/meson: replace deprecated pkgconfig with pkg-config 1352- zink+nvk/ci: add flakes seen in latest nightly run 1353- v3dv: add missing bounds check in VK_EXT_4444_formats 1354- docs: add release notes for 24.1.1 1355- docs: add sha256sum for 24.1.1 1356- docs: update calendar for 24.1.1 1357- turnip/ci: add a750 flakes seen in the latest nightly 1358- radv/ci: fix manual rules 1359- radv/ci: move radv manual rules into their own group 1360- nvk+zink/ci: add another flake seen in nightly 1361- docs: add release notes for 24.0.9 1362- docs: update calendar for 24.0.9 1363- docs: add sha256sum for 24.0.9 1364- venus/ci: add flake that's been blocking MRs 1365- v3d/drm-shim: emulate a rpi4 instead of a rpi3 1366- nvk+zink/ci: add another flake seen in nightly 1367- radv/ci: document navi31 regression from !29235 1368- ci: set a common B2C_JOB_SUCCESS_REGEX with the message that's printed for all jobs 1369- ci/deqp: uprev gl & gles cts 1370- radeonsi/ci: mark a bunch of tests as fixed on vangogh 1371- radv/ci: drop duplicate navi21-aco flakes line 1372- radv/ci: drop duplicate navi31-aco flakes line 1373- turnip+zink/ci: mark a dEQP-GLES(2|3).functional.rasterization.(fbo|primitives).line_(strip_|)wide as fixed 1374- turnip/ci: add a750 flakes seen in the latest nightly 1375- panfrost/ci: remove duplicate path 1376- nvk+zink/ci: mark KHR-GL46.sparse_texture2_tests.SparseTexture2* as fixed 1377- nvk+zink/ci: add flakes seen in nightly pipeline 1378- nvk+zink/ci: consider all the \`double` tests in spec\@glsl-4.00\@execution\@built-in-functions to be flaky 1379- freedreno/ci: disable mid-testing reboot on a750 1380- driconf: drop param for setting default gpu vendor id in DRI_CONF_FORCE_VK_VENDOR() 1381- egl: fix teardown when using xcb 1382- egl: move android-specific code into an android branch 1383- egl: ensure future platforms get their teardown implemented 1384- egl/device: drop unnecessary intermediate variable 1385- ci: fix meson install script 1386- lavapipe/ci: update trace checksum following nir change 1387- lavapipe/ci: document regression while it's being worked on 1388- turnip+zink/ci: mark dEQP-GLES3.functional.fbo.depth.depth_test_clamp.* tests as fixed 1389- bin/ci: escape literal url in regex 1390- glx: fix build -D glx-direct=false 1391- nvk+zink/ci: mark spec\@ext_image_dma_buf_import\@ext_image_dma_buf_import-refcount-multithread as fixed 1392- nvk+zink/ci: add flakes seen over the last few nightlies 1393- asahi/lib: generate git_sha1.h for agx_device.c 1394- ci/vkd3d: deduplicate the diff between the expectation and the results 1395- ci/vkd3d: print a message when the expected failures file is missing 1396- ci/vkd3d: drop override of job artifacts 1397- ci/vkd3d: fix error message printing 1398- ci/vkd3d: stop ignoring errors in a block where errors can't happen 1399- ci/vkd3d: don't ignore errors 1400- ci/vkd3d: group version check lines together 1401- ci/vkd3d: limit the vulkaninfo capture to the driverInfo line 1402- ci/vkd3d: print a real error message when failing to get the list of failing tests 1403- ci/vkd3d: rename vkd3d test log file to end in .txt 1404- ci/vkd3d: print URL to the vkd3d-proton.log file to make it easier to access 1405- ci/vkd3d: put \`then` on the same line as the \`if` to match the rest of the code style 1406- ci/vkd3d: drop the "clear results folder without deleting the folder" logic 1407- ci/vkd3d: drop \`quiet` wrapper 1408- ci/vkd3d: drop redundant "vkd3d-proton execution: SUCCESS" 1409- docs: add release notes for 24.1.2 1410- docs: add sha256sum for 24.1.2 1411- docs: update calendar for 24.1.2 1412- venus/ci: fix indentation of list nested in a dict item 1413- venus/ci: add manual/nightly venus-lavapipe-full 1414- venus/ci: skip timed out test 1415- nvk+zink/ci: add flakes seen over the last two nightly runs 1416- nvk+zink/ci: catch more \`double` flakes 1417- venus+zink/ci: drop fraction and add missing timeout on zink-venus-lvp 1418- loader: use os_get_option() to allow android to set LIBGL_DRIVERS_PATH, GBM_BACKENDS_PATH, GALLIUM_PIPE_SEARCH_DIR 1419- gallium/hud: use os_get_option() to allow android to set GALLIUM_HUD and related vars 1420- egl: use os_get_option() to allow android to set EGL_LOG_LEVEL 1421- venus/ci: make sure nightly job doesn't get retried 1422- venus/ci: drop fixed test from fails list 1423- docs/ci: fix indentation of list nested in a dict item 1424- docs/ci: merge test-docs and test-docs-mr 1425- docs/ci: auto-run test-docs in fork pipelines 1426- docs/ci: drop .no_scheduled_pipelines-rules from test-docs 1427- ci: reorder alpine/x86_64_build rules to fix the nightly pipelines 1428- drm-shim: stub syncobj_timeline_signal ioctl 1429- llvmpipe/ci: add comment for later on weird-looking code 1430- llvmpipe/ci: fix indentation of list nested in a dict item 1431- llvmpipe/ci: set rusticl variables in deqp-runner instead of passing them down from the job 1432- ci: include rusticl in the arm64 build 1433- llvmpipe,rusticl/ci: move rusticl files rule out of llvmpipe 1434- v3d/ci: add nightly job for rusticl testing 1435- panfrost/ci: drop duplicate job rules 1436- panfrost/ci: split gl & vk jobs rules 1437- radeonsi/ci: mark test as fixed 1438- lavapipe/ci: skip timing out test 1439- broadcom/ci: disable auto-retry on manual jobs 1440- docs/features: mark VK_KHR_maintenance7 as implemented on anv and lvp 1441- docs: add release notes for 24.1.3 1442- docs: update calendar for 24.1.3 1443- docs: add sha256sum for 24.1.3 1444- ci_run_n_monitor: add support for new \`canceling` job status 1445- ci_run_n_monitor: be coherent about using sets for \`element in group` checks 1446- ci_run_n_monitor: use COMPLETED_STATUSES in more places 1447- ci_run_n_monitor: add RUNNING_STATUSES and use it where appropriate 1448- bin/ci: allow bugfixes in requirements.txt 1449- ci: split .no-auto-retry out of .scheduled_pipeline-rules 1450- ci: simplify setting .no-auto-retry now that it isn't bundled with unrelated rules: 1451- v3d/ci: include results of GL full run in expectations 1452- v3d/ci: include results of CL run in expectations 1453- zink+nvk/ci: ascii-sort fails 1454- zink+nvk/ci: document regression from !30033 1455- turnip+zink/ci: add two more CS related flakes 1456- lvp+zink/ci: document a flake seen in a merge pipeline 1457- v3d/ci: add disabled job for GL testing on the RPi5 1458- v3d/ci: rename "rusticl on v3d" suite to \`v3d-rusticl` 1459- v3d/ci: add disabled job for CL testing on the RPi5 1460- features.txt: specify that VK_EXT_depth_clamp_zero_one is only supported on v3dv/vc7+ 1461- features.txt: specify that VK_EXT_depth_clip_enable is only supported on v3dv/vc7+ 1462- features.txt: specify that GL_ARB_depth_clamp is only supported on v3d/vc7+ 1463- docs: add release notes for 24.1.4 1464- docs: update calendar for 24.1.4 1465- docs: add sha256sum for 24.1.4 1466- ci: replace gallium-drivers=swrast with gallium-drivers=llvmpipe,softpipe 1467- bin/ci_run_n_monitor: explain that the 'Universal Recycling symbol' ♲ emoji means these jobs were cancelled 1468- bin/ci_run_n_monitor: add text labels next to the emojis 1469- bin/ci_run_n_monitor: replace ♲ with to represent cancelled jobs 1470- meson: fix filename printed when generating devenv files 1471- meson/megadriver: fix install message to match the rest of meson 1472- meson/megadriver: stop removing the "master" .so file 1473- meson/megadriver: replace hardlinks with symlinks 1474- ci/vkd3d: fix LD_LIBRARY_PATH 1475- v3d/ci: mark spec\@amd_performance_monitor\@vc4 tests as fixed 1476- llvmpipe/ci: mark spec\@!opengl 1.1\@gl_select tests as fixed 1477- Revert "bin/ci_run_n_monitor: explain that the 'Universal Recycling symbol' ♲ emoji means these jobs were cancelled" 1478- VERSION: bump for 24.2.0-rc1 1479- .pick_status.json: Update to 0cc23b652401600e57c278d8f6fe6756b13b9f6a 1480- radeonsi/ci: skip timing out test 1481- freedreno/ci: double job timeout for a306 1482- freedreno/ci: document extra variants of failing tests on a618 and a630 1483- anv+zink/ci: mark some tests as fixed 1484- anv+zink/ci: document two tests, one failing and one crashing 1485- anv+zink/ci: mark a couple of tests as flaky 1486- venus/ci: skip timing out test 1487- loader: gc loader_get_extensions_name() and __DRI_DRIVER_{GET\_,}EXTENSIONS defines 1488- .pick_status.json: Update to 3b6867f53a6718de80bbff4acb84ffd5aca8a8c8 1489- nak: fix meson typo 1490- venus: initialize bitset in CreateDescriptorPool() 1491- v3d/ci: mark spec\@amd_performance_monitor\@vc4 tests as flaky 1492- meson: xcb & xcb-randr are needed by the loader whenever x11 is built 1493- .pick_status.json: Update to c30e5d44b1027ed03a8fd542829df0055d3e1a96 1494- .pick_status.json: Update to 6cd4372460b197fea98d257217328ddc3406e6ad 1495- docs: add stub header for u_format_gen.h 1496- .pick_status.json: Update to c33d2db06ac0ea4d3d5372caa93bee3bbbe028c7 1497- VERSION: bump for 24.2.0-rc2 1498- .pick_status.json: Update to ad90bf0500e07b1bc35f87a406f284c0a7fa7049 1499- ci/baremetal: fix logic for retrying boot when it failed 1500- meson: don't select the deprecated \`swrast` option ourselves 1501- meson: improve wording of "incompatible llvm options" error 1502- ci: remove llvmpipe in the job that disables llvm 1503- .pick_status.json: Update to aa9745427b917bb0613b753ccd59c6c1e6f07584 1504- VERSION: bump for 24.2.0-rc3 1505- .pick_status.json: Update to 366e7e2ddc7d3b340bbf040eca1d3223219e6122 1506- meson,ci: remove dead \`kmsro` option in \`gallium-drivers` 1507- .pick_status.json: Mark 93f9afa1e039cbf681adcc6d170aec987d9f0f65 as denominated 1508- .pick_status.json: Mark f427c9fe233e862bfa30d0c7441ce77592ce4654 as denominated 1509- .pick_status.json: Update to d58f7a24d1be7b8b50ebdc0c1c3ce26bd65317a5 1510- .pick_status.json: Update to d9849ac46623797a9f56fb9d46dc52460ac477de 1511- .pick_status.json: Update to ef88af846761ca9e642f7ed46011db7d3d6b61fd 1512- VERSION: bump for 24.2.0-rc4 1513- .pick_status.json: Update to c90e2bccf756004e48f9e7e71e555db0d03c1b98 1514- ci: pass MESA_SPIRV_LOG_LEVEL from job to the test 1515- android: fix build in multiple ways 1516- .pick_status.json: Update to 214b6c30406f844560bdf35a54ff8a51ee248709 1517- .pick_status.json: Update to cc2dbb8ea5329b509d79eedb6c0cbb9a1903b5ad 1518 1519Eric R. Smith (8): 1520 1521- panfrost: add a barrier when launching xfb jobs in CSF 1522- get_color_read_type: make sure format/type combo is legal for gles 1523- glsl: test both inputs when sorting varyings for xfb 1524- glsl: make the xfb varying sort stable 1525- panfrost: fix some omissions in valhall flow control 1526- panfrost: change default rounding mode for samplers 1527- panfrost: fix texture.border_clamp regression for valhall 1528- panfrost: use RGB1 component ordering for R5G6B5 pixel formats 1529 1530Erico Nunes (6): 1531 1532- ci: lima farm maintenance 1533- lima/ci: update piglit ci expectations 1534- Revert "ci: lima farm maintenance" 1535- lima: fix surface reload flags assignment 1536- mesa/st: don't set lower_fdot in draw_nir_options 1537- dri: fix sun4i-drm driver name 1538 1539Erik Faye-Lund (106): 1540 1541- panfrost: add PAN_MAX_TEXEL_BUFFER_ELEMENTS define 1542- panfrost: clamp buffer-size to max-size 1543- panfrost: remove nonsensical assert 1544- panfrost: do not deref potentially null pointer 1545- panfrost: check return-value from u_trim_pipe_prim 1546- panfrost: assert that drmSyncobjWait returns 0 1547- panfrost: check return-code of drmSyncobjWait 1548- panfrost: correct first-tracking for signature 1549- panvk: drop needless null-check 1550- panvk: do not leak bindings 1551- panvk: drop needless null-checks 1552- panvk: avoid dereferencing a null-pointer 1553- docs/panfrost: compact gpu-table 1554- docs/panfrost: move details to separate articles 1555- docs/panfrost: link to conformant products 1556- panfrost: simplify panfrost_texture_num_elements 1557- panfrost: explicitly loop over surfaces 1558- panfrost: untangle faces from layers 1559- util/format: correct a typo 1560- mesa/main: rewrite mipmap generation code 1561- mesa/main: remove unused function 1562- mesa/main: rework GL_IMAGE_PIXEL_TYPE query 1563- mesa/main: clean up _mesa_uncompressed_format_to_type_and_comps 1564- mesa/main: clean up switch statement 1565- mesa/main: do not return _REV format for uncompressed format 1566- mesa/main: prefer non-suffixed enums 1567- mesa/main: fixup indent 1568- mesa/main: updates for EXT_texture_format_BGRA8888 1569- docs: wrap long words instead of overflowing 1570- meson: bump test-timeout 1571- mesa/main: remove unused function 1572- panfrost: lower maxVertexInputStride to match vulkan runtime 1573- mesa/main: remove stale prototype 1574- mesa/main: remove duplicate error-checks 1575- mesa/main: require EXT_texture_integer for GL 3.0 1576- mesa/main: do not allow RGBA_INTEGER et al in gles3 1577- mesa/main: factor out format/type enum checking 1578- mesa/main: use extension-helper 1579- mesa/main: tighten rg/half-float interaction 1580- mesa/main: use _mesa_is_gles1()-helper 1581- mesa/main: remove needless check 1582- mesa/main: simplify conditions 1583- mesa/main: merge identical checks 1584- panvk: move macro-definition to header 1585- mailmap: invert tomeu's mapping 1586- mailmap: merge Robert and Bob Beckett into one 1587- mailmap: invert my mailmapping 1588- mailmap: map collabora.co.uk to collabora.com 1589- mailmap: move konstantin to the right sorted position 1590- mailmap: use consistent spelling for constantine 1591- mailmap: update rohan's primary email address 1592- nir: fix utf-8 encoding-issue 1593- Revert "docs: use html_static_path for static files" 1594- docs: edgeflag -> edge flag 1595- docs: zink -> Zink 1596- docs: Anv -> ANV 1597- docs: tgsi -> TGSI 1598- docs: hw -> HW 1599- docs: mooth -> smooth 1600- docs: unify spelling of front/back-facing 1601- docs: eg. -> e.g. 1602- docs: url -> URL 1603- docs: nabled -> enabled 1604- docs: sommelier -> Sommelier 1605- docs: remove apostrophe from uppercased 1606- docs: utrace -> trace 1607- docs: google -> Google 1608- docs: Nvidia -> NVIDIA 1609- docs: ssbo/ubo -> SSBO/UBO 1610- docs: cpu -> CPU 1611- docs: gpu -> GPU 1612- docs: renderpass -> render pass 1613- docs: spell out "stencil reference" 1614- docs: submision -> submission 1615- docs: Steamos -> SteamOS 1616- docs: colour -> color 1617- docs: occured -> occurred 1618- docs: precidence -> precedence 1619- docs: undifined behaviour -> undefined behavior 1620- docs: debian -> Debian 1621- docs: zink -> Zink 1622- docs: vulkan -> Vulkan 1623- docs: attachements -> attachments 1624- docs: acress -> across 1625- docs: pluggins -> plug-ins 1626- docs: pusbuf -> pushbuf 1627- docs: metadatas -> metadata 1628- docs: use os.pardir 1629- docs: allow out-of-tree docs build 1630- meson: build html-docs 1631- docs: automatically generate depfile 1632- meson: error when missing hawkmoth 1633- meson: allow specifying html-docs-path 1634- ci: build docs using meson 1635- panvk: support x11 wsi 1636- vulkan/runtime: tne -> the 1637- vulkan/runtime: initizlie -> initialize 1638- vulkan/runtime: abreviation -> abbreviation 1639- vulkan/runtime: multiesample -> multisample 1640- vulkan/runtime: implementaiton -> implementation 1641- docs: fix bootstrap-extension 1642- docs/panfrost: fix numbered list 1643- docs/panfrost: fix math-notation 1644- docs/panfrost: use math-role more 1645- docs/panfrost: use c:func-role for function 1646- docs/panfrost: quote identifiers 1647 1648Esdras Tarsis (1): 1649 1650- nvk: Enable 8bit and 16bit access in VK_KHR_workgroup_memory_explicit_layout. 1651 1652Faith Ekstrand (297): 1653 1654- nak: Don't saturate depth writes 1655- nvk: Only clip Z with the guardband 1656- nouveau/class_parser.py: Fix the docs for --out-rs 1657- nvk: Advertise VK_EXT_pipeline_robustness 1658- nouveau/headers: Clean up the meson a bit 1659- spirv: Auto-generate spirv_info.h 1660- spirv: Update the JSON and headers 1661- spirv: Better handle duplicated enums in the JSON parser 1662- spirv: Generate a spirv_capabilities struct 1663- spirv: Record capabilities rather than ad-hoc bools 1664- mesa: Stop pretending to support SPV_AMD_gcn_shader in OpenGL 1665- spirv: Move the old AMD extensions out of capabilities 1666- spirv: Move the printf enable out of capabilities 1667- spirv: Add supported_capabilities to vtn_builder 1668- spirv: Use supported_capabilities for various checks 1669- spirv: Drop the SubgroupUniformControlFlow check 1670- spirv: Add a table of all implemented capabilities 1671- spirv: Check capabilities using the supported_capabilities table 1672- spirv: Add support for specifying caps through the new struct 1673- spirv: Use spirv_capabilities in tests 1674- mesa: Flip the script on SPIR-V extension enabling 1675- mesa: Use the new spirv_capabilities struct 1676- clover: Use the new spirv_capabilities struct 1677- rusticl: Use the new spirv_capabilities struct 1678- vulkan: Set SPIR-V caps from supported features 1679- radv: Use vk_physical_device_get_spirv_capabilities() 1680- intel/kernel: Use the new capabilities struct 1681- asahi/clc: Use the new spirv_capabilities struct 1682- zink: Use the new spirv_capabilities struct 1683- anv: Use spirv_capabilities for the float64 shader 1684- ir3: Use spirv_capabilities in ir3_cmdline 1685- microsoft: Use spirv_capabilities for spirv_to_dxil 1686- spirv: Get rid of the old caps struct 1687- nvk: Re-emit sample locations when rasterization samples changes 1688- nvk/meta: Restore set_sizes[0] 1689- nvk: Get rid of sets_dirty 1690- nvk: Don't rely on push_dirty for which push sets exist 1691- nouveau/headers: Add a bool for whether or not to dump offsets 1692- nvk/upload_queue: Only upload one line of data 1693- nvk/upload_queue: Add some useful asserts 1694- nvk/upload_queue: Add a _fill method 1695- nvk: Use the upload queue for NVK_DEBUG=zero_memory 1696- nvk: Improve the GetMemoryFdKHR error 1697- nouveau/winsys: Take a reference to BOs found in the cache 1698- nouveau/winsys: Make BO_LOCAL and BO_GART separate flags 1699- nvk: Allow GART for dma-bufs 1700- nil: Use the right PTE kind for Z32 pre-Turing 1701- nvk: Set color/Z compression based on nil_image::compressed 1702- nil: Default to NV_MMU_PTE_KIND_GENERIC_MEMORY on Turing+ 1703- nvk: Allow VK_IMAGE_ASPECT_MEMORY_PLANE_0_BIT 1704- drm-uapi: Sync nouveau_drm.h 1705- nouveau/winsys: Add back nouveau_ws_bo_new_tiled() 1706- nvk: Support image creation with modifiers 1707- nvk: Set tile mode and PTE kind on dedicated dma-buf BOs 1708- nvk: Implement DRM format modifier queries 1709- nvk: Advertise VK_EXT_queue_family_foreign 1710- nvk: Advertise VK_EXT_image_drm_format_modifier 1711- vulkan/wsi: Bind memory planes, not YCbCr planes. 1712- nvk/wsi: Advertise modifier support 1713- zink: Set workarounds.can_do_invalid_linear_modifier for NVK 1714- nvk: Fix misc. whitespace and style issues 1715- nvk: Go wide for query copies 1716- nvk: Store descriptor set addresses in descriptor state 1717- nvk: Add static asserts for nvk_buffer_address layout 1718- nvk: Store an nvk_buffer_address for each set in the root table. 1719- nvk: Advertise 32 descriptor sets 1720- nvk: Move and better document set_dynamic_buffer_start 1721- nvk: Add an NVK_MAX_SAMPLES #define 1722- nvk: Refactor nvk_meta_begin() to use a desc helper 1723- nvk/meta: Save and restore set_dynamic_buffer_start 1724- nak: Emit !PT for carries on IADD3 1725- nak: Add with -0 for fabs() 1726- nak: Don't emit a plop3 for immediate shift sources 1727- nak: Encode LDC directly 1728- vulkan: Update XML and headers to 1.3.286 1729- spirv: Update the JSON and headers 1730- nir: Handle cmat types in lower_variable_initializers 1731- spirv: Handle constant cooperative matrices in OpCompositeExtract 1732- spirv: Assert that non-vector composites have the right length 1733- spirv: Implement SPV_EXT_replicated_composites 1734- nvk: Advertise VK_EXT_shader_replicated_composites 1735- anv: Advertise VK_EXT_shader_replicated_composites 1736- hasvk: Advertise VK_EXT_shader_replicated_composites 1737- radv: Advertise VK_EXT_shader_replicated_composites 1738- turnip: Advertise VK_EXT_shader_replicated_composites 1739- lavapipe: Advertise VK_EXT_shader_replicated_composites 1740- dozen: Advertise VK_EXT_shader_replicated_composites 1741- nir/print: Improve divergence information 1742- nak: Fix NAK_DEBUG=serial for warp barriers 1743- nak: Only convert the written portion of the buffer in NirInstrPrinter 1744- nak: Fix BasicBlock::phi*() for OpAnnotate 1745- nak: BMov is always variable-latency 1746- nak: Only copy-prop neg into iadd2/3 if no carry is written 1747- nak: Get rid of OpINeg 1748- nak: Expose a BasicBlock::map_instrs() helper 1749- nak: Add some helpers for uniform instructions and registers 1750- nak: Add OpR2UR 1751- nak: Clean up bindless cbuf handles 1752- nak/ra: Move an assert 1753- nak: Make SSARef::file() return Option<RegFile> 1754- nak: Drop BasicBlock::new() 1755- nak: Add a concept of uniform blocks 1756- nak/to_cssa: Resolve phi register file mismatches 1757- nak/ra: Spill UGPRs and UPreds 1758- nak/ra: Never move uniform regs in non-uniform blocks 1759- nak: Support uniform regs in lower_copy_swap() 1760- nak/sm70: Defer ALU src processing until encode_alu() 1761- nak/sm70: Rework ALU source encode helpers 1762- nak/sm70: Add support for encoding uniform ALU ops 1763- nak/sm70: Fix encoding of fadd/fsetp and friends with UGPRs 1764- nak/sm70: Implement a bunch of uniform ops on SM75+ 1765- nak/legalize: Fold immediate sources before instructions 1766- nak/legalize: Drop some pointless plop3 logic 1767- nak/legalize: Be more precise about shfl and out 1768- nak/legalize: Fix imad and ffma legalization on SM50 1769- nak/legalize: Patch a RegFile through to copy helpers 1770- nak/legalize: Handle uniform sources in warp instructions 1771- nak/legalize: Ensure all SSA values for a given ref are in the same file 1772- nak/legalize: Copy uniform vectors in non-uniform control-flow 1773- nak/legalize: Uniform instructions can't have cbuf sources 1774- nak/legalize: Explicitly ignore OpPhiSrcs and OpPhiDsts 1775- nak/calc_instr_deps: Rename a couple variables 1776- nak/calc_instr_deps: Rewrite calc_delays() again 1777- nak/calc_instr_deps: Add latencies for uniform instructions 1778- nak: Add a opt_uniform_instrs() pass 1779- nak/copy_prop: Rewrap a couple comments 1780- nak/copy_prop: Don't propagate UBOs into uniform instructions 1781- nak/lower_cf: Parent scopes are never NULL 1782- nak/lower_cf: Track block divergence 1783- nak: Convert to LCSSA before divergence analysis 1784- nak/lower_cf: Flag phis as convergent when possible 1785- nak/from_nir: Clean up phi annotations 1786- nak: Add a UniformBuilder 1787- nak/from_nir: Emit uniform instructions when !divergent 1788- nak/sm70: Properly encode bindless cbufs 1789- nak/dce: Account for bindless CBuf handles 1790- nak/calc_instr_deps: Account for bindless CBufs 1791- nak/bitset: Add an iterator 1792- nak/ra: Handle bindless CBufs 1793- nak/ra: Pull searching for unused/unpinned regs into a helper 1794- nak/ra: Rename PinnedRegAllocator to VecRegAllocator 1795- nak/ra: Add a concept of pinned registers to RegAllocator 1796- nak: Add OpPin and OpUnpin 1797- nak/legalize: Allow pinned uniform vectors in non-uniform blocks 1798- nak/legalize: Bindless cbufs must be pinned in non-uniform blocks 1799- nak/copy_prop: Don't propagate bindless cbufs into non-uniform blocks 1800- nir: Add some new _nv intrinsics 1801- nvk,nak: Switch to nir_intrinsic_ldc_nv 1802- nak: Implement r2ur_nv 1803- nak: Implement [un]pin_cx_handle_nv 1804- nir: Add nir_foreach_block_in_cf_node_safe() iterators 1805- nak: Lower non-uniform ldcx_nv to global loads 1806- nak: Implement nir_intrinsic_ldcx_nv 1807- nvk: Split SSBO and UBO address formats 1808- nvk: Split write_[dynamic\_]buffer_desc into UBO and SSBO variants 1809- nvk: Align buffer descriptors 1810- nvk: Rename nvk_cmd_buffr_get_cbuf_descriptor() 1811- nvk: Make nvk_min_cbuf_alignment() inline 1812- nvk/lower_descriptors: Add a descriptor_type_is_ubo/ssbo() helper 1813- nvk: Move the zero offset optimization to load_descriptor_for_idx_intrin() 1814- nvk: Allow the cbuf optimization for VK_DESCRIPTOR_TYPE_MUTABLE_EXT 1815- nvk/descriptor_set_layout: Record which dynamic buffers are UBOs 1816- nvk: Use bindless cbufs on Turing+ 1817- nvk: Be much more conservative about rebinding cbufs 1818- nvk: Use cbuf loads for variable pointers dynamic SSBO descriptors 1819- nvk: s/draw_idx/draw_index/g 1820- nvk: Pass the base workgroup and global size to flush_compute_state() 1821- nvk: Use helper macros for accessing root descriptors 1822- nvk: Pass the queue to draw/dispatch_state_init() 1823- nvk: Use inline constant buffer updates for CB0 1824- nvk: Only write draw parameters to cb0 when they change 1825- nvk: Refactor build_cbuf_map() 1826- nak,nir: Drop r2ur_nv in favor of as_uniform 1827- nouveau: Fix a race in nouveau_ws_bo_destroy() 1828- nvk: Use NVK_VK_GRAPHICS_STAGE_BITS in dirty_cbufs_for_descriprots() 1829- nvk: Dirty cbufs in CmdPushDescriptorSetWithTemplate2KHR 1830- intel/blorp: Set nir_shader::options up-front before building 1831- util/format_pack: Fix packing of signed 1010102 SSCALED formats 1832- util/format_pack: Also use iround for SCALED formats 1833- util/format_pack: Clamp SNORM values to [-1, 1] when unpacking 1834- util/format: Round to nearest even when converting to R11G11B20F 1835- util/format: Handle denorms when converting to R11G11B10F 1836- nir/format_convert: Smash NaN to 0 in pack_r9g9b9e5() 1837- nir/format_convert: Use fmin/fmax to clamp R9G9B9E5 data 1838- nir: Add a nir_intrinsic_use for unit tests 1839- nir: Move most of nir_format_convert to a C file 1840- nir: Support 0 and 32 bits in some format conversion helpers 1841- util: Make format_srgb.h C++ safe 1842- nir: Add a format pack helper and tests 1843- nir: Add a format unpack helper and tests 1844- nir/format_convert: Assert that UNORM formats are <= 16 bits 1845- ci: Update trace SHAs 1846- vulkan/meta: Use demote instead of discard 1847- nvk: Fix whitespace issues around conservative rasterization 1848- nvk: Re-order conservative rasterization checks 1849- nvk: Don't emit conservative rasterization before Maxwell B 1850- nvk: Silently fail to enumerate if not on nouveau 1851- util/cnd_monotonic: Move the guts to a c file 1852- util/cnd_monotonic: Use a void * on Windows 1853- vulkan/wsi/wayland: Use mtx_t and u_cnd_monotonic 1854- vulkan/wsi/x11: Use c11/threads for thread spawning 1855- vulkan/wsi/x11: Use mtx_t and u_cnd_monotonic 1856- vulkan/wsi/display: Use mtx_t and u_cnd_monotonic 1857- vulkan/wsi/queue: Use mtx_t and u_cnd_monotonic 1858- vulkan/wsi: Delete wsi_init_pthread_cond_monotonic 1859- vulkan: Use u_cnd_monotonic for vk_sync_timeline 1860- nvk: Why are nvk_image.c/h writeable? 1861- nvk: Bump the sparse alignment requirement on buffers to 64K 1862- nvk: Align sparse-bound images to the sparse binding size 1863- nvk: Fetch debug flags from the physical device 1864- nvk: Initialize the debug flags in nvk_instance 1865- nvk: Add the start of a KMD abstraction 1866- nvk/nvkmd: Implement dev and pdev for nouveau 1867- nvk: Use the NVKMD interface for device enumeration 1868- nvk/nvkmd: Add memory and virtual address interfaces 1869- nvk/nvkmd: Implement the mem and va interfaces for nouveau 1870- nvk: Add static wrappers for image/buffer binding 1871- nvk: Use nvkmd_mem for nvk_device_memory 1872- nvk: Use nvkmd_mem for nvk_image::linear_tiled_shadow_mem 1873- nvk: Use nvkmd_mem for nvk_cmd_pool 1874- nvk: Use nvkmd_mem for nvk_descriptor_pool 1875- nvk: Use nvkmd_mem in nvk_upload_queue 1876- nvk: Use nvkmd_mem for descriptor tables 1877- nvk: Use nvkmd_mem for shader and event heaps 1878- nvk: Use nvkmd_mem for query pools 1879- nvk: Use an nvkmd_mem for the SLM area 1880- nvk: Drop extra_bos from nvk_queue_submit_simple() 1881- nvk: Use nvkmd_mem for the nvk_queue_state::push 1882- nvk: Use nvkmd_mem for the zero page, VAB, and CB0 1883- nvk/nvkmd: Add a context interface 1884- nvk/nvkmd: Implement nvkmd_ctx for nouveau 1885- nvk: Convert the upload queue to nvkmd_ctx 1886- nvk: Use an nvkmd_ctx for sparse binding 1887- nvk: Use nvkmd_ctx for queue submit 1888- nvk: Remove the last vestages of nouveau/winsys from core NVK 1889- nouveau/mme: Don't dereference an empty vector 1890- nouveau/mme: Don't leak data_bo 1891- nouveau/mme: Use fixed BO addresses in the MME test 1892- nvk: Move Heaps and BO binding into nvkmd 1893- nvk: Move debug flags int nvk_debug.h 1894- nvk/nvmkd: Plumb parent pointers through everywhere 1895- nvk/nvkmd: Re-implement NVK_DEBUG=vm 1896- nvk: Do mem maps directly in nvkmd on nouveau 1897- nvk/nvkmd: Add real mem<->bo flag translation 1898- nvk/nvkmd: Flip the script on NO_SHARED 1899- nvk: Drop nvk_buffer::is_local 1900- nvk/nvkmd: Rework memory placement flags 1901- nvk/nvkmd: Be more specific about memory alignments 1902- nvk/nvkmd: Be a lot more pedantic about VA alignments 1903- nvk: Put CB0 in VRAM 1904- nvk: Put descriptors in VRAM 1905- nouveau/push: Cache the last header DW to avoid read-back 1906- nak/sph: Stop storing the shader model in ShaderProgramHeader 1907- nak: Move encode_sm* to to sm*.rs 1908- nak/sm50: Get rid of the hand-rolled align_up/down() helpers 1909- nak: Plumb a ShaderModel trait through everywhere 1910- nak/ra: Move the NAK_DEBUG=spill logic into RA 1911- nak: Move RegFile::num_regs() into ShaderModel 1912- nak: Move Instr::can_be_uniform() into ShaderModel 1913- nak: Move instruction encoding into ShaderModel 1914- nak/sm70: Move instruction encoding into a trait 1915- nak/sm70: Re-organize the code a bit 1916- nak/legalize: Move a bunch of helpers to a trait 1917- nak/legalize: Handle OpBreak and OpBSSy specially 1918- nak/legalize: Handle RA instructions up-front 1919- nak/sm70: Move legalization into SM70Op 1920- nak/sm50: Move instruction encoding into a trait 1921- nak/sm50: Move legalization into SM50Op 1922- nak: Add a legalize() method to ShaderModel 1923- nak/sm50: Re-order all the ops 1924- nak/sm50: Fix immediates for IMnMx 1925- zink/kopper: Set VK_COMPOSITE_ALPHA_OPAQUE_BIT when PresentOpaque is set 1926- nak: gather instr count explicitly 1927- nvk/nvkmd: nouveau uses the OS page size 1928- nvk: Drop the sparse alignment back down to 4096 1929- nvk: Use the page size queried from NVKMD 1930- nak/nir: Use an indirect load for sample locations 1931- nak/copy_prop: Propagate OpSel with a selector of SrcRef::Zero 1932- nak/copy_prop: Ignore the top 16 bits of OpPrmt::sel 1933- nak: Don't print the destination of OpIpa twice 1934- nir,nak: Add a nir_op_prmt_nv 1935- nak/nir: Use prmt in texture lowering 1936- nak/nir: Use prmt for barycentric offset lowering 1937- nak/nir: Make interpolate_at_sample more efficient 1938- nak: Add some helpers for working with OpPrmt selectors 1939- nak: Optimize nested OpPrmt 1940- nak: Add a pass macro for more consistent debug printing 1941- nak: Run copy-prop again after opt_prmt and opt_lop 1942- nvk: Fix indirect cbuf binds pre-Turing 1943- nvk: Don't advertise sparse residency on Maxwell A 1944- nvk: Reject sparse images on Maxwell A and earlier 1945- nak/spill_values: Don't assume no trivial phis 1946- meson/megadriver: Don't invoke the megadriver script with no drivers 1947- nak: Sample locations are byte-aligned 1948- nvk: Require color or depth/stencil attachment support for input attachments 1949- nvk: Support STORAGE_READ_WITHOUT_FORMAT on buffers 1950- zink: Align descriptor buffers to descriptorBufferOffsetAlignment 1951 1952Francisco Jerez (33): 1953 1954- intel/brw/xe2+: Keep PS sample mask in the f1.0 register whether or not kill is used. 1955- intel/brw: Don't emit Z coordinate interpolation if CPS isn't in use. 1956- intel/brw/xe2+: Fix indirect extended descriptor setup for scratch space. 1957- iris: Allocate fixed amount of space for blend state. 1958- blorp: Allocate fixed amount of space for blend state. 1959- intel/brw/xe2+: Don't use SEL peephole on 64-bit moves. 1960- intel/brw/xe2+: Fix 64-bit subgroup scan intrinsics not to rely on SEL instructions. 1961- intel/brw/xe2+: Lower 64-bit SHUFFLE and CLUSTER_BROADCAST. 1962- intel/xe2+: Enable native 64-bit integer arithmetic. 1963- nir: Add option to lower 64-bit uadd_sat. 1964- intel/brw/xe2+: Lower 64-bit integer uadd_sat. 1965- intel/brw/xe2+: Round up spill/unspill data size to nearest reg_size multiple. 1966- intel/xe2+: Enable native 64-bit integer arithmetic. 1967- iris,anv/xe2+: Enable the DX10/OGL border mode for YCrCb as per Wa_14014226147. 1968- iris,anv/xe2+: Set tessellation redistribution regions per patch to recommended values. 1969- iris,anv/xe2+: Use pipelined variant of 3DSTATE_DRAWING_RECTANGLE. 1970- intel/brw/xe2+: Use active-thread-only barriers available since Xe2+. 1971- iris/xe2+: Fix format of scratch space surface address in various 3DSTATE packets. 1972- anv/xe2+: Fix format of scratch space surface address in various 3DSTATE packets. 1973- intel/fs/gfx20+: Fix surface state address on extended descriptors for NIR scratch intrinsics. 1974- intel/fs/xe2+: Ask driver for PS payload registers based on barycentric load intrinsics in use. 1975- iris/gfx11+: Request PS payload fields for ALU-based interpolation via 3DSTATE_PS_EXTRA. 1976- anv/gfx11+: Request PS payload fields for ALU-based interpolation via 3DSTATE_PS_EXTRA. 1977- intel/fs/xe2+: Don't lower barycentric load offsets to fixed-point format on Xe2+. 1978- intel/fs/xe2+: Add ALU-based implementation of barycentric interpolation at a per-channel offset. 1979- intel/fs/xe2+: Add ALU-based implementation of barycentric interpolation at a per-channel sample. 1980- intel/dev: Add GRF size information to the intel_device_info struct. 1981- anv/xe2+: Align push constant ranges to GRF boundaries. 1982- intel/brw: Implement null push constant workaround. 1983- intel/dev: Add devinfo flag for TBIMR push constant workaround. 1984- anv/gfx12.5: Pass non-empty push constant data to PS stage for TBIMR workaround. 1985- iris/gfx12.5: Pass non-empty push constant data to PS stage for TBIMR workaround. 1986- iris: Pin pixel hashing table BO from iris_batch submission instead of from iris_state. 1987 1988Friedrich Vock (7): 1989 1990- aco/tests: Insert p_logical_start/end in reduce_temp tests 1991- aco/spill: Insert p_start_linear_vgpr right after p_logical_end 1992- radv: Use max_se instead of num_se where appropriate 1993- radeonsi: Use max_se instead of num_se where appropriate 1994- radv/rt: Fix memory leak when compiling libraries 1995- aco/spill: Don't spill phis with all-undef operands 1996- aco: Limit rt stages to 128 vgprs 1997 1998GKraats (3): 1999 2000- i915g: fix generation of large mipmaps 2001- i915g: fix mipmap-layout for npots 2002- i915g: fix max_lod at mipmap-sampling 2003 2004Ganesh Belgur Ramachandra (4): 2005 2006- radeonsi: add GL_EXT_texture_filter_minmax extension 2007- radeonsi: add GL_ARB_texture_filter_minmax extension 2008- radeonsi: fix eptich on chips without image opcodes (e.g. gfx940) 2009- amd/common: skip lane size determination for chips without image opcodes (e.g. gfx940) 2010 2011Georg Lehmann (88): 2012 2013- aco/tests: don't use undef for descriptors 2014- aco/tests/post_ra: fix various validation errors 2015- aco/lower_to_hw: fix v_cvt_pk_u16_u32 instruction format 2016- aco/lower_to_hw: fix 16bit p_insert on gfx8 2017- aco/tests: validate before and after post-ra tests 2018- spirv: preserve signed zero in modf 2019- aco/lower_to_hw: don't use regClass to identify subdword reductions 2020- aco: add a subdword lowering pass 2021- aco: add tests for lower_subdword 2022- aco/ra: remove gfx6/7 subdword paths 2023- aco/lower_to_hw: remove gfx6/7 subdword paths 2024- ac/nir: explicitly use pack_half_2x16_rtz 2025- radv, radeonsi: don't use D16 for f2f16_rtz 2026- radv: always run nir_opt_16bit_tex_image 2027- nir/opt_16bit_tex_image: pass options to opt_16bit_dest 2028- nir/opt_16bit_tex_image: optimize packed conversions too 2029- aco/gfx11+: use v_cvt_pk_u8_f32 for 8bit constant copies 2030- aco/gfx10: use v_add_u16 with literal for constant copies 2031- aco/tests: simplify small constant copy test 2032- aco/gfx11+: optimize v_fma_mix throughput 2033- zink: use bitcasts instead of pack/unpack double opcodes 2034- aco/gfx11: use v_swap_b16 2035- aco/optimizer: remove ineffective vcc opt 2036- aco/optimizer: remove ineffective undef opt 2037- aco: remove perfwarn 2038- aco: don't pass program to emit_bpermute 2039- aco/lower_to_hw: add copy_constant_sgpr 2040- aco: small constant copy optimizations 2041- aco/lower_to_hw: use copy_constant_sgpr for masks 2042- aco/lower_to_hw: optimize split 64bit constant copies 2043- aco/optimizer: use p_create_vector to create mask when a copy can't be used 2044- nir: remove unpack_half_flush_to_zero 2045- nir/opt_uniform_atomics: handle inverse_ballot when detecting single lane ifs 2046- aco: optimize branching sequence with p_create_vector exec producer 2047- nir: sink/move inverse_ballot like moves 2048- ac: set has_pack_32_4x8 2049- nir: lower pack_uvec4_to_uint to pack_32_4x8 if supported 2050- nir/opt_algebraic: alternative 8bit pack_[us]norm_4x8 lowering 2051- aco: rework how affinities for acc operands are determined 2052- aco: add affinities for possible sopk optimizations 2053- aco/gfx11+: fix inline constants for v_pk_fmac_f16 2054- aco: move literal unswizzle opt to RA 2055- aco/ra: use a switch to check vop2acc instruction support 2056- aco: move s_add_u32 -> s_addk_i32 optimization fully to ra 2057- amd/common: set COMPUTE_STATIC_THREAD_MGMT_SE2-3 correctly on gfx10-11 2058- aco: add more anonymous namespaces 2059- aco: make local functions static in files without anonymous namespace 2060- radv: inline partial push constant loads 2061- nir: add ford, funord, fneo, fequ, fltu, fgeu 2062- aco: implement ford, funord, fneo, fequ, fltu, fgeu 2063- ac/llvm: implement ford, funord, fneo, fequ, fltu, fgeu 2064- ac/nir: enable ford, funord, fneo, fequ, fltu, fgeu 2065- nir/opt_algebraic: look through fabs/fneg when matching fmulz/ffmaz 2066- nir/optimize cmp(a, -0.0) 2067- nir/opt_algebraic: optimize cmp(fneg(a), #b) and feq with fabs 2068- nir/opt_algebraic: add various unordered/ordered patterns from aco 2069- aco: remove ordered/unordered optimizations 2070- aco/ir: remove unused vopc helpers 2071- iris/ci: update trace checksums 2072- aco/ra: fix affinity for s_addk 2073- aco: fix s_delay_alu with salu and trans dependency 2074- aco,nir: add dpp16_shift_amd intrinsic 2075- radv/nir: add a pass to optimize shuffle/booleans dependent only on tid/consts 2076- radv: use radv_nir_opt_tid_function for shuffles 2077- radv: use radv_nir_opt_tid_function to create inverse_ballot 2078- aco/gfx12: use trans s_delay_alu for pseudo scalar 2079- aco/gfx12: don't allow vgpr operands for pseudo scalar 2080- aco/gfx11.5: select s_cvt_[ui]32_f32 2081- aco/gfx11.5: select s_(ceil|floor|trunc|rndne) 2082- aco: add aco_opcode::p_s_cvt_f16_f32_rtne 2083- aco/gfx11.5: select SALU float conversions 2084- aco/gfx11.5: fix s_fmac acc to definition 2085- aco/gfx11.5: select SOP2 float instructions 2086- aco/gfx11.5: select SOPC float instructions 2087- aco/gfx11.5: select SALU fsat 2088- aco/gfx11.5: select SALU fsign 2089- aco/gfx11.5+: allow sgpr dst for trans ops and use pseudo scalar ops on gfx12 2090- aco/gfx11.5: select SALU fneg/fabs 2091- aco/gfx11.5: select SALU fquantize2f16 2092- aco: micro optimize VALU fquantize2f16 2093- aco: handle clustered uniform reductions correctly 2094- nir: constant fold inverse_ballot 2095- aco: remove optimize_cmp_subgroup_invocation 2096- spirv: ignore more function param decorations 2097- aco/optimizer: update temp_rc when converting to uniform bool alu 2098- aco/gfx11+: don't use VOP3 v_swap_b16 2099- nir/lower_int64: replace uadd_sat with ior for find_lsb64 and ufind_msb64 2100- aco/gfx10+: set lateKill for sgprs used by wave64 VALU writing a mask 2101 2102Gert Wollny (4): 2103 2104- zink/kopper: Wait for last QueuePresentKHR to finish before acquiring for readback 2105- mesa/st: don't use base shader serialization when uniforms are not packed 2106- r600/sfn: Set bit size for newly created store intrinsic 2107- zink: limit minSampleShading to a maxium value of 1.0 2108 2109Guilherme Gallo (3): 2110 2111- ci: Add S3 id_token for all jobs 2112- ci: Use id_tokens for JWT auth 2113- ci/lava: Fix cmdline for UART/fastboot devices 2114 2115Hans-Kristian Arntzen (5): 2116 2117- vulkan: Update XML and headers to 1.3.285. 2118- ac/surface: Add surface flags to prefer 4K and 64K alignment. 2119- radv: Implement VK_MESA_image_alignment_control 2120- wsi/common: Do not update present mode with MESA_VK_WSI_PRESENT_MODE. 2121- wsi/x11: Bump maximum number of outstanding COMPLETE events. 2122 2123Heinrich Fink (1): 2124 2125- zink: remove workaround of FB modifiers forcing present state 2126 2127Iago Toral Quiroga (53): 2128 2129- v3dv: fix VK_KHR_vertex_attribute_divisor 2130- v3d,v3dv: stop hard-coding max attrib divisor 2131- broadcom/compiler: assert on array overflow 2132- v3d: fix array_len when precompiling outputs for shader-db 2133- broadcom/compiler: fix num_textures for precompiled shaders 2134- broadcom/compiler: don't read excess channels on image loads 2135- broadcom/compiler: simplify v3d_vir_emit_tex 2136- broadcom/cle: fix up shader record for V3D 7.1.10 / 2712D0 2137- v3d: support 2712D0 2138- v3dv: support 2712D0 2139- broadcom/compiler: make add_node return the node index 2140- broadcom/compiler: don't assign payload registers to spilling setup temps 2141- broadcom/compiler: apply payload conflict to spill setup before RA 2142- broadcom/compiler: check if vertex shader writes point size 2143- v3dv: only flag 'shader writes point size' if the shader actually writes it 2144- v3dv: emit a default point size when drawing points 2145- v3dv: drop unused stride field from v3dv_pipeline_vertex_binding 2146- v3dv: fix incorrect index buffer size 2147- v3dv: use pSizes paramater in vkCmdBindVertexBuffers2 2148- v3dv: implement vkCmdBindIndexBuffer2KHR 2149- v3dv: handle VkBufferUsageFlags2CreateInfoKHR 2150- v3dv: handle VkPipelineCreateFlags2CreateInfoKHR 2151- v3dv: lower maxVertexInputBindingStride to match vulkan runtime 2152- v3dv: shader modules are deprecated with VK_KHR_maintenance5 2153- v3dv: implement vkGetImageSubresourceLayout2KHR 2154- v3dv: refactor create_image 2155- v3dv: add a get_image_subresource_layout helper 2156- v3dv: implement vkGetDeviceImageSubresourceLayoutKHR 2157- v3dv: implement vkGetRenderingAreaGranularityKHR 2158- v3dv: fix pipeline leaks when meta pipeline cache is disabled 2159- v3dv: fix a few asserts that check layerCount instead of array_layers 2160- v3dv: allow VK_REMAINING_ARRAY_LAYERS in VkImageSubresourceLayers 2161- v3dv: remove blit shader restriction on depth/stencil not being linear 2162- v3dv: disable some TLB paths for cases of linear depth/stencil stores 2163- v3dv: support VK_FORMAT_A1B5G5R5_UNORM_PACK16_KHR 2164- v3dv: add more checks for device loss 2165- v3dv: fix handling of pipeline flags when pipeline init fails 2166- v3dv: expose VK_KHR_maintenance5 2167- broadcom/compiler: initialize payload_conflict for all initial nodes 2168- v3dv: don't call wsi_device_init too early 2169- broadcom/compiler: don't spill in between multop and umul24 2170- broadcom/compiler: fix per-quad spilling 2171- broadcom/compiler: validate rtop + thrsw hazard 2172- broadcom/compiler: drop multop if we dce umul24 2173- broadcom/compiler: add missing signal compatibilities for V3D 7.x 2174- broadcom/compiler: add new float32 unpack modifiers in V3D 7.x 2175- broadcom/compiler: disallow copy propagation of FMOV exclusive modifiers 2176- broadcom/compiler: implement nir_op_fsat 2177- v3d: don't lower fsat on V3D 7.x 2178- v3dv: make nir helpers receive nir compiler options from caller 2179- v3dv: don't lower fsat on V3D 7.x 2180- v3d: skip tlb loads when emitting clears with a draw call 2181- v3d: rename job->clear to job->clear_tlb 2182 2183Ian Romanick (33): 2184 2185- intel/brw: Fix optimize_extract_to_float for i2f of unsigned extract 2186- intel/brw: Avoid optimize_extract_to_float when it will just be undone later 2187- intel/elk: Fix optimize_extract_to_float for i2f of unsigned extract 2188- nir/algebraic: Optimize some extract_* expressions 2189- spirv: Use fp16 fp_fast_math settings when lowering fp16 asin and acos 2190- intel/brw: Remove dsign optimization 2191- intel/elk: Remove dsign optimization 2192- intel/brw: Use fs_inst::resize_sources in brw_fs_opt_algebraic 2193- intel/brw: Add support for fcsel opcodes 2194- intel/brw: Handle fsign optimization in a NIR algebraic pass 2195- intel/brw: Update CSEL source type validation 2196- intel/brw: Combine constants and constant propagation for CSEL 2197- intel/brw: Algebraic optimizations for CSEL 2198- intel/brw: Implement more strictly correct fsign lowering 2199- intel/brw: Use range analysis to optimize fsign 2200- nir/algebraic: Add nir_lower_int64_options::nir_lower_iadd3_64 2201- nir/search: Fix is_16_bits for vectors 2202- nir/search: Refactor is_16_bits 2203- nir/algebraic: More patterns to generate iadd3 2204- nir/algebraic: intel/fs: Optimize some patterns before lowering 64-bit integers 2205- intel/brw: Temporarily disable result=float16 matrix configs 2206- intel/brw: Major rework of lower_cmat_load_store 2207- intel/brw/xe2+: Catch invalid uses of writes_accumulator earlier 2208- intel/brw/xe2+: Adjust size_read() for DPAS 2209- intel/brw/xe2+: Scale size_written by reg_unit for DPAS 2210- intel/brw/xe2+: Adjust DPAS lowering to DP4A to accommodate larger GRF and SIMD16 2211- intel/brw/xe2+: Allow vec16 for cooperative matrix 2212- nir: dpas_intel second source can have different number of components 2213- intel/brw/xe2+: Add LNL cooperative matrix configurations 2214- intel/tools: Advertise I915_PARAM_HAS_EXEC_TIMELINE_FENCES 2215- intel/brw: Test corner case CSE of ADD3 instructions 2216- intel/brw: Don't propagate saturate to an instruction that writes flags 2217- intel/elk: Don't propagate saturate to an instruction that writes flags 2218 2219Icenowy Zheng (7): 2220 2221- llvmpipe: add shader cache support for ORCJIT implementation 2222- gallivm: orcjit: use a mutex to protect symbol looking up 2223- util: detect LoongArch architecture 2224- gallivm: add LoongArch support to the mattrs setting code 2225- llvmpipe: add LoongArch support in ORCJIT 2226- gallivm: orcjit: keep the ownership of tm for LPJit 2227- gallivm: orcjit: use atexit to release LPJit singleton at exit 2228 2229Italo Nicola (1): 2230 2231- nir: add {load,store}_global_etna intrinsics 2232 2233Iván Briano (21): 2234 2235- compiler: reorder FLOAT_CONTROLS enums 2236- nir: track some float controls bits per instruction 2237- spirv: gather some float controls bits per instruction 2238- nir: check inf/nan/sz preserve per-instruction 2239- nir/algebraic: support float controls conditions per instruction 2240- nir/algebraic: move float control conditions to be per instruction 2241- vtn: support float controls2 2242- anv: enable VK_KHR_shader_float_controls2 2243- anv: check requirements for VK_IMAGE_USAGE_FRAGMENT_SHADING_RATE 2244- anv: fix casting to graphics_pipeline_base 2245- anv: consolidate DestroyPipeline for graphics and graphics_lib 2246- intel/brw: fix subgroup size of geometry stages for lnl+ 2247- anv: check cmd_buffer is on a transfer queue more properly 2248- intel/brw: add fetch_viewport_index function 2249- intel/brw: always read LAYER/VIEWPORT from the FS payload 2250- vulkan/runtime: pColorAttachmentInputIndices is allowed to be NULL 2251- vulkan/properties: handle LayeredApiPropertiesListKHR 2252- anv: enable VK_KHR_maintenance7 2253- anv: get scratch surface from the correct pool 2254- anv: set MOCS for protected memory when needed 2255- intel/rt: fix terminateOnFirstHit handling 2256 2257JCWasmx86 (1): 2258 2259- meson: Fix invalid kwarg name 2260 2261Jeremy Gebben (1): 2262 2263- radv: Return hang status from radv_check_gpu_hangs() 2264 2265Jesse Natalie (14): 2266 2267- nir_opt_algebraic: Add a couple optimizations for lowered unpack(pack()) 2268- wgl: Delete pixelformat support query 2269- wgl: Fix flag check for GDI compat 2270- nir_range_analysis: Use fmin/fmax to fix NAN handling 2271- d3d12: Use GetResourceAllocationInfo instead of GetCopyableFootprints for residency sizes 2272- nir: Remove assert-only variable by inlining its single use 2273- zink: Add ASSERTED to assert-only local variable 2274- mesa: Add ASSERTED to assert-only local variable 2275- subprojects: Use depth=1 in the git wrap files 2276- blake3: fix Windows ARM64 build and detect ARM64EC as ARM64 2277- ci/windows: Disable zlib in LLVM 2278- ci/windows: Specify numpy < 2.0 to prevent breaking changes 2279- microsoft/clc: Split struct copies before vars_to_ssa in pre-inline optimizations 2280- meson: Add an error message for llvmpipe without llvm draw support 2281 2282Jessica Clarke (3): 2283 2284- Revert "meson: Do not require libdrm for DRI2 on hurd" 2285- Revert "meson: fix with_dri2 definition for GNU Hurd" 2286- meson: egl: Build egl_dri2 driver even for plain DRI 2287 2288Jianxun Zhang (43): 2289 2290- intel/isl: Allow multi-sample on depth aux usage (xe2) 2291- isl: Add a heading 4KB to MCS surface (xe2) 2292- isl: Add AUX MCS encoding into aux modes (xe2) 2293- blorp: Scaledown rectangle of MSAA fast clear (xe2) 2294- blorp: Fix offset when ambiguating MCS buffer (xe2) 2295- isl: Clone from isl_gfx12.* files (xe2) 2296- isl: Update isl_gfx20 code (xe2) 2297- isl: Add isl_gfx20 into build (xe2) 2298- isl: Add dispatching in isl.c (xe2) 2299- isl: Implement a part of WA_22018390030 (xe2) 2300- isl: Remove code for Xe2 from isl_gfx12.c 2301- isl: Update render CMF mapping (xe2) 2302- isl: Don't set clear values or their address (xe2) 2303- blorp: Get fast clear rectangle of non-MSAA surfaces (xe2) 2304- blorp: Pass down fast clear color value (xe2) 2305- intel/genxml,blorp,common: Update 3DSTATE_PS command (xe2) 2306- iris: Update aux state for color fast clears (xe2) 2307- iris: Limit FCV_CCS_E to platforms that enable it 2308- anv: Don't enable compression with modifiers (xe2) 2309- iris: Add more restrictions on compression (Xe2) 2310- anv: Don't enable compression on external bos (xe2) 2311- iris: Disable PAT-based compression on depth surfaces (xe2) 2312- anv: Disable PAT-based compression on depth images (xe2) 2313- iris: Update synchronization of fast clear (xe2) 2314- iris: Workaround: Don't allocate compressed bo from cache (xe2) 2315- isl: Remove restriction of CCS_E support on formats (xe2) 2316- blorp: Don't convert ccs_e formats for copy (xe2) 2317- isl: Initialize the last usage in isl_encode_aux_mode[] (xe2) 2318- anv: Update synchronization of fast clear (xe2) 2319- iris: Disable predraw resolve (xe2) 2320- blorp: Ensure MSAA fast clear in correct modes (xe2) 2321- intel/dev: Select a compressed PAT entry (xe2) 2322- isl: Add some formats not covered in CMF table (xe2) 2323- anv: Disable tracking fast clear and aux state (xe2) 2324- anv: Fix Vulkan CTS failure related to MCS (xe2) 2325- anv: Support arbitrary fast-clear value on all layouts (xe2) 2326- anv: Disable tracking of clear color on color attachment 2327- intel/common: Ensure SIMD16 for fast-clear kernel (xe2) 2328- intel/common: Remove blank lines in intel_set_ps_dispatch_state() (xe2) 2329- anv: Fix assertion failures on BMG (xe2) 2330- iris: Fix an assertion failure with compressed format 2331- anv: Disable compression on legacy modifiers (xe2) 2332- anv: Disable legacy CCS setup in binding (xe2) 2333 2334Job Noorman (33): 2335 2336- ir3: simplify cat5 parsing 2337- ir3: add encoding for isam.v 2338- ir3: use isam.v for multi-component SSBO loads 2339- ir3: add encoding of ldib/stib offsets 2340- ir3: lower SSBO access imm offsets 2341- nir/opt_offsets: add callback for max base offset 2342- nir/opt_offsets: add option to allow offset wrapping 2343- nir/opt_offsets: add load/store_ssbo_ir3 2344- ir3: use nir_opt_offsets for SSBO accesses 2345- ir3: optimize SSBO offset shifts for nir_opt_offsets 2346- ir3: remove spilled splits in shared RA 2347- ir3: set wrmask for spilled splits in shared RA 2348- ir3: print sharedness/halfness of merge set regs 2349- ir3: print intervals when dumping merge sets 2350- ir3: print dst_offset of spill.macro 2351- ir3: debug print limit pressure and post-spill max pressure 2352- ir3: set current instruction before all validation asserts 2353- ir3: fix crash in try_evict_regs with src reg 2354- ir3: fix handling of early clobbers in calc_min_limit_pressure 2355- ir3: set offset on splits created while spilling 2356- ir3: correctly set wrmask for reload.macro 2357- ir3: don't remove intervals for non-killed tex prefetch sources 2358- ir3: don't remove collects early while spilling 2359- ir3: expose instruction indexing helper for merge sets 2360- ir3: make indexing instructions optional in ir3_merge_regs 2361- ir3: index instructions before fixing up merge sets after spilling 2362- ir3: move liveness recalculation inside ir3_ra_shared 2363- ir3: restore interval_offset after liveness recalculation in shared RA 2364- ir3: add ir3_cursor/ir3_builder helpers 2365- ir3: refactor ir3_spill.c to use the ir3_cursor/ir3_builder API 2366- ir3: only add live-in phis for top-level intervals while spilling 2367- ir3: print rounding mode for cov 2368- ir3: set rounding mode for all floating point conversions 2369 2370Jordan Justen (33): 2371 2372- blorp: Update programming for XY_FAST_COLOR_BLT on xe2 2373- intel/genxml: Add XY_FAST_COLOR_BLT for xe2 2374- intel/genxml: Update 3DSTATE_BTD for xe2 2375- intel/dev: Allow setting FORCE_PROBE for intel PCI IDs 2376- intel/dev: Support INTEL_FORCE_PROBE env-var 2377- docs: Document INTEL_FORCE_PROBE env-var 2378- intel/dev: Add LNL device info 2379- pci_ids/intel: Add LNL PCI IDs (with FORCE_PROBE set) 2380- anv/grl: Set INTEL_FORCE_PROBE=* when running intel_clc 2381- intel/brw: Simplify enabling brw_fs_test_dispatch_packing 2382- intel/brw: Allow xe2 in brw_stage_has_packed_dispatch() 2383- intel/brw: Fix SSBO/shared load offset register size for Xe2 2384- anv/grl: Build for xe2 2385- Revert "anv: Disable Ray Tracing on xe2 until our compiler supports Xe2 RT" 2386- intel/dev/mesa_defs.json: Update LNL WA entries 2387- intel/dev: Add INTEL_PLATFORM_BMG enum, BMG WA info 2388- intel/dev: Add BMG device info 2389- intel/dev: Add BMG PCI IDs (with FORCE_PROBE set) 2390- intel/dev: Silence INTEL_FORCE_PROBE warning for intel_clc 2391- intel/dev: If building the driver, always allow getting device info 2392- Revert "anv/grl: Set INTEL_FORCE_PROBE=* when running intel_clc" 2393- intel/compiler: Don't set size written in brw_lower_logical_sends.cpp 2394- intel/tools: Fix intel_dev_info --hwconfig switch 2395- isl: Move isl_get_render_compression_format in isl_genX_helpers.h 2396- isl: Implement isl_get_render_compression_format for xe2 2397- intel/brw: Retype some regs to BRW_TYPE_UD for Xe2 indirect accesses 2398- intel/perf/xe: Fix free pointer location in xe_add_config() 2399- intel/dev: Enable LNL PCI IDs without INTEL_FORCE_PROBE 2400- anv/generated_indirect_draws: Adjust xe2 simd32 sends_count_expectation 2401- intel/dev: Disable LNL PCI IDs on Mesa 24.2 (require INTEL_FORCE_PROBE) 2402- intel/brw/validate: Simplify grf span validation check by not using a mask 2403- intel/brw/validate: Update dst grf crossing check for Xe2 2404- intel/brw/validate: Convert access mask to be grf based 2405 2406Jordan Petridis (1): 2407 2408- Revert "ci: mark microsoft farm as offline" 2409 2410Jose Maria Casanova Crespo (9): 2411 2412- v3d: fix CLE MMU errors avoiding using last bytes of CL BOs. 2413- v3dv: fix CLE MMU errors avoiding using last bytes of CL BOs. 2414- v3d: Increase alignment to 16k on CL BO on RPi5 2415- v3dv: Increase alignment to 16k on CL BO on RPi5 2416- v3dv: V3D_CL_MAX_INSTR_SIZE bytes in last CL instruction not needed 2417- v3dv/ci: Add more dEQP-VK subgroups that are currently skipped 2418- v3dv: Emit stencil draw clear if needed for GFXH-1461 2419- v3dv: really fix CLE MMU errors on 7.1HW Rpi5 2420- v3d: really fix CLE MMU errors on 7.1HW Rpi5 2421 2422Josh Simmons (3): 2423 2424- radv: Fix crash when using SQTT and NO_COMPUTE 2425- radv: Add \`RADV_PROFILE_PSTATE` envvar 2426- radv: Fix shader mask for SQ_WGP SPM counters 2427 2428José Expósito (2): 2429 2430- meson: Update proc_macro2 meson.build patch 2431- llvmpipe: Init eglQueryDmaBufModifiersEXT num_modifiers 2432 2433José Roberto de Souza (87): 2434 2435- intel/perf: Nuke platform_supported 2436- intel/perf: Remove i915_drm.h include from gen_perf.py 2437- intel/perf: Fix the error check of i915_add_config() 2438- intel/perf: Change oa_format to uint64_t 2439- intel/perf: Store pointer intel_device_info to in intel_perf_config 2440- intel/perf: Add intel_perf_free() 2441- intel/perf: Add intel_perf_free_context() 2442- intel/ds: Free perf config and context 2443- intel/ds: Nuke ralloc_ctx and ralloc_cfg 2444- anv: Free intel_perf_config when destroying physical device 2445- hasvk: Free intel_perf_config when destroying physical device 2446- iris: Free intel_perf_config and intel_perf_context 2447- crocus: Free intel_perf_config and intel_perf_context 2448- intel/perf: Add and use a function to return platform OA format 2449- intel/perf: Add function to open perf stream 2450- intel/perf: Fix return of read_oa_samples_until() 2451- anv: Nuke perf_query_pass from anv_execbuf 2452- intel/perf: Replace I915_OA_FORMAT_* usage by platform check 2453- intel/perf: Move code that will be shared by both KMDs 2454- intel/perf: Move i915 specific code from common code 2455- intel/perf: Move i915 specific code to load configurations to i915 file 2456- intel/perf: Allocate sseu in heap memory 2457- intel/perf: Replace drm_i915_perf_record_header by intel_perf_record_header 2458- intel/perf: Add a macro with header + sample length 2459- intel/perf: Add intel_perf_stream_read_samples() 2460- intel/dev: Add LNL stepping mapping 2461- intel/dev: Add BMG stepping mapping 2462- intel: Move slm functions from brw_compiler.h to intel_compute_slm.c/h 2463- intel/common: Implement Xe2 SLM encode 2464- intel/common: Implement preferred SLM encode 2465- intel/dev: Use topology variables to calculate strides in Xe KMD 2466- intel/dev: Add function to get the number of EUs per subslice 2467- intel: Set preferred SLM allocation size >= than SLM size for Xe2 2468- anv: Set maxComputeSharedMemorySize value for Xe2 platforms 2469- intel: Compute the optimal preferred SLM size per subslice 2470- anv: Initialize variable to fix static analyzer warning 2471- intel/genxml/gfx20: Sync POSTSYNC_DATA struct with spec 2472- anv/xe2: Enable compute walker and BTD thread preemption 2473- anv/xe2: Add STATE_COMPUTE_MODE individual masks 2474- anv: Remove block promoting non CPU mapped bos to coherent 2475- intel/isl: Set dummy_aux_address to implement Wa_14019708328 2476- anv: Implement Wa_14019708328 2477- iris: Implement Wa_14019708328 2478- anv: Implement Wa_14019857787 2479- iris: Implement Wa_14019857787 2480- intel/dev: Add compressed PAT entry 2481- anv: Add support for compressed images allocation in Xe2 2482- anv: Give apps the choice of compressed or uncompressed but cpu visible images 2483- iris: Add support for compressed images allocation in Xe2 2484- anv: Fix assert in xe_gem_create() 2485- intel/perf: Change order of if blocks 2486- intel/perf: Add assert to check if allocated enough query fiels 2487- intel/dev: Add engine_class_supported_count to intel_device_info 2488- intel/perf: Add LNL OA XML 2489- intel/perf: Add INTEL_PERF_QUERY_FIELD_TYPE_SRM_OA_PEC 2490- intel/perf: Extend intel_perf_query_result_read_gt_frequency() to gfx 20 2491- intel: Sync xe_drm.h 2492- intel/perf: Implement function that returns OA format for Xe KMD 2493- intel/perf: Add function to check if OA/perf is supported by Xe KMD 2494- intel/perf: Replace i915_perf_version and i915_query_supported by a feature bitmask 2495- intel/perf: Refactor and add Xe KMD support to add and remove configs 2496- intel/perf: Add Xe KMD perf stream open function 2497- intel/perf: Refactor and add Xe KMD support to enable and disable perf stream 2498- intel/perf: Refactor and add Xe KMD support to change stream metrics id 2499- tool/pps: Add Xe KMD support 2500- intel/perf: Remove i915_drm.h includes from common code 2501- intel/perf: Implement Xe KMD perf stream read 2502- anv: Implement Xe KMD query pools 2503- intel/perf: Enable perf on Xe KMD 2504- intel/perf: Implement intel_perf_query_result_accumulate() for gfx 20+ 2505- intel/perf: Add support for LNL OA sample format size 2506- intel/perf: Return LNL OA sample format 2507- intel/perf: Do not add INTEL_PERF_QUERY_FIELD_TYPE_SRM_OA_PEC 2508- intel/perf: Adjust EU count for Xe2+ 2509- intel/dev: Replace intel_device_info::apply_hwconfig by a gfx version check 2510- intel: Rename XE_PERF to XE_OBSERVATION 2511- anv: Fix return of PAT index for compressed bos for discrete GPUs 2512- intel/dev: Drop DG1 PAT entries 2513- intel/dev: Add documentation about intel_device_info_pat_entry::mmap 2514- intel/dev: Drop coherency from intel_device_info_pat_entry 2515- intel/dev: Add comment documenting the PAT entries 2516- intel/dev: Use GPU WB PAT for Xe2 writecombining 2517- intel/dev: Drop writeback_incoherent from Xe2 2518- isl: Fix Xe2 protected mask 2519- anv: Propagate protected information to blorp_batch_isl_copy_usage() 2520- intel: Sync xe_drm.h 2521- intel/dev: Support new topology type with SIMD16 EUs 2522 2523Juan A. Suarez Romero (57): 2524 2525- vc4/ci: update results 2526- vc4/v3d/ci: update expected list 2527- vc4: set src type on storing sample mask 2528- broadcom/compiler: remove unused parameters in vpm read 2529- broadcom/compiler: do not run lowering I/O for FS 2530- v3d/vc4/ci: set full renderer version check 2531- nir,v3d: rename tlb_color_v3d intrinsic 2532- vc4: use tlb_color_brcm intrinsic 2533- .gitignore: add .cache folder 2534- vc4: use IO semantics for location 2535- v3d: use BITSET for the masks 2536- v3d: remove handled cases for devices <= 42 2537- ci: define SNMP base interface on runner 2538- v3d: use screen name in disk cache 2539- v3d,v3dv: add compatibility revision in GPU name 2540- broadcom/ci: update expected results 2541- v3dv/ci: add expected failure 2542- v3dv/ci: fix spurious line in expected 2543- v3dv/ci: add new timeouts 2544- dri: cast constant to uint for bitshift 2545- util: do not access member of a NULL structure 2546- util: use unsigned types when performing bitshift 2547- vulkan: do not access member of a NULL structure 2548- nir: fix overflow when negating maxint in constant expressions 2549- nir: use unsigned types when performing bitshifting 2550- glsl: fix downcasting addresses to wrong object types 2551- egl: do not access member of a NULL structure 2552- mesa: use unsigned types when performing bitshifting 2553- mesa: do not pass NULL pointer to function not expecting NULLs 2554- ci: disable Igalia farm 2555- broadcom/compiler: use unsigned types when performing bitshifting 2556- v3dv: do not access member of a NULL structure 2557- v3dv: do not pass NULL pointer to function not expecting NULLs 2558- v3dv: restrict to channels when encoding border color 2559- v3dv: fix misalignment in descriptor layout structure 2560- v3d: do not access member of a NULL structure 2561- v3d: do not pass NULL pointer to function not expecting NULLs 2562- vc4: use unsigned types when performing bitshifting 2563- vc4: do not access member of a NULL structure 2564- vc4: do not pass NULL pointer to function not expecting NULLs 2565- vc4: do not create 0-bytes variable length arrays 2566- vc4: fix out-of-bounds access to array 2567- Revert "ci: disable Igalia farm" 2568- v3d: use original enabled_mask on setting vertex buffers 2569- broadcom/ci: read 32-bit kernel from arm32 path 2570- broadcom/ci: remove arch from hardware name 2571- vc4/ci: run tests in 64-bits 2572- broadcom/ci: run some GL tests in arm32 arch 2573- broadcom/qpu: clean all versions not supported 2574- broadcom: follow version naming convention 2575- broadcom/ci: add more jobs to test with rpi5 2576- broadcom/ci: update traces for rpi4 2577- v3d/ci: update expected list 2578- v3dv: adversise VK_EXT_depth_clamp_zero_one 2579- v3d: expose ARB_depth_clamp in V3D 7.x 2580- v3dv: free temp image created when copying with blit 2581- v3dv: don't leak cache key 2582 2583Julian Orth (1): 2584 2585- egl/wayland: ignore unsupported driver configs 2586 2587Juston Li (8): 2588 2589- venus: refactor out image requirements helpers 2590- venus: extend image cache to vkGetDeviceImageMemoryRequirements 2591- sync protocol for VkRingPriorityInfoMESA 2592- venus: forward nice priority when creating ring 2593- zink: disable cpu_storage for PIPE_USAGE_STREAM 2594- venus: add missing sTypes for vk_set_physical_device_properties_struct 2595- venus: sync protocol for conditionally ignored dyn arrays 2596- anv/android: set ANV_BO_ALLOC_EXTERNAL for imported AHW 2597 2598Karmjit Mahil (6): 2599 2600- ir3: Don't set saturation on \`flat.b` 2601- zink: Add missing currentExtent special value handling 2602- turnip: Remove workaround for CTS bug zero-sized inline uniform block 2603- mailmap: Add Karmjit Mahil 2604- freedreno/isa: Fix isaspec map for a3xx-ld 2605- tu: Set \`TU_ACCESS_CCHE_READ` for transfer ops with read access 2606 2607Karol Herbst (159): 2608 2609- nir: add SYSTEM_VALUE_BASE_WORKGROUP_ID 2610- nir/divergence_analysis: handle load_base_global_invocation_id 2611- intel/compiler: lower workgoup id to index only for mesh shaders 2612- v3d: call nir_lower_compute_system_values to get rid of base intrinsics 2613- lavapipe: lower base_workgroup_id to zero 2614- mesa/st: lower base invoc and workgroup id 2615- nir: remove global_invocation_id_zero_base 2616- nir: remove workgroup_id_zero_base 2617- nir: document base_global_invocation_id and base_workgroup_id 2618- core/kernel: skip validating unique kernel signatures 2619- rusticl/program: Arc the stored KernelInfo 2620- rust/program: remove Program::kernels 2621- nouveau: fix potential double-free in nouveau_drm_screen_create 2622- nir: fix nir_shader_get_function_for_name for functions without names. 2623- rusticl: use stream uploader for cb0 if prefered 2624- rusticl/kernel: properly handle grid and offsets being usize 2625- rusticl: lower huge grids 2626- rusticl: add RUSTICL_MAX_WORK_GROUPS 2627- rusticl/event: use Weak refs for dependencies 2628- rusticl/icd: remove CLObject 2629- rusticl/spirv: enable more caps 2630- Revert "rusticl/event: use Weak refs for dependencies" 2631- event: break long dependency chains on drop 2632- rusticl/device: add DeviceCaps and move timestamp stuff into it 2633- rusticl/device/caps: move enough for has_images 2634- rusticl/device: properly handle devices with no support for images 2635- rusticl/mesa/context: flush context before destruction 2636- rusticl: merge rusticl_nir and rusticl_mesa_bindings_inline_wrapper targets 2637- rusticl: move mesa_version_string out of the inline wrapper 2638- rusticl: bump bindgen req to 0.65 2639- rusticl: bump meson req to 1.4 2640- rusticl: make use of new \`output_inline_wrapper` meson.rust.bindgen feature 2641- nir/lower_cl_images: set binding also for samplers 2642- nouveau: import nvif/ioctl.h file from libdrm_nouveau 2643- gallium/vl: stub vl_video_buffer_create_as_resource 2644- gallium/vl: remove stubs which are defined in mesa_util 2645- meson: centralize galliumvl_stub handling 2646- rusticl: link against libgalliumvl_stub 2647- wgl: link against libgalliumvl_stub 2648- gallium/drivers: do not link against libgalliumvl directly 2649- rusticl/event: fix deadlock when calling clGetEventProfilingInfo inside callbacks 2650- iris: fix PIPE_RESOURCE_PARAM_STRIDE for buffers 2651- rusticl/icd: make sure returned function pointers are of the right type 2652- rusticl/kernel/launch: fix mapping usize types to GPU pointer sizes 2653- rusticl/kernel/launch: remove useless upload of the input 2654- rusticl/kernel: move most of the code in launch inside the closure 2655- rusticl/kernel/launch: move allocation of resources vec 2656- rusticl/kernel/launch: rework how the printf buffer is allocated 2657- rusticl/kernel/launch: get rid of Arc clones for global resources 2658- rusticl/kernel/launch: add helper to bind global buffers 2659- broadcom/compiler: handle load_workgroup_size 2660- v3d: add support for load_workgroup_size 2661- rusticl/spirv: do not pass a NULL pointer to slice::from_raw_parts 2662- rusticl/memory: copies might overlap for host ptrs 2663- gallium: reduce pipe_resource.usage to 4 bits 2664- gallium: properly type pipe_resource.usage with the enum 2665- gallium: properly type fields of pipe_resource.usage 2666- nir_lower_mem_access_bit_sizes: support unaligned store_scratch 2667- nir: add global_atomic_2x32 variants to nir_get_io_offset_src_number 2668- broadcom/compiler: support global load/store intrinsics 2669- broadcom/compiler: use nir_lower_mem_access_bit_sizes for memory lowering 2670- broadcom/compiler: convert 2x32 global operations to scalar variants 2671- broadcom/compiler: only handle load_uniform explicitly in v3d_nir_lower_load_store_bitsize 2672- broadcom/compiler: rework scratch lowering 2673- rusticl/meson: add build root dir to the include dirs of rusticl_c 2674- rusticl: depend on the spirv_info target 2675- util/u_printf: properly handle %% 2676- rusticl/memory: assume minimum image_height of 1 2677- rusticl/memory: fix clFillImage for buffer images 2678- rusticl: add new CL_INVALID_BUFFER_SIZE condition for clCreateBuffer 2679- rusticl: add bsymbolic to linker flags 2680- rusticl/icd: rename all entry points to the actual correct name 2681- radeonsi: set bo_size for user memory allocations 2682- rusticl/queue: gracefully stop the worker thread 2683- rusticl/queue: run rustfmt 2684- nir/lower_alu: support 8 and 16 bit bit_count 2685- nir/opt_sink: add load_kernel_input 2686- gallium: add PIPE_CAP_TEXTURE_SAMPLER_INDEPENDENT 2687- rusticl/device: require PIPE_CAP_TEXTURE_SAMPLER_INDEPENDENT for image support 2688- rusticl/mesa/context: handle clear_buffer not set by driver 2689- rusticl/mesa/screen: handle get_timestamp not set by driver 2690- rusticl/kernel/launch: fix global work offsets for 32 bit archs again 2691- broadcom/compiler: add generated v3d_nir_lower_algebraic 2692- broadcom/compiler: handle fp16 conversion ops 2693- broadcom/compiler: fix iu2f32 for 8 and 16 bit inputs 2694- broadcom/compiler: try handling 8/16 bit alu operations 2695- broadcom/compiler: handle up to vec16 load_uniforms 2696- broadcom/compiler: abort on unknown intrinsics 2697- broadcom/compiler: implement load_kernel_input 2698- broadcom/compiler: call nir_lower_64bit_phis 2699- broadcom/compiler: handle variable shared memory 2700- v3d: implement gallium APIs for OpenCL support 2701- v3d: treat SHADER_KERNEL as SHADER_COMPUTE 2702- v3d: lower CL alus 2703- v3d: lower 64 bit ALUs 2704- v3d: support variable shared memory 2705- v3d: fix MAX_GLOBAL_SIZE and MAX_MEM_ALLOC_SIZE 2706- v3d: never replace a mapped bo 2707- rusticl: enable v3d 2708- nir/schedule: add write dep also for shared_atomic 2709- meson: rename with_gallium_opencl to with_gallium_clover 2710- rusticl/program: move binary parsing into its own function 2711- rusticl/program: make binary API not crash on errors 2712- rusticl/program: use blob.h to parse binaries 2713- rusticl/program: update binary format 2714- rusticl/program: use default in more places 2715- Revert "rusticl/queue: run rustfmt" 2716- Revert "rusticl/queue: gracefully stop the worker thread" 2717- rusticl/buffer: harden bound checks against overflows 2718- rusticl/context: move SVM pointer tracking into own type 2719- rusticl/ptr: add a few APIs to TrackedPointers 2720- rusticl/memory: complete rework on how mapping is implemented 2721- rusticl: remove unused interfaces to simplify code 2722- rusticl/mesa: remove ResourceType::Cb0 2723- rusticl/memory: optimize sw_copy when the row_pitch matches the height 2724- rusticl/mesa: make PipeResource repr(transparent) 2725- v3d: support unnormalized coords 2726- rusticl/spirv: support more caps 2727- rusticl/device: fix image_3d_write_supported for embedded 2728- rusticl/device: turn image_3d_write_supported into a cap 2729- rusticl/device: fix advertizement of 3d write images support 2730- rusticl: require PIPE_CAP_IMAGE_STORE_FORMATTED for image support. 2731- rusticl/event: make set_status handle error status properly 2732- rusticl/queue: do not overwrite event error states 2733- rusticl/queue: properly check all dependencies for an error status 2734- rusticl/event: properly implement CL_EXEC_STATUS_ERROR_FOR_EVENTS_IN_WAIT_LIST 2735- rusticl/queue: properly implement in-order queue error checking 2736- rusticl/event: return execution errors when doing a blocking enqueue 2737- rusticl/mesa: handle failures with u_upload_data 2738- rusticl/mesa: set take_ownership to true in set_constant_buffer_stream 2739- rusticl/event: fix outdated comment in call 2740- rusticl/queue: format file 2741- zink: fix OpenCL read_write images 2742- rusticl: support read_write images 2743- spirv: generate info for FunctionParameterAttribute 2744- spirv: initial parsing of function parameter decorations 2745- spirv: handle function parameters passed by value 2746- nak: allow clippy::not_unsafe_ptr_arg_deref lints 2747- clc: force linking of spirvs with mismatching pointer types in signatures 2748- rusticl: fix clippy lint having bounds defined in multiple places 2749- rusticl/program: protect against 0 length in slice::from_raw_parts 2750- rusticl/api: protect against 0 length in slice::from_raw_parts 2751- rusticl/spirv: protect against 0 length in slice::from_raw_parts 2752- nouveau: handle realloc failure inside cli_kref_set 2753- mesa: check for enabled extensions for \*UID enums 2754- nouveau/winsys: fix handling of NV_DEVICE_TYPE_IGP 2755- nouveau: use nv_devince_info and fill in PCI and type information 2756- nouveau: add nv_device_uuid 2757- nouveau: implement driver_uuid and device_uuid 2758- nvk: use nv_device_uuid 2759- zink: lower 64 bit find_lsb, ufind_msb and bit_count 2760- zink: lower 8/16 bit alu ops vk spirv doesn't allow 2761- rusticl/kernel: properly respect device thread limits per dimension 2762- rusticl/memory: Fix memory unmaps after rework 2763- rusticl/image: take pitches into account when allocating memory for maps 2764- rusticl/image: properly sync mappings content for 1Dbuffer images 2765- rusticl/queue: add clSetCommandQueueProperty 2766- util/u_printf: do not double print format string with unused arugments 2767- rusticl/memory: fix sampler argument size check 2768 2769Kenneth Graunke (63): 2770 2771- isl: Set MOCS to uncached for Gfx12.0 blitter sources/destinations 2772- intel/brw: Delete gfx10 table for align1 3src type encoding 2773- intel/brw: Drop NF type support 2774- intel/brw: Rework BRW_REGISTER_TYPE's representation semantics 2775- intel/brw: Stop using long BRW_REGISTER_TYPE enum names 2776- intel/brw: Reindent after shortening BRW_REGISTER_TYPE_* to BRW_TYPE_* 2777- intel/brw: Use newer brw_type_is_* shorter names 2778- intel/brw: Replace brw_reg_type_from_bit_size by brw_type_with_size 2779- intel/brw: Replace type_sz and brw_reg_type_to_size with brw_type_size_* 2780- intel/brw: Combine a1/a16 3src type encoding functions 2781- intel/brw: Combine a1/a16 3src type decoding functions 2782- intel/brw: Rename brw_reg_type_to_hw_type to brw_type_encode 2783- intel/brw: Don't use inst return value when it isn't needed 2784- intel/brw: Make a helper for finding the largest of two types 2785- intel/brw: Add builder helpers for math functions 2786- intel/brw: Add builder helpers that allocate temporary destinations 2787- intel/brw: Use new builder helpers that allocate a VGRF destination 2788- intel/brw: Print W/UW immediates correctly 2789- intel/brw: Do not create empty basic blocks when removing instructions 2790- intel/brw: Support CSE on more ops 2791- intel/brw: Don't include unnecessary undefined values in texture results 2792- intel/brw: Add a new VEC() helper. 2793- intel/brw: Use VEC for load_const 2794- intel/brw: Use VEC for gl_FragCoord 2795- intel/brw: Use VEC for TCS/TES/GS input/output loads 2796- intel/brw: Use VEC for FS outputs 2797- intel/brw: Use VEC for output stores 2798- intel/brw: Use VEC for NIR vec*() sources 2799- intel/brw: Use VEC for emit_unzip() 2800- intel/nir: Set src_type on TCS quads workaround store_output 2801- intel/brw: Blockify convergent load_shared on Gfx11-12 as well 2802- intel/brw: Recreate GS output registers after EmitVertex 2803- intel/brw: Skip fs_nir_setup_outputs for compute shaders 2804- intel/brw: Handle scratch address swizzling of constants 2805- intel/brw: Add a idom_tree::dominates(a, b) helper. 2806- intel/brw: Make brw_reg::bits publicly accessible from fs_reg 2807- intel/brw: Update instructions_match() to compare more fields 2808- intel/brw: Drop compiler parameter from try_constant_propagate() 2809- intel/brw: Drop BRW_OPCODE_IF from try_constant_propagate 2810- intel/brw: Refactor try_constant_propagate() 2811- intel/brw: Refactor code to commute immediates into legal positions 2812- intel/brw: Delete SAD2 and SADA2 opcodes 2813- intel/brw: Make VEC() perform a single write to its destination. 2814- intel/brw: Make gl_SubgroupInvocation lane index loading SSA 2815- intel/brw: Skip LOAD_PAYLOADs after every texture instruction if possible 2816- intel/brw: Add a new def analysis pass 2817- intel/brw: Print defs in dump_instructions 2818- intel/brw: Write a new global CSE pass that works on defs 2819- intel/brw: Switch to the new defs-based global CSE pass 2820- intel/brw: Delete old local common subexpression elimination pass 2821- intel/brw: Introduce a new SSA-based copy propagation pass 2822- intel/brw: Use the defs-based copy propagation along with the old one 2823- intel/brw: Make opt_copy_propagation_defs clean up its own trash 2824- intel/brw: Build the scratch header on the fly for pre-LSC systems 2825- intel/brw: Skip discarding the interference graph 2826- intel/brw: Delay liveness calculations in saturate propagation 2827- intel/brw: Make an alu2 builder helper 2828- intel/brw: Make bld.ADD(x, 0) emit no instructions and return x directly 2829- intel/brw: Support CSE of ADD3 2830- intel/brw: Add a lower_csel pass and allow building it for all types 2831- intel/nir: Don't needlessly split u2f16 for nir_type_uint32 2832- intel/brw: Don't mix types for unary extended math instructions 2833- intel/brw: Disallow scalar byte to float conversions on DG2+ 2834 2835Kevin Chuang (6): 2836 2837- anv: Properly fetch partial results in vkGetQueryPoolResults 2838- anv: Properly handle cases for different query types in copy_query_results_with_shader 2839- intel/genxml: add task/mesh shader statistics registers 2840- anv: Update pipeline statistics mask for task/mesh shader invocations 2841- anv: implement mesh shader queries 2842- anv: toggle meshShaderQueries based on whether we support mesh_shader or not 2843 2844Khem Raj (1): 2845 2846- amd: Include missing llvm IR header Module.h 2847 2848Konstantin (4): 2849 2850- docs: Add documentation about debugging GPU hangs on RADV 2851- ac/debug,radv: Read UMR wave dumps into memory before parsing 2852- radv: Use a struct for the trace_bo layout 2853- radv: Trace indirect dispatch sizes 2854 2855Konstantin Seurer (59): 2856 2857- radv: Handle all dependencies of CmdWaitEvents2 2858- nir/print: Do not access invalid indices of load_uniform 2859- radv: Fix radv_shader_arena_block list corruption 2860- radv: Remove arenas from capture_replay_arena_vas 2861- radv: Zero initialize capture replay group handles 2862- radv/ci: Add back pipeline library flakes 2863- radv/ci: Document recent flakes 2864- gitlab: Reference hang debugging documenttion 2865- radv: Remove radv_cmd_dirty_dynamic_bits 2866- llvmpipe: Use a second LLVMContext for compiling sample functions 2867- radv: Add locking to radv_replay_shader_arena_block 2868- radv: Replace is_rt_shader with RADV_SHADER_TYPE_RT_PROLOG 2869- radv: Remove uses_dynamic_rt_callable_stack 2870- radv/rt: Track ray_launch_id reads 2871- radv/rt: Track ray_launch_size reads 2872- radv/rt: Remove load_rt_dynamic_callable_stack_base_amd 2873- radv: Return a block from radv_replay_shader_arena_block 2874- ac/llvm: Fix DENORM_FLUSH_TO_ZERO with exact instructions 2875- ac/llvm: Enable helper invocations for vote_all/any 2876- radv/ci: Bring back vkcts-navi21-llvm-valve 2877- khronos-update: Add ANDROID guards to vk_android_native_buffer.h 2878- zink: Always include renderdoc_app.h 2879- zink: Blit using one triangle for nearest filtering 2880- llvmpipe: Lock shader access to sample_functions 2881- llvmpipe: Stop using a sample_functions pointer as cache key 2882- llvmpipe: Only evict cache entries if a fence is available 2883- lavapipe: Always call finish_fence after lvp_execute_cmd_buffer 2884- radv: Clean up pipeline barrier handling 2885- radv: Remove dead access bits 2886- radv/meta: Use READ access for dst_access_flush 2887- radv/rra: Detect BVHs with back edges 2888- radv/rra: Move some code into handle_accel_struct_write 2889- radv/rra: Fix disabling the ray history 2890- radv/rra: Fix reporting the isec invocations 2891- radv/rra: Bump rt_driver_interface_version to 8.0 2892- radv/rra: Reduce the memory requirement of copy_after_build 2893- radv/rra: Rework calculating the ray history size 2894- radv/rra: Enable RADV_RRA_TRACE_COPY_AFTER_BUILD by default 2895- util: Add a helper for querying sparse tile sizes 2896- lavapipe: Do not allocate 0 sized buffers for descriptor sets 2897- gallium: Add a memory range parameter to resource_bind_backing 2898- llvmpipe: Use an anonymous file for memory allocations 2899- lavapipe: Implement sparse buffers and images 2900- lavapipe: Implement shaderResourceResidency 2901- venus: Refactor hiding sparse features and properties 2902- venus: Disable sparse binding on lavapipe 2903- vulkan: Handle group stages in vk\_.*_access2_for_pipeline_stage_flags2 2904- vulkan: Add vk_expand_(dst|src)_access_flags2 2905- radv: Use vk_expand_(src|dst)_access_flags2 2906- radv: Remove no-op access flag handling 2907- radv: Remove handling for expanded access flags 2908- radv: Remove write access handling from radv_dst_access_flush 2909- radv: Handle AS access bits like shader storage access bits 2910- radv: Refactor radv_(dst|src)_access_flush 2911- radv: Fix smooth lines with dynamic polygon mode and topology 2912- radv: Always use dynamic line smoothing 2913- nir: Stop using "capture : true" for nir_opt_algebraic 2914- nir: Add FLOAT_CONTROLS\_.*_PRESERVE 2915- aco: print s_delay_alu INSTSKIP>3 correctly 2916 2917Leo Liu (4): 2918 2919- ac/surface: add GFX12 256B tile mode for video 2920- ac/surface/tests: add the test for ADDR3_256B_2D 2921- radeon/vcn: use pipe video buffers for dpb 2922- radeon/vcn: enable dpb to use pipe video buffer with swizzle mode 2923 2924Lionel Landwerlin (125): 2925 2926- anv: disable dual source blending state if not used in shader 2927- anv: reuse embedded samplers across shaders 2928- anv: simplify multisampling check 2929- anv: fixup indentation 2930- anv: factor out wm_prog_data get in runtime flush 2931- intel/brw: fixup wm_prog_data_barycentric_modes() 2932- intel/fs: decouple alphaToCoverage from per sample dispatch 2933- intel/brw: add min_sample_shading value in wm_prog_data 2934- anv: track sample shading enable & min sample shading 2935- anv: add dirty tracking of fs_msaa_flags in runtime 2936- anv: move 3DSTATE_WM::BarycentricInterpolationMode programming to runtime 2937- anv: move more PS_EXTRA programming to runtime 2938- anv: move 3DSTATE_PS to partial packing 2939- anv: move 3DSTATE_MULTISAMPLE to partial emission 2940- anv: remove fs_msaa_flags from the graphics pipeline 2941- anv: enable EDS3 AlphaToCoverageEnable & RasterizationSamples 2942- anv: fixup alloc failure handling in reserved_array_pool 2943- anv: fix leak of custom border colors 2944- anv: avoid requirement to put flush_data as first field 2945- anv: move device initialization as the last step of vkCreateDevice 2946- anv: move empty_vs_input to physical device 2947- anv: VK_EXT_legacy_vertex_attributes 2948- docs: update anv features 2949- anv: fix ycbcr plane indexing with indirect descriptors 2950- intel/hang_replay: use newer API of i915 execbuffer 2951- intel/hang_replay: use hw image param 2952- intel/tools: add README file 2953- brw: add more condition for reducing sampler simdness 2954- intel: move debug identifier out of libintel_dev 2955- brw: drop dependency on libintel_common 2956- anv: fix push constant subgroup_id location 2957- nir/divergence: add missing load_printf_buffer_address 2958- nir: add a base offset for printf indexing 2959- nir: add ptr_bit_size parameter to nir_lower_printf 2960- nir: add a low level printf emission helper 2961- intel/nir: remove unused prototypes 2962- intel/nir: add reloc delta to load_reloc_const_intel intrinsic 2963- intel/compiler: store u_printf_info in prog_data 2964- intel/nir: add printf lowering 2965- anv: add debug shader printf support 2966- intel/clc: enable printfs support 2967- anv: shader printf example 2968- anv: switch to vk_device::mem_cache field for default cache 2969- anv: use weak_ref mode for global pipeline caches 2970- anv: fix shader identifier handling 2971- intel/brw: ensure find_live_channel don't access arch register without sync 2972- anv: fix utrace compute walker timestamp captures 2973- anv: fix timestamp copies from secondary buffers 2974- anv: move last compute command pointers to the state structure 2975- u_trace: extend tracepoint end_of_pipe bit into flags 2976- anv: optimize POSTSYNC_DATA rewrites in timestamp emissions 2977- intel: fix HW generated local-id with indirect compute walker 2978- brw: use a single virtual opcode to read ARF registers 2979- brw: limit dependencies on SR register 2980- brw: better model READ_ARF_REG opcode 2981- anv: fix Gfx9 fast clears on srgb formats 2982- anv: rewrite Wa_18019816803 tracking to be more like state 2983- anv: factor out some more gpu_memcpy setup 2984- anv: fix pipeline flag fields 2985- anv: expose VK_MESA_image_alignment_control 2986- anv: support setting CFE_STATE::StackIDControl per application 2987- anv: limit aux invalidations to primary command buffers 2988- anv: ensure completion of surface state copies before secondaries 2989- anv: simplify TRTT initialization 2990- anv: reuse setup_execbuf_fence_params for utrace submissions 2991- anv: rework utrace submission 2992- anv: move trtt submissions over to the anv_async_submit 2993- anv: use reserved array pool for legacy custom border colors 2994- anv: make device initialization more asynchronous 2995- mi-builder: rename relocated api 2996- mi-builder: c++ warning fix 2997- mi-builder: make instruction pointer manipulation more obvious 2998- mi-builder: add missing write completion check 2999- mi-builder: add relocated register/memory writes 3000- mi-builder: add a write check parameter 3001- anv: centralize mi_builder setup 3002- anv: use the new relocated write mi-builder api 3003- anv: move more MI_SDI to mi_builder 3004- anv: use default mocs for memory bits only touched by CS 3005- anv: set query mi-builder mocs only once 3006- anv: use new mi-builder write check API to avoid stalls 3007- genxml: add MI_MEM_FENCE for Gfx20 3008- mi-builder: add read/write memory fencing support on Gfx20+ 3009- intel/fs: fix lower_simd_width for MOV_INDIRECT 3010- anv: add custom mi write fences 3011- anv: emit conditional after gfx state flushing 3012- anv: factor out STATE_BASE_ADDRESS filling to helper function 3013- anv: predicate emission of STATE_BASE_ADDRESS 3014- anv: reuse device local variable 3015- anv: avoid initalizing TRTT stuff without sparseBinding 3016- anv: fix vkCmdWaitEvents2 handling 3017- anv: don't apply descriptor array bound checking 3018- brw: add missing break 3019- brw: factor out source extraction for rematerialization 3020- brw: improve rematalization of surface/sampler handles 3021- brw: bound the amount of rematerialized NIR instructions 3022- brw: remove rematerialization assert 3023- brw: remove some brackets 3024- brw: enable rematerialization of non 32bit uniforms 3025- brw: always use new registers for load address increments 3026- brw: annotation send instructions with surface handles generated with exec_all 3027- brw: avoid Wa_1407528679 in uniform cases 3028- brw: blockify load_global_const_block_intel 3029- brw: enable A64 loads source rematerialization 3030- anv: limit vertex fetch invalidation on indirect read 3031- anv: add a protected scratch pool 3032- anv: prepare 2 variants of all shader instructions 3033- anv: allocate compute scratch using the right scratch pool 3034- anv: emit the right shader instruction for protected mode 3035- anv: workaround flaky xfb query results on Gfx11 3036- anv: fix u_trace on < Gfx12.0 3037- intel/ds: remove duplicate arguments 3038- hasvk: move cmd_emit_timestamp initialization to genX 3039- hasvk: pass anv_address to predicate helper 3040- brw: fix uniform rebuild of sources 3041- anv: get rid of the second dynamic state heap 3042- isl: account for protection in base usage checks 3043- anv: properly flag image/imageviews for ISL protection 3044- anv: propagate protected information for blorp operations 3045- anv: fix check on pipeline mode to track buffer writes 3046- vulkan/runtime: allow null/empty debug names 3047- anv: reuse object string for RMV token 3048- anv: add missing MEDIA_STATE_FLUSH for internal shaders 3049- anv/blorp: force CC_VIEWPORT reallocation when programming 3DSTATE_VIEWPORT_STATE_POINTERS_CC 3050- brw/rt: fix ray_object_(direction|origin) for closest-hit shaders 3051 3052Louis-Francis Ratté-Boulianne (20): 3053 3054- dri_interface: add interface for EGL_EXT_surface_compression 3055- gallium: add interface for fixed-rate surface/texture compression 3056- egl/wayland: factor out common part of DRI image creation 3057- egl: wire up EGL_EXT_surface_compression extension 3058- st/dri2: add support for fixed-rate compression interface 3059- egl/dri2: add support for EGL_EXT_surface_compression 3060- mapi: add EXT_texture_storage_compression extension 3061- mesa/st: add compression parameter to st_texture_create 3062- mesa: implement EXT_texture_storage_compression extension 3063- mesa: implement EXT_EGL_image_storage_compression extension 3064- panfrost: Add AFRC overlay in v10 xml specification 3065- panfrost: add device querying for AFRC support 3066- panfrost: add utils for AFRC fixed-rate support 3067- panfrost: encode component order as an inverted swizzle (v10) 3068- panfrost: add support for AFRC textures 3069- panfrost: add support for AFRC render targets 3070- panfrost: add support for AFRC modifiers 3071- panfrost: add translation between modifier and compression rates 3072- panfrost: add support for fixed-rate compression 3073- panfrost: add PAN_AFRC_RATE env var to force a compression rate 3074 3075Luc Ma (4): 3076 3077- loader: silence implicit-load zink error by the loader 3078- gallium: properly propagate the usage of resource 3079- gallium: inline trivial needs_pack() 3080- meson: Build pipe-loader when build-tests is true 3081 3082Lucas Fryzek (7): 3083 3084- llvmpipe: query winsys support for dmabuf mapping 3085- u_gralloc/fallback: Set fd from handle directly 3086- egl/x11/sw: Implement swapbuffers with damage 3087- vulkan/wsi: Update sw x11 wsi to only copy damage regions 3088- egl/x11/sw: Implement shm support 3089- egl/x11: Remove force software check for exporting SBWD 3090- lp: only map dt buffer on import from dmabuf 3091 3092Lucas Stach (2): 3093 3094- etnaviv: drm: don't skip flush when there are active PMRs 3095- etnaviv: always flush pending queries on get_query_result 3096 3097M Henning (2): 3098 3099- nir: Handle texop_*_nv in nir_tex_instr_is_query 3100- nak: Add minimum bindgen requirement 3101 3102Maaz Mombasawala (2): 3103 3104- svga: Retry DRM_VMW_SYNCCPU ioctl on failure. 3105- svga: Replace shared surface flag and simplify surface creation 3106 3107Marcin Ślusarz (2): 3108 3109- intel/genxml/xe2: update MESH_CONTROL 3110- anv,intel/compiler/xe2: fill MESH_CONTROL.VPandRTAIndexAutostripEnable 3111 3112Marek Olšák (174): 3113 3114- ac/gpu_info: set tcc_rb_non_coherent only if number of TCCs != number of RBs 3115- ac/surface: disable DCC for 3D textures on gfx9 to improve performance 3116- ac/surface: enable thick tiling for 3D textures for better perf on gfx6-8 3117- radeonsi: don't invalidate L2 for internal compute without DCC stores 3118- radeonsi: fix KHR-GL46.texture_lod_bias.texture_lod_bias_all on gfx10-11 3119- radeonsi: validate IO semantics in scan_io_usage 3120- radeonsi: add workarounds for DCC MSAA for gfx9-10 3121- radeonsi: enable DCC for MSAA on gfx10-10.3 3122- radeonsi: check for FMASK correctly in gfx10_get_bin_sizes 3123- amd/ci: 17 piglit failures are fixed for raven 3124- nir: add ACCESS_CP_GE_COHERENT_AMD 3125- nir: add nir_atomic_op_ordered_add_gfx12_amd 3126- nir: add streamout intrinsics for AMD GFX12 3127- nir: add sleep intrinsics for AMD 3128- nir: validate src_type of store_output intrinsics, require bit_size >= 16 3129- nir: add more build helpers 3130- nir: add shader_info::use_aco_amd 3131- nir/lower_tex: support FMASK loads with a 16-bit sample index 3132- nir/lower_image: support FMASK loads with a 16-bit sample index 3133- drm-uapi: update amdgpu_drm.h and drm_fourcc.h for gfx12 3134- amd: import gfx12 addrlib 3135- amd: add gfx12 register definitions 3136- amd: add gfx12 register definitions into the register header generator 3137- amd: add initial common code for gfx12 3138- ac/nir: update ac_nir_lower_resinfo for gfx12 3139- ac/nir,llvm: add GS VGPR changes for gfx12 3140- ac/llvm: use new s_wait instructions and split the existing ones for gfx12 3141- ac/llvm: add new cache flags for gfx12 3142- ac/llvm: add CS SGPR changes for gfx12 3143- ac/llvm: update inline assembly for buffer_load_format_xyzw with TFE for gfx12 3144- ac/nir: add ac_nir_sleep and handle the intrinsics 3145- ac/nir: add gfx12 streamout NIR code 3146- ac/llvm: handle nir_atomic_op_ordered_add_gfx12_amd 3147- ac/llvm: implement nir_intrinsic_ordered_xfb_counter_add_gfx12_amd 3148- ac/llvm: add a workaround for nir_intrinsic_load_constant for LLVM on gfx12 3149- ac/surface: add gfx12 3150- ac/surface/tests: add gfx12 tests 3151- radeonsi: add gfx12 3152- util: shift the mask in BITSET_TEST_RANGE_INSIDE_WORD to be relative to b 3153- ac/llvm: improve/simplify/fix load_ssbo 3154- radeonsi: serialize shader disassembly string to fix asm dumps for ACO 3155- radeonsi: fix the size of the query result SSBO 3156- radeonsi: validate the buffer range in si_set_shader_buffer 3157- radeonsi: remove GDS tests 3158- radeonsi: set flags directly instead of having needs_db_flush 3159- radeonsi/gfx11: use a lighter workaround for Navi31 dEQP failures 3160- radeonsi: get NIR options from si_screen instead of calling get_compiler_options 3161- radeonsi: minor simplifications of clear/copy_buffer shaders 3162- radeonsi: simplify the complex clear/copy_buffer shader 3163- radeonsi: use set_work_size for all internal compute dispatches 3164- radeonsi: replace the clear_12bytes_buffer shader with the DMA compute shader 3165- radeonsi: remove slow code from si_msaa_resolve_blit_via_CB 3166- radeonsi/ci: fix caselists for vk-gl-cts/main 3167- radeonsi/ci: update failures for all generations 3168- radeonsi/ci: remove some gfx11 flakes 3169- radeonsi: constify struct pipe_vertex_buffer * 3170- nir/algebraic: eliminate pack+unpack and unpack+pack pairs 3171- ac: move radv_mem_vectorize_callback to common code 3172- ac/llvm: global stores should have no holes in the writemask 3173- radeonsi: call nir_lower_int64 later to fix ACO failure with Tomb Raider 3174- radeonsi: vectorize load/stores and shrink stores 3175- amd: update addrlib 3176- amd: add more gfx11 APUs 3177- amd: enable 32B minimum DCC block size for gfx1151 3178- ac/llvm: fix incorrect parameter type in llvm.amdgcn.s.nop 3179- radeonsi: vectorize loads/store after ABI lowering and optimizations 3180- radeonsi/gfx12: fix the alpha ref value 3181- radeonsi/gfx12: fix incorrect condition for when to do clear_buffer via compute 3182- radeonsi/gfx12: disable CU1 instead of CU0 for GS due to SQTT 3183- radeonsi/gfx12: fix a regression in si_set_mutable_tex_desc_fields 3184- radeonsi/gfx12: fix depth bounds register values 3185- radeonsi/gfx12: fix a regression in si_init_depth_surface 3186- radeonsi: don't lower UBO/SSBOs to descriptors if they are already lowered 3187- radeonsi: lower NIR resource srcs to descriptors last 3188- ac/descriptors: fix gfx12 regressions 3189- ac/nir/lower_ngg: use global_atomic_amd to fix gfx12 streamout 3190- ac/nir/lower_ngg: use voffset in global_atomic_add for xfb 3191- ac: add gfx12 DCC shared code 3192- radeonsi/gfx12: fix GPU deadlocks due to query result incoherency 3193- radeonsi: assume si_set_ring_buffer is only used by gfx6-10.3 3194- radeonsi: remove cp_to_L2 and L2_to_cp, inline the values 3195- radeonsi: remove RADEON_FLAG_READ_ONLY 3196- radeonsi: allow RADEON_HEAP_BIT_GL2_BYPASS for VRAM 3197- radeonsi: remove leftover comment of non-existent RADEON_FLAG_MALL_NOALLOC 3198- radeonsi/gfx12: add DCC 3199- ac/surface: pass the correct addrlib handle to Addr3GetPossibleSwizzleModes 3200- amd: update addrlib 3201- ac/nir/lower_ngg: don't use gfx12 xfb defs outside their basic block on gfx11 3202- radeonsi/gfx12: fix stencil corruption 3203- gallium/u_blitter: add option to override fragment shader for util_blitter_blit 3204- radeonsi: don't declare 3D coordinates in the compute blit if they aren't needed 3205- radeonsi: use better workgroup sizes for compute blits to improve perf 3206- radeonsi: ignore PIPE_SWIZZLE_1 for 40% VGPR usage reduction for compute blits 3207- radeonsi: remove fp16_rtz from the compute blit 3208- radeonsi: use MIMG D16 (16-bit data) for image instructions in compute blits 3209- radeonsi: optimize unaligned compute blits 3210- radeonsi: fix sample0_only for the compute blit 3211- radeonsi: reject unsupported parameters as the first thing in the compute blit 3212- radeonsi: don't use si_can_use_compute_blit in the compute blit 3213- radeonsi: don't fail due to DCC when using the compute blit on compute queues 3214- radeonsi/gfx11: enable MSAA image stores in the compute blit 3215- radeonsi: document better how X/Y flipping in the compute blit works 3216- radeonsi: cosmetic and robustness changes for the compute blit 3217- radeonsi: extend the compute blit to do image clears as well 3218- radeonsi: switch compute image clears to the compute blit shader 3219- radeonsi: rename si_compute_blit "testing" parameter to "fail_if_slow" 3220- radeonsi: rename si_compute_copy_image -> si_compute_copy_image_old 3221- radeonsi: add a new version of si_compute_copy_image using the compute blit 3222- radeonsi: switch the old compute image copy to the new one using the blit 3223- radeonsi: remove the old si_compute_copy_image 3224- radeonsi: convert the compute blit shader hash table to u64 keys 3225- radeonsi: split xy_clamp_to_edge to separate X and Y flags for the compute blit 3226- radeonsi: restructure (rewrite) the compute blit shader 3227- radeonsi: adds flags parameter into si_compute_blit to replace fail_if_slow 3228- radeonsi: change the compute blit to clear/blit multiple pixels per lane 3229- radeonsi: extend NIR compute helpers to allow returning 16-bit results 3230- radeonsi: use MIMG A16 (16-bit image coordinates) in compute blits 3231- radeonsi: print the compute shader blit key for AMD_DEBUG 3232- radeonsi: use shader_info::use_aco_amd to determine whether to use ACO 3233- radeonsi: add use_aco into CS blit shader key 3234- radeonsi: clear color buffers via compute for special tiling cases 3235- radeonsi: add a custom MSAA resolving pixel shader 3236- radeonsi: add fail_if_slow parameter into si_msaa_resolve_blit_via_CB 3237- radeonsi: add a new blit microbenchmark 3238- radeonsi: add decision code to select when to use CB_RESOLVE for performance 3239- radeonsi: add decision code to select when to use compute blit for performance 3240- ac/nir: import the MSAA resolving pixel shader from radeonsi 3241- ac/nir: import the universal compute clear/blit shader 3242- ac/nir: import the dispatch logic for the universal compute clear/blit shader 3243- Revert "radeonsi: fix initialization of occlusion query buffers for disabled RBs" 3244- radeonsi/ci: update gfx10.3 failures 3245- nir/lower_io_to_scalar: add new_component temporary variable 3246- nir/lower_io_to_scalar: don't create output stores that have no effect 3247- nir: add nir_opt_vectorize_io, vectorizing lowered IO 3248- glsl/linker: vectorize lowered IO 3249- nir: add a NIR option flag nir_io_prefer_scalar_fs_inputs 3250- ac/nir/cdna: allow 16-bit coordinates 3251- ac/nir/cdna: ignore image_descriptor intrinsics 3252- ac/nir/cdna: don't use image_descriptor intrinsics if the src is a descriptor 3253- mesa: switch remaining shader functions from SHA1 to BLAKE3 3254- radeonsi: replace shader SHA1 hashes with BLAKE3 3255- radeonsi: don't use CP DMA on GFX940 3256- nir: rename ordered_xfb_counter_add_gfx12_amd -> ordered_add_loop_gfx12_amd 3257- ac/nir: remove sleeps from gfx12 streamout code 3258- ac/llvm: remove s_nop from ordered_add_loop_gfx12_amd 3259- ac/llvm: fix inline assembly register constraints for ordered_add_loop_gfx12_amd 3260- as/llvm: add s_nops before the ordered add loop and s_wait_alu workaround 3261- radeonsi: implement nir_intrinsic_load_ssbo_address 3262- radeonsi: expose internal buffer bindings to compute shaders 3263- radeonsi/gfx12: always set BO metadata, not just during export 3264- radeonsi/gfx12: fix compute register settings for global_atomic_ordered_add 3265- ac/surface: finish display DCC for gfx11.5 3266- ac/surface: finish display DCC for gfx12 3267- radeonsi: add fail_if_slow parameter into compute_clear/copy_buffer 3268- radeonsi: use a hash_table and define a shader key for the DMA compute shader 3269- radeonsi: add dwords_per_thread parameter into si_compute_clear_copy_buffer 3270- radeonsi: clear buffers with a 12B clear value by clearing 4 dwords per thread 3271- radeonsi: rewrite the clear/copy_buffer microbenchmark 3272- radeonsi/ci: update gfx11 failures 3273- radeonsi: replace si_shader::scratch_bo with scratch_va, don't set it on gfx11+ 3274- radeonsi: don't update compute scratch if the compute shader doesn't use it 3275- ac: add radeon_info::has_scratch_base_registers 3276- radeonsi: lock a mutex when updating scratch_va for compute shaders 3277- util: make util_idalloc_exists private 3278- util: don't use variable names that can appear in args of idalloc foreach macros 3279- util: add util_idalloc_sparse, solving the excessive virtual memory usage 3280- mesa: switch ID allocation to util_idalloc_sparse to reduce virtual memory usage 3281- nir/opt_algebraic: use fmulz for fpow lowering to fix incorrect rendering 3282- radeonsi/gfx12: fix a GPU hang due to an invalid packet with window rectangles 3283- radeonsi: ensure TC_L2_dirty is set if we don't sync after internal SSBO blits 3284- radeonsi: fix buffer coherency issues on gfx6-8,12 due to missing PFP->ME sync 3285- radeonsi/gfx12: fix register programming to fix GPU hangs 3286- radeonsi/gfx12: fix VS output corruption with streamout 3287- ac/surface/gfx12: turn off HiZ for pre-production samples 3288 3289Mark Burton (1): 3290 3291- gallivm: Fix compilation errors when using LLVM 13. 3292 3293Mark Collins (21): 3294 3295- vdrm: Add fixed VA parameter for mapping memory 3296- tu: Handle VkDeviceMemory BO unmapping in VkUnmapMemory 3297- tu: Implement VK_EXT_map_memory_placed 3298- docs/features: Add VK_EXT_map_memory_placed 3299- tu/shader: Allow LRZ when write pos with explicit early frag test 3300- tu/lrz: Emit GRAS_LRZ_CNTL2 on A7XX 3301- tu/lrz: Use actual CHIP rather than hardcoding A6XX 3302- fd/a7xx: Initialize magic register 8C34 to 0 3303- fd/a7xx: Initialize magic register 8008 to 0 3304- tu: Allow LRZ on A7XX 3305- tu/lrz: Add structure for LRZ FC layout 3306- tu: Update LRZ FC allocation for A7XX layout 3307- tu: Update LRZ FC dirty clear for A7XX 3308- tu: Specify LRZ FC depth clear value on A7XX 3309- tu: Enable LRZ fast-clear for A7XX 3310- fd/a7xx: Document \`LRZ_FLIP_BUFFER` event 3311- docs/freedreno: Add documentation on A7XX LRZ 3312- tu: Emit GRAS_LRZ_DEPTH_BUFFER_INFO correctly 3313- tu/kgsl: Spin unti KGSL reports queue timestamp during profiling 3314- tu/kgsl: Fix profiling buffer GPU IOVA 3315- fd/meson: Only build 'ds' when system has DRM 3316 3317Martin Krastev (2): 3318 3319- svga: convert license block to SPDX 3320- svga: update timespan in copyright message 3321 3322Martin Roukala (né Peres) (9): 3323 3324- ci/b2c: Reduce the length of the kernel cmdline 3325- nvk+zink/ci: rename the ga106 jobs to be more in line with RADV 3326- nvk+zink/ci/ga106: make the expectations codename-specific 3327- nvk+zink/ci: document more flakes in the ga106 3328- turnip/ci: document a missing flake from the a750_vk job 3329- turnip/ci: bump the a750_vk timeout 3330- turnip+zink/ci: add more flakes to the expectations 3331- radv+zink/ci: document recent flakes 3332- radv/ci: add a bunch of flakes 3333 3334Mary Guillemard (86): 3335 3336- nak: Pass has_mod to all form of src2 requiring it 3337- panvk: Ensure we lower load_base_workgroup_id to 0 3338- panfrost: Skip new failure from VKCTS 1.3.8.x 3339- nvk, nak: Wire up conservative rasterization underestimate 3340- docs/features: Add EXT_conservative_rasterization for NVK 3341- agx: speed-up dce 3342- panvk: Only clear UBOs descriptors when set isn't present 3343- nouveau: nvidia_header: Add AMPERE_B class generation 3344- nak: Set SPH version to 4 on SM75+ 3345- nak: Migrate sph.rs to use SPH headers defintion 3346- bi: Reformat code 3347- midgard: Reformat code 3348- bi: Alloc replacement array once in opt_cse 3349- pan/lib, panvk: Ensure data_size is on 64 bits 3350- panvk: Fix shader destruction when vk_shader_module_to_nir fail 3351- panvk: Remove panvk_lower_blend 3352- panvk: Remove dynarray from panvk_shader 3353- panvk: Keep panvk_shader alive in panvk_pipeline_shader 3354- panvk: Upload shader in panvk_shader 3355- panvk: Upload copy tables in panvk_shader 3356- panvk: Upload render state in panvk_shader 3357- panvk: Move the linking bits to panvk_shader 3358- panvk: Kill panvk_pipeline_shader and use panvk_shader directly 3359- panvk: Link shaders at draw time 3360- panvk: Move compile logic out of shader_create 3361- panvk: Move NIR lower logic out of shader_create 3362- panvk: Move preprocess logic out of shader_create 3363- panvk: Implement vk_shader 3364- panvk: Remove panvk_pipeline 3365- pan/va: Ensure no clash with other defs in disassembler 3366- bi: Make disassembler take a const void* 3367- midgard: Make disassembler take a const void* 3368- bi: Move bi_disasm definitions to their own header 3369- panfrost: Add pan_shader_disassemble 3370- panvk: Implement executable IR reporting 3371- panvk: Advertise VK_KHR_pipeline_executable_properties 3372- panvk: Generate proper device and driver UUIDs 3373- panvk: Advertise VK_EXT_pipeline_creation_cache_control and VK_EXT_pipeline_creation_feedback 3374- panvk: Advertise VK_EXT_shader_module_identifier 3375- panvk: Advertise VK_KHR_pipeline_library and VK_EXT_graphics_pipeline_library 3376- panvk: Enable pipeline library in CI for Mali-G52 3377- docs: Update features.txt to add panvk for BDA extensions 3378- panvk: Advertise VK_KHR_device_group and VK_KHR_device_group_creation 3379- panvk: Reorder extensions by name 3380- panvk: Advertise VK_KHR_maintenance3 3381- panvk: Add missing null check in DestroyCommandPool 3382- panvk: Add missing clean up in blend_shader_cache_init 3383- panvk: Make mempool detect NULL BOs 3384- panvk: Check for maxBufferSize in panvk_CreateBuffer 3385- panvk: Make panvk_kmod_zalloc use correct allocation scope on non-transient 3386- panvk: Ensure to unref transient bo in reset for mempools 3387- panvk: Fix device mempool leaks 3388- panvk: Add more allocation checks in create_device 3389- panvk: Implement CmdDispatchBase 3390- panvk: Enable device_init, null_handle and object_management in CI for Mali-G52 3391- panvk: Advertise shaderModuleIdentifier feature 3392- panvk: Report correct min value for discreteQueuePriorities 3393- panvk: Enable dEQP-VK.info tests in CI for Mali-G52 3394- panvk: Clamp viewport scissor to valid range 3395- panvk: Enable offscreen_viewport tests in CI for Mali-G52 3396- panvk: Skip dispatch on empty workgroup 3397- panvk: Report proper workgroup invocation and size 3398- panvk: Enable compute pipeline in CI for Mali-G52 3399- panvk: Advertise VK_EXT_private_data 3400- panvk: Do not emit blend shader when color_mask is 0 3401- panvk: Run nir_lower_io_to_vector for fragment shader 3402- panvk: Enable glsl.440.linkage in CI for Mali-G52 3403- panvk: Implement and advertise anisotropy support 3404- panvk: Advertise VK_KHR_sampler_mirror_clamp_to_edge 3405- panvk: Enable texture filtering in CI for Mali-G52 3406- pan/kmod: Avoid deadlock on VA allocation failure on panthor 3407- panfrost: Handle context_init errors correctly 3408- panfrost: Handle gracefully resource BO alloc failures 3409- ci/panfrost: Update t760 fails 3410- rusticl: Add panthor when panfrost is present in RUSTICL_ENABLE 3411- bi: Clean up mem_access_size_align_cb 3412- bi: Enable lower_pack_64_4x16 3413- bi: Lower pack_32_4x8_split and pack_32_2x16_split in algebraic 3414- bi: Enable lower_pack pass in compiler 3415- bi: Implement basic 8-bit vec support 3416- panfrost: Rewrite set_global_binding to make resources truly global 3417- panfrost: Do not recreate bo if already mapped 3418- panfrost: Increase address space to 48-bit 3419- panfrost: Fetch available system memory 3420- panvk: Fix image support in vertex jobs 3421- panvk: Pass attrib_buf_idx_offset to desc_copy_info 3422 3423MastaG (1): 3424 3425- gallivm: Call StringMapIterator from llvm:: scope 3426 3427Matt Coster (1): 3428 3429- docs: List VK_EXT_debug_utils 3430 3431Matt Turner (8): 3432 3433- intel: Build float64 shader only for Vulkan 3434- intel/clc: Free parsed_spirv_data 3435- intel/clc: Free disk_cache 3436- intel/brw: Use REG_CLASS_COUNT 3437- intel/elk: Use REG_CLASS_COUNT 3438- docs: Drop references to LIBGL_DRIVERS_PATH 3439- util: Add ATTRIBUTE_OPTIMIZE(flags) 3440- util: Force emission of stack frame in stack unit test 3441 3442Mauro Rossi (1): 3443 3444- intel/common: fix building error in intel_common.c 3445 3446Maíra Canal (7): 3447 3448- v3dv: Use errno when logging an error to stderr 3449- drm-uapi: Update v3d_drm.h 3450- broadcom/common: Add maximum number of perf counters to v3d_device_info 3451- v3dv: Use DRM_IOCTL_V3D_GET_COUNTER to get perfcnt information 3452- v3d: Use DRM_IOCTL_V3D_GET_COUNTER to get perfcnt information 3453- broadcom/simulator: Add DRM_V3D_PARAM_MAX_PERF_COUNTERS parameter support 3454- broadcom/simulator: Add DRM_IOCTL_V3D_GET_COUNTER to simulator 3455 3456Michel Dänzer (4): 3457 3458- wsi/wayland: Dispatch event queue in wsi_wl_swapchain_queue_present 3459- wsi: Call drmSyncobjQuery only once for all images 3460- egl/dri: Use packed pipe_format 3461- dri: Go back to hard-coded list of RGBA formats 3462 3463Mike Blumenkrantz (162): 3464 3465- glthread: check for invalid primitive modes in DrawElementsBaseVertex 3466- zink: reconstruct features pnext after determining extension support 3467- zink: prune zink_shader::programs under lock 3468- zink: fully wait on all program fences during ctx destroy 3469- kopper: fix bufferage/swapinterval handling for non-window swapchains 3470- zink: slightly better swapinterval failure handling 3471- kopper: don't set drawable buffer age 3472- zink: handle swapchain currentExtent special value 3473- zink: clean up accidental debug print 3474- dri: rename 'implicit' param from earlier series 3475- tu: support VK_EXT_legacy_vertex_attributes 3476- llvmpipe: add KHR-Single-GL45.arrays_of_arrays_gl.AtomicUsage skip 3477- ci: disable lavapipe-vk-asan job 3478- lavapipe: VK_EXT_legacy_vertex_attributes 3479- zink: clamp buffer_indices_hashlist resets to used region 3480- zink: delete GS conditional in update_so_info 3481- zink: use zink_shader_key_optimal unions for pipeline state asserts 3482- zink: use info.fs.uses_sample_qualifier instead of manual scan 3483- zink: simplify confusing return in rewrite_tex_dest 3484- zink: simplify flagging legacy shadow samplers 3485- zink: rename zink_shader variable in create functions 3486- zink: break out shadow sampler scanning 3487- zink: always block the precompile threads when pruning shaders 3488- zink: more effectively synchronize separate shader program precompiles 3489- zink: use zink_shader type directly in zink_create_gfx_shader_state() 3490- zink: split shader create into 2-stage functions 3491- zink: reorder precompile_separate_shader_job() in file 3492- zink: split generated tcs creation into 2-stage functions 3493- zink: move gfx shader init to thread 3494- zink: reorder some code in zink_create_gfx_program() 3495- zink: reorder fencing in zink_create_gfx_program() 3496- zink: split gfx program creation into 2-stage functions 3497- zink: precompile_job() -> gfx_program_precompile_job() 3498- zink: move blocking gfx program init functions to thread 3499- ci: disable g52 3500- egl/x11: disable dri3 with LIBGL_KOPPER_DRI2=1 as expected 3501- zink: add a batch ref for committed sparse resources 3502- u_blitter: stop leaking saved blitter states on no-op blits 3503- freedreno/replay: use inttypes format string for 64bit 3504- zink: delete unused zink_batch struct member 3505- zink: move in_rp to zink_context 3506- zink: move ref_lock from zink_batch to zink_batch_state 3507- zink: move has_work from zink_batch to zink_batch_state 3508- zink: rename last_was_compute -> last_work_was_compute 3509- zink: move last_work_was_compute from zink_batch to zink_context 3510- zink: move work_count from zink_batch to zink_context 3511- zink: move swapchain from zink_batch to zink_context 3512- zink: rename zink_batch::state -> zink_batch::bs 3513- zink: delete all zink_batch uses from zink_query.c 3514- zink: remove zink_batch usage from zink_clear.c 3515- zink: remove all uses of zink_batch from zink_batch.c 3516- zink: remove all zink_batch usage from zink_resource.h 3517- zink: remove all zink_batch usage from zink_draw.cpp 3518- zink: remove all zink_batch usage from zink_render_pass.c 3519- zink: remove all zink_batch usage from zink_context.c 3520- zink: delete zink_batch 3521- zink: zink_batch_state::has_barriers -> has_reordered_work 3522- zink: reset all the has_work flags in the same place 3523- zink: check all has_work flags for flushes 3524- zink: rely on zink_get_cmdbuf() to set has_work flags 3525- zink: flag has_work in a few more places 3526- zink: stop flagging has_work on batch tracking 3527- zink: don't submit main cmdbuf if has_work is not set 3528- frontends/dri: only release pipe when screen init fails 3529- frontends/dri: always init opencl_func_mutex in InitScreen hooks 3530- zink: use u_minify for sparse calcs 3531- zink: always commit full miptails 3532- zink: refcount miptails 3533- zink: clean up semaphore arrays on batch state destroy 3534- zink: add a batch array for tracked semaphores 3535- zink: stop leaking sparse semaphores 3536- zink: rework sparse semaphore waits 3537- ci: bump VVL to snapshot-2024wk19 3538- zink: hook up VK_EXT_legacy_vertex_attributes 3539- zink: set all spirv caps for the vvl vtn pass 3540- ci: bump VVL to v1.3.285 3541- zink: make unassigned io variables unreachable 3542- zink: minor tweaks to shader io assignment 3543- zink: outdent assign_producer_var_io() 3544- zink: outdent assign_consumer_var_io() 3545- zink: pass a struct through io assignment functions 3546- zink: track masks of io locations used during linking 3547- zink: unify io assignment 3548- zink: move 'reserved' into io assign struct 3549- zink: split slot map between regular varyings and patch 3550- zink: ci updates 3551- egl/dri2: fix error returns on dri2_initialize_x11_dri3 fail 3552- nir/lower_aaline: fix for scalarized outputs 3553- nir/linking: fix nir_assign_io_var_locations for scalarized dual blend 3554- lavapipe: split out DGC into separate file 3555- lavapipe: plumb print_cmds through NV DGC 3556- lavapipe: lvp_indirect_command_layout -> lvp_indirect_command_layout_nv 3557- zink: remove dgc debug mode 3558- zink: add atomic image ops to the ms deleting pass 3559- build/amd: add amd-use-llvm build option 3560- ir3: flag progress from nir_lower_io_to_scalar 3561- ir3: assert that no further optimizations can be done if !progress 3562- gallium: add drawid_offset to draw_mesh_tasks interface 3563- gallium: stop dropping drawid_offset param with util_draw_indirect 3564- vulkan: Update XML and headers to 1.3.287 3565- zink: add HKP to tiler mode switch 3566- lavapipe: fix mesh+task binding with shader objects 3567- mesa/st: fix zombie shader handling for non-current programs 3568- zink: null check pipe loader config before use 3569- zink: split out msaa replication 3570- zink: implement msaa replication with dynamic rendering 3571- radeonsi: enable compute pbo blits 3572- ci: kill filament trace globally 3573- zink: add a driver workaround to disable 2D_VIEW_COMPATIBLE+sparse 3574- zink: free sparse page for miptail on uncommit 3575- zink: remove adreno from broken_cache_semantics driver workaround 3576- egl: deduplicate MESA_image_dma_buf_export enablement 3577- egl: only enable MESA_image_dma_buf_export with PIPE_CAP_DMABUF 3578- lavapipe: maint7 3579- st/pbo: fix MESA_COMPUTE_PBO=spec crash on shutdown 3580- st/pbo_compute: special case stencil extraction from Z24S8 3581- mesa/st: use compute pbo download for readpixels 3582- ci: bump vvl to v1.3.289 3583- zink: add an a750 skip 3584- zink: enable compute pbos for turnip 3585- aux/tc: update docs to indicate replaced buffers have multiple pipe_resources 3586- zink: don't lower fpow 3587- zink: propagate valid buffer range to real buffer when mapping staging 3588- zink: track the "real" buffer range from replacement buffers 3589- zink: modify some buffer mapping behavior for buffer replacement srcs 3590- zink: move all driverID checks to a helper function 3591- zink: hook up maintenance7 3592- zink: use maint7 to capture venus driver and more accurately use workarounds 3593- mesa/st: load state params for feedback draws with allow_st_finalize_nir_twice 3594- egl/x11/sw: fix partial image uploads 3595- egl/x11/sw: plug in swap_buffers_with_damage handling 3596- winsys/radeon: take the full winsys struct in radeon_get_drm_value() 3597- winsys/radeon: wrap fd access with util function 3598- winsys/radeon: switch to rendernode when card node doesn't work 3599- winsys/radeon: revert recent changes 3600- glx: directly link to gallium 3601- egl: link with libgallium directly 3602- gbm: link directly with libgallium 3603- loader: delete loader_open_driver() 3604- loader/dri3: check xfixes version in loader_dri3_open() 3605- loader/dri3: avoid killing the xcb connection if dri3 not found 3606- loader/glx: move multibuffers check to loader 3607- egl: use loader's multibuffer check to deduplicate lots of code 3608- vl/dri3: use loader's dri3 init code and delete everything else 3609- zink: move image aoa access to nir pass 3610- zink: use PIPE_CAP_NIR_SAMPLERS_AS_DEREF 3611- gallium: install gallium-$version.so to libdir 3612- ci: prune dri from LD_LIBRARY_PATH 3613- dril: rework config creation 3614- llvmpipe: handle vma allocation failure 3615- llvmpipe: only use vma allocations on linux 3616- dri: fix kmsro define 3617- Revert "vl/dri3: use loader's dri3 init code and delete everything else" 3618- glx: include src/gallium for apple 3619- dri: link with libloader 3620- kopper: check swapchain size after possible loader image resize 3621- pipe-loader: fix driconf memory management 3622- dril: always take the egl init path 3623- egl: fix zink init 3624- dri: fix kms_swrast screen fail 3625- egl/wayland: bail on zink init in non-sw mode if extension check fails 3626- zink: fix partial update handling 3627 3628Mike Lothian (2): 3629 3630- radeonsi,aco: Run ac_nir_lower_global_access pass 3631- ac/llvm: Remove global access ops handling 3632 3633Mingcong Bai (2): 3634 3635- meson: set default drivers for ppc, ppc64 3636- meson: set default Vulkan drivers for ppc, ppc64 3637 3638Mohamed Ahmed (4): 3639 3640- nil: Add a nil_image::compressed bit 3641- nil: Add some helpers for DRM format modifiers 3642- nil: Support creating images with DRM modifiers 3643- nvk: enable rendering to DRM_FORMAT_MOD_LINEAR images 3644 3645Mykhailo Skorokhodov (2): 3646 3647- egl/wayland: Fix sRGB format look up for config 3648- ci/lima: expect fail of window_8888_colorspace_srgb on wayland 3649 3650Nanley Chery (29): 3651 3652- intel/isl: Add and use _isl_surf_info_supports_ccs 3653- intel/isl: Reduce halign for disabled CCS on XeHP 3654- intel/isl: Update quote for XeHP's CCS halign rule 3655- intel/isl: Allow sampling from 3D HIZ_CCS_WT 3656- intel/blorp: Factor bpb into the fast-clear rect 3657- intel/blorp: Allow gfx12 fast-clears without CCS surf 3658- intel/isl: Add and use ISL_DRM_CC_PLANE_PITCH_B 3659- anv: Refactor modifier plane layout queries 3660- intel/aux_map: Add and use INTEL_AUX_MAP_MAIN_PITCH_SCALEDOWN 3661- intel/aux_map: Add and use INTEL_AUX_MAP_META_ALIGNMENT_B 3662- intel/aux_map: Add and use INTEL_AUX_MAP_MAIN_SIZE_SCALEDOWN 3663- intel/isl: Add and use ISL_MAIN_TO_CCS_SIZE_RATIO_XE 3664- intel/isl: Add and use multi-engine surf usage bits 3665- iris: Simplify bo import in memobj_create_from_handle 3666- intel/isl: Assert alignments of surface addresses 3667- anv: Rely on the primary surf usage to disable aux 3668- anv,hasvk: Drop anv_get_isl_format_with_usage 3669- anv: Support multiple aspects in anv_formats_ccs_e_compatible 3670- anv: Rely more on ISL_SURF_USAGE_DISABLE_AUX_BIT 3671- anv: Restrict CCS ISL surface creation to gfx9-11 3672- iris: Add and use comp_ctrl_surf_offset on gfx12 3673- intel/isl: Drop support for the gfx12 CCS ISL surf 3674- intel/isl: Add and use isl_drm_modifier_needs_display_layout 3675- iris,anv: Disable gfx12.0 fast-clears with unaligned pitch 3676- intel/isl: Consolidate some tiling checks for CCS 3677- intel/isl: Require display flag for 512B pitch alignment 3678- intel/isl: Pad the pitch on gfx12.0 for fast-clears 3679- anv+zink/ci: Change sparse test result from crash to fail 3680- intel/isl: Enable Tile4 for CPB surfaces 3681 3682Natanael Copa (1): 3683 3684- nir/opt_varyings: reduce stack usage 3685 3686Neha Bhende (2): 3687 3688- svga: Retrieve stride info from hwtnl->cmd.vdecl for swtnl draws 3689- dri: fix macro name check to detect svga driver 3690 3691Oskar Viljasaar (8): 3692 3693- vulkan/properties: support Android in the property generator 3694- v3dv: constify arguments of vendor/device id getters 3695- v3dv: Use common runtime vk_properties 3696- vulkan/properties: Document RENAMED_PROPERTIES in the property generator 3697- anv: Move completely over to common runtime GetPhysicalDeviceProperties2 3698- hasvk: switch to use runtime physical device properties infrastructure 3699- vulkan: add a property struct setter function 3700- venus: Use common physical device properties 3701 3702Patrick Lerda (8): 3703 3704- gallium/auxiliary/vl: fix typo which negatively impacts the src_stride initialization 3705- clover: fix pipe_box update regression 3706- clover: fix memory leak related to optimize 3707- r600: fix vertex state update clover regression 3708- mesa/main: fix stack overflow related to the new mipmap code 3709- radeonsi: fix assert triggered on gfx6 after the tessellation update 3710- clover: fix meson opencl-spirv option 3711- st/pbo_compute: fix async->nir memory leak 3712 3713Paulo Zanoni (31): 3714 3715- isl: add ISL_TILING_64_XE2 to isl_tiling_to_name() 3716- anv/sparse: add the MSAA block shape tables 3717- anv/sparse: we can't do multi-sampled depth/stencil sparse images 3718- anv/sparse: properly reject sample counts we don't support 3719- anv/sparse: reject all sample flags that non-sparse doesn't support 3720- anv/sparse: fix block_size_B when the image is multi-sampled 3721- anv/sparse: exclude Xe2's Tile64's non-standard block shapes 3722- anv/sparse: flush the tile cache when resolving sparse images 3723- anv/sparse: enable MSAA for Sparse when applicable 3724- anv: check for VK_RENDERING_SUSPENDING_BIT once at CmdEndRendering 3725- anv+zink/ci: add failures related to multi-sampled sparse binding 3726- anv/sparse: assert a format can't be standard and non-standard 3727- anv/xe: fix declaration of memory flags for integrated non-LLC platforms 3728- anv/sparse: reject 1D sparse residency images 3729- anv/sparse: fix the image property sizes for multi-sampled images 3730- anv/sparse: fix reporting of VK_SPARSE_IMAGE_FORMAT_SINGLE_MIPTAIL_BIT 3731- intel/isl: pass struct isl_tile_info to choose_image_alignment_el() 3732- anv/sparse: dump info about opaque binds when DEBUG_SPARSE 3733- anv/sparse: fix TR-TT page table bo size and flags 3734- anv/sparse: remove obsolete linear tiling code path 3735- anv/sparse: unify and rework tile size calculation 3736- anv/sparse: use ANV_SPARSE_BLOCK_SIZE instead of tile_size when possible 3737- anv: properly store the engine_class_supported_count values 3738- anv: LNL+ doesn't need the special flush for sparse 3739- anv: reimplement the anv_fake_nonlocal_memory workaround 3740- iris: fix iris_xe_wait_exec_queue_idle() on release builds 3741- anv/trtt: fix the process of picking device->trtt.queue 3742- anv/xe: try harder when the vm_bind ioctl fails 3743- anv: don't expose the compressed memory types when DEBUG_NO_CCS 3744- anv: disable CCS for Source2 games on Xe2 3745- intel: fix compute SLM sizes on Xe2 and newer 3746 3747Pavel Ondračka (13): 3748 3749- r300: replace constant size field with usemask 3750- r300: move dead constants pass earlier for vertex shaders 3751- r300: switch to a new constant remap table format 3752- r300: compact scalar uniforms into empty slots 3753- r300: better packing for immediates 3754- r300/ci: fails list update 3755- r300: fix cycles counting for KIL 3756- r300: fix writemask rewrite when converting to omod 3757- r300: fix for ouput modifier and DDX/DDX 3758- r300: fix RC_OMOD_DIV_2 modifier 3759- r300: missing whitespace in shader stats 3760- r300: vectorization tweaks for R300/R400 3761- r300: bias presubtract fix 3762 3763Philipp Zabel (7): 3764 3765- etnaviv: drm: Suppress get-param error message for non-existent core 3766- etnaviv: drm: Stop after model query failure 3767- etnaviv: Pass npu to etna_screen_create in a separate parameter 3768- etnaviv: Add a separate NPU pipe 3769- etnaviv: Allow collecing both GPU and NPU specs 3770- etnaviv/nn: Pipe through input/accumulation buffer depth from hwdb 3771- etnaviv: update headers from rnndb 3772 3773Pierre-Eric Pelloux-Prayer (34): 3774 3775- radeonsi/sqtt: use ac_sqtt_get_shader_mask for spm counters 3776- radeonsi/sqtt: cleanup si_sqtt_add_code_object a bit 3777- radeonsi/sqtt: support sqtt buffer auto-resizing 3778- radeonsi: add new si_shader_binary_upload_at method 3779- radeonsi/sqtt: use si_shader_binary_upload_at to reupload shaders 3780- radeonsi: allocate sqtt and spm buffers in GTT 3781- radeonsi: add testmemperf mem bandwidth test 3782- radv/sqtt: use radeon_check_space before emit_spm_* 3783- radeonsi: use the common SQTT implementation 3784- radeonsi/sqtt: add AMD_THREAD_TRACE_INSTRUCTION_TIMING 3785- ac/sqtt: make VA helpers static 3786- ac/llvm: implement WA in nir to llvm 3787- radeonsi: store the total binary size in si_shader 3788- radeonsi: handle DBG(TEX) after tc_compatible_htile is set 3789- radeonsi/tests: don't match gfx10_3 baseline for gfx10 family 3790- radeonsi/tests: add a shortcut to re-run only failing tests 3791- ac/surface: reject modifiers with retile_dcc and bpe != 32 3792- radeonsi: add gfx11 workaround for upgraded_depth 3793- ac/nir: don't use the compute blit for PIPE_FORMAT_R5G6B5_UNORM 3794- radeonsi/tests: update tests baseline 3795- radeonsi/tests: clarify the output when results changes 3796- radeonsi: fix buffer_size in si_compute_shorten_ubyte_buffer 3797- Revert "ac, radeonsi: remove has_syncobj, has_fence_to_handle" 3798- ac/info: remove has_syncobj 3799- winsys/radeon: fill lds properties 3800- radeonsi: fix crash in si_update_tess_io_layout_state for gfx8 and earlier 3801- radeonsi/tests: correctly parse the family name 3802- radeonsi: fix ac_create_shadowing_ib_preamble parameter 3803- radeonsi, radv: bump libdrm_amdgpu version requirement 3804- ci: bump Fedora and Android libdrm2 to 2.4.122 3805- radeonsi: fix si_get_dmabuf_modifier_planes for gfx12 3806- frontends/dri: add error logs to dri2_create_image_from_fd 3807- amd: use a valid size for ac_pm4_state allocation 3808- egl,gbm,glx: fix log message spam 3809 3810Qiang Yu (8): 3811 3812- glsl: respect GL_EXT_shader_image_load_formatted when image is embedded in a struct 3813- radeonsi: add missing nir_intrinsic_bindless_image_descriptor_amd 3814- nir: fix lower array to vec metadata preserve 3815- nir: fix clip cull distance lowering metadata preserve 3816- nir: add filter parameter to nir_lower_array_deref_of_vec 3817- nir: nir_vectorize_tess_levels support indirect access 3818- nir: consider more deref types when fixup deref 3819- glsl: fix indirect tess factor access for compact_arrays=false drivers 3820 3821Rebecca Mckeever (9): 3822 3823- panvk: Add jm and bifrost dirs 3824- panvk: Add push_uniform/constant helpers 3825- panvk: Make helper functions panvk_cmd_buffer agnostic 3826- panvk: Move panvk_descriptor_state to bifrost subdir 3827- panvk: Move vkCmdDraw* functions to their own file 3828- panvk: Move vkCmdDispatch* functions to their own file 3829- panvk: Move vkCmd*Event functions to their own file 3830- panvk: Add Valhall DescriptorSetLayout implementation 3831- panvk: Add Valhall Descriptor{Set,Pool} implementations 3832 3833Renato Pereyra (2): 3834 3835- anv: Attempt to compile all pipelines even after errors 3836- intel/perf: Move sysmacros.h include from header to implementation 3837 3838Rhys Perry (95): 3839 3840- aco/tests: add tests for hidden breaks/continues 3841- aco/tests: add tests for divergent merge phi with undef 3842- nir/dead_cf: stop reindexing blocks for each non-block cf node 3843- aco/stats: fix s_waitcnt parsing 3844- aco/stats: don't use VS counter pre-GFX10 3845- aco/waitcnt: fix DS/VMEM ordered writes when mixed 3846- aco: make wait_imm indexable 3847- aco/waitcnt: add target_info 3848- aco/waitcnt: refactor for indexable wait_imm 3849- aco/stats: refactor for indexable wait_imm 3850- aco: add wait_imm::unpack and wait_imm::max 3851- radv: keep track of unaligned dynamic vertex access 3852- aco: form hard clauses in VS prologs 3853- aco: copy VS prolog constants after loads 3854- aco: support VS prologs with unaligned access 3855- aco/util: improve small_vec assertion 3856- radv: advertise VK_EXT_legacy_vertex_attributes 3857- aco: don't count certain pseudo towards VMEM_STORE_CLAUSE_MAX_GRAB_DIST 3858- aco/tests: support GFX12 3859- aco: add SFPU/ValuPseudoScalarTrans instr class 3860- aco: add GFX11.5+ opcodes 3861- aco: support GFX12 in assembler 3862- aco/tests: add GFX12 assembler tests 3863- aco: don't change prefetch mode on GFX11.5+ 3864- aco/gfx12: disable s_cmpk optimization 3865- aco: add GFX12 wait counters 3866- aco/waitcnt: support GFX12 in waitcnt pass 3867- aco/stats: support GFX12 in collect_preasm_stats() 3868- aco: update VS prolog waitcnt for GFX12 3869- aco/lower_phis: create loop header phis for non-boolean loop exit phis 3870- aco: create lcssa phis for continue_or_break loops when necessary 3871- aco: use scalar phi lowering for lcssa workaround 3872- aco: remove nir_to_aco 3873- aco/lower_phis: don't create boolean loop header phis in some situations 3874- radv: malloc graphics pipeline stages 3875- aco: support GFX12 in insert_NOPs 3876- aco/gfx12: implement subgroup shader clock 3877- aco/gfx12: implement workgroup barrier 3878- aco/gfx12: sign-extend s_getpc_b64 3879- aco/gfx12: don't create v_fmac_legacy_f32 3880- aco/gfx12: use ttmp9/ttmp7 for workgroup id 3881- radv/gfx12: don't add workgroup id shader args 3882- aco/gfx12: remove MIMG vector affinity 3883- aco/gfx12: decrease max_nsa_vgprs for VSAMPLE 3884- aco/gfx12: disallow SCC and most constants for BUF SOFFSET 3885- aco: fix fddx/y with uniform inf/nan input 3886- meson: remove --depfile for aco_tests 3887- ac/llvm: implement load_subgroup_id 3888- aco/gfx12: implement load_subgroup_id 3889- ac/nir: skip subgroup_id/local_invocation_index lowering for gfx12 3890- aco/gfx12: fix s_wait_event immediate 3891- aco: don't combine vgpr into writelane src0 3892- aco: implement nir_atomic_op_ordered_add_gfx12_amd 3893- aco: implement nir_intrinsic_nop_amd and nir_intrinsic_sleep_amd 3894- ac/nir: support lowering of sub-dword push constants 3895- radv: lower sub-dword push constants 3896- ac/llvm: remove support for sub-dword push constants 3897- aco: remove support for sub-dword push constants 3898- aco/gfx6: set glc for buffer_store_byte/short 3899- aco: inline store_vmem_mubuf/emit_single_mubuf_store 3900- aco: use ac_hw_cache_flags 3901- aco: use GFX12 scope/temporal-hint 3902- ac: stop using radeon_info for ac_get_hw_cache_flags 3903- aco: use ac_get_hw_cache_flags() 3904- aco: remove some missing label resets 3905- nir/opt_loop: rematerialize derefs instead of creating phis 3906- nir/opt_loop: fix formatting 3907- aco: insert s_nop before discard early exit sendmsg(dealloc_vgpr) 3908- radv: lower push constants in NIR 3909- ac/llvm: remove push constants 3910- aco: remove push constants 3911- aco/insert_exec_mask: ensure top mask is not a temporary at loop exits 3912- vtn: ensure TCS control barriers have a large enough memory scope 3913- aco: use 1.5x vgprs for gfx1151 and gfx12 3914- aco: skip continue_or_break LCSSA phis when not needed 3915- aco: use s_pack_ll_b32_b16 for pack_32_2x16_split 3916- aco: combine extracts into s_pack_ll_b32_b16 3917- aco: use s_pack_*_b32_b16 more in p_insert/p_extract lowering 3918- aco: turn split(vec()) into p_parallelcopy instead of p_create_vector 3919- aco: add missing isConstant()/isTemp() checks 3920- aco: fix follow_operand with combined label_extract and label_split 3921- aco: use alignment information in visit_load_constant() 3922- aco: fix wmma raw hazard 3923- aco: replace constant v_bfrev_b32 with v_mov_b32 to create vopd 3924- aco/gfx11: don't use v_bfrev_b32 with wave64 3925- glsl: always lower non-TCS outputs to temporaries 3926- gallium: remove PIPE_CAP_SHADER_CAN_READ_OUTPUTS 3927- nir/linking_helpers: remove special case for read mesh outputs 3928- nir/linking_helpers: remove varying accesses in nir_remove_unused_io_vars 3929- nir/linking_helpers: remove nested IF 3930- radv: remove unecessary nir_remove_unused_varyings cleanup passes 3931- aco/gfx11.5: workaround export priority issue 3932- aco: fix validation of v_s\_ opcodes 3933- docs: update ACO_DEBUG documentation for scheduler options 3934- docs: update ACO_DEBUG documentation for perfwarn 3935 3936Rob Clark (63): 3937 3938- tu: Add missing error path cleanup 3939- tu: Fix a6xx lineWidthGranularity 3940- freedreno/ir3: Skip DAG validation on release builds 3941- llvmpipe: Fix build error with clang-18 3942- freedreno/ci: Switch a618_piglit to deqp-runner 3943- vulkan/android: Add helper to probe AHB support 3944- vulkan: Don't request Ycbcr conversion for rgb 3945- vulkan: Add helper to resolve Android external format 3946- tu: Skip YUV conversion for RGB formats 3947- tu: Support VkExternalFormatANDROID 3948- freedreno/ci: Remove some skips 3949- freedreno/ci: Remove some obsolete skips 3950- freedreno/ci: Refactor out common a6xx skips list 3951- freedreno/ci: Skip unsupported legacy gl stuff 3952- freedreno/ci: Skip max-texture-size 3953- freedreno/ci: Add a common skips file to a618_piglit. 3954- freedreno/ci: Skip built-in-functions VS/GS tests 3955- freedreno/ci: Skip some slow tests 3956- freedreno/ci: Increase a630/a618 piglit fraction 3957- freedreno/ir3: Fix ldg/stg offset 3958- egl/android: Fix sRGB visuals 3959- docs/features: Add missing AHB for tu 3960- tu: Don't advertise AHB handle time on non-android 3961- freedreno: Namespace DEFINE_CAST() 3962- virgl: Update headers 3963- loader: Add better support for virtgpu nctx driver loading 3964- freedreno/loader: Switch over to probe_nctx 3965- vulkan/android: Fix suggestedYcbcrModel with !mapper4 3966- tu: Fix imageview + ahb 3967- vulkan/android: Fix YcbcrRange for !mapper4 3968- ir3: Add some more missing progress accumulation 3969- gallium/tc: Add optional buffer replacement limit 3970- freedreno: Use buffer replacement limit 3971- gallium/tc: Allow replacement if replacing valid_range 3972- freedreno/drm: Add rd dumper support 3973- st/mesa/pbo: Set src type on image_store 3974- freedreno: Handle non-null cb with null buffer 3975- u_blitter+d3d12: Move stencil fallback clear to caller 3976- freedreno/a6xx: Implement S8 support 3977- freedreno: Implement stencil blit fallback 3978- freedreno: Use LINEAR for staging resources 3979- freedreno/a6xx: Drop 16b packed image formats 3980- freedreno/bc: Rework flush order 3981- freedreno/a6xx: Tweak blitter traces 3982- freedreno/a6xx: Skip blitter for L/A conversions 3983- freedreno/a6xx: Add more format swizzles 3984- freedreno/a6xx: Allow blit based transfers 3985- freedreno: Enable the X1-85 3986- tu: Fix issues with 16k (or larger) page sizes 3987- freedreno/drm/virtio: Fix issues with 16k (or larger) page sizes 3988- freedreno/a6xx: Implement reg stomper support 3989- freedreno/a7xx: Fix GRAS_UNKNOWN_80F4 writes 3990- freedreno/cffdec: Fix a7xx CP_EVENT_WRITE decoding 3991- tu/drm/virtio: Add missing a7xx case 3992- freedreno/drm: Handle a7xx case 3993- freedreno: Move GENX/CALLX magic to common 3994- freedreno: Extract out common UBWC helper 3995- freedreno: Extract out shared LRZFC layout helpers 3996- freedreno/a6xx: Allocate lrcfc when needed for direction tracking 3997- freedreno/a6xx: Refactor CP_EVENT_WRITE emit 3998- freedreno/a6xx: Rework CCU_CNTL emit for a7xx 3999- freedreno/a6xx: Initial a7xx support 4000- gallium: Add option to not add version to libgallium filename 4001 4002Robert Mader (3): 4003 4004- egl: Implement EGL_EXT_config_select_group 4005- egl: Implement EGL_MESA_x11_native_visual_id 4006- egl/x11: Allow all RGB visuals to match 32-bit RGBA EGLConfigs 4007 4008Rohan Garg (21): 4009 4010- anv: formatting fix when printing pipe controls 4011- anv: allocate space for generated indirect draw id's using the temporary allocation helper 4012- intel/brw: update Xe2 max SIMD message sizes 4013- Revert "iris: slow clear higher miplevels on single sampled 8bpp resources that have TILE64" 4014- intel/eu/xe2+: Fix src1 length bits of SEND instruction with UGM target. 4015- intel/brw: Advertise fp64 atomic add's when we have 64 bit float support and a LSC 4016- intel/brw: We no longer have atomic fmin/fmax ops for fp64 in xe2 4017- intel/genxml: add the new state byte stride instruction 4018- intel/genxml: update 3DSTATE_CPSIZE_CONTROL_BUFFER for xe2+ 4019- isl: enable compression for CPS buffers on xe2+ 4020- intel/genxml: update CFE_STATE for LNL 4021- intel/genxml: Update XY_BLOCK_COPY_BLT 4022- intel/genxml: update MI_SEMAPHORE_WAIT for Xe2 4023- intel/genxml: Update STATE_COMPUTE_MODE for Xe2 4024- anv: 3D stencil surfaces have fewer layers for higher miplevels 4025- isl: disable CCS for 3D depth/stencil surfaces when WA is applicable 4026- isl: Enable volumetric STC_CCS,HiZ+CCS on gfx12.0 4027- intel/genxml: Add RESOURCE_BARRIER for xe2 4028- intel/compiler: fix shuffle generation on LNL 4029- anv: flag WSI images as scanout images for ISL 4030- anv: reuse existing macro to query for flushes 4031 4032Roland Scheidegger (1): 4033 4034- lavapipe: add option to enable snorm blending 4035 4036Romain Naour (1): 4037 4038- glxext: don't try zink if not enabled in mesa 4039 4040Roman Stratiienko (11): 4041 4042- vulkan/android: Add basic u_gralloc support 4043- vulkan/android: Add common vkGetSwapchainGrallocUsage{2}ANDROID 4044- vulkan/android: Add android buffer classification to vk_image 4045- vulkan/android: Add common helpers for the ANB extension 4046- vulkan/android: Add common helpers for the AHB extension 4047- vulkan/android: Add common vkGetAndroidHardwareBufferPropertiesANDROID 4048- turnip/android: Migrate to common ANB code 4049- v3dv/android: Migrate ANB and AHB to use common helpers 4050- u_gralloc/fallback: Extract modifier from QCOM native_handle 4051- turnip/android: Use DETECT_OS_ANDROID in tu_device 4052- turnip/android: Use DETECT_OS_ANDROID in freedreno_rd_output 4053 4054Romaric Jodin (1): 4055 4056- intel/brw: allocate large table in the heap instead of the stack 4057 4058Ruijing Dong (14): 4059 4060- radeonsi/vcn: add vcn5 encoding interface change 4061- radeonsi/vcn: add vcn5.0 for h264 enc only 4062- radeonsi/vcn: add hevc support for vcn5 4063- radeonsi/vcn: enable decoding in vcn5. 4064- radeonsi/vcn: correct tile_size_bytes_minus1 4065- radeonsi/vcn: add cdef modes for vcn5 encoding 4066- radeonsi/vcn: apply cdef mode to vcn5 4067- radeonsi/vcn: share functions between vcn4/vcn5 4068- frontends/va: parsing uniform_tile_spacing flag 4069- radeonsi/vcn: add header files for vcn5 av1 tile 4070- radeonsi/vcn: enable av1 encoding in vcn5 4071- radeonsi/vcn: enable roi feature for vcn5 4072- radeonsi/vcn: remove tile_config_flag 4073- radesonsi/vcn: update vcn4 tile processing logic 4074 4075Ryan Neph (7): 4076 4077- venus: reclaim signal semaphore feedback resources for wasteful clients 4078- venus: sync headers for VK_EXT_external_memory_acquire_unmodified 4079- venus: enable VK_EXT_external_memory_acquire_unmodified 4080- venus: factor image memory barrier fixes to common implementation 4081- venus: refactor image memory barrier fix storage and conventions 4082- venus: skip barrier fixes as early as possible 4083- venus: chain VkExternalMemoryAcquireUnmodifiedEXT for wsi ownership transfers 4084 4085Rémi Bernon (2): 4086 4087- zink: Add VKAPI_PTR specifier to zink_stub_function_not_loaded. 4088- zink: Add VKAPI_PTR specifier to generated stub functions. 4089 4090Sagar Ghuge (8): 4091 4092- intel/compiler: Fix destination type for CMP/CMPN 4093- intel/disasm: Fix cache load/store disassembly for URB messages 4094- iris: Load 32-bit MMIO PREDICATE register from buffer 4095- intel/compiler: No need to re-type the destination register 4096- intel/fs: Adjust destination register size for untyped atomic on Xe2+ 4097- intel/fs: Adjust destination register size for global atomic on Xe2+ 4098- intel/compiler: Don't use half float param for sample_b 4099- intel/compiler: Add indirect mov lowering pass 4100 4101Samuel Pitoiset (399): 4102 4103- radv: fix image format properties with fragment shading rate usage 4104- docs: Add an alternative way to debug GPU hangs with RADV 4105- radv/rt: add radv_ray_tracing_state_key 4106- radv/rt: pass radv_ray_tracing_state_key to radv_rt_pipeline_compile() 4107- radv/rt: rework the helper that hashes a ray tracing pipeline 4108- radv/ci: add more flakes 4109- radv: simplify DB_Z_INFO.NUM_SAMPLES with null ds target on GFX11 4110- radv: remove bogus VkShaderCreateInfoEXT::flags being 0 assert for compute 4111- radv: simplify radv_emit_primitive_restart_enable() 4112- radv: inline radv_get_pa_su_sc_mode_cntl() in radv_emit_culling() 4113- radv: remove useless DB_Z_INFO.NUM_SAMPLES when emitting the MSAA state 4114- radv: pre-compute VGT_TF_PARAM.DISTRIBUTION_MODE 4115- radv: use the bound GS copy shader when emitting shader objects 4116- radv: add GS copy shader BO to the cmdbuf BO list at bind time 4117- radv: add RT prolog BO to the cmdbuf BO list at bind time 4118- radv: add shaders BO to the cmdbuf BO list at bind time 4119- radv: emit compute pipelines directly from the cmdbuf 4120- radv: precompute compute/task shader register values 4121- radv: clear unwritten color attachments for monolithic PS earlier 4122- radv: compact SPI_SHADER_COL_FORMAT as late as possible 4123- radv: rename col_format_non_compacted to spi_shader_col_format 4124- radv: store cb_shader_mask for fragment shaders and epilogs 4125- radv: add a new dirty state for emitting the color output state 4126- radv/ci: document a recent regression on GFX6-8 4127- radv: split cmdbuf dirty flags into dirty/dirty_dynamic 4128- radv: precompute existing legacy GS register values later 4129- radv: precompute fragment shader register values 4130- radv: precompute mesh shader register values 4131- radv: precompute legacy GS register values 4132- radv: precompute vertex shader register values 4133- radv: precompute DB_SHADER_CONTROL for fragment shaders later 4134- vulkan: Update XML and headers to 1.3.284 4135- aco: add support for remapping color attachments 4136- radv: implement VK_KHR_dynamic_rendering_local_read 4137- radv: advertise VK_KHR_dynamic_rendering_local_read 4138- radv: add a new mechanism for tracking registers per cmdbuf 4139- radv: move common registers between VS/GS and NGG 4140- radv: precompute NGG register values 4141- radv: remove unused parameter to radv_pipeline_emit_pm4() 4142- radv: stop recomputing the last VGT API stage when emitting graphics shaders 4143- radv: do not emit non-context registers to radv_pipeline::ctx_cs 4144- radv: track and bind more VRS states from the graphics pipeline 4145- radeonsi: remove the _unused parameter in all radeon_xxx macros 4146- radv: remove gfx10_emit_ge_pc_alloc() 4147- radv: do not emit VGT_GS_OUT_PRIM_TYPE to ctx_cs on GFX11 4148- radv: simplify radv_emit_hw_ngg() slightly 4149- radv: simplify radv_emit_hw_vs() slightly 4150- radv: simplify radv_emit_hw_gs() slightly 4151- radv: fix the late scissor workaround for GFX9 since a recent refactoring 4152- radv: make radv_conv_gl_prim_to_gs_out() a non-static function 4153- radv: emit graphics pipelines directly from the cmdbuf 4154- radv: add graphics shaders context registers that need to be tracked 4155- radv: add more radeon_opt_set_xxx variants 4156- radv: track all graphics shaders context registers 4157- radv: simplify radv_emit_ps_inputs() slightly 4158- radv: stop using radv_physical_device for radeon helpers 4159- radv: introduce radeon_set_reg_seq() 4160- radv: remove redundant radeon_set_perfctr_reg() helper 4161- radv: rename radeon perfctr uconfig helpers 4162- radv: add a helper to configure ring buffer descriptors 4163- radv: only enable VK_MESA_image_alignment_control on GFX9-11.5 4164- radv: reject unsupported buffer formats earlier 4165- ac,radv,radeonsi: add a helper to translate buffer numformat 4166- ac,radv,radeonsi: add a helper to translate buffer dataformat 4167- radv: simplify radv_emit_default_sample_locations() 4168- radv: pass radv_physical_device to radv_emit_default_sample_locations() 4169- radv: use float instead of double for viewport zscale/ztranslate 4170- radv: add more helpers to emit viewports 4171- radeonsi: refactor si_translate_border_color() 4172- ac,radv,radeonsi: introduce a helper to build a sampler descriptor 4173- radv: stop checking the output value of radv_translate_tex_numformat 4174- radv: use PIPE_FORMAT in radv_translate_tex_numformat() 4175- ac,radv,radeonsi: add a function to translate tex numformat 4176- radv: use PIPE_FORMAT in radv_translate_colorswap() 4177- ac,radv,radeonsi: add a function to translate colorswap 4178- radv: use PIPE_FORMAT in radv_translate_dbformat() 4179- ac,radv,radeonsi: add a function to translate db format 4180- ac,radv,radeonsi: add a function to get the color format endian swap 4181- radv: allow 3d views with VK_IMAGE_CREATE_2D_VIEW_COMPATIBLE_BIT_EXT 4182- radv: simplify creating gfx10 texture descriptors for sliced 3d/2d view of 3d 4183- radv: remove redundant check for VK_FORMAT_E5B9G9R9_UFLOAT_PACK32 on GFX6-10 4184- radv: stop checking the return value of ac_get_cb_number_type() 4185- radv: add radv_is_colorbuffer_format_blendable() 4186- amd/common: move some format related helpers to ac_formats.c 4187- ac,radv,radeonsi: add helper to know if a format is supported by CB 4188- ac,radv,radeonsi: add helper to know if a format is supported by DB 4189- ac,radv,radeonsi: introduce a helper to build a FMASK descriptor 4190- radeonsi: reject some texture formats but only on GFX8/CARRIZO 4191- radv: mark some formats as unsupported on GFX8/CARRIZO 4192- radv: set image view descriptors as buffer for non-graphics GPU 4193- ac,radv,radeonsi: add a helper to get the tile mode index 4194- radv: tidy up swizzle in radv_set_mutable_tex_desc_fields() 4195- radv: tidy up custom pitch for gfx10.3 in radv_set_mutable_tex_desc_fields() 4196- radv: tidy up meta_va in radv_set_mutable_tex_desc_fields() 4197- radv: set ITERATE_256 for GFX10+ in radv_set_mutable_tex_desc_fields() 4198- radv: stop clearing unnecessary bitfields in radv_set_mutable_tex_desc_fields() 4199- ac,radv,radeonsi: add a helper to set mutable tex desc fields 4200- ac,radv,radeonsi: add a function for getting border color swizzle 4201- radv: only set ALPHA_IS_ON_MSB if the image has DCC on GFX6-9 4202- radv: adjust determining if alpha should be on MSB 4203- ac,radv,radeonsi: add function to determine if alpha should be on MSB 4204- ac,radv,radeonsi: add a common helper for translating swizzle 4205- amd/common: only pass gfx_level to ac_get_gfx10_format_table() 4206- amd/common: add new helpers to build buffer descriptors 4207- radv: use new common helpers for building buffer descriptor 4208- aco: use new common helpers for building buffer descriptors 4209- radv: remove unused radv_translate_buffer_dataformat() 4210- radv: remove useless check about FIXED formats 4211- radv: simplify radv_is_vertex_buffer_format_supported() 4212- radv: replace vk_to_non_srgb_format() by vk_format_no_srgb() 4213- radv: fix setting a custom pitch for CB on GFX10_3+ 4214- radv: replace db_{z,stencil}_{read,write}_base by db_{depth,stencil}_base 4215- radv: tidy up radv_initialise_ds_surface() 4216- radv: separate non-mutable vs mutable fields for ds surface 4217- amd/common: add a function to initialize ds surface 4218- radv: use the common helper for initializing DS surfaces 4219- radeonsi: use the common helper for initializing DS surfaces 4220- radv: only enable DB_STENCIL_INFO.ITERATE_FLUSH when necessary 4221- radv: do not check image usage for ITERATE256 with TC-compat HTILE 4222- ac,radv,radeonsi: add function to get the number of ZPLANES 4223- ac,radv,radeonsi: a function that sets mutable DS surface fields 4224- ac,radv,radeonsi: move ZRANGE_PRECISION to mutable DS fields 4225- amd/common: add ac_gpu_info::has_etc_support 4226- radv: use PIPE_FORMAT in radv_translate_tex_dataformat() 4227- ac,radv,radeonsi: add a function to translate texture data format 4228- amd/common: add a helper to set the third word of buffer descriptor 4229- ac,radv: add a stride parameter to ac_build_attr_ring_descriptor() 4230- radeonsi: use common build buffer descriptor helpers 4231- radv: align DCC control settings to RadeonSI for GFX8 4232- amd/common: add a function to initialize cb surface 4233- radv: use the common helper for initializing CB surfaces 4234- radeonsi: use the common helper for initializing CB surfaces 4235- ac,radv,radeonsi: a function that sets mutable CB surface fields 4236- radv: apply the workaround for no PS inpputs and LDS on GFX11 only 4237- radv: apply the SQ_THREAD_TRACE_WPTR workaround on GFX11 only 4238- radv: fix flushing DB meta cache on GFX11.5 4239- radv: only emit streamout enable for legacy streamout 4240- amd/common: define SDMA v7.0 for GFX12 4241- ac,radv,radeonsi: add ac_gpu_info::has_tc_compatible_htile 4242- radv: update NUM_THREAD_FULL bitfields 4243- radv: enable GS_FAST_LAUNCH_2 by default on GFX12 4244- radv: do not emit non-existent registers on GFX12 4245- radv: update configuring sample locations on GFX12 4246- radv: update configuring viewport/scissor on GFX12 4247- radv: update configuring PS states on GFX12 4248- radv: update configuring NGG states on GFX12 4249- radv: update configuring VGT states on GFX12 4250- radv: update configuring DB states on GFX12 4251- radv: update configuring rasterization states on GFX12 4252- radv: update configuring some CB states on GFX12 4253- radv: update configuring occlusion query state on GFX12 4254- radv: update configuring MSAA state on GFX12 4255- radv: update configuring GFX preamble on GFX12 4256- radv: update configuring tess rings on GFX12 4257- radv: update binning settings on GFX12 4258- radv: update emitting discard rectangles on GFX12 4259- radv: update shader input arguments for GS stage on GFX12 4260- aco: adjust loading local invocation ID for GS on GFX12 4261- radv: do not emulate clear state for shadowed regs on GFX12 4262- radv: update cache flush emission on GFX12 4263- radv: update emitting stipple line on GFX12 4264- radv: disallow merging multiple draws into one wave on GFX12 4265- radv: emit SQ_NON_EVENT packets after drawing with streamout on GFX12 4266- radv: update configuring the number of patch control points on GFX12 4267- radv: update configuring VGT_SHADER_STAGES_EN on GFX12 4268- radv: enable GE_CNTL.DIS_PG_SIZE_ADJUST_FOR_STRIP on GFX12 4269- radv: update NUM_THREAD_FULL bitfields on GFX12 4270- radv: update global graphics shader pointers on GFX12 4271- radv: update SDMA resource type on GFX12 4272- radv: update VS input VGPRs on GFX12 4273- radv: do not enable HTILE for depth/stencil storage images 4274- radv: allow STORAGE for depth formats 4275- radv: fix configuring the number of patch control points on GFX6 4276- radv: configure DB_Z_INFO.NUM_SAMPLES on GFX12 4277- radv: configure DB_RENDER_CONTROL to zero on GFX12 4278- radv: do not enable MEM_ORDERED on GFX12 4279- radv: update configuring the attribute ring on GFX12 4280- radv: do not flush L2 metadata on GFX12 4281- radv: mark all images coherent with TC L2 on GFX12 4282- radv: update configuring SPI_SHADER_PGM_LO_LS on GFX12 4283- radv: update configuring SPI_SHADER_PGM_LO_ES on GFX12 4284- radv: update configuring SPI_SHADER_PGM_RSRC4_{HS,GS,PS} on GFX12 4285- radv: update configuring GE_CNTL.PRIM_GRP_SIZE_GFX11 on GFX12 4286- radv: update configuring SPI_PS_IN_CONTROL on GFX12 4287- radv: configure PA_SC_HISZ_CONTROL on GFX12 4288- radv: configure SPI_SHADER_GS_OUT_CONFIG_PS on GFX12 4289- radv: update configuring GS_VGPR_COMP_CNT on GFX12 4290- radv: do not set DX10_CLAMP on GFX12 4291- radv: fix VRS subpass attachments with mipmaps 4292- ac,radeonsi: set COLOR_SW_MODE for mutable CB surfaces on GFX12 4293- radv: configure PA_SC_SAMPLE_PROPERTIES on GFX12 4294- radv: update number of input VGPRs for VS on GFX12 4295- radv: update configuring color buffers on GFX12 4296- radv: update configuring depth stencil buffers on GFX12 4297- radv: update configuring PA_SC_WINDOW_SCISSOR on GFX12 4298- radv: do not emit SPI_SHADER_PGM_RSRC3_GS on GFX12 4299- radv: fix configuring NGG registers on GFX12 4300- radv: do not set VGT_PRIMITIVEID_EN.PRIMITIVEID_EN on GFX12 4301- radv: cleanup radv_precompute_registers_hw_{ngg,fs} 4302- radv: assert that GDS/GDS OA buffers can't be created on GFX12 4303- radv: only set valid bitfields for CB/DS surfaces address 4304- radv: only emit VGT_GS_MAX_PRIMS_PER_SUBGROUP on GFX9 4305- radv: only emit SQ_PERFCOUNTER_MASK on GFX7-9 4306- radv: do not set VGT_SHADER_STAGES_EN.DYNAMIC_HS on GFX9 4307- radv: only emit SPI_SHADER_PGM_SRC3_GS on GFX7+ 4308- radv: only emit CB_COLOR0_DCC_CONTROL on GFX8 4309- radv: use pipe_format when building image view descriptors 4310- ac,radv,radeonsi: add a function to build texture descriptors 4311- amd/common: add MIN_LOD for texture descriptors on GFX12 4312- Revert "radv/ci: Bring back vkcts-navi21-llvm-valve" 4313- radv: update configuring depth clamp enable on GFX12 4314- radv: update configuring COVERAGE_TO_SHADER_SELECT on GFX12 4315- radv: fix emitting VGT_PRIMITIVEID_RESET in the GFX preamble on GFX12 4316- radv: only set valid bitfields for CB/DS surfaces address on GFX12 4317- radv: add a helper to get image VA 4318- ac,radeonsi import PM4 state from RadeonSI 4319- ac,radeonsi: add a function to initialize compute preambles 4320- radv: initialize compute preambles with the common helper 4321- radv: fix creating unlinked shaders with ESO when nextStage is 0 4322- radv: pass a radv_shader to radv_get_compute_pipeline_metadata() 4323- radv: don't assume that TC_ACTION_ENA invalidates L1 cache on gfx9 4324- ac,radv: add a helper for SQTT control register 4325- ac,radv,radeonsi: add more parameters to ac_sqtt 4326- amd: allow to emit privileged config registers in PM4 4327- amd: mark more registers that need RESET_FILTER_CAM in PM4 4328- amd: add a common implementation for SQTT using PM4 4329- radv: emit more consecutive registers for SQTT on GFX8-9 4330- radv: use the common SQTT implementation 4331- radv: update VGT_TESS_DISTRIBUTION.ACCUM_ISOLINE value 4332- radv: do not set registers set by CLEAR_STATE in the preamble on GFX10-11.5 4333- radv: emit SPI_GS_THROTTLE_CNTL1 when the attr ring is emitted 4334- radv: fix incorrect buffer_list advance for multi-planar descriptors 4335- radv: use BDA in the DGC prepare shader 4336- radv: remove dynamic uniform/storage buffers support with DGC 4337- radv: do not use nir_pkt3() when the packet len is constant with DGC 4338- radv: add new macros for emiting packets in DGC 4339- radv: remove redundant nir_builder param in some DGC helpers 4340- radv: add a helper to load the pipeline VA for DGC 4341- radv: store a pointer to the logical device in dgc_cmdbuf 4342- radv: allow VK_NV_device_generated_commands_{compute} with LLVM 4343- radv: always save/restore all shader objects for internal operations 4344- radv: update configuring WALK_ALIGN8_PRIM_FITS_ST on GFX12 4345- ac/surface: add NBC view support on GFX12 4346- radv: declare a new user SGPR for the streamout state buffer on GFX12 4347- radv/nir: lower nir_intrinsic_load_xfb_state_address_gfx12_amd 4348- radv: implement streamout on GFX12 4349- radv: force using indirect descriptor sets for indirect compute pipelines 4350- radv: emit indirect sets for indirect compute pipelines with DGC 4351- radv: fix emitting indirect descriptor sets in the DGC prepare shader 4352- radv: cleanup getting AC_UD_TASK_RING_ENTRY for mesh shader 4353- radv: use radv_shader_info::user_data_0 for task shaders 4354- radv: remove dead mesh shader code for indirect draws 4355- radv: remove useless masking in radv_cs_emit_indirect_mesh_draw_packet() 4356- radv: remove useless draw_id to radv_emit_userdata_task() 4357- radv: add the DGC preprocess BO to the cmdbuf BO list 4358- radv/amdgpu: allow cs_execute_ib() to pass a VA instead of a BO 4359- radv/amdgpu: use the non-IB path for dumping CS with external IBs 4360- ac/parse_ib: dump PKT3_DISPATCH_{TASKMESH_GFX,TASKMESH_DIRECT_ACE} 4361- radv/amdgpu: fix chaining CS with external IBs on compute queue 4362- radv: add a helper to execute a DGC IB 4363- radv: add support for computing the DGC ACE IB size 4364- radv: prepare for DISPATCH_TASKMESH_GFX emission in the DGC shader 4365- radv: prepare for DISPATCH_TASKMESH_DIRECT_ACE emission in the DGC shader 4366- radv: refactor some DGC helpers in preparation for the ACE IB 4367- radv: add a helper to pad DGC IB 4368- radv: add support for preparing the ACE IB in DGC 4369- radv: add support for executing the DGC ACE IB 4370- radv: fix incorrect cache flushes before decompressing DCC on compute 4371- radv: improve clarity of DGC offset computations 4372- radv: pre-compute the base upload offset in radv_prepare_dgc() 4373- radv: add a helper that determines if DGC uses task shaders 4374- radv: split allocating and emitting push constants with DGC 4375- radv: rework emitting push constants with DGC 4376- radv: reserve space for push constants in the DGC ACE IB 4377- radv: adjust the base upload offset when DGC uses task shaders 4378- radv: emit push constant for task shaders with DGC 4379- radv: disable conditional rendering with DGC and task shaders 4380- radv: fix a synchronization issue with non-preprocessed DGC with task shader 4381- radv: enable task shaders support with NV DGC 4382- radv: suspend user conditional rendering when DGC has task shaders 4383- radv: rename radv_get_user_sgpr() to radv_get_user_sgpr_info() 4384- radv: add radv_get_user_sgpr{_loc}() helpers 4385- radv: use radv_get_user_sgpr_loc() for the GS copy shader too 4386- radv: remove unused parameter to dgc_emit_draw_mesh_tasks_ace() 4387- radv: do not emit compute userdata for empty dispatches 4388- radv: cleanup using vtx_base_sgpr for userdata with DGC 4389- radv: use radv_dgc_with_task_shader() more 4390- radv: move radv_CmdPreprocessGeneratedCommandsNV() to radv_cmd_buffer.c 4391- radv: use the graphics pipeline from the DGC info 4392- radv: use radv_get_user_sgpr() more in DGC 4393- vulkan: Update XML and headers to 1.3.289 4394- radv: advertise VK_KHR_maintenance7 4395- ci: bump vkd3d-proton to 3d46c082906c77544385d10801e4c0184f0385d9 4396- radv: remove unused parameter to radv_pipeline_import_retained_shaders() 4397- radv: simplify importing libraries with retained shaders 4398- radv: remove unused get_vs_output_info() function 4399- radv: remove unnecessary radv_pipeline_has_ngg() function 4400- radv: move radv_hash_shaders() to radv_graphics_pipeline.c 4401- radv: simplify determining when the rasterization primitive is unknown 4402- radv: simplify determining when a VS prolog is needed 4403- radv: stop passing a pipeline to some graphics related helpers 4404- radv: rework generating all graphics state for compiling pipelines 4405- radv: remove radv_descriptor_set_layout::shader_stages 4406- radv: use blake3 for hashing descriptor set layouts 4407- radv: use blake3 for hashing pipeline layouts 4408- radv: disable VK_EXT_sampler_filter_minmax on TAHITI and VERDE 4409- ac,radeonsi: add ac_is_reduction_mode_supported() 4410- radv: use ac_is_reduction_mode_supported() 4411- radv: fix marking RADV_DYNAMIC_COLOR_ATTACHMENT_MAP as dirty 4412- nir/gather_info: handle uses_fbfetch_output for sparse image loads 4413- nir/gather_info: handle uses_fbfetch_output for texture operations 4414- radv: destroy the perf counter BO in radv_device_finish_perf_counter() 4415- radv: add radv_device_init_perf_counter() 4416- radv: add helpers for init/deinit device memory cache 4417- radv: add helpers for init/deinit RGP 4418- radv: simplify keeping shader info for GPU hangs debugging 4419- radv: add radv_device_init_trap_handler() 4420- radv: add helpers for init/deinit device fault detection 4421- radv: add radv_device_init_rmv() 4422- radv: regroup all tools initialization in one helper 4423- radv: use zero allocation for the device queues 4424- radv/meta: remove non-valuable comments 4425- radv/meta: remove unnecessary blit2d_dst_temps struct 4426- radv/meta: remove redundant check for hw resolve pipelines 4427- radv/meta: remove unused number of rectangles for internal operations 4428- radv/meta: remove useless checks for NULL handles before destroying 4429- radv/meta: add a helper to create compute pipeline 4430- radv/meta: add a helper to create pipeline layout 4431- radv/meta: add a helper to create descriptor set layout 4432- zink/ci: skip arb_shader_image_load_store also on NAVI31/VANGOGH 4433- zink/ci: remove redundant arb_shader_image_load_store skips on POLARIS10 4434- radv: do not expose ImageFloat32AtomicMinMax on GFX11_5 4435- radv: fix programming DB_RENDER_CONTROL for NULL depth/stencil on GFX11_5 4436- radv: expose BufferFloat32AtomicMinMax on GFX11_5 4437- radv: disable SPM trace on GFX11_5 4438- ac/rgp: assume GFX11_5 use the same SQTT/RGP versions as GFX11 4439- radv: allow to capture with RGP on GFX11_5 4440- radv/meta: fix potential race condition when creating the copy VRS pipeline 4441- radv/meta: rework creating the VRS copy HTILE pipeline 4442- radv/meta: remove the depth resummarize operation 4443- radv/meta: avoid potential NULL deref with the gfx depth decompress pipeline 4444- radv/meta: move locking around the gfx depth decompress pipeline 4445- radv/meta: remove unused parameter to radv_get_depth_pipeline() 4446- radv/meta: rework creating the gfx depth decompress pipeline 4447- radv/meta: create the compute depth decompress pipeline on-demand 4448- radv/meta: cleanup creating the compute depth decompress pipeline 4449- radv/meta: separate creating the fill/copy pipelines 4450- radv/meta: create the fill/copy pipelines on-demand 4451- radv/meta: cleanup radv_device_init_meta_blit_{color,depth,stencil]() 4452- radv/meta: move the locking around creating blit pipelines 4453- radv/meta: cleanup meta_emit_blit() 4454- radv/meta: rework creating blit pipelines 4455- radv/meta: create fmask expand layouts regardless on-demand 4456- radv/meta: rework creating FMASK expand pipelines 4457- radv/meta: create fmask copy layouts regardless on-demand 4458- radv/meta: rework creating copy expand pipelines 4459- radv/meta: fix potential race condition when creating DCC retile pipelines 4460- radv/meta: fix potential memleak when creating DCC retile pipelines 4461- radv/meta: rework creating DCC retile pipelines 4462- radv/meta: remove useless memset when destroying DCC retile state 4463- radv/meta: rework creating GFX depth/stencil resolve pipelines 4464- radv/meta: rework creating GFX color resolve pipelines 4465- radv/meta: rework creating compute color resolve pipelines 4466- radv/meta: rework creating compute depth/stencil resolve pipelines 4467- radv/meta: cleanup creating HW resolve pipelines 4468- radv/meta: rework creating HW resolve pipelines 4469- radv/meta: rework creating DCC decompress compute pipelines 4470- radv/meta: rework creating clear HTILE mask pipeline 4471- radv/meta: create clear HTILE mask pipeline on-demand when needed 4472- radv/meta: create DCC comp-to-single pipelines on-demand when needed 4473- radv/meta: add a helper to create itob pipelines 4474- radv/meta: create itob pipelines on-demand when needed 4475- radv/meta: add a helper to create btoi pipelines 4476- radv/meta create btoi pipelines on-demand when needed 4477- radv/meta: add a helper to create btoi r32g32b32 pipeline 4478- radv/meta: create btoi r32g32b32 pipeline on-demand when needed 4479- radv/meta: update the helper that creates itoi pipelines 4480- radv/meta: create itoi pipelines on-demand when needed 4481- radv/meta: add a helper to create itoi r32g32b32 pipeline 4482- radv/meta: create itoi r32g32b32 pipelines on-demand when needed 4483- radv/meta: update the helper that creates clear pipelines 4484- radv/meta: create clear pipeliones on-demand when needed 4485- radv/meta: add a helper to create clear r32g32b32 pipeline 4486- radv/meta: create clear r32g32b32 pipelines on-demand when needed 4487- radv: fix shaders cache corruption with indirect pipeline binds 4488- radv/meta: stop checking that creating NIR shaders failed 4489- radv/meta: remove unnecessary goto 4490- radv/meta: stop creating similar pipeline layouts for depth decompress 4491- radv/meta: create the layouts for blit pipelines on-demand 4492- radv/meta: create the layouts for FS resolve pipelines on-demand 4493- radv/meta: create the layouts for depth decompress on-demand 4494- radv/meta: create the layouts for FMASK copy on-demand 4495- radv/meta: create the layouts for FMASK expand on-demand 4496- radv/meta: create the layouts for compute resolve on-demand 4497- radv/meta: create the louts for DCC comp-to-single clear on-demand 4498- radv/meta: rework getting clear color pipelines 4499- radv/meta: create the layout for clear color on-demand 4500- radv/meta: rework getting depth stencil clear pipelines 4501- radv/meta: create the layout for clear depth/stencil on-demand 4502 4503Saroj Kumar (2): 4504 4505- mesa: Add functions to print blake3 4506- mesa: replace shader_info::source_sha1 4507 4508Sathishkumar S (3): 4509 4510- util/format: add planar3 y8_u8_v8_440 pipe format 4511- frontends/va,gallium/vl: add support for yuv440 format 4512- radeonsi/vcn: enable yuv440 jpeg decode 4513 4514Sebastian Wick (1): 4515 4516- vulkan/wsi/wayland: refactor wsi_wl_swapchain_wait_for_present 4517 4518Sergi Blanch Torne (21): 4519 4520- mr-label-maker: specialize CI labels 4521- ci: kernel stored in a different s3 bucket 4522- ci: identify and label S3 buckets 4523- ci: disable Collabora's farm due to maintance 4524- Revert "ci: disable Collabora's farm due to maintance" 4525- ci: fix stress counter in run'n'monitor 4526- ci: disable Collabora's farm due to maintenance 4527- Uprev Piglit to cf8daaf5ba90fc9b8a0e144355026e2a14c79944 4528- Revert "ci: disable Collabora's farm due to maintenance" 4529- ci: disable Collabora's farm due to runners maintenance 4530- Revert "ci: disable Collabora's farm due to runners maintenance" 4531- ci: continue stress run'n'monitor 4532- ci: Fix parse GitLab pipeline url 4533- ci: run_n_monitor, collect and summarize 4534- ci: disable Collabora's farm due to maintenance 4535- Revert "ci: disable Collabora's farm due to maintenance" 4536- ci: run_n_monitor, arguments review and unicode 4537- ci: run_n_monitor, pretty duration with padding 4538- ci: run_n_monitor, listing job names with a padding 4539- ci: run_n_monitor, sort by name when listing jobs 4540- ci: fix run_n_monitor single execution 4541 4542Sil Vilerino (5): 4543 4544- d3d12: Fix static analysis issues due to bad parenthesis closing 4545- nir: Mark variable as ASSERTED to fix unused variable warning treated as error 4546- d3d12: Video Encode - Fix inputs for older OS support query cap 4547- d3d12: Add missing case for CQP in d3d12_video_encoder_disable_rc_qualitylevels 4548- Revert "d3d12: Video Encode - Remove PIPE_VIDEO_PROFILE_MPEG4_AVC_BASELINE as not supported" This reverts commit d6bb4ddc638f3ee37fbbe066c631dad80aaeb2d3. Fixes: d6bb4ddc638 ("d3d12: Video Encode - Remove PIPE_VIDEO_PROFILE_MPEG4_AVC_BASELINE as not supported") 4549 4550Simon Ser (1): 4551 4552- glapi: fix param type in TexGenxOES 4553 4554Sushma Venkatesh Reddy (4): 4555 4556- drm-uapi: Sync i915_drm.h with a78313bb206e 4557- anv/drirc: add option to provide low latency hint 4558- anv: Fix I915_PARAM_HAS_CONTEXT_FREQ_HINT check 4559- intel/clflush: Utilize clflushopt in intel_invalidate_range 4560 4561Sviatoslav Peleshko (5): 4562 4563- anv: Fix descriptor sampler offsets assignment 4564- anv,driconf: Add fake non device local memory WA for Total War: Warhammer 3 4565- intel/brw: Actually retype integer sources of sampler message payload 4566- intel/elk: Actually retype integer sources of sampler message payload 4567- mesa: Fix PopAttrib not restoring states that changed on deeper stack level 4568 4569Tapani Pälli (14): 4570 4571- iris: change stream uploader default size to 2MB 4572- anv: skip gfx push constants alloc optimization on gfx9/11 4573- iris: ForceZeroRTAIndexEnable if last geom stage does not write layer id 4574- vulkan/runtime: add a subpass bit for legacy dithering 4575- anv: VK_EXT_legacy_dithering support 4576- docs/features: add VK_EXT_legacy_dithering 4577- ci: update failures list with angle for jsl, tgl 4578- anv/android: enable emulated astc for applications 4579- anv: implement WA 14018283232 4580- mesa: remove some conditions in mipmap code 4581- isl: fix condition for enabling sampler route to lsc 4582- isl/iris/anv: provide drirc toggle intel_sampler_route_to_lsc 4583- anv: move some pc was to batch_emit_pipe_control_write 4584- anv: fix a cmd_buffer reference in simple shader 4585 4586Tatsuyuki Ishi (5): 4587 4588- radv: Remove radv_queue::device again 4589- vk_entrypoints_gen: Add missing ATTR_WEAK for instance and physdev entrypoints 4590- vk_entrypoints_gen: Rework ATTR_WEAK to unify Unix and MinGW 4591- vk_entrypoints_gen: Apply hidden visibility to generated symbols 4592- vk_cmd_queue_gen: Exclude CmdDispatchGraphAMDX 4593 4594Thomas H.P. Andersen (2): 4595 4596- nvk: advertise EXT_depth_range_unrestricted 4597- nvk/upload_queue: fix the _fill method 4598 4599Tim Huang (2): 4600 4601- amd: add GFX v11.5.2 support 4602- amd/vpelib: support VPE IP v6.1.3 4603 4604Timothy Arceri (36): 4605 4606- glsl: wrap nir_opt_loop in NIR_PASS() 4607- glsl: use hash table when serializing resource data 4608- glsl: move geom input array sizing to nir linker 4609- lima: drop unrequired opt from standalone compiler 4610- glsl: remove unused detect_recursion_linked() 4611- lima: remove the standalone compiler 4612- glsl: add support for glsl es 310/320 to standalone compiler 4613- nir: clarify and update loop conditional instruction 4614- nir: more aggressively remove in loop during partial unroll 4615- nir: support more loop unrolling for logical operators 4616- nir: add merge loop terminators optimisation 4617- nir: add test for opt_loop_merge_terminators 4618- nir: correctly track current loop in nir_opt_loop() 4619- nir: test opt_loop_merge_terminators() skips unhandled loops 4620- nir: add additional opt_loop_merge() test of deref handling 4621- glsl: drop dump-builder support from standalone compiler 4622- glsl: remove Par-linking from the standalone linker 4623- glsl: remove do_function_inlining() 4624- glsl: make glsl_to_nir() more generic 4625- glsl: remove unused symbol table functionality 4626- glsl: remove out of date TODO 4627- glsl: move call to create explicit ifc layout out of glsl_to_nir 4628- glsl: drop glsl ir optimisation from the standalone compiler 4629- glsl: make warning tests pass linking 4630- glsl/mesa: remove UniformHash field 4631- glsl/standalone: init EmptyUniformLocations 4632- glsl/tests: fix test_gl_lower_mediump 4633- mesa: remove _mesa_get_log_file() wrapper 4634- util/mesa: move mesa/main log code to util 4635- mesa: add unreachable to _mesa_shader_stage_to_subroutine_prefix() 4636- glsl: set how_declared to hidden for compiler temps 4637- glsl: fix cross validate globals 4638- glsl: remove out of date comment 4639- nir: set disallow_undef_to_nan for legacy ARB asm programs 4640- glsl: fix glsl to nir support for lower precision builtins 4641- glsl: always copy bindless sampler packing constructors to a temp 4642 4643Timur Kristóf (25): 4644 4645- ac/nir/esgs: Slightly refactor emitting IO loads and stores. 4646- ac/nir/tess: Slightly refactor emitting LS outputs. 4647- ac/nir: Add helper macros for emitting IO code. 4648- ac/nir/esgs: Implement packed 16-bit ES->GS I/O using helper macros. 4649- ac/nir/tess: Implement packed 16-bit LS->HS I/O using helper macros. 4650- ac/nir/tess: Implement packed 16-bit HS->TES I/O using helper macros. 4651- aco: Add missing nir_builder include. 4652- ac/nir: Move some helpers to new file. 4653- ac/nir: Add helper for pre-rasterization output info. 4654- ac/nir/ngg: Use new pre-rasterization output info helper. 4655- ac/nir/legacy: Use new pre-rasterization output info helper. 4656- nir: Add nir_opt_load_store_update_alignments. 4657- radv: Add TES num_linked_patch_inputs. 4658- radv: Add shader stats for inputs and outputs. 4659- radv: Fix TCS -> TES I/O linking typo of VARYING_SLOT vs. BIT. 4660- nir/opt_varyings: Print FS VEC4 type when debugging relocate_slot. 4661- nir/opt_varyings: Don't promote flat inputs when moving post-dominator. 4662- ac/nir/tess: Adjust TCS->TES output mapping for linked shaders. 4663- radv: Properly link TCS->TES IO again. 4664- nir/lower_io: Add option to implement mediump as 32-bit. 4665- radv: Ignore mediump IO flag. 4666- ac/nir/tess: Only write tess factors that the TES reads. 4667- ac/nir/tess: Fix per-patch output LDS mapping. 4668- ac/nir/tess: Fix per-patch output VRAM mapping. 4669- radv: Use number of TES inputs for TCS-TES linking. 4670 4671Tomeu Vizoso (2): 4672 4673- etnaviv/nn: Make parallel jobs disabled by default 4674- etnaviv: handle missing alu conversion opcodes 4675 4676Turo Lamminen (1): 4677 4678- radv: Optimize memcpy in write_image_descriptor 4679 4680Tvrtko Ursulin (1): 4681 4682- intel/hang_replay: fix batch address 4683 4684Valentine Burley (40): 4685 4686- docs: Update VK_EXT_legacy_vertex_attributes entries 4687- tu: Add missing VK_EXT_legacy_vertex_attributes feature 4688- tu: Change commas to semicolons in VK_EXT_map_memory_placed features 4689- drm-shim: Stub syncobj reset ioctl 4690- tu: Expose VK_EXT_nested_command_buffer 4691- freedreno/devices: Fix indentation for Adreno A32 4692- freedreno/ci: Update expectations 4693- wsi: Guard DRM-dependent function implementations with HAVE_LIBDRM 4694- tu: Add support for VkBindMemoryStatusKHR 4695- tu: Add support for NULL index buffer 4696- tu: Add support for version 2 of all descriptor binding commands 4697- tu: Advertise VK_KHR_maintenance6 4698- tu: Move event related related code to tu_event.cc/h 4699- tu: Handle all dependencies of CmdWaitEvents2 4700- mr-label-maker: Update nouveau directories 4701- mr-label-maker: Separate freedreno and turnip labels 4702- tu: Handle the new sync2 flags 4703- tu: Remove declaration of unused update_stencil_mask function 4704- tu: Switch to vk_ycbcr_conversion 4705- tu: Use vk_sampler 4706- tu: Use device->vk.enabled_features instead of iterating twice 4707- tu: Move sampler related code to tu_sampler.cc/h 4708- tu: Drop tu_init_sampler helper function 4709- tu: Advertise VK_KHR_shader_float_controls2 4710- tu: Use the common version of vkGetBufferMemoryRequirements2 4711- tu: Move buffer related code to tu_buffer.cc/h 4712- tu: Use the common version of vkQueueBindSparse 4713- tu: Use vk_buffer_view 4714- tu: Drop tu_buffer_view_init helper function 4715- tu: Move buffer view related code to tu_buffer_view.cc/h 4716- tu: Rename tu_query.cc/h to tu_query_pool.cc/h 4717- tu: Use the common versions of vkBegin/EndQuery() 4718- tu: Use vk_query_pool 4719- tu: Don't disable 2 10-bit formats 4720- freedreno,tu,ir3: Move threadsize_base and max_waves to fd_dev_info 4721- freedreno/ci: Use the common a6xx-skips on a750 4722- tu: Enable VK_KHR_shader_subgroup_uniform_control_flow 4723- tu/kgsl: Remove unused variable 4724- vulkan/wsi: Refactor can_present_on_device 4725- tu: Always report that we can present on kgsl 4726 4727Vignesh Raman (3): 4728 4729- virtio/ci: separate hiden jobs to -inc.yml files 4730- ci: add farm variable for devices in collabora farm 4731- ci/lava: add farm in structured log files 4732 4733Vinson Lee (2): 4734 4735- panvk: Remove duplicate variable src_idx 4736- panvk: Fix assert 4737 4738Vlad Schiller (2): 4739 4740- pvr: Handle VK_STRUCTURE_TYPE_EXPORT_MEMORY_ALLOCATE_INFO 4741- pvr: Handle VK_STRUCTURE_TYPE_IMAGE_FORMAT_LIST_CREATE_INFO 4742 4743WANG Xuerui (2): 4744 4745- meson: Force use of LLVM ORCJIT for hosts without MCJIT support 4746- meson: Additionally probe -mtls-dialect=desc for TLSDESC support 4747 4748Weifeng Liu (1): 4749 4750- anv/anroid: Query gralloc for tiling mode 4751 4752X512 (2): 4753 4754- egl/haiku: fix double free of BBitmap 4755- egl/haiku: fix synchronization problems, add missing header 4756 4757Yiwei Zhang (31): 4758 4759- venus: avoid client allocators for ring internals 4760- venus: silence a stack array false alarm 4761- venus: workaround excessive dma-buf import failure on turnip 4762- venus: fix to destroy all pipeline handles on early error paths 4763- meson: disallow Venus debug + LTO build via GCC 4764- turnip: msm: clean up iova on error path 4765- turnip: msm: fix racy gem close for re-imported dma-buf 4766- venus: drop the workaround for excessive dma-buf import oom on turnip 4767- turnip: virtio: fix error path in virtio_bo_init 4768- turnip: virtio: fix iova leak upon found already imported dmabuf 4769- turnip: virtio: fix racy gem close for re-imported dma-buf 4770- vulkan: cast to avoid -Wswitch for Android struct beyond VkStructureType 4771- venus: directly use vk drm and pci props in renderer info 4772- venus: move custom props fill from GPDP2 to props init 4773- venus: move props sanitization to a separate helper 4774- venus: define VN_SET_VK_PROPS(_EXT) to simplify vk props init 4775- vulkan: drop redundant core props query and copy helpers 4776- venus: drop internal memory pools 4777- venus: allow non-wsi image alias path to passthrough upon bind memory 4778- ci/venus: skip a timeout test 4779- anv: use os_get_option instead of getenv 4780- venus: defer qfb buffer init upon query being used 4781- venus: refactor vn_android_image_from_anb 4782- venus: refactor to add vn_android_image_from_anb_internal 4783- venus: support VK_ANDROID_NATIVE_BUFFER_SPEC_VERSION 8 4784- vulkan: properly ignore unsupported feature structs 4785- venus: tentative fix for test flakiness from invalid ring wait 4786- venus: simplify cached mem type emulation 4787- venus: clarify wsi image ownership 4788- venus: fix a race condition between gem close and gem handle tracking 4789- Revert "meson: disallow Venus debug + LTO build via GCC" 4790 4791Yogesh Mohan Marimuthu (4): 4792 4793- radeonsi: remove si_query_hw_ops table and call func directly 4794- radeonsi: use reseults_end instead of unprepared to init query buffer 4795- radeonsi: rename query_hw_ops to hw_query_ops match sw 4796- radeonsi: add more comments in si_query.c 4797 4798Yonggang Luo (2): 4799 4800- util: Rename DETECT_OS_UNIX to DETECT_OS_POSIX 4801- gallivm: add lp_context_ref for combine usage of LLVMContextSetOpaquePointers 4802 4803Yukari Chiba (7): 4804 4805- llvmpipe: add gallivm_add_global_mapping 4806- llvmpipe: make unnamed global have internal linkage 4807- util: detect RISC-V architecture 4808- gallivm: add riscv support to the mattrs setting code 4809- llvmpipe: add function name to gallivm_jit_function 4810- llvmpipe/tests: add a new test for multiple symbols for orc jit testing 4811- llvmpipe: add an implementation with llvm orcjit 4812 4813Yusuf Khan (7): 4814 4815- nouveau: Fix crash when destination or source screen fences are null 4816- nouveau/headers: Make nvk_cl**** turn to nv_push_cl**** 4817- nvk: remove NVK_MME_COPY_QUERIES 4818- zink/query: begin time elapsed queries even if we arent in a rp 4819- nvc0/vbo: wrap draw_vbo for multidraw performance 4820- nv50/vbo: wrap draw_vbo to avoid ovehead from multidraw 4821- aux/draw: Use the draw info we get passed in instead of our own 4822 4823Zach Battleman (2): 4824 4825- intel/brw: update comment to accurately reflect intended behavior 4826- intel/brw: update Wa_1805992985 to use workarounds mechanism 4827 4828Zack Middleton (2): 4829 4830- gles1: fix GL_OES_vertex_array_object 4831- gles1: fix glBufferSubData() 4832 4833Zan Dobersek (14): 4834 4835- fdperf: use snprintf instead of asprintf 4836- fdperf: select_counter() should work with a countable value 4837- fdperf: prettify logic around the reserved CP counter 4838- fdperf: improve reads of counter values 4839- fdperf: simplify counter value output 4840- freedreno: add a7xx perfcounter support 4841- tu: fix ZPASS_DONE interference between occlusion queries and autotuner 4842- tu: avoid memory polling in occlusion query endings using ZPASS_DONE 4843- tu: use either the 16-bit or 32-bit descriptor 4844- ir3_nir_opt_preamble: handle 8-bit preamble loads and stores 4845- ir3: rework TYPE_S8 as TYPE_U8_32 4846- tu: support KHR_8bit_storage 4847- tu: add format feature flag checks for VK_IMAGE_USAGE_INPUT_ATTACHMENT_BIT 4848- freedreno/drm: add mesautil dependency 4849 4850bbhtt (1): 4851 4852- nvk: Clean up unused header from libdrm_nouveau 4853 4854chiachih (9): 4855 4856- amd/vpelib: Resolve mismatch with shader 4857- amd/vpelib: Remove linear_0_125 TF 4858- amd/vpelib: Remove gamma cached table 4859- amd/vpelib: Remove support for non-linear FP16 4860- amd/vpelib: adding blend gamma bypass 4861- amd/vpelib: Remove checks for pitch alignment 4862- amd/vpelib: Fix Color Adjustment Failing Test Cases 4863- amd/vpelib: Fix blndgam bypass flag assignment 4864- amd/vpelib: Bypass de/regam on HLG 4865 4866msizanoen (1): 4867 4868- egl/wayland: Fix direct scanout with EGL_EXT_present_opaque 4869 4870nyanmisaka (1): 4871 4872- frontends/va: add support for A2RGB10/X2RGB10/A2BGR10/X2BGR10 4873 4874tarsin (4): 4875 4876- turnip: Change tu_image to use common initialization helpers 4877- turnip: Convert tu_device_memory to use vk_device_memory 4878- turnip: Split tu_image_init to use layout setting logic separately 4879- turnip: Support AHardwareBuffer 4880