Mesa 24.1.0 Release Notes / 2024-05-22¶
Mesa 24.1.0 is a new development release. People who are concerned with stability and reliability should stick with a previous release or wait for Mesa 24.1.1.
Mesa 24.1.0 implements the OpenGL 4.6 API, but the version reported by glGetString(GL_VERSION) or glGetIntegerv(GL_MAJOR_VERSION) / glGetIntegerv(GL_MINOR_VERSION) depends on the particular driver being used. Some drivers don’t support all the features required in OpenGL 4.6. OpenGL 4.6 is only available if requested at context creation. Compatibility contexts may report a lower version depending on each driver.
Mesa 24.1.0 implements the Vulkan 1.3 API, but the version reported by the apiVersion property of the VkPhysicalDeviceProperties struct depends on the particular driver being used.
SHA256 checksum¶
b7eac8c79244806b1c276eeeacc329e4a5b31a370804c4b0c7cd16837783f78b mesa-24.1.0.tar.xz
New features¶
VK_EXT_map_memory_placed on RADV, ANV and NVK
VK_KHR_shader_subgroup_rotate on RADV and ANV and NVK
VK_KHR_load_store_op_none on RADV, ANV, NVK and Turnip
VK_KHR_line_rasterization on RADV, ANV, NVK and Turnip
VK_KHR_index_type_uint8 on RADV, ANV, NVK and Turnip
VK_KHR_shader_expect_assume on all Vulkan drivers
VK_KHR_shader_maximal_reconvergence on RADV, ANV and NVK
VK_KHR_shader_quad_control on RADV
OpenGL 4.6 on Asahi
OpenGL ES 3.2 on Asahi
Mali G610 and G310 on Panfrost
Mali T600 on Panfrost
VK_KHR_shader_subgroup_uniform_control_flow on NVK
alphaToOne/extendedDynamicState3AlphaToOneEnable on RADV
VK_EXT_device_address_binding_report on RADV
VK_EXT_external_memory_dma_buf for lavapipe
VK_EXT_queue_family_foreign for lavapipe
VK_EXT_shader_object on RADV
VK_EXT_nested_command_buffer on NVK and RADV
VK_EXT_queue_family_foreign on NVK
VK_EXT_image_drm_format_modifier on NVK
Bug fixes¶
anv: unbounded shader cache
radv: Crash due to nir validation fail in Enshrouded
bisected: turnip: deqp regressions
android: sRGB configs no longer exist after !27709
[24.1-rc4] fatal error: intel/dev/intel_wa.h: No such file or directory
vcn: rewinding attached video in Totem cause [mmhub] page fault
When using amd gpu deinterlace, tv bt709 properties mapping to 2 chroma
ci: switch from CI_JOB_JWT to id_tokens
VCN decoding freezes the whole system
[RDNA2 [AV1] [VAAPI] hw decoding glitches in Thorium 123.0.6312.133 after https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28960
nvk: Tracker issue for gamescope support
nvk: Implement VK_EXT_image_drm_format_modifier
WSI: Support VK_IMAGE_ASPECT_MEMORY_PLANE_i_BIT_EXT for DRM Modifiers in Vulkan
[Regression][Bisected] EGL/Wayland: QT applications terminated by SIGSEGV (Address boundary error) when using dGPU
radv: Enshrouded GPU hang on RX 6800
NVK Zink: Wrong color in Unigine Valley benchmark
panforst: T604 issue with using u32 for flat varyings
[anv] FINISHME: support YUV colorspace with DRM format modifiers
gen9/11 test became flaky: piglit.spec.!opengl 1_4.blendminmax
24.0.6: build fails
mesa 24 intel A770 KOTOR black shadow smoke scenes
[bisected][regression] kitty fails to start due to `glfwWindowHint(GLFW_SRGB_CAPABLE,true)`
r600: bisected 5eb0136a3c561 breaks a number of piglits
Graphical glitches in RPCS3 after updating Vulkan Intel drivers
[R600] OpenGL and VDPAU regression in Mesa 23.3.0 - some bitmaps get distorted.
VAAPI radeonsi: VBAQ broken with HEVC
radv/video: 10-bit support
radv: vkCmdWaitEvents2 is broken
Zink: enabled extensions and features may not match
glRasterPos: “Assertion `prog->base_serialized_nir’ failed.” if a shader is loaded from the shader cache
radv: mesa-9999/src/amd/vulkan/radv_image_view.c:147: radv_set_mutable_tex_desc_fields: Assertion `(plane->surface.u.gfx9.surf_pitch * plane->surface.bpe) % 256 == 0’ failed.
ACO doesn’t hide lds_param_load latencies
ACO doesn’t form a VMEM clause for image stores in one case on GFX11
r600: Valheim hangs CAYMAN gpu (regression/bisected)
r600: Artifacts in Oxygen Not Included around air ducts and pipes (regression, bisected)
radv: UMR wave parsing format is outdated
radv: GetImageMemoryRequirements2 does not look at VkImagePlaneMemoryRequirementsInfo
RADV, regression : Objects randomly appear/disappear on Unreal Engine 4 titles using D3D12 backend on Polaris
mesa 23.1.0-rc3 flickering textures/lighting in Unreal 4 games Polaris10
ACO tests SIGSEGV in debian-vulkan job with LTO enabled
radv: Address binding report for images is incorrect.
blorp: avoid dirtying push constants in 3D
anv: flaky vkd3d-proton test_buffer_feedback_instructions_sm51
FTBFS: commit aaccc25a4dd9ccfc134e51a7e81168334d63a909 broke mesa snapshot build
d3d12_screen.cpp:60:10: fatal error: ShlObj.h: No such file or directory
r300: crash when compiling some GSK shaders
anv: vkd3d-proton test_stress_suballocation failure
d3d12: Zwift renders with bad textures/lighting
nir_opt_remove_phis breaks divergence analysis
intel: Require 64KB alignment when using CCS and multiple engines
NVK: Misrendering with Civilization 6
radv: RDR2 might need zerovram
intel-clc build failure, i think?
Issues rendering gtk4 window decorations on v3d on Fedora-40/mesa-24.0
clc: Failure when linking with llvm+clang 18.1 (-Dshared-llvm=disabled)
LLVM-18 build issue
vulkan/wsi/x11: VK_SUBOPTIMAL_KHR is never reported by the swapchain
Broken vaapi encoding on Radeon RX 6900XT
RUSTICL creating a shared reference to mutable static is discouraged and will become a hard error
anv: GPU hang on Assassin’s Creed Valhalla while running benchmark
nvk: dota 2 crashes after ~5 seconds in game
dzn: conflicting defines with DirectX headers 1.613.0
VAAPI: Incorrect HEVC block size reported with radeonsi
radv: WWE 2K24 has very quirky DCC issues on RDNA2
anv: Dirt 5 crashes at tryCreatingPipelineStateFromCache
freedreno: remove headergen2
freedreno: remove headergen2
vulkan/wsi: crash in dEQP-VK.wsi.wayland.swapchain.simulate_oom.min_image_count
Document that Zink on MoltenVK is not expected to work
KiCAD 3D Viewer - rounded pads rendered incorrectly (texture mapping or stencil test error)
OpenSCAD rendering incorrect and inconsistent on radeonsi
intel/fs: regression on MTL with 64bit values in UBO
ci: split debian-build-testing?
[freedreno] Black background on SuperTux Kart with postmarketOS and Oneplus 6T
[radv] Half-Life Alyx renders solid black for reflective surfaces
iris: iris_resource_get_handle returns wrong modifier
[RX 7900 XTX] Helldivers 2 cause GPU reset
radeon: Crash in radeon_bo_can_reclaim_slab
regression/bisected: commit 4e3f3c10e14d8778781c81f39ced659b5ec2f148 broke mesa snapshot build
RV530 renders improperly at non 4:3 resolutions.
anv: new cooperative matrix failures with CTS 1.3.8.0
nvk: Missing implementation of VkImageSwapchainCreateInfoKHR and VkBindImageMemorySwapchainInfoKHR
mesa > 23.1.9 [opencl,video_cards_nouveau] fails to build due to missing symbol vl_video_buffer_is_format_supported
intel/meson: Make intel_stub_gpu work with `meson devenv`
Follow-up from “iris: Fix plane indexing and handling on image import”
nvk,nak: Implement shaderStorageImageMultisample
nvk,nak: Implement VK_KHR_shader_subgroup_uniform_control_flow
`[gfxhub0] no-retry page fault` triggered by `AMD_TEST=testdmaperf` on gfx90c APU
nvk: glcts hangs
v3d: Line rendering broken when smoothing is enabled
PowerVR reports minMemoryMapAlignment of 64
RADV: GPU crash when setting ‘RADV_DEBUG=allbos’
[intel] mesa ftbfs with time_t64
d3d12_resource.cpp:307:49: error: no matching function for call to ‘ID3D12Heap::GetDesc()’
radv regression between a337a0c8072d0be487e43c2b7b132e003c6d5a5e and 83f741124b66818053b6b1b2f7e42f5217a27004
[build failure] [armhf] - error: #error “_TIME_BITS=64 is allowed only with _FILE_OFFSET_BITS=64”
R400 should have native support for sin/cos in VS
[radv] Crash when VkGraphicsPipelineCreateInfo::flags = ~0u
intel: all workarounds disabled with ATS skus
vulkan: GPL now broken
Gen4 assertion `force_writemask_all’ failed.
src/gallium/auxiliary/rtasm/rtasm_x86sse.c:198:10: runtime error: store to misaligned address 0x7fabba0cd011 for type ‘int’, which requires 4 byte alignment
[radv] Holographic projection texture glitch in Rage 2
RustiCL: Callbacks are not called upon errors
MTL: regressions in vulkancts due to BO CCS allocations
zink: spec@ext_external_objects@vk-image-overwrite fail
vaapi: radeonsi: surface_region.{x,y} is not honored in processing when source is RGB
nvk: Implement VK_EXT_shader_object
nvk: Implement VK_EXT_graphics_pipeline_library
turnip: UBWC disabled for MSAA
KHR-Single-GL46.arrays_of_arrays_gl.AtomicUsage fails on MTL
GTF-GL46.gtf42.GL3Tests.texture_storage.texture_storage_texture_as_framebuffer_attachment fails on MTL
nvk: Implement VK_KHR_maintenance5
[intel][anv][build][regression] - genX_grl.h:27:10: fatal error: grl/grl_cl_kernel.h: No such file or directory
RX 6600 VDPAU not recognizing HEVC_MAIN_10 correctly
Running an app on another AMD GPU (offload, DRI_PRIME) produces corrupted frames on Wayland.
regression in radeonsi since 9aa205668bcbf701f8f694551c284cd8e4cc17a3 (crashes in vbo_save_playback_vertex_list)
clang/libclc related Mesa build failures
Ninja Install Error
anv: add a dri config to enable implicit fencing on external memory interop
VDPAU declares a texture as “immutable” without also setting its ImmutableLevels attribute.
Segfault in glsl_to_nir.cpp nir_visitor::visit when assigning interface block
[rusticl]WARNING: Project targets ‘>= 1.1.0’ but uses feature deprecated since ‘1.0.0’: module rust has been stabilized. drop “unstable-” prefix from the module name
RX6600 hardware HEVC video decode fails for VDPAU but works for VA-API. (Can lock up GPU!)
Rusticl panics when getting program build logs using opencl.hpp
ue5 game issues lighting Rog Ally 7080u (z1e)
Missing textures in RoboCop: Rogue City with mesh shaders enabled
Intel/anv: Allow pre-compiled shader caches to be reused across multiple devices
radv: Multiview PSO forgets to export layer in some cases.
-Dintel-rt=enabled fails to build on 32-bit
MTL: regressions in vulkancts due to BO CCS allocations
intel: build failures
regression/bisected commit 4de62731f4db56360026cbb6a3b8566f86f22466 broke HW acceleration in the Google Chrome
i386 intel build failure: meson.build:45:6: ERROR: Unknown variable “prog_intel_clc”.
rusticl: clEnqueueFillBuffer (among others) fails on buffers created from GL object.
MTL raytracing regression
[ANV/DG2] Unexpectedly slow replay of RenderDoc frame capture of Resident Evil 4 Remake
zink: flickering artifacts in Selaco
[ADL] gpu hang on dEQP-VK.synchronization.internally_synchronized_objects.pipeline_cache_graphics
Turnip spam on non-turnip devices
Intermittent compiler failures when building valhall tests
panfrost: graphical artifacts on T604 (T600)
Dying Light native artifacts on Intel A770
r300: Amnesia: The Dark Descent heavy corruption
[ANV/DG2] Age of Empires IV fullscreen “banding” artefacts
[mtl][anv] dEQP-VK.pipeline.monolithic.depth.format.d32_sfloat.compare_ops.* failures when run multithreaded
[mtl][anv] flaky tests in pipeline.monolithic.extended_dynamic_state*stencil_state_face* series
Broken colors/dual-source blending on PinePhone (Pro) since 23.1.0
r600/sfn: “Indexed locks of kcache banks 14 and 15 are ignored” in the ALU clause documentation
turnip: Logarithmic-time subgroup reductions using brcst.active and getlast.w8 instructions on a6xx gen4+
GTF-GL46.gtf42.GL3Tests.texture_storage.texture_storage_compressed_texture_data regression
microsoft/compiler: Missing globally-coherent logic
Regression between 23.0.4 and 23.1.0: texture glitches in osgEarth
[Broadcom] Warning when runnin every OpenGL game on Vulkan using ZINK
radeonsi unsynchronized flips/tearing with KMS DRM rendering on 780M
radeonsi has an unchecked hard dependency on libelf
DR crashes with mesa 24 and rusticl (radeonsi)
Piglit tests assert on gen9 with zink
vlc crashes when playing 1920x1080 video with Radeon RX6600 hardware acceleration and deinterlacing enabled.
[radeonsi] Regression: graphical artifacting on water texture in OpenGOAL
Assertion when creating dmabuf-compatible VkImage on Tigerlake
Palworld fails to launch on Intel Arc unless “force_vk_vendor” is set to “-1”.
panfrost: implement line smoothing
r300: backend DCE fails in piglit glsl-vs-copy-propagation-1.shader_test
[AMDGPU RDNA3] Antialiasing is broken in Blender
MTL: vulkan cooperative matrix tests gpu hang on MTL
nvk: Implement VK_KHR_zero_initialize_workgroup_memory
Assassin’s Creed Odyssey wrong colors on Arc A770
VAAPI: EFC on VCN2 produces broken H264 video and crashes the HEVC encoder
etnaviv, modesetting, and glxgears
The Finals fails to launch with DX12 on Intel Arc unless “force_vk_vendor” is set to -1.
nvk: `VK_KHR_zero_initialize_workgroup_memory` and `VK_KHR_shader_subgroup_extended_types` not marked as complete in features.txt
nvk: Implement variableMultisampleRate
VA-API CI tests freeze
radv: games render with garbage output on RX5600M through PRIME with DCC
Warning when use ALIGN over uint64_t and uintptr_t
Changes¶
Adrian Perez de Castro (1):
Revert “egl/wayland: Remove EGL_WL_create_wayland_buffer_from_image”
Agate, Jesse (6):
amd/vpelib: Studio Range Handling
amd/vpelib: White Screen Fix
amd/vpelib: VPT Failing Test Cases
amd/vpelib: VPE integration for HLG
amd/vpelib: Add PQ Norm to VPE interface
amd/vpelib: Refactor norm factor logic
Alan Liu (4):
radeonsi/vpe: Add environment variable to set embbuf number
radeonsi/vpe: Don’t map and unmap emb_buffer every time in process_frame
amd/vpelib: remove unused header file
radeonsi/vpe: support vpe 1.1
Alejandro Piñeiro (3):
broadcom/compiler: fix coverity warning (unitialized pointer read)
v3dv/bo: use mtx_lock/unlock on cache_init too
v3dv: expose VK_EXT_depth_clip_enable
Alexandre Marquet (1):
pan/mdg: quirk to disable auto32
Alyssa Rosenzweig (328):
nir/lower_ssbo: rewrite
nir/lower_blend: return progress
nir/lower_io_to_temporaries: return prog
nir/lower_clip_cull_distance_arrays: return prog
nir: return prog from drawpixels
nir/lower_bitmap: return prog
nir/lower_alpha_test: rewrite with intrinsics_pass
nir/lower_point_size_mov: return prog
nir/lower_passthrough_edgeflags: return progress
nir/lower_io_arrays_to_elements: return prog
nir/lower_flatshade: fix metadata
glsl: return progress in point size linking
glsl: don’t use NIR_PASS_V
glsl: fix metadata in gl_nir_zero_initialize_clip_distance
mesa/st: return progress in st_nir_lower_wpos_ytransform
mesa/st: use instructions_pass for plane lowering
mesa/st: return progress lowering builtins
mesa/st: don’t use NIR_PASS_V
agx: rm deadcode
agx: ingest undefs in the backend
agx: stop lowering in opt loop
agx: only lower vars to ssa once
agx: fix metadata in layer lowering
agx: unset silly nir opts
agx: return progress from passes
asahi: return progress from passes
asahi: avoid silly internal NIR_PASS in gs lowering
asahi: don’t use NIR_PASS for removing entrypoints
asahi: don’t use NIR_PASS_V
nir/passthrough_gs: plug leak
compiler,gallium: move u_decomposed_prim to common
nir/passthrough_gs: flesh out gs_in_prim
compiler: add a vs.tes_agx bit
asahi: add more uapi stubs
asahi: gut macOS related code
asahi: lower poly stipple
asahi: Implement skeleton for tessellation
asahi: fix metadata for images with VS lowered to GS
asahi: implement VBO robustness
asahi: implement reset queries
asahi: enable robustness
asahi: fix unbound ssbos
asahi: optimize more when linking libagx
asahi: decode uniform_high records
agx: implement load_subgroup_invocation
agx: lower more subgroups
agx: introduce ballot pseudo
agx: fuse ballot+cmp
nir: add active_subgroup_invocation_agx sysval
agx: implement active_subgroup_invocation_agx
agx: optimize first_invocation
agx: optimize vote_eq
asahi: fix prim restart unrolling with indirects
asahi: delete bogus assertion
asahi: plug passthrough tcs leak
asahi: rework meta shader infra
asahi: plug geometry shader leaks
asahi: plug pre-gs leak
asahi: plug early_serialized_nir leak
asahi: plug so target leak
asahi: plug glsl type leak
asahi: plug geometry heap leak
asahi: fix UB in qbo’s
agx: add some more bitop tests
Revert “asahi: don’t canonicalize nans/flush denorms when copying”
asahi: sync with query mismatches
asahi: enable tcs caching
asahi: don’t sync for uninitialized buffer
asahi: fix valid buffer tracking for SSBO/image/XFB
asahi: handle read-only SSBOs
asahi: honour discard_whole | persistent
agx: only run early tests if needed
docs/asahi: fix strided linear note
ail: add tests for linear<–>twiddled copies
ail: port tiling routines to c++
ail: use template for tiled memcpy
agx: don’t inline imms into stack_store
agx: optimize b2x(inot)
agx: reassociate bcsel with ior/iand
asahi: implement pipeline stats as a checkbox
asahi: log geometry shaders separate from xfb
asahi: don’t use util_resource_size
asahi: fix vbo dirty track
asahi: force inline ppp update logic
asahi: skip set if tested
asahi: rm dead
asahi: track bit count, not word count
asahi: enable compblit behind dbg flag
asahi: allow disk cache with compblit
asahi: assert invariant
asahi: drop silly else
asahi: rewrite queries
asahi: split up stage uniform upload
asahi: dirty track stage uniforms
asahi: collapse stage uniform upload
asahi: optimize “no changes” case
asahi: optimize no changes descriptor case
asahi: move some code into dirty tracking
asahi: drop any_draws
asahi: fix instance count with indirect draw
asahi: collapse if
asahi: hoist xfb code
asahi: hoist layer id code
asahi: rm blank
asahi: track batches with incoherent writes
asahi: optimize memory_barrier
asahi,agx: use intrinsics pass
agx: clamp register file based on workgroup size
agx: improve scratch size accounting
asahi: add has_scratch to shader key
agx: set nr_preamble_gprs for preamble scratch
asahi: allocate preamble scratch
agx: allow 16-bit immediate on stack load/store
agx: print register vectors
agx: introduce “memory variables”
agx: add spill/fill lowering pass
agx: unit test memory parallel copies
agx: unit test spill/fill lowering
agx: add parallel copy printing
agx: add =spill debug option
asahi: bump max threads per wg
asahi: drop xfb hack
asahi: allow vertex/geom/tess side effects
agx: fix buffer overflow with varying slots
asahi,agx: use hw clip distance
asahi: fix dirty tracking issue
asahi: rip out existing MDI+GS implementation
libagx: fix buggy align macro
asahi: make GS flatshade_first more dynamic
libagx: use native static_assert on host
libagx: use real PACKED macro
libagx: static assert some sizes
libagx: generalize vertex_id_for_topology
asahi: simplify IA mode handling
asahi: add shader_info::outputs for gs lower
asahi: add geometry parameters for separable GS
asahi: rework shader stage handling a bit
asahi: separate GS from VS
asahi: rm arrayed output lowering
asahi: allow bindful GS textures
asahi: shrink GS key
asahi: infer stage in descriptor update
asahi: be a bit more methodical with shader stages
nir: rm load_vert_id_in_prim_agx
asahi: allow lowering bindings after lowering textures
asahi: collapse indirection with GS
asahi: support stage override in sysval lower
asahi: set gs_grid[0] even for direct draws
asahi: use load_instance_id in gs lowering
asahi: fix vertex out size calc
asahi: invert geometry shaders
asahi: implement GS disk caching
asahi: rm dead
asahi: simplify expressions involving xfb
asahi: avoid silly psiz writes even with gs
asahi: eliminate tri fan %
asahi: make provoking vertex dynamic
asahi: make gs topology dynamic
asahi: support GS in shaderdb
asahi: always support ARB_clip_control
asahi: make clip_halfz dynamic
asahi: rm ia key
agx: remove discard -> zs_emit lower
agx: rm dead sample count argument
agx: call agx_nir_lower_sample_mask earlier
agx: rm unused backend nr_samples
agx: rm unused opt_ixor_bcsel
agx: sink wait_pix
asahi: Implement ARB_texture_barrier by decompression
asahi: quelch gcc warning
agx: rm ridiculous dependency
agx: decouple compiler from genxml
agx: use #pragma once
asahi/lib: use #pragma once
ail: use #pragma once
asahi: use #pragma once
asahi: clean up format table renderability
asahi: split out genxml/ directory
agx: move SSBO lowering
agx: call texture lowering in the driver
agx: move texture lowering into lib
agx: decouple from libagx
asahi: reorder compiler before clc
asahi: precompile helper program
agx: add “is helper program?” key bit
asahi: advertise GL4.6 and ES3.2
docs: update for GL4.6 and ES3.2 on asahi
vulkan: add vk_index_type_to_restart helper
tu: use vk_index_to_restart
anv,hasvk: use vk_index_to_restart
util/hash_table: add u64 foreach macro
util/ralloc: add memdup
treewide: use ralloc_memdup
panfrost: Add a library to build CSF command streams
panfrost: Add support for the CSF job frontend
nir/opt_shrink_vectors: hoist alu helpers
nir/opt_shrink_vectors: shrink some intrinsics from start
util: add _mesa_hash_table_u64_num_entries
nir/print: do not print empty lists on intrinsics
util/hash_table: add DERIVE macro
panfrost: derive ht
asahi: derive ht
nvk: derive ht
radeonsi: derive ht
v3d: derive ht
glsl_types: derive ht
asahi: bump maximum samplers for Blender
asahi: allow more samplers for shaderdb
asahi: move more code out of agx_preprocess_nir
asahi/lib: fix overread with stateful
asahi: fix overread with samplers
asahi: clarify how unroll index buffers are offsetted
asahi: zero more in the unroll path
asahi: fix unit mismatch with unroll path
asahi: fix stage accounting for meta compute shaders
asahi: export build_meta_shader
asahi: add flush_query_writers helper
asahi: add helper to classify queries
asahi: accelerate QBO copies
asahi: fix depth bias interactions with points/lines
asahi: implement CDM stream linking for GS
asahi: be robust against tess batch changes
asahi: stop merging VS and TCS
asahi: drop TCS key
asahi: drop asahi_vs_next_stage
libagx: improve static assert message
asahi/clc: fix mem leaks
agx/opt_cse: alloc less
agx: fix stack smash with spilling
agx: fix allocating phi sources past the reg file
agx: add more asserts
agx: add num_successors helper
agx: fix 16-bit mem swaps
agx: scalarize vector phis
agx: allow vector phis to pass validation
agx: assert phis don’t have .kill set
agx: fix bogus implicit cast with 2d msaa arrays
agx: sink harder
agx: implement live range splits of phis
agx: don’t leak shuffle copies
agx: add more iterator macros
agx: add temp_like helper
agx: add before_function cursor
agx: add limit for max sources per non-phi
agx: coalesce phi webs
agx: try to coalesce moves
agx: drop scratch regs for spilling
agx: validate phi sources for consistency
agx: add SSA reindexing pass
agx: add SSA repair pass
agx: add Braun-Hack spiller pass
agx: switch to Braun-Hack spiller
agx: use dense reg_to_ssa map
agx: make add_successor public
agx: add helpers for multiblock unit tests
agx: add tests for SSA repair
agx: move spill/fills accounting to shaderdb
agx: enable indirect temps
agx: generalize remat code
agx: implement get_sr remat
asahi: use less bindless samplers
agx: add more shaderdb stats
agx: fix lowering uniforms with abs/neg
agx: restrict high uniforms with textures
agx: extract “accepts uniform?” ISA query
agx: model 64-bit uniform restriction on ALU
agx: extract agx_is_float_src
agx: promote constants to uniforms
agx: compact 32-bit constants
agx: test constant compaction
agx: implement load_subgroup_id
libagx: polyfill glsl ballot()
libagx: accelerate restart unroll across a subgroup
libagx: accelerate prim restart unroll across wg
libagx: deal with silly NIR
libagx: parallelize prefix sum over 1024 threads
agx: use funop short form
agx: split select opt into its own pass
agx: vectorize uniform_store
agx: start a crude cycle model
agx/opt_preamble: improve preamble cost function
agx/opt_preamble: restrain ourselves
agx/opt_preamble: preamble cycle estimates
agx/opt_preamble: improve rewrite cost est
docs/asahi: document UVS
nir: add offset to load_coefficients_agx
nir: add intrinsics for lowered VS outputs
asahi: add agx_push_packed
asahi: drop =varyings debug
asahi: extract agx_cf_binding
agx: explicitly assign coeff registers
agx: pack indirect CF
agx: handle indirect varyings
asahi: advertise indirect fs inputs
agx: rm unnecessary iter hack
agx: pack indirect st_vary
agx: inline imm into st_vary
asahi: rewrite varying linking
asahi: drop now-empty base key
asahi: make point size replacement dynamic
asahi: stop using GLSL indirect lowering
agx/lower_vbo: dce as we go
asahi: drop dead linked_so code
asahi: use ht derive more
asahi: fix _packed USC structs
asahi: delete layer id code
asahi: don’t set writes_memory for tib spilling
agx: optimize out wait_pix in some cases
agx: inline sampler states
agx: always reserve sampler #0 for txf
asahi: fix bit sizes in point sprite lower
nir: add samples_log2_agx sysval
nir: add export/load_exported_agx intrinsics
agx: wire up samples_log2 sr
agx: generalize preloaded cache
agx: implement exports
agx: document non-monolithic ABI
asahi: add agx_usc_push_packed helper
asahi: constify agx_build_tilebuffer_layout
asahi: don’t allocate tib space for gaps
nir: add intrinsics for non-monolithic agx shaders
agx: drop shader stage assertion
asahi: static assert blend key size
agx: add agx_shader_part data structure
agx: add main_size info
asahi: add fast linker
asahi/clc: stop padding binaries
asahi: switch to VS/FS prolog/epilog system
Amber (3):
tu: wideLines support for a7xx.
tu: Add MESA_VK_DYNAMIC_RS_LINE_WIDTH to tu_rast_state.
tu: re-emit vertex buffer on MESA_VK_DYNAMIC_VI_BINDINGS_VALID dirty.
Amber Harmonia (1):
freedreno/common: Fix register stomper ranges for A7XX
Andres Calderon Jaramillo (1):
radeonsi: get enc/dec caps from kernel only on amdgpu
Antoine Coutant (2):
clc: retrieve libclang path at runtime.
drisw: fix build without dri3
Anton Bambura (2):
panfrost: Enable Mali-T600
docs/panfrost: Document Mali-T600 support
Antonio Gomes (6):
mesa/st: Skip querying PCI values in interop_query_device_info if version >= 4
rusticl/gl: Bump mesa_glinterop_device_info to version 4
gallium: Add new PIPE_CAP_CL_GL_SHARING
iris: Set PIPE_CAP_CL_GL_SHARING to true
radeonsi: Set PIPE_CAP_CL_GL_SHARING to true
rusticl/device: Verify for PIPE_CAP_CL_GL_SHARING when enabling gl_sharing
Arthur Huillet (1):
nvk: remove useless MME scratch 26 usage
Asahi Lina (17):
asahi: libagx: introduce AGX_STATIC_ASSERT
agx: Rename some SRs
nir: Add AGX-specific helper opcodes
agx: Hook up AGX helper NIR intrinsics
agx: Hook up helper intrinsics into CL
agx: Add scaffolding to build the helper shader at device init
agx: compiler: Add fence_helper_exit_agx barrier
agx: compiler: Export scratch size to the driver
agx: compiler: Enable stack_adjust
asahi: libagx: Move PACKED and GLOBAL macros to libagx.h
asahi: cmdbuf: Fix scratch bucket offset/size
asahi: Implement scratch allocation
asahi: scratch: Add feature to debug core IDs
asahi: Hook up scratch
asahi: Allocate scratch for shaders
asahi: Enable scratch debugging
asahi: batch: Trace before waiting for syncobj
Assadian, Navid (2):
amd/vpelib: Apply inverse gamut remap to background
amd/vpelib: Use uint64 for buffer size
Axel Davy (5):
frontend/nine: Fix ff ps key
frontend/nine: Fix programmable vs check
frontend/nine: Fix missing light flag check
frontend/nine: Fix destruction race
frontend/nine: Reset should EndScene
Bas Nieuwenhuizen (10):
util/disk_cache: Add marker on cache usage.
radv: Remove ray_launch_size_addr_amd system value.
radv: Add winsys argument to buffer map/unmap.
radv/winsys: Use radv_buffer_map wrapper.
radv/amdgpu: Use mmap directly.
radv: Support for mapping a buffer at a fixed address.
radv: Implement reserving the VA range on unmap.
radv: Expose VK_EXT_map_memory_placed.
radv: Fix differing aspect masks for multiplane image copies.
radv: Use zerovram for Enshrouded.
Benjamin Lee (14):
nak: support predicate swaps on SM50
nak: support predicate sel on SM50
nak: fix frnd on SM50
nak: implement FSWZADD on SM50
nak: implement FLO on SM50
nak: fix iabs on SM50 with an explicit i2i op
nak: implement rro op on SM50
nak: use rro when emitting mufu on SM50
nak: implement kill op on SM50
nak: implement cs2r op on SM50
nak: handle nop ops from NAK IR on SM50
nak: fix lod mode encoding for SM50 tld op
nak: fix tex offset encoding on SM50
nvk: disable shaderResourceMinLod on pre-sm70
Benjamin Tissoires (3):
CI: add mr-label-maker.yml config
.mr-label-maker.yml: fix wrong label
CI: add a test for checking the validity of .mr-label-maker.yml
Biju Das (1):
gallium: Add Renesas rzg2l-du DRM entry point
Blisto (1):
driconf: set vk_x11_strict_image_count for Atlas Fallen Vulkan
Bob Beckett (2):
panfrost: Add an entry for panthor in the renderonly_drivers[] array
panfrost: Add the gallium glue to get panfrost loaded when panthor is detected
Boris Brezillon (193):
panvk: Fix tracing
panvk: Fix access to unitialized panvk_pipeline_layout::num_sets field
panfrost: Kill unused forward declarations in pan_texture.h
panfrost: Add a per-gen panfrost_format_from_pipe_format() helper
panfrost: Add a per-gen panfrost_blendable_format_from_pipe_format() helper
panfrost: Make panfrost_format_to_bifrost_blend() a per-gen helper
panfrost: Add panfrost_[blendable]_format_table() helpers
panfrost: Move panfrost_is_yuv() to pan_format.h
panfrost: Move YUV-debugging out of panfrost_new_texture()
panfrost: Stop passing a panfrost_device to panfrost_new_texture()
panfrost: Don’t pass a panfrost_device to panfrost_format_supports_afbc()
panfrost: Don’t pass a panfrost_device to panfrost_afbc_can_tile()
panfrost: Stop passing a panfrost_device to pan_blend_get_internal_desc()
panfrost: Stop exposing pan_blend_create_shader()
panfrost: Stop passing a panfrost_device to pan_blend_create_shader()
panfrost: Stop passing a panfrost_device to pan_inline_rt_conversion()
panfrost: Make the pan_blend logic panfrost_device-agnostic
panfrost: Get rid of unused panfrost_device arguments in pan_blitter.c
panfrost: Pass the tile buffer budget through pan_fb_info
panfrost: Pass the sample position array through pan_fb_info
panfrost: Pass no_hierarchical_tiling info through pan_tiler_context
panfrost: Pass tiler heap info through pan_tiler_context
panvk: Inline pan_wls_mem_size()
panfrost: Make pan_desc.{c,h} panfrost_device agnostic
panfrost: Drop unused panfrost_device forward declaration in pan_shader.h
panfrost: Make pan_layout.c panfrost_device agnostic
panfrost: Make pan_sample.c panfrost_device agnostic
panfrost: Make pan_encoder.h panfrost_device agnostic
panfrost: Remove unused header inclusions from pan_blitter.h
panfrost: Make pan_blitter.h includable from non per-gen files
panfrost: Make pan_blitter.{c,h} panfrost_device agnostic
panfrost: Make pan_indirect_dispatch panfrost_device agnostic
panfrost: Make pan_pool.h panfrost_{device,bo} agnostic
panfrost: Make pan_props.c panfrost_device agnostic
panfrost: Make pan_texture.{c,h} panfrost_bo agnostic
panfrost: Make pan_desc.{c,h} panfrost_bo agnostic
panfrost: Remove uneeded pan_device.h inclusions
panfrost: Make panfrost_texfeatures.c panfrost_device agnostic
panfrost: Make pan_perf panfrost_device agnostic
panfrost: Add a helper to retrieve a panfrost_bo from a pan_kmod_bo
panvk: Get rid of unused pdev arguments passed to some meta helpers
panvk: Stop passing panfrost_device around in internal meta helpers
panvk: Store various physical device properties at the physical_device level
panvk: Use vk_device::drm_fd instead of going back to the physical device
panvk: Move panfrost_device and panvk_meta to panvk_device
panvk: Add a decode context at the panvk_device level
panvk: Instantiate our own blitter/blend_shader caches
panvk: Add pan_kmod_{vm,dev} objects to panvk_device
panvk: Add the concept of private BO
panvk: Transition panvk_pool to panvk_priv_bo
panvk: Transition panvk_descriptor_set to panvk_priv_bo
panvk: Transition panvk_pipeline to panvk_priv_bo
panvk: Transition panvk_{image,buffer}_view to panvk_priv_bo
panvk: Track blit src/dst using pan_kmod_bo objects
panvk: Keep a ref to a pan_kmod_bo in panvk_image
panvk: Keep a ref to a pan_kmod_bo in panvk_buffer
panvk: Keep tiler_heap and sample_positions BOs at the panvk_device level
panvk: Move away from panfrost_{bo,device}
panfrost: Move pan_{bo,device}.{c,h} to the gallium driver dir
panfrost: Clamp the render area to the damage region
panfrost: v4 doesn’t have Blend descriptors
panfrost: Pad compute jobs with zeros on v4
pan/va: Add missing valhall_enums dep to valhall_disasm
pan/kmod: Fix typo in pan_kmod_vm_op_check() helper
pan/kmod: Add a PAN_KMOD_VM_FLAG_TRACK_ACTIVITY flag
pan/kmod: Reject pre 1.1 panfrost kernel drivers
panfrost: Rework the way we compute thread info
panfrost: Prepare support for GPU variants
pan/perf: Reject panthor kernel driver
drm-uapi: Add panthor uAPI
pan/kmod: Add a backend for panthor
panfrost: Add v10 support to libpanfrost
pan/genxml: Various CS related improvements in v10.xml
pan/decode: Introduce the concept of usermode queue
panfrost: Don’t allocate a tiler heap buffer on v10+
pan/genxml: Make sure pan_pack() evaluates ‘dst’ only once
panfrost: Relax position result alignment constraint on v10+
panfrost: Add arch-specific context init/cleanup hooks
panfrost: Add a panfrost_context_reinit() helper
panfrost: Add a cleanup_batch() method to panfrost_vtable
panfrost: Enable v10 in the gallium driver
panfrost: Advertize G610 support
panfrost: Advertize G310 support
panfrost: Update the release note to mention G310/G610 addition
vk/meta: Add the PUSH_DESCRIPTOR_BIT flag when creating blit pipeline layouts
vk/meta: Fix base_type selection in build_{clear,blit}_shader()
panvk: Fix call ordering in panvk_DestroyDevice()
panvk: clang-format the source files
panvk: Kill the panvk_pack_color() prototype
panvk: Add VKAPI_{ATTR,CALL} specifiers to all panvk-specific entrypoints
panvk: Do not handle invalid NULL memory object in BindImageMemory2()
panvk: Get rid of unused panvk_image_get_plane_size() helper
panvk: Get rid of the custom device lost handling
panvk: Fix allocation scope of command buffer sub-objects
panvk: Add missing util_dynarray_init() in panvk_cmd_open_batch()
panvk: Don’t open-code panvk_cmd_open_batch() in CmdBeginRenderPass2()
panvk: Don’t allocate a TEXTURE descriptor in CreateImageView()
panvk: s/panvk_event_op/panvk_cmd_event_op/
panvk: Allocate descriptor set arrays using vk_multialloc_zalloc()
panvk: Don’t pass a device where we don’t need one
panvk: Get rid of unused panvk_cmd_buffer fields
panvk: Kill panvk_{Create,Destroy}SamplerYcbcrConversion()
panvk: Drop panvk_framebuffer
panvk: Get rid of panvk_pipeline_cache
panvk: Make panvk_buffer_view inherit from vk_buffer_view
panvk: Make panvk_device_memory inherit from vk_device_memory
panvk: Make pan_AllocateMemory() robust to errors
panvk: Add extra checks to panvk_MapMemory()
panvk: Implement {Map,Unmap}Memory2KHR
panvk: Make panvk_sampler inherit from vk_sampler
panvk: Fix GetPhysicalDeviceProperties2() to report accurate info
panvk: Get rid of fields we already have in vk_xxx objects
panvk: Disable global offset on varying and non-VS attribute descriptors
panfrost: Move the image attribute offset adjustment to a NIR pass
panvk: Implement dynamic rendering entry points
nir: Extend nir_get_io_offset_src_number() to support load_push_constant
nir: Extend nir_lower_mem_access_bit_sizes() to support push constants
pan/bi: Lower push constant accesses
pan/bi: Lower load_push_constant with dynamic indexing
pan/bi: Update the push constant count when emitting load_push_constant
panvk: Move some macros to panvk_macros.h
panvk: Move image related definitions to panvk_image.{h,c}
panvk: Move the VkBuffer logic to its own source file
panvk: Move the VkBufferView logic to its own file
panvk: Move the VkDeviceMemory logic to panvk_device_memory.{c,h}
panvk: Move the VkSampler logic to its own file
panvk: Move panvk_pipeline definition to panvk_pipeline.h
panvk: Move VkImageView logic to its own source files
panvk: Move the VkEvent logic to panvk_event.{c,h}
panvk: Move panvk_descriptor_{set,pool} definitions to panvk_descriptor_set.h
panvk: Move VkDescriptorSetLayout logic to panvk_descriptor_set_layout.{c,h}
panvk: Move VkPipelineLayout logic to its own file
panvk: Move shader related definitions to panvk_[vX_]shader.{c,h}
panvk: Kill panvk_[vX_]cs.{c,h}
panvk: Move panvk_{draw,dispatch}_info definitions to panvk_vX_cmd_buffer.c
panvk: Move the VkCommandPool logic to panvk_cmd_pool.{c,h}
panvk: Move VkQueue logic to panvk_[vX_]queue.{c,h}
panvk: Add a panvk_arch_dispatch_ret() variant
panvk: Make the device creation/destruction per-arch
panvk: Move the VkInstance logic to panvk_instance.{c,h}
panvk: Move the VkPhysicalDevice logic to panvk_physical_device.{c,h}
panvk: Move panvk_meta definitions to panvk_meta.h
panvk: Move panvk_device definition to panvk_device.h
panvk: Move the panvk_cmd_buffer definitions in panvk_cmd_buffer.h
panvk: Move the panvk_priv_bo logic to panvk_priv_bo.{c,h}
panvk: Move panvk_wsi definitions to panvk_wsi.h
panvk: Kill panvk_private.h
panvk: Make panvk_buffer_view per-gen
panvk: Make panvk_image_view per-gen
panvk: Make panvk_sampler a per-gen
panvk: Make panvk_cmd_buffer per-gen
panvk: Make panvk_shader per-gen
panvk: Make panvk_descriptor_set per-gen
panvk: Make panvk_descriptor_set_layout per-gen
panvk: Make panvk_pipeline per-gen
panvk: Make panvk_queue per-gen
panvk: Make panvk_pipeline_layout per-gen
panvk: Fix attach-less rendering
panvk: Fix the colorAttachmentCount check in begin_rendering_init_fbinfo()
pan/bi: Support fragment store_output() with a non-zero offset
panvk: Don’t assume VkGraphicsPipelineCreateInfo::pColorBlendState != NULL
pan/bi: Allow subpass sampler dims
panvk: Fix input attachment support
panvk: Fill pan_tls_info::wls::instances
panvk: Make sure the sample_pattern is set in the tiler descriptor
panvk: We don’t support resolve operations yet
pan/bi: Extend bi_emit_texc() to support wider direct tex/sampler idx
panvk: Don’t assume pViewportState != NULL
panvk: Fix img2buf copies with image X offset not aligned on 16 pixels
panvk: Fix has_non_vs_attribute() test in panvk_draw_prepare_vs_attribs()
panvk: Make sure we pick a valid wrap_mode_r value for unnormalizedCoordinates
panvk: Fix depth/stencil image views
panvk: Make sure we have a decode context created when we need one
panvk: Don’t advertize vertex_buffer cap on sRGB formats
panvk: Swizzle the border color on v7 when the format is BGR
panvk: Re-order things in panvk_physical_device_init()
panvk: Fill maxCustomBorderColorSamplers
panvk: Skip tiler jobs when the vertex shader doesn’t write the position
panvk: Make sure we use the proper format for views of depth+stencil images
panvk: Abort on fault when PANVK_DEBUG=sync
panvk/ci: Make sure we catch GPU faults
panvk/ci: Enable dEQP-VK.pipeline.monolithic.*
panvk: Add support for KHR_push_descriptor
panvk/ci: Re-enable copy_and_blit tests
panvk: Stop declaring one push constant array per graphics stage
panvk: Pass the push constant array to draw/dispatch calls
panvk: Stop lowering push constant loads to UBO loads
panvk: Dissociate UBO and push_constant emission
nir/lower_blend: Fix nir_blend_logicop() for 8/16-bit integer formats
panfrost: do not write outside num_wg_sysval
panfrost: Add the BO containing fragment program descriptor to the batch
pan/kmod: Fix a syncobj leak in the panthor backend
pan/kmod: Make default allocator thread-safe
Boyuan Zhang (5):
radeonsi/vcn: only use multi slices reflist when available
meson: bump the minimal required vdpau version to 1.4
ac/gpu_info: Add vcn dec and enc version query
radeonsi/vcn: choose rc_per_pic by encode verison
radeonsi/vcn: mark rc_per_pic as obsoleted
Błażej Szczygieł (2):
gallivm/ssbo: replace run time loop by compile time loop
gallivm/ssbo: mask offset with exec_mask instead of building the ‘if’
Caio Oliveira (268):
intel/compiler/xe2: Implement instruction compaction for DPAS.
intel/compiler: Add couple of tests for fs_combine_constants
intel/compiler: Fix rebuilding the CFG in fs_combine_constants
intel: Use an intel enum for cmat scope
intel/compiler: Enable lower_rotate_to_shuffle in subgroup lowering
anv: Advertise VK_KHR_shader_subgroup_rotate
iris: Remove unused brw_* includes
iris: Remove prototypes for unsupported Gfx versions
iris: Remove unused paramater
iris: Call blorp_finish() when destroying context
crocus: Call blorp_finish() when destroying context
intel/compiler: Rename brw_image_param to isl_image_param
intel/compiler: Rename BRW_WM_MSAA_* enums to INTEL_MSAA_*
intel/compiler: Rename BRW_TESS_* enums to INTEL_TESS_*
intel/compiler: Rename DISPATCH_MODE_* enums to INTEL_DISPATCH_MODE_*
intel/compiler: Rename brw_vue_map to intel_vue_map
intel/compiler: Rename brw_cs_dispatch_info to intel_cs_dispatch_info
intel/compiler: Move disassemble functions to own header file
intel/compiler: Include brw_disasm_info.h where its used
intel/compiler: Merge intel_disasm.[ch] into corresponding brw files
intel: Rename i965_{asm,disasm} tools to brw_{asm,disasm}
intel/blorp: Don’t require specific prog_data type in callback
intel/blorp: Remove brw_ prefix when not applicable
intel/blorp: Simplify blorp_compile_fs() interface
intel/blorp: Simplify blorp_compile_cs() interface
intel/blorp: Use a struct to return blorp_compile_*() results
intel/blorp: Remove outdated reference in comment
intel/blorp: Move brw_blorp_get_urb_length helper
intel/blorp: Avoid brw types in blorp_priv.h
intel/blorp: Move brw_compiler.h include to where is needed
intel/blorp: Use a Meson dependency for blorp
intel: Add missing dependencies on blorp
intel/decoder: Move decoder to a separate module
intel/compiler: Collect NIR-only passes in intel_nir.h
intel/compiler: Rename the passes and files related to intel_nir.h
intel/compiler: Rename brw_gfx_ver_enum.h to intel_gfx_ver_enum.h
intel: Remove brw_ prefix from process debug function
intel/isl: Include compiler generic header
anv: Remove lower_atomics from storage image lowering opts
iris: Remove no-ops from storage image lowering
intel/compiler: Use “intel” prefix for walk_order enum
iris: Add stage to iris_compiled_shader
iris: Don’t use prog_data to guard 3DSTATE_CONSTANT_* code
iris: Reduce dependency on brw_*_prog_data structs
iris: Take ownership of prog_data when applying it
iris: Use uint32_t instead of brw_param_builtin
iris: Move compiler creation to iris_program.c
iris: Add IRIS_MAX_* constants to replace BRW_MAX_* usage
iris: Add helper to access use_tcs_multi_patch
iris: Add helper for indirect_ubos_use_sampler
iris: Move iris_get_compiler_options to iris_program.c
iris: Include brw_compiler.h only when needed
intel/meson: Remove usage of meson.source_root and meson.build_root
intel/meson: Fix warning about broken str.format
intel/elk: Fork Gfx8- compiler by copying existing code
intel/elk: Compile ELK library, tests and tools
intel/elk: Remove compiler specific devinfo hash
intel/elk: Remove a bunch of files that don’t apply for Gfx8-
intel/elk: Use common code in intel/compiler
intel/elk: Remove stages not used in Gfx8-
intel/elk: Remove DPAS lowering
intel/elk: Rename files to use elk prefix
intel/elk: Rename header guards
intel/elk: Update doxygen-like file comments
intel/elk: Rename C++ namespace
intel/elk: Rename symbols
intel/elk: Don’t include elk_eu_defines.h in elk_nir.h
intel/elk: Create separate header for opcodes
intel/blorp: Move brw specific code to a separate file
intel/blorp: Explicitly include brw_compiler.h header
intel/blorp: Add ELK support
intel/blorp: Remove Gfx9+ references in elk code
intel/decoder: Add ELK support
crocus: Use ELK compiler
hasvk: Use ELK compiler
iris: Rename screen->compiler to screen->brw
iris: Use ELK compiler for Gfx8
intel/tools: Add ELK support for aubinator
intel/tools: Add ELK support for aubinator_error_decode
intel/tools: Add ELK support for intel_hang_replay
intel/tools: Add ELK support for aubinator_viewer
intel/tools: Add ELK support for intel_hang_viewer
intel: Use _brw suffix for genX headers that rely on brw
intel/meson: Rename libintel_compiler to libintel_compiler_brw
intel/tools: Add extra compiler device sha only for Gfx9+
intel/elk: Move nir_options to its own c/h file pair
intel-clc: Use correct set of nir_options when building for Gfx8
intel/elk: Use anonymous namespace in fs_combine_constants
intel/elk: Remove tests for Gfx9+
intel/brw: Remove assembler tests for Gfx8-
intel/brw: Remove EU compaction tests for Gfx8-
intel/brw: Remove EU validation tests for Gfx8-
intel/brw: Remove pass test cases for Gfx8-
intel/brw: Assert Gfx9+
intel/compiler: Remove has_render_target_reads from wm_prog_data
intel/brw: Remove Gfx8- passes from optimize()
intel/brw: Pull opt_copy_propagation out of fs_visitor
intel/brw: Pull opt_cmod_propagation out of fs_visitor
intel/brw: Pull opt_saturate_propagation out of fs_visitor
intel/brw: Pull dead_code_eliminate out of fs_visitor
intel/brw: Pull opt_combine_constants out of fs_visitor
intel/brw: Pull opt_cse out of fs_visitor
intel/brw: Pull bank_conflicts out of fs_visitor
intel/brw: Pull peephole_sel out of fs_visitor
intel/brw: Pull redundant_halt out of fs_visitor
intel/brw: Pull opt_algebraic out of fs_visitor
intel/brw: Pull split/compact virtual_grf opts out of fs_visitor
intel/brw: Pull opt_split_sends out of fs_visitor
intel/brw: Pull opt_zero_samples out of fs_visitor
intel/brw: Pull eliminate_find_live_channel out of fs_visitor
intel/brw: Pull remove_extra_rounding_modes out of fs_visitor
intel/brw: Pull register_coalesce out of fs_visitor
intel/brw: Pull lower_constant_loads out of fs_visitor
intel/brw: Pull lower_pack out of fs_visitor
intel/brw: Pull lower_simd_width out of fs_visitor
intel/brw: Pull lower_barycentrics out of fs_visitor
intel/brw: Pull lower_logical_sends out of fs_visitor
intel/brw: Pull fixup_nomask_control_flow out of fs_visitor
intel/brw: Pull lower_integer_multiplication out of fs_visitor
intel/brw: Pull lower_sub_sat out of fs_visitor
intel/brw: Pull lower_derivatives out of fs_visitor
intel/brw: Pull lower_regioning out of fs_visitor
intel/brw: Pull fixup_sends_duplicate_payload out of fs_visitor
intel/brw: Pull lower_uniform_pull_constant_loads out of fs_visitor
intel/brw: Pull lower_find_live_channel out of fs_visitor
intel/brw: Pull lower_load_payload out of fs_visitor
intel/brw: Use references for a couple of backend_shader passes
intel/brw: Simplify OPT macro usage in fs_visitor::optimize
intel/brw: Pull fixup_3src_null_dest out of fs_visitor
intel/brw: Pull emit_dummy_memory_fence_before_eot out of fs_visitor
intel/brw: Pull emit_dummy_mov_instruction out of fs_visitor
intel/brw: Pull lower_scoreboard out of fs_visitor
intel/brw: Pull optimize() out of fs_visitor
intel/brw: Move optimize and small optimizations to brw_fs_opt.cpp
intel/brw: Move virtual GRF opts into their own file
intel/brw: Move fs algebraic to its own file
intel/brw: Move small lowering passes into brw_fs_lower.cpp
intel/brw: Move lower_integer_multiplication to its own file
intel/brw: Expose flag_mask/bit_mask fs helpers
intel/brw: Move lower_simd_width to its own file
intel/brw: Move workarounds to a separate file
intel/blorp: Remove Gfx8- references in BRW code
intel/brw: Move brw_compile_* functions out of vec4-specific files
intel/brw: Move type_size_* functions out of vec4-specific file
intel/brw: Always use scalar shaders
intel/brw: Remove vec4 backend
intel/brw: Remove now unused vec4-only opcodes
intel/brw: Remove unused legacy shader stages
intel/brw: Remove Gfx8- code from disassembler
intel/brw: Remove Gfx8- code from assembler
intel/brw: Remove Gfx8- code from brw_compile_* functions
intel/brw: Remove Gfx8- code from scheduler
intel/brw: Remove Gfx8- code from register allocator
intel/brw: Remove Gfx8- code from thread payload
intel/brw: Remove Gfx8- code from NIR conversion
intel/brw: Remove Gfx8- code from lower storage image pass
intel/brw: Remove Gfx8- code from lower logical sends
intel/brw: Remove Gfx8- code from generator
intel/brw: Remove Gfx8- code from backend passes
intel/brw: Remove Gfx8- code from EU compaction
intel/brw: Remove Gfx8- code from IR performance analysis
intel/brw: Remove Gfx8- code from EU emission
intel/brw: Remove Gfx8- code from EU validation
intel/brw: Remove Gfx8- code from NIR passes
intel/brw: Remove Gfx4-5 manual compression selection
intel/brw: Remove Gfx8- code from EU codegen helpers
intel/brw: Remove Gfx8- code from NIR options
intel/brw: Remove Gfx8- code from register type helpers
intel/brw: Remove Gfx8- specific EU inst helpers
intel/brw: Remove Gfx8- code from inst FC and F macros
intel/brw: Replace inst F8 macro with F macro
intel/brw: Remove Gfx8- code from inst F20 macros
intel/brw: Remove Gfx8- code from inst FD20 and FV20 macros
intel/brw: Remove Gfx8- code from inst FI macros
intel/brw: Remove Gfx8- code from inst BRW_IA*_ADDR_IMM macros
intel/brw: Remove Gfx8- code from inst FFDC, FDC and FD macros
intel/brw: Update comments for FK macro
intel/brw: Replace inst FF macro with F or F20 macros
intel/brw: Remove F16TO32 and F32TO16 opcodes
intel/brw: Remove Gfx8- code from builder
intel/brw: Remove Gfx8- code from fs_inst
intel/brw: Remove Gfx8- code from VUE map
intel/brw: Remove Gfx8- code from SIMD lowering
intel/brw: Remove Gfx8- code from visitor
intel/brw: Remove Gfx8- remaining opcodes
intel/brw: Remove MRF type
intel/brw: Inline brw_nir_apply_sampler_key code
intel/brw: Remove unused attrib workarounds
intel/brw: Remove edgeflag_is_last VS parameter
intel/brw: Remove Gfx8- fields from *_prog_key structs
intel/brw: Remove Gfx8- fields from *_prog_data structs
intel/brw: Use a single register set
intel/brw: Remove runtime_check_aads_emit
intel/brw: Remove automatic_exec_sizes
intel/brw: Use fs_visitor instead of backend_shader in various passes
intel/brw: Fold fs_instruction_scheduler into instruction_scheduler
intel/brw: Change cfg_t to refer to fs_visitor
intel/brw: Move dump_* functions into fs_visitor
intel/brw: Fold backend_shader into fs_visitor
intel/brw: Remove extra stage_prog_data field in fs_visitor
intel/brw: Remove brw_shader.h
intel/meson: Add dependencies for brw and elk
intel/compiler: Remove nir_print_instr hack in disasm_info
intel/brw: Use C++ for brw_disasm_info.c
intel/brw: Hide the definition of cfg_t et al from C code
intel/brw: Use fs_inst in cfg_t
intel/brw: Use fs_inst explicitly in various passes
intel/brw: Use fs_inst in disasm_annotate()
intel/brw: Move functions from backend_instruction into fs_inst
intel/brw: Fold backend_instruction into fs_inst
intel/brw: Remove typedefs from fs_builder
intel/brw: Fold backend_reg into fs_reg
intel/brw: Simplify usage of reg immediate helpers
intel/compiler: Fix SIMD lowering when instruction needs a larger SIMD
intel/elk: Remove split sends
intel/elk: Remove DPAS opcode
intel/elk: Remove BTD and RT opcodes
intel/elk: Remove DP4A opcode
intel/elk: Remove ROR and ROL opcodes
intel/elk: Remove IADD3 opcode
intel/elk: Remove EU compaction logic for Gfx9+
intel/elk: Remove encoding for Gfx9+
intel/elk: Remove SYNC opcode and SWSB annotations
intel/elk: Remove Gfx12 SFIDs and related LSC code
intel/elk: Remove Gfx9+ sampler messages and modes
intel/elk: Rename symbols for A64 OWord Block R/W messages
intel/elk: Remove Gfx9+ dataport messages
intel/elk: Remove FB_READ opcodes
intel/elk: Remove Gfx12.5 URB message
intel/elk: Remove ex_desc and ex_mlen from elk_inst
intel/elk: Remove Xe2 logical sends lowering
intel/elk: Remove unused sources from ELK_SHADER_OPCODE_SEND
intel/elk: Remove unused SEND features
intel/elk: Remove validation code for Gfx9+
intel/elk: Remove Gfx9+ from nir conversion
intel/elk: Remove Gfx9+ from compile/run functions
intel/elk: Remove FB_WRITE_LOGICAL_SRC_SRC_STENCIL
intel/elk: Remove Gfx9+ from passes
intel/elk: Remove Gfx9+ from thread payload
intel/elk: Remove Gfx9+ from EU emission
intel/elk: Remove coarse pixel handling
intel/elk: Remove Gfx9+ from FS generator
intel/elk: Remove Gfx9+ from Reg related code
intel/elk: Remove Gfx9+ from asm grammar
intel/elk: Remove Gfx9+ from disasm
intel/elk: Remove Gfx9+ from NIR auxiliary code
intel/elk: Remove use_tcs_multi_patch
intel/elk: Remove Gfx9+-only passes
intel/elk: Remove uses of intel_device_info_is_9lp()
intel/elk: Remove remaining Gfx9+ code
intel/elk: Remove multi-polygon support
intel/elk: Clean up unused code in elk_compiler.h
intel/brw: Use hstride instead of stride for accumulator
intel/brw: Use helper to create accumulator register
intel/brw: Fix validation of accumulator register
anv: Enable VK_KHR_shader_maximal_reconvergence
intel/tools: Make intel_stub_gpu work when using meson devenv
intel/brw: Implement quad_vote_any and quad_vote_all
intel/brw: Use predicates for quad_vote_any and quad_vote_all when available
anv: Enable VK_KHR_shader_quad_control
intel/brw: Handle Xe2 in brw_fs_opt_zero_samples
intel/brw: Remove vestiges of sources on IF opcode, only valid on Gfx6
intel/brw: Add a src array for the common case in fs_inst
intel/brw: Refactor FS validation macros
intel/brw: Remove two duplicated validate calls in optimizer
intel/brw: Move validate out of fs_visitor
intel/brw: Support FIXED_GRF when generating code for CLUSTER_BROADCAST
intel/brw: Lower VGRFs to FIXED_GRFs earlier
Casey Bowman (1):
anv: Override VendorID for Hitman 3
Charlie Turner (2):
amd, radeonsi: Lower minimum supported video dimensions for AV1
{vulkan,radv,anv}/video: fix issue in H264 scaling lists derivation
Chia-I Wu (7):
radv: fix pipeline stats mask
meson: fix a build ereror
radv: hide the sparse queue when radv_legacy_sparse_binding
radv: hide the sparse queue on older kernels
radv: set VK_SYNC_FEATURE_GPU_MULTI_WAIT
aco: fix nir_op_pack_32_4x8 handling
radv: fix 2d/3d image copy on compute queue
Chris Rankin (4):
vdpau: Declare texture object as immutable using helper function.
vdpau: Refactor query for video surface formats.
meson: bump the minimal required vdpau version to 1.5
frontends/vdpau: Add support for VDPAU AV1 decoding.
Christian Duerr (1):
panfrost: Fix dual-source blending
Christian Gmeiner (100):
.gitignore: Add .venv folder
etnaviv/isa: Add missing dep of encode.py/decode.py calls on isa.py
isaspec: encode.py: Include assert.h
isaspec: encode.py: Include util/log.h
etnaviv: Remove no_oneconst_limit from etna_inst
isaspec: encode: Constify encode.type
isaspec: encode: Constify bitset_params
etnaviv: Remove not used etna_assemble_set_imm(..)
etnaviv: Fix how we determine the max supported number of varyings
etnaviv: isa: Remove duplicate #instruction-alu-atomic
etnaviv: isa: Add dsx and dsy opcodes
etnaviv: isa: Add frc opcode
etnaviv: isa: Add norm_dp2, norm_dp3 and norm_dp4 opcodes
etnaviv: isa: Add bit_extract opcode
etnaviv: isa: Correct dp2 opcode
etnaviv: isa: Add branch_any opcode
etnaviv: isa: Name cond enum value 22
etnaviv: isa: Add movai opcode
etnaviv: isa: Add bit_rev opcode
etnaviv: isa: Add texldb opcode
etnaviv: isa: Add texldl opcode
etnaviv: isa: Add texldd opcode
etnaviv: isa: Remove note about GC3000
etnaviv: isa: Add div opcode
etnaviv: isa: Reorder instructions
etnaviv: isa: Rename reg_group u2 to u
etnaviv: isa: Add internal register group
etnaviv: isa: Add movar opcode
etnaviv: isa: Move {TEX_SWIZ}
etnaviv: isa: Correct SRC0_AMODE
etnaviv: isa: Correct #instruction-cf-src1-src2 bitset name
etnaviv: isa: Correct #instruction-alu-no-dst-maybe-src1-src2 name
etnaviv: isa: Correct #instruction-alu-no-dst-has-src0-src1 expr name
etnaviv: isa: Combine branch and branch_if
etnaviv: isa: Support unary branch instruction
etnaviv: isa: Support unary texkill instruction
etnaviv: isa: Support multiple encodings for texldl
etnaviv: isa: Fix #instruction-tex-src0-src1-src2 bitset
etnaviv: isa: Support multiple encodings for texldb
isaspec: Remove not used isa_decode_hook
isaspec: deocde: Hide all the internals ISA details
isaspec: decode: Add isa specific functions
isaspec: decode: Make isa_decode_bitset(..) private
freedreno/isa: Rework meson dependency for libir3decode
etnaviv: isa: Rework meson dependency for libetnaviv_decode
isaspec: deocde: Make isa_bitset arrays static
isaspec: deocde: Make isa_decode_field(..) private
isaspec: decode: Add libisaspec
isaspec: deocde: Remove generic functions from public interface
etnaviv: isa: Define a dontcare bit in atomic instructions
etnaviv: isa: Add name attributes
etnaviv: isa: Generate c header containing enums
etnaviv: isa: Generate opcode enum
etnaviv: isa: Add an empty libetnaviv_encode
etnaviv: Link against libetnaviv_encode
etnaviv: Move struct etna_inst to src/etnaviv
etnaviv: isa: Make use of generated enums
etnaviv: isa: Add rouding to etna_inst
etnaviv: Set dst.use for MOVAR
etnaviv: isa: Add encode support
etnaviv: isa: Add isa_assemble_instruction(..)
etnaviv: Switch to isa_assemble_instruction(..)
etnaviv: Move swizzle related macros to scr/etnaviv
etnaviv: Switch to macros from isa.h
etnaviv: Remove isa.xml.h
etnaviv: Do not set tex.amode for rounding
ci/etnaviv: Remove duplicates
ci/etnaviv: Do not skip tex-miplevel piglits
etnaviv: Remove offline shader compiler
etnaviv: Introduce common etna_core_info
etnaviv: drm: Make use of etna_core_info
etnaviv: drm: Add etna_gpu_get_core_info(..)
etnaviv: Switch to etna_core_info
etnaviv: Move hw header to common place
etnaviv: Introduce etna_feature enum
etnaviv: common: Add feature bitset
etnaviv: drm: Initialize etna_core_info based on kernel features
etnaviv: Switch to etna_core APIs
etnaviv: drm: Query some id values in etna_gpu_new(..)
etnaviv: hwdb: Import gc_feature_database from NXP
etnaviv: hwdb: Import gc_feature_database from Amlogic
etnaviv: hwdb: Import gc_feature_database from ST
ci: Install python3-pycparser in build container
etnaviv: hwdb: Generate hwdb.h
etnaviv: hwdb: Add etna_query_feature_db(..)
etnaviv: drm: Make use of hwdb
etnaviv: common: Add enum etna_core_type
etnaviv: common: Add some limit values
etnaviv: hwdb: Fill limits
etnaviv: drm: Fill limits
etnaviv: Copy values from etna_core_info
etnaviv: drm: Remove fallback value for ETNA_GPU_NUM_CONSTANTS
etnaviv: Drop not needed check if seamless cube map is supported
etnaviv: hwdb: Drop stdint.h dependency
nvk: Remove duplicate DRM_NODE_RENDER check
meson: Add missing newline at eof
etnaviv: Switch to etna_core_disable_feature(..)
etnaviv: Fix disabling of features
etnaviv: drm: Drop NPU-related params
clc: Always use spir for 32 bit
Collabora’s Gfx CI Team (4):
Uprev Piglit to e9316bcd12544aaf7e753ce37fe50d64165d9598
Uprev Piglit to 2a1c49a81cd9a6bf5d0c3a9b87225be94771ca96
Uprev Piglit to 1e631479c0b477006dd7561c55e06269d2878d8d
Uprev Piglit to dd6f7eaf82e8dd442da28b346c236141cbcce0b1
Connor Abbott (56):
freedreno: Add a7xx crashdump-related registers and enums
ir3/ra: Add specialized shared register RA/spilling
ir3: Set branchstack earlier
ir3: Rewrite (jp) and branchstack handling
ir3: Calculate physical edges correctly
ir3: Fix comment thinko
ir3/ra: Fix bug with collect source handling
tu: Add more info to ldg inline uniform path
ir3/a7xx: Fix load_global_ir3 with immediate offset
ir3: Initial support for pushing globals with ldg.k
tu: Follow pipeline compatibility rules for dynamic descriptors
tu: Reenable MSAA UBWC on a6xx gen1
tu: Enable UBWC for SNORM formats on a740+
tu: Enable UBWC for storage images on a7xx
vk/graphics_state: Remove bogus assert in CmdSetSampleMaskEXT
vk/graphics_state: Add stubs required by VK_EXT_shader_objects
freedreno/afuc: Decode (peek) modifier
freedreno/afuc: Add missing ALU encode case for bic
freedreno/afuc: Bump max instructions for a7xx
freedreno/afuc: Fix setbit/clrbit parsing
freedreno/afuc: Use left recursion in parser
freedreno/afuc: Improve jump table handling
freedreno/afuc: Add .align directive
freedreno/afuc: Add more general T_IDENTIFIER in lexer
freedreno/afuc: Add support for multiple sections when assembling
freedreno/afuc: Allow -e option on a7xx
freedreno/afuc: Emulate THREAD_SYNC on a660
freedreno/afuc: Run entire bootstrap routine
freedreno/afuc: Add a7xx test case
freedreno/afuc: Add magic control reg values for a740
freedreno/afuc: Add section on reassembling firmwares and relocations
freedreno/a7xx: Add CP_CCHE_INVALIDATE
tu: Implement CCHE invalidation
nir/divergence_analysis: Add ir3-specific intrinsics
nir/divergence_analysis: Add uniform_load_tears option
nir/divergence_analysis: Fix load_view_index divergence in VS
ir3: Allow single-predecessor phis
ir3: Run divergence analysis at the end
ir3: Remove loop shared copy check
ir3: Use divergence analysis for (jp) and physical CFG
freedreno/afuc: Switch to using the GPU ID in the firmware
freedreno/afuc: Add a7xx new-style branch instructions
freedreno/afuc: Add initial support for a750
freedreno: Make has_ibo_ubwc a7xx specific
freedreno,tu: Disable UBWC for storage images on a750
ir3/legalize: Fix intra-block state propagation with loops
ir3: Rewrite nop insertion
docs/android: Fix example meson cross file
docs/android: Improve instructions for replacing driver
ir3: Don’t use non-contiguous component masks for FS
ir3: Don’t pack FS inlocs
freedreno/a7xx: Register updates from kgsl
ir3: Add scan_clusters_macro to ir3_valid_flags()
ir3: Add scan_clusters.macro to is_subgroup_cond_mov_macro()
ir3/ra: Don’t demote movmsk instructions to non-shared
docs/android: Fix example meson cross file again
Constantine Shablia (6):
panvk: implement vkGetBufferDeviceAddress
panvk: advertise bufferDeviceAddress
vulkan/runtime: fix typo
mesa: fix typo
pan/bi: fix 1D array tex coord lowering
panfrost: report correct MAX_VARYINGS
Corentin Noël (37):
zink: Avoid the use of negative array offsets
zink: Use memmove when dealing with overlapping memory
glsl: Make sure to not cast ir_dereference_variable into ir_variable
glsl: Make sure that the variable is a ir_variable before unreferencing it
zink: Initialize zink_shader_object
zink: Initialize zink_bindless_descriptor to zero on creation
zink: Initialize pipe_query_result
zink: Do not shadow the variable ret
zink: Avoid variable shadowing everywhere
zink: Only call reapply_color_write if EXT_color_write_enable is available
ci_run_n_monitor: Allow the upstream format to not exist
zink: use symbolic values instead of 0
zink: do not use undefined stage mask if on missing KHR_synchronization2
glsl: Ensure that we are dealing with ir_variable and ir_rvalue
venus: sync protocol for VK_EXT_attachment_feedback_loop_layout
venus: enable VK_EXT_attachment_feedback_loop_layout
zink: Return early if the file descriptor could not have been duplicated/acquired
ci: Update virglrenderer and crosvm
zink: Make wrap_surface return a zink_ctx_surface directly
zink: Use an intermediary variable for create_surface
zink: Separate the template from the wrapped surface
zink: Return early if the source could not have been acquired
zink: Move zink_surface_destroy before zink_create_surface
zink: Make sure to not leak anything on surface creation failure
zink: Change zink_get_surface to return a zink_surface
zink: Add error logging on surface creation failure
st_pbo/compute: Use the correct structure type when allocating a specialized key
zink: Make sure to initialize all the fields of VkMemoryBarrier
dri/kopper: Assume a non-null drawable in flush_frontbuffer
zink: Removed unused function
zink: Removed unused num_texel_buffers member
zink: Removed unused push_valid member
zink: Remove ctx from zink_gfx_program
ci: Change propagated variables into an array
ci: Add VK_DRIVER_FILES passthrough from jobs to tests
ci: Allow to pass LIBGL_ALWAYS_SOFTWARE to the guest environment
ci: Add zink-venus-lvp job
Daniel Almeida (17):
nak/sm50 add support for suld
nak/sm50: add support for suatom
nak/sm50: add support for isberd
nak: sm50: add support for OpOut
nak: sm50: fadd: ensure src[0] is in a register
nak/sm50: legalize: display instruction on panic
nak/sm50: add support for brev
nak: sm50: fix some legalization issues
nak/sm50: add a memstream abstraction
nak/sm50: add an annotate debug flag
nak/sm50: support annotations through OpAnnotate
nak/sm50: sprinkle OpAnnotate in optimization passes
meson,ci: Add the paste crate
nil: Add the start of a Rust library
nil: Rewrite nil_format in rust
nil: Re-implement nil_image in Rust
nil: Rewrite the TIC code in Rust
Daniel Schürmann (52):
aco/insert_exec_mask: unify exec restore code after divergent control flow
aco/insert_exec_mask: replace phi for loop restore mask with explicit copies
aco/insert_exec_mask: only create loop phis for exec mask if necessary
aco: give spiller more room to assign spilled SGPRs to VGPRs
spirv: Fix SpvOpExpectKHR
vulkan: enable VK_KHR_shader_expect_assume
spirv: Update headers and grammar JSON
aco/insert_exec_mask: Fix unconditional demote at top-level control flow.
aco/insert_exec_mask: tiny refactor
aco: always terminate quads if they have been demoted entirely
aco/insert_exec_mask: Reduce latency when switching to WQM.
spirv: implement SPV_KHR_maximal_reconvergence
aco: enable WQM if demote is used with maximal reconvergence
radv: enable VK_KHR_shader_maximal_reconvergence
spirv: implement SPV_KHR_quad_control
radv: enable VK_KHR_shader_quad_control
radv: fix initialization of radv_shader_layout->use_dynamic_descriptors
aco: rematerialize constants in every basic block during optimizer
aco: reorder code and use namespaces in aco_interface.cpp
aco/util: small_vec few additions
aco: use small_vec as Block::edge_vec for predecessors and successors
aco/spill: refactor SSA repairing
aco/spill: don’t allocate extra spill_id for phi operands in add_coupling_code()
aco/spill: add spills_entry interferences only when necessary
aco/spill: refactor adding spilled vars into separate function add_to_spills()
aco/spill: keep live-out variables spilled at branch blocks
aco/spill: don’t prefer to spill phis at merge blocks
aco/spill: add interferences with variables spilled at loop headers
aco/spill: avoid re-spilling loop-carried variables in process_block()
aco/spill: avoid re-spilling loop-carried variables in add_coupling_code()
aco/spill: keep loop-carried variables spilled at loop headers
aco/spill: keep loop-carried variables spilled at merge blocks
aco/spill: select more loop-carried variables to be spilled
aco/spill: keep loop variables spilled during nested loops
aco: use instr_class::branch to identify SOPP branches
aco: remove SOPP_instruction::block member
aco: unify different SALU types into single struct SALU_instruction
aco/builder: use accessor functions instead of casting to subtypes
aco: change return type of create_instruction() to Instruction*
aco: defer instruction size from aco::Format in create_instruction()
aco: remove create_instruction() template parameter
aco: move create_instruction() to aco_ir.cpp
aco/spill: Fix assertion for nested loops
aco/spill: pass live_vars to spill_ctx
aco/spill: compute live-in variables from live-out
aco/spill: maintain valid live vars at any point
aco/spill: use live variables instead of next_use_distances in add_coupling_code()
aco/spill: gather information about average use distances
aco/spill: use average use distances in process_block()
aco/spill: use average use distances in init_live_in_vars() for merge blocks
aco/spill: use average use distances to spill loop variables
aco/ra: fix kill flags after renaming fixed Operands
Daniel Stone (50):
egl: Return BAD_CONFIG when robust access unsupported
st/dri: Use correct pipe_resource for GL texture image export
dri: Redeclare __DRI_IMAGE_FORMAT_* as PIPE_FORMAT_*
st/dri2: Remove __DRI_IMAGE_FORMAT conversion
st/dri2: Pass pipe_format to driCreateConfigs
st/dri2: Use u_format to get config format information
util: Add util_format_get_component_shift
st/dri: Remove format tables from driCreateConfigs
st/dri: Completely remove mesa_format from config setup
st/dri: Add transient HAS_ZS() helper
st/dri: Rework depth/stencil format selection
st/dri: Use pipe_format for Z/S modes
st/dri: Check format properties from format helpers
st/dri: Store pipe_format in gl_config
egl/wayland: Remove format-query fallback
st/dri: Reuse stored renderbuffer format
st/dri: Reuse stored texture format
dri/kopper: Move format -> FourCC translation up a level
dri/kopper: Add translations for sRGB formats
dri/kopper: Reorder format tables
dri/kopper: Flatten pipe_format/DRIImage/FourCC conversion
egl/wayland: Query image FourCC for linear copies
egl/wayland: s/DRI_IMAGE_FORMAT/pipe_format/g
egl/wayland: Add opaque-equivalent FourCCs
egl/wayland: Fix EGL_EXT_present_opaque
egl/wayland: Use pipe_format to look up configs
egl/wayland: Use FourCC to look up wl_buffer support
egl/wayland: Add helper to check server format support
egl/wayland: Use helper to look up visual
egl/wayland: Eliminate double loop for configs
egl/wayland: Simplify alternate-format fallback for configs
egl/wayland: Remove WL_SHM_* format listings
egl/wayland: Use pipe_format for format names
egl/wayland: Remove shift/size masks
egl: Fail display creation if no EGLConfigs created
egl/wayland: Remove check for EGLConfig presence
egl/gbm: Remove check for EGLConfig presence
egl/x11: Remove check for EGLConfig presence
egl/android: Remove check for EGLConfig presence
egl/{surfaceless,device}: Remove check for EGLConfig presence
egl: Automatically set EGLConfig ID
egl: Use pipe_format for pbuffer configs
gbm/dri: Query DRIImage for FourCC directly
gbm: Remove hardcoded color-channel data
egl/android: Remove hard-coded color-channel data
egl/x11: Compare config shifts/sizes locally
egl: Remove shifts/sizes from dri2_add_config argument
st/dri: Use pipe_format from config directly
egl/dri: Use pipe_format instead of DRI_IMAGE_FORMAT
egl/wayland: Remove EGL_WL_create_wayland_buffer_from_image
Danylo Piliaiev (37):
freedreno/replay: Delete all buffers after each submission
freedreno/replay: Correctly free iova on msm backend
freedreno/replay: Add WSL backend for Windows
ir3: Fix “print” meta instruction synchronization
ir3: Add fullsync and fullnop ir3 dbg options for over-syncing
freedreno/replay: Make meta “print” instruction take any number of regs
tu: Do not print anything on systems without Adreno GPU
tu/a7xx: Make A7XX_RB_UNKNOWN_8E06 value configurable per-gen
tu: Define and set to zero all SP_*_VGPR_CONFIG regs
ir3: Add ldg.k instruction
tu/a7xx: Correctly set A7XX_HLSQ_UNKNOWN_A9AE.SYSVAL_REGS_COUNT
tu/a7xx: Do not preload shaders, HW does it by default
tu: Use SS6_INDIRECT consts upload path for 3d blits
turnip,ir3/a750: Implement consts loading via preamble
tu: Use SS6_INDIRECT for VS params
turnip,ir3/a750: Implement inline uniforms via ldg.k
tu/a750: Consider vertex attr buff in gmem allocation
freedreno,tu: Move varying interp and varying repl modes to xml
freedreno/devices: Update magic regs for a7xx
tu: Exclude more a7xx regs from stomping
tu: Add workaround for D3D11 games accessing UBO out of bounds
tu/a7xx: Write even more magic regs to fix rendering issues on Android
tu: Do not emit zero-sized fs params
freedreno/a7xx: Fix base_align for non-UBWC depth-stencil
tu/autotuner: Use CP_EVENT_WRITE7 for submission fence
tu: Update prim restart state when we switch from/to indexed draw
tu: Fix dynamic state not always being emitted
meson: Correctly get sizeof_pointer with cross-compilers
freedreno/devices: Do not write to 8E79 on a750, KGSL has it protected
freedreno/replay: Use real queueid for submissions and waits
freedreno,tu/a7xx: Add PC_TESS_PARAM_SIZE and PC_TESS_FACTOR_SIZE
tu: Update RP state depending on pipeline in first RP draw
tu: Emit non-draw-state state at the first draw call
freedreno/devices: Add A740v3 from Quest 3
util/vma: Add function to get max continuous free size
freedreno/replay: Allocated maximum available size for cs overriding
ir3: Do not set clip/cull mask if no one writes clip/cull
Dario Mylonopoulos (1):
llvmpipe: fixed race condition in lp_rast_destroy that causes a crash on windows
Dave Airlie (46):
vulkan/video: drop unused function.
vulkan/video: rename some of the parameter tracking structs.
vulkan/video: start to wrap the video structs for deep copies.
vulkan/video: start deep copying the parameters structures
vulkan/video: constify the encoding apis.
radv/video: refactor sq start/end code to avoid decode hangs.
radv: don’t submit empty command buffers on encoder ring.
gallivm: fix coroutines with llvm 18
gallivm: passing fp16_split_fp64 to fp16 lowering.
nvk: allow 3d compressed textures
nvk: mem cannot be null in binding buffers/images.
zink: use sparse residency for buffers.
vulkan: update registry/includes to 1.3.277
vulkan/video: add AV1 decode support to common code
radv: fix correct padding on uvd
radv: init decoder ip block earlier.
radv/uvd: uvd kernel checks for full dpb allocation.
radv: don’t submit 0 length on UVD either.
egl: don’t bind zink under dri2/3
glx/dri3: handle zink fallback if loader picks it.
loader: handle picking zink for nouveau for certain GPUs.
nouveau/winsys: fix bda heap leak.
nvk: fix dri options leak.
egl/dri2: if zink is preferred from dri3 skip dri2 paths.
radv/video: fix filling out decode operations.
radv/video: use vcn ip version in more places.
radv: rename it_ptr to it_probs_ptr in advance of adding av1
radv/video: use proper struct sizes for decoder structs.
radv/video: add VK_KHR_video_decode_av1 support.
nvk: free leaked cmd_buffer descriptors state.
nvk: only unmap heap bos that were mapped
nvk: enable a mappable bar heap when rebar is disabled.
radv/video: fix h265 decode with unaligned w/h
mesa: reorder st context teardown
vulkan/video: copy the profile over for h264 encode.
radv/video: export unified queue header/tail functions.
radv: add direct cs emit for a dword.
radv: add encoder queue support pieces and encoder queries.
radv/video: add parameter patching calls.
radv/video: add initial support for encoding with h264.
radv/video: add h265 encode support
radv/video: enable video encoding behind perftest flag
radv/video: handle encode control parameters better.
radv/video: don’t advertise timestamp bits for decode/encode
egl/dri2: don’t bind dri2 for zink
radv/video/encode: fix quality params on v2 hw.
David (Ming Qiang) Wu (1):
frontends/va: make vlVaSyncSurface blocking
David Heidelberg (57):
ci/deqp: uprev deqp-runner for Linux too to 0.18.0
ci/lima: update expectations, failing tests are being skipped
ci: bump kernel to 6.6.12, modularize i915, add Transparent Huge Pages
ci: shorter kernel tag, included Vivante NPU patches
ci: disable Valve farm in Keywords
ci: bump libdrm to 2.4.120
ci/VK-GL-CTS: add patches to fix dEQP-VK.glsl.derivate crashes
ci: Valve farm (Keywords location) works again
meson: upgrade zlib wrap to 1.3.1
util: use crc32_z instead of crc32 and bump zlib dep to 1.2.9
ci: bump kernel to 6.6.16 + enable X2APIC
ci/freedreno: add fail found by new Piglit
ci/etnaviv: update expectations
ci: temporarily disable Collabora farm
ci: enable Collabora farm
ci: re-enable Collabora farm after maintenance
ci/intel: decompose anv-tgl-test so we can specify custom devices for TGL
ci/intel: add acer-cp514-2h-11{30,60}g7-volteer
ci/intel: move machine definition to the intel-tgl-skqp job
ci/intel: split asus-cx9400-volteer into acer-cp514-2h-11{30,60}g7-volteer
drm-shim: Avoid invalid file and time bits combination
intel/tools: avoid invalid time and file bits combination
ci/deqp: backport Implement support for the EGL_EXT_config_select_group extension GL-CTS patch
ci/freedreno: update expectations comment
ci/deqp: add EGL patch for correct suite (GLES, not GL)
nine: convert licenses block to SPDX
nine: fill missing licenses headers and copyrights
nine: drop useless and a bit too long line
ci: uprev kernel to 6.6.21
ci/freedreno: disable workarounds for Adreno 618, 630, and 660
ci/freedreno: mark fails resolved by “drm/msm/gem: Add metadata uapi”
ci: reduce irrelevant output to a simple list of libraries
util: move gen_zipped_file into generic util and rename to gen_zipped_xml_file
ci/r300: implement rules for d3d9 testing
ci/svga: add missed test and gl-rules include
r300: convert to SPDX license block and fix small typos
r300: add missing licence to the r300_public.h
r300: add missing copyright header
docs: we support EGL 1.5 for a long time
ci/amd: meld radv-traces into radv-raven-traces
ci/amd: drop old PIGLIT_REPLAY_DESCRIPTION_FILE surpassed by PIGLIT_TRACES_FILE
frontend/nine: fix typos
r600: update licensing to SPDX header
r600: add license header to r600_formats.h
r600: add license info to the r600_opcodes.h
r600: add license information to the sfn_shader_gs.h
r600: fix typos
ci: disable sona devices, all devices are offline
ci/intel: sona device_type is back online
ci: temporarily disable Android test builds
ci: disable Igalia farm
meson: implement split-debug
freedreno/ci: move the disabled jobs from include to the main file
ci/deqp: correct EGL_EXT_config_select_group detection
egl/x11: Move RGBA visuals in the second config selection group
winsys/i915: depends on intel_wa.h
subprojects: uprev perfetto to v45.0
David Rosca (31):
radeonsi/vcn: Fix H264 slice header when encoding I frames
frontends/va: Fix updating AV1 rate control parameters
radeonsi/vcn: Don’t reinitialize encode session on bitrate/fps change
frontends/va: Only set VP9 segmentation fields when segmentation is enabled
frontends/va: Separate QP for I/P/B frames
radeonsi/vcn: Use temporal_layer_index to select temporal layer
radeonsi/vcn: Implement separate QP for I/P/B frames
radv/video: Set maxActiveReferencePictures to 16 for H264/5
frontends/vdpau: Fix cdef strengths and lr_unit_shift in AV1 decode
frontends/vdpau: Support creating VDP_CHROMA_TYPE_420_16 surfaces
radv/video: Fix setting slice QP
radv/video: Set correct bitstream buffer size
radv/video: Set VBV buffer size and level
radv/video: Select temporal layer when encoding each frame
radv/video: Set maxSublayerCount to 4 for H265
radv/video: Avoid resetting rate control every frame
radv/video: Implement per picture type min/max QP
radv/video: Set correct bit depth and format for 10bit input
radv/video: Check encode profiles and bit depth in capabilities query
radv/video: Report maxBitrate in encode capabilities
radeonsi/vcn: Allocate session buffer in VRAM
radeonsi/vcn: Fix 10bit HEVC VPS general_profile_compatibility_flags
radeonsi/vcn: Only enable VBAQ with rate control mode
frontends/va: Fix AV1 slice_data_offset with multiple slice data buffers
Revert “radeonsi/vcn: AV1 skip the redundant bs resize”
frontends/va: Only increment slice offset after first slice parameters
radeonsi: Update buffer for other planes in si_alloc_resource
frontends/va: Store slice types for H264 decode
radeonsi/vcn: Ensure DPB has as many buffers as references
radeonsi/vcn: Allow duplicate buffers in DPB
radeonsi/vcn: Ensure at least one reference for H264 P/B frames
David Stern (1):
vulkan/wsi/x11: Explicitly discard errors from xcb_present_pixmap.
David Tobolik (1):
rusticl: implement cl_khr_suggested_local_work_size
Derek Foreman (10):
egl/wayland: Fix possible buffer leak
loader/wayland: Add named queue fallback
egl/wayland: Give names to our Wayland event queues
vulkan/wsi/wayland: Give names to our Wayland event queues
vulkan/wsi/wayland: Remove confusing comment
vulkan/wsi/wayland: Adjust presentation id locking
vulkan/wsi/wayland: Use wl_display_dispatch_queue_timeout
vulkan/wsi/wayland: More descriptive name for swapchain queue
vulkan/wsi/wayland: Fix use after free
vulkan/wsi/wayland: Remove unused get_min_image_count_for_mode_group
Dmitry Baryshkov (11):
freedreno/drm: don’t crash for unsupported devices
freedreno/regs: define the wide bus enable bit in DSI_VID_CFG0
freedreno/registers: fix generation dependencies
freedreno/registers: add missing copyright imports
freedreno/registers: inline mdp4_csc group
freedreno/registers: fix WB doffsets array in mdp5.xml
freedreno/registers: support processing display display headers
freedreno/registers: limit the rules schema
freedreno/registers: drop unsupported features from schema
freedreno/rnn: drop headergen2
freedreno/rnn: drop custom aprintf function
Dmitry Osipenko (2):
virtio/vdrm: Fix lockup in vdrm_host_sync()
iris: Use Mesa internal drm-uapi headers
Dylan Baker (8):
intel/vulkan: assume() that we don’t use “ISL_NUM_FORMATS”
intel/hasvk: assume() we don’t get ISL_NUM_FORMATS
meson: drop intel-cl deprecation of ‘false’
meson: rework intel-rt option to be a feature
meson: Allow building intel-clc for the host if it can be run
intel/brw: track last successful pass and leave the loop early
nvk: drop meson version check that is always true
nouveau: require cbindgen >= 0.25
Echo J (9):
nvk: Set ICD version to 1.3
nvk: Implement the VR-related display extensions
nak: Rip out a few dead_code statements
nvk: Add NVK to the Vulkan device name
nvk: Advertise VK_VALVE_mutable_descriptor_type
nvk: Implement calibrated timestamps
vulkan: Add implicit pipeline caching support
nvk: Use implicit pipeline cache
nvk: Don’t advertise residencyAlignedMipSize on MaxwellB+
Emma Anholt (2):
ci: Add full-run xfails missed in the 1.3.7.0 CTS update.
ci: Disable VK full runs that time out since 1.3.7.0 (hasvk, anv-tgl, a630)
Emmanuel Vadot (1):
util: Allow kcmp on FreeBSD
Eric Engestrom (282):
VERSION: bump to 24.1
docs: reset new_features.txt
docs: update calendar for 24.0.0-rc1
ci: make sure we evaluate the python-test rules first
docs: fix syntax highlighting on non-code text snippet
docs: fix syntax highlighting on shell commands
ci/deqp: ensure that in `default` builds, wayland + x11 + xcb are all built
zink+anv/ci: add known failures
ci: fix job dependency error in MRs for bin/ci/* scripts
nouveau/ci: don’t run nouveau (gl) tests on nvk changes
amd/ci: simplify deqp config
amd/ci: add flakes seen today
docs: update calendar for 24.0.0-rc2
zink+radv/ci: drop duplicates flakes lines
CODEOWNERS: add myself as as person of contact for CI changes
CODEOWNERS: remove myself as a person of contact for a few things
radv/ci: sort navi21 flakes
amd/ci: add flakes seen today
amd/ci: consider much more of dEQP-VK.query_pool.statistics_query.host_query_reset.* to be flaky
r300/ci: add flakes
ci/deqp: backport fix for zlib.net not allowing tarball download anymore
rpi3/ci: update piglit & deqp expectations
rpi4/ci: skip more of the dEQP-VK.ssbo.phys.layout.* tests that timeout occasionally
rpi3/ci: add flake seen today
rpi4/ci: add timeouts seen today
rpi5/ci: add flake seen today
docs: add release notes for 23.3.4
docs: update calendar for 23.3.4
docs: add sha256sum for 23.3.4
docs: update calendar for 24.0.0-rc3
ci_run_n_monitor: drop always-true condition
ci_run_n_monitor: allow passing multiple targets
ci/deqp: fix default target check when target is not specified
ci/deqp: simplify version log dump
ci/deqp: avoid storing the huge list of vk tests on android builds
ci/deqp: move editable part to the top of the file
ci/deqp: split vk and gl builds
ci/deqp: drop the implicit DEQP_TARGET; explicitly set `default` in VK builds
ci/deqp: only compile EGL tests in GL builds, not VK builds
ci/deqp: only compile the test binaries that are relevant to the build
ci/deqp: only keep the mustpass lists that are relevant to the build
ci: bump the image tags to rebuild all the deqp variants
Revert “bin/ci: Add GitLab basic token validation”
Reapply “bin/ci: Add GitLab basic token validation”
util: rename __check_suid() to __normal_user()
tree-wide: use __normal_user() everywhere instead of writing the check manually
zink+anv/ci: add a couple more flakes
util: simplify logic in __normal_user()
util: check for setgid() as well in __normal_user()
ci: always skip dEQP-VK.info.device_extensions
vk/util: fix ‘beta’ check for physical device features
vk/util: fix ‘beta’ check for physical device properties
ci: when specifying a driver remove all other ones
docs: update calendar for 24.0.0
docs: add release notes for 24.0.0
docs: add sha256sum for 24.0.0
docs/release-calendar: add planned 24.0.x bugfix releases
docs: add release notes for 23.3.5
docs: update calendar for 23.3.5
docs: add sha256sum for 23.3.5
v3d-rpi4-gl: reduce the parallelism from 10 to 8
docs/calendar: add 24.1 branchpoint and release schedule
ci: drop dash in image tags dates
ci: enforce maximum image tag length
ci: reduce maximum image tags length from 30 to 20
ci: explain purpose of the word after the date in image tags
panfrost: fix UB caused by shifting signed int too far
ci_run_n_monitor: avoid spamming a ton of “new status: created” for all the jobs at the beginning
ci: build panvk in debian-vulkan job
nouveau/tests: fix null dereference
ci: build nvk in debian-vulkan job
v3dv/ci: test the WSI on rpi4 and rpi5
radv: enable VK_EXT_headless_surface on all platforms except Windows
v3dv: enable VK_EXT_headless_surface on all platforms except Windows
tu: enable VK_EXT_headless_surface on all platforms except Windows
anv: enable VK_EXT_headless_surface on all platforms except Windows
hasvk: enable VK_EXT_headless_surface on all platforms except Windows
dzn: enable VK_EXT_headless_surface on all platforms except Windows
nvk: enable VK_EXT_headless_surface on all platforms except Windows
panvk: enable VK_EXT_headless_surface on all platforms except Windows
vn: enable VK_EXT_headless_surface on all platforms except Windows
lvp: enable VK_EXT_headless_surface on all platforms except Windows
pvr: enable VK_EXT_headless_surface on all platforms except Windows
ci_run_n_monitor: warn user if they forgot to push the branch
ci_run_n_monitor: add some types for gitlab objects
ci_run_n_monitor: update job when it goes through enable_job()
ci_run_n_monitor: add method to get a pipeline job by its id
ci_run_n_monitor: track new job when retrying a job
ci_run_n_monitor: refresh job state when starting it
gitlab_gql: print error returned by server in –print-merged-yaml
ci_run_n_monitor: implicitly include `parallel:` jobs
ci_run_n_monitor: print the target regex before adding the X/N bit
docs: add release notes for 24.0.1
docs: add sha256sum for 24.0.1
docs: add release notes for 23.3.6
docs: update calendar for 23.3.6
docs: add sha256sum for 23.3.6
docs: update calendar for 24.0.1
ci_run_n_monitor: explain why/when there might be no tracked remote
ci_run_n_monitor: allow detached heads as well
docs: add release notes for 24.0.2
docs: add sha256sum for 24.0.2
docs: update calendar for 24.0.2
ci_run_n_monitor: fix handling of optional jobs again
ci_run_n_monitor: read job logs as utf-8
vk/util: trivial cleanups in vk_icd_gen.py
vk/util: print a nice error in vk_icd_gen.py when VK_HEADER_VERSION is not defined
ci/android: use a specific version of android-cuttlefish
ci: document which image tags to bump when touching build-mold.sh
ci: uprev mold to the latest release
ci/image-tags: move KERNEL_ROOTFS_TAG to group the test images together
ci/deqp: only apply the android patches to the android build
ci/deqp: build deqp-egl using mold as well
ci/deqp: make deql-egl for android less of a special case
ci/deqp: control the GL release independently of VK
ci/deqp: control the GLES release independently of GL
r300/ci: group tex-miplevel-selection flakes together
r300/ci: add another tex-miplevel-selection flake
iris/ci: add pbuffer flakes for amly, same as apl and glk
panfrost/ci: skip dEQP-GLES31.functional.copy_image.non_compressed.* on t760 as they hang
rpi3/ci: update expectations for vc4-rpi3-gl-piglit-full:arm32 2/4
freedreno/ci: add another a618 flake
zink+anv: update expectations
r300/ci: add flakes
radeonsi/ci: add vangogh piglit flake
zink+radv: update navi31 expectations (one test fixed)
softpipe: update expectations
ci/deqp: drop zlib url patch
ci/deqp: split vk/gl/gles patches
ci/deqp-runner: inline never-used DEQP_VARIANT variable
ci/deqp: use the proper gl/gles releases for deqp-gl*, deqp-gles*, deqp-egl
ci/venus-lavapipe: drop unused DEQP_VER that’s being overwritten by DEQP_SUITE anyway
ci/lavapipe: fold `DEQP_VER: vk` and drop .deqp-test-vk
docs: delay 24.1 branchpoint by 2 weeks
vk/update-aliases: drop VK_ERROR_ prefix substitution
ci/deqp-runner: do a release build instead of debug
ci/deqp-runner: set android rust target in the caller (debian/x86_64_test-android.sh)
ci/deqp-runner: bring “install from crate” & “install from git” to feature parity
ci/deqp-runner: update repo url
ci/deqp-runner: fix list of image tags to update
ci/image-tags: re-generate all the images building deqp-runner
docs: add release notes for 24.0.3
docs: add sha256sum for 24.0.3
ci/deqp: document which build produces which binary
ci: include all the src/**/gitlab-ci.yml files
nouveau: add missing vl lib
nouveau/ci: fix yaml indentation
nouveau/ci: only trigger jobs for relevant changes
Revert “nouveau: add missing vl lib”
ci/deqp: backport fix for dEQP-VK.wsi.direct_drm.* bug
vc4/ci: add flake
radeonsi/ci: udpate expected failures
r300: mark new fails
v3dv/ci: update expectations
v3d/ci: mark spec@ext_framebuffer_blit@fbo-blit-check-limits as fixed
vc4/ci: add another `spec@!opengl 1.1@depthstencil-default_fb-drawpixels` flake
vc4/ci: add another `spec@arb_vertex_buffer_object@vbo-subdata-many draw` flake
v3dv/ci: mark the `dEQP-VK.wsi.*.maintenance1.deferred_alloc.*` flakes seen so far as happening on all platforms
v3dv/ci: add other flakes seen during nightly run
ci: fix shader-db job existence condition
v3dv/ci: assume dEQP-VK.wsi.wayland.swapchain.simulate_oom.* have been fixed
v3dv/ci: add more flakes
v3dv/ci: assume list of dEQP-VK.wsi.*.maintenance1.present_modes.* flakes is the same between xcb & xlib and between rpi4 & rpi5
ci: enable MESA_VK_ABORT_ON_DEVICE_LOSS globally
ci/deqp-runner: split gl & gles groups to use the correct binary
ci/deqp-runner: print deqp-gles version log as well
ci: deduplicate converting the current job runtime into %M:%S
ci: convert the job start date into a timestamp only once
ci: simplify unnecessarily complex printf
radv/ci: sort tahiti flakes
radv/ci: add a bunch of flakes seen recently
v3dv/ci: track regression
rpi/ci: add flakes
radv/ci: add more flakes
v3dv/ci: add more flakes
docs: update calendar for 24.0.3
docs: update calendar for 24.0.4
docs: add release notes for 24.0.4
docs: add sha256sum for 24.0.4
v3dv/ci: another batch of flakes
radv/ci: another batch of flakes
radv/ci: another batch of flakes
radv/ci: dEQP-VK.spirv_assembly.type.vec4.i8.mod_geom Fail -> Crash on tahiti
ci: don’t run rustfmt on every core change
ci_run_n_monitor: explain how to pass multiple targets without having to use regexes
rpi/ci: another batch of flakes
docs: mesa also implements gles 3.0+
docs/egl: various wording improvements
ci: take kws farm offline
ci: restore kws farm
radv/ci: simplify tahiti flakes list
ci: fold .test-check into its only user, python-test
ci: run python-test when editing the CI itself
ci: run python-test automatically only in merge pipelines
docs/macos: drop reference to former github mirror
docs/nir: vec4 reference
docs/envvars: fix reference
docs/isl: fix references to ISL_AUX_USAGE_CCS_*
docs/isl: stop trying to link to classic drivers code
docs/isl: VK_FORMAT_xxx_PACKEDn is not a real format, don’t try to link to it
docs/isl: fix enum references
docs: fix inline c identifier reference -> inline code
isl: fix inline c identifier reference -> inline code
nir: add missing stdint include
docs/anv: fix envvar documentation
docs/nvk: fix envvar documentation
ci: mark vmware farm as offline
ci: add missing rule to disable vmware farm
ci: raise the log level threshold of spirv logs
docs/envvars: document some vulkan loader env vars
docs: replace references to the deprecated VK_ICD_FILENAMES with the new VK_DRIVER_FILES
docs: replace references to the deprecated VK_INSTANCE_LAYERS with the new VK_LOADER_LAYERS_ENABLE
docs/zink: format the envvar value as code instead of plain text
meson: add VK_DRIVER_FILES to devenv, alongside the old VK_ICD_FILENAMES
ci: drop unused VK_ICD_FILENAMES passthrough from jobs to tests
ci: use the new VK_DRIVER_FILES env var
ci/deqp: backport fix for dEQP-VK.pipeline.*.render_to_image.*.huge.*
ci: fix nightly build
ci: fix nightly build (v2)
ci/llvmpipe: make sure manual jobs don’t auto-retry
ci/llvmpipe: fix out of date fails list
ci/lavapipe: fix out of date fails list
ci/lavapipe: skip test that sometimes times out
ci: add nightly full run of llvmpipe
ci: add nightly full run of lavapipe
gallium/dri: reuse existing meson variables
meson: regroup glvnd lines to get an easier-to-review diff in the next commit
meson: turn `glvnd` option into a feature
ci: explicitly disable glvnd to avoid regression when making it auto
meson: auto-enable glvnd when libglvnd is installed
mr-label-maker: include */gitlab-ci-inc.yml in GitLab CI changes
mr-label-maker: be explicit about the various CI files
docs: add release notes for 24.0.5
docs: update calendar for 24.0.5
docs: add sha256sum for 24.0.5
ci: delete mistaken duplicate llvmpipe-{fails,skips}.txt
etnaviv: avoid re-defining prog_python
egl: drop dead dri2_dpy param in dri2_wl_visual_idx_from_config()
lavapipe: add 1 new failure and 1 new timeout since CTS uprev to 1.3.8.0
vk/overlay-layer: drop unused imports
vk/overlay-layer: fix None checks
vk/overlay-layer: simplify print and make it more readable
docs/rusticl: add an intro explaining what Rusticl is
wsi/x11: drop unused param in x11_present_to_x11_sw()
radv: initialize a couple of variables
util: simplify loop logic in util_format_get_first_non_void_channel()
util/futex: replace double-cast check with a simple sign check
docs/ci: explain how gitlab considers “changes” when pushing on a fork branch
rpi5/ci: sort flakes
rpi5/ci: add flakes from last night’s run
rpi4/ci: sort flakes
rpi4/ci: add new flakes from last night’s run
radeonsi/ci: update vangogh expectations after piglit uprev
llvmpipe/ci: update expectations after piglit uprev
VERSION: bump for 24.1.0-rc1
.pick_status.json: Update to 4660ee1deaace6457bf5fbf3fc8810e4a2453cb5
ci: fix container rules on release branches and tags
.pick_status.json: Update to 84632dce93f44e8d88cda47648cfd4cc0958918f
.pick_status.json: Update to 8248cc0bf45d0d7558cc3d77a63dcd078a96aa66
ci: pass MESA_VK_ABORT_ON_DEVICE_LOSS through to the DUT
.pick_status.json: Update to 86281ef15fca378ef48bcb072a762168e537820d
.pick_status.json: Update to 47f6e24ad5dfcb59dd1511800aee8c56b4f8fee4
meson: simplify `-gsplit-dwarf` compiler argument check
meson: move tsan-blacklist.txt to build-support with the other build support files
VERSION: bump for 24.1.0-rc2
.pick_status.json: Update to 603982ea802b3846e91a943b413a7baf430e875d
.pick_status.json: Update to 569c2fcf952a3ec13ddf77c0058e769bf68f3aaf
.pick_status.json: Update to 9666756f603f0285d8a93ef93db1c7ec702b671f
.pick_status.json: Update to b8e79d2769b4a4aed7e2103cf0405acc5bdadb86
VERSION: bump for 24.1.0-rc3
.pick_status.json: Update to 18c53157318d6c8e572062f6bb768dfb621a55fd
.pick_status.json: Update to 406dda70e7c9baa59c975eb64025e7c3b210c3bc
.pick_status.json: Update to 5502ecd7716045e76f13f007a4aa5f5653c80ecd
util/format: add missing null check in util_format_is_srgb()
.pick_status.json: Update to d516721cd0cb16d0b601c42c01de0fdcc4ae887b
.pick_status.json: Update to aa9244c8f6bfa3fb33cf233104b00fc44fc9459f
.pick_status.json: Mark a45f1990860db3a8da6d7251bb627a314dfb8423 as denominated
VERSION: bump for 24.1.0-rc4
.pick_status.json: Update to b2282e3a571f18b48b8b717ec32da1d0ed93f1b5
.pick_status.json: Update to 471ac97a4af751226bc51076130deae252bb481e
.pick_status.json: Update to 2487a875527f636565a7b39036690fbf7c5d46db
.pick_status.json: Update to 3584fc64828ad2ad4d486572ec915aab8321aadd
Eric R. Smith (13):
panfrost: fix panfrost drm-shim
panfrost: add lowering pass for multisampled images
panfrost: support multi-sampled image load/store
panfrost: protect alpha calculation from accessing non-existent component
panfrost: make drm-shim work again for panfrost
panfrost: make sure blends always have 4 components
panfrost: mark indirect compute buffer as read
gallium: handle copy_image of depth textures
panfrost: fix polygon offset calculation for floating point Z
panfrost: fix a GPU/CPU synchronization problem
panfrost: mark separate_stencil as valid when surface is valid
panfrost: fix an incorrect stencil clear optimization
panfrost: add a barrier when launching xfb jobs in CSF
Erico Nunes (2):
Revert “ci: lima farm is down”
ci: enable shader-db on lima
Erik Faye-Lund (32):
panfrost: add support for forcing sample-counts
panfrost: pass reduced primitive type instead of points
panfrost: add line_smooth shader-key and lowering
panfrost: clean up active_prim update
panfrost: implement line-smoothing
mesa/main: add support for EXT_texture_storage
mesa: fix error-handling for ETC2/RGTC textures
glapi: move EXT_texture_storage to the right position
targets/va: override LIBVA_DRIVERS_PATH in devenv
mesa/main: fix _mesa_base_tex_format for BGRA
mesa/main: mark GL_BGRA as color-renderable
mesa/main: mark GL_BGRA8_EXT as color-renderable
mesa/main: work around chrome/firefox bug
mesa/main: allow GL_BGRA for FBOs
panvk: do not handle illegal null
glsl: Make error_value a real ir_rvalue type
panfrost: give afbc-packing its own flag
panfrost: add driconf infrastructure
panfrost: add pan_force_afbc_packing driconf
mesa: prefer read-format of RG for snorm
gallium: remove always-false parameter
panvk: use integers instead of strings
panfrost: silence compiler warning
panfrost: add tiler-heap driconfs
panvk: wire up version-overriding
panfrost: implement a driver-specific max-miplevel
panfrost: use perf_debug_ctx instead of perf_debug
panfrost: perf_debug_ctx -> perf_debug
panfrost: use util_debug_message for perf_debug
panfrost: do not deref potentially null pointer
panfrost: correct first-tracking for signature
panvk: avoid dereferencing a null-pointer
Erik Kurzinger (2):
wsi/wayland: don’t use explicit sync with sw
wsi/x11: support explicit sync
Faith Ekstrand (284):
nvk: Add an explicit mapping from shader stages to cbuf bindings
nvk: Return an nvk_cbuf_map from nvk_lower_nir()
nvk: Use s instead of set_idx in CmdBindDescriptorSets
nvk: Rework descriptor set binding
nvk: Make dynamic cbuf indices relative to the descriptor set
nvk: Handle missing descriptor sets in nvk_nir_lower_descriptors
nvk: Invalidate state after secondary command buffers
nvk: Set a minimum of one patch control point
nak: Disallow gl_FragData and set MRT correctly
nak: Add explicit padding to nak_shader_info
nvk: Emit SET_ANTI_ALIAS at draw time when no render targets are bound
nvk: Move SET_HYBRID_ANTI_ALIAS_CONTROL to draw time
nvk: Advertise variableMultisampleRate and EDS3RasterizationSamples
nvk: Add a couple more features to features.txt
nak: Stop passing –explicit-padding to bindgen
nak: Implement nir_op_pack_half_2x16_rtz_split
nak: Implement nir_op_ufind_msb_rev
nak: Rename OpBrev to OpBRev
nak: Implement nir_op_bfm
nouveau/mme/fermi: Stop truncating iadd immediates
nouveau/mme: Stop using isaspec
nvk: Set framebufferIntegerColorSampleCounts
nvk: Unref shaders on pipeline free
nvk: Add a #define for max shared memory size
nvk: Properly configure the min/max shared mem size
nvk: Implement VK_KHR_zero_initialize_workgroup_memory
nir,spirv: Add support for SPV_NV_shader_sm_builtins
nak: Add support for SPV_NV_shader_sm_builtins
nvk: Advertise VK_NV_shader_sm_builtins
nvk/draw: Map cbuf slots to shaders, not cbuf_maps
nak: Refactor shader upload math
nvk: Wire up nir_opt_large_constants
nak: Enable NIR fuse_ffmaN
nak: Legalize OpBMsk
nvk: Don’t exnore ExternalImageFormatInfo
nvk: Set maxInlineUniformTotalSize
nak: Fix TCS output reads
anv: Add helpers for getting the surface state from an image view
anv: Advertise VK_EXT_attachment_feedback_loop_layout
nak: Choose S2R vs CS2R based on sysval index
nak: Add a source barrier intrinsic
nak: Loop to ensure we get accurate shader clocks
nvk: Stop requiring dedicated allocations
nvk: Advertise Vulkan 1.3
nvk: Do a second submit to check for errors in the sync case
nvk: Whitespace fixes
nvk: Disable all cbufs in nvk_queue_init_context_draw_state()
nvk: Call lower_compute_system_values after zer_initialize_workgroup_memory
nak/nir: Stop lowering load_local_invocation_index
nil: Set the level offset to 0 in nil_image_for_level
nvk: Fix whitespace in nvk_image.c
nouveau/winsys: Re-order channel creation
nouveau/winsys: Allow only allocating a subset of engines
nvk/queue: Pull DRM specfc stuff into nvk_queue_drm.c
nvk/queue: Refactor the push builder a bit
nvk: Move the nouveau_ws_context to nvk_queue
nvk: Add an array of queue families to nvk_physical_device
nvk/queue: Rework context state init
nvk/queue: Only initialize the necessary engines
nvk: Use VM_BIND for contiguous heaps instead of copying
nvk: Only map heaps that explicitly request maps
nvk: Add an upload queue
nvk: Add an upload queue to nvk_device
nvk: Use the upload queue for shader uploads
nvk: Don’t set CONSTANT_BUFFER_SELECTOR with a zero size
nvk/heap: Use nvk_heap_bo::addr instead of bo->offset
nvk/heap: Rework over-allocation
nvk: Convert shader addresses to offsets in nvk_shader.c
vulkan: Update XML and headers to 1.3.278
nvk: Use nouveau_ws_bo_new_mapped() for descriptors
nouveau/winsys: Add a fixed_addr to nouveau_ws_bo_map
nvk: Implement VK_EXT_map_memory_placed
nvk: Invalidate the texture cache before MSAA resolves
nvk: Don’t use WAIT_AVAILABLE in nvk_upload_queue_sync
drm-uapi: Sync nouveau_drm.h
nouveau/winsys: Add a vram_used query
nvk: Add a nouveau_ws_device to nvk_physical_device
nvk: Add a hand-rolled nvk_memory_heap struct
nvk: Use 3/4 of total system memory for the VRAM heap
nvk: Add an available query to nvk_memory_heap
nvk: implement EXT_memory_budget
nouveau/winsys: Getch the BAR size from the kernel
nvk/heap: Upload shaders on the CPU when we have a map
nvk: Upload shaders on the CPU when we have ReBAR
nvk: Expose a host-visible VRAM type when we have REBAR
nvk: Only expose VK_KHR_present_id/wait when we have WSI
nvk: Advertise VK+KHR_incremental_present
nil: Add PIPE_FORMAT_R5G5B5A1_UNORM
nak: Add writes_point_size to nak_shader_info
nvk: Handle missing gl_PointSize in the last geometry stage
nvk/copy: Handle VK_REMAINING_ARRAY_LAYERS
vulkan/meta: Handle VK_REMAINING_ARRAY_LAYERS in blit and resolve
nvk: Use VkPipelineCreateFlags2 flag names
nvk: Advertise VK_KHR_maintenance5
vulkan: Add a vk_get_subgroup_size() helper
vulkan: Move the descriptor set limit to vk_limits.h
vulkan: Add runtime code for VK_EXT_shader_object
vulkan: Add a vk_render_pass_state_has_attachment_info() helper
vulkan: Rework vk_render_pass_state::attachments
vulkan: Add a new dynamic state for render pass attachments
vulkan: Add a vk_pipeline base struct
vulkan: Add push constant ranges to vk_pipeline_layout
vulkan: Add a BLAKE3 hash to vk_descriptor_set_layout
vulkan: Add generic graphics and compute VkPipeline implementations
nvk: Populate vk_descriptor_set_layout::blake3
nvk/shader: Refactor some helpers
nvk: Move populate_fs_key to nvk_shader.c
nvk: Pass an array of descriptor sets to nvk_lower_nir
nvk: Move nir_lower_patch_vertices to nvk_lower_nir()
nvk: Use vk_render_pass_state::attachments for write masks
nvk: Switch to shader objects
nvk: Advertise VK_KHR_graphics_pipeline_library
nvk: Advertise VK_EXT_shader_object
nak: Implement nir_op_iadd3 on SM70+
nir: Add an imad opcode
nak: Move NAK_FS_OUT_COLOR next to the enum
nak: Add support for imad on Volta+ and enable it in simple cases
nvk: Advertise a CTS version of 1.3.7.3
nvk: Drop the non-conformant warning on Turing-Ada
nvk: Don’t print the NVK_I_WANT_A_BROKEN_VULKAN_DRIVER warning in release builds
meson: Rename nouveau-experimental to nouveau and build by default on x86
vulkan/pipeline: Whitespace fix
vulkan/pipeline: Handle fully compiled library shaders properly
nvk: Advertise VK_KHR_pipeline_library
docs/nvk: Update the conformance status section
docs/nvk: Update the NVK_DEBUG docs
docs/nvk: Document NAK_DEBUG
nil: Enable A8_UNORM for storage buffers
vulkan/pipeline: Always init pipeline cache objects
nak: Fix printing of OpIsberd
nak/sm50: Fix encoding of immediates in OpFFma
nak/sm50: Use OpBfe instead of OpBRev for nir_op_find_lsb
nak: Support F2I for 8-bit integers on SM50
nvk: Return os_page_size for minMemoryMapAlignment
nouveau: Import g_nv_name_released.h from NVIDIA OGK
nvk: Report official GPU names from NVIDIA when we have them
nvk: Use row_stride_B instead of width for render and copies
nil: Rework tiling calculations
nil: Add a concept of width to tile sizes
nil: Add a concept of sliced 3D image views
nvk: Use “real” 3D image views
nvk/queue: Add a push_bind helper
nvk: Refactor opaque image binds
nvk/queue: Add support for non-opaque sparse binds
nak: Rename resident to fault
nak: Plumb is_sparse through from NIR for texture ops
nak/nir: Add sparse support to shrink_image_load()
nak: Wire up sparse residency for texture ops
nil: Fix a typo in a comment
nvk: Document the register name for the helper load workaround
nvk: Always wait for the FALCON in set_priv_reg
nvk: Disable the Out Of Range Address exception
nvk: Drop a bunch of dev->pdev and just use pdev
nvk: Add and use more cmd_buffer_*_cls helpers
nvk: Replace more dev->pdev with nvk_device_physical()
nvk: Drop nvk_device::pdev
zink: Remove interpolateAtSample() when not multi-sampling
nil: Move Z slice offset calculations to a helper
nvk: Add a nil_image helper variable in BeginRendering
nvk: Manually offset array and Z slices in BeginRendering
nil: Advertise support for PIPE_FORMAT_R5G6B5_UNORM
nil: Whitespace fix
nil: Add support for larger textures on Pascal+
nil: Add a helper to view a MSAA image as samples
nil: Expose nil_pix_extent_sa()
nvk: Use a HW generation names instead of chipsets
nvk: Stop pretending to handle Intel image intrinsics
nvk: Use different descriptor layouts for storage vs. sampled images
nvk: Implement shaderStorageImageMultisample
zink: Rework sparse texture lowering
nvk: Ignore rasterizationSamples when handling sampleShadingEnable
nvk: Always set SET_ATTRIBUTE_POINT_SIZE
Revert “nvk: Enable VK_KHR_shader_subgroup_uniform_control_flow”
nvk: Move the mutableDescriptorType enable
nir: Take a nir_def in nir_goto_if()
nir/print: Inline print_ssa_use()
nir/builder: Correctly handle decl_reg or undef as the first instruction
nir: Improve the comment for nir_block::imm_dom
nir: Add a sort_unstructured_blocks() helper
nir: Validate that unstructured blocks are in reverse PDFS order
nir/lower_reg: Remove dead reg_decl intrinsics
nir/lower_reg: Support unstructured control-flow
nir/repair_ssa: Support unstructured control-flow
nir/gather_types: Support unstructured control-flow
nir: Mark divergent regs in phis_to_regs_block()
nir: Add a lower_terminate_to_demote pass
nak: Add a copy_fs_outputs_nv intrinsic
nak: Move barrier removal into its own pass
nak: Add a condition to bar_break_nv
nak/nir: Add a control-flow lowering pass
nak: Add more NIR wrappers for walking the NIR CFG
nak: Add NIR helpers for jump instructions
nak: Add helpers for emitting jumps
nak: Handle unstructured NIR
nak: Use the new lowering pass on SM70+
nak: Remove the old barriers pass
nak/nir: Use nir_lower_terminate_to_demote()
nvk: Advertise VK_KHR_shader_maximal_reconvergence
nvk: Advertise VK_KHR_shader_subgroup_uniform_control_flow
nak/nir: Emit nir_intrinsic_ald_nv directly for system values
nak/nir: Rename load_interpolated_input
nak/nir: Add a load_fs_input hepler for flat inputs
nak/nir: Emit nir_intrinsic_ipa_nv directly for FS system values
nak/nir: Use nir_io_semantics for varyings and attributes
nak: Break lower_fs_inputs into its own file
nak/nir: Clean up lower_fs_inputs a bit
nak: Call nir_lower_io_to_temporaries for FS outputs
nak/nir: Use nir_io_semantics for FS outputs
nak: Drop lower_io_arrays_to_elements_no_indirects for FS outputs
nak: Simplify over-all I/O lowering
nak: Don’t write undefined FS outputs
nak: Plumb through LDC modes
nak: Implement load_ubo with an indirect cbuf index
nvk: Support VkBindMemoryStatusKHR
nvk: Advertise VK_KHR_maintenance6
nir: Delete the rest of the CF list when adding a halt
nak: Don’t do a scope break cascade for nir_jump_halt
nil: Add a CSV version of the format table
nil: Re-organize the format table
nil: Switch to using the CSV generated table
nil: Drop bogus color formats from non-renderable luminance/alpha formats
nil: Remove 2-bit SNORM from the format table
nil: Drop unneeded types from formats
nvk: Use the page-aligned BO size for the descriptor pool
nvk: Use a VMA heap for descriptor memory
nvk: Use a linked list for descriptor sets in a pool
nvk: Add a _pad field to nvk_cbuf
nvk: Delete dead descriptor code
nvk: Add a _pad field to nvk_fs_key
nvk: Add a bunch of -Wpadded errors
vulkan: Add a bunch of -Wpadded errors
nouveau: Move .rustfmt.toml from NAK to src/nouveau/
nouveau: Use hyphenated arguments to class_parser.py
nouveau/headers: Add initial Rust bindings
nouveau/headers: Add Rust bindings for texture headers
ci: Add cbindgen to the build images
nil: Move nil_tic_format to nil_format_table.h
nil: Move to a single header file
nvk: Stop using nvk_extent4d short names
nil: Rename nil_tiling::gob_height_8 to gob_height_is_8
nak/bitview: Add a SetField<f32> implementation
nil: Delete unused USAGE bits
nil: Make the Rust library the primary build target
nil: Add Extent/Offset4D::new() helpers
nil: Drop the nil_extent/offset4d() helpers
nil: Take a format in el_to_B()
nil: Enforce units via Rust types
nil: cbindgen is required
nvk: Improve the unsupported handle type error
nvk: Restrict shaderFloat16 to Ampere+ for now
nouveau/headers: Move the classes into a submodule for Rust
nouveau/headers: Generate Rust for QMDs
nak: Add helpers for filling QMDs
nvk: QMDs are 64 dwords
nvk: Use the NAK helpers to fill QMDs
nouveau: Import the hwref headers from Nvidia OGK
nouveau/headers: Add the MMU headers to the Rust crate
nil: Use the enums from the hwref headers for PTE kinds
nil: add s8 pte kind
nil: Be more speicific about Maxwell in the format table
nil: Advertise S8_UINT on MaxwellB+
nvk: Hash ycbcr conversions in the descriptor set layout hash
nvk: Re-emit sample locations when rasterization samples changes
nvk/meta: Restore set_sizes[0]
nvk/upload_queue: Only upload one line of data
vulkan/wsi: Bind memory planes, not YCbCr planes.
nvk: Improve the GetMemoryFdKHR error
nouveau/winsys: Take a reference to BOs found in the cache
nouveau/winsys: Make BO_LOCAL and BO_GART separate flags
nvk: Allow GART for dma-bufs
nil: Use the right PTE kind for Z32 pre-Turing
nvk: Set color/Z compression based on nil_image::compressed
nil: Default to NV_MMU_PTE_KIND_GENERIC_MEMORY on Turing+
nvk: Allow VK_IMAGE_ASPECT_MEMORY_PLANE_0_BIT
drm-uapi: Sync nouveau_drm.h
nouveau/winsys: Add back nouveau_ws_bo_new_tiled()
nvk: Support image creation with modifiers
nvk: Set tile mode and PTE kind on dedicated dma-buf BOs
nvk: Implement DRM format modifier queries
nvk: Advertise VK_EXT_queue_family_foreign
nvk: Advertise VK_EXT_image_drm_format_modifier
nvk/wsi: Advertise modifier support
zink: Set workarounds.can_do_invalid_linear_modifier for NVK
nvk/meta: Save and restore set_dynamic_buffer_start
Felix DeGrood (6):
driconf: Change vendorid on Palworld for Intel
driconf: Fake vendorid for RDR2
mesa-overlay: defer listening to socket until first frame
driconf: add SotTR DX12 to Intel XeSS workaround
iris: Increase target batch size to 128 KB
intel/ds: add pipe control reasons to perfetto flushes
Francisco Jerez (36):
intel/fs: Use full 32-bit sample masks when immediate.
intel/eu/validate: SEND instructions don’t have immediate encodings on Gen12+.
intel/eu/gfx12.5+: Don’t fail validation with ARF register restriction error for indirect addressing.
intel/compiler/xe2: Add Xe2 bounds to FF() macro.
intel/compiler/xe2: Implement codegen of general instruction controls.
intel/compiler/xe2: Implement codegen of 2-source instruction operands.
intel/compiler/xe2: Implement codegen of indirect immediates.
intel/compiler/xe2: Implement codegen of three-source instructions.
intel/compiler: Add assume() checks to brw_compact_inst_(set_)bits().
intel/compiler/xe2: Implement codegen of compact instructions.
intel/compiler/xe2: Implement instruction compaction.
intel/compiler/xe2: Fix for NibCtrl field removal.
intel/compiler/xe2: Fix for the removal of most predication modes.
intel/compiler/xe2: Add extra flag registers.
intel/compiler/xe2: Fix for the removal of AccWrCtrl.
intel/ir/xe2+: Add support for 32 SBID tokens to performance model.
intel/fs/xe2+: Disable bank conflict mitigation pass for now.
intel/eu/xe2+: Translate brw_reg fields in REG_SIZE units to physical 512b GRF units during codegen.
intel/fs: Set the default execution group to 0 when not representable by the platform.
intel/fs: Emit QUAD_SWIZZLE instructions with WE_all for derivative lowering.
intel/fs/xe2+: Allow SIMD16 MULH instructions.
intel/brw/xe2: Render target reads have been removed from the hardware.
intel/brw/xe2+: Update encoding of FB write descriptor message control.
intel/brw/xe2+: Update encoding of FB write extended descriptor.
intel/brw/xe2+: Double allowed SIMD width of FB write SEND messages.
intel/brw/xe2+: Allow FS stencil output in SIMD16 dispatch mode.
intel/brw/xe2+: Allow dual-source blending in SIMD16 mode.
intel/blorp/xe2+: Don’t use replicated-data clears.
intel/brw/gfx12: Setup PS thread payload registers required for ALU-based pixel interpolation.
intel/brw/xe2+: Setup PS thread payload registers required for ALU-based pixel interpolation.
iris/xe2+: Disable coherent framebuffer fetch.
intel/brw/xehp+: Replace lsc_msg_desc_dest_len()/lsc_msg_desc_src0_len() with helpers to do the computation.
intel/eu/xehp+: Don’t initialize mlen and rlen descriptor fields from lsc_msg_desc*().
intel/brw/xehp+: Drop redundant arguments of lsc_msg_desc*().
intel/fs/gfx20+: Implement sub-dword integer regioning restrictions.
intel/fs/gfx20+: Handle subdword integer regioning restrictions in copy propagation.
Frank Binns (3):
CODEOWNERS: update Imagination maintainers
pvr: fix up some includes
pvr: split out device info into per GPU headers
Friedrich Vock (27):
radv/rt: Add workaround to make leaves always active
radv: Fix shader replay allocation condition
nir: Make is_trivial_deref_cast public
nir: Handle casts in nir_opt_copy_prop_vars
radv/amdgpu: Fix build on BSD
winsys/amdgpu: Fix build on BSD
util: Provide a secure_getenv fallback for platforms without it
vulkan: Use secure_getenv for trigger files
aux/trace: Guard triggers behind __normal_user
vtn: Use secure_getenv for shader dumping
mesa/main: Use secure_getenv for shader dumping
radv: Use secure_getenv in radv_builtin_cache_path
radv: Use secure_getenv for RADV_THREAD_TRACE_TRIGGER
util/disk_cache: Use secure_getenv to determine cache directories
radv/rt: Write inactive node data in ALWAYS_ACTIVE workaround
radv/rt: Optimize update shader VGPR usage
radv,driconf: Enable active AS leaf workaround for Jedi Survivor
radv/rt: Handle monolithic pipelines in capture/replay
vulkan/runtime: Allow more than 8 DRM devices
radv: Set SCRATCH_EN for RT pipelines based on dynamic stack size
radv/rt: Fix frontface culling with emulated RT
radv/rt: Force active leaves for every updateable accel struct
radv,driconf: Remove active accel struct workaround
radv: Only enable SEs that the device reports
radeonsi: Only enable SEs that the device reports
aco/tests: Insert p_logical_start/end in reduce_temp tests
aco/spill: Insert p_start_linear_vgpr right after p_logical_end
Ganesh Belgur Ramachandra (1):
compiler,glsl: fix warning when -finstrument-functions is used
Georg Lehmann (60):
aco: reassign split vector to SOPC
aco: stop scheduling at p_logical_end
nir: add ballot_relaxed and as_uniform intrinsics
aco: implement as_uniform and ballot_relaxed
ac/llvm: implement as_uniform and ballot_relaxed
nir: add lowering for boolean shuffle
radv: lower boolean shuffle
radeonsi: lower boolean shuffle
aco: remove boolean shuffle isel
aco: fix printing dpp8
aco: validate v_permlane opsel correctly
aco: support v_permlane64_b32
aco/gfx11: use v_nop to resolve VcmpxPermlaneHazard
aco/gfx11: resolve VcmpxPermlaneHazard for v_permlane64
aco: implement rotate
radv: enable VK_KHR_shader_subgroup_rotate
radv: report rotate subgroup feature bits
anv: report rotate subgroup feature bits
aco/gfx11+: disable v_pk_fmac_f16_dpp
aco: add packed fma dpp note to README-ISA
aco: don’t remove branches that skip v_writelane_b32
aco/print_ir: don’t use alloca for input modifiers
aco: print neg prettier for packed math
aco: don’t print hi() for permlane opsel
aco: print permlane16 bc/fi
aco: print exec/vcc_lo/hi for single dword access
aco/gfx11+: limit hard clauses to 32 instructions
radv/gfx11+: add rtwave32 perftest option
aco: use fmamk/ak instead of fma with inline constant for more VOPD
nir: remove rotate scope
nir/divergence_analysis: fix subgroup mask
aco: create pseudo instructions with correct struct
aco/post-ra: rename overwritten_subdword to allow additional uses
aco/post-ra: assume scc is going to be overwritten by phis at end of blocks
aco: store if pseudo instr needs scratch reg
aco/post-ra: track pseudo scratch sgpr/scc clobber
aco/ssa_elimination: check if pseudo scratch reg overwrittes regs used for v_cmpx opt
aco/builder: improve v_mul_imm for negative imm
aco/builder: use 24bit mul if low bits of imm are zero
aco/optimizer: combine v_mul_i32_i24 and add to mad
aco: avoid full 32bit imul for uniform reduce/scan
aco: don’t combine mul+add_clamp to mad_clamp
aco/ra: use SDWA for 16bit instructions when the second byte is blocked
aco/vn: remove instruction hash templates
aco: use v1 definition for v_interp_p1lv_f16
aco/assembler: add vintrp high_16bit support
aco: swap opsel and wait_exp for vinterp
aco: support high_16bits FS IO
aco/tests: add assembler tests for interp high_16bits
aco/gfx9: all non legacy opsel instructions only write 16bits
aco: use v_interp_p2_f16 opsel
aco: add ra test for hi v_interp_p2_f16
radv: sink alu
radv: move alu
nir: don’t try to optimize exclusive min/max scan to inclusive
nir: rename to nir_opt_16bit_tex_image
ac/nir: add ac_nir_opt_pack_half
radv: use ac_nir_opt_pack_half
radv, radeonsi: don’t use D16 for f2f16_rtz
zink: use bitcasts instead of pack/unpack double opcodes
George Ouzounoudis (1):
vulkan: Fix null pointer dereferencing on sample locations state
Gert Wollny (76):
r600: lower dround_even also on hardware that supports fp64
virgl: Use better reporting for mirror_clamp features
ci: Upref virglrenderer
zink: Factor out create buffer from resource_object_create
zink: shorten lifetime of success variable in resource_object_create
zink: Factor out create sampler conversion in resource_object_create
zink: factor out get_format_feature_flags in resource_object_create
zink: factor out get_image_memory_requirement in resource_object_create
zink: reduce number of #ifdefs in resource_create_object
zink: extract get_export_flags from resource_object_create
zink: extract function allocate_bo from resource_create_object
zink: redesign the allocation try loop to test all heaps
zink: extract function create_image from resource_object_create
zink: extract function update_alloc_info from resource_object_create
zink: extract update_obj_info from resource_object_create
zink: extract debug_resource_mem from resource_object_create
zink: drop duplicate assignment to obj->alignment
zink: extract allocate_bo_and_update_obj from resource_object_create
zink: Move more code to create_image and create_buffer
zink: simplify call to get_export_flags
zink: remove duplicate arguments and use VkMemoryRequirements locally
zink: use enums as return values in resource_object_create
radv: Fix compilation with gcc-13 and tsan enabled
nir/lower_int64: Fix compilation with gcc-13 and tsan enabled
nir/builder: Fix compilation with gcc-13 when tsan is enabled
zink: Fix return type and values of create_buffer and create_images
zink: extract check_unordered_exec from zink_get_cmdbuf
zink: remove duplicate check and assignment in zink_resource_image_needs_barrier
zink: extract emit_memory_barrier from zink_resource_image_barrier
zink: extract emit_memory_barrier::for_buffer from zink_resource_buffer_barrier
zink: extract update_unordered_access_and_get_cmdbuf
zink/sync: remove duplicate assignments in UNSYNCHRONIZED case
zink: move zink_resource_copies_reset out of exportable_lock
zink: remove invalid scope in bo allocation loop
r600: handle indirect access to kcache 14 and 15
zink/nir_to_spirv: emit ViewportIndex cap also for inputs
zink: use only ZINK_BIND_DESCRIPTOR
zink: decrease aggressiveness of increasing descriptor data space adaptive
zink/nir-to-spirv: Make sure sampleid for InterpolateAtSample is int
nir-to-spirv: Cast SSBO input pointer when needed
zink: set handle type also for user memory
zink: acquire - maybe clear timeout after waiting for presentation fence
nir_to_spirv: Allow LOD for external images
zink: ctx->last_fence really wants to be a batch_state, so accomodate it
zink: another fence that is better off as a batch state
ntv: remove store_def_raw
ntv: remove store_ssa_def
ntv: pass def->index to store_def
ntv: simplify increasing the number of dest componnets for sparse tex
zink/ntv: introduce structure using the source params
zink/ntv: extract get_tex_srcs
zink/ntv: use new struct to pass texture parameters
zink/ntv: extract find_sampler_and_texture_index
zink/ntv: simplify evaluation of sampled_type
zink/ntv: extract get_texture_load
zink/ntv: extract get_texop_dest_type
zink/ntv: Extract move_tex:proj_into_coord
zink/ntv: replace if-chain with switch in emit_tex
zink/ntv: extract picking the image to load from
zink/ntv: extract emit_tex_readop as function that reads texture pixel data
zink/ntv: pull result out of cases and use a common store_def
zink: if AcquireNextImageKHR fails with VK_NOT_READY or VK_TIMEOUT retry
meson: Add blacklist when compiling with tsan
futex: disable futexes when compiling with tsan
util/u_queue: read fence->signalled locked with TSAN
tsan-blacklist: ignore race when reading lp_fence signalled status
llvmpipe: Don’t emit certain debug code when TSAN is enabled
tsan-blacklist: Ignore race in get_max_abs_timeout_ns
tsan-blacklist: surpress two race conditions in TC
r600/sfn: Add array element parent also to array
r600/sfn: Use dependecies to order barriers and LDS/RAT instructions
r600/sfn: call nir_lower_doubles explicitely
r600/sfn: when emitting fp64 op2 groups pre-load values
r600/sfn: Don’t put b2f64 conversion into ALU group
zink/kopper: Wait for last QueuePresentKHR to finish before acquiring for readback
mesa/st: don’t use base shader serialization when uniforms are not packed
Guilherme Gallo (25):
ci/lava: Turn the r8152 issue check into a counter
ci/lava: Detect r8152 issue during boot phase
ci/lava: Detect hard resets during test phase
bin/ci: Don’t submit jobs on integration test
ci/lava: Ignore DUT feedback messages
ci/lava: Fix the integration test
bin/ci: Propagate the token to GitlabGQL
bin/ci: Move get_token_from_default_dir to common
bin/ci: Refactor read_token function
bin/ci: Add GitLab basic token validation
ci/lava: Broader R8152 error handling
radv+zink/ci: Update xfiles based on nightly run
radv/ci: Update xfiles based on nightly run
v3d/ci: Update xfiles based on nightly run
freedreno/ci: Update xfiles based on nightly run
etnaviv/ci: Update xfiles based on nightly run
r300/ci: Update xfiles based on nightly run
ci/a618: Rebalance a618-limozeen jobs
ci/a618: Add zink-tu-a618-full
ci/lava: A few formatting cleanups
ci/lava: Introduce unretriable exception handling
ci/lava: Don’t run jobs if the remaining execution time is too short
ci/lava: Fix how exception entry in structured log
ci: Add S3 id_token for all jobs
ci: Use id_tokens for JWT auth
Gurchetan Singh (6):
mesa/util: Check __ANDROID__ when for detecting for Android
mesa/util: add <linux/fcntl.h>
mesa/util: use DETECT_OS_ANDROID in anon_file.c
mesa/vulkan: use a simpler path for header in trampoline gen
mesa/vulkan: use DETECT_OS_ANDROID
vk_image.c: #ifndef _WIN32 –> DETECT_OS_LINUX + DETECT_OS_BSD
Haihao Xiang (1):
anv: Fix typo in transition_color_buffer
Hannes Mann (2):
gallium/pipe: Add contiguous planes per-surface attribute
frontends/va: Only export one handle for contiguous planes
Hans-Kristian Arntzen (20):
wsi/x11: Remove unused vk_alpha in get_dri3_modifiers.
wsi/x11: Compare modifiers before signalling SUBOPTIMAL.
wsi/x11: Add drirc option to ignore SUBOPTIMAL.
wsi/x11: Add workaround for Detroit Become Human.
wsi/x11: Rewrite implementation to always use threads.
wsi/x11: Implement VK_EXT_swapchain_maintenance1.
wsi/x11: Keep track of multiple presentation requests.
wsi/x11: Make chain->status atomic.
wsi/wl: Refactor out code to update current present ID.
wsi/wl: Improve fallback for present_wait.
wsi/common: Allow KHR_present_wait on WL.
wsi/x11: Disable vk_xwayland_wait_ready by default on most drivers.
wsi/x11: Rewrite logic for how we consider minImageCount.
radv: export multiview in VS/TES/GS for depth-only rendering
wsi/wl: Fix deadlock in dispatch_queue_timeout.
wsi/wayland: Replace surface pilfer with retired bool.
wsi/wayland: Init outstanding list earlier.
wsi/x11: Return OUT_OF_DATE on sw resize.
vulkan/runtime: Check correct callback list for binding report.
radv: Store range rather than bo_size in VkBuffer/VkImage.
Helen Koike (6):
ci/ci_run_n_monitor: move get_gitlab_pipeline_from_url() to gitlab_common
ci/ci_gantt_chart: add tool to analyse pipeline execution time
ci/ci_gantt_chart: add timeout vertical line
ci/ci_gantt_chart: add option to save output to a file
ci/ci_gantt_chart: show duration on hover
ci/ci_post_gantt: add script that post gantt to Marge’s messages
Hsieh, Mike (4):
amd/vpelib: add new tf enum and add flag for geometric scaling
amd/vpelib: skip gamma remap and cs conversion when geometric scaling
amd/vpelib: geometric scaling fix
amd/vpelib: Add UID for 3d Lut and control logic
Hyunjun Ko (3):
anv/video: fix out-of-bounds read
anv/video: fix scan order for scaling lists on H265 decoding.
anv/video: Fix to set correct offset and size for parsing h265 slice header.
Iago Toral Quiroga (49):
broadcom/compiler: fix incorrect flags setup in non-uniform if path
broadcom/compiler: fix incorrect flags update for subgroup elect
broadcom/compiler: add new SFU instructions in V3D 7.x
broadcom/compiler: don’t move subgroup reduction instructions above setmsf
broadcom/compiler: support subgroup ballot
broadcom/compiler: support subgroup shuffle
broadcom/compiler: support subgroup vote
broadcom/compiler: support subgroup quad
v3dv: expose more subgroup features on V3D 7.x
broadcom/compiler: be more careful with unifa in non-uniform control flow
brodcom/compiler: implement non-compute TSY barrier
broadcom/compiler: support subgroup reduction operations from fragment shaders
v3dv: allow subgroup operations in fragment shaders
broadcom/compiler: fix lane selection for subgroups in fragment shaders
v3d,v3dv: fix BO allocation for shared vars
v3dv: fix subpass clear with draw call for multi-layered framebuffers
v3dv: always set view index before drawing
v3dv: fix copying v3dv_end_query_info into primaries from secondaries
v3dv: refactor checking and adding pending jobs
v3dv: add a helper to constrain clip window to render area
v3dv: add helper to check if we need to use a draw for a depth/stencil clear
v3dv: add helper to build a render pass for dynamic rendering
v3dv: add a helper to setup a framebuffer for dynamic rendering
v3dv: add a vk_render_pass_state to pipelines
v3dv: don’t assume that pipelines have a render pass
v3dv: implement vkCmdBeginRendering and vkCmdEndRendering
v3dv: implement dynamic rendering resume/suspend
v3dv: rename SECONDARY job type to INCOMPLETE
v3dv: fix resume address patching for secondary command buffers
v3dv: handle render pass continue flag with dynamic passes
v3dv: also emit subpass clears with secondary command buffers
v3dv: enable VK_KHR_dynamic_rendering
broadcom/ci: skips for tests that don’t check for extension support correctly
broadcom/ci: add new expected test failures
broadcom/ci: add a test that fails only in CI
broadcom/ci: add skips for unsupported features
v3dv: fix image creation when exceeding maxResourceSize
v3d: implement fix for GFXH-1602
broadcom/compiler: fix workaround for GFXH-1602
v3dv: require multisync kernel
v3dv: drop single sync kernel interface
v3dv: add a v3dv_job_clone helper
v3dv: fix job pointers from cloned CLs
v3dv: store the offset of the BRANCH instruction in a CL
v3dv: fix job suspend with command buffer simultaneous use flag
broadcom/compiler: enable perquad with uses_wide_subgroup_intrinsics
v3d/simulator: size counter_values array correctly on V3D 7.x
broadcom/ci: document external causes for some CTS 1.3.8 failures
v3dv: fix VK_KHR_vertex_attribute_divisor
Ian Romanick (54):
nir: Minor clean up in nir_alu_srcs_negative_equal
intel/compiler: Disable DPAS instructions on MTL
intel/compiler: Use u_foreach_bit64 in brw_get_compiler_config_value
intel/compiler: Track lower_dpas flag in brw_get_compiler_config_value
intel/compiler: Track mue_compaction and mue_header_packing flags in brw_get_compiler_config_value
intel/fs: Fix shift counts for 8- and 16-bit types
intel/rt: Don’t directly generate umul_32x16
intel/compiler/xe2: Update get_sampler_lowered_simd_width
intel/fs: Move opcode modification before the switch that emits srcs
intel/compiler/xe2: Use new sample_*_mlod messages
nir: Pack texture LOD and array index to a single 32-bit value
intel/compiler/xe2: Emit texture instructions w/ combined LOD and array index
intel/compiler/xe2: Set SIMD mode for sampler messages
nir: Add documentation for subgroup_.._mask
intel/fs: Delete stale comment in nir_intrinsic_ballot implementation
nir: Mark nir_intrinsic_load_global_block_intel as divergent
intel/fs: Enable nir_opt_uniform_atomics in all shader stages
intel/fs: Use constant of same type to write flag
intel/fs: Add fast path for ballot(true)
nir: Initial framework for optimizing uniform subgroup operations
intel/fs: Use nir_opt_uniform_subgroup
nir: Optimize uniform iadd, fadd, and ixor reduction operations
nir: Optimize uniform vote_all and vote_any
i915: Fix value returned for PIPE_CAP_MAX_TEXTURE_CUBE_LEVELS
intel/brw: Silence “statement may fall through” warning
intel/brw: Correctly dump subnr for FIXED_GRF in INTEL_DEBUG=optimizer
intel/compiler: Enforce 64-bit RepCtrl restriction in eu_validate
intel/brw: Integer multiply w/ DW and W sources is not commutative
intel/brw: Combine constants for src0 of integer multiply too
intel/brw: Combine constants for src0 of POW instructions too
intel/brw: Avoid a silly add with zero in assign_curb_setup
intel/fs: Don’t allow 0 stride on MOV destination
intel/brw/xe2: Correctly disassemble RT write subtypes
intel/brw: Fix handling of accumulator register numbers
intel/brw: Allow SIMD16 F and HF type conversion moves
intel/brw: Remove last vestiges of could_coissue
intel/brw: Clear write_accumulator flag when changing the destination
intel/brw: Use enums for DPAS source regioning
nir: intel/brw: Change the order of sources for nir_dpas_intel
intel/brw/xe2+: DPAS must be SIMD16 now
intel/brw/xe2+: Use phys_nr and phys_subnr in DPAS encoding
intel/brw/xe2: Update brw_nir_analyze_ubo_ranges to account for 512b physical registers
intel/brw/xe2: Update uniform handling to account for 512b physical registers
intel/compiler: Ensure load_barycentric_at_sample and load_interpolated_input remain together
intel/brw: Don’t call nir_opt_remove_phis before nir_convert_from_ssa
intel/elk: Don’t call nir_opt_remove_phis before nir_convert_from_ssa
intel/brw: Delete stray nir_opt_dce
intel/elk: Delete stray nir_opt_dce
intel/brw/xe2+: Implement Wa 22016140776
intel/brw/xe2+: Only apply Wa 22016140776 to math instructions
intel/brw: Fix handling of cmat_signed_mask
nir: intel/brw: Remove cmat_signed_mask from dpas_intel intrinsic
intel/brw: Fix optimize_extract_to_float for i2f of unsigned extract
intel/elk: Fix optimize_extract_to_float for i2f of unsigned extract
Isaac Marovitz (1):
asahi: Add >16 Sampler Access for Ryujinx
Iván Briano (10):
anv: flush query clears for all gens
anv, hasvk: pMutableDescriptorTypeLists can be out of range on pool creation
compiler/types: fix serialization of cooperative matrix
intel/cmat: fix stride calculation in cmat load/store
nir/algebraic: avoid double lowering of some fp64 operations
nir/lower_doubles: preserve sign of zero if we are asked to
nir/lower_doubles: preserve NaN when asked to do so
anv, hasvk: check requirements for USAGE_INPUT_ATTACHMENT properly
anv: check requirements for VK_IMAGE_USAGE_FRAGMENT_SHADING_RATE
anv: fix casting to graphics_pipeline_base
JCWasmx86 (1):
meson: Fix invalid kwarg name
Jan Beich (1):
util: mimic KCMP_FILE via KERN_FILE on DragonFly and FreeBSD
Jani Nikula (1):
docs: fix doc build ‘intel/dev/intel_device_info_gen.h’ file not found
Javier Martinez Canillas (2):
clc: silence a warn_unused_result
gallium: Add ssd130x to the list of kmsro drivers
Jesse Natalie (115):
ci/windows: Update WARP to 1.0.9 NuGet
mesa: Consider mesa format in addition to internal format for mip/cube completeness
ci/windows: Rev Vulkan SDK and piglit
d3d12: Set up spirv-as and fix expectations
microsoft/compiler: Declare shader model 6.8 / validator 1.8
microsoft/compiler: Handle comparison bias/gradient sampling
dzn: Add a debug option to enable experimental shader models
microsoft/compiler: Add feature flags for new comparison sampling ops
dzn: Implement maintenance3 VariableDescriptorCountLayoutSupport
dzn: Fix enhanced barrier layout for depth blits
dzn: Handle VkBindImageMemorySwapchainInfoKHR
dzn: Disable depth/stencil for partial binding from dynamic rendering
spirv2dxil: Fix the spirv2dxil command line tool
spirv2dxil: Handle aliasing/overlapping UBO/SSBO variables
util: Detect arm64ec as aarch64 (and x86_64)
glsl: Work around MSVC arm64 optimizer bug
dzn: Don’t set view instancing mask until after the PSO
dzn: Fix path passed to CreateDeviceFactory
d3d12: Fix path passed to CreateDeviceFactory
microsoft/compiler: Use double pack/unpack instead of int for reduce ops on doubles
dzn: Add a stencil blit fallback
dzn: Add missing condition to immutable sampler init loop
dzn: Add missing blit source barriers for enhanced barriers
microsoft/compiler: Respect ACCESS_COHERENT in UAV variable data
microsoft/compiler: Add a pass for promoting ACCESS_COHERENT on loads/stores
spirv2dxil: Lower the Vulkan memory model and coherent loads/stores
dzn: Add missing handling of VK_PIPELINE_STAGE_2_DRAW_INDIRECT_BIT
dzn: Add barrier to copy source for DispatchIndirect copies
dzn: Support non-static samplers for meta
dzn: Add a debug flag for forcing off native view instancing
dzn: Don’t resolve for RESOLVE_MODE_NONE
dzn: Use correct format for depth/stencil resolves
dzn: Use blits for all non-averaging resolves
microsoft/compiler: Only use simplified subgroup ID algorithm for compute
d3d12: Subgroup ballot
microsoft/compiler: Relax assert for SPIR-V barriers
spirv2dxil: Remove dead branches early during shader compilation
spirv2dxil: Trivial fixes for tessellation shaders
dzn: Simultaneous-access is mutually exclusive with MSAA
dzn: Fix tessellation shader insertion into PSO desc
dzn: Add a driconf option to disable dzn for specific apps and use it for RDR2
microsoft/compiler: For emulating scan, ensure all threads are active when reading cross-lane
microsoft/compiler: Fix wave size control for SM6.6+
microsoft/compiler: Fix wave size control for SM6.8+
wgl: Support a single-buffered winsys framebuffer
wgl: Flush frontbuffer when calling swapbuffers on single-buffered fb
wgl: Add no-gdi-single-buffered and gdi-double-buffered PFDs
wgl: Enable WGL_ARB_pixel_format_float
wgl: Add HDR pixel formats
winsys/d3d12: Support single-buffered mode
d3d12: Support R16G16B16A16_FLOAT display targets
microsoft/compiler: Fix SM6.6 non-bindless handle annotation for UAV counter
dzn: Fix conditions for barrier in texture-converting copy case
wgl: Check for stw_device->screen before trying to destroy it
spirv2dxil: Set push constant register space to nonzero
microsoft/compiler: Remove deref load/store/atomic ops that statically go out of array bounds
microsoft/compiler: Remove code after discard/terminate in later optimization steps
wgl: Initialize DEVMODE struct
d3d12: Point sprite lowering pass needs to handle arrays
nir_lower_tex_shadow: For old-style shadows, use vec4(result, 0, 0, 1)
spirv2dxil: Support buffer_device_address
dzn: Support bufferDeviceAddress
wgl: Delete unused context param to swap
wgl: Check for null before dereferencing ctx in swap
nir_tests: Add /bigobj when compiling with MSVC
dzn: Include vulkan_core.h instead of vulkan.h in the device enum header
dzn: Initialize memoryTypeBits for querying properties on imported handles
microsoft/compiler: domainLocation component index needs to be i8
microsoft/compiler: Disable GS streams workaround for validator 1.8
ci/windows: Update DirectX-Headers, Agility SDK, zlib, DXC, and WARP
ci/debian: Update DirectX-Headers
nir: Handle ptr_as_array for build_deref_follower
microsoft/compiler: Don’t store static-indexing handles that are dynamically emitted
microsoft/clc: When possible, compute a part-constant “pointer” value for kernel inputs
microsoft/compiler: Simplify code emitting CL globals
clc: Move libclc helpers back to microsoft/clc
microsoft/clc: Add linkage capability to libclc build to silence warning
microsoft/clc: Adjust order of UAV binding assignment
microsoft/clc: Install clon12compiler
wgl: The default swap interval is supposed to be 1
d3d12: Fix d3d12_lower_triangle_strip if multiple vars are in a single location
microsoft/compiler: When sorting variables, put unused variables last
microsoft/compiler: Move kill-unused/undefined varying pass from spirv to common
microsoft/compiler: Simplify I/O component type enum handling
microsoft/compiler: Expect front-facing var as an input
microsoft/compiler: Improve linking helpers
microsoft/compiler: Don’t duplicate work from gather_info in var sorting
d3d12: Move some lowering passes to pre-variant
d3d12: Lower uniforms to UBO by nir options
d3d12: Minor logging improvements
d3d12: Fix var splitting pass writemasks
d3d12: Explicitly add tess factor vars to tess signatures
d3d12: Forward front-facing for passthrough GS
d3d12: Capture always_active_io in varying data
d3d12: Use TES inputs rather than VS outputs for TCS variant key
d3d12: Add primitive ID sysval to input bitmask (for GS in)
d3d12: Gather info less and before the final compilation steps
d3d12: Remove variables instead of adding them for linking
d3d12: Don’t compile useless variants during shader creation
microsoft/compiler: Add a fractional var mask for variable sorting
d3d12: Set fractional var masks
d3d12: Add a debug flag for loading WinPixGpuCapturer.dll
ci/windows: Bump Agility SDK to 1.613.2 for ExecuteIndirect validation fix
microsoft/compiler: Handle base vertex/instance sysvals as DXIL intrinsics
spirv2dxil: Support passing first vertex / base instance to DXIL backend
spirv2dxil: Output more specific metadata for whether draw sysvals are needed
dzn: Delete dzn structs for indirect draw args and use D3D ones
dzn: Query options21
dzn: Understand whether first-vertex and base-instance are needed for a pipeline
dzn: Update pipeline cache params to take all options into account
dzn: Rework indirect drawing keys for shaders and command signatures
dzn: Add a hash table of command signatures with non-default strides
dzn: Don’t copy app indirect args if we don’t need to
glsl: Use a stable attr sort for VS in / FS out
d3d12: Include <shlobj.h> with lowercase name
Job Noorman (43):
tu: support l1 dcache size on musl
ir3: fix setting shared flag on parallel copy arguments
ir3: optimize read_first.macro to a mov
ir3: fix printing of brcst.active and quad_shuffle
ir3: optimize subgroup operations using brcst.active
ir3: set reconvergence for scan_clusters.macro
ir3: add disassembly for flat.b
ir3: update a0/a1 users when cloning instructions
ir3: fix alignment of spill slots
ir3: validate instruction block pointer
ir3: add terminators to blocks
ir3: fix instruction count before kill_sched
ir3: print branch sources
ir3: remove OPC_B and brtype from cat0
ir3: remove comp1/2 from cat0
ir3: allow liveness calculation for different register types
ir3: allow finding SSA uses for a subset of uses
ir3: implement RA for predicate registers
ir3: validate no registers are invalid after RA
ir3: integrate predicates into RA validation
ir3: optimize bitwise ops that can directly write predicates
ir3: insert predicate conversions after their source
ir3: fold negations into cmps.ne zero
nir: add search helper is_only_used_by_if
ir3: fold and/or and negations into branches
freedreno/ci: Update pixmark-piano-v2 hash
ir3: fix freeing incorrect register in loops
ir3: fix returning false instead of NULL
freedreno/registers: fix installation of schema
zink: print shaderdb info via debug message callback
ir3: calculate SSA uses at the start of predicates RA
ir3: fix finding uses of reloaded defs in predicates RA
ir3-disasm: run clang-format
ir3-disasm: remove unused #includes
ir3-disasm: add options to specify GPU by chip ID or name
ir3-disasm: add option to disassemble hex number
freedreno,computerator: support initialization of buffers
ir3: remove unnecessary tessellation epilogue
ir3: model predt/predf without sources
ir3: add support for precolored sources in predicate RA
ir3: add support for predication
freedreno/drm-shim: add a730, a740, and a750
freedreno/drm-shim: remove duplicate entry for a630
Jonathan Gray (4):
intel/dev: update DG2 device names
intel/dev: update DG2 device names
intel/dev: update DG2 device names
intel/dev: 0x7d45 is mtl-u not mtl-h
Jonathan Marek (1):
tu/a750: Basic a750 support
Jordan Justen (26):
intel/dev/common: Add xe2 support to get_l3_list()
intel/dev: Add ARL platform enums
intel/dev: Add intel_device_info_is_mtl_or_arl()
intel/l3: Define l3 config for ARL
iris: Extend MTL modifiers to ARL devices
intel/i915: ARL also supports the set-PAT uapi
intel/dev: Define engine prefetch for ARL
isl: Define MOCS for ARL
isl: Handle ARL in isl_drm_modifier_get_score()
intel/compiler: Lower DPAS instructions on ARL except ARL-H
anv/drirc: Extend option to disable FCV optimization to ARL
anv/query: Follow MTL code paths on ARL
intel/dev: Add device info for ARL
intel/compiler: Set branch shader required-width as 16 for xe2
intel/compiler: Implement nir_intrinsic_load_topology_id_intel for xe2
intel/compiler: Verify SIMD16 is used for xe2 BTD/RT dispatch
intel/dev: Add 2 additional ADL-N PCI ids
intel/compiler: Adjust fs_visitor::emit_cs_terminate() for Xe2
intel/dev: Adjust device strings for ATS-M devices
intel/dev: Add ATS-M PCI ID for Data Center GPU Flex 170G
intel/compiler/fs: Restore SIMD32 restriction for ray_queries on Xe2
intel/compiler: nib_ctrl no longer exists on Xe2+
intel/dev/mesa_defs.json: Add LNL WA entries
intel/dev: Add 0x56be and 0x56bf DG2 PCI IDs
intel/dev: Change ATS-M 0x56c2 string from 170G to 170V
intel/brw: Avoid getting a stride of 0 for nir_intrinsic_exclusive_scan
Jose Maria Casanova Crespo (7):
ci: Adds /usr/local/bin to PATH at piglit-traces.sh
v3d: Fix indentation at v3d_flush_jobs_writing_resource
v3d: Only flush jobs that write texture from different job submission.
v3d: Early return if job is not writing the resource
v3d: Implement GL_ARB_texture_barrier
broadcom/compiler: needs_quad_helper_invocation enable PER_QUAD TMU access
ci: re-enable Igalia farm
Joshua Ashton (34):
winsys/amdgpu: Hook up guilt to amdgpu_ctx_set_sw_reset_status
winsys/amdgpu: Limit usage of query_reset_state2
radv/amdgpu: Handle -ENODATA and -ETIME from cs_submit
radv: Mark device loss if QueueSubmit failed immediately
radv: Remove check_status
radv/amdgpu: Remove ctx_query_reset_status
radv: Add radv_get_tdr_timeout_for_ip helper
radv: Ensure vkGetQueryPoolResults returns in finite-time
android: Use system = ‘android’ in crossfile
meson: Enable zink in gallium_drivers by default
meson: Enable d3d12 in gallium_drivers by default on Windows
anv: Enable EXT_swapchain_maintenance1
v3dv: Enable EXT_swapchain_maintenance1
lavapipe: Enable EXT_swapchain_maintenance1
v3dv: Enable EXT_swapchain_colorspace
lavapipe: Enable EXT_swapchain_colorspace
wsi: Pass wsi_drm_image_params to wsi_configure_native_image
wsi: Pass wsi_drm_image_params to wsi_configure_prime_image
wsi: Add explicit_sync to wsi_image_info
wsi: Add explicit_sync to wsi_drm_image_params
build: Add linux-drm-syncobj-v1 wayland protocol
wsi: Track if timeline semaphores are supported
wsi: Add acquired member to wsi_image
wsi: Track CPU side present ordering via a serial
wsi: Get timeline semaphore exportable handle types
wsi: Add common infrastructure for explicit sync
ci: Bump wayland-protocols version to 1.34
ci: Bump DEBIAN_BASE_TAG for now
meson: Update wayland-protocols wrap to 1.34
meson: Bump wayland-protocols requirement to 1.34
wsi: Implement linux-drm-syncobj-v1
tu: Expose VK_EXT_surface/swapchain_maintenance1
radv: Enable KHR_video_queue if encode is enabled
radv: Properly initialize imageCreateFlags in GetPhysicalDeviceVideoFormatPropertiesKHR
José Expósito (2):
zink: add render-passes HUD query
meson: Update proc_macro2 meson.build patch
José Roberto de Souza (88):
intel/isl/xe2: Disable route of Sampler LD message to LSC
anv: Fix PAT entry for userptr in integrated GPUs
intel/genxml/xe2: Remove L3ALLOC
intel/dev: Reduce usage of intel_device_info_compute_system_memory()
intel: Make memory heaps consistent between KMDs
anv: Fix calculation of syncs required in Xe KMD
iris: Avoid read of uninitialized value in blorp_clear_stencil_as_rgba()
iris: Fix return of iris_wait_syncobj()
iris: Wait for drm_xe_exec_queue to be idle before destroying it
intel/common: Add functions to handle async vm bind
anv: Start to use intel_bind_timeline
iris: Start to use intel_bind_timeline
anv: Switch to truly asynchronous VM binding in Xe KMD
iris: Switch to truly asynchronous VM binding in Xe KMD
intel: Fix intel_get_mesh_urb_config()
anv: Drop include to common/i915/intel_gem.h
intel/common: Fix location of C++ support macro in intel_gem.h
intel: Remove circular dependency between intel/dev and intel/common
intel/common: Add intel_engines_supported_count()
anv: Use intel_engines_supported_count()
iris: Use intel_engines_supported_count()
intel: Sync i915_drm.h
intel/common: Implement i915_engines_is_guc_semaphore_functional()
intel: Sync xe_drm.h
intel/common: Implement xe_engines_is_guc_semaphore_functional()
iris: Fix iris_batch_is_banned() check
anv: Use DRM_XE_VM_BIND_OP_UNMAP_ALL to unbind whole bos
docs/anv: Add recommended GuC firmware version
iris: Set (EXEC_OBJECT_SUPPORTS_48B_ADDRESS | EXEC_OBJECT_PINNED) in a single place
iris: Remove iris_bo::kflags
iris: Move i915 set and get tiling uAPI calls to i915 specific code
iris: Remove more i915_drm.h includes from common code
intel: Move intel_define.h to i915/intel_define.h
intel/common: Remove more i915_drm.h includes from common code
intel/tools/error_decode: Add function to try to open error dump file
intel/tools/error_decode: Simply error message handling
intel/tools/error_decode: Add support to search for Xe KMD error dumps
intel/tools/error_decode: Detect and split error dump file parsing by KMD
intel: Sync xe_drm.h
anv/xe: Add VMs to error dump
iris/xe: Add VMs to error dump
intel/tools/error_decode: Move code that can be shared between i915 and Xe error decoders
intel/tools/error_decode: Parse Xe KMD error dump file
intel/tools: Fix compilation in 32 bits
intel/nullhw: Fix 32bits compilation warnings
iris: Add IRIS_HEAP_DEVICE_LOCAL_CPU_VISIBLE_SMALL_BAR heap type
iris: Force lmem cpu accessible for bos with clear-color
iris/xe: Consider pat_index while unbinding the bo
anv: Call flush_pipeline_select_gpgpu() for compute engines in compute code paths
anv: Skip cmd_buffer_emit_bt_pool_base_address() in blitter and video engines
intel: Drop pre-production steppings
anv: Fix Xe KMD userptr unbind
intel/dev: Nuke ‘ver == 10’ check
intel/dev: Nuke display_ver
intel: Enable Xe KMD support by default
iris: Set BO_ALLOC_NO_SUBALLOC when allocating bo for slab
anv: Replace the 2 sparse booleans by 1 enum
anv: Set VK_QUEUE_PROTECTED_BIT during queue families initialization
anv: Set VM control to true in Xe KMD
intel/tools/error_decode: Fix parsing in Xe decoder
intel/tools/error_decode: Add function to print batch in Xe decoder
intel/tools/error_decode: Parse HW context in Xe decoder
iris: Move tiling_to_modifier() implementation to i915 folder
iris: Remove i915_drm.h include from iris_indirect_gen.c
intel/decoder: Fix binding table pointer entry being marked as invalid
anv: Set STATE_COMPUTE_MODE mask bit when zeroing compute mode
intel/genxml: Add more instdone registers
intel/genxml/gfx125: Fix definition of INTERFACE_DESCRIPTOR_DATA::Thread group dispatch size
intel/genxml/xe2: Update definition of INTERFACE_DESCRIPTOR_DATA
anv: Create protected engine context when i915 supports vm control
anv: Remove protected memory types from default_buffer_mem_types
intel/tools/error2hangdump: Print out_filename when failed to open it
intel/tools/error2hangdump: Replace drm_i915_gem_engine_class by intel_engine_class
intel/tools: Move Xe KMD error decode functions to a separated file
intel/tools: Move ascii85_decode_char() to error_decode_lib
intel/tools: Move more Xe KMD error decode functions to error_decode_xe_lib
intel/tools/error2hangdump: Move code that will be shared with Xe parser to error2hangdump_lib
intel/tools/error2hangdump: Move i915 parser to a function
intel/tools/error2hangdump: Add Xe KMD support
anv: Add missing ANV_BO_ALLOC_INTERNAL
iris: Add comments to BO_ALLOC flags
iris: Avoid creation of slabs and cache buckets of lmem heaps in integrated gpus
iris: Avoid allocation of not needed iris_bucket_cache
intel/tools/aubinator_error_decode: Move definition of option_color to header
intel/decoder: Add intel_print_group_custom_spacing()
intel/tools: Parse INSTDONE registers in Xe KMD error dump
intel: Sync xe_drm.h
intel/dev: Read GFX IP version during runtime
Juan A. Suarez Romero (32):
Revert “v3d: use kmsro to create drm screen on real hw”
v3d: show warning on creating a v3d screen on real hw
v3d/vc4/ci: reset the list of timeout tests
Revert “v3d: show warning on creating a v3d screen on real hw”
broadcom/simulator: protect simulator BO rallocs with mutexes
v3d/ci: run OpenGL 3.1 tests
v3dv/ci: increase timeout for full jobs in 30min
ci: disable Igalia farm
Revert “ci: disable Igalia farm”
Revert “ci: disable Igalia farm”
v3d/ci: update expected results
v3d/ci: update expected list
Revert “v3d/ci: update expected list”
vc4/ci: update expected list
v3d/ci: add new failures
v3dv/ci: update expected list
v3dv/ci: remove crashes from expected list
v3d,v3d: use new simulator
v3dv: disable Early Z for multisampled 16-bit depth buffers
v3d: disable Early Z for multisampled 16-bit depth buffers
broadcom/compiler: fix SFU check for 7.1
v3dv: mark some promoted extensions as supported
v3d: add load_fep_w_v3d intrinsic
v3d: fix line coords with perspective projection
compiler,gallium: move u_reduced_prim to common
v3dv: assume that rasterization state can be NULL
v3dv: enable smooth line rendering
broadcom/ci: add new expected failures
v3d: configure polygon mode when enabled
broadcom/ci: update expected results
v3dv/ci: update expected list
nir/lower_clip: update inputs/ouputs read/written bitmask
Juston Li (13):
venus: refactor query feedback cmds
venus: acquire mutex when recycling query feedback cmds
venus: free query batches for VK_COMMAND_POOL_RESET_RELEASE_RESOURCES_BIT
venus: add comments for query feedback batch free list
venus: recycle linked query feedback immediately during submission
venus: handle empty resolved query feedback list
venus: fix image reqs cache store locking
venus: extract cache hash/equals functions into common
venus: image format properties cache
venus: move feedback on empty last batch to prior batch
venus: fix VkDeviceGroupSubmitInfo cmd counts from feedback
venus: extend device format prop cache with VkFormatProperties3
Revert “zink: store last pipeline directly for zink_gfx_program::last_pipeline”
Kai Wasserbäch (2):
fix(FTBFS): clc: adapt to new LLVM 19 DiagnosticHandlerTy
fix(FTBFS): clover: adapt to new LLVM 19 DiagnosticHandlerTy
Karol Herbst (131):
nak/algebraic: merge run and main function
nak/algebraic: write code to an output file
clc: use spirv triple starting with llvm-17
clc: add support for the native spir-v backend
rusticl/kernel: run opt/lower_memcpy later to fix a crash
rusticl/kernel: add a few comments in regards to pass ordering
rusticl/kernel: no need to reset the scratch size anymore
nir/printf: remove treat_doubles_as_floats
clc: require LLVM-14
clc: merge blocks handling optional features
clc: require LLVM-15
nir: rework and fix rotate lowering
rusticl/program: rework debug logging option
rusticl/spirv: do not attempt to parse spirv after failed link
rusticl/spirv: use bool::then_Some inside SPIRVBin::link
rusticl/program: add clc_validator_options helper function
rusticl/program: add debug option to validate internal spirvs
nak/opt_out: fix comparison in try_combine_outs
nak: simplify phi_dsts
nak: make it compile with clippy
rusticl/meson: use rust_abi instead of rust_crate_type
rust/spirv: fix clippy lint on unneeded late initialization
rusticl/kernel: check that local size on dispatch doesn’t exceed limits
nak/meson: specify rust flags globally and allow some clippy lints
nak: fix clippy::extra_unused_lifetimes warnings
nak: fix clippy::mem_replace_with_default warnings
nak: fix clippy::useless_conversion warnings
nak: fix clippy::needless_lifetimes warnings
nak: fix clippy::needless_borrow warnings
nak: fix clippy::while_let_loop warnings
nak: fix clippy::match_like_matches_macro warnings
nak: fix clippy::needless_return warnings
nak: fix clippy::redundant_closure warnings
nak: fix clippy::unwrap_or_default warnings
nak: fix clippy::manual_while_let_some warnings
nak: fix clippy::clone_on_copy warnings
nak: fix clippy::single_match warnings
rusticl/util: add a wrapper around “thread-safe” C types
rusticl/mesa/device: convert to ThreadSafeCPtr
rusticl/mesa/screen: convert to ThreadSafeCPtr
rusticl/mesa: add thread-safe wrapper for pipe_image_views
rusticl/context: store SVM pointers as usize
rusticl/gl: mark GLCtxManager as Send + Sync
rusticl/mem: make Mem Send/Sync by storing mapping ptrs as usize
rusticl/program: mark NirKernelBuild as Send and Sync
meson: remove opencl-external-clang-headers option and rely on shared-llvm
clc: force fPIC for every user when using shared LLVM
nir/lower_cl_images: record image_buffers and msaa_images
rusticl/mem: properly handle buffers
rusticl/mem: support GL_TEXTURE_BUFFER
rust/api: add RustTypes enum
rusticl/util: support nested structs in offset_of!
rusticl/api: allow CLObjectBase to be placed anywhere
rusticl/icd: move get_ref_vec_from_arr into the Rusticl type
rusticl/icd: move refcnt() and get rid of needless atomic ops
rusticl/icd: move retain() and release()
rusticl/icd: move get_arc_vec_from_arr and rename it
rusticl/icd: fold leak_ref into its only consumer
rusticl/icd: move get_ref()
rusticl/device: deduplicate devices with sorting
rusticl/icd: move from_arc() and rename it
rusticl/event: drop from_cl_arr and use arcs_from_arr
rusticl/icd: move get_arc() and rename it
rusticl/icd: split Arc part out of CLObject into new trait
rusticl/device: get rid of pointless Arc overhead
rusticl/icd: actually allow dispatching CL types
rusticl/mem: split into Buffer and Image
rusticl/mem: use pattern matching in is_parent_buffer
rusticl/mem: move fill methods into concrete types
core/memory: drop Arc for &Arc<Queue> function parameters
rusticl/mem: move map methods into concrete types
rusticl/mem: move shadow sync methods into concrete types
rusticl/mem: split unmap into Buffer and Image versions
rusticl/mem: move copy and write buffer impls into Buffer
rusticl/mem: split read_to_user_rect into Buffer and Image versions
rusticl/mem: split write_from_user_rect into buffer and image
rusticl/mem: move copy_to_rect into Buffer
rusticl/mem: split copy_to into Buffer and Image
rusticl/mem: split Buffer::copy_to into Buffer and Image versions
rusticl/mem: split Image::copy_to into Buffer and Image versions
rusticl/mem: get rid of pixel_size
rusticl/mem: move tx_image into Image
rusticl/mem: fold tx_raw into tx
rusticl/image: call tx on the parent buffer directly
rusticl/mem: move is_parent_buffer into Image
rusticl/mem: move tx into Buffer
rusticl/mem: remove get_res
rusticl/mem: move comment describing how mapping works
rusticl/mem: reorganize Image::map
rusticl/mem: move MemBase::map into the users
rusticl/mem: move tx_raw_async methods into Buffer and Image
rusticl/mem: move Buffer and Image specific fields into the subtypes
rusticl/mem: reorganize MemBase::from_gl a little
rusticl/mem: move pipe_image_host_access into Image
rusticl/kernel: recalculate scratch and shared memory after opts
rusticl/program: fix CL_PROGRAM_BINARIES for devs with no builds
meson/rusticl: import rust instead of unstable-rust
clc: include opencl-c.h for extensions needing it
meson: do not pull in clc for clover
intel: Only build shaders with anv and iris
zink: lower unaligned memory accesses
rusticl/context: complete conversion of SVM pointers to usize
rusticl/memory: store host_ptr as usize
rusticl/memory: make closures Send and Sync
rusticl/event: make EventSig Send + Sync
rusticl/spirv: mark SPIRVBin as Send and Sync
rusticl/kernel: make it Send and Sync
rusticl/icd: verify all cl classes are Send and Sync
rusticl/meson: remove -Aclippy::arc-with-non-send-sync flag
rusticl/kernel: make builds private
rusticl/event: we need to call the CL_COMPLETE callback on errors as well
rusticl/kernel: assign sampler locations before DCEing variables
rusticl/device: support query_memory_info to retrieve available memory
drm-uapi: Sync nouveau_drm.h
nvk: use c.get_supported_arguments for compiler flags
nouveau: import libdrm_nouveau
nouveau: call glsl_type_singleton_init_or_ref earlier
nouveau/drm: drop immediate parameter from nouveau_pushbuf_new
nouveau/drm: rely on nouveau_pushbuf::channel being always set
nouveau/drm: drop unsued chan argument from nouveau_pushbuf_kick
nouveau/drm: remove nouveau_client::id
rusticl/util: make create_pipe_box indepentend of pipe_box’s field types
meson: fix link failure with llvm-18
rusticl/program: handle -cl-no-subgroup-ifp
nouveau: fix potential double-free in nouveau_drm_screen_create
nir: fix nir_shader_get_function_for_name for functions without names.
rusticl: use stream uploader for cb0 if prefered
rusticl/icd: remove CLObject
event: break long dependency chains on drop
rusticl/mesa/context: flush context before destruction
nir/lower_cl_images: set binding also for samplers
Kenneth Graunke (76):
iris: Don’t return timestamps modulo 36-bits
intel/dev: Fix typo (ajust -> adjust)
iris: Implement query_memory_info() on discrete cards
intel/nir: Pass devinfo and prog_data to brw_nir_lower_cs_intrinsics
intel: Add driver support for hardware generated local invocation IDs
intel: Use hardware generated compute shader local invocation IDs
driconf: Advertise GL_EXT_shader_image_load_store on iris for SVP13
iris: Implement INTEL_DEBUG=heaps
intel/fs: Don’t include sync.nop in instruction count statistics
intel/fs: Don’t rely on CSE for VARYING_PULL_CONSTANT_LOAD
intel/brw: Delete enum brw_urb_write_flags
intel/brw: Delete more unused defines
intel/brw: Delete legacy SFIDs
intel/brw: Delete SIMD4x2 URB opcodes
intel/brw: Delete more unused compression stuff
intel/brw: Delete SINCOS
intel/brw: Delete constant_buffer_0_is_relative
intel/brw: Delete compiler->supports_shader_constants
intel/brw: Delete enum gfx6_gather_sampler_wa
intel/brw: Delete brw_wm_prog_key::line_aa
intel/brw: Delete unnecessary brw_wm_prog_data fields
intel/brw: Delete some swizzling functions
intel/brw: Delete brw_eu_util.c
intel/brw: Change unit tests to use TEX_LOGICAL instead of TEX
intel/brw: Delete SHADER_OPCODE_TXF_CMS[_LOGICAL]
intel/brw: Delete SHADER_OPCODE_TXF_UMS
intel/brw: Allow CSE on TXF_CMS_W_GFX12_LOGICAL
intel/brw: Delete legacy texture opcodes
intel/brw: Mark FIND[_LAST]_LIVE_CHANNEL as not writing the flag
intel/brw: Replace CS_OPCODE_CS_TERMINATE with SHADER_OPCODE_SEND
intel/brw: Avoid copy propagating any fixed registers into EOTs
intel/brw: Handle SHADER_OPCODE_SEND without src[3] in copy prop
intel/brw: Add assertions that EOT messages live in g112+
intel/brw: Copy the smaller payload in fixup_sends_duplicate_payload
intel/brw: Make register coalescing obey the g112-g127 restriction
intel/brw: Call constant combining after copy propagation/algebraic
intel/brw: Remove SIMD lowering to a larger SIMD size
intel/brw: Unindent code after previous change
iris: Fix tessellation evaluation shaders that use scratch
intel/brw: Emit better code for read_invocation(x, constant)
iris: Remove suballocation in iris_flush_resource()
iris: Eliminate prototype introduced in the previous patch
ra: Add debug functions for printing spill costs and benefits
intel/fs: Avoid generating useless UNDEFs for every SSA def
intel/brw: Split out 64-bit lowering from algebraic optimizations
intel/brw: Don’t consider UNIFORM_PULL_CONSTANT_LOAD a send-from-GRF
intel/brw: Eliminate top-level FIND_LIVE_CHANNEL & BROADCAST once
intel/brw: Fix check for 64-bit SEL lowering types
intel/brw: Assert that min/max are not happening in 64-bit SEL lowering
intel/brw: Use correct execution pipe for lowering SEL on DF
intel/brw: Unify DF and Q/UQ lowering for MOV
Revert “intel/brw: Don’t consider UNIFORM_PULL_CONSTANT_LOAD a send-from-GRF”
intel/brw: Fix opt_split_sends() to allow for FIXED_GRF send sources
intel/brw: Fix register coalescing’s LOAD_PAYLOAD dst offset handling
intel/brw: Fix destination stride assertion in copy propagation
intel/brw: Allow changing types for LOAD_PAYLOAD with 1 source
intel/brw: Delete brw_fs_lower_minmax
anv, hasvk: Save the original instance ID
anv, hasvk: Move multiview remapping loop below output stores
anv, hasvk: Fix nir_lower_multiview to re-emit outputs before EmitVertex
intel/brw: Stop checking mlen on math opcodes in CSE pass
intel/brw: Rearrange fs_inst fields
intel/brw: Fix generate_mov_indirect to check has_64bit_int not float
intel/brw: Fix lower_regioning for BROADCAST, MOV_INDIRECT on Q types
intel/brw: Update comments for indirect MOV splitting
intel/brw: Don’t mention gfx7 limitations in shuffle comments
intel/brw: Drop dead CHV checks.
intel/brw: Drop align16 support in brw_broadcast()
intel/brw: Drop gfx7 scratch message setup code
intel/brw: Delete if_depth_in_loop
intel/brw: Delete fs_visitor::vgrf helper
intel/brw: Drop default size of 1 from bld.vgrf() calls
intel/brw: Use SHADER_OPCODE_SEND for coherent framebuffer reads
intel/brw: Replace FS_OPCODE_LINTERP with BRW_OPCODE_PLN
intel/brw: Make an fs_builder::SYNC helper
isl: Set MOCS to uncached for Gfx12.0 blitter sources/destinations
Konrad Dybcio (1):
freedreno/registers: Add some HWCG regs
Konstantin (8):
util/printf: Include stdio.h
util/printf: Expose util_printf_prev_tok
ac/debug: Handle the output of recent umr versions
radv/debug: Canonicalize shader addr
radv: Canonicalize addresses in radv_find_shader
radv/debug: Try to find unbound shaders
radv/debug: Dump descriptor binding information
ac/parse_ib: Always print the value of the whole register
Konstantin Seurer (105):
nak/repair_ssa: Remap PHI sources as well
ac/llvm: Enable helper invocations for quad OPs
radv: Vectorize load_global_constant
lavapipe: Fix DGC vertex buffer handling
gallivm: Use saturating fpto*i conversions
lavapipe: Mark vertex elements dirty if the stride changed
lavapipe: Report the correct preprocess buffer size
radv: Implement NIR debug printf
llvmpipe: Stop refcounting sample functions
llvmpipe: Compile sample functioins on demand
radv/rt: Use doubles inside intersect_ray_amd_software_tri
llvmpipe: Fix building with llvm11
nir/print: Don’t print shared_size twice
nir/print: Rename workgroup-size to workgroup_size
radv/radix_sort: clang-format
radv: Reduce the amount of radv_device_to_handle calls
radv: Make radv_write_user_event_marker non-static
radv: Emit user events during acceleration structure builds
radv: Skip unused acceleration structure build paths
radv/sqtt: Set SeparateCompiled for monolithic RT pipelines
radv/sqtt: Handle ray tracing pipelines with no traversal shader
radv/rt: Lower ray payloads like hit attribs
radv/rra: Rename rra_chunk_type to rra_chunk_version
radv/rra: Use memcpy for chunk descriptions
radv/rra: Remove useless variable
radv/rra: Refactor error handling
radv/rra: Dump basic ray history tokens
docs: Document RADV_RRA_TRACE_HISTORY_SIZE
radv/rra: Implement ahit/isec counters
amd/common: Use the correct register table for GFX10_3
radv: Wire up ac_gather_context_rolls
zink: Always set mfence->submit_count to the fence submit_count
Revert “zink: always force flushes when originating from api frontend”
llvmpipe: Use full subgroups when possible
gallivm: Consider the initial mask when terminating loops
lavapipe: Advertise VK_KHR_shader_maximal_reconvergence
ci: Update llvmpipe trace checksums
ac/parse_ib: Add and use print_addr
ac/parse_ib: Dump the ADDR field of PKT3_SET_BASE
ac/parse_ib: Annotate addresses with UAF/OOB info
ac/parse_ib: Handle 32bit PKT3_DISPATCH_INDIRECT addrs
ac/parese_ib: Handle more packets
radv/rra: Avoid reading past the ray history buffer
radv/meta: Add shader - device mapping for radv_build_printf
vulkan/cmd_queue: Implement CmdBuildAccelerationStructuresKHR
lavapipe: Implement VK_KHR_acceleration_structure
lavapipe: Add ray traversal code
lavapipe: Implement VK_KHR_ray_query
lavapipe: Advertise VK_KHR_deferred_host_operations
lavapipe: Advertise VK_KHR_acceleration_structure
lavapipe: Advertise VK_KHR_ray_query
lavapipe/ci: Document ray query failures
docs: Document lavapipe ray tracing features
vulkan: Implement DebugMarkerSetObjectNameEXT
radv/rt: Implement RADV_DEBUG=shaderstats
radv/rt: Add radv_ray_tracing_stage_info
radv/rt: Fixup constant args
aco: Only fix used variables to registers
radv/rt: Avoid passing unused data to the next stage
radv/rt: Inline constant trace_ray srcs into the traversal shader
radv/rt: Inline constant information about ray flags
radv/rt: Fix raygen_imported condition
zink: Handle aoa derefs of images
ac: Annotate context rolls
ac/parse_ib: Replace the parameter list with ac_ib_parser
ac/parse_ib: Implement annotations
radv: Add support for IB annotations
radv: Add an IB annotation layer
ac: Improve context roll readability
radv: Use radv_buffer_map for parsing IBs
radv/rt: Use 32-bit offsets for load_sbt_entry
radv: Skip more acceleration structure build markers
radv/printf: Use fprintf instead of printf
nir/print: Fix printing booleans with bit_size>1
nir/serialize: Encode data for temporaries
nir: Add lavapipe ray tracing intrinsics
llvmpipe: Fix function call handling
lavapipe: Add lvp_spirv_to_nir
lavapipe: Make lvp_shader_init non-static
lavapipe: Make lvp_create_pipeline_nir non-static
lavapipe: Lower mem_constant variables
lavapipe: Defer binding compute state
lavapipe: Remove unused ray tracing variables
lavapipe: Add more ray tracing helpers
lavapipe: Pass lvp_ray_flags into lvp_aabb_intersection_cb
lavapipe: Use the pipeline type in get_pcbuf_size
lavapipe: Inline fill_ubo0
lavapipe: Add an api_stage parameter to update_pcbuf
lavapipe: Fix a memory leak in lvp_push_internal_buffer
lavapipe: Implement VK_KHR_ray_tracing_pipeline
lavapipe: Implement KHR_ray_tracing_maintenance1
lavapipe: Implement VK_EXT_pipeline_library_group_handles
lavapipe: Implement VK_KHR_ray_tracing_position_fetch
radv: Destroy leaf_updateable_pipeline
lavapipe: Handle accel struct queries in handle_copy_query_pool_results
lavapipe: Implement ray_tracing_maintenance1 queries
lavapipe: Do nort use NIR_PASS during lowering
lavapipe: Handle multiple planes in GetDescriptorEXT
lavapipe: Explicitely support ycbcr formats
Revert “gallivm/ssbo: mask offset with exec_mask instead of building the ‘if’”
radv: Handle all dependencies of CmdWaitEvents2
nir/print: Do not access invalid indices of load_uniform
radv: Fix radv_shader_arena_block list corruption
radv: Remove arenas from capture_replay_arena_vas
radv: Zero initialize capture replay group handles
Krzysztof Kurek (1):
panfrost: fix shift overflow in `bi_fold_constant`
Leo Liu (2):
radeonsi: fix video processing path without VPE enabled
ac/gpu_info: Fix broken UVD firmware query
Lepton Wu (1):
llvmpipe: Set “+64bit” for X86_64
Lin, Ricky (1):
amd/vpelib: Rename the parameters of init vpe function
Lionel Landwerlin (186):
anv: fix disabled Wa_14017076903/18022508906
intel/aux_map: fix fallback unmapping range on failure
anv: hide vendor ID for The Finals
intel/decoder: make vertex data decoding optional
intel/decoder: don’t ignore BT entries at offset 0
intel/genxml: add CCS_INSTDONE register
intel/genxml: add GAM done register description
intel/hang_viewer: add aux-tt view
anv: export descriptor flushing functions
anv: fix include guards
anv: fix missing header
anv: move generated draw flush helper to its own file
anv: move draw commands to their own file
anv: move compute/ray-tracing commands to their own file
anv: rename video command file
nir/alu_srcs_negative_equal: bail earlier if possible
nir/comparison_pre_tests: update expectations
anv: using a single struct for kernel upload
anv: fix pipeline executable properties with graphics libraries
isl: add print helpers for debug
anv: implement undocumented tile cache flush requirements
anv: reorder anv_astc_emu.c
anv: remove unused perfetto declarations
anv: rename layers entrypoints
anv: add BO flag for internal driver allocations
anv: track total state stream allocated blocks from the pool
anv: track imported ray tracing pipeline groups
anv: initial RMV support
vulkan/runtime: handle new image layout
anv: don’t prevent L1 untyped cache flush in 3D mode
anv: promote EXT_index_type_uint8 to KHR
anv: promote EXT_line_rasterization to KHR
anv: promote EXT_load_store_op_none to KHR
anv: add missing alignment for AUX-TT mapping
intel/ds: track predication of blorp operations
vulkan/runtime: add helper to query attachment layout
anv: ensure consistent layout transitions in render passes
anv: add check that in renderpass barriers apply to attachments
anv: handle image feedback loop usage
anv: implement VK_EXT_attachment_feedback_loop_dynamic_state
anv/hasvk: don’t report error when intel_get_device_info_from_fd fails
anv: factor out aux-tt binding logic for future reuse
anv: rename aux_tt image field
anv: retain ccs image binding address
anv: fix transfer barriers flushes with compute queue
vulkan/runtime: handle new dynamic states for attachment remapping
docs/features: drop gen8+/gen9+ on Anv
docs/features: synchronize new features for Anv
vulkan/multialloc: bump max number to 16
vulkan/runtime: rework VK_KHR_dynamic_rendering_local_read state tracking
anv: reduce cache flushing for indirect commands on Gfx12.5+
anv: don’t unmap AUX ranges at BO delete
isl: printout sparse usage
isl: add a no-aux-align usage flag
anv: move ALLOC_HOST_CACHED_COHERENT as define
anv: use address helper to compute address u64 value
intel/aux_map: add BSpec reference
intel/aux_map: add helper to compute offset in aux data
anv: re-introduce BO CCS allocations
intel/dev: fix missing dependency on generated packing heaers
anv: factor out post submit queue debug code
intel/fs: indent lowering code to make it more readable
intel/fs: rerun divergence prior to lowering non-uniform interpolate at sample
anv: fix incorrect flushing on shader query copy
meson: add a new option to enable intel-clc without building RT shaders
intel/compiler: make default NIR compiler options visible
intel-clc: move ISA generation to its own function
intel/clc: add ability to output NIR
intel-clc: print text input
genxml: enable opencl code generation
genxml: generate opencl packing headers
genxml: remove NDEBUG_UNUSED
intel/ds: new tracepoints for generated commands
meson: add option to install intel-clc
ci: build a host version of mesa for cross builds
anv: rewrite internal shaders using OpenCL
intel/shaders: add iris variant of indirect draws generation shader
intel/shaders: enable gfx8 support
iris: make binding table shifting values available outside iris_state.c
iris: make KSP helper available outside iris_state.c
iris: make URB programming available outside iris_state.c
iris: factor out index buffer emission
iris: add an option for not emit draw parameters
iris: enable generated indirect draws
meson: enforce build of intel-clc with anv/iris
anv: remove redundant asserts
anv: don’t allocate aux padded BOs with host pointers
anv: fix buffer marker cache flush issues on MTL
anv: enable query clear/copy using shaders on MTL/ARL
anv: fixup push descriptor shader analysis
anv: factor out descriptor buffer flushing
anv: reenable ANV_ALWAYS_BINDLESS
anv: remove unused definition
anv: fix Wa_16013994831 macros
anv: fix emission of Wa_14015055625
genxml: generate opencl temporary variables with private qualifier
intel/clc: lower temp function/shader variables together
intel/clc: workaround LLVM17 opaque pointers
anv: disable Wa_16013994831
ci/anv: add more testing for optimization paths
intel/ci: bump anv/tgl fraction to 6
intel/nir: only consider ray query variables in lowering
anv: limit depth flush on dynamic render pass suspend
anv: add missing generated file dep
anv: optimize push descriptor updates
anv: add new heap/pool for descriptor buffers
anv: create new helper for small allocations
anv: add a second dynamic state heap for descriptor buffers
anv: move aux-tt to general state pool
anv: allocate slice_hash for descriptor buffer
anv: allocate border colors for descriptor buffers
anv: allocate fsr states for descriptor buffer
anv: implement data write entry points for EXT_descriptor_buffer
anv: compute a sampler hash based on parameters
anv: add embedded sampler parameters in descriptor set layout hash
intel/fs: add plumbing for embedded samplers
nir: add additional flag to resource_intel for embedded samplers
anv: add embedded sampler support
anv: add new helper to update binding table pool offset
anv: add descriptor set layout support for descriptor buffers
anv: add pipeline/shader support for descriptor buffers
anv: handle push descriptor writes with descriptor buffers
anv: implement descriptor buffer binding
anv: disable mutable combined image/sampler in descriptor buffer
anv: expose VK_EXT_descriptor_buffer
anv: fix non matching image/view format attachment resolve
anv: fix helper usage for CmdUpdateBuffer()
anv: remove some wrapping around mmap
anv: add support for VK_EXT_map_memory_placed
anv: delay internal shader upload to when needed
anv: fix companion command buffer initialization
anv: fix incorrect ISL usage in buffer view creation
anv/iris/blorp: use the right MOCS values for each engine
anv: try to keep the pipeline in GPGPU mode when buffer transfer ops
anv: don’t copy the null descriptor from the GPU memory
intel/fs: fixup sampler header message
anv: return unsupported for FSR images on Gfx12.0
intel/fs: remove some unused send helpers
anv: ignore descriptor alignment for inline uniforms
intel/fs: bump max simd size of some messages for xe2
anv: track embedded sampler counts in layouts
anv: allocate pipeline bindings tables dynamically on the heap
anv: avoid partially compiled warning with GPL
blorp: handle a few allocation failure cases
anv: fix invalid border color free
anv: fix block pool allocation failure
anv: fix temporary state pool allocation failures
anv: fix bitfield checks in gfx runtime flushing
anv: fix query clearing with blorp compute operations
blorp: add support for cached dynamic states
anv: reduce blorp dynamic state emissions
anv: optimize emission of dynamic state with blorp
anv: fix protected memory allocations
anv: pull surface state copies for secondary in one loop
anv: disable protected content around surface state copies
anv: disable generated draws in protected command buffers
anv: update protection fault property
anv: fix incorrect blorp dynamic state heap usage
intel/fs: printout a couple of more late compile steps
intel/fs: fixup instruction scheduling last grf write tracking
anv: add missing data flush out of L3 for transform feedback writes
anv: mark descriptors & pipeline dirty after blorp compute
isl: set NullPageCoherencyEnable for depth/stencil sparse surfaces
anv: only check patch_control_points changes in runtime flush
anv: increase maxResourceDescriptorBufferRange on DG2+
anv: reuse vk_common_GetImageSubresourceLayout
anv: move all format props checks to anv_get_image_format_properties()
drirc: rename hasvk only option
vulkan: track compression control flags on vk_image
anv: implement VK_EXT_image_compression_control
anv: disable capture replay with descriptor buffer
anv: remove useless dynamic state allocation for samplers
anv: add capture/replay support for image with descriptor buffers
anv: add capture/replay support for buffer with descriptor buffers
anv: add a new reserved pool for capture/release
anv: enable shader border color capture/replay
anv: enable capture/replay with descriptor buffers
anv: disable dual source blending state if not used in shader
intel/brw: fixup wm_prog_data_barycentric_modes()
anv: fixup alloc failure handling in reserved_array_pool
anv: fix leak of custom border colors
anv: fix ycbcr plane indexing with indirect descriptors
brw: add more condition for reducing sampler simdness
anv: fix push constant subgroup_id location
nir/divergence: add missing load_printf_buffer_address
anv: use weak_ref mode for global pipeline caches
Louis-Francis Ratté-Boulianne (5):
panfrost: factor out method to check whether we can discard resource
panfrost: add copy_resource flag to pan_resource_modifier_convert
panfrost: add can_discard flag to pan_legalize_afbc_format
panfrost: Legalize before updating part of a AFBC-packed texture
panfrost: Add AFBC packing support for RG formats
Luc Ma (1):
gallium/u_blitter: Fix a few uninitialized fb_state
Luca Bacci (1):
meson,windows: Use relative paths in Vulkan ICD manifest files
Lucas Fryzek (19):
egl/wayland/sw: don’t invert y `wl_surface_damage_buffer`
drisw/winsys: Flip y coordinate when creating pipe boxes
drisw: clamp damage region to texture bounds
llvmpipe: explicitly reject (most) yuv formats
gallium: Add dmabuf arg to memory fd allocation API
llvmpipe: Implement dmabuf handling
drisw: reuse kopper image extension vtable if modifiers/dmabuf is supported
llvmpipe: conditionally export PIPE_CAP_DMABUF
lavapipe: support VK_EXTERNAL_MEMORY_HANDLE_TYPE_DMA_BUF_BIT_EXT
lavapipe: EXT_external_memory_dma_buf
llvmpipe: make it possible to import and bind unbacked resources
lavapipe: include drm_fourcc.h
lavapipe: check drm modifier info during image create
lavapipe: EXT_image_drm_format_modifier
venus/ci: Add patch for modifiers test to check import/export bits
lp: Wrap udmabuf usage in HAVE_LIBDRM ifdef
ci/lp: Remove ext buffer YUV tests from fails
llvmpipe: Only return null resource handle when dt is not mapped
llvmpipe: Only use udmabuf if header is found
Lucas Stach (18):
etnaviv: disable 64bpp render/sampler formats
etnaviv: track resource sharing
etnaviv: only add shared resources to implicit flush list
etnaviv: implicitly update shared texture resources
etnaviv: don’t use int filter for depth textures
etnaviv: tex_desc: emit texture comparator
etnaviv: fix fixpoint conversion of negative values
ci/etnaviv: update expectations
etnaviv: fix depth writes without testing
etnaviv: rs: take src dimensions into account when increasing height alignment
etnaviv: use correct blit box sizes when copying resource
etnaviv: fix separate depth/stencil clears
etnaviv: trigger TS derivation after slow clear
etnaviv: split TS and non-TS RS clear commands
etnaviv: ci: update expectation with fixed depth/stencil clears
etnaviv: rs: treat depth-only clear to X8Z24 surfaces as full clear
ci/etnaviv: update expectation after piglit uprev
etnaviv: flip the switch on MSAA support
Luigi Santivetti (1):
pvr: return the OS page size for minMemoryMapAlignment
Lynne (1):
radv/av1: limit profile and bit depth to supported values
M Henning (10):
nvk: Don’t clobber vb0 after repeated blits
nak: Remove assert on nir->info.outputs_written
nvk: Early-out impossible descriptor allocations
nak: Fix ldg/stg/atomg encoding to use globalmem
nak: Set fewer bits in writes_color
nak: Use undef for unset FSOut components
nak: Remove old union_find implementation
nak: Rewrite union_find and use it in repair_ssa
nak: Count GLOBAL_SIZE_OFFSET in bytes, not words
nvk: Don’t use a descriptor cbuf if it’s too large
Manuel Stoeckl (1):
util/disk_cache: try getenv(HOME) before getpwuid->pw_dir
Marcin Ślusarz (1):
intel/compiler/xe2: fix decoding of sampler simd mode
Marek Olšák (244):
gallium/u_vbuf: replace unnecessary dst_index with “i”
gallium: remove unbind_trailing_count from set_vertex_buffers
cso: don’t unbind vertex buffers when enabling/disabling u_vbuf
winsys/amdgpu: merge loops decrementing num_active_ioctls & unreferencing bufs
winsys/amdgpu: cosmetic touchups
winsys/amdgpu: don’t clear buffer list elements after IB submission
winsys/amdgpu: add more fence_reference helpers
winsys/amdgpu: don’t clear fence list elements after IB submission
winsys/amdgpu: remove misplaced duplicated comment
winsys/amdgpu: represent IB_MAX_SUBMIT_DWORDS in bytes
winsys/amdgpu: represent max_ib_size_dw in bytes
winsys/amdgpu: cosmetic touchups around IB sizes
amd: unify NIR options between RADV and radeonsi
ac/nir: don’t write TCS outputs to memory if no_varying is set
ac/nir: rename clipdist_enable_mask -> clip_cull_dist_mask
ac/nir: optimize out multiplications in small line culling
ac/nir: simplify code at the beginning of ac_nir_gs_shader_query
ac,radeonsi: emulate GS primitive pipeline stat on gfx11 because of culling
radeonsi: report more detailed output stats for shader-db
radeonsi: expose shader profiles to other .c files
radeonsi: don’t use staging uploads for buffers & shaders with all VRAM visible
radeonsi: deduplicate gfx10_ngg_get_vertices_per_prim / get_num_vert_per_prim
radeonsi: change GS_STATE_PROVOKING_VTX_INDEX to 1 bit PROVOKING_VTX_FIRST
radeonsi: split si_update_ngg_prim_state_sgpr into 2 functions
radeonsi: pack GS_STATE_ESGS_VERTEX_STRIDE better to save 2 bits
radeonsi: remove no-op additions for viewport0_y_inverted
radeonsi: remove unused preloaded instance_divisor_constbuf
radeonsi: rename *trivial_vs_prolog -> *trivial_vs_inputs
radeonsi/gfx11: clean up MAX_ALLOWED_TILES_IN_WAVE programming
radeonsi/ci: update gfx11 flakes
radeonsi/gfx11: flush DB before Z/S clear to work around dEQP failures
radeonsi: don’t flush CS before and after every blitter invocation
mesa,gallium: move the thread scheduler to src/util
gallium: rename PIPE_.._PIN_THREADS_TO_L3_CACHE -> .._UPDATE_THREAD_SCHEDULING
st/mesa: rename ST_L3_PINNING_DISABLED -> ST_THREAD_SCHEDULER_DISABLED
util: add mesa_pin_threads environment variable that sets a static affinity mask
glthread: apply the thread scheduling policy when the context is created
glthread: apply the thread scheduling policy when a batch executes synchronously
gallium/hud: add “csv” option to print values to stdout as CSV
nir: remove INTERP_MODE_COLOR
nir: relax validation failure for generic TCS outputs with no_varying
nir: remove and replace underused option pack_varying_options
nir: replace lower_io_variables with a GLSL NIR flag
nir: add a lower_mediump_io callback into options
nir: add vertex divergence into nir_divergence_analysis
winsys/amdgpu: fix a race condition when reading ws->num_buffers
winsys/amdgpu: add real buffers of slab entries in the CS thread
winsys/amdgpu: change the signature of amdgpu_add_bo_fences_to_dependencies
winsys/amdgpu: move code out of amdgpu_add_bo_fences_to_dependencies for reuse
winsys/amdgpu: merge 2 loops iterating over slab entries in amdgpu_cs_submit_ib
winsys/amdgpu: merge 2 loops iterating over sparse BOs in amdgpu_cs_submit_ib
winsys/amdgpu: merge 2 loops iterating over real BOs in amdgpu_cs_submit_ib
winsys/amdgpu: skip code checking RADEON_USAGE_SYNCHRONIZED for slabs
winsys/amdgpu: simplify amdgpu_do_add_buffer to remove memset
winsys/amdgpu: don’t ref/unref slab BOs in amdgpu_cs_submit_ib
radeonsi: use num_vertex_buffers instead of ARRAY_SIZE
radeonsi/ci: add gfx11 flakes
gallium: always set vertex elements before setting vertex buffers
gallium/u_blitter: set take_ownership=true for set_vertex_buffers
st/mesa: set take_ownership=true for set_vertex_buffers in st_draw_quad
gallium/util: add take_ownership parameter into util_draw_vertex_buffer
st/mesa: set take_ownership=true for util_draw_vertex_buffer in st_DrawTex
st/mesa: set take_ownership=true for set_vertex_buffers in st_pbo_draw
gallium/hud: set take_ownership=true for set_vertex_buffers
cso: remove CSO_UNBIND_VERTEX_BUFFER0
gallium/u_threaded: remove the count=0 path from tc_call_set_vertex_buffers
gallium/u_threaded: allow drivers to change tc_call_set_vertex_buffers function
gallium: remove take_ownership from set_vertex_buffers, assume it’s true
gallium/noop: don’t leak resources due to take_ownership
radeonsi,aco: remove the VS prolog
gallium/u_threaded: expose helpers for filling set_vertex_buffers externally
st/mesa: rename attribs -> arrays in st_atom_array to indicate non-zero strides
st/mesa: do (inputs_read & enabled_arrays) outside setup_arrays
st/mesa: do (inputs_read & ~enabled_arrays) outside st_setup_current
st/mesa: move a piece of _mesa_draw_array_attrib out of the loop in setup_arrays
st/mesa: cosmetic touchups in st_atom_array.cpp
st/mesa: change the update enum of vertex elements
st/mesa: move st_update_functions into st_context
st/mesa: constify the pipe_draw_info parameter and remove obsolete comments
mesa: inline {Create,Draw}GalliumVertexState callbacks
mesa: inline _mesa_set_vao_immutable
mesa: add gl_vertex_array_object::NonIdentityBufferAttribMapping
util/idalloc: make deleting invalid IDs a no-op
mesa: remove unused _mesa_HashTable code
mesa: clean up unnecessary _mesa_HashTable locked/unlocked wrappers
mesa: re-format main/hash.h, move inlines to the end, some code to main/hash.c
mesa: fold _mesa_HashDeleteAll into _mesa_DeleteHashTable
mesa: remove _mesa_HashTable::InDeleteAll
st/mesa: merge 3 unlikely blocks in _mesa_get_bufferobj_reference
st/mesa: remove !obj checking in _mesa_get_bufferobj_reference when it’s useless
mesa: fix incorrect _mesa_HashInsertLocked parameter in _mesa_EndList
mesa: use util_idalloc_alloc_range for _mesa_HashFindFreeKeyBlock
winsys/amdgpu: convert amdgpu_cs.c to .cpp
winsys/amdgpu: enable unlimited number of parallel queues for VCN
util/idalloc: optimize foreach by tracking the greatest non-zero element
mesa: declare _mesa_HashTable::id_alloc as non-pointer
mesa: declare _mesa_HashTable inside structures instead of as a pointer
mesa: remove isGenName parameter from _mesa_HashInsert
mesa: use util_idalloc_foreach for looping in _mesa_HashTable
mesa: replace _mesa_HashTable::ht with util_sparse_array for faster lookups
d3d12: make DrawTransformFeedback not depend on the vertex buffer offset
mesa: don’t use the slow VAO path except for drivers that want to use it
st/mesa: add VAO fast path C++ template variants for st_update_array callback
st/mesa: optimize st_update_arrays using lots of C++ template variants
glthread: re-enable thread scheduling in st/mesa when glthread is disabled
glthread: use _mesa_glthread_fence_call() instead of duplicating that code
glthread: add no_error variants of glDrawElements*
glthread: add no_error variants of glDrawArrays*
glthread: remove cmd_size from constant-sized calls
glthread: clean up how vertex stride is packed
glthread: pack “size” in Pointer calls as 16 bits
mesa: deduplicate get_index_size_shift code
mesa: deduplicate is_index_type_valid code
glthread: pack the primitive type to 8 bits
glthread: pack the index type to 8 bits
glthread: rewrite glDrawElements call packing
glthread: rewrite glDrawArrays call packing
glapi: fix type names for glthread and handle all types
glthread: sort fixed-sized parameters before returning them
glthread: move global marshal_XML.py functions into class marshal_function
glthread: precompute fixed_params and variable_params lists
gltrhead: merge 3 blocks conditional on marshal_sync in print_async_body
glthread: separate unmarshal function generation into print_unmarshal_func
glthread: separate marshal code generation into print_marshal_async_code
glthread: remove “if True” from print_marshal_async_code
glapi: pass pointer size to python for glthread from meson
glthread: pack glVertexAttribPointer calls better
glthread: fix multi draws with a negative draw count
glthread: pack uploaded user vertex buffers and offsets better
glthread: deduplicate batch finalization code
glthread: don’t check cmd_size for small variable-sized calls
glthread: use marshal_count instead of count for more functions
glthread: rewrite glBindBuffer packing
glthread: add a packed variant of glDrawElements with 16-bit count and indices
glthread: add a packed version of DrawElementsUserBuf
glthread: generate packed versions of gl*Pointer/Offset calls
amd: update addrlib
mesa: deduplicate initialization of gl_pixelstore_attrib
mesa: move struct gl_pixelstore_attrib into glthread.h
glthread: track glPixelStore(GL_UNPACK_*)
glthread: execute small glBitmap asynchronously
glthread: execute small glDrawPixels asynchronously
glthread: invert _mesa_glthread_has_no_{un}pack_buffer by removing the negation
amd/registers: add correct gfx11.x enums for BINNING_MODE
radeonsi: disable binning correctly on gfx11.5
radeonsi/gfx11: fix programming of PA_SC_BINNER_CNTL_1.MAX_ALLOC_COUNT
radeonsi/gfx10.3: add a GPU hang workaround for legacy tess+GS
radeonsi: allocate only one set of tessellation rings per device
radeonsi/gfx11: program the attribute ring right before draws
radeonsi: program tessellation rings right before draws
radeonsi/gfx11: program SAMPLE_MASK_TRACKER_WATERMARK optimally for APUs
ac: use the gfx11 shadowed register tables for gfx11.5
radeonsi/gfx11: add missing DCC_RD_POLICY setting
radeonsi: add radeonsi_cache_rb_gl2 option enabling GL2 caching for CB and DB
nir/divergence_analysis: change function prototypes
nir/divergence_analysis: load_primitive_id is convergent within a primitive
nir/divergence_analysis: load_instance_id is convergent within a primitive
nir/divergence_analysis: handle derefs of system values
nir: print nir_io_semantics::invariant
nir: add nir_block::divergent to indicate a divergent entry condition
ac/llvm: fix SSBO bounds checking by using raw instead of struct opcodes
radeonsi: fix the DMA compute shader
radeonsi: don’t test so many wave limits for AMD_TEST=testdmaperf
nir: add a utility computing post-dominance of SSA uses
nir: add nir_opt_varyings, new pass optimizing and compacting varyings
nir/tests: add tests for nir_opt_varyings
radeonsi: set the lower_mediump_io callback for GLSL
radeonsi: set trivial NIR options for nir_opt_varyings
radeonsi: enable uniform propagation for varyings except VP/Energy
radeonsi: add test failures due to incorrect tests for nir_opt_varyings
st/mesa: get dual slot input info from NIR if IO is lowered
st/mesa: lower sysvals slightly sooner
st/mesa: skip a few NIR passes that don’t work with lowered IO
glsl/linker,st/mesa: enable nir_opt_varyings and lower IO in the linker
amd/ci: update stoney results
r300: port scanout pitch alignment from the DDX to fix DRI3
r300: enable tiling for scanout to fix DRI3 performance
radeonsi/ci: run GLCTS, ESCTS, and dEQP from the glcts directory
radeonsi/ci: update failures
Unbreak Viewperf by reverting “util: use crc32_z instead of crc32 and bump zlib dep to 1.2.9”
gallium: use u_box_3d to initialize pipe_box instead of non-designated initializers
gallium: increase the size of pipe_box y, height fields to allow bigger textures
nir: rename AMD XFB intrinsics to *_gfx11_amd
nir,amd: add nir_intrinsic_load_debug_log_desc_amd and its use
aco: implement aco_is_gpu_supported using switch statement
aco: add a helper printing shader asm by disassembling via LLVM
ac/llvm: remove remnants of gfx10 NGG streamout
radeonsi: implement the shader debug log from ac_nir_store_debug_log_amd
nir/validate: validate interp_mode of load_barycentric_*
nir/lower_io: add nir_io_semantics::interp_explicit_strict
nir/validate: validate more fields of nir_io_semantics
tgsi_to_nir: translate TG4
nir/opt_varyings: don’t generate IO with unsupported bit sizes
nir/opt_varyings: simplify nir_io_semantics::num_slots of directly-indexed slots
nir/opt_varyings: handle load_input_vertex
ac/surface: add radeon_surf::thick_tiling
ac/nir: allow 16-bit results for resinfo
ac/llvm: simplify extracting an element in get_image_coords
ac/llvm: add support for 16-bit coordinates (A16) for image (non-sampler) opcodes
ac/llvm: allow image loads to return less than 4 components, trim DMASK
ac/llvm: remove handling of input and output loads/stores that are lowered
ac/llvm: remove unused fields of ac_shader_abi
ac/llvm: simplify the optimization barrier and apply it to the whole vector
ac: add helper ac_get_ip_type_string to remove duplication
nir: add more build helpers
nir: allow FP16 in nir_format_linear_to_srgb
nir: add nir_intrinsic_optimization_barrier_sgpr_amd
nir: change “user_data_amd” sysval from 4 to 8 components
nir/use_dominance: set the root as post-dominator of unmovable instructions
util: add new format helpers
util: import pipe_box and its helpers
ac/llvm: fix assertions for texture instructions with 16-bit LOD bias
ac/llvm: always trim components of texture instructions, trim DMASK
ac/surface: constify and reindent NIR meta address-from-coord function params
radeonsi/ci: update gfx11 failures
radeonsi/gfx11: don’t prefetch constants in binaries into the instruction cache
radeonsi/gfx11: enable DCC fast clears for 8-bit and 16-bit formats
radeonsi: use the same nir_lower_subgroups_options as RADV
radeonsi: add the radeonsi_optimize_io option into the shader cache key
radeonsi: check has_stable_pstate in the winsys
radeonsi: move TCS epilog key bits to the key->ge.opt section
radeonsi: fix initialization of occlusion query buffers for disabled RBs
radeonsi: don’t expose samples_identical and don’t lower FMASK if it’s disabled
radeonsi: allow input NIR to use descriptors in image opcodes
radeonsi: move blitter resource_copy_region implementation to si_gfx_copy_image
radeonsi: move blitter clear_render_target impl into si_gfx_clear_render_target
radeonsi: preserve NaNs in draw-based resource_copy_region
radeonsi: use simpler UINT fallback formats for draw-based resource_copy_region
radeonsi: remove si_use_compute_copy_for_float_formats
radeonsi: change allow_flat_shading to make it a single condition
radeonsi: don’t call resource_copy_region in pipe->blit
radeonsi/gfx11: implement DCC clear to “single” for fast non-0/1 clears
radeonsi: disable VRS flat shading for selected 8xMSAA and thick tiling cases
radeonsi: don’t use si_get_flush_flags() for flushing images
radeonsi: don’t flush CB in si_launch_grid_internal_images if not needed
radeonsi: don’t flush CB and DB if there have been no draw calls
radeonsi: enable fast FB clears for conditional rendering
radeonsi: make clear_render_target clear DCC directly instead of via pipe->clear()
radeonsi: don’t add whether NIR is used into the shader key
radeonsi: only expose 8 EQAA samples due to shader limitations
radeonsi: always run nir_opt_16bit_tex_image
radeonsi: use ip_type in debug code instead of hardcoding GFX
radeonsi: implement user_data_amd for 5, 6, and 7 components correctly
util: shift the mask in BITSET_TEST_RANGE_INSIDE_WORD to be relative to b
Mark Collins (32):
tu/kgsl: Fix sync_wait’d FD in kgsl_syncobj_wait
tu/a7xx: Update CCU layout logic for A7XX
tu: Allow GMEM on A7XX when TU_DEBUG=gmem
tu: Set A7XX registers in `tu6_tile_render_begin`
tu: Set `CP_THREAD_CONTROL::CONCURRENT_BIN_DISABLE` in A7XX HW init
tu: Only set PC/VFD PWR_CNTL regs on A6XX
tu: Use `CP_SET_PSEUDO_REG` for A7XX VSC stream regs
tu/autotune: Use `CP_EVENT_WRITE7::ZPASS_DONE` on A7XX
tu: Set `RB_UNKNOWN_88E4` for A7XX event blits
freedreno/devices: Update A7XX tile values
tu: Use full size color CCU in sysmem mode
tu: Update CCU layout selection logic for seperate stencil stores
tu: Allow event blit to resolve depth stencil formats
tu: Fix 2D blit path for GMEM stores on A7XX
tu: Use `Z24_UNORM_S8_UINT_AS_R8G8B8A8` for A7XX GMEM D24S8 blits/clear
tu: Disable LRZ properly on A7XX
tu: Set RB_CCU_CNTL during HW init on A7XX
tu: Fix CP_BLIT sync on A7XX
tu: Clear `VSC_UNKNOWN_0D08` on A7XX
tu: Add blit cache flushing for input attachments
tu: Unconditionally enable GMEM on A7XX
fd/replay: Fix wrbuffer name extraction
fd/replay: Dump wrbuf into cwd rather than exe directory
fd/replay: Clamp dumped wrbuf to buffer size
fd/replay: Clear wrbufs after submitting cmdstreams for DRM
fd/replay: Add wrbuf support for KGSL/DXG
fd/replay: Error when VMA AS allocation fails
fd/replay+rddecompiler: Add option to clear wrbufs at start
fd/rddecompiler: Disable IR3 cache for replay context
fd/decode: Build generate_rd executable rather
fd/replay: Use generate_rd as default CS generator
fd/decode: Fix “OPTSIONS” typo in help messages
Mark Janes (18):
hasvk: add missing linker arguments
util: add parson for handling json files
intel/dev: specify struct intel_device_info type details in python
intel/dev: generate declarations for struct intel_device_info
intel/tools: add intel device meson dependencies
intel/dev: implement json serialization for intel_device_info
intel/dev/tools: add json as an output format for intel_dev_info
intel/tools: load json device info in drm_shim
intel/dev: improve meson invocation for intel_device_info gen
intel/compiler: generate a hash function to use with the shader cache
iris: use device info sha in device renderer string
anv: use intel_device_info to set device UUID
intel/tools: move intel_dev_info to intel/tools
intel/tools: add shader compiler hash key to json devinfo format
pan/va: Add missing valhall_enums dep to bifrost_tests
intel/dev: declare workarounds required by ATSM platforms
intel/dev: remove pci revision from shader cache key
intel/compiler: drop unused ray-tracing fields from cache hash
Martell Malone (3):
nine: r500 under 20 fragments cap is a warning
nine: detect emulation fallback of d3d coordinates
nine: update verbiage for enduser device messages
Martin Krastev (5):
svga/ci: land vmware mesa-ci lava farm
svga/ci: workaround vmware farm’s inability to use public DNS 8.8.8.8
svga/ci: re-enable vmware farm
svga/ci: add two new piglit flakes to svga
svga/ci: disable vmware farm
Martin Roukala (né Peres) (10):
radeonsi/ci: update vangogh’s expectations after piglit uprev
zink/ci: update navi31’s expectations after piglit uprev
zink/ci: update polaris10’s expectations after piglit uprev
radv/ci: switch vkcts-polaris10 from mupuf to KWS’ farm
radv/ci: add a vkcts-tahiti job
radv/ci: add a vkd3d-tahiti job
ci/b2c: rename .b2c-test-{vk,gl} to .b2c-x86_64-test-{vk,gl}
ci/b2c: rename .deqp-test-valve into .b2c-deqp-test
ci/b2c: allow setting the DTB to be used
ci/valve: remove the traces runner
Mary Guillemard (37):
nouveau: nvidia_header: Add AMPERE_A in vk_push_print
nouveau: nvidia_header: Add TURING_COMPUTE_A and AMPERE_COMPUTE_A in vk_push_print
nouveau: nvidia_header: Add AMPERE_COMPUTE_B in vk_push_print
nouveau: nvidia-headers: Add compute array parsing to class_parser.py
nouveau: nvidia-headers: Add nv_push_dump tool
nouveau: mme: Add a dumper
agx: Add more bitops in agx_bitop_table
agx: Remove and/or/xor pseudo ops
agx: Fuse not into and/or/xor
agx: Add a bitop optimizer pass
pan/bi: assert indices when offsets are present in bi_emit_tex_valhall
pan/lib: Remove variables in blitter
pan/bi: Rework indices for tex on Valhall
pan/bi: Rework indices for image on Valhall
pan/bi: Rework indices for attributes on Valhall
pan/bi: Lower ubo table in indices for Valhall
panfrost, pan/lib: Move pan_resource_table to panfrost
nvk: Always copy conditional rendering value before compare
drm-shim: Add io region handling in mmap
panfrost: Add support for Panthor in drm-shim
docs: Document Mali-G610 in drm-shim section
panfrost: group up stubbed params in drm-shim
nouveau: Add support for TERT opcodes in vk_push_print
nouveau: Fix NINC TERT handling in vk_push_print
nak: Support unaligned swizzles in 8/16 bits vec srcs
nak: move folding logic to Src::fold_imm
nak: Add F16 and F16v2 sources
nak: Improve copy propagation pass to handle F16
nak: Add 16-bits float operations
nvk: Advertise shaderFloat16
nvk: Allow various alu op to be vectorized for 2xfp16
nak: Allow SHF to use immediate encoding for shift
panvk: Return os_page_size for minMemoryMapAlignment
panvk: Fix driver UUID not being filled
panvk: Move to vk_properties
panvk: Advertise VK_KHR_driver_properties
nak: Pass has_mod to all form of src2 requiring it
Mary Strodl (2):
rusticl: set OCL_ICD_VENDORS as directory, not file
NirShader: don’t fail on null constant_buffer
Matt Turner (5):
util: Add DETECT_ARCH_HPPA macro
util/tests: Disable half-float NaN test on hppa/old-mips
meson: Limit intel_vk_rt to x86_64
anv/drirc: Add option to control implicit sync on external BOs
intel: Build float64 shader only for Vulkan
Matthew Waters (1):
teximage: allow glCopyTex{Sub}Image[123]D into R/RG textures with OpenGL ES 2.0
Max R (11):
d3d10umd: Fix compilation
winsys/gdi: Handle R8G8B8 formats
winsys/gdi: Custom acquisition of hDC
d3d10umd: Use flush_frontbuffer for Present
virgl: Fix compilation on MSVC
virgl: Fix crash when no VE bound
virgl: Implement PIPE_QUERY_GPU_FINISHED
virgl: Allow importing resources without known templ
virgl: Pass cmd_buf to flush_frontbuffer
d3d10umd, meson: Allow naming d3d10umd DLLs
d3d10umd: Rename d3d10sw target to d3d10umd
Michel Dänzer (2):
egl/wayland: Flush after blitting to linear copy
wsi/wayland: Dispatch event queue in wsi_wl_swapchain_queue_present
Mike Blumenkrantz (313):
vk/cmdbuf: add back deleted maint6 workgraph bits
lavapipe: use pushconstants2 for dgc
lavapipe: fix devenv icd filename
zink: fix separate shader patch variable location adjustment
lavapipe: delete extra descriptor buffer layout validation
zink: use local screen variable in surface creation
zink: hook up maint6
zink: use maint6 for multi-layer compressed surface creation
zink: set more dynamic states when using shader objects
lavapipe: KHR_dynamic_rendering_local_read
zink: always map descriptor buffers as COHERENT
zink: fix descriptor buffer unmaps on screen destroy
lavapipe: RM2024 extension promotions
zink: add a tu flake
zink: prune dmabuf export tracking when adding resource binds
zink: fix sparse bo placement
zink: zero allocate resident_defs array in ntv
zink: move sparse lowering up in file
zink: run sparse lowering after all optimization passes
zink: add back (safe) optimizations after sparse lowering
zink: split out sparse_residency_code_and lowering
mesa: plumb errors through to texture allocation
zink: adjust swizzled deref loads by the variable component offset
nir/lower_io: fix handling for compact arrays with indirect derefs
zink: only add arrays to indirect non-tcs variables
zink: promote a conditional on gfx shader destroy
zink: clamp zink_gfx_lib_cache::stages_present for generated tcs
zink: promote gpl libs freeing during shader destroy out of prog loop
zink: don’t add VK_IMAGE_CREATE_2D_ARRAY_COMPATIBLE_BIT for sparse textures
zink: add a ci skip
ci: bump VVL to snapshot-2024wk06
zink: update vvl expectations
mesa: check driver format support for certain GetInternalformativ queries
zink: always enable glsl_correct_derivatives_after_discard
zink: add a750 baseline
zink: delete maxDescriptorBufferBindings checks
zink: flag the use_img as unsync access with buf2img copies, not the swapchain
zink: pre-check formats for samplecount support
zink: validate sample count on image create
zink: add an assert for dummy fb surface creation
zink: compute bo unique_id on use, not creation
zink: avoid infinite recursion on (very) small BAR systems in bo alloc
egl/x11/swrast: deduplicate ANGLE_sync_control_rate enablement
drisw: hook up EXT_buffer_age
drisw/egl: delete unused buffer age handling
vk/wsi/x11/sw: use swapchain depth for putimage
mesa: add more driver support checks for more format queries
zink: add checks/compat for low-spec descriptor buffer implementations
zink: add a second fence disambiguation case
zink: force host-visible allocations for MAP_COHERENT resources
zink: hook up KHR_dynamic_rendering_local_read
zink: use KHR_dynamic_rendering_local_read
ci: make clang-format job warn on failure instead of killing the pipeline
zink: handle stencil_fallback in zink_clear_depth_stencil
zink: don’t destroy the current batch state on context destroy
zink: only scan active batch states for free states if > 1 exist
zink: fix longstanding issue with active batch state recycling
zink: assert that batch_id is valid in zink_screen_check_last_finished()
zink: move flagging rp_changed in zink_update_fbfetch() to caller
zink: don’t pre-init dummy fbfetch surface when missing nullDescriptor feature
zink: also set null fbfetch surfaces when no fb surface is bound
zink: break out null fbfetch init for descriptor buffer
zink: create/resize dummy surfaces on-demand
zink: start out with 256x256 sized dummy surfaces
zink: don’t pre-init null fbfetch info
zink: clamp in_rp clears to fb size
zink: fix (dynamic rendering) execution of scissored clears during flush
zink: fix swapchain readback conditional
zink: lock buffer age when chundering swapchain for readback
zink: flag acquired swapchain image as readback target on acquire, not present
zink: make kopper_swapchain_image::acquired the resource that acquired it
zink: add a swapchain readback case for reading differently-acquired image
zink: make readback attempts count towards ZINK_READBACK_THRESHOLD
zink: update swapchain readback cache on create
zink: set and manage a flag indicating that swapchain readback needs updating
zink: only update swapchain readback cache on create if necessary
zink: only update swapchain readback cache when necessary
zink: use new flag to determine whether swapchain readback cache is usable
zink: update nv blob baseline
zink: add nvk baseline
ci: disable clang-format job
zink: apply all storage memory masks to control barriers if no modes are specified
zink: emit SpvCapabilityImageMSArray for ms arrayed storage images
zink: null out bo usage when allocating from slab
zink: fix unsynchronized read-mapping of device-local buffers
zink: delete unused buffer map conditional
zink: force max buffer alignment on return ptrs for mapped staging buffers
gallium: add a nboxes param to flush_frontbuffer
winsys/sw: propagate nboxes to displaytarget_display()
drisw: plumb through a swapBuffersWithDamage interface
egl/wayland/sw: move swrast_update_buffers() directly into swapbuffers
egl/wayland/sw: move dri2_wl_swrast_commit_backbuffer() directly into swapbuffers
egl/wayland: unify back/current swapping between zink and swrast
egl/wayland/sw: split out surface attach from dri2_wl_swrast_commit_backbuffer()
egl/wayland/sw: call dri2_wl_swrast_attach_backbuffer() before swap
egl/wayland/sw: trigger damage from put_image2
egl/wayland/sw: move partial->full copy promotion to swapbuffers
egl/wayland/sw: fix no-op updating of current backbuffer
egl/wayland/sw: pass damage region through from put_image2 to wl_surface_damage
egl/wayland/sw: clamp putimage geometry to surface size
drisw/xlib: loop over all the boxes in display() hook
drisw/winsys: loop over all the boxes in display()
drisw: pass all frontend swapbuffer damage rects through
egl/kopper: plumb through SwapBuffersWithDamage
egl/kopper: advertise EXT_swap_buffers_with_damage only in non-sw mode
egl/wayland: split out kopper vtable
egl/wayland: add a separate hook for kopper buffer age
egl/wayland: split out kopper swapbuffers functions
egl/kopper: call swrast buffer age query for kopper+swrast
kopper: set drawable buffer age
egl/wayland/kopper: actually call kopper swapbuffer functions
egl/wayland: split out kopper update_buffers
egl/wayland: delete swrast references to zink
zink: fix stencil-only blitting with stencil fallback
zink: make zink_kopper_present_info public
zink: use a slab allocator for zink_kopper_present_info
zink: hook up VK_KHR_incremental_present
zink: use VK_KHR_incremental_present to propagate damage rects
zink: hook up KHR_partial_update
vulkan/dispatch_table: add an uncompacted version of the table
zink: use uncompacted vk_dispatch_table
egl/dri2: use the right egl platform enum
glx: only print zink failure-to-load messages if explicitly requested
zink: stop enabling EXT_conservative_rasterization
lavapipe bump descriptor buffer address space limits
zink: fix PIPE_CAP_MAX_SHADER_PATCH_VARYINGS
zink: call CmdSetRasterizationStreamEXT when using shader objects
nvk: bump NVK_PUSH_MAX_SYNCS to 256
zink: update nvk baseline
util/blitter: iterate samples in stencil_fallback
mesa: fix CopyTexImage format compatibility checks for ES
zink: update nvk baseline with nvk changes
driconf: add radv_zero_vram for Crystal Project (1637730)
zink: update nv baseline
zink: track whether shaders use load_barycentric_at_sample
zink: apply zink_shader::uses_sample to fs variant updating
zink: destroy batch states after copy context
zink: set VkExternalMemoryBufferCreateInfo for opaque fds too
zink: simplify vb masking on bind
mesa: force rendertarget usage on required-renderable formats
zink: try getting sparse page size again without storage bit on fail
u/inlines: constify util_res_sample_count()
zink: only add STORAGE bit for sparse images based on multisample usage
zink: nvk baseline updates
zink: set the sparse format usage flags directly based on queried props
zink: delete faked_e5sparse
zink: rename optimal_key in update_gfx_program_optimal()
zink: use the sanitized key in update_gfx_program_optimal()
zink: always sync and replace separable progs even with ZINK_DEBUG=noopt
zink: add even more strict checks for separate shader usage
zink: be even stricter with shader object usage about blocking invalid usage
zink: remove stale comments for DRLR usage
zink: add a pass to strip out multisample storage image ops
zink: don’t deref swapchain image array with UINT32_MAX
zink: handle image_deref_samples when stripping MS image instrs
zink: iterate all the modes when doing separate shader fixups
mesa/st: add ‘base_serialized_nir’
mesa/st: add is_draw_shader param to st_finalize_nir
mesa/st: when creating draw shader variants, use the base nir and skip driver opts
mesa/st: use sanitized shader keys for feedback draws
zink: do io fixup on patch variables too
zink: defer present barrier to flush if a clear is pending
zink: clamp present region size
zink: clamp swapchain renderarea instead of asserting
zink: set dynamic rendering color attachment layouts
radv: inline radv_device_fault_detection_enabled
ci: bump VVL to v1.3.281
nir/divergence: add zink intrinsics
nir/opt_varyings: update alu type when rewriting src/dest for moved ops
zink: only check that CUBE_COMPATIBLE for images doesn’t subtract flags
zink: don’t use set_foreach_remove with dmabuf_exports
zink: make descriptor pool creation more robust
zink: fix shaderdb pipeline compile
zink: delete some ntv dead code
zink: always sort io variables by location after re-creating them
zink: use outputs_written mask to detect edge flag usage
zink: update xfb info after lower_to_scalar
zink: run scan_nir before variable rework
zink: apply component offset for CLIP/CULL DIST1 location derefs
zink: manually calc clip/cull distance sizes
zink: add a helper to detect clip/cull dist locations
zink: always use shader sizes for clip/cull dist variables
zink: fix generated variable expansion
zink: check for arrayness rather than tess io vars for indirect array vars
zink: track a mask of arrayed io locations on shaders
zink: call gather_info during shader creation
zink: always check patch io during rework_io_vars
zink: don’t clobber indirect array reads with missing components
zink: fix io slot calculation for vertex inputs in add_derefs
zink: fix add_derefs case for compact arrays
zink: only use location_frac for deref array indexing for compact variables
llvmpipe: fix DRAW_USE_LLVM=0
nir/lower_wpos_ytransform: move new value load to start of function, reuse
nir/lower_wpos_ytransform: reuse input zw components for fragcoord rewrite
nir/lower_wpos_ytransform: update comment to reflect variable usage
nir/lower_wpos_ytransform: scalarize emit_wpos_adjustment
nir/lower_wpos_ytransform: fix for lowered io
glsl: handle xfb resources for spirv before running varying opts
mesa: clamp binary pointer in ShaderBinary if length==0
gallium: rework PIPE_CAP_POINT_SIZE_FIXED
zink: delete some maintenance5 psiz pruning
zink: fix add_derefs for partial interp loads of derefs
zink: assert that ntv interp handling isn’t doing implicit component expansion
egl/x11: disable swapbufferswithdamage for zink without kopper
glx/egl: fix LIBGL_KOPPER_DISABLE
glsl: set PSIZ bit in outputs_written when injecting a 1.0 psiz write
nir/lower_clamp_color_outputs: fix use with lowered io
nir/lower_flatshade: break out location checking
nir/lower_flatshade: fix with lowered io
nir/lower_alpha_test: fix use with lowered io
nir/lower_two_sided_color: rework for lowered io
nir/lower_drawpixels: fix for lowered io
nir/lower_clip_disable: fix for lowered io
nir/lower_point_size_mov: rework.
nir/lower_point_size_mov: fix for lowered io
nir/texcoord_replace: fix scalarized io handling
nir/dominance: fix comment
drisw: reorder image extensions
sw_winsys: add displaytarget_create_mapped
winsys/null: implement displaytarget_create_mapped
winsys/drisw: implement displaytarget_create_mapped
winsys/drisw: implement dmabuf handling
lavapipe: add a function for asserting external memory handle types
winsys: add WINSYS_HANDLE_TYPE_UNBACKED
winsys: add more stride members to winsys_handle
lavapipe: EXT_queue_family_foreign
lavapipe: rework mem handle type assert to handle dmabuf
lavapipe: handle drm image format queries
lavapipe: handle drm image imports
docs: update lavapipe features
nir: add compact_arrays to nir_shader_compiler_options
nir/gather_info: fix gathering for compact arrayed builtins
zink: set compact_arrays in compiler options
microsoft/compiler: set compact_arrays in compiler options
lavapipe: don’t clamp index buffer size for null index buffer draws
v3d: set use_clipdist_array=true for lower_clip?
nir/lower_clip: surgerize for lowered io
nir/lower_clip: handle scalarized io
zink: block LA formats with srgb
llvmpipe: clamp 32bit query results to low 32 bits rather than MIN
lavapipe: clamp 32bit query results to low 32 bits rather than MIN
agx: set compact_arrays in compiler options
v3d: set compact_arrays in compiler options
intel: set compact_arrays in compiler options
freedreno: set compact_arrays in compiler options
glsl: stop using PIPE_CAP_NIR_COMPACT_ARRAYS and check compact_arrays
ttn: stop using PIPE_CAP_NIR_COMPACT_ARRAYS and check compact_arrays
glsl: move an assert from st_context over to avoid using PIPE_CAP_NIR_COMPACT_ARRAYS
mesa: delete LowerCombinedClipCullDistance from consts
st/program: stop using PIPE_CAP_NIR_COMPACT_ARRAYS and use compact_arrays
nine: stop checking PIPE_CAP_NIR_COMPACT_ARRAYS and use compact_arrays
gallium: delete PIPE_CAP_NIR_COMPACT_ARRAYS
zink: set indirect io compiler flags
zink: set lower_to_scalar
zink: rework rework_io_vars
zink: set nir_io_glsl_lower_derefs in compiler options
zink: add a pass to fix vertex input locations
zink: enable opt_varyings with ZINK_DEBUG=ioopt
zink: ci updates
nir/remove_unused_io_vars: check all components to determine variable liveness
ci: kill piano trace globally
nir: print i/o variables in location order
lavapipe: disable stencil test if no stencil attachment
egl: fix defines for zink’s dri3 check
egl/android: fix zink loading
egl: use os_get_option for MESA_LOADER_DRIVER_OVERRIDE
zink: disable buffer reordering correctly on shader image binds
nir/print: stop trying to match i/o vars using base/driver_location
zink: add ZINK_DEBUG=nopc to completely disable precompilation
zink: destroy shaderdb pipelines
zink: add VK_PIPELINE_CREATE_CAPTURE_STATISTICS_BIT_KHR for shaderdb
brw/lower_a2c: fix for scalarized fs outputs
zink: copy shader name when copying shader info
zink: run nir_lower_io_to_scalar (mostly) unconditionally and earlier
zink: vectorize io loads/stores when possible
zink: ci updates
zink: prune some piglit cts fails
loader: delete unused param from pipe_loader_vk_probe_dri()
glx: fix some indentation
glx: add an ‘implicit’ param to createScreen
glx: pass implicit load param through allocation
dri: plumb a ‘implicit’ param through createNewScreen interfaces
gbm: plumb an ‘implicit’ param through device creation
frontends/dri: plumb an ‘implicit’ param through screen init
pipe-loader: plumb a flag for implicit driver load through screen creation
zink: don’t print error messages when failing an implicit driver load
glx: silence more implicit-load zink errors
mesa/st: don’t use serialized_nir for cached shaders
zink: make NOREORDER mode context-based
zink: disable command reordering for compute-only contexts
nir: store variable names to io instrs during io lowering
nir/lower_io_to_scalar: preserve variable names when splitting io
nir/clone: preserve intrinsic name field across clones
nir/print: print io instr->name if available
zink: preserve/merge variable names when generating new variables
glthread: check for invalid primitive modes in DrawElementsBaseVertex
zink: reconstruct features pnext after determining extension support
zink: prune zink_shader::programs under lock
zink: fully wait on all program fences during ctx destroy
kopper: fix bufferage/swapinterval handling for non-window swapchains
zink: slightly better swapinterval failure handling
kopper: don’t set drawable buffer age
zink: clean up accidental debug print
egl/x11: disable dri3 with LIBGL_KOPPER_DRI2=1 as expected
zink: add a batch ref for committed sparse resources
u_blitter: stop leaking saved blitter states on no-op blits
freedreno/replay: use inttypes format string for 64bit
frontends/dri: only release pipe when screen init fails
frontends/dri: always init opencl_func_mutex in InitScreen hooks
zink: clean up semaphore arrays on batch state destroy
egl/dri2: fix error returns on dri2_initialize_x11_dri3 fail
nir/lower_aaline: fix for scalarized outputs
nir/linking: fix nir_assign_io_var_locations for scalarized dual blend
Mike Hsieh (1):
amd/vpelib: Add param check for geometric scaling and refactor
Mohamed Ahmed (14):
nil: change image_level_size() to take tiling in account
nil: Add helper function to get tile size in pixels
nil: Add helpers for conversion from pixel values to tiles
nil: Expose tiling_extent_B()
nil: Add support for sparse resident images
nvk: add sparse queries
nvk: enable sparse residency features
nak: wire up shader resource residency intrinsics
nak: wire up sparse image loads
nvk: advertise shader resource residency
nil: Add a nil_image::compressed bit
nil: Add some helpers for DRM format modifiers
nil: Support creating images with DRM modifiers
nvk: enable rendering to DRM_FORMAT_MOD_LINEAR images
Mykhailo Skorokhodov (2):
egl/wayland: Fix sRGB format look up for config
ci/lima: expect fail of window_8888_colorspace_srgb on wayland
Nanley Chery (13):
iris: Don’t memset the extra_aux memory range
iris: Don’t memset CCS on integrated gfx12
iris: Enable pass-through state init for gfx12 CCS
isl: Pick a better initial state for zeroed MCS
iris: Copy main ISL surf when reallocating in place
iris: Report the correct modifier for Tile4 images
iris: Use resource_get_param in resource_get_handle
intel/isl: Remove inconsistency when choosing Tile64
intel/isl: Remove inconsistency when encoding Tile64
intel/isl: Remove a CCS_D check from gfx12+ code
intel/isl: Enable a 64KB alignment WA for flat-CCS
intel/isl: Use Tile64 to align images for CCS WA
intel/isl: Disable miptails to align LODs for CCS WA
Neil Armstrong (1):
freedreno: Add a750 clock gating control related registers
Nikita Popov (1):
Pass no-verify-fixpoint option to instcombine in LLVM 18
Oskar Viljasaar (3):
vulkan/properties: Start looping from the next member in GPDP2
tu: Use common physical device properties infrastructure
compiler/types: Fix glsl_dvec*_type() helpers
Patrick Lerda (16):
glsl/nir: fix gl_nir_cross_validate_outputs_to_inputs() memory leak
r300: fix vertex_buffer related refcnt imbalance
r300: fix r300_destroy_context() related memory leaks
r300: fix memory leaks when register allocation fails
r300: fix constants_remap_table memory leak
radeonsi/gfx10: fix main_shader_part_ngg_es memory leak
r300: enable R400 cos and sin hardware vertex shader opcodes
ac/llvm,radeonsi: fix memory leaks triggered by ac_nir_translate() errors
r300: fix NIR passes regression
r300: fix constants_remap_table memory leak related to the dummy shader path
r300: fix r300_draw_elements() behavior
panfrost: remove panfrost_create_shader_state() related dead code
gallium/auxiliary/vl: fix typo which negatively impacts the src_stride initialization
clover: fix pipe_box update regression
clover: fix memory leak related to optimize
r600: fix vertex state update clover regression
Paul Gofman (3):
glsl: allow out arrays in #110 with allow_glsl_120_subset_in_110
driconf: add a workaround for Joe Danger 2
driconf: add a workaround for Joe Danger
Paulo Zanoni (35):
zink: fix bind size handling in buffer_bo_commit()
anv/sparse: add an extra step before anv_sparse_bind_resource_memory()
anv/sparse: allow binding operations to match the resource size
anv+zink/ci: remove recently fixed tests from the crash list
anv/sparse: don’t issue a single bind operation per vm_bind ioctl
anv/sparse: leave the semaphore waits and signals to the vm_bind ioctl
anv/sparse: don’t use the bind_timeline when doing sparse binding
anv: change the vm_bind-related kmd_backend vfuncs to return VkResult
anv: add an anv_pipe_bits bit to allow invalidating the TLB
anv/trtt: invalidate the TLB after writing TR-TT entries
anv/trtt: update GFX_TRTT_VA_RANGE for LNL
anv: don’t leak device->vma_samplers
anv: set shaderFloat64 to true when fp64_workaround_enabled
driconf/anv: set fp64_workaround_enabled to DIRT 5
anv/xe: don’t leak xe_syncs during trtt submission
anv/xe: don’t overwrite the result from vk_sync_wait()
vulkan: don’t zero-initialize STACK_ARRAY()’s stack array
anv, iris: add missing CS_STALL bit for GPGPU texture invalidation
anv: reduce struct anv_image_memory_range from 32 to 24 bytes
vulkan: reduce struct vk_object_base by 8 bytes
anv/sparse: remove useless isl_surf_get_tile_info() call
anv/sparse: remove unnecessary popcount assertions
anv/sparse: adjust sparse_bind_image_memory debug messages
anv/sparse: remove unused dump_vk_sparse_memory_bind()
anv/sparse: replace device->using_sparse with device->num_sparse_resources
anv/sparse: rework anv_free_sparse_bindings() error handling
anv/xe: extract anv_vm_bind_to_drm_xe_vm_bind()
anv/xe: add a ‘flags’ parameter to the vm_bind() kmd_backend function
anv/xe: slightly improve error handling for the vm_bind ioctl
anv/xe: assert we’re using drm_syncobjs only once
anv/xe: de-duplicate xe_exec_fill_sync()
anv/xe: rename and refactor xe_exec_fill_sync()
anv/sparse: fail the right way in anv_GetDeviceImageSparseMemoryRequirements()
anv: const-correct anv_{image,buffer}_is_sparse()
isl: add ISL_TILING_64_XE2 to isl_tiling_to_name()
Pavel Ondračka (38):
r300: fix reusing of color varying slots for generic ones
r300: skip draw if vertex shader does not write gl_Position
r300/ci: switch to deqp-runner suite
r300/ci: add the KHR gles2 tests
r300/ci: move streaming-texture-leak from fails to skips
r300: fix writemask for nir_intrinsic_load_ubo_vec4
r300: skip backend DCE for vertex shaders
r300: remove R3xx/R4xx backend absulute modifier lowering
r300/ci: add dEQP on RV380
r300: remove backend SLE and SGT support
r300: add r300_is_only_used_as_float helper
r300: optimize out more modifiers produced later
r300: lower comparison ops early in NIR
r300: remove SGE, SNE, SLT, SGE lowering in the backend
r300: remove the remaining of backend constant folding
r300: remove backend support for SUB
r300/ci: update piglit fails
r300: remove compiler tests
r300/ci: add two more observed piglit flakes
r300: fix vs output register indexing
r300: add explicit flrp lowering
ci: install xwayland in x86_64_test-gl
ci: build nine in debian-testing
ci: build nine tests
r300/ci: enable nine tests
r300: explicitly check if sin/cos input is already in correct range
r300: move sin/cos input fixups to finalize_nir
r300: remove some late NIR passes
nir/lower_vec_to_regs: always set cursor before inserting decl_reg
r300: check for the extra restrictions on presubtract swizzles
r300: move presubtract pass later
r300: optimize swizzle for inline constants
r300: inline unoptimized_ra ntr option
r300: get rid of the unused ubo_vec4_max ntr option
r300: remove the ntr lower_cmp option
r300: move lower_fabb option out of the options struct
r300: remove nir_to_rc_options wrapper
r300/ci: failures list update
Peyton Lee (6):
radeonsi/vpe: remove wait source surface fence and while loop
radeonsi/vpe: disable info log
radeonsi/vpe: move flush to si_vpe_processor_end_frame
radeonsi/vpe: support multi-buffer
radeonsi/vpe: pre-allocate stream struceutre
radeonsi/vpe: add support for p010
Philip Rebohle (1):
radv: Remove dead shared variables after optimization loop.
Philipp Zabel (6):
rusticl: work around reference-to-mutable-static warnings
etnaviv: common: Add PIPE_3D feature bit
etnaviv: Avoid duplicate query of ETNA_GPU_FEATURES_0 parameter
etnaviv: hwdb: Add VIP_V7 and NN_XYDP0 feature bits
etnaviv: Add nn_core_version field to etna_specs
etnaviv/nn: Extend post-multiplier for v8 architecture
Pierre-Eric Pelloux-Prayer (21):
radeonsi: compute epitch when modifying surf_pitch
Revert “ci/radeonsi: disable VA-API testing on raven”
radeonsi: emit cache flushes before draw registers
radeonsi: adjust flags for si_compute_shorten_ubyte_buffer
winsys/amdgpu: use syncobj rather than amdgpu fence
ac, radeonsi: remove has_syncobj, has_fence_to_handle
radeonsi: try to disable dcc if compute_blit is the only option
meson: require libelf when radeonsi is built
egl/drm: flush before calling get_back_bo
radv: don’t remove the blit queue from the device queues
winsys/amdgpu: unmap user fence BO before destroy
winsys/amdgpu: remove unused amdgpu_fence_is_syncobj
wsi/wl: flush connection on swapchain failure
mesa: deal with vbo_save_vertex_list::modes being NULL
wsi/wl: check wsi_wl_surface’s validity before use
egl/wayland: use __DRI_IMAGE_PRIME_LINEAR_BUFFER in get_back_bo
winsys/radeon: pass priv instead NULL to radeon_bo_can_reclaim
radeonsi: preserve alpha if needed in kill_ps_outputs_cb
amd: fix addrlib regression
aco: don’t use python 3.7+ feature in aco_opcodes.py
radv: don’t use python 3.9 feature in radv_annotate_layer_gen.py
Qiang Yu (1):
radeonsi: split RADEON_USAGE_NEEDS_IMPLICIT_SYNC into CB and DB flags
Ray Smith (2):
panfrost: Don’t try to set bifrost blendable format on midgard
panfrost: Fix format tables for v4 and v5
Rhys Perry (84):
radv: do nir_shader_gather_info after radv_nir_lower_rt_abi
nir/lower_non_uniform: set non_uniform=false when lowering is not needed
nir/lower_shader_calls: remove CF before nir_opt_if
aco: fix labelling of s_not with constant
aco: add VOPD format
aco: add VOPD statistic
aco: refactor schedule_ilp main loop
aco: implement VOPD scheduler
aco: enable VOPD scheduler
aco: fix >8 byte linear vgpr copies
aco/tests: fix to_hw_instr.swap_linear_vgpr
aco: refactor create_vopd_instruction
aco: swap operands to create VOPD instructions
aco: turn v_mov_b32 into addition to create VOPD instructions
aco: improve printing of VOPD instructions
aco/tests: add tests for VOPD operand swapping
aco/tests: use raw strings in form_hard_clauses.nsa
radv: support minmax filter for more formats
aco/ra: don’t initialize assigned in initializer list
aco/ra: fix GFX9- writelane
aco: don’t combine linear and normal VGPR copies
aco/ra: disable p_start_linear_vgpr allocation hint
aco: allow p_start_linear_vgpr to use multiple operands
aco: require linear vgpr uses to be late kill
aco: only allow linear vgpr kills in top-level blocks
aco/ra: constify various RegisterFile
aco/ra: move parallelcopy creation into helper
aco/ra: change get_reg_bounds() helper
aco/ra: rework linear VGPR allocation
aco/ra: disable live range splitting of linear vgprs
aco/ra: emit linear VGPR parallel copy separately
aco/tests: add tests for linear VGPR register allocation
aco: optimize for purely linear VGPR copies
nir/algebraic: don’t create 64-bit min/max/ior if lowered
nir/algebraic: remove duplicated iand(ien, ine)/ior(ieq, ieq) patterns
nir/algebraic: optimize 64-bit comparisons with zero’d halves to 32-bit
nir/lower_int64: allow 64-bit comparisons when lowering minmax
nir/search: fix nir_replace_instr() debug code
aco: don’t pass constant to is_overwritten_since()
radv: don’t advertise DGC with LLVM
radv: stop using 5/8 component SSBO stores
radv,aco: allow VS prologs to increase VGPR usage
aco: don’t reuse misaligned attribute destination VGPRs in VS prologs
aco/util: add small_vec
radv: use dual_color_blend_by_location with Half-Life Alyx
aco/cssa: reset equal_anc_out if merging fails
aco/cssa: update comments
aco: fix GFX6 buffer_load_dwordx4 opcode number
aco: rename opcode->instruction
aco: refactor VOPC opcode list
aco: use single tuple for all opcode numbers
aco: use op()
aco: move dot/wmma instructions into VOP3P list
aco: unify MIMG opcode lists
aco/gfx11: fix scratch ST mode assembly
aco: split instruction assembly into functions
aco: always emit float mode for merged shaders compiled separately
aco: avoid breaking clauses with waitcnt
nir: add mqsad_4x8, shfr and nir_opt_mqsad
aco: implement mqsad_4x8 and shfr
ac/llvm: implement mqsad_4x8 and shfr
amd: set has_shfr32=true
radv: optimize msad_4x8 to mqsad_4x8
radv: memset radv_pipeline_cache_object data
nir: add nir_remove_after_cf_node helper
aco: remove unreachable merge blocks
aco: ensure loop exits exist in NIR
aco: save/reset/combine has_divergent_continue in uniform branches
nir,aco: add test intrinsics
aco/tests: add isel test helpers
aco/tests: add control flow tests
aco: assume no unreachable blocks
aco: don’t include the clause in VMEM_CLAUSE_MAX_GRAB_DIST
aco: remove occupancy check in dealloc_vgprs()
aco/tests: don’t assume constructor order
aco/tests: remove LLVM 11 code
radv: cache RT stage info
aco: include LDSDIR in latency/etc stats
aco: make store clauses more aggressively
aco: schedule LDSDIR instructions
aco: schedule LDS instructions
aco: split vop3p results
aco/waitcnt: fix DS/VMEM ordered writes when mixed
aco: create lcssa phis for continue_or_break loops when necessary
Rob Clark (31):
freedreno/a6xx: fix comment
freedreno/registers: Pass full args to dump_c()
freedreno/registers: De-duplicate xml_reg_files
freedreno/registers: Don’t re-parse files
freedreno/registers: Generate copyright comment blurb
freedreno/registers: Add basic kernel header support
freedreno/registers: A couple newline changes
tu/drm/virtio: Fix dmabuf import
freedreno/drm: Submit should hold ref to device
freedreno/drm: Fix teardown crash harder
freedreno/decode: Fix prefetch handling for IB1 crash
freedreno: Fix MSAA z/s layout in GMEM
freedreno/crashdec: Find potential fault buffers
tu: Give suballoc bo’s a name
freedreno/a6xx: Add dual_color_blend_by_location
freedreno/a6xx: Fix z/s preserving sysmem clear blit
freedreno/pps: Don’t re-init perfcntrs
freedreno: Add bo usage hints
freedreno/drm: Add perfetto memory tracing
tu: Add perfetto memory tracing
pps: Enable memory traces
pps: Config tweaks to avoid loosing traces
freedreno/registers: Add license header
egl/android: Fix gl_config dereference
freedreno/drm/virtio: Fix deadlock on exit
freedreno+virgl: Add missing driconf
freedreno: Update a618 xfails
ci: Add deqp fix for pipeline_statistics_3 tests
tu: Fix a6xx lineWidthGranularity
egl/android: Fix sRGB visuals
freedreno/ir3: Fix ldg/stg offset
Robert Beckett (1):
vulkan/wsi: fix force_bgra8_unorm_first
Robert Mader (5):
crocus: Support offset query for multi-planar planes
panfrost: Use pipe resource helper
egl: Implement EGL_EXT_config_select_group
egl: Implement EGL_MESA_x11_native_visual_id
egl/x11: Allow all RGB visuals to match 32-bit RGBA EGLConfigs
Robin Kertels (3):
nvk: Enable EXT_nested_command_buffer.
nak: Enable lowering rotate to shuffle.
nvk: Advertise VK_KHR_shader_subgroup_rotate.
Rohan Garg (35):
anv: refactor emit_dynamic_buffer_binding_table_entry
isl,blorp,anv: introduce ISL_TILING_64_XE2 for Xe2+ platforms
anv: untyped data port flush required when a pipeline sets the VK_ACCESS_2_SHADER_STORAGE_READ_BIT
anv: factor out common code for determining surface usage from a VkDescriptorType
anv: cleanup duplicate robustness flag calculations
anv: add a command streamer stall on Xe2+ when switching pipelines
intel/compiler: Xe2+ can do URB load/store with a byte offset
anv: drop duplicated 3DSTATE_SLICE_TABLE_STATE_POINTERS emission
anv, blorp: Set COMPUTE_WALKER Message SIMD field
intel/genxml: update PIPE_CONTROL so that we can decode it on the CCS
iris,anv: WA 1509820217 is no impact for Xe2+
intel/brw: Use the dimensions supplied in the instruction
intel/brw: Cleanup send generation
intel/brw: Update written size depending on the LSC message
intel/brw: Set the right cache control bits for xe2
intel/brw: Adjust src1 length bits for xe2+
anv,blorp: implement restrictions from WA 1406738321
anv: 3D surfaces have fewer layers for higher miplevels
isl: enable CCS for 3D surfaces on gen12.5 and above
intel/brw: account for sources when determining if a operation uses half floats
intel/brw: Xe2+ can do SIMD16 for extended math on HF types
intel/brw: update disassembly for MATH pipe
intel/brw: adjust the copy propgation pass to account for wider GRF’s on Xe2+
intel/brw: minor rework to de duplicate variable assignment
intel/brw: Handle typed surface and atomic messages for xe2+
intel/brw: Lower DWORD scattered read writes to lsc
intel/eu/validate: Allow SIMD16 for mixed mode float operations on xe2+
iris: slow clear higher miplevels on single sampled 8bpp resources that have TILE64
intel/blorp: add fast clear rectangle dimensions for single sampled TILE64 CCS surfaces
isl: allow CCS on single sampled TILE64 surfaces
anv: Enable HiZ on multi-LOD depth buffers.
anv: use u_foreach_bit to iterate over the the view mask like we do for transition_clear_color
anv: formatting fix when printing pipe controls
anv: allocate space for generated indirect draw id’s using the temporary allocation helper
Revert “iris: slow clear higher miplevels on single sampled 8bpp resources that have TILE64”
Roland Scheidegger (2):
auxiliary/draw: fix streamout overflow calculation
auxiliary/rtasm: fix unaligned stores
Romain Naour (1):
glxext: don’t try zink if not enabled in mesa
Ruijing Dong (6):
radeonsi/vcn: data structure av1 enc long term reference.
radeonsi/vcn: vcn4 av1 long term ref support
frontends/va: get av1 encoding ref frame infos for L0.
radeonsi/vcn: add enc surface alignment caps
frontends/va: add surface alignment attribute
radeonsi/vcn: update to use correct padding size.
Ryan Neph (3):
venus: fix shmem leak on vn_ring_destroy
virgl: use PIPE_MAX_SAMPLERS in bind_samplers_states
venus: reclaim signal semaphore feedback resources for wasteful clients
Sagar Ghuge (28):
intel/fs: Track instance id in gs_thread_payload
vulkan/runtime: Track VkSharingMode in vk_image
anv: Disable compression if we have concurrent sharing mode
intel/compiler/xe2: Handle 6-bit message type for Gfx20+
intel/compiler: Add texture operation lowering pass
intel/compiler: Use nir_tex_src_backend1 to pack LOD and array index
nir: Drop intel specific lowering code
intel/compiler: Lower texture operation to combine LOD and AI
intel/dev: Update max_subslices_per_slice comment
intel/compiler: Fix disassembly of URB message descriptor on Xe2+
anv: Drop warnings for engine initialization failure
anv: Set timestampValidBits to 64bits
intel/compiler: Trim vector properly till array index
intel/compiler: Adjust sample_b parameter according to new layout
intel/compiler: Pack LOD/bias and array index on TG4 messages
intel/compiler: Pack texture LOD and offset to a single 32-bit value
intel/compiler: Add helper method to decide if header is required
intel/compiler: Add gather4_i/l/[_c]/b sampler message
intel/compiler: Add texture gather offset LOD/Bias message support
nir: Allow nir_texop_tg4 in implicit derivative
intel/compiler: Enable packing of offset with LOD or Bias
anv: Implement VK_AMD_texture_gather_bias_lod
anv/xe: Consider pat_index while unbinding the bo
anv: Fix typo in DestinationAlphaBlendFactor value
anv: Use appropriate argument format for indirect draw
isl: Update isl_swizzle_supports_rendering comment
isl: Update shader channel select for missing components
intel/compiler: Disassemble mlen/rlen/ex_mlen in units of registers
Saleemkhan Jamadar (1):
radeonsi/vcn: set jpeg reg version for gfx 1151
Samuel Pitoiset (419):
radv: constify stages in radv_rt_fill_group_info()
radv/rt: re-use radv_ray_tracing_stage::sha1 for hashing RT pipelines
radv: correctly return VK_ERROR_OUT_OF_DEVICE_MEMORY when mapping a BO fails
radv/nir: pass radv_shader_stage to some radv_nir_xxx() functions
radv/nir: remove useless struct for nir_shader typedef
radv: remove one unused parameter in radv_fill_shader_info_ngg()
radv: move radv_pipeline_key::mesh_fast_launch_2 to the per-device cache key
radv: add radv_shader_stage_key to radv_shader_stage
radv: use radv_shader_stage_key directly with pre-existing fields
radv: add optimisations_disabled to radv_shader_stage_key
radv: remove unecessary radv_nir_compiler_options::key
radv: remove unused lower_rt_instruction_monolithic_state::key
radv: stop passing the pipeline key when compiling compute/rt shaders
radv: re-organize radv_pipeline_key
radv: add vertex_robustness1 to radv_shader_stage_key
radv: introduce radv_graphics_state_key
zink/ci: skip more arb_shader_image_load_store.* on Polaris10/Navi10
radv: add keep_statistic_info to radv_shader_stage_key
radv: add shader_version to radv_shader_stage_key
radv: pass radv_shader_stage_key to radv_pipeline_stage_init()
radv: make sure to retain shaders key for imported shaders with GPL
radv: cleanup radv_generate_pipeline_key()
radv: add radv_pipeline_get_shader_key()
radv/rt: cleanup radv_parse_rt_stage()
radv: hash radv_shader_stage_key
radv: stop hashing radv_pipeline_key for compute/rt pipelines
radv: remove the pipeline key for compute pipelines
radv: remove the pipeline key for ray tracing pipelines
radv: remove an extra new line in radv_shader.h
radv: pass radv_graphics_state_key to radv_hash_shaders()
radv: remove radv_generate_pipeline_key()
radv: rename radv_pipeline_key to radv_graphics_pipeline_key
radv: delay emitting streamout enable at draw time
aco: silent checking if clrxdisasm is available
radv: fix indirect dispatches on the compute queue on GFX7
radv: fix indirect draws with NULL index buffer on GFX10
radv: remove unused parameter to gather_shader_info_mesh()
radv: add a per-stage key field for mesh shaders with a task shader
vulkan: bump headers/registry to 1.3.276
lavapipe: fix build since vulkan spec update
vulkan: promote VK_EXT_line_rasterization to KHR
vulkan: promote VK_EXT_index_type_uint8 to KHR
radv: add a helper for binding the custom blend mode
radv: add a helper to get the VGT_GS_OUT value
radv: prevent accessing NULL pipelines when emitting VBO with ESO
radv: re-emit the TCS epilog when a new TCS is bound
radv: enable prologs/epilogs in-memory cache for shader objects
radv: add required NV entrypoints for VK_EXT_shader_object
radv: initialize default dynamic state when beginning a new cmdbuf
radv: add radv_shader_stage::next_stage field
radv: add radv_shader_layout::dynamic_offset_count
radv: add support for creating/destroying shader objects
radv: make some pipeline graphics helpers non-static for ESO
radv: add support for binding/emitting shader objects
radv: advertise VK_EXT_shader_object on GFX6-8
radv: advertise VK_KHR_load_store_op_none
radv: promote VK_EXT_line_rasterization to KHR
radv: advertise VK_KHR_line_rasterization
radv: promote VK_EXT_index_type_uint8 to KHR
radv: advertise VK_KHR_index_type_uint8
radv: use device->vk.enabled_features instead of iterating twice
radv: fix segfault when getting device vm fault info
radv/ci: enable RADV_PERFTEST=shader_object for vkcts-polaris10-valve
radv: refactor gfx103_pipeline_emit_vgt_draw_payload_cntl()
radv: refactor gfx103_pipeline_emit_vrs_state()
radv: use the non-emitted graphics pipeline for the needed dynamic states
radv: fix the late scissor emission workaround with ESO on GFX9
radv: set NGG fields in vgt_shader_key for ESO on GFX10+
radv: do not ignore RADV_DYNAMIC_FRAGMENT_SHADING_RATE for ESO on GFX10.3+
radv: emit more default states for ESO on GFX10.3+
radv: export alpha-to-coverage via MRTZ for ESO on GFX11
radv: fix detecting invalid binaries with ESO
radv: fix emitting tess domain origin for merged TES+GS on GFX9
radv: emit required programming for tess on GFX10+ in radv_emit_hw_vs()
radv: rebind mesh/task shaders when restoring meta context
radv: determine next stage for mesh/task with ESO
radv: ignore unneeded dynamic states with mesh shaders and ESO
radv: determine the last VGT api stage with mesh shaders and ESO
radv: bind and emit mesh/task shaders with ESO
radv: prevent crashes when a task shader is compiled unlinked with ESO
radv: init the shader key in radv_shader_stage_init() for ESO
radv: add support for VK_SHADER_CREATE_NO_TASK_SHADER_BIT_EXT
radv: add a helper to know if device fault detection is enabled
radv: refactor dumping GPU hang reports by using chunks
radv: add support for keeping GPU hang reports in memory
radv: export GPU hang reports through VK_EXT_device_fault
radv: enable deviceFaultVendorBinary if RADV_DEBUG=hang is set
radv: remove radv_graphics_state_key::dynamic_patch_control_points
radv: determine the workgroup size for TCS earlier
radv: set the default workgroup size for VS as LS
radv: constify radv_device in radv_emit_shader_pointer()
radv: check active NIR stages before trying to merge shaders on GFX9+
radv: only merge shader info stages if both stages exist on GFX9+
radv: rework shader arguments for separate compilation of VS+TCS on GFX9+
radv: always mark drawid/base_instance used with ESO
radv: force TCS stage for VS as LS compiled separately on GFX9+
radv: always emit PGM_RSRC1_HS when emitting the TCS epilog state
radv: add support for emitting VS+TCS compiled separately on GFX9+
radv: do not allow to enable VK_EXT_shader_object with LLVM
radv: add a workaround for mipmaps and minLOD on GFX6-8
radv/sqtt: fix describing queue submits for RGP
radv: limit maxIndirectCommandsTokenCount to 512
radv: remove one indentation level in radv_fill_shader_info_ngg()
radv: squash GFX10/GFX10.3 NGG restrictions in the same condition
radv: always set GS as NGG if present on GFX11
radv: use next_stage to determine the NGG stage
radv: check for MESA_SHADER_TESS_EVAL in radv_fill_shader_info_ngg()
radv: determine the ES stage earlier when processing binary config
radv: determine the workgroup size for GS non-NGG earlier
radv: set the default workgroup size for VS/TES as ES
radv: change the user SGPR idx of AC_UD_TES_STATE
radv: add a new user SGPR for the ESGS ring item size
radv/nir: lower esgs_vertex_stride for GS compiled separately on GFX9+
radv: rework shader arguments for separate compilation of VS+GS on GFX9+
radv: declare streamout buffers for VS+GS compiled separately on GFX9+
radv: force GS stage for VS as ES compiled separately on GFX9+
radv: add support for emitting VS+GS compiled separately on GFX9+
radv/ci: remove VKD3D_CONFIG=dxr11 for navi21/navi31
radv: remove unused radv_indirect_command_layout::state_offset
radv: only load 3x32-bit elements when emitting draws with mesh shader
docs: fix RADV_DEBUG=nonggc description
radv: add RADV_DEBUG=nongg_gs for GFX10/GFX10.3
radv: add radv_disable_ngg_gs and enable it for Persona 3 Reload
radv: fix RGP barrier reason for RP barriers inserted by the runtime
radv: force GS stage for TES as ES compiled separately on GFX9+
radv: declare streamout buffers for TES+GS compiled separately on GFX9+
radv: declare AC_UD_TES_STATE for separate compilation of GS on GFX9+
radv: bind the vertex input SGPR only for relevant stages
radv: add support for emitting TES+GS compiled separately on GFX9+
radv: allow RADV_PERFTEST=shader_object on GFX9/VEGA10
radv/ci: enable RADV_PERFTEST=shader_object on VEGA10
radv: cleanup radv_shader_combine_cfg_vs_tcs()
radv: fix emitting VS prologs for merged shaders compiled separately on GFX10+
radv: clear RADV_CMD_DIRTY_SHADERS when resetting the shader object state
radv: clear the custom blend mode when resetting gfx pipeline state
radv: fix re-emitting DB_RENDER_CONTROL when resetting gfx pipeline state
radv: make sure to reset the GS copy shader with ESO
radv: fix selecting shader variants with ESO
radv: fix setting the rasterized primitive for ESO
radv: enable GS_FAST_LAUNCH=2 by default for RDNA3 APUs (Phoenix)
radv: only configure {XYZ_DIM,DRAW_INDEX}_REG for mesh shaders if enabled
radv: re-enable GS_FAST_LAUNCH=2 by default on GFX11
radv: stop using conditional rendering internally when preprocessing DGC
radv: disable conditional rendering if enabled when preprocessing DGC
radv: pass the ES stage when emitting geometry shader with ESO
radv: determine the ES stage for merged NGG shaders compiled separately
radv: prefix radv_vgt_shader_key::streamout with ngg
radv: set radv_vgt_shader_key::ngg_streamout for ESO
radv: determine the number of invocations only for VS/TES as NGG
radv: store the number of outputs for VS/TES as NGG
radv: use radv_shader_info for computing NGG LDS layout
radv: fix a compilation warning in radv_bind_graphics_shaders()
spirv: only consider IO variables when adjusting patch locations for TES
radv: move mesh_fast_launch_2 to radv_physical_device
radv: initialize disk cache slightly later when creating a physical device
radv: introduce a per physical device cache key
radv: fix binary shaders compatibility with ESO
radv: fix indirect dispatches on compute queue with conditional rendering on GFX7
radv: remove the union in radv_shader_object
radv: fix a big memleak with VK_EXT_shader_object
radv: free NIR shaders when creating linked shaders with ESO
radv: simplify binding the GS copy shader with ESO
radv: rename radv_emit_shaders() to radv_emit_graphics_shaders()
radv: simplify emitting VGT_ESGS_RING_ITEMSIZE for ESO
radv: re-emit more states when a shader compiled separately is bound
radv: only enable emulated mesh/task shader invocations on GFX10.3
radv: add support for mesh primitives queries on GFX11
radv: add support for task shader invocations queries on GFX11
radv: remove a TODO about adding mesh/task queries on GFX11
radv: store/reset conditional rendering user info in the helpers
radv: add support for conditional rendering on the compute queue with DGC
radv: remove unused parameter in gfx10_get_ngg_query_info()
radv: do not set gs.has_pipeline_stat_query twice for NGG GS
radv: use so.num_outputs to determine if NGG shaders need XFB queries
radv: determine NGG query info before linking shader info
radv: pass gfx10_ngg_info to gfx10_get_ngg_info()
radv: pass radv_shader_info to gfx10_get_ngg_info()
radv: determine NGG culling info before NGG info
radv: compute NGG scratch LDS base in gfx10_get_ngg_info()
radv: compute the total LDS usage in gfx10_get_ngg_info()
radv: disable VK_FORMAT_E5B9G9R9_UFLOAT_PACK32 with minmax filter on GFX6
radv/ci: enable RADV_PERFTEST=shader_object for vkcts-tahiti-valve
radv: clean up MAX_ALLOWED_TILES_IN_WAVE programming
radv: add missing RADV_DEBUG_NO_NGG_GS to the physical device cache key
radv: fix conditional rendering with direct mesh+task draws and multiview
radv: move conditional rendering for compute in radv_cmd_state
radv: get the pipeline layout info from the push constant token with DGC
radv: add a helper to calculate the compute resource limits
radv: add a function to get compute pipeline metadata for DGC
radv: add support for VK_PIPELINE_CREATE_INDIRECT_BINDABLE_BIT_NV
radv: implement vkGetPipelineIndirectXXX() for DGC
radv: implement vkCmdUpdatePipelineIndirectBufferNV()
radv: implement indirect compute pipeline binds with DGC
radv: handle indirect pipeline binds with scratch and DGC
radv: force shader BOs to be local BOS with DGC indirect compute pipelines
radv: enable deviceGeneratedComputePipelines
radv: fix conditional rendering on compute queue on GFX6
radv: add missing conditional rendering for indirect dispatches on GFX6
radv: add a helper to emit PKT3_COND_EXEC
radv: add a new user SGPR for NGG shaders compiled separately with ESO
radv: lower lds_ngg_{gs_out_vertex_base,_scratch_base} with ESO
radv: add support for emitting NGG shaders with ESO
radv: allow RADV_PERFTEST=shader_object on GFX11
radv: enable radv_zero_vram for RAGE2
radv: preserve streamout_buffers user SGPR for VS/TES + GS compiled separately
radv: always use ace_cs for the gang CS variable
radv: refactor emitting the view index for task shaders
radv: allocate a 32-bit value for the MEC fw bug with indirect mesh+task earlier
radv: stop passing radv_cmd_buffer to draw functions with task shaders
radv/ci: remove RT tests from the VANGOGH skip list
radv/ci: remove dEQP-VK.robustness.* from the VANGOGH skip list
radv: disable NGG in more situations with ESO on GFX10/GFX10.3
radv: implement has_vgt_flush_ngg_legacy_bug for ESO
radv: allow RADV_PERFTEST=shader_object on GFX10/GFX10.3
radv,aco: stop duplicating PS/TCS epilog fields
radv: add a helper to emit PS/TCS epilogs
radv/ci: enable RADV_PERFTEST=shader_object for VKCTS jobs on GFX10+
radv/ci: mark dEQP-VK.shader_object.binding.mesh_swap_task as flake on NAVI21
radv: stop using the custom blend mode for PS epilogs
radv: re-emit RB+ state with PS epilogs only when the col format changes
radv: fix emitting default blend state for PS without epilogs and ESO
radv: allow RADV_PERFTEST=shader_object on all GFX9 GPUs
radvi/ci: enable RADV_PERFTEST=shader_object for RENOIR
util/u_debug: fix parsing of “all” again
ci: use Linux kernel 6.6 for RADV
radv/ci: update list of flakes for VKCTS jobs
zink/ci: update list of flakes for RADV jobs
ac/nir: fix exporting NGG streamout outputs with implicit PrimId from VS/TES
radv: fix determining if PrimId is used for merged shaders compiled separately
radv: determine radv_vgt_shader_key::has_ngg_xxx with the last VGT shader
radv: rework generating vgt_shader_key for pipelines
radv: fix wave32 support with ESO
radv: add helpers to bind the GS copy shader and the RT prolog
radv: fix RADV_PERFTEST=dmashaders with ESO
radv: emit VGT_GS_OUT_PRIM_TYPE as part of the dynamic primitive topology
radv: disable binning correctly on GFX11.5
radv: fix programming of PA_SC_BINNER_CNTL_1.MAX_ALLOC_COUNT on GFX11
radv: program SAMPLE_MASK_TRACKER_WATERMARK optimally for GFX11 APUs
radv: add a GPU hang workaround for legacy tess+GS for GFX10.3
radv: fix occlusion queries with MSAA and no attachments
radv: add radv_force_pstate_peak_gfx11_dgpu and enable it for Helldivers 2
zink/ci: enable RADV_PERFTEST=shader_object for polaris10
radv: add a workaround for null IBO on GFX6
zink/ci: update CI lists
radv: always export MRTZ in FS epilogs with ESO on GFX11
radv: trigger a new PS epilog when the framebuffer is dirty with ESO
zink/ci: allow RADV_PERFTEST=shader_object on NAVI31
radv: invalidate L2 metadata for VK_ACCESS_2_MEMORY_READ_BIT
radv: make sure to disable NGG culling with TES when the FS stage is unknown
zink/ci: enable RADV_PERFTEST=shader_object for NAVI10/VANGOGH
radv/rmv: add missing logging when sparse BOs are destroyed
radv/rmv: add missing logging when events are destroyed
radv/rmv: fix logging of per-queue destroyed BOs
radv/rmv: fix logging sparse residency
radv/winsys: move BO size to radeon_winsys_bo
radv/rmv: remove BO size parameter in radv_rmv_log_bo_allocate()
radv: make some create resources helpers static
radv/rmv: remove unnecessary is_internal parameter to some helpers
radv: add radv_bo_{create,destroy}() helpers
radv/rmv: prevent logging BOs allocated in GDS/OA domains
radv/rmv: log allocated/destroyed BOs in radv_buffer_{create,destroy}()
radv: add radv_bo_virtual_bind() helper
radv: fix conditional rendering with mesh+task and multiview (again)
radv: remove useless RADV_DEBUG=nomemorycache
radv: implement alpha-to-one
radv: advertise alphaToOne
radv: advertise extendedDynamicState3AlphaToOneEnable with ACO
docs: add alpha-to-one features for RADV
radv: rename radv_physical_device variables to pdev everywhere
radv/winsys: rename gpu_info to pci_ids in the null winsys
radv: rename radeon_info variables to gpu_info everywhere
radv: rename radv_physical_device::rad_info to info
radv: remove radv_device::physical_device
radv: remove radv_queue::device
radv: remove radv_cmd_buffer::device
radv: remove radv_device::instance
radv: remove radv_physical_device::instance
radv: declare radv_cmd_update_descriptor_xxx() in radv_descriptor_set.h
radv: declare format related functions in radv_formats.h
radv: pass a radv_physical_device to radv_use_llvm_for_stage()
radv: move radv_device_supports_etc() to radv_physical_device.c
radv: move some VK_DEFINE_NONDISP_HANDLE_CASTS to radv_descriptor_set.h
radv: add radv_sampler.h
radv: add radv_event.h
radv: add radv_buffer_view.h
radv: add radv_buffer.h
radv: add radv_video.h
radv: add radv_image.h
radv: add radv_image_view.h
radv: add radv_query.h
radv: add radv_perfcounter.h
radv: add radv_device_generated_commands.h
radv: enable radv_zero_vram for Red Dead Redemption 2
vulkan/debug_utils: add a helper for reporting address binding
radv: implement VK_EXT_device_address_binding_report
radv: advertise VK_EXT_device_address_binding_report
radv: move radv_prim_vertex_count to si_cmd_buffer.c
radv: move radv_userdata_locations to radv_shader_args.h
radv: move radv_shader_{layout,stage} to radv_shader.h
radv: add radv_device_memory.h
radv: add radv_instance.h
radv: add radv_queue.h
radv: add radv_physical_device.h
radv: add radv_rra.h
radv: add radv_device.h
radv: add radv_pipeline_cache.h
radv: add radv_pipeline.h
radv: add radv_pipeline_compute.h
radv: add radv_pipeline_rt.h
radv: add radv_pipeline_graphics.h
radv: add radv_wsi.h
radv: add radv_sqtt.h
radv: add radv_shader_object.h
radv: add radv_spm.h
radv: add radv_cmd_buffer.h
radv: add radv_rmv.h
radv: add radv_cp_reg_shadowing.h
radv: add radv_printf.h
radv: move radv_get_tdr_timeout_for_ip() to radv_query.h
radv: move radv_queue_ring() to radv_queue.c
radv: add radv_nir_to_llvm.h
radv: add radv_android.h
radv: add radv_shader_info.h
radv: move CP DMA related code to radv_cp_dma.c/h
radv: move more cmd buffer related code to radv_cmd_buffer.c
radv: merge radv_write_guardband() with radv_emit_guardband_state()
radv: merge radv_write_scissors() with radv_emit_scissor()
radv: move radv_get_viewport_xform() to radv_pipeline_graphics.c
radv: move radv_create_gfx_config() to radv_device.c
radv: move radv_emit_{compute,graphics}() to radv_queue.c
radv: move code related to sample positions to radv_device.c
radv: rename si_cmd_buffer.c to radv_cs.c
radv: remove unused radv_printflike()
radv: remove pre-declarations needed for WSI entrypoints
radv: remove remaining forward declarations and comments in radv_private.h
radv: replace RADV_FROM_HANDLE by VK_FROM_HANDLE
radv: add missing endif comment for some headers
radv: rename remaining phys_dev occurrences to pdev
radv: replace radv_minify() by u_minify()
radv: replace align_{u32,u64}() by align{64}()
radv: replace align_u32_npot() by ALIGN_NPOT
radv: replace radv_float_to_{u,s}fixed() by util_{un}signed_fixed()
util: add util_is_aligned()
radv: replace radv_is_aligned() by util_is_aligned()
radv: move RADV_SUPPORT_CALIBRATED_TIMESTAMPS to radv_physical_device.c
radv: move RADV_API_VERSION to radv_instance.h
radv: move CLOCK_MONOTONIC_RAW define to radv_physical_device.h
radv: move RADV_USE_WSI_PLATFORM define to radv_wsi.h
radv: remove radv_private.h
radv: make radv_get_vgt_index_size() static
radv: move radv_get_user_sgpr() to radv_shader.c
radv: move radv_queue_family_to_ring() to radv_queue.c
radv: remove old comment in radv_cs.c
radv: move radv_printf_data to radv_printf.h
radv: make sure the heap budget is less than or equal to the heap size
radv: use SPDX-License-Identifier
radv: enable VK_EXT_shader_object by default
aco: use SPDX-License-Identifier
ci: uprev vkd3d-proton to c3b385606a93baed42482d822805e0d9c2f3f603
docs: mark VK_KHR_maintenance6 as DONE for RADV
radv: determine if the cache is disabled at device creation time
radv: add skip_shaders_cache also for compute/rt pipelines
radv: stop using a graphics pipeline for generating the graphics key
radv/rt: constify device in radv_init_rt_stage_hashes()
radv/rt: handle creation feedback like graphics/compute pipelines
radv/rt: stop passing pCreateInfo to radv_ray_tracing_pipeline_cache_search()
radv/rmv: fix missing image bind logging for WSI images
radv: fix missing addr binding report for WSI image binds
radv: fix addr binding report for disjoint image binds
radv/rmv: fix image binds logging for disjoint images
radv: add a helper to set image bindings
radv: fix missing unbind report when an image is destroyed
radv: fix missing unbind report when a buffer is destroyed
radv/rt: remove dead code about intersection shaders in radv_pipeline_get_shader_key()
radv: add a helper for hashing pipelines
radv: rework and add a helper for hashing a compute pipeline
radv: stop ignoring shader stages that don’t need to be imported with GPL
radv: add missing SQTT markers when an indirect indexed draw is used with DGC
radv/rt: use radv_pipeline_hash_shader_stage()
radv/rt: stop computing unused hash for the traversal shader
radv: use canonicalized VA for VM fault reports
radv: simplify importing pipeline layout with GPL
radv: return early when PS is NULL in radv_pipeline_init_blend_state()
radv: simplify checking for PS epilogs in radv_pipeline_init_blend_state()
radv: remove unused parameter in radv_skip_graphics_pipeline_compile()
radv: simplify the check for exporting multiview in the last VGT stage
radv/rt: remove unnecessary pipeline parameter to radv_rt_fill_group_info()
radv/rt: remove unnecessary pipeline parameter to radv_generate_rt_shaders_key()
radv/rt: initialize shader group capture/replay in a separate function
radv/rt: rework handle_from_stages to pass hashes directly
radv/rt: insert shaders to cache right after they are compiled
radv/rt: add radv_rt_pipeline_compile()
radv: clear color attachments without exports before compaction
ci: uprev CTS to vulkan-cts-1.3.8.0
radv/ci: add one more flake since CTS 1.3.8.0 for RENOIR
radv/ci: update lists for TAHITI and Zink/Polaris10
radV/ci: reduce the parallelism of navi21 to 3
radv: fix waiting for occlusion queries on GFX6-8
radv: return per plane requirements for disjoint images
zink/ci: update CI lists since piglit uprev
radv/rt: remove unnecessary param to radv_ray_tracing_pipeline_cache_insert()
radv/rt: move radv_ray_tracing_pipeline::sha1 to radv_pipeline
radv: use radv_pipeline::sha1 for graphics/compute pipelines
radv: rework pipeline cache search helpers
radv: add RADV_DEBUG=psocachestats to report per-pipeline cache hits/misses
vulkan: pass cmdbuf level to vk_command_buffer_ops::create()
radv/amdgpu: do not use IB2 for nested command buffers
radv: track if nested command buffers uses indirect draws
radv: advertise VK_EXT_nested_command_buffer
ac,radeonsi: add helpers to compute the number of tess patches/lds size
radv: rework the number of tess patches computation
ac: allow to use 64K of LDS for tessellation on GFX9+
ci: uprev CTS to 1.3.8.2
radv: fix image format properties with fragment shading rate usage
radv: remove bogus VkShaderCreateInfoEXT::flags being 0 assert for compute
radv: allow 3d views with VK_IMAGE_CREATE_2D_VIEW_COMPATIBLE_BIT_EXT
radv: mark some formats as unsupported on GFX8/CARRIZO
radv: set image view descriptors as buffer for non-graphics GPU
radv: only set ALPHA_IS_ON_MSB if the image has DCC on GFX6-9
Sathishkumar S (5):
ac/gpu_info: query the number of ip instance
radeonsi/vcn: avoid hard-coding the number of jpeg instance
ac/gpu_info: fix regression in vulkan hw decode
radeonsi/vcn: use num_instances from radeon_info
ac/gpu_info: update multimedia info
Sean Anderson (2):
gallium: lima: Don’t round height when checking alignment
Add Xilinx ZynqMP KMSRO entrypoint
Sebastian Wick (4):
radeonsi: Destroy queues before the aux contexts
util: Add timespec_sub_saturate to avoid negative time for deadlines
loader/wayland: Add fallback wl_display_dispatch_queue_timeout
vulkan/wsi/wayland: Use dispatch_queue_timeout in acquire_next_image
Sergi Blanch Torne (10):
ci: disable Collabora’s farm due to maintance
Revert “ci: disable Collabora’s farm due to maintance”
ci: disable Collabora’s farm due to maintance
Revert “ci: disable Collabora’s farm due to maintance”
ci: Nightly run expectations update
ci: disable Collabora’s farm due to maintance
Revert “ci: disable Collabora’s farm due to maintance”
Uprev Piglit to f7ece74a107a2f99b2f494d978c84f8d51faa703
ci: kernel stored in a different s3 bucket
ci: identify and label S3 buckets
Shih, Jude (4):
amd/vpelib: Need a debug flag to support 2tap downscaling
amd/vpelib: Add VPE prefix on API to avoid naming conflict
amd/vpelib: Solve the downscaling problem for 2 tap
amd/vpelib: Solve link error due to missing static for one function
Sil Vilerino (25):
d3d12: Do not assume multi-subregion support when querying for driver encode support
d3d12: Implement cap for PIPE_VIDEO_CAP_ENC_INTRA_REFRESH
d3d12: Clean up H264 video decode interlaced code path
frontends/va, d3d12: Fix PIPE_VIDEO_SLICE_MODE_MAX_SLICE_SICE -> PIPE_VIDEO_SLICE_MODE_MAX_SLICE_SIZE typo
d3d12: d3d12_video_encoder_negotiate_current_h264_slices_configuration to use correct mode when intra-refresh is on
d3d12: Do not use PIPE_BIND_DISPLAY_TARGET for d3d12_video_buffer
d3d12: AV1 encode - Configure CQP using qp and new qp_inter parameters
d3d12: H264 encode - Update CQP using current frame type as per VA frontend change
d3d12: HEVC encode - Update CQP using current frame type as per VA frontend change
frontend/va: Support media only post proc without compositor using shaders or surfaces
frontend/va: Use get_resources in VaDeriveImage for media only devices without get_surfaces support
d3d12: Add partial media, compute, graphics support with CORE and GENERIC feature levels
d3d12: Refactor graphics functions from context and blit to separate files
d3d12: Add GetDesc wrapper for ID3D12Heap
d3d12: Only check D3D12_FEATURE_DATA_PLACED_RESOURCE_SUPPORT_INFO for D3D_FEATURE_LEVEL_1_0_GENERIC
d3d12: Video Encode - Add driver workaround for rate control reconfiguration
d3d12: Implement PIPE_VIDEO_CAP_ENC_SURFACE_ALIGNMENT
d3d12: Bump directx-headers dependency to v613
d3d12: Support H264 slice L0/L1 active number override
d3d12: Support HEVC slice L0/L1 active number override
d3d12: Fix leak of batch->bos on video-only builds
d3d12: Fix leak dxil_module::serialized_dependency_table
d3d12: Fix util_blitter_destroy destruction ordering
vl_win32_screen_create: Take ownership of winsys injected to created d3d12_screen
d3d12/ci: Add vainfo with appverifier CI check
Simon Ser (2):
egl/wayland: ensure wl_drm is available before use
egl/wayland: explain why implicit modifier downgrade is allowed
Stéphane Cerveau (1):
vulkan/video: hevc: b-frames can be reference or not
Surafel Assefa (1):
radeonsi: Adds return on failure to get plane info
Sviatoslav Peleshko (6):
nir: Use alu source components count in nir_alu_srcs_negative_equal
anv,driconf: Add sampler coordinate precision workaround for AoE 4
driconf: Apply dual color blending workaround to Dying Light
anv: Store host-located copy of NULL surface state for faster memcpy
dri: Flush the context after flush_resource when creating shareable image
anv: Fix descriptor sampler offsets assignment
Tapani Pälli (35):
hasvk: remove cmd_buffer_ray_query_globals function decl
hasvk: remove gfx9 specific code from emit_sample_pattern
hasvk: remove softpin (GFX_VERx10 >= 90) related code
hasvk: remove gfx9 specific cs stall from emit_ps_depth_count
anv: check for wa 16013994831 in emit_so_memcpy_end
iris: expand pre-hiz data cache flush to gfx >= 125
anv: expand pre-hiz data cache flush to gfx >= 125
iris: replace constant cache invalidate with hdc flush
anv: move *bits_for_access_flags to genX_cmd_buffer
anv: use workaround framework for Wa_22018402687
intel/blorp: add a TODO note about stencil buffer resolve
intel: refactor urb configuration, add intel_urb_config
intel/common: provide a helper for urb setup comparison
blorp/crocus: refactor blorp_emit_urb_config
iris: implement Wa_16014912113
anv: implement Wa_16014912113
blorp: implement Wa_16014912113 callback for drivers
anv: flush tile cache independent of format with HIZ-CCS flush
anv: revert cache flushing changes for indirect commands
intel/blorp: disable use of REP16 independent of format
iris: make sure DS and TE are sent in pairs on >= gfx125
iris: make sure aux is disabled for external objects
anv: make sure aux is disabled for memory objects
hasvk: make sure aux is disabled for memory objects
crocus: make sure aux is disabled for memory objects
anv: use workaround framework for Wa_16013000631
anv: setup distribution granularity with Wa_14019166699
iris: refactor function that checks primitive id usage
iris: setup distribution granularity with Wa_14019166699
anv: disable fcv optimization on >= gfx125
intel/blorp: remove unused blorp batch flag
intel/compiler: add assert for Wa_22017182272
anv: add dirty tracking for push constant data
iris: change stream uploader default size to 2MB
anv: skip gfx push constants alloc optimization on gfx9/11
Tatsuyuki Ishi (8):
radv: Recompute max_waves after postprocessing RT config
radv: never set DISABLE_WR_CONFIRM for CP DMA clears and copies
util: Optimize mesa_hex_to_bytes
radv: Add radv_spirv_to_nir_options that summarize early gfx states.
radv: Rename cache_search_nir to cache_lookup_nir_handle.
radv: Re-sort RADV_PERFTEST env vars in docs
radv: Implement NIR caching behind RADV_PERFTEST=nircache.
radv: Remove radv_queue::device again
Teng, Jin Chung (1):
d3d12: HEVC Encode - Query slice config mode based on user slice setting
Thomas H.P. Andersen (6):
nvk: promote VK_EXT_index_type_uint8 to KHR
nvk: promote VK_EXT_line_rasterization to KHR
nvk: promote load_store_op_none to KHR
docs: update features.txt for nvk
nvk: support driconf option force_vk_vendor
driconf: override vendor id for X4 Foundations on NVK
Thong Thai (2):
radeonsi/vcn: remove EFC support for renoir
frontends/va/postproc: do not use efc if image is to be translated
Timothy Arceri (78):
glsl: add nir version of validate_geometry_shader_emissions()
glsl: use nir version of geom stream validation
glsl: remove now unused GLSL IR validate_geometry_shader_emissions()
glsl: don’t tree graft globals
Revert “ci: Enable GALLIUM_DUMP_CPU=true only in the clang job”
glsl: add basic params for AMD_gpu_shader_half_float extension
glsl: add half float support to the parser
glsl: add explicit half float conversion support
glsl: update assert to allow for half float support
glsl: add glsl_type_is_float_16() helper
glsl: add implicit half float conversions
glsl: add ubo packing support for half floats
glsl: skip conversion of half float back to float for GL queries
glsl: add some new helpers for half float builtin functions
glsl: add half float angle and trigonometry functions
glsl: add half float exponential functions
glsl: add f2f16() helper to ir_builder
glsl: add half float support for common functions
glsl: add support for half float packing functions
glsl: add half float geometric functions
glsl: add half float matrix functions
glsl: add half float vector relational functions
glsl: allow half float varyings
glsl: add half float interpolation functions
glsl: add half float derivative functions
glsl: add half float AMD_shader_trinary_minmax functions
compiler/types: Add a contains_32bit helper
gallium: add PIPE_CAP_FP16 for AMD_gpu_shader_half_float
glsl: add missing error check for half float varying
nir: allow gather info to handle nir_deref_type_array_wildcard
glsl: support array wildcards in lower named interface blocks
glsl: split var copies before lowering named interfaces
glsl: fix potential crash in expression flattening
glsl: move some lowering to the compiler
glsl_to_nir: merge function param handling
glsl_to_nir: support conversion of struct/array function params
glsl_to_nir: support conversion of struct/array function returns
glsl_to_nir: support conversion of opaque function params
glsl: don’t inline functions in glsl ir
nir: add some nir_parameter fields
glsl: add missing define to linker_util.h
glsl: add nir version of function recursion detection
glsl: move function inlining out of glsl_to_nir()
glsl: make use of nir recursion detection
glsl: implement nir version of lower discard flow
glsl: make use of nir lower discard flow
glsl: remove now unused glsl ir lower discard pass
glsl: make an explicitly safe version of visit_exec_list()
glsl_to_nir: never convert instructions after jump
glsl: remove unrequired do_lower_jumps() call
glsl: move invariant builtin validation to the nir linker
nir: add max_array_access data field
nir: add implicit_sized_array data field
glsl: add resize_tes_inputs() to the nir linker
nir: add variable field from_ssbo_unsized_array
glsl: don’t remove redefined per vertex block
glsl: add nir implemenation of block validation
glsl: switch to NIR block validation
glsl: call new nir resize_tes_inputs() pass
glsl: remove now unused resize_tes_inputs()
glsl: remove now unused glsl ir block validation
glsl: move some linking calls to gl_nir_link_glsl()
glsl: switch verify_subroutine_associated_funcs() to nir
nir: add subroutine fields to nir_function
glsl: move link_assign_subroutine_types() to the nir linker
glsl: move check_explicit_uniform_locations() to NIR linker
glsl: move mode_string() to helper
glsl: add some data members to nir_variable
glsl: make validate_intrastage_arrays() usable across files
glsl: move cross_validate_uniforms() to the nir linker
glsl: use shader info to store gs verts
glsl: use info from shader when linking
glsl: move validate_{stage}_shader_executable() to the nir linker
glsl: remove now unused do_dead_functions()
glsl: remove FragDepthLayout field
glsl: remove ActiveStreamMask field
glsl: remove UsesEndPrimitive field
glsl: inline _mesa_copy_linked_program_data()
Timur Kristóf (121):
radv: Correctly select SDMA support for PRIME blit.
nir: Fix divergence of reductions.
nir: Fix divergence analysis of load_patch_vertices_in.
nir: Cleanup divergence analysis for mesh shaders.
nir: Clean up divergence analysis for TES patch input loads.
aco: Eliminate SCC copies when possible.
radv: Lower mesh shader draw ID to zero when they have a task shader.
radv: Extract input and output stride info to new functions.
radv: Use mapped driver locations for determining I/O strides.
aco: Allow passing constant operand to is_overwritten_since.
radv/llvm: Remove dead code.
radv: Allow NGG culling with LLVM.
compiler: Add helper for counting tess level components.
ac/nir/tess: Always record tess level info and use it at the end.
ac/nir/tess: Don’t record mapped tess level location.
ac/nir/tess: Split tess factor write into multiple functions.
ac/nir/tess: Emit tess factor output independently of whether it can be passed by registers.
ac/nir/tess: Refactor how the end of HS is emitted.
aco: Use common helper for counting tess level components.
aco: Use tess factors when TCS jumps to epilog.
radv: Declare tess_lvl_in/out args for TCS epilogs.
radv: Always pass tess factors to epilogs in registers.
radv, aco: Delete now dead TCS epilog code.
nir: Add two new AMD specific tess intrinsics.
radeonsi: Implement new intrinsics for monolithic shaders.
radv: Copy TES primitive mode to TCS info.
radv: Implement new tess intrinsics.
radv: Call nir_opt_dead_cf in radv_optimize_nir_algebraic.
ac/nir/tess: Emit tess factor stores based on new intrinsics.
radv: Completely delete TCS epilogs.
radv, aco: Remove the code that jumped to RADV’s TCS epilogs.
ac/llvm, radeonsi: Handle tess_rel_patch_id in common code.
radeonsi: Put HS output count in TCS offchip layout, not patch data offset.
radeonsi: Implement dynamic TCS intrinsics for non-monolithic shaders.
radeonsi: Delete TCS epilogs entirely.
aco: Delete all TCS epilog code.
radeonsi: Add number of VS outputs to TCS output layout.
radeonsi: Remove tess bits from VS state.
radeonsi: Use one more bit for number of patches in TCS offchip layout.
ac/nir/tess: Remove dead code that was meant for epilogs.
radv: Add number of LS and HS outputs to tcs_offchip_layout.
radv: Change input patch size in TCS offchip layout to match RadeonSI.
radv: Change number of patches in TCS offchip layout to match RadeonSI.
radv: Include output patch size in TCS offchip layout.
radv: Reuse TCS offchip layout between TCS and TES.
nir/gather_info: Record per-primitive outputs without variables.
nir: Record per-primitive inputs without variables.
nir/recompute_io_bases: Sort per-primitive PS inputs last.
ac/nir: Introduce ac_nir_calc_io_offset_mapped.
ac/nir/tess: Load tess factors from variable when they are passed in registers.
ac/nir/tess: Clarify when a TCS output is stored in LDS or VRAM.
ac/nir/tess: Return undef when loading an unwritten TCS output.
ac/nir/tess: Map TCS LDS IO locations without gaps.
ac/nir/tess: Calculate reserved LDS outputs based on IO info.
ac/nir/tess: Remove superfluous args for reserved TCS outputs.
ac/nir/tess: Clarify when VS-TCS I/O can use registers.
radv: Only add extra dword to LS-HS stride when there are LS outputs.
radv: Pass key structures to gather intrinsic info.
radv: Extract gather_load_vs_input_info function.
radv: Slightly refactor gather_intrinsic_store_output_info.
radv: Record PS input clip/cull mask instead of number.
radv: Use NIR IO semantics to determine GS output info.
radv: Add helper for determining per-attribute vertex buffer descriptors.
radv: Add helper to determine usage of VS prologs.
radv: Remove unused VS input usage mask.
radv: Use NIR IO semantics to determine VS input info.
radv: Use IO semantic location for shader output info.
aco/optimizer_postRA: Remove a check from SCC no-compare optimization.
radv: Use NIR IO semantics to determine FS input info.
radv: Remove I/O variables after nir_lower_io.
radv: Slightly refactor the determination of max_ps_params.
radv: Increase maximum allowed PS params for enabling NGG culling.
radv: Remove unused gfx_level from gfx10_emit_ge_pc_alloc.
ac/nir/ngg: Don’t create dummy output variable for primitive ID.
ac/nir/ngg: Use IO semantics for determining instance rate inputs.
ac/nir/ngg: Rename confusing driver_location variable in mesh shader lowering.
radv: Use NIR IO semantics for VS input location mapping.
radv: Don’t set driver locations for mesh shaders.
radv: Don’t set driver locations for FS outputs.
radv: Don’t set driver locations for last pre-rasterization stage.
radv: Keep track of TCS outputs that need LDS.
radv: Remove dead code for creating per-patch IO mask.
radv: Add radv_gather_unlinked_io_mask to shader info header.
radv: Always use fixed I/O locations for TCS outputs in VRAM.
radv: Clean up gathering linked I/O info.
nir/print: Print per-primitive and explicit strict IO info.
nir/recompute_io_bases: Fix per-primitive inputs.
nir/gather_info: Clear per-primitive I/O masks at the beginning.
nir/lower_io_to_scalar: Support explicit (and per-vertex) FS inputs.
nir/lower_io_to_scalar: Support per-primitive outputs.
nir/opt_varyings: Allow optimizing primitive ID for MS -> FS.
nir/opt_varyings: Support per-primitive I/O.
nir/opt_varyings: Fix explicit and per-vertex FS inputs.
nir/opt_varyings: Add early return when producer stage is task.
nir/opt_varyings: Only propagate constant MS outputs, not other uniforms.
nir/opt_varyings: Debug print during relocate_slot.
nir/opt_varyings: Fix relocate_slot so it doesn’t mix up 32-bit and 16-bit I/O.
nir/opt_varyings: Add workaround for RADV mesh shader multiview.
ac/nir/ngg: Remove support for loading mesh shader outputs.
ac/nir/ngg: Refactor MS primitive indices for scalarized IO.
ac/nir/ngg: Slightly refactor mesh shader cull flag stores.
ac/nir/ngg: Use just one IO semantics variable in MS output store.
ac/nir/ngg: Refactor update_ms_output_info.
ac/nir/ngg: Refactor MS output store into two functions.
ac/nir/ngg: Split 16-bit MS output stores by components.
ac/nir/ngg: Enable packing 16-bit mesh shader outputs.
radv: Run DCE before deleting I/O variables.
radv: Only consider interpolated inputs as 16-bit float.
radv: Refactor emitting PS input types.
radv: Remove superfluous bool arg from slot_to_ps_input.
radv: Allow using high 16 bits of PS input slots.
radv: Rename per_vertex_shaded_mask to explicit_strict_shaded_mask.
radv: Rename LDS related variables in get_tcs_num_patches.
radv: Calculate VRAM tess patch size independently of LDS size.
ac/nir/tess: Split I/O mapping to two functions.
ac/nir/tess: Use LDS IO mapping when loading tess levels from LDS.
ac/nir/ngg: Implement packed 16-bit VS/TES outputs in non-dedicated slots.
ac/nir/ngg: Implement packed 16-bit GS outputs in non-dedicated slots.
ac/nir/lower_legacy_vs: Implement packed 16-bit VS/TES outputs in non-dedicated slots.
ac/nir/lower_legacy_gs: Implement packed 16-bit GS outputs in non-dedicated slots.
ac/nir/ngg: Fix packing 16-bit MS outputs.
Tomeu Vizoso (20):
ci: disable Igalia farm
gallium/util: Fix pipe_buffer_copy
mesa: Import TensorFlow Lite headers
teflon: Initial commit
etnaviv: Update headers from rnndb
etnaviv: Add a bunch of new params for NPUs
etnaviv: Don’t emit boilerplate for compute only contexts
etnaviv: Use NN cores to accelerate convolutions
etnaviv: Use TP cores to accelerate tensor transformations
teflon: Add table with known supported models to docs
etnaviv: Don’t init the blitter in compute-only contexts
etnaviv/nn: Implement zero run length encoding of weights
teflon: Enable convolutions with number of output channels not divisible by 8
etnaviv/nn: Ensure tile_y is > 0
etnaviv/nn: Fix calculation of remaining out channels
etnaviv/nn: Move unused field to its right place in the struct
etnaviv/nn: Enable image cache
etnaviv/nn: Don’t shortcut ZRL bits calculation
etnaviv/nn: Keep track of the sign bit when decrementing to zero
etnaviv/nn: Make parallel jobs disabled by default
Tranquillity Codes (1):
intel: Skip ioctls for querying device info when hardware is unsupported
Valentine Burley (27):
tu: Promote VK_EXT_index_type_uint8 to KHR
tu: Promote VK_EXT_load_store_op_none to KHR
tu: Promote VK_EXT_line_rasterization to KHR
docs: Update features.txt for anv, nvk and tu
nvk: Enable VK_KHR_shader_subgroup_uniform_control_flow
nvk: Advertise VK_KHR_vertex_attribute_divisor
nvk: Reorder device features
tu: Implement VK_KHR_map_memory2
tu: Advertise VK_KHR_vertex_attribute_divisor
tu: Reorder device features
nvk: Fix missing implementation of creating images from swapchains
nvk: Expose VK_EXT_display_control
nvk: Expose VK_EXT_surface/swapchain_maintenance1
nvk: Expose VK_EXT_swapchain_colorspace
docs/features: Add missing VK_EXT_surface/swapchain_maintenance1 entry
tu/rmv: Remove tu_rmv_DebugMarkerSetObjectNameEXT
nvk: Trivially expose three VK_GOOGLE extensions
tu: Expose VK_KHR_surface_protected_capabilities
tu: Trivially expose three VK_GOOGLE extensions
docs: Update features.txt for tu
docs: Update features.txt and new_features.txt for anv and nvk
nvk: Add support for version 2 of all descriptor binding commands
tu: Move tu_BindImageMemory2() to tu_image.cc
tu: Replace TU_HAS_SURFACE with TU_USE_WSI_PLATFORM
tu: Fix missing implementation of creating images from swapchains
tu: Replace TU_FROM_HANDLE with VK_FROM_HANDLE
drm-shim: Stub syncobj reset ioctl
Vasily Khoruzhick (4):
lima: ppir: always use vec4 for output register
lima: ppir: use dummy program if FS has empty body
lima: gpir: abort compilation if load_uniform instrinsic src isn’t const
lima: update expected CI failures
Vignesh Raman (5):
ci: Add kmod
ci: disable Collabora’s farm due to maintenance
Split debian-build-testing job
ci: Implement support for replaying ANGLE restricted traces
ci: handle missing dri libraries during listing
Vinson Lee (2):
intel/disasm: Remove duplicate variable reg_file
intel/clc: Fix file descriptor leak
Visan, Tiberiu (1):
amd/vpelib: revert SRGB to 709
Vlad Schiller (2):
pvr: Implement VK_EXT_memory_budget
pvr: Implement VK_KHR_index_type_uint8
Yifan Zhang (2):
amd: Add code to enable gfx11.5.1
radv: initialize video decoder for GFX11.5.1
Yiwei Zhang (105):
venus: avoid redundant layout transition for optimal internal layout
venus: populate oom from ring submit alloc failures
vulkan/wsi/wayland: fix returns and avoid leaks for failed swapchain
venus: ensure object id is unique
venus: fix pipeline layout lifetime
venus: drop some redundant comment
venus: fix pipeline derivatives
venus: fix to respect the final pipeline layout
venus: allow tls ring submission to utilize the entire ring shmem
venus: default to enable GPL
venus: force async pipeline create on threads creating descriptor pools
venus: use obj handle instead of id in device memory report
anv: refactor wsi_memory_allocate_info handling
anv: optimize the implicit fencing support of external memory
anv: extend implicit fencing support for case requiring implicit write
vulkan/util: drop redundant code gen from vk_extensions_gen.py
vulkan/runtime: refactor to use DETECT_OS_ANDROID instead of ANDROID
v3dv: refactor to use DETECT_OS_ANDROID instead of ANDROID
venus: refactor to use DETECT_OS_ANDROID instead of ANDROID
hasvk: refactor to use DETECT_OS_ANDROID instead of ANDROID
anv: refactor to use DETECT_OS_ANDROID instead of ANDROID
radv: refactor to use DETECT_OS_ANDROID instead of ANDROID
turnip: refactor to use DETECT_OS_ANDROID instead of ANDROID
egl: refactor to use DETECT_OS_ANDROID instead of ANDROID
gallium: refactor to use DETECT_OS_ANDROID
util: refactor to use DETECT_OS_ANDROID
meson: drop -DANDROID
venus: update tracepoints to align with later optimizations
venus: fix the cmd stride used for qfb recording
venus: rewrite fence feedback interception to minimize batches
venus: refactor to add vn_cached_storage
venus: use vn_cached_storage for vn_queue_submission allocs
venus: misc cleanups for queue submission
venus: simplify feedback types tracking during submission
venus: massive feedback renamings for consistency and clarity
venus: refactor to add vn_queue_submission_setup_batch
venus: simplify to drop the struct vn_feedback_cmds accessor
venus: refactor semaphore feedback
venus: add vn_set_temp_cmd helper to initialize feedback batch cmd
venus: fix to ensure sfb cmds can get recycled
venus: mandate a few venus capsets long required before 1.0
venus: sync protocol for VK_KHR_fragment_shading_rate
venus: add VK_KHR_fragment_shading_rate
vulkan: fix runtime libraries’ dep against generated headers
venus: fix ffb batch prepare for a corner case and avoid a memcpy UB
vulkan: remove unused wsi_common_entrypoints include and dep
vulkan: properly ensure wsi_entrypoints header gen order
vulkan: remove header files from lib source files
vulkan: refactor the runtime header gen order dependency
anv/hasvk: default image_read_without_format to true
venus: qfb to track cmd handle directly
venus: combine query record and reset
venus: massive qfb renamings
venus: minor cmd count related refactors
venus: drop vn_get_temp_cmd_ptr
venus: simplify vn_cmd_reset and apply more code sharing
venus: refactor query record recycle
venus: rewrite qfb vn_feedback helpers
venus: refactor vn_queue_submission_add_query_feedback
venus: add vn_queue_submission_get_resolved_query_records
venus: optimize to further batch query records
venus: roundtrip now belongs to ring
venus: minor naming cleanups
venus: ensure shmem is attached to renderer before use for guest vram
venus: avoid excessive ring notifications
venus: further reduce idle timeout from 5ms to 1ms
venus: add enum vn_relax_reason
venus: avoid constant busy wait for query result waiting
venus: deprecate unused perf env vars
venus: decorate cmd enqueue macro internals with compiler hints
venus: add a more relaxed polling strategy
venus: cleanup 2 TODOs from 1.3 support
venus: remove obsolete TODOs
venus: use STACK_ARRAY to simplify modifier query
venus: use STACK_ARRAY to simplify BindBufferMemory2
venus: use STACK_ARRAY to simplify BindImageMemory2
venus: use STACK_ARRAY to simplify render pass creation
venus: use STACK_ARRAY to simplify physical device enumeration
venus: use STACK_ARRAY to simplify set layout creation
venus: use STACK_ARRAY to simplify sync wait
venus: rely on enum vn_descriptor_type for internal trackings
venus: move async_set_allocation check outside helpers
venus: set alloc to skip earlier for reserved and invalid bindings
venus: optimize mutable state restore
venus: misc set alloc and cleanup refactors
venus: drop vn_should_sanitize_descriptor_set_writes
venus: refactor descriptor set update and push
venus: use STACK_ARRAY to simplify descriptor set update and push
venus: use more relaxed profile for TLS ring seqno wait
venus: avoid the redundant template entry
venus: fix to drop an extra ;
venus: simplify push descriptor update with template
venus: optimize set update template data population
venus: simplify need and ignore rules for desc image info
venus: use STACK_ARRAY to simplify set template update and push
venus: clean up legacy descriptor update template bits
venus: fix swapchain image memory bind
venus: fix VkDeviceGroupSubmitInfo::deviceMask for feedback cmds
venus: avoid client allocators for ring internals
venus: fix to destroy all pipeline handles on early error paths
turnip: msm: clean up iova on error path
turnip: msm: fix racy gem close for re-imported dma-buf
turnip: virtio: fix error path in virtio_bo_init
turnip: virtio: fix iova leak upon found already imported dmabuf
turnip: virtio: fix racy gem close for re-imported dma-buf
Yogesh Mohan Marimuthu (6):
winsys/amdgpu: sws instead of ws for amdgpu_screen_winsys
winsys/amdgpu: rws instead of ws for radeon_winsys
winsys/amdgpu: aws instead of ws for amdgpu_winsys
winsys/amdgpu: use _destroy_locked() for failure to create winsys
winsys/amdgpu: remove tab space
winsys/amdgpu: add more comments for winsys create in header file
Yonggang Luo (37):
util: Add function util_is_power_of_two_nonzero_uintptr and macro IS_POT_NONZERO
asahi,panfrost: Use IS_POT_NONZERO to replace util_is_power_of_two_nonzero for different size
treewide: Use util_is_power_of_two_nonzero{64|_uintptr} when needed
svga: Cleanup duplicate ALIGN macro defines
nouveau: Use align64 instead of ALIGN over input layer_size_B
treewide: Use align64 instead of ALIGN for 64 bit value parameter
util: Update ALIGN prototype to match align
compiler/spirv: The spirv shader is binary, should write in binary mode
compiler/spirv: There is not need unqualify const in function vtn_string_literal
compiler/spirv: vtn_add_printf_string support for handling OpBitcast
zink: Update zink-anv-tgl flakes
treewide: Remove vulkan/runtime vulkan/util prefix in include path
freedreno/vulkan: Use vk_dynamic_graphics_state_init instead of direct assignment
vulkan/runtime: Mark vk_default_dynamic_graphics_state to be private
Revert “meson/vulkan/util: allow venus to drop compiler deps”
vulkan: allow building venus without libcompiler
glx: Remove DEBUG code in xfont.c
panfrost/shared: avoid use gallium helper in pan_minmax_cache.*
panfrost/meson: remove redundant gallium include from meson files
treewide: Replace the invalid usage #if DEBUG with #ifdef DEBUG
util: Cleanup strtod.(h|c) by introduce _mesa_get_locale
meson: Extract with_mesa_debug and with_mesa_ndebug for latter usage
meson: Define MESA_DEBUG for latter usage
treewide: Replace usage of macro DEBUG with MESA_DEBUG when possible
meson: Remove the non-used -DDEBUG manually
intel/meson: Remove redundant inc_gallium
radv: Remove redundant inc_gallium
radv: Rename src/amd/vulkan/vk_format.h to src/amd/vulkan/radv_formats.h
vulkan: Move vk_format_is_alpha and vk_format_is_alpha_on_msb into vk_format.h from pvr
pvr: inline and remove vk_format_get_channel_width
pvr: Merge imagination/vulkan/vk_format.h into imagination/vulkan/pvr_formats.h
pvr: Add pvr_ prefix for vk_format_* functions in pvr_formats.h
util: Fixes futex_wait on win32
util: futex_wait use TIME_MONOTONIC on win32 for consistence with other platform
util: Turn futex_wake parameter to int32_t for consistence across platforms
broadcom/common: Now “util/box.h” is under src, so remove the FIXME
nouveau: Fixes error: unused import: `crate::nvh_classes_cl906f::*`
Yusuf Khan (4):
nvk: remove some dead code files
nvk: fix valve segfault from setting a descriptor set from NULL
crocus: fix potential null pointer dereference if transfer_mapping fails
nouveau: Fix crash when destination or source screen fences are null
Zack Rusin (1):
svga: Fix instanced draw detection
Zan Dobersek (11):
freedreno: add fd_rd_output facilities for gzip-compressed RD dumps
tu/msm: fix RD_CHIP_ID size used when dumping RD
tu: tu_device should clean up its global bo
vulkan/rmv: enable logging miscellaneous internal resources
tu: add RMV support
freedreno/fdl: avoid overflow in layout size computations
tu: fix memory leaks in tu_shader
fd: enable prefixing the RD output filename
tu/autotune: use SAMPLE_COUNT_END_OFFSET when writing the ending sample count
tu: RB_SAMPLE_COUNT_ADDR is also used on a7xx
tu/query: improve CP_EVENT_WRITE7::ZPASS_DONE usage
antonino (1):
zink: plug leak in `zink_create_quads_emulation_gs`
chyyran (1):
util/format/fxt1: include “u_format_pack.h” instead of “util/format/u_format_pack.h”
daoxiang.gong (1):
zink - Fix for minLod and maxLod when mipmap filter is disabled
duncan.hopkins (19):
compiler/clc: fix compiler issue on MacOS with st_mtim[e] in stat.
egl: MacOS platform guard around pthread_condattr_setclock()
egl: Added DRI3 code guards.
egl: Changed EGLNativeDisplayType size check to make sure it is big enough instead of exactly the same size.
gallium/dri: Switch xf86drm.h for util/libdrm.h to allow for the no-op shim to be used.
gallium/dri: Added XCB dependency to frontends/dri/libdrm build. Fix header issues with xcb.h being used.
util: Updated util/libdrm.h stubs with drmGetMagic()
dri: guarded DRI code.
glx: Switched DRI2 functions over to use Apple specific alternatives and extension name.
meson: relaxed some meson restrictions on MacOS/Apple allowing for wider build support.
apple: Extended Apple feature support using GLX_USE_APPLE.
apple: Meson defines GLX_USE_APPLE to allow for Gallium drivers to work on MacOS.
zink: Fixed header location and compiling issue with [[deprecated]] from newer MoltenVK versions.
zink: use portability EXT on Apple.
zink: stopped the use of VkFormatProperties3 if the reported API is less than 1.3 or VK_KHR_format_feature_flags2 not present.
zink: removed `MESA_PRIM_QUADS` from the supported `PIPE_CAP_SUPPORTED_PRIM_MODES`.
zink: Avoid issues when kopper tries using XCB WSI on Apple.
zink/apple: added `moltenvk-dir` search to allow MoltenVK to be sourced from brew.
zink/apple: update docs to reflect the current status of Zink on macOS.
nyanmisaka (2):
frontends/va: Report vendor and device ID through VADisplayPCIID
radeonsi/uvd_enc: update to use correct padding size
qbojj (1):
vulkan: Fix calculation of flags in vk_graphics_pipeline_state_fill
thfrwn (1):
mesa: fix off-by-one for newblock allocation in dlist_alloc