17ec681f3SmrgMesa 21.3.0 Release Notes / 2021-11-17 27ec681f3Smrg====================================== 37ec681f3Smrg 47ec681f3SmrgMesa 21.3.0 is a new development release. People who are concerned 57ec681f3Smrgwith stability and reliability should stick with a previous release or 67ec681f3Smrgwait for Mesa 21.3.1. 77ec681f3Smrg 87ec681f3SmrgMesa 21.3.0 implements the OpenGL 4.6 API, but the version reported by 97ec681f3SmrgglGetString(GL_VERSION) or glGetIntegerv(GL_MAJOR_VERSION) / 107ec681f3SmrgglGetIntegerv(GL_MINOR_VERSION) depends on the particular driver being used. 117ec681f3SmrgSome drivers don't support all the features required in OpenGL 4.6. OpenGL 127ec681f3Smrg4.6 is **only** available if requested at context creation. 137ec681f3SmrgCompatibility contexts may report a lower version depending on each driver. 147ec681f3Smrg 157ec681f3SmrgMesa 21.3.0 implements the Vulkan 1.2 API, but the version reported by 167ec681f3Smrgthe apiVersion property of the VkPhysicalDeviceProperties struct 177ec681f3Smrgdepends on the particular driver being used. 187ec681f3Smrg 197ec681f3SmrgSHA256 checksum 207ec681f3Smrg--------------- 217ec681f3Smrg 227ec681f3Smrg:: 237ec681f3Smrg 247ec681f3Smrg a2753c09deef0ba14d35ae8a2ceff3fe5cd13698928c7bb62c2ec8736eb09ce1 mesa-21.3.0.tar.xz 257ec681f3Smrg 267ec681f3Smrg 277ec681f3SmrgNew features 287ec681f3Smrg------------ 297ec681f3Smrg 307ec681f3Smrg- VK_EXT_color_write_enable on lavapipe 317ec681f3Smrg- GL_ARB_texture_filter_anisotropic in llvmpipe 327ec681f3Smrg- Anisotropic texture filtering in lavapipe 337ec681f3Smrg- VK_EXT_shader_atomic_float2 on Intel and RADV. 347ec681f3Smrg- VK_EXT_vertex_input_dynamic_state on RADV. 357ec681f3Smrg- VK_KHR_timeline_semaphore on lavapipe 367ec681f3Smrg- VK_EXT_external_memory_host on lavapipe 377ec681f3Smrg- GL_AMD_pinned_memory on llvmpipe 387ec681f3Smrg- GL 4.5 compatibility on llvmpipe 397ec681f3Smrg- VK_EXT_primitive_topology_list_restart on RADV and lavapipe. 407ec681f3Smrg- ES 3.2 on zink 417ec681f3Smrg- VK_KHR_depth_stencil_resolve on lavapipe 427ec681f3Smrg- VK_KHR_shader_integer_dot_product on RADV. 437ec681f3Smrg- OpenGL FP16 support on llvmpipe 447ec681f3Smrg- VK_KHR_shader_float16_int8 on lavapipe 457ec681f3Smrg- VK_KHR_shader_subgroup_extended_types on lavapipe 467ec681f3Smrg- VK_KHR_spirv_1_4 on lavapipe 477ec681f3Smrg- Experimental raytracing support on RADV 487ec681f3Smrg- VK_KHR_synchronization2 on Intel 497ec681f3Smrg- NGG shader based culling is now enabled by default on GFX10.3 on RADV. 507ec681f3Smrg- VK_KHR_maintenance4 on RADV 517ec681f3Smrg- VK_KHR_format_feature_flags2 on RADV. 527ec681f3Smrg- EGL_EXT_present_opaque on wayland 537ec681f3Smrg 547ec681f3Smrg 557ec681f3SmrgBug fixes 567ec681f3Smrg--------- 577ec681f3Smrg 587ec681f3Smrg- RADV/ACO: Monster Hunter Rise Demo renders wrong results 597ec681f3Smrg- radv: Odd lack of implicit host memory invalidation 607ec681f3Smrg- Regression/Bisected: Crash in Godot games when steam overlay enabled 617ec681f3Smrg- RADV: IsHelperInvocationEXT query is not considered volatile in ACO 627ec681f3Smrg- ANV: error submitting with the same semaphore for wait and signal - regression? 637ec681f3Smrg- [TGL+] anv: some dEQP-VK.drm_format_modifiers.* fails 647ec681f3Smrg- Mesa 21.3rc3 - compile failure 657ec681f3Smrg- iris: subslice assertion failure on some hardware during initialization 667ec681f3Smrg- Final Fantasy V (Old version): Random text characters are not displayed 677ec681f3Smrg- Diagonal rendering artifacts in Tomb Raider 687ec681f3Smrg- dota2 black squares corruption 697ec681f3Smrg- [hsw][bisected][regression] dEQP-VK.reconvergence.*nesting* failures 707ec681f3Smrg- anv: dEQP-VK.wsi.wayland.<various> failures 717ec681f3Smrg- radv_android.c: build errors due to commit 49c3a88 727ec681f3Smrg- dEQP-EGL.functional.sharing.gles2.multithread.* regression with Iris 737ec681f3Smrg- [radeonsi] Euro Truck Simulator 2: broken mimimap 747ec681f3Smrg- [regression][bisected] Launching Valheim OpenGL game leads to GPU Hang 757ec681f3Smrg- Android Meson build regression: hardware/system information apps crash on Raspberry Pi 4 767ec681f3Smrg- radv: format properties are broken with modifiers 777ec681f3Smrg- anv: dEQP-VK.graphicsfuzz.cov-multiple-one-iteration-loops-global-counter-write-matrices fails 787ec681f3Smrg- iris: CCS modifier tests failing with suballocation 797ec681f3Smrg- [RADV] For the game "World War Z: Aftermath" (Vulkan API) should used RADV_DEBUG=invariantgeom param 807ec681f3Smrg- RADV: Resident Evil Village needs invariantgeom when NGG culling is enabled 817ec681f3Smrg- radv: VK_EXT_vertex_input_dynamic_state 827ec681f3Smrg- anv: dynamic state emission is busted 837ec681f3Smrg- radeonsi: out of bounds access/compiler warning 847ec681f3Smrg- RADV: Rendering issues in Resident Evil 2 with NGGC 857ec681f3Smrg- GPU Hang/reset/forced reboot - latest mesa - mesa-demos/gloss 867ec681f3Smrg- crocus: Incorrect stride when used through prime 877ec681f3Smrg- radv: Vulkan games and demo apps are broken since "use DCC compatible with image stores for < 4K resolutions" 887ec681f3Smrg- anv: descriptorBindingUniformBufferUpdateAfterBind feature is not supported 897ec681f3Smrg- Cheza board reboots into another image on retry 907ec681f3Smrg- freedreno: several regressions in org.skia.skqp.SkQPRunner 917ec681f3Smrg- android: radv_android.c building errors after commits 9fc16b6 and 48cae11 927ec681f3Smrg- iris: Implement memory sub-allocation 937ec681f3Smrg- Assault Android Cactus ( STEAM AppID 250110) - Black triangles on Main menu character 947ec681f3Smrg- World War Z - Renders red if FSR is enabled 957ec681f3Smrg- Significant performance drop on Radeon HD 8400 967ec681f3Smrg- turnip/a650: most VK_EXT_filter_cubic tests in dEQP-VK.texture.filtering.* fail 977ec681f3Smrg- Ender Lilies: Turnip: Fails to render in-game 987ec681f3Smrg- [nir][radv] Out of range shift when compiling Resident Evil Village shaders 997ec681f3Smrg- [nir][radv] Out of range shift when compiling Resident Evil Village shaders 1007ec681f3Smrg- GL_EXT_disjoint_timer_query glGetInteger64v GL_TIMESTAMP failing with GL_INVALID_ENUM 1017ec681f3Smrg- Valgrind errors in VBO display list code since vertex store rework 1027ec681f3Smrg- Issue with Turnip compilation on Oneplus 8 1037ec681f3Smrg- freedreno: primtype_mask 1047ec681f3Smrg- [radv] bufferImageGranularity is 64 1057ec681f3Smrg- ../mesa-9999/src/amd/llvm/ac_llvm_helper.cpp:63:14: error: 'class llvm::AttributeList' has no member named 'hasAttribute'; did you mean 'getAttributes'? 1067ec681f3Smrg- GPU Reset POLARIS with Unigine Heaven and X4 1077ec681f3Smrg- RADV: consistent crash in Splitgate 1087ec681f3Smrg- llvmpipe doesn't compile a shader with an inner scope in a for loop 1097ec681f3Smrg- llvmpipe doesn't compile the increment of a for a loop 1107ec681f3Smrg- Mesa 21.2.1 implementation error: unexpected state[0] in make_state_flags() 1117ec681f3Smrg- freedreno: regression in org.skia.skqp.SkQPRunner#gles_localmatriximagefilter 1127ec681f3Smrg- [Radeonsi] VA-API Encoding no longer works on AMD PITCAIRN 1137ec681f3Smrg- turnip: Geometry flickering in Genshin Impact after 83e9a7fbcf53b90d0de66985dbbf91986fc7b05d 1147ec681f3Smrg- i915g: Need to link fail on non-unrolled loops 1157ec681f3Smrg- spirv2dxil.c:128:22: error: passing argument 7 of ‘spirv_to_dxil’ from incompatible pointer type [-Werror=incompatible-pointer-types] 1167ec681f3Smrg- OSMesa problem resizing 1177ec681f3Smrg- iris: Perform busy tracking for resources without GEM_BUSY/GEM_WAIT 1187ec681f3Smrg- [RADV] The game "Aliens: Fireteam Elite" start crashing after commit 2e56e2342094e8ec90afa5265b1c43503f662939 1197ec681f3Smrg- radeonsi: Smart Access Memory not being enabled by default? 1207ec681f3Smrg- Memory leak: si_get_shader_binary_size is missing a call to ac_rtld_close 1217ec681f3Smrg- dEQP-GLES3.stress.draw.unaligned_data.random.4 segfault 1227ec681f3Smrg- gl_DrawID is incorrect for glMultiDrawElementsBaseVertex/glMultiDrawElementsIndirect 1237ec681f3Smrg- iris: Scanout buffers now mapped WB cause glitches on screen 1247ec681f3Smrg- turnip: dEQP-VK.spirv_assembly.instruction.graphics.spirv_ids_abuse.lots_ids_* fails 1257ec681f3Smrg- i915g: nir_to_tgsi: Error : CONST[0]: The same register declared more than once 1267ec681f3Smrg- i915: GPU hang when doing FB fetch and gl_FragDepth write in one shader 1277ec681f3Smrg- ../mesa-9999/src/amd/compiler/aco_instruction_selection.cpp:10009:30: error: 'exchange' is not a member of 'std' 1287ec681f3Smrg- radv: disable DCC for displayable images with storage on navi12/14 1297ec681f3Smrg- RADV: Menu static/artifacts in Doom Eternal 1307ec681f3Smrg- Crash happens when testing GL_PIXEL_PACK_BUFFER 1317ec681f3Smrg- Possible miscompilation of an integer division with vulkan 1327ec681f3Smrg- panfrost G31 - Cathedral crash- opengl 2.1 game (I guess) 1337ec681f3Smrg- freedreno C++14 build error 1347ec681f3Smrg- panfrost / armv7 - crash with mesa newer than 21.0.3 1357ec681f3Smrg- iris: recursive mutex acquire when re-using BO with aux map 1367ec681f3Smrg- llvmpipe doesn't compile a valid shader with an useless switch 1377ec681f3Smrg- i915g: dEQP-GLES2.functional.fbo.completeness.renderable.texture.color0.rgb10_a2 failure 1387ec681f3Smrg- i915g: polygon offset CTS failures 1397ec681f3Smrg- GetFragDataLocation(prog, "gl_FragColor") generates INVALID_OPERATION, but specs don't say it should 1407ec681f3Smrg- anv: VK_EXT_memory_budget doesn't know about device local memory 1417ec681f3Smrg- turnip: dEQP-VK.api.version_check.entry_points regression 1427ec681f3Smrg- Possible miscompilation of a comparison with unsigned zero 1437ec681f3Smrg- i915g: FXT1 support 1447ec681f3Smrg- dEQP-VK.wsi.android.swapchain.create#image_swapchain_create_info crash on Android R 1457ec681f3Smrg- Nine Regression with util: Switch the non-block formats to unpacking rgba rows instead of rects. 1467ec681f3Smrg- Add an Intel NDK Android build job 1477ec681f3Smrg- android: anv building error after commit e08370d 1487ec681f3Smrg- panfrost G31 Unreal Tournament - various glitches (apitrace) 1497ec681f3Smrg- Miscompilation of a switch case 1507ec681f3Smrg- ci/virgl: "dEQP error: waiting got error - 16, slow gpu or hang?" flakes 1517ec681f3Smrg- [radeonsi][regression] CPU is being used ~10 times more than usual after c5478f9067f. 1527ec681f3Smrg- i915g: cos/sin accuracy 1537ec681f3Smrg- glGetTexImage with PBO is not accelerated on Gallium 1547ec681f3Smrg- radeonsi: bad performance on PBO packs 1557ec681f3Smrg- dEQP-VK.wsi.android.swapchain.create#image_swapchain_create_info crash on Android R 1567ec681f3Smrg- [kbl] GPU hang launching UE4Editor (unreal engine) 1577ec681f3Smrg- turnip: A few dEQP-VK.pipeline.framebuffer_attachment.* tests failing due to "FINISHME: unaligned store of msaa attachment" 1587ec681f3Smrg- ci: new freedreno trace job running for lavapipe 1597ec681f3Smrg- i915g: Emit TXP 1607ec681f3Smrg- The image is distorted while use iGPU(Intel GPU) rendering and output via dGPU (AMD GPU) 1617ec681f3Smrg- Radeon 5700XT: Small render glitches around "heat balls" in dhewm3 (Doom 3) 1627ec681f3Smrg- lima: regression in plbu scissors cmd 1637ec681f3Smrg- freedreno: regression in org.skia.skqp.SkQPRunner#gles_multipicturedraw_*_tiled 1647ec681f3Smrg- Incorrect rendering 1657ec681f3Smrg- intel/isl: Wrong surface format name in batch 1667ec681f3Smrg- Unused graph areas created for device and format in VK_LAYER_MESA_overlay 1677ec681f3Smrg- [RADV] FSR in Resident Evil: Village looks very pixelated on Polaris 1687ec681f3Smrg- iris: regression in yuzu 1697ec681f3Smrg- 21.2.0rc1 Build Failure - GCC6.3 1707ec681f3Smrg- Crash in update_buffers after closing KDE "splash screen" downloader 1717ec681f3Smrg- Firefox (wayland) crash in wayland_platform 1727ec681f3Smrg- Crash in update_buffers after closing KDE "splash screen" downloader 1737ec681f3Smrg- Firefox (wayland) crash in wayland_platform 1747ec681f3Smrg- radeonsi: persistent, read-only buffer maps are slow to read 1757ec681f3Smrg- substance painter flickering with jagged texture and masks shown black 1767ec681f3Smrg- radv: FP16 mode in FidelityFX FSR doesn't look right 1777ec681f3Smrg- Regression, ACO: DOOM Eternal hangs with ACO 1787ec681f3Smrg- Regression in Turnip with KGSL and Zink running opengl in proot 1797ec681f3Smrg- [bsw][i965][bisected][regression] waffle crashing after patch 1807ec681f3Smrg- Validation crash on wlroots after wl_shm appeared 1817ec681f3Smrg- [RADV] Blocky corruption in Scarlet Nexus and vkd3d-proton 2.4 1827ec681f3Smrg 1837ec681f3Smrg 1847ec681f3SmrgChanges 1857ec681f3Smrg------- 1867ec681f3Smrg 1877ec681f3SmrgAdam Jackson (18): 1887ec681f3Smrg 1897ec681f3Smrg- glx/drisw: Nerf PutImage when loaderPrivate == NULL 1907ec681f3Smrg- mesa: (correctly) flush more in _mesa_make_current 1917ec681f3Smrg- egl/dri2: Stop disabling pbuffer support on msaa configs 1927ec681f3Smrg- dri: Reformat DRI context attribute #defines 1937ec681f3Smrg- glx: Fix and simplify the share context compatibility check 1947ec681f3Smrg- glx: Store the context vtable on the glx screen 1957ec681f3Smrg- glx/dri2: Require the driver to support v4 of __DRI_DRI2 1967ec681f3Smrg- glx/drisw: Remove some misplaced error checks 1977ec681f3Smrg- glx/dri: Collect the GLX context attributes in a struct 1987ec681f3Smrg- glx: Simplify context API profile computation 1997ec681f3Smrg- glx: Remove some unused declarations from glxclient.h 2007ec681f3Smrg- glx: Move __glFreeAttributeState next to its one caller 2017ec681f3Smrg- glx: Clarify a debug message 2027ec681f3Smrg- glx: Don't strip off window/pixmap support from float fbconfigs 2037ec681f3Smrg- wsi/x11: Fix a misunderstanding about how xcb_get_geometry works 2047ec681f3Smrg- wsi/x11: Fetch and discard the SYNC extension info 2057ec681f3Smrg- dri: Remove the allow_fp16_configs option, always allow them 2067ec681f3Smrg- egl/dri: Enable FP16 for EGL_EXT_platform_device 2077ec681f3Smrg 2087ec681f3SmrgAdrian Bunk (1): 2097ec681f3Smrg 2107ec681f3Smrg- util/format: NEON is not available with the soft-float ABI 2117ec681f3Smrg 2127ec681f3SmrgAlejandro Piñeiro (12): 2137ec681f3Smrg 2147ec681f3Smrg- broadcom: don't define internal BPP values twice 2157ec681f3Smrg- vulkan: add vk_spec_info_to_nir_spirv util method 2167ec681f3Smrg- spirv: set medium precision with RelaxedPrecision decorator 2177ec681f3Smrg- broadcom/qpu: update/remove comments 2187ec681f3Smrg- broadcom/qpu: add new lookup opcode description helper 2197ec681f3Smrg- broadcom/qpu: use and expand version info at opcode description 2207ec681f3Smrg- broadcom/compiler: remove commented out vir_LOAD_IMM methods 2217ec681f3Smrg- broadcom/compiler: remove qpu_acc helper 2227ec681f3Smrg- broadcom/common: remove unused debug helper 2237ec681f3Smrg- v3d/v3dv: add unlikely for any V3D_DEBUG check 2247ec681f3Smrg- v3dv: use NULL for vk_error on initialization failures 2257ec681f3Smrg- v3dv/pipeline: don't clone the nir shader at pipeline_state_create_binning 2267ec681f3Smrg 2277ec681f3SmrgAlyssa Rosenzweig (243): 2287ec681f3Smrg 2297ec681f3Smrg- panfrost: Add perf_debug macros 2307ec681f3Smrg- panfrost: Warn on software conditional rendering 2317ec681f3Smrg- panfrost: Warn on going out of AFBC 2327ec681f3Smrg- panfrost: Log reasons for flushes 2337ec681f3Smrg- panfrost: Warn on get_fresh_batch_for_fbo 2347ec681f3Smrg- panfrost: Warn on get_fresh_batch 2357ec681f3Smrg- panfrost: Warn on transitions to linear 2367ec681f3Smrg- pan/bi: Copy liveness routines back 2377ec681f3Smrg- pan/bi: Copy back add_successor 2387ec681f3Smrg- pan/bi: Copy back bi_foreach_successor 2397ec681f3Smrg- pan/bi: Copy block bi_block 2407ec681f3Smrg- pan/bi: Clean up useless casts 2417ec681f3Smrg- pan/bi: Clean up liveness freeing 2427ec681f3Smrg- pan/bi: Shrink live array to 8-bits 2437ec681f3Smrg- meson: Build panfrost with tools=panfrost 2447ec681f3Smrg- panfrost: Remove unnecessary bifrost_compiler deps 2457ec681f3Smrg- panfrost: Only build libpanfrost with GL/VK 2467ec681f3Smrg- pan/bi: Add explicit cast for lod_or_mode 2477ec681f3Smrg- pan/bi: Remove duplicate NIR compiler options 2487ec681f3Smrg- pan/bi: Mark mod to string as maybe unused 2497ec681f3Smrg- panfrost,panvk: Remove broken v4 spilling code 2507ec681f3Smrg- targets/graw-xlib: Add missing dep_x11 2517ec681f3Smrg- pan/mdg: Garbage collect silly quirk 2527ec681f3Smrg- panfrost: Move context initalization to the vtable 2537ec681f3Smrg- panfrost: Make sampler view creation private 2547ec681f3Smrg- panfrost: Move sysval analysis out of per-gen 2557ec681f3Smrg- panfrost: Compile pan_cmdstream per-gen 2567ec681f3Smrg- panfrost: Statically determine uses_clamp 2577ec681f3Smrg- panfrost: Don't make get_index_buffer_bounded per-gen 2587ec681f3Smrg- panfrost: Match sampler "nearest" names 2597ec681f3Smrg- panfrost: Share sampler code across archs 2607ec681f3Smrg- panfrost: Share blend code across architectures 2617ec681f3Smrg- panfrost: #ifdef pan_merge_empty_fs 2627ec681f3Smrg- panfrost: #ifdef fragment RSD packing 2637ec681f3Smrg- panfrost: Add a concatenation macro for genxml 2647ec681f3Smrg- panfrost: Use PAN_ARCH for the rest of pan_cmdstream 2657ec681f3Smrg- panfrost: Move init_batch to GenXML vtbl 2667ec681f3Smrg- panfrost: Make panfrost_batch_get_bifrost_tiler per-gen 2677ec681f3Smrg- panvk: Fix sampler filter modes on Bifrost 2687ec681f3Smrg- asahi: Identify texture address field 2697ec681f3Smrg- asahi: Fix sampler filtering flag 2707ec681f3Smrg- asahi: Identify texture dimension field 2717ec681f3Smrg- asahi: Set texture dimension field 2727ec681f3Smrg- asahi: Calculate cube map stride 2737ec681f3Smrg- asahi: Calculate resource offsets for cube maps 2747ec681f3Smrg- asahi: Implement cube map tiling transfers 2757ec681f3Smrg- asahi: Use agx_rsrc_offset for linear transfer_map 2767ec681f3Smrg- asahi: Allow tiled cube maps 2777ec681f3Smrg- asahi: Simplify can_tile type signature 2787ec681f3Smrg- asahi: Require tiling for cube maps 2797ec681f3Smrg- asahi: Assert texture layer is nonzero 2807ec681f3Smrg- agx: Don't set helper invocation kill bit 2817ec681f3Smrg- agx: Fix mismatched units in load_ubo 2827ec681f3Smrg- agx: Dump register file when failing to allocate 2837ec681f3Smrg- agx: Use consistent ncomps 2847ec681f3Smrg- agx: Plug memory leak in register allocator 2857ec681f3Smrg- asahi: Enable instancing 2867ec681f3Smrg- agx: Drop dated /* TODO: RA \*/ 2877ec681f3Smrg- agx: Handle load_instance_id 2887ec681f3Smrg- agx: Add agx_ushr helper 2897ec681f3Smrg- agx: Add udiv-by-constant routine 2907ec681f3Smrg- agx: Include divisors in the vertex shader key 2917ec681f3Smrg- agx: Implement instanced arrays 2927ec681f3Smrg- agx: Define p_extract for type converts 2937ec681f3Smrg- asahi: Pass instance_divisor to the compiler 2947ec681f3Smrg- agx: Add agx_format_shift routine 2957ec681f3Smrg- agx: Shift vertex buffer stride in the compiler 2967ec681f3Smrg- asahi: Add integers to agx_vertex_formats 2977ec681f3Smrg- asahi: Generalize src_offset for non-4byte formats 2987ec681f3Smrg- pan/va: Add initial ISA.xml for Valhall 2997ec681f3Smrg- pan/va: Add ISA.xml parser and support code 3007ec681f3Smrg- pan/va: Assert no instructions are duplicated 3017ec681f3Smrg- pan/va: Add Valhall assembler 3027ec681f3Smrg- pan/va: Check for FAU conflicts in the assembler 3037ec681f3Smrg- pan/va: Add disassembler generator 3047ec681f3Smrg- pan/va: Add dis/assembler test cases 3057ec681f3Smrg- pan/va: Add negative test cases for the assembler 3067ec681f3Smrg- pan/va: Add assembler test harness 3077ec681f3Smrg- pan/va: Add disassembler test harness 3087ec681f3Smrg- pan/va: Integrate the tests into meson test 3097ec681f3Smrg- pan/bi: Remove unused pointer from bi_instr 3107ec681f3Smrg- pan/bi: Remove unused option 3117ec681f3Smrg- pan/bi: Parse file names in standalone compiler 3127ec681f3Smrg- pan/bi: Zero initialize shader_info 3137ec681f3Smrg- pan/bi: Do more mesa/st stuff in standalone compiler 3147ec681f3Smrg- pan/bi: Add quirks for Mali G78 3157ec681f3Smrg- pan/bi: Only call clause code on Bifrost 3167ec681f3Smrg- pan/bi: Output binaries from standalone compiler 3177ec681f3Smrg- pan/bi: Add helpers for unit testing 3187ec681f3Smrg- pan/bi: Add instruction equality helper 3197ec681f3Smrg- pan/bi: Add instruction unit test macro 3207ec681f3Smrg- pan/bi: Remove redundant check in clamp fusing 3217ec681f3Smrg- pan/bi: Constify BIR manipulation 3227ec681f3Smrg- pan/bi: DCE after bifrost_nir_lower_algebraic_late 3237ec681f3Smrg- pan/bi: Add discard flag to bi_index 3247ec681f3Smrg- pan/bi: Remove unused BIR_FAU_HI 3257ec681f3Smrg- pan/bi: Model \*ADD_IMM instructions in IR 3267ec681f3Smrg- pan/bi: Model RSCALE for Valhall 3277ec681f3Smrg- pan/bi: Model Valhall special values as FAU 3287ec681f3Smrg- pan/bi: Fix typo in FAU enum 3297ec681f3Smrg- pan/bi: Rename NOP.i32 to NOP 3307ec681f3Smrg- pan/bi: Rename CLPER_V7 back to CLPER 3317ec681f3Smrg- pan/bi: Add strip_index helper 3327ec681f3Smrg- pan/bi: Add helper to swizzle a constant 3337ec681f3Smrg- pan/bi: Use bi_apply_swizzle in constant folding 3347ec681f3Smrg- pan/bi: Refactor constant folding for testability 3357ec681f3Smrg- pan/bi: Add constant folding unit test 3367ec681f3Smrg- pan/bi: Fix UBO push with nir_opt_shrink_vectors 3377ec681f3Smrg- pan/bi: Garbage collect stuff in bi_layout.c 3387ec681f3Smrg- pan/bi: Add branch_offset immediate 3397ec681f3Smrg- pan/bi: Clean up and export bi_reconverge_branches 3407ec681f3Smrg- pan/bi: Clarify the logic of bi_reconverge_branches 3417ec681f3Smrg- pan/bi: Align staging registers on Valhall 3427ec681f3Smrg- pan/va: Allow floating-point swizzles on ATEST 3437ec681f3Smrg- gallium/tests: Fix warning calculating absdiff 3447ec681f3Smrg- pan/bi: Inline away bi_must_last 3457ec681f3Smrg- pan/bi: Remove dated ASSERTED properties 3467ec681f3Smrg- pan/bi: Expose unit tested scheduler predicates 3477ec681f3Smrg- pan/bi: Add BIT_ASSERT helper for unit testing 3487ec681f3Smrg- pan/bi: Teach meson about scheduler predicate test 3497ec681f3Smrg- pan/bi: Teach meson about Bifrost packing test 3507ec681f3Smrg- pan/bi: Teach meson about format pack tests 3517ec681f3Smrg- glsl/standalone: Lower COMPUTE shader precision 3527ec681f3Smrg- pan/bi: Restrict swizzles on same cycle temporaries 3537ec681f3Smrg- pan/bi: Test restrictions on same-cycle temporaries 3547ec681f3Smrg- pan/bi: Remove incorrect errata workaround 3557ec681f3Smrg- pan/bi: Use getopt for bifrost_compiler 3567ec681f3Smrg- pan/bi: Lower fragment output with <4 components 3577ec681f3Smrg- pan/bi: Add bi_entry_block helper 3587ec681f3Smrg- pan/bi: Handle asymmetric staging in bi_count_read_registers 3597ec681f3Smrg- pan/bi: Stub 64-bit in count_write_registers 3607ec681f3Smrg- pan/bi: Validate the live set starts empty 3617ec681f3Smrg- nir/lower_mediump_io: Don't remap base unless needed 3627ec681f3Smrg- nir/lower_mediump: Fix metadata in all passes 3637ec681f3Smrg- pan/bi: Make bi_opt_push_ubo optional 3647ec681f3Smrg- pan/bi: Add a noopt debug option 3657ec681f3Smrg- panfrost: Add LINEAR debug option 3667ec681f3Smrg- panfrost: Remove unused #defines 3677ec681f3Smrg- panfrost: Use _PU for non-dithered formats 3687ec681f3Smrg- panfrost: Add blend helper packing the equation 3697ec681f3Smrg- panfrost: Fix is_opaque when blend_enable=false 3707ec681f3Smrg- panfrost: Simplify blend_factor_constant_mask 3717ec681f3Smrg- panfrost: Add basic fixed-function blending tests 3727ec681f3Smrg- panfrost: Leverage Bifrost's 2*src blend factor 3737ec681f3Smrg- panfrost: Test src*dst + dst*src blending 3747ec681f3Smrg- pan/va: Document IEEE 754 conformance of clamps 3757ec681f3Smrg- pan/bi: Constant fold texturing lowerings 3767ec681f3Smrg- pan/bi: Unit test new constant folding patterns 3777ec681f3Smrg- pan/bi: Simplify bi_compose_clamp 3787ec681f3Smrg- pan/bi: Fuse abs/neg more on Valhall 3797ec681f3Smrg- pan/bi: Add shader equality helper for unit tests 3807ec681f3Smrg- pan/bi: Use FABSNEG pseudo ops for modifier prop 3817ec681f3Smrg- pan/bi: Add optimizer unit tests 3827ec681f3Smrg- pan/bi: Use FCLAMP pseudo op for clamp prop 3837ec681f3Smrg- pan/bi: Add fclamp unit tests 3847ec681f3Smrg- pan/bi: Fuse DISCARD with conditions 3857ec681f3Smrg- pan/bi: Unit test DISCARD+FCMP fusing 3867ec681f3Smrg- docs/panfrost: Update llvm option 3877ec681f3Smrg- drm-shim: Support kernels with >4k pages 3887ec681f3Smrg- panfrost: Fix leak of render node fd 3897ec681f3Smrg- panfrost: Rewrite the clear colour packing code 3907ec681f3Smrg- panvk: Use pan_pack_color 3917ec681f3Smrg- panfrost: Mark R5G6B5 as blendable 3927ec681f3Smrg- panfrost: Unit test clear colour packing 3937ec681f3Smrg- panfrost: Add dither state to the clear colour tests 3947ec681f3Smrg- panfrost: Handle non-dithered clear colours 3957ec681f3Smrg- panfrost: Add unit tests for non-dithered clears 3967ec681f3Smrg- panfrost: Disable shader-assisted indirect draws 3977ec681f3Smrg- pan/bi: Set eldest_colour dependency for ST_TILE 3987ec681f3Smrg- pan/bi: Don't set td in blend shaders 3997ec681f3Smrg- pan/bi: Correct the sr_count on +ST_TILE 4007ec681f3Smrg- pan/bi: Extract load_sample_id to a helper 4017ec681f3Smrg- pan/bi: Set the sample ID for blend shader LD_TILE 4027ec681f3Smrg- panfrost: Evaluate blend shaders per-sample 4037ec681f3Smrg- pan/bi: Use ST_TILE for multisampled blend output 4047ec681f3Smrg- pan/bi: Use CLPER_V6 on Mali G31 4057ec681f3Smrg- panfrost: Remove unneeded quirks from T760 4067ec681f3Smrg- panfrost: Fix UNORM 10 sizes 4077ec681f3Smrg- panfrost: Use blendable check for tib read check 4087ec681f3Smrg- panfrost: Delete unpacks for blendable formats 4097ec681f3Smrg- pan/mdg: Insert moves before writeout when needed 4107ec681f3Smrg- pan/lower_framebuffer: Don't replicate so much 4117ec681f3Smrg- pan/lower_framebuffer: Use fmul_imm 4127ec681f3Smrg- pan/lower_framebuffer: Unify UNORM handling 4137ec681f3Smrg- pan/lower_framebuffer: Don't treat UNORM 4 special 4147ec681f3Smrg- pan/lower_framebuffer: Don't open-code pad_vec4 4157ec681f3Smrg- pan/lower_framebuffer: Don't open-code pan_unpacked_type_for_format 4167ec681f3Smrg- pan/mdg: Handle swapped 565 and 1010102 unorm 4177ec681f3Smrg- panfrost: Zero initialize blend_shaders 4187ec681f3Smrg- panfrost: Port v5 blend shader issue to blitter 4197ec681f3Smrg- panfrost: Fix NULL dereference in allowlist code 4207ec681f3Smrg- panfrost: Rip out primconvert code 4217ec681f3Smrg- panfrost/ci: Switch to suite support 4227ec681f3Smrg- panfrost/ci: Don't skip matrix inverse tests 4237ec681f3Smrg- panfrost: Protect the variants array with a lock 4247ec681f3Smrg- panfrost: Remove null check in batch_cleanup 4257ec681f3Smrg- panfrost: Simplify get_fresh_batch_for_fbo 4267ec681f3Smrg- panfrost: Don't use ralloc for resources 4277ec681f3Smrg- panfrost: Move bo->label assignment into the lock 4287ec681f3Smrg- panfrost: Remove get_fresh_batch 4297ec681f3Smrg- panfrost: Inline add_fbo_bos 4307ec681f3Smrg- panfrost: Switch resources from an array to a set 4317ec681f3Smrg- panfrost: Cache number of users of a resource 4327ec681f3Smrg- panfrost: Maintain a bitmap of active batches 4337ec681f3Smrg- panfrost: Add foreach_batch iterator 4347ec681f3Smrg- panfrost: Prefer batch->resources to rsrc->users 4357ec681f3Smrg- panfrost: Remove rsrc->track.users 4367ec681f3Smrg- panfrost: Remove writer = NULL assignments 4377ec681f3Smrg- panfrost: Replace writers pointer with hash table 4387ec681f3Smrg- panfrost: Take a ctx when submitting/destroying 4397ec681f3Smrg- panfrost: Raise maximum texture size 4407ec681f3Smrg- panfrost: Remove CACHE_LINE_SIZE #define 4417ec681f3Smrg- panfrost: Remove stale TODOs and XXXs 4427ec681f3Smrg- panfrost: Remove unused functions 4437ec681f3Smrg- pan/bi: Simplify condition 4447ec681f3Smrg- pan/bi: Assert l != NULL in bi_ra 4457ec681f3Smrg- pan/bi: Remove unused clause_start field 4467ec681f3Smrg- pan/bi: Fix format specifiers in disassembler 4477ec681f3Smrg- docs/panfrost: Remove obsolete note on Android.mk 4487ec681f3Smrg- docs/panfrost: We're conformant now! 4497ec681f3Smrg- docs/panfrost: Add web chat link 4507ec681f3Smrg- panfrost: Fix incorrect test condition 4517ec681f3Smrg- panfrost: Add ASTC stretch factor enums 4527ec681f3Smrg- panfrost: Assert ASTC/AFBC are not used on v4 4537ec681f3Smrg- panfrost: Use ASTC 2D enums 4547ec681f3Smrg- panfrost: Encode 3D ASTC dimensions 4557ec681f3Smrg- panfrost: Move special_varying to compiler definitions 4567ec681f3Smrg- panfrost: Fix off-by-one in varying count assert 4577ec681f3Smrg- panfrost: Introduce PAN_MAX_VARYINGS define 4587ec681f3Smrg- panfrost: Don't set CAP_TGSI_FS_COORD_PIXEL_CENTER_INTEGER 4597ec681f3Smrg- panfrost: Fix PAN_MESA_DEBUG=sync with INTEL_blackhole_render 4607ec681f3Smrg- nir: Add Mali-specific derivative opcodes 4617ec681f3Smrg- pan/bi: Optimize abs(derivative) 4627ec681f3Smrg- panfrost: Don't allow rendering/texturing 48-bit 4637ec681f3Smrg- panfrost: Detect implementations support AFBC 4647ec681f3Smrg- panfrost,panvk: Use dev->has_afbc instead of quirks 4657ec681f3Smrg- panfrost: Fix gl_FragColor lowering 4667ec681f3Smrg- panfrost: Workaround ISSUE_TSIX_2033 4677ec681f3Smrg- panfrost: Add internal afbc_formats 4687ec681f3Smrg- panfrost: Decompress for incompatible AFBC formats 4697ec681f3Smrg- panfrost: Enable AFBC on v7 4707ec681f3Smrg- mesa: Require MRT support for GL3/ES3 4717ec681f3Smrg- nir/lower_pntc_ytransform: Support PointCoordIsSysval 4727ec681f3Smrg 4737ec681f3SmrgAndreas Baierl (5): 4747ec681f3Smrg 4757ec681f3Smrg- lima: CI: Enable GL_R8 and GL_RG8 texture formats 4767ec681f3Smrg- lima: Expose GL_EXT_clip_control 4777ec681f3Smrg- lima: Remove depth near/far workaround 4787ec681f3Smrg- lima: Fix glFrontFace handling 4797ec681f3Smrg- lima/parser: add shader disassembly to dump 4807ec681f3Smrg 4817ec681f3SmrgAndreas Bergmeier (1): 4827ec681f3Smrg 4837ec681f3Smrg- v3dv: implement VK_EXT_physical_device_drm 4847ec681f3Smrg 4857ec681f3SmrgAntonio Caggiano (3): 4867ec681f3Smrg 4877ec681f3Smrg- ci/freedreno: Test with non-redistributable traces 4887ec681f3Smrg- freedreno/ci: Add a manual job for tracking performance 4897ec681f3Smrg- pps: Restore documentation 4907ec681f3Smrg 4917ec681f3SmrgAnuj Phogat (1): 4927ec681f3Smrg 4937ec681f3Smrg- intel/dg2: Add L3 configuration 4947ec681f3Smrg 4957ec681f3SmrgArvind Yadav (1): 4967ec681f3Smrg 4977ec681f3Smrg- radeonsi: remove the use of PKT3_CONTEXT_REG_RMW 4987ec681f3Smrg 4997ec681f3SmrgAxel Davy (1): 5007ec681f3Smrg 5017ec681f3Smrg- util: Fix translate from block compressed to rgba 5027ec681f3Smrg 5037ec681f3SmrgBas Nieuwenhuizen (72): 5047ec681f3Smrg 5057ec681f3Smrg- zink: set dedicated allocation when needed 5067ec681f3Smrg- util/fossilize_db: Update parsed_offset correctly. 5077ec681f3Smrg- util/fossilize_db: Reset file position to parsed_offset on cache_offset read failure. 5087ec681f3Smrg- util/fossilize_db: Flush files after header write. 5097ec681f3Smrg- util/fossilize_db: Be conservative about header length check for locking. 5107ec681f3Smrg- util/fossilize_db: Only allocate entries after full read. 5117ec681f3Smrg- util/fossilize_db: Use uint64_t for file size. 5127ec681f3Smrg- util/fossilize_db: Unlock the cache file if the entry already exists. 5137ec681f3Smrg- util/fossilize_db: Add extra flock mutex. 5147ec681f3Smrg- radv: Use correct signedness in misalign test. 5157ec681f3Smrg- radv: Allocate space for inline push constants. 5167ec681f3Smrg- nir/lower_scratch: Ensure we don't lower vars with unsupported usage. 5177ec681f3Smrg- nir/inline_functions: Handle halting functions. 5187ec681f3Smrg- radv: Check format before calling depth_only/stencil_only. 5197ec681f3Smrg- util/fossilize_db: Don't corrupt keys during entry read. 5207ec681f3Smrg- nir: Avoid visiting instructions multiple times in nir_instr_free_and_dce. 5217ec681f3Smrg- radv: Expose a bufferImageGranularity of 1. 5227ec681f3Smrg- radv: Fix CPU AABB build. 5237ec681f3Smrg- radv: Fix arrayOfPointers for instances in accel struct build. 5247ec681f3Smrg- radv: Add accel struct build support for the object-to-world matrix. 5257ec681f3Smrg- radv: Add more acceleration structure formats. 5267ec681f3Smrg- radv: Add optimized CPU BVH builds. 5277ec681f3Smrg- radv: Add bvh node definitions to a header. 5287ec681f3Smrg- radv: Modify load_sbt_amd intrinsic to get the descriptor. 5297ec681f3Smrg- aco: Implement call scope. 5307ec681f3Smrg- radv: Refactor some nir_channels usage to use nir_channel. 5317ec681f3Smrg- radv: Do more meta shader lowering. 5327ec681f3Smrg- radv: Implement NULL accel struct descriptor write. 5337ec681f3Smrg- nir: Add AMD rt intrinsics. 5347ec681f3Smrg- radv: Add support for ray launch size. 5357ec681f3Smrg- aco: Add support for ray launch size. 5367ec681f3Smrg- nir: Support ray launch size in divergence analysis. 5377ec681f3Smrg- radv: Support nir_intrinsic_load_global_constant. 5387ec681f3Smrg- radv: Add RT cache flushes. 5397ec681f3Smrg- radv: Add pipeline type. 5407ec681f3Smrg- radv: Add group info to pipeline. 5417ec681f3Smrg- radv: Add raytracing pipeline properties. 5427ec681f3Smrg- radv: Make some pipeline functions non-static. 5437ec681f3Smrg- radv: Add scaffolding for RT pipeline compilation incl libraries. 5447ec681f3Smrg- radv: Add main loop variables. 5457ec681f3Smrg- radv: Add helper to inline shaders into the main shader. 5467ec681f3Smrg- radv: Add helper to parse raytracing stages. 5477ec681f3Smrg- radv: Add pass to lower anyhit shader into an intersection shader. 5487ec681f3Smrg- radv: Add ray traversal loop. 5497ec681f3Smrg- radv: Combine all the parts together with a main loop for an RT pipeline. 5507ec681f3Smrg- radv: Add support for setting a dynamic stack size. 5517ec681f3Smrg- radv: Add caching for RT pipelines. 5527ec681f3Smrg- radv: Experimentally enable RT extensions. 5537ec681f3Smrg- radv: Add DMA buffer update function for internal use. 5547ec681f3Smrg- radv: Add an internal indirect dispatch command. 5557ec681f3Smrg- radv: Add an indirect dispatch struct to the header. 5567ec681f3Smrg- radv: Add copy/serialization info to accel struct headers. 5577ec681f3Smrg- radv: Add acceleration structure queries. 5587ec681f3Smrg- radv: Add GPU copy/serialization/deserialization shader. 5597ec681f3Smrg- radv: Add CPU copying of acceleration structures. 5607ec681f3Smrg- radv: Add GPU copying of acceleration structures. 5617ec681f3Smrg- radv: Add CPU serialization of acceleration structures. 5627ec681f3Smrg- radv: Add GPU serialization of acceleration structures. 5637ec681f3Smrg- radv: Fix Android build for common functions. 5647ec681f3Smrg- radv: Don't invalidate VCACHE after clear_htile_mask. 5657ec681f3Smrg- radv: Add VK_FORMAT_R16G16B16A16_UNORM for accel. structures. 5667ec681f3Smrg- radv: Handle copying zero queries. 5677ec681f3Smrg- amd/common: Add fallback for misreported clocks for RGP. 5687ec681f3Smrg- radv: Document cache coherency rules. 5697ec681f3Smrg- radv: Add hooks after in-renderpass meta operations. 5707ec681f3Smrg- radv: Try to do a better job of dealing with L2 coherent images. 5717ec681f3Smrg- radv: Fix modifier property query. 5727ec681f3Smrg- radv: Add bufferDeviceAddressMultiDevice support. 5737ec681f3Smrg- radv: Disable coherent L2 optimization on cards with noncoherent L2. 5747ec681f3Smrg- meson: Check arguments before adding. 5757ec681f3Smrg- util: Add support for clang::fallthrough. 5767ec681f3Smrg- radv: Fix memory corruption loading RT pipeline cache entries. 5777ec681f3Smrg 5787ec681f3SmrgBoris Brezillon (137): 5797ec681f3Smrg 5807ec681f3Smrg- panfrost: Fix pan_blitter_emit_bifrost_blend() 5817ec681f3Smrg- panfrost: Add explicit padding to pan_blend_shader_key 5827ec681f3Smrg- pan/gen_pack: Generalize the PREFIX() trick 5837ec681f3Smrg- panvk: Add missing midgard_pack dependency 5847ec681f3Smrg- pan/gen_pack: Add pan_size() and pan_align() macros 5857ec681f3Smrg- panfrost: Move the polygon list init logic to pan_cmdstream.c 5867ec681f3Smrg- pan/gen_macros: Move the TEXTURE definition to gen_macros.h 5877ec681f3Smrg- pan/gen_macros: Map {TEXTURE,SAMPLER} to the arch-specific descriptor 5887ec681f3Smrg- pan/gen_macros: Include midgard_pack.h from gen_macros.h 5897ec681f3Smrg- panfrost: Stop including midgard_pack.h directly 5907ec681f3Smrg- panfrost: s/[idep\_]midgard_pack/[idep\_]pan_packers/ 5917ec681f3Smrg- panfrost: Get rid of the mali_xxx enum redefinitions 5927ec681f3Smrg- panfrost: Add generic mappings for the gen-specific tiler descriptor macros 5937ec681f3Smrg- pan/gen_pack: Add parens around packed1/2 vars in pan_merge() 5947ec681f3Smrg- panfrost: Get rid of all _packed structs in pan_context.h 5957ec681f3Smrg- panfrost: Move panfrost_modifier_to_layout() to pan_texture.c 5967ec681f3Smrg- panfrost: Only emit special attribute buffer entries on pre-v6 hardware 5977ec681f3Smrg- panvk: Prepare per-gen split 5987ec681f3Smrg- panfrost: Prepare indirect dispatch helpers to per-gen XML 5997ec681f3Smrg- panfrost: Prepare indirect draw helpers to per-gen XML 6007ec681f3Smrg- panfrost: Fix pan_blit_ctx_init() when start > end 6017ec681f3Smrg- panfrost: Make pan_blit() return the tiler job pointer 6027ec681f3Smrg- panfrost: v7 does not support RGB32_UNORM textures 6037ec681f3Smrg- panvk: Make the per-arch static lib depend on panvk_entrypoints.h 6047ec681f3Smrg- panvk: Fix panvk_copy_fb_desc() 6057ec681f3Smrg- panvk: Don't use pan_is_bifrost() 6067ec681f3Smrg- panvk: Fix blend descriptor emission 6077ec681f3Smrg- panvk: Only advertise MSAA-4 6087ec681f3Smrg- panvk: We don't support linear filtering on integer formats 6097ec681f3Smrg- panvk: Don't advertise min/max filter 6107ec681f3Smrg- panvk: Fix chan_size calculation in panvk_emit_blend() 6117ec681f3Smrg- panvk: Narrow the allow-forward-pixel-kill condition 6127ec681f3Smrg- panvk: Clamp blend constants before copying them to the cmdbuf state 6137ec681f3Smrg- panvk: Don't allocate an array of blend constants 6147ec681f3Smrg- panvk: Close the panfrost device in the panvk_physical_device_init() error path 6157ec681f3Smrg- panvk: Reset panvk_pool->transient_bo in panvk_pool_reset() 6167ec681f3Smrg- panvk: Fix a BO leak in panvk_pool_alloc_backing() 6177ec681f3Smrg- panvk: Initialize clear values to zero when load_op != OP_CLEAR 6187ec681f3Smrg- panvk: Don't take a BO reference when binding memory to an image 6197ec681f3Smrg- panvk: Only set PAN_DBG_TRACE if PANVK_DEBUG_TRACE is set 6207ec681f3Smrg- panvk: Disable the BO cache 6217ec681f3Smrg- panfrost: Patch Z32_S8X24 format when creating a sampler view 6227ec681f3Smrg- panfrost: Fix the Z32_S8X24 and X32_S8X24 definitions 6237ec681f3Smrg- panfrost: RGB10_A2_SNORM is not a valid texture format on v6+ 6247ec681f3Smrg- panfrost: Drop the R and T flags on SCALED formats 6257ec681f3Smrg- panfrost: RGB332_UNORM is not a valid texture format on v6+ 6267ec681f3Smrg- panfrost: Prepare blitter helpers to per-gen XML 6277ec681f3Smrg- panfrost: Prepare blend helpers to per-gen XML 6287ec681f3Smrg- panfrost: Prepare pan_cs helpers to per-gen XML 6297ec681f3Smrg- panfrost: Move panfrost_major_version() to gen_macros.h 6307ec681f3Smrg- panfrost: Prepare pandecode to per-gen XML 6317ec681f3Smrg- panfrost: Prepare scoreboard helpers to per-gen XML 6327ec681f3Smrg- panfrost: Prepare pan_encoder.h to per-gen XML 6337ec681f3Smrg- panfrost: Prepare texture helpers to per-gen XML 6347ec681f3Smrg- panfrost: Prepare shader helpers to per-gen XML 6357ec681f3Smrg- panfrost: Fix indirect draws when vertex or instance count is 0 6367ec681f3Smrg- panfrost: Fix collision in the indirect draw shader table 6377ec681f3Smrg- panfrost/ci: Skip the indirect_draw+XFB tests 6387ec681f3Smrg- pan/bi: Relax check on 8bit swizzles 6397ec681f3Smrg- pan/bi: Allow passing RT conversion descriptors to fragment shaders 6407ec681f3Smrg- pan/blit: Fix a NULL dereference in the preload path 6417ec681f3Smrg- pan/blit: Extend pan_preload_fb() to return emitted jobs 6427ec681f3Smrg- panvk: Initialize the blend shader logic 6437ec681f3Smrg- panvk: Preload FB attachments when required 6447ec681f3Smrg- panvk: Merge identical BO entries before submitting a job 6457ec681f3Smrg- panvk: Move copy stubs to a separate file 6467ec681f3Smrg- panvk: Move blit/resolve stubs to a separate file 6477ec681f3Smrg- panvk: Get rid of panvk_emit_fragment_job() 6487ec681f3Smrg- panvk: Don't use the subpass to calculate the FB descriptor size 6497ec681f3Smrg- panvk: Don't check the bind_point in panvk_cmd_prepare_fragment_job() 6507ec681f3Smrg- panvk: Make panvk_cmd_alloc_tls_desc() more generic 6517ec681f3Smrg- panvk: Add a panvk_cmd_prepare_tiler_context() helper 6527ec681f3Smrg- panvk: Stop dereferencing the subpass in panvk_cmd_close_batch() 6537ec681f3Smrg- panvk: Issue a fragment job if at least one target is cleared 6547ec681f3Smrg- panvk: Implement vkCmdClear{DepthStencil,Color}Image() 6557ec681f3Smrg- panvk: Implement vkCmdCopyImage() 6567ec681f3Smrg- panvk: Implement vkCmdCopyBufferToImage() 6577ec681f3Smrg- panvk: Implement vkCmdCopyImageToBuffer() 6587ec681f3Smrg- panvk: Implement vkCmdCopyBuffer() 6597ec681f3Smrg- panvk: Implement vkCmdFillBuffer() 6607ec681f3Smrg- panvk: Implement vkCmdUpdateBuffer() 6617ec681f3Smrg- pan/decode: Fix DCD size in Pre frame decoding 6627ec681f3Smrg- pan/blit: Let the caller offset the start/end coords passed to the blitter 6637ec681f3Smrg- pan/blit: Fix 3D blittering 6647ec681f3Smrg- panvk: Implement vkCmdBlitImage() 6657ec681f3Smrg- panvk: Always allocate at least one BLEND descriptor for fragment shaders 6667ec681f3Smrg- panvk: Fix the static scissor/viewport case 6677ec681f3Smrg- panvk: Fix TLS initialization for multi-draw batches 6687ec681f3Smrg- panvk: Extend panvk_cmd_close_batch() to handle current_batch == NULL 6697ec681f3Smrg- panvk: Make panvk_cmd_open_batch() return the new batch 6707ec681f3Smrg- panvk: Use the local batch variable when we have one 6717ec681f3Smrg- panvk: Don't invalidate the vertex attributes when binding a new pipeline 6727ec681f3Smrg- panvk: Fix the pipeline binding logic 6737ec681f3Smrg- panvk: Fix panvk_pipeline_builder_upload_sysval() 6747ec681f3Smrg- panvk: Fix multisample image copies 6757ec681f3Smrg- panvk: Avoid allocating sysvals UBOs when the pipeline has one 6767ec681f3Smrg- panvk: Handle input varyings without previous writes 6777ec681f3Smrg- panvk: Fix an overflow on cmdbuf->state.clear 6787ec681f3Smrg- panvk: Don't expect subpasses to use all RTs 6797ec681f3Smrg- panvk: Only prepare texture descriptors when the image is sampled 6807ec681f3Smrg- panvk: Fix 1DArray image to buffer copy 6817ec681f3Smrg- panvk: Fix size overflow in GetBufferMemoryRequirements() 6827ec681f3Smrg- panvk: Fix stencil clear assignment in panvk_cmd_fb_info_set_subpass() 6837ec681f3Smrg- panvk: Handle VK_REMAINING_{MIP_LEVELS,ARRAY_LAYERS) when creating image views 6847ec681f3Smrg- panvk: Split var copies before lowering them 6857ec681f3Smrg- panvk/ci: Trigger bifrost jobs on vulkan changes 6867ec681f3Smrg- pan/bi: Fix 1DArray image coordinate retrieval 6877ec681f3Smrg- pan/lower_fb: Support SNORM8 unpacking 6887ec681f3Smrg- pan/lower_fb: Re-order components when dealing with raw formats 6897ec681f3Smrg- pan/lower_fb: Add support for B10G10R10A2_UINT variants 6907ec681f3Smrg- pan/lower_fb: Add support for rgb10a2 _SINT variants 6917ec681f3Smrg- panfrost: Use an identity swizzle for RAW formats 6927ec681f3Smrg- panfrost: Add a common genxml file so we can share a few definitions 6937ec681f3Smrg- panfrost: Split command stream descriptor definitions per-gen 6947ec681f3Smrg- panfrost: Move genxml related files to a subdir 6957ec681f3Smrg- nir: Make sure src->num_components < dst->num_components in nir_ssa_for_src() 6967ec681f3Smrg- nir/lower_blend: Pad src to a 4-component vector 6977ec681f3Smrg- nir/lower_blend: Don't lower RTs whose format is set to NONE 6987ec681f3Smrg- nir/lower_blend: Make sure we're not passed scaled formats 6997ec681f3Smrg- nir/lower_blend: Shrink blended result if needed 7007ec681f3Smrg- pan/blend: Allow passing blend constants through a sysval 7017ec681f3Smrg- panvk: Fill the blend constants sysval 7027ec681f3Smrg- panvk: Lower blend operations when needed 7037ec681f3Smrg- panvk/ci: Enable blend tests 7047ec681f3Smrg- panvk: Fix allocation of BOs bigger than the slab size 7057ec681f3Smrg- panvk: Don't use panfrost_get_default_swizzle() on v7+ 7067ec681f3Smrg- panvk: Fix wls_size retrieval 7077ec681f3Smrg- panvk: Pass the render target index to panvk_meta_clear_attachment() 7087ec681f3Smrg- panvk: Allow clear_attachment of RTs > 0 7097ec681f3Smrg- panvk: Support clearing ZS attachments 7107ec681f3Smrg- nir: Add a nir_sysvals_to_varyings() helper 7117ec681f3Smrg- spirv: Let spirv_to_nir() users turn sysvals into input varyings 7127ec681f3Smrg- spirv: Always declare FragCoord as a sysval 7137ec681f3Smrg- spirv: Declare PointCoord as a sysval 7147ec681f3Smrg- vulkan: Fix weak symbol emulation when compiling with MSVC 7157ec681f3Smrg- vulkan: Set unused entrypoints to vk_entrypoint_stub when compiling with MSVC 7167ec681f3Smrg- vulkan: Fix entrypoint generation when compiling for x86 with MSVC 7177ec681f3Smrg 7187ec681f3SmrgBoyuan Zhang (5): 7197ec681f3Smrg 7207ec681f3Smrg- radeon/vcn: initilize num_temporal_layers for hevc 7217ec681f3Smrg- radeon/vcn: track width and height of the last frame 7227ec681f3Smrg- radeon/vcn: check frame size change for vp9 header flags 7237ec681f3Smrg- radeon/vcn: set min value for num_temporal_layers 7247ec681f3Smrg- frontends/va: add num_temporal_layers check 7257ec681f3Smrg 7267ec681f3SmrgCaio Marcelo de Oliveira Filho (27): 7277ec681f3Smrg 7287ec681f3Smrg- vulkan/util: Add and use vk_multialloc_zalloc variants 7297ec681f3Smrg- anv: Zero initialize pipeline structs 7307ec681f3Smrg- spirv: Implement SPV_EXT_shader_atomic_float16_add 7317ec681f3Smrg- vulkan: Update XML and headers to 1.2.185 7327ec681f3Smrg- anv: Advertise support for VK_EXT_shader_atomic_float2 7337ec681f3Smrg- nir/dead_cf: Do not remove loops with loads that can't be reordered 7347ec681f3Smrg- nir: Update documentation for location to mention Task/Mesh 7357ec681f3Smrg- nir: Add a way to identify per-primitive variables 7367ec681f3Smrg- nir: Add per-primitive I/O intrinsics 7377ec681f3Smrg- compiler: Add new non-Multiview Task/Mesh builtins 7387ec681f3Smrg- compiler: Add Task/Mesh to shader_info 7397ec681f3Smrg- nir/lower_io: Identify Mesh output as arrayed 7407ec681f3Smrg- nir/divergence_analysis: Handle Task/Mesh shaders 7417ec681f3Smrg- nir: Don't lower Task/Mesh I/O to temporaries 7427ec681f3Smrg- nir: Allow Task/Mesh to lower compute system values 7437ec681f3Smrg- spirv: Implement non-Multiview parts of SPV_NV_mesh_shader 7447ec681f3Smrg- anv: Simplify subgroup_size_type rules for compute shaders 7457ec681f3Smrg- anv: Refactor subgroup_size_type rules into a single function 7467ec681f3Smrg- spirv: Identify non-temporal memory access 7477ec681f3Smrg- nir/lower_io_to_vector: Allow Task/Mesh to load from outputs 7487ec681f3Smrg- intel: Add and use max_constant_urb_size_kb 7497ec681f3Smrg- iris: Document push constants allocation 7507ec681f3Smrg- anv: Validate vertex related states only when VS is present 7517ec681f3Smrg- anv: Move together primitive pipeline emit calls 7527ec681f3Smrg- anv: Identify code paths specific to graphics primitive pipeline 7537ec681f3Smrg- intel/compiler: Convert test_eu_compact to use gtest 7547ec681f3Smrg- intel/compiler: Remove unused \`ret` declaration 7557ec681f3Smrg 7567ec681f3SmrgCaio Oliveira (1): 7577ec681f3Smrg 7587ec681f3Smrg- util/ra: Fix deserialization of register sets 7597ec681f3Smrg 7607ec681f3SmrgCarsten Haitzler (1): 7617ec681f3Smrg 7627ec681f3Smrg- panfrost: tidy up GPU naming to be in line with official names 7637ec681f3Smrg 7647ec681f3SmrgCharlie Turner (5): 7657ec681f3Smrg 7667ec681f3Smrg- ci: Build libdrm earlier for x86_test-vk 7677ec681f3Smrg- ci: Fix syntax error in radv fails files 7687ec681f3Smrg- ci: Support per-driver skip lists. 7697ec681f3Smrg- radv/ci: Remove duplication in dEQP skip lists. 7707ec681f3Smrg- radv/ci: Fix the GPU_VERSION for polaris10 7717ec681f3Smrg 7727ec681f3SmrgCharmaine Lee (2): 7737ec681f3Smrg 7747ec681f3Smrg- aux/draw: Check for preferred IR to take nir-to-tgsi path in draw module 7757ec681f3Smrg- svga: fix render target views leak 7767ec681f3Smrg 7777ec681f3SmrgChia-I Wu (43): 7787ec681f3Smrg 7797ec681f3Smrg- venus: refactor vn_EndCommandBuffer 7807ec681f3Smrg- egl/surfaceless: try kms_swrast before swrast 7817ec681f3Smrg- meson: allow egl_native_platform to be specified 7827ec681f3Smrg- vulkan/wsi: replace prime_blit_buffer by a bool 7837ec681f3Smrg- venus: clean up vn_AllocateMemory 7847ec681f3Smrg- venus: suballocate memory in more cases 7857ec681f3Smrg- venus: log more WSI messages 7867ec681f3Smrg- vulkan/wsi/x11: do not inherit last_present_mode 7877ec681f3Smrg- venus: print warnings when stuck in busy waits 7887ec681f3Smrg- iris, crocus: add idep_genxml to per_hw target dependencies 7897ec681f3Smrg- venus: update venus-protocol headers 7907ec681f3Smrg- venus: break up vn_device.h 7917ec681f3Smrg- venus: break up vn_device.c 7927ec681f3Smrg- venus: free queues after vkDestroyDevice is emitted 7937ec681f3Smrg- venus: use uint32_t in vn_ring_submit 7947ec681f3Smrg- venus: minor cleanup to physical device init loop 7957ec681f3Smrg- venus: pre-initialize device groups 7967ec681f3Smrg- venus: fix device group enumeration with unsupported devices 7977ec681f3Smrg- venus: group physical device fields with a struct 7987ec681f3Smrg- venus: no supported device is not an error 7997ec681f3Smrg- venus: initialize physical devices once 8007ec681f3Smrg- venus: reorder version fields in vn_instance 8017ec681f3Smrg- venus: init roundtrip fields in vn_instance later 8027ec681f3Smrg- venus: add vn_renderer_submit_simple_sync 8037ec681f3Smrg- venus: support reply shmem without ring 8047ec681f3Smrg- venus: init experimental features before the ring 8057ec681f3Smrg- venus: add and use VN_CS_ENCODER_INITIALIZER 8067ec681f3Smrg- venus: rework vn_instance_submission 8077ec681f3Smrg- venus: make ring buffer size configurable 8087ec681f3Smrg- venus: update venus-protocol headers 8097ec681f3Smrg- venus: raise the ring buffer size to 64KB 8107ec681f3Smrg- venus: refactor vn_instance_enumerate_physical_devices 8117ec681f3Smrg- venus: separate physical device init and filter 8127ec681f3Smrg- venus: copy VkPhysicalDeviceImageDrmFormatModifierInfoEXT 8137ec681f3Smrg- venus: add vn_refcount 8147ec681f3Smrg- venus: convert bo and shmem to use vn_refcount 8157ec681f3Smrg- venus: add a helper to destroy vn_descriptor_set 8167ec681f3Smrg- venus: add vn_refcount to vn_descriptor_set_layout 8177ec681f3Smrg- venus: keep layouts of descriptor sets alive 8187ec681f3Smrg- radv: plug leaks in radv_device_init_accel_struct_build_state 8197ec681f3Smrg- vulkan/wsi/wayland: fix an invalid u_vector_init call 8207ec681f3Smrg- util/vector: make util_vector_init harder to misuse 8217ec681f3Smrg- venus: add atrace support 8227ec681f3Smrg 8237ec681f3SmrgChristian Gmeiner (46): 8247ec681f3Smrg 8257ec681f3Smrg- etnaviv: export supported prim types 8267ec681f3Smrg- etnaviv: remove primconvert 8277ec681f3Smrg- ci: include etnaviv support in ARMHF container. 8287ec681f3Smrg- ci: update kernel 8297ec681f3Smrg- ci/bare-metal: add telnet based serial 8307ec681f3Smrg- ci/bare-metal: add support for eth008 power relay 8317ec681f3Smrg- ci/bare-metal: add etnaviv 8327ec681f3Smrg- lima: fix leak of the screen hash table 8337ec681f3Smrg- util/tests: rename bitset test names 8347ec681f3Smrg- util/bitset: add bitwise AND, OR and NOT 8357ec681f3Smrg- util/tests: add bitwise AND, OR and NOT tests 8367ec681f3Smrg- util/bitset: add right shift 8377ec681f3Smrg- util/tests: add bitset SHR tests 8387ec681f3Smrg- util/bitset: add left shift 8397ec681f3Smrg- util/tests: add bitset SHL tests 8407ec681f3Smrg- util/bitset: s/BITSET_SET_RANGE/BITSET_SET_RANGE_INSIDE_WORD 8417ec681f3Smrg- util/bitset: add BITSET_SET_RANGE(..) 8427ec681f3Smrg- util/tests: add set bit range test 8437ec681f3Smrg- freedreno/isa: add leading zero's 8447ec681f3Smrg- freedreno/isa: simplify custom_target 8457ec681f3Smrg- freedreno/isa: add next_instruction(..) 8467ec681f3Smrg- freedreno/isa: add defines for fprintf(..) usage 8477ec681f3Smrg- freedreno/isa: store max size for needed bitset 8487ec681f3Smrg- freedreno/isa: generate ir3-isa.h 8497ec681f3Smrg- freedreno/isa: generate isaspec-decode.h 8507ec681f3Smrg- freedreno/isa: add bitmask_t to encode.py 8517ec681f3Smrg- freedreno/isa: add bitmask to/from uint64_t helper 8527ec681f3Smrg- freedreno/isa: add BITMASK_WORDS define 8537ec681f3Smrg- freedreno/isa: add store_instruction(..) 8547ec681f3Smrg- freedreno/isa: generate marcos used for printf(..) 8557ec681f3Smrg- freedreno/isa: add split_bits(..) methods 8567ec681f3Smrg- freedreno/isa: decode: switch bitmask_t to BITSET_WORD's 8577ec681f3Smrg- freedreno/isa: encode: switch bitmask_t to BITSET_WORD's 8587ec681f3Smrg- freedreno/isa: update documentation 8597ec681f3Smrg- freedreno/isa: add shbang and make executable 8607ec681f3Smrg- freedreno/isa: move isaspec to a new home 8617ec681f3Smrg- compiler/isaspec: add print(..) helper 8627ec681f3Smrg- compiler/isaspec: keep track of written data 8637ec681f3Smrg- compiler/isaspec: add alignment support 8647ec681f3Smrg- etnaviv: use better name for fd hash table 8657ec681f3Smrg- etnaviv: fix leak of the screen hash table 8667ec681f3Smrg- etnaviv: fix indentation 8677ec681f3Smrg- etnaviv: move drm version readout to drm layer 8687ec681f3Smrg- etnaviv: allow screen creation with NULL renderonly object 8697ec681f3Smrg- etnaviv: extend screen_create(..) with gpu_fd 8707ec681f3Smrg- etnaviv: add etna_lookup_or_create_screen(..) 8717ec681f3Smrg 8727ec681f3SmrgClayton Craft (1): 8737ec681f3Smrg 8747ec681f3Smrg- anv: don't advertise vk conformance on GPUs that aren't conformant 8757ec681f3Smrg 8767ec681f3SmrgConnor Abbott (81): 8777ec681f3Smrg 8787ec681f3Smrg- tu: Triage some CTS failures 8797ec681f3Smrg- ir3: Preserve gl_ViewportIndex in the binning shader 8807ec681f3Smrg- tu: Use NIR for clear/blit shaders 8817ec681f3Smrg- ir3: Delete old packed struct encoding 8827ec681f3Smrg- tu: Handle multisample vkCmdCopyColorImage() 8837ec681f3Smrg- tu: Make tile stores use a dedicated CS 8847ec681f3Smrg- tu: Implement non-aligned multisample GMEM STORE_OP_STORE 8857ec681f3Smrg- freedreno: Rename and document tess primid-related sysvals 8867ec681f3Smrg- tu, freedreno/a6xx, ir3: Rewrite tess PrimID handling 8877ec681f3Smrg- tu, freedreno/a6xx: Fix setting PC_XS_OUT_CNTL::PRIMITVE_ID 8887ec681f3Smrg- ir3: Document RA-related register flags better 8897ec681f3Smrg- tu: Read some input attachments directly 8907ec681f3Smrg- freedreno/a6xx: Add new register fields 8917ec681f3Smrg- freedreno, tu: Stop asking for foveation quality 8927ec681f3Smrg- freedreno, tu: Set GRAS_LRZ_PS_INPUT_CNTL::SAMPLEID 8937ec681f3Smrg- freedreno/a6xx: Document GRAS_SC_CNTL::SINGLE_PRIM_MODE 8947ec681f3Smrg- tu: Fix feedback loops in sysmem mode 8957ec681f3Smrg- tu: Fix xfb when there is a hole at the end 8967ec681f3Smrg- freedreno: Decode a650+ CP_START_BIN/CP_END_BIN packets 8977ec681f3Smrg- tu: Fix logic errors with subpass implicit dependencies 8987ec681f3Smrg- tu: Consider depth/stencil for implicit dependencies 8997ec681f3Smrg- ir3: Add pass to remove unreachable blocks 9007ec681f3Smrg- ir3/ra: Remove logical_unreachable 9017ec681f3Smrg- ir3: Copy-propagate single-source phis 9027ec681f3Smrg- ir3: Print physical successors/predecessors 9037ec681f3Smrg- ir3/print: Use mesa_stream_log_printf for (kill) 9047ec681f3Smrg- ir3/merge_regs: Set wrmask for pcopy destinations 9057ec681f3Smrg- ir3/ra: Reinitialize interval when inserting 9067ec681f3Smrg- ir3/ra: Fix available bitset for live-through collect srcs 9077ec681f3Smrg- ir3/ra: Handle huge merge sets 9087ec681f3Smrg- ir3/ra: Make ir3_reg_interval_remove_all() useful for spilling 9097ec681f3Smrg- ir3: Add loop depth to ir3_block 9107ec681f3Smrg- ir3: Add ra_foreach_src_n/ra_foreach_dst_n 9117ec681f3Smrg- ir3: Fix RA debug printing 9127ec681f3Smrg- ir3: Properly validate pcopy reg sizes 9137ec681f3Smrg- ir3: Fix compress_regs_left accounting for half-regs 9147ec681f3Smrg- ir3: Initial support for spilling non-shared registers 9157ec681f3Smrg- ir3: Fix getting stp/ldp components in ir3_info 9167ec681f3Smrg- ir3, turnip, freedreno: Report stp/ldp in shader stats 9177ec681f3Smrg- freedreno/ci: Add spillall tests 9187ec681f3Smrg- tu: Properly handle waiting on an earlier pipeline stage 9197ec681f3Smrg- tu: Add a650-specific CCU flush workaround 9207ec681f3Smrg- tu: Remove some stale bypass xfails 9217ec681f3Smrg- ir3: Remove ir3_instr::name 9227ec681f3Smrg- ir3: Make instruction IP 32 bits 9237ec681f3Smrg- ir3: Make ir3_register::name 32-bits 9247ec681f3Smrg- ir3/ra: Fix type mismatch when comparing intervals 9257ec681f3Smrg- lima: Add a NIR load duplicating pass 9267ec681f3Smrg- lima/gpir: Rewrite register allocation for value registers 9277ec681f3Smrg- freedreno/computerator: Add support for pvtmem 9287ec681f3Smrg- ir3/lower_pcopy: Use right flags for src const/immed 9297ec681f3Smrg- ir3/lower_pcopy: Set entry->done in the swap loop 9307ec681f3Smrg- tu: Fix VS primid with tess + GS 9317ec681f3Smrg- freedreno/a6xx: Fix VS primid with tess + GS. 9327ec681f3Smrg- ir3: Add bar to beginning of HS with tess_use_shared 9337ec681f3Smrg- freedreno, turnip: Disable 8bpp UBWC on a650 9347ec681f3Smrg- ir3: Make trig replacement expression exact 9357ec681f3Smrg- freedreno/a6xx: Name TPL1_DBG_ECO_CNTL 9367ec681f3Smrg- freedreno, turnip: Set TPL1_DBG_ECO_CNTL better 9377ec681f3Smrg- ir3: Use source in ir3_output_conv_src_type() 9387ec681f3Smrg- tu/clear_blit: Constify some image views 9397ec681f3Smrg- tu: Implement VK_KHR_imageless_framebuffer 9407ec681f3Smrg- ir3/lower_subgroups: Support 16-bit READ_* sources 9417ec681f3Smrg- ir3: Skip src size validation for cat1 9427ec681f3Smrg- tu: Expose VK_KHR_shader_subgroup_extended_types 9437ec681f3Smrg- ir3: Initialize local size earlier 9447ec681f3Smrg- ir3/ra: Don't reset round-robin start for each block 9457ec681f3Smrg- ir3/ra: Use killed sources in register eviction 9467ec681f3Smrg- ir3/cp: Add missing const promotion check 9477ec681f3Smrg- ir3/cp: Fix inlining 32->16 const into meta instructions 9487ec681f3Smrg- nir/lower_ubo_vec4: Fix align_mul=8 special case 9497ec681f3Smrg- ir3: Fix printing branch type 9507ec681f3Smrg- ir3: Make ir3_create_collect() take a block 9517ec681f3Smrg- ir3: Always create barycentrics in the input block 9527ec681f3Smrg- ir3: Remove separate regmask.h 9537ec681f3Smrg- ir3: Handle special regs in regmask 9547ec681f3Smrg- ir3/legalize: handle WAR for special regs 9557ec681f3Smrg- ir3: Fix check for immediate range 9567ec681f3Smrg- ir3: Fix handling cat6 immediates 9577ec681f3Smrg- ir3: Fold ldc src immediates 9587ec681f3Smrg- ir3/spill: Mark root as non-spillable after inserting 9597ec681f3Smrg 9607ec681f3SmrgCorentin Noël (8): 9617ec681f3Smrg 9627ec681f3Smrg- ci: actually run piglit tests with virgl 9637ec681f3Smrg- ci: Re-enable piglit trace for virgl 9647ec681f3Smrg- ci: Disable llvmpipe optimizations when running virgl CI 9657ec681f3Smrg- ci: Increase the default Rust toolchain version 9667ec681f3Smrg- ci: Increase crosvm version 9677ec681f3Smrg- ci: Use crosvm to run dEQP tests for virgl 9687ec681f3Smrg- glx: Prevent crashes when an extension isn't found 9697ec681f3Smrg- virgl: Set GL_QUADS_FOLLOW_PROVOKING_VERTEX_CONVENTION to 1 9707ec681f3Smrg 9717ec681f3SmrgDaniel Schürmann (54): 9727ec681f3Smrg 9737ec681f3Smrg- aco/optimizer: ensure to not erase high bits when propagating packed constants 9747ec681f3Smrg- aco/ra: don't allocate vector space for MIMG NSA operands 9757ec681f3Smrg- aco: include <cstddef> in aco_util.h 9767ec681f3Smrg- nir/lower_alu_to_scalar: don't skip gaps in write_mask 9777ec681f3Smrg- nir/opt_shrink_vectors: don't shrink vectors used by intrinsics 9787ec681f3Smrg- nir: consider write_mask in nir_ssa_def_components_read() 9797ec681f3Smrg- nir/opt_shrink_vectors: reverse iteration order 9807ec681f3Smrg- nir/shrink_vectors: shrink ALU properly 9817ec681f3Smrg- nir/shrink_vectors: shrink vecN properly 9827ec681f3Smrg- nir: return false for loops in contains_other_jump() 9837ec681f3Smrg- aco/print_ir: fix printing of VOPC_SDWA definitions 9847ec681f3Smrg- aco: use VOPC_SDWA on GFX9+ 9857ec681f3Smrg- aco: add instr_is_16bit() helper function 9867ec681f3Smrg- aco/ra: refactor subdword definition info 9877ec681f3Smrg- aco/ra: refactor subdword operand stride 9887ec681f3Smrg- aco/validate: simplify get_subdword_bytes_written() 9897ec681f3Smrg- aco/opcodes: remove definition_size[] 9907ec681f3Smrg- aco: add more validation rules for SDWA operands 9917ec681f3Smrg- nir/loop_analyze: consider instruction cost of nir_op_flrp 9927ec681f3Smrg- nir/opt_algebraic: optimize flrp(fadd, fadd, x) only if fadd are used_once 9937ec681f3Smrg- radv: call nir_lower_flrp() after the first radv_optimize_nir() 9947ec681f3Smrg- aco: remove redundant s_and exec after nir_op_inot 9957ec681f3Smrg- aco: only apply extract if not used more than 4 times 9967ec681f3Smrg- aco: refactor nir_op_imul selection 9977ec681f3Smrg- aco/optimizer: combine v_mul_lo_u16 + v_add_u16 -> v_mad_u16 9987ec681f3Smrg- aco/optimizer: fuse v_mul_f64 + v_add_f64 -> v_fma_f64 9997ec681f3Smrg- aco/optimizer: combine v_pk_mul_u16 + v_pk_add_u16 -> v_pk_mad_u16 10007ec681f3Smrg- aco: fix init_any_pred_defined() for loop header phis 10017ec681f3Smrg- aco: refactor lower_phis() 10027ec681f3Smrg- aco/lower_bool_phis: avoid creating trivial phis 10037ec681f3Smrg- aco/lower_phis: propagate constants before emitting merge code 10047ec681f3Smrg- aco/lower_phis: optimize loop exit phis 10057ec681f3Smrg- aco: fix p_insert lowering with 16bit sources 10067ec681f3Smrg- aco: rewrite SDWA selector 10077ec681f3Smrg- aco: remove explicit dst_preserve flag 10087ec681f3Smrg- aco/print_ir: always print SDWA dst & src selections 10097ec681f3Smrg- aco: preserve subdword RC when lowering p_insert/p_extract 10107ec681f3Smrg- aco/ra: Fix potential out-of-bounds array accesses. 10117ec681f3Smrg- aco/ra: don't copy linear VGPRs within CF in get_reg_create_vector() 10127ec681f3Smrg- aco: stop scheduling if clause-forming fails 10137ec681f3Smrg- aco: make clause-forming depend on the number of moved instructions 10147ec681f3Smrg- aco: try forming clauses even if reg_pressure exceeds 10157ec681f3Smrg- aco: clang-format 10167ec681f3Smrg- aco/ra: fix intersects() 10177ec681f3Smrg- aco/ra: refactor affinities into assignment struct 10187ec681f3Smrg- aco/ra: remove some redundant code 10197ec681f3Smrg- aco/ra: split register assignment for phis into separate function 10207ec681f3Smrg- aco/ra: try more aggressive to assign phi defs the same register 10217ec681f3Smrg- aco/ra: for phis try to find an operand-matching register earlier 10227ec681f3Smrg- aco/ra: don't set affinities for ssa-repair phis 10237ec681f3Smrg- aco/ra: create affinities between nested phis 10247ec681f3Smrg- aco/ra: create nested affinities for loop header phis 10257ec681f3Smrg- aco/ra: don't rewrite affinities for phi operands after register assignment 10267ec681f3Smrg- driconf: set vk_x11_strict_image_count for Wolfenstein: Youngblood 10277ec681f3Smrg 10287ec681f3SmrgDaniel Stone (7): 10297ec681f3Smrg 10307ec681f3Smrg- vulkan/wsi/wayland: Cosmetic alignment fix 10317ec681f3Smrg- vulkan/wsi/wayland: Initialise wl_shm pointer in VkImage 10327ec681f3Smrg- egl/wayland: Error on invalid native window 10337ec681f3Smrg- egl/wayland: Allow EGLSurface to outlive wl_egl_window 10347ec681f3Smrg- CI: Disable LAVA devices 10357ec681f3Smrg- Revert "CI: Disable LAVA devices" 10367ec681f3Smrg- fdno/resource: Rewrite layout selection for allocation 10377ec681f3Smrg 10387ec681f3SmrgDanylo Piliaiev (39): 10397ec681f3Smrg 10407ec681f3Smrg- freedreno: fix wrong tile aligment for 3 CCU gpu 10417ec681f3Smrg- tu: handle half-reg fs outputs 10427ec681f3Smrg- tu: delay decision of forcing sysmem due to subpass self-dependencies 10437ec681f3Smrg- turnip: reduce maxComputeWorkGroupSize 10447ec681f3Smrg- tu: disable gmem in primary cmdbuffer if secondary has it disabled 10457ec681f3Smrg- tu: add "flushall" and "syncdraw" debug options 10467ec681f3Smrg- freedreno/decode: print estimated crash location without colored output 10477ec681f3Smrg- tu: declare VK_EXT_extended_dynamic_state2 but leave it disabled 10487ec681f3Smrg- tu: implement dynamic depth bias enable 10497ec681f3Smrg- tu: implement dynamic primitive restart enable 10507ec681f3Smrg- tu: implement dynamic rasterizer discard enable 10517ec681f3Smrg- tu: enable VK_EXT_extended_dynamic_state2 10527ec681f3Smrg- turnip: provide dummy CmdSetLogicOpEXT and CmdSetPatchControlPointsEXT 10537ec681f3Smrg- freedreno: rename Z_TEST_ENABLE->Z_READ_ENABLE, Z_ENABLE->Z_TEST_ENABLE 10547ec681f3Smrg- turnip: apply workaround for depth bounds test without depth test 10557ec681f3Smrg- ir3: prohibit folding of half->full conversion into mul.s24/u24 10567ec681f3Smrg- ir3/a6xx,freedreno: account for resinfo return size dependency on IBO_0_FMT 10577ec681f3Smrg- turnip: consider shader's immediates size for sub-stream allocation 10587ec681f3Smrg- turnip: re-emit vertex params after they are invalidated 10597ec681f3Smrg- util/u_trace: make u_trace usable for other than gallium drivers 10607ec681f3Smrg- util/u_trace: auto-generation of serialization funcs for tracepoints 10617ec681f3Smrg- turnip: implement basic perfetto support 10627ec681f3Smrg- u_trace: helpers for tracing tiling GPUs and re-usable VK cmdbuffers 10637ec681f3Smrg- turnip/perfetto: reusable command buffers support 10647ec681f3Smrg- u_trace: pass command stream through tracing functions 10657ec681f3Smrg- turnip: support tracing of gmem/sysmem load/store/clears 10667ec681f3Smrg- turnip/kgsl: fix compilation after perfetto introduction 10677ec681f3Smrg- turnip: consider multiview_mask when clearing depth-stencil attachment 10687ec681f3Smrg- turnip: Move to common DEFINE_HANDLE_CASTS casting macro 10697ec681f3Smrg- turnip: clamp per-tile scissors to max viewport size in binning pass 10707ec681f3Smrg- turnip: fix vbs emission when there are holes in bindings 10717ec681f3Smrg- ir3: remove obsolete assert for intrinsic_store_output in tess 10727ec681f3Smrg- turnip: do nothing on dispatch with zero total workgroups 10737ec681f3Smrg- ir3: support source modes for resinfo.b 10747ec681f3Smrg- ir3/freedreno: handle non-uniform resinfo 10757ec681f3Smrg- ir3/freedreno: handle non-uniform a1en instructions 10767ec681f3Smrg- turnip: fix streamout buffer offset calculations 10777ec681f3Smrg- ir3/ra: Check register file upper bound when updating preferred_reg 10787ec681f3Smrg- tu: fix rast state allocation size on a6xx gen4 10797ec681f3Smrg 10807ec681f3SmrgDave Airlie (134): 10817ec681f3Smrg 10827ec681f3Smrg- lvp: fixup multi draw memcpys 10837ec681f3Smrg- lavapipe: fix multi-draw regression in shader parameters test 10847ec681f3Smrg- lavapipe: fix indexed multi draw draw_id increment 10857ec681f3Smrg- draw: handle resetting draw_id between instances. 10867ec681f3Smrg- softpipe/aniso: move DDQ calculation to after scaling. 10877ec681f3Smrg- wl/shm: don't fetch formats if not requested. 10887ec681f3Smrg- clover/il: return IL only for spirv and correct length 10897ec681f3Smrg- gallivm: add anisotropic filter weight table. 10907ec681f3Smrg- draw: add shader access to aniso filter table. 10917ec681f3Smrg- llvmpipe: add filter table shader accessor 10927ec681f3Smrg- gallivm: add support for anisotropic sampling. 10937ec681f3Smrg- llvmpipe: add support for max aniso query. 10947ec681f3Smrg- draw: add sampler max_aniso query. 10957ec681f3Smrg- llvmpipe: enable GL_ARB_texture_filter_anisotropic 10967ec681f3Smrg- llvmpipe/virgl/ci: update traces for aniso 10977ec681f3Smrg- docs: update anisotropic info for softpipe/llvmpipe/lavapipe 10987ec681f3Smrg- crocus/gen4-5: fix ff gs emit on VS vue map change. 10997ec681f3Smrg- llvmpipe/linear: fix ppc64/s390 build 11007ec681f3Smrg- llvmpipe: add some extra linear rast checks. 11017ec681f3Smrg- llvmpipe: add support for time elapsed queries. 11027ec681f3Smrg- llvmpipe: rework query fence signalling for get_query_result_resource 11037ec681f3Smrg- gallivm/img: use uint for image coord builder. 11047ec681f3Smrg- draw/llvmpipe: multiply polygon offset units by 2 11057ec681f3Smrg- teximage: return correct desktop GL error for compressedteximage 11067ec681f3Smrg- crocus/gen4: restrict memcpy mapping to gen5 11077ec681f3Smrg- intel/fs: restrict max push length on older GPUs to a smaller amount 11087ec681f3Smrg- intel/decode: add gfx4 constant buffer decode 11097ec681f3Smrg- intel/decode: add gfx4 vertex shader decode 11107ec681f3Smrg- crocus/gen45: fix mapping compressed textures 11117ec681f3Smrg- intel/genxml: fix raster operation field in blt genxml 11127ec681f3Smrg- crocus: add support for set alpha to one with blt. 11137ec681f3Smrg- virgl: disable anisotropic filtering. 11147ec681f3Smrg- virgl: add support for anisotropic texture filtering 11157ec681f3Smrg- ci: bump to latest virglrenderer for anisotropic support 11167ec681f3Smrg- clover/llvm: turn off optional CL 3 features. 11177ec681f3Smrg- nir/libclc: handle null callee name when lowering 11187ec681f3Smrg- vtn: add support for atomic flag test/set/clear 11197ec681f3Smrg- nir: add 32-bit bool of fisfinite 11207ec681f3Smrg- nir: add fisnormal lowering 11217ec681f3Smrg- gallivm: handle fisfinite/fisnormal 11227ec681f3Smrg- clover: fix api zero sized enqueue 11237ec681f3Smrg- clover: return CL_INVALID_PLATFORM properly. 11247ec681f3Smrg- clover: add kernel attributes support for SPIR-V 11257ec681f3Smrg- clover: fix compilation with clang + llvm 12. 11267ec681f3Smrg- clover/nir: don't convert to NIR on library link 11277ec681f3Smrg- clover: only return CLC version as 1.2 (even for 3.0) 11287ec681f3Smrg- llvmpipe: add support for user memory pointers 11297ec681f3Smrg- lavapipe: add host ptr support. 11307ec681f3Smrg- docs: add llvmpipe host memory extensions 11317ec681f3Smrg- crocus/blt: add pitch/offset checks to fix blt corruption 11327ec681f3Smrg- crocus: align staging resource pitch on gen4/5 to allow BLT usage. 11337ec681f3Smrg- intel/vec4: sel.cond writes the flags on Gfx4 and Gfx5 11347ec681f3Smrg- draw: handle primitive ID for quads/quad strips. 11357ec681f3Smrg- draw/gs: add clipvertex support for compatibility 11367ec681f3Smrg- draw/tess: add clipvertex support for compatibility 11377ec681f3Smrg- draw: add vertex color clamping to gs/tes 11387ec681f3Smrg- llvmpipe: enable GL compatibility profiles 11397ec681f3Smrg- gallivm: don't lower local invocation index in frontend 11407ec681f3Smrg- llvmpipe/cl: limit kernel input size. 11417ec681f3Smrg- gallivm: fix idiv/irem for 8/16/64-bit and 32-bit INT_MIN/-1 11427ec681f3Smrg- gallivm: fix non-32 bit popcounts. 11437ec681f3Smrg- llvmpipe: init renderer string once to avoid races. 11447ec681f3Smrg- vulkan/wsi/sw: wait for image fence before submitting to queue 11457ec681f3Smrg- crocus: copy views before adjusting 11467ec681f3Smrg- crocus: drop u_primconvert header. 11477ec681f3Smrg- crocus: add missing line smooth bits. 11487ec681f3Smrg- crocus: add missing fs dirty on reduced prim change. 11497ec681f3Smrg- vulkan/wsi: add support for detecting mit-shm pixmaps. 11507ec681f3Smrg- vulkan/wsi/sw: add support for using host_ptr for shm pixmaps. 11517ec681f3Smrg- vulkan/wsi/sw: add mit-shm support for pixmap allocation 11527ec681f3Smrg- meson: fix regression finding shm dep 11537ec681f3Smrg- llvmpipe/fs: fix multisample depth/stencil fs writes. 11547ec681f3Smrg- llvmpipe: consolidate scissor plane code between line/tri 11557ec681f3Smrg- llvmpipe/scissor: rewrite scissor planes interaction. 11567ec681f3Smrg- llvmpipe: adjust scissor planes for multisample. 11577ec681f3Smrg- gallium: add a sample0 only option to blitter. 11587ec681f3Smrg- u_blitter: add support for sample0 only resolves. 11597ec681f3Smrg- lavapipe: VK_KHR_depth_stencil_resolve support 11607ec681f3Smrg- crocus/gen7: add missing IVB/GT2 geom shader workaround. 11617ec681f3Smrg- intel/decode/gfx6: add support for gfx6 CC/VIEWPORT pointers. 11627ec681f3Smrg- gallivm/ssbo: fix up dynamic indexed ssbo load/stores/atomics 11637ec681f3Smrg- gallivm/ssbo: cast ssbo index to int type. 11647ec681f3Smrg- lavapipe: enable dynamic index ubo/ssbo 11657ec681f3Smrg- llvmpipe/cs: rework thread pool for avoid mtx locking 11667ec681f3Smrg- gallivm/coro: use a phi instead of alloca 11677ec681f3Smrg- llvmpipe: shorten hold time on the screen mutex 11687ec681f3Smrg- llvmpipe/cs: rework coroutine context handling (v2) 11697ec681f3Smrg- gallivm: add initial support for 16-bit float builder. 11707ec681f3Smrg- gallivm/nir: handle conversion to 16-bit texel fetch 11717ec681f3Smrg- gallivm/nir: fix f2b32 11727ec681f3Smrg- gallivvm/nir: handle non-32bit mask scatter stores 11737ec681f3Smrg- gallivm: add 16-bit sin/cos via llvm intrinsic 11747ec681f3Smrg- llvmpipe: lower_flrp16 11757ec681f3Smrg- gallivm/nir: handle 16-bit exp/lod using intrinsics. 11767ec681f3Smrg- gallivm/nir: call pow with correct flt builder 11777ec681f3Smrg- gallivm/nir: pass the correct float builder to ddx/y 11787ec681f3Smrg- gallivm: increase tgsi nesting call stack size 11797ec681f3Smrg- gallivm: use llvm intrinsics for 16-bit round/trunc/roundeven 11807ec681f3Smrg- llvmpipe: enable FP16 and update CL + traces piglit results. 11817ec681f3Smrg- lavapipe: enable KHR_shader_float16_int8 11827ec681f3Smrg- gallivm/nir: handle subgroup reduction across all types 11837ec681f3Smrg- lavapipe: enable KHR_shader_subgroup_extended_types 11847ec681f3Smrg- docs: update docs for new llvmpipe/lavapipe features 11857ec681f3Smrg- lavapipe: enable KHR_spirv_1_4 11867ec681f3Smrg- lavapipe: fix vertex attributes/descriptor binding 11877ec681f3Smrg- lavapipe: don't access pColorBlendState when not legal 11887ec681f3Smrg- gallium/format: move two vertex formats into the proper place. 11897ec681f3Smrg- lavapipe/ci: drop some fails I fixed recently 11907ec681f3Smrg- lavapipe: move to 1.2 features/properties structs. 11917ec681f3Smrg- gallivm/nir: fix subgroup invocation read. 11927ec681f3Smrg- lavapipe: enable vulkan 1.2 support. 11937ec681f3Smrg- lavapipe: move to new shared features/properties 11947ec681f3Smrg- lavapipe: cleanup image create function. 11957ec681f3Smrg- lavapipe: fixup image binding flags. 11967ec681f3Smrg- llvmpipe: overhaul fs/cs variant keys to be simpler. 11977ec681f3Smrg- gallivm: use pmulhrsw to make aos sampling more accurate. 11987ec681f3Smrg- crocus/gen6: don't reemit the svbi when debugging 11997ec681f3Smrg- crocus/query: don't loop on ready status after gpu hang. 12007ec681f3Smrg- gallivm/format: clamp SINT conversion rather than truncate. 12017ec681f3Smrg- llvmpipe/cs: change submission pattern for threadpool 12027ec681f3Smrg- llvmpipe: fix 4-bit output scaling. 12037ec681f3Smrg- lvp/fence: quick fix to previous commit. 12047ec681f3Smrg- device_select: close dri3 fd after using it. 12057ec681f3Smrg- wsi/x11: cleanup properly after mit shm paths are used. 12067ec681f3Smrg- Revert "lvp/fence: quick fix to previous commit." 12077ec681f3Smrg- lavapipe: fix fence handling around wsi submission 12087ec681f3Smrg- crocus: Honor scanout requirement from DRI 12097ec681f3Smrg- crocus/gen5: reemit shaders on gen5 after new program cache bo. 12107ec681f3Smrg- crocus/gen5: add dirty flags for urb fences. 12117ec681f3Smrg- llvmpipe: fix userptr for texture resources. 12127ec681f3Smrg- lavapipe: drop EXT_acquire_xlib_display 12137ec681f3Smrg- vulkan/wsi: set correct bits for host allocations/exports for images. 12147ec681f3Smrg- llvmpipe: disable 64-bit integer textures. 12157ec681f3Smrg- llvmpipe: fix compressed image sizes. 12167ec681f3Smrg 12177ec681f3SmrgDerek Foreman (2): 12187ec681f3Smrg 12197ec681f3Smrg- egl/wayland: Support RGBA ordered formats 12207ec681f3Smrg- egl/wayland: Properly clear stale buffers on resize 12217ec681f3Smrg 12227ec681f3SmrgDmitry Baryshkov (1): 12237ec681f3Smrg 12247ec681f3Smrg- freedreno/regs: add bit to control continuous clock with 7nm PHYs 12257ec681f3Smrg 12267ec681f3SmrgDylan Baker (19): 12277ec681f3Smrg 12287ec681f3Smrg- VERSION: bump version for 21.3 development cycle 12297ec681f3Smrg- docs/relnotes/new_features: empty for next release cycle 12307ec681f3Smrg- docs: update calendar for 21.2.0-rc1 12317ec681f3Smrg- docs: mark mesa 21.0 as done 12327ec681f3Smrg- freedreno/ir3: Add build id to the disassembler test 12337ec681f3Smrg- docs: add release notes for 21.2.0 12347ec681f3Smrg- docs: update calendar for 21.2.0-rc2 12357ec681f3Smrg- docs: update calendar for 21.2.0-rc3 12367ec681f3Smrg- docs: update calendar and link releases notes for 21.2.0 12377ec681f3Smrg- docs: Add calendar entries for 21.2 release. 12387ec681f3Smrg- bin/gen_release_notes: Add basic tests for parsing issues 12397ec681f3Smrg- bin/gen_release_notes: Don't consider issues for other projects 12407ec681f3Smrg- bin/gen_release_notes: Fix commits with multiple Closes: 12417ec681f3Smrg- docs: add release notes for 21.2.2 12427ec681f3Smrg- docs/relnotes/21.2.2: Add SHA256 sum 12437ec681f3Smrg- docs: update calendar and link releases notes for 21.2.2 12447ec681f3Smrg- docs: add release notes for 21.2.3 12457ec681f3Smrg- docs" Add SHA256 sum for mesa 21.2.3 12467ec681f3Smrg- docs: update calendar and link releases notes for 21.2.3 12477ec681f3Smrg 12487ec681f3SmrgEd Baker (1): 12497ec681f3Smrg 12507ec681f3Smrg- frontends/va: Fix test_va_api VAAPIDisplayAttribs tests 12517ec681f3Smrg 12527ec681f3SmrgEd Martin (1): 12537ec681f3Smrg 12547ec681f3Smrg- winsys/radeonsi: Set vce_encode = true when VCE found 12557ec681f3Smrg 12567ec681f3SmrgEduardo Lima Mitev (1): 12577ec681f3Smrg 12587ec681f3Smrg- turnip: Add support for VK_VALVE_mutable_descriptor_type 12597ec681f3Smrg 12607ec681f3SmrgElla-0 (13): 12617ec681f3Smrg 12627ec681f3Smrg- v3dv: Add is_unorm, is_snorm and is_float format functions 12637ec681f3Smrg- v3dv: Implement VK_EXT_custom_border_color 12647ec681f3Smrg- v3dv: implement VK_EXT_color_write_enable 12657ec681f3Smrg- v3dv: Implement VK_EXT_pipeline_creation_cache_control 12667ec681f3Smrg- v3dv: Implement VK_EXT_provoking_vertex 12677ec681f3Smrg- v3dv: Implement VK_EXT_pipeline_creation_feedback 12687ec681f3Smrg- v3d/compiler: Handle point_coord_upper_left 12697ec681f3Smrg- v3d: Don't handle PIPE_SPRITE_COORD_UPPER_LEFT twice 12707ec681f3Smrg- v3dv: Expose correct point size granularity 12717ec681f3Smrg- v3dv: Implement VK_EXT_vertex_attribute_divisor 12727ec681f3Smrg- ci/v3dv: Update fails with multiview failing with points 12737ec681f3Smrg- v3d: add R10G10B10X2_UNORM to format table 12747ec681f3Smrg- v3dv: enable VK_KHR_surface_protected_capabilities 12757ec681f3Smrg 12767ec681f3SmrgEmma Anholt (233): 12777ec681f3Smrg 12787ec681f3Smrg- nir: Validate after deserialization. 12797ec681f3Smrg- nir_to_tgsi: Fix image declarations. 12807ec681f3Smrg- gallium/ttn: Add a debug flag for dumping the shaders. 12817ec681f3Smrg- freedreno/ir3: Reduce choose_instr_dec() and _inc() overhead. 12827ec681f3Smrg- gallium/ureg: Sort the output decls. 12837ec681f3Smrg- freedreno: Lock access to msm_pipe for RB object suballocation. 12847ec681f3Smrg- ci/freedreno: Enable the MSAA deqp tests. 12857ec681f3Smrg- gallivm: Default brilinear filtering to off. 12867ec681f3Smrg- gallivm: Always take the per-pixel LOD path for cubemaps. 12877ec681f3Smrg- i915g: Add support for shader-db. 12887ec681f3Smrg- nir_to_tgsi: Pack our tex coords into vec4 nir_tex_src_backend[12]. 12897ec681f3Smrg- nir_to_tgsi: Add support for TXP. 12907ec681f3Smrg- nir_to_tgsi: Add support for HW atomics. 12917ec681f3Smrg- nir_to_tgsi: Declare buffers for all of num_ssbos. 12927ec681f3Smrg- nir_to_tgsi: Add support for nir_intrinsic_load_sample_pos. 12937ec681f3Smrg- turnip: Fix assertions on checking mutable combined samplers support. 12947ec681f3Smrg- gallium/dri2: Make dri_init_options just init DRI options. 12957ec681f3Smrg- gallium/driconf: Allow the driver to parse the driconf options. 12967ec681f3Smrg- ci: Stop disabling filter hacks for llvmpipe. 12977ec681f3Smrg- ci/i915: Update deqp expectations for another test passing. 12987ec681f3Smrg- ci: Uprev deqp-runner and use "suite" support to merge softpipe runs. 12997ec681f3Smrg- ci/llvmpipe: Use the deqp-runner suite support to consolidate jobs. 13007ec681f3Smrg- ci/i915g: Merge the two dEQP runs together. 13017ec681f3Smrg- ci: Save dEQP results on all tests. 13027ec681f3Smrg- ci/virgl: Use deqp-runner suite support to reduce CI job count. 13037ec681f3Smrg- ci/zink: Use deqp-runner suite support to reduce the CI job count. 13047ec681f3Smrg- ci: Update piglit to 4545a28cd8fea03fbab0e5f90bfbd812c32f3be1 13057ec681f3Smrg- ci/freedreno: Clear out TF API errors xfails. 13067ec681f3Smrg- freedreno/a5xx: Disable TF when pausing or transitioning to non-TF. 13077ec681f3Smrg- freedreno/a5xx: Don't try to emit FS images in binning command streams. 13087ec681f3Smrg- ci/freedreno: Mark border_color as passing on a5xx. 13097ec681f3Smrg- ci/a5xx: Skip some piglit stress tests that destabilize CI. 13107ec681f3Smrg- ci/freedreno: Organize, fill out, and document our VK xfails. 13117ec681f3Smrg- ci/freedreno: Generalize the spirv_ids_abuse skips. 13127ec681f3Smrg- ci/freedreno: Clean up and fill out the tess timeout annotations. 13137ec681f3Smrg- ci/freedreno: Skip the slow dEQP-VK.ubo.random.all_shared_buffer.48 in CI. 13147ec681f3Smrg- ci/freedreno: Add jobs to manually do a full VK on freedreno. 13157ec681f3Smrg- i915g: Use the devmaster quadratic approximation for sin/cos. 13167ec681f3Smrg- i915g: Reapply clang-format. 13177ec681f3Smrg- nir: Move phi src setup to a helper. 13187ec681f3Smrg- i915g: Make the 1D workaround keep TXP's .w channel in the right spot. 13197ec681f3Smrg- i915g: Add support for blitting compressed textures. 13207ec681f3Smrg- i915g: Add missing support for sRGB S3TC. 13217ec681f3Smrg- i915g: Fix up the format mapping for DXT1_*RGB 13227ec681f3Smrg- i915g: Add support for FXT1. 13237ec681f3Smrg- i915g: Fix 3D texture layouts for width != height. 13247ec681f3Smrg- i915g: Implement cube/3d texture_subdata() as a series of per-layer maps. 13257ec681f3Smrg- ci/turnip: Add a new flake from running more of the CTS. 13267ec681f3Smrg- ci/freedreno: Move freedreno's deqp testing to suite support. 13277ec681f3Smrg- freedreno/a6xx: Apply the cube image size lowering to GL, too. 13287ec681f3Smrg- freedreno/ir3: Only lower cube image sizes once. 13297ec681f3Smrg- freedreno/ir3: Use the resinfo path for ssbo sizes on GL, too. 13307ec681f3Smrg- freedreno/ir3: Move a6xx's get_ssbo_size shl to NIR. 13317ec681f3Smrg- freedreno/a6xx: Skip setting up image dims constants. 13327ec681f3Smrg- freedreno/a5xx: Use ST4\_ constants for SSBO/image state types. 13337ec681f3Smrg- freedreno/a5xx: Reduce packet emits for SSBO state. 13347ec681f3Smrg- ci/freedreno: Mark a new flaky SSBO length test. 13357ec681f3Smrg- ci/freedreno: Flake the rest of the pbuffer/window dEQP-EGL tests. 13367ec681f3Smrg- i915g: Fix polygon offset by telling draw the Z format. 13377ec681f3Smrg- i915g: Correct PIPE_SHADER_CAP_MAX_TEMPS. 13387ec681f3Smrg- i915g: Reduce ARB_fp max tex indirections to match i915c. 13397ec681f3Smrg- i915g: Clear some xfails that are now skips. 13407ec681f3Smrg- i915g: Add comments explaining various xfails. 13417ec681f3Smrg- i915g: clang-format fixup. 13427ec681f3Smrg- freedreno/ir3: Apply the a6xx samgq workaround to TES/TCS/GS as well. 13437ec681f3Smrg- freedreno/ir3: Align driver param upload size/offset for indirect uploads. 13447ec681f3Smrg- freedreno/a6xx: Sync TFB BO access against prior TFB writes. 13457ec681f3Smrg- ci/lavapipe: Add a fractional run with ASan 13467ec681f3Smrg- ci/llvmpipe: Add a fractional ASan run. 13477ec681f3Smrg- nir: Set .driver_location for GLSL UBO/SSBOs when we lower to block indices. 13487ec681f3Smrg- nir/nir_lower_uniforms_to_ubo: Set the explicit stride of the UBO 0 uniform. 13497ec681f3Smrg- nir_to_tgsi: Use explicit sizes of NIR variables for UBO declarations. 13507ec681f3Smrg- ci/freedreno: Annotate a bunch of piglit fails/crashes. 13517ec681f3Smrg- ci/freedreno: Add a bunch of recent a530 and a630 flakes. 13527ec681f3Smrg- ci/v3dv: generalize the buffer_access.through_pointers flakes. 13537ec681f3Smrg- ci/freedreno: Fix xfail update for arb_draw_indirect. 13547ec681f3Smrg- freedreno/ir3: Don't use isam for coherent image loads on a6xx. 13557ec681f3Smrg- freedreno/ir3: Clarify what's going on in a4xx SSBO atomics. 13567ec681f3Smrg- freedreno/ir3: Refactor a3xx ibo/ssbo load/store instruction XML. 13577ec681f3Smrg- freedreno/ir3: Add encode/decode support for a5xx's LDIB. 13587ec681f3Smrg- freedreno/ir3: Use LDIB for coherent image loads on a5xx. 13597ec681f3Smrg- osmesa: Add a unit test for resizing buffers. 13607ec681f3Smrg- cso: Revert using FS sampler count for other stages at context unbind. 13617ec681f3Smrg- mesa/st: Add an assertion for finalize_nir versus PIPE_CAP_TEXCOORD. 13627ec681f3Smrg- i915g: Simplify the process of texcoord mapping to TGSI semantics. 13637ec681f3Smrg- i915g: Expose PIPE_CAP_TGSI_TEXCOORD. 13647ec681f3Smrg- i915g: Add finalize_nir. 13657ec681f3Smrg- mesa/st: Add an optional GLSL link fail msg to finalize_nir. 13667ec681f3Smrg- i915g: Reject non-unrolled loops or non-flattend IFs at link time. 13677ec681f3Smrg- ci/iris: Mark create_context-no_error as failing. 13687ec681f3Smrg- ci/iris: Unmark dma_buf_import_export tests as failing. 13697ec681f3Smrg- ci/iris: Consistently use .test-manual-mr for our unstable hardware. 13707ec681f3Smrg- ci/iris: Switch GL/GLES testing to suites. 13717ec681f3Smrg- freedreno/a6xx: Emit a WFI after event writes flushing CCU. 13727ec681f3Smrg- ci/freedreno: Fix typo in glx-tfp flake annotation. 13737ec681f3Smrg- ci/freedreno: Mark a630 basic-glsl-misc-fs as flaky. 13747ec681f3Smrg- ci/freedreno: Skip slow SizedDeclarationsPrimitive in CI. 13757ec681f3Smrg- llvmpipe: Free CS shader images on context destroy. 13767ec681f3Smrg- llvmpipe: Fix leak of CS local memory with 0 threads. 13777ec681f3Smrg- llvmpipe: memcpy user_buffers at set_constant_buffer time. 13787ec681f3Smrg- nir_to_tgsi: Fix indirect addressing of atomic counters. 13797ec681f3Smrg- nir_to_tgsi: Don't forget to add sampler views with our samplers. 13807ec681f3Smrg- nir_to_tgsi: Add support for memory_barrier_tcs_patch. 13817ec681f3Smrg- nir_to_tgsi: Clean up some unnecessary pointers-to-uregs. 13827ec681f3Smrg- nir_to_tgsi: Switch ssa_temp[] to be a ureg_src. 13837ec681f3Smrg- nir_to_tgsi: Allow SSA defs to include swizzles, abs, and neg. 13847ec681f3Smrg- mesa: Move the advanced blend bitmask to shader_info. 13857ec681f3Smrg- nir: Add a nir_instr_free() to replace ralloc_free(instr). 13867ec681f3Smrg- nir: Pull the instr list free function out to a helper. 13877ec681f3Smrg- nir/from_ssa: Use nir_instr_free() to free instrs instead of ralloc. 13887ec681f3Smrg- nir: Consistently pass the shader to the shader arg of instr creation. 13897ec681f3Smrg- nir: Consistently pass the instr to nir_src_copy(). 13907ec681f3Smrg- nir: Add all allocated instructions to a GC list. 13917ec681f3Smrg- nir/lower_phis_to_scalar: Use nir_instr_free() to free instrs. 13927ec681f3Smrg- nir/tests: Fix transmuting an SSA dest to be non-SSA 13937ec681f3Smrg- nir: Switch from ralloc to malloc for NIR instructions. 13947ec681f3Smrg- nir: Drop the unused instr arg for src/dest copy functions. 13957ec681f3Smrg- ci/freedreno: Drop minetest from a3xx trace testing. 13967ec681f3Smrg- freedreno: Precompute resource pointer hash values. 13977ec681f3Smrg- freedreno: Use TC's flag for whether get_query is in the driver thread. 13987ec681f3Smrg- freedreno: Move the batch cache to the context. 13997ec681f3Smrg- freedreno: Remove the submit lock locking. 14007ec681f3Smrg- freedreno: Use a BO bitset for faster checks for resource referenced. 14017ec681f3Smrg- freedreno: Remove dead fd_batch_reset(). 14027ec681f3Smrg- ci/i915g: Clarify failure happening in fbo-fragcoord2. 14037ec681f3Smrg- mesa/st: Allow loops in GLSL when NIR is enabled, even if the HW can't. 14047ec681f3Smrg- freedreno: Fix autotune regression since batch-cache rework. 14057ec681f3Smrg- freedreno: Assert to check for the previous regression. 14067ec681f3Smrg- ci/freedreno: Add some cubearray piglit flakes on a630 I noticed. 14077ec681f3Smrg- ci/baremetal: Retry if our network device spontaneously fails. 14087ec681f3Smrg- ci/freedreno: Update restricted trace sha1s. 14097ec681f3Smrg- nir_to_tgsi: Remove the abs on fcsel's bool src. 14107ec681f3Smrg- freedreno/a5xx+: Rename GRAS_CNTL/RB_RENDER_CONTROL0 IJ_LINEAR_* bits. 14117ec681f3Smrg- freedreno/a5xx+: Set the IJ_LINEAR_* request bits if we need the regs. 14127ec681f3Smrg- tu: Move core features definitions to a helper function. 14137ec681f3Smrg- tu: Deduplicate extension/core feature flags. 14147ec681f3Smrg- tu: Add GetPhysicalDeviceFeatures2() support for more VK 1.2 core features. 14157ec681f3Smrg- tu: Move VK 1.1 core properties to a helper function and use macros for exts. 14167ec681f3Smrg- tu: Support VK_STRUCTURE_TYPE_PHYSICAL_DEVICE_PROTECTED_MEMORY_PROPERTIES. 14177ec681f3Smrg- turnip: Move physical device 1.2 properties to a helper function. 14187ec681f3Smrg- mesa: Throw an error for compressed glGenerateMipmap on GLES2 contexts. 14197ec681f3Smrg- mesa: Prioritize checking for GLES2's uniform transpose error. 14207ec681f3Smrg- mesa: Fix missing CopyTexImage formats for OES_required_internalformat. 14217ec681f3Smrg- ci/vc4,i915g: Add links to VK-GL-CTS issues for some of our xfails. 14227ec681f3Smrg- vulkan: Add helpers for filling exts for core features and properties. 14237ec681f3Smrg- vulkan: Support PHYSICAL_DEVICE_1_n\_ features/properties in the helpers. 14247ec681f3Smrg- turnip: Use the shared now-in-core feature/prop extension helper functions. 14257ec681f3Smrg- anv: Use the shared now-in-core feature/prop extension helper functions. 14267ec681f3Smrg- radv: Use the shared now-in-core feature/prop extension helper functions. 14277ec681f3Smrg- vulkan: Update the XML and headers to 1.2.193 14287ec681f3Smrg- turnip: Set the VK_DRIVER_ID to our new enum. 14297ec681f3Smrg- turnip: Swizzle in 0, 1 for D24S8 STENCIL_ASPECT sampling. 14307ec681f3Smrg- turnip: Disable VK_EXT_display_control. 14317ec681f3Smrg- i915g: Improve debug output for the fresh-batch overflow case. 14327ec681f3Smrg- i915g: Remove dead VBUF_USE_POOL code. 14337ec681f3Smrg- i915g: Unifdef VBUF_MAP_BUFFER. 14347ec681f3Smrg- i915g: Use the non-vbuf code path by default to fix index overflows. 14357ec681f3Smrg- ci/freedreno: Disable flaky a530 for now. 14367ec681f3Smrg- gallium/dri: Make YUV formats we're going to emulate external-only. 14377ec681f3Smrg- turnip: Match the blob's format for vendorID and deviceID. 14387ec681f3Smrg- turnip: Expose a device name similar to the blob. 14397ec681f3Smrg- freedreno/rnndec: Fix use of undefined value_orig in the !ti case. 14407ec681f3Smrg- freedreno/rnndec: Avoid making 0-length variable length arrays. 14417ec681f3Smrg- freedreno/afuc: Avoid ubsan warns about shifting to the top bit of 'int' 14427ec681f3Smrg- freedreno: Fix UBSan failures in cffdec's (uint8_t)x << 24 14437ec681f3Smrg- freedreno: Reuse u_math.h instead of open coding ALIGN/ARRAY_SIZE. 14447ec681f3Smrg- freedreno: Reuse u_math.h instead of open coding uif(). 14457ec681f3Smrg- freedreno: Move afuc tests to meson unit tests. 14467ec681f3Smrg- freedreno: Move crashdec/cffdec tests to be meson unit tests. 14477ec681f3Smrg- freedreno: Move the headergen2 test to be meson unit tests. 14487ec681f3Smrg- panfrost: Disable flaky piglit job for now. 14497ec681f3Smrg- ci/freedreno: Restart the run if cheza spontenously reboots. 14507ec681f3Smrg- freedreno/tools: Fix build failure when cffdump isn't built but tests are. 14517ec681f3Smrg- freedreno/a6xx: Move the format table to common code. 14527ec681f3Smrg- freedreno/a6xx: Add int/scaled/snorm vertex formats to match turnip. 14537ec681f3Smrg- freedreno/a6xx: disable vertex fetch support flag for b8g8r8a8_srgb. 14547ec681f3Smrg- freedreno/a6xx: Add support for EXT_texture_sRGB_R8/RG8. 14557ec681f3Smrg- freedreno/a6xx: Drop texturing support from other scaled formats. 14567ec681f3Smrg- freedreno/a6xx: Add some more 16-bit rgb/rgba swaps to our format tables. 14577ec681f3Smrg- freedreno/a6xx+: Add support for the R8G8_R8B8 and G8R8_B8R8 formats. 14587ec681f3Smrg- util/format: Add an RGB planar format for YV12, like we have for NV12. 14597ec681f3Smrg- freedreno/a6xx: Put R8_G8_B8_420_UNORM in the format table. 14607ec681f3Smrg- freedreno/a6xx: Use fd6_pipe2tex() for the 2D src format. 14617ec681f3Smrg- freedreno/a6xx: Make the format table const. 14627ec681f3Smrg- freedreno/a6xx: Rewrite the format table format/swap helpers. 14637ec681f3Smrg- freedreno/a6xx: Add support for A/XRGB1555 formats. 14647ec681f3Smrg- freedreno/a6xx: Enable UBWC for RGBA5551 (and 1555) textures. 14657ec681f3Smrg- turnip: Give D32_SFLOAT_S8_UINT a native format. 14667ec681f3Smrg- turnip: Switch tu_format internals to using pipe_format more. 14677ec681f3Smrg- turnip: Do format lookups from the fd6 format table and cross-check. 14687ec681f3Smrg- turnip: Replace our format table with fd6_format_table. 14697ec681f3Smrg- i915g: Check for the scanout-layout conditions before setting level info. 14707ec681f3Smrg- mesa/st: Don't bump locations of patch vars for !PIPE_CAP_TEXCOORD. 14717ec681f3Smrg- nir_to_tgsi: Include txf_ms's sample index. 14727ec681f3Smrg- nir_to_tgsi: Add support for load_output/load_per_vertex_output. 14737ec681f3Smrg- gallium/ureg: Sort the input decls, too. 14747ec681f3Smrg- nir_to_tgsi: Add support for declaring image arrays. 14757ec681f3Smrg- nir_to_tgsi: Add support for load_barycentric_sample. 14767ec681f3Smrg- nir_to_tgsi: Add support for nir_intrinsic_load_barycentric_at_sample. 14777ec681f3Smrg- nir_to_tgsi: Turn GS PRIMID into an input instead of a sysval. 14787ec681f3Smrg- nir-to-tgsi: Avoid emitting TXL just for lod 0 on non-vertex shaders. 14797ec681f3Smrg- nir_to_tgsi: Sort FS output declarations to avoid virglrenderer bugs. 14807ec681f3Smrg- nir_to_tgsi: Add a workaround for virgl UBO array dynamic indexing. 14817ec681f3Smrg- nir_to_tgsi: Force the TXQ LOD argument to be scalar. 14827ec681f3Smrg- virgl: Add support for NIR shaders when VIRGL_DEBUG=nir. 14837ec681f3Smrg- turnip: Plug the vendor/device ID into the pipeline cache fields, too. 14847ec681f3Smrg- turnip: Fix allocation failure handling around device->name. 14857ec681f3Smrg- turnip: Free disk cache on pdev init failure. 14867ec681f3Smrg- ci/freedreno: Move the other a530 test jobs to test-manual-mr. 14877ec681f3Smrg- ci/freedreno: try to fix the a630 cubearray flake's regex. 14887ec681f3Smrg- ci/freedreno: Disable the minetest trace due to flaky shader code. 14897ec681f3Smrg- ci: Update deqp to vulkan-cts-1.2.7.1. 14907ec681f3Smrg- ci: Update piglit to 7d7dd2688c214e1b3c00f37226500cbec4a58efb. 14917ec681f3Smrg- radeonsi: Fix leak of screen->perfcounters. 14927ec681f3Smrg- Revert "ci: Add osmesa to Windows GitLab CI" 14937ec681f3Smrg- ci/deqp-runner: Drop SUMMARY_LIMIT env var. 14947ec681f3Smrg- ci/deqp-runner: Simplify the --jobs argument setup. 14957ec681f3Smrg- ci/deqp-runner: Use new deqp-runner's built-in renderer/version checks. 14967ec681f3Smrg- ci/deqp-runner: Drop silly CSV env vars. 14977ec681f3Smrg- ci/deqp-runner: Move remaining asan runs to --env LD_PRELOAD= 14987ec681f3Smrg- ci/deqp-runner: Drop LD_LIBRARY_PATH=/usr/local for libkms workaround. 14997ec681f3Smrg- ci/deqp-runner: Don't start GPU hang detection for making junit results. 15007ec681f3Smrg- ci/deqp-runner: Move more non-suite logic under the non-suite 'if'. 15017ec681f3Smrg- ci/piglit-runner: Fix funny indentation of the piglit-runner command. 15027ec681f3Smrg- ci/deqp-runner: Rename the deqp-drivername-\*.txt files to drivername-*.txt 15037ec681f3Smrg- ci/piglit-runner: Merge piglit-driver-\*.txt files into driver-*.txt. 15047ec681f3Smrg- ci: Enable testing radeonsi's libva using libva-util unit tests. 15057ec681f3Smrg- freedreno: Fix gmem invalidating the depth or stencil of packed d/s. 15067ec681f3Smrg- freedreno/a6xx: Fix partial z/s clears with sysmem. 15077ec681f3Smrg- freedreno/a6xx: Don't try to generate mipmaps for SNORM with our blitter. 15087ec681f3Smrg- freedreno/ir3: Fix off-by-one in prefetch safety assert. 15097ec681f3Smrg- freedreno/a6xx: Emit a null descriptor for unoccupied IBO slots. 15107ec681f3Smrg- mesa/st: Disable NV_copy_depth_to_color on non-doubles-capable HW. 15117ec681f3Smrg 15127ec681f3SmrgEmmanuel Gil Peyrot (3): 15137ec681f3Smrg 15147ec681f3Smrg- radv: Support device initialization without LLVM dependencies 15157ec681f3Smrg- radv: Support shader compilation without LLVM dependencies 15167ec681f3Smrg- radv: Allow building when LLVM isn’t enabled 15177ec681f3Smrg 15187ec681f3SmrgEnrico Galli (11): 15197ec681f3Smrg 15207ec681f3Smrg- microsoft/spirv_to_dxil: Adding continue opt pass to fix DXIL loop gen 15217ec681f3Smrg- nir_lower_readonly_images_to_tex: Fix typeo on image arrays 15227ec681f3Smrg- microsoft/compiler: Add support for arrays to image_store 15237ec681f3Smrg- microsoft/compiler: Correctly flag when using raw buffers 15247ec681f3Smrg- microsoft/spirv_to_dxil: Enable support for shared memory 15257ec681f3Smrg- microsoft/compiler: Add support for local_invocation_index 15267ec681f3Smrg- spirv_to_dxil: Convert out parameters to a single object 15277ec681f3Smrg- nir: Add CAN_REORDER to load_ubo_dxil 15287ec681f3Smrg- spirv_to_dxil: Add support for nir_intrinsic_load_num_workgroups 15297ec681f3Smrg- spirv_to_dxil: Add support for non-zero vertex and instance indices 15307ec681f3Smrg- nir_to_dxil: Add tagging raw SRVs in shader flags 15317ec681f3Smrg 15327ec681f3SmrgEric Engestrom (45): 15337ec681f3Smrg 15347ec681f3Smrg- docs: add release notes for 21.1.5 15357ec681f3Smrg- docs: update calendar and link releases notes for 21.1.5 15367ec681f3Smrg- docs: drop duplicate \`21.1` branch name from release calendar 15377ec681f3Smrg- docs: add release notes for 21.1.6 15387ec681f3Smrg- docs: update calendar and link releases notes for 21.1.6 15397ec681f3Smrg- pick-ui: drop assert that optional argument is passed 15407ec681f3Smrg- pick-ui: show nomination type in the UI 15417ec681f3Smrg- pick-ui: show commit date 15427ec681f3Smrg- docs: add release notes for 21.1.7 15437ec681f3Smrg- docs: update calendar and link releases notes for 21.1.7 15447ec681f3Smrg- python: explicitly require python3 15457ec681f3Smrg- gitlab-ci: stop installing python-is-python3 package 15467ec681f3Smrg- python: drop python2 support 15477ec681f3Smrg- Revert "python: Explicitly add the 'L' suffix on Python 3" 15487ec681f3Smrg- isl: drop comment about "python 2 vs 3" as it doesn't apply anymore 15497ec681f3Smrg- isl: drop left-over comment 15507ec681f3Smrg- glsl/tests: remove some dead code 15517ec681f3Smrg- python: drop explicit output_encoding='utf-8' in mako templates 15527ec681f3Smrg- docs: add release notes for 21.1.8 15537ec681f3Smrg- docs: update calendar and link releases notes for 21.1.8 15547ec681f3Smrg- docs: add plan for 21.3.x release cycle 15557ec681f3Smrg- docs: shorten "last release" note to fit on the website without horizontal scrolling 15567ec681f3Smrg- bin/khronos-update.py: update the branch name (s/master/main/) 15577ec681f3Smrg- bin/khronos-update.py: add upstream for vulkan_directfb.h & vulkan_screen.h 15587ec681f3Smrg- gitlab: convert old REVIEWERS into GitLab's CODEOWNERS 15597ec681f3Smrg- CODEOWNERS: add SWR maintainers 15607ec681f3Smrg- CODEOWNERS: add intel group 15617ec681f3Smrg- CODEOWNERS: add android build system 15627ec681f3Smrg- CODEOWNERS: add @alyssa for Asahi and Panfrost 15637ec681f3Smrg- CODEOWNERS: add @bbrezillon for src/panfrost/vulkan/ 15647ec681f3Smrg- CODEOWNERS: add @jenatali for Microsoft & D3D12 15657ec681f3Smrg- egl: sync eglext.h & egl.xml from Khronos 15667ec681f3Smrg- egl: implement EGL_EXT_present_opaque on wayland 15677ec681f3Smrg- VERSION: bump for 21.3.0-rc1 15687ec681f3Smrg- .pick_status.json: Update to 86b3d8c66ce17ddcaefa5bdea68882cc03a57f15 15697ec681f3Smrg- .pick_status.json: Mark 7a2e40df5e8490de739c66865f90fa6804e41f6d as denominated 15707ec681f3Smrg- VERSION: bump for 21.3.0-rc2 15717ec681f3Smrg- .pick_status.json: Update to 4856586ac605e89ee6c128b1a190f000311b49ba 15727ec681f3Smrg- VERSION: bump for 21.3.0-rc3 15737ec681f3Smrg- .pick_status.json: Update to c356f3cfce9459dc1341b6a2a0fd5336a9bdcc3c 15747ec681f3Smrg- VERSION: bump for 21.3.0-rc4 15757ec681f3Smrg- .pick_status.json: Update to 549924d53e359c04d7c14b12990178c86d3aad2d 15767ec681f3Smrg- meson: drop duplicate addition of surfaceless & drm to the list of platforms 15777ec681f3Smrg- VERSION: bump for 21.3.0-rc5 15787ec681f3Smrg- .pick_status.json: Update to ba6d389fa7a0ac512cb9d4cdd21efde990f041b1 15797ec681f3Smrg 15807ec681f3SmrgErico Nunes (2): 15817ec681f3Smrg 15827ec681f3Smrg- lima: avoid crash with negative viewport values 15837ec681f3Smrg- ci: enable CI for lima again 15847ec681f3Smrg 15857ec681f3SmrgErik Faye-Lund (52): 15867ec681f3Smrg 15877ec681f3Smrg- dxil: Set coord_components on the txf in lower_int_sampler 15887ec681f3Smrg- lavapipe: do not assert on more than 32 samplers 15897ec681f3Smrg- lavapipe: do not mark unsupported tests as crashing 15907ec681f3Smrg- gallivm: let nir_lower_tex handle projectors 15917ec681f3Smrg- gallivm: make rho-approximation opt-in instead of opt-out 15927ec681f3Smrg- gallivm: remove pointless no_filter_hacks flag 15937ec681f3Smrg- d3d12: split up root parameter update and set 15947ec681f3Smrg- microsoft/compiler: fix psv-output calculation 15957ec681f3Smrg- microsoft/compiler: harmonize num_psv_inputs with outputs 15967ec681f3Smrg- gallivm: use lp_build_log2_safe for pow 15977ec681f3Smrg- lavapipe: remove stale xfails 15987ec681f3Smrg- lavapipe: remove duplicate xfail with typo 15997ec681f3Smrg- lavapipe: lower mipmapPrecisionBits to 4 16007ec681f3Smrg- gallivm: remove code to force nearest s/t interpolation 16017ec681f3Smrg- llvmpipe: take intersection with bbox for non-legacy points 16027ec681f3Smrg- st/mesa: correct point_tri_clip for gles2 16037ec681f3Smrg- gallivm: fix texture-mapping with 16-bit result 16047ec681f3Smrg- draw: fix stippling of fractional lines 16057ec681f3Smrg- gallium/nir/tgsi: fixup indentation 16067ec681f3Smrg- gallium/nir/tgsi: initialize file_max for inputs 16077ec681f3Smrg- draw: improve numerical stability in clipper 16087ec681f3Smrg- llvmpipe: use preferred attribute interpolation for wide lines 16097ec681f3Smrg- llvmpipe: clamp z to 0..1 range when using polygon offset 16107ec681f3Smrg- llvmpipe: split coefficient calculation and store 16117ec681f3Smrg- llvmpipe: improve polygon-offset precision 16127ec681f3Smrg- lavapipe: fix reported subpixel precision for lines 16137ec681f3Smrg- draw/llvmpipe: correct exponent calculation for negative z 16147ec681f3Smrg- gallium/tgsi: remove unused helper 16157ec681f3Smrg- gallium/tgsi: rip out cylindrical wrap from ureg 16167ec681f3Smrg- gallium/tgsi: rip out cylindrical wrap support 16177ec681f3Smrg- softpipe: rip out cylindrical wrap support 16187ec681f3Smrg- llvmpipe: rip out cylindrical wrap support 16197ec681f3Smrg- microsoft/compiler: remove needless error-returns 16207ec681f3Smrg- microsoft/compiler: return errors from get_n_src 16217ec681f3Smrg- microsoft/compiler: trivial fixes to error-handling 16227ec681f3Smrg- Revert "zink: always init bordercolor value for sampler" 16237ec681f3Smrg- zink: do not warn about rare features until used 16247ec681f3Smrg- zink: initialize pQueueFamilyIndices 16257ec681f3Smrg- zink: avoid overflow when calculating size 16267ec681f3Smrg- zink: do not try to dereference null-key 16277ec681f3Smrg- zink: do not dereference null-pointer 16287ec681f3Smrg- zink: pctx can't be null here 16297ec681f3Smrg- zink: return false on failure 16307ec681f3Smrg- zink: remove needless NULL-check 16317ec681f3Smrg- zink: avoid memcmping null pointers 16327ec681f3Smrg- zink: avoid checking if src is const twice 16337ec681f3Smrg- zink: give each major intrinsic it's own emit function 16347ec681f3Smrg- zink: remove needless scope 16357ec681f3Smrg- zink: remove incorrect ASSERTED macro 16367ec681f3Smrg- zink: clean up const-value handling for get_ssbo_size 16377ec681f3Smrg- zink: reduce scope of version-struct hack 16387ec681f3Smrg- zink: avoid generating nonsensical code 16397ec681f3Smrg 16407ec681f3SmrgEsme Xuan Lim (1): 16417ec681f3Smrg 16427ec681f3Smrg- docs/panfrost: Fix link to use rst syntax 16437ec681f3Smrg 16447ec681f3SmrgFelix DeGrood (2): 16457ec681f3Smrg 16467ec681f3Smrg- iris: add tile cache flush to iris_copy_region 16477ec681f3Smrg- anv: dirty only state impacted by blorp_exec 16487ec681f3Smrg 16497ec681f3SmrgFilip Gawin (18): 16507ec681f3Smrg 16517ec681f3Smrg- docs: make most important part of bugs.rst easier to find 16527ec681f3Smrg- radeonsi: improve rounding of zmin 16537ec681f3Smrg- radv: improve rounding of zmin 16547ec681f3Smrg- nir: fix shadowed variable in nir_lower_bit_size.c 16557ec681f3Smrg- nir: fix ifind_msb_rev by using appropriate type 16567ec681f3Smrg- meson: add crocus to default group of drivers for x86/x86_64 16577ec681f3Smrg- nouveau: fix forward declaration of struct 16587ec681f3Smrg- nouveau: use bool literals instead of integers 16597ec681f3Smrg- glsl: use bool literals instead of integers 16607ec681f3Smrg- r300: fix usage of COVERED_PTR_MASKING_ENABLE for r500 16617ec681f3Smrg- r300: make global variables const (if possible) 16627ec681f3Smrg- r300: assert that array in translate_vertex_program is initialized 16637ec681f3Smrg- aco: cleanup assignment of unique_ptrs 16647ec681f3Smrg- r300: implement forgotten tgsi's cases of textures 16657ec681f3Smrg- r300: fix UB caused by 1 << 31 and 2 << 30 16667ec681f3Smrg- r300: avoid searching for temp variable twice 16677ec681f3Smrg- nir: avoiding reading unitialized memory when using nir_dest_copy 16687ec681f3Smrg- r300: fixes for UB caused by left shifts 16697ec681f3Smrg 16707ec681f3SmrgFrancisco Jerez (12): 16717ec681f3Smrg 16727ec681f3Smrg- iris: Add read-only domain for VF cache. 16737ec681f3Smrg- iris: Annotate all BO uses through VF cache domain. 16747ec681f3Smrg- iris: Insert buffer-local memory barriers for VF reads. 16757ec681f3Smrg- iris: Add separate dirty bit for VBO flushes. 16767ec681f3Smrg- iris: Insert buffer-local memory barriers for indirect draw parameters. 16777ec681f3Smrg- iris: Add read-write domain for data cache. 16787ec681f3Smrg- iris: Use DATA domain barrier for shader images instead of OTHER domain. 16797ec681f3Smrg- iris: Insert buffer-local memory barriers for SSBO reads and writes. 16807ec681f3Smrg- iris: Insert buffer-local memory barriers for UBO reads. 16817ec681f3Smrg- iris: Use separate dirty bits for UBO and SSBO flushes. 16827ec681f3Smrg- iris: Track dirty UBOs per-stage for more targeted flushing. 16837ec681f3Smrg- iris: Make sure a bound resource is flushed after iris_dirty_for_history. 16847ec681f3Smrg 16857ec681f3SmrgGeorg Lehmann (3): 16867ec681f3Smrg 16877ec681f3Smrg- radv: Use c_msvc_compat_args. 16887ec681f3Smrg- aco: Use cpp_msvc_compat_args. 16897ec681f3Smrg- radv: Remove dead min waves code. 16907ec681f3Smrg 16917ec681f3SmrgGert Wollny (3): 16927ec681f3Smrg 16937ec681f3Smrg- mesa: Add support for EXT_clear_texture 16947ec681f3Smrg- mesa: Add EXT_texture_mirror_clamp_to_edge to extension table 16957ec681f3Smrg- mesa: signal driver when buffer is bound to different texture format 16967ec681f3Smrg 16977ec681f3SmrgGreg V (1): 16987ec681f3Smrg 16997ec681f3Smrg- util: make util_get_process_exec_path work on FreeBSD w/o procfs 17007ec681f3Smrg 17017ec681f3SmrgGuilherme Gallo (9): 17027ec681f3Smrg 17037ec681f3Smrg- gitlab-ci: enable testing on Intel Whiskey Lake (experimental) 17047ec681f3Smrg- gitlab-ci: enable testing on Intel Comet Lake (experimental) 17057ec681f3Smrg- gitlab-ci: Fix trace expectations for iris devices 17067ec681f3Smrg- gitlab-ci: Fix octopus device type and tag 17077ec681f3Smrg- gitlab-ci: Add sleep for every \`scheduler.jobs.logs` call 17087ec681f3Smrg- gitlab-ci: Implement a simple timeout detection for LAVA jobs 17097ec681f3Smrg- gitlab-ci: refactor timeout constants and tweak timeout values 17107ec681f3Smrg- ci: Uprev deqp-runner to 0.9.0 17117ec681f3Smrg- ci: Update linux kernel to v5.15 17127ec681f3Smrg 17137ec681f3SmrgGurchetan Singh (3): 17147ec681f3Smrg 17157ec681f3Smrg- drm-uapi: virtgpu_drm.h: context init feature 17167ec681f3Smrg- virgl/drm: query for context init ioctl and supported capset ids 17177ec681f3Smrg- virgl/drm: explicit context initialization 17187ec681f3Smrg 17197ec681f3SmrgHoe Hao Cheng (2): 17207ec681f3Smrg 17217ec681f3Smrg- zink: make codegen compatible with python 3.5 17227ec681f3Smrg- zink/codegen: do not enable extensions based on vulkan version 17237ec681f3Smrg 17247ec681f3SmrgHyunjun Ko (4): 17257ec681f3Smrg 17267ec681f3Smrg- tu: allow dynamic primitive topology with tessellation 17277ec681f3Smrg- freedreno/a5xx,a6xx: rename MSAA_ENABLE to LINE_MODE in GRAS_SU_CNTL 17287ec681f3Smrg- turnip: enable VK_EXT_line_rasterization 17297ec681f3Smrg- turnip: enable strictLines 17307ec681f3Smrg 17317ec681f3SmrgIago Toral Quiroga (40): 17327ec681f3Smrg 17337ec681f3Smrg- ci: disable Broadcom CI 17347ec681f3Smrg- v3dv: remove more dead clearing code 17357ec681f3Smrg- v3dv: refactor meta copy/clear code 17367ec681f3Smrg- v3dv: remove unused layer field from struct rcl_clear_info 17377ec681f3Smrg- v3dv: improve TLB layered image clears 17387ec681f3Smrg- v3dv: allow limiting amount of tile state allocated 17397ec681f3Smrg- v3dv: don't overallocate tile state for meta TLB operations 17407ec681f3Smrg- v3dv: don't emit frame setup more than once for multilayered framebuffers 17417ec681f3Smrg- v3dv: fix I/O lowering for GS 17427ec681f3Smrg- v3dv: drop unused parameters 17437ec681f3Smrg- v3dv: store multiview info in our render pass data 17447ec681f3Smrg- v3dv: move all our NIR pre-processing to preprocess_nir 17457ec681f3Smrg- v3dv: inject a custom passthrough geometry shader for multiview pipelines 17467ec681f3Smrg- broadcom/compiler: implement nir_intrinsic_load_view_index 17477ec681f3Smrg- v3dv: broadcast multiview draw commands 17487ec681f3Smrg- v3dv: don't merge subpasses with different view masks 17497ec681f3Smrg- v3dv: use correct number of layers for multiview 17507ec681f3Smrg- v3dv: skip processing tiles for layers that are not in the view mask 17517ec681f3Smrg- v3dv: track first and last subpass that use a view index 17527ec681f3Smrg- v3dv: fix query error handling 17537ec681f3Smrg- v3dv: implement interaction of queries with multiview 17547ec681f3Smrg- v3dv: expose VK_KHR_multiview 17557ec681f3Smrg- v3dv: fill in drmFormatModifierTilingFeatures 17567ec681f3Smrg- v3dv: handle IMAGE_DRM_FORMAT_MODIFIER_EXPLICIT_CREATE_INFO_EXT 17577ec681f3Smrg- docs: flag VK_KHR_multiview as implemented for v3dv 17587ec681f3Smrg- broadcom/compiler: add a vir_get_cond helper 17597ec681f3Smrg- broadcom/compiler: Flags are per-thread state in V3D 4.2+ 17607ec681f3Smrg- broadcom/compiler: make spills of conditional writes also conditional 17617ec681f3Smrg- broadcom/compiler: rewrite partial update liveness tracking 17627ec681f3Smrg- v3d,v3dv: add options to force 32-bit or 16-bit TMU precision 17637ec681f3Smrg- v3dv: don't try to access pColorBlendState if rasterization is disabled 17647ec681f3Smrg- v3dv: add API entry points for sampler Ycbcr conversions 17657ec681f3Smrg- vulkan: allow creating color views from depth/stencil images 17667ec681f3Smrg- v3dv: make v3dv_image derive from vk_image 17677ec681f3Smrg- v3dv: use subresource helpers in more places 17687ec681f3Smrg- v3dv: make v3dv_image_view derive from vk_image_view 17697ec681f3Smrg- v3dv: honor VkPhysicalDeviceFeatures2 in pNext chain of VkDeviceCreateInfo 17707ec681f3Smrg- broadcom/compiler: don't enable early fragment tests if shader writes Z 17717ec681f3Smrg- v3dv: start using Broadcom's device identifiers 17727ec681f3Smrg- broadcom/compiler: fix assert that current instruction must be in current block 17737ec681f3Smrg 17747ec681f3SmrgIan Romanick (65): 17757ec681f3Smrg 17767ec681f3Smrg- nir/gcm: Clear out pass_flags before starting 17777ec681f3Smrg- util/queue: Don't crash in util_queue_destroy when init failed 17787ec681f3Smrg- iris: Add a comment for iris_uncompiled_shader::nir 17797ec681f3Smrg- iris: Fix return type of iris_compile_* 17807ec681f3Smrg- iris: Unify iris_delete_[shader stage]_state functions 17817ec681f3Smrg- iris: Unify iris_create_[shader stage]_state functions 17827ec681f3Smrg- iris: Merge iris_create_[shader stage]_state funcs into iris_create_shader_state 17837ec681f3Smrg- iris: Ref count the uncompiled shaders 17847ec681f3Smrg- iris: Extract allocation bits from iris_upload_shader to iris_create_shader_variant 17857ec681f3Smrg- iris: Allocate shader variant in caller of iris_upload_shader 17867ec681f3Smrg- iris: Add the variant to the list as early as possible 17877ec681f3Smrg- iris: Don't pass the shader key to iris_compile_[shader stage] 17887ec681f3Smrg- iris: add sync_compile option 17897ec681f3Smrg- iris: Enable threaded shader compilation 17907ec681f3Smrg- iris: Split iris_upload_shader in two 17917ec681f3Smrg- intel/compiler: Add id parameter to shader_debug_log callback 17927ec681f3Smrg- intel/compiler: Add id parameter to shader_perf_log callback 17937ec681f3Smrg- mesa: Fix tiny race condition in _mesa_debug_get_id 17947ec681f3Smrg- util: Add and use functions to calculate min and max int for a size 17957ec681f3Smrg- isl: Use CLAMP macro instead of MIN of MAX 17967ec681f3Smrg- nir/opcodes: Use u_intN_(min|max) 17977ec681f3Smrg- Revert "nir/algebraic: Convert some f2u to f2i" 17987ec681f3Smrg- intel/fs: sel.cond writes the flags on Gfx4 and Gfx5 17997ec681f3Smrg- gallium: Remove "optimize" parameter from pipe_screen::finalize_nir 18007ec681f3Smrg- intel/compiler: Document and assert some aspects of 8-bit integer lowering 18017ec681f3Smrg- nir/algebraic: Optimize some extract forms resulting from 8-bit lowering 18027ec681f3Smrg- intel/fs: Allow copy propagation between MOVs of mixed sizes 18037ec681f3Smrg- intel/fs: Emit better code for u2u of extract 18047ec681f3Smrg- nir/algebraic: Remove spurious conversions from inside logic ops 18057ec681f3Smrg- nir: intel/compiler: Add and use nir_op_pack_32_4x8_split 18067ec681f3Smrg- intel/compiler: Lower 8-bit ops to 16-bit in NIR on all platforms 18077ec681f3Smrg- util/xmlconfig: Make unit tests more resilient against user env settings 18087ec681f3Smrg- util/xmlconfig: Test values set via the environment 18097ec681f3Smrg- nir/lower_bit_size: Support add_sat and sub_sat 18107ec681f3Smrg- nir/opcodes: Add integer dot-product opcodes 18117ec681f3Smrg- nir/algebraic: Basic patterns for dot_4x8 18127ec681f3Smrg- intel/compiler: Basic support for DP4A instruction 18137ec681f3Smrg- nir/algebraic: Add lowering for dot_4x8 instructions 18147ec681f3Smrg- nir/algebraic: Add some extract optimizations 18157ec681f3Smrg- spirv: Update headers and metadata from latest Khronos commit 18167ec681f3Smrg- spirv: Add support for SPV_KHR_integer_dot_product 18177ec681f3Smrg- intel/fs: Refactor some cmod propagation tests 18187ec681f3Smrg- intel/fs: Remove redundant inst->opcode checks in cmod prop 18197ec681f3Smrg- intel/fs: Add many cmod propagation tests involving MOV instructions 18207ec681f3Smrg- intel/fs: Fix a cmod prop bug when the source type of a mov doesn't match the dest type of scan_inst 18217ec681f3Smrg- intel/compiler: Move type_is_unsigned_int to brw_reg_type.h 18227ec681f3Smrg- intel/fs: cmod propagate from MOV with any condition 18237ec681f3Smrg- intel/fs: Remove condition-based restriction for cmod propagation to saturated operations 18247ec681f3Smrg- intel/fs: Remove after parameter from test_saturate_prop 18257ec681f3Smrg- intel/fs: Remove type-based restriction for cmod propagation to saturated operations 18267ec681f3Smrg- anv: Enable KHR_shader_integer_dot_product 18277ec681f3Smrg- nir/lower_gs_intrinsics: Return progress if append_set_vertex_and_primitive_count makes progress 18287ec681f3Smrg- nir/lower_gs_intrinsics: Make nir_lower_gs_intrinsics be idempotent 18297ec681f3Smrg- iris: crocus: Use shader_info::is_arb_asm flag 18307ec681f3Smrg- iris: Calculate uses_atomic_load_store after all lowering 18317ec681f3Smrg- nir/edgeflags: Add a flag to indicate the edge flag input is needed 18327ec681f3Smrg- iris: Eliminate iris_uncompiled_shader::needs_edge_flag 18337ec681f3Smrg- iris: Move iris_set_max_shader_compiler_threads and iris_is_parallel_shader_compilation_finished 18347ec681f3Smrg- iris: Add finalize_nir 18357ec681f3Smrg- spirv: Silence unused parameter warnings in vtn_alu.c 18367ec681f3Smrg- spirv: Minor cleanup in SpvOpFOrdNotEqual 18377ec681f3Smrg- spirv: SpvOpFUnordNotEqual doesn't need special treatment 18387ec681f3Smrg- spirv: Generate shorter code for SpvOpFUnord comparisons 18397ec681f3Smrg- nir/algebraic: Small optimizations for SpvOpFOrdNotEqual and SpvOpFUnordEqual 18407ec681f3Smrg- nir/loop_unroll: Always unroll loops that iterate at most once 18417ec681f3Smrg 18427ec681f3SmrgIcecream95 (26): 18437ec681f3Smrg 18447ec681f3Smrg- pan/decode: Avoid undefined behaviour on shift in bits() 18457ec681f3Smrg- pan/gen_pack: Use 1U for unpacking log2 to avoid undefined behaviour 18467ec681f3Smrg- pan/bi: Print the clause of branch targets 18477ec681f3Smrg- pan/bi: Use padding bytes for checking whether to stop disassembly 18487ec681f3Smrg- pan/bi: Fix infinite loop parsing arguments for bifrost_compiler 18497ec681f3Smrg- pan/mdg: Analyze helper termination after scheduling 18507ec681f3Smrg- pan/bi: Use the computed scale for fexp NaN propagation 18517ec681f3Smrg- panfrost: Call primconvert and u_transfer_helper destroy functions 18527ec681f3Smrg- pan/bi,pan/mdg: Fix memory leak of hash tables 18537ec681f3Smrg- panfrost: Fix memory leaks for compute state 18547ec681f3Smrg- panfrost: Free TGSI tokens 18557ec681f3Smrg- panfrost: Free NIR when deleting shader state 18567ec681f3Smrg- pan/mdg: Reduce size of tex_opcode_props 18577ec681f3Smrg- panfrost: Fill tiler job padding again 18587ec681f3Smrg- panfrost: Add nocache debug flag for disabling the BO cache 18597ec681f3Smrg- panfrost: Only allow colour blit shaders to be killed 18607ec681f3Smrg- panfrost: drm-shim support 18617ec681f3Smrg- pan/bi: Extend bi_add_nop_for_atest for tilebuffer loads 18627ec681f3Smrg- lima: Enable PIPE_CAP_VERTEX_COLOR_UNCLAMPED 18637ec681f3Smrg- lima: Fix crashes for GPUs with more than four cores 18647ec681f3Smrg- lima: Improve error messages for unsupported GP operations 18657ec681f3Smrg- lima: Add a noop drm-shim 18667ec681f3Smrg- pan/bi: Don't set dependencies for +BLEND in blend shaders 18677ec681f3Smrg- pan/mdg: Remove use of global variables in disassembler 18687ec681f3Smrg- panfrost: Add ASTC 3D texture format entries 18697ec681f3Smrg- pan/mdg: Use the correct swizzle for condition moves 18707ec681f3Smrg 18717ec681f3SmrgIlia Mirkin (7): 18727ec681f3Smrg 18737ec681f3Smrg- st/mesa: fix pbo download store image type 18747ec681f3Smrg- mesa: don't return errors for gl_* GetFragData* queries 18757ec681f3Smrg- mesa: rgb10_a2 is never color-renderable in gles2 18767ec681f3Smrg- glsl: fix explicit-location ifc matching in presence of array types 18777ec681f3Smrg- freedreno: use OUT_WFI for emit_marker 18787ec681f3Smrg- a4xx: add some better documentation for compute registers 18797ec681f3Smrg- a4xx/computerator: add initial backend 18807ec681f3Smrg 18817ec681f3SmrgItalo Nicola (6): 18827ec681f3Smrg 18837ec681f3Smrg- ci: skip minio login if PIGLIT_REPLAY_UPLOAD_TO_MINIO is not set 18847ec681f3Smrg- virgl/ci: switch glmark2 traces from .rdc to .trace 18857ec681f3Smrg- virgl/ci: stop overriding GL version when running traces 18867ec681f3Smrg- virgl/ci: enable some traces that were previously crashing 18877ec681f3Smrg- main: don't always clamp pixels read from snorm buffers 18887ec681f3Smrg- panfrost: fix null deref when no color buffer is attached 18897ec681f3Smrg 18907ec681f3SmrgIván Briano (8): 18917ec681f3Smrg 18927ec681f3Smrg- anv: Don't advertise unsupported shader stages 18937ec681f3Smrg- anv: fix some multisample lines_wide CTS tests 18947ec681f3Smrg- anv: Unbreak wide lines on HSW/BDW 18957ec681f3Smrg- anv: fix feature/property/sizes reported for fragment shading rate 18967ec681f3Smrg- anv: Allow unused VkSpecializationMapEntries 18977ec681f3Smrg- anv: Don't copy the lineStipple values if lineStipple is not enabled 18987ec681f3Smrg- vulkan: fix handling of aliases in enum members 18997ec681f3Smrg- vulkan: Generate defines for aliases of promoted enums 19007ec681f3Smrg 19017ec681f3SmrgJames Park (1): 19027ec681f3Smrg 19037ec681f3Smrg- aco: Work around MSVC restrict in c99_compat.h 19047ec681f3Smrg 19057ec681f3SmrgJan Beich (1): 19067ec681f3Smrg 19077ec681f3Smrg- meson: disable -Werror=thread-safety on FreeBSD 19087ec681f3Smrg 19097ec681f3SmrgJason Ekstrand (192): 19107ec681f3Smrg 19117ec681f3Smrg- intel/dev: Handle CHV CS thread weirdness in get_device_info_from_fd 19127ec681f3Smrg- intel/dev: Put the device name in intel_device_info 19137ec681f3Smrg- intel/dev: Handle BSW naming issues 19147ec681f3Smrg- intel/dev: Add a max_cs_workgroup_threads field 19157ec681f3Smrg- intel/dev: Drop a bogus assert 19167ec681f3Smrg- nir: Better document the Boissinot algorithm in nir_from_ssa() 19177ec681f3Smrg- iris: Re-emit MEDIA_VFE_STATE for variable group size shaders 19187ec681f3Smrg- anv: Handle errors properly in anv_i915_query 19197ec681f3Smrg- intel: Pull anv_i915_query into common code 19207ec681f3Smrg- anv: Use intel_i915_query_alloc for memory regions 19217ec681f3Smrg- iris: Use intel_i915_query for meminfo 19227ec681f3Smrg- intel/dev: Use intel_i915_query_alloc in query_topology 19237ec681f3Smrg- intel/perf: Use intel_i915_query_flags instead of hand-rolling it 19247ec681f3Smrg- intel/eu: Start validating LSC message descriptors 19257ec681f3Smrg- anv: Assume syncobj support 19267ec681f3Smrg- anv: Drop unused sync_file and BO semaphore code 19277ec681f3Smrg- anv: Stop reference counting semaphores 19287ec681f3Smrg- glsl/nir: Use nir_ssa_undef() from nir_builder 19297ec681f3Smrg- nir: Set IMAGE_DIM and IMAGE_ARRAY on deref intrinsics 19307ec681f3Smrg- nir: Set src_components = -1 for image intrinsic deref sources 19317ec681f3Smrg- nir: Add a format field to _deref image intrinsics 19327ec681f3Smrg- nir/lower_subgroups: Handle down-casts in uint_to_ballot_type 19337ec681f3Smrg- nir/lower_image: Handle index and bindless image_size 19347ec681f3Smrg- nir/lower_tex: Add a lower_txs_cube_array option 19357ec681f3Smrg- radv,radeonsi: Do cube size divide-by-6 lowering in NIR 19367ec681f3Smrg- turnip: Replace tu_lower_image_size with nir_lower_image 19377ec681f3Smrg- intel/eu: Don't validate LSC transpose on ops that don't have it 19387ec681f3Smrg- ttn: Don't handle texop_txf_ms_mcs 19397ec681f3Smrg- amd: Don't handle nir_tex_src_ms_mcs 19407ec681f3Smrg- panfrost: Don't handle nir_texop_txf_ms_mcs 19417ec681f3Smrg- nir: Suffix all the MCS texture stuff _intel 19427ec681f3Smrg- docs,nir: Document NIR texture instructions 19437ec681f3Smrg- intel/blorp: Use nir_texop_txl 19447ec681f3Smrg- nir/lower_tex: Rework invalid implicit LOD lowering 19457ec681f3Smrg- nir: Validate newly documented texture restrictions 19467ec681f3Smrg- anv/android: Rework our handling of AHardwareBuffer imports 19477ec681f3Smrg- nir: Removing uses of SSA defs destroys SSA liveness 19487ec681f3Smrg- nouveau: Use nir_lower_tex for projectors 19497ec681f3Smrg- anv/blorp: Drop some can_ycbcr checks 19507ec681f3Smrg- anv/blorp: Use the isl_surf for computing level_width/height in anv_image_ccs_op 19517ec681f3Smrg- anv: Rename anv_get_format_plane to anv_get_format_aspect 19527ec681f3Smrg- anv: Rework depth/stencil early return in anv_get_format_plane 19537ec681f3Smrg- anv: Add a get_format_plane helper and use it in image setup 19547ec681f3Smrg- anv: Use anv_get_format_plane in anv_get_image_format_features 19557ec681f3Smrg- anv: Use anv_get_format_plane for color image view setup 19567ec681f3Smrg- anv: Stop assuming planes are in aspect-bit-order 19577ec681f3Smrg- anv/image: Rework YCbCr image aspects 19587ec681f3Smrg- anv: Rework our aspect/plane helpers 19597ec681f3Smrg- anv: Make anv_image_aspect_to_plane take an anv_image* 19607ec681f3Smrg- intel/eu: Set scope to TILE for TGM flushes 19617ec681f3Smrg- meson/intel: Don't build genxml tests on Android 19627ec681f3Smrg- meson: Intel drivers don't require expat on Android 19637ec681f3Smrg- meson/glsl: Only run GLSL tests if can_run_host_binaries() 19647ec681f3Smrg- intel/vec4: Don't override emit_urb_write_opcode for SNB GS 19657ec681f3Smrg- intel/perf: Use a char array for OA perf query data 19667ec681f3Smrg- anv/android: Pass the correct pointer type to vk_errorf 19677ec681f3Smrg- anv/android: Drop unused device variables 19687ec681f3Smrg- ci: Build ANV on Android 19697ec681f3Smrg- include/drm-uapi: Bump headers 19707ec681f3Smrg- anv: Use I915_MMAP_OFFSET_FIXED for LMEM platforms 19717ec681f3Smrg- iris: SMEM buffers on discrete platforms are coherent 19727ec681f3Smrg- iris: Use a tiny table to map mmap modes to offsets 19737ec681f3Smrg- iris: Add an assert to iris_bo_gem_mmap_legacy() 19747ec681f3Smrg- iris: Add a new IRIS_MMAP_NONE map type 19757ec681f3Smrg- iris: Use I915_MMAP_OFFSET_FIXED for LMEM platforms 19767ec681f3Smrg- anv: Use I915_USERPTR_PROBE when available 19777ec681f3Smrg- intel/isl: Explicitly set offset_B = 0 in get_uncomp_surf for arrays 19787ec681f3Smrg- intel/isl: Add units to view dimensions in isl_surf_get_uncompressed_surf 19797ec681f3Smrg- intel/isl: Better document isl_tiling_get_intratile_offset_* 19807ec681f3Smrg- intel/isl: Add a missing assert in isl_tiling_get_intratile_offset_sa 19817ec681f3Smrg- intel/isl: Use uint64_t for computed byte offsets 19827ec681f3Smrg- anv/image: Use planes[i]->primary_surface.isl.format in check_drm_format_mod 19837ec681f3Smrg- anv: Delete anv_image::format 19847ec681f3Smrg- vulkan: Add a vk_image struct 19857ec681f3Smrg- anv: Make anv_image derive from vk_image 19867ec681f3Smrg- anv,vulkan: Move anv_image_expand_aspects to common code 19877ec681f3Smrg- anv,vulkan: Move VkImageSubresource* helpers from ANV 19887ec681f3Smrg- vulkan: Refactor and better document vk_image_expand_aspect_mask 19897ec681f3Smrg- radv: Add asserts to vk_format_depth/stencil_only 19907ec681f3Smrg- vulkan,radv: Move vk_format_depth/stencil_only to common code 19917ec681f3Smrg- vulkan: Add a vk_image_view struct 19927ec681f3Smrg- anv: Make anv_image_view derive from vk_image_view 19937ec681f3Smrg- anv,vulkan: Move ANV image layout helpers to common code 19947ec681f3Smrg- anv,vulkan: Move drm_format_mod to vk_image 19957ec681f3Smrg- anv,vulkan: Add a vk_image::wsi_legacy_scanout bit 19967ec681f3Smrg- anv: Move compute_heap_size lower in the file 19977ec681f3Smrg- anv: Rework init_meminfo 19987ec681f3Smrg- anv: compute available memory in anv_init_meminfo 19997ec681f3Smrg- anv: Set CONTEXT_PARAM_RECOVERABLE to false 20007ec681f3Smrg- intel/compiler: Add unified barrier support for CS 20017ec681f3Smrg- intel/isl: Add more parameters to isl_tiling_get_info 20027ec681f3Smrg- isl/docs/tiling: Add Tile4 docs 20037ec681f3Smrg- intel/fs: Add support for atomic_fadd 20047ec681f3Smrg- anv: Advertise support for shaderBufferFloat32AtomicAdd 20057ec681f3Smrg- nir: Properly clean up nir_src/dest indirects 20067ec681f3Smrg- nir: Stop sweeping indirects 20077ec681f3Smrg- spirv: Handle the SubgroupSize execution mode 20087ec681f3Smrg- intel/fs: Handle required subgroup sizes specified in the SPIR-V 20097ec681f3Smrg- iris: Handle states=NULL in iris_bind_sampler_states 20107ec681f3Smrg- iris: Return 1 for PIPE_COMPUTE_CAP_IMAGES_SUPPORTED 20117ec681f3Smrg- panvk: Use vk_queue 20127ec681f3Smrg- panvk: Use vk_command_buffer 20137ec681f3Smrg- vulkan: Add the pCreateInfo to vk_queue_init() 20147ec681f3Smrg- anv: Drop anv_queue::flags 20157ec681f3Smrg- radv: Drop radv_queue::flags/queue_family_index/queue_idx 20167ec681f3Smrg- lavapipe: Drop lvp_queue::flags 20177ec681f3Smrg- turnip: Drop tu_queue::flags/queue_family_index/queue_idx 20187ec681f3Smrg- v3dv: Drop v3dv_queue::flags 20197ec681f3Smrg- panvk: Drop panvk_queue::flags/queue_family_index 20207ec681f3Smrg- vulkan/device: Add a common GetDeviceQueue2 implementation 20217ec681f3Smrg- vulkan/device: Add a common DeviceWaitIdle implementation 20227ec681f3Smrg- anv: Switch to common GetDeviceQueues2 and DeviceWaitIdle 20237ec681f3Smrg- radv: Switch to common GetDeviceQueues2 and DeviceWaitIdle 20247ec681f3Smrg- turnip: Switch to common GetDeviceQueues2 and DeviceWaitIdle 20257ec681f3Smrg- v3dv: Use the common GetDeviceQueue implementation 20267ec681f3Smrg- lavapipe: Simplify DeviceWaitIdle 20277ec681f3Smrg- lavapipe: Switch to common GetDeviceQueue and DeviceWaitIdle 20287ec681f3Smrg- panvk: Switch to common GetDeviceQueue and DeviceWaitIdle 20297ec681f3Smrg- intel/fs: Rework fence handling in brw_fs_nir.cpp 20307ec681f3Smrg- intel/fs: Ignore SLM fences if shared is unused 20317ec681f3Smrg- intel/fs: Add the URB fence message 20327ec681f3Smrg- intel/fs: Emit URB fences when we have LSC 20337ec681f3Smrg- vulkan/shader_module: Fix the lifetime of temporary shader modules 20347ec681f3Smrg- v3dv: Use VK_DEFINE_*HANDLE_CASTS instead of rolling our own 20357ec681f3Smrg- st/texture: Dedent surface setup in CompressedTexSubImage 20367ec681f3Smrg- st/texture: Fall back to single-slice uploads in st_CompressedTexSubImage 20377ec681f3Smrg- Move a bunch of the CLC stuff from src/microsoft to common code 20387ec681f3Smrg- compiler/clc: Clean ups 20397ec681f3Smrg- compiler/clc: grab opencl-c.h from the system path by default 20407ec681f3Smrg- anv,iris,genxml: Use NumberOfBarriers on XeHP 20417ec681f3Smrg- vulkan/physical_device_features: Drop some unnecessary dependencies 20427ec681f3Smrg- vulkan/physical_device_features: Stop generating a header 20437ec681f3Smrg- radv: Use VK_DEFINE_*HANDLE_CASTS instead of rolling our own 20447ec681f3Smrg- vulkan: Update the XML and headers to 1.2.195 20457ec681f3Smrg- anv: Add an anv_image_get_memory_requirements helper 20467ec681f3Smrg- intel/isl: Add a max_buffer_size limit to isl_device 20477ec681f3Smrg- intel/isl: Simplify isl_format_supports_filtering 20487ec681f3Smrg- intel/isl: Stop claiming ASTC works on Cherry View 20497ec681f3Smrg- anv: Ask ISL about ASTC support 20507ec681f3Smrg- intel/isl: ASTC support was removed on Gfx12.5 20517ec681f3Smrg- genxml: Drop bit 27 from RENDER_SURFACE_STATE::Surface Format 20527ec681f3Smrg- nir/algebraic: Lower fisfinite 20537ec681f3Smrg- nir/algebraic: Add some boolean optimizations 20547ec681f3Smrg- nir/algebraic: Add some opts for comparisons of comparisons 20557ec681f3Smrg- vulkan: Drop vk_object_base_reset 20567ec681f3Smrg- vulkan: Track which objects are client-visible 20577ec681f3Smrg- vulkan/log: Assert if the driver logs a client-invisible object 20587ec681f3Smrg- vulkan/log: Log to instance messages during instance construction 20597ec681f3Smrg- anv: drop a misplaced and wrong comment 20607ec681f3Smrg- anv: Stop printing descriptor pool allocation failures 20617ec681f3Smrg- anv: s/vk_error/anv_error/g 20627ec681f3Smrg- vulkan/log: Handle logging to a physical device 20637ec681f3Smrg- vulkan/log: Add common vk_error and vk_errorf helpers 20647ec681f3Smrg- anv: Drop unused logging helpers 20657ec681f3Smrg- anv/queue: Plumb the queue through all the queue_submit calls 20667ec681f3Smrg- anv: Use the common vk_error and vk_errorf helpers 20677ec681f3Smrg- radv: Stop printing descriptor pool allocation failures 20687ec681f3Smrg- radv: Switch to the new common vk_error helpers 20697ec681f3Smrg- lavapipe: Switch to the new vk_error helpers 20707ec681f3Smrg- panvk: Switch to the new vk_error helpers 20717ec681f3Smrg- v3dv: Switch to the new vk_error helpers 20727ec681f3Smrg- turnip: Plumb non-startup errors through the new vk_error helpers 20737ec681f3Smrg- vulkan/log: Drop _impl from the log helper names 20747ec681f3Smrg- vulkan/instance: Use vk_error in vk_instance_init 20757ec681f3Smrg- vulkan/device: Use vk_error 20767ec681f3Smrg- vulkan/device: Use vk_errorf to report missing features 20777ec681f3Smrg- Revert "mesa: use simple_mtx_t for TexMutex" 20787ec681f3Smrg- nir/lower_discard_or_demote: Fix metadata 20797ec681f3Smrg- vulkan: Generate flag #defines based on bitwidth 20807ec681f3Smrg- vulkan: Generate #defines with every bit in a given bitfield 20817ec681f3Smrg- anv: Use the common wrapper for GetPhysicalDeviceFormatProperties 20827ec681f3Smrg- anv: Flip around the way we reason about storage image lowering 20837ec681f3Smrg- meson: Add and use an idep for Vulkan WSI 20847ec681f3Smrg- vulkan/wsi: Add a dispatch table for WSI entrypoints 20857ec681f3Smrg- vulkan/wsi: Add common wrappers for most entrypoints 20867ec681f3Smrg- anv: Use the common WSI wrappers 20877ec681f3Smrg- radv: Use the common WSI wrappers 20887ec681f3Smrg- turnip: Use the common WSI wrappers 20897ec681f3Smrg- v3dv: Use the common WSI wrappers 20907ec681f3Smrg- panvk: Use the common WSI wrappers 20917ec681f3Smrg- lavapipe: Use the common WSI wrappers 20927ec681f3Smrg- venus: Use the common WSI wrappers 20937ec681f3Smrg- vulkan/wsi/common: Delete the wrapper entrypoints 20947ec681f3Smrg- vulkan/wsi/x11: Delete the wrapper entrypoints 20957ec681f3Smrg- vulkan/wsi/wayland: Delete the wrapper entrypoints 20967ec681f3Smrg- vulkan/wsi/display: Delete the wrapper entrypoints 20977ec681f3Smrg- vulkan/log: Tweak our handling of a couple error enums 20987ec681f3Smrg- i965: Emit a NULL surface for buffer textures with no buffer 20997ec681f3Smrg- lavapipe: Don't wrap errors returned from vk_device_init in vk_error 21007ec681f3Smrg- anv: Fix FlushMappedMemoryRanges for odd mmap offsets 21017ec681f3Smrg- anv: Also disallow CCS_E for multi-LOD images 21027ec681f3Smrg- vulkan/util: Include stdlib.h 21037ec681f3Smrg 21047ec681f3SmrgJeremy Newton (1): 21057ec681f3Smrg 21067ec681f3Smrg- Fix building AMD MM/GL with EL7 21077ec681f3Smrg 21087ec681f3SmrgJesse Natalie (62): 21097ec681f3Smrg 21107ec681f3Smrg- mesa/main: Check for fbo attachments when importing EGL images to textures 21117ec681f3Smrg- microsoft/compiler: Implement texture loads from UAVs 21127ec681f3Smrg- microsoft/clc: Add a test for compiling a kernel with a read-write image 21137ec681f3Smrg- gallium/dri: Move driConf -> st option processing to aux/util 21147ec681f3Smrg- xmlconfig: Use static inline for regex fallback to prevent -O0 issues 21157ec681f3Smrg- wgl: Parse driconf options 21167ec681f3Smrg- wgl: Add a driver name for driconf 21177ec681f3Smrg- u_driconf: Use a macro to avoid repeating option names 21187ec681f3Smrg- CI: Update Windows quick_gl baseline for mysterious new passes 21197ec681f3Smrg- spirv2dxil: Fix build after spirv_to_dxil signature change 21207ec681f3Smrg- ci/windows: Build spirv-to-dxil 21217ec681f3Smrg- llvmpipe: Don't wait for already-terminated threads on Windows 21227ec681f3Smrg- mapi: Fix shared-glapi build with MSVC 21237ec681f3Smrg- wgl: Fix unit test when using shared glapi 21247ec681f3Smrg- static-glapi: Fix MSVC preprocessor definitions 21257ec681f3Smrg- wgl: Don't use BUILD_GL32 for wgl frontend 21267ec681f3Smrg- wgl: Move opengl32.def to target instead of frontend 21277ec681f3Smrg- wgl: Move wgl* non-extension definitions to libgl-gdi 21287ec681f3Smrg- wgl: Make overridden entrypoints local to stw_ext_context 21297ec681f3Smrg- wgl: Refactor drivers to a libgallium_wgl.dll 21307ec681f3Smrg- docs: Update Windows llvmpipe doc for driver split 21317ec681f3Smrg- gl.h: Remove dllimport 21327ec681f3Smrg- wgl: Create contexts and DHGLRCs separately 21337ec681f3Smrg- wgl: Pass share context as pointer instead of DHGLRC 21347ec681f3Smrg- wgl: Make contexts current with pointer instead of DHGLRC 21357ec681f3Smrg- wgl: Allow creating framebuffers that aren't in the global window list 21367ec681f3Smrg- wgl: Make contexts current with framebuffers instead of HDCs 21377ec681f3Smrg- wgl: Split DrvReleaseContext to support unbind via pointer 21387ec681f3Smrg- wgl: Add iPixelFormat to stw_pixelformat_info 21397ec681f3Smrg- wgl: Un-inline helpers which use stw_own_mutex 21407ec681f3Smrg- wgl: Add an explicit iPixelFormat for context creation 21417ec681f3Smrg- wgl: Use HWND instead of HDC as primary framebuffer handle 21427ec681f3Smrg- wgl: Add a stw_dev getter 21437ec681f3Smrg- wgl: Swap buffers via pointer instead of HDC 21447ec681f3Smrg- wgl: Add stw_* DLL exports for EGL support 21457ec681f3Smrg- meson: Include EGL after gallium 21467ec681f3Smrg- meson, egl: Support building for the Windows platform 21477ec681f3Smrg- egl: Add wgl/gallium dependencies for Windows platform 21487ec681f3Smrg- egl: Use the .def file for Windows 21497ec681f3Smrg- egl: Don't try to dereference native displays unless there's a detectable platform 21507ec681f3Smrg- egl: Detect Windows platform using GDI 21517ec681f3Smrg- egl: Add a basic Windows driver 21527ec681f3Smrg- symbols-check: Fix symbol demangling for Windows 21537ec681f3Smrg- egl: Update Windows .def to include missing exports 21547ec681f3Smrg- meson: Set /Zc:__cplusplus for MSVC 21557ec681f3Smrg- CI/windows: Build shared-glapi, EGL, gles2 21567ec681f3Smrg- microsoft/clc: Rename compiler DLL to clon12compiler 21577ec681f3Smrg- microsoft/clc: Clean up clc_context 21587ec681f3Smrg- microsoft/clc: Stop heap-allocating tiny fixed-size transparent structs 21597ec681f3Smrg- microsoft/clc: Split clc_object and rename entrypoints 21607ec681f3Smrg- microsoft/clc: Support SPIR intermediates in the compilation APIs 21617ec681f3Smrg- microsoft/clc: Parse SPIR-V specialization consts into metadata 21627ec681f3Smrg- microsoft/clc: Support passing specialization consts to spirv_to_nir 21637ec681f3Smrg- microsoft/clc: Add API to independently specialize SPIR-V 21647ec681f3Smrg- microsoft/clc: Add a test for specializing via SPIRV-Tools 21657ec681f3Smrg- clover: std::result_of is deprecated in c++17 and removed in c++20 21667ec681f3Smrg- clover: Delete unused 'e' exception reference vars 21677ec681f3Smrg- clover: Rename module -> binary, because C++20 makes module a keyword 21687ec681f3Smrg- compiler/clc: Null extensions should mean all supported, not all 21697ec681f3Smrg- compiler/clc: Preserve OCL kernel arg type metadata on LLVM13 21707ec681f3Smrg- util/hash_table: Clear special 0/1 entries for u64 hash table too 21717ec681f3Smrg- d3d12: Fix Linux fence wait return value 21727ec681f3Smrg 21737ec681f3SmrgJonathan Marek (1): 21747ec681f3Smrg 21757ec681f3Smrg- freedreno/registers: add a6xx media formats 21767ec681f3Smrg 21777ec681f3SmrgJordan Justen (51): 21787ec681f3Smrg 21797ec681f3Smrg- nir: Add nir_lower_image() to lower cube image sizes 21807ec681f3Smrg- intel/compiler: Rename brw_nir_lower_image_load_store to brw_nir_lower_storage_image 21817ec681f3Smrg- intel/compiler: Lower cube image sizes using nir_lower_image() 21827ec681f3Smrg- intel/compiler: Remove cube array size lowering in compiler backend 21837ec681f3Smrg- meson: Search for python3 before python for bin/meson_get_version.py 21847ec681f3Smrg- meson: Check that bin/meson_get_version.py ran without an error 21857ec681f3Smrg- intel/pci-ids: Re-enable DG1 and add SG1 21867ec681f3Smrg- intel/compiler: Regroup TCS barrier code paths 21877ec681f3Smrg- intel/compiler: Add unified barrier support for TCS 21887ec681f3Smrg- iris: Disable the Y-tiled modifiers on XeHP+ 21897ec681f3Smrg- intel: Move subslice_total into devinfo 21907ec681f3Smrg- intel/devinfo: Add devinfo->max_scratch_ids 21917ec681f3Smrg- intel/dev: Add is_dg2 to devinfo 21927ec681f3Smrg- intel/isl: Enable MOCS 61 for external surfaces on TGL 21937ec681f3Smrg- intel/dev: Add display_ver and set adl-p to 13 21947ec681f3Smrg- iris: Disable I915_FORMAT_MOD_Y_TILED_GEN12* on adl-p/display 13 21957ec681f3Smrg- Revert "iris: Disable I915_FORMAT_MOD_Y_TILED_GEN12* on adl-p/display 13" 21967ec681f3Smrg- Revert "intel/dev: Add display_ver and set adl-p to 13" 21977ec681f3Smrg- intel/dev: Add display_ver and set adl-p to 13 21987ec681f3Smrg- iris: Disable I915_FORMAT_MOD_Y_TILED_GEN12* on adl-p/display 13 21997ec681f3Smrg- intel/blorp: Move most of BLORP_CREATE_NIR_INPUT into a function 22007ec681f3Smrg- intel/blorp: Add compute support to BLORP_CREATE_NIR_INPUT 22017ec681f3Smrg- intel/blorp: Add shader_pipeline to brw_blorp_base_key 22027ec681f3Smrg- intel/blorp: Add brw_blorp_init_cs_prog_key 22037ec681f3Smrg- intel/compiler: Use INTEL_DEBUG=blorp to dump blorp compute shaders 22047ec681f3Smrg- intel/blorp: Add subgroup_id input for compute programs 22057ec681f3Smrg- intel/blorp: Add blorp_compile_cs 22067ec681f3Smrg- intel/blorp: Split out ps specific sampler state into a separate function 22077ec681f3Smrg- intel/blorp: Split out surface setup from state emission 22087ec681f3Smrg- blorp: Add blorp_alloc_general_state 22097ec681f3Smrg- intel/blorp: Emit compute program based on BLORP_BATCH_USE_COMPUTE 22107ec681f3Smrg- intel/gfx7: Change GPGPU Mode to bool 22117ec681f3Smrg- intel/blorp: Add blorp_get_cs_local_y, blorp_set_cs_dims 22127ec681f3Smrg- intel/blorp: Change discard terminology to bounds 22137ec681f3Smrg- intel/blorp: Add blorp_check_in_bounds() 22147ec681f3Smrg- intel/blorp: Use blorp_check_in_bounds for discards 22157ec681f3Smrg- blorp: Set view usage to ISL_SURF_USAGE_STORAGE_BIT for compute 22167ec681f3Smrg- blorp/clear: Simplify rbg-as-red channel packing 22177ec681f3Smrg- intel/blorp: Convert blorp_clear color_write_disable to a bitmask 22187ec681f3Smrg- intel/blorp: Support compute for slow clears 22197ec681f3Smrg- intel/blorp/blit: Rename wm_prog_key and prog_key to key 22207ec681f3Smrg- intel/blorp: Support some image/buffer blit operations using compute 22217ec681f3Smrg- anv: Store anv_queue_family type in cmd-pool 22227ec681f3Smrg- anv: Prevent starting a render pass on compute queues 22237ec681f3Smrg- anv/blorp: Make sure blorp type is supported by the queue 22247ec681f3Smrg- anv/blorp: Select pipeline based on BLORP_BATCH_USE_COMPUTE 22257ec681f3Smrg- anv/blorp: Add anv_blorp_batch_init, anv_blorp_batch_finish 22267ec681f3Smrg- anv/blorp: Force compute blorp on compute-only queues 22277ec681f3Smrg- anv/slice_hash: Don't allocate more than once with multiple queues 22287ec681f3Smrg- intel/isl: Add mocs settings for DG2 22297ec681f3Smrg- Revert "iris: Disable I915_FORMAT_MOD_Y_TILED_GEN12* on adl-p/display 13" 22307ec681f3Smrg 22317ec681f3SmrgJose Maria Casanova Crespo (8): 22327ec681f3Smrg 22337ec681f3Smrg- Revert "ci: disable Broadcom CI" 22347ec681f3Smrg- v3d/driconf: Expose non-MSAA texture limits for mutter and gnome-shell 22357ec681f3Smrg- v3d: export supported prim types by v3d 22367ec681f3Smrg- v3d: remove primconvert 22377ec681f3Smrg- vc4: export supported prim types by vc4 22387ec681f3Smrg- vc4: remove primconvert 22397ec681f3Smrg- v3d: Enable PIPE_CAP_PRIMITIVE_RESTART 22407ec681f3Smrg- v3d: Enable PIPE_CAP_TEXTURE_MIRROR_CLAMP_TO_EDGE 22417ec681f3Smrg 22427ec681f3SmrgJoshua Ashton (26): 22437ec681f3Smrg 22447ec681f3Smrg- lavapipe: Use common Vulkan format helpers 22457ec681f3Smrg- radv: Fix DCC image store check 22467ec681f3Smrg- radv: Disable DCC on storage images that cannot support DCC image stores 22477ec681f3Smrg- ac/surface: Add modifiers capable of DCC image stores 22487ec681f3Smrg- ac/surface: Add ac_modifier_supports_dcc_image_stores helper 22497ec681f3Smrg- radv: Expose modifiers that support DCC image stores with STORAGE_IMAGE_BIT 22507ec681f3Smrg- radv: Push box traversal results onto stack in correct order 22517ec681f3Smrg- radv: Add noatocdithering option to RADV_DEBUG 22527ec681f3Smrg- vulkan/util: Cast vk_alloc pointers 22537ec681f3Smrg- radv: Rename radv_subpass_barrier function to radv_emit_subpass_barrier 22547ec681f3Smrg- radv: Define extern "C" linkage if C++ 22557ec681f3Smrg- ac/surface: Add helper for checking if a surface supports DCC Image stores 22567ec681f3Smrg- radv: Use common DCC image store check 22577ec681f3Smrg- radeonsi: Use common DCC image store check 22587ec681f3Smrg- radv: Remove assert in radv_rt_bind_tables 22597ec681f3Smrg- radv: Do not pass result to insert_traversal_aabb_case 22607ec681f3Smrg- radv: Implement build_node_to_addr for GFX8 and below 22617ec681f3Smrg- radv: Implement software emulation for intersect_ray 22627ec681f3Smrg- radv: Enable raytracing extensions on older generations 22637ec681f3Smrg- radv: Add force_emulate_rt perftest option 22647ec681f3Smrg- ac/surface: Use 64 && 128 for GFX10_3 on non-modifier path 22657ec681f3Smrg- ac/surface: Add ac_modifier_max_extent 22667ec681f3Smrg- radeonsi: Check if modifier supports the image extent 22677ec681f3Smrg- radv: Respect max extent for modifiers 22687ec681f3Smrg- ac/surface: Expose modifiers capable of DCC image stores first 22697ec681f3Smrg- radv: Do early and late tests for fast clears 22707ec681f3Smrg 22717ec681f3SmrgJoshua Watt (1): 22727ec681f3Smrg 22737ec681f3Smrg- v3d, vc4: Fix dmabuf import for non-scanout buffers 22747ec681f3Smrg 22757ec681f3SmrgJosé Fonseca (1): 22767ec681f3Smrg 22777ec681f3Smrg- llvmpipe: Add a linear rasterizer optimized for 2D rendering. 22787ec681f3Smrg 22797ec681f3SmrgJuan A. Suarez Romero (35): 22807ec681f3Smrg 22817ec681f3Smrg- broadcom/compiler: emit TMU flush before a jump 22827ec681f3Smrg- ci/v3dv: update expected results 22837ec681f3Smrg- ci/v3d: add piglit flake test 22847ec681f3Smrg- v3d: handle debug options with debug_named_value 22857ec681f3Smrg- v3dv: assert job->cmd_buffer is valid 22867ec681f3Smrg- ci/v3dv: update vulkan expected results 22877ec681f3Smrg- broadcom: remove v3dv3 from neon library 22887ec681f3Smrg- ci: update to VK-GL-CTS 1.2.7.0 22897ec681f3Smrg- drm-uapi: add v3d performance counters 22907ec681f3Smrg- v3d: check if device supports performance monitors 22917ec681f3Smrg- v3d: attach performance monitor to jobs 22927ec681f3Smrg- v3d: move queries to pipe queries 22937ec681f3Smrg- v3d: add fence wait function 22947ec681f3Smrg- v3d: implement performance counter queries 22957ec681f3Smrg- v3d/simulator: implement performance counters 22967ec681f3Smrg- gallium/hud: initialize query 22977ec681f3Smrg- ci/v3dv: update expected results 22987ec681f3Smrg- broadcom/compiler: change current block on setting spill base 22997ec681f3Smrg- v3d: print error on perfmon destroy error 23007ec681f3Smrg- ci/vc4: update piglit expected results 23017ec681f3Smrg- broadcom/compiler: set current block on incrementing unifa 23027ec681f3Smrg- ci/v3dv: update flakes 23037ec681f3Smrg- v3dv: initialize CL submission structure 23047ec681f3Smrg- v3d/ci: add piglit flake 23057ec681f3Smrg- broadcom/ci: use deqp-runner suites for gles 23067ec681f3Smrg- broadcom/qpu: remove duplicated opcode variable 23077ec681f3Smrg- broadcom/compiler: check instruction belongs to current block 23087ec681f3Smrg- mesa: fix default texture buffer format 23097ec681f3Smrg- broadcom: make vir_emit_last_thrsw() private 23107ec681f3Smrg- broadcom/compiler: force a last thrsw for spilling 23117ec681f3Smrg- broadcom/compiler: add V3D_DEBUG_NO_LOOP_UNROLL debug option 23127ec681f3Smrg- broadcom: add cl_nobin debug option 23137ec681f3Smrg- ci/v3dv: update flakes 23147ec681f3Smrg- ci/v3d: add piglit flake 23157ec681f3Smrg- ci/vc4: add piglit timeout 23167ec681f3Smrg 23177ec681f3SmrgKai Wasserbäch (3): 23187ec681f3Smrg 23197ec681f3Smrg- gallivm: add new wrapper around Module::setOverrideStackAlignment() 23207ec681f3Smrg- gallivm: fix FTBFS on i386 with LLVM >= 13, StackAlignmentOverride is gone 23217ec681f3Smrg- fix(clover/llvm): update code to build with recent versions of LLVM 14 (Git) 23227ec681f3Smrg 23237ec681f3SmrgKarol Herbst (4): 23247ec681f3Smrg 23257ec681f3Smrg- nv50/ir/nir: fix smem size for GL 23267ec681f3Smrg- nv30: fix emulated vertex index buffers 23277ec681f3Smrg- clover: Local memory needs to be aligned. 23287ec681f3Smrg- spirv: Don't add 0.5 to array indicies for OpImageSampleExplicitLod 23297ec681f3Smrg 23307ec681f3SmrgKeith Packard (1): 23317ec681f3Smrg 23327ec681f3Smrg- iris: Map scanout buffers WC instead of WB [v2] 23337ec681f3Smrg 23347ec681f3SmrgKenneth Graunke (29): 23357ec681f3Smrg 23367ec681f3Smrg- gallium: Remove dead pb_malloc_buffer_create function prototype 23377ec681f3Smrg- iris: Rename bo->gtt_offset to bo->address 23387ec681f3Smrg- iris: Improve the memory layout of iris_bo by fixing pahole issues 23397ec681f3Smrg- iris: Drop dead drm_ioctl prototype 23407ec681f3Smrg- iris: Don't try to CPU read imported clear color BOs 23417ec681f3Smrg- iris: Use the new I915_USERPTR_PROBE API 23427ec681f3Smrg- iris: Allow SET_DOMAIN to fail when allocating new GEM objects 23437ec681f3Smrg- iris: Stop using SET_DOMAIN on discrete GPUs altogether 23447ec681f3Smrg- iris: Bypass the BO cache when allocating buffers for aux map tables 23457ec681f3Smrg- iris: Mark the aux table buffers with EXEC_OBJECT_CAPTURE. 23467ec681f3Smrg- i965: Only call lower_blend_equation_advanced for fragment shaders 23477ec681f3Smrg- glsl: Assert that lower_blend_equation_advanced is only called for FS 23487ec681f3Smrg- iris: Rewrite bo->index comment to refer to exec_bos[] 23497ec681f3Smrg- iris: Track written BOs via a bitfield rather than exec_object2 entries 23507ec681f3Smrg- iris: Defer construction of the validation (exec_object2) list 23517ec681f3Smrg- iris: Add some accessor wrappers for a few fields. 23527ec681f3Smrg- intel: Finish off the last scraps of bacon 23537ec681f3Smrg- iris: Move some iris_bo entries into a union 23547ec681f3Smrg- iris: Handle multiple BOs backed by the same GEM object in execbuf code 23557ec681f3Smrg- iris: Begin handling slab-allocated wrapper BOs in various places 23567ec681f3Smrg- iris: Introduce a BO_ALLOC_NO_SUBALLOC flag and set it in a few places 23577ec681f3Smrg- iris: Change the validation list debug code to print the BO list instead 23587ec681f3Smrg- iris: Move suballocated resources to a dedicated allocation on export 23597ec681f3Smrg- iris: Suballocate BO using the Gallium pb_slab mechanism 23607ec681f3Smrg- iris: Delete the MI_COPY_MEM_MEM resource_copy_region implementation. 23617ec681f3Smrg- iris: Require a 4K alignment for extra clear color BOs. 23627ec681f3Smrg- iris: Fix MOCS for buffer copies 23637ec681f3Smrg- iris: Fix parameters to iris_copy_region in reallocate_resource_inplace 23647ec681f3Smrg- intel/genxml: Fix MI_FLUSH_DW to actually specify the length properly 23657ec681f3Smrg 23667ec681f3SmrgKostiantyn Lazukin (1): 23677ec681f3Smrg 23687ec681f3Smrg- util/u_trace: Replace Flag with IntEnum to support python3.5 23697ec681f3Smrg 23707ec681f3SmrgKyle Brenneman (2): 23717ec681f3Smrg 23727ec681f3Smrg- Add copyright comments to the GLVND-related files. 23737ec681f3Smrg- Remove the shebang from eglFunctionList.py. 23747ec681f3Smrg 23757ec681f3SmrgLeandro Ribeiro (8): 23767ec681f3Smrg 23777ec681f3Smrg- vulkan/wsi/wayland: check directly if we got globals successfully 23787ec681f3Smrg- vulkan/wsi/wayland: do not perform roundtrip when not querying formats 23797ec681f3Smrg- vulkan/wsi/wayland: fix crash when force_bgra8_unorm_first is true 23807ec681f3Smrg- vulkan/wsi/wayland: fold wsi_wl_display_swrast and wsi_wl_display_dmabuf into parent 23817ec681f3Smrg- vulkan/wsi/wayland: always initialize format vector 23827ec681f3Smrg- vulkan/wsi/wayland: add helper function find_format() 23837ec681f3Smrg- vulkan/wsi/wayland: create swapchain using vk_zalloc() 23847ec681f3Smrg- vulkan/wsi/wayland: memset members of image to zero 23857ec681f3Smrg 23867ec681f3SmrgLeo Liu (8): 23877ec681f3Smrg 23887ec681f3Smrg- frontends/va: Add AV1 picture description 23897ec681f3Smrg- frontends/va: Add AV1 parameter buffers functions 23907ec681f3Smrg- frontends/va: Place AV1 picture and slice parameter buffers functions 23917ec681f3Smrg- frontends/va: Add AV1 profile main to the config 23927ec681f3Smrg- radeon/vcn: Enable the AV1 decode p010 mode 23937ec681f3Smrg- frontends/va: Reallocate p010 buffer for AV1 10 bits decode 23947ec681f3Smrg- radeon/vcn: reuse the dpb buffers when with the same size. 23957ec681f3Smrg- radeon/vcn: add a handling of error for incorrect reference lists 23967ec681f3Smrg 23977ec681f3SmrgLepton Wu (3): 23987ec681f3Smrg 23997ec681f3Smrg- gallium: Reset {d,r}Priv in dri_unbind_context 24007ec681f3Smrg- i965: Enable RGBX8888_SRGB format. 24017ec681f3Smrg- virgl: Add an option to disable coherent 24027ec681f3Smrg 24037ec681f3SmrgLionel Landwerlin (67): 24047ec681f3Smrg 24057ec681f3Smrg- isl: fix mapping of format->stringname 24067ec681f3Smrg- loader/dri3: create linear buffer with scanout support 24077ec681f3Smrg- nir/lower_shader_calls: adding missing stack offset alignment 24087ec681f3Smrg- anv: fix submission batching with perf queries 24097ec681f3Smrg- drm-shim: implement stat/fstat when xstat variants are not there 24107ec681f3Smrg- intel/disasm: fix missing oword index decoding 24117ec681f3Smrg- anv: don't try to access Android swapchains 24127ec681f3Smrg- nir/lower_shader_calls: remove empty phis 24137ec681f3Smrg- anv/android: handle image bindings from gralloc buffers 24147ec681f3Smrg- genxml: add more INSTDONE registers for Gfx12.5 24157ec681f3Smrg- intel/error-decode: printout more registers 24167ec681f3Smrg- nir: prevent peephole from generating invalid NIR 24177ec681f3Smrg- intel/fs: fix framebuffer reads 24187ec681f3Smrg- microsoft/clc: small compile fix on Linux 24197ec681f3Smrg- clc: use the defined version for the parser 24207ec681f3Smrg- spirv: don't fail on CapabilitySubgroupDispatch if supported 24217ec681f3Smrg- spirv: avoid shadowing local variable 24227ec681f3Smrg- spirv: workaround LLVM-SPIRV Undef variable initializers 24237ec681f3Smrg- spirv: don't bother initializing variables to Undef 24247ec681f3Smrg- microsoft/clc: drop LLVM dependency to version < 12 24257ec681f3Smrg- nir: fix opt_memcpy src/dst mixup 24267ec681f3Smrg- spirv: switch Groups capability to non AMD specific field 24277ec681f3Smrg- microsoft/clc: drop MSVC specific function 24287ec681f3Smrg- microsoft/clc: fix compiler warning on uninitiailzed variable use 24297ec681f3Smrg- meson: extract libversion checks from clc & clover 24307ec681f3Smrg- anv: honor INTEL_DEBUG=sync 24317ec681f3Smrg- clc: add allowed extension for compile parameter 24327ec681f3Smrg- clc: print warnings/errors on their own line 24337ec681f3Smrg- clc: let user specify the targetted SPIRV version 24347ec681f3Smrg- anv: enable UBO indexing 24357ec681f3Smrg- intel/compiler: add missing line returns to logs 24367ec681f3Smrg- anv: remove redundant VertexURBEntryReadLength setting 24377ec681f3Smrg- nir/lower_io: preserve all metadata when no progress 24387ec681f3Smrg- anv: move GetBufferMemoryRequirement with other buffer functions 24397ec681f3Smrg- anv: implement vkGetDeviceBufferMemoryRequirementsKHR 24407ec681f3Smrg- anv: remove unused function 24417ec681f3Smrg- anv: move VkImage object allocation to anv_CreateImage 24427ec681f3Smrg- anv: implement vkGetDeviceImageMemoryRequirementsKHR 24437ec681f3Smrg- anv: implement vkGetDeviceImageSparseMemoryRequirementsKHR 24447ec681f3Smrg- anv: enable VK_KHR_maintenance4 24457ec681f3Smrg- vulkan: put generated defines into their own header 24467ec681f3Smrg- vulkan: handle new VK_KHR_synchronization2 image layouts 24477ec681f3Smrg- vulkan: remove unused VkCommand 24487ec681f3Smrg- vulkan/util: generate define for a selected few enums 24497ec681f3Smrg- vulkan: implement legacy entrypoints on top of VK_KHR_synchronization2 24507ec681f3Smrg- anv: add missing transition handling bits 24517ec681f3Smrg- anv: make semaphore helper work on a single object 24527ec681f3Smrg- anv: improve readability of pipelined states 24537ec681f3Smrg- anv: implement VK_KHR_synchronization2 24547ec681f3Smrg- spirv: deal with null pointers 24557ec681f3Smrg- anv: switch to use VkFormatFeatureFlags2KHR internally 24567ec681f3Smrg- intel/nir: allow unknown format in lowering of storage images 24577ec681f3Smrg- anv: start computing KHR_format_features2 flags for storage images 24587ec681f3Smrg- anv: implement VK_KHR_format_feature_flags2 24597ec681f3Smrg- anv: fill correct surface state for lowered storage image 24607ec681f3Smrg- isl: only bump the min row pitch for display when not specified 24617ec681f3Smrg- vulkan/wsi/wayland: don't expose surface formats not fully supported 24627ec681f3Smrg- anv: fix push constant lowering with bindless shaders 24637ec681f3Smrg- intel/dev: fix HSW GT3 number of subslices in slice1 24647ec681f3Smrg- intel/dev: don't forget to set max_eu_per_subslice in generated topology 24657ec681f3Smrg- intel/dev: reuse internal functions to set mask 24667ec681f3Smrg- intel/dev: fix subslice/eu total computations with some fused configurations 24677ec681f3Smrg- intel/perf: fix perf equation subslice mask generation for gfx12+ 24687ec681f3Smrg- intel/devinfo: fix wrong offset computation 24697ec681f3Smrg- intel: remove 2 preproduction pci-id for ADLS 24707ec681f3Smrg- anv: don't forget to add scratch buffer to BO list 24717ec681f3Smrg- anv: fix multiple wait/signal on same binary semaphore 24727ec681f3Smrg 24737ec681f3SmrgLiviu Prodea (1): 24747ec681f3Smrg 24757ec681f3Smrg- ci: Add osmesa to Windows GitLab CI 24767ec681f3Smrg 24777ec681f3SmrgLone_Wolf (1): 24787ec681f3Smrg 24797ec681f3Smrg- clover: TargetRegistry.h was moved to another folder 24807ec681f3Smrg 24817ec681f3SmrgLucas Stach (2): 24827ec681f3Smrg 24837ec681f3Smrg- renderonly: don't complain when GPU import fails 24847ec681f3Smrg- etnaviv: always try to create KMS side handles for imported resources 24857ec681f3Smrg 24867ec681f3SmrgLuis Felipe Strano Moraes (2): 24877ec681f3Smrg 24887ec681f3Smrg- docs: Clean up environment variable docs for Intel drivers. 24897ec681f3Smrg- docs: Add documentation regarding INTEL_MEASURE to envvars doc. 24907ec681f3Smrg 24917ec681f3SmrgM Henning (1): 24927ec681f3Smrg 24937ec681f3Smrg- nouveau: Support nir_intrinsic_*_atomic_fadd 24947ec681f3Smrg 24957ec681f3SmrgManiraj D (1): 24967ec681f3Smrg 24977ec681f3Smrg- egl: set TSD as NULL after deinit 24987ec681f3Smrg 24997ec681f3SmrgMao, Marc (1): 25007ec681f3Smrg 25017ec681f3Smrg- iris: declare padding for iris_vue_prog_key 25027ec681f3Smrg 25037ec681f3SmrgMarcin Ślusarz (51): 25047ec681f3Smrg 25057ec681f3Smrg- intel/tools/aubinator_error_decode: tag hanging instruction 25067ec681f3Smrg- anv: share some code between vkCmdDrawIndirectCount and vkCmdDrawIndexedIndirectCount 25077ec681f3Smrg- glsl: evaluate switch expression once 25087ec681f3Smrg- nir/builder: invalidate metadata per function 25097ec681f3Smrg- intel/compiler: use nir_shader_instructions_pass in brw_nir_apply_attribute_workarounds 25107ec681f3Smrg- d3d12: use nir_metadata_none instead of its value 25117ec681f3Smrg- microsoft/clc: preserve only valid metadata in clc_lower_printf_base 25127ec681f3Smrg- microsoft/clc: use nir_shader_instructions_pass in clc_nir_dedupe_const_samplers 25137ec681f3Smrg- microsoft/compiler: preserve all metadata when upcast_phi doesn't make progress 25147ec681f3Smrg- microsoft/compiler: use nir_shader_instructions_pass in dxil_nir_split_clip_cull_distance 25157ec681f3Smrg- microsoft/compiler: use nir_shader_instructions_pass in dxil_nir_lower_double_math 25167ec681f3Smrg- zink: use nir_shader_instructions_pass in lower_discard_if 25177ec681f3Smrg- zink: use nir_shader_instructions_pass in nir_lower_dynamic_bo_access 25187ec681f3Smrg- genxml: add INSTDONE_GEOM register for Gfx12.5 25197ec681f3Smrg- intel/error-decode: printout INSTDONE_GEOM register for Gfx12.5 25207ec681f3Smrg- glsl/opt_algebraic: disable invalid optimization 25217ec681f3Smrg- glsl: refactor code to avoid static analyzer noise 25227ec681f3Smrg- freedreno/ir3: use nir_metadata_none instead of its value 25237ec681f3Smrg- r600: use nir_shader_instructions_pass in r600_nir_lower_atomics 25247ec681f3Smrg- r600: preserve all metadata when passes don't make progress 25257ec681f3Smrg- turnip: use nir_shader_instructions_pass in tu_lower_io 25267ec681f3Smrg- intel/compiler: INT DIV function does not support source modifiers 25277ec681f3Smrg- vulkan/wsi/x11: fix shm allocation control flow issue 25287ec681f3Smrg- glsl: propagate errors from \*=, /=, +=, -= operators 25297ec681f3Smrg- glsl: break out early if compound assignment's operand errored out 25307ec681f3Smrg- crocus: drop redundant unlikely's around INTEL_DEBUG 25317ec681f3Smrg- intel/compiler: drop redundant likely's around INTEL_DEBUG 25327ec681f3Smrg- anv: drop redundant unlikely's around INTEL_DEBUG 25337ec681f3Smrg- lima: use nir_shader_instructions_pass in lima_nir_split_load_input 25347ec681f3Smrg- anv: Set graphics pipeline active_stages earlier 25357ec681f3Smrg- anv: Use input assembly state only when pipeline has vertex stage 25367ec681f3Smrg- intel/compiler: use nir_shader_instructions_pass in brw_nir_demote_sample_qualifiers 25377ec681f3Smrg- intel/compiler: use nir_shader_instructions_pass in brw_nir_clamp_image_1d_2d_array_sizes 25387ec681f3Smrg- intel/compiler: use nir_shader_instructions_pass in brw_nir_lower_conversions 25397ec681f3Smrg- intel/compiler: use nir_shader_instructions_pass in brw_nir_lower_mem_access_bit_sizes 25407ec681f3Smrg- intel/compiler: use nir_shader_instructions_pass in brw_nir_lower_scoped_barriers 25417ec681f3Smrg- intel/compiler: use nir_shader_instructions_pass in brw_nir_lower_storage_image 25427ec681f3Smrg- intel/compiler: use nir_shader_instructions_pass in brw_nir_opt_peephole_ffma 25437ec681f3Smrg- intel/compiler: use nir_metadata_none instead of its value 25447ec681f3Smrg- anv: use nir_shader_instructions_pass in anv_nir_add_base_work_group_id 25457ec681f3Smrg- anv: use nir_shader_instructions_pass in anv_nir_lower_ycbcr_textures 25467ec681f3Smrg- anv: preserve all metadata when anv_nir_lower_multiview doesn't make progress 25477ec681f3Smrg- glsl: preserve all metadata when lower_buffer_interface_derefs doesn't make progress 25487ec681f3Smrg- nir: preserve all metadata when nir_lower_int_to_float doesn't make progress 25497ec681f3Smrg- nir: preserve all metadata when nir_propagate_invariant doesn't make progress 25507ec681f3Smrg- nir: preserve all metadata when nir_opt_vectorize doesn't make progress 25517ec681f3Smrg- anv: allocate zeroed device object 25527ec681f3Smrg- nir/print: pad 64-bit constants with zeroes 25537ec681f3Smrg- anv: fix potential integer overflow 25547ec681f3Smrg- iris: fix scratch address patching for TESS_EVAL stage 25557ec681f3Smrg- intel: fix INTEL_DEBUG environment variable on 32-bit systems 25567ec681f3Smrg 25577ec681f3SmrgMarek Olšák (211): 25587ec681f3Smrg 25597ec681f3Smrg- radeonsi: don't expose no-attachment MSAA 16x on all 1 RB chips due to issues 25607ec681f3Smrg- radeonsi: document a missing synchronization for bindless textures 25617ec681f3Smrg- st/mesa: inline st_setup_arrays on MSVC too by adding a wrapper 25627ec681f3Smrg- mesa: remove unused drawid_offset parameter from DrawGalliumMultiMode 25637ec681f3Smrg- mesa: fix incorrect comment in draw_gallium_multimode 25647ec681f3Smrg- st/mesa: always use PIPE_USAGE_STAGING for GL_MAP_READ_BIT usage 25657ec681f3Smrg- shader_enums,mesa: move VERT_ATTRIB_EDGEFLAG to slot 31 for st/mesa 25667ec681f3Smrg- gallium: change pipe_vertex_element::src_format to uint8_t 25677ec681f3Smrg- gallium: add multi-component 64-bit UINT formats for raw double vertex attribs 25687ec681f3Smrg- gallium: add pipe_vertex_element::dual_slot to move lowering to CSO creation 25697ec681f3Smrg- gallium: lower raw 64-bit vertex formats in cso/vbuf instead of st/mesa 25707ec681f3Smrg- st/mesa: remove lowering of 64-bit vertex attribs to 32 bits 25717ec681f3Smrg- st/mesa: remove st_vertex_program::index_to_input 25727ec681f3Smrg- st/mesa: remove st_vertex_program::input_to_index 25737ec681f3Smrg- radeonsi: improve viewperf snx performance by forcing staging for VRAM buffers 25747ec681f3Smrg- gallium: simplify VRAM uploads by adding PIPE_RESOURCE_FLAG_DONT_MAP_DIRECTLY 25757ec681f3Smrg- gallium/noop: implement fences 25767ec681f3Smrg- gallium/noop: implement shader buffers and shader images 25777ec681f3Smrg- gallium/noop: use threaded_query 25787ec681f3Smrg- gallium/noop: use threaded_resource 25797ec681f3Smrg- gallium/noop: use threaded_transfer 25807ec681f3Smrg- gallium/noop: enable threaded_context to test TC overhead without a driver 25817ec681f3Smrg- gallium/noop: update pipe_screen::num_contexts 25827ec681f3Smrg- gallium/noop: implement a lot of missing screen functions 25837ec681f3Smrg- gallium/noop: implement a lot of missing context functions 25847ec681f3Smrg- radeonsi: allow arbitrary swizzle modes for displayable DCC 25857ec681f3Smrg- radv: allow arbitrary swizzle modes for displayable DCC 25867ec681f3Smrg- ac/surface: allow arbitrary swizzle modes for displayable DCC 25877ec681f3Smrg- gallium: add take_ownership into set_sampler_views to skip reference counting 25887ec681f3Smrg- st/mesa: set take_ownership = true in set_sampler_views 25897ec681f3Smrg- st/mesa: move handling CubeMapSeamless into st_convert_sampler where it belongs 25907ec681f3Smrg- gallium: remove vertices_per_patch, add pipe_context::set_patch_vertices 25917ec681f3Smrg- radeonsi: remove vertices_per_patch parameter from draw-related functions 25927ec681f3Smrg- frontend/dri: add environment variable DRI_NO_MSAA for performance comparisons 25937ec681f3Smrg- gallium: use a packed enum to make pipe_prim_mode 1-byte large with __GNUC__ 25947ec681f3Smrg- gallium: change pipe_draw_info::mode to uint8_t on MSVC to make it 1 byte large 25957ec681f3Smrg- glthread: implement glGetUniformLocation without syncing 25967ec681f3Smrg- meson: add missing custom target to generate shader_replacement.h 25977ec681f3Smrg- mesa: add environment variable MESA_NO_SHADER_REPLACEMENT 25987ec681f3Smrg- util/cpu_detect: print num_L3_caches and num_cpu_mask_bits 25997ec681f3Smrg- util/cpu_detect: add/guess support for next Zen CPUs 26007ec681f3Smrg- vbo: merge draws with GL_LINES regardless of line stippling 26017ec681f3Smrg- vbo: check more GL errors when drawing via glCallList 26027ec681f3Smrg- mesa: remove unused indices parameter from validate functions 26037ec681f3Smrg- mesa: fix gl_DrawID with indirect multi draws using user indirect buffer 26047ec681f3Smrg- mesa: skip draw calls with unaligned indices 26057ec681f3Smrg- radeonsi: remove unused depth_clamp_any 26067ec681f3Smrg- radeonsi: remove instancing support from the prim discard compute shader 26077ec681f3Smrg- radeonsi: remove stages_key parameter from si_shader_selector_key 26087ec681f3Smrg- radeonsi: move si_vgt_stages_key determination into si_update_vgt_shader_config 26097ec681f3Smrg- radeonsi: move as_ls/es/ngg setting out of si_shader_selector_key 26107ec681f3Smrg- radeonsi: inline si_get_alpha_test_func 26117ec681f3Smrg- radeonsi: stop using AC_EXP_PARAM_UNDEFINED because it's not useful 26127ec681f3Smrg- radeonsi: use memcmp and radeon_emit_array in radeon_opt_set_context_regn 26137ec681f3Smrg- radeonsi: correctly use cs instead of gfx_cs in build pm4 helpers 26147ec681f3Smrg- radeonsi: simplify memory usage checking by merging vram and gtt counters 26157ec681f3Smrg- radeonsi: inline remaining big functions in draw_vbo for better snx perf 26167ec681f3Smrg- radeonsi: simplify si_need_gfx_cs_space 26177ec681f3Smrg- winsys/amdgpu: clean up amdgpu_cs_check_space 26187ec681f3Smrg- radeonsi: inline si_need_gfx_cs_space 26197ec681f3Smrg- radeonsi: don't use SQ_NON_EVENT before GE_PC_ALLOC for better perf on Navi1x 26207ec681f3Smrg- radeonsi: add si_print_current_ib function for debugging 26217ec681f3Smrg- ac/debug: add an option to disable colors for printed IBs 26227ec681f3Smrg- radeonsi: fix a memory leak in si_get_shader_binary_size 26237ec681f3Smrg- radeonsi: set gfx10 registers better in si_emit_initial_compute_regs 26247ec681f3Smrg- ac/gpu_info: fix detection of smart access memory 26257ec681f3Smrg- radeonsi: disable DCC stores on Navi12-14 for displayable DCC to fix corruption 26267ec681f3Smrg- radeonsi: enable DCC stores for clear_render_target on gfx10 26277ec681f3Smrg- radeonsi: add missing make_CB_shader_coherent for DCC stores into copy_image 26287ec681f3Smrg- radeonsi: handle pipe_aligned in compute_expand_fmask 26297ec681f3Smrg- radeonsi: rename DCC_WRITE -> ALLOW_DCC_STORE 26307ec681f3Smrg- radeonsi: track displayable_dcc_dirty for non-compute shaders 26317ec681f3Smrg- radeonsi: enable DCC stores on gfx10.3 APUs for better performance 26327ec681f3Smrg- radeonsi: clean up typecasts in compute_copy_image 26337ec681f3Smrg- ac/llvm: remove load_tess_coord callback 26347ec681f3Smrg- ac/llvm: implement a bunch of NIR AMD intrinsics for NGG 26357ec681f3Smrg- ac: remove needless parameters from ac_shader_abi::emit_outputs 26367ec681f3Smrg- ac: make ac_shader_abi::inputs an array instead of a pointer 26377ec681f3Smrg- ac/llvm: implement nir_intrinsic_overwrite_*_arguments_amd 26387ec681f3Smrg- ac/llvm: implement nir_intrinsic_elect 26397ec681f3Smrg- ac,radeonsi: load VS inputs at the call site of nir_intrinsic_load_input 26407ec681f3Smrg- ac,radv: remove unused inputs array and VS input code 26417ec681f3Smrg- radeonsi: don't set prefer_mono for fetched instance divisors 26427ec681f3Smrg- radeonsi: ignore the vertex element count in si_shader_selector_key_vs 26437ec681f3Smrg- radeonsi: accurately check if instance divisors need a VS update 26447ec681f3Smrg- radeonsi: don't update shaders if only the vertex element count changes 26457ec681f3Smrg- radeonsi: correct index_bias_varies usage 26467ec681f3Smrg- radeonsi: remove the primitive discard compute shader 26477ec681f3Smrg- winsys/amdgpu: precompute amdgpu_ib_max_submit_dwords 26487ec681f3Smrg- radeonsi: reduce the frequency of switching GS fast launch on/off 26497ec681f3Smrg- radeonsi: strengthen the VGT_FLUSH condition in begin_new_gfx_cs 26507ec681f3Smrg- radeonsi: skip setting some PGM_HI registers by switching to 32-bit addresses 26517ec681f3Smrg- winsys/amdgpu: include CS ioctl overhead in RADEON_NOOP 26527ec681f3Smrg- radeonsi: enable shader-based prim culling with polygon mode 26537ec681f3Smrg- radeonsi: remove a few fields from si_state_rasterizer 26547ec681f3Smrg- radeonsi: don't emit PA_SU_POLY_OFFSET_CLAMP if it has no effect 26557ec681f3Smrg- radeonsi: add AMD_DEBUG=ib to print IBs 26567ec681f3Smrg- radeonsi: don't use NGG passthrough if culling is possible for better perf 26577ec681f3Smrg- radeonsi: fix DCC image stores with display DCC 26587ec681f3Smrg- radeonsi: copy a few nir_shader_compiler_options from RADV 26597ec681f3Smrg- driconf: remove leftover code for allow_incorrect_primitive_id 26607ec681f3Smrg- radeonsi: fix DCC image stores with image descriptors in user SGPRs 26617ec681f3Smrg- radeonsi: add const to the key parameter in si_shader_select_with_key 26627ec681f3Smrg- radeonsi: handle NO_OPT_VARIANT in si_shader_select_with_key 26637ec681f3Smrg- radeonsi: sink memsets and disable uniform inlining in si_shader_selector_key 26647ec681f3Smrg- radeonsi: move PS shader key code into a separate function 26657ec681f3Smrg- radeonsi: don't memset mono and opt in si_update_ps_shader_key 26667ec681f3Smrg- radeonsi: don't memset part in si_update_ps_shader_key 26677ec681f3Smrg- radeonsi: divide si_update_ps_shader_key into many separate functions 26687ec681f3Smrg- radeonsi: ignore blitter when computing the PS shader key 26697ec681f3Smrg- radeonsi: update most of the PS shader key in set & bind functions 26707ec681f3Smrg- radeonsi: clean up and clear VS shader key fields related to outputs 26717ec681f3Smrg- radeonsi: update the VS shader key in set & bind functions and remove memsets 26727ec681f3Smrg- radeonsi: rewrite inlinable uniform states for shader keys in si_context 26737ec681f3Smrg- radeonsi: move si_shader_io_get_unique_index calls out of si_get_vs_key_outputs 26747ec681f3Smrg- radeonsi: move PS inputs_read computation out of si_get_vs_key_outputs 26757ec681f3Smrg- radeonsi: unset SI_PREFETCH_* only when we unbind pm4 shader states 26767ec681f3Smrg- radeonsi: make si_update_shaders a C++ template in si_state_draw.cpp 26777ec681f3Smrg- radeonsi: optimize scratch buffer size updates using C++ template arguments 26787ec681f3Smrg- radeonsi: check flatshade and sprite_coord_enable for spi_map in bind_rs_state 26797ec681f3Smrg- radeonsi: move DB_SHADER_CONTROL update for PS out of si_update_shaders 26807ec681f3Smrg- radeonsi: move flat shading VRS enablement out of si_update_shaders 26817ec681f3Smrg- radeonsi: precompute si_vgt_stages_key for NGG in si_shader 26827ec681f3Smrg- radeonsi: deduplicate si_compiler_ctx_state initialization 26837ec681f3Smrg- radeonsi: determine num_vbos_in_user_sgprs from template arguments in draw_vbo 26847ec681f3Smrg- radeonsi: eliminate a not-found conditional for PrimID in si_get_ps_input_cntl 26857ec681f3Smrg- radeonsi: force flat for PrimID early in si_nir_scan_shader 26867ec681f3Smrg- radeonsi: restructure si_get_ps_input_cntl for future refactoring 26877ec681f3Smrg- radeonsi: interleave si_shader_info::input_* in memory for faster emit_spi_map 26887ec681f3Smrg- radeonsi: precompute num_interp for si_emit_spi_map 26897ec681f3Smrg- radeonsi: simplify si_emit_spi_map for back-face colors 26907ec681f3Smrg- radeonsi: inline si_get_ps_input_cntl because it has only one use 26917ec681f3Smrg- radeonsi: unroll loops in si_emit_spi_map using 33 C++ template instantiations 26927ec681f3Smrg- radeonsi: precompute more spi_map code 26937ec681f3Smrg- radeonsi: set prefer_mono outside of si_shader_selector_key 26947ec681f3Smrg- radeonsi: move setting most TCS shader key fields out of si_shader_selector_key 26957ec681f3Smrg- radeonsi: move setting one GS shader key field out of si_shader_selector_key 26967ec681f3Smrg- radeonsi: put si_pm4_state at the beginning of si_shader 26977ec681f3Smrg- radeonsi: eliminate redundant SPI_SHADER_PGM_RSRC3/4_GS register writes 26987ec681f3Smrg- radeonsi: convert gfx10_emit_ge_pc_alloc to radeon_opt_set_uconfig_reg 26997ec681f3Smrg- radeonsi: use a trick to extract and pack edgeflags using fewer instructions 27007ec681f3Smrg- radeonsi: don't set edgeflags for TES and blit VS 27017ec681f3Smrg- radeonsi: fix incorrect comments about VGT_SHADER_STAGES_EN 27027ec681f3Smrg- radeonsi: enable NGG passthrough when LDS is used, document the real constraints 27037ec681f3Smrg- radeonsi: remove the unused cs parameter from radeon_emit 27047ec681f3Smrg- radeonsi: remove the unused cs parameter from radeon_emit_array 27057ec681f3Smrg- radeonsi: remove the unused cs parameter from radeon_set_(config|context)_reg 27067ec681f3Smrg- radeonsi: remove the unused cs parameter from radeon_set_sh_reg 27077ec681f3Smrg- radeonsi: remove the unused cs parameter from radeon_set_uconfig_reg 27087ec681f3Smrg- radeonsi: remove the unused cs parameter from remaining packet functions 27097ec681f3Smrg- ac/surface: use DCC compatible with image stores for < 4K resolutions 27107ec681f3Smrg- ac/surface: correct a comment about DCC image stores 27117ec681f3Smrg- radeonsi: fix a depth texturing performance regression on gfx6-7 27127ec681f3Smrg- radeonsi: change the units of oversub_pc_factor to integer multiples of 1/4 27137ec681f3Smrg- radeonsi: decrease vertex count threshold for shader culling to 128 27147ec681f3Smrg- radeonsi: set vs_uses_base_instance using C++ template arguments 27157ec681f3Smrg- radeonsi: use the optimal draw packet sequence for VGT_FLUSH 27167ec681f3Smrg- radeonsi: reduce NGG culling on/off transitions by keeping it enabled 27177ec681f3Smrg- radeonsi: clean prefer_mono for the blit VS 27187ec681f3Smrg- radeonsi: don't check ngg_culling != 0 for fast launch because it's tautology 27197ec681f3Smrg- ac/gpu_info: fix the comment for the NGG->legacy transition bug 27207ec681f3Smrg- radeonsi: strenthen the ngg->legacy hw workaround, fix fast launch hangs too 27217ec681f3Smrg- radeonsi: fix clearing index_size for NGG fast launch 27227ec681f3Smrg- radeonsi: disallow NGG fast launch on Navi1x because VGT_FLUSH makes it slower 27237ec681f3Smrg- ac/llvm: pass cull options into cull_bbox directly 27247ec681f3Smrg- radeonsi: always use the correct number of vertices in NGG shader code 27257ec681f3Smrg- radeonsi: add gfx10 helpers for determining whether edgeflags are enabled 27267ec681f3Smrg- ac/llvm: rename ac_cull_triangle -> ac_cull_primitive 27277ec681f3Smrg- radeonsi: implement shader-based culling for lines 27287ec681f3Smrg- radeonsi: don't set DX10_DIAMOND_TEST_ENA for better performance 27297ec681f3Smrg- util: add util_popcnt_inline_asm 27307ec681f3Smrg- util: import u_debug_refcnt, u_hash_table, u_debug_describe from gallium 27317ec681f3Smrg- gallium/util: make pipe_vertex_buffer_reference safe for hashing dst 27327ec681f3Smrg- gallium: add pipe_vertex_state and draw_vertex_state for display lists 27337ec681f3Smrg- gallium/u_threaded: implement draw_vertex_state 27347ec681f3Smrg- gallium/trace: add pipe_vertex_state support 27357ec681f3Smrg- gallium/util: add util_vertex_state_cache for deduplicating the states 27367ec681f3Smrg- st/mesa: add ST_PIPELINE_RENDER_NO_VARRAYS, for future display list support 27377ec681f3Smrg- st/mesa: make setup_arrays more reusable for future display list support 27387ec681f3Smrg- mesa: use pipe_vertex_state in vbo and st/mesa for lower display list overhead 27397ec681f3Smrg- radeonsi: separate VBO descriptor code into a new function (for future work) 27407ec681f3Smrg- radeonsi: implement draw_vertex_state for lower display list overhead 27417ec681f3Smrg- ac/surface: don't overwrite DCC settings for imported buffers 27427ec681f3Smrg- ac/surface: enable DCC image stores for all displayable DCC on gfx10.3 27437ec681f3Smrg- mesa: add missing unlock_texture into generate_texture_mipmap 27447ec681f3Smrg- util/slab: use simple_mtx_t 27457ec681f3Smrg- util/queue: use simple_mtx_t for finish_lock 27467ec681f3Smrg- gallium/pb_cache: use simple_mtx_t 27477ec681f3Smrg- gallium/pb_slab: use simple_mtx_t 27487ec681f3Smrg- mesa: use simple_mtx_t for TexMutex 27497ec681f3Smrg- mesa: use simple_mtx_t for ShaderIncludeMutex 27507ec681f3Smrg- gallium/u_threaded: fix draw_vertex_state with multi draws 27517ec681f3Smrg- radeonsi: fix a leak in draw_vertex_state if threaded_context is disabled 27527ec681f3Smrg- radeonsi: remove duplicate partial_count variable 27537ec681f3Smrg- radeonsi: add back a workaround for DCC MSAA on gfx9 due to conformance issues 27547ec681f3Smrg- radeonsi: remove GS fast launch 27557ec681f3Smrg- util,gallium: put count in pipe_resource & sampler_view on its own cache line 27567ec681f3Smrg- radeonsi: align pipe_resource & sampler_view allocations to a cache line 27577ec681f3Smrg- radeonsi: fix an out-of-bounds access in si_create_vertex_state 27587ec681f3Smrg- ac/surface: always use suboptimal display DCC with DRM <= 3.43.0 27597ec681f3Smrg- ac/surface: disallow display DCC for big resolutions 27607ec681f3Smrg- ac/surface: enable better display DCC for chips newer than Yellow Carp 27617ec681f3Smrg- radeonsi: simplify how VS_OUT_CCDIST is set 27627ec681f3Smrg- radeonsi: simplify write_psize code in si_get_vs_out_cntl 27637ec681f3Smrg- mesa: fix crashes in the no_error path of glUniform 27647ec681f3Smrg- st/mesa: don't crash when draw indirect buffer has no storage 27657ec681f3Smrg- radeonsi: enable shader culling for indirect draws 27667ec681f3Smrg- radeonsi: print the border color error message only once 27677ec681f3Smrg- radeonsi: fix 2 issues with depth_cleared_level_mask 27687ec681f3Smrg- radeonsi: fix a typo preventing a fast depth-stencil clear 27697ec681f3Smrg- driconf: disallow 10-bit pbuffers for viewperf2020/maya due to X errors 27707ec681f3Smrg 27717ec681f3SmrgMarek Vasut (2): 27727ec681f3Smrg 27737ec681f3Smrg- freedreno: a2xx: Handle samplerExternalOES like sampler2D 27747ec681f3Smrg- freedreno: Handle timeout == PIPE_TIMEOUT_INFINITE and rollover 27757ec681f3Smrg 27767ec681f3SmrgMarijn Suijten (1): 27777ec681f3Smrg 27787ec681f3Smrg- freedreno: Enable Adreno 508, 509 and 512 27797ec681f3Smrg 27807ec681f3SmrgMark Janes (3): 27817ec681f3Smrg 27827ec681f3Smrg- anv: Use local memory for block pool BO 27837ec681f3Smrg- anv: Allocate workaround buffer in local memory if present 27847ec681f3Smrg- anv: warn if system memory is used 27857ec681f3Smrg 27867ec681f3SmrgMartin Krastev (2): 27877ec681f3Smrg 27887ec681f3Smrg- svga: enable DRM mks-stats via hooking to the corresponding DRM ioctls 27897ec681f3Smrg- meson: introduce option vmware-mks-stats controlling the instrumentations of gallium svga driver 27907ec681f3Smrg 27917ec681f3SmrgMartin Roukala (néé Peres) (1): 27927ec681f3Smrg 27937ec681f3Smrg- radv/ci: mark some tests as flaky on gfx9 27947ec681f3Smrg 27957ec681f3SmrgMatt Turner (5): 27967ec681f3Smrg 27977ec681f3Smrg- tu: Raise maxDescriptorSetUpdateAfterBindUniformBuffersDynamic to 16 27987ec681f3Smrg- util: Add unit tests for dag 27997ec681f3Smrg- util: Replace recursive DFS with iterative implementation 28007ec681f3Smrg- tu: Free device->bo_idx and device->bo_list on init failure 28017ec681f3Smrg- tu: Enable VK_KHR_uniform_buffer_standard_layout 28027ec681f3Smrg 28037ec681f3SmrgMichael Tang (11): 28047ec681f3Smrg 28057ec681f3Smrg- spirv_to_dxil: expose version number 28067ec681f3Smrg- spirv_to_dxil: Run nir_lower_tex during compilation 28077ec681f3Smrg- microsoft/compiler: Add support for SV_SampleIndex intrinsic 28087ec681f3Smrg- microsoft/compiler: More robustly handle setting Register=-1 28097ec681f3Smrg- microsoft/compiler: Set the SampleFrequency runtime metadata 28107ec681f3Smrg- microsoft/compiler: Emit a flat interpolation method for SV_SampleIndex 28117ec681f3Smrg- microsoft/compiler: Miscellaneous fixes from running clang-format 28127ec681f3Smrg- microsoft/spirv_to_dxil: Add \`install : true` to spirv_to_dxil library. 28137ec681f3Smrg- gallium/d3d12: move d3d12_lower_bool_input to microsoft/compiler 28147ec681f3Smrg- microsoft/spirv_to_dxil: use dxil_nir_lower_bool_input pass 28157ec681f3Smrg- microsoft/spirv_to_dxil: turn sysvals into input varyings 28167ec681f3Smrg 28177ec681f3SmrgMichel Dänzer (2): 28187ec681f3Smrg 28197ec681f3Smrg- ci: Drop "success" job 28207ec681f3Smrg- ci: Put all container related jobs in a single stage 28217ec681f3Smrg 28227ec681f3SmrgMichel Zou (6): 28237ec681f3Smrg 28247ec681f3Smrg- zink: Fix unused-variable warning 28257ec681f3Smrg- meson: dont use missing dumpbin path 28267ec681f3Smrg- radv: fix build with mingw 28277ec681f3Smrg- lavapipe: fix missing VKAPI_CALL attribute 28287ec681f3Smrg- wgl: fix 32 bits mingw exports 28297ec681f3Smrg- docs: mark off missing lavapipe exts 28307ec681f3Smrg 28317ec681f3SmrgMike Blumenkrantz (480): 28327ec681f3Smrg 28337ec681f3Smrg- zink: improve detection for broken drawids 28347ec681f3Smrg- lavapipe: increment drawid for multidraws 28357ec681f3Smrg- radv: merge si_write_viewport into radv_emit_viewport 28367ec681f3Smrg- radv: pre-calculate viewport transforms 28377ec681f3Smrg- radv: remove unused variable from radv_emit_viewport 28387ec681f3Smrg- lavapipe: don't read line stipple info in pipeline creation if stipple is disabled 28397ec681f3Smrg- util/tc: make clear calls async 28407ec681f3Smrg- util/foz: stop crashing on destroy if prepare hasn't been called 28417ec681f3Smrg- lavapipe: add a padding member to rendering_state 28427ec681f3Smrg- lavapipe: implement VK_EXT_color_write_enable 28437ec681f3Smrg- features: VK_EXT_color_write_enable for lavapipe 28447ec681f3Smrg- zink: check for dedicated allocation requirements during image alloc 28457ec681f3Smrg- zink: hook up VK_KHR_dedicated_allocation 28467ec681f3Smrg- zink: optimize shader recalc 28477ec681f3Smrg- zink: ifdef out some context prototypes/inlines for c++ compile 28487ec681f3Smrg- zink: start adding C++ draw templates 28497ec681f3Smrg- zink: add draw template for dynamic state 28507ec681f3Smrg- zink: make descriptors_update hook return a bool if a flush occurred 28517ec681f3Smrg- zink: if descriptor updating flushes, re-call draw/compute 28527ec681f3Smrg- zink: add template for starting new cmdbuf 28537ec681f3Smrg- zink: split pipeline_changed to use template value separately 28547ec681f3Smrg- zink: stop flagging pipeline dirty for line width changes 28557ec681f3Smrg- zink: don't rebind vertex buffers if pipeline changes 28567ec681f3Smrg- zink: add a ctx flag for drawid reading 28577ec681f3Smrg- zink: flatten descriptor_refs_dirty into BATCH_CHANGED template 28587ec681f3Smrg- zink: use drawid_offset directly during draw 28597ec681f3Smrg- zink: add a ctx flag for shader reading basevertex 28607ec681f3Smrg- zink: remove screen info stuff from draw templates 28617ec681f3Smrg- zink: add changed flag for blend states 28627ec681f3Smrg- util/tc: add a util function for setting bytes_mapped_limit 28637ec681f3Smrg- radeonsi: use new tc util for setting bytes_mapped_limit 28647ec681f3Smrg- zink: use new tc util for setting bytes_mapped_limit 28657ec681f3Smrg- freedreno: use new tc util for setting bytes_mapped_limit 28667ec681f3Smrg- nir/lower_point_size_mov: zero nir_state_slot::swizzle in new variable 28677ec681f3Smrg- gallium: add pipe_sampler_state::pad member 28687ec681f3Smrg- lavapipe: add support for anisotropic texturing 28697ec681f3Smrg- nir: add nir_imm_ivec3 builder 28707ec681f3Smrg- zink: add mechanism for generating VkBuffers for rebinding 28717ec681f3Smrg- zink: change vbo_bind_count to a mask of slots 28727ec681f3Smrg- zink: handle vertex buffer offset overflows 28737ec681f3Smrg- zink: split and move maybe_flush_or_stall mechanic 28747ec681f3Smrg- zink: split draw_count checking to local variable 28757ec681f3Smrg- zink: make zink_end_render_pass public 28767ec681f3Smrg- zink: make batch_rp and norp static inlines 28777ec681f3Smrg- zink: use a local var for draw mode during draw 28787ec681f3Smrg- zink: add a param to check_batch_completion for toggling lock-taking 28797ec681f3Smrg- zink: rework oom flushing 28807ec681f3Smrg- zink: move mem cache to sub-struct 28817ec681f3Smrg- zink: inline mem cache hash table 28827ec681f3Smrg- zink: split mem cache per type 28837ec681f3Smrg- zink: clamp descriptor allocation bucket sizing to defined limit 28847ec681f3Smrg- zink: add define for descriptor alloc clamping 28857ec681f3Smrg- zink: improve lazy descriptor pool handling 28867ec681f3Smrg- zink: fix cached descriptor allocation clamping 28877ec681f3Smrg- nir/validate: refactor validate_assert to have a return value 28887ec681f3Smrg- zink: use array size in spirv bo length calculations 28897ec681f3Smrg- zink: add screen function for checking usage completion 28907ec681f3Smrg- zink: force batch completion check on query result 28917ec681f3Smrg- zink: add some resource util functions for batch usage 28927ec681f3Smrg- zink: collapse a conditional in zink_batch_resource_usage_set() 28937ec681f3Smrg- zink: use resource batch usage helpers in invalidate_buffer() 28947ec681f3Smrg- zink: simplify some dumb code in invalidate_buffer 28957ec681f3Smrg- zink: use new resource batch usage utils for is_resource_busy 28967ec681f3Smrg- zink: replace some direct batch_usage calls with resource abstractions 28977ec681f3Smrg- zink: remove no longer used internal resource function 28987ec681f3Smrg- zink: more explicitly check shader stages during compile 28997ec681f3Smrg- zink: merge draw_count and compute_count, move to batch struct 29007ec681f3Smrg- zink: improve oom flushing 29017ec681f3Smrg- zink: EXT_vertex_input_dynamic_state 29027ec681f3Smrg- zink: change descriptor flushing to assert 29037ec681f3Smrg- zink: lower subgroup ballot instructions 29047ec681f3Smrg- zink: implement compiler handling for subgroup ballot builtins/intrinsics 29057ec681f3Smrg- zink: remove VK_EXT_shader_subgroup_ballot from device info 29067ec681f3Smrg- zink: export PIPE_CAP_TGSI_BALLOT 29077ec681f3Smrg- zink: add env var to disable timelines 29087ec681f3Smrg- ci: add another zink job with timelines disabled 29097ec681f3Smrg- zink: use dynamic line stipple 29107ec681f3Smrg- zink: use MAP_ONCE for qbo readback 29117ec681f3Smrg- zink: rework buffer mapping 29127ec681f3Smrg- mesa/st: break up st_GetTexSubImage 29137ec681f3Smrg- mesa/st: break up st_choose_matching_format() 29147ec681f3Smrg- mesa/st: enable calling st_choose_format() purely for translation 29157ec681f3Smrg- mesa/st: add format-finding capabilities to pbo get_dst_format() 29167ec681f3Smrg- st/texture: refactor get_src_format() to be more useful 29177ec681f3Smrg- zink: never use staging buffer for unsynchronized buffer maps 29187ec681f3Smrg- zink: force threadsafe mapping for query results when necessary 29197ec681f3Smrg- Revert "zink: simplify some dumb code in invalidate_buffer" 29207ec681f3Smrg- zink: simplify some dumb code in invalidate_buffer (v2) 29217ec681f3Smrg- lavapipe: rework queue to use u_queue 29227ec681f3Smrg- lavapipe: use consistent semaphore variable naming 29237ec681f3Smrg- lavapipe: implement timeline semaphores 29247ec681f3Smrg- features: mark off timelines for lavapipe 29257ec681f3Smrg- zink: add locking for zink_shader::programs 29267ec681f3Smrg- zink: sum available memory heaps instead of assigning 29277ec681f3Smrg- zink: simplify else clause for mem info gathering 29287ec681f3Smrg- nine: don't memset sampler state during conversion 29297ec681f3Smrg- nine: set CSO_NO_USER_VERTEX_BUFFERS for main cso context 29307ec681f3Smrg- nine: optimize texture binds a bit 29317ec681f3Smrg- nine: split enabled/dummy texture binds into separate iterators 29327ec681f3Smrg- nine: update bound sampler mask directly during texture updates 29337ec681f3Smrg- nine: track bound sampler count to optimize unbinds 29347ec681f3Smrg- nine: enable tc 29357ec681f3Smrg- nir: add imm_vec3 to round these out 29367ec681f3Smrg- nine: init take_index_buffer_ownership for draws 29377ec681f3Smrg- nine: init more draw info members 29387ec681f3Smrg- zink: add a suballocator 29397ec681f3Smrg- zink: repack zink_resource_object struct 29407ec681f3Smrg- zink: stop zeroing structs during resource allocation 29417ec681f3Smrg- zink: split transfer_unmap for images and buffers 29427ec681f3Smrg- zink: split mem unmap logic for images and buffers 29437ec681f3Smrg- zink: make map_count useful for dedicated image allocations 29447ec681f3Smrg- zink: remove PIPE_MAP_ONCE from subdata 29457ec681f3Smrg- zink: rejigger PIPE_MAP_ONCE for internal qbo reads 29467ec681f3Smrg- zink: flake out some tests for now 29477ec681f3Smrg- zink: collapse 'dedicated' allocation into zink_bo 29487ec681f3Smrg- zink: remove duplicated zink_resource_object::mem member 29497ec681f3Smrg- zink: split out zink_transfer allocation 29507ec681f3Smrg- zink: split buffer and image map functions 29517ec681f3Smrg- zink: remove unused variable from image map 29527ec681f3Smrg- zink: break out transfer map destroy 29537ec681f3Smrg- zink: handle map failures more effectively 29547ec681f3Smrg- zink: enable compat contexts 29557ec681f3Smrg- zink: ci updates 29567ec681f3Smrg- nir/lower_vectorize_tess_levels: set num_components for vectorized loads 29577ec681f3Smrg- softpipe: fix ci rule ordering to avoid unnecessarily running jobs 29587ec681f3Smrg- zink: simplify get_descriptor_set_lazy params 29597ec681f3Smrg- zink: remove redundant asserts from lazy descriptor set populate 29607ec681f3Smrg- zink: remove repeated lazy batch dd casts 29617ec681f3Smrg- zink: flag the gfx pipeline dirty and unset pipeline shader module on shader change 29627ec681f3Smrg- zink: do compute shader change on bind 29637ec681f3Smrg- zink: clear current gfx/compute program upon unbinding its shaders 29647ec681f3Smrg- zink: clear out all ubo rebinds first if they exist 29657ec681f3Smrg- zink: make descriptor update functions return the updated resource 29667ec681f3Smrg- zink: split out buffer rebinds to helper functions 29677ec681f3Smrg- zink: add bind counts for so bindings 29687ec681f3Smrg- zink: count streamout rebinds when doing buffer rebinds 29697ec681f3Smrg- zink: rebind all buffers on replacement 29707ec681f3Smrg- zink: only force all buffer rebinds if rebinds exist on other contexts 29717ec681f3Smrg- zink: defer deletion of no-attachment framebuffers 29727ec681f3Smrg- zink: stop referencing framebuffers 29737ec681f3Smrg- nine: replace unnecessary dynamic-sized array with bitfield 29747ec681f3Smrg- zink: move void format detection function to zink_format 29757ec681f3Smrg- zink: make component mapping function a static inline 29767ec681f3Smrg- zink: make void swizzle clamping util public 29777ec681f3Smrg- zink: add better TODO note for surface swizzles 29787ec681f3Smrg- zink: fix program init flag 29797ec681f3Smrg- zink: fix pipeline caching 29807ec681f3Smrg- zink: verify program key sizes before checking for default variant 29817ec681f3Smrg- zink: return early when getting resource modifer if no modifier is used 29827ec681f3Smrg- zink: inline program cache structs 29837ec681f3Smrg- zink: track mask of bound gfx shader stages 29847ec681f3Smrg- zink: split gfx shader cache based on stages present 29857ec681f3Smrg- zink: avoid hashing shader stages multiple times for new gfx programs 29867ec681f3Smrg- zink: create compute programs on bind 29877ec681f3Smrg- zink: simplify a bitmask init 29887ec681f3Smrg- zink: stop using dirty_shader_stages for shader binds 29897ec681f3Smrg- zink: add some null checks for shader variant key generation 29907ec681f3Smrg- zink: set inlinable_uniforms_mask first when binding a shader 29917ec681f3Smrg- zink: only remove programs from hash tables on shader deletion if needed 29927ec681f3Smrg- zink: implement PIPE_QUERY_GPU_FINISHED 29937ec681f3Smrg- zink: always init bordercolor value for sampler 29947ec681f3Smrg- zink: require occlusionQueryPrecise for occlusion queries 29957ec681f3Smrg- zink: assert precise queries are occlusion queries 29967ec681f3Smrg- zink: declare ctx var during blend state bind 29977ec681f3Smrg- zink: remove attachment count from pipeline hash 29987ec681f3Smrg- zink: pass current program's shader array, not ctx array 29997ec681f3Smrg- zink: remove extra unsetting of ctx->vertex_state_changed 30007ec681f3Smrg- zink: reorder gfx program/pipeline/descriptor binds if dynamic state is present 30017ec681f3Smrg- zink: init ctx->gfx_prim_mode to nonzero value to trigger pipeline changes 30027ec681f3Smrg- zink: use ctx gfx prim mode for draw comparisons 30037ec681f3Smrg- zink: remove query flush from memory barrier hook 30047ec681f3Smrg- zink: slim down streamout component of mem barrier hook 30057ec681f3Smrg- zink: batch mem barrier hooks 30067ec681f3Smrg- zink: use dynamic prim type 30077ec681f3Smrg- zink: consolidate pipeline hash tables 30087ec681f3Smrg- zink: no-op prim changes for pipeline recalc 30097ec681f3Smrg- zink: hook up VK_EXT_extended_dynamic_state2 30107ec681f3Smrg- zink: template for VK_EXT_extended_dynamic_state2 30117ec681f3Smrg- zink: bump dynamic pipeline state count 30127ec681f3Smrg- zink: set primitive restart with extended dynamic state2 30137ec681f3Smrg- zink: move dynamic state1 pipeline members into substruct 30147ec681f3Smrg- zink: move viewport count into dynamic state1 part of pipeline hash 30157ec681f3Smrg- zink: zero viewport and scissor count in pipeline with dynamic state1 30167ec681f3Smrg- zink: repack zink_rasterizer_hw_state 30177ec681f3Smrg- zink: add clip_halfz to rasterizer hw state 30187ec681f3Smrg- zink: steal a bit from rast_samples in pipeline state 30197ec681f3Smrg- zink: convert rasterizer pipeline components to bitfield 30207ec681f3Smrg- zink: repack zink_gfx_pipeline_state 30217ec681f3Smrg- zink: make zink_gfx_pipeline_state::vertices_per_patch a bitfield 30227ec681f3Smrg- zink: improve threadsafe qbo access 30237ec681f3Smrg- zink: move time query ending out to zink_end_query 30247ec681f3Smrg- zink: don't try to sync previous timestamp query qbo values 30257ec681f3Smrg- zink: more effectively utilize batch_usage for query destruction 30267ec681f3Smrg- zink: avoid pulling in unused push descriptors for cached ubo0 30277ec681f3Smrg- zink: remove extra program ref from cached descriptor updates 30287ec681f3Smrg- freedreno: export supported primtypes 30297ec681f3Smrg- freedreno: remove primconvert 30307ec681f3Smrg- freedreno: ci updates 30317ec681f3Smrg- zink: only update inlinable constants when they change 30327ec681f3Smrg- zink: determine whether the gpu has a resizable BAR at startup 30337ec681f3Smrg- zink: implement PIPE_RESOURCE_FLAG_DONT_MAP_DIRECTLY when resizable bar not present 30347ec681f3Smrg- radv: use pool stride when copying single query results 30357ec681f3Smrg- radv: ignore dynamic line stipple if line stipple isn't enabled 30367ec681f3Smrg- zink: free local shader nirs on program free 30377ec681f3Smrg- zink: use VK_WHOLE_SIZE for full-sized bufferviews 30387ec681f3Smrg- zink: explicitly end renderpass before running dispatch 30397ec681f3Smrg- zink: move alphaToOne warning to a dynamic warning 30407ec681f3Smrg- zink: add input attachment thingy for spirv builder 30417ec681f3Smrg- zink: emit fbfetch variables as ntv input attachments 30427ec681f3Smrg- zink: add a compiler pass to translate fbfetch -> input attachments 30437ec681f3Smrg- zink: refactor descriptor layout/template creation a little 30447ec681f3Smrg- zink: track fbfetch info on context, update as needed 30457ec681f3Smrg- zink: flag color attachment images as input attachments at creation 30467ec681f3Smrg- zink: add an input attachment to the gfx push set layout to handle fbfetch 30477ec681f3Smrg- zink: fix lazy descriptor deinit 30487ec681f3Smrg- zink: add an input attachment to the gfx push set layout to handle fbfetch 30497ec681f3Smrg- zink: update push descriptor set anytime fbfetch changes 30507ec681f3Smrg- zink: add a renderpass flag for input attachment layout handling 30517ec681f3Smrg- zink: enable fbfetch pipe cap 30527ec681f3Smrg- docs: mark off ES 3.2 for zink 30537ec681f3Smrg- zink: ci updates 30547ec681f3Smrg- zink: destroy shader modules on program free to avoid leaking 30557ec681f3Smrg- aux/cso: always restore states in atom order 30567ec681f3Smrg- gallium/cso: add unbind mask for cso restore 30577ec681f3Smrg- zink: directly pass resource pointer to descriptor state updates 30587ec681f3Smrg- zink: use tc rebind info for buffer replacements 30597ec681f3Smrg- zink: split out stalling from fence-waiting function 30607ec681f3Smrg- zink: remove refcounting from batch states 30617ec681f3Smrg- zink: ensure gfx shader module states are updated when doing a partial recalc 30627ec681f3Smrg- zink: create inner scanout object without scanout binds 30637ec681f3Smrg- zink: dynamic vertex input template 30647ec681f3Smrg- zink: don't use dynamic vertex stride with dynamic vertex input 30657ec681f3Smrg- zink: incrementally hash gfx shader stages 30667ec681f3Smrg- zink: incrementally hash module variants in pipeline 30677ec681f3Smrg- zink: incrementally hash vertex state into pipeline hash 30687ec681f3Smrg- zink: incrementally hash all pipeline component hashes 30697ec681f3Smrg- zink: inline gfx pipeline hash table 30707ec681f3Smrg- zink: track compatible render passes 30717ec681f3Smrg- zink: use compatible renderpass state in pipeline hash 30727ec681f3Smrg- zink: clamp lazy pools to 500 descriptors and allocate more slowly 30737ec681f3Smrg- zink: remove ZINK_HEAP_HOST_VISIBLE_ANY 30747ec681f3Smrg- mesa/st: create new surfaces before destroying old ones when updating attachments 30757ec681f3Smrg- radv: just use UINT64_MAX when getting absolute timeout for that value 30767ec681f3Smrg- radv: add some asserts for descriptor updating 30777ec681f3Smrg- lavapipe: support EXT_primitive_topology_list_restart 30787ec681f3Smrg- docs: update features for lavapipe 30797ec681f3Smrg- lavapipe: unbreak imageless framebuffer 30807ec681f3Smrg- zink: move get_framebuffer() to zink_framebuffer.c 30817ec681f3Smrg- zink: store some image creation metadata to object struct 30827ec681f3Smrg- zink: store some surface metadata to struct during creation 30837ec681f3Smrg- zink: use imageless framebuffers 30847ec681f3Smrg- lavapipe: unbreak push descriptor templates 30857ec681f3Smrg- zink: add a piglit ci job for lazy descriptors 30867ec681f3Smrg- tgsi_to_nir: force int type for LAYER output 30877ec681f3Smrg- zink: hash blend state pointers on creation 30887ec681f3Smrg- zink: remove tcs shader keys 30897ec681f3Smrg- zink: move sample part of fs key to renderpass 30907ec681f3Smrg- zink: add pipeline state flag for determining if output type is points 30917ec681f3Smrg- zink: move point sprite rasterizer bits to unhashed pipeline state 30927ec681f3Smrg- zink: move drawid_broken to unhashed pipeline state 30937ec681f3Smrg- zink: always emit sample id 0 for non-msaa texel pointers in ntv 30947ec681f3Smrg- zink: fix PIPE_CAP_DRAW_PARAMETERS export 30957ec681f3Smrg- zink: add 8bit alu handling 30967ec681f3Smrg- zink: hook up 8/16bit storage exts 30977ec681f3Smrg- zink: lower 32_2x16_split pack/unpack instructions 30987ec681f3Smrg- zink: implement nir_op_pack_half_2x16_split 30997ec681f3Smrg- zink: handle 8/16bit ssbo storage 31007ec681f3Smrg- zink: handle bo struct types that are just a runtime array 31017ec681f3Smrg- zink: fix PIPE_SHADER_CAP_FP16_DERIVATIVES handling 31027ec681f3Smrg- zink: clamp query results to 500 per qbo on 32bit 31037ec681f3Smrg- util/primconvert: force restart rewrites if original primtype wasn't supported 31047ec681f3Smrg- lavapipe: fix primitive restart with indexed indirect draws 31057ec681f3Smrg- zink: hook up VK_EXT_primitive_topology_list_restart 31067ec681f3Smrg- zink: use EXT_primitive_topology_list_restart where available 31077ec681f3Smrg- zink: use dispatch table for (almost) all vulkan calls 31087ec681f3Smrg- zink: fix some pipe caps for max instructions 31097ec681f3Smrg- mesa/st: use uint for instance_divisor instead of int 31107ec681f3Smrg- aux/trace: dump more pipe_vertex_element members 31117ec681f3Smrg- mesa: skip fallback draw call if no primitives are being drawn 31127ec681f3Smrg- aux/trace: use private refcounts for samplerviews 31137ec681f3Smrg- zink: reorganize cached descriptor updating a bit 31147ec681f3Smrg- zink: split out lazy set updating 31157ec681f3Smrg- zink: fall back to lazy descriptors if too many cache misses in a row 31167ec681f3Smrg- zink: add "nofallback" descriptor mode 31177ec681f3Smrg- zink: document ZINK_DESCRIPTORS env var 31187ec681f3Smrg- zink: ci updates 31197ec681f3Smrg- zink: move resource unrefs to flush thread 31207ec681f3Smrg- zink: remove batch params from renderpass functions 31217ec681f3Smrg- zink: remove batch params from resource copy functions 31227ec681f3Smrg- zink: remove unused barrier function 31237ec681f3Smrg- zink: remove batch params from barrier functions 31247ec681f3Smrg- zink: clamp instance divisors to max value 31257ec681f3Smrg- zink: add 8/16bit ubo handling 31267ec681f3Smrg- zink: export PIPE_SHADER_CAP_FP16_CONST_BUFFERS 31277ec681f3Smrg- zink: initialize zink_descriptor_layout_key::use_count on create 31287ec681f3Smrg- Revert "zink: ci updates" 31297ec681f3Smrg- zink: set vbo resource usage on bind 31307ec681f3Smrg- zink: add inline for checking whether a resource has any binds 31317ec681f3Smrg- zink: replace a couple checks for bind counts with new inline 31327ec681f3Smrg- zink: add some asserts for buffer replacement 31337ec681f3Smrg- zink: add a batch ref when replacing a buffer that has binds and usage 31347ec681f3Smrg- zink: move batch ref when possible during buffer replacement 31357ec681f3Smrg- zink: make a local screen var for buffer replace 31367ec681f3Smrg- zink: use better check for determining bufferview rebinds 31377ec681f3Smrg- zink: remove ZINK_RESOURCE_USAGE_STREAMOUT 31387ec681f3Smrg- zink: use bind_stages for pipeline barrier generation 31397ec681f3Smrg- zink: don't generate more pipeline stages if vertex bit is already set 31407ec681f3Smrg- zink: use more accurate generation for buffer barrier pipeline stages 31417ec681f3Smrg- zink: remove bind_stages and bind_history from zink_resource 31427ec681f3Smrg- zink: remove zink_get_resource_for_descriptor() 31437ec681f3Smrg- zink: use descriptor info for ubo hashing 31447ec681f3Smrg- zink: fix ZINK_MAX_DESCRIPTORS_PER_TYPE to stop exploding the stack 31457ec681f3Smrg- zink: add function for decomposing vertex format to single component 31467ec681f3Smrg- zink: decompose vertex attribs into single components when not supported 31477ec681f3Smrg- zink: use smallest int type possible for decompose shader key 31487ec681f3Smrg- zink: hook up dmabuf ext 31497ec681f3Smrg- zink: add dmabuf modifier query hooks for screen 31507ec681f3Smrg- zink: hook up VK_EXT_queue_family_foreign 31517ec681f3Smrg- zink: split import and export fd handle types 31527ec681f3Smrg- zink: set a flag for dmabuf init 31537ec681f3Smrg- zink: handle image creation for dmabufs 31547ec681f3Smrg- zink: fix import pNext attachment during image creation 31557ec681f3Smrg- zink: use foreign queue import for dmabufs 31567ec681f3Smrg- zink: add dmabuf fd handling 31577ec681f3Smrg- zink: fix dmabuf cap export 31587ec681f3Smrg- zink: unconditionally support conditional rendering 31597ec681f3Smrg- zink: fix some return values 31607ec681f3Smrg- zink: add return values for resource usage unsetting 31617ec681f3Smrg- zink: move barrier info to resource object struct 31627ec681f3Smrg- zink: unset barrier info if resource object no longer has usage after reset 31637ec681f3Smrg- zink: unset src access in barriers if there's no src pipeline stages 31647ec681f3Smrg- zink: assert surface geometry 31657ec681f3Smrg- zink: add a resource reference for bufferviews 31667ec681f3Smrg- zink: move surface and bufferview caches onto resources 31677ec681f3Smrg- zink: wrap framebuffer surfaces to preserve gallium expectations 31687ec681f3Smrg- zink: be smarter about fb surface rebinds 31697ec681f3Smrg- zink: force imageless fb rebind if rebinding an attachment 31707ec681f3Smrg- zink: update surface info when rebinding to storage 31717ec681f3Smrg- zink: add some debug asserts to validate imageless framebuffer correctness 31727ec681f3Smrg- compiler/spirv: add a fail if tex instr coord components aren't dimensional enough 31737ec681f3Smrg- zink: don't copy inner surface refcount 31747ec681f3Smrg- zink: stop setting nr_samples for null surfaces 31757ec681f3Smrg- zink: fix enabled vertex buffer mask calculation 31767ec681f3Smrg- zink: move pending prim type to gfx pipeline struct 31777ec681f3Smrg- zink: make tcs shader generation take screen param 31787ec681f3Smrg- zink: remove ctx references from shader compile path 31797ec681f3Smrg- zink: remove some ctx references from shader/pipeline compile 31807ec681f3Smrg- zink: only update gfx pipeline cache after creating a real pipeline 31817ec681f3Smrg- zink: simplify flagging last vertex stage for updating 31827ec681f3Smrg- zink: move xfb updates to just before draw 31837ec681f3Smrg- zink: move shader keys to be persistent on pipeline state 31847ec681f3Smrg- zink: move uniform size calc for shader keys into keybox 31857ec681f3Smrg- zink: store shader key to shader module 31867ec681f3Smrg- zink: stop using hash table for compute programs 31877ec681f3Smrg- zink: move shader cache to gfx program struct 31887ec681f3Smrg- zink: replace shader module hash table with a list 31897ec681f3Smrg- zink: remove default_variants storage in program struct 31907ec681f3Smrg- zink: split out inlined uniform shader variants into separate cache 31917ec681f3Smrg- zink: simplify shader variant update loop 31927ec681f3Smrg- zink: cap max shader variants with inlined uniforms 31937ec681f3Smrg- zink: store drm fd to screen 31947ec681f3Smrg- zink: unbreak dmabuf handling 31957ec681f3Smrg- zink: pre-filter multi-plane modifiers 31967ec681f3Smrg- zink: pass all modifiers through to image creation 31977ec681f3Smrg- zink: zero VkImageCreateInfo::queueFamilyIndexCount on creation 31987ec681f3Smrg- features: fix listing for GL_ARB_parallel_shader_compile 31997ec681f3Smrg- util/tc: rename tc_replace_buffer_storage_func::num_rebinds and document 32007ec681f3Smrg- zink: don't leak drm fd on drmPrimeFDToHandle failure 32017ec681f3Smrg- zink: disable miplevel tests in ci completely for now 32027ec681f3Smrg- zink: fix regex syntax from previous ci commit 32037ec681f3Smrg- build: fix nine compilation with only zink enabled as a gallium driver 32047ec681f3Smrg- zink: always use type size for query result copy stride 32057ec681f3Smrg- zink: fix ci skips 32067ec681f3Smrg- zink: don't use legacy scanout with modifiers 32077ec681f3Smrg- zink: clean up texture_barrier hook a little 32087ec681f3Smrg- zink: check for pending memory barrier before trying to flush it 32097ec681f3Smrg- zink: enable timeline ext features 32107ec681f3Smrg- zink: split vk debug logging into separate functions 32117ec681f3Smrg- zink: repack zink_render_pass_state 32127ec681f3Smrg- zink: add ZINK_HEAP_DEVICE_LOCAL_LAZY 32137ec681f3Smrg- zink: add ZINK_BIND_TRANSIENT 32147ec681f3Smrg- zink: improve handling of buffer rebinds using tc info 32157ec681f3Smrg- zink: reorder draw state updates 32167ec681f3Smrg- zink: remove fbfetch layout thingy from zs renderpass init 32177ec681f3Smrg- zink: move fb attachment init to new function 32187ec681f3Smrg- zink: stop setting nr_samples for shader image surface creation 32197ec681f3Smrg- zink: implement GL_EXT_multisampled_render_to_texture 32207ec681f3Smrg- docs: mark off GL_EXT_multisampled_render_to_texture for zink 32217ec681f3Smrg- zink: remove duplicated struct member set 32227ec681f3Smrg- zink: force lazy descriptor set rebinds if pipeline compatibility changes 32237ec681f3Smrg- zink: split out bvci creation from object creation 32247ec681f3Smrg- zink: don't add resource to pending barrier set if no barrier will be generated 32257ec681f3Smrg- zink: refactor some shader image code to make it reusable 32267ec681f3Smrg- zink: handle bindless images and samplers in ntv 32277ec681f3Smrg- zink: hook up VK_EXT_descriptor_indexing 32287ec681f3Smrg- zink: implement bindless textures 32297ec681f3Smrg- zink: export PIPE_CAP_BINDLESS_TEXTURE 32307ec681f3Smrg- features: mark off bindless texture for zink 32317ec681f3Smrg- lavapipe: add support for KHR_shader_float_controls 32327ec681f3Smrg- anv: assert that legacy_scanout isn't used with explicit modifiers 32337ec681f3Smrg- wsi/x11: fix uninit value by using zalloc for swapchain 32347ec681f3Smrg- zink: make a local resource var in fb_clears_apply_internal 32357ec681f3Smrg- zink: break out surface info init to helper function 32367ec681f3Smrg- anv: support EXT_primitive_topology_list_restart 32377ec681f3Smrg- zink: stop using VK_COMMAND_POOL_CREATE_RESET_COMMAND_BUFFER_BIT 32387ec681f3Smrg- zink: ensure fences are released before reusing them 32397ec681f3Smrg- zink: support 16bit rgbx formats 32407ec681f3Smrg- ci: updates 32417ec681f3Smrg- lavapipe: inherit from vk_image 32427ec681f3Smrg- lavapipe: EXT_4444_formats support 32437ec681f3Smrg- lavapipe: remove display extension support 32447ec681f3Smrg- build: unify vulkan cpp platform args 32457ec681f3Smrg- build: also remove wayland wsi flags from c++ build 32467ec681f3Smrg- features: be explicit about EXT_color_buffer_half_float support 32477ec681f3Smrg- zink: wait on thread queue before destroying context 32487ec681f3Smrg- zink: split out fb state updating to helper function 32497ec681f3Smrg- zink: wait in the flush thread when ETOOMANY batches are out 32507ec681f3Smrg- zink: move semaphore reset handling to submit 32517ec681f3Smrg- zink: remove zink_context::curr_batch 32527ec681f3Smrg- zink: stop leaking buffers on replacement 32537ec681f3Smrg- zink: switch remaining direct access of zink_resource_object::(reads|writes) to util 32547ec681f3Smrg- zink: remove reads/writes members from zink_resource_object 32557ec681f3Smrg- zink: stop leaking resource surface cache hash tables 32567ec681f3Smrg- zink: rework in-use batch states hash table to be a singly-linked list 32577ec681f3Smrg- zink: ci updates 32587ec681f3Smrg- zink: move glx@glx-multi-window-single-context to flakes 32597ec681f3Smrg- radv: don't use invalid stride for triggering vertex state change 32607ec681f3Smrg- radv: dynamically calculate misaligned_mask for dynamic vertex input 32617ec681f3Smrg- radv: pre-calc "simple" dynamic vertex input values 32627ec681f3Smrg- radv: add a mask of bound descriptor buffers for dynamic vertex input 32637ec681f3Smrg- radv: move alpha_adjust into conditional during vertex input updating 32647ec681f3Smrg- aux/pb: add a tolerance for reclaim failure 32657ec681f3Smrg- aux/pb: more correctly check number of reclaims 32667ec681f3Smrg- zink: use static array for detecting VK_TIME_DOMAIN_DEVICE_EXT 32677ec681f3Smrg- zink: add a read barrier for indirect dispatch 32687ec681f3Smrg- zink: fully zero surface creation struct 32697ec681f3Smrg- zink: rescue surfaces/bufferviews for cache hits during deletion 32707ec681f3Smrg- zink: clear descriptor refs on buffer replacement 32717ec681f3Smrg- zink: assert compute descriptor key is valid before hashing it 32727ec681f3Smrg- zink: don't update lazy descriptor states in hybrid mode 32737ec681f3Smrg- zink: move push descriptor updating into lazy-only codepath 32747ec681f3Smrg- zink: add an early return for zink_descriptors_update_lazy_masked() 32757ec681f3Smrg- zink: move last of lazy descriptor state updating back to lazy-only code 32767ec681f3Smrg- zink: detect prim type more accurately for tess/gs lines 32777ec681f3Smrg- zink: don't break early when applying fb clears 32787ec681f3Smrg- zink: only reset zink_resource::so_valid on buffer rebind 32797ec681f3Smrg- zink: don't check rebind count outside of buffer/image rebind function 32807ec681f3Smrg- zink: stop exporting PIPE_SHADER_CAP_FP16_DERIVATIVES 32817ec681f3Smrg- zink: don't add dynamic vertex pipeline states if no attribs are used 32827ec681f3Smrg- zink: fix gl_SampleMaskIn spirv generation 32837ec681f3Smrg- zink: more accurately update samplemask for fs shader keys 32847ec681f3Smrg- nir/lower_samplers_as_deref: rewrite more image intrinsics 32857ec681f3Smrg- zink: add better handling for CUBE_COMPATIBLE bit 32867ec681f3Smrg- zink: use align64 for allocation sizes 32877ec681f3Smrg- zink: set aspectMask for renderpass2 VkAttachmentReference2 structs 32887ec681f3Smrg- zink: always use explicit lod for texture() when legal in non-fragment stages 32897ec681f3Smrg- zink: be more permissive for injecting LOD into texture() instructions 32907ec681f3Smrg- zink: inject LOD for sampler version of OpImageQuerySize 32917ec681f3Smrg- zink: flag renderpass change when toggling fbfetch 32927ec681f3Smrg- zink: don't clamp cube array surfacess to cubes 32937ec681f3Smrg- zink: don't clamp 2D_ARRAY surfaces to 2D 32947ec681f3Smrg- zink: error when trying to allocate a bo larger than heap size 32957ec681f3Smrg- zink: clamp max buffer sizes to smallest buffer heap size 32967ec681f3Smrg- zink: explicitly enable VK_EXT_shader_subgroup_ballot 32977ec681f3Smrg- zink: add more int/float types to cast switching in ntv 32987ec681f3Smrg- zink: force float dest types on some alu results 32997ec681f3Smrg- zink: stop double printing validation messages 33007ec681f3Smrg- zink: add SpvCapabilityStorageImageMultisample for multisampled storage images 33017ec681f3Smrg- zink: reject all storage multisampling if the feature is unsupported 33027ec681f3Smrg- zink: add queue locking 33037ec681f3Smrg- build: add sha1_h to llvmpipe build 33047ec681f3Smrg- zink: set fbfetch state on lazy batch data when enabling it 33057ec681f3Smrg- zink: always use lazy (non-push) updating for fbfetch descriptors 33067ec681f3Smrg- zink: clamp PIPE_SHADER_CAP_MAX_INPUTS for xfb 33077ec681f3Smrg- aux/primconvert: handle singular incomplete restarts 33087ec681f3Smrg- zink: rework cached fbfetch descriptor fallback 33097ec681f3Smrg- aux/trace: fix vertex state tracing 33107ec681f3Smrg- zink: be more consistent about applying module hash for gfx pipeline 33117ec681f3Smrg- zink: update gfx pipeline shader module pointer even if the program is unchanged 33127ec681f3Smrg- zink: always add VK_IMAGE_CREATE_2D_ARRAY_COMPATIBLE_BIT for 3D images 33137ec681f3Smrg 33147ec681f3SmrgMykhailo Skorokhodov (3): 33157ec681f3Smrg 33167ec681f3Smrg- iris: Fix compute shader leak 33177ec681f3Smrg- iris: Add missed tile flush flag 33187ec681f3Smrg- Revert "iris: add tile cache flush to iris_copy_region" 33197ec681f3Smrg 33207ec681f3SmrgNanley Chery (41): 33217ec681f3Smrg 33227ec681f3Smrg- anv: Add genX(cmd_buffer_emit_gfx12_depth_wa) 33237ec681f3Smrg- iris: Add genX(emit_depth_state_workarounds) 33247ec681f3Smrg- iris: Update the clear value in cso_z->packets 33257ec681f3Smrg- iris: Emit clear_params as part of cso_z->packets 33267ec681f3Smrg- iris: Update clear_params only when HiZ is enabled 33277ec681f3Smrg- intel: Move the D16 workarounds out of ISL 33287ec681f3Smrg- iris: Use constants for emitting cso_z->packets 33297ec681f3Smrg- iris: Optimize genX(emit_depth_state_workarounds) 33307ec681f3Smrg- anv: Optimize genX(cmd_buffer_emit_gfx12_depth_wa) 33317ec681f3Smrg- intel: Use env_var_as_boolean for INTEL_NO_HW 33327ec681f3Smrg- intel: Parse INTEL_NO_HW for devinfo construction 33337ec681f3Smrg- intel/isl: Add msaa_layout param to isl_tiling_get_info 33347ec681f3Smrg- intel/isl: Define ISL_TILING_4/64 for XeHP 33357ec681f3Smrg- intel/isl: Update image alignments on XeHP 33367ec681f3Smrg- intel/isl: Size Tile64 surfaces with 4 dimensions 33377ec681f3Smrg- intel/isl: Drop extra assert on array_pitch_el_rows 33387ec681f3Smrg- intel/isl: Drop ISL_SURF_USAGE_DISPLAY_*_BIT 33397ec681f3Smrg- intel/isl: Use an allow-list in gfx6_filter_tiling 33407ec681f3Smrg- intel/isl: Update tiling filter functions for XeHP 33417ec681f3Smrg- intel: Support Tile4/64 in depth/stencil state 33427ec681f3Smrg- intel: Support Tile4/64 in surface states 33437ec681f3Smrg- intel/blorp: Fix faked RGB image alignment on XeHP 33447ec681f3Smrg- intel/blorp: Fix Gfx7 stencil surface state valign 33457ec681f3Smrg- intel/isl: Fix halign/valign of uncompressed views 33467ec681f3Smrg- intel/isl: Use a switch for HALIGN/VALIGN encoding 33477ec681f3Smrg- intel: Update surface states for XeHP alignments 33487ec681f3Smrg- intel: Add underscores to HALIGN and VALIGN enums 33497ec681f3Smrg- intel/isl: Disable I915_FORMAT_MOD_Y_TILED on XeHP+ 33507ec681f3Smrg- iris: Disable tiled memcpy for Tile4 33517ec681f3Smrg- anv/image: Don't assert that HiZ can be added 33527ec681f3Smrg- iris: Delete iris_resource_get_clear_color 33537ec681f3Smrg- iris: Support NULL aux BOs in fill_surface_state 33547ec681f3Smrg- iris: Split clear color and aux BO checks 33557ec681f3Smrg- iris: Simplify an iris_use_pinned_bo call 33567ec681f3Smrg- iris: Allow NULL aux BOs in aux-state functions 33577ec681f3Smrg- iris: Don't add a clear color BO for MC_CCS 33587ec681f3Smrg- iris: Add and use get_num_planes 33597ec681f3Smrg- iris: Finish aux import in iris_resource_from_handle 33607ec681f3Smrg- anv: Allow HIZ_CCS_WT with subpass self-dependencies 33617ec681f3Smrg- anv: Tile cache flush for depth before fast clear 33627ec681f3Smrg- iris: Tile cache flush for depth before fast clear 33637ec681f3Smrg 33647ec681f3SmrgNeha Bhende (4): 33657ec681f3Smrg 33667ec681f3Smrg- aux/draw: use nir_to_tgsi for draw shader in llvm path 33677ec681f3Smrg- svga/drm: use pb_usage_flags instead of pipe_map_flags in vmw_svga_winsys_buffer_map 33687ec681f3Smrg- auxiliary/indices: convert primitive type PIPE_PRIM_PATCHES 33697ec681f3Smrg- st: Fix 64-bit vertex attrib index for TGSI path 33707ec681f3Smrg 33717ec681f3SmrgNeil Roberts (1): 33727ec681f3Smrg 33737ec681f3Smrg- v3d: Update prim_counts when prims generated query in flight without TF 33747ec681f3Smrg 33757ec681f3SmrgOlivier Fourdan (1): 33767ec681f3Smrg 33777ec681f3Smrg- radeonsi: Check aux_context on si_destroy_screen() 33787ec681f3Smrg 33797ec681f3SmrgPaulo Zanoni (10): 33807ec681f3Smrg 33817ec681f3Smrg- iris: mark the workaround_bo as asynchronous 33827ec681f3Smrg- iris: don't bump the seqno for the workaround_bo 33837ec681f3Smrg- iris: assign bo->index to the aux map BOs too 33847ec681f3Smrg- iris: extract the code that adds BOs to the batch lists 33857ec681f3Smrg- iris: add the workaround_bo directly to the batch 33867ec681f3Smrg- iris: use add_bo_to_batch() when adding batch->bo 33877ec681f3Smrg- iris: syncobjs are now owned by bufmgr instead of screen 33887ec681f3Smrg- iris: give each screen of a bufmgr a unique ID 33897ec681f3Smrg- iris: switch to explicit busy tracking 33907ec681f3Smrg- iris: signal the syncobj after a failed batch 33917ec681f3Smrg 33927ec681f3SmrgPavel Asyutchenko (3): 33937ec681f3Smrg 33947ec681f3Smrg- vulkan/overlay: Fix violation of VUID-VkMappedMemoryRange-size-01389 33957ec681f3Smrg- llvmpipe: fix crash when doing FB fetch + gl_FragDepth write in one shader 33967ec681f3Smrg- lavapipe: Fix vkWaitForFences for initially-signalled fences 33977ec681f3Smrg 33987ec681f3SmrgPhilipp Zabel (3): 33997ec681f3Smrg 34007ec681f3Smrg- etnaviv: fix gbm_bo_get_handle_for_plane for multiplanar images 34017ec681f3Smrg- etnaviv: fix dirty bit check for baselod emission 34027ec681f3Smrg- etnaviv: add mov for direct depth store output from load input 34037ec681f3Smrg 34047ec681f3SmrgPierre Moreau (5): 34057ec681f3Smrg 34067ec681f3Smrg- clover: Do not advertise OpenCL x.y when unsupported 34077ec681f3Smrg- clover/spirv: Increase max amount of function args 34087ec681f3Smrg- clover/spirv: Properly size 3-component vector args 34097ec681f3Smrg- clover/api: Interleave details in dispatch table 34107ec681f3Smrg- clover/nir: Set constant buffer pointer size to host 34117ec681f3Smrg 34127ec681f3SmrgPierre-Eric Pelloux-Prayer (78): 34137ec681f3Smrg 34147ec681f3Smrg- mesa: fix bindless uniform samplers update 34157ec681f3Smrg- dlist: don't handle unmerged draws as merged 34167ec681f3Smrg- mesa: move gl_program::is_arb_asm to shader_info 34177ec681f3Smrg- radeonsi: preserve derivatives after discards for ARB shaders 34187ec681f3Smrg- gallium/va: don't use key=NULL in hash tables 34197ec681f3Smrg- amd/registers: fix fields conflict detection 34207ec681f3Smrg- dlist: upload vertices in compile_vertex_list 34217ec681f3Smrg- dlist: implement vertices deduplication 34227ec681f3Smrg- radeonsi: add a script to run piglit/glcts/deqp tests 34237ec681f3Smrg- radeonsi: add expected tests results for Navi10 GPU 34247ec681f3Smrg- st/pbo: only use x coord when reading a PIPE_TEXTURE_1D 34257ec681f3Smrg- st/pbo: set nir_tex_instr::is_array field 34267ec681f3Smrg- st/pbo: add a fast pbo download code-path 34277ec681f3Smrg- radeonsi: fix test script's output 34287ec681f3Smrg- radeonsi: add -t option to the test script 34297ec681f3Smrg- radeonsi: don't create an infinite number of variants 34307ec681f3Smrg- nir: add a pass to optimize "gl_FragDepth = gl_FragCoord.z" away 34317ec681f3Smrg- radeonsi/test: fix test script args handling 34327ec681f3Smrg- radeonsi/test: format radeonsi-run-test.py with black 34337ec681f3Smrg- radeonsi/test: allow to pass a filename as a test filter value 34347ec681f3Smrg- radeonsi/test: prettier output 34357ec681f3Smrg- radeonsi/test: add Sienna Cichlid expected results 34367ec681f3Smrg- vbo/dlist: simplify add_vertex function 34377ec681f3Smrg- vbo/dlist: apply start_offset after indices construction 34387ec681f3Smrg- vbo/dlist: move VAO update at the end 34397ec681f3Smrg- vbo/dlist: use buffer_in_ram_size 34407ec681f3Smrg- vbo/dlist: use a single buffer object 34417ec681f3Smrg- vbo/dlist: remove vbo_save_vertex_store::bufferobj 34427ec681f3Smrg- vbo/dlist: don't store prim_store 34437ec681f3Smrg- vbo/dlist: use prim_store directly 34447ec681f3Smrg- vbo/dlist: realloc prims array instead of free/malloc 34457ec681f3Smrg- vbo/dlist: don't force list compilation if out of prim space 34467ec681f3Smrg- vbo/dlist: remove vbo_save_context::buffer_ptr 34477ec681f3Smrg- vbo/dlist: reset vertex_store::used in reset_counters 34487ec681f3Smrg- vbo/dlist: remove vbo_save_context::buffer_map 34497ec681f3Smrg- vbo/dlist: realloc vertex stores 34507ec681f3Smrg- vbo/dlist: remove vbo_save_context::max_vert 34517ec681f3Smrg- vbo/dlist: limit allocation sizes 34527ec681f3Smrg- vbo/dlist: don't force list compilation if out of vertex space 34537ec681f3Smrg- vbo/dlist: rework out of memory 34547ec681f3Smrg- vbo/dlist: fix max_index_count value 34557ec681f3Smrg- vbo/dlist: remove vbo_save_copied_vtx 34567ec681f3Smrg- vbo/dlist: remove vbo_save_context::vert_count 34577ec681f3Smrg- vbo/dlist: add documentation 34587ec681f3Smrg- vbo/dlist: remove unused functions 34597ec681f3Smrg- vbo/dlist: rework buffer sizes 34607ec681f3Smrg- vbo/dlist: rework primitive store handling 34617ec681f3Smrg- vbo/dlist: rework vertex_store management 34627ec681f3Smrg- vbo/dlist: fix indentation in vbo_save_api.c 34637ec681f3Smrg- vbo/dlist: reallocate the vertex buffer on vertex upgrade 34647ec681f3Smrg- Revert "ci/v3d: add piglit flake" 34657ec681f3Smrg- radeonsi/test: fix typo in the test script 34667ec681f3Smrg- radeonsi/test: update expected results 34677ec681f3Smrg- radeonsi/sqtt: export wave size and scratch size 34687ec681f3Smrg- radeonsi/sqtt: add si_se_is_disabled 34697ec681f3Smrg- radeonsi/test: don't require a folder name 34707ec681f3Smrg- radeonsi/test: use -t for deqp tests 34717ec681f3Smrg- radeonsi/test: print default values in help 34727ec681f3Smrg- radeonsi/test: allow to specify a baseline folder 34737ec681f3Smrg- radeonsi/test: sanitize output_folder 34747ec681f3Smrg- radeonsi/test: add --gpu to select the GPU to test 34757ec681f3Smrg- radeonsi/test: add Raven expected results 34767ec681f3Smrg- radeonsi/test: add sanity checks 34777ec681f3Smrg- gallium: add PIPE_CAP_PREFER_BACK_BUFFER_REUSE 34787ec681f3Smrg- loader/dri3: avoid reusing the same back buffer with DRI_PRIME 34797ec681f3Smrg- radeonsi: disable PIPE_CAP_PREFER_BACK_BUFFER_REUSE 34807ec681f3Smrg- radeonsi: don't clear G_028644_OFFSET 34817ec681f3Smrg- radeonsi: implement si_sdma_copy_image for gfx7+ 34827ec681f3Smrg- radeonsi: add an async compute context 34837ec681f3Smrg- gallium: add a is_dri_blit_image bool to pipe_blit_info 34847ec681f3Smrg- radeonsi: make the DRI_PRIME dGPU -> iGPU copy async 34857ec681f3Smrg- radeonsi: use viewport offset in quant_mode determination 34867ec681f3Smrg- radeonsi: treat nir_intrinsic_load_constant as a VMEM operation 34877ec681f3Smrg- radeonsi/sdma: fix bogus assert 34887ec681f3Smrg- ac/surface: don't validate DCC settings if DCC isn't possible 34897ec681f3Smrg- vbo/dlist: free copied.buffer if no vertices were copied 34907ec681f3Smrg- mesa: always call _mesa_update_pixel 34917ec681f3Smrg- radeonsi/sqtt: fix shader stage values 34927ec681f3Smrg 34937ec681f3SmrgQiang Yu (20): 34947ec681f3Smrg 34957ec681f3Smrg- nir/inline_uniforms: add uniforms in condition atomically 34967ec681f3Smrg- nir/inline_uniforms: support vector uniform 34977ec681f3Smrg- nir/loop_analyze: move nir_is_supported_terminator_condition() to header 34987ec681f3Smrg- nir/loop_analyze: record induction variables for each loop 34997ec681f3Smrg- nir/loop_analyze: skip unsupported induction variable early 35007ec681f3Smrg- nir/inline_uniforms: support loop 35017ec681f3Smrg- egl/dri2: seperate EGLImage validate and lookup 35027ec681f3Smrg- gbm/dri: implement image lookup extension version 2 35037ec681f3Smrg- gallium/dri: add dri_screen egl image validate hooks 35047ec681f3Smrg- gallium/api: add validate_egl_image interface 35057ec681f3Smrg- mesa: add ValidateEGLImage driver callback 35067ec681f3Smrg- mesa: fix glthread deadlock when EGL multi thread shared context 35077ec681f3Smrg- nir/lower_io_to_vector: check centroid & sample when merge variable 35087ec681f3Smrg- nir/linker: pack varyings with different interpolation qualifier 35097ec681f3Smrg- radeonsi: enable nir option pack_varying_options 35107ec681f3Smrg- radeonsi: fix ps SI_PARAM_LINE_STIPPLE_TEX arg 35117ec681f3Smrg- loader/dri3: fix swap out of order when changing swap interval 35127ec681f3Smrg- mesa/st: delay nir spirv link 35137ec681f3Smrg- nir/linker: support uniform when optimizing varying 35147ec681f3Smrg- nir/linker: rename replace_constant_input to replace_varying_input_by_constant_load 35157ec681f3Smrg 35167ec681f3SmrgQuantum (1): 35177ec681f3Smrg 35187ec681f3Smrg- main: allow all external textures for BindImageTexture 35197ec681f3Smrg 35207ec681f3SmrgRhys Perry (108): 35217ec681f3Smrg 35227ec681f3Smrg- aco: don't create v_madmk_f32/v_madak_f32 from v_fma_legacy_f16 35237ec681f3Smrg- ac/llvm: implement v2f16 fsat 35247ec681f3Smrg- radv: set image_dim and image_array intrinsic indices 35257ec681f3Smrg- aco: use image_dim and image_array intrinsic indices 35267ec681f3Smrg- aco: calculate correct register demand for branch instructions 35277ec681f3Smrg- nir/algebraic: fix imod by negative power-of-two 35287ec681f3Smrg- nir/algebraic: don't optimize umod/imod/irem if lower_bitops=true 35297ec681f3Smrg- nir/algebraic: add optimizations for imul(a, INT_MIN) 35307ec681f3Smrg- nir/search: don't consider INT_MIN a negative power-of-two 35317ec681f3Smrg- nir/algebraic: improve irem by power-of-two optimization 35327ec681f3Smrg- nir/idiv_const: improve idiv(n, INT_MIN) 35337ec681f3Smrg- nir/idiv_const: optimize imod/irem 35347ec681f3Smrg- nir: fix signed overflow for iadd constant folding 35357ec681f3Smrg- nir/tests: add tests for umod/imod/irem optimizations 35367ec681f3Smrg- radv: enable DCC with signedness reinterpretation 35377ec681f3Smrg- nir: remove src/compiler/nir/nir_control_flow 35387ec681f3Smrg- nir: swap fadd operands in nir_atan() 35397ec681f3Smrg- spirv: swap fadd operands in build_asin() and matrix_multiply() 35407ec681f3Smrg- nir/algebraic: add various ffma optimizations 35417ec681f3Smrg- nir/algebraic: reassociate add chains for more MAD/FMA-friendly code 35427ec681f3Smrg- nir/algebraic: add is_used_once to dot product reassociation optimization 35437ec681f3Smrg- nir: add ffma creation helpers 35447ec681f3Smrg- nir: create ffma from builders more often 35457ec681f3Smrg- nir: lower fdot to ffma if lower_ffma=false 35467ec681f3Smrg- spirv: create ffma more often 35477ec681f3Smrg- nir,glsl_to_nir: use nir_fdot() 35487ec681f3Smrg- ci: update trace hashes 35497ec681f3Smrg- aco: fix validation of DPP v_cndmask_b32/v_addc_co_u32 35507ec681f3Smrg- aco: add can_use_DPP() and convert_to_DPP() 35517ec681f3Smrg- aco: move a bunch of helpers into aco_ir.h/aco_ir.cpp 35527ec681f3Smrg- aco: make optimize_postRA() work across blocks 35537ec681f3Smrg- aco: handle DPP in the optimizer 35547ec681f3Smrg- aco: combine DPP into VALU before RA 35557ec681f3Smrg- aco: combine DPP into VALU after RA 35567ec681f3Smrg- aco/tests: add tests for pre-RA DPP combining 35577ec681f3Smrg- aco/tests: add tests for post-RA DPP combining 35587ec681f3Smrg- aco: fix vectorized 16-bit load_input/load_interpolated_input 35597ec681f3Smrg- aco: remove label_extract if the extract is used by a non-VALU 35607ec681f3Smrg- aco/scheduler: allow moving down VMEM stores to below VMEM loads 35617ec681f3Smrg- nir/lower_io: use nir_vector_insert_imm() 35627ec681f3Smrg- radv: use nir_vector_insert_imm in lower_intrinsics 35637ec681f3Smrg- nir: consider push constant loads as always dynamically uniform 35647ec681f3Smrg- nir/gcm: pin some instructions which require uniform sources 35657ec681f3Smrg- aco: include utility in isel 35667ec681f3Smrg- aco: don't constant propagate to DPP instructions 35677ec681f3Smrg- aco/tests: test copy propagation with DPP instructions 35687ec681f3Smrg- aco: remove DPP when applying constants/literals/sgprs 35697ec681f3Smrg- aco: don't coalesce constant copies into non-power-of-two sizes 35707ec681f3Smrg- aco/spill: add temporary operands of exec phis to next_use_distances_end 35717ec681f3Smrg- nir: separate lower_add_sat 35727ec681f3Smrg- nir: add sdot_2x16 and udot_2x16 opcodes 35737ec681f3Smrg- spirv: use sdot_2x16 and udot_2x16 opcodes 35747ec681f3Smrg- ac/gpu_info: add has_accelerated_dot_product 35757ec681f3Smrg- ac/llvm: implement nir_op_pack_32_4x8 35767ec681f3Smrg- ac/llvm,radv: implement uadd_sat/iadd_sat 35777ec681f3Smrg- ac/llvm: implement udot_4x8/sdot_4x8/udot_2x16/sdot_2x16 opcodes 35787ec681f3Smrg- radv: refactor handling of nir_options 35797ec681f3Smrg- radv,aco: implement iadd_sat 35807ec681f3Smrg- aco: implement nir_op_pack_32_4x8 35817ec681f3Smrg- aco: implement udot_4x8/sdot_4x8/udot_2x16/sdot_2x16 opcodes 35827ec681f3Smrg- aco/ra: allow v1b operands with 16-bit instructions 35837ec681f3Smrg- radv: expose VK_KHR_shader_integer_dot_product 35847ec681f3Smrg- aco/ra: don't use ds_write_b8_d16_hi/ds_write_b16_d16_hi on GFX8 35857ec681f3Smrg- nir: fix serialization of loop/if control 35867ec681f3Smrg- radv: fix pipeline caching with robust buffer access 35877ec681f3Smrg- aco: add RegClass::is_linear_vgpr helper 35887ec681f3Smrg- aco: add and use RegClass::resize helper 35897ec681f3Smrg- aco: rewrite print_reg_class() 35907ec681f3Smrg- aco: find a scratch register for sub-dword copies on GFX7 if scc is empty 35917ec681f3Smrg- aco: find scratch reg for sub-dword psuedo instructions which read sgprs 35927ec681f3Smrg- aco/tests: fix finish_ra_test() 35937ec681f3Smrg- aco/tests: add regalloc.scratch_sgpr.create_vector 35947ec681f3Smrg- aco: implement linear vgpr copies 35957ec681f3Smrg- aco: allow live-range splits of linear vgprs in top-level blocks 35967ec681f3Smrg- aco/nops: use up-to-date mask_size 35977ec681f3Smrg- aco/nops: create handle_raw_hazard_instr helper 35987ec681f3Smrg- aco/nops: add State 35997ec681f3Smrg- aco/nops: fix handle_raw_hazard_internal when visiting the current block 36007ec681f3Smrg- nir/algebraic: distribute fmul(fadd(a, b), c) when b and c are constants 36017ec681f3Smrg- aco/tests: add idep_amdgfxregs_h 36027ec681f3Smrg- nir: add nir_src_components_read() 36037ec681f3Smrg- nir/opt_if: add opt_if_rewrite_uniform_uses 36047ec681f3Smrg- radv: don't require a GS copy shader to use the cache with NGG VS+GS 36057ec681f3Smrg- radv: workaround incorrect image format with World War Z 36067ec681f3Smrg- radv: move ngg culling determination earlier 36077ec681f3Smrg- nir: add _amd suffix to fragment_mask_fetch and fragment_fetch texops 36087ec681f3Smrg- nir/lower_tex: add lower_to_fragment_fetch_amd 36097ec681f3Smrg- radv: don't create blit pipelines for multisampled 3D images 36107ec681f3Smrg- aco: return 0x76543210 for NULL FMASK fetch 36117ec681f3Smrg- ac/nir: return 0x76543210 for NULL FMASK fetch 36127ec681f3Smrg- aco: use correct dim for FMASK fetches 36137ec681f3Smrg- radv,aco: use lower_to_fragment_fetch 36147ec681f3Smrg- radv,aco: don't include FMASK in the storage descriptor 36157ec681f3Smrg- ac/llvm: fix image_samples with null descriptors 36167ec681f3Smrg- radv/llvm: fix parameter index for layer exports 36177ec681f3Smrg- aco: fix vadd32() when b is neither a constant nor temporary 36187ec681f3Smrg- radv: add and use radv_vs_input_alpha_adjust 36197ec681f3Smrg- radv: add radv_translate_vertex_format() 36207ec681f3Smrg- radv: add radv_shader_variant_get_va and radv_find_shader_variant helpers 36217ec681f3Smrg- radv: add segregated fit shader memory allocator 36227ec681f3Smrg- radv: move VS specific input SGPRs first 36237ec681f3Smrg- radv: implement dynamic vertex input state using vertex shader prologs 36247ec681f3Smrg- radv: add pre-compiled vertex shader prologs for common states 36257ec681f3Smrg- aco: implement aco_compile_vs_prolog 36267ec681f3Smrg- aco: implement VS input loads with prologs 36277ec681f3Smrg- radv: implement VK_EXT_vertex_input_dynamic_state 36287ec681f3Smrg- radv: enable VK_EXT_vertex_input_dynamic_state 36297ec681f3Smrg- aco: consider pseudo-instructions reading exec in needs_exec_mask() 36307ec681f3Smrg 36317ec681f3SmrgRob Clark (81): 36327ec681f3Smrg 36337ec681f3Smrg- freedreno/registers: update dsi registers to support tpg 36347ec681f3Smrg- freedreno/a6xx: Add missing PC_CCU_INVALIDATE_x 36357ec681f3Smrg- driconfig: Add support for device specific config 36367ec681f3Smrg- driconf: Add force_gl_renderer override 36377ec681f3Smrg- freedreno: Support per-device driconf overrides 36387ec681f3Smrg- freedreno: Unleash the dragon! 36397ec681f3Smrg- freedreno: Move generated device table to .h 36407ec681f3Smrg- freedreno: Drop device_id 36417ec681f3Smrg- freedreno: Reduce use of screen->gpu_id 36427ec681f3Smrg- freedreno/ir3: Reduce use of compiler->gpu_id 36437ec681f3Smrg- freedreno/ir3/lower_io_offsets: Drop gpu_id param 36447ec681f3Smrg- freedreno/all: Introduce fd_dev_id 36457ec681f3Smrg- freedreno: Make chip_id 64b 36467ec681f3Smrg- freedreno: Device matching based on chip_id 36477ec681f3Smrg- freedreno: Use correct key for binning pass shader 36487ec681f3Smrg- freedreno: Add a680 support 36497ec681f3Smrg- freedreno/cffdec: Fix indentation 36507ec681f3Smrg- freedreno/cffdec: Fix gpuaddr comparision 36517ec681f3Smrg- freedreno/crashdec: Decode full RB in verbose mode 36527ec681f3Smrg- freedreno/crashdec: Quiet spammy print in query mode 36537ec681f3Smrg- freedreno/common: Fix comment typo 36547ec681f3Smrg- freedreno/a6xx: Set type for PC_HS_INPUT_SIZE 36557ec681f3Smrg- freedreno/a6xx: Register updates for a6xx gen3 36567ec681f3Smrg- freedreno/a6xx: Rast updates for a6xx gen3 36577ec681f3Smrg- freedreno/a6xx: Fix streamout with tess_use_shared 36587ec681f3Smrg- freedreno/a6xx: Updates for tess_use_shared 36597ec681f3Smrg- freedreno/a6xx: Register updates for a6xx gen4 36607ec681f3Smrg- freedreno/a6xx: Fix a6xx gen4 compute shaders 36617ec681f3Smrg- freedreno/ci: Add a status variable for CI farm 36627ec681f3Smrg- freedreno/ci: Take fd farm offline for moving day 36637ec681f3Smrg- freedreno/ci: Bring fd farm back online after move 36647ec681f3Smrg- clover: Don't remove sampler/image uniforms 36657ec681f3Smrg- nir/lower_amul: Handle load/store_global 36667ec681f3Smrg- nir/lower_amul: Fix usage of nir_foreach_src() 36677ec681f3Smrg- freedreno/ir3: Update physical_successors after retargetting jumps 36687ec681f3Smrg- freedreno/ir3: Fix physical successors for break out of loop 36697ec681f3Smrg- freedreno/ir3: Fix double printing of branch suffix 36707ec681f3Smrg- freedreno/ir3: Validate physical successors 36717ec681f3Smrg- freedreno/ir3: Improve error msg for block level validation 36727ec681f3Smrg- freedreno/ir3: Update physical_predecessors for streamout block 36737ec681f3Smrg- freedreno: Remove unused function 36747ec681f3Smrg- freedreno: Cleanup primtypes/primtypes_mask 36757ec681f3Smrg- freedreno: Move a6xx specific screen init 36767ec681f3Smrg- freedreno/drm: Garbage collect unused bo_cache 36777ec681f3Smrg- freedreno/drm: Rename bo->flags to bo->reloc_flags 36787ec681f3Smrg- freedreno/drm: Consider allocation flags in bo-cache 36797ec681f3Smrg- freedreno/drm: Don't return shared/control bo's to cache 36807ec681f3Smrg- freedreno/drm: Add cached-coherent bo support 36817ec681f3Smrg- freedreno/drm: Use cached-coherent cmdstream buffers 36827ec681f3Smrg- freedreno/drm: Use cached-coherent for control bo 36837ec681f3Smrg- freedreno: Used cached coherent for staging resources 36847ec681f3Smrg- freedreno: Add perf warning for WC readback 36857ec681f3Smrg- freedreno/a6xx: Pre-bake SO-disable stateobj 36867ec681f3Smrg- freedreno/ir3: Fix sched debug msgs 36877ec681f3Smrg- freedreno/ir3: Cleanup liveness lifetime 36887ec681f3Smrg- freedreno/ir3: Fix generation check 36897ec681f3Smrg- freedreno/computerator/a4xx: Fix enum mismatch warning 36907ec681f3Smrg- freedreno: Add info->a6xx.has_shading_rate 36917ec681f3Smrg- turnip: Fix unitialized cs->device 36927ec681f3Smrg- turnip: Rast updates for a6xx gen4 36937ec681f3Smrg- turnip: Fix a6xx gen4 compute shaders 36947ec681f3Smrg- isaspec: Remove unused leftovers 36957ec681f3Smrg- isaspec: Fix comment 36967ec681f3Smrg- isaspec: Split encode_bitset() into it's own template 36977ec681f3Smrg- isaspec: De-duplicate bitset encoding 36987ec681f3Smrg- freedreno: Get shader variant msgs in perf debug output 36997ec681f3Smrg- freedreno: Optimize no-op submits 37007ec681f3Smrg- freedreno: Fix some indentation 37017ec681f3Smrg- freedreno/ir3: Remove used unused 37027ec681f3Smrg- freedreno: Handle cso==NULL in bind_sampler_states 37037ec681f3Smrg- freedreno: Handle PIPE_FORMAT_NONE buffers 37047ec681f3Smrg- gallium/u_threaded: Get reset status without sync 37057ec681f3Smrg- freedreno: Disable TC syncs for get_device_reset_status() 37067ec681f3Smrg- zink: Disable TC syncs for get_device_reset_status() 37077ec681f3Smrg- Revert "freedreno: Fix autotune regression since batch-cache rework." 37087ec681f3Smrg- Revert "freedreno: Remove dead fd_batch_reset()." 37097ec681f3Smrg- Revert "freedreno: Use a BO bitset for faster checks for resource referenced." 37107ec681f3Smrg- Revert "freedreno: Remove the submit lock locking." 37117ec681f3Smrg- Revert "freedreno: Move the batch cache to the context." 37127ec681f3Smrg- gallium/u_threaded: Split out options struct 37137ec681f3Smrg- freedreno/drm: Move pipe unref after fence removal 37147ec681f3Smrg 37157ec681f3SmrgRohan Garg (7): 37167ec681f3Smrg 37177ec681f3Smrg- virgl: Add more meta data to cached resources 37187ec681f3Smrg- Revert "Revert "virgl: Cache depth and stencil buffers"" 37197ec681f3Smrg- virgl: Enable caching for sampler views and render targets 37207ec681f3Smrg- i965: Take into account the offset when marking a valid data region 37217ec681f3Smrg- i965: Write a custom allocator for the intel memobj struct 37227ec681f3Smrg- ci: Fix a minor issue in prepare-artifacts.sh script 37237ec681f3Smrg- ci: Use FDO_DISTRIBUTION_TAG where possible 37247ec681f3Smrg 37257ec681f3SmrgRoland Scheidegger (7): 37267ec681f3Smrg 37277ec681f3Smrg- llvmpipe/linear: don't try to use tgsi analysis for nir shaders 37287ec681f3Smrg- llvmpipe: always use draw_regions intersection 37297ec681f3Smrg- llvmpipe: fix nir dot products (fsum op) 37307ec681f3Smrg- aux/cso: try harder to keep cso state in sync on cso context unbind 37317ec681f3Smrg- gallium: add rasterizer depth_clamp enable bit 37327ec681f3Smrg- lavapipe: implement VK_EXT_depth_clip_enable 37337ec681f3Smrg- lavapipe: Fix crashes with transform feedback when using VK_WHOLE_SIZE 37347ec681f3Smrg 37357ec681f3SmrgRoman Stratiienko (7): 37367ec681f3Smrg 37377ec681f3Smrg- kmsro: Add 'kirin' driver support 37387ec681f3Smrg- AOSP: Extract version from libdrm instead of hardcoding it. 37397ec681f3Smrg- AOSP: Upgrade libLLVM dependency to v12 37407ec681f3Smrg- AOSP: Update timestamps of target binaries 37417ec681f3Smrg- AOSP: Add panfrost vulkan library suffix 37427ec681f3Smrg- lima: Implement lima_resource_get_param() callback 37437ec681f3Smrg- meson_options: Bump max value of platform-sdk-version to 31 37447ec681f3Smrg 37457ec681f3SmrgRyan Neph (1): 37467ec681f3Smrg 37477ec681f3Smrg- virgl: disallow null-terminated debug messages 37487ec681f3Smrg 37497ec681f3SmrgSagar Ghuge (19): 37507ec681f3Smrg 37517ec681f3Smrg- nir: Add new opcode for ternary addition 37527ec681f3Smrg- intel/compiler: Add support for ternary add instruction on XeHP 37537ec681f3Smrg- intel/compiler: Make decision based on source type instead of opcode 37547ec681f3Smrg- intel/compiler: Allow ternary add to promote source to immediate 37557ec681f3Smrg- nir: Add optimizations for iadd3 37567ec681f3Smrg- intel/compiler: Enable has_iadd3 option on XeHP 37577ec681f3Smrg- intel/compiler: Fix missing break in switch 37587ec681f3Smrg- intel/compiler: Handle ternary add in lower_simd_width 37597ec681f3Smrg- genxml/gen12: Update debug register fields according to HW 37607ec681f3Smrg- genxml/gen125: Update debug register fields according to HW 37617ec681f3Smrg- anv: Fix VK_EXT_memory_budget to consider VRAM if available 37627ec681f3Smrg- intel/compiler: Add 64-bit A64 float logical opcode support 37637ec681f3Smrg- anv: Advertise support for shaderBufferFloat64AtomicMinMax 37647ec681f3Smrg- intel/compiler: Add support to handle 64-bit atomics with A32 messages 37657ec681f3Smrg- anv: No need to lower to A64 messages for 64-bit atomics 37667ec681f3Smrg- iris: Enable atomic operations on compressed surfaces 37677ec681f3Smrg- intel/genxml: Add new bit fields Render Compression Format 37687ec681f3Smrg- isl: Add helper to return render compression format encoding 37697ec681f3Smrg- isl: Use software programmable render compression format encoding 37707ec681f3Smrg 37717ec681f3SmrgSamuel Pitoiset (215): 37727ec681f3Smrg 37737ec681f3Smrg- radv: only init the TC-compat ZRANGE metadata for the depth aspect 37747ec681f3Smrg- radv: fix bounds checking for zero vertex stride on GFX6-7 37757ec681f3Smrg- radv: report APUs as discrete GPUs for Red Dead Redemption 2 37767ec681f3Smrg- radv: fix specifying the stencil layout for separate depth/stencil layouts 37777ec681f3Smrg- radv: allow unused VkSpecializationMapEntries 37787ec681f3Smrg- aco: implement VK_EXT_shader_atomic_float2 37797ec681f3Smrg- radv: implement VK_EXT_shader_atomic_float2 37807ec681f3Smrg- radv: reduce number of emitted DWORDS for contiguous context registers 37817ec681f3Smrg- radv: do not use radeon_set_context_reg_seq() for only one register 37827ec681f3Smrg- radv: init radv_image::l2_coherent when creating the layout 37837ec681f3Smrg- ac: introduce a structure to store DCC address equations for GFX9 37847ec681f3Smrg- amd/addrlib: expose CMASK address equations to drivers on GFX9 37857ec681f3Smrg- ac/surface: add tests for CmaskAddrFromCoord prototype outside of addrlib 37867ec681f3Smrg- ac/surface: store CMASK pitch and height to radeon_surf 37877ec681f3Smrg- ac/surface: copy the CMASK equation to radeon_surf 37887ec681f3Smrg- ac/surface: implement CmaskAddrFromCoord in NIR 37897ec681f3Smrg- radv: fix selecting the first active CU when profiling with SQTT 37907ec681f3Smrg- radv: fix missing cache flushes when clearing HTILE levels on GFX10+ 37917ec681f3Smrg- amd/addrlib: expose CMASK address equations to drivers on GFX10+ 37927ec681f3Smrg- ac/surface: add tests for CmaskAddrFromCoord on GFX10+ 37937ec681f3Smrg- ac/surface: implement CmaskAddrFromCoord in NIR on GFX10+ 37947ec681f3Smrg- radv: rework DCC, FMASK and FCE decompress path 37957ec681f3Smrg- radv: perform a FCE for MSAA images that might have been fast-cleared 37967ec681f3Smrg- radv: allow DCC MSAA fast clears if a FCE is needed 37977ec681f3Smrg- radv: fix initializing the DS clear metadata value for separate aspects 37987ec681f3Smrg- radv: remove unnecessary FIXME about custom sample locations 37997ec681f3Smrg- radv: flush caches before performing separate depth/stencil aspect init 38007ec681f3Smrg- radv: bump maxFragmentSizeAspectRatio to 2 38017ec681f3Smrg- radv: disable fragmentShadingRateWithCustomSampleLocations 38027ec681f3Smrg- radv: bump maxFragmentShadingRateCoverageSamples to 32 38037ec681f3Smrg- radv: fix reported sample counts for VRS 1x1 38047ec681f3Smrg- radv: use more explicit DCC clear codes 38057ec681f3Smrg- radv: pass an image view to vi_get_fast_clear_parameters() 38067ec681f3Smrg- radv: add RADV_DCC_CLEAR_SINGLE 38077ec681f3Smrg- radv: determine if an image support fast clears using comp-to-single 38087ec681f3Smrg- radv: implement DCC fast clears with comp-to-single 38097ec681f3Smrg- radv: skip FCE for images that are fast-cleared using comp-to-single 38107ec681f3Smrg- radv: enable DCC fast-clears with comp-to-single on GFX10+ 38117ec681f3Smrg- radv: allow fast clears for concurrent images if comp-to-single is supported 38127ec681f3Smrg- radv: fix pre-computing viewport xform when setting new viewports 38137ec681f3Smrg- radv: fix fast clearing depth images with mips on GFX10+ 38147ec681f3Smrg- radv: determine if an image support comp-to-single at creation time 38157ec681f3Smrg- radv: remove useless check about the FCE predicate offset 38167ec681f3Smrg- radv: do not allocate the FCE predicate for images that use comp-to-single 38177ec681f3Smrg- radv: remove unnecessary check in radv_layout_is_htile_compressed() 38187ec681f3Smrg- radv: remove incorrect comment about compressed writes to HTILE on GFX10+ 38197ec681f3Smrg- radv: fix copying depth+stencil images on compute 38207ec681f3Smrg- radv: remove unused fast depth-stencil gfx clear path with expclear 38217ec681f3Smrg- radv: remove useless DISABLE_{ZMASK,SMEM}_EXPCLEAR_OPTIMIZATION state 38227ec681f3Smrg- radv: don't use SQ_NON_EVENT before GE_PC_ALLOC for better perf on Navi1x 38237ec681f3Smrg- radv: allocate shaders to 32-bit address to skip PGM_HI 38247ec681f3Smrg- nir/opt_algebraic: optimize fmax(-fmin(b, a), b) -> fmax(b, -a) 38257ec681f3Smrg- Revert "nir/opt_algebraic: optimize fmax(-fmin(b, a), b) -> fmax(b, -a)" 38267ec681f3Smrg- nir/opt_algebraic: optimize fmax(-fmin(b, a), b) -> fmax(fabs(b), -a) 38277ec681f3Smrg- ci: update the list of expected failures/skips for RADV 38287ec681f3Smrg- radv: allow storage images with VK_FORMAT_E5B9G9R9_UFLOAT_PACK32 on GFX10.3+ 38297ec681f3Smrg- ci: update the list of skipped tests for Fiji/RADV 38307ec681f3Smrg- radv: remove outdated radv_finishme() in the HW resolve path 38317ec681f3Smrg- radv: remove useless check about number of samples in the HW resolve path 38327ec681f3Smrg- radv: remove unecessary radv_finishme() for invalid color formats 38337ec681f3Smrg- radv: disable DCC image stores on Navi12-14 for displayable DCC corruption 38347ec681f3Smrg- radv: do not load/store the clear value for comp-to-single images 38357ec681f3Smrg- radv: do not allocate a clear value for images that support comp-to-single 38367ec681f3Smrg- radv: add support for clearing multi layers with normal gfx clear path 38377ec681f3Smrg- vulkan: Update the XML and headers to 1.2.190 38387ec681f3Smrg- radv: advertise VK_EXT_primitive_topology_list_restart 38397ec681f3Smrg- ac/llvm: adjust assertion for nir_intrinsic_terminate 38407ec681f3Smrg- ac/llvm: fix huge alignment when loading from shared memory 38417ec681f3Smrg- radv/llvm: fix invalid IR when converting triangle strips to indices 38427ec681f3Smrg- radv: use radeon_set_sh_reg_seq() more for initial gfx/compute state 38437ec681f3Smrg- radv: call nir_lower_int64() for LLVM 38447ec681f3Smrg- radv: track if shader image 32-bit float atomics are enabled 38457ec681f3Smrg- radv: do not disable DCC for storage images if atomics aren't enabled 38467ec681f3Smrg- vulkan: add common entrypoints for sparse image requirements/properties 38477ec681f3Smrg- radv: use common entrypoints for sparse image requirements/properties 38487ec681f3Smrg- radv: use common vkGetPhysicalDevice{Image}FormatProperties() 38497ec681f3Smrg- radv: use common vkGetDeviceQueue() 38507ec681f3Smrg- radv: use common vkBind{Buffer,Image}Memory() 38517ec681f3Smrg- radv: use common vkGet{Buffer,Image}MemoryRequirements() 38527ec681f3Smrg- radv: fix determining the maximum number of waves that can use scratch 38537ec681f3Smrg- radv: remove NGG streamout support in LLVM 38547ec681f3Smrg- radv: allow to conditionally read HTILE value when copying VRS rates 38557ec681f3Smrg- radv: optimize copying VRS rates to the global HTILE buffer 38567ec681f3Smrg- radv: pass the HTILE buffer to radv_copy_vrs_htile() 38577ec681f3Smrg- radv: optimize VRS when no depth stencil attachment is bound 38587ec681f3Smrg- radv/llvm: rework VS input loads and implement the callback 38597ec681f3Smrg- ac/llvm: fix build with LLVM 14 38607ec681f3Smrg- radv: add MSAA support to the comp-to-single fast clear path 38617ec681f3Smrg- radv: enable comp-to-single for MSAA images 38627ec681f3Smrg- radv: reduce SQTT traffic when instruction timing is disabled 38637ec681f3Smrg- radv/llvm: fix using Wave32 38647ec681f3Smrg- radv/llvm: fix vertex input fetches with 16-bit floats 38657ec681f3Smrg- ac/llvm: implement nir_intrinsic_image_deref_atomic_{fmin,fmax} 38667ec681f3Smrg- ac/llvm: implement nir_intrinsic_ssbo_atomic_{fmin,fmax} 38677ec681f3Smrg- ac/llvm: implement nir_intrinsic_shared_atomic_{fmin,fmax} 38687ec681f3Smrg- ac/llvm: implement nir_intrinsic_global_atomic_{fmin,fmax} 38697ec681f3Smrg- radv: advertise EXT_shader_atomic_float2 with LLVM 14+ 38707ec681f3Smrg- radv/ci: add a list of expected failures for VanGogh 38717ec681f3Smrg- ac/rgp, radv: report scratch memory size for shaders 38727ec681f3Smrg- ac/rgp, radv: report wave size for shaders 38737ec681f3Smrg- radv: rename radv_decompress_depth_stencil() 38747ec681f3Smrg- radv: implement depth/stencil expand on compute 38757ec681f3Smrg- radv: add support for copying compressed depth/stencil images on compute 38767ec681f3Smrg- radv: keep depth/stencil images compressed for TRANSFER_DST on compute 38777ec681f3Smrg- radv: replicate THREAD_TRACE_CTRL config when stopping SQTT 38787ec681f3Smrg- radv: make the SQTT BO a resident buffer 38797ec681f3Smrg- radv: remove useless assertions in the SQTT path 38807ec681f3Smrg- radv: do not use a different disk cache key for LLVM 38817ec681f3Smrg- radv: do not store meta shaders to the default shader disk cache 38827ec681f3Smrg- radv: remove useless shader variant key copies for VS+TCS 38837ec681f3Smrg- radv: stop loading invocation ID for NGG vertex shaders 38847ec681f3Smrg- radv: remove unused radv_tcs_variant_key:primitive_mode 38857ec681f3Smrg- radv: stop using the shader keys for as_ls/as_es/as_ngg when possible 38867ec681f3Smrg- radv: remove useless as_ngg_passthrough init when lowering NGG in NIR 38877ec681f3Smrg- radv/llvm: stop using vs_common_out.as_ngg_passthrough 38887ec681f3Smrg- radv: add export_clip_dists for VS and TES to radv_shader_info 38897ec681f3Smrg- radv,aco: stop using vs_common_out.export_clip_dists 38907ec681f3Smrg- radv/llvm: stop using vs_common_out.export_prim_id 38917ec681f3Smrg- radv: store the topology instead of the output primitive type in the key 38927ec681f3Smrg- radv: store the CS subgroup size to radv_shader_info 38937ec681f3Smrg- radv: rework layout of radv_pipeline_key 38947ec681f3Smrg- radv: pass the pipeline key to the backend compilers 38957ec681f3Smrg- radv: cleanup uses of VK_PIPELINE_CREATE_DISABLE_OPTIMIZATION_BIT 38967ec681f3Smrg- radv: remove unused radv_nir_compiler_options fields 38977ec681f3Smrg- radv: remove unnecessary vs_common_out.export_viewport_index 38987ec681f3Smrg- radv: remove unnecessary vs_common_out.export_layer_id 38997ec681f3Smrg- radv: remove unnecessary radv_shader_info:{vs,tes}.export_prim_id 39007ec681f3Smrg- radv: remove unnecessary init of outinfo.export_prim_id for GS 39017ec681f3Smrg- radv: remove vs_common_out:export_prim_id 39027ec681f3Smrg- radv: remove vs_common_out:export_clip_dists 39037ec681f3Smrg- radv: pass the pipeline key to the shader info pass 39047ec681f3Smrg- radv: use the pipeline key more when possible 39057ec681f3Smrg- radv: stop using vs_common_out.{as_es/as_ls/as_ngg*} shader keys 39067ec681f3Smrg- radv: remove radv_shader_variant_key completely 39077ec681f3Smrg- radv: fix missing features for BDA 39087ec681f3Smrg- radv: remove the LLVM stat about the number of private VGPRs 39097ec681f3Smrg- radv: fix adjusting the frag coord when RADV_FORCE_VRS is enabled 39107ec681f3Smrg- radv: fix selecting the hash when RADV_FORCE_VRS is enabled 39117ec681f3Smrg- radv: make sure to load the Primitive ID for VS+GS as NGG 39127ec681f3Smrg- radv: fix vk_object_base_init/finish for the internal pipeline cache 39137ec681f3Smrg- radv: fix vk_object_base_init/finish for internal buffer views 39147ec681f3Smrg- radv: fix vk_object_base_init/finish for the internal push descriptors 39157ec681f3Smrg- radv: fix vk_object_base_init/finish for internal image views 39167ec681f3Smrg- radv: fix vk_object_base_init/finish for internal buffers 39177ec681f3Smrg- radv: set export_clip_dists for the GS copy shader 39187ec681f3Smrg- radv: determine the VS output parameters in the shader info pass 39197ec681f3Smrg- radv: disable the DX10 diamond test for better line rasterization perf 39207ec681f3Smrg- radv: get the float controls execution mode from NIR for LLVM 39217ec681f3Smrg- radv: do not declare an extra user SGPR for sample positions and PS 39227ec681f3Smrg- radv: move ngg early prim export determination earlier 39237ec681f3Smrg- move: move ngg lds bytes determination earlier 39247ec681f3Smrg- radv: move ngg passthrough determination earlier 39257ec681f3Smrg- radv: remove unnecessary ac_nir_ngg_config output struct 39267ec681f3Smrg- radv: constify radv_shader_info for radv_lower_{io_to_mem,ngg}() 39277ec681f3Smrg- radv: move forcing discard to demote to the graphics pipeline key 39287ec681f3Smrg- radv: move forcing invariant geometry to the graphics pipeline key 39297ec681f3Smrg- radv: move forcing MRT output NaN fixup to the graphics pipeline key 39307ec681f3Smrg- radv: move forcing VRS rates to the graphics pipeline key 39317ec681f3Smrg- radv: move use of NGG to the graphics pipeline key 39327ec681f3Smrg- radv: remove redundant check of needs_multiview_view_index for PS 39337ec681f3Smrg- radv: remove useless loads_dynamic_offsets when emitting push constants 39347ec681f3Smrg- radv: determine the ES type (VS or TES) for GS earlier 39357ec681f3Smrg- ci: enable building RADV in debian-release 39367ec681f3Smrg- radv: fix vk_object_base_init/finish for push descriptors 39377ec681f3Smrg- radv: fix writing combined image/sampler descriptor 39387ec681f3Smrg- radv: fix vk_object_base_init/finish for internal device memory objects 39397ec681f3Smrg- radv/llvm: fix exporting VS parameters 39407ec681f3Smrg- radv: do not set TRAP_PRESENT(1) for fragment shaders 39417ec681f3Smrg- aco: fix load_barycentric_at_{offset,sample} 39427ec681f3Smrg- radv: declare the shader user locs from the shader arguments 39437ec681f3Smrg- radv: determine if a shader uses indirect descriptors from the SGPR loc 39447ec681f3Smrg- radv: determine if a shader loads push constants from the SGPR loc 39457ec681f3Smrg- radv: remove unnecessary radv_shader_info:base_inline_push_consts 39467ec681f3Smrg- radv: remove unnecessary radv_shader_info:num_inline_push_consts 39477ec681f3Smrg- radv: do not overwrite loads_push_constants when declaring shader args 39487ec681f3Smrg- radv: gather more information about PS in the shader info pass 39497ec681f3Smrg- radv,aco: compute and store the SPI PS input in radv_shader_info 39507ec681f3Smrg- aco: prevent using undeclared shader arguments for PS 39517ec681f3Smrg- radv,aco: remap PS inputs when declaring shader arguments 39527ec681f3Smrg- aco: constify radv_shader_{info,args} 39537ec681f3Smrg- radv: remove radv_pipeline::layout 39547ec681f3Smrg- radv: implement vkGetDeviceBufferMemoryRequirementsKHR() 39557ec681f3Smrg- radv: implement vkGetDeviceImageMemoryRequirementsKHR() 39567ec681f3Smrg- radv: implement vkGetDeviceImageSparseMemoryRequirementsKHR() 39577ec681f3Smrg- radv: advertise VK_KHR_maintenance4 39587ec681f3Smrg- radv: use nir_image_deref_{load,store} in the DCC retile compute path 39597ec681f3Smrg- radv: remove useless coordinate computation in the compute clear path 39607ec681f3Smrg- radv: remove few useless nir_channels() in meta shaders 39617ec681f3Smrg- radv: use get_global_ids() to compute coordinates in meta shaders 39627ec681f3Smrg- radv: use nir_ssa_undef() for unused image components in meta shaders 39637ec681f3Smrg- radv: move ac_shader_config to radv_shader_binary instead of legacy 39647ec681f3Smrg- radv: store the post-processed shader binary config to the cache 39657ec681f3Smrg- radv,aco: remove nir_intrinsic_load_layer_id 39667ec681f3Smrg- radv: remove no-op about the view index in the shader info pass 39677ec681f3Smrg- radv: rename needs_multiview_view_index to uses_view_index 39687ec681f3Smrg- radv: stop gathering output GS info for vertex shaders 39697ec681f3Smrg- aco: cleanup setup_vs_output_info() 39707ec681f3Smrg- radv: do not initialize is_ngg_passthrough for geometry shaders 39717ec681f3Smrg- radv: remove duplicated code about NGG passthrough determination 39727ec681f3Smrg- radv: switch to VK_FORMAT_FEATURE_2_XXX/VkFormatProperties3KHR 39737ec681f3Smrg- radv: implement VK_KHR_format_feature_flags2 39747ec681f3Smrg- aco: do not return an empty string when disassembly is not supported 39757ec681f3Smrg- radv: fix removing PSIZ when it's not emitted by the last VGT stage 39767ec681f3Smrg- radv: fix OpImageQuerySamples with non-zero descriptor set 39777ec681f3Smrg- radv: do not remove PSIZ for streamout shaders 39787ec681f3Smrg- aco: fix invalid IR generated for b2f64 when the dest is a VGPR 39797ec681f3Smrg- aco: fix emitting stream outputs when the first component isn't zero 39807ec681f3Smrg- aco: fix loading 64-bit inputs with fragment shaders 39817ec681f3Smrg- radv: re-emit prolog inputs when the nontrivial divisors state changed 39827ec681f3Smrg- radv: fix build errors with Android 39837ec681f3Smrg- aco: only load streamout buffers if streamout is enabled 39847ec681f3Smrg- radv: do not expose buffer features for depth/stencil formats 39857ec681f3Smrg- radv/sqtt: fix GPU hangs when capturing from the compute queue 39867ec681f3Smrg- radv: fix a sync issue on GFX9+ by clearing the upload BO fence 39877ec681f3Smrg- nir: fix constant expression of ibitfield_extract 39887ec681f3Smrg 39897ec681f3SmrgSergii Melikhov (2): 39907ec681f3Smrg 39917ec681f3Smrg- iris: Fix Null pointer dereferences 39927ec681f3Smrg- dri2: Fix Null pointer dereferences 39937ec681f3Smrg 39947ec681f3SmrgShmerl (1): 39957ec681f3Smrg 39967ec681f3Smrg- vulkan/overlay: don't display histogram and range for device and format 39977ec681f3Smrg 39987ec681f3SmrgSimon Ser (18): 39997ec681f3Smrg 40007ec681f3Smrg- EGL: sync headers with Khronos 40017ec681f3Smrg- egl: add support for EGL_EXT_device_drm_render_node 40027ec681f3Smrg- etnaviv: fix renderonly check in etna_resource_alloc 40037ec681f3Smrg- etnaviv: fail in get_handle(TYPE_KMS) without a scanout resource 40047ec681f3Smrg- freedreno: fail in get_handle(TYPE_KMS) without a scanout resource 40057ec681f3Smrg- panfrost: fail in get_handle(TYPE_KMS) without a scanout resource 40067ec681f3Smrg- lima: fail in get_handle(TYPE_KMS) without a scanout resource 40077ec681f3Smrg- vulkan/wsi/wayland: use drm_fourcc.h for formats 40087ec681f3Smrg- vulkan/wsi/wayland: drop support for wl_drm 40097ec681f3Smrg- vulkan/wsi/wayland: generalize modifier handling 40107ec681f3Smrg- etnaviv: add stride, offset and modifier to resource_get_param 40117ec681f3Smrg- panfrost: implement resource_get_param 40127ec681f3Smrg- vc4: implement resource_get_param 40137ec681f3Smrg- v3d: implement resource_get_param 40147ec681f3Smrg- vulkan/wsi/x11: add driconf option to not wait under Xwayland 40157ec681f3Smrg- gbm: consistently use the same name for BO flags 40167ec681f3Smrg- gbm: add gbm_{bo,surface}_create_with_modifiers2 40177ec681f3Smrg- gbm: assume USE_SCANOUT in create_with_modifiers 40187ec681f3Smrg 40197ec681f3SmrgSimon Zeni (5): 40207ec681f3Smrg 40217ec681f3Smrg- gbm: add GBM_FORMAT_R16 40227ec681f3Smrg- i915: remove use of backtrace and backtrace_symbols 40237ec681f3Smrg- glapi/gl_gentable.py: drop call to backtrace on no op 40247ec681f3Smrg- util/u_debug_symbol: remove debug_symbol_name_glibc and execinfo dependency 40257ec681f3Smrg- meson: stop searching for execinfo 40267ec681f3Smrg 40277ec681f3SmrgStéphane Marchesin (1): 40287ec681f3Smrg 40297ec681f3Smrg- virgl: Flush context before waiting on fences 40307ec681f3Smrg 40317ec681f3SmrgTapani Pälli (22): 40327ec681f3Smrg 40337ec681f3Smrg- crocus: take a reference to memobj bo in crocus_resource_from_memobj 40347ec681f3Smrg- crocus: disable depth and d+s formats with memory objects 40357ec681f3Smrg- iris: handle depth-stencil import with a wrapper function 40367ec681f3Smrg- anv: disable aux for exportable images without modifiers 40377ec681f3Smrg- anv: allow stencil memory export 40387ec681f3Smrg- anv/android: fix build error due refactoring 40397ec681f3Smrg- mesa: fix timestamp enum with EXT_disjoint_timer_query 40407ec681f3Smrg- mesa: GL_ARB_ES3_2_compatibility GL compat profile support 40417ec681f3Smrg- anv: remove a format assert when setting up attachments 40427ec681f3Smrg- vulkan: provide common functions to check device features 40437ec681f3Smrg- anv: remove feature checks from device creation 40447ec681f3Smrg- radv: remove feature checks from device creation 40457ec681f3Smrg- turnip: remove feature checks from device creation 40467ec681f3Smrg- v3dv: remove feature checks from device creation 40477ec681f3Smrg- lavapipe: remove feature checks from device creation 40487ec681f3Smrg- panvk: remove feature checks from device creation 40497ec681f3Smrg- intel/blorp: fix a compile warning about uninitialized use 40507ec681f3Smrg- intel/isl: FXT1 support was removed on Gfx12.5 40517ec681f3Smrg- swrast: Fix another warning from gcc 11 40527ec681f3Smrg- anv/android: fix parameters given for vk_common_QueueSubmit 40537ec681f3Smrg- anv: use vk_object_zalloc for wsi fences created 40547ec681f3Smrg- iris: clear bos_written when resetting a batch 40557ec681f3Smrg 40567ec681f3SmrgThomas H.P. Andersen (1): 40577ec681f3Smrg 40587ec681f3Smrg- nine: Fix assert in tx_src_param 40597ec681f3Smrg 40607ec681f3SmrgThomas Wagner (6): 40617ec681f3Smrg 40627ec681f3Smrg- gallium: add utility and interface for memory fd allocations 40637ec681f3Smrg- llvmpipe: add support for EXT_memory_object(_fd) 40647ec681f3Smrg- lavapipe: add support for KHR_external_memory_fd 40657ec681f3Smrg- llvmpipe: enable EXT_memory_object(_fd) 40667ec681f3Smrg- lavapipe: enable KHR_external_memory_fd 40677ec681f3Smrg- util: use anonymous file for memory fd creation 40687ec681f3Smrg 40697ec681f3SmrgThong Thai (15): 40707ec681f3Smrg 40717ec681f3Smrg- gallium: add temporal layers cap enum 40727ec681f3Smrg- frontends/va: check number of temporal layers supported by encoder 40737ec681f3Smrg- gallium: update h264 struct to track temporal layers 40747ec681f3Smrg- radeon/vcn/enc: H.264 SVC encode 40757ec681f3Smrg- radeonsi: enable H.264 temporal encoding support for VCN 40767ec681f3Smrg- frontends/va: handle h264 num_temporal_layers for SVC encoding 40777ec681f3Smrg- gallium: change rate ctrl struct to array 40787ec681f3Smrg- r600: change rate ctrl struct to array 40797ec681f3Smrg- radeon/vce: change rate ctrl struct to array 40807ec681f3Smrg- radeon/vcn/enc: change to per-temporal layer rate control 40817ec681f3Smrg- frontends/omx: change rate ctrl struct to array 40827ec681f3Smrg- frontends/va: change to per-layer rate control 40837ec681f3Smrg- gallium/auxiliary/vl: Add additional deinterlace enum and tracking 40847ec681f3Smrg- gallium/util: add half texel offset param to util_compute_blit 40857ec681f3Smrg- frontends/va/postproc: Keep track of deinterlacing method being used 40867ec681f3Smrg 40877ec681f3SmrgTimothy Arceri (20): 40887ec681f3Smrg 40897ec681f3Smrg- util: document that workaround also fixes Riptale 40907ec681f3Smrg- glsl: replace some C++ code with C 40917ec681f3Smrg- nir/gcm: be less destructive with instruction order 40927ec681f3Smrg- intel/compiler: call nir_opt_dead_cf() after we have finished all opts 40937ec681f3Smrg- intel/compiler: Use GCM in nir_optimize 40947ec681f3Smrg- util: add workaround for Full Bore 40957ec681f3Smrg- glsl: relax rule on varying matching for shaders older than 4.20 40967ec681f3Smrg- intel/compiler: make sure swizzle is applied to if condition 40977ec681f3Smrg- nir: add indirect loop unrolling to compiler options 40987ec681f3Smrg- nir: move nir_block_ends_in_break() to nir.h 40997ec681f3Smrg- nir: add heuristic for instructions in loops with GCM 41007ec681f3Smrg- nir: fix GCM when GVN enabled 41017ec681f3Smrg- glsl: fix variable scope for instructions inside case statements 41027ec681f3Smrg- mesa: fix mesa_problem() call in _mesa_program_state_flags() 41037ec681f3Smrg- glsl: fix variable scope for loop-expression 41047ec681f3Smrg- glsl: handle scope correctly when inlining loop expression 41057ec681f3Smrg- glsl: fix variable scope for do-while loops 41067ec681f3Smrg- util/cache: run basic cache tests on the single file cache 41077ec681f3Smrg- util/cache: test simple cache put and get between instances 41087ec681f3Smrg- mesa: fix buffer overrun in SavedObj texture obj array 41097ec681f3Smrg 41107ec681f3SmrgTimur Kristóf (71): 41117ec681f3Smrg 41127ec681f3Smrg- radv: Use 128-sized vertex grouping for NGG shaders. 41137ec681f3Smrg- radv: Don't compile NGG culling into shaders that write viewport index. 41147ec681f3Smrg- radv: Remove num_viewports from radv_skip_ngg_culling. 41157ec681f3Smrg- aco: Swap s_and operand order for ballot. 41167ec681f3Smrg- aco: Allow elect to take advantage of knowing when all lanes are active. 41177ec681f3Smrg- aco: Remove s_and with exec when all lanes are active. 41187ec681f3Smrg- radv: Use pre-computed viewport transform for NGG culling state. 41197ec681f3Smrg- aco: Fix how p_elect interacts with optimizations. 41207ec681f3Smrg- aco, nir, ac: Simplify sequence of getting initial NGG VS edge flags. 41217ec681f3Smrg- ac/nir: Use es_accepted variable after culling. 41227ec681f3Smrg- ac/nir: Use gs_accepted variable after culling. 41237ec681f3Smrg- ac/nir: Don't count vertices and primitives in wave after culling. 41247ec681f3Smrg- nir, aco: Remove vertex and primitive count overwrite intrinsic. 41257ec681f3Smrg- ac/nir: Remove unhelpful nir_opt_cse from ac_nir_lower_ngg_nogs. 41267ec681f3Smrg- aco: Use Navi 10 empty NGG output workaround on NGG culling shaders. 41277ec681f3Smrg- radv: Don't toggle PC oversubscription for NGG culling. 41287ec681f3Smrg- radv: Use ac_compute_late_alloc in radv_pipeline. 41297ec681f3Smrg- ac: Remove deprecated use_late_alloc field as nobody uses it anymore. 41307ec681f3Smrg- radv: Write RSRC2_GS for NGGC when pipeline is dirty but not emitted. 41317ec681f3Smrg- aco: Fix to_uniform_bool_instr when operands are not suitable. 41327ec681f3Smrg- radv, ac, aco: Use indices 0-2 of gs_vtx_offset argument array on GFX9+. 41337ec681f3Smrg- radeonsi: Change GS vertex offset arguments to use gs_vtx_offset array. 41347ec681f3Smrg- ac: Calculate workgroup sizes of HW stages that operate in workgroups. 41357ec681f3Smrg- radv: Calculate workgroup sizes in radv_pipeline. 41367ec681f3Smrg- radv: Remove superfluous workgroup size calculations. 41377ec681f3Smrg- aco: Use workgroup size from input shader info. 41387ec681f3Smrg- aco: Consider LDS usage by PS inputs in MaxWaves calculation. 41397ec681f3Smrg- aco: Consider maximum number of workgroups per CU/WGP on Navi. 41407ec681f3Smrg- aco: Emit zero for the derivatives of uniforms. 41417ec681f3Smrg- aco: Unset 16 and 24-bit flags from operands in apply_extract. 41427ec681f3Smrg- nir: Add unsigned upper bound for extract opcodes. 41437ec681f3Smrg- nir: Fix local_invocation_index upper bound for non-compute-like stages. 41447ec681f3Smrg- nir: Add comment to explain the sad_u8x4 opcode. 41457ec681f3Smrg- aco: Fix invalid usage of std::fill with std::array. 41467ec681f3Smrg- ac/nir/ngg: Delete unused struct. 41477ec681f3Smrg- ac/nir/nggc: Don't stop applying reusable variables at prim export. 41487ec681f3Smrg- ac/nir/nggc: Only repack arguments that are needed. 41497ec681f3Smrg- ac/nir/nggc: Move gs_alloc_req up in NGG culling shaders. 41507ec681f3Smrg- aco: Use Builder reference in emit_copies_block. 41517ec681f3Smrg- aco: Skip code paths to emit copies when there are no copies. 41527ec681f3Smrg- aco/optimize_postRA: Use iterators instead of operator[] of std::array. 41537ec681f3Smrg- aco: Add some useful info to the README for debugging. 41547ec681f3Smrg- radv: Remove PSIZ output when it isn't needed. 41557ec681f3Smrg- aco: Add ability to optimize v_lshl + v_sub into v_mad_i32_i24. 41567ec681f3Smrg- aco/isel: Fix emit_vop2_instruction to apply 16/24-bit flags properly. 41577ec681f3Smrg- ac/nir: Remove byte permute from prefix sum of the repack sequence. 41587ec681f3Smrg- ac/nir: Fix match_mask to work correctly for VS outputs. 41597ec681f3Smrg- nir: Exclude non-generic patch variables from get_variable_io_mask. 41607ec681f3Smrg- radv: Disable HW generated edge flags for NGG shaders. 41617ec681f3Smrg- ac/nir: Emit edge flag instructions conditionally. 41627ec681f3Smrg- radv/llvm: Don't read edge flags anymore. 41637ec681f3Smrg- radv: Fix gs_vgpr_comp_cnt for NGG culling in vertex shaders. 41647ec681f3Smrg- ac/nir/nggc: Refactor save_reusable_variables. 41657ec681f3Smrg- ac/nir/nggc: Don't reuse uniform values from divergent control flow. 41667ec681f3Smrg- radv: Select PC oversubscription rate based on number of PS params. 41677ec681f3Smrg- radv: Reduce NGG culling small draw threshold to 128. 41687ec681f3Smrg- aco: Allow p_extract to have different definition and operand sizes. 41697ec681f3Smrg- aco: Implement integer conversions using p_extract. 41707ec681f3Smrg- aco: Omit p_extract after ds_read with matching bit size. 41717ec681f3Smrg- aco: Don't write m0 register for LDS instructions on GFX9+. 41727ec681f3Smrg- aco: Fix small primitive precision. 41737ec681f3Smrg- aco: Fix determining whether any culling is enabled. 41747ec681f3Smrg- radv: Don't declare ngg_gs_state when there is no API GS. 41757ec681f3Smrg- radv: Enable NGG culling by default on GFX10.3, add nonggc debug flag. 41767ec681f3Smrg- ac/nir/cull: Accept NaN and +/- Inf in face culling. 41777ec681f3Smrg- ac/nir/nggc: Write undef to variables in non-repacked ES threads. 41787ec681f3Smrg- aco/optimizer: Skip SDWA on v_lshlrev when unnecessary in apply_extract. 41797ec681f3Smrg- drirc: Fix indentation. 41807ec681f3Smrg- drirc: Apply radv_invariant_geom workaround to Resident Evil Village. 41817ec681f3Smrg- drirc: Apply radv_invariant_geom workaround to World War Z games. 41827ec681f3Smrg- aco: Fix how p_is_helper interacts with optimizations. 41837ec681f3Smrg 41847ec681f3SmrgTomeu Vizoso (40): 41857ec681f3Smrg 41867ec681f3Smrg- panvk: Don't try to update samplers if they are immutable 41877ec681f3Smrg- panvk: Start a new batch when the job index gets above the limit 41887ec681f3Smrg- panvk: Close batch when ending a command buffer 41897ec681f3Smrg- panvk: Move check for fragment requirement up to the draw 41907ec681f3Smrg- panvk: A pipeline might not be bound when the render pass is ended 41917ec681f3Smrg- panvk: Expose panvk_cmd_alloc_fb_desc and panvk_cmd_alloc_tls_desc 41927ec681f3Smrg- panvk: Implement vkCmdClearAttachments 41937ec681f3Smrg- docs/ci: Update http cache config to let Authorization headers pass through 41947ec681f3Smrg- freedreno/ci: Move rules for restricted jobs to test-source-dep.yml 41957ec681f3Smrg- ci: Update canvas_text trace 41967ec681f3Smrg- virgl/ci: Have LLVMPipe use more threads for rendering 41977ec681f3Smrg- virgl/ci: Rebalance concurrency 41987ec681f3Smrg- virgl/ci: Wait a bit before shutting the VM down 41997ec681f3Smrg- virgl/ci: Set NIR_VALIDATE=0 on the host 42007ec681f3Smrg- panfrost: Add padding to pan_blit_blend_shader_key 42017ec681f3Smrg- iris/ci: Add manual jobs for tracking performance 42027ec681f3Smrg- panvk: Initialize timestamp for disk cache 42037ec681f3Smrg- freedreno/ci: Correctly set freq governors to max 42047ec681f3Smrg- iris/ci: Correctly set freq governors to max 42057ec681f3Smrg- panvk/ci: Build-test panvk 42067ec681f3Smrg- ci: Ensure the DRM device is open 42077ec681f3Smrg- lavapipe: add xfails for whole of CTS 42087ec681f3Smrg- vulkan: Read len attribute of parameters to functions 42097ec681f3Smrg- vulkan: Generate code to place commands in a queue 42107ec681f3Smrg- vulkan: Generate entrypoints that enqueue commands 42117ec681f3Smrg- lavapipe: Use generated command queue code 42127ec681f3Smrg- lavapipe: Use c_msvc_compat_args 42137ec681f3Smrg- vulkan: Remove dependency on Python 3.9+ 42147ec681f3Smrg- Revert "lavapipe: unbreak imageless framebuffer" 42157ec681f3Smrg- vulkan: Copy pNext structures when enqueuing commands 42167ec681f3Smrg- ci: Uprev piglit to 99be1b06ff36 42177ec681f3Smrg- ci: Stop adding link to tracie dashboard 42187ec681f3Smrg- panfrost/ci: Enable test runs on G72 42197ec681f3Smrg- panvk: Move CmdClear* impl to a separate file 42207ec681f3Smrg- panfrost/ci: Move CI files to src/panfrost 42217ec681f3Smrg- panfrost/ci: Test panvk on Mali G52 42227ec681f3Smrg- ci: Rebuild kernel with Amlogic KMS support 42237ec681f3Smrg- panfrost/ci: Run Piglit's quick_gl tests on G52 42247ec681f3Smrg- ci: Add support for lazor Chromebooks 42257ec681f3Smrg- ci: Let manual LAVA jobs have a longer timeout than others 42267ec681f3Smrg 42277ec681f3SmrgTony Wasserka (24): 42287ec681f3Smrg 42297ec681f3Smrg- radv: Rename radv_shader_helper.h to radv_llvm_helper.h 42307ec681f3Smrg- aco: Separate LLVM/CLRX asm printers more cleanly 42317ec681f3Smrg- aco: Extend set of supported GPUs that can be disassembled with CLRX 42327ec681f3Smrg- radv: Build code which depends on LLVM only when enabled 42337ec681f3Smrg- radv: Disable shader disassembly when no disassembler is available 42347ec681f3Smrg- aco/tests: Assert that the requested IR is actually provided 42357ec681f3Smrg- aco/spill: Avoid unneeded copies when iterating over maps 42367ec681f3Smrg- aco: Use std::vector for the underlying container of std::stack 42377ec681f3Smrg- aco/spill: Remove unused container 42387ec681f3Smrg- aco/spill: Replace map[] with map::insert 42397ec681f3Smrg- aco/spill: Avoid copying next_use maps more often than needed 42407ec681f3Smrg- aco/spill: Persist memory allocations of local next use maps 42417ec681f3Smrg- aco/spill: Avoid destroying local next use maps over-eagerly 42427ec681f3Smrg- aco/spill: Replace vector<map> with vector<vector> for local_next_use 42437ec681f3Smrg- aco/spill: Prefer unordered_map over map for next use distances 42447ec681f3Smrg- aco/spill: Avoid copying current_spills when not needed 42457ec681f3Smrg- aco/spill: Reduce redundant std::map lookups 42467ec681f3Smrg- aco/spill: Replace an std::map to booleans with std::set 42477ec681f3Smrg- aco/spill: Store remat list in an std::unordered_map instead of std::map 42487ec681f3Smrg- aco/spill: Change worklist to a single integer 42497ec681f3Smrg- aco/spill: Reduce allocations in next_uses_per_block 42507ec681f3Smrg- aco/spill: Clarify use of long-lived references by adding const 42517ec681f3Smrg- aco/spill: Use unordered_map for spills_exit 42527ec681f3Smrg- aco/spill: Use std::unordered_map for spills_entry 42537ec681f3Smrg 42547ec681f3SmrgVadym Shovkoplias (3): 42557ec681f3Smrg 42567ec681f3Smrg- driconf, glsl: Add a vs_position_always_precise option 42577ec681f3Smrg- drirc: Set vs_position_always_precise for Assault Android Cactus 42587ec681f3Smrg- intel/fs: Fix a cmod prop bug when cmod is set to inst that doesn't support it 42597ec681f3Smrg 42607ec681f3SmrgVasily Khoruzhick (2): 42617ec681f3Smrg 42627ec681f3Smrg- lima: handle fp16 vertex formats 42637ec681f3Smrg- lima: split_load_input: don't split unaligned vec2 42647ec681f3Smrg 42657ec681f3SmrgVeerabadhran Gopalakrishnan (2): 42667ec681f3Smrg 42677ec681f3Smrg- radeon/vcn: Add FW header flag to enable VP9 header parsing 42687ec681f3Smrg- gallium/va: Remove VP9 header parsing for secure playback 42697ec681f3Smrg 42707ec681f3SmrgVinson Lee (17): 42717ec681f3Smrg 42727ec681f3Smrg- nv50/ir: Initialize Value member id in constructor. 42737ec681f3Smrg- asahi: Move assignment after null check. 42747ec681f3Smrg- spirv_to_dxil: Fix missing-prototypes build error. 42757ec681f3Smrg- meson: Remove duplicate xvmc in build summary. 42767ec681f3Smrg- nir: Initialize evaluate_cube_face_index_amd dst.x. 42777ec681f3Smrg- zink: Remove unnecessary null checks. 42787ec681f3Smrg- nv50/ir: Add FlatteningPass constructor. 42797ec681f3Smrg- freedreno: Require C++17. 42807ec681f3Smrg- broadcom/compiler: Fix qpu.flags.muf typo. 42817ec681f3Smrg- glx: Fix unused-variable warning with macOS build. 42827ec681f3Smrg- draw/tess: Fix unused-function warning with draw-use-llvm=disabled. 42837ec681f3Smrg- nv50/ir: Add DeadCodeElim constructor. 42847ec681f3Smrg- pps: Avoid duplicate elements in with_datasources array. 42857ec681f3Smrg- freedreno: Add valgrind dependency. 42867ec681f3Smrg- anv: Fix assertion. 42877ec681f3Smrg- radv: Fix memory leak on error path. 42887ec681f3Smrg- virgl: Allocate qdws after virgl_init_context to avoid leak. 42897ec681f3Smrg 42907ec681f3SmrgWitold Baryluk (2): 42917ec681f3Smrg 42927ec681f3Smrg- zink: Do not access just freed zink_batch_state 42937ec681f3Smrg- zink: Fully initialize VkBufferViewCreateInfo for hashing 42947ec681f3Smrg 42957ec681f3SmrgYevhenii Kharchenko (1): 42967ec681f3Smrg 42977ec681f3Smrg- iris: fix layer calculation for TEXTURE_3D ReadPixels() on mip-level>0 42987ec681f3Smrg 42997ec681f3SmrgYevhenii Kolesnikov (19): 43007ec681f3Smrg 43017ec681f3Smrg- glsl: Add operator for .length() method on implicitly-sized arrays 43027ec681f3Smrg- glsl: Properly handle .length() of an unsized array 43037ec681f3Smrg- vulkan: Add a common vk_command_buffer structure 43047ec681f3Smrg- anv: Use a common vk_command_buffer structure 43057ec681f3Smrg- radv: Use a common vk_command_buffer structure 43067ec681f3Smrg- turnip: Use a common vk_command_buffer structure 43077ec681f3Smrg- v3dv: Use a common vk_command_buffer structure 43087ec681f3Smrg- lavapipe: Use a common vk_command_buffer structure 43097ec681f3Smrg- vulkan: Add a common vk_queue structure 43107ec681f3Smrg- anv: Use a common vk_queue structure 43117ec681f3Smrg- radv: Use a common vk_queue structure 43127ec681f3Smrg- turnip: Use a common vk_queue structure 43137ec681f3Smrg- v3dv: Use a common vk_queue structure 43147ec681f3Smrg- lavapipe: Use a common vk_queue structure 43157ec681f3Smrg- vulkan: Implement VK_EXT_debug_utils 43167ec681f3Smrg- vulkan/enum_to_str: Add generator for VkObjectType to Vulkan Handle 43177ec681f3Smrg- vulkan: Add vk_asprintf and vk_vasprintf helpers 43187ec681f3Smrg- vulkan: Add convenience debug message helpers 43197ec681f3Smrg- anv: Switch to new debug message helpers 43207ec681f3Smrg 43217ec681f3SmrgYipeng Chen (Jasber) (1): 43227ec681f3Smrg 43237ec681f3Smrg- radeonsi: do not use staging texture for APU 43247ec681f3Smrg 43257ec681f3SmrgYiwei Zhang (24): 43267ec681f3Smrg 43277ec681f3Smrg- venus: cache ahb backed buffer memory type bits requirement 43287ec681f3Smrg- venus: fix all missing vn_object_base_fini 43297ec681f3Smrg- venus: scrub ignored fields of pipeline info when rasterization is disable 43307ec681f3Smrg- venus: refactor failure path for sets allocation 43317ec681f3Smrg- venus: add vn_descriptor_set_layout_init 43327ec681f3Smrg- venus: descriptor layout to track more binding infos 43337ec681f3Smrg- venus: layout to track variable descriptor count binding info 43347ec681f3Smrg- venus: descriptor pool to track pool state 43357ec681f3Smrg- venus: descriptor set to track descriptor count of last binding 43367ec681f3Smrg- venus: check descriptor allocations against pool resource 43377ec681f3Smrg- venus: conditionally enable async descriptor set allocation 43387ec681f3Smrg- venus: set maxMipLevels to 1 for ahb images 43397ec681f3Smrg- venus: renderer to check map size only when mappable 43407ec681f3Smrg- venus: workaround a blob_mem mappable size check issue 43417ec681f3Smrg- venus: suggest the proper sampler ycbcr model conversion based on format 43427ec681f3Smrg- docs: update vn extension list 43437ec681f3Smrg- venus: amend supported extensions list 43447ec681f3Smrg- venus: properly check and fill ahb buffer properties 43457ec681f3Smrg- util: fix sign comparison 43467ec681f3Smrg- radv/anv android: rename buffer usage camera mask 43477ec681f3Smrg- android_stub: update platform headers to include atrace 43487ec681f3Smrg- venus: update to latest venus-protocol to include tracing 43497ec681f3Smrg- dri_interface: remove obsolete interfaces 43507ec681f3Smrg- dri_interface: remove gl header 43517ec681f3Smrg 43527ec681f3SmrgYogesh Mohan Marimuthu (2): 43537ec681f3Smrg 43547ec681f3Smrg- radeonsi: remove redundant setting scratch_state atom dirty 43557ec681f3Smrg- radeonsi: set scratch_state dirty only if ctx->scratch_buffer allocated 43567ec681f3Smrg 43577ec681f3SmrgYogesh Mohanmarimuthu (1): 43587ec681f3Smrg 43597ec681f3Smrg- vulkan/device-select: select correct default device for xcb apiVersion 1.0 43607ec681f3Smrg 43617ec681f3SmrgZachary Michaels (1): 43627ec681f3Smrg 43637ec681f3Smrg- X11: Ensure that VK_SUBOPTIMAL_KHR propagates to user code 43647ec681f3Smrg 43657ec681f3SmrgZhu Yuliang (1): 43667ec681f3Smrg 43677ec681f3Smrg- gallium/vl: don't leak fd in vl_dri3_screen_create 43687ec681f3Smrg 43697ec681f3Smrgbyte[] (1): 43707ec681f3Smrg 43717ec681f3Smrg- i965: Explicitly abort instead of exiting on batch failure 43727ec681f3Smrg 43737ec681f3Smrgliuyujun (1): 43747ec681f3Smrg 43757ec681f3Smrg- gallium: fix surface->destroy use-after-free 43767ec681f3Smrg 43777ec681f3Smrgmattvchandler (1): 43787ec681f3Smrg 43797ec681f3Smrg- gallium/osmesa: fix buffer resizing 43807ec681f3Smrg 43817ec681f3Smrgmwezdeck (1): 43827ec681f3Smrg 43837ec681f3Smrg- mesa: validate texture format against GL/ES ctx 43847ec681f3Smrg 43857ec681f3Smrgorbea (1): 43867ec681f3Smrg 43877ec681f3Smrg- build: add sha1_h for lp_texture.c 43887ec681f3Smrg 43897ec681f3Smrgsuijingfeng (4): 43907ec681f3Smrg 43917ec681f3Smrg- gallivm: add basic mips64 support and set mcpu to mips64r5 on ls3a4000 43927ec681f3Smrg- pass egl-symbols-check test on mips64el 43937ec681f3Smrg- gallivm: fix pass init order on mips64 with llvm 8 43947ec681f3Smrg- llvmpipe: correct the debug information printed with GALLIVM_PERF=nopt 43957ec681f3Smrg 43967ec681f3Smrgxantares (1): 43977ec681f3Smrg 43987ec681f3Smrg- lavapipe: Fix 32bits windows build 4399