17ec681f3SmrgMesa 20.3.0 Release Notes / 2020-12-03 27ec681f3Smrg====================================== 37ec681f3Smrg 47ec681f3SmrgMesa 20.3.0 is a new development release. People who are concerned 57ec681f3Smrgwith stability and reliability should stick with a previous release or 67ec681f3Smrgwait for Mesa 20.3.1. 77ec681f3Smrg 87ec681f3SmrgMesa 20.3.0 implements the OpenGL 4.6 API, but the version reported by 97ec681f3SmrgglGetString(GL_VERSION) or glGetIntegerv(GL_MAJOR_VERSION) / 107ec681f3SmrgglGetIntegerv(GL_MINOR_VERSION) depends on the particular driver being used. 117ec681f3SmrgSome drivers don't support all the features required in OpenGL 4.6. OpenGL 127ec681f3Smrg4.6 is **only** available if requested at context creation. 137ec681f3SmrgCompatibility contexts may report a lower version depending on each driver. 147ec681f3Smrg 157ec681f3SmrgMesa 20.3.0 implements the Vulkan 1.2 API, but the version reported by 167ec681f3Smrgthe apiVersion property of the VkPhysicalDeviceProperties struct 177ec681f3Smrgdepends on the particular driver being used. 187ec681f3Smrg 197ec681f3SmrgSHA256 checksum 207ec681f3Smrg--------------- 217ec681f3Smrg 227ec681f3Smrg:: 237ec681f3Smrg 247ec681f3Smrg 2999738e888731531cd62b27519fa37566cc0ea2cd7d4d97f46abaa3e949c630 mesa-20.3.0.tar.xz 257ec681f3Smrg 267ec681f3Smrg 277ec681f3SmrgNew features 287ec681f3Smrg------------ 297ec681f3Smrg 307ec681f3Smrg- GL 4.5 on llvmpipe 317ec681f3Smrg 327ec681f3Smrg- GL_INTEL_blackhole_render on radeonsi 337ec681f3Smrg 347ec681f3Smrg- GL_NV_copy_depth_to_color for NIR 357ec681f3Smrg 367ec681f3Smrg- GL_NV_half_float 377ec681f3Smrg 387ec681f3Smrg- GL_NV_shader_atomic_int64 on radeonsi 397ec681f3Smrg 407ec681f3Smrg- EGL_KHR_swap_buffers_with_damage on X11 (DRI3) 417ec681f3Smrg 427ec681f3Smrg- VK_PRESENT_MODE_FIFO_RELAXED on X11 437ec681f3Smrg 447ec681f3Smrg- GLX_EXT_swap_control for DRI2 and DRI3 457ec681f3Smrg 467ec681f3Smrg- GLX_EXT_swap_control_tear for DRI3 477ec681f3Smrg 487ec681f3Smrg- VK_KHR_copy_commands2 on RADV 497ec681f3Smrg 507ec681f3Smrg- VK_KHR_shader_terminate_invocation on RADV 517ec681f3Smrg 527ec681f3Smrg- NGG GS support in ACO 537ec681f3Smrg 547ec681f3Smrg- VK_KHR_shader_terminate_invocation on ANV 557ec681f3Smrg 567ec681f3Smrg- driconf: add glx_extension_override 577ec681f3Smrg 587ec681f3Smrg- driconf: add indirect_gl_extension_override 597ec681f3Smrg 607ec681f3Smrg- VK_AMD_mixed_attachment_samples on RADV (GFX6-GFX7). 617ec681f3Smrg 627ec681f3Smrg- GL_MESA_pack_invert on r100 and vieux 637ec681f3Smrg 647ec681f3Smrg- GL_ANGLE_pack_reverse_row_order 657ec681f3Smrg 667ec681f3Smrg- VK_EXT_shader_image_atomic_int64 on RADV 677ec681f3Smrg 687ec681f3Smrg- None 697ec681f3Smrg 707ec681f3Smrg 717ec681f3SmrgBug fixes 727ec681f3Smrg--------- 737ec681f3Smrg 747ec681f3Smrg- \[icl,tgl][iris][i965][regression][bisected\] piglit failures 757ec681f3Smrg- shader-db valgrind error 767ec681f3Smrg- \[AMDGPU NAVI 5700xt\] Large parts of the Blender viewport does not render correctly if an object with hair is moved. 777ec681f3Smrg- \[aco\] problem compiling compute pipeline 787ec681f3Smrg- zink: regression after !7606 797ec681f3Smrg- glcpp test 084-unbalanced-parentheses fails with bison 3.6.y 807ec681f3Smrg- zink+radv: corruption on pre-game menu in quake3 817ec681f3Smrg- panfrost massive glitches apitrace opengl 2.1 827ec681f3Smrg- \[radeonsi\] After 549ae5f84375dfadb86cfd465f0103acfae3249f commit Firefox Nightly Asan begins crashes 837ec681f3Smrg- Amber test NIR validation failed after spirv_to_nir 847ec681f3Smrg- zink: add detection for wsi_memory_allocate_info usage 857ec681f3Smrg- Follow-up from "nir,spirv: Add generic pointers support" 867ec681f3Smrg- v3d GL_ARB_vertex_array_bgra support 877ec681f3Smrg- iris: glClear with FBO imported from DMA-BUF doesn't work 887ec681f3Smrg- Fast-clears of GL_ALPHA16 textures are broken on TGL 897ec681f3Smrg- NV50_PROG_USE_NIR=1 doesn't work for piglit/bin/pbo-teximage ? 907ec681f3Smrg- Follow-up from "st/mesa: Use nir-to-tgsi for builtins if the driver needs TGSI" 917ec681f3Smrg- \[spirv-fuzz\] Shader causes an assertion failure in nir_opt_large_constants 927ec681f3Smrg- Amber test validate_phi_src 937ec681f3Smrg- Regnum Online UBO break after game update 947ec681f3Smrg- Current mesa git fails to build in multilib environment? 957ec681f3Smrg- radv/aco: Vertex explosion on RPCS3 967ec681f3Smrg- llvmpipe-cl should not run for other drivers 977ec681f3Smrg- Factorio v1.0 - Linux native - 64 bit - OpenGL/radeonsi - completely broken rendering 987ec681f3Smrg- Gnome 3.38 with Xwayland has screen corruption for X11 apps. 997ec681f3Smrg- st/va fails to build on old libva in mesa git 1007ec681f3Smrg- sp_state_shader.c:146: undefined reference to \`nir_to_tgsi' 1017ec681f3Smrg- anv: dEQP-VK.api.copy_and_blit.core.blit_image.all_formats.color.3d\* failures 1027ec681f3Smrg- anv: dEQP-VK.api.copy_and_blit.core.blit_image.all_formats.color.3d\* failures 1037ec681f3Smrg- RADV: Death Stranding glitchy sky rendering 1047ec681f3Smrg- Crash in glDrawArrays on Intel iris 1057ec681f3Smrg- GLX_OML_swap_method not fully supported 1067ec681f3Smrg- deinterlace_vaapi=rate=field does not double output's actual frame rate on AMD 1077ec681f3Smrg- Steam game Haydee leans on implementation-dependent behavior 1087ec681f3Smrg- ANV: Support 1 million update-after-bind descriptors 1097ec681f3Smrg- zink: crash in Blender on start-up 1107ec681f3Smrg- vc4 in 20.2-rc has regression causing app to crash 1117ec681f3Smrg- \[RADV\] broken stencil behaviour when using extended dynamic stencil state 1127ec681f3Smrg- \[RADV/ACO\] Star Citizen Lighting/Shadow Issue 1137ec681f3Smrg- \[RADV\] Some bindings seem broken with VK_DYNAMIC_STATE_VERTEX_INPUT_BINDING_STRIDE_EXT 1147ec681f3Smrg- \[RADV/ACO\] ACO build error about SMEM operands 1157ec681f3Smrg- Graphics corruption in Super Mega Baseball 2 with RADV on Navi 1167ec681f3Smrg- RADV ACO - ground line corruption in Path of Exile with Vulkan renderer 1177ec681f3Smrg- omx/tizonia build broken with latest mesa git 1187ec681f3Smrg- Request: VK_EXT_transform_feedback on Intel Gen 7 1197ec681f3Smrg- iris: Regression in deqp const_write tests 1207ec681f3Smrg- \[hsw][bisected][regression\] gpu hangs on dEQP-VK.subgroups.(shuffle|quad) tests 1217ec681f3Smrg- \[RADV/LLVM/ACO\] Serious Sam 4 crashes after first cutscene with ACO backend + flickering black spots sprout up everywhere 1227ec681f3Smrg- TGL B0 Stepping gpu hangs on many dEQP-VK.subgroups.quad nonconst tests 1237ec681f3Smrg- \[machines without AVX2/F16C][bisected\] X server crash, wflinfo crash in mesa CI 1247ec681f3Smrg- nir: Mesa regression on Compute shader 1257ec681f3Smrg- radv, aco: dEQP-VK.glsl.atomic_operations.*_fragment_reference regressed 1267ec681f3Smrg- Commit c6c1fa9a263880 causes corruption in Steam UI 1277ec681f3Smrg- \[spirv-fuzz\] Shader generates a wrong image 1287ec681f3Smrg- Running Amber test leads to VK_DEVICE_LOST 1297ec681f3Smrg- \[Regression][Bisected][20.2][radeonsi\] American Truck Simulator continually allocates memory until OOM 1307ec681f3Smrg- \[radeonsi\] bottom mips of height=1 2D texture is uninitialised after upload 1317ec681f3Smrg- Missing terrain in Total War: Warhammer 1327ec681f3Smrg- anv: dEQP-VK.robustness.robustness2.\* failures on gen12 1337ec681f3Smrg- AMD VAAPI encoding - applying filters introduces garbled line at the bottom 1347ec681f3Smrg- AMD VAAPI HEVC encoding not working correctly on Polaris 1357ec681f3Smrg- \[RADV\] Problems reading primitive ID in fragment shader after tessellation 1367ec681f3Smrg- Massive memory leak (at least AMD, others unknown) 1377ec681f3Smrg- Substance Painter 6.1.3 black glitches on Radeon RX570 1387ec681f3Smrg- \[ivb,hsw,byt,bsw][i965][bisected\] anv_reloc_list_add: Assertion failure 1397ec681f3Smrg- vkCmdCopyImage broadcasts subsample 0 of MSAA src into all subsamples of dst on RADV 1407ec681f3Smrg- assert(left \<= -1 && top \<= -1 && right \>= 1 && bottom \>= 1) fails in si_emit_guardband 1417ec681f3Smrg- Crash in ruvd_end_frame when calling vaBeginPicture/vaEndPicture without rendering anything 1427ec681f3Smrg- Release signing key is not readily available 1437ec681f3Smrg- \[iris][bisected\] piglit.spec.nv_copy_depth_to_color.nv_copy_depth_to_color failures 1447ec681f3Smrg- VAAPI vaDeriveImage returns VA_STATUS_ERROR_OPERATION_FAILED 1457ec681f3Smrg- X-Plane 11 Installer crashes on startup since \`glsl: declare gl_Layer/gl_ViewportIndex/gl_ViewportMask as vs builtins\` 1467ec681f3Smrg- piglit spec/glsl-1.50/execution/geometry/clip-distance-vs-gs-out.shader_test fails on Iris 1477ec681f3Smrg- Amber test opt_peel_loop_initial_if: Assertion failed 1487ec681f3Smrg- builder_misc.cpp:137:55: error: ‘get’ is not a member of ‘llvm::ElementCount’ 1497ec681f3Smrg- AVX instructions leak outside of CPU feature check and cause SIGILL 1507ec681f3Smrg- Dirt Rally: Flickering glitches on certain foliage since Mesa 20.1.0 caused by MSAA 1517ec681f3Smrg- Horizon Zero Dawn graphics corruption with with radv 1527ec681f3Smrg- Crusader Kings 3 Crashes at start since commit with !6472 1537ec681f3Smrg- pan_resource.c:733:38: error: use of GNU empty initializer extension \[-Werror,-Wgnu-empty-initializer\] 1547ec681f3Smrg- \[BRW\] WRC 5 asserts with gallium nine and iris. 1557ec681f3Smrg- ci/bare-metal: POWER_GOOD detection broken with reboot rework 1567ec681f3Smrg- radv: Corruption in "The Surge 2" 1577ec681f3Smrg- \[RADV\] Detroit: Become Human Demo game lock-ups with RADV 1587ec681f3Smrg- Road Redemption certain graphic effects rendered white color 1597ec681f3Smrg- gen_state_llvm.h:54:99: error: invalid conversion from ‘int’ to ‘const llvm::VectorType*’ \[-fpermissive\] 1607ec681f3Smrg- Using a shared dEQP build script 1617ec681f3Smrg- vulkan/wsi/x11: deadlock with Xwayland when compositor holds multiple buffers 1627ec681f3Smrg- \[RADV/ACO\] Death Stranding cause a GPU hung (\*ERROR\* Waiting for fences timed out!) 1637ec681f3Smrg- lp_bld_init.c:172:7: error: implicit declaration of function ‘LLVMAddConstantPropagationPass’; did you mean ‘LLVMAddCorrelatedValuePropagationPass’? \[-Werror=implicit-function-declaration\] 1647ec681f3Smrg- ci: Use lld or gold instead of ld.bfd 1657ec681f3Smrg- Intel Vulkan driver crash with alpha-to-coverage 1667ec681f3Smrg- radv: blitting 3D images with linear filter 1677ec681f3Smrg- \[ACO\] Compiling pipelines from RPCS3's shader interpreter spins forever in ACO code 1687ec681f3Smrg- \[regression][bisected\] nir: nir_intrinsic_io_semantics assert failures in piglit 1697ec681f3Smrg- error: ‘static_assert’ was not declared in this scope 1707ec681f3Smrg- Intel Vulkan driver assertion with small xfb buffer 1717ec681f3Smrg- \<<MESA crashed>\> Array Index Out of Range with Graphicsfuzz application 1727ec681f3Smrg- EGL_KHR_swap_buffers_with_damage support on X11 1737ec681f3Smrg- \[spirv-fuzz\] SPIR-V parsing failed "src->type->type == dest->type->type" 1747ec681f3Smrg- radeonsi: radeonsi crashes in Chrome on chromeos 1757ec681f3Smrg- \[RADV\] commit d19bc94e4eb94 broke gamescope with Navi 1767ec681f3Smrg- 4e3a7dcf6ee4946c46ae8b35e7883a49859ef6fb breaks Gamescope showing windows properly. 1777ec681f3Smrg- anv: crashes in CTS test dEQP-VK.subgroups.*.framebuffer.*_tess_eval 1787ec681f3Smrg- Intel Vuikan (anv) crash in copy_non_dynamic_state() when using validation layer 1797ec681f3Smrg- \[tgl][bisected][regression\] GPU hang in The Witcher 3 1807ec681f3Smrg- Mafia 3: Trees get rendered incorrectly 1817ec681f3Smrg- radv: dEQP-VK.synchronization.op.multi_queue.timeline_semaphore.write_clear_attachments_*_concurrent fail when forcing DCC. 1827ec681f3Smrg- Crash on GTA 5 through proton 5.0.9 and GE versions 1837ec681f3Smrg- Flickering textures in "Divinity Original Sin Enhanced Edition" 1847ec681f3Smrg- Mesa 20.2.0-rc1 fails to build for AMD 1857ec681f3Smrg- Assertion failure compiling shader from Zigguart 1867ec681f3Smrg 1877ec681f3Smrg 1887ec681f3SmrgChanges 1897ec681f3Smrg------- 1907ec681f3Smrg 1917ec681f3SmrgAaron Watry (1): 1927ec681f3Smrg 1937ec681f3Smrg- clover: Fix incorrect error check in clGetSupportedImageFormats 1947ec681f3Smrg 1957ec681f3SmrgAdam Jackson (22): 1967ec681f3Smrg 1977ec681f3Smrg- drisw: Port the MIT-SHM check to XCB 1987ec681f3Smrg- vulkan: Don't pointlessly depend on libxcb-dri2 1997ec681f3Smrg- docs: Stop claiming to implement OpenVG 2007ec681f3Smrg- mesa: Fix GL_CLAMP handling in glSamplerParameter 2017ec681f3Smrg- mesa: Generate more errors from GetSamplerParameter 2027ec681f3Smrg- wsi/x11: Hook up VK_PRESENT_MODE_FIFO_RELAXED_KHR 2037ec681f3Smrg- glx: Use GLX_FUNCTION2 only for actually aliased function names 2047ec681f3Smrg- glx: Collect all the non-applegl extensions in the GetProcAddress table 2057ec681f3Smrg- glx: Reject glXSwapIntervalMESA greater than INT_MAX 2067ec681f3Smrg- glx: Implement GLX_EXT_swap_control for DRI2 and DRI3 2077ec681f3Smrg- glx/dri3: Implement GLX_EXT_swap_control_tear 2087ec681f3Smrg- glx: Allow depth-30 pbuffers to work without a depth-30 pixmap format 2097ec681f3Smrg- wsi/x11: Create a present queue for VK_PRESENT_MODE_FIFO_RELAXED_KHR 2107ec681f3Smrg- glx: move \__glXGetUST into the DRI1 code 2117ec681f3Smrg- glx: Delegate the core of glXGetScreenDriver to the GLX screen vtable 2127ec681f3Smrg- glx: Move glXGet{ScreenDriver,DriverConfig} to common code 2137ec681f3Smrg- docs/features: Update extensions for softpipe 2147ec681f3Smrg- docs/features: Update extensions for swr 2157ec681f3Smrg- loader: Print dlerror() output in the failure message 2167ec681f3Smrg- mesa: Enable GL_MESA_pack_invert unconditionally 2177ec681f3Smrg- mesa: Implement GL_ANGLE_pack_reverse_row_order 2187ec681f3Smrg- docs: Add MESA_pack_invert and ANGLE_pack_reverse_row_order 2197ec681f3Smrg 2207ec681f3SmrgAlejandro Piñeiro (147): 2217ec681f3Smrg 2227ec681f3Smrg- v3d/compiler: add v3dv_prog_data_size helper 2237ec681f3Smrg- v3d/packet: fix typo on Set InstanceID/PrimitiveID packet 2247ec681f3Smrg- v3d: set instance id to 0 at start of tile 2257ec681f3Smrg- broadcom/qpu_instr: wait is not a read or write vpm instruction 2267ec681f3Smrg- nir/lower_io: don't reduce range if parent length is zero 2277ec681f3Smrg- broadcom/simulator: update to a newer simulator 2287ec681f3Smrg- broadcom/common: increase V3D_MAX_TEXTURE_SAMPLERS, add specific OpenGL limit 2297ec681f3Smrg- broadcom/compiler: add V3D_DEBUG_RA option 2307ec681f3Smrg- v3dv: add v3d vulkan driver skeleton 2317ec681f3Smrg- gitlab-ci: add broadcom vulkan driver 2327ec681f3Smrg- v3dv: add support for VK_EXT_debug_report 2337ec681f3Smrg- v3dv: memory management stubs 2347ec681f3Smrg- v3dv: add support to use v3d simulator 2357ec681f3Smrg- v3dv/debug: plug v3d_debug 2367ec681f3Smrg- v3dv/debug: add v3dv_debug 2377ec681f3Smrg- v3dv: stubs for graphics pipeline methods 2387ec681f3Smrg- v3dv: Create/DestroyShaderModule implementation 2397ec681f3Smrg- v3d/compiler: num_tex_used on v3d_key 2407ec681f3Smrg- v3dv/format: add v3dv_get_format_swizzle 2417ec681f3Smrg- v3dv: initial CreateGraphicsPipeline/DestroyPipeline implementation 2427ec681f3Smrg- v3dv: initial stub for CmdBindPipeline 2437ec681f3Smrg- v3dv: CmdSetViewport and CmdSetScissor implementation 2447ec681f3Smrg- v3dv/pipeline: start to track dynamic state 2457ec681f3Smrg- v3dv/cmd_buffer: init command buffer dynamic state during pipeline bind 2467ec681f3Smrg- v3dv/cmd_buffer: emit Scissor packets 2477ec681f3Smrg- v3dv/cmd_buffer: emit Viewport packets 2487ec681f3Smrg- v3dv/cmd_buffer: emit shader_state packets 2497ec681f3Smrg- v3dv/cmd_buffer: start to emit draw packets 2507ec681f3Smrg- v3dv/cmd_buffer: add shader source bos to cmd_buffer 2517ec681f3Smrg- v3dv: clif format dumping support 2527ec681f3Smrg- v3dv/cmd_buffer: cache viewport translate/scale 2537ec681f3Smrg- v3dv: add v3dv_write_uniforms 2547ec681f3Smrg- v3dv/cmd_buffer: start jobs with CmdBeginRenderPass 2557ec681f3Smrg- v3d/compiler: update uses_vid/uses_iid check 2567ec681f3Smrg- v3dv/cmd_buffer: emit CFG_BITS 2577ec681f3Smrg- v3dv: partial prepack of the gl_shader_state_record 2587ec681f3Smrg- v3dv: prepack VCM_CACHE_SIZE 2597ec681f3Smrg- v3dv/pipeline: lower fs/vs inputs/outputs 2607ec681f3Smrg- v3dv: vertex input support 2617ec681f3Smrg- v3dv: provide default values for input attributes 2627ec681f3Smrg- v3dv/format: add R32G32B32A32_SFLOAT format 2637ec681f3Smrg- v3dv: stubs for Create/DestroyPipelineCache 2647ec681f3Smrg- v3d/cmd_buffer: emit flat_shade/noperspective/centroid flags 2657ec681f3Smrg- v3dv/pipeline: adding some nir-based linking 2667ec681f3Smrg- v3dv/bo: add a bo name 2677ec681f3Smrg- v3dv: debug nir shader also after spirv_to_nir 2687ec681f3Smrg- v3dv: initial descriptor set support 2697ec681f3Smrg- v3dv/descriptor_set: support for array of ubo/ssbo 2707ec681f3Smrg- v3dv/pipeline: null check for pCreateInfo->pDepthStencilState 2717ec681f3Smrg- v3dv: no need to manually add assembly bo to the job 2727ec681f3Smrg- v3d/compiler: handle GL/Vulkan differences in uniform handling 2737ec681f3Smrg- v3dv/cmd_buffer: support for push constants 2747ec681f3Smrg- v3dv/descriptor: support for dynamic ubo/ssbo 2757ec681f3Smrg- v3dv/pipeline: revamp nir lowering/optimizations passes 2767ec681f3Smrg- v3dv/pipeline: clean up io lowering 2777ec681f3Smrg- v3dv/descriptor: take into account pPushConstantRanges 2787ec681f3Smrg- v3dv/device: tweak ssbo/ubo device limits 2797ec681f3Smrg- v3dv/cmd_bufffer: rename and split emit_graphics_pipeline 2807ec681f3Smrg- v3dv/cmd_buffer: push constants not using descriptor anymore 2817ec681f3Smrg- v3dv/uniforms: cleaning up, moving udpate ubo/ssbo uniforms to a function 2827ec681f3Smrg- v3dv/pipeline: unify local allocator name 2837ec681f3Smrg- v3dv/pipeline: sampler lowering 2847ec681f3Smrg- v3dv/descriptor_set: added support for samplers 2857ec681f3Smrg- v3dv/uniforms: filling up QUNIFORM_TMU_CONFIG_P0/P1 2867ec681f3Smrg- v3dv/pipeline: add support for shader variants 2877ec681f3Smrg- v3dv/cmd_buffer: update shader variants at CmdBindDescriptorSets/CmdBindPipeline 2887ec681f3Smrg- v3dv/cmd_buffer: allow return in the middle of variant update if needed 2897ec681f3Smrg- v3dv/pipeline: fix adding texture/samplers array elements to texture/sampler map 2907ec681f3Smrg- v3dv/descriptor_set: support for immutable samplers 2917ec681f3Smrg- v3dv/descriptor: move descriptor_map_get_sampler, add and use get_image_view 2927ec681f3Smrg- v3dv/descriptor_set: combine texture and sampler indices 2937ec681f3Smrg- v3dv/descriptor: handle not having a sampler when combining texture and sampler id 2947ec681f3Smrg- v3dv/uniforms: fill up texture size-related uniforms 2957ec681f3Smrg- v3dv/format: expose correctly if a texture format is filterable 2967ec681f3Smrg- v3dv: handle texture/sampler shader state bo failure with OOM error 2977ec681f3Smrg- v3dv: properly return OOM error during pipeline creation 2987ec681f3Smrg- v3dv/meta-copy: ensure valid height/width with compressed formats 2997ec681f3Smrg- v3dv/cmd_buffer: move variant checking to CmdDraw 3007ec681f3Smrg- v3dv/pipeline: support for specialization constants 3017ec681f3Smrg- v3dv/descriptor: add general bo on descriptor pool 3027ec681f3Smrg- v3dv/descriptor: use descriptor pool bo for image/samplers 3037ec681f3Smrg- v3dv/meta-copy: add uintptr_t casting to avoid warning 3047ec681f3Smrg- v3dv/bo: adding a BO cache 3057ec681f3Smrg- v3dv/bo: add a maximum size for the bo_cache and a envvar to configure it 3067ec681f3Smrg- v3dv/bo: add dump stats info 3077ec681f3Smrg- v3d/tex: avoid to ask back for a sampler state if not needed 3087ec681f3Smrg- v3dv/pipeline: iterate used textures using the combined index map 3097ec681f3Smrg- v3dv/pipeline: set load_layer_id to zero 3107ec681f3Smrg- v3dv: initial support for input attachments 3117ec681f3Smrg- v3dv/descriptors: support for DESCRIPTOR_TYPE_STORAGE_IMAGE 3127ec681f3Smrg- v3dv/pipeline: lower_image_deref 3137ec681f3Smrg- v3dv/uniforms: support for some QUNIFORM_IMAGE_XXX 3147ec681f3Smrg- nir: include texture query lod as one of the ops that requires a sampler 3157ec681f3Smrg- v3dv/device: expose support for image cube array 3167ec681f3Smrg- v3dv/image: fix TEXTURE_SHADER_STATE depth for cube arrays 3177ec681f3Smrg- v3dv/device: add vendorID/deviceID get helpers 3187ec681f3Smrg- v3dv/device: get proper device ID under simulator 3197ec681f3Smrg- v3dv/device: proper pipeline cache uuid 3207ec681f3Smrg- v3dv/pipeline_cache: bare basic support for pipeline cache 3217ec681f3Smrg- v3dv/pipeline_cache: cache nir shaders 3227ec681f3Smrg- v3dv/pipeline: add basic ref counting support for variants 3237ec681f3Smrg- v3dv/pipeline_cache: cache v3dv_shader_variants 3247ec681f3Smrg- v3dv/pipeline_cache: support to serialize/deserialize cached NIRs 3257ec681f3Smrg- v3dv/pipeline_cache: MergePipelineCaches implementation 3267ec681f3Smrg- v3dv/pipeline: provide a shader_sha1 to private ShaderModules 3277ec681f3Smrg- v3dv/pipeline_cache: add default pipeline cache 3287ec681f3Smrg- v3dv/pipeline: remove custom variant cache 3297ec681f3Smrg- v3dv/pipeline: when looking for a variant, check first current variant 3307ec681f3Smrg- v3dv/pipeline: pre-generate more that one shader variant 3317ec681f3Smrg- v3dv/pipeline: handle properly OUT_OF_HOST_MEMORY error when allocating p_stage 3327ec681f3Smrg- v3dv/descriptor: support for UNIFORM/STORAGE_TEXEL_BUFFER 3337ec681f3Smrg- v3dv: add v3dv_limits file 3347ec681f3Smrg- v3dv/device: fix minTexelBufferOffsetAlingment 3357ec681f3Smrg- v3dv/formats: fix exposing FEATURE_UNIFORM/STORAGE_TEXEL_BUFFER_BIT 3367ec681f3Smrg- v3dv/uniforms: handle texture size for texel buffers 3377ec681f3Smrg- v3dv/descriptor: remove v3dv_descriptor_map_get_image_view 3387ec681f3Smrg- v3dv/device: add assert for texture-related limits 3397ec681f3Smrg- v3dv/device: warn when the pipeline cache is disabled 3407ec681f3Smrg- v3dv/debug: add v3dv_print_v3d_key 3417ec681f3Smrg- v3dv/pipeline: fix combined_index_map insertions 3427ec681f3Smrg- v3dv/meta: fix hash table insertion 3437ec681f3Smrg- broadcom/compiler: allow GLSL_SAMPLER_DIM_BUF on txs emission 3447ec681f3Smrg- v3d/simulator: add v3d_simulator_get_mem_size 3457ec681f3Smrg- v3dv/device: fix compute_heap_size for the simulator 3467ec681f3Smrg- v3dv/pipeline: use derefs for ubo/ssbo 3477ec681f3Smrg- v3dv: Call nir_lower_io for push constants 3487ec681f3Smrg- v3dv/pipeline: track if texture is shadow 3497ec681f3Smrg- v3dv/pipeline: set 16bit return_size for shadows always 3507ec681f3Smrg- v3dv/cmd_buffer: set instance id to 0 at start of tile 3517ec681f3Smrg- v3d/limits: add line width and point size limits 3527ec681f3Smrg- v3dv/device: fix point-related VkPhysicalDeviceLimits 3537ec681f3Smrg- v3dv/device: enable largePoints 3547ec681f3Smrg- v3dv/meta_copy: handle mirroring z component bliting 3D images 3557ec681f3Smrg- v3dv/formats: properly return unsupported for 1D compressed textures 3567ec681f3Smrg- v3dv/meta_copy: fix TFU blitting when using 3D images 3577ec681f3Smrg- v3dv/pipeline_cache: set a max size for the pipeline cache 3587ec681f3Smrg- v3dv/pipeline_cache: extend pipeline cache envvar 3597ec681f3Smrg- v3dv/device: Support loader interface version 3. 3607ec681f3Smrg- nir/lower_io_to_scalar: update io semantics on per-component inst 3617ec681f3Smrg- docs/features: add v3dv driver 3627ec681f3Smrg- v3dv/format: use XYZ1 swizzle for three-component formats 3637ec681f3Smrg- v3d/format: use XYZ1 swizzle for three-component formats 3647ec681f3Smrg- broadcom/compiler: remove v3d_fs_key depth_enabled field. 3657ec681f3Smrg- v3dv/util: remove several logging functions 3667ec681f3Smrg- v3dv/util: log debug ignored stype only on debug builds 3677ec681f3Smrg- v3dv/device: do nothing when asked physical device pci bus properties 3687ec681f3Smrg- v3dv/cmd_buffer: missing (uint8_t \*) casting when calling memcmp 3697ec681f3Smrg 3707ec681f3SmrgAlexandros Frantzis (5): 3717ec681f3Smrg 3727ec681f3Smrg- tracie: Make tests independent of environment 3737ec681f3Smrg- tracie: Produce JUnit XML results 3747ec681f3Smrg- gitlab-ci: Enable unit test reports for normal runner traces jobs 3757ec681f3Smrg- gitlab-ci: Enable unit test reports for lava traces jobs 3767ec681f3Smrg- gitlab-ci: Enable unit test report for arm64_a630_traces 3777ec681f3Smrg 3787ec681f3SmrgAlyssa Rosenzweig (388): 3797ec681f3Smrg 3807ec681f3Smrg- panfrost: Remove blend prettyprinters 3817ec681f3Smrg- panfrost: Move format stringify to decode.c 3827ec681f3Smrg- pan/decode: Remove shader replacement artefact 3837ec681f3Smrg- panfrost: Inline panfrost-misc.h into panfrost-job.h 3847ec681f3Smrg- panfrost: Remove panfrost-misc.h 3857ec681f3Smrg- panfrost: Don't export exception_status 3867ec681f3Smrg- panfrost: Rename encoder/ to lib/ 3877ec681f3Smrg- panfrost: Move pandecode into lib/ 3887ec681f3Smrg- pan/mdg: Separate disassembler and compiler targets 3897ec681f3Smrg- pan/bi: Separate disasm/compiler targets 3907ec681f3Smrg- panfrost: Reduce bit dependency to disassembly only 3917ec681f3Smrg- panfrost: Add panloader/ to .gitignore 3927ec681f3Smrg- pan/bi: Drop use of MALI_POSITIVE 3937ec681f3Smrg- panfrost: Inline max rt into compilers 3947ec681f3Smrg- panfrost: Treat texture dimension as first-class 3957ec681f3Smrg- panfrost: Drop compiler cmdstream deps 3967ec681f3Smrg- nir/lower_ssbo: Don't set align\_\* for atomics 3977ec681f3Smrg- gallium/dri2: Support Arm modifiers 3987ec681f3Smrg- panfrost: Set \`initialized\` more conservatively 3997ec681f3Smrg- panfrost: Remove hint-based AFBC heuristic 4007ec681f3Smrg- panfrost: Introduce create_with_modifier helper 4017ec681f3Smrg- panfrost: Use modifier instead of layout throughout 4027ec681f3Smrg- panfrost: Account for modifiers when creating BO 4037ec681f3Smrg- panfrost: Respect modifiers in resource management 4047ec681f3Smrg- panfrost: Import staging routines from freedreno 4057ec681f3Smrg- panfrost: Choose AFBC when available 4067ec681f3Smrg- panfrost: Implement YTR availability check 4077ec681f3Smrg- panfrost: Enable YTR where allowed 4087ec681f3Smrg- panfrost: Allocate enough space for tiled formats 4097ec681f3Smrg- panfrost: Ensure AFBC slices are aligned 4107ec681f3Smrg- panfrost: Implement panfrost_query_dmabuf_modifiers 4117ec681f3Smrg- panfrost: Add stub midgard.xml 4127ec681f3Smrg- panfrost: Adopt gen_pack_header.py via v3d 4137ec681f3Smrg- panfrost: Build midgard_pack.h via meson 4147ec681f3Smrg- panfrost: Redirect cmdstream includes through GenXML 4157ec681f3Smrg- pan/decode: Add helper to dump GPU structures 4167ec681f3Smrg- panfrost: XMLify job_type 4177ec681f3Smrg- panfrost: XMLify draw_mode 4187ec681f3Smrg- panfrost: XMLify mali_func 4197ec681f3Smrg- panfrost: XMLify stencil op 4207ec681f3Smrg- panfrost: XMLify wrap modes 4217ec681f3Smrg- panfrost: XMLify viewport 4227ec681f3Smrg- panfrost: XMLify UBOs 4237ec681f3Smrg- panfrost: XMLify stencil test 4247ec681f3Smrg- panfrost: Simplify zsa == NULL case 4257ec681f3Smrg- panfrost: Simplify depth/stencil/alpha 4267ec681f3Smrg- panfrost: Don't mask coverage mask to 4-bits 4277ec681f3Smrg- panfrost: XMLify Midgard samplers 4287ec681f3Smrg- panfrost: XMLify Bifrost samplers 4297ec681f3Smrg- panfrost: XMLify Midgard textures 4307ec681f3Smrg- panfrost: XMLify Bifrost textures 4317ec681f3Smrg- panfrost: Drop unused mali_channel_swizzle 4327ec681f3Smrg- panfrost: XMLify Block Format 4337ec681f3Smrg- panfrost: XMLify MSAA writeout mode 4347ec681f3Smrg- panfrost: XMLify exception access 4357ec681f3Smrg- panfrost: XMLify enum mali_format 4367ec681f3Smrg- panfrost: Set STRIDE_4BYTE_ALIGNED_ONLY 4377ec681f3Smrg- panfrost: Drop NXR format 4387ec681f3Smrg- panfrost: Squash 22-bit format field in attr_meta 4397ec681f3Smrg- panfrost: XMLify mali_channel 4407ec681f3Smrg- panfrost: XMLify attributes 4417ec681f3Smrg- panfrost: Merge attribute packing routines 4427ec681f3Smrg- panfrost: Add XML for attribute buffers 4437ec681f3Smrg- panfrost: Use better packs for blits 4447ec681f3Smrg- panfrost: Simplify offset fixup proof 4457ec681f3Smrg- panfrost: Make attribute-buffer map explicit 4467ec681f3Smrg- panfrost: Move attr_meta emission to the draw routine 4477ec681f3Smrg- panfrost: Use packs for attributes 4487ec681f3Smrg- panfrost: Hoist instance_shift/instance_odd fetch 4497ec681f3Smrg- panfrost: Inline panfrost_vertex_instanced 4507ec681f3Smrg- panfrost: Use packs for vertex attribute buffers 4517ec681f3Smrg- panfrost: Use packs for vertex built-ins 4527ec681f3Smrg- panfrost: Reword comment 4537ec681f3Smrg- panfrost: Pass varying descriptors by reference 4547ec681f3Smrg- panfrost: Factor out general varying case 4557ec681f3Smrg- panfrost: Use pack for XFB varying 4567ec681f3Smrg- panfrost: Use pack for general varying 4577ec681f3Smrg- panfrost: Use MALI_ATTRIBUTE_LENGTH 4587ec681f3Smrg- pan/bit: Use packs for Bifrost unit tests 4597ec681f3Smrg- panfrost: Remove mali_attr_meta 4607ec681f3Smrg- panfrost: Use packs for varying buffers 4617ec681f3Smrg- panfrost: Drop hand-rolled pandecode for attribute buffers 4627ec681f3Smrg- panfrost: Drop union mali_attr 4637ec681f3Smrg- panfrost: Update CI expectations 4647ec681f3Smrg- panfrost: Decontextualize rasterizer 4657ec681f3Smrg- panfrost: Drop rasterizer null checks in draw calls 4667ec681f3Smrg- panfrost: Drop ZSA null checks in draws 4677ec681f3Smrg- panfrost: Drop panfrost_invalidate_frame 4687ec681f3Smrg- panfrost: Drop QUADS primitive convert 4697ec681f3Smrg- panfrost: Hoist add_fbo_bo call 4707ec681f3Smrg- panfrost: Remove useless comment 4717ec681f3Smrg- panfrost: Hoist assert from bind to create 4727ec681f3Smrg- panfrost: Fix WRITES_GLOBAL bit 4737ec681f3Smrg- panfrost: Fix shared memory size computation 4747ec681f3Smrg- pan/mdg: Ensure barrier op is set on texture 4757ec681f3Smrg- pan/mdg: Handle 32-bit offsets from store_shared 4767ec681f3Smrg- pan/mdg: Identify barrier out-of-order field 4777ec681f3Smrg- pan/mdg: Fix printing of r26 ld/st sources post-RA 4787ec681f3Smrg- pan/mdg: Fix auxiliary load/store swizzle packing 4797ec681f3Smrg- panfrost: Pre-allocate memory for pool 4807ec681f3Smrg- panfrost: Introduce invisible pool 4817ec681f3Smrg- panfrost: Avoid minimum stack allocations 4827ec681f3Smrg- pan/decode: Don't try to dereference heap mapping 4837ec681f3Smrg- panfrost: Share tiler_heap across batches/contexts 4847ec681f3Smrg- panfrost: Drop implicit blend pooling 4857ec681f3Smrg- panfrost: Explicitly handle nr_cbufs=0 case 4867ec681f3Smrg- panfrost: Drop depth-only case in blend finalize 4877ec681f3Smrg- panfrost: Keep finalized blend state constant 4887ec681f3Smrg- panfrost: Fix blend leak for render targets 5-8 4897ec681f3Smrg- panfrost: Free cloned NIR shader 4907ec681f3Smrg- panfrost: Free NIR of blit shaders 4917ec681f3Smrg- panfrost: Free hash_to_temp map 4927ec681f3Smrg- pan/mdg: Free previous liveness 4937ec681f3Smrg- panfrost: Use memctx for sysvals 4947ec681f3Smrg- panfrost: Free batch->dependencies 4957ec681f3Smrg- panfrost: Pass alignments explicitly 4967ec681f3Smrg- panfrost: Fix attribute buffer underallocation 4977ec681f3Smrg- panfrost: Don't overallocate attributes 4987ec681f3Smrg- panfrost: Don't reserve for NPOT w/o instancing 4997ec681f3Smrg- panfrost: Reduce attribute buffer allocations 5007ec681f3Smrg- panfrost: Fix alignment on Bifrost 5017ec681f3Smrg- gallium: Add util_blend_factor_uses_dest helper 5027ec681f3Smrg- gallium: Add util_blend_uses_dest helper 5037ec681f3Smrg- si: Use util_blend_factor_uses_dest 5047ec681f3Smrg- r300: Use util_blend_factor_uses_dest 5057ec681f3Smrg- pan/decode: Drop legacy 32-bit job support 5067ec681f3Smrg- panfrost: Decode nested structs correctly 5077ec681f3Smrg- panfrost: Hoist blend finalize calls 5087ec681f3Smrg- panfrost: Separate shader/blend descriptor emits 5097ec681f3Smrg- panfrost: XMLify blend flags 5107ec681f3Smrg- panfrost: Simplify make_fixed_blend_mode prototype 5117ec681f3Smrg- panfrost: Honour load_dest/opaque flags 5127ec681f3Smrg- panfrost: XMLify blend equation 5137ec681f3Smrg- panfrost: Combine frag_shader_meta_init functions 5147ec681f3Smrg- panfrost: Size UBO#0 accurately 5157ec681f3Smrg- panfrost: Clamp shader->uniform_count 5167ec681f3Smrg- panfrost: Bake the initial tag into the shader pointer 5177ec681f3Smrg- panfrost: Specialize compute vs frag shader init 5187ec681f3Smrg- panfrost: Rename shader emit functions 5197ec681f3Smrg- panfrost: Clean up blend shader errata handling 5207ec681f3Smrg- panfrost: Group SFBD state together 5217ec681f3Smrg- panfrost: XMLify Midgard properties 5227ec681f3Smrg- panfrost: Pack compute Midgard properties 5237ec681f3Smrg- panfrost: Use packs for fragment properties 5247ec681f3Smrg- panfrost: Use pack for shaderless 5257ec681f3Smrg- panfrost: Fold work_count packing for blend shaders 5267ec681f3Smrg- panfrost: Simplify bind_blend_state 5277ec681f3Smrg- panfrost: Remove midgard1 bitfield 5287ec681f3Smrg- panfrost: XMLify bifrost1 5297ec681f3Smrg- panfrost: Drop redundant NULL check 5307ec681f3Smrg- panfrost: Group SFBD code tighter 5317ec681f3Smrg- panfrost: XMLify Bifrost preload 5327ec681f3Smrg- panfrost: Identify additional SFBD flags 5337ec681f3Smrg- panfrost: Support SHADERLESS mode everywhere 5347ec681f3Smrg- panfrost: Quiet pandecode error 5357ec681f3Smrg- panfrost: Derive texture/sampler_count from shader 5367ec681f3Smrg- panfrost: XMLify beginning of shader descriptor 5377ec681f3Smrg- panfrost: Derive UBO count from shader_info 5387ec681f3Smrg- panfrost: Pack vertex properties when compiling 5397ec681f3Smrg- panfrost: Prepack fragment properties/preload 5407ec681f3Smrg- panfrost: Simplify shaderless packing 5417ec681f3Smrg- panfrost: Ensure shader-db state is zero-initialized 5427ec681f3Smrg- panfrost: Allocate a state uploader 5437ec681f3Smrg- panfrost: Upload shader descriptors at CSO create 5447ec681f3Smrg- panfrost: Use preuploaded shader descriptors 5457ec681f3Smrg- panfrost: XMLify the rest of shader_meta 5467ec681f3Smrg- panfrost: Inherit default values from structs 5477ec681f3Smrg- panfrost: Use pack for blit shaders 5487ec681f3Smrg- panfrost: Use pack for Bifrost test state 5497ec681f3Smrg- panfrost: Add optional opaque packs to GenXML 5507ec681f3Smrg- panfrost: Use opaque pack for vertex shaders 5517ec681f3Smrg- panfrost: Use pack for fragment shaders 5527ec681f3Smrg- pan/decode: Use unpacks for state descriptor 5537ec681f3Smrg- panfrost: Drop mali_shader_meta 5547ec681f3Smrg- panfrost: Add opaque midgard_blend XML 5557ec681f3Smrg- panfrost: Emit explicit REPLACE for disabled colour writeout 5567ec681f3Smrg- panfrost: Drop blend indirection 5577ec681f3Smrg- panfrost: Add padded type for instance fields 5587ec681f3Smrg- panfrost: Add XML for mali_vertex_tiler_postfix 5597ec681f3Smrg- panfrost: Use draw pack for blit 5607ec681f3Smrg- panfrost: Separate postfix from emits 5617ec681f3Smrg- panfrost: Inline vt_update_{rasterizer, occlusion} 5627ec681f3Smrg- panfrost: Remove postfix parameter from UBO upload 5637ec681f3Smrg- panfrost: Avoid postfix dep for vertex_data 5647ec681f3Smrg- panfrost: Don't call panfrost_vt_init for compute 5657ec681f3Smrg- panfrost: Inline panfrost_vt_init 5667ec681f3Smrg- panfrost: Inline panfrost_vt_set_draw_info 5677ec681f3Smrg- panfrost: Detangle postfix from varying emits 5687ec681f3Smrg- panfrost: Use draw pack for compute jobs 5697ec681f3Smrg- panfrost: Use pack for draw descriptor 5707ec681f3Smrg- panfrost: Simplify ZSA bind 5717ec681f3Smrg- panfrost: Cleanup point sprite linking 5727ec681f3Smrg- panfrost: Drop point sprite from shader key 5737ec681f3Smrg- panfrost: XMLify primitive information 5747ec681f3Smrg- panfrost: Add invocation XML 5757ec681f3Smrg- panfrost: XMLify invocations 5767ec681f3Smrg- panfrost: Drop bifrost_payload_fused 5777ec681f3Smrg- panfrost: Inline bifrost_tiler_only 5787ec681f3Smrg- panfrost: Use nir_builder_init_simple_shader for blits 5797ec681f3Smrg- pan/decode: Drop scratchpad size dump 5807ec681f3Smrg- pan/decode: Drop mali_vertex_tiler_postfix arg 5817ec681f3Smrg- pan/decode: Print shader-db even for compute 5827ec681f3Smrg- pan/decode: Fix awkward syntax 5837ec681f3Smrg- pan/decode: Use generation for vertex_tiler_postfix 5847ec681f3Smrg- pan/decode: Use unpack for vertex_tiler_postfix_pre 5857ec681f3Smrg- panfrost: Remove mali_vertex_tiler_postfix 5867ec681f3Smrg- pan/decode: Drop prefix braces 5877ec681f3Smrg- panfrost: Emit texture/sampler points for compute 5887ec681f3Smrg- pan/mdg: Implement i/umul_high 5897ec681f3Smrg- pan/mdg: Scalarize 64-bit 5907ec681f3Smrg- pan/mdg: Bounds check swizzle writing globals 5917ec681f3Smrg- pan/mdg: Implement nir_intrinsic_load_sample_mask_in 5927ec681f3Smrg- pan/mdg: Refactor texture op/mode handling 5937ec681f3Smrg- pan/mdg: Add disassembly for shadow gathers 5947ec681f3Smrg- pan/mdg: Implement texture gathers 5957ec681f3Smrg- panfrost: Set PIPE_CAP_MAX_TEXTURE_GATHER_COMPONENTS 5967ec681f3Smrg- docs/features: Add missing Panfrost extensions 5977ec681f3Smrg- pan/mdg: Fix discard encoding 5987ec681f3Smrg- pan/mdg: Fix perspective combination 5997ec681f3Smrg- panfrost: Drop PIPE_CAP_MAX_COMBINED_HW_ATOMIC_COUNTER/BUFFERS 6007ec681f3Smrg- mesa/st: Don't set alpha if ALPHA_TEST is lowered 6017ec681f3Smrg- pan/mdg: Obey f2fmp size restriction in fuse_io_16 6027ec681f3Smrg- panfrost: Fix nonzero stencil mask on vertex/compute 6037ec681f3Smrg- pan/bit: Set d3d=true for CMP tests 6047ec681f3Smrg- pan/bit: Fix unit tests 6057ec681f3Smrg- pan/bi: Lower flrp16 6067ec681f3Smrg- pan/bi: Add XML describing the instruction set 6077ec681f3Smrg- pan/bi: Add ISA parser 6087ec681f3Smrg- pan/bi: Add packing generator 6097ec681f3Smrg- pan/bi: Add disassembler generator 6107ec681f3Smrg- pan/bi: Add disassembly prototypes 6117ec681f3Smrg- pan/bi: Add bi_disasm_dest\_\* helpers 6127ec681f3Smrg- pan/bi: Export dump_src 6137ec681f3Smrg- pan/bi: Use new disassembler 6147ec681f3Smrg- pan/bi: Use canonical syntax for registers/uniforms/imms 6157ec681f3Smrg- pan/bi: Use canonical syntax for special constants 6167ec681f3Smrg- pan/bi: Add dummy carry/borrow argument for iadd/isub 6177ec681f3Smrg- pan/bi: Introduce segments into the IR 6187ec681f3Smrg- pan/bi: Add format field to IR 6197ec681f3Smrg- pan/bi: Track compute_lod in IR 6207ec681f3Smrg- pan/bi: Pass blend descriptor explicitly in IR 6217ec681f3Smrg- pan/bi: Use 8-bit shifts 6227ec681f3Smrg- pan/bi: Use src1/dest_invert instead of src_invert[\] 6237ec681f3Smrg- pan/bi: Move packing helpers to dedicated file 6247ec681f3Smrg- pan/bi: Use new packing 6257ec681f3Smrg- pan/bi: Remove unused prints 6267ec681f3Smrg- pan/bi: Remove unused packing data structures 6277ec681f3Smrg- pan/bi: Drop \*FMIN reference 6287ec681f3Smrg- pan/bi: Annotate stop bit (canonically "Z-bit") 6297ec681f3Smrg- pan/bi: Annotate disassemble with format names 6307ec681f3Smrg- pan/bi: Inline dump_instr 6317ec681f3Smrg- pan/bi: Track M values of disassembled constants 6327ec681f3Smrg- pan/bi: Decode M values in disasm 6337ec681f3Smrg- pan/bi: Disassemble PC-relative addresses 6347ec681f3Smrg- pan/bi: Add bifrost_reg_mode enum 6357ec681f3Smrg- pan/bi: Pass 'first' through disassembler 6367ec681f3Smrg- pan/bi: Decode all 32-bit register modes 6377ec681f3Smrg- pan/bi: Rename port -\> slot 6387ec681f3Smrg- pan/bi: Use canonical register packing 6397ec681f3Smrg- pan/bi: Remove old register mode definitions 6407ec681f3Smrg- pan/bi: Fix assert when writing vertex outputs 6417ec681f3Smrg- pan/bi: Add copy for register COMBINEs 6427ec681f3Smrg- pan/decode: Ensure mappings are zeroed 6437ec681f3Smrg- pan/bi: Fix memory corruption in scheduler 6447ec681f3Smrg- pan/bi: Drop if 0'd combine lowering 6457ec681f3Smrg- pan/bi: Cull unnecessary edges on the CF graph 6467ec681f3Smrg- pan/bi: Use canonical floating-point modes 6477ec681f3Smrg- pan/bi: Canonicalize terminate_discarded_threads 6487ec681f3Smrg- pan/bi: Use canonical next_clause_prefetch 6497ec681f3Smrg- pan/bi: Use canonical name for staging registers 6507ec681f3Smrg- pan/bi: Expand clause type to 5-bit 6517ec681f3Smrg- pan/bi: Add missing message types 6527ec681f3Smrg- pan/bi: Print message types as strings 6537ec681f3Smrg- pan/bi: Use canonical term "message type" 6547ec681f3Smrg- pan/bi: Use canonical term dependency 6557ec681f3Smrg- pan/bi: Use canonical flow control enum 6567ec681f3Smrg- pan/bi: Pass flow_control through directly 6577ec681f3Smrg- pan/bi: Handle vector moves 6587ec681f3Smrg- pan/bi: Expose GL 2.1 on Bifrost 6597ec681f3Smrg- pan/bi: Fix simple txl test 6607ec681f3Smrg- pan/bi: Use canonical texture op names in IR 6617ec681f3Smrg- pan/bi: Streamline TEXC/TEXS naming/selection 6627ec681f3Smrg- pan/bi: Encode skip bit into IR 6637ec681f3Smrg- pan/bi: Pack skip bit for texture operations 6647ec681f3Smrg- pan/bi: Add texture operator descriptor 6657ec681f3Smrg- pan/bi: Stub out TEXC handling 6667ec681f3Smrg- pan/bi: Add data register passing infrastructure 6677ec681f3Smrg- pan/bi: Handle nir_tex_src_lod 6687ec681f3Smrg- pan/bi: Pack TEXC 6697ec681f3Smrg- pan/bi: Rewrite to fit dest = src constraint 6707ec681f3Smrg- pan/bi: Prefer 'texture_index' to 'image_index' 6717ec681f3Smrg- panfrost: Add missing XML for Bifrost samplers 6727ec681f3Smrg- panfrost: Fix Bifrost filter selection 6737ec681f3Smrg- panfrost: Fix Bifrost high LOD clamp 6747ec681f3Smrg- panfrost: Add some missing Bifrost texture XML 6757ec681f3Smrg- pan/bi: Implement txb 6767ec681f3Smrg- panfrost: Set helper_invocation_enable for Bifrost 6777ec681f3Smrg- pan/bi: Fix message type printing 6787ec681f3Smrg- pan/bi: Don't terminate helper threads 6797ec681f3Smrg- panfrost: Add panfrost_block_dim helper 6807ec681f3Smrg- pan/bi: Use new block dimension helper 6817ec681f3Smrg- panfrost: Fix faults on block-based formats on Bifrost 6827ec681f3Smrg- pan/bi: Map NIR tex ops to Bifrost ops 6837ec681f3Smrg- pan/bi: Add bi_emit_lod_cube helper 6847ec681f3Smrg- pan/bi: Implement FETCH 6857ec681f3Smrg- panfrost: Update XML for Bifrost early-z/FPK 6867ec681f3Smrg- panfrost: Set "shader modifies coverage?" flag 6877ec681f3Smrg- panfrost: Temporarily disable FP16 on Bifrost 6887ec681f3Smrg- pan/bi: Disable mediump output lowering 6897ec681f3Smrg- pan/bi: Range check newc/oldc when rewriting 6907ec681f3Smrg- panfrost: Rename gtransfer to transfer 6917ec681f3Smrg- panfrost: Use canonical characterization of tls_size 6927ec681f3Smrg- panfrost: Drop panfrost_vt_emit_shared_memory 6937ec681f3Smrg- pan/mdg: Cleanup mir_rewrite_index_src_single 6947ec681f3Smrg- pan/bi: Drop 64-bit constant support 6957ec681f3Smrg- pan/bi: Fix handling of small constants in bi_lookup_constant 6967ec681f3Smrg- pan/bi: Stub spilling 6977ec681f3Smrg- pan/bi: Add no_spill flag to IR 6987ec681f3Smrg- pan/bi: Implement bi_choose_spill_node 6997ec681f3Smrg- pan/bi: Add spills/fills parameters 7007ec681f3Smrg- pan/bi: Add bi_spill helper 7017ec681f3Smrg- pan/bi: Add bi_fill 7027ec681f3Smrg- pan/bi: Add bi_rewrite_index_src_single helper 7037ec681f3Smrg- pan/bi: Add helpers for working with singletons 7047ec681f3Smrg- pan/bi: Implement bi_spill_register 7057ec681f3Smrg- pan/bi: Factor out singleton construction from scheduler 7067ec681f3Smrg- pan/bi: Add bi_foreach_clause_in_block_safe helper 7077ec681f3Smrg- pan/bi: Pack LOAD/STORE 7087ec681f3Smrg- pan/bi: Implement spilling 7097ec681f3Smrg- pan/bi: Pipe through tls_size 7107ec681f3Smrg- panfrost: Move nir_undef_to_zero to common util/ 7117ec681f3Smrg- pan/bi: Use nir_undef_to_zero 7127ec681f3Smrg- panfrost: Record architecture major version 7137ec681f3Smrg- panfrost: Don't export queries 7147ec681f3Smrg- panfrost: Calculate thread count on Bifrost 7157ec681f3Smrg- panfrost: Fix component order XML 7167ec681f3Smrg- panfrost: Implement BGRA textures 7177ec681f3Smrg- panfrost: Drop PIPE_CAP_GLSL_FEATURE_LEVEL for Bifrost 7187ec681f3Smrg- panfrost: Don't advertise MSAA on Bifrost 7197ec681f3Smrg- pan/bi: Account for bool32 ld_ubo reads 7207ec681f3Smrg- panfrost: Don't double-compose swizzles 7217ec681f3Smrg- panfrost: Add MALI_EXTRACT_INDEX helper 7227ec681f3Smrg- panfrost: Use consistent swizzle names in XML 7237ec681f3Smrg- panfrost: Add a blendable format table 7247ec681f3Smrg- panfrost: Use panfrost_blendable_formats for MFBD 7257ec681f3Smrg- panfrost: Use panfrost_blendable_formats for SFBD 7267ec681f3Smrg- panfrost: Use panfrost_blendable_formats for blending 7277ec681f3Smrg- panfrost: Complete format_to_bifrost_blend 7287ec681f3Smrg- panfrost: Remove duplicated format arg for ASTC 7297ec681f3Smrg- panfrost: Remove panfrost_is_z24s8_variant 7307ec681f3Smrg- panfrost: Add v7 special colour formats 7317ec681f3Smrg- panfrost: Add missing depth/stencil formats 7327ec681f3Smrg- panfrost: Add miscellaneous missing Midgard formats 7337ec681f3Smrg- panfrost: Add v7-specific depth formats 7347ec681f3Smrg- panfrost: Split out v6/v7 format tables 7357ec681f3Smrg- panfrost: Rename VARYING_DISCARD to CONSTANT 7367ec681f3Smrg- panfrost: Rename VARYING_POS to SNAP4 7377ec681f3Smrg- panfrost: Add missing 1/2/4/64-bit formats to XML 7387ec681f3Smrg- panfrost: Use macro for panfrost_get_default_swizzle 7397ec681f3Smrg- panfrost: Fix RGB5A1 formats 7407ec681f3Smrg- panfrost: Fix BGR233 component order 7417ec681f3Smrg- panfrost: Add missing alpha-first special formats 7427ec681f3Smrg- pan/bi: Suppress disassembly for internal shaders 7437ec681f3Smrg- pan/bi: Lower +CUBEFACE2 7447ec681f3Smrg- panfrost: Disable point sprites on Bifrost 7457ec681f3Smrg- panfrost: Advertise Bifrost support 7467ec681f3Smrg- panfrost: Drop unused swizzles 7477ec681f3Smrg- panfrost: Add bi_emit_array_index helper 7487ec681f3Smrg- pan/bi: Track tex data register swizzles 7497ec681f3Smrg- pan/bi: Handle 3D/array coordinates 7507ec681f3Smrg- pan/bi: Don't emit TEXS for array textures 7517ec681f3Smrg- panfrost: Set .array_size on Bifrost 7527ec681f3Smrg- nir: Add SRC_TYPE to store_combined_output_pan 7537ec681f3Smrg- pan/mdg: Deduplicate nir_find_variable_with_driver_location 7547ec681f3Smrg- pan/mdg: Move writeout lowering to common panfrost 7557ec681f3Smrg- panfrost: Pass through src_type 7567ec681f3Smrg- panfrost: Deduplicate shader properties 7577ec681f3Smrg- pan/bi: Add +ZS_EMIT instruction to IR 7587ec681f3Smrg- pan/bi: Infer z/stencil flags from sources passed 7597ec681f3Smrg- pan/bi: Factor out bi_emit_atest 7607ec681f3Smrg- pan/bi: Factor out bi_emit_blend 7617ec681f3Smrg- pan/bi: Stub handling for nir_intrinsic_store_combined_output_pan 7627ec681f3Smrg- pan/bi: Emit +ZS_EMIT as needed 7637ec681f3Smrg- pan/bi: Lower depth/stencil stores 7647ec681f3Smrg- pan/bi: Correctly calculate render target index 7657ec681f3Smrg- pan/mdg: Add missing Collabora copyright notices 7667ec681f3Smrg- panfrost: Add missing Collabora copyright notices 7677ec681f3Smrg- pan/bi: Model writemasks correctly 7687ec681f3Smrg 7697ec681f3SmrgAndreas Baierl (4): 7707ec681f3Smrg 7717ec681f3Smrg- lima/ppir: Skip instruction merge when having more than one successor 7727ec681f3Smrg- lima: fix glCopyTexSubImage2D 7737ec681f3Smrg- lima: set clear depth value to 0x00ffffff as default 7747ec681f3Smrg- lima/parser: Fix varyings decoding in RSW 7757ec681f3Smrg 7767ec681f3SmrgAndres Gomez (3): 7777ec681f3Smrg 7787ec681f3Smrg- gitlab-ci: reuse container_post_build when building the test images 7797ec681f3Smrg- gitlab-ci: reorder container_post_build call for arm64_test image 7807ec681f3Smrg- Revert "gitlab-ci: reuse container_post_build when building the test images" 7817ec681f3Smrg 7827ec681f3SmrgAndrew Randrianasulu (1): 7837ec681f3Smrg 7847ec681f3Smrg- st/va: fix build with old libva 7857ec681f3Smrg 7867ec681f3SmrgAndrey Vostrikov (1): 7877ec681f3Smrg 7887ec681f3Smrg- egl/x11: Free memory allocated for reply structures on error 7897ec681f3Smrg 7907ec681f3SmrgAndrii Simiklit (4): 7917ec681f3Smrg 7927ec681f3Smrg- util/xmlconfig: eliminate memory leak 7937ec681f3Smrg- nir: get rid of OOB dereferences in nir_lower_io_arrays_to_elements 7947ec681f3Smrg- glx: get rid of memory leak 7957ec681f3Smrg- glsl: avoid an out-of-bound access while setting up a location for variable 7967ec681f3Smrg 7977ec681f3SmrgAnthoine Bourgeois (4): 7987ec681f3Smrg 7997ec681f3Smrg- docs/features: Minor update extensions support 8007ec681f3Smrg- docs/features: VK_KHR_mir_surface is disabled, remove it 8017ec681f3Smrg- docs/features: add some extensions we missed 8027ec681f3Smrg- docs/features.txt: VK_EXT_separate_stencil_usage not exposed on RADV 8037ec681f3Smrg 8047ec681f3SmrgAntonio Caggiano (1): 8057ec681f3Smrg 8067ec681f3Smrg- zink: pre-hash gfx-pipeline-state 8077ec681f3Smrg 8087ec681f3SmrgAnuj Phogat (2): 8097ec681f3Smrg 8107ec681f3Smrg- intel/gen9: Enable MSC RAW Hazard Avoidance 8117ec681f3Smrg- intel: Pointer to SCISSOR_RECT array should be 64B aligned 8127ec681f3Smrg 8137ec681f3SmrgAníbal Limón (1): 8147ec681f3Smrg 8157ec681f3Smrg- src/util/disk_cache_os.c: Add missing headers for open/fcntl 8167ec681f3Smrg 8177ec681f3SmrgArcady Goldmints-Orlov (7): 8187ec681f3Smrg 8197ec681f3Smrg- broadcom/compiler: support nir_intrinsic_load_sample_id 8207ec681f3Smrg- broadcom/compiler: Add a constant folding pass after nir_lower_io 8217ec681f3Smrg- broadcom/compiler: Enable PER_QUAD for UBO and SSBO loads. 8227ec681f3Smrg- broadcom/compiler: support varyings with struct types 8237ec681f3Smrg- broadcom/compiler: use nir io semantics 8247ec681f3Smrg- broadcom/compiler: Handle non-SSA destinations for tex instructions 8257ec681f3Smrg- broadcom/compiler: Allow spills of temporaries from TMU reads 8267ec681f3Smrg 8277ec681f3SmrgBas Nieuwenhuizen (58): 8287ec681f3Smrg 8297ec681f3Smrg- radv: Add ETC2 support on RAVEN2. 8307ec681f3Smrg- radv: Fix assert that is too strict. 8317ec681f3Smrg- radv: Add forcecompress debug flag. 8327ec681f3Smrg- radv: Do not consider layouts fast-clearable on compute queue. 8337ec681f3Smrg- radv: Update CI expectations for the recent descriptor indexing regressions. 8347ec681f3Smrg- radv: When importing an image, redo the layout based on the metadata. 8357ec681f3Smrg- radv: Clean up setting the surface flags. 8367ec681f3Smrg- radv: Use getter instead of setter to extract value. 8377ec681f3Smrg- driconf: Support selection by Vulkan applicationName. 8387ec681f3Smrg- radv: Override the uniform buffer offset alignment for World War Z. 8397ec681f3Smrg- radv: Fix handling of attribs 16-31. 8407ec681f3Smrg- radv: Remove conformance warnings with ACO. 8417ec681f3Smrg- radv: Update CTS version. 8427ec681f3Smrg- radv: Fix 3d blits. 8437ec681f3Smrg- radv: Centralize enabling thread trace. 8447ec681f3Smrg- radv: Allow triggering thread traces by file. 8457ec681f3Smrg- radv: Fix threading issue with submission refcounts. 8467ec681f3Smrg- radv: Avoid deadlock on bo_list. 8477ec681f3Smrg- spirv: Deal with glslang bug not setting the decoration for stores. 8487ec681f3Smrg- spirv: Deal with glslang not setting NonUniform on constructors. 8497ec681f3Smrg- radeonsi: Work around Wasteland 2 bug. 8507ec681f3Smrg- radv,gallium: Add driconf option to reduce advertised VRAM size. 8517ec681f3Smrg- amd/common: Store non-displayable DCC pitch. 8527ec681f3Smrg- radeonsi: Put retile map in separate buffers. 8537ec681f3Smrg- radeonsi: Move display dcc dirty tracking to framebuffer emission. 8547ec681f3Smrg- ac/surface: Fix depth import on GFX6-GFX8. 8557ec681f3Smrg- radv,radeonsi: Disable compression on interop depth images 8567ec681f3Smrg- Revert "radv: set BIG_PAGE to improve performance on GFX10.3" 8577ec681f3Smrg- Revert "radv: emit {CB,DB}_RMI_L2_CACHE_CONTROL at framebuffer time" 8587ec681f3Smrg- st/mesa: Deal with empty textures/buffers in semaphore wait/signal. 8597ec681f3Smrg- radv: Disable NGG on APUs. 8607ec681f3Smrg- radv: Simplify radv_is_hw_resolve_pipeline. 8617ec681f3Smrg- radv: Add VK_FORMAT_E5B9G9R9_UFLOAT_PACK32 rendering support. 8627ec681f3Smrg- radv: Fix emitting SQTT userdata. 8637ec681f3Smrg- radv: Use correct alignment for SQTT buffer sizes. 8647ec681f3Smrg- radv: Fix RGP Asic CU info for GFX10+. 8657ec681f3Smrg- radv: Include flushes in the barrier. 8667ec681f3Smrg- radv: Record cache flushes for RGP. 8677ec681f3Smrg- radv: Write correct dispatch size for RGP. 8687ec681f3Smrg- radeonsi: Fix imports with displayable DCC. 8697ec681f3Smrg- radv: Use atomics to read query results. 8707ec681f3Smrg- radv: Set fce metadata correctly on DCC initialization. 8717ec681f3Smrg- radv: Fix event write cmdbuffer allocation when tracing. 8727ec681f3Smrg- radv/winsys: Expand scope of allbos lock. 8737ec681f3Smrg- radv: Fix mipmap extent adjustment on GFX9+. 8747ec681f3Smrg- aco: Add VK_KHR_shader_terminate_invocation support. 8757ec681f3Smrg- amd/llvm: Add VK_KHR_shader_terminate_invocation support. 8767ec681f3Smrg- radv: Advertise VK_KHR_shader_terminate_invocation. 8777ec681f3Smrg- frontends/va: Initialize drm modifier on import. 8787ec681f3Smrg- radv: Fix 1D compressed mipmaps on GFX9. 8797ec681f3Smrg- radv: Do not access set layout during vkCmdBindDescriptorSets. 8807ec681f3Smrg- radv: Fix variable name collision. 8817ec681f3Smrg- radv: Skip tiny non-visible VRAM heap. 8827ec681f3Smrg- radv: Fix budget calculations with large BAR. 8837ec681f3Smrg- radv: Fix exporting/importing multisample images. 8847ec681f3Smrg- radv: Fix RB+ blending for VK_FORMAT_E5B9G9R9_UFLOAT_PACK32. 8857ec681f3Smrg- radv: Fix a hang on CB change by adding flushes. 8867ec681f3Smrg- radv: Deal with unused attachments in mip flush 8877ec681f3Smrg 8887ec681f3SmrgBillKristiansen (1): 8897ec681f3Smrg 8907ec681f3Smrg- compiler/glsl: Initialize local variable to zero to fix MSVC RTC error 8917ec681f3Smrg 8927ec681f3SmrgBoris Brezillon (141): 8937ec681f3Smrg 8947ec681f3Smrg- spirv: Move the emit a 'return value' store logic into own function 8957ec681f3Smrg- compiler/nir: Add new flags to lower pack/unpack split instructions 8967ec681f3Smrg- nir: Fix i64tof32 lowering 8977ec681f3Smrg- spirv: Add support for the CL Round instruction 8987ec681f3Smrg- panfrost: Rename panfrost_create_pool() into panfrost_pool_init() 8997ec681f3Smrg- panfrost: Avoid accessing pan_pool fields directly 9007ec681f3Smrg- panfrost: Store transient BOs in a dynamic array 9017ec681f3Smrg- spirv: Add a vtn_get_mem_operands() helper 9027ec681f3Smrg- spirv: Don't accept CPacked decoration on struct members 9037ec681f3Smrg- spirv: Propagate packed information to glsl_type 9047ec681f3Smrg- glsl: Propagate packed info in get_explicit_type_for_size_align() 9057ec681f3Smrg- nir/glsl: Consider block interfaces as structs when it comes to size/align calculation 9067ec681f3Smrg- nir: Expose the packed attribute attached to glsl_type objects 9077ec681f3Smrg- panfrost: gen_pack: Minor formatting improvement 9087ec681f3Smrg- panfrost: gen_pack: Fix \__gen_unpack_uint() 9097ec681f3Smrg- panfrost: gen_pack: Add pan_{unpack,print}() helpers 9107ec681f3Smrg- panfrost: gen_pack: Move the group get_length() logic to its own method 9117ec681f3Smrg- panfrost: gen_pack: Add the aggregate concept 9127ec681f3Smrg- panfrost: gen_pack: Allow empty structs 9137ec681f3Smrg- panfrost: gen_pack: Add an align() modifier 9147ec681f3Smrg- panfrost: gen_pack: Add a log2 modifier 9157ec681f3Smrg- panfrost: gen_pack: Allow enum/define values expressed in hexadecimal 9167ec681f3Smrg- panfrost: decode: Make the indentation consistent with auto-generated print helpers 9177ec681f3Smrg- panfrost: decode: Rework the DUMP_{CL,ADDR}() macros 9187ec681f3Smrg- panfrost: decode: Add a macro to dump unpacked descriptors 9197ec681f3Smrg- panfrost: decode: Use pan_{unpack,print}() when applicable 9207ec681f3Smrg- panfrost: XML-ify the local storage descriptor 9217ec681f3Smrg- panfrost: Clarify what TILED mode is 9227ec681f3Smrg- panfrost: Add Tiled linear mode to the Block Format enum 9237ec681f3Smrg- panfrost: XML-ify the midgard tiler descriptor 9247ec681f3Smrg- panfrost: XML-ify the single target framebuffer descriptor 9257ec681f3Smrg- panfrost: XML-ify the bifrost tiler descriptors 9267ec681f3Smrg- panfrost: XML-ify the multi-target framebuffer descriptors 9277ec681f3Smrg- panfrost: XML-ify the job header descriptor 9287ec681f3Smrg- panfrost: XML-ify the write value job descriptor 9297ec681f3Smrg- panfrost: XML-ify the fragment job descriptor 9307ec681f3Smrg- panfrost: Rename the Blend dither disable flag 9317ec681f3Smrg- panfrost: XML-ify the compute job descriptor 9327ec681f3Smrg- panfrost: Avoid copying job descriptors around when we can 9337ec681f3Smrg- panfrost: decode: Misc formatting improvements 9347ec681f3Smrg- panfrost: gen_pack: Fix gnu-empty-initializer errors 9357ec681f3Smrg- ci: Extend meson-clang coverage by compiling all gallium drivers 9367ec681f3Smrg- panfrost: Fix bifrost tiler descriptor definition 9377ec681f3Smrg- panfrost: Fix bifrost tiler job emission 9387ec681f3Smrg- panfrost: Adjust quirks for bifrost v6 9397ec681f3Smrg- panfrost: Add preliminary support for Mali G72 9407ec681f3Smrg- kmsro: Add mediatek entry point 9417ec681f3Smrg- panfrost: Add support for rbg16 formats 9427ec681f3Smrg- panfrost: decode: Fix decode_bifrost_constant() prototype 9437ec681f3Smrg- panfrost: decode: Flag pandecode_log_typed() as PRINTFLIKE 9447ec681f3Smrg- panfrost: bifrost: disassemble: Fix decoding of next_regs 9457ec681f3Smrg- panfrost: Fix a warning 9467ec681f3Smrg- panfrost: Adjust the draw descriptor definition 9477ec681f3Smrg- panfrost: Adjust the primitive desc definition 9487ec681f3Smrg- panfrost: Adjust the renderer state definition 9497ec681f3Smrg- panfrost: Get rid of the with_opaque qualifier on the renderer state desc 9507ec681f3Smrg- panfrost: Drop the with_opaque specifier on midgard blend desc 9517ec681f3Smrg- panfrost: gen_pack: Drop support for opaque structs 9527ec681f3Smrg- panfrost: gen_pack: Support overlapping structs 9537ec681f3Smrg- panfrost: gen_pack: Add a no-direct-packing attribute 9547ec681f3Smrg- panfrost: Rework fixed-function blending 9557ec681f3Smrg- panfrost: Rework the render target layout to use overlapping structs 9567ec681f3Smrg- panfrost: XML-ify the blend descriptors 9577ec681f3Smrg- panfrost: Fix fixed-function blend on Mali v6 9587ec681f3Smrg- panfrost: Constify the rt_fmts arg passed to pan_lower_framebuffer() 9597ec681f3Smrg- panfrost: Move the blend constant mask extraction out of make_fixed_blend_mode() 9607ec681f3Smrg- panfrost: Pass compile arguments through a struct 9617ec681f3Smrg- panfrost: Allocate blit_blend with ralloc() 9627ec681f3Smrg- panfrost: Don't leak NIR blend shaders 9637ec681f3Smrg- panfrost: Let compile_blend_shader() allocate the blend shader object 9647ec681f3Smrg- panfrost: Get rid of the constant patching done on blend shader binaries 9657ec681f3Smrg- panfrost: Move the blend shader cache at the context level 9667ec681f3Smrg- panfrost: Fix fixed-function blend on bifrost 9677ec681f3Smrg- panfrost: Extend compile_inputs to pass a blend descriptor 9687ec681f3Smrg- pan/bi: Copy blend shader info from compile_inputs 9697ec681f3Smrg- pan/bi: Use canonical name for FAU RAM sources 9707ec681f3Smrg- pan/bi: Get rid of the regs argument in bi_assign_fau_idx() 9717ec681f3Smrg- pan/bi: Rework blend descriptor access handling 9727ec681f3Smrg- pan/bi: Add support for load_blend_const_color_{r,g,b,a}_float 9737ec681f3Smrg- pan/bi: Support indirect jumps 9747ec681f3Smrg- panfrost: Add a "Bifrost Internal Blend" descriptor 9757ec681f3Smrg- panfrost: Scalarize nir_load_blend_const_color_rgba 9767ec681f3Smrg- panfrost: Flag blend shader function as an entry point 9777ec681f3Smrg- pan/bi: Add load_output support 9787ec681f3Smrg- pan/bi: Collect return addresses of blend calls 9797ec681f3Smrg- pan/bi: Special-case BLEND instruction emission for blend shaders 9807ec681f3Smrg- pan/bi: Reserve r0-r3 in blend shaders 9817ec681f3Smrg- pan/bi: Special-case load_input for blend shaders 9827ec681f3Smrg- panfrost: Add missing tile-buffer formats to the format enum 9837ec681f3Smrg- panfrost: Add blend shader support to bifrost 9847ec681f3Smrg- panfrost: Adjust the renderer state definition 9857ec681f3Smrg- panfrost: Fix tiler job injection 9867ec681f3Smrg- panfrost: Add the bifrost tiler internal state field 9877ec681f3Smrg- panfrost: Add specialized preload descriptors 9887ec681f3Smrg- panfrost: Replace unkown renderer state fields by their real names 9897ec681f3Smrg- pan/bi: Make sure we don't print special index as a register 9907ec681f3Smrg- pan/bi: Print blend descriptor source properly 9917ec681f3Smrg- pan/bi: Add support for load_sample_id 9927ec681f3Smrg- pan/bi: Support the case where TEXC needs 0 or 1 staging reg 9937ec681f3Smrg- pan/bi: Add basic support for txf_ms 9947ec681f3Smrg- panfrost: Make {midgard,bifrost}_compile_shader_nir() return a program object 9957ec681f3Smrg- panfrost: Build blit shaders on Bifrost too 9967ec681f3Smrg- panfrost: Use real name for attribute's unknown field 9977ec681f3Smrg- panfrost: Rename panfrost_transfer to panfrost_ptr 9987ec681f3Smrg- panfrost: Pass the texture payload through a panfrost_ptr 9997ec681f3Smrg- panfrost: Split panfrost_load_midg() 10007ec681f3Smrg- panfrost: Add support for native wallpapering on Bifrost 10017ec681f3Smrg- panfrost: Use native wallpapering on Bifrost 10027ec681f3Smrg- panfrost: Get rid of the non-native wallpering bits 10037ec681f3Smrg- panfrost: Preload primitive flags when gl_FrontFacing is accessed 10047ec681f3Smrg- pan/bi: Add support for load_front_face 10057ec681f3Smrg- pan/bi: Add support for load_point_coord 10067ec681f3Smrg- pan/bi: Lower {i,u}{min,max} instructions 10077ec681f3Smrg- pan/bi: Add ult support 10087ec681f3Smrg- pan/bi: Fix ms_idx type to catch missing ms_index source 10097ec681f3Smrg- panfrost: Leave push_constants pointer to NULL if there's no uniform 10107ec681f3Smrg- panfrost: Suppress Bifrost prefetching 10117ec681f3Smrg- panfrost: Add array size to XML 10127ec681f3Smrg- panfrost: Implement v7 texture payloads 10137ec681f3Smrg- pan/bi: s/t0/t1/ in bi_disasm_dest_add() 10147ec681f3Smrg- pan/bi: Move special instruction packing to a separate helper 10157ec681f3Smrg- pan/bi: Split special class in two 10167ec681f3Smrg- pan/bi: Hook up cube instructions packing 10177ec681f3Smrg- pan/bi: Lower cube map coordinates 10187ec681f3Smrg- panfrost: Force late pixel kill when depth/stencil is written from the FS 10197ec681f3Smrg- panfrost: Expose GLES3 features on Bifrost when PAN_MESA_DEBUG=deqp 10207ec681f3Smrg- pan/bi: Extract LD_VAR sample field from ins->load_vary.interp_mode 10217ec681f3Smrg- pan/bi: Support centroid and sample interpolations 10227ec681f3Smrg- pan/bi: Fix swizzle handling in bi_copy_src() 10237ec681f3Smrg- pan/bi: Add support for load_ubo 10247ec681f3Smrg- pan/bi: Lower uniforms to UBO 10257ec681f3Smrg- pan/bi: Get rid of bi_emit_ld_uniform() 10267ec681f3Smrg- pan/bi: Move bitwise op packing out of bi_pack_fma() 10277ec681f3Smrg- pan/bi: Fix ARSHIFT definitions 10287ec681f3Smrg- pan/bi: Add support for ishr 10297ec681f3Smrg- pan/bi: Add support for ushr 10307ec681f3Smrg- panfrost: Allow linear ZS resources on Bifrost 10317ec681f3Smrg- pan/bi: Add support for load_vertex_id 10327ec681f3Smrg- pan/bi: Add support for load_instance_id 10337ec681f3Smrg- panfrost: Fix Bifrost blend descriptor emission 10347ec681f3Smrg- panfrost: Fix ->reads_frag_coord assignment 10357ec681f3Smrg 10367ec681f3SmrgBoyuan Zhang (5): 10377ec681f3Smrg 10387ec681f3Smrg- vl: add flag and definition for protected playback 10397ec681f3Smrg- frontends/va: handle protected slice data buffer 10407ec681f3Smrg- radeon: add decryption params definition header 10417ec681f3Smrg- radeon/vcn: add defines for drm message buffer 10427ec681f3Smrg- radeon/vcn: program drm message buffer 10437ec681f3Smrg 10447ec681f3SmrgBrendan Dougherty (1): 10457ec681f3Smrg 10467ec681f3Smrg- mesa: Fix vertex_format_to_pipe_format index. 10477ec681f3Smrg 10487ec681f3SmrgCaio Marcelo de Oliveira Filho (19): 10497ec681f3Smrg 10507ec681f3Smrg- intel/compiler: Use C99 array initializers for prog_data/key sizes 10517ec681f3Smrg- nir: Add nir_intrinsic_terminate and nir_intrinsic_terminate_if 10527ec681f3Smrg- spirv: Update headers and metadata from latest Khronos commit 10537ec681f3Smrg- spirv: Handle SpvOpTerminateInvocation 10547ec681f3Smrg- intel/fs: Handle nir_intrinsic_terminate 10557ec681f3Smrg- vulkan: Update XML and headers to 1.2.158 10567ec681f3Smrg- anv: Advertise VK_KHR_shader_terminate_invocation 10577ec681f3Smrg- nir: Use a switch in nir_lower_explicit_io_instr 10587ec681f3Smrg- intel/fs: Don't emit_uniformize when getting a constant SSBO index 10597ec681f3Smrg- spirv: Implement SpvCapabilitySubgroupShuffleINTEL from SPV_INTEL_subgroups 10607ec681f3Smrg- nir: Add nir_intrinsic_{load,store}_deref_block_intel 10617ec681f3Smrg- spirv: Implement SpvCapabilitySubgroupBufferBlockIOINTEL 10627ec681f3Smrg- intel/fs: Add A64 OWORD BLOCK opcodes 10637ec681f3Smrg- intel/fs: Implement nir_intrinsic_{load,store}_global_block_intel 10647ec681f3Smrg- intel/fs: Add surface OWORD BLOCK opcodes 10657ec681f3Smrg- intel/fs: Implement nir_intrinsic_{load,store}_ssbo_block_intel 10667ec681f3Smrg- intel/fs: Implement nir_intrinsic_{load,store}_shared_block_intel 10677ec681f3Smrg- compiler: Add new Vulkan shader stages 10687ec681f3Smrg- spirv: Add Ray Tracing execution models 10697ec681f3Smrg 10707ec681f3SmrgCaleb Callaway (1): 10717ec681f3Smrg 10727ec681f3Smrg- iris: Add missing newline to debug log message 10737ec681f3Smrg 10747ec681f3SmrgChad Versace (2): 10757ec681f3Smrg 10767ec681f3Smrg- anv/image: Check DISJOINT in vkGetPhysicalDeviceImageFormatProperties2 (v2) 10777ec681f3Smrg- anv/image: Fix isl_surf_usage_flags for stencil images 10787ec681f3Smrg 10797ec681f3SmrgCharmaine Lee (3): 10807ec681f3Smrg 10817ec681f3Smrg- st/mesa: increase size of gl_register_file bitfields 10827ec681f3Smrg- winsys/svga: fix display corruption after surface_init 10837ec681f3Smrg- svga: fix draw elements with 8-bits indices 10847ec681f3Smrg 10857ec681f3SmrgChia-I Wu (2): 10867ec681f3Smrg 10877ec681f3Smrg- virgl: move protocol headers to a common place 10887ec681f3Smrg- virgl: update protocol headers 10897ec681f3Smrg 10907ec681f3SmrgChristian Gmeiner (17): 10917ec681f3Smrg 10927ec681f3Smrg- etnaviv: call nir_lower_bool_to_bitsize 10937ec681f3Smrg- etnaviv: completely turn off MSAA 10947ec681f3Smrg- ci: do not build libdrm for vc4, reedreno and etnaviv 10957ec681f3Smrg- etnaviv: call nir_opt_shrink_vectors(..) in opt loop 10967ec681f3Smrg- etnaviv: shuffle some variant fields 10977ec681f3Smrg- etnaviv: add disk cache 10987ec681f3Smrg- etnaviv: simplify linear stride implementation 10997ec681f3Smrg- ci: piglit: conditionally build OpenCL tests 11007ec681f3Smrg- ci/bare-metal: suppress 'No such file or directory' 11017ec681f3Smrg- etnaviv: drop etna_pipe_wait(..) 11027ec681f3Smrg- ci/x86: speed up piglit testing 11037ec681f3Smrg- nir: make tgsi_varying_semantic_to_slot(..) public 11047ec681f3Smrg- etnaviv: convert from tgsi semantic/index to varying-slot 11057ec681f3Smrg- etnaviv: move etna_dump_shader(..) to generic location 11067ec681f3Smrg- etnaviv: move etna_destroy_shader(..) to generic location 11077ec681f3Smrg- etnaviv: nir: do not run opt loop after nir_lower_bool_xxx(..) 11087ec681f3Smrg- etnaviv/drm: fix evil-twin etna_drm_table_lock 11097ec681f3Smrg 11107ec681f3SmrgConnor Abbott (59): 11117ec681f3Smrg 11127ec681f3Smrg- freedreno/afuc: Fix printing preemptleave on a5xx 11137ec681f3Smrg- freedreno/afuc: Handle setsecure opcode 11147ec681f3Smrg- freedreno/afuc: Add iret 11157ec681f3Smrg- freedreno/afuc: Handle xmov modifiers 11167ec681f3Smrg- freedreno/afuc: Make 0 a valid number 11177ec681f3Smrg- freedreno/afuc: Install asm/disasm 11187ec681f3Smrg- freedreno: Add afuc regression test 11197ec681f3Smrg- nir/spirv: Add the option to keep ViewIndex as an input 11207ec681f3Smrg- nir/lower_input_attachments: Refactor to use an options struct 11217ec681f3Smrg- nir/lower_input_attachments: Support loading layer id as an input 11227ec681f3Smrg- radv: Use an input for the layer when lowering input attachments 11237ec681f3Smrg- tu: Use an input for the layer when lowering input attachments 11247ec681f3Smrg- nir/lower_input_attachments: Support loading layer id via gl_ViewIndex 11257ec681f3Smrg- freedreno/a6xx: Add multiview registers 11267ec681f3Smrg- ir3: Add support for gl_ViewIndex in VS & FS 11277ec681f3Smrg- tu: Translate VkRenderPassMultiviewCreateInfo to VkRenderPassCreateInfo2 11287ec681f3Smrg- tu: Parse multiview render pass info 11297ec681f3Smrg- tu: Implement multiview clear/resolve interactions 11307ec681f3Smrg- tu: Improve timestamp queries 11317ec681f3Smrg- tu: Implement multiview query interactions 11327ec681f3Smrg- tu: Add multiview lowering pass 11337ec681f3Smrg- tu: Implement multiview pipeline state 11347ec681f3Smrg- tu: Enable VK_KHR_multiview 11357ec681f3Smrg- freedreno/computerator: Use a render node 11367ec681f3Smrg- tu: Expose shaderStorageImageExtendedFormats 11377ec681f3Smrg- tu: Expose shaderImageGatherExtended 11387ec681f3Smrg- ir3: Don't use the format to get the image type 11397ec681f3Smrg- tu: Expose shaderStorageImage*WithoutFormat 11407ec681f3Smrg- nir: Add nir_lower_multiview pass 11417ec681f3Smrg- anv: Use nir_lower_multiview pass 11427ec681f3Smrg- nir: Count i/o slots correctly for per-view variables 11437ec681f3Smrg- nir/lower_io_arrays: Fix xfb_offset bug 11447ec681f3Smrg- nir: Add per_view to IO semantics 11457ec681f3Smrg- nir: Handle per-view io in nir_io_add_const_offset_to_base() 11467ec681f3Smrg- tu: Write multiview control registers in binning pass 11477ec681f3Smrg- tu: Refactor shader compilation flow 11487ec681f3Smrg- ir3, tu: Run optimization loop twice 11497ec681f3Smrg- ir3, tu: Link per-view position correctly 11507ec681f3Smrg- tu: Enable multi-position output 11517ec681f3Smrg- intel/nir: Use nir control flow helpers 11527ec681f3Smrg- radv: Use nir control flow insertion helpers 11537ec681f3Smrg- ttn: Use nir control flow insertion helpers 11547ec681f3Smrg- nir/lower_returns: Use nir control flow insertion helpers 11557ec681f3Smrg- nir/opt_if: Remove open-coded nir_ssa_def_rewrite_uses() 11567ec681f3Smrg- nir/opt_if: Use early returns in opt_if_merge() 11577ec681f3Smrg- ttn: Fix number of components for IF/UIF 11587ec681f3Smrg- nir/lower_clip_cull: Store array size for FS inputs 11597ec681f3Smrg- ir3: Switch tess lowering to use location 11607ec681f3Smrg- ir3: Handle clip+cull distances 11617ec681f3Smrg- tu: Implement clip/cull distances 11627ec681f3Smrg- freedreno/a6xx: Implement user clip/cull distances 11637ec681f3Smrg- freedreno: Introduce common device info struct 11647ec681f3Smrg- tu: Use freedreno_dev_info 11657ec681f3Smrg- freedreno: Use freedreno_dev_info 11667ec681f3Smrg- freedreno/a6xx: Update SO registers for streams 11677ec681f3Smrg- ir3: Support geometry streams 11687ec681f3Smrg- util/bitset: Add a range iterator helper 11697ec681f3Smrg- tu: Support geometryStreams 11707ec681f3Smrg- tu: Support rasterizerDiscardEnable and RasterizationStreamSelect 11717ec681f3Smrg 11727ec681f3SmrgDaniel Abrecht (1): 11737ec681f3Smrg 11747ec681f3Smrg- etnaviv: Make sure to track different pipe_screens for different DRM device descriptions 11757ec681f3Smrg 11767ec681f3SmrgDaniel Schürmann (26): 11777ec681f3Smrg 11787ec681f3Smrg- aco: execute branch instructions in WQM if necessary 11797ec681f3Smrg- nir,amd: remove trinary_minmax opcodes 11807ec681f3Smrg- aco/isel: refactor code and remove unnecessary v_mov 11817ec681f3Smrg- aco/isel: refactor emit_vop3a_instruction() to handle 2 operand instructions 11827ec681f3Smrg- ac/nir: implement nir_op_[un]pack_[64/32]_\* 11837ec681f3Smrg- aco: propagate SGPRs into VOP1 instructions early. 11847ec681f3Smrg- aco: expand create_vector more carefully w.r.t. subdword operands 11857ec681f3Smrg- aco: use p_create_vector for nir_op_pack_half_2x16 11867ec681f3Smrg- nir/opt_algebraic: optimize unpack_half_2x16_split_x(ushr, a, 16) 11877ec681f3Smrg- aco: use p_split_vector for nir_op_unpack_half\_\* 11887ec681f3Smrg- aco: add validation rules for p_split_vector 11897ec681f3Smrg- aco: use v_cvt_pkrtz_f16_f32 for pack_half_2x16 11907ec681f3Smrg- radv,aco: lower_pack_half_2x16 11917ec681f3Smrg- aco: use VOP2 version of v_cvt_pkrtz_f16_f32 on GFX_6_7_10 11927ec681f3Smrg- aco: use VOP2 for v_cvt_pkrtz_f16_f32 if possible 11937ec681f3Smrg- aco: refactor GFX6_7 subdword copy lowering 11947ec681f3Smrg- aco: improve code sequences for 16bit packing 11957ec681f3Smrg- aco: use do_pack() for self-intersecting operations. 11967ec681f3Smrg- aco: fix GFX8 16-bit packing 11977ec681f3Smrg- aco: implement nir_op_unpack_[64/32]_\* 11987ec681f3Smrg- ac/nir: implement nir_op_[un]pack_64_4x16 11997ec681f3Smrg- nir: add options to lower nir_op_pack_[64/32]_\* via nir_lower_alu_to_scalar() 12007ec681f3Smrg- radv: lower pack_[64/32]_\* via nir_lower_alu_to_scalar() 12017ec681f3Smrg- radv: remove call to nir_lower_pack() 12027ec681f3Smrg- aco: refactor split_store_data() to always split into evenly sized elements 12037ec681f3Smrg- nir/lcssa: consider loops with no back-edge invariant 12047ec681f3Smrg 12057ec681f3SmrgDaniel Stone (17): 12067ec681f3Smrg 12077ec681f3Smrg- glsl/test: Don't run whitespace tests in parallel 12087ec681f3Smrg- CI: Disable Panfrost T860 and AMD Stoney tests 12097ec681f3Smrg- CI: Skip flaky CS test on VirGL 12107ec681f3Smrg- CI: Skip another flaky GS test on softpipe 12117ec681f3Smrg- CI: Disable Panfrost T720/T760 CI 12127ec681f3Smrg- meson: Add MSVC narrowing-int-to-char warnings 12137ec681f3Smrg- CI: Windows: Use separate config file for Docker 12147ec681f3Smrg- CI: Re-enable VS2019 build 12157ec681f3Smrg- CI: Disable Windows again 12167ec681f3Smrg- CI: Temporarily disable Panfrost T7xx 12177ec681f3Smrg- CI: Re-enable Panfrost T7xx 12187ec681f3Smrg- CI: Disable Panfrost T7xx CI 12197ec681f3Smrg- CI: Re-enable Panfrost T7xx CI 12207ec681f3Smrg- CI: Don't run pixmark-piano twice on radeonsi 12217ec681f3Smrg- CI: Only run OpenCL tests when we need to 12227ec681f3Smrg- CI: Disable Panfrost T760 12237ec681f3Smrg- freedreno: Add missing dependency to build 12247ec681f3Smrg 12257ec681f3SmrgDanylo Piliaiev (19): 12267ec681f3Smrg 12277ec681f3Smrg- st/mesa: Treat vertex outputs absent in outputMapping as zero in mesa_to_tgsi 12287ec681f3Smrg- anv/nir: Unify inputs_read/outputs_written between geometry stages 12297ec681f3Smrg- spirv: Only require bare types to match when copying variables 12307ec681f3Smrg- ir_constant: Return zero on out-of-bounds vector accesses 12317ec681f3Smrg- glsl: Eliminate assigments to out-of-bounds elements of vector 12327ec681f3Smrg- glsl: Eliminate out-of-bounds triop_vector_insert 12337ec681f3Smrg- intel/disasm: Change visibility of has_uip and has_jip 12347ec681f3Smrg- intel/disasm: brw_label and support functions 12357ec681f3Smrg- intel/disasm: Label support in shader disassembly for UIP/JIP 12367ec681f3Smrg- intel/assembler: Add labels support 12377ec681f3Smrg- intel/compiler: Fix pointer arithmetic when reading shader assembly 12387ec681f3Smrg- st/nir: Call st_glsl_to_nir_post_opts before interface unification 12397ec681f3Smrg- nir/lower_io: Eliminate oob writes and return zero for oob reads 12407ec681f3Smrg- nir/large_constants: Eliminate out-of-bounds writes to large constants 12417ec681f3Smrg- nir/lower_samplers: Clamp out-of-bounds access to array of samplers 12427ec681f3Smrg- intel/fs: Disable sample mask predication for scratch stores 12437ec681f3Smrg- docs: add INTEL_SHADER_ASM_READ_PATH description 12447ec681f3Smrg- nir/lower_returns: Append missing phis' sources after "break" insertion 12457ec681f3Smrg- freedreno/a6xx: Fix typo in height alignment calculation in a6xx layout 12467ec681f3Smrg 12477ec681f3SmrgDave Airlie (115): 12487ec681f3Smrg 12497ec681f3Smrg- anv: add no reloc flags on empty and simple bo paths. 12507ec681f3Smrg- CI: temp disable t720/t760 jobs. 12517ec681f3Smrg- llvmpipe: only read 0 for channels being read 12527ec681f3Smrg- gallium: add an interface for memory allocations. 12537ec681f3Smrg- gallium: add a resource flag to say no over allocation. 12547ec681f3Smrg- llvmpipe: add support for memory allocation APIs 12557ec681f3Smrg- gallivm/nir: add load push constant support 12567ec681f3Smrg- util/format: add some ZS helpers for vallium 12577ec681f3Smrg- vulkan/wsi: add sw support. (v2) 12587ec681f3Smrg- vallium: initial import of the vulkan frontend 12597ec681f3Smrg- llvmpipe/blit: for 32-bit unorm depth blits just copy 32-bit 12607ec681f3Smrg- llvmpipe: enable GL 4.5 12617ec681f3Smrg- vallium: fix input attachment lowering variable shadowing 12627ec681f3Smrg- llvmpipe/cs: update compute counters not fragment shader. 12637ec681f3Smrg- gallium/nir/tgsi: reindent some code in the nir->tgsi info (v2) 12647ec681f3Smrg- gallivm/nir: add imod support 12657ec681f3Smrg- gallivm/sample: fix lod query on array textures. 12667ec681f3Smrg- llvmpipe: lower uadd_carry/usub_borrow. 12677ec681f3Smrg- gallium/nir/tgsi: add support for compact variables 12687ec681f3Smrg- gallivm/nir: fixup compact TCS variable storage. 12697ec681f3Smrg- gallivm/nir: split tex/sampler indirect offsets 12707ec681f3Smrg- llvmpipe: lower cs local index from id 12717ec681f3Smrg- llvmpipe: lower mul 2x32_64 12727ec681f3Smrg- llvmpipe/nir: lower some more packing at nir level. 12737ec681f3Smrg- llvmpipe: add reference counting to fragment shaders. 12747ec681f3Smrg- vallium: handle 3D image views properly. 12757ec681f3Smrg- vallium: limit buffer allocations to gallium max. 12767ec681f3Smrg- gallium/nir/tgsi: fix nir->tgsi info conversion for samplers/image 12777ec681f3Smrg- gallivm/nir: lower tg4 offsets. 12787ec681f3Smrg- gallivm/nir: add indirect swizzle output loading support 12797ec681f3Smrg- gallivm/nir: add quantize to f16 support 12807ec681f3Smrg- gallivm/nir: fix const compact 12817ec681f3Smrg- gallivm/nir: lower frexp/ldexp 12827ec681f3Smrg- gallivm/nir: add subpass sampler type support 12837ec681f3Smrg- gallivm: use common code to select texel type 12847ec681f3Smrg- llvmpipe: blend has effects even if no colorbuffers. 12857ec681f3Smrg- llvmpipe: add array/3d clearing support 12867ec681f3Smrg- llvmpipe/fs: multisample depth/stencil bad ir generated 12877ec681f3Smrg- gallivm/nir: allow 64-bit arit ops 12887ec681f3Smrg- gallivm/nir: add some f16 support 12897ec681f3Smrg- vallium: disable VK_KHR_shader_float16_int8. 12907ec681f3Smrg- vulkan/device_select: don't pick a cpu driver as the default 12917ec681f3Smrg- llvmpipe: include gallivm perf flags in shader cache. 12927ec681f3Smrg- gallivm: disable brilinear for lod bias and explicit lod. 12937ec681f3Smrg- vtn: add an option to create a nir library from spirv 12947ec681f3Smrg- clover/nir: add libclc lowering pass 12957ec681f3Smrg- util: add missing extern C 12967ec681f3Smrg- clover: handle libclc shader (v3) 12977ec681f3Smrg- gallivm: fix pow(0, y) to be 0 12987ec681f3Smrg- gallivm: fix 64-bit CL intrinsics. 12997ec681f3Smrg- gallivm/nir: fix up non 32-bit load stores 13007ec681f3Smrg- gallivm/nir: handle non-32-bit mul high 13017ec681f3Smrg- llvmpipe: use an alternate env var to enable clover. 13027ec681f3Smrg- lavapipe: rename vallium to lavapipe 13037ec681f3Smrg- gallivm/nir: make sure to mask global reads. 13047ec681f3Smrg- llvmpipe/cs: add in shader shared size. 13057ec681f3Smrg- gallivm/nir: fix non-32 bit find lsb/msb 13067ec681f3Smrg- lavapipe: drop dri,dricommon deps. 13077ec681f3Smrg- ci: move to using clang 10 for meson + clover 13087ec681f3Smrg- clover: Use core libclc loader 13097ec681f3Smrg- ci: enable piglit testing of clover/llvmpipe. 13107ec681f3Smrg- clover: don't call validate spirv at all if not needed 13117ec681f3Smrg- ci: fix deqp clone + fetch 13127ec681f3Smrg- CI: build our own spirv tools 13137ec681f3Smrg- clover/nir: add a constant folding pass before lowering mem const 13147ec681f3Smrg- llvmpipe: fix sampler/image binding for clover. 13157ec681f3Smrg- gallivm: add load/store scratch support. 13167ec681f3Smrg- llvmpipe: fix 8/16 bit global stores 13177ec681f3Smrg- gallivm: fix 64->16 f2f16 13187ec681f3Smrg- gallivm: add 16-bit split/merge support. 13197ec681f3Smrg- gallivm: add b2i8/b216 support 13207ec681f3Smrg- gallivm: handle sub-32 bit masked stores. 13217ec681f3Smrg- gallivm: add support for 8/16-bit mul_hi 13227ec681f3Smrg- gallivm: get correct min/max behaviour for kernels. 13237ec681f3Smrg- gallivm: lower flrp for all sizes. 13247ec681f3Smrg- CI: remove llvmpipe cl flake test 13257ec681f3Smrg- gallivm: zero init the temporary register storage. 13267ec681f3Smrg- gallium: add a level parameter to resource parameter get 13277ec681f3Smrg- gallium: add a layer stride pipe resource parameter. 13287ec681f3Smrg- llvmpipe: add resource get param support. 13297ec681f3Smrg- lavapipe: use resource get param. 13307ec681f3Smrg- gallivm: fix f16 quantize. 13317ec681f3Smrg- lavapipe: don't write to pending clear aspects in cmd buffer 13327ec681f3Smrg- lavapipe: constify state pointers into command buffers. 13337ec681f3Smrg- lavapipe: fix dEQP-VK.info.device_properties 13347ec681f3Smrg- gallivm/nir: handle dvec3/4 inputs properly. 13357ec681f3Smrg- gallivm/nir: fix vulkan vertex inputs 13367ec681f3Smrg- lavapipe: fix 3d compressed texture copies. 13377ec681f3Smrg- lavapipe: stop crashes with 3D z blits 13387ec681f3Smrg- llvmpipe: add clear_buffer callback. (v2) 13397ec681f3Smrg- lavapipe: use clear_buffer callback 13407ec681f3Smrg- lavapipe: don't advertise linear filtering on integer textures. 13417ec681f3Smrg- gallium: add a non-multisample sample mask out behaviour flag. 13427ec681f3Smrg- llvmpipe: respect the sample mask in non-multisample flag 13437ec681f3Smrg- lavapipe: request correct sample mask behaviour 13447ec681f3Smrg- CL: update CL headers to 3.0 13457ec681f3Smrg- vtn/opencl: add ctz support 13467ec681f3Smrg- clover: access 3.0 and deprecated 2.2 API 13477ec681f3Smrg- clover/llvm: add 3.0 versioning. 13487ec681f3Smrg- clover/spirv: hook up spir-v environment for 3.0 13497ec681f3Smrg- clover: add empty cl 3.0 dispatch entries. 13507ec681f3Smrg- gallium: handle empty cbuf slots in framebuffer samples helper 13517ec681f3Smrg- u_blitter: port radv 3D blit coords logic. 13527ec681f3Smrg- lavapipe: enable alpha to one. 13537ec681f3Smrg- lavapipe: disable SNORM blending for now 13547ec681f3Smrg- llvmpipe: just use draw_regions in draw/line setup. 13557ec681f3Smrg- draw: fix tess eval pipeline statistics. 13567ec681f3Smrg- lavapipe: fixup device allocate + enable private data 13577ec681f3Smrg- lavapipe: fix wsi acquire fences 13587ec681f3Smrg- llvmpipe/setup: move point stats collection earlier. 13597ec681f3Smrg- llvmpipe: fix multisample point rendering. 13607ec681f3Smrg- llvmpipe: fix multisample lines. 13617ec681f3Smrg- lavapipe: fixup mipmap precsion bits 13627ec681f3Smrg- lavapipe: enable pipeline stats queries 13637ec681f3Smrg- gallium: fix missing bit field in p_state.h 13647ec681f3Smrg 13657ec681f3SmrgDenis Pauk (1): 13667ec681f3Smrg 13677ec681f3Smrg- mesa: bptc fixes for decompress rgba_unorm and rgb_float 13687ec681f3Smrg 13697ec681f3SmrgDuncan Hopkins (10): 13707ec681f3Smrg 13717ec681f3Smrg- meson: Add xcb-fixes to loader when using x11 and dri3. Fixes undefined symbol for xcb_xfixes_create_region in loader_dri3_helper.c 13727ec681f3Smrg- zink: clamped maxPerStageDescriptorUniformBuffers limits to INT_MAX when stored as uint32_t. 13737ec681f3Smrg- zink: Basic framework to check for optional instance layers and instance extensions. 13747ec681f3Smrg- zink: Added support for MacOS MoltenVK APIs. 13757ec681f3Smrg- zink: return fail if create_instance fails 13767ec681f3Smrg- zink: Added inbuilt debug logging from the VK_LAYER_LUNARG_standard_validation layer. 13777ec681f3Smrg- zink: add support to device info for macro guards and just VkPhysicalDevice*Features with out the have\_. 13787ec681f3Smrg- zink: have_triangle_fans support. 13797ec681f3Smrg- zink: For MoltenVk added vkFlushMappedMemoryRanges() to vkMapMemory() to fix empty mapped memory. 13807ec681f3Smrg- zink: make physical device functions use a dynamic function pointers. 13817ec681f3Smrg 13827ec681f3SmrgDylan Baker (31): 13837ec681f3Smrg 13847ec681f3Smrg- Bump development version and clear new_features 13857ec681f3Smrg- meson/freedreno: Fix lua requirement 13867ec681f3Smrg- docs: update calendar for 20.2.0-rc1 13877ec681f3Smrg- docs: update calendar for 20.2.0-rc2 13887ec681f3Smrg- meson/anv: Use variable that checks for --build-id 13897ec681f3Smrg- glsl/xxd.py: fix imports 13907ec681f3Smrg- clover/meson: use dep.get_variable instead of deprecated get_pkgconfig_variable 13917ec681f3Smrg- meson: generalize libclc usage 13927ec681f3Smrg- docs: update calendar and link releases notes for 20.2 13937ec681f3Smrg- docs/release-calender: Add 20.2 stable releases 13947ec681f3Smrg- docs: add release notes for 20.2.0 13957ec681f3Smrg- docs: Add sh256 sums for 20.2.0 13967ec681f3Smrg- docs: add release notes for 20.2.1 13977ec681f3Smrg- docs: add SHA256 sums for 20.2.1 13987ec681f3Smrg- docs: update calendar and link releases notes for 20.2.1 13997ec681f3Smrg- docs: add release notes for 20.2.2 14007ec681f3Smrg- dcs: Add sha256 sums for 20.2.2 14017ec681f3Smrg- docs: update calendar and link releases notes for 20.2.2 14027ec681f3Smrg- bump version for 20.3-rc1 14037ec681f3Smrg- .pick_status.json: Update to bf5cea7232f9ee2934c212211ebefb6fe766526d 14047ec681f3Smrg- .pick_status.json: Update to 87dc3106b077199b829a082e32ec33d0c6d400ab 14057ec681f3Smrg- .pick_status.json: Update to bac6cc586fe4c1b24351e0574d3a961eb631f6ae 14067ec681f3Smrg- bump VERSION for 20.3.0-rc2 release 14077ec681f3Smrg- .pick_status.json: Update to a59b1b18a95af1f8edb0093baf508e974e3251a2 14087ec681f3Smrg- .pick_status.json: Update to a92f597b98bb032b904c7c8a8c3a9fe798b51915 14097ec681f3Smrg- .pick_status.json: Update to 9fa1cdfe7ffd9e7ebd83055e2008f3e4b8ada549 14107ec681f3Smrg- meson: Don't add extra values to shader-cache 14117ec681f3Smrg- appveyor: disable for now 14127ec681f3Smrg- bump VERSION for 20.3-rc3 14137ec681f3Smrg- .pick_status.json: Update to 89f6b72f19dbc503386643c6283047bdb1013bef 14147ec681f3Smrg- .pick_status.json: Update to d3c67d7e7ec6b9cf10fbea0d08e92751b7b0fbae 14157ec681f3Smrg 14167ec681f3SmrgEduardo Lima Mitev (9): 14177ec681f3Smrg 14187ec681f3Smrg- st: Pass TextureTiling option from texture to memory obj 14197ec681f3Smrg- freedreno: Implement memory object create/destroy for GL_EXT_memory_object 14207ec681f3Smrg- freedreno: Refactor fd_resource_create_with_modifiers() into a helper 14217ec681f3Smrg- freedreno/layout: Move hard-coded minimum width for UBWC to a macro 14227ec681f3Smrg- freedreno: implement pipe screen's resource_from_memobj 14237ec681f3Smrg- freedreno: Implement pipe screen's get_device/driver_uuid() 14247ec681f3Smrg- freedreno: Enable GL_EXT_memory_object and GL_EXT_memory_object_fd 14257ec681f3Smrg- freedreno: Destroy syncobj too when destroying fence 14267ec681f3Smrg- turnip: Enable support for KHR_incremental_present 14277ec681f3Smrg 14287ec681f3SmrgEleni Maria Stea (3): 14297ec681f3Smrg 14307ec681f3Smrg- radeonsi: support for external buffers (ext_external_objects) 14317ec681f3Smrg- iris: handle PIPE_FD_TYPE_SYNCOBJ type 14327ec681f3Smrg- iris: add support for fence signal capability 14337ec681f3Smrg 14347ec681f3SmrgEmil Velikov (1): 14357ec681f3Smrg 14367ec681f3Smrg- radv: restrict exported symbols with static llvm 14377ec681f3Smrg 14387ec681f3SmrgEmmanuel Vadot (1): 14397ec681f3Smrg 14407ec681f3Smrg- util/os_misc: os_get_available_system_memory() for FreeBSD 14417ec681f3Smrg 14427ec681f3SmrgEric Anholt (221): 14437ec681f3Smrg 14447ec681f3Smrg- util: Split the pack/unpack functions out of the format desc. 14457ec681f3Smrg- util: Change a codegenned switch statement to a nice little table. 14467ec681f3Smrg- util: Fix up indentation in the generated format tables code. 14477ec681f3Smrg- uitl: Add R1_UNORM to the list of noaccess (no pack/unpack) formats. 14487ec681f3Smrg- util: Make all 3 fetch_rgba functions occupy the same function slot. 14497ec681f3Smrg- util: Mark the format description getter functions as const. 14507ec681f3Smrg- util: Move fetch_rgba to a separate function table. 14517ec681f3Smrg- gallium: Use unpack_rgba() instead of fetch_rgba in translate_generic 14527ec681f3Smrg- freedreno/ir3: Fix compiler warning from the setjmp fails path. 14537ec681f3Smrg- freedreno/cffdec: When .mergedregs is set, don't count half regs. 14547ec681f3Smrg- freedreno/ir3: Fix assertion failures dumping CS high full regs. 14557ec681f3Smrg- util: Expose rgba unpack/fetch functions as external functions as well. 14567ec681f3Smrg- util: Explicitly call the unpack functions from inside bptc pack/unpack. 14577ec681f3Smrg- radv: Move nir_opt_shrink_vectors() into the opt loop. 14587ec681f3Smrg- nir/opt_undef: Handle a couple more normal store intrinsics. 14597ec681f3Smrg- nir: Expand opt_undef to handle undef channels in a store intrinsic. 14607ec681f3Smrg- nir: Shrink store intrinsic num_components to the size used by the writemask. 14617ec681f3Smrg- ci/deqp-runner: Drop stale comment from deqp-runner.sh. 14627ec681f3Smrg- ci/deqp-runner: Drop unused "count" variable 14637ec681f3Smrg- ci/deqp-runner: Add a post-deqp-run filter list for known flakes. 14647ec681f3Smrg- ci/freedreno: Move our skips lists over to being known-flakes lists. 14657ec681f3Smrg- ci/freedreno: List more common flakes reported recently. 14667ec681f3Smrg- ci/bare-metal: Use a new serial buffer tool. 14677ec681f3Smrg- ci/bare-metal: Convert the main cros-servo boot code to python 14687ec681f3Smrg- ci/bare-metal: Retry booting chezas instead of failing when !POWER_GOOD 14697ec681f3Smrg- ci/bare-metal: Try rebooting chezas again if they get stuck during tftp. 14707ec681f3Smrg- nir: Make the nir_builder \*_imm helpers consistently handle bit size. 14717ec681f3Smrg- nir: Add nir_[iu]shr_imm and nir_udiv_imm helpers and use them. 14727ec681f3Smrg- nir: Add a lowering pass for backends wanting load_ubo with vec4 offsets. 14737ec681f3Smrg- freedreno/ir3: Replace our custom vec4 UBO intrinsic with the shared lowering. 14747ec681f3Smrg- nir/load_store_vectorizer: Clean up unit test swizzle assertions. 14757ec681f3Smrg- freedreno: Drop UNIFORM_BUFFER_OFFSET_ALIGNMENT to 32 14767ec681f3Smrg- ci: Mark the rest of compswap as flaky on freedreno. 14777ec681f3Smrg- freedreno/a5xx: Don't set the VARYING flag for fragcoord-only programs. 14787ec681f3Smrg- ci: Test the KHR-GL\* CTS cases with softpipe. 14797ec681f3Smrg- nir/opt_copy_prop_vars: Quiet valgrind warning about overlapping memcpy. 14807ec681f3Smrg- nir: Add a helper for general instruction-modifying passes. 14817ec681f3Smrg- nir/lower_vec_to_movs: Convert to use nir_shader_instructions_pass(). 14827ec681f3Smrg- nir/opt_undef: Convert to use nir_shader_instructions_pass(). 14837ec681f3Smrg- nir/lower_io_to_scalar: Convert to use nir_shader_instructions_pass(). 14847ec681f3Smrg- nir/nir_lower_wrmasks: Use the nir_lower_instructions_pass() helper. 14857ec681f3Smrg- nir/lower_discard_to_demote: Use nir_shader_instructions_pass(). 14867ec681f3Smrg- drm-shim: Fix unused variable warnings from asserts in release build. 14877ec681f3Smrg- panfrost: Fix OOB array access compiler warning. 14887ec681f3Smrg- panfrost: Fix remaining release-build warnings. 14897ec681f3Smrg- gallium/tests: Fix compiler warning about unused vars in trivial tests. 14907ec681f3Smrg- nvc0: Fix compiler warning about unused var that gets asserted. 14917ec681f3Smrg- vc4: Fix unused var warnings in release builds from assertions. 14927ec681f3Smrg- nv50: Fix uninitialized var warnings from using assert() as unreachable(). 14937ec681f3Smrg- zink: Fix unused var warnings in release build from assertions. 14947ec681f3Smrg- etnaviv: Fix unused var warning in release build from assertions. 14957ec681f3Smrg- lima: Fix unused var/function warnings in release build from assertions. 14967ec681f3Smrg- lima: Fix uninitialized var warning from using assert() as unreachable(). 14977ec681f3Smrg- virgl: Fix unused var warnings in release build from assertions. 14987ec681f3Smrg- ci: Add a release build with -Werror enabled. 14997ec681f3Smrg- nir: Fix printing of individual instructions with io semantics. 15007ec681f3Smrg- nir: Look up the shader when printing a single instruction. 15017ec681f3Smrg- ci: Make a missing device name correctly bail out of deqp-runner.sh. 15027ec681f3Smrg- turnip: Make sure we include the build id. 15037ec681f3Smrg- pipe-loader: Use real galliumvl if radeonsi is being linked. 15047ec681f3Smrg- ci: Switch to using gold as the linker. 15057ec681f3Smrg- nir: Invalidate live SSA def information when making new SSA defs. 15067ec681f3Smrg- nir: Switch the indexing of block->live_in/out arrays. 15077ec681f3Smrg- ci: Bump vulkan CTS version to 1.2.3.2, and keep the GL CTS around. 15087ec681f3Smrg- ci: Use the same VK-GL-CTS tree for GL/GLES as VK. 15097ec681f3Smrg- ci: Enable KHR-GL30 CTS testing on freedreno a630. 15107ec681f3Smrg- freedreno/a6xx: Add ARB_depth_clamp and separate clamp support. 15117ec681f3Smrg- gallivm: Report the unsupported intrinsic instead of just assert(0); 15127ec681f3Smrg- gallium/tgsi: Add support for PRIMITIVEID as a system value. 15137ec681f3Smrg- gallium/tgsi: Add some missing opcodes to tgsi_ureg. 15147ec681f3Smrg- gallium/tgsi: Add a helper for initializing ureg from a shader_info. 15157ec681f3Smrg- gallium/ureg: Set the next shader stage from the shader info. 15167ec681f3Smrg- nir: Add simplistic lowering for bany_equal/ball_inequal. 15177ec681f3Smrg- nir/opt_vectorize: Add a callback for filtering of vectorizing. 15187ec681f3Smrg- gallium/tgsi_exec: Add missing DFLR opcode support. 15197ec681f3Smrg- gallium/tgsi_exec: Fix up NumOutputs counting 15207ec681f3Smrg- ci/bare-metal: Use re.search() instead re.match() for our line matching. 15217ec681f3Smrg- ci/bare-metal: Fix detection of "POWER_GOOD not seen in time" fails 15227ec681f3Smrg- ci/bare-metal: Include a timestamp in our serial reads. 15237ec681f3Smrg- ci/bare-metal: Log why our run restarts when it does. 15247ec681f3Smrg- ci/bare-metal: Fix capturing of serial output as job artifacts. 15257ec681f3Smrg- ci/bare-metal: Use python for handling fastboot booting and parsing 15267ec681f3Smrg- nir/load_store_vectorizer: Use more imm helpers in the tests. 15277ec681f3Smrg- nir/load_store_vectorizer: Add unit tests for alignment handling. 15287ec681f3Smrg- nir: Update the comment about nir_lower_uniforms_to_ubo()'s multiplier. 15297ec681f3Smrg- nir: Add a range_base+range to nir_intrinsic_load_ubo(). 15307ec681f3Smrg- freedreno/ir3: Use the new NIR UBO ranges in UBO analysis. 15317ec681f3Smrg- freedreno/ir3: Apply the max upload limit to initial range setup 15327ec681f3Smrg- nir: Use explicit deref information to provide real UBO ranges. 15337ec681f3Smrg- iris: Add missing range_base/range to our nir_load_ubos. 15347ec681f3Smrg- turnip: Fix a compiler warning in release builds of the query code. 15357ec681f3Smrg- freedreno: Make the pack struct have a .qword for wide addresses. 15367ec681f3Smrg- turnip: Fix truncation of CS shader iovas to 32 bits. 15377ec681f3Smrg- turnip: Fix truncation of iovas to 32 bits in queries. 15387ec681f3Smrg- ci/bare-metal: Update the kernel to msm-next-pgtables 15397ec681f3Smrg- ci/bare-metal: Allow wget of the kernel/dtb for kernel development. 15407ec681f3Smrg- freedreno: Add another new sysmem flake. 15417ec681f3Smrg- freedreno/cffdec: Fix up texturator parsing scripts for XML changes. 15427ec681f3Smrg- freedreno/cffdec: Add support for texturator's 2DMS layout setup. 15437ec681f3Smrg- freedreno/fdl: Add layout test for the Android CTS's MSAA mustpass surface. 15447ec681f3Smrg- turnip: Add support for a615. 15457ec681f3Smrg- turnip/kgsl: Associate fences with submits. 15467ec681f3Smrg- mesa: Make the android_stub be a set of non-installed shared libraries. 15477ec681f3Smrg- android: Disable trying to read/write to the disk cache. 15487ec681f3Smrg- gallium/drm: Deduplicate screen creation for the dynamic (clover) pipe loader. 15497ec681f3Smrg- gallium/drm: Refactor the stub screen create functions. 15507ec681f3Smrg- gallium/drm: Define the DRM entrypoints in drm_helper.h 15517ec681f3Smrg- gallium/drm: Make the pipe loader handle the driconf merging. 15527ec681f3Smrg- util/xmlconfig: Add a unit test of the code. 15537ec681f3Smrg- virgl: Clean up the driconf definition of GLES_SAMPLES_PASSED_VALUE. 15547ec681f3Smrg- driconf: Use nesting macros for defining options. 15557ec681f3Smrg- mesa: Promote Intel's simple logging façade for Android to util/ 15567ec681f3Smrg- turnip: Replace tu_log*() with mesa_log*() 15577ec681f3Smrg- ci/freedreno: Sort the traces in the .yml of expectations 15587ec681f3Smrg- ci/freedreno: Add trace tests for glxgears, 0 A.D., and xonotic. 15597ec681f3Smrg- nir/lower_clip: Add i/o semantics for load/store intrinsics. 15607ec681f3Smrg- intel: Add support for i945g to intel_stub_gpu. 15617ec681f3Smrg- freedreno/ir3: Make sure we run the opt loop after lowering UBOs to vec4. 15627ec681f3Smrg- nir: Document a bit about how align_mul/offset work. 15637ec681f3Smrg- nir: Print the alignment information on casts. 15647ec681f3Smrg- nir/nir_lower_uniforms_to_ubo: Set better alignments on our new instructions. 15657ec681f3Smrg- nir/gl_nir_lower_buffers: Set up align_mul/offset on UBOs. 15667ec681f3Smrg- nir: Make the load_store_vectorizer provide align_mul + align_offset. 15677ec681f3Smrg- nir: Drop the high_offset argument to the load_store_vectorizer filter. 15687ec681f3Smrg- nir: Make nir_lower_ubo_vec4() handle non-vec4-aligned loads. 15697ec681f3Smrg- freedreno/ir3: Enable the i/o vectorizer on UBOs. 15707ec681f3Smrg- ci/bare-metal: Move the "POWER_GOOD not seen in time" check to the right time. 15717ec681f3Smrg- driconf: Eliminate the DRI_CONF_OPT_BEGIN_B macro. 15727ec681f3Smrg- driconf: Fix extra quoting on "Jimenez'". 15737ec681f3Smrg- r200: Reuse DRI_CONF_OPT_F for texture_blend_quality. 15747ec681f3Smrg- driconf: Make a DRI_CONF_OPT_S() for string options. 15757ec681f3Smrg- util/xmlconfig: Drop silly open-coded strdup. 15767ec681f3Smrg- util/xmlconfig: Indent to Mesa style. 15777ec681f3Smrg- driconf: Delete disjoint range support. 15787ec681f3Smrg- driconf: Use DRI_CONF_OPT_I for remaining int options 15797ec681f3Smrg- driconf: Make the driver's declarations be structs instead of XML. 15807ec681f3Smrg- driconf: Stop quoting true/false in boolean option definitions. 15817ec681f3Smrg- util/xmlconfig: Drop use of XML_Char in parsing. 15827ec681f3Smrg- android: Disable the user XML config parsing. 15837ec681f3Smrg- turnip: Don't expose VK_ANDROID_native_buffer on non-Android. 15847ec681f3Smrg- turnip: Use mesa's normal PRINTFLIKE macro instead of our own. 15857ec681f3Smrg- turnip: Mark the vk_errorf helper as bring printflike. 15867ec681f3Smrg- turnip: Extend the coverage of TU_DEBUG=startup. 15877ec681f3Smrg- turnip: Always enable TU_DEBUG=startup on debug drivers. 15887ec681f3Smrg- turnip: Report device loss through \_mesa_loge() instead of fprintf. 15897ec681f3Smrg- turnip/kgsl: Add strerror decode in BO init failure. 15907ec681f3Smrg- driconf: Make sure that the range check on the defaults actually works. 15917ec681f3Smrg- driconf: Restore the ability to override driconf with the environment. 15927ec681f3Smrg- ci/softpipe: Add another flaky GS test to the skips list. 15937ec681f3Smrg- freedreno/ir3: Clean up the UBO upload plan setup. 15947ec681f3Smrg- freedreno/ir3: Don't leave holes the UBO upload plan. 15957ec681f3Smrg- turnip/kgsl: Fix last minute breakage of the build. 15967ec681f3Smrg- turnip/kgsl: Add support for importing dma-bufs. 15977ec681f3Smrg- turnip: Detect Qualcomm gralloc and its UBWC flag on gralloc surfaces. 15987ec681f3Smrg- turnip: Add support for GetSwapchainGrallocUsage2ANDROID(). 15997ec681f3Smrg- meson: Drop adding -Wl,--gc-sections to project c/cpp arguments. 16007ec681f3Smrg- glsl/tests: Make the tests skip on Android binary execution failures. 16017ec681f3Smrg- symbols-check: Add \__cxa_guard_\* to the list of approved symbols. 16027ec681f3Smrg- ci/android: Switch to using the Android NDK. 16037ec681f3Smrg- docs: Document how to replicate a CI build locally. 16047ec681f3Smrg- android_stub: Update platform headers to include gralloc1.h. 16057ec681f3Smrg- ci/android: Switch build to using platform SDK version 26. 16067ec681f3Smrg- util: Import a copy of drm's libsync.h 16077ec681f3Smrg- android: Add pre-4.7 Android kernel compatibility to our libsync header. 16087ec681f3Smrg- turnip: Drop a dead error checking path in device init. 16097ec681f3Smrg- turnip: Use Mesa's libsync.h instead of libdrm's libsync.h. 16107ec681f3Smrg- turnip: Don't link the WSI code if we don't have a WSI extension. 16117ec681f3Smrg- turnip: Only link libdrm in the DRM case, not KGSL. 16127ec681f3Smrg- ci: Enable NIR_VALIDATE everywhere. 16137ec681f3Smrg- nir: Introduce nir_metadata_instr_index for nir_index_instr() being current. 16147ec681f3Smrg- nir: Replace nir_ssa_def->live_index with nir_instr->index. 16157ec681f3Smrg- nir: Add a block start/end ip to live instr index metadata. 16167ec681f3Smrg- nir: Add a call to get a struct describing SSA liveness per instruction. 16177ec681f3Smrg- nir: Add an option to not lower source mods for f64/u64/i64. 16187ec681f3Smrg- gallium: Add a nir-to-TGSI pass. 16197ec681f3Smrg- softpipe: Fix buffer overflows in SSBO atomics. 16207ec681f3Smrg- softpipe: Switch to using NIR as the shader format from mesa/st. 16217ec681f3Smrg- meson: Only require libexpat when a part of the build needs it. 16227ec681f3Smrg- freedreno: Use Android's libsync instead of libdrm's. 16237ec681f3Smrg- meson: Don't try to build GLX by default on Android. 16247ec681f3Smrg- meson: Don't enable libunwind by in 'auto' mode on Android. 16257ec681f3Smrg- docs: Document how to build and install Android drivers. 16267ec681f3Smrg- freedreno/cffdec: Fix format overflow warning. 16277ec681f3Smrg- freedreno/tools: Fix compiler warnings about using sz in the error paths. 16287ec681f3Smrg- freedreno/fdperf: Silence a compiler warning about current counter. 16297ec681f3Smrg- turnip: Handle some error paths in allocating CS space from a command buffer. 16307ec681f3Smrg- turnip: Handle the error path for tu/drm's vkResetFences(). 16317ec681f3Smrg- turnip: Add error path handling for descriptor pool init. 16327ec681f3Smrg- ci: Enable Werror on meson-arm64-build-test. 16337ec681f3Smrg- gallium/ntt: Add default compiler options for non-native-NIR drivers. 16347ec681f3Smrg- st/mesa: Drop the TGSI paths for PBOs and use nir-to-tgsi if needed. 16357ec681f3Smrg- st/mesa: Drop the TGSI paths for drawpixels and use nir-to-tgsi if needed. 16367ec681f3Smrg- nir: Only validate in passes that might have changed things. 16377ec681f3Smrg- docs: Move the gallium driver documentation to the top level. 16387ec681f3Smrg- docs/vmware: Move the vmware driver docs into the drivers section. 16397ec681f3Smrg- docs/vc4: Move my old vc4 wiki's documentation into docs.mesa3d.org. 16407ec681f3Smrg- docs/vc4: Add information on the hw documentation available. 16417ec681f3Smrg- docs/v3d: Add a little stub of v3d documentation. 16427ec681f3Smrg- docs: Drop extra link to old DRI wiki in the "Help" section. 16437ec681f3Smrg- docs: Add a link to the linux kernel DRM docs under "Developer Topics" 16447ec681f3Smrg- docs: Fix "Hosted by" link and drop duplicate. 16457ec681f3Smrg- ci: Add the new timeout-prone softpipe-gl test to the skips list. 16467ec681f3Smrg- mesa/st: Fix a use-after-free of the NIR shader stage. 16477ec681f3Smrg- st/nir: Fix the st->pbo.use_gs case. 16487ec681f3Smrg- st/nir: Drop setting interp mode on system values in builtins. 16497ec681f3Smrg- tu: Make sure spirv_to_nir knows we support imageStorageWithoutFormat. 16507ec681f3Smrg- turnip: Fix image size for 3D vkGetImageSubresourceLayout. 16517ec681f3Smrg- ci/bare-metal: Apply autopep8 to the bare-metal scripts. 16527ec681f3Smrg- ci/bare-metal: Reset colors at the end of a line of serial output. 16537ec681f3Smrg- ci/deqp: Switch to a new dEQP runner written in Rust. 16547ec681f3Smrg- util/set: Fix the \_mesa_set_clear function to not leave tombstones. 16557ec681f3Smrg- ci: Only install kernel modules for LAVA devices. 16567ec681f3Smrg- gallium/draw: Fix rasterizer_discard for wide points/lines. 16577ec681f3Smrg- freedreno: Fix leak of shader binary on disk cache hits. 16587ec681f3Smrg- freedreno: Fix warning about uninit size for the size==0 special case. 16597ec681f3Smrg- gallium: Fix leak of the merged driconf options. 16607ec681f3Smrg- freedreno: Fix leak of u_transfer_helper. 16617ec681f3Smrg- gallium: Fix leak of bound SSBOs at CSO context destruction. 16627ec681f3Smrg- gallivm: Fix max const buffer count. 16637ec681f3Smrg- gallium: Fix leak of currently bound UBOs at CSO context destruction. 16647ec681f3Smrg- freedreno: Break out of "should we free the entry" loop once we've freed. 16657ec681f3Smrg 16667ec681f3SmrgEric Engestrom (94): 16677ec681f3Smrg 16687ec681f3Smrg- pick-ui: specify git commands in "resolve cherry pick" message 16697ec681f3Smrg- egl/entrypoint-check: split sort-check into a function 16707ec681f3Smrg- egl/entrypoint-check: add check that GLVND and plain EGL have the same entrypoints 16717ec681f3Smrg- driconf: fix force_gl_vendor description 16727ec681f3Smrg- meson: bump required glvnd version 16737ec681f3Smrg- egl: replace \_EGLDriver param with \_EGLDisplay->Driver in \_eglReleaseDisplayResources() 16747ec681f3Smrg- egl: replace \_EGLDriver param with \_EGLDisplay->Driver in dri{2_x11,3}_create_window_surface() 16757ec681f3Smrg- egl: replace \_EGLDriver with \_EGLDisplay->Driver in \_eglQuerySurface() 16767ec681f3Smrg- egl: drop unused \_EGLDriver from Initialize() 16777ec681f3Smrg- egl: drop unused \_EGLDriver from Terminate() 16787ec681f3Smrg- egl: drop unused \_EGLDriver from {Create,Destroy}Context() 16797ec681f3Smrg- egl: drop unused \_EGLDriver from Create{Window,Pixmap,Pbuffer}Surface() & DestroySurface() 16807ec681f3Smrg- egl: drop unused \_EGLDriver from MakeCurrent() 16817ec681f3Smrg- egl: drop unused \_EGLDriver from QuerySurface() 16827ec681f3Smrg- egl: drop unused \_EGLDriver from {Bind,Release}TexImage() 16837ec681f3Smrg- egl: drop unused \_EGLDriver from SwapInterval() 16847ec681f3Smrg- egl: drop unused \_EGLDriver from SwapBuffers{,WithDamageEXT,RegionNOK}() 16857ec681f3Smrg- egl: drop unused \_EGLDriver from CopyBuffers() 16867ec681f3Smrg- egl: drop unused \_EGLDriver from SetDamageRegion() 16877ec681f3Smrg- egl: drop unused \_EGLDriver from WaitClient() 16887ec681f3Smrg- egl: drop unused \_EGLDriver & \_EGLDisplay from WaitNative() 16897ec681f3Smrg- egl: drop unused \_EGLDriver from GetProcAddress() 16907ec681f3Smrg- egl: drop unused \_EGLDriver from {Create,Destroy}ImageKHR() 16917ec681f3Smrg- egl: drop unused \_EGLDriver from {Create,Destroy,ClientWait,Wait,Signal}SyncKHR() 16927ec681f3Smrg- egl: drop unused \_EGLDriver from DupNativeFenceFDANDROID() 16937ec681f3Smrg- egl: drop unused \_EGLDriver from {Create,Export}DRMImageMESA() 16947ec681f3Smrg- egl: drop unused \_EGLDriver from {Bind,Unbind,Query}WaylandDisplayWL() 16957ec681f3Smrg- egl: drop unused \_EGLDriver from CreateWaylandBufferFromImageWL() 16967ec681f3Smrg- egl: drop unused \_EGLDriver from PostSubBufferNV() 16977ec681f3Smrg- egl: drop unused \_EGLDriver from QueryBufferAge() 16987ec681f3Smrg- egl: drop unused \_EGLDriver from ExportDMABUFImage{,Query}MESA() 16997ec681f3Smrg- egl: drop unused \_EGLDriver from QueryDmaBuf{Formats,Modifiers}EXT() 17007ec681f3Smrg- egl: drop unused \_EGLDriver from SetBlobCacheFuncsANDROID() 17017ec681f3Smrg- egl: drop unused \_EGLDriver from \_eglGetConfigs()/_eglChooseConfig()/_eglGetConfigAttrib() 17027ec681f3Smrg- egl: drop unused \_EGLDisplay from \_eglSetDamageRegionKHRClampRects() 17037ec681f3Smrg- egl: drop unused \_EGLDriver & \_EGLDisplay from \_eglQueryContext() 17047ec681f3Smrg- egl: drop unused \_EGLDriver from \_eglSurfaceAttrib() 17057ec681f3Smrg- egl: replace \_EGLDriver with \_EGLDisplay->Driver in \_eglGetSyncAttrib() 17067ec681f3Smrg- egl: replace replace \_EGLDriver with \_EGLDisplay->Driver in eglapi.c 17077ec681f3Smrg- egl: drop unused \_EGLDriver from MesaGLInteropEGL{QueryDeviceInfo,ExportObject}() 17087ec681f3Smrg- egl: replace \`&_eglDriver\`/\`NULL\` tested against \`NULL\` with simple \`true\`/\`false\` 17097ec681f3Smrg- egl: drop unused ${drv}_driver() 17107ec681f3Smrg- egl: inline \_eglGetDriverProc() into eglGetProcAddress() 17117ec681f3Smrg- egl: inline \_eglInitializeDisplay() into eglInitialize() 17127ec681f3Smrg- egl: drop now empty egldriver.c 17137ec681f3Smrg- egl: drop unused egldriver.h header 17147ec681f3Smrg- meson: fix trivial s/dir/dri/ typo 17157ec681f3Smrg- egl/x11_dri3: enable & require xfixes 2.0 17167ec681f3Smrg- egl/x11_dri3: implement EGL_KHR_swap_buffers_with_damage 17177ec681f3Smrg- docs: add release notes for 20.1.6 17187ec681f3Smrg- docs: update calendar and link releases notes for 20.1.6 17197ec681f3Smrg- gitlab-ci: fix testing whether a variable with a given name is set or not 17207ec681f3Smrg- gitlab-ci: fix quoting of variables passed down to bare-metal runners 17217ec681f3Smrg- egl: drop an indentation level in \_eglFindDisplay() by replacing break/if with a goto 17227ec681f3Smrg- egl: drop another indentation level in \_eglFindDisplay() by inverting an if 17237ec681f3Smrg- egl: drop invalid shebang 17247ec681f3Smrg- scons: bump c++ standard to 14 to match meson 17257ec681f3Smrg- docs/egl: fix typo 17267ec681f3Smrg- docs/egl: move section around 17277ec681f3Smrg- docs/egl: complete list of dri2 platforms 17287ec681f3Smrg- docs/egl: add haiku driver 17297ec681f3Smrg- docs/egl: add some more documentation 17307ec681f3Smrg- docs/egl: correct/update DRI2 mention with the shiny new DRI3 17317ec681f3Smrg- egl: move extension driver functions after core functions 17327ec681f3Smrg- egl: document which driver hooks are only required by extensions 17337ec681f3Smrg- egl: inline eglSwapInterval() fallback 17347ec681f3Smrg- egl: simplify eglSwapInterval() fallback logic 17357ec681f3Smrg- meson: don't advertise TLS support if glx wasn't build with it 17367ec681f3Smrg- egl/android: simplify dri2_initialize_android() 17377ec681f3Smrg- egl/surfaceless: simplify dri2_initialize_surfaceless() 17387ec681f3Smrg- egl/wayland: simplify dri2_initialize_wayland() 17397ec681f3Smrg- egl/x11: simplify dri2_initialize_x11() 17407ec681f3Smrg- docs: add release notes for 20.1.7 17417ec681f3Smrg- docs: update calendar and link releases notes for 20.1.7 17427ec681f3Smrg- docs: shift 20.2 rc dates by two weeks to match reality 17437ec681f3Smrg- meson: drop leftover PTHREAD_SETAFFINITY_IN_NP_HEADER 17447ec681f3Smrg- docs/download: mention tarball GPG signatures and link to the keys 17457ec681f3Smrg- docs: add another 20.1.x release to allow for more overlap with 20.2 17467ec681f3Smrg- docs/release-calendar: update 20.2 17477ec681f3Smrg- docs: add release notes for 20.1.8 17487ec681f3Smrg- docs: update calendar and link releases notes for 20.1.8 17497ec681f3Smrg- bin/gen_release_notes.py: escape special rST characters 17507ec681f3Smrg- docs: add release notes for 20.1.9 17517ec681f3Smrg- docs: update calendar and link releases notes for 20.1.9 17527ec681f3Smrg- add one last 20.1 release to coincide with expected 20.2.1 17537ec681f3Smrg- radv: add missing u_atomic.h include 17547ec681f3Smrg- docs: fix relnotes index 17557ec681f3Smrg- docs: fix release calendar 17567ec681f3Smrg- docs: fix 20.2.0 relnotes 17577ec681f3Smrg- docs: add release notes for 20.1.10 17587ec681f3Smrg- docs: update calendar and link releases notes for 20.1.10 17597ec681f3Smrg- docs/release-calendar: plan 20.3 release 17607ec681f3Smrg- gitlab-ci: drop deprecated platforms that snuck in when nobody was watching 17617ec681f3Smrg- meson: drop deprecated EGL platform build options 17627ec681f3Smrg 17637ec681f3SmrgErico Nunes (4): 17647ec681f3Smrg 17657ec681f3Smrg- lima: dont split vec3 unaligned load inputs 17667ec681f3Smrg- lima: allocate new bo for stream draw 17677ec681f3Smrg- lima: fix vertex shader uniform buffer size 17687ec681f3Smrg- lima: add natively supported vertex buffer formats 17697ec681f3Smrg 17707ec681f3SmrgErik Faye-Lund (123): 17717ec681f3Smrg 17727ec681f3Smrg- st/wgl: do not reject PFD_SUPPORT_GDI 17737ec681f3Smrg- gallium/util: factor out primitive-restart rewriting logic 17747ec681f3Smrg- gallium/indices: don't expand prim-type for 8-bit indices 17757ec681f3Smrg- gallium/indices: generalize primitive-restart logic 17767ec681f3Smrg- gallium/indices: implement prim-restart for line-loops 17777ec681f3Smrg- gallium/indices: use prim_restart-helper for polygon 17787ec681f3Smrg- gallium/indices: implement prim-restart for triangle fans 17797ec681f3Smrg- gallium/indices: introduce u_primconvert_config 17807ec681f3Smrg- gallium/indices: translate primitive-restart values 17817ec681f3Smrg- compiler/nir: make lowering global-id to local-id optional 17827ec681f3Smrg- nir: add iabs-lowering code 17837ec681f3Smrg- gallium/util: use uint sampler for stencil-reads 17847ec681f3Smrg- nir: fix const-cast warning on MSVC 17857ec681f3Smrg- v3d: remove unused header 17867ec681f3Smrg- vc4: remove unused header 17877ec681f3Smrg- gallium/aux: remove unused u_blit.[ch\] 17887ec681f3Smrg- gallium/util: add shader for stencil-blits 17897ec681f3Smrg- gallium/util: add blitter-support for stencil-fallback 17907ec681f3Smrg- mesa: handle GL_FRONT after translating to it 17917ec681f3Smrg- zink: correct typo in stencil-setup 17927ec681f3Smrg- zink: store base-object of DSA-state 17937ec681f3Smrg- zink: only set stencil-ref for back if two-sided 17947ec681f3Smrg- docs: escape backquote character 17957ec681f3Smrg- docs: show 'Edit on GitLab'-link 17967ec681f3Smrg- docs: store prefixes in redirects 17977ec681f3Smrg- docs: remove webmaster article 17987ec681f3Smrg- docs: everytime -\> every time 17997ec681f3Smrg- docs: apis -\> APIs 18007ec681f3Smrg- docs: scons -\> SCons 18017ec681f3Smrg- docs: frambuffer -\> framebuffer 18027ec681f3Smrg- docs: make two acronyms upper-case 18037ec681f3Smrg- docs: unecessarily -\> unnecessarily 18047ec681f3Smrg- docs: behaviour -\> behavior 18057ec681f3Smrg- docs: timeplan -\> time plan 18067ec681f3Smrg- docs: initialisation -\> initialization 18077ec681f3Smrg- docs: gitlab -\> GitLab 18087ec681f3Smrg- docs: url -\> URL 18097ec681f3Smrg- docs: recognisable -\> recognizable 18107ec681f3Smrg- docs: drop outdated gallium-docs comment 18117ec681f3Smrg- docs: clippping -\> clipping 18127ec681f3Smrg- docs: consistantly -\> consistently 18137ec681f3Smrg- docs: stabilisation -\> stabilization 18147ec681f3Smrg- docs: flavours -\> flavors 18157ec681f3Smrg- docs: debian -\> Debian 18167ec681f3Smrg- docs: docker -\> Docker 18177ec681f3Smrg- docs: gallium -\> Gallium 18187ec681f3Smrg- st/mesa: use roundf instead of floorf for lod-bias rounding 18197ec681f3Smrg- gallium/util: set right dst-dimensions 18207ec681f3Smrg- gallium/util: fix texture-coordinates for stencil-fallback 18217ec681f3Smrg- gallium/util: allow scaling blits for stencil-fallback 18227ec681f3Smrg- docs: softwara -\> software 18237ec681f3Smrg- docs: existant -\> existent 18247ec681f3Smrg- docs: webservice -\> web service 18257ec681f3Smrg- docs: bpp -\> BPP 18267ec681f3Smrg- docs: llvm -\> LLVM 18277ec681f3Smrg- docs: correct reference to meson.build 18287ec681f3Smrg- docs: meson -\> Meson 18297ec681f3Smrg- docs: python3 -\> Python 3 18307ec681f3Smrg- docs: flex -\> Flex 18317ec681f3Smrg- docs: bison -\> Bison 18327ec681f3Smrg- docs: mako -\> Mako 18337ec681f3Smrg- docs: chocolatey -\> Chocolatey 18347ec681f3Smrg- docs: ninja -\> Ninja 18357ec681f3Smrg- docs: mingw -\> MinGW 18367ec681f3Smrg- docs: microsoft -\> Microsoft 18377ec681f3Smrg- docs: linux -\> Linux 18387ec681f3Smrg- docs: windows -\> Windows 18397ec681f3Smrg- docs: visual studio -\> Visual Studio 18407ec681f3Smrg- docs: gpu -\> GPU 18417ec681f3Smrg- docs: cmake -\> CMake 18427ec681f3Smrg- docs: x11 -\> X11 18437ec681f3Smrg- docs: wayland -\> Wayland 18447ec681f3Smrg- docs: drm -\> DRM 18457ec681f3Smrg- docs: android -\> Android 18467ec681f3Smrg- docs: git -\> Git 18477ec681f3Smrg- docs: quote "git log" 18487ec681f3Smrg- docs: scons -\> SCons 18497ec681f3Smrg- docs: ubuntu -\> Ubuntu 18507ec681f3Smrg- docs: vmware -\> VMWare 18517ec681f3Smrg- docs: Sandybridge -\> Sandy Bridge 18527ec681f3Smrg- docs: cpu -\> CPU 18537ec681f3Smrg- gallium/util: fix memory-leak 18547ec681f3Smrg- gallium/util: allow scissored blits for stencil-fallback 18557ec681f3Smrg- zink: use nir_lower_ubo_vec4 to simplify things a bit 18567ec681f3Smrg- zink: support non-const offsets 18577ec681f3Smrg- zink: support loading any UBO 18587ec681f3Smrg- zink: do not report SSBOs as halfway supported 18597ec681f3Smrg- zink: add feature-documentation 18607ec681f3Smrg- zink: reject resource-imports with modifiers 18617ec681f3Smrg- v3d: do not report alpha-test as supported 18627ec681f3Smrg- vc4: do not report alpha-test as supported 18637ec681f3Smrg- nir: drop support for using load_alpha_ref_float 18647ec681f3Smrg- nir: drop unused alpha_ref_float 18657ec681f3Smrg- docs: create leading directories for redirects 18667ec681f3Smrg- docs: verify that targets for relative redirects exist 18677ec681f3Smrg- docs: specify redirects relative to docs-root 18687ec681f3Smrg- docs: specify redirects in conf.py instead 18697ec681f3Smrg- zink: verify geometry shader feature 18707ec681f3Smrg- docs: do not document required minimum 18717ec681f3Smrg- docs: document zink's gl \\> 3.0 requirements 18727ec681f3Smrg- mapi: remove unused function 18737ec681f3Smrg- mapi: do not call thread-unsafe dispatch getter 18747ec681f3Smrg- mapi: do not return thread-specific data for wrong thread 18757ec681f3Smrg- docs: add link to extension spec 18767ec681f3Smrg- docs: ie. -\> i.e. 18777ec681f3Smrg- docs: eg. -\> e.g. 18787ec681f3Smrg- docs: anistropy -\> anisotropy 18797ec681f3Smrg- docs: api -\> API 18807ec681f3Smrg- docs: hud -\> HUD 18817ec681f3Smrg- docs: fbo -\> FBO 18827ec681f3Smrg- docs: gcc -\> GCC 18837ec681f3Smrg- docs: clang -\> Clang 18847ec681f3Smrg- docs: s3tc -\> S3TC 18857ec681f3Smrg- spirv: correct sematic-typo 18867ec681f3Smrg- libgl-gdi: support building without softpipe 18877ec681f3Smrg- gallium/util: do not pass undefined sample-count 18887ec681f3Smrg- softpipe: correct signature of get_compiler_options 18897ec681f3Smrg- mesa/main: add missing include in glformats.h 18907ec681f3Smrg- zink: more accurately track supported blits 18917ec681f3Smrg- zink: fix layered resolves 18927ec681f3Smrg- zink: fall back to util_blitter for scaled resolves 18937ec681f3Smrg- docs: document new zink-flag 18947ec681f3Smrg- zink: do not require VK_KHR_external_memory 18957ec681f3Smrg 18967ec681f3SmrgFelix Yan (1): 18977ec681f3Smrg 18987ec681f3Smrg- Correct a typo in threads_win32.h 18997ec681f3Smrg 19007ec681f3SmrgGert Wollny (81): 19017ec681f3Smrg 19027ec681f3Smrg- gallium + mesa/st: Add PIPE_CAP_NIR_ATOMICS_AS_DEREF and use it 19037ec681f3Smrg- r600: Set PIPE_CAP_NIR_ATOMICS_AS_DEREF to true 19047ec681f3Smrg- r600/sfn: Sort uniforms by binding and offset 19057ec681f3Smrg- r600/sfn: add r600 specific lowering pass for atomics and use it 19067ec681f3Smrg- r600/sfn: Add a mapping table for atomics 19077ec681f3Smrg- r600/sfn: correct allocating and emitting of atomics 19087ec681f3Smrg- r600/sfn: Correct ssbo instruction handling 19097ec681f3Smrg- r600/sfn: handle querying SSBO size 19107ec681f3Smrg- r600/sfn: Force a minimum of 4 GPRs, it seems to fix atomics 19117ec681f3Smrg- r600: Enable compute shaders for NIR code path 19127ec681f3Smrg- compiler/nir: rewrite lower_fragcoord_wtrans to use nir_lower_instructions 19137ec681f3Smrg- compiler/nir: extend lower_fragcoord_wtrans to support VARYING_SLOT_POS 19147ec681f3Smrg- gallium/aux: reorder vertex attributes in triangle fans according to PV 19157ec681f3Smrg- meson: Make some warnings handled as errors with MSVC 19167ec681f3Smrg- r600: revert disabling llvm draw 19177ec681f3Smrg- r600/nir: fetch sources and split uniforms before emittting alu instructions 19187ec681f3Smrg- r600/sfn: correct ring op patching 19197ec681f3Smrg- r600/sfn: Fix loading vertex attributes 19207ec681f3Smrg- r600/sfn: clone shader before lowering to registers and src/dest modifiers 19217ec681f3Smrg- r600/sfn: Fix ordering of tex param moves 19227ec681f3Smrg- r600/sfn: avoid some copies 19237ec681f3Smrg- r600/sfn: Lower \*sign opcodes in nir 19247ec681f3Smrg- r600/sfn: Fix split_alu_modifiers 19257ec681f3Smrg- r600/sfn: Fix bitfield ops and 2x16 split_y 19267ec681f3Smrg- r600/sfn: Fix source swizzle for gradient queries 19277ec681f3Smrg- r600/sfn: more fixing of vec4 fetching 19287ec681f3Smrg- r600/sfn: Fix comparison with different signedness 19297ec681f3Smrg- nir: Add option lower_uniforms_to_ubo 19307ec681f3Smrg- radeonsi: set compiler flag lower_uniforms_to_ubo 19317ec681f3Smrg- freedreno/ir3: set lower_uniforms_to_ubo compiler flag 19327ec681f3Smrg- intel/compiler: Set lower_uniform_to_ubo compiler flag 19337ec681f3Smrg- llvmpipe: set lower_uniform_to_ubo compiler flag 19347ec681f3Smrg- gallium+mesa/st: lower uniforms based on compiler flag instead of packed uniforms cap 19357ec681f3Smrg- r600: enable lowering uniforms to UBO 19367ec681f3Smrg- r600/sfn: Use load_ubo_vec4 lowering pass 19377ec681f3Smrg- nir: remove ubo_r600 instrinsic since ubo_vec4 is used now 19387ec681f3Smrg- r600/sfn: make number of source components a local variable 19397ec681f3Smrg- r600/sfn: Fix component count for fdph 19407ec681f3Smrg- r600/sfn: Fix typo in comment 19417ec681f3Smrg- r600/sfn: use cnde instead of cnde_int 19427ec681f3Smrg- r600/sfn: run late algebraic optimizations 19437ec681f3Smrg- r600/sfn: remove a useless if-condition 19447ec681f3Smrg- r600: Add flag for dual-source blending to shader key 19457ec681f3Smrg- r600/sfn: Sort the outputs of the FS according to data index 19467ec681f3Smrg- r600/sfn: Keep info about dual-source blend in FS 19477ec681f3Smrg- r600/sfn: Handle number of color outputs taking dual source blending into account 19487ec681f3Smrg- r600/sfn: Take dual source blending output indices into account 19497ec681f3Smrg- r600/sfn: Acquire the number of FS outputs and the write_all info early 19507ec681f3Smrg- r600/sfn: Be a bit more verbose when logging skipped FS outputs 19517ec681f3Smrg- r600/sfn: Fix emitting shared atomics with constant sources 19527ec681f3Smrg- r600/sfn: Handle nir_op_b2b32 19537ec681f3Smrg- r600/sfn: lower to scalar for some optimizations and vectortize later 19547ec681f3Smrg- r600/sfn: Support group memory barrier 19557ec681f3Smrg- r600/sfn: save some instructions when doing multisample on sample 0 19567ec681f3Smrg- r600/sfn: use fine gradient evaluation for interpolate_at_offset 19577ec681f3Smrg- r600/sfn: Fix interpolate at sample 19587ec681f3Smrg- r600/sfn: Fix indirect const buffer access 19597ec681f3Smrg- r600/sfn; go back to not lowering uniforms to UBOs 19607ec681f3Smrg- r600/sfn: replace hand-backed literal check by NIR function 19617ec681f3Smrg- r600/sfn: remove old code to track uniforms as it is no longer needed 19627ec681f3Smrg- r600/sfn: Add support for helper invocations 19637ec681f3Smrg- r600/sfn: Fix enabling the right interpolator for inerpolate_at_sample 19647ec681f3Smrg- r600/sfn: Fix IDX register ID 19657ec681f3Smrg- r600/sfn: Add support for more barrier instructions 19667ec681f3Smrg- r600/sfn: extend life range of all variables by one 19677ec681f3Smrg- r600/sfn: Don't reuse registers for workgroup ID and local invocation ID 19687ec681f3Smrg- r600/sfn: Fix ssbo resource offset for buffer loads 19697ec681f3Smrg- r600/sfn: Fix keepalive patch 19707ec681f3Smrg- r600/sfn: fix mega fetch count for SSBO/Image atomics result fetch 19717ec681f3Smrg- r600/sfn: Rework get_temp_register to return a smart pointer to GPRValue 19727ec681f3Smrg- r600/sfn: use shared pointer to GPR for FS sysvalues 19737ec681f3Smrg- r600/sfn: Handle mem barrier and image barrier by using ACK 19747ec681f3Smrg- r600/sfn: use cacheless op for coherent image write 19757ec681f3Smrg- r600/sfn: use 32 bit bools 19767ec681f3Smrg- r600/sfn: fix remapping of deleted attributes 19777ec681f3Smrg- r600/sfn: Use register keep-alive also when scanning the shader 19787ec681f3Smrg- r600/sfn: Fix the parameter component type 19797ec681f3Smrg- r600/sfn: Update state docu 19807ec681f3Smrg- compile/nir: Correct printing dest_type 19817ec681f3Smrg- r600/sfn: lower bool to int32 only after common optimizations 19827ec681f3Smrg- r600/sfn: fix component loading from fixed buffer ID 19837ec681f3Smrg 19847ec681f3SmrgGreg V (1): 19857ec681f3Smrg 19867ec681f3Smrg- radv,anv: use CLOCK_MONOTONIC_FAST when CLOCK_MONOTONIC_RAW is undefined 19877ec681f3Smrg 19887ec681f3SmrgGuido Günther (1): 19897ec681f3Smrg 19907ec681f3Smrg- kmsro: Extend to include imx-dcss 19917ec681f3Smrg 19927ec681f3SmrgGurchetan Singh (7): 19937ec681f3Smrg 19947ec681f3Smrg- virgl: add flags to (\*resource_create) callback 19957ec681f3Smrg- drm-uapi: virtgpu_drm.h: resource create blob + host visible memory region 19967ec681f3Smrg- virgl/drm: query for resource blob and host visible memory region 19977ec681f3Smrg- virgl/drm: add resource create blob function 19987ec681f3Smrg- virgl: support PIPE_CAP_BUFFER_MAP_PERSISTENT_COHERENT 19997ec681f3Smrg- virgl: query blob mem 20007ec681f3Smrg- virgl: fix stride + layer_stride inconsistency 20017ec681f3Smrg 20027ec681f3SmrgHoe Hao Cheng (7): 20037ec681f3Smrg 20047ec681f3Smrg- zink: generate extension infrastructure using a python script 20057ec681f3Smrg- zink: hook zink_device_info.py to build system 20067ec681f3Smrg- zink: use the new extension infrastructure in device creation 20077ec681f3Smrg- zink: use the new, generated extension infrastructure 20087ec681f3Smrg- zink: remove old extension infrastructure 20097ec681f3Smrg- zink: implement pipe_device_reset_callback 20107ec681f3Smrg- zink: call the reset callback not only during a status check 20117ec681f3Smrg 20127ec681f3SmrgHyunjun Ko (4): 20137ec681f3Smrg 20147ec681f3Smrg- freedreno: support GL_EXT_semaphore 20157ec681f3Smrg- turnip: Refactor structs of tu_query 20167ec681f3Smrg- turnip: Support pipeline statistics query 20177ec681f3Smrg- turnip: Implement VK_EXT_host_query_reset 20187ec681f3Smrg 20197ec681f3SmrgIago Toral Quiroga (443): 20207ec681f3Smrg 20217ec681f3Smrg- v3d/compiler: fix V3D double-rounding of .8 fixed-point XY coordinates 20227ec681f3Smrg- v3dv: add support for valgrind macros 20237ec681f3Smrg- v3dv: implement vkCreateInstance 20247ec681f3Smrg- v3dv: implement vkDestroyInstance 20257ec681f3Smrg- v3dv: implement vkEnumeratePhysicalDevices 20267ec681f3Smrg- v3dv: pretend to initialize a physical device 20277ec681f3Smrg- v3dv: Implement vkGetPhysicalDeviceProperties 20287ec681f3Smrg- v3dv: retrieve device name from device info 20297ec681f3Smrg- v3dv: add a comment to clarify how we should implement uuid / deviceID retrieval 20307ec681f3Smrg- v3dv: implement vkGetPhysicalDeviceMemoryProperties 20317ec681f3Smrg- v3dv: implement vkGetPhysicalDeviceFeatures 20327ec681f3Smrg- v3dv: implement vkEnumerateDeviceExtensionProperties 20337ec681f3Smrg- v3dv: amend vkEnumerateInstanceExtensionProperties to handle layers 20347ec681f3Smrg- v3dv: implement vkGetPhysicalDeviceQueueFamilyProperties 20357ec681f3Smrg- v3dv: implement vkCreateDevice 20367ec681f3Smrg- v3dv: implement vkGetDeviceQueue 20377ec681f3Smrg- v3dv: add dummy implementations for the packet definition generator 20387ec681f3Smrg- v3dv: add stubs for the format table and vkGetPhysicalDeviceFormatProperties 20397ec681f3Smrg- v3dv: add some basic support for format properties 20407ec681f3Smrg- v3dv: implement vkEnumerate{Instance,Device}LayerProperties 20417ec681f3Smrg- v3dv: add stub for vkDeviceWaitIdle 20427ec681f3Smrg- v3dv: implement vkCreateImage 20437ec681f3Smrg- v3dv: implement vkGetImageMemoryRequirements 20447ec681f3Smrg- v3dv: initialize mememory heaps in the physical device 20457ec681f3Smrg- v3dv: implement vkAllocateMemory 20467ec681f3Smrg- v3dv: implement vkFreeMemory 20477ec681f3Smrg- v3dv: implement vkMapMemory 20487ec681f3Smrg- v3dv: implement vkUnmapMemory 20497ec681f3Smrg- v3dv: implement vkBindImageMemory 20507ec681f3Smrg- v3dv: implement vkCreateImageView 20517ec681f3Smrg- v3dv: implement vk{Create,Destroy}Buffer 20527ec681f3Smrg- v3dv: implement vkGetBufferMemoryRequirements 20537ec681f3Smrg- v3dv: implement vkBindBufferMemory 20547ec681f3Smrg- v3dv: implement vkCreateRenderPass 20557ec681f3Smrg- v3dv: implement vk{Create,Destroy}RenderPass 20567ec681f3Smrg- v3dv: implement vk{Create,Destroy}Framebuffer 20577ec681f3Smrg- v3dv: implement vkCreateCommandPool 20587ec681f3Smrg- v3dv: implement vk{Allocate,Free}CommandBuffers 20597ec681f3Smrg- v3dv: create a v3dv_bo struct and reference it from v3dv_device_memory 20607ec681f3Smrg- v3dv: add a concept of a command list 20617ec681f3Smrg- v3dv: implement vkBeginCommandBuffer 20627ec681f3Smrg- v3dv: start handling command buffer status 20637ec681f3Smrg- v3dv: implement vkGetPhysicalDeviceImageFormatProperties 20647ec681f3Smrg- v3dv: make v3dv_bo_alloc allocate memory for the bo struct 20657ec681f3Smrg- v3dv: compute tile size for framebuffer 20667ec681f3Smrg- v3dv: implement vkCmdBeginRenderPass 20677ec681f3Smrg- v3dv: make the command buffer own the command list BOs 20687ec681f3Smrg- v3dv: add a few more API stubs 20697ec681f3Smrg- v3dv: store base mip level in the image view 20707ec681f3Smrg- v3dv: add the tile state and alloc BOs to the command buffer BO list 20717ec681f3Smrg- v3dv: revert the decision that the command buffer takes ownership of BOs 20727ec681f3Smrg- v3dv: implement vkDestroyImage and vkDestroyImageView 20737ec681f3Smrg- v3dv: make v3dv_layer_offset public 20747ec681f3Smrg- v3dv: plug leak when destroying device 20757ec681f3Smrg- v3dv: precompute more tiling info at framebuffer creation time 20767ec681f3Smrg- v3dv: emit scissor to render area and precompute hw color clear values 20777ec681f3Smrg- v3dv: emit the render command list 20787ec681f3Smrg- v3dv: implement vkEndCommandBuffer 20797ec681f3Smrg- v3dv: create the command buffer BO set before we init CLs 20807ec681f3Smrg- v3dv: keep track of the numbre of BOs in a command buffer 20817ec681f3Smrg- v3dv: clear set of BOs in the command buffer on reset 20827ec681f3Smrg- v3dv: implement vkQueueSubmit 20837ec681f3Smrg- v3dv: be more conservative resetting command buffer state 20847ec681f3Smrg- v3dv: setup color clear values at subpass setup time 20857ec681f3Smrg- v3dv: emit tile loads 20867ec681f3Smrg- v3dv: flush at the end of each subpass 20877ec681f3Smrg- v3dv: split framebuffer internal bpp calculations from tiling calculations 20887ec681f3Smrg- v3dv: rename and make compute_tile_size_for_framebuffer() public 20897ec681f3Smrg- v3dv: implement vkCmdCopyImageToBuffer 20907ec681f3Smrg- v3dv: add the concept of a job 20917ec681f3Smrg- v3dv: implement vkCmdNextSubpass 20927ec681f3Smrg- v3dv: use the correct miplevel slice for the tile load operation 20937ec681f3Smrg- v3dv: implement vkCmdPipelineBarrier 20947ec681f3Smrg- v3dv: do not automatically emit a binner flush when finishing jobs 20957ec681f3Smrg- v3dv: fix clipping against render area 20967ec681f3Smrg- v3dv: add a note on interactions between clearing and scissor 20977ec681f3Smrg- v3dv: rewrite attachment state tracking 20987ec681f3Smrg- v3dv: only clear attachments on the first subpass that uses them 20997ec681f3Smrg- v3dv: merge subpasses into the same job when possible 21007ec681f3Smrg- v3dv: fix tile buffer loading 21017ec681f3Smrg- v3dv: rewrite the attachment needs clearing condition 21027ec681f3Smrg- v3dv: create a helper to start a new frame 21037ec681f3Smrg- v3dv/cmd_buffer: rename render pass RCL emission helpers to be more explicit 21047ec681f3Smrg- v3dv: handle VK_ATTACHMENT_UNUSED properly in more places 21057ec681f3Smrg- v3dv: implement vkDeviceWaitIdle 21067ec681f3Smrg- v3dv: implement vk{Create,Destroy}Semaphore 21077ec681f3Smrg- v3dv: implement semaphore waits and signals on queue submissions 21087ec681f3Smrg- v3dv: implement fences 21097ec681f3Smrg- v3dv: support queue submissions with multiple command buffers 21107ec681f3Smrg- v3dv: implement vkGetPhysicalDeviceSparseImageFormatProperties 21117ec681f3Smrg- v3dv: include Vulkan version 1.1 as unsupported. 21127ec681f3Smrg- v3dv: implement VK_KHR_get_physical_device_properties2 21137ec681f3Smrg- v3dv: implement VK_KHR_external_memory_capabilities 21147ec681f3Smrg- v3dv: implement VK_KHR_external_memory{_fd,_dma_buf} 21157ec681f3Smrg- v3dv: fix copy image to buffer 21167ec681f3Smrg- v3dv: implement vkGetImageSubresourceLayout 21177ec681f3Smrg- v3dv: implement DRM modifier setup for WSI 21187ec681f3Smrg- v3dv: hook up WSI support 21197ec681f3Smrg- v3dv: implement device detection on actual hardware 21207ec681f3Smrg- v3dv: allocate winsys BOs properly 21217ec681f3Smrg- v3dv: rename drm device fields so they are more explict 21227ec681f3Smrg- v3dv: don't swap RB channels when copying images to buffers 21237ec681f3Smrg- v3dv: implement support for depth testing 21247ec681f3Smrg- v3dv: don't always skip tile buffer stores 21257ec681f3Smrg- v3dv: compute subpass ranges for attachments at render pass creation time 21267ec681f3Smrg- v3dv: select the depth/stencil buffer from the attachment aspect mask 21277ec681f3Smrg- v3dv: select correct internal type for depth/stencil formats 21287ec681f3Smrg- v3dv: support depth testing on combined depth/stencil formats 21297ec681f3Smrg- v3dv: implement stencil testing 21307ec681f3Smrg- v3dv: fix indentation 21317ec681f3Smrg- v3dv: support copying depth/stencil aspects to buffer 21327ec681f3Smrg- v3dv: fix viewport state from pipeline 21337ec681f3Smrg- v3dv: implement early Z optimization 21347ec681f3Smrg- v3dv: clamp stencil masks and reference value to supported limits 21357ec681f3Smrg- v3dv: implement dynamic stencil states 21367ec681f3Smrg- v3dv: fix the mess with dynamic state handling 21377ec681f3Smrg- v3dv: add a helper to compute the hardware clear color 21387ec681f3Smrg- v3dv: add a helper to get the Z/S buffer from an aspect mask 21397ec681f3Smrg- v3dv: implement vkCmdClearAttachments 21407ec681f3Smrg- v3dv: implement indexed draws 21417ec681f3Smrg- v3dv: fix clockwise primitive setting 21427ec681f3Smrg- v3dv: ignore image view aspects for depth/stencil attachments 21437ec681f3Smrg- v3dv: take the number of layers from the framebuffer 21447ec681f3Smrg- v3dv: Add more supported formats to our format table 21457ec681f3Smrg- v3dv: don't advertise texel buffer support yet. 21467ec681f3Smrg- v3dv: implement vkCmdCopyBuffer 21477ec681f3Smrg- v3dv: implement vkCmdUpdateBuffer 21487ec681f3Smrg- v3dv: implement vkCmdFillBuffer 21497ec681f3Smrg- v3dv: move the framebuffer setup code for buffer copy/fill to a helper 21507ec681f3Smrg- v3dv: add a concept of a fake framebuffer for meta-copy operations 21517ec681f3Smrg- v3dv: refactor common code in meta copy operations 21527ec681f3Smrg- v3dv: fix copy size for image to buffer copies 21537ec681f3Smrg- v3dv: implement vkCmdCopyImage 21547ec681f3Smrg- v3dv: implement vkCmdClearColorImage 21557ec681f3Smrg- v3dv: fix buffer automatic stride for image to buffer copies 21567ec681f3Smrg- v3dv: implement vkCmdClearDepthStencilImage 21577ec681f3Smrg- v3dv: implement vkCmdCopyBufferToImage for color formats 21587ec681f3Smrg- v3dv: vkCmdCopyBufferToImage for depth/stencil formats 21597ec681f3Smrg- v3dv: add an assert to catch applications trying to clear invalid aspects 21607ec681f3Smrg- v3dv: implement indirect draws 21617ec681f3Smrg- v3dv: add support for primitive restarts on indexed draw calls 21627ec681f3Smrg- v3dv: initialize in_sync_bcl in our submits 21637ec681f3Smrg- v3dv: implement vkResetCommandBuffer 21647ec681f3Smrg- v3dv: add assertions for unimplemented fallback paths 21657ec681f3Smrg- v3dv: honor swizzle for non-copy operations of color formats 21667ec681f3Smrg- v3dv: implement vkQueueWaitIdle 21677ec681f3Smrg- v3dv: destroy wsi device during physical device termination 21687ec681f3Smrg- v3dv: implement vk{Create,Destroy}BufferView 21697ec681f3Smrg- v3dv: implement host-side event handling functions 21707ec681f3Smrg- v3dv: adjust a few limits to comply with CTS minimum requirements 21717ec681f3Smrg- v3dv: declare that we support robust buffer access 21727ec681f3Smrg- v3dv: meet requirements for supported format features and properties 21737ec681f3Smrg- v3dv: implement vkResetCommandPool 21747ec681f3Smrg- v3dv: don't swap R/B channels for VK_FORMAT_R5B6G5_UNORM_PACK16 21757ec681f3Smrg- v3dv: don't use TLB path for formats that are not supported for rendering 21767ec681f3Smrg- v3dv: fix image clearing with VK_REMAINING\_\* 21777ec681f3Smrg- v3dv: don't support image formats that we can rendet to or texture from 21787ec681f3Smrg- v3dv: fix fill buffer with VK_WHOLE_SIZE 21797ec681f3Smrg- v3dv: implement vkGetRenderAreaGranularity 21807ec681f3Smrg- v3dv: fix supertile coverage when render are size is 0. 21817ec681f3Smrg- v3dv: take memory format from appropriate miplevel for image load/store 21827ec681f3Smrg- v3dv: fix framebuffer format when computing fragment shader key 21837ec681f3Smrg- v3dv: fix subpass tracking in the command buffer state 21847ec681f3Smrg- v3dv: rewrite frame tiling setup 21857ec681f3Smrg- v3dv: more frame tiling refactors 21867ec681f3Smrg- v3dv: trivial refactors in a few meta copy helpers 21877ec681f3Smrg- v3dv: assign driver locations on fragment shader output variables 21887ec681f3Smrg- v3dv: don't reset loader data on command buffers 21897ec681f3Smrg- v3dv: drop incorrect assertion 21907ec681f3Smrg- v3dv: add a no-op fragment shader if we don't have one 21917ec681f3Smrg- v3dv: implement interpolation qualifiers 21927ec681f3Smrg- v3d/compiler: implement nir_op_fquantize2f16 21937ec681f3Smrg- v3dv: call nir_lower_io_arrays_to_elements_no_indirects on vertex shaders 21947ec681f3Smrg- v3dv: fix incorrect sizing of the vertex attribute state array 21957ec681f3Smrg- v3dv: split fragment shader array outputs 21967ec681f3Smrg- v3dv: lower usubborrow and uaddcarry 21977ec681f3Smrg- v3dv: lower {i,u}mulExtended 21987ec681f3Smrg- v3dv: don't assume that VkPipelineColorBlendStateCreateInfo is provided 21997ec681f3Smrg- v3dv: drop incorrect assertion 22007ec681f3Smrg- v3dv: drop assert for map of a mapped buffer 22017ec681f3Smrg- v3dv: fix image tiling configuration 22027ec681f3Smrg- v3dv: fix scissor outside viewport 22037ec681f3Smrg- v3dv: fix viewport Z 22047ec681f3Smrg- v3dv: work around viewport Z scale hardware bug 22057ec681f3Smrg- v3dv: don't leak job allocations 22067ec681f3Smrg- v3dv: handle the case where we fail to allocate a new job gracefully 22077ec681f3Smrg- v3dv: only export the last job sync object once 22087ec681f3Smrg- v3dv: support submits without a command buffer 22097ec681f3Smrg- v3dv: return OOM error if we fail to import or export sync objects 22107ec681f3Smrg- v3dv: use vk_error() for all queue/submit errors 22117ec681f3Smrg- v3dv: fix copies and clears of 3D images 22127ec681f3Smrg- v3dv: fix depth/stencil clear color 22137ec681f3Smrg- v3dv: implement color blending 22147ec681f3Smrg- v3dv: only expose blending on formats that support it 22157ec681f3Smrg- v3dv: add an 'always flush' mode 22167ec681f3Smrg- v3dv: always flush draw calls if we are doing sRGB blending 22177ec681f3Smrg- v3dv: implement dynamic state for blend constants 22187ec681f3Smrg- v3dv: only emit blend state if the pipeline is dirty 22197ec681f3Smrg- v3dv: rewrite dirty state handling 22207ec681f3Smrg- v3dv: drop redundant emission of stencil state 22217ec681f3Smrg- v3dv: stencil state fixes 22227ec681f3Smrg- v3dv: only emit config bits and varyings packets if needed 22237ec681f3Smrg- v3dv: use perp end caps rasterization mode for lines 22247ec681f3Smrg- v3dv: drop incorrect assertion on number of clear values at render pass begin 22257ec681f3Smrg- v3dv: disable depth/stencil testing if we don't have a depth/stencil attachment 22267ec681f3Smrg- v3dv: assert on vkCreateComputePipelines 22277ec681f3Smrg- v3dv: improve assert handling for fallback paths on meta copy/clear operations 22287ec681f3Smrg- v3dv: check support for transfer usage flags 22297ec681f3Smrg- v3dv: make sure we only expose transfer features for formats we can use 22307ec681f3Smrg- v3dv: use compatible TLB formats if possible during copies and clears 22317ec681f3Smrg- v3dv: fix incorrect image slice selection 22327ec681f3Smrg- v3dv: fix clearing of 3D images 22337ec681f3Smrg- v3dv: fix job subpass index for vkCmdClearAttachments jobs 22347ec681f3Smrg- v3dv: don't emit the subpass RCL for jobs that have emitted their own 22357ec681f3Smrg- v3dv: fix a1r5g5b5 format 22367ec681f3Smrg- v3dv: allow to create shader modules from NIR 22377ec681f3Smrg- v3dv: improve asserts for VkPipelineColorBlendStateCreateInfo handling 22387ec681f3Smrg- v3dv: implement partial color attachment clears 22397ec681f3Smrg- v3dv: implement partial depth/stencil attachment clears 22407ec681f3Smrg- v3dv: implement proper caching for partial clear pipelines 22417ec681f3Smrg- v3dv: store the clip window in the command buffer state 22427ec681f3Smrg- v3dv: check the render area against the clip window 22437ec681f3Smrg- v3dv: fix v3dv_GetRenderAreaGranularity to account for attachment bpp 22447ec681f3Smrg- v3dv: don't always assert that we have an active job 22457ec681f3Smrg- v3dv: use the TLB to clear attachments even if we have an active scissor 22467ec681f3Smrg- v3dv: restrict render pass clears to the render area 22477ec681f3Smrg- v3dv: handle stencil load/store operations 22487ec681f3Smrg- v3dv: assert on subpasses that use input or resolve attachments 22497ec681f3Smrg- v3dv: push/pop more state during meta operations 22507ec681f3Smrg- v3dv: create a v3dv_cmd_buffer_subpass_resume helper 22517ec681f3Smrg- v3dv: set render area for partial clears to match clear rect 22527ec681f3Smrg- v3dv: compute tile granularity for each subpass 22537ec681f3Smrg- v3dv: fix incorrect attachment reference 22547ec681f3Smrg- v3dv: fix incorrect attachment reference 22557ec681f3Smrg- v3dv: simplify partial clearing code 22567ec681f3Smrg- v3dv: handle partial clears of just one aspect of combined DS targets 22577ec681f3Smrg- v3d/compiler: implement nir_intrinsic_load_base_instance 22587ec681f3Smrg- v3dv: emit instanced draw calls when requested 22597ec681f3Smrg- v3dv: fix subpass merge tests 22607ec681f3Smrg- v3dv: reset all state to dirty when we start a new job for a command buffer 22617ec681f3Smrg- v3dv: implement occlusion queries 22627ec681f3Smrg- v3dv: submit a no-op job if a command buffer doesn't have any jobs. 22637ec681f3Smrg- v3dv: simplify handling of no-op jobs 22647ec681f3Smrg- v3dv: add a bunch of API stubs 22657ec681f3Smrg- v3dv: implement TFU blits 22667ec681f3Smrg- v3dv: reset subpass index at render pass end 22677ec681f3Smrg- v3dv: meta operations can happen outside a render pass 22687ec681f3Smrg- v3dv: save and restore descriptor state during meta operations if needed 22697ec681f3Smrg- v3dv: save and restore push constant state during meta operations 22707ec681f3Smrg- v3dv: implement shader draw fallback for vkCmdBlitImage 22717ec681f3Smrg- v3dv: require optimal tiling for features that reqiure sampling 22727ec681f3Smrg- v3dv: move early-Z update to pre-draw 22737ec681f3Smrg- v3dv: don't leak NIR code in pipelines 22747ec681f3Smrg- v3dv: don't leak host memory allocated for shader variants 22757ec681f3Smrg- v3dv: don't leak default pipeline attributes BO 22767ec681f3Smrg- v3dv: don't leak prog_data from shader variants 22777ec681f3Smrg- v3dv: don't leak the compiler from the physical device 22787ec681f3Smrg- v3dv: don't leak the texture shader state BO from image views 22797ec681f3Smrg- v3dv: don't leak state BO from samplers 22807ec681f3Smrg- v3dv/blit: fix integer blits from larger to lower bit size 22817ec681f3Smrg- v3dv: handle miplevel correctly for blits 22827ec681f3Smrg- v3dv: support depth blits 22837ec681f3Smrg- v3dv: don't support blitting of combined depth/stencil formats 22847ec681f3Smrg- v3dv: don't support 1D depth/stencil for transfer sources or sampling 22857ec681f3Smrg- v3dv: remove incorrect assert 22867ec681f3Smrg- v3dv: support blits with 1D and 3D images 22877ec681f3Smrg- v3dv: add framework for private driver objects 22887ec681f3Smrg- v3dv: fix leaks during recording of meta blits 22897ec681f3Smrg- v3dv: use the private object framework in the meta clear path 22907ec681f3Smrg- v3dv: implement fallback for partial image copies 22917ec681f3Smrg- v3dv: implement stencil aspect blits for combined depth/stencil format 22927ec681f3Smrg- v3d: fix Tile Rendering Mode Cfg (Color) packet description 22937ec681f3Smrg- v3dv: limit software integer RT clamp to rgb10a2 22947ec681f3Smrg- v3dv: handle copies from/to compressed formats 22957ec681f3Smrg- v3dv: implement partial buffer copies to color images 22967ec681f3Smrg- v3dv: support blitting both depth and stencil aspects at the same time 22977ec681f3Smrg- v3dv: implement partial buffer copies to depth/stencil images 22987ec681f3Smrg- v3dv: always return true from a fallback path if it can handle the case 22997ec681f3Smrg- v3dv: fix image addressing calculations to account for suballocation 23007ec681f3Smrg- v3dv: only require 4-byte alignment for linear images 23017ec681f3Smrg- v3dv: implement partial image to buffer copies 23027ec681f3Smrg- v3dv: do not rewrite blit spec for combined depth/stencil in get_blit_pipeline 23037ec681f3Smrg- v3dv: drop blit path for depth/stencil formats 23047ec681f3Smrg- v3dv: implement depth bias 23057ec681f3Smrg- v3dv: ignore dynamic updates of depth bounds state 23067ec681f3Smrg- v3dv: implement wide lines 23077ec681f3Smrg- v3dv: fix dynamic blend constants 23087ec681f3Smrg- v3dv: fix the command buffer private object framework for 32-bit 23097ec681f3Smrg- v3dv: fix depth/stencil clears on hardware 23107ec681f3Smrg- v3dv: make the driver more robust against OOM 23117ec681f3Smrg- v3dv: implement events 23127ec681f3Smrg- v3dv: don't leak BOs from CLs when using BRANCH 23137ec681f3Smrg- v3dv: fix vkResetCommandPool 23147ec681f3Smrg- v3dv: make TLB clearing paths return true/false 23157ec681f3Smrg- v3dv: drop the extra BO handling from the command buffer 23167ec681f3Smrg- v3dv: remove some unnecessary / unused functions 23177ec681f3Smrg- v3dv: assert command buffers are executable when submitting to a queue 23187ec681f3Smrg- v3dv: check that GPU device matches requirements 23197ec681f3Smrg- v3dv: ensure BCL space is available before emitting packets 23207ec681f3Smrg- v3dv: handle OOM properly during command buffer recording in more places 23217ec681f3Smrg- v3dv: fix bogus command buffer allocation scopes 23227ec681f3Smrg- v3dv: add basic support for secondary command buffers 23237ec681f3Smrg- v3dv: implement vkCmdWaitEvents for secondary command buffers 23247ec681f3Smrg- v3dv: support vkCmdClearAttachments in secondary command buffers 23257ec681f3Smrg- v3dv: don't leak attachment state 23267ec681f3Smrg- v3dv: add stubs for missing API implementations 23277ec681f3Smrg- v3dv: warn users that this is not a conformant driver 23287ec681f3Smrg- v3dv: fix BCL start offset in presence of chained BOs 23297ec681f3Smrg- v3dv: regen BO lists for CLs inside cloned jobs 23307ec681f3Smrg- v3dv: fix a few cases where we were ignoring suballocated buffers 23317ec681f3Smrg- v3dv: fix release build warnings 23327ec681f3Smrg- v3dv: actually enable early Z 23337ec681f3Smrg- v3dv: try harder to skip emission of redundant state 23347ec681f3Smrg- v3dv: add a TFU path for buffer to image copies 23357ec681f3Smrg- v3dv: add a CPU path for buffer to image copies 23367ec681f3Smrg- v3dv: try to use TFU path when creating tiled images from linear buffers 23377ec681f3Smrg- v3dv: always map full BOs 23387ec681f3Smrg- v3dv: support compute pipelines 23397ec681f3Smrg- v3dv: handle separate binding points for compute and graphics 23407ec681f3Smrg- v3dv: implement compute dispatch 23417ec681f3Smrg- v3dv: handle unsized arrays in SSBOs 23427ec681f3Smrg- v3dv: always emit index buffer state for new jobs 23437ec681f3Smrg- v3dv: implement indirect compute dispatch 23447ec681f3Smrg- v3dv: return a proper error for too large buffer allocations 23457ec681f3Smrg- v3dv: assert that our framebuffers are single sampled 23467ec681f3Smrg- v3dv: don't free BOs from imported memory objects 23477ec681f3Smrg- v3dv: pipeline initialization fixes for disabled rasterization 23487ec681f3Smrg- v3dv: handle empty set layouts 23497ec681f3Smrg- v3dv: don't reset descriptor state after a meta operation 23507ec681f3Smrg- v3dv: lower unpack_{u,s}norm_2x16 23517ec681f3Smrg- v3dv: lower frexp 23527ec681f3Smrg- v3dv: implement support for shader spilling 23537ec681f3Smrg- v3dv: fix GFXH-930 workaround 23547ec681f3Smrg- v3dv: add workaround for GFXH-1602 23557ec681f3Smrg- v3dv: improve handling of too large image sizes 23567ec681f3Smrg- v3dv: handle draw after barrier 23577ec681f3Smrg- v3dv: fix vkCmdCopyBuffer unaligned TLB access 23587ec681f3Smrg- v3dv: fix textureSize() for cube arrays 23597ec681f3Smrg- v3dv: fix srcSubresource description for image to buffer blits 23607ec681f3Smrg- v3dv: fix blit_shader() to honor the region's aspect mask 23617ec681f3Smrg- v3dv: handle unnormalized coordinates in samplers 23627ec681f3Smrg- v3dv: use swizzle X001 with D/S formats 23637ec681f3Smrg- v3dv: fix regressions for cubemap array load/store 23647ec681f3Smrg- v3dv: fix color border clamping with specific formats 23657ec681f3Smrg- v3dv: make sure we emit vertex attributes in location order 23667ec681f3Smrg- v3d/compiler: support swapping R/B channels in vertex attributes. 23677ec681f3Smrg- v3dv: handle VK_FORMAT_B8G8R8A8_UNORM vertex attributes 23687ec681f3Smrg- v3dv: don't support sRGB buffer formats 23697ec681f3Smrg- v3dv: improve pipeline barrier handling 23707ec681f3Smrg- v3dv: use a binning sync for CL jobs waiting on a semaphore 23717ec681f3Smrg- v3dv: ignore stencil load operation if attachment format doesn't have stencil 23727ec681f3Smrg- v3dv: only use per-buffer clear bit for cases were we are already storing 23737ec681f3Smrg- v3dv: avoid prime blit path when presenting WSI images 23747ec681f3Smrg- v3dv: only care about barriers between GPU jobs 23757ec681f3Smrg- v3dv: emit new shader state if viewport is dirty 23767ec681f3Smrg- v3dv: only clear depth/stencil attachments if any aspect needs clearing 23777ec681f3Smrg- v3dv: add a fast path for vkCmdClearAttachments 23787ec681f3Smrg- v3dv: enable shaderClipDistance 23797ec681f3Smrg- v3dv: enable fillModeNonSolid 23807ec681f3Smrg- v3dv: fix dynamic state after meta operation 23817ec681f3Smrg- v3dv: consider MSAA when computing frame tiling 23827ec681f3Smrg- v3dv: process VkPipelineMultisampleStateCreateInfo properly 23837ec681f3Smrg- v3dv: implement subpass multisample rendering and resolve 23847ec681f3Smrg- v3dv: implement vkCmdResolveImage for whole images 23857ec681f3Smrg- v3dv: handle multisampled image copies in the TLB path 23867ec681f3Smrg- v3dv: setup texture shader state correctly for multisampled images 23877ec681f3Smrg- v3dv: add a blit fallback path for vkCmdResolveImage 23887ec681f3Smrg- v3dv: handle multisampled image copies with the blit path 23897ec681f3Smrg- broadcom/compiler: handle gl_SampleMask writes in fragment shaders 23907ec681f3Smrg- v3dv: amend tile size tables with smallest tile sizes available 23917ec681f3Smrg- nir/glsl: add a glsl_ivec4_type() helper 23927ec681f3Smrg- v3dv: fix blitting of signed integer formats 23937ec681f3Smrg- v3dv: handle multisample resolve of integer formats 23947ec681f3Smrg- v3dv: handle multisample resolves for formats that don't support TLB resolves 23957ec681f3Smrg- v3dv: handle multisample image clears 23967ec681f3Smrg- broadcom/compiler: implement nir_intrinsic_load_sample_pos 23977ec681f3Smrg- broadcom/compiler: track if the fragment shader forces per-sample MSAA 23987ec681f3Smrg- v3dv: enable sample rate shading if fragment shader reads gl_SampleID 23997ec681f3Smrg- v3dv: implement nir_texop_texture_samples 24007ec681f3Smrg- v3dv: handle multisample rasterization with empty framebuffers 24017ec681f3Smrg- nir/lower_io: add an option to lower interpolateAt functions 24027ec681f3Smrg- v3dv: lower interpolateAt functions in NIR and enable sample rate shading 24037ec681f3Smrg- v3dv: only require texel-size alignment for linear images 24047ec681f3Smrg- v3dv: fix 3D image blits 24057ec681f3Smrg- v3dv: don't cache subpass color clear pipelines 24067ec681f3Smrg- v3dV: move meta init/finish to meta implementation files 24077ec681f3Smrg- nir: add a nir_get_ubo_size intrinsic 24087ec681f3Smrg- v3d/compiler: implement nir_intrinsic_get_ubo_size 24097ec681f3Smrg- v3dv: handle QUNIFORM_GET_UBO_SIZE 24107ec681f3Smrg- broadcom/compiler: rename QUNIFORM_GET_BUFFER_SIZE to QUNIFORM_GET_SSBO_SIZE 24117ec681f3Smrg- v3d/compiler: add a lowering pass for robust buffer access 24127ec681f3Smrg- v3dv: hook up robust buffer access 24137ec681f3Smrg- v3dv: fix color clear pipeline destruction for 32-bit architectures 24147ec681f3Smrg- v3dv: handle VK_STRUCTURE_TYPE_PHYSICAL_DEVICE_IMAGE_DRM_FORMAT_MODIFIER_INFO 24157ec681f3Smrg- v3dv: expose DRM modifiers based on supported features 24167ec681f3Smrg- v3dv: fix offset computed by vkGetImageSubresourceLayout for array images 24177ec681f3Smrg- v3dv: fix size computed by vkGetImageSubresourceLayout for 3D images 24187ec681f3Smrg- v3dv: do not expose VK_IMAGE_USAGE_SAMPLED_BIT for swapchains 24197ec681f3Smrg- v3dv: signal semaphore/fence if needed after acquiring a swapchain image 24207ec681f3Smrg- v3dv: fix sampling from stencil aspect of a combined depth/stencil image 24217ec681f3Smrg- v3dv: honor VkPipelineDepthStencilStateCreateInfo::depthWriteEnable 24227ec681f3Smrg- v3dv: don't leak dumb BO handles allocated for swapchain images 24237ec681f3Smrg- v3dv: clean-up after obtaining an XCB connection 24247ec681f3Smrg- v3dv: free noop job if needed when finishing the queue 24257ec681f3Smrg- v3d/compiler: allow to batch spills 24267ec681f3Smrg- v3dv: always program a reasonable internal depth type for copies/clears 24277ec681f3Smrg- v3dv: only advertise one memory type 24287ec681f3Smrg- v3dv: flag tmu_dirty_rcl in primaries when linking secondaries that have it set 24297ec681f3Smrg- v3dv: implement workaround for GFXH-1461 24307ec681f3Smrg- v3dv: implement workaround for GFXH-1918 24317ec681f3Smrg- v3dv: fixes for barriers in secondary command buffers 24327ec681f3Smrg- v3dv: fix blit path for copies from 3D compressed images 24337ec681f3Smrg- v3dv: generate proper UUIDs for device and driver 24347ec681f3Smrg- v3dv: limit blit framebuffer dimensions to max coordinates 24357ec681f3Smrg- v3dv: drop a couple of obsolete comments 24367ec681f3Smrg- v3dv: fix buffer copies to compressed images on the blit path 24377ec681f3Smrg- broadcom/compiler: track partially interpolated fragment inputs 24387ec681f3Smrg- v3d/compiler: implement load interpolated input intrinsics 24397ec681f3Smrg- v3dv: skip unnecessary tile loads when blitting 24407ec681f3Smrg- v3dv: fix multi-layered buffer to image copies on the blit path 24417ec681f3Smrg- v3dv: do not attempt to blit from a linear image source 24427ec681f3Smrg- v3dv: fix Z coordinate for 3D blits 24437ec681f3Smrg- v3dv: handle compressed image to buffer copies on the blit path 24447ec681f3Smrg- v3dv: handle buffer to linear depth/stencil image copies in blit path 24457ec681f3Smrg- broadcom/cle: fix vec size dump when set to 0 24467ec681f3Smrg- v3d/compiler: fix BGRA vertex attributes for vec2/float size. 24477ec681f3Smrg- v3dv: compute swap_rb flag after applying all swizzles 24487ec681f3Smrg- v3dv: properly describe swap_color_rb 24497ec681f3Smrg- v3dv: enable the logicOp feature 24507ec681f3Smrg- v3dv: grow meta descriptor pool dynamically 24517ec681f3Smrg- v3dv: enable alphaToOne feature 24527ec681f3Smrg- v3dv: add image view debug checks for VK_KHR_maintenance1 24537ec681f3Smrg- v3dv: fix base slice selection for copies involving 3D images 24547ec681f3Smrg- v3dv: update assertion to match VK_KHR_maintenance1 semantics 24557ec681f3Smrg- v3dv: implement vkTrimCommandPool 24567ec681f3Smrg- v3dv: expose VK_KHR_maintenance1 24577ec681f3Smrg- v3dv: add support for timestamp queries 24587ec681f3Smrg- v3dv: fix occlusion query inheritance in secondary command buffers 24597ec681f3Smrg- zink: require Vulkan timestamp queries for time query caps 24607ec681f3Smrg- zink: add VK_STRUCTURE_TYPE_WSI_MEMORY_ALLOCATE_INFO_MESA for WSI allocations 24617ec681f3Smrg- v3dv: add a v3dv_bo_init helper 24627ec681f3Smrg- v3dv: expose more features 24637ec681f3Smrg- zink: fix pNext chain for resource memory allocation 24647ec681f3Smrg 24657ec681f3SmrgIan Romanick (34): 24667ec681f3Smrg 24677ec681f3Smrg- intel/vec4: Silence unused paramter warnings in brw_vec4_generator.cpp 24687ec681f3Smrg- intel/compiler: Silence unused parameter warning in brw_surface_payload_size 24697ec681f3Smrg- intel/compiler: Don't fallback to vec4 when scalar GS compile fails \[v2\] 24707ec681f3Smrg- intel/vec4: Remove inline lowering of LRP 24717ec681f3Smrg- intel/compiler: Remove INTEL_SCALAR\_... env variables 24727ec681f3Smrg- intel/vec4: Remove all support for Gen8+ \[v2\] 24737ec681f3Smrg- intel/vec4: Remove everything related to VS_OPCODE_SET_SIMD4X2_HEADER_GEN9 24747ec681f3Smrg- i965: Allow viewport array extensions with allow_higher_compat_version 24757ec681f3Smrg- intel: Silence many unused parameter warnings in blorp_genX_exec.h 24767ec681f3Smrg- i965: Silence many unused parameter warnings in genX_blorp_exec.c 24777ec681f3Smrg- i965: Silence many unused parameter warnings in genX_state_upload.c 24787ec681f3Smrg- i965: Make MOCS index tables static const 24797ec681f3Smrg- i965: Rename gen10_emit_isp_disable to gen7_emit_isp_disable 24807ec681f3Smrg- intel: Disable all support for Gen10 24817ec681f3Smrg- intel/compiler: Remove Gen10-specific code 24827ec681f3Smrg- i965: Remove Gen10-specific state setup and workarounds 24837ec681f3Smrg- i965: Don't build Gen10-specific files and libraries 24847ec681f3Smrg- intel: Remove Gen10-specific cache config code 24857ec681f3Smrg- intel/isl: Don't generate Gen10-specific functions 24867ec681f3Smrg- iris: Don't generate Gen10-specific functions 24877ec681f3Smrg- anv: Don't generate Gen10-specific functions 24887ec681f3Smrg- intel: Remove Gen10-speicific perf support 24897ec681f3Smrg- intel: Remove Gen10-specific device entries 24907ec681f3Smrg- i965: Silence unused parameter warnings 24917ec681f3Smrg- mesa/st: Silence unused parameter warnings in st_context.c 24927ec681f3Smrg- mesa: Pass the correct caller string to \_mesa_lookup_or_create_texture 24937ec681f3Smrg- glx: rework \__glXCalculateUsableExtensions to be more readable 24947ec681f3Smrg- nir: Rename replicated-result dot-product instructions 24957ec681f3Smrg- mesa: Open-code hash walk in \_mesa_HashPrint 24967ec681f3Smrg- mesa: Store the atlas Id in the gl_bitmap_atlas structure 24977ec681f3Smrg- i965: Get the gl_perf_query_object Id from the object 24987ec681f3Smrg- mesa: Remove the key parameter from the \_mesa_HashWalk callback 24997ec681f3Smrg- mesa: Remove the key parameter from the \_mesa_HashDeleteAll callback 25007ec681f3Smrg- intel/compiler: Rotate instructions ROR and ROL cannot have source modifiers 25017ec681f3Smrg 25027ec681f3SmrgIcecream95 (27): 25037ec681f3Smrg 25047ec681f3Smrg- panfrost: Fix border colour 25057ec681f3Smrg- docs/features: Add missing Panfrost extensions 25067ec681f3Smrg- panfrost: Cleanup panfrost_get_param 25077ec681f3Smrg- panfrost: Remove old comment on broken depth reload 25087ec681f3Smrg- panfrost: Correctly set modifier_constant 25097ec681f3Smrg- panfrost: Seperate resource setup and bo creation 25107ec681f3Smrg- panfrost: Move tiled-linear conversion checking to a new function 25117ec681f3Smrg- panfrost: AFBC to linear layout conversion 25127ec681f3Smrg- pan/mdg: Fix spilling of non-32-bit types 25137ec681f3Smrg- panfrost: Set modifier_constant to true for exported resources 25147ec681f3Smrg- pan/mdg: Return a bool from midgard_nir_lod_errata 25157ec681f3Smrg- pan/mdg: Use nir_shader_instructions_pass for nir_lod_errata 25167ec681f3Smrg- pan/mdg: Use nir_shader_instructions_pass for fdot2 lowering 25177ec681f3Smrg- Revert "panfrost: Drop implicit blend pooling" 25187ec681f3Smrg- panfrost: Clamp uniform buffer size 25197ec681f3Smrg- panfrost: Handle non-positive viewport positions 25207ec681f3Smrg- panfrost: Remove redundant casts of viewport position 25217ec681f3Smrg- panfrost: Mark blit shaders as internal 25227ec681f3Smrg- pan/mdg: Infer whether to disassemble shaders from info.internal 25237ec681f3Smrg- panfrost: Add a debug flag to disable AFBC 25247ec681f3Smrg- panfrost: Precise occlusion query support 25257ec681f3Smrg- panfrost: Only enable occlusion queries when active 25267ec681f3Smrg- panfrost: Move zs format handling code out of the !afbc case 25277ec681f3Smrg- panfrost: Z16 depth buffer support 25287ec681f3Smrg- panfrost: AFBC compress Z16 depth buffers 25297ec681f3Smrg- panfrost: Fix AFBC blits of resources with faked RGTC 25307ec681f3Smrg- panfrost: Fix stack shift calculation 25317ec681f3Smrg 25327ec681f3SmrgIgor V. Kovalenko (1): 25337ec681f3Smrg 25347ec681f3Smrg- r600: amend space check for chips older than EVERGREEN 25357ec681f3Smrg 25367ec681f3SmrgIlia Mirkin (1): 25377ec681f3Smrg 25387ec681f3Smrg- panfrost: enable DrawTransformFeedback\* 25397ec681f3Smrg 25407ec681f3SmrgIndrajit Kumar Das (5): 25417ec681f3Smrg 25427ec681f3Smrg- mesa: add NV_copy_depth_to_color support for nir 25437ec681f3Smrg- gallium: prepare framework for supporting GL_NV_shader_atomic_int64 25447ec681f3Smrg- mesa,glsl: add support for GL_NV_shader_atomic_int64 25457ec681f3Smrg- radeonsi: enable support for GL_NV_shader_atomic_int64 25467ec681f3Smrg- radeonsi/gfx10: fix stream index for multi-stream overflow query 25477ec681f3Smrg 25487ec681f3SmrgItalo Nicola (12): 25497ec681f3Smrg 25507ec681f3Smrg- nir: add shared/global atomics to nir_get_io_offset_src() 25517ec681f3Smrg- panfrost: fix undefined value access on mir_set_intr_mask() 25527ec681f3Smrg- panfrost: add atomic_cmpxchg opcode 25537ec681f3Smrg- panfrost: add LDST_ADDRESS property to atomic ops 25547ec681f3Smrg- panfrost: introduce LDST_ATOMIC property 25557ec681f3Smrg- panfrost: add support for src[3\] in LOAD_STORE ops 25567ec681f3Smrg- panfrost: add atomic ops infrastructure 25577ec681f3Smrg- panfrost: add support for atomics 25587ec681f3Smrg- nir/algebraic: fold some nested comparisons with ball and bany 25597ec681f3Smrg- pan/mdg: remove unused arg from ALU_CHECK_CMP and ALU_CASE_CMP 25607ec681f3Smrg- pan/mdg: map uabs_i/usub to i/uabsdiff 25617ec681f3Smrg- pan/mdg: fix LOCAL_STORAGE wls_instances packing 25627ec681f3Smrg 25637ec681f3SmrgIván Briano (1): 25647ec681f3Smrg 25657ec681f3Smrg- anv: restrict number of subgroups per group 25667ec681f3Smrg 25677ec681f3SmrgJames Park (16): 25687ec681f3Smrg 25697ec681f3Smrg- amd/addrlib: Fix warning list for msvc 25707ec681f3Smrg- radv: Increased const usage 25717ec681f3Smrg- util: Hide timespec_passed on Windows 25727ec681f3Smrg- radv: Only close local_fd when valid 25737ec681f3Smrg- ac,amd/llvm,radv: Initialize structs with {0} 25747ec681f3Smrg- util,radv,radv/winsys: Cross-platform rwlock API 25757ec681f3Smrg- util,ac,aco,radv: Cross-platform memstream API 25767ec681f3Smrg- util: Fix rwlock Windows include for MinGW 25777ec681f3Smrg- util/xmlconfig: Disable for Windows like Android 25787ec681f3Smrg- aco: Clean up some C++ usages 25797ec681f3Smrg- vulkan/util,vulkan/wsi,radv: Add typed outarray API 25807ec681f3Smrg- aco: Fix accidental copies, attempt two 25817ec681f3Smrg- nir: Stabilize compact_components sort 25827ec681f3Smrg- amd/llvm,aco: Replace VLA with alloca 25837ec681f3Smrg- radv,radv/winsys: Move RADV_MAX_IBS_PER_SUBMIT 25847ec681f3Smrg- radv: Fix leak in radv_amdgpu_winsys_destroy() 25857ec681f3Smrg 25867ec681f3SmrgJan Beich (1): 25877ec681f3Smrg 25887ec681f3Smrg- spirv: switch to util_bswap32 to improve portability 25897ec681f3Smrg 25907ec681f3SmrgJan Ziak (1): 25917ec681f3Smrg 25927ec681f3Smrg- Add driver override to envvars.rst 25937ec681f3Smrg 25947ec681f3SmrgJason Ekstrand (296): 25957ec681f3Smrg 25967ec681f3Smrg- iris: no-op implement set_compute_resources 25977ec681f3Smrg- iris: Implement set_global_binding 25987ec681f3Smrg- iris: Add support for serialized NIR 25997ec681f3Smrg- intel/cs_intrinsics: Handle 64-bit intrinsics 26007ec681f3Smrg- intel/compiler: Allow MESA_SHADER_KERNEL 26017ec681f3Smrg- iris: Use blob_write_uint32 for num_system_values 26027ec681f3Smrg- iris: Add a kernel_input_size field for compiled shaders 26037ec681f3Smrg- iris/disk_cache: Stop assuming stage == cache_id 26047ec681f3Smrg- iris: Copy dest size from the original intrinsic in setup_uniforms 26057ec681f3Smrg- iris: Upload kernel inputs with system values 26067ec681f3Smrg- iris: Add support for MESA_SHADER_KERNEL in the disk cache 26077ec681f3Smrg- nir: Add and use nir_foreach_block_unstructured helpers 26087ec681f3Smrg- nir/lower_goto_if: Document some data structures 26097ec681f3Smrg- nir/lower_goto_if: Clean up ralloc usage 26107ec681f3Smrg- nir/lower_goto_if: Use util/list instead of exec_list 26117ec681f3Smrg- nir/lower_goto_if: Rework handling of skip targets 26127ec681f3Smrg- nir/lower_goto_if: Rework some set union logic 26137ec681f3Smrg- nir/lower_goto_if: Sort blocks in select_fork 26147ec681f3Smrg- nir/lower_goto_if: Add a block_for_singular_set helper 26157ec681f3Smrg- nir/lower_goto_if: Replace a tripple loop with a double loop 26167ec681f3Smrg- nir/lower_goto_if: Add a route::outside set 26177ec681f3Smrg- nir/lower_goto_if: Add some debug prints 26187ec681f3Smrg- spirv: Add a MESA_SPIRV_FORCE_UNSTRUCTURED environment variable 26197ec681f3Smrg- nir/builder: Make nir_get_ptr_bitsize take a nir_shader 26207ec681f3Smrg- spirv: Don't emit RMW for vector indexing in shared or global 26217ec681f3Smrg- clover/nir: Stop setting ubo_addr_format 26227ec681f3Smrg- clover/nir: Stop computing the global address format twice 26237ec681f3Smrg- clover/nir: Use the correct address mode for shared 26247ec681f3Smrg- nir: Initialize nir_ssa_def::live_index 26257ec681f3Smrg- nir/builder: Add a nir_iand_imm helper 26267ec681f3Smrg- nir/find_array_copies: Handle cast derefs 26277ec681f3Smrg- nir/large_constants: Handle incomplete derefs 26287ec681f3Smrg- compiler/types: Allow interfaces in get_explicit_type_for_size_align 26297ec681f3Smrg- nir/opt_large_constants: Fix a type/deref_type typo 26307ec681f3Smrg- nir: Add an LOD parameter to image_*_size 26317ec681f3Smrg- iris: Stop advertising PIPE_SHADER_IR_NIR_SERIALIZED 26327ec681f3Smrg- iris: Stop advertising clover-only caps 26337ec681f3Smrg- iris: ref/unref the GLSL type singleton in screen_create/destroy 26347ec681f3Smrg- iris: Normalize all compute shaders to MESA_SHADER_COMPUTE 26357ec681f3Smrg- iris: Always re-upload sysvals when we have kernel inputs 26367ec681f3Smrg- intel/fs: Fix an assert in load_scratch 26377ec681f3Smrg- intel/nir: Allow splitting a single load into up to 32 loads 26387ec681f3Smrg- clover/spirv: Don't call llvm::regularizeLlvmForSpirv 26397ec681f3Smrg- clover: Call clang with -O0 for the SPIR-V path 26407ec681f3Smrg- nir: Report progress properly in nir_lower_bool_to\_\* 26417ec681f3Smrg- intel/nir: Pass the nir_builder by reference in lower_alpha_to_coverage 26427ec681f3Smrg- intel/nir: Rewrite the guts of lower_alpha_to_coverage 26437ec681f3Smrg- intel/nir: Clean up lower_alpha_to_coverage a bit 26447ec681f3Smrg- nir: Use a switch in nir_inline_function_impl 26457ec681f3Smrg- nir: Take a variable remap parameter in nir_inline_function_impl 26467ec681f3Smrg- intel/fs: Add support for vec8 and vec16 ops 26477ec681f3Smrg- intel/nir: Lower things with \\> 4 components in lower_mem_access_bit_sizes 26487ec681f3Smrg- spirv: Support big-endian strings 26497ec681f3Smrg- spirv: Delete some dead workgroup variable handling code 26507ec681f3Smrg- nir: Rename num_shared to shared_size 26517ec681f3Smrg- nir: Improve the comment on num_inputs and friends 26527ec681f3Smrg- intel/fs: Fix MOV_INDIRECT and BROADCAST of Q types on Gen11+ 26537ec681f3Smrg- nir: Add a new nir_var_mem_constant variable mode 26547ec681f3Smrg- nir: Add a load_global_constant intrinsic 26557ec681f3Smrg- nir/lower_io: Use the variable mode for load_scratch_base_ptr checks 26567ec681f3Smrg- nir/lower_io: Add a build_addr_for_var helper 26577ec681f3Smrg- nir/lower_io: Add support for nir_var_mem_constant 26587ec681f3Smrg- nir: Allow opt_large_constants to be run with constant_data_size \\> 0 26597ec681f3Smrg- spirv: Use nir_var_mem_constant for UniformConstant data in CL 26607ec681f3Smrg- intel/fs: Implement nir_intrinsic_load_global_constant 26617ec681f3Smrg- nouveau/nir: Implement load_global_constant 26627ec681f3Smrg- llvmpipe: Add support for load_global_constant 26637ec681f3Smrg- clover/nir: Use nir_var_mem_constant for \__constant memory 26647ec681f3Smrg- spirv: Drop the constant_as_global as option 26657ec681f3Smrg- nir/lower_explicit_io: Assert that compute address sizes match derefs 26667ec681f3Smrg- clover: Use 64-bit offsets for shader_in on 64-bit GPUs 26677ec681f3Smrg- nir/clone: Add a helper for cloning most instruction types 26687ec681f3Smrg- intel/compiler: Get rid of the global compaction table pointers 26697ec681f3Smrg- intel/compiler: Get rid of struct gen_disasm 26707ec681f3Smrg- iris: Use gen_disassemble 26717ec681f3Smrg- intel/eu: Add some new helpers 26727ec681f3Smrg- intel/fs,vec4: Stuff the constant data from NIR in the end of the program 26737ec681f3Smrg- anv: Stop storing the shader constant data side-band 26747ec681f3Smrg- intel/eu: Include brw_compiler.h in brw_eu.h 26757ec681f3Smrg- intel/eu: Add a mechanism for emitting relocatable constant MOVs 26767ec681f3Smrg- intel/fs: Add support for a new load_reloc_const intrinsic 26777ec681f3Smrg- anv: Properly cache brw_stage_prog_data::relocs 26787ec681f3Smrg- nir/builder: Add load/store_global helpers 26797ec681f3Smrg- anv: Patch constant data pointers into shaders with using softpin 26807ec681f3Smrg- iris: Patch constant data pointers into shaders 26817ec681f3Smrg- intel/fs: Don't copy-propagate stride=0 sources into ddx/ddy 26827ec681f3Smrg- intel/fs: Use a single untyped surface read for load_num_work_groups 26837ec681f3Smrg- intel/nir: Lower load_num_work_groups to 32-bit if needed 26847ec681f3Smrg- iris: Re-emit push constants if we have a varying workgroup size 26857ec681f3Smrg- intel/compiler: Handle all indirect lowering choices in brw_nir.c 26867ec681f3Smrg- nir/lower_indirect_derefs: Add a threshold 26877ec681f3Smrg- intel/nir: Stop using nir_lower_vars_to_scratch 26887ec681f3Smrg- nir: Don't bail too early in lower_mem_constant_vars 26897ec681f3Smrg- clover: Call nir_lower_mem_constant_vars 26907ec681f3Smrg- compiler/types: Make booleans 32-bit for cl_size/align 26917ec681f3Smrg- nir/glsl: Add an explicit_alignment field to glsl_type 26927ec681f3Smrg- nir: Add alignment information to cast derefs 26937ec681f3Smrg- nir: Handle all array stride cases in nir_deref_instr_array_stride 26947ec681f3Smrg- nir: Add a helper for getting the alignment of a deref 26957ec681f3Smrg- nir/lower_io: Apply alignments from derefs when available 26967ec681f3Smrg- nir/opt_deref: Don't remove casts with alignment information 26977ec681f3Smrg- nir/opt_deref: Remove restrictive alignment information from casts 26987ec681f3Smrg- spirv: Add pointer helper vars to OpCopyMemory 26997ec681f3Smrg- spirv: Propagate alignments to deref chains via casts 27007ec681f3Smrg- nir: Allow var_mem_global in nir_lower_vars_to_explicit_types 27017ec681f3Smrg- nir: Allow uniform in nir_lower_vars_to_explicit_types 27027ec681f3Smrg- clover: Use args.size() to compute new var locations 27037ec681f3Smrg- spirv: Stop counting inputs in entry_point_wrapper 27047ec681f3Smrg- clover/nir: Use lower_vars_to_explicit for uniform and global 27057ec681f3Smrg- spirv: Drop the OpenCL type layout code 27067ec681f3Smrg- anv: Set alignments on UBO/SSBO root derefs 27077ec681f3Smrg- compiler/types: Fix deserializing structs with \>= 15 members 27087ec681f3Smrg- spirv: Improve the "Entry point not found" error message 27097ec681f3Smrg- spirv2nir: Rework argument handling 27107ec681f3Smrg- nir/lower_io: Fix the unknown-array-index case in get_deref_align 27117ec681f3Smrg- nir: Add a dominance validation pass 27127ec681f3Smrg- spirv: Run repair_ssa if there are discard instructions 27137ec681f3Smrg- intel/nir: Call validate_ssa_dominance at both ends of the NIR compile 27147ec681f3Smrg- nir: More NIR_MAX_VEC_COMPONENTS fixes 27157ec681f3Smrg- nir/idiv_const: Use the modern nir_src_as\_\* constant helpers 27167ec681f3Smrg- anv: Fix the target_bo assertion in anv_reloc_list_add 27177ec681f3Smrg- clover: Pull the stride from pipe_transfer for image maps 27187ec681f3Smrg- spirv: Access qualifiers are not a bitfield 27197ec681f3Smrg- spirv: Plumb access qualifiers through from image types 27207ec681f3Smrg- nir: Add a pass for lowering CL-style image ops to texture ops 27217ec681f3Smrg- intel/fs/swsb: SCHEDULING_FENCE only emits SYNC_NOP 27227ec681f3Smrg- nir: Rename get_buffer_size to get_ssbo_size 27237ec681f3Smrg- radeonsi: Only call nir_lower_var_copies at the end of the opt loop 27247ec681f3Smrg- spirv: vtn_fail with a nice message on unsupported rounding modes 27257ec681f3Smrg- nir/liveness: Consider if uses in nir_ssa_defs_interfere 27267ec681f3Smrg- compiler/types: Add glsl_baseN_t_type(bit_size) helpers 27277ec681f3Smrg- spirv: Use the new types helpers 27287ec681f3Smrg- nir: Add a new memcpy intrinsic 27297ec681f3Smrg- nir: Add a lowering pass to lower memcpy 27307ec681f3Smrg- spirv: Add support for OpCopyMemorySized 27317ec681f3Smrg- clover/nir: Call the memcpy lowering pass 27327ec681f3Smrg- nir: Allow creating variables with nir_var_mem_push_const. 27337ec681f3Smrg- nir/lower_io: Add support for push constants 27347ec681f3Smrg- anv,radv,tu,val: Call nir_lower_io for push constants 27357ec681f3Smrg- spirv: Use derefs for push constants 27367ec681f3Smrg- vallium: Stop using lower_ubo_ssbo_access_to_offsets 27377ec681f3Smrg- spirv: Delete the legacy offset/index UBO/SSBO lowering 27387ec681f3Smrg- nir/copy_propagate: Copy-prop into jump conditions 27397ec681f3Smrg- nir: Disallow goto and goto_if in clone and \[de]serialize 27407ec681f3Smrg- nir/cf: Better handle intra-block splits 27417ec681f3Smrg- nir/validate: Improve the validation of blocks 27427ec681f3Smrg- nir/lower_goto_ifs: Don't destroy SSA form in the process 27437ec681f3Smrg- nir/dominance: Use \_mesa_set_clear instead ofhand-rolling it 27447ec681f3Smrg- spirv: Only run repair_ssa if structured 27457ec681f3Smrg- nir/lower_goto_ifs: Use rzalloc 27467ec681f3Smrg- nir/lower_goto_ifs: Add asserts for SSA forks 27477ec681f3Smrg- nir/lower_goto_ifs: Always include level dom_frontiers in prev_frontier 27487ec681f3Smrg- Revert "nir/lower_goto_if: Add a route::outside set" 27497ec681f3Smrg- anv: Allow HiZ clears for multi-view 27507ec681f3Smrg- anv: Use more temp vars in cmd_buffer_begin_subpass 27517ec681f3Smrg- anv: Skip HiZ and CCS ambiguates which preceed fast-clears 27527ec681f3Smrg- nir: Split NIR_INTRINSIC_TYPE into separate src/dest indices 27537ec681f3Smrg- nir: Add a conversion and rounding intrinsic 27547ec681f3Smrg- nir: Add builder helpers for OpenCL type conversions 27557ec681f3Smrg- nir: Add a passes for nir_intrinsic_convert_alu_types 27567ec681f3Smrg- spirv: Add some conversion handling helpers 27577ec681f3Smrg- spirv: Handle all OpenCL conversion ops with full rounding 27587ec681f3Smrg- spirv/opencl: Drop dest_type from handle_v_load_store 27597ec681f3Smrg- clover/nir: Call nir_lower_convert_alu_types 27607ec681f3Smrg- nir: Add lowering from regular ALU conversions to the intrinsic 27617ec681f3Smrg- intel/fs: NoMask initialize the address register for shuffles 27627ec681f3Smrg- nir: Fix a misspelling 27637ec681f3Smrg- nir/find_array_copies: Properly discard copies for casts 27647ec681f3Smrg- nir: Handle memcpy in copy_prop_vars and combine_stores 27657ec681f3Smrg- nir: Add a memcpy optimization pass 27667ec681f3Smrg- nir/opt_load_store_vectorize: Use bit sizes when checking mask compatibility 27677ec681f3Smrg- nir: Add component mask re-interpret helpers 27687ec681f3Smrg- nir/opt_deref: Add an instruction type switch 27697ec681f3Smrg- nir/opt_deref: Add an optimization for bitcasts 27707ec681f3Smrg- nir: Add a pass to lower vec3s to vec4s 27717ec681f3Smrg- intel/fs: Don't use NoDDClk/NoDDClr for split SHUFFLEs 27727ec681f3Smrg- iris: Fix the constant data address calculation 27737ec681f3Smrg- anv: Implement VK_EXT_transform_feedback on Gen7 27747ec681f3Smrg- spirv: Make the clc_shader const 27757ec681f3Smrg- nir/constant_folding: Use the builder 27767ec681f3Smrg- nir/constant_folding: Use nir_shader_instruction_pass 27777ec681f3Smrg- nir: Validate constant initializers 27787ec681f3Smrg- nir/constant_folding: Fold load_deref of nir_var_mem_constant 27797ec681f3Smrg- iris: Add pipe-loader support 27807ec681f3Smrg- iris: Handle runtime-specified local memory size 27817ec681f3Smrg- iris: Add support for load_work_dim as a system value 27827ec681f3Smrg- iris: Fill out compute caps and enable clover support 27837ec681f3Smrg- gallium/pipe: Add a GALLIUM_PIPE_SEARCH_DIR override env var 27847ec681f3Smrg- util/xxd.py: Add an option for binary files 27857ec681f3Smrg- spirv: Add a shared libclc loader 27867ec681f3Smrg- spirv: Move nir_lower_libclc to src/compiler/spirv 27877ec681f3Smrg- intel/nir: Don't try to emit vector load_scratch instructions 27887ec681f3Smrg- intel/nir: Lower load_global_constant in lower_mem_access_bit_sizes 27897ec681f3Smrg- i965: Take an isl_format in emit_buffer_surface_state 27907ec681f3Smrg- intel/fs: Add an alignment to VARYING_PULL_CONSTANT_LOAD_LOGICAL 27917ec681f3Smrg- intel/fs: Add an option to use dataport messages for UBOs 27927ec681f3Smrg- anv: Add a device parameter to format_for_descriptor_type 27937ec681f3Smrg- anv: Use format_for_descriptor_type for descriptor buffers 27947ec681f3Smrg- anv: Plumb the device into \*bits_for_access_flags 27957ec681f3Smrg- anv: Use the data cache for indirect UBO pulls on Gen8+ 27967ec681f3Smrg- iris: Use the data cache for indirect UBO pulls 27977ec681f3Smrg- clover: Stop leaking NIR shaders 27987ec681f3Smrg- nir/opt_deref: Fix the vector bitcast optimization 27997ec681f3Smrg- nir: Allow more deref modes in phis 28007ec681f3Smrg- intel/batch_decoder: Don't clame vec4 vs/gs/tcs shaders on Gen11+ 28017ec681f3Smrg- intel/fs: Copy the PTSS from g0 for scratch reads/writes 28027ec681f3Smrg- intel/fs: Add a SCRATCH_HEADER opcode 28037ec681f3Smrg- intel/fs/ra: Increment spill_offset as part of the emit_spill loop 28047ec681f3Smrg- intel/fs/ra: Refactor handling of Gen7 scratch reads 28057ec681f3Smrg- intel/fs/ra: Store the last non-spill VGRF node 28067ec681f3Smrg- intel/fs/ra: Sanity-check our IP counts 28077ec681f3Smrg- intel/fs/ra: Use a set to track added spill/fill instructions 28087ec681f3Smrg- intel/fs: Rework scratch handling on Gen9+ 28097ec681f3Smrg- intel/fs: Allow constant-propagation into SAMPLEINFO and IMAGE_SIZE 28107ec681f3Smrg- anv: Go back to using the sampler for UBO pulls 28117ec681f3Smrg- Revert "iris: Use the data cache for indirect UBO pulls" 28127ec681f3Smrg- anv: Bump the number of update-after-bind descriptors to 1M 28137ec681f3Smrg- anv: Add a descriptor_count to descriptor sets 28147ec681f3Smrg- anv: Implement VariableDescriptorCount 28157ec681f3Smrg- iris: Flush caches based on brw_compiler::indirect_ubos_use_sampler 28167ec681f3Smrg- anv,iris: Use the data cache for UBO pulls on Gen12+ 28177ec681f3Smrg- spirv: Add 0.5 to integer coordinates for OpImageSampleExplicitLod 28187ec681f3Smrg- nir/lower_io: Assert non-zero power-of-two alignments 28197ec681f3Smrg- compiler/types: Assert non-zero alignments in get_explicit_type_for_size_align 28207ec681f3Smrg- compiler/types: Allow images and samplers in get_explicit_type_for_size_align 28217ec681f3Smrg- clover/nir: Calculate sizes of images and samplers properly 28227ec681f3Smrg- clover/nir: Add an image lowering pass 28237ec681f3Smrg- spirv: Fix OpCopyMemorySized 28247ec681f3Smrg- nir/lower_memcpy: Don't mask the store 28257ec681f3Smrg- docs: Specify when branch points happen 28267ec681f3Smrg- nir/validate: Explain why we don't use nir_foreach_block 28277ec681f3Smrg- mesa/spirv: Lower variable initializers for global variables 28287ec681f3Smrg- nir/builder: Add a nir_ieq_imm helper 28297ec681f3Smrg- nir/phis_to_scalar: Use a deny-list for load_deref modes 28307ec681f3Smrg- nir: Handle incomplete derefs in split_struct_vars 28317ec681f3Smrg- nir: Use var->data.mode instead of deref->mode in a few cases 28327ec681f3Smrg- nir: Disallow writes to system values and mem_constant 28337ec681f3Smrg- nir/opt_find_array_copies: Allow copies from mem_constant 28347ec681f3Smrg- nir: Add and use some deref mode helpers 28357ec681f3Smrg- nir/lower_array_deref_of_vec: Use nir_deref_mode_must_be 28367ec681f3Smrg- nir/lower_io: Use nir_deref_mode\_\* helpers 28377ec681f3Smrg- nir/phis_to_scalar,gcm: Use nir_deref_mode_may_be 28387ec681f3Smrg- nir: Only force loop unrolling if we know it's a in/out/temp 28397ec681f3Smrg- nir/vars_to_ssa: Use nir_deref_must_be 28407ec681f3Smrg- nir/vec3_to_vec4: Use nir_deref_must_be 28417ec681f3Smrg- nir: Use nir_deref_mode_may_be in deref optimizations 28427ec681f3Smrg- nir/find_array_copies: Prepare for generic pointers 28437ec681f3Smrg- nir/split_*_vars: Prepare for generic pointers 28447ec681f3Smrg- nir: Make nir_deref_instr::mode a bitfield 28457ec681f3Smrg- nir: Add support for generic pointers 28467ec681f3Smrg- spirv: Add generic pointer support 28477ec681f3Smrg- nir/opt_deref: Add a deref mode specialization optimization 28487ec681f3Smrg- nir/opt_deref: Add an optimization for deref_mode_is 28497ec681f3Smrg- nir/lower_io: Add a mode parameter to build_addr_iadd 28507ec681f3Smrg- nir/lower_io: Add a mode parameter to addr_format_is\_\* 28517ec681f3Smrg- nir/lower_io: Add support for 32/64bit_global for shared 28527ec681f3Smrg- nir/lower_io: Add support for lowering deref_mode_is 28537ec681f3Smrg- nir/lower_io: Support generic pointer access 28547ec681f3Smrg- nir/lower_io: Add a new 62bit_generic address format 28557ec681f3Smrg- nir/opt_intrinsics: Report progress for the gl_SampleMask optimization 28567ec681f3Smrg- nir/constant_folding: Use a switch in try_fold_intrinsic 28577ec681f3Smrg- nir/constant_folding: Use the standard variable naming convention 28587ec681f3Smrg- nir: Move constant folding of vote to opt_constant_folding 28597ec681f3Smrg- nir/constant_folding: Fold subgroup shuffle intrinsics 28607ec681f3Smrg- nir/opt_intrinsics: Refactor a bit 28617ec681f3Smrg- nir/opt_intrinsic: Optimize bcsel(b, shuffle(x, i), shuffle(x, j)) 28627ec681f3Smrg- nir/find_array_copies: Don't assume all children exist 28637ec681f3Smrg- nir/deref: Fix a typo 28647ec681f3Smrg- spirv: Add basic plumbing for ray-tracing capabilities 28657ec681f3Smrg- spirv: Remove a redundant vtn_fail_if 28667ec681f3Smrg- spirv: Add a guard for OpTypeForwardPointer storage classes 28677ec681f3Smrg- spirv: Pass the deref type to storage_class_to_mode for non-forward pointers 28687ec681f3Smrg- spirv: Add support for OpTypeAccelerationStructureKHR 28697ec681f3Smrg- spirv,nir: Add support for ray-tracing built-ins 28707ec681f3Smrg- nir/builder: Add a select_from_ssa_def_array helper 28717ec681f3Smrg- nir: Add intrinsics for object to/from world RT sysvals 28727ec681f3Smrg- nir: Add new variable modes for ray-tracing 28737ec681f3Smrg- spirv: Implement the new ray-tracing storage classes 28747ec681f3Smrg- nir,spirv: Add support for the ShaderCallKHR scope 28757ec681f3Smrg- spirv,nir: Add ray-tracing intrinsics 28767ec681f3Smrg- nir: Handle ray-tracing intrinsics and storage classes in copy-prop etc. 28777ec681f3Smrg- spirv: Update headers and metadata from latest Khronos commit 28787ec681f3Smrg- nir: Print formats on image intrinsics as text 28797ec681f3Smrg- nir: Validate image atomic formats 28807ec681f3Smrg- util,gallium: Add new 64-bit integer formats 28817ec681f3Smrg- compiler/types: Add 64-bit image types 28827ec681f3Smrg- nir: Allow 64-bit image atomics 28837ec681f3Smrg- spirv: Add support for SPV_EXT_shader_image_atomic_int64 28847ec681f3Smrg- nir/lower_bit_size: Don't cast comparison results 28857ec681f3Smrg- nir/lower_bit_size: Pass a nir_instr to the callback 28867ec681f3Smrg- nir/lower_bit_size: Add support for lowering subgroup ops 28877ec681f3Smrg- intel/nir: Refactor lower_bit_size_callback 28887ec681f3Smrg- intel/nir: Lower 8-bit scan/reduce ops to 16-bit 28897ec681f3Smrg- intel/nir: Lower 8-bit ops to 16-bit in NIR on Gen11+ 28907ec681f3Smrg- intel/fs: Fix use of undefined value in fixup_nomask_control_flow 28917ec681f3Smrg- spirv: Call repair SSA for OpTerminateInvocation 28927ec681f3Smrg 28937ec681f3SmrgJesse Natalie (61): 28947ec681f3Smrg 28957ec681f3Smrg- nir: nir_range_analysis needs to be updated for vec16 28967ec681f3Smrg- u_debug_stack_test: Fix MSVC compiling by using ATTRIBUTE_NOINLINE 28977ec681f3Smrg- util/macros: Add ATTRIBUTE_NOINLINE definition for MSVC 28987ec681f3Smrg- glsl: Add 'bare' shadow sampler type 28997ec681f3Smrg- nir: Fix serialize/deserialize of void samplers/images 29007ec681f3Smrg- nir: Optimize mask+downcast to just downcast 29017ec681f3Smrg- nir: Add nir_address_format_32bit_offset_as_64bit 29027ec681f3Smrg- nir: Add nir_address_format_32bit_index_offset_pack64 29037ec681f3Smrg- nir/vtn: CL SPIR-V callers should specify address modes 29047ec681f3Smrg- mesa: Move ATTRIBUTE_NOINLINE for glsl_to_tgsi_visitor::visit_expression for MSVC 29057ec681f3Smrg- nir: Add fisnormal op 29067ec681f3Smrg- nir/vtn: Support SpvOpIsNormal via fisnormal 29077ec681f3Smrg- nir: Add fisfinite op 29087ec681f3Smrg- nir/vtn: Support SpvOpIsFinite via fisfinite 29097ec681f3Smrg- nir/vtn: Handle LessOrGreater deprecated opcode 29107ec681f3Smrg- nir/vtn: Support OpOrdered and OpUnordered opcodes 29117ec681f3Smrg- nir/glsl: Add glsl_get_cl_type_size_align helper 29127ec681f3Smrg- nir: Use 'unsigned' instead of enum types in nir_variable::data 29137ec681f3Smrg- wgl: Switch to Win10 version defines to enable usage of Win10 WGL callbacks 29147ec681f3Smrg- nir: Populate some places where existing system values were missing 29157ec681f3Smrg- nir: Add new system values and intrinsics for dealing with CL work offsets 29167ec681f3Smrg- nir: Move compute system value lowering to a separate pass 29177ec681f3Smrg- nir: Add options to nir_lower_compute_system_values to control compute ID base lowering 29187ec681f3Smrg- spirv: Use new global invocation offset system value 29197ec681f3Smrg- nir: Add a lowering pass to split 64bit phis 29207ec681f3Smrg- nir: Relax opt_if logic to prevent re-merging 64bit phis for loop headers 29217ec681f3Smrg- nir_lower_bit_size: Support lowering ops with differing source/dest sizes 29227ec681f3Smrg- nir: Implement mul_high lowering for bit sizes other than 32 29237ec681f3Smrg- nir: Remove 32bit restriction for uadd_carry optimization 29247ec681f3Smrg- nir: Add bit_count to lower_int64 pass 29257ec681f3Smrg- nir/vtn: SPIR-V bit count opcodes (core and extension) dest size mismatches nir 29267ec681f3Smrg- clover/nir/spirv: Use uniform rather than shader_in for kernel inputs 29277ec681f3Smrg- nir/vtn: Add type constant to image intrinsics 29287ec681f3Smrg- nir/vtn: Add support for kernel images to SPIRV-to-NIR. 29297ec681f3Smrg- nir/vtn: Use return type rather than image type for tex ops 29307ec681f3Smrg- nir/vtn: Handle integer sampling coordinates 29317ec681f3Smrg- nir/vtn: ImageSizeLod op can be applied to images 29327ec681f3Smrg- nir/vtn: Add intrinsics for CL image format/order queries 29337ec681f3Smrg- nir/vtn: Convert constant samplers to variables with data 29347ec681f3Smrg- nir_dominance: Use uint32_t instead of int16_t for dominance counters 29357ec681f3Smrg- nir: More NIR_MAX_VEC_COMPONENTS fixes 29367ec681f3Smrg- spirv: Handle OpTypeOpaque 29377ec681f3Smrg- glsl_type: Add packed to structure type comparison for hash map 29387ec681f3Smrg- nir_lower_system_values: Fix load_global_invocation_id to use base_work_group_id even with no base_global id 29397ec681f3Smrg- nir: Add an internal flag to shader_info 29407ec681f3Smrg- nir: Add glsl_base_type unsigned -\> signed version helper 29417ec681f3Smrg- nir/vtn: Add handling for SPIR-V event variables 29427ec681f3Smrg- vtn/opencl: Rework type handling for CL extension opcodes 29437ec681f3Smrg- vtn/opencl: Add infrastructure for calling out to libclc 29447ec681f3Smrg- vtn/opencl: Implement a lot of opcodes via libclc 29457ec681f3Smrg- vtn/opencl: Rework handle_instr to be able to handle core SPIR-V opcodes via libclc 29467ec681f3Smrg- vtn/opencl: Hook up OpenCL async copy and group wait opcodes via libclc 29477ec681f3Smrg- vtn/opencl: Switch non-native trig to use libclc 29487ec681f3Smrg- vtn/opencl: Switch exp/pow/log to use libclc 29497ec681f3Smrg- vtn/opencl: Switch division-related ops to use libclc 29507ec681f3Smrg- vtn/opencl: Switch some nir-sequence ops to use libclc 29517ec681f3Smrg- vtn/opencl: Only use libclc ldexp when lower_ldexp is set 29527ec681f3Smrg- vtn/opencl: Switch fma to conditionally use libclc for 32bit floats 29537ec681f3Smrg- spirv: Implement vload[a]_half[n\] and vstore[a]_half[n][_r\] 29547ec681f3Smrg- util: Move xxd.py to util 29557ec681f3Smrg- util: Make xxd.py output char array instead of string 29567ec681f3Smrg 29577ec681f3SmrgJohn Bates (1): 29587ec681f3Smrg 29597ec681f3Smrg- disk_cache: build option for disabled-by-default 29607ec681f3Smrg 29617ec681f3SmrgJonathan Gray (13): 29627ec681f3Smrg 29637ec681f3Smrg- util: unbreak endian detection on OpenBSD 29647ec681f3Smrg- util/anon_file: add OpenBSD shm_mkstemp() path 29657ec681f3Smrg- meson: build with \_ISOC11_SOURCE on OpenBSD 29667ec681f3Smrg- meson: don't build with USE_ELF_TLS on OpenBSD 29677ec681f3Smrg- meson: conditionally include -ldl in gbm pkg-config file 29687ec681f3Smrg- util: futex fixes for OpenBSD 29697ec681f3Smrg- util/u_thread: include pthread_np.h if found 29707ec681f3Smrg- anv: use os_get_total_physical_memory() 29717ec681f3Smrg- util/os_misc: add os_get_available_system_memory() 29727ec681f3Smrg- anv: use os_get_available_system_memory() 29737ec681f3Smrg- util/os_misc: os_get_available_system_memory() for OpenBSD 29747ec681f3Smrg- radv: remove seccomp includes 29757ec681f3Smrg- vulkan: make VK_TIME_DOMAIN_CLOCK_MONOTONIC_RAW_EXT conditional 29767ec681f3Smrg 29777ec681f3SmrgJonathan Marek (57): 29787ec681f3Smrg 29797ec681f3Smrg- panfrost: add missing dependency on midgard_pack.h 29807ec681f3Smrg- util/format: expose generated format packing functions through a header 29817ec681f3Smrg- turnip: implement VK_EXT_custom_border_color 29827ec681f3Smrg- turnip: remove dead tu_minify/typed_memcpy functions 29837ec681f3Smrg- turnip: delete a blit_image TODO that has already been resolved 29847ec681f3Smrg- turnip: fix CmdBlitImage with D32_SFLOAT_S8_UINT 29857ec681f3Smrg- turnip: rework format_to_ifmt 29867ec681f3Smrg- turnip: call packing functions directly for pack_gmem_clear_value 29877ec681f3Smrg- turnip: add missing tu_bo_list_add in CmdWriteTimestamp 29887ec681f3Smrg- freedreno/ir3: remove indirect input load 29897ec681f3Smrg- freedreno/ir3: improve handling of aliased inputs 29907ec681f3Smrg- freedreno/ir3: rework setup_{input,output} to make struct varyings work 29917ec681f3Smrg- freedreno/regs: add 7nm DSI PHY/PLL regs 29927ec681f3Smrg- turnip: delete tu_physical_device path field 29937ec681f3Smrg- turnip: delete unused tu_image fields 29947ec681f3Smrg- turnip: fix the type of tu_shader_module code field, delete unused sha1 29957ec681f3Smrg- turnip: delete unused "tu_cmd_buffer_upload" 29967ec681f3Smrg- turnip: remove some unnecessary regs init 29977ec681f3Smrg- turnip: rework vertex buffers draw state handling 29987ec681f3Smrg- turnip: device global bo list 29997ec681f3Smrg- turnip: avoid heap allocations in QueueSubmit when semaphores are used 30007ec681f3Smrg- freedreno/ir3: allow layer/viewport output for VS/GS/DS 30017ec681f3Smrg- freedreno/ir3: add view_zero to shader key 30027ec681f3Smrg- turnip: multiViewport and VK_EXT_shader_viewport_index_layer 30037ec681f3Smrg- vulkan/wsi/display: add option for display fence to signal syncobj 30047ec681f3Smrg- turnip: delete unused tu_fence_signal function 30057ec681f3Smrg- turnip: add a fd field to tu_device 30067ec681f3Smrg- turnip: require syncobj support 30077ec681f3Smrg- turnip: rework fences to use syncobjs 30087ec681f3Smrg- radv: fix incorrect ResetFences path for WSI fence 30097ec681f3Smrg- radv: use syncobj for wsi fence 30107ec681f3Smrg- turnip: fix wrong indentation in tu6_draw_common 30117ec681f3Smrg- turnip: move A6XX_RB_ALPHA_CONTROL write to init_hw 30127ec681f3Smrg- turnip: implement VK_EXT_extended_dynamic_state 30137ec681f3Smrg- turnip: remove unused cmd_buffer/device arguments in descriptor sets 30147ec681f3Smrg- turnip: delete unused/broken pipeline layout hashing code 30157ec681f3Smrg- turnip: initial implementation of VK_KHR_push_descriptor 30167ec681f3Smrg- turnip: clean up tu_device_memory 30177ec681f3Smrg- turnip: always create permanent syncobj for semaphore 30187ec681f3Smrg- turnip: set MSM_SUBMIT_SYNCOBJ_RESET for submit pWaitSemaphores 30197ec681f3Smrg- turnip: semaphores simplification (only syncobj semaphores supported) 30207ec681f3Smrg- turnip: rework GetSemaphoreFdKHR 30217ec681f3Smrg- turnip: rework ImportSemaphoreFdKHR 30227ec681f3Smrg- turnip: remove remaining uses of drmSyncobj helpers 30237ec681f3Smrg- turnip: share code between semaphores/fences + fence import/export 30247ec681f3Smrg- turnip: signal fence and semaphore in AcquireNextImage2KHR 30257ec681f3Smrg- turnip: implement legacy API functions separately 30267ec681f3Smrg- freedreno/cffdec: fix decoding of bindless descriptors 30277ec681f3Smrg- turnip: remove pre-emption marker 30287ec681f3Smrg- turnip: implement timestamp fences/semaphores for kgsl backend 30297ec681f3Smrg- turnip: rework android gralloc path so it doesn't call tu_image_create 30307ec681f3Smrg- turnip: don't implement CreateImage as two separate functions 30317ec681f3Smrg- turnip: LAYOUT_PREINITIALIZED is not different for optimal tiling 30327ec681f3Smrg- turnip: remove useless tu_image asserts 30337ec681f3Smrg- turnip: remove unnecessary/redundant tu_image fields 30347ec681f3Smrg- turnip: don't always fallback to linear for mutable formats 30357ec681f3Smrg- turnip: enable VK_EXT_image_drm_format_modifier 30367ec681f3Smrg 30377ec681f3SmrgJordan Justen (4): 30387ec681f3Smrg 30397ec681f3Smrg- anv, iris: Set MediaSamplerDOPClockGateEnable for gen12+ 30407ec681f3Smrg- anv: Drop warning about gen12 not being supported 30417ec681f3Smrg- intel/dev: Add device info for ADL-S 30427ec681f3Smrg- intel/mi_builder: Support gen11 command-streamer based register offsets 30437ec681f3Smrg 30447ec681f3SmrgJose Maria Casanova Crespo (7): 30457ec681f3Smrg 30467ec681f3Smrg- vc4: Avoid negative scissor caused by no intersection 30477ec681f3Smrg- nir/algebraic: optimize iand/ior of (n)eq zero when umax/umin not available 30487ec681f3Smrg- vc4: Enable lower_umax and lower_umin 30497ec681f3Smrg- vc4: enable lower_isign for VC4 30507ec681f3Smrg- vc4: Add missing load_ubo set_align in yuv_blit fs. 30517ec681f3Smrg- vc4: Add missing range_base/range at nir_load_ubos in yuv_blit fs. 30527ec681f3Smrg- vc4: Enable nir_lower_io for uniforms 30537ec681f3Smrg 30547ec681f3SmrgJoshua Ashton (1): 30557ec681f3Smrg 30567ec681f3Smrg- zink: Fix 32-bit compilation 30577ec681f3Smrg 30587ec681f3SmrgJuan A. Suarez Romero (7): 30597ec681f3Smrg 30607ec681f3Smrg- intel: split driver/device UUID generators 30617ec681f3Smrg- iris: plumb device/driver UUID generators 30627ec681f3Smrg- intel/uuid: use git-sha1/package for the driver UUID 30637ec681f3Smrg- st/mesa: initialize lower alpha func to ALWAYS 30647ec681f3Smrg- v3d/compiler: extend swapping R/B support to all vertex attributes 30657ec681f3Smrg- v3dv: mark the right bit to swap R/B vertex attributes 30667ec681f3Smrg- v3d: Add GL_ARB_vertex_array_bgra support 30677ec681f3Smrg 30687ec681f3SmrgJulian Winkler (1): 30697ec681f3Smrg 30707ec681f3Smrg- nir: Add a structurizer 30717ec681f3Smrg 30727ec681f3SmrgKarol Herbst (48): 30737ec681f3Smrg 30747ec681f3Smrg- util/set: add \_mesa_set_intersects 30757ec681f3Smrg- spirv: rename vtn_emit_cf_list to vtn_emit_cf_list_structured 30767ec681f3Smrg- nir: Add a structured flag to nir_shader 30777ec681f3Smrg- nir: Add goto_if jump instruction 30787ec681f3Smrg- spirv: extract switch parsing into its own function 30797ec681f3Smrg- spirv: parse unstructured CFG 30807ec681f3Smrg- clover/nir: fix mem_shared by using address_format_32bit_offset 30817ec681f3Smrg- nv50/ir/nir: fix smem size 30827ec681f3Smrg- nv50/ir/nir: rework indirect function_temp handling 30837ec681f3Smrg- clover/nir: Call vars_to_explicit_types for shared memory 30847ec681f3Smrg- nve4: fix uploading unaligned sized input buffers 30857ec681f3Smrg- nv50/ir/nir: assert on unknown alu ops 30867ec681f3Smrg- clover/nir: support int64 atomics if the device supports it 30877ec681f3Smrg- nv50/ir/nir: fix global_atomic_comp_swap 30887ec681f3Smrg- nvc0: handle nr being 0 in nvc0_set_global_bindings 30897ec681f3Smrg- nv50/ir/nir: support load_work_dim 30907ec681f3Smrg- clover/spirv: rework handling of spirv extensions 30917ec681f3Smrg- clover/spirv: pass list of supported extensions to the translator 30927ec681f3Smrg- nir: rename nir_op_fne to nir_op_fneu 30937ec681f3Smrg- nir: fix nir_variable_create for kernels 30947ec681f3Smrg- clover/nir: add support for global invocation id offsets 30957ec681f3Smrg- nv50/ir: remove symbol table support for compute shaders 30967ec681f3Smrg- nv50/ir: add nv50_ir_prog_info_out 30977ec681f3Smrg- nir: use nir_var_all to get rid of casting 30987ec681f3Smrg- util: add helpers to define bitwise operators on enums for C++ 30997ec681f3Smrg- nir: use enum operator helper for nir_variable_mode and nir_metadata 31007ec681f3Smrg- clover/nir: Lower function_temp to scratch. 31017ec681f3Smrg- nv50/ir: fix cas lowering for 64 bit 31027ec681f3Smrg- clover/nir: use offset for temp memory 31037ec681f3Smrg- clover/llvm: undefine \__IMAGE_SUPPORT_\_ for devices without image support 31047ec681f3Smrg- nvc0/ir: fix load propagation for sub 4 byte addressing 31057ec681f3Smrg- spirv: fix 64 bit atomic inc and dec 31067ec681f3Smrg- nvc0/cl: hande 64 bit pointers in nvc0_set_global_handle 31077ec681f3Smrg- clover/spirv: fix vec3 alignment 31087ec681f3Smrg- nir/serialize: fix serialization of system values 31097ec681f3Smrg- clover/util: add id_type_equals to support symbols with multiple sections 31107ec681f3Smrg- clover: bind constant buffer if one is provided 31117ec681f3Smrg- clover/nir: extract constant buffer into its own section 31127ec681f3Smrg- clover/spirv: parse arg_info 31137ec681f3Smrg- clover/spirv: support CL_KERNEL_COMPILE_WORK_GROUP_SIZE 31147ec681f3Smrg- clover: use pipe_image_view for images instead of set_compute_resources 31157ec681f3Smrg- clover: support custom driver strides 31167ec681f3Smrg- clover/device: use PIPE_MAX_SHADER_SAMPLER_VIEWS for max_images_read 31177ec681f3Smrg- clover/nir: set kernel_image cap 31187ec681f3Smrg- nouveau: hide SVM support behing a variable for now as kernel space is broken 31197ec681f3Smrg- nvc0/CL: enable images 31207ec681f3Smrg- llvmpipe: enable CL images 31217ec681f3Smrg- nv50/ir/nir: don't use designated initializers 31227ec681f3Smrg 31237ec681f3SmrgKenneth Graunke (15): 31247ec681f3Smrg 31257ec681f3Smrg- iris: Fix headerless sampler messages in compute shaders with preemption 31267ec681f3Smrg- nir: Copy semantics to nir_intrinsic_load_fs_input_interp_deltas 31277ec681f3Smrg- nir: Move new edgeflag assert into the io_lowered case 31287ec681f3Smrg- iris: Reorder the loops in iris_fence_await() for clarity. 31297ec681f3Smrg- iris: Drop stale syncobj references in fence_server_sync 31307ec681f3Smrg- Revert "nir: replace lower_ffma and fuse_ffma with has_ffma" 31317ec681f3Smrg- intel/compiler, anv: Delete cs_prog_data->slm_size 31327ec681f3Smrg- iris: Fix doubling of shared local memory (SLM) sizes. 31337ec681f3Smrg- anv: Set only one ISL usage bit (RT/texture) for CopyBuffer sources 31347ec681f3Smrg- isl, anv, iris: Add a centralized helper to select MOCS based on usage 31357ec681f3Smrg- isl: Enable Tigerlake HDC:L1 caches via MOCS in various cases. 31367ec681f3Smrg- iris: fix source/destination layers for 3D blits 31377ec681f3Smrg- iris: Move blit scissoring earlier. 31387ec681f3Smrg- intel/fs: Fix sampler message headers on Gen11+ when using scratch 31397ec681f3Smrg- nir/algebraic: Avoid creating new fp64 ops when using softfp64 31407ec681f3Smrg 31417ec681f3SmrgKhem Raj (1): 31427ec681f3Smrg 31437ec681f3Smrg- vc4: use intmax_t for formatted output of timespec members 31447ec681f3Smrg 31457ec681f3SmrgKristian Høgsberg (12): 31467ec681f3Smrg 31477ec681f3Smrg- egl/android: Call createImageFromDmaBufs directly 31487ec681f3Smrg- egl/android: Look up prime fds in droid_create_image_from_prime_fds() 31497ec681f3Smrg- egl/android: Drop unused ctx argument 31507ec681f3Smrg- egl/android: Simplify droid_create_image_from_name() path 31517ec681f3Smrg- egl/android: Move droid_create_image_from_prime_fds() function up 31527ec681f3Smrg- egl/android: Use droid_create_image_from_prime_fds() in get_back_bo() 31537ec681f3Smrg- egl/android: Add support for CrOS buffer info perform op 31547ec681f3Smrg- turnip: Add kgsl backend 31557ec681f3Smrg- util/formats: Add PIPE_FORMAT_R8_G8B8_420_UNORM 31567ec681f3Smrg- st/mesa: Add NV12 lowering to PIPE_FORMAT_R8_G8B8_420_UNORM 31577ec681f3Smrg- freedreno/a6xx: Generalize pointers in struct fd6_pipe_sampler_view 31587ec681f3Smrg- freedreno/a6xx: Support PIPE_FORMAT_R8_G8B8_420_UNORM for texturing 31597ec681f3Smrg 31607ec681f3SmrgKrunal Patel (2): 31617ec681f3Smrg 31627ec681f3Smrg- gallium/auxiliary/vl: Odd Dimensions are failing 31637ec681f3Smrg- radeon/vcn: Bitrate not updated when changing framerate 31647ec681f3Smrg 31657ec681f3SmrgLeo Liu (2): 31667ec681f3Smrg 31677ec681f3Smrg- frontends/omx/dec: Use the known codec profile when allocating buffers 31687ec681f3Smrg- frontends/omx/h265: Check the pps set before the scaling data 31697ec681f3Smrg 31707ec681f3SmrgLepton Wu (1): 31717ec681f3Smrg 31727ec681f3Smrg- util/ralloc: fix ralloc alignment. 31737ec681f3Smrg 31747ec681f3SmrgLionel Landwerlin (36): 31757ec681f3Smrg 31767ec681f3Smrg- anv: fix incorrect realloc failure handling 31777ec681f3Smrg- intel/dump_gpu: only write BOs mapped by the driver 31787ec681f3Smrg- intel/dump_gpu: further track mapping of BOs 31797ec681f3Smrg- intel/dump_gpu: set default device_override 31807ec681f3Smrg- intel/dump_gpu: add an only-capture option 31817ec681f3Smrg- intel/dump_gpu: only map in GTT buffers not previously mapped 31827ec681f3Smrg- anv: track the current frame and write it into the driver identifier BO 31837ec681f3Smrg- intel/dump_gpu: fix --platform option 31847ec681f3Smrg- intel/dump_gpu: add an option to capture a single frame 31857ec681f3Smrg- anv: centralize vk to gen arrays 31867ec681f3Smrg- anv: fix up dynamic clip emission 31877ec681f3Smrg- anv: don't fail userspace relocation with perf queries 31887ec681f3Smrg- intel/perf: store query symbol name 31897ec681f3Smrg- intel/perf: fix raw query kernel metric selection 31907ec681f3Smrg- anv: fix transform feedback surface size 31917ec681f3Smrg- anv: move push constant allocation tracking into gfx pipeline state 31927ec681f3Smrg- anv: simplify push constant emissions 31937ec681f3Smrg- anv: VK_INTEL_performance_query interaction with VK_EXT_private_data 31947ec681f3Smrg- anv: fix robust buffer access 31957ec681f3Smrg- include/drm-uapi: bump headers 31967ec681f3Smrg- anv: add new gem/drm helpers 31977ec681f3Smrg- anv: implement shareable timeline semaphores 31987ec681f3Smrg- intel/genxml: make sure test assert are compiled in 31997ec681f3Smrg- intel/compiler: fixup Gen12 workaround for array sizes 32007ec681f3Smrg- vulkan: bump headers/registry to 1.2.154 32017ec681f3Smrg- anv: implement VK_KHR_copy_commands2 32027ec681f3Smrg- intel/perf: fix crash when no perf queries are supported 32037ec681f3Smrg- intel/dev: add a small non installable tool to print device info 32047ec681f3Smrg- intel/dev: fix 32bit build issue 32057ec681f3Smrg- genxml: drop gen10 32067ec681f3Smrg- blorp: identify copy kernels in NIR 32077ec681f3Smrg- blorp: allow blits with floating point source layers 32087ec681f3Smrg- anv: fix source/destination layers for 3D blits 32097ec681f3Smrg- anv: report latest extension spec versions 32107ec681f3Smrg- intel/dev: Bump Max EU per subslice/dualsubslice 32117ec681f3Smrg- anv: fix descriptor pool leak in VMA object 32127ec681f3Smrg 32137ec681f3SmrgLouis Li (1): 32147ec681f3Smrg 32157ec681f3Smrg- radeon/radeon_vce: fix out of target bitrate in CBR mode (H.264) 32167ec681f3Smrg 32177ec681f3SmrgLouis-Francis Ratté-Boulianne (6): 32187ec681f3Smrg 32197ec681f3Smrg- st/mesa: factor ucp-lowering logic into helper 32207ec681f3Smrg- st/mesa: Enable clip planes lowering for geometry shaders 32217ec681f3Smrg- pipebuffer: Remove unused buffer event in slab bufmgr 32227ec681f3Smrg- st/mesa: Replace UsesStreams by ActiveStreamMask for GS 32237ec681f3Smrg- glsl/linker: Add support for XFB varying lowering in geometry shader 32247ec681f3Smrg- gallium: Fix NIR validation when lowering polygon stipple 32257ec681f3Smrg 32267ec681f3SmrgLucas Stach (19): 32277ec681f3Smrg 32287ec681f3Smrg- etnaviv: stop leaking the dummy texure descriptor BO 32297ec681f3Smrg- gallium/dri: allow create image for formats that only support SV or RT binding 32307ec681f3Smrg- etnaviv: drm: fix BO refcount race 32317ec681f3Smrg- etnaviv: blt: properly program surface TS offset for clears 32327ec681f3Smrg- etnaviv: update headers from rnndb 32337ec681f3Smrg- etnaviv: tex_desc: fix TS compression enable 32347ec681f3Smrg- etnaviv: cosmetic etna_resource_alloc fixes 32357ec681f3Smrg- etnaviv: do proper cpu prep/fini when clearing allocated buffer 32367ec681f3Smrg- etnaviv: simplify etna_screen_bo_from_handle 32377ec681f3Smrg- etnaviv: pass correct layout to etna_resource_alloc for scanout resources 32387ec681f3Smrg- etnaviv: don't import allocated scanout resources via from_handle 32397ec681f3Smrg- Revert "gallium/dri: fix dri2_from_planar for multiplanar images" 32407ec681f3Smrg- etnaviv: emit RA_EARLY_DEPTH on dirty ZSA 32417ec681f3Smrg- etnaviv: flush depth cache when changing depth config 32427ec681f3Smrg- etnaviv: update headers from rnndb 32437ec681f3Smrg- etnaviv: expose shader discard usage in etna_shader_variant 32447ec681f3Smrg- etnaviv: rework ZSA into a derived state 32457ec681f3Smrg- gallium: document convention for get_handle calls on multi-planar resources 32467ec681f3Smrg- etnaviv: fix disabling of INT filter for real 32477ec681f3Smrg 32487ec681f3SmrgLukas F. Hartmann (1): 32497ec681f3Smrg 32507ec681f3Smrg- etnaviv: Fix disabling early-z rejection on GC7000L (HALTI5) 32517ec681f3Smrg 32527ec681f3SmrgMarcin Ślusarz (50): 32537ec681f3Smrg 32547ec681f3Smrg- intel/perf: fix calculation of used counter space 32557ec681f3Smrg- intel/perf: fix how pipeline stats are stored 32567ec681f3Smrg- intel/perf: streamline error handling in read_oa_samples_until 32577ec681f3Smrg- intel/perf: fix performance counters availability after glFinish 32587ec681f3Smrg- intel/perf: split load_oa_metrics 32597ec681f3Smrg- intel/perf: export performance counters sorted by \[group|set\] and name 32607ec681f3Smrg- glsl: fix crashes on out of bound matrix access using constant index 32617ec681f3Smrg- gitlab: ask for more detailed info about GPU 32627ec681f3Smrg- mesa: fix formatting of messages printed using \_mesa_log 32637ec681f3Smrg- anv: refresh cached current batch bo after emitting some commands 32647ec681f3Smrg- iris: handle os_dupfd_cloexec failure 32657ec681f3Smrg- iris: verify color component width in convert_fast_clear_color 32667ec681f3Smrg- i965: verify format width in blorp_get_client_bo 32677ec681f3Smrg- intel/perf: don't generate logically dead code 32687ec681f3Smrg- intel/compiler/test: use TEST_DEBUG env var consistently 32697ec681f3Smrg- intel/compiler: mark debug constant as const 32707ec681f3Smrg- intel/fs,vec4: remove unused assignments 32717ec681f3Smrg- intel: add INTEL_DEBUG=shaders 32727ec681f3Smrg- intel/fs: add hint how to get more info when shader validation fails 32737ec681f3Smrg- intel/compiler: match brw_compile\_\* declarations with their definitions 32747ec681f3Smrg- intel/compiler: use the same name for nir shaders in brw_compile\_\* functions 32757ec681f3Smrg- intel/compiler: move extern C functions out of namespace brw 32767ec681f3Smrg- intel/compiler: print dispatch width when shader fails to compile 32777ec681f3Smrg- intel/compiler: fix typo in a comment 32787ec681f3Smrg- anv: fix minor gen_ioctl(I915_PERF_IOCTL_CONFIG) error handling issue 32797ec681f3Smrg- intel/compiler: remove unused fs_validator::param_size 32807ec681f3Smrg- intel/compiler: initialize remaining fields of various classes 32817ec681f3Smrg- intel/tools: fix possible memory leak in the error path 32827ec681f3Smrg- intel/tools: handle ftell errors 32837ec681f3Smrg- intel/compiler: quiet Coverity warnings 32847ec681f3Smrg- intel/tools: fix possible randomly increased verbosity of error2aub 32857ec681f3Smrg- intel: add INTEL_DEBUG expected value in declaration 32867ec681f3Smrg- iris: drop likely/unlikely around INTEL_DEBUG 32877ec681f3Smrg- i965: drop likely/unlikely around INTEL_DEBUG 32887ec681f3Smrg- anv: drop likely/unlikely around INTEL_DEBUG 32897ec681f3Smrg- intel: drop likely/unlikely around INTEL_DEBUG 32907ec681f3Smrg- vulkan/wsi: fix possible random stalls in wsi_display_wait_for_event 32917ec681f3Smrg- intel/tools: fix invalid type in argument to printf 32927ec681f3Smrg- intel/genxml: don't generate identical code for different branches 32937ec681f3Smrg- anv: always annotate memory returned from anv_gem_mmap 32947ec681f3Smrg- intel: remove dead code 32957ec681f3Smrg- i965: remove prototypes of not-existing functions 32967ec681f3Smrg- intel/compiler: use C++ template instead of preprocessor 32977ec681f3Smrg- intel/compiler: remove branch weight heuristic 32987ec681f3Smrg- intel/tools: allow --color option to be used without arg 32997ec681f3Smrg- anv: remove dead code from anv_create_cmd_buffer 33007ec681f3Smrg- intel/tools: handle some failures 33017ec681f3Smrg- intel/tools: refactor logging to be easier to follow by static analyzers 33027ec681f3Smrg- intel/tools: add missing new lines to few remaining fail_if users 33037ec681f3Smrg- nir: handle float atomics in copy propagation pass 33047ec681f3Smrg 33057ec681f3SmrgMarek Olšák (278): 33067ec681f3Smrg 33077ec681f3Smrg- radeonsi: enable ETC2 hw acceleration on Raven2 33087ec681f3Smrg- ac/gpu_info: set num_tiles_pipes on gfx10+ too 33097ec681f3Smrg- Revert "radeonsi: honor a user-specified pitch on gfx10.3" 33107ec681f3Smrg- radeonsi: use correct wave size in gfx10_ngg_calculate_subgroup_info 33117ec681f3Smrg- radeonsi: use the same units for esgs_ring_size and ngg_emit_size 33127ec681f3Smrg- radeonsi: increase minimum NGG vertex count requirement per workgroup on gfx 10.3 33137ec681f3Smrg- radeonsi: fix applying the NGG minimum vertex count requirement 33147ec681f3Smrg- radeonsi: don't count unusable vertices to the NGG LDS size 33157ec681f3Smrg- radeonsi: add a common function for getting the size of gs_ngg_scratch 33167ec681f3Smrg- radeonsi: remove the NGG hack decreasing LDS usage to deal with overflows 33177ec681f3Smrg- radeonsi: various fixes for gfx10.3 33187ec681f3Smrg- radeonsi: disable NGG culling on gfx10.3 because of hangs 33197ec681f3Smrg- radeonsi: fix compute-based culling with VERTEX_COUNTER_GDS_MODE == 1 33207ec681f3Smrg- compiler: add glsl_print_type 33217ec681f3Smrg- nir: remove nir_strip stub declaration 33227ec681f3Smrg- nir: handle load_input_vertex in nir_get_io_offset_src 33237ec681f3Smrg- nir: save IO semantics in lowered IO intrinsics 33247ec681f3Smrg- nir: gather all IO info from IO intrinsics 33257ec681f3Smrg- nir: update IO semantics in nir_io_add_const_offset_to_base 33267ec681f3Smrg- nir: print IO semantics (v2) 33277ec681f3Smrg- nir: properly identify texcoords for lowered IO in nir_lower_drawpixels 33287ec681f3Smrg- nir: add shader_info::io_lowered 33297ec681f3Smrg- nir: add interpolation qualifiers for color sysvals into shader_info 33307ec681f3Smrg- nir: generate lowered IO in nir_lower_passthrough_edgeflags 33317ec681f3Smrg- st/mesa: don't pass NIR to draw module if IO is lowered 33327ec681f3Smrg- st/mesa: don't generate NIR for ARB_vp/fp if NIR is not preferred 33337ec681f3Smrg- st/mesa: handle lowered IO in st_nir_assign_vs_in_locations 33347ec681f3Smrg- gallium/tgsi: add helper tgsi_get_interp_mode 33357ec681f3Smrg- radeonsi: fix tess levels coming as scalar arrays from SPIR-V 33367ec681f3Smrg- st/mesa: remove useless code for lowered IO in st_nir_assign_vs_in_locations 33377ec681f3Smrg- gallivm: fix build on LLVM 12 due to LLVMAddConstantPropagationPass removal 33387ec681f3Smrg- amd/registers: expose the canonicalize.py program as a function 33397ec681f3Smrg- amd/registers: sort registers by offset in json 33407ec681f3Smrg- amd/registers: add a script that generates json from kernel headers 33417ec681f3Smrg- amd/registers: add non-gfx10 register files generated from kernel headers 33427ec681f3Smrg- amd/registers: switch to new generated register definitions 33437ec681f3Smrg- nir: fix a bug in is_dual_slot in nir_io_add_const_offset_to_base 33447ec681f3Smrg- st/mesa: fix lowered IO - don't call st_nir_assign_vs_in_locations twice 33457ec681f3Smrg- radeonsi: don't crash if input_usage_mask is 0 for a VS input 33467ec681f3Smrg- radeonsi: get color interpolation info from shader_info 33477ec681f3Smrg- radeonsi: clean up code for loading VS inputs 33487ec681f3Smrg- ac/nir: handle all lowered IO intrinsics 33497ec681f3Smrg- radeonsi: lower IO intrinsics - complete rewrite of input/output scanning 33507ec681f3Smrg- radeonsi: remove in/out/uniform variables from NIR after lowering IO 33517ec681f3Smrg- radeonsi: don't lower indirect IO in GLSL 33527ec681f3Smrg- radeonsi: don't execute LDS stores for TCS outputs that are never read 33537ec681f3Smrg- radeonsi: simplify handling color interp modes in si_emit_spi_map 33547ec681f3Smrg- radeonsi: change PIPE_SHADER to MESA_SHADER (si_shader_selector::type) 33557ec681f3Smrg- radeonsi: change PIPE_SHADER to MESA_SHADER (si_shader_context::type) 33567ec681f3Smrg- radeonsi: change PIPE_SHADER to MESA_SHADER (debug flags) 33577ec681f3Smrg- radeonsi: change PIPE_SHADER to MESA_SHADER (si_compile_llvm) 33587ec681f3Smrg- radeonsi: change PIPE_SHADER to MESA_SHADER (si_get_shader_part) 33597ec681f3Smrg- radeonsi: remove unused si_shader_context::type 33607ec681f3Smrg- radeonsi: change PIPE_SHADER to MESA_SHADER (si_shader_dump_disassembly) 33617ec681f3Smrg- radeonsi: precompute si_*_descriptors_idx in si_shader_selector 33627ec681f3Smrg- radeonsi: change PIPE_SHADER to MESA_SHADER (si_dump_descriptors) 33637ec681f3Smrg- radeonsi: remove si_shader_selector::type 33647ec681f3Smrg- compiler: add INTERP_MODE_COLOR for radeonsi 33657ec681f3Smrg- radeonsi: replace TGSI_INTERPOLATE with INTERP_MODE 33667ec681f3Smrg- radeonsi: replace TGSI_SEMANTIC with VARYING_SLOT and FRAG_RESULT 33677ec681f3Smrg- radeonsi: optimize out the loop in si_get_ps_input_cntl 33687ec681f3Smrg- ac/llvm: fix unaligned VS input loads on gfx10.3 33697ec681f3Smrg- nir: get ffma support from NIR options for nir_lower_flrp 33707ec681f3Smrg- nir/algebraic: trivially enable existing 32-bit patterns for all bit sizes 33717ec681f3Smrg- nir/algebraic: add 16-bit versions of a few 32-bit patterns 33727ec681f3Smrg- glsl_to_nir: fix crashes with int16 shifts 33737ec681f3Smrg- radeonsi: remove redundant no-signed-zero-fp-math LLVM attribute 33747ec681f3Smrg- radeonsi: move nir_shader_compiler_options into si_screen 33757ec681f3Smrg- Revert "ac: generate FMA for inexact instructions for radeonsi" 33767ec681f3Smrg- ac/llvm: remove stub prototype for fmed3 33777ec681f3Smrg- ac/llvm: fix amdgcn.rcp for v2f16 33787ec681f3Smrg- ac/llvm: fix amdgcn.fract for v2f16 33797ec681f3Smrg- ac/llvm: fix amdgcn.rsq for v2f16 33807ec681f3Smrg- ac/llvm: fix bcsel for v2*16 33817ec681f3Smrg- ac/llvm: remove dead code handling for fmod 33827ec681f3Smrg- ac/llvm: add better code for isign 33837ec681f3Smrg- ac/llvm: add better code for fsign 33847ec681f3Smrg- ac/llvm: fix b2f for v2f16 33857ec681f3Smrg- radeonsi: stop using TGSI_PROPERTY_NEXT_SHADER 33867ec681f3Smrg- radeonsi: stop using TGSI_PROPERTY_VS_WINDOW_SPACE_POSITION / VS_BLIT_SGPRS_AMD 33877ec681f3Smrg- radeonsi: stop using TGSI_PROPERTY_TCS_VERTICES_OUT 33887ec681f3Smrg- radeonsi: stop using TGSI_PROPERTY_TES_POINT_MODE / TES_PRIM_MODE 33897ec681f3Smrg- radeonsi: stop using TGSI_PROPERTY_TES_SPACING 33907ec681f3Smrg- radeonsi: stop using TGSI_PROPERTY_TES_VERTEX_ORDER_CW 33917ec681f3Smrg- radeonsi: stop using TGSI_PROPERTY_GS\_\* 33927ec681f3Smrg- radeonsi: stop using TGSI_PROPERTY_CS\_\* 33937ec681f3Smrg- radeonsi: stop using TGSI_PROPERTY_FS_EARLY_DEPTH_STENCIL 33947ec681f3Smrg- radeonsi: stop using TGSI_PROPERTY_FS_POST_DEPTH_COVERAGE 33957ec681f3Smrg- radeonsi: stop using TGSI_PROPERTY_FS_COORD_PIXEL_CENTER 33967ec681f3Smrg- radeonsi: stop using TGSI_PROPERTY_FS_DEPTH_LAYOUT 33977ec681f3Smrg- radeonsi: stop using TGSI_PROPERTY_CS_LOCAL_SIZE 33987ec681f3Smrg- radeonsi: stop using TGSI_PROPERTY_FS_COLOR0_WRITES_ALL_CBUFS 33997ec681f3Smrg- radeonsi: remove info::samplers_declared, image_buffers, msaa_images_declared 34007ec681f3Smrg- radeonsi: remove redundant si_shader_info::shader_buffers_declared 34017ec681f3Smrg- radeonsi: remove redundant si_shader_info::images_declared 34027ec681f3Smrg- radeonsi: remove redundant si_shader_info::const_buffers_declared 34037ec681f3Smrg- radeonsi: remove redundant si_shader_info:\*(clip|cull)\* fields 34047ec681f3Smrg- radeonsi: remove unused si_shader_info::uses_(vertexid|basevertex) 34057ec681f3Smrg- radeonsi: merge uses_persp_opcode_interp_sample/uses_linear_opcode_interp_sample 34067ec681f3Smrg- radeonsi: remove redundant si_shader_info::uses_kill 34077ec681f3Smrg- radeonsi: reduce type sizes in si_shader_selector 34087ec681f3Smrg- radeonsi: rename num_memory_instructions -\> num_memory_stores 34097ec681f3Smrg- radeonsi: remove redundant si_shader_info::writes_memory 34107ec681f3Smrg- radeonsi: remove redundant GS variables in si_shader_selector 34117ec681f3Smrg- radeonsi: remove redundant si_shader_selector::max_gs_stream 34127ec681f3Smrg- radeonsi: remove redundant si_shader_info::uses_derivatives 34137ec681f3Smrg- radeonsi: use shader_info::cs::local_size_variable to clean up some code 34147ec681f3Smrg- radeonsi: deduplicate setting key.mono.u.vs_export_prim_id 34157ec681f3Smrg- radeonsi: kill point size VS output if it's not used by the rasterizer 34167ec681f3Smrg- radeonsi: set outputs_written_before_ps for geometry shaders too 34177ec681f3Smrg- radeonsi: eliminate unused shader outputs for separate NGG geometry shaders 34187ec681f3Smrg- radeonsi: remove swizzle == ~0 dead code in si_llvm_load_input_gs 34197ec681f3Smrg- ac,radeonsi: lower 64-bit IO to 32 bits and remove all dead code 34207ec681f3Smrg- radeonsi: inline trivial PS functions 34217ec681f3Smrg- nir: add mediump flag to IO semantics 34227ec681f3Smrg- nir: fix lower_mediump_outputs to not require variables 34237ec681f3Smrg- nir/algebraic: add flrp patterns for 16 and 64 bits 34247ec681f3Smrg- nir/algebraic: expand existing 32-bit patterns to all bit sizes using loops 34257ec681f3Smrg- nir: remove redundant opcode u2ump 34267ec681f3Smrg- nir: enforce 32-bit src type requirement for f2fmp and i2imp 34277ec681f3Smrg- nir: add new mediump opcodes f2[ui]mp, i2fmp, u2fmp 34287ec681f3Smrg- nir/algebraic: collapse conversion opcodes (many patterns) 34297ec681f3Smrg- nir/algebraic: add late optimizations that optimize out mediump conversions (v3) 34307ec681f3Smrg- nir/opt_vectorize: don't lose exact and no_*_wrap flags 34317ec681f3Smrg- st/mesa: don't enable NV_copy_depth_to_color if NIR doesn't support FP64 34327ec681f3Smrg- nir,radeonsi: move ffma fusing to late optimizations for better codegen 34337ec681f3Smrg- radeonsi: clean up ffma handling 34347ec681f3Smrg- Revert "radeonsi: set BIG_PAGE fields on gfx10.3" 34357ec681f3Smrg- Revert "radeonsi: move L2_CACHE_CONTROL registers into si_emit_framebuffer_state" 34367ec681f3Smrg- radeonsi: don't lower pack for better 16-bit vectorization 34377ec681f3Smrg- radeonsi: set flags for FP16 in shaders 34387ec681f3Smrg- radeonsi: implement 16-bit FS color outputs 34397ec681f3Smrg- radeonsi: vectorize IO for better ALU vectorization 34407ec681f3Smrg- radeonsi: don't scalarize 16-bit vec2 ALU opcodes 34417ec681f3Smrg- radeonsi: add 16-bit ALU vectorization 34427ec681f3Smrg- gallium: rename PIPE_TRANSFER\_\* -\> PIPE_MAP\_\* 34437ec681f3Smrg- gallium: rename pipe_transfer_usage -\> pipe_map_flags 34447ec681f3Smrg- gallium: rename transfer flags -\> map flags in comments 34457ec681f3Smrg- radeon: rename RADEON_TRANSFER\_\* -\> RADEON_MAP\_\* 34467ec681f3Smrg- radeonsi: set TRUNC_COORD=0 for Total War: WARHAMMER to fix it 34477ec681f3Smrg- radeonsi: move debug options from si_disk_cache_create to si_get_ir_cache_key 34487ec681f3Smrg- radeonsi: remove KILL_PS_INF_INTERP/CLAMP_DIV_BY_ZERO, use screen::options 34497ec681f3Smrg- amd: add Dimgrey Cavefish support 34507ec681f3Smrg- amd: add VanGogh support 34517ec681f3Smrg- radeonsi: set KEEP_TOGETHER_ENABLE if needed 34527ec681f3Smrg- radeonsi: move binning parameters into si_screen 34537ec681f3Smrg- radeonsi: break a binning batch on a new PS if bins can use multiple state sets 34547ec681f3Smrg- radeonsi: add a tweak for PS wave CU utilization for gfx10.3 34557ec681f3Smrg- nir: split fuse_ffma into fuse_ffma16/32/64 34567ec681f3Smrg- nir: split lower_ffma into lower_ffma16/32/64 34577ec681f3Smrg- radeonsi: fuse or lower ffma optimally on all chips 34587ec681f3Smrg- nir: replace lower_ffma and fuse_ffma with has_ffma 34597ec681f3Smrg- radeonsi: use optimal order of operations when setting up a compute dispatch 34607ec681f3Smrg- radeonsi: call si_upload_graphics_shader_descriptors before the big conditional 34617ec681f3Smrg- radeonsi: move a displaced comment in si_draw_vbo 34627ec681f3Smrg- radeonsi: don't call emit_cache_flush after uploading bindless descriptors 34637ec681f3Smrg- radeonsi: reorganize the code around the gfx9 scissor bug 34647ec681f3Smrg- radeonsi: move si_upload_vertex_buffer_descriptors into si_state_draw.c 34657ec681f3Smrg- radeonsi: add unlikely statements into si_draw_vbo 34667ec681f3Smrg- radeonsi: lift the conditional for skipping si_upload_vertex_buffer_descriptors 34677ec681f3Smrg- radeonsi: always inline draw-related functions that have only one use 34687ec681f3Smrg- nir: gather indirect info from lowered IO intrinsics 34697ec681f3Smrg- nir: gather tess.tcs_cross_invocation info from lowered IO intrinsics 34707ec681f3Smrg- nir: set system_values_read for all intrinsics 34717ec681f3Smrg- nir: gather fs.uses_sample_qualifier from lowered IO 34727ec681f3Smrg- nir: fix input/output info gathering for lowered IO 34737ec681f3Smrg- nir: gather information about fbfetch and dual source color 34747ec681f3Smrg- radeonsi: fix indirect dispatches with variable block sizes 34757ec681f3Smrg- radeonsi: call nir_shader_gather_info after lowering and optimizing NIR 34767ec681f3Smrg- radeonsi: use info.system_values_read 34777ec681f3Smrg- radeonsi: get information about FS color outputs from shader_info directly 34787ec681f3Smrg- radeonsi: get input/output usage flags from shader_info directly 34797ec681f3Smrg- radeonsi: run NIR optimizations that glsl_to_nir runs but other places might not 34807ec681f3Smrg- radeonsi: assume that constant load_local_group_size has been optimized out 34817ec681f3Smrg- radeonsi: remove redundant variables from struct si_compute 34827ec681f3Smrg- radeonsi: remove redundant info.uses_fbfetch 34837ec681f3Smrg- gallivm: add support for lowered IO in vertex shaders 34847ec681f3Smrg- util: implement f16c - fast half<->float conversions 34857ec681f3Smrg- util: move util_half_to_float code into \_mesa_half_to_float_slow 34867ec681f3Smrg- util: remove util_float_to_half and util_half_to_float wrappers 34877ec681f3Smrg- gallium/util: remove redundant util_float_to_half_rtz 34887ec681f3Smrg- gallium/util: remove empty file u_half.h 34897ec681f3Smrg- radeonsi: Fix dead lock with aux_context_lock in si_screen_clear_buffer. 34907ec681f3Smrg- radeonsi: simplify NGG culling enablement and add radeonsi_shader_culling option 34917ec681f3Smrg- radeonsi: kill disabled clip distances and planes at per-channel granularity 34927ec681f3Smrg- radeonsi: move si_set_active_descriptors_for_shader into si_update_common_shader_state 34937ec681f3Smrg- radeonsi: use staging buffer uploads for most VRAM buffers 34947ec681f3Smrg- radeonsi: call nir_lower_bool_to_int32 last because it breaks nir_opt_if 34957ec681f3Smrg- radeonsi: restructure si_pipe_set_constant_buffer 34967ec681f3Smrg- mesa: factor out layout parsing for glInterleavedArrays 34977ec681f3Smrg- gl_marshal.py: inline print_sync_dispatch 34987ec681f3Smrg- driconf: force the vendor string to NVIDIA to fix viewperf energy tests 34997ec681f3Smrg- driconf: enable force_glsl_extensions_warn for viewperf 35007ec681f3Smrg- st/mesa: enable GL name reuse for queries based on the driconf option 35017ec681f3Smrg- util/idalloc: resize if ID is too large for reservation 35027ec681f3Smrg- gallium/util: add set_frontend_noop into driver_noop and u_threaded_context 35037ec681f3Smrg- radeonsi: remove dead variable postponed_kill 35047ec681f3Smrg- radeonsi: implement GL_INTEL_blackhole_render 35057ec681f3Smrg- gallium/u_threaded_context: don't call memcpy in tc_set_constant_buffer 35067ec681f3Smrg- gallium/u_threaded_context: always flush asynchronously if requested 35077ec681f3Smrg- gallium/u_threaded_context: fix use-after-free in transfer_unmap 35087ec681f3Smrg- util: implement F16C using inline assembly on x86_64 35097ec681f3Smrg- util: move util_half_to_float code into \_mesa_half_to_float_slow 35107ec681f3Smrg- util: remove util_float_to_half and util_half_to_float wrappers 35117ec681f3Smrg- gallium/util: remove redundant util_float_to_half_rtz 35127ec681f3Smrg- gallium/util: remove empty file u_half.h 35137ec681f3Smrg- mesa: don't use GET_DISPATCH because it doesn't work with glthread 35147ec681f3Smrg- mesa: remove api_loopback to remove call indirections 35157ec681f3Smrg- glthread: handle glInterleavedArrays 35167ec681f3Smrg- nir/algebraic: always lower idiv to shifts if bitops are allowed 35177ec681f3Smrg- util: add \_mesa_set_create_u32_keys where keys are not pointers 35187ec681f3Smrg- nir: add new helper passes that lower uniforms to literals 35197ec681f3Smrg- gallium: add pipe_context::set_inlinable_constants 35207ec681f3Smrg- st/mesa: pass inlinable uniforms to drivers if they requested it 35217ec681f3Smrg- ac/surface: fix valgrind warnings in DCC retile tile lookups 35227ec681f3Smrg- winsys/amdgpu: rework the VM alignment optimizations 35237ec681f3Smrg- winsys/amdgpu: apply the VM alignment optimization to the physical alignment too 35247ec681f3Smrg- radeonsi: update the DMA perf test 35257ec681f3Smrg- radeonsi: disable SDMA on gfx6-7 and gfx10.3 to decrease CPU overhead 35267ec681f3Smrg- Revert "radeonsi/gfx10: disable vertex grouping" 35277ec681f3Smrg- radeonsi: don't disable NGG culling on gfx10.3 35287ec681f3Smrg- radeonsi: enable NGG culling by default on gfx10.3 dGPUs 35297ec681f3Smrg- radeonsi: optimize out LDS bank conflicts in the NGG culling shader 35307ec681f3Smrg- radeonsi: remove indirection when loading position at the end for NGG culling 35317ec681f3Smrg- radeonsi: write VS/TES system values into LDS after culling 35327ec681f3Smrg- radeonsi: pack LDS better for NGG culling 35337ec681f3Smrg- radeonsi: tweak LATE_ALLOC_GS numbers for faster NGG culling 35347ec681f3Smrg- radeonsi: enable NGG on Navi14 PRO cards 35357ec681f3Smrg- radeonsi: enable NGG culling by default on Navi1x PRO cards 35367ec681f3Smrg- ac/llvm: don't lower bool to int32, switch to native i1 bool 35377ec681f3Smrg- amd: update addrlib 35387ec681f3Smrg- nir: consider load_color intrinsics as both inputs and sysval in gathering 35397ec681f3Smrg- Revert "st/mesa: don't pass NIR to draw module if IO is lowered" 35407ec681f3Smrg- st/mesa: make sure prog->info is up to date for NIR (v2) 35417ec681f3Smrg- amd: regenerate gfx103.json from kernel headers 35427ec681f3Smrg- amd: correct typos in gfx10-rsrc.json 35437ec681f3Smrg- amd: update gfx10-rsrc.json for gfx10.3 35447ec681f3Smrg- amd: replace 0x028848 with the register definition 35457ec681f3Smrg- amd: print NUM_PKRS with AMD_DEBUG=info on gfx10.3 35467ec681f3Smrg- Revert "radeonsi: use staging buffer uploads for most VRAM buffers" 35477ec681f3Smrg- util: remove unused util_get_L3_for_pinned_thread 35487ec681f3Smrg- util: consolidate thread_get_time functions 35497ec681f3Smrg- st/mesa: remove random L3 pinning heuristic for glthread 35507ec681f3Smrg- util: add util_set_thread_affinity helpers including Windows support 35517ec681f3Smrg- util: add util_get_current_cpu using sched_getcpu and Windows equivalent 35527ec681f3Smrg- util: completely rewrite and do AMD Zen L3 cache pinning correctly 35537ec681f3Smrg- glthread: pin driver threads to the same L3 as the main thread regularly 35547ec681f3Smrg- radeonsi: implement inlinable uniforms 35557ec681f3Smrg- gallium: move pipe_draw_info::start/count to the beginning and pad empty space 35567ec681f3Smrg- gallium: add pipe_context::multi_draw 35577ec681f3Smrg- winsys/amdgpu: remove incorrect assertion check against max_check_space_size 35587ec681f3Smrg- radeonsi: add num_draws parameter into si_need_gfx_cs_space 35597ec681f3Smrg- radeonsi don't get count from pipe_draw_info in si_num_prims_for_vertices 35607ec681f3Smrg- radeonsi: don't check info->count == 0 35617ec681f3Smrg- radeonsi: implement multi_draw but supporting only 1 draw 35627ec681f3Smrg- radeonsi: add support for multi draws 35637ec681f3Smrg- radeonsi: set NOT_EOP for back-to-back draws on gfx10+ 35647ec681f3Smrg- radeonsi: implement multi_draw for compute-based primitive culling 35657ec681f3Smrg- gallium/u_threaded: move a structure up to be used later 35667ec681f3Smrg- gallium/u_threaded: merge consecutive draw calls within batches 35677ec681f3Smrg- st/mesa: fix use-after-free when updating shader info in st_link_nir 35687ec681f3Smrg- radeonsi: fix min_direct_count value 35697ec681f3Smrg- radeonsi: do VGT_FLUSH when switching NGG -\> legacy on Sienna Cichlid 35707ec681f3Smrg- radeonsi: only do VGT_FLUSH for fast launch if previous draw was normal launch 35717ec681f3Smrg- radeonsi: determine correctly if switching from normal launch to fast launch 35727ec681f3Smrg- radeonsi: add options.inline_uniforms to the shader cache key 35737ec681f3Smrg- ac: fix detection of Pro graphics 35747ec681f3Smrg- ac: fix min/max_good_num_cu_per_sa on gfx10.3 with disabled SEs 35757ec681f3Smrg- radeonsi: fix NGG streamout regression 35767ec681f3Smrg- radeonsi: fix scan_instruction for bindless inc_wrap/dec_wrap atomics 35777ec681f3Smrg- nir: fix gathering TCS cross invocation access with lowered IO 35787ec681f3Smrg- nir: fix gathering patch IO usage with lowered IO 35797ec681f3Smrg- ac/nir: fix a typo in ac_are_tessfactors_def_in_all_invocs 35807ec681f3Smrg- mesa: call FLUSH_VERTICES before changing sampler uniforms 35817ec681f3Smrg- st/mesa: fix uninitialized/random clip plane state vars in lower_ucp 35827ec681f3Smrg- radeonsi: fix a memory leak in si_create_dcc_retile_cs 35837ec681f3Smrg- radeonsi: fix a nasty bug in si_pm4.c 35847ec681f3Smrg- radeonsi: disable WGP mode on gfx10.3 to prevent hangs 35857ec681f3Smrg 35867ec681f3SmrgMarek Vasut (2): 35877ec681f3Smrg 35887ec681f3Smrg- etnaviv: Remove etna_resource_get_status() 35897ec681f3Smrg- etnaviv: Add lock around pending_ctx 35907ec681f3Smrg 35917ec681f3SmrgMarijn Suijten (5): 35927ec681f3Smrg 35937ec681f3Smrg- util: Makefile.sources: Add disk_cache_os.{c,h} 35947ec681f3Smrg- android: gallium/auxiliary: Deduplicate nir_to_tgsi.c inclusion 35957ec681f3Smrg- scons: gallium/auxiliary: Unconditionally compile NIR regardless of LLVM 35967ec681f3Smrg- android: panfrost: Move nir_undef_to_zero to util 35977ec681f3Smrg- android: freedreno: Add freedreno_dev_info.[ch\] to Makefile.sources 35987ec681f3Smrg 35997ec681f3SmrgMark Janes (2): 36007ec681f3Smrg 36017ec681f3Smrg- intel/fs: Assert if lower_source_modifiers converts 32x16 to 32x32 multiplication 36027ec681f3Smrg- intel/fs: work around gen12 lower-precision source modifier limitation 36037ec681f3Smrg 36047ec681f3SmrgMark Menzynski (5): 36057ec681f3Smrg 36067ec681f3Smrg- nv50/ir: Use a bit field in info_out structure 36077ec681f3Smrg- nv50/ir: Add nv50_ir_prog_info_out serialize and deserialize 36087ec681f3Smrg- nv50/ir: Add prog_info_out print 36097ec681f3Smrg- nv50/ir: Add nv50_ir_prog_info serialize 36107ec681f3Smrg- nvc0: Add shader disk caching 36117ec681f3Smrg 36127ec681f3SmrgMartin Peres (11): 36137ec681f3Smrg 36147ec681f3Smrg- driconf: bump the maximum string size from 25 to 1024 36157ec681f3Smrg- driconf: initialize the option value before using it 36167ec681f3Smrg- dri/DRI2ConfigQueryExtension: add support for string options 36177ec681f3Smrg- glx/extensions: split set_glx_extension into find\_ and set\_ 36187ec681f3Smrg- glx: stop using hardcoded array sizes for bitfields 36197ec681f3Smrg- glx: initial plumbing to let users force-enable/disable extensions 36207ec681f3Smrg- glx: let users force-enable/disable indirect GL extensions 36217ec681f3Smrg- driconf: add a way to override GLX extensions 36227ec681f3Smrg- driconf: add a way to override indirect-GL extensions 36237ec681f3Smrg- driconf: disable GLX_OML_swap_method by default on Brink 36247ec681f3Smrg- driconf: allow higher compat version for Brink 36257ec681f3Smrg 36267ec681f3SmrgMatt Turner (3): 36277ec681f3Smrg 36287ec681f3Smrg- intel/tools: Disassemble WAIT's argument as a destination 36297ec681f3Smrg- Revert F16C series (MR 6774) 36307ec681f3Smrg- glcpp: Handle bison-3.6 error message changes 36317ec681f3Smrg 36327ec681f3SmrgMauro Rossi (28): 36337ec681f3Smrg 36347ec681f3Smrg- android: panfrost: Rename encoder/ to lib/ 36357ec681f3Smrg- android: panfrost: Move pandecode into lib/ 36367ec681f3Smrg- android: pan/mdg: Separate disassembler and compiler targets 36377ec681f3Smrg- android: pan/bi: Separate disasm/compiler targets 36387ec681f3Smrg- android: panfrost: Redirect cmdstream includes through GenXML 36397ec681f3Smrg- android: panfrost/bifrost: add libpanfrost_lib static dependency 36407ec681f3Smrg- android: panfrost: Redirect cmdstream includes through GenXML (v2) 36417ec681f3Smrg- android: util/format: fix generated sources rules 36427ec681f3Smrg- android: amd/registers: switch to new generated register definitions 36437ec681f3Smrg- android: util: fix missing include path 36447ec681f3Smrg- android: nv50/ir: Add nv50_ir_prog_info_out serialize and deserialize 36457ec681f3Smrg- android: freedreno: Implement pipe screen's get_device/driver_uuid() 36467ec681f3Smrg- android: freedreno/common: add libmesa_git_sha1 static dependency 36477ec681f3Smrg- egl/android: HAVE_DRM_GRALLOC path fixes (v2) 36487ec681f3Smrg- android: aco/isel: Move context initialization code to a dedicated file 36497ec681f3Smrg- android: pan/bi: Use new disassembler 36507ec681f3Smrg- android: pan/bi: Use new packing 36517ec681f3Smrg- android: pan/bi: fix typo in bifrost_gen_disasm.c gen rules 36527ec681f3Smrg- android: gallium/iris: cleanup iris_driinfo.h gen rules 36537ec681f3Smrg- android: gallium/radeonsi: cleanup si_driinfo.h gen rules 36547ec681f3Smrg- android: gallium/virgl: cleanup virgl_driinfo.h gen rules 36557ec681f3Smrg- android: util: add log.c to Makefile.sources 36567ec681f3Smrg- android: pan/bi: Use new disassembler (v2) 36577ec681f3Smrg- android: panfrost: use python3 for generated sources rules 36587ec681f3Smrg- android: util: Move xxd.py to util 36597ec681f3Smrg- android: util,ac,aco,radv: Cross-platform memstream API 36607ec681f3Smrg- android: fix libsync dependencies (v2) 36617ec681f3Smrg- android: aco: add aco_form_hard_clauses.cpp to Makefile.sources 36627ec681f3Smrg 36637ec681f3SmrgMichael Olbrich (1): 36647ec681f3Smrg 36657ec681f3Smrg- meson.build: xxf86vm is not needed for -Dglx-direct=false 36667ec681f3Smrg 36677ec681f3SmrgMichael Tretter (2): 36687ec681f3Smrg 36697ec681f3Smrg- etnaviv: fix comment for source of etna_mesa_debug 36707ec681f3Smrg- etnaviv: free tgsi tokens when shader state is deleted 36717ec681f3Smrg 36727ec681f3SmrgMichel Dänzer (31): 36737ec681f3Smrg 36747ec681f3Smrg- ci: Fix up rules for post-merge / main project branch pipelines 36757ec681f3Smrg- ci: Create test-docs job in mesa/mesa pipelines for MRs 36767ec681f3Smrg- ci: Don't exclude "success" job from mesa/mesa pipelines for MRs 36777ec681f3Smrg- ci: Restrict "success" job to pipelines for MRs 36787ec681f3Smrg- ci: Do not create manual test-docs job in post-merge pipelines 36797ec681f3Smrg- ci: Remove any existing results directory before running piglit 36807ec681f3Smrg- ci: Add "is scheduled pipeline" YAML anchor 36817ec681f3Smrg- ci: Add "is master branch of main project" YAML anchor 36827ec681f3Smrg- ci: Add "is pre-merge pipeline for Marge Bot" YAML anchor 36837ec681f3Smrg- ci: Add "is post-merge pipeline, not for Marge Bot" YAML anchor 36847ec681f3Smrg- ci: Add "is forked branch or pre-merge pipeline" YAML anchor 36857ec681f3Smrg- ci: Add "is forked branch" YAML anchor 36867ec681f3Smrg- ci: Add "is post-merge pipeline" YAML anchor 36877ec681f3Smrg- ci: Add "is pre-merge pipeline" YAML anchor 36887ec681f3Smrg- ci: Add "is for Marge Bot" YAML anchor 36897ec681f3Smrg- ci: Always use CI_PROJECT_NAMESPACE instead of CI_PROJECT_PATH 36907ec681f3Smrg- ci: Prevent pages job from running in pre-merge pipelines 36917ec681f3Smrg- ci: Don't create test-docs job if the pages one exists in the pipeline 36927ec681f3Smrg- ci: Use ignore_scheduled_pipelines anchor in .radeonsi-rules 36937ec681f3Smrg- gallium: Make pipe_viewport_state swizzle_x/y/z/w bit-fields 8 bits wide 36947ec681f3Smrg- ci: Move test-docs job to deploy stage 36957ec681f3Smrg- ci: Add empty needs: to pages job 36967ec681f3Smrg- ci: Add jobs running ci-fairy checks 36977ec681f3Smrg- loader/dri3: Only allocate additional buffers if needed 36987ec681f3Smrg- loader/dri3: Keep current number of back buffers if frame was skipped 36997ec681f3Smrg- loader/dri3: Allocate up to 4 back buffers for page flips 37007ec681f3Smrg- ci: Add "check mr" job to needs: of build jobs 37017ec681f3Smrg- ci: Run git_archive job if all_paths matches 37027ec681f3Smrg- i965/bufmgr: Handle NULL bufmgr in brw_bufmgr_get_for_fd 37037ec681f3Smrg- iris/bufmgr: Handle NULL bufmgr in iris_bufmgr_get_for_fd 37047ec681f3Smrg- ac: Don't negate strstr return values in ac_query_gpu_info 37057ec681f3Smrg 37067ec681f3SmrgMichel Zou (9): 37077ec681f3Smrg 37087ec681f3Smrg- swr: fix build with mingw 37097ec681f3Smrg- swr: missing \_BitScanForward64 on 32 bits win 37107ec681f3Smrg- swr: fix \_BitScanForward64 on unix 37117ec681f3Smrg- util: drop non-posix header fnmatch 37127ec681f3Smrg- lavapipe: fix usleep usage in lvp_device 37137ec681f3Smrg- wsi: move drm code to wsi_common_drm.c 37147ec681f3Smrg- gallium: use libpipe_loader_links 37157ec681f3Smrg- lavapipe: configure suffix in icd json 37167ec681f3Smrg- util: use dllexport for mingw too 37177ec681f3Smrg 37187ec681f3SmrgMike Blumenkrantz (118): 37197ec681f3Smrg 37207ec681f3Smrg- zink: basic primitive restart support for strip/fan topologies 37217ec681f3Smrg- zink: move 8bit index handling out of u_primconvert path 37227ec681f3Smrg- zink: use util_draw_vbo_without_prim_restart for unsupported prim modes 37237ec681f3Smrg- zink: set primitive restart cap 37247ec681f3Smrg- zink: move shader state methods for pipe_context into zink_program.c 37257ec681f3Smrg- zink: adjust zink_shader struct to contain full streamout info 37267ec681f3Smrg- zink: refcount zink_gfx_program objects 37277ec681f3Smrg- zink: split up creating zink_shader objects and VkShaderModule objects 37287ec681f3Smrg- zink: use ZINK_SHADER_COUNT instead of PIPE_SHADER_TYPES - 1 everywhere 37297ec681f3Smrg- zink: start using per-stage flags for new shaders, refcount shader modules 37307ec681f3Smrg- zink: always compile shaders in pipeline order 37317ec681f3Smrg- zink: rename zink_gfx_program::stages to 'modules' 37327ec681f3Smrg- gallium: add pipe_transfer_usage for z/s only mappings 37337ec681f3Smrg- gallium/u_transfer_helper: add util functions for doing deinterleaving during map 37347ec681f3Smrg- zink: print error when getprocaddr fails for extension functions 37357ec681f3Smrg- zink: change pipeline hashes to index based on vk primitive type 37367ec681f3Smrg- zink: handle more draw modes 37377ec681f3Smrg- zink: invalidate pipeline hash on more changes 37387ec681f3Smrg- zink: use u_transfer_helper to split/merge interleaved depth/stencil formats 37397ec681f3Smrg- zink: add note about buffer<->image copy functions not handling multisample 37407ec681f3Smrg- zink: generically handle matrix types 37417ec681f3Smrg- anv: improve error message when failing to open device path 37427ec681f3Smrg- anv: assert that the target bo is valid when adding a reloc list 37437ec681f3Smrg- zink: use correct value for color buffer sample count when creating renderpass 37447ec681f3Smrg- zink: use correct number of samples on framebuffer in set_framebuffer_state 37457ec681f3Smrg- zink: use correct layer count when creating framebuffer 37467ec681f3Smrg- zink: clamp min created fb size to 1x1 37477ec681f3Smrg- zink: verify that src and dst aspects are the same in resource_copy_region hook 37487ec681f3Smrg- zink: implement ARB_instanced_arrays 37497ec681f3Smrg- zink: move viewport count to zink_gfx_pipeline_state 37507ec681f3Smrg- zink: set multiviewport cap in ntv when gl_ViewportIndex is a written output 37517ec681f3Smrg- zink: correctly set up fb-sized scissors for each viewport 37527ec681f3Smrg- zink: apply viewport count when creating pipelines 37537ec681f3Smrg- zink: reorder create_stream_output_target to fix failure case leak 37547ec681f3Smrg- zink: combine all surface layout-setting for src/dst into util function 37557ec681f3Smrg- zink: unify all occurrences of waiting on a fence 37567ec681f3Smrg- zink: correctly handle ARB_arrays_of_arrays in ntv for samplers 37577ec681f3Smrg- zink: run nir_lower_uniforms_to_ubo conditionally 37587ec681f3Smrg- zink: fix shader buffer size caps to use 65536 37597ec681f3Smrg- zink: always emit descriptor set 0 in ntv 37607ec681f3Smrg- zink: emit ubo variables sized based on the overall ubo block size 37617ec681f3Smrg- zink: don't emit ubos or bindings for ubo variables 37627ec681f3Smrg- zink: correctly set up ubo bindings and buffer indices 37637ec681f3Smrg- zink: use sizeof(vec4) multiplier for nir_lower_uniforms_to_ubo 37647ec681f3Smrg- zink: hook up driconf 37657ec681f3Smrg- xmlconfig: fix scandir_filter 37667ec681f3Smrg- zink: handle timestamp queries 37677ec681f3Smrg- zink: handle TIME_ELAPSED queries 37687ec681f3Smrg- zink: add pipe_context::get_timestamp hook 37697ec681f3Smrg- zink: enable pipe caps for ARB_timer_query 37707ec681f3Smrg- anv: remove VkPipelineCacheCreateInfo::flags assert 37717ec681f3Smrg- radv: remove VkPipelineCacheCreateInfo::flags assert 37727ec681f3Smrg- util/hash_table: add function for reserving size in a hash table 37737ec681f3Smrg- zink: enable VK_KHR_vulkan_memory_model extension 37747ec681f3Smrg- zink: add VK_EXT_custom_border_color 37757ec681f3Smrg- zink: support VK_EXT_blend_operation_advanced 37767ec681f3Smrg- zink: support VK_EXT_extended_dynamic_state 37777ec681f3Smrg- zink: add VK_EXT_pipeline_creation_cache_control 37787ec681f3Smrg- zink: enable VK_EXT_shader_stencil_export 37797ec681f3Smrg- zink: ARB_uniform_buffer_object is now implemented, so add cap and feature doc 37807ec681f3Smrg- glsl: fix up location setting for variables pointing to a UBO's base 37817ec681f3Smrg- nir: update ubo locations in nir_lower_uniforms_to_ubo 37827ec681f3Smrg- zink: add a mechanism to track current resource usage in batches 37837ec681f3Smrg- zink: optimize transfer_map for resources with pending reads/writes 37847ec681f3Smrg- zink: add more explicit fencing for transfer maps 37857ec681f3Smrg- zink: explicitly flag fb attachments as being written to in render passes 37867ec681f3Smrg- zink: don't leak sampler view textures 37877ec681f3Smrg- zink: redo slot mapping again for the last time really I mean it 37887ec681f3Smrg- zink: export PIPE_CAP_MAX*_VARYINGS values 37897ec681f3Smrg- zink: unify code for emitting named uint-based variable instructions 37907ec681f3Smrg- glsl: more accurately handle swizzle in 64bit varying split with no left value 37917ec681f3Smrg- zink: increase descriptor pool sizes for other descriptor types we'll be using 37927ec681f3Smrg- zink: implement ARB_texture_buffer_object 37937ec681f3Smrg- zink: ensure resource tracking for sampler buffers in render batches 37947ec681f3Smrg- zink: assert valid format in zink_create_sampler_view() 37957ec681f3Smrg- zink: handle null attachment for ARB_texture_buffer_object samplers 37967ec681f3Smrg- zink: add VK_BUFFER_USAGE_INDEX_BUFFER_BIT to vertex buffer creation 37977ec681f3Smrg- zink: add last few format maps for ARB_vertex_type_2_10_10_10_rev 37987ec681f3Smrg- zink: fix stencil wrapping 37997ec681f3Smrg- zink: add some spirv_builder functions we'll be using for geometry shaders 38007ec681f3Smrg- zink: handle shader io vars more generically for use with gs 38017ec681f3Smrg- zink: add ntv handling for geometry shader variables 38027ec681f3Smrg- zink: re-transform gl_Position for gs input 38037ec681f3Smrg- zink: add handling for gs in ntv 38047ec681f3Smrg- zink: remove ADJACENCY prim types from primconvert path 38057ec681f3Smrg- zink: round out handling for streamout buffer stride setting during draw 38067ec681f3Smrg- zink: add gallium handling for geometry shaders 38077ec681f3Smrg- zink: enable gs pipe caps 38087ec681f3Smrg- zink: bump to glsl 1.40 38097ec681f3Smrg- zink: mark off GL 3.1 as done in features.txt 38107ec681f3Smrg- zink: GLSL 1.50 38117ec681f3Smrg- zink: set 3.2 complete in features.txt 38127ec681f3Smrg- zink: bump GLSL to 3.30 38137ec681f3Smrg- zink: set 3.3 complete in features.txt 38147ec681f3Smrg- zink: implement ARB_draw_indirect 38157ec681f3Smrg- zink: add helper for vec-type input variables in ntv 38167ec681f3Smrg- zink: add ntv handling for ARB_sample_shading 38177ec681f3Smrg- zink: add a pipe_context::get_sample_position hook 38187ec681f3Smrg- zink: mark ARB_sample_shading as supported 38197ec681f3Smrg- doc/features: remove zink entries for GL 3.3 items 38207ec681f3Smrg- zink: deduplicate some query result code 38217ec681f3Smrg- zink: more correctly handle PIPE_QUERY_PRIMITIVES_GENERATED queries 38227ec681f3Smrg- zink: also create an xfb query for every primitives generated query 38237ec681f3Smrg- zink: store batch id onto query object at time of start 38247ec681f3Smrg- zink: fixup gs/xfb tracking for primitives generated queries 38257ec681f3Smrg- zink: rework query overflow handling 38267ec681f3Smrg- zink: always use query->type for starting/stopping xfb queries 38277ec681f3Smrg- zink: always reset query pools on next query begin 38287ec681f3Smrg- zink: add pass for lowering dynamic ubo/ssbo block indexing to constants 38297ec681f3Smrg- zink: break up dynamic access lowering 38307ec681f3Smrg- util/threaded_context: use driver's buffer alignment for staging transfers 38317ec681f3Smrg- nir/clip_disable: write 0s instead of undefs for disabled clip planes 38327ec681f3Smrg- nir/clip_disable: try for better no-op 38337ec681f3Smrg- nir/clip_disable: handle 2x vec4 case 38347ec681f3Smrg- zink: implement ARB_texture_query_lod 38357ec681f3Smrg- zink: use same function for all pipe_context::delete_*_state shader methods 38367ec681f3Smrg- zink: add a quadop function in spirv_builder 38377ec681f3Smrg- zink: add some spirv builder functions for barriers 38387ec681f3Smrg 38397ec681f3SmrgNanley Chery (46): 38407ec681f3Smrg 38417ec681f3Smrg- dri_util: Update internal_format to GL_RGB8 for MESA_FORMAT_B8G8R8X8_UNORM 38427ec681f3Smrg- iris: Don't call SET_TILING for dmabuf imports 38437ec681f3Smrg- iris: Make iris_bo_import_dmabuf take a modifier 38447ec681f3Smrg- iris: Drop iris_resource_alloc_separate_aux 38457ec681f3Smrg- iris: Drop unused resource allocation optimization 38467ec681f3Smrg- iris: Drop old comment on clear color BO allocation 38477ec681f3Smrg- iris: Move size/offset calculations out of configure_aux 38487ec681f3Smrg- iris: Add and use iris_resource_configure_main 38497ec681f3Smrg- iris: Drop buffer support in resource_from_handle 38507ec681f3Smrg- gallium/dri2: Report correct YUYV and UYVY plane count 38517ec681f3Smrg- iris: Fix aux assertion in resource_get_handle 38527ec681f3Smrg- iris: Fold a condition into no_gpu for consistency 38537ec681f3Smrg- iris: Make iris_has_color_unresolved more generic 38547ec681f3Smrg- iris: Avoid resolving Z/S reads in transfer_map 38557ec681f3Smrg- iris: Drop a use of the need_resolve boolean 38567ec681f3Smrg- iris: Better determine map_would_stall for Z/S 38577ec681f3Smrg- gallium/dri2: Report I915_FORMAT_MOD_Y_TILED_GEN12_MC_CCS num_planes 38587ec681f3Smrg- gallium/dri2: Support I915_FORMAT_MOD_Y_TILED_GEN12_MC_CCS import 38597ec681f3Smrg- intel/isl: Describe I915_FORMAT_MOD_Y_TILED_GEN12_MC_CCS 38607ec681f3Smrg- intel/isl: Support ISL_AUX_USAGE_MC in surface states 38617ec681f3Smrg- intel/isl: Add YUV format info for the aux-map 38627ec681f3Smrg- st/mesa: Don't map all P01X DRM formats to P016 38637ec681f3Smrg- intel/common: Add get_aux_map_format_bits() 38647ec681f3Smrg- iris: Support planar resource imports for MC 38657ec681f3Smrg- intel/common: Drop unused gen_aux_map_add_image 38667ec681f3Smrg- iris: Support MC modifier in plane count queries 38677ec681f3Smrg- iris: Support I915_FORMAT_MOD_Y_TILED_GEN12_MC_CCS 38687ec681f3Smrg- blorp: Fix alignment test for HIZ_CCS_WT fast-clears 38697ec681f3Smrg- blorp: Drop trailing whitespace in blorp_clear.c 38707ec681f3Smrg- anv/image: Disable multi-layer CCS_E on TGL+ 38717ec681f3Smrg- blorp: Ensure aligned HIZ_CCS_WT partial clears 38727ec681f3Smrg- iris: Fix a fast-clear skipping optimization 38737ec681f3Smrg- anv: Enable multi-layer aux-map init for HIZ+CCS 38747ec681f3Smrg- Revert "anv: Add driconf option to disable compression for 16bpp format" 38757ec681f3Smrg- iris: Add fast-clear restriction for 8bpp surfaces 38767ec681f3Smrg- isl: Allow CCS for 8bpp surfaces with 3+ miplevels 38777ec681f3Smrg- st/mesa: Add missing sentinels in format_map[\] 38787ec681f3Smrg- intel/isl: Drop redundant unpack of unorm channels 38797ec681f3Smrg- isl: Fix the aux-map encoding for D24_UNORM_X8 38807ec681f3Smrg- iris: Fix fast-clears of swizzled LA formats 38817ec681f3Smrg- iris: Fix SINT assert in convert_fast_clear_color 38827ec681f3Smrg- iris: Fix fast-clears of swizzled alpha formats 38837ec681f3Smrg- iris: Flush dmabufs during context flushes 38847ec681f3Smrg- mesa: Add and use \_mesa_has_depth_float_channel 38857ec681f3Smrg- mesa: Clamp some depth values in glClearBufferfv 38867ec681f3Smrg- mesa: Clamp some depth values in glClearBufferfi 38877ec681f3Smrg 38887ec681f3SmrgNeil Roberts (3): 38897ec681f3Smrg 38907ec681f3Smrg- v3d: Make the function to set tex dirty state for a stage global 38917ec681f3Smrg- v3d: Split the creating of TEXTURE_SHADER_STATE into a helper function 38927ec681f3Smrg- v3d: Update the TEXTURE_SHADER_STATE when there’s a new buf for a tex 38937ec681f3Smrg 38947ec681f3SmrgPhilipp Zabel (3): 38957ec681f3Smrg 38967ec681f3Smrg- meson: fix power8 option 38977ec681f3Smrg- gallium/dri: fix dri2_query_image for multiplanar images 38987ec681f3Smrg- gallium/dri: fix dri2_from_planar for multiplanar images 38997ec681f3Smrg 39007ec681f3SmrgPierre Moreau (5): 39017ec681f3Smrg 39027ec681f3Smrg- clover/spirv: Remove unused tuple header 39037ec681f3Smrg- clover/spirv: Print linked SPIR-V module if asked 39047ec681f3Smrg- meson: Raise minimum version for SPIR-V OpenCL deps (v4) 39057ec681f3Smrg- clover/llvm: Use the highest supported SPIR-V version (v4) 39067ec681f3Smrg- clover/nir: Register callback for translation messages (v2) 39077ec681f3Smrg 39087ec681f3SmrgPierre-Eric Pelloux-Prayer (61): 39097ec681f3Smrg 39107ec681f3Smrg- ac/llvm: handle static/shared llvm init separately 39117ec681f3Smrg- mesa/st: introduce PIPE_CAP_NO_CLIP_ON_COPY_TEX 39127ec681f3Smrg- radeonsi: enable PIPE_CAP_NO_CLIP_ON_COPY_TEX 39137ec681f3Smrg- ac/llvm: add option to clamp division by zero 39147ec681f3Smrg- radeonsi,driconf: add clamp_div_by_zero option 39157ec681f3Smrg- radeonsi: use radeonsi_clamp_div_by_zero for SPECviewperf13, Road Redemption 39167ec681f3Smrg- amd/llvm: switch to 3-spaces style 39177ec681f3Smrg- amd/common: switch to 3-spaces style 39187ec681f3Smrg- mesa: move u_idalloc from gallium/aux/util to util 39197ec681f3Smrg- util/idalloc: add util_idalloc_reserve 39207ec681f3Smrg- util/idalloc: add lowest_free_idx to avoid iterating from 0 39217ec681f3Smrg- mesa: add a isGenName parameter to \_mesa_HashInsert 39227ec681f3Smrg- mesa: add GL name reuse support 39237ec681f3Smrg- mesa: add \_mesa_HashFindFreeKeys 39247ec681f3Smrg- mesa: use \_mesa_HashFindFreeKeys for GL functions 39257ec681f3Smrg- driconf: add option to reuse GL names 39267ec681f3Smrg- glsl: fix per_vertex_accumulator::fields size 39277ec681f3Smrg- r600/uvd: set dec->bs_ptr = NULL on unmap 39287ec681f3Smrg- radeon/vcn: set dec->bs_ptr = NULL on unmap 39297ec681f3Smrg- radeonsi: fix quant_mode selection for large negative values 39307ec681f3Smrg- radeonsi: fix guardband handling for large values 39317ec681f3Smrg- mesa: fix glUniform\* when a struct contains a bindless sampler 39327ec681f3Smrg- gallium: add PIPE_CAP_MAX_TEXTURE_MB 39337ec681f3Smrg- radeonsi: move GL vendor workaround to drirc 39347ec681f3Smrg- radeonsi: reduce PIPE_SHADER_CAP_MAX_CONST_BUFFER_SIZE value 39357ec681f3Smrg- radeonsi: change vendor name to AMD 39367ec681f3Smrg- radeonsi: force linear for textures with height=1 (gfx6-8) 39377ec681f3Smrg- radeonsi/tmz: use secure job if framebuffer has dcc 39387ec681f3Smrg- radeonsi/tmz: use secure job if using an encrypted z/s buffer 39397ec681f3Smrg- radeonsi/tmz: add safety assert when tmz is enabled 39407ec681f3Smrg- radeonsi/tmz: allocate depth/stencil buffers as encrypted 39417ec681f3Smrg- radeonsi: introduce SI_RESOURCE_FLAG_INTERNAL / RADEON_FLAG_DRIVER_INTERNAL 39427ec681f3Smrg- amd: add AMDGPU_IDS_FLAGS_TMZ definition to amdgpu_drm.h 39437ec681f3Smrg- ac/gpu_info: add detection of TMZ support 39447ec681f3Smrg- radeonsi/tmz: allow secure job if the app made a tmz allocation 39457ec681f3Smrg- amd/winsys: add RADEON_FLUSH_TOGGLE_SECURE_SUBMISSION 39467ec681f3Smrg- radeonsi/tmz: fail si_texture_transfer_map if tex is encrypted 39477ec681f3Smrg- radeonsi/tmz: add tmz variant of sctx::wait_mem_scratch 39487ec681f3Smrg- radeonsi/tmz: add tmz variant for sctx::tess_rings 39497ec681f3Smrg- radeonsi: disable primitive discard if tmz is in use 39507ec681f3Smrg- radeonsi/tmz: add a tmz variant for sctx::eop_bug_scratch 39517ec681f3Smrg- radeonsi/tmz: add workaround for mpv/vaapi subtitles 39527ec681f3Smrg- amd/tmz: move uses_secure_bos to radeon_winsys 39537ec681f3Smrg- gallium/vl: do not call transfer_unmap if transfer is NULL 39547ec681f3Smrg- gallium/vl: add chroma_format arg to vl_video_buffer functions 39557ec681f3Smrg- omx/tizonia: fix build 39567ec681f3Smrg- gallium: add new cap PIPE_CAP_DEVICE_PROTECTED_CONTENT 39577ec681f3Smrg- gallium: introduce PIPE_BIND_PROTECTED 39587ec681f3Smrg- radeonsi: honor PIPE_BIND_PROTECTED 39597ec681f3Smrg- egl: implement EGL_EXT_protected_surface support 39607ec681f3Smrg- radeonsi: enable PIPE_CAP_DEVICE_PROTECTED_CONTENT 39617ec681f3Smrg- egl: handle EGL_PROTECTED_CONTENT_EXT for eglImage 39627ec681f3Smrg- dri: introduce createImageFromDmaBufs3 39637ec681f3Smrg- egl/dri2: implement createImageFromDmaBufs3 39647ec681f3Smrg- driconf: add disable_protected_content_check option 39657ec681f3Smrg- radeonsi: fix RADEON_FLUSH flags conflicts 39667ec681f3Smrg- radeon: add si_vid_create_tmz_buffer helper 39677ec681f3Smrg- radeon/vcn: delay dec->ctx and dec->dpb allocation 39687ec681f3Smrg- va/picture: make sure destination buffer is protected if needed 39697ec681f3Smrg- va: support VA_RT_FORMAT_PROTECTED 39707ec681f3Smrg- radeonsi/gfx10: flush gfx cs on ngg -\> legacy transition 39717ec681f3Smrg 39727ec681f3SmrgPierre-Loup A. Griffais (2): 39737ec681f3Smrg 39747ec681f3Smrg- radv: fix null descriptor for dynamic buffers 39757ec681f3Smrg- radv: fix vertex buffer null descriptors 39767ec681f3Smrg 39777ec681f3SmrgQiang Yu (4): 39787ec681f3Smrg 39797ec681f3Smrg- radeonsi: fix syncobj wait timeout 39807ec681f3Smrg- radeonsi: fix user fence space when MCBP is enabled 39817ec681f3Smrg- radeonsi: fix max syncobj wait timeout 39827ec681f3Smrg- radeonsi: fix user fence GPU address 39837ec681f3Smrg 39847ec681f3SmrgRhys Perry (160): 39857ec681f3Smrg 39867ec681f3Smrg- aco: fix C++11/C++14 compilation 39877ec681f3Smrg- aco: set constant_data_offset correctly in the case of merged shaders 39887ec681f3Smrg- aco: don't move memory accesses to before control barriers 39897ec681f3Smrg- nir/opt_remove_phis: optimize out phis with undef 39907ec681f3Smrg- gitlab: ask inxi output to be in code blocks 39917ec681f3Smrg- util: add a alignof() macro 39927ec681f3Smrg- nir: fix potential left shift of a negative value 39937ec681f3Smrg- nir: fix memory leak in nir_cf_list_clone 39947ec681f3Smrg- radv: don't pass null to \_mesa_sha1_update 39957ec681f3Smrg- radv: align pipeline cache entry and header sizes 39967ec681f3Smrg- radv: fix null memcpy and zero-sized malloc 39977ec681f3Smrg- aco: fix non-rtz pack_half_2x16 39987ec681f3Smrg- nir: add and use nir_intrinsic_has\_ helpers 39997ec681f3Smrg- aco: use nir_intrinsic_has_access 40007ec681f3Smrg- bifrost: use nir_intrinsic_has_type 40017ec681f3Smrg- aco: consider branch definitions in spiller 40027ec681f3Smrg- aco: don't consider the first partial spill if it's the wrong type 40037ec681f3Smrg- aco: don't fix break condition for break+discard to exec 40047ec681f3Smrg- aco: fix regclass checks when fixing to vcc/exec with Builder 40057ec681f3Smrg- aco: fix spills_entry heuristic for branch blocks in init_live_in_vars() 40067ec681f3Smrg- aco: keep loop live-through variables spilled 40077ec681f3Smrg- aco: reserve 2 sgprs for each branch 40087ec681f3Smrg- aco: create long jumps 40097ec681f3Smrg- aco/tests: add test for GFX10 0x3f bug 40107ec681f3Smrg- aco: shorten disassembly for repeated instructions 40117ec681f3Smrg- aco/tests: add tests for long jumps 40127ec681f3Smrg- aco: remove 64-bit SGPR ubfe/ibfe 40137ec681f3Smrg- aco: fix sgpr ubfe/ibfe if the offset is too large 40147ec681f3Smrg- aco: sink get_alu_src() in bfe lowering 40157ec681f3Smrg- spirv: fix Uniform and Output MemoryAccessMakePointer{Visible,Available} 40167ec681f3Smrg- spirv: make OpLoad/OpStore visibility/availablity barriers acquire/release 40177ec681f3Smrg- spirv: add vtn_emit_make_{visible,available}_barrier helpers 40187ec681f3Smrg- spirv: implement MakePointerAvailable/MakePointerVisible for OpCopyMemory 40197ec681f3Smrg- spirv: implement Volatile memory semantic 40207ec681f3Smrg- spirv: implement Volatile image operand 40217ec681f3Smrg- spirv: implement SpvMemoryAccessVolatileMask 40227ec681f3Smrg- spirv: add some tests for volatile/available/visible 40237ec681f3Smrg- radv: remove descriptor_indexing fails from expected fails 40247ec681f3Smrg- aco: fix mad splitting after applying output modifiers 40257ec681f3Smrg- aco: remove omod_success/clamp_success 40267ec681f3Smrg- aco: fix byte_align_scalar for 3 dword vectors 40277ec681f3Smrg- nir/load_store_vectorize: rework alignment calculation 40287ec681f3Smrg- nir/opt_shrink_vectors: shrink image stores using the format 40297ec681f3Smrg- aco: fix one-off error in Operand(uint16_t) 40307ec681f3Smrg- aco: improve fsign selection 40317ec681f3Smrg- nir/opt_if: fix opt_if_merge when destination branch has a jump 40327ec681f3Smrg- nir/opt_loop_unroll: fix is_access_out_of_bounds with vectors 40337ec681f3Smrg- aco: fix v_writelane_b32 with two sgprs 40347ec681f3Smrg- aco: workaround disassembler bug of v_writelane_b32 with literal 40357ec681f3Smrg- aco: don't apply constant to SDWA on GFX8 40367ec681f3Smrg- aco: fix value numbering of reductions 40377ec681f3Smrg- aco: fix validation of sub-dword parallel-copies 40387ec681f3Smrg- aco: pass -fno-exceptions and -fno-rtti 40397ec681f3Smrg- aco: fix incorrect assertion in emit_vop3a_instruction() 40407ec681f3Smrg- radv: initialize with expanded cmask if the destination layout needs it 40417ec681f3Smrg- radv,aco: fix reading primitive ID in FS after TES 40427ec681f3Smrg- aco: keep track of temporaries' regclasses in the Program 40437ec681f3Smrg- aco: use bit vectors for liveness sets 40447ec681f3Smrg- aco: use io semantics to get an intrinsic's slot 40457ec681f3Smrg- aco: use nir_get_io_offset_src() in visit_load_input() 40467ec681f3Smrg- aco: use nir's constant source helpers more 40477ec681f3Smrg- aco: remove dead indirect fs input loading 40487ec681f3Smrg- aco: stop multiplying driver_location by 4 40497ec681f3Smrg- st/nir: call nir_opt_access before gl_nir_lower_buffers 40507ec681f3Smrg- radeonsi: don't use nir_opt_access 40517ec681f3Smrg- nir/instr_set: hash intrinsic sources 40527ec681f3Smrg- nir/load_store_vectorize: improve vectorization with identical operations 40537ec681f3Smrg- aco: fix get_buffer_resource_flags() 40547ec681f3Smrg- aco: remove trailing whitespace 40557ec681f3Smrg- radv: remove trailing whitespace 40567ec681f3Smrg- aco: Add loop creation helpers. 40577ec681f3Smrg- nir: return progress from nir_lower_io_to_scalar_early 40587ec681f3Smrg- radv: move optimizations in shader_compile_to_nir() to after io_to_scalar 40597ec681f3Smrg- radv: use radv_optimize_nir() less in radv_link_shaders() 40607ec681f3Smrg- spirv: add and use a generator id enum 40617ec681f3Smrg- spirv: replace discard with demote for incorrect HLSL->SPIR-V translations 40627ec681f3Smrg- radv: remove RDR2 discard workaround 40637ec681f3Smrg- android: fix SPIR-V -\> NIR build 40647ec681f3Smrg- aco: optimize more uniform reductions/scans 40657ec681f3Smrg- aco: implement elect 40667ec681f3Smrg- radv/aco,nir/lower_subgroups: don't lower elect 40677ec681f3Smrg- nir: add last_invocation intrinsic 40687ec681f3Smrg- aco: implement last_invocation 40697ec681f3Smrg- nir: move divergence analysis options to nir_shader_compiler_options 40707ec681f3Smrg- nir: allow divergence information to be updated when inserting instruction 40717ec681f3Smrg- nir: add pass to optimize uniform atomics 40727ec681f3Smrg- aco: use nir_opt_uniform_atomics 40737ec681f3Smrg- nir/opt_uniform_atomics: optimize image atomics 40747ec681f3Smrg- nir/opt_uniform_atomics: don't optimize atomics twice 40757ec681f3Smrg- aco: fix get_ssbo_size with a vgpr resource 40767ec681f3Smrg- scons: fix SPIR-V -\> NIR build 40777ec681f3Smrg- nir/opt_uniform_atomics: remove useless returns 40787ec681f3Smrg- aco: implement 16-bit literals 40797ec681f3Smrg- aco: propagate literals into sub-dword pseudo instructions on GFX9+ 40807ec681f3Smrg- aco: don't use v_pack_b32_f16 if 16-bit input denormals are flushed 40817ec681f3Smrg- nir/opt_load_store_vectorize: don't vectorize stores across demote 40827ec681f3Smrg- nir/opt_load_store_vectorize: add some tests for discard/demote behaviour 40837ec681f3Smrg- aco: add missing SCC clobber in get_buffer_size 40847ec681f3Smrg- ci: disable check commits job for now 40857ec681f3Smrg- nir/loop_analyze: adjust force unrolling to only include interesting modes 40867ec681f3Smrg- ac/nir: remove bindless image atomic format check 40877ec681f3Smrg- aco: remove isel_context::allocated 40887ec681f3Smrg- aco: update phi_map in add_subdword_operand() 40897ec681f3Smrg- aco: don't do divergent break+discard 40907ec681f3Smrg- aco: skip value numbering of copies 40917ec681f3Smrg- aco: copy-propgate through p_create_vector during value numbering 40927ec681f3Smrg- aco: expand vectors passed as copy operands 40937ec681f3Smrg- aco: don't use bld.copy() in handle_operands() 40947ec681f3Smrg- aco: allow literals on sub-dword p_parallelcopy 40957ec681f3Smrg- aco: always use p_parallelcopy for pre-RA copies 40967ec681f3Smrg- aco: use Builder::copy more 40977ec681f3Smrg- aco: remove some unused optimizations 40987ec681f3Smrg- aco: use v_mov_b32_sdwa for some 16-bit constants 40997ec681f3Smrg- aco: remove all-undef phi opt 41007ec681f3Smrg- aco: ignore the ACO-inserted continue in create_continue_phis() 41017ec681f3Smrg- aco: default to a definition size of 32 41027ec681f3Smrg- aco: round bytes_written to dwords if larger than 4 bytes 41037ec681f3Smrg- aco: use control flow creation helpers in select_gs_copy_shader 41047ec681f3Smrg- aco: use mubuf helper in select_gs_copy_shader 41057ec681f3Smrg- aco: move individual instruction disassembly to its own helper 41067ec681f3Smrg- aco: refactor repeated instruction disassembly 41077ec681f3Smrg- aco: switch aco_print_asm to a FILE \\* 41087ec681f3Smrg- aco: create s_clause on GFX10+ 41097ec681f3Smrg- aco: assert a label only uses one of the members in ssa_info's union 41107ec681f3Smrg- aco: fix printing of some sdwa sels 41117ec681f3Smrg- aco: fix combine_inverse_comparison() 41127ec681f3Smrg- aco: don't allow destination opsel for v_cvt_pknorm 41137ec681f3Smrg- aco: handle SDWA in the optimizer 41147ec681f3Smrg- docs/features: update unpromoted Vulkan extensions 41157ec681f3Smrg- docs/features: add Vulkan 1.2 41167ec681f3Smrg- radv: add some missing radv_{start,stop}_feedback 41177ec681f3Smrg- radv: fix shader caching with discard->demote workaround 41187ec681f3Smrg- radv: fix shader caching with NaN fixup workaround 41197ec681f3Smrg- nir: scalarize fdot in reverse 41207ec681f3Smrg- spirv: reverse order in matrix multiplication 41217ec681f3Smrg- nir/algebraic: better propagate constants up fadd chains 41227ec681f3Smrg- nir: add nir_alu_src_is_trivial_ssa() 41237ec681f3Smrg- nir: skip bcsel with non-trivial swizzle in opt_simplify_bcsel_of_phi() 41247ec681f3Smrg- nir: use nir_alu_src_is_trivial_ssa() in nir_ssa_for_alu_src() 41257ec681f3Smrg- nir: add shader_info::bit_sizes_used 41267ec681f3Smrg- nir/lower_bit_size: optimize upcast of b2i8/b2i16 41277ec681f3Smrg- radv: move a few passes to after load/store vectorization 41287ec681f3Smrg- radv: do nir_lower_bit_size after algebraic optimizations 41297ec681f3Smrg- radv: rework nir_lower_bit_size callback and run DA on GFX8+ 41307ec681f3Smrg- aco: implement some 16-bit arithmetic instead of lowering 41317ec681f3Smrg- aco: implement 8/16-bit instructions which can be trivially widened 41327ec681f3Smrg- spirv: fix GLSLstd450Modf/GLSLstd450Frexp when the destination is vector 41337ec681f3Smrg- util: add mapping from Vulkan to Gallium R64 integer formats 41347ec681f3Smrg- amd/common: add PIPE_FORMAT_R64_{UINT,SINT} to GFX10 format table 41357ec681f3Smrg- aco: implement 64-bit images 41367ec681f3Smrg- ac/nir: implement 64-bit images 41377ec681f3Smrg- radv: implement VK_EXT_shader_image_atomic_int64 41387ec681f3Smrg- aco: don't combine precise max(min()) to med3 41397ec681f3Smrg- aco: fix combine_constant_comparison_ordering() NaN check with 16/64-bit 41407ec681f3Smrg- aco: disallow various v_add_u32 opts if modifiers are used 41417ec681f3Smrg- aco: disable omod if the sign of zeros should be preserved 41427ec681f3Smrg- aco: fix fp16 \*0.5 omod 41437ec681f3Smrg- aco: fix v_mul_hi_u32_u24 format 41447ec681f3Smrg- nir/unsigned_upper_bound: fix buffer overflow in search_phi_bcsel 41457ec681f3Smrg- nir: fix sampler_lod_parameters_pan indices 41467ec681f3Smrg 41477ec681f3SmrgRicardo Garcia (1): 41487ec681f3Smrg 41497ec681f3Smrg- anv: Ignore continue flag in primary cmd buffers 41507ec681f3Smrg 41517ec681f3SmrgRicardo Quesada (1): 41527ec681f3Smrg 41537ec681f3Smrg- anv: support fd==-1 in ImportSemaphoreFdKHR 41547ec681f3Smrg 41557ec681f3SmrgRob Clark (46): 41567ec681f3Smrg 41577ec681f3Smrg- freedreno/registers: add some missing regs to build 41587ec681f3Smrg- freedreno/ir3: don't install ir3_compiler cmdline tool 41597ec681f3Smrg- freedreno/ir3: add tracking for \\# of instructions per category 41607ec681f3Smrg- freedreno/ir3: add more disasm stats 41617ec681f3Smrg- freedreno/crashdec: handle section name typos 41627ec681f3Smrg- freedreno/decode: try harder to not crash in disasm 41637ec681f3Smrg- freedreno/registers: SC_WAIT_WC is not a6xx 41647ec681f3Smrg- freedreno/a6xx: only generate streamout for draw pass shader 41657ec681f3Smrg- freedreno/a6xx: fix occlusion query with more than one tile 41667ec681f3Smrg- freedreno/cffdump: add arg to filter by process name 41677ec681f3Smrg- freedreno/a6xx: disable LRZ when color channels are masked 41687ec681f3Smrg- freedreno/a6xx: refactor debug logging 41697ec681f3Smrg- freedreno: add debug helper to dump buffers 41707ec681f3Smrg- freedreno: handle case of shadowing current render target 41717ec681f3Smrg- freedreno/gmemtool: add tile_alignw/h and a650 41727ec681f3Smrg- freedreno: add env var to override GMEM size 41737ec681f3Smrg- freedreno: add env var to override tiles-per-pipe 41747ec681f3Smrg- freedreno/a6xx: fix hang with large render target 41757ec681f3Smrg- freedreno/batch: split out helper for rb alloc 41767ec681f3Smrg- freedreno/batch: replace lrz_clear with prologue 41777ec681f3Smrg- freedreno/a5xx+a6xx: use sysmem path for nondraw batches 41787ec681f3Smrg- freedreno/a6xx: move ubwc clear to blitter 41797ec681f3Smrg- freedreno: Fix missing rsc->seqno updates 41807ec681f3Smrg- freedreno: fence_server_sync() fixes 41817ec681f3Smrg- freedreno: Fix rast state for multisample clear 41827ec681f3Smrg- freedreno: Don't bypass fd_draw_vbo() in clear fallback 41837ec681f3Smrg- freedreno/a6xx: Skip empty tile_setup 41847ec681f3Smrg- freedreno/a6xx: Fix fd6_draw_vbo() return 41857ec681f3Smrg- freedreno: Clear gs/tcs/tes state for clear blits 41867ec681f3Smrg- freedreno/a6xx: Fix MSAA clear 41877ec681f3Smrg- freedreno: fix fence-fd leak 41887ec681f3Smrg- ci/deqp-runner: Allow overriding width/height/config 41897ec681f3Smrg- ci: cherry-pick deqp fix for config choosing 41907ec681f3Smrg- ci: Enable remaining (non-rotate) mustpass CTS tests 41917ec681f3Smrg- freedreno/drm: drop bo's dev reference 41927ec681f3Smrg- freedreno: Don't leak border_color_buf reference 41937ec681f3Smrg- freedreno/a6xx: Small cleanup 41947ec681f3Smrg- freedreno/drm: Also clean ring_cache 41957ec681f3Smrg- freedreno/registers: Add a couple things used on kernel side 41967ec681f3Smrg- freedreno: Don't leak LRZ bo's 41977ec681f3Smrg- freedreno: Update import/export traces 41987ec681f3Smrg- freedreno: Disallow tiled if SHARED and not QCOM_COMPRESSED 41997ec681f3Smrg- freedreno: Rework GMEM limit init 42007ec681f3Smrg- freedreno/gmem: Respect max-height limits too 42017ec681f3Smrg- freedreno: Protect gmem_cache ralloc allocations 42027ec681f3Smrg- freedreno/ir3: Fix crash in shader compile fail path 42037ec681f3Smrg 42047ec681f3SmrgRohan Garg (3): 42057ec681f3Smrg 42067ec681f3Smrg- anv: Mark anv_dump_{start,finish} as PUBLIC 42077ec681f3Smrg- gitlab-ci: Test the traces from bgfx 42087ec681f3Smrg- virgl: Always enable emulated BGRA and swizzling unless specifically told not to 42097ec681f3Smrg 42107ec681f3SmrgRoland Scheidegger (1): 42117ec681f3Smrg 42127ec681f3Smrg- gallivm: add InstSimplify pass 42137ec681f3Smrg 42147ec681f3SmrgRoman Gilg (2): 42157ec681f3Smrg 42167ec681f3Smrg- vulkan/wsi/x11: add sent image counter 42177ec681f3Smrg- vulkan/wsi/x11: wait for acquirable images in FIFO mode 42187ec681f3Smrg 42197ec681f3SmrgRoman Stratiienko (1): 42207ec681f3Smrg 42217ec681f3Smrg- android: freedreno: Another build fix 42227ec681f3Smrg 42237ec681f3SmrgRuijing Dong (1): 42247ec681f3Smrg 42257ec681f3Smrg- frontends/omx/enc: fix omx h264 encoding force-keyframe-period issue. 42267ec681f3Smrg 42277ec681f3SmrgRyan Neph (1): 42287ec681f3Smrg 42297ec681f3Smrg- virgl: Fixes portal2 binary name in tweak config 42307ec681f3Smrg 42317ec681f3SmrgSagar Ghuge (12): 42327ec681f3Smrg 42337ec681f3Smrg- intel/isl: Drop unnecessary check on 16bpp depth format 42347ec681f3Smrg- intel/blorp: Conditionally clear full surface depth and stencil 42357ec681f3Smrg- anv: Factor out dri option initialization code in separate function 42367ec681f3Smrg- anv: Add driconf option to disable compression for 16bpp format 42377ec681f3Smrg- anv: Return number of layers/levels attached to anv_image 42387ec681f3Smrg- anv: Handle compressed stencil buffer transition on Gen12+ 42397ec681f3Smrg- anv: Set stencil_aux_usage flag 42407ec681f3Smrg- anv: Get aux usage from plane while clearing stencil buffer 42417ec681f3Smrg- anv: Don't track clear bo for stencil buffer compression 42427ec681f3Smrg- anv: Return optimal aux state for stencil buffer compression 42437ec681f3Smrg- anv: Pass correct stencil aux usage during MSAA resolve 42447ec681f3Smrg- anv: Enable stencil buffer compression on Gen12+ 42457ec681f3Smrg 42467ec681f3SmrgSamuel Iglesias Gonsálvez (14): 42477ec681f3Smrg 42487ec681f3Smrg- freedreno/layout: add tile_all flag to the layout 42497ec681f3Smrg- turnip: add environment variable to disable LRZ 42507ec681f3Smrg- turnip: create LRZ buffer 42517ec681f3Smrg- turnip: disable LRZ on specific cases 42527ec681f3Smrg- turnip: disable LRZ writes when blend is enabled 42537ec681f3Smrg- turnip: disable LRZ depending on fragment changes 42547ec681f3Smrg- turnip: add LRZ tracking to command buffer state 42557ec681f3Smrg- turnip: add LRZ valid tracking for secondary command buffers 42567ec681f3Smrg- turnip: add support to clear LRZ 42577ec681f3Smrg- turnip: emit correct LRZ fast clear setup 42587ec681f3Smrg- turnip: disable LRZ on vkCmdClearAttachments() 42597ec681f3Smrg- turnip: disable LRZ on vkCmdClearattachments() 3D fallback path 42607ec681f3Smrg- turnip: enable LRZ 42617ec681f3Smrg- turnip: don't initialize GRAS_LRZ_CNTL/RB_LRZ_CNTL tu6_init_hw() 42627ec681f3Smrg 42637ec681f3SmrgSamuel Pitoiset (157): 42647ec681f3Smrg 42657ec681f3Smrg- radv: allow to force-enable LLVM internally for a specific shader stage 42667ec681f3Smrg- radv: report the spirv-nir logs back to the application 42677ec681f3Smrg- radv: rework the error function helpers a bit 42687ec681f3Smrg- radv: report errors back to the application via VK_EXT_debug_report 42697ec681f3Smrg- radv: report a better error message when QueueWaitIdle() failed 42707ec681f3Smrg- radv/gfx10: add missing initialization of registers 42717ec681f3Smrg- radv: limit LATE_ALLOC_GS to prevent a GPU hang on GFX10 42727ec681f3Smrg- radv: fix emitting the border color pointer on the compute queue 42737ec681f3Smrg- radv/winsys: add null winsys entries for Sienna Cichild/Navy Flounder 42747ec681f3Smrg- gitlab-ci: test Fossilize with GFX1030 42757ec681f3Smrg- aco: do not set valid_mask for POS0 exports on GFX 10.3 42767ec681f3Smrg- radv: track and report if a logical device is lost 42777ec681f3Smrg- aco: rename DEBUG_VALIDATE to DEBUG_VALIDATE_IR 42787ec681f3Smrg- aco: rework the way various compilation/validation errors are reported 42797ec681f3Smrg- radv,aco: report ACO errors/warnings back via VK_EXT_debug_report 42807ec681f3Smrg- aco: fix file leak in ra_fail() 42817ec681f3Smrg- radv: ignore BB labels when splitting the disassembly string 42827ec681f3Smrg- aco: add ACO_DEBUG=force-waitcnt to emit wait-states 42837ec681f3Smrg- amd/registers: add missing TBA registers on GFX6-GFX8 42847ec681f3Smrg- amd/registers: add some SQ_WAVE\_\* register definitions 42857ec681f3Smrg- aco: add TBA/TMA/TTMP0-11 physical registers definitions 42867ec681f3Smrg- aco: validate that SMEM operands can use fixed registers 42877ec681f3Smrg- aco: add a helper for building a trap handler shader 42887ec681f3Smrg- aco: skip unnecessary compiler pass for the trap handler program 42897ec681f3Smrg- radv: add a small interface for creating the trap handler shader 42907ec681f3Smrg- radv: add initial trap handler support with RADV_TRAP_HANDLER=1 42917ec681f3Smrg- radv: enable the trap handler and configure the shader exceptions 42927ec681f3Smrg- radv: use the trap handler to detect faulty shaders/instructions 42937ec681f3Smrg- radv: align the TMA BO size to 256 42947ec681f3Smrg- radv: allocate the TMA BO into 32-bit addr space 42957ec681f3Smrg- radv: fix setting EXCP_EN for different shader stages 42967ec681f3Smrg- radv: print a warning when RADV_TRAP_HANDLER is used 42977ec681f3Smrg- aco: add ACO_DEBUG=novn,noopt,nosched for debugging purposes 42987ec681f3Smrg- radv: emit {CB,DB}_RMI_L2_CACHE_CONTROL at framebuffer time 42997ec681f3Smrg- radv: set BIG_PAGE to improve performance on GFX10.3 43007ec681f3Smrg- aco: fix wrong source position for constant with nir_op_cube_face_coord 43017ec681f3Smrg- radv: dump shader stats with VK_KHR_pipeline_executable_properties 43027ec681f3Smrg- radv: force RADV_DEBUG=syncshaders when RADV_TRACE_FILE is used 43037ec681f3Smrg- radv: improve reporting faulty pipelines when a GPU hang is detected 43047ec681f3Smrg- radv: dump GPU info into the hang report 43057ec681f3Smrg- nir/algebraic: mark some optimizations with fsat(NaN) as inexact 43067ec681f3Smrg- spirv: fix retrieving dest type for OpFragmentMaskFetchAMD 43077ec681f3Smrg- radv,aco: disable opts if VK_PIPELINE_CREATE_DISABLE_OPTIMIZATION_BIT 43087ec681f3Smrg- aco: handle unaligned loads on GFX10.3 43097ec681f3Smrg- spirv: fix emitting switch cases that directly jump to the merge block 43107ec681f3Smrg- radv: fix transform feedback crashes if pCounterBufferOffsets is NULL 43117ec681f3Smrg- radv: add a helper for loading meta descriptors 43127ec681f3Smrg- radv: do not lower UBO/SSBO access to offsets 43137ec681f3Smrg- radv: remove useless assignment of MAX_API_VERSION 43147ec681f3Smrg- radv: bump the advertised patch version to 145 43157ec681f3Smrg- radv: add VK_KHR_copy_commands2 but leave it disabled 43167ec681f3Smrg- radv: add support for CmdBlitImage2KHR() 43177ec681f3Smrg- radv: add support for CmdCopyBuffer2KHR() 43187ec681f3Smrg- radv: add support for CmdCopyBufferToImage2KHR() 43197ec681f3Smrg- radv: add support for CmdCopyImage2KHR() 43207ec681f3Smrg- radv: add support for CmdCopyImageToBuffer2KHR() 43217ec681f3Smrg- radv: cleanup selecting the hardware resolve path 43227ec681f3Smrg- radv: add support for CmdResolveImage2KHR() 43237ec681f3Smrg- radv: advertise VK_KHR_copy_commands2 43247ec681f3Smrg- radv: set KEEP_TOGETHER_ENABLE if necessary on GFX10+ 43257ec681f3Smrg- radv: add a tweak for PS wave CU utilization for gfx10.3 43267ec681f3Smrg- ci: adjust RadeonSI rules 43277ec681f3Smrg- ci: add dEQP-VK.info.device_extensions to the list of skipped tests 43287ec681f3Smrg- nir/lower_memory_model: return progress when visiting instructions 43297ec681f3Smrg- nir/lower_memory_model: do not break with global atomic operations 43307ec681f3Smrg- ac/nir: implement nir_intrinsic_{load,store}_global 43317ec681f3Smrg- ac/nir: implement nir_intrinsic_global_atomic\_\* 43327ec681f3Smrg- radv: lower deref operations for global memory for both backends 43337ec681f3Smrg- ac/llvm: fix invalid IR if image stores are shrinked using the format 43347ec681f3Smrg- nir/lower_io: change nir_io_add_const_offset_to_base to use bitfield modes 43357ec681f3Smrg- radeonsi: call nir_io_add_const_offset_to_base only once per shader 43367ec681f3Smrg- radv/llvm: call nir_lower_io_to_vector with FS to fix array tests 43377ec681f3Smrg- radv: call nir_io_add_const_offset_to_base for FS outputs 43387ec681f3Smrg- radv: move lowering of FS outputs outside of ACO 43397ec681f3Smrg- radv: fix gathering writes_memory for global store/atomic operations 43407ec681f3Smrg- ac/llvm: fix invalid use of unreachable in ac_build_atomic_rmw() 43417ec681f3Smrg- ac/nir: fix nir_intrinsic_shared_atomic_fadd 43427ec681f3Smrg- radv: gather output usage mask from store_output for VS, TES and GS 43437ec681f3Smrg- radv/aco: lower IO for all stages outside of ACO 43447ec681f3Smrg- aco: apply the clamped integer addition disassembly workaround for v_add3 43457ec681f3Smrg- aco/tests: add disassembler tests to reproduce the add3+clamp crash 43467ec681f3Smrg- ac/llvm: adjust dmask when image stores are shrinked using the format 43477ec681f3Smrg- ac/nir: remove dead load/store deref code for temporary variables 43487ec681f3Smrg- radv/llvm: assign driver locations for VS, TCS, TES and GS correctly 43497ec681f3Smrg- radv/llvm: lower GS IO 43507ec681f3Smrg- radv/llvm: lower TES IO 43517ec681f3Smrg- radv/llvm: gather TCS outputs from the output variables 43527ec681f3Smrg- radv/llvm: lower TCS IO 43537ec681f3Smrg- radv/llvm: gather VS input usage mask from load_input 43547ec681f3Smrg- radv/llvm: lower VS IO 43557ec681f3Smrg- ac/llvm: implement nir_op_unpack_half_2x16_split_{x,y} 43567ec681f3Smrg- radv/llvm: enable lower_unpack_half_2x16 43577ec681f3Smrg- ac/nir: remove dead global load/store/atomic derefs code 43587ec681f3Smrg- ac/nir: remove dead shader IO code 43597ec681f3Smrg- radeonsi: remove dead code in TCS/TES/GS since const_index is always 0 43607ec681f3Smrg- ac,radv,radeonsi: remove unused parameters in the shader ABI IO 43617ec681f3Smrg- radv: remove unused gs.writes_memory in the shader info pass 43627ec681f3Smrg- radv: remove dead deref code in the shader info pass 43637ec681f3Smrg- ac/nir,radv: fix invalid IR when loading inline uniform blocks 43647ec681f3Smrg- nir/constant_folding: init nir_const_value to zero 43657ec681f3Smrg- aco: bail out if the NIR IO base offset isn't zero 43667ec681f3Smrg- aco: more uses of nir_get_io_offset_src() 43677ec681f3Smrg- ac/nir: implement nir_op_fsat 43687ec681f3Smrg- radv/llvm: do not lower nir_op_fsat 43697ec681f3Smrg- radv/llvm: remove dead code for 64-bit GS inputs 43707ec681f3Smrg- aco: dump the program if the disassembler failed 43717ec681f3Smrg- radv/llvm: do not lower sub 43727ec681f3Smrg- radv: use the same NIR compiler options for both compiler backends 43737ec681f3Smrg- radv/llvm: stop assigning driver_location in NIR->LLVM 43747ec681f3Smrg- ac,radv,radeonsi: stop multiplying driver_location by 4 43757ec681f3Smrg- ac/nir: pass the variable location to store_tcs_outputs 43767ec681f3Smrg- radv/llvm: switch to NIR IO assigned locations 43777ec681f3Smrg- radv/llvm: reduce the ESGS itemsize by using NIR IO assigned locations 43787ec681f3Smrg- radv/llvm: reduce LDS size for tess by using NIR IO assigned locations 43797ec681f3Smrg- radv: remove one leftover TODO in the shader info pass 43807ec681f3Smrg- ac/llvm: move AC_FETCH_FORMAT to non-LLVM code 43817ec681f3Smrg- radv: replace RADV_ALPHA_ADJUST by AC_FETCH_FORMAT 43827ec681f3Smrg- radv: move lower_io_arrays_to_elements before lower_io_to_scalar_early 43837ec681f3Smrg- radv: fix adjusting vertex alpha 43847ec681f3Smrg- aco: implement missing nir_op_unpack_half_2x16_split_{x,y}_flush_to_zero 43857ec681f3Smrg- radv/aco: disable NGG GS support because it randomly hangs the GPU 43867ec681f3Smrg- radv: fix ignoring the vertex attribute stride if set as dynamic 43877ec681f3Smrg- aco: remove stub lower_wqm() prototype 43887ec681f3Smrg- aco: remove useless occurences of radv_nir_compiler_options 43897ec681f3Smrg- aco: remove unused radv_shader.h includes 43907ec681f3Smrg- radv: move compiler statistics to ACO 43917ec681f3Smrg- aco: compute the CS workgroup size from the shader NIR info 43927ec681f3Smrg- aco: adjust an assertion about the wavesize in emit_gfx10_wave64_bpermute() 43937ec681f3Smrg- radv: fix optimizing needed states if some are marked as dynamic 43947ec681f3Smrg- ac/nir: implement missing nir_op_pack_half_2x16_split 43957ec681f3Smrg- radv: report latest extension spec versions 43967ec681f3Smrg- radv: add missing 'discardtodemote' option in the debug list 43977ec681f3Smrg- Revert "radv/aco: disable NGG GS support because it randomly hangs the GPU" 43987ec681f3Smrg- ac/nir: handle non-const offset with txf/txf_ms 43997ec681f3Smrg- radv: move all NIR pass outside of ACO 44007ec681f3Smrg- ac/nir: do not sign-extend the result of texop_samples_identical 44017ec681f3Smrg- radv,aco: fix use of texop_samples_identical in the resolve meta path 44027ec681f3Smrg- aco: fix determining if LOD is zero for nir_texop_txf/nir_texop_txs 44037ec681f3Smrg- ac/nir: ignore set_vertex_and_primitive_count intrinsic 44047ec681f3Smrg- ac/nir: abort when an unknown intrinsic is reached 44057ec681f3Smrg- ac: add an option to dump GPU info to a file 44067ec681f3Smrg- radv: add radv_dump_cmd() helper 44077ec681f3Smrg- radv: dump UMR ring and waves into the hang report 44087ec681f3Smrg- radv: dump GPU hang report logs into $HOME/radv_dumps_<pid\> 44097ec681f3Smrg- radv: re-order GPU hang report dumps by usefulness 44107ec681f3Smrg- radv: replace RADV_TRACE_FILE by RADV_DEBUG=hang 44117ec681f3Smrg- radv: do not perform a FMASK expand for non-writeable MSAA images 44127ec681f3Smrg- radv: flush CB before and after FMASK_DECOMPRESS or DCC_DECOMPRESS 44137ec681f3Smrg- radv: enable VK_AMD_mixed_attachment_samples on GFX6-GFX7 44147ec681f3Smrg- radv,aco: adjust the sample mask only if per-sample shading is enabled 44157ec681f3Smrg- radv,aco: optimize computing the sample mask for per-sample shading 44167ec681f3Smrg- aco: store NIR range analysis data to the isel context 44177ec681f3Smrg- aco: select v_mul_{hi}_u32_u24 for 24-bit multiplications 44187ec681f3Smrg- nir/algebraic: distribute imul(iadd(a, b), c) when b and c are constants 44197ec681f3Smrg- aco: optimize v_and(a, v_subbrev_co(0, 0, vcc)) -\> v_cndmask(0, a, vcc) 44207ec681f3Smrg- nir/algebraic: optimize bitfield_select(a, b, 0) to iand(a, b) 44217ec681f3Smrg- aco: fix combining add/sub to b2i if a new dest needs to be allocated 44227ec681f3Smrg 44237ec681f3SmrgSerge Martin (13): 44247ec681f3Smrg 44257ec681f3Smrg- clover: set LLVM min version to 8.0.1 44267ec681f3Smrg- clover: implements clEnqueueMigrateMemObjects 44277ec681f3Smrg- clover: implements clEnqueueFillImage 44287ec681f3Smrg- clover: implements clGetKernelArgInfo 44297ec681f3Smrg- clover: bind sampler_t type to module::argument::sampler 44307ec681f3Smrg- clover: add CL_KERNEL_ATTRIBUTES for clGetKernelInfo 44317ec681f3Smrg- clover: implements clGetKernelWorkGroupInfo CL_KERNEL_COMPILE_WORK_GROUP_SIZE 44327ec681f3Smrg- clover: implements notification callback on program builds 44337ec681f3Smrg- clover: avoid adding an extra space to compiler options 44347ec681f3Smrg- clover: move tokenize function to algorithm 44357ec681f3Smrg- clover: validate image_row_pitch and image_slice_pitch in clEnqueueMapImage 44367ec681f3Smrg- clover: clCreateImage: calculate image row_pitch and slice_pitch when not provided 44377ec681f3Smrg- clover: implements clSetContextDestructorCallback 44387ec681f3Smrg 44397ec681f3SmrgSuresh Guttula (2): 44407ec681f3Smrg 44417ec681f3Smrg- gallium: update abs_delta segementation parameter 44427ec681f3Smrg- radeon/vcn : Corrected dpb_size calculation for VP9_2 44437ec681f3Smrg 44447ec681f3SmrgTapani Pälli (16): 44457ec681f3Smrg 44467ec681f3Smrg- anv: add a check for depthStencilState before using it 44477ec681f3Smrg- anv: null check for buffer before reading size 44487ec681f3Smrg- anv: take depth in to account in anv_GetImageSubresourceLayout 44497ec681f3Smrg- mesa: refactor floating point texture fbo completeness check on gles 44507ec681f3Smrg- mesa: add EXT_color_buffer_half_float plumbing 44517ec681f3Smrg- mesa/st: enable EXT_color_buffer_half_float when formats supported 44527ec681f3Smrg- glsl: mark some builtins with correct glsl(es) version check 44537ec681f3Smrg- iris: remove additional pipe control done before hiz for older gens 44547ec681f3Smrg- glsl: take EXT_gpu_shader4 in to account when adding round 44557ec681f3Smrg- gallivm/nir: handle nir_op_flt in lp_build_nir_llvm 44567ec681f3Smrg- iris: fix the order of src and dst for fence memcpy 44577ec681f3Smrg- mesa/st: call memobj_destroy only if there is memory imported 44587ec681f3Smrg- mesa: do not throw \_mesa_problem when invalid enum is used 44597ec681f3Smrg- mesa/st: use a lock to protect access to variants when updating them 44607ec681f3Smrg- egl/dri2: fix race between image create and egl_image_target_texture 44617ec681f3Smrg- iris: initialize shared screen->vtbl only once 44627ec681f3Smrg 44637ec681f3SmrgThong Thai (10): 44647ec681f3Smrg 44657ec681f3Smrg- radeon/vcn: fix jpeg decode for navi10 44667ec681f3Smrg- frontends/va: Add support for NV12/P010/P016 to vaDeriveImage 44677ec681f3Smrg- frontends/va: Derive image from interlaced buffers 44687ec681f3Smrg- frontends/va: Derive image from interlaced buffers in some cases 44697ec681f3Smrg- gallium: Parse packed HEVC SPS encode header for crop parameters 44707ec681f3Smrg- radeon: Pass HEVC encode crop parameters to the encoder 44717ec681f3Smrg- frontends/va: Enabled packed headers for H.264 encoder 44727ec681f3Smrg- gallium/auxiliary/vl: Include src region in scale_y calculation 44737ec681f3Smrg- frontends/va/postproc: Un-break field flag 44747ec681f3Smrg- frontends/va: Return P010/P016 as possible surface formats when encoding 44757ec681f3Smrg 44767ec681f3SmrgTimothy Arceri (15): 44777ec681f3Smrg 44787ec681f3Smrg- i965: add support for force_gl_vendor 44797ec681f3Smrg- disk_cache: move cache dir generation into OS specific helper file 44807ec681f3Smrg- disk_cache: add disk_cache_enabled() helper 44817ec681f3Smrg- disk_cache: move index mmap into OS specific helper 44827ec681f3Smrg- disk_cache: move munmap into an OS specific helper 44837ec681f3Smrg- disk_cache: move evict_lru_item() to an OS specific helper 44847ec681f3Smrg- disk_cache: create new helper for writing cache items to disk 44857ec681f3Smrg- disk_cache: move get_cache_file() to an OS specific helper 44867ec681f3Smrg- disk_cache: add new OS specific helper disk_cache_evict_item() 44877ec681f3Smrg- disk_cache: move cache item loading code into disk_cache_load_item() helper 44887ec681f3Smrg- glsl: don't duplicate state vars as uniforms in the NIR linker 44897ec681f3Smrg- util/disk_cache: remove unused function param 44907ec681f3Smrg- glsl: relax rule on varying matching for shaders older than 4.00 44917ec681f3Smrg- glsl: add extra pp tokens workaround and enable for CoR 44927ec681f3Smrg- glsl: drop NMS OpenGL workarounds 44937ec681f3Smrg 44947ec681f3SmrgTimur Kristóf (50): 44957ec681f3Smrg 44967ec681f3Smrg- aco: Fix unused variable warning by adding ASSERTED. 44977ec681f3Smrg- aco: Fix convert_to_SDWA when instruction has 3 operands. 44987ec681f3Smrg- aco: Move README to README-ISA 44997ec681f3Smrg- aco: Fixup markdown formatting of the README-ISA. 45007ec681f3Smrg- aco: Add README which explains about what ACO is and how it works. 45017ec681f3Smrg- aco: Fix emit_boolean_exclusive_scan in wave32 mode. 45027ec681f3Smrg- aco: Clean up emit_mbcnt. 45037ec681f3Smrg- aco: Add base argument to emit_mbcnt. 45047ec681f3Smrg- aco: Use NIR IO semantics for tess factor IO locations. 45057ec681f3Smrg- radv/aco: Set I/O variable locations outside ACO. 45067ec681f3Smrg- nir: Add ability to count emitted GS primitives. 45077ec681f3Smrg- nir: Add ability to count emitted GS vertices per primitive. 45087ec681f3Smrg- nir: Add ability to overwrite incomplete GS primitives. 45097ec681f3Smrg- nir: Count vertices per stream. 45107ec681f3Smrg- nir: Add ability to count primitives per stream. 45117ec681f3Smrg- radv/aco: Use new GS lowering options for ACO with NGG GS. 45127ec681f3Smrg- aco: Clarify missing export error message in assembler. 45137ec681f3Smrg- aco: Extract lanecount_to_mask to a separate function. 45147ec681f3Smrg- aco: Extract thread_id_in_threadgroup to a separate function. 45157ec681f3Smrg- aco: Use thread_id_in_threadgroup helper for ES outputs. 45167ec681f3Smrg- aco: Optimize thread_id_in_threadgroup when there is just one wave. 45177ec681f3Smrg- aco: Add wave-specific opcode for s_lshl and s_flbit. 45187ec681f3Smrg- aco/ngg: Refactor gs_alloc_req in preparation for NGG GS. 45197ec681f3Smrg- aco/ngg: Refactor ngg_emit_prim_export in preparation for NGG GS. 45207ec681f3Smrg- aco/ngg: Make primitive export packing less prone to error. 45217ec681f3Smrg- aco/ngg: Clean up and reorganize NGG VS/TES code. 45227ec681f3Smrg- aco/ngg: Allow NGG GS to store ES outputs. 45237ec681f3Smrg- aco/ngg: Allow NGG GS to load per-vertex GS inputs. 45247ec681f3Smrg- aco/ngg: Allow NGG GS to create VS exports. 45257ec681f3Smrg- aco/ngg: Setup NGG GS. 45267ec681f3Smrg- aco/ngg: Create LDS layout for NGG GS. 45277ec681f3Smrg- aco/ngg: Implement workgroup reduce / exclusive scan for NGG GS. 45287ec681f3Smrg- aco/ngg: Implement NGG GS output. 45297ec681f3Smrg- aco/ngg: Place workgroup barrier outside control flow for NGG GS. 45307ec681f3Smrg- aco/ngg: Add shader query support to NGG GS. 45317ec681f3Smrg- radv/aco: Enable NGG GS by default. 45327ec681f3Smrg- aco/ngg: Use more efficient LDS layout to help reduce bank conflicts. 45337ec681f3Smrg- aco/ngg: Allocate NGG GS space early for const vertex/primitive counts. 45347ec681f3Smrg- aco/ngg: Calculate workgroup size of NGG shaders. 45357ec681f3Smrg- nir: Emit set_vertex_and_primitive_count for inactive streams. 45367ec681f3Smrg- aco/ngg: Add assertion to make sure we always know the vertex count. 45377ec681f3Smrg- aco: Assert that workgroup barriers are not used inappropriately. 45387ec681f3Smrg- aco/ngg: Put shader query reduction operand into a VGPR. 45397ec681f3Smrg- aco: Add some validation for PSEUDO_REDUCTION instructions. 45407ec681f3Smrg- aco: Make emitting reduction instructions a bit more convenient. 45417ec681f3Smrg- aco: Add a few assertions about LDS usage. 45427ec681f3Smrg- aco/ngg: Export a zero-area triangle when primitive count is 0. 45437ec681f3Smrg- aco/ngg: Incorporate GS invocations into workgroup size calculation. 45447ec681f3Smrg- aco/optimizer: Only set scc_needed when it is actually needed. 45457ec681f3Smrg- aco: Fix NGG GS assert failure from the WG scan. 45467ec681f3Smrg 45477ec681f3SmrgTomeu Vizoso (16): 45487ec681f3Smrg 45497ec681f3Smrg- Revert "CI: temp disable t720/t760 jobs." 45507ec681f3Smrg- Revert "CI: Disable Panfrost T720/T760 CI" 45517ec681f3Smrg- ci: Split traces.yml file per driver 45527ec681f3Smrg- ci: Test Panfrost with more traces 45537ec681f3Smrg- ci: Fix URL to imagediff page in traces dashboard 45547ec681f3Smrg- ci: Update kernel used in LAVA to 5.8-based drm-misc 45557ec681f3Smrg- ci: Run deqp-gles2 on RadeonSI 45567ec681f3Smrg- ci: Run deqp-gles3 and deqp-gles31 on RadeonSI 45577ec681f3Smrg- ci: Update kernel for LAVA 45587ec681f3Smrg- ci: Test Panfrost on Khadas VIM3 boards 45597ec681f3Smrg- ci: Disable pm_runtime and max clocks in LAVA jobs 45607ec681f3Smrg- ci: Unskip fragment_ops tests on Bifrost 45617ec681f3Smrg- virgl: Correctly align size of blobs 45627ec681f3Smrg- ci: Update kernel for LAVA to 5.10-rc2 plus patches 45637ec681f3Smrg- ci: Update dEQP skips and fails for Bifrost on G52 45647ec681f3Smrg- ci: Distribute ADMGPU driver to LAVA as a module 45657ec681f3Smrg 45667ec681f3SmrgTony Wasserka (26): 45677ec681f3Smrg 45687ec681f3Smrg- nir/lower_idiv: Port recent LLVM fixes to emit_udiv 45697ec681f3Smrg- radv: Fix various non-critical integer overflows 45707ec681f3Smrg- aco: Fix integer overflows when emitting parallel copies during RA 45717ec681f3Smrg- amd/common: Fix various non-critical integer overflows 45727ec681f3Smrg- aco/isel: Turn the function template emit_load into a proper function 45737ec681f3Smrg- aco/isel: Simplify nested branching code 45747ec681f3Smrg- aco/isel: Consistently use references for input parameters in emit_load 45757ec681f3Smrg- aco/isel: Remove unused definitions 45767ec681f3Smrg- aco/isel: Move context initialization code to a dedicated file 45777ec681f3Smrg- aco/isel: Move add_startpgm to aco_instruction_selection.cpp 45787ec681f3Smrg- aco/isel: Compile all helper functions with static linkage 45797ec681f3Smrg- nir: Fix undefined behavior due to signed integer multiplication overflows 45807ec681f3Smrg- nir: Fix unaligned pointer access 45817ec681f3Smrg- radv: Avoid calling memcpy with null pointers 45827ec681f3Smrg- radv: Fix unaligned memory access when writing specialization map entries 45837ec681f3Smrg- radv: Clean up CreateDescriptorSetLayout 45847ec681f3Smrg- radv: Respect alignment requirements in descriptor set layouts 45857ec681f3Smrg- aco/isel: Fix out-of-bounds write in visit_load_input 45867ec681f3Smrg- aco/isel: Always export position data from VS/NGG 45877ec681f3Smrg- aco/isel: Remove some dead code 45887ec681f3Smrg- aco/isel: Remove now unused VS-related code from create_null_export 45897ec681f3Smrg- aco: Use strong typing to model SW<->HW stage mappings 45907ec681f3Smrg- aco: Clean up symbol names and comments related to NGG 45917ec681f3Smrg- aco/isel: Miscellaneous cleanups using the new Stage API 45927ec681f3Smrg- aco/ra: Fix counting of subdword variables in get_reg_create_vector 45937ec681f3Smrg- aco: Fix format string used when raising validation errors 45947ec681f3Smrg 45957ec681f3SmrgVeerabadhran Gopalakrishnan (1): 45967ec681f3Smrg 45977ec681f3Smrg- frontends/va: Added protected playback support for VP9 45987ec681f3Smrg 45997ec681f3SmrgVinson Lee (64): 46007ec681f3Smrg 46017ec681f3Smrg- util: Fix memory leaks in unit test. 46027ec681f3Smrg- meson: Fix lmsensors warning message. 46037ec681f3Smrg- radv/winsys: Fix memory leak. 46047ec681f3Smrg- vulkan: Fix memory leaks. 46057ec681f3Smrg- panfrost: Fix gnu-empty-initializer errors. 46067ec681f3Smrg- freedreno: Fix file descriptor leak. 46077ec681f3Smrg- svga: Fix unused printf argument. 46087ec681f3Smrg- spirv: Initialize spirv_test member shader. 46097ec681f3Smrg- nv50/ir: Add fallthrough statement. 46107ec681f3Smrg- nv50/ir: Remove duplicate mask assignment. 46117ec681f3Smrg- ac/llvm: Fix nonportable sizeof. 46127ec681f3Smrg- freedreno: Check file descriptor before write. 46137ec681f3Smrg- nv50/ir: Initialize Converter members. 46147ec681f3Smrg- libgl-gdi: Fix unused-variable warnings. 46157ec681f3Smrg- disk_cache: Fix filename leak on error path. 46167ec681f3Smrg- radesonsi: Remove unsigned comparison to zero. 46177ec681f3Smrg- panfrost: Delete debug allocated syncobj. 46187ec681f3Smrg- turnip: Release bo_mutex lock before potential error path. 46197ec681f3Smrg- pan/bi: Fix typo. 46207ec681f3Smrg- glsl: Initialize ir_constant member const_elements in all constructors. 46217ec681f3Smrg- r600/sfn: Initialize GPRValue member m_pin_to_channel. 46227ec681f3Smrg- gallium/dri2: Move image->texture assignment after image NULL check. 46237ec681f3Smrg- panfrost: Remove extra printf arguments. 46247ec681f3Smrg- anv: Check file descriptor before closing. 46257ec681f3Smrg- aco: Initialize mad_info member literal_idx. 46267ec681f3Smrg- gallium/swr: Remove unreachable code. 46277ec681f3Smrg- pan/mdg: Fix memory leak on error path. 46287ec681f3Smrg- lima: Print usage if --help is any of the arguments. 46297ec681f3Smrg- radv: Fix asserts using assign instead of compare. 46307ec681f3Smrg- nv50/ir: Initialize Source members. 46317ec681f3Smrg- freedreno: Move rsc NULL check to before rsc dereferences. 46327ec681f3Smrg- intel/vec4: Remove leftover code from Gen8+ removal. 46337ec681f3Smrg- glsl: Initialize ast_node member field location.path in constructor. 46347ec681f3Smrg- meson: Use more portable compiler option -std. 46357ec681f3Smrg- swr/rasterizer: Remove BuilderGfxMem member mpTrackMemAccessFuncTy. 46367ec681f3Smrg- util/xmlconfig: Initialize xmlconfig member options in constructor. 46377ec681f3Smrg- svga: Remove unused printf argument. 46387ec681f3Smrg- glsl: Initialize ir_to_mesa_visitor members in constructor. 46397ec681f3Smrg- v3dv: Fix assert using assign instead of compare. 46407ec681f3Smrg- glsl: Initialize lower_ubo_reference_visitor members in constructor. 46417ec681f3Smrg- glsl: Initialize add_uniform_to_shader member var in constructor. 46427ec681f3Smrg- v3dv: Remove unsigned comparison to zero. 46437ec681f3Smrg- v3dv: Initialize time before usage by free_stale_bos. 46447ec681f3Smrg- panfrost: Fix stride for AFBC_FORMAT_MOD_BLOCK_SIZE_32x8. 46457ec681f3Smrg- v3dv: Fix assert using assign instead of compare. 46467ec681f3Smrg- glsl: Initialize ir_if_to_cond_assign_visitor members in constructor. 46477ec681f3Smrg- glsl: Initialize lower_shared_reference_visitor members. 46487ec681f3Smrg- scons/windows: Support build with LLVM 11. 46497ec681f3Smrg- amd/addrlib: Initialize Gfx10Lib members in constructor. 46507ec681f3Smrg- Fix VMware capitalization. 46517ec681f3Smrg- glsl: Update loop_terminator constructor to accept parameters. 46527ec681f3Smrg- draw: Remove draw_install_aaline_stage dead code. 46537ec681f3Smrg- os: Fix open result check. 46547ec681f3Smrg- gallium: Remove duplicate resource variable. 46557ec681f3Smrg- tgsi: Initialize tgsi_declaration_dimension padding. 46567ec681f3Smrg- radesonsi: Remove unnecessary shader->selector NULL check. 46577ec681f3Smrg- amd/addrlib: Add missing va_end. 46587ec681f3Smrg- v3dv: Remove unsigned comparison to zero. 46597ec681f3Smrg- st/nine: Remove unnecessary NULL check. 46607ec681f3Smrg- turnip: Fix file descriptor return. 46617ec681f3Smrg- vdpau: Add missing printf format specifier. 46627ec681f3Smrg- frontends/va: Fix \*num_entrypoints check. 46637ec681f3Smrg- clover/spirv: Add missing break for SpvOpExecutionMode case. 46647ec681f3Smrg- turnip: Close sync_fd only if it is a valid file descriptor. 46657ec681f3Smrg 46667ec681f3SmrgWoody Chow (1): 46677ec681f3Smrg 46687ec681f3Smrg- st/mesa: Fix EGLImageTargetTexture2D for GL_TEXTURE_2D 46697ec681f3Smrg 46707ec681f3SmrgYevhenii Kolesnikov (1): 46717ec681f3Smrg 46727ec681f3Smrg- nir/large_constants: only search for constant duplicates 46737ec681f3Smrg 46747ec681f3SmrgYogesh Mohan Marimuthu (1): 46757ec681f3Smrg 46767ec681f3Smrg- src/mesa: add GL_NV_half_float extension support (v2) 46777ec681f3Smrg 46787ec681f3Smrgjzielins (4): 46797ec681f3Smrg 46807ec681f3Smrg- gallium/swr: Fix compilation with LLVM 12 46817ec681f3Smrg- gallium/swr: Fix compilation TCS/TES compilation issues 46827ec681f3Smrg- swr: Fix crashes on non-AVX hardware 46837ec681f3Smrg- swr: Use ElemenCount constructor for LLVM 11 46847ec681f3Smrg 46857ec681f3Smrgn00b7 (1): 46867ec681f3Smrg 46877ec681f3Smrg- v3dv/device: handle primary nodes for newer kernels 46887ec681f3Smrg 46897ec681f3Smrgorbea (1): 46907ec681f3Smrg 46917ec681f3Smrg- spirv/vtn_cfg.c: Include util/debug.h for env_var_as_boolean. 46927ec681f3Smrg 46937ec681f3Smrgzhu yong (1): 46947ec681f3Smrg 46957ec681f3Smrg- meson: add support for loongson's mips/mips64 arch. 4696