17ec681f3SmrgMesa 20.3.0 Release Notes / 2020-12-03
27ec681f3Smrg======================================
37ec681f3Smrg
47ec681f3SmrgMesa 20.3.0 is a new development release. People who are concerned
57ec681f3Smrgwith stability and reliability should stick with a previous release or
67ec681f3Smrgwait for Mesa 20.3.1.
77ec681f3Smrg
87ec681f3SmrgMesa 20.3.0 implements the OpenGL 4.6 API, but the version reported by
97ec681f3SmrgglGetString(GL_VERSION) or glGetIntegerv(GL_MAJOR_VERSION) /
107ec681f3SmrgglGetIntegerv(GL_MINOR_VERSION) depends on the particular driver being used.
117ec681f3SmrgSome drivers don't support all the features required in OpenGL 4.6. OpenGL
127ec681f3Smrg4.6 is **only** available if requested at context creation.
137ec681f3SmrgCompatibility contexts may report a lower version depending on each driver.
147ec681f3Smrg
157ec681f3SmrgMesa 20.3.0 implements the Vulkan 1.2 API, but the version reported by
167ec681f3Smrgthe apiVersion property of the VkPhysicalDeviceProperties struct
177ec681f3Smrgdepends on the particular driver being used.
187ec681f3Smrg
197ec681f3SmrgSHA256 checksum
207ec681f3Smrg---------------
217ec681f3Smrg
227ec681f3Smrg::
237ec681f3Smrg
247ec681f3Smrg    2999738e888731531cd62b27519fa37566cc0ea2cd7d4d97f46abaa3e949c630  mesa-20.3.0.tar.xz
257ec681f3Smrg
267ec681f3Smrg
277ec681f3SmrgNew features
287ec681f3Smrg------------
297ec681f3Smrg
307ec681f3Smrg- GL 4.5 on llvmpipe
317ec681f3Smrg
327ec681f3Smrg- GL_INTEL_blackhole_render on radeonsi
337ec681f3Smrg
347ec681f3Smrg- GL_NV_copy_depth_to_color for NIR
357ec681f3Smrg
367ec681f3Smrg- GL_NV_half_float
377ec681f3Smrg
387ec681f3Smrg- GL_NV_shader_atomic_int64 on radeonsi
397ec681f3Smrg
407ec681f3Smrg- EGL_KHR_swap_buffers_with_damage on X11 (DRI3)
417ec681f3Smrg
427ec681f3Smrg- VK_PRESENT_MODE_FIFO_RELAXED on X11
437ec681f3Smrg
447ec681f3Smrg- GLX_EXT_swap_control for DRI2 and DRI3
457ec681f3Smrg
467ec681f3Smrg- GLX_EXT_swap_control_tear for DRI3
477ec681f3Smrg
487ec681f3Smrg- VK_KHR_copy_commands2 on RADV
497ec681f3Smrg
507ec681f3Smrg- VK_KHR_shader_terminate_invocation on RADV
517ec681f3Smrg
527ec681f3Smrg- NGG GS support in ACO
537ec681f3Smrg
547ec681f3Smrg- VK_KHR_shader_terminate_invocation on ANV
557ec681f3Smrg
567ec681f3Smrg- driconf: add glx_extension_override
577ec681f3Smrg
587ec681f3Smrg- driconf: add indirect_gl_extension_override
597ec681f3Smrg
607ec681f3Smrg- VK_AMD_mixed_attachment_samples on RADV (GFX6-GFX7).
617ec681f3Smrg
627ec681f3Smrg- GL_MESA_pack_invert on r100 and vieux
637ec681f3Smrg
647ec681f3Smrg- GL_ANGLE_pack_reverse_row_order
657ec681f3Smrg
667ec681f3Smrg- VK_EXT_shader_image_atomic_int64 on RADV
677ec681f3Smrg
687ec681f3Smrg- None
697ec681f3Smrg
707ec681f3Smrg
717ec681f3SmrgBug fixes
727ec681f3Smrg---------
737ec681f3Smrg
747ec681f3Smrg- \[icl,tgl][iris][i965][regression][bisected\] piglit failures
757ec681f3Smrg- shader-db valgrind error
767ec681f3Smrg- \[AMDGPU NAVI 5700xt\] Large parts of the Blender viewport does not render correctly if an object with hair is moved.
777ec681f3Smrg- \[aco\] problem compiling compute pipeline
787ec681f3Smrg- zink: regression after !7606
797ec681f3Smrg- glcpp test 084-unbalanced-parentheses fails with bison 3.6.y
807ec681f3Smrg- zink+radv: corruption on pre-game menu in quake3
817ec681f3Smrg- panfrost massive glitches apitrace opengl 2.1
827ec681f3Smrg- \[radeonsi\] After 549ae5f84375dfadb86cfd465f0103acfae3249f commit Firefox Nightly Asan begins crashes
837ec681f3Smrg- Amber test NIR validation failed after spirv_to_nir
847ec681f3Smrg- zink: add detection for wsi_memory_allocate_info usage
857ec681f3Smrg- Follow-up from "nir,spirv: Add generic pointers support"
867ec681f3Smrg- v3d  GL_ARB_vertex_array_bgra support
877ec681f3Smrg- iris: glClear with FBO imported from DMA-BUF doesn't work
887ec681f3Smrg- Fast-clears of GL_ALPHA16 textures are broken on TGL
897ec681f3Smrg- NV50_PROG_USE_NIR=1 doesn't work for piglit/bin/pbo-teximage ?
907ec681f3Smrg- Follow-up from "st/mesa: Use nir-to-tgsi for builtins if the driver needs TGSI"
917ec681f3Smrg- \[spirv-fuzz\] Shader causes an assertion failure in nir_opt_large_constants
927ec681f3Smrg- Amber test validate_phi_src
937ec681f3Smrg- Regnum Online UBO break after game update
947ec681f3Smrg- Current mesa git fails to build in multilib environment?
957ec681f3Smrg- radv/aco: Vertex explosion on RPCS3
967ec681f3Smrg- llvmpipe-cl should not run for other drivers
977ec681f3Smrg- Factorio v1.0 - Linux native - 64 bit - OpenGL/radeonsi - completely broken rendering
987ec681f3Smrg- Gnome 3.38 with Xwayland has screen corruption for X11 apps.
997ec681f3Smrg- st/va fails to build on old libva in mesa git
1007ec681f3Smrg- sp_state_shader.c:146: undefined reference to \`nir_to_tgsi'
1017ec681f3Smrg- anv: dEQP-VK.api.copy_and_blit.core.blit_image.all_formats.color.3d\* failures
1027ec681f3Smrg- anv: dEQP-VK.api.copy_and_blit.core.blit_image.all_formats.color.3d\* failures
1037ec681f3Smrg- RADV: Death Stranding glitchy sky rendering
1047ec681f3Smrg- Crash in glDrawArrays on Intel iris
1057ec681f3Smrg- GLX_OML_swap_method not fully supported
1067ec681f3Smrg- deinterlace_vaapi=rate=field does not double output's actual frame rate on AMD
1077ec681f3Smrg- Steam game Haydee leans on implementation-dependent behavior
1087ec681f3Smrg- ANV: Support 1 million update-after-bind descriptors
1097ec681f3Smrg- zink: crash in Blender on start-up
1107ec681f3Smrg- vc4 in 20.2-rc has regression causing app to crash
1117ec681f3Smrg- \[RADV\] broken stencil behaviour when using extended dynamic stencil state
1127ec681f3Smrg- \[RADV/ACO\] Star Citizen Lighting/Shadow Issue
1137ec681f3Smrg- \[RADV\] Some bindings seem broken with VK_DYNAMIC_STATE_VERTEX_INPUT_BINDING_STRIDE_EXT
1147ec681f3Smrg- \[RADV/ACO\] ACO build error about SMEM operands
1157ec681f3Smrg- Graphics corruption in Super Mega Baseball 2 with RADV on Navi
1167ec681f3Smrg- RADV ACO - ground line corruption in Path of Exile with Vulkan renderer
1177ec681f3Smrg- omx/tizonia build broken with latest mesa git
1187ec681f3Smrg- Request: VK_EXT_transform_feedback on Intel Gen 7
1197ec681f3Smrg- iris: Regression in deqp const_write tests
1207ec681f3Smrg- \[hsw][bisected][regression\] gpu hangs on dEQP-VK.subgroups.(shuffle|quad) tests
1217ec681f3Smrg- \[RADV/LLVM/ACO\] Serious Sam 4 crashes after first cutscene with ACO backend + flickering black spots sprout up everywhere
1227ec681f3Smrg- TGL B0 Stepping gpu hangs on many dEQP-VK.subgroups.quad nonconst tests
1237ec681f3Smrg- \[machines without AVX2/F16C][bisected\] X server crash, wflinfo crash in mesa CI
1247ec681f3Smrg- nir: Mesa regression on Compute shader
1257ec681f3Smrg- radv, aco: dEQP-VK.glsl.atomic_operations.*_fragment_reference regressed
1267ec681f3Smrg- Commit c6c1fa9a263880 causes corruption in Steam UI
1277ec681f3Smrg- \[spirv-fuzz\] Shader generates a wrong image
1287ec681f3Smrg- Running Amber test leads to VK_DEVICE_LOST
1297ec681f3Smrg- \[Regression][Bisected][20.2][radeonsi\] American Truck Simulator continually allocates memory until OOM
1307ec681f3Smrg- \[radeonsi\] bottom mips of height=1 2D texture is uninitialised after upload
1317ec681f3Smrg- Missing terrain in Total War:  Warhammer
1327ec681f3Smrg- anv: dEQP-VK.robustness.robustness2.\* failures on gen12
1337ec681f3Smrg- AMD VAAPI encoding - applying filters introduces garbled line at the bottom
1347ec681f3Smrg- AMD VAAPI HEVC encoding not working correctly on Polaris
1357ec681f3Smrg- \[RADV\] Problems reading primitive ID in fragment shader after tessellation
1367ec681f3Smrg- Massive memory leak (at least AMD, others unknown)
1377ec681f3Smrg- Substance Painter 6.1.3 black glitches on Radeon RX570
1387ec681f3Smrg- \[ivb,hsw,byt,bsw][i965][bisected\] anv_reloc_list_add: Assertion failure
1397ec681f3Smrg- vkCmdCopyImage broadcasts subsample 0 of MSAA src into all subsamples of dst on RADV
1407ec681f3Smrg- assert(left \<= -1 && top \<= -1 && right \>= 1 && bottom \>= 1) fails in si_emit_guardband
1417ec681f3Smrg- Crash in ruvd_end_frame when calling vaBeginPicture/vaEndPicture without rendering anything
1427ec681f3Smrg- Release signing key is not readily available
1437ec681f3Smrg- \[iris][bisected\] piglit.spec.nv_copy_depth_to_color.nv_copy_depth_to_color failures
1447ec681f3Smrg- VAAPI vaDeriveImage returns VA_STATUS_ERROR_OPERATION_FAILED
1457ec681f3Smrg- X-Plane 11 Installer crashes on startup since \`glsl: declare gl_Layer/gl_ViewportIndex/gl_ViewportMask as vs builtins\`
1467ec681f3Smrg- piglit spec/glsl-1.50/execution/geometry/clip-distance-vs-gs-out.shader_test fails on Iris
1477ec681f3Smrg- Amber test opt_peel_loop_initial_if: Assertion failed
1487ec681f3Smrg- builder_misc.cpp:137:55: error: ‘get’ is not a member of ‘llvm::ElementCount’
1497ec681f3Smrg- AVX instructions leak outside of CPU feature check and cause SIGILL
1507ec681f3Smrg- Dirt Rally: Flickering glitches on certain foliage since Mesa 20.1.0 caused by MSAA
1517ec681f3Smrg- Horizon Zero Dawn graphics corruption with with radv
1527ec681f3Smrg- Crusader Kings 3 Crashes at start since commit with !6472
1537ec681f3Smrg- pan_resource.c:733:38: error: use of GNU empty initializer extension \[-Werror,-Wgnu-empty-initializer\]
1547ec681f3Smrg- \[BRW\] WRC 5 asserts with gallium nine and iris.
1557ec681f3Smrg- ci/bare-metal: POWER_GOOD detection broken with reboot rework
1567ec681f3Smrg- radv: Corruption in "The Surge 2"
1577ec681f3Smrg- \[RADV\] Detroit: Become Human Demo game lock-ups with RADV
1587ec681f3Smrg- Road Redemption certain  graphic effects rendered white color
1597ec681f3Smrg- gen_state_llvm.h:54:99: error: invalid conversion from ‘int’ to ‘const llvm::VectorType*’ \[-fpermissive\]
1607ec681f3Smrg- Using a shared dEQP build script
1617ec681f3Smrg- vulkan/wsi/x11: deadlock with Xwayland when compositor holds multiple buffers
1627ec681f3Smrg- \[RADV/ACO\] Death Stranding cause a GPU hung (\*ERROR\* Waiting for fences timed out!)
1637ec681f3Smrg- lp_bld_init.c:172:7: error: implicit declaration of function ‘LLVMAddConstantPropagationPass’; did you mean ‘LLVMAddCorrelatedValuePropagationPass’? \[-Werror=implicit-function-declaration\]
1647ec681f3Smrg- ci: Use lld or gold instead of ld.bfd
1657ec681f3Smrg- Intel Vulkan driver crash with alpha-to-coverage
1667ec681f3Smrg- radv: blitting 3D images with linear filter
1677ec681f3Smrg- \[ACO\] Compiling pipelines from RPCS3's shader interpreter spins forever in ACO code
1687ec681f3Smrg- \[regression][bisected\] nir: nir_intrinsic_io_semantics assert failures in piglit
1697ec681f3Smrg- error: ‘static_assert’ was not declared in this scope
1707ec681f3Smrg- Intel Vulkan driver assertion with small xfb buffer
1717ec681f3Smrg- \<<MESA crashed>\> Array Index Out of Range with Graphicsfuzz application
1727ec681f3Smrg- EGL_KHR_swap_buffers_with_damage support on X11
1737ec681f3Smrg- \[spirv-fuzz\] SPIR-V parsing failed "src->type->type == dest->type->type"
1747ec681f3Smrg- radeonsi: radeonsi crashes in Chrome on chromeos
1757ec681f3Smrg- \[RADV\] commit d19bc94e4eb94 broke gamescope with Navi
1767ec681f3Smrg- 4e3a7dcf6ee4946c46ae8b35e7883a49859ef6fb breaks Gamescope showing windows properly.
1777ec681f3Smrg- anv: crashes in CTS test dEQP-VK.subgroups.*.framebuffer.*_tess_eval
1787ec681f3Smrg- Intel Vuikan (anv) crash in copy_non_dynamic_state() when using validation layer
1797ec681f3Smrg- \[tgl][bisected][regression\] GPU hang in The Witcher 3
1807ec681f3Smrg- Mafia 3: Trees get rendered incorrectly
1817ec681f3Smrg- radv: dEQP-VK.synchronization.op.multi_queue.timeline_semaphore.write_clear_attachments_*_concurrent fail when forcing DCC.
1827ec681f3Smrg- Crash on GTA 5 through proton 5.0.9 and GE versions
1837ec681f3Smrg- Flickering textures in "Divinity Original Sin Enhanced Edition"
1847ec681f3Smrg- Mesa 20.2.0-rc1 fails to build for AMD
1857ec681f3Smrg- Assertion failure compiling shader from Zigguart
1867ec681f3Smrg
1877ec681f3Smrg
1887ec681f3SmrgChanges
1897ec681f3Smrg-------
1907ec681f3Smrg
1917ec681f3SmrgAaron Watry (1):
1927ec681f3Smrg
1937ec681f3Smrg- clover: Fix incorrect error check in clGetSupportedImageFormats
1947ec681f3Smrg
1957ec681f3SmrgAdam Jackson (22):
1967ec681f3Smrg
1977ec681f3Smrg- drisw: Port the MIT-SHM check to XCB
1987ec681f3Smrg- vulkan: Don't pointlessly depend on libxcb-dri2
1997ec681f3Smrg- docs: Stop claiming to implement OpenVG
2007ec681f3Smrg- mesa: Fix GL_CLAMP handling in glSamplerParameter
2017ec681f3Smrg- mesa: Generate more errors from GetSamplerParameter
2027ec681f3Smrg- wsi/x11: Hook up VK_PRESENT_MODE_FIFO_RELAXED_KHR
2037ec681f3Smrg- glx: Use GLX_FUNCTION2 only for actually aliased function names
2047ec681f3Smrg- glx: Collect all the non-applegl extensions in the GetProcAddress table
2057ec681f3Smrg- glx: Reject glXSwapIntervalMESA greater than INT_MAX
2067ec681f3Smrg- glx: Implement GLX_EXT_swap_control for DRI2 and DRI3
2077ec681f3Smrg- glx/dri3: Implement GLX_EXT_swap_control_tear
2087ec681f3Smrg- glx: Allow depth-30 pbuffers to work without a depth-30 pixmap format
2097ec681f3Smrg- wsi/x11: Create a present queue for VK_PRESENT_MODE_FIFO_RELAXED_KHR
2107ec681f3Smrg- glx: move \__glXGetUST into the DRI1 code
2117ec681f3Smrg- glx: Delegate the core of glXGetScreenDriver to the GLX screen vtable
2127ec681f3Smrg- glx: Move glXGet{ScreenDriver,DriverConfig} to common code
2137ec681f3Smrg- docs/features: Update extensions for softpipe
2147ec681f3Smrg- docs/features: Update extensions for swr
2157ec681f3Smrg- loader: Print dlerror() output in the failure message
2167ec681f3Smrg- mesa: Enable GL_MESA_pack_invert unconditionally
2177ec681f3Smrg- mesa: Implement GL_ANGLE_pack_reverse_row_order
2187ec681f3Smrg- docs: Add MESA_pack_invert and ANGLE_pack_reverse_row_order
2197ec681f3Smrg
2207ec681f3SmrgAlejandro Piñeiro (147):
2217ec681f3Smrg
2227ec681f3Smrg- v3d/compiler: add v3dv_prog_data_size helper
2237ec681f3Smrg- v3d/packet: fix typo on Set InstanceID/PrimitiveID packet
2247ec681f3Smrg- v3d: set instance id to 0 at start of tile
2257ec681f3Smrg- broadcom/qpu_instr: wait is not a read or write vpm instruction
2267ec681f3Smrg- nir/lower_io: don't reduce range if parent length is zero
2277ec681f3Smrg- broadcom/simulator: update to a newer simulator
2287ec681f3Smrg- broadcom/common: increase V3D_MAX_TEXTURE_SAMPLERS, add specific OpenGL limit
2297ec681f3Smrg- broadcom/compiler: add V3D_DEBUG_RA option
2307ec681f3Smrg- v3dv: add v3d vulkan driver skeleton
2317ec681f3Smrg- gitlab-ci: add broadcom vulkan driver
2327ec681f3Smrg- v3dv: add support for VK_EXT_debug_report
2337ec681f3Smrg- v3dv: memory management stubs
2347ec681f3Smrg- v3dv: add support to use v3d simulator
2357ec681f3Smrg- v3dv/debug: plug v3d_debug
2367ec681f3Smrg- v3dv/debug: add v3dv_debug
2377ec681f3Smrg- v3dv: stubs for graphics pipeline methods
2387ec681f3Smrg- v3dv: Create/DestroyShaderModule implementation
2397ec681f3Smrg- v3d/compiler: num_tex_used on v3d_key
2407ec681f3Smrg- v3dv/format: add v3dv_get_format_swizzle
2417ec681f3Smrg- v3dv: initial CreateGraphicsPipeline/DestroyPipeline implementation
2427ec681f3Smrg- v3dv: initial stub for CmdBindPipeline
2437ec681f3Smrg- v3dv: CmdSetViewport and CmdSetScissor implementation
2447ec681f3Smrg- v3dv/pipeline: start to track dynamic state
2457ec681f3Smrg- v3dv/cmd_buffer: init command buffer dynamic state during pipeline bind
2467ec681f3Smrg- v3dv/cmd_buffer: emit Scissor packets
2477ec681f3Smrg- v3dv/cmd_buffer: emit Viewport packets
2487ec681f3Smrg- v3dv/cmd_buffer: emit shader_state packets
2497ec681f3Smrg- v3dv/cmd_buffer: start to emit draw packets
2507ec681f3Smrg- v3dv/cmd_buffer: add shader source bos to cmd_buffer
2517ec681f3Smrg- v3dv: clif format dumping support
2527ec681f3Smrg- v3dv/cmd_buffer: cache viewport translate/scale
2537ec681f3Smrg- v3dv: add v3dv_write_uniforms
2547ec681f3Smrg- v3dv/cmd_buffer: start jobs with CmdBeginRenderPass
2557ec681f3Smrg- v3d/compiler: update uses_vid/uses_iid check
2567ec681f3Smrg- v3dv/cmd_buffer: emit CFG_BITS
2577ec681f3Smrg- v3dv: partial prepack of the gl_shader_state_record
2587ec681f3Smrg- v3dv: prepack VCM_CACHE_SIZE
2597ec681f3Smrg- v3dv/pipeline: lower fs/vs inputs/outputs
2607ec681f3Smrg- v3dv: vertex input support
2617ec681f3Smrg- v3dv: provide default values for input attributes
2627ec681f3Smrg- v3dv/format: add R32G32B32A32_SFLOAT format
2637ec681f3Smrg- v3dv: stubs for Create/DestroyPipelineCache
2647ec681f3Smrg- v3d/cmd_buffer: emit flat_shade/noperspective/centroid flags
2657ec681f3Smrg- v3dv/pipeline: adding some nir-based linking
2667ec681f3Smrg- v3dv/bo: add a bo name
2677ec681f3Smrg- v3dv: debug nir shader also after spirv_to_nir
2687ec681f3Smrg- v3dv: initial descriptor set support
2697ec681f3Smrg- v3dv/descriptor_set: support for array of ubo/ssbo
2707ec681f3Smrg- v3dv/pipeline: null check for pCreateInfo->pDepthStencilState
2717ec681f3Smrg- v3dv: no need to manually add assembly bo to the job
2727ec681f3Smrg- v3d/compiler: handle GL/Vulkan differences in uniform handling
2737ec681f3Smrg- v3dv/cmd_buffer: support for push constants
2747ec681f3Smrg- v3dv/descriptor: support for dynamic ubo/ssbo
2757ec681f3Smrg- v3dv/pipeline: revamp nir lowering/optimizations passes
2767ec681f3Smrg- v3dv/pipeline: clean up io lowering
2777ec681f3Smrg- v3dv/descriptor: take into account pPushConstantRanges
2787ec681f3Smrg- v3dv/device: tweak ssbo/ubo device limits
2797ec681f3Smrg- v3dv/cmd_bufffer: rename and split emit_graphics_pipeline
2807ec681f3Smrg- v3dv/cmd_buffer: push constants not using descriptor anymore
2817ec681f3Smrg- v3dv/uniforms: cleaning up, moving udpate ubo/ssbo uniforms to a function
2827ec681f3Smrg- v3dv/pipeline: unify local allocator name
2837ec681f3Smrg- v3dv/pipeline: sampler lowering
2847ec681f3Smrg- v3dv/descriptor_set: added support for samplers
2857ec681f3Smrg- v3dv/uniforms: filling up QUNIFORM_TMU_CONFIG_P0/P1
2867ec681f3Smrg- v3dv/pipeline: add support for shader variants
2877ec681f3Smrg- v3dv/cmd_buffer: update shader variants at CmdBindDescriptorSets/CmdBindPipeline
2887ec681f3Smrg- v3dv/cmd_buffer: allow return in the middle of variant update if needed
2897ec681f3Smrg- v3dv/pipeline: fix adding texture/samplers array elements to texture/sampler map
2907ec681f3Smrg- v3dv/descriptor_set: support for immutable samplers
2917ec681f3Smrg- v3dv/descriptor: move descriptor_map_get_sampler, add and use get_image_view
2927ec681f3Smrg- v3dv/descriptor_set: combine texture and sampler indices
2937ec681f3Smrg- v3dv/descriptor: handle not having a sampler when combining texture and sampler id
2947ec681f3Smrg- v3dv/uniforms: fill up texture size-related uniforms
2957ec681f3Smrg- v3dv/format: expose correctly if a texture format is filterable
2967ec681f3Smrg- v3dv: handle texture/sampler shader state bo failure with OOM error
2977ec681f3Smrg- v3dv: properly return OOM error during pipeline creation
2987ec681f3Smrg- v3dv/meta-copy: ensure valid height/width with compressed formats
2997ec681f3Smrg- v3dv/cmd_buffer: move variant checking to CmdDraw
3007ec681f3Smrg- v3dv/pipeline: support for specialization constants
3017ec681f3Smrg- v3dv/descriptor: add general bo on descriptor pool
3027ec681f3Smrg- v3dv/descriptor: use descriptor pool bo for image/samplers
3037ec681f3Smrg- v3dv/meta-copy: add uintptr_t casting to avoid warning
3047ec681f3Smrg- v3dv/bo: adding a BO cache
3057ec681f3Smrg- v3dv/bo: add a maximum size for the bo_cache and a envvar to configure it
3067ec681f3Smrg- v3dv/bo: add dump stats info
3077ec681f3Smrg- v3d/tex: avoid to ask back for a sampler state if not needed
3087ec681f3Smrg- v3dv/pipeline: iterate used textures using the combined index map
3097ec681f3Smrg- v3dv/pipeline: set load_layer_id to zero
3107ec681f3Smrg- v3dv: initial support for input attachments
3117ec681f3Smrg- v3dv/descriptors: support for DESCRIPTOR_TYPE_STORAGE_IMAGE
3127ec681f3Smrg- v3dv/pipeline: lower_image_deref
3137ec681f3Smrg- v3dv/uniforms: support for some QUNIFORM_IMAGE_XXX
3147ec681f3Smrg- nir: include texture query lod as one of the ops that requires a sampler
3157ec681f3Smrg- v3dv/device: expose support for image cube array
3167ec681f3Smrg- v3dv/image: fix TEXTURE_SHADER_STATE depth for cube arrays
3177ec681f3Smrg- v3dv/device: add vendorID/deviceID get helpers
3187ec681f3Smrg- v3dv/device: get proper device ID under simulator
3197ec681f3Smrg- v3dv/device: proper pipeline cache uuid
3207ec681f3Smrg- v3dv/pipeline_cache: bare basic support for pipeline cache
3217ec681f3Smrg- v3dv/pipeline_cache: cache nir shaders
3227ec681f3Smrg- v3dv/pipeline: add basic ref counting support for variants
3237ec681f3Smrg- v3dv/pipeline_cache: cache v3dv_shader_variants
3247ec681f3Smrg- v3dv/pipeline_cache: support to serialize/deserialize cached NIRs
3257ec681f3Smrg- v3dv/pipeline_cache: MergePipelineCaches implementation
3267ec681f3Smrg- v3dv/pipeline: provide a shader_sha1 to private ShaderModules
3277ec681f3Smrg- v3dv/pipeline_cache: add default pipeline cache
3287ec681f3Smrg- v3dv/pipeline: remove custom variant cache
3297ec681f3Smrg- v3dv/pipeline: when looking for a variant, check first current variant
3307ec681f3Smrg- v3dv/pipeline: pre-generate more that one shader variant
3317ec681f3Smrg- v3dv/pipeline: handle properly OUT_OF_HOST_MEMORY error when allocating p_stage
3327ec681f3Smrg- v3dv/descriptor: support for UNIFORM/STORAGE_TEXEL_BUFFER
3337ec681f3Smrg- v3dv: add v3dv_limits file
3347ec681f3Smrg- v3dv/device: fix minTexelBufferOffsetAlingment
3357ec681f3Smrg- v3dv/formats: fix exposing FEATURE_UNIFORM/STORAGE_TEXEL_BUFFER_BIT
3367ec681f3Smrg- v3dv/uniforms: handle texture size for texel buffers
3377ec681f3Smrg- v3dv/descriptor: remove v3dv_descriptor_map_get_image_view
3387ec681f3Smrg- v3dv/device: add assert for texture-related limits
3397ec681f3Smrg- v3dv/device: warn when the pipeline cache is disabled
3407ec681f3Smrg- v3dv/debug: add v3dv_print_v3d_key
3417ec681f3Smrg- v3dv/pipeline: fix combined_index_map insertions
3427ec681f3Smrg- v3dv/meta: fix hash table insertion
3437ec681f3Smrg- broadcom/compiler: allow GLSL_SAMPLER_DIM_BUF on txs emission
3447ec681f3Smrg- v3d/simulator: add v3d_simulator_get_mem_size
3457ec681f3Smrg- v3dv/device: fix compute_heap_size for the simulator
3467ec681f3Smrg- v3dv/pipeline: use derefs for ubo/ssbo
3477ec681f3Smrg- v3dv: Call nir_lower_io for push constants
3487ec681f3Smrg- v3dv/pipeline: track if texture is shadow
3497ec681f3Smrg- v3dv/pipeline: set 16bit return_size for shadows always
3507ec681f3Smrg- v3dv/cmd_buffer: set instance id to 0 at start of tile
3517ec681f3Smrg- v3d/limits: add line width and point size limits
3527ec681f3Smrg- v3dv/device: fix point-related VkPhysicalDeviceLimits
3537ec681f3Smrg- v3dv/device: enable largePoints
3547ec681f3Smrg- v3dv/meta_copy: handle mirroring z component bliting 3D images
3557ec681f3Smrg- v3dv/formats: properly return unsupported for 1D compressed textures
3567ec681f3Smrg- v3dv/meta_copy: fix TFU blitting when using 3D images
3577ec681f3Smrg- v3dv/pipeline_cache: set a max size for the pipeline cache
3587ec681f3Smrg- v3dv/pipeline_cache: extend pipeline cache envvar
3597ec681f3Smrg- v3dv/device: Support loader interface version 3.
3607ec681f3Smrg- nir/lower_io_to_scalar: update io semantics on per-component inst
3617ec681f3Smrg- docs/features: add v3dv driver
3627ec681f3Smrg- v3dv/format: use XYZ1 swizzle for three-component formats
3637ec681f3Smrg- v3d/format: use XYZ1 swizzle for three-component formats
3647ec681f3Smrg- broadcom/compiler: remove v3d_fs_key depth_enabled field.
3657ec681f3Smrg- v3dv/util: remove several logging functions
3667ec681f3Smrg- v3dv/util: log debug ignored stype only on debug builds
3677ec681f3Smrg- v3dv/device: do nothing when asked physical device pci bus properties
3687ec681f3Smrg- v3dv/cmd_buffer: missing (uint8_t \*) casting when calling memcmp
3697ec681f3Smrg
3707ec681f3SmrgAlexandros Frantzis (5):
3717ec681f3Smrg
3727ec681f3Smrg- tracie: Make tests independent of environment
3737ec681f3Smrg- tracie: Produce JUnit XML results
3747ec681f3Smrg- gitlab-ci: Enable unit test reports for normal runner traces jobs
3757ec681f3Smrg- gitlab-ci: Enable unit test reports for lava traces jobs
3767ec681f3Smrg- gitlab-ci: Enable unit test report for arm64_a630_traces
3777ec681f3Smrg
3787ec681f3SmrgAlyssa Rosenzweig (388):
3797ec681f3Smrg
3807ec681f3Smrg- panfrost: Remove blend prettyprinters
3817ec681f3Smrg- panfrost: Move format stringify to decode.c
3827ec681f3Smrg- pan/decode: Remove shader replacement artefact
3837ec681f3Smrg- panfrost: Inline panfrost-misc.h into panfrost-job.h
3847ec681f3Smrg- panfrost: Remove panfrost-misc.h
3857ec681f3Smrg- panfrost: Don't export exception_status
3867ec681f3Smrg- panfrost: Rename encoder/ to lib/
3877ec681f3Smrg- panfrost: Move pandecode into lib/
3887ec681f3Smrg- pan/mdg: Separate disassembler and compiler targets
3897ec681f3Smrg- pan/bi: Separate disasm/compiler targets
3907ec681f3Smrg- panfrost: Reduce bit dependency to disassembly only
3917ec681f3Smrg- panfrost: Add panloader/ to .gitignore
3927ec681f3Smrg- pan/bi: Drop use of MALI_POSITIVE
3937ec681f3Smrg- panfrost: Inline max rt into compilers
3947ec681f3Smrg- panfrost: Treat texture dimension as first-class
3957ec681f3Smrg- panfrost: Drop compiler cmdstream deps
3967ec681f3Smrg- nir/lower_ssbo: Don't set align\_\* for atomics
3977ec681f3Smrg- gallium/dri2: Support Arm modifiers
3987ec681f3Smrg- panfrost: Set \`initialized\` more conservatively
3997ec681f3Smrg- panfrost: Remove hint-based AFBC heuristic
4007ec681f3Smrg- panfrost: Introduce create_with_modifier helper
4017ec681f3Smrg- panfrost: Use modifier instead of layout throughout
4027ec681f3Smrg- panfrost: Account for modifiers when creating BO
4037ec681f3Smrg- panfrost: Respect modifiers in resource management
4047ec681f3Smrg- panfrost: Import staging routines from freedreno
4057ec681f3Smrg- panfrost: Choose AFBC when available
4067ec681f3Smrg- panfrost: Implement YTR availability check
4077ec681f3Smrg- panfrost: Enable YTR where allowed
4087ec681f3Smrg- panfrost: Allocate enough space for tiled formats
4097ec681f3Smrg- panfrost: Ensure AFBC slices are aligned
4107ec681f3Smrg- panfrost: Implement panfrost_query_dmabuf_modifiers
4117ec681f3Smrg- panfrost: Add stub midgard.xml
4127ec681f3Smrg- panfrost: Adopt gen_pack_header.py via v3d
4137ec681f3Smrg- panfrost: Build midgard_pack.h via meson
4147ec681f3Smrg- panfrost: Redirect cmdstream includes through GenXML
4157ec681f3Smrg- pan/decode: Add helper to dump GPU structures
4167ec681f3Smrg- panfrost: XMLify job_type
4177ec681f3Smrg- panfrost: XMLify draw_mode
4187ec681f3Smrg- panfrost: XMLify mali_func
4197ec681f3Smrg- panfrost: XMLify stencil op
4207ec681f3Smrg- panfrost: XMLify wrap modes
4217ec681f3Smrg- panfrost: XMLify viewport
4227ec681f3Smrg- panfrost: XMLify UBOs
4237ec681f3Smrg- panfrost: XMLify stencil test
4247ec681f3Smrg- panfrost: Simplify zsa == NULL case
4257ec681f3Smrg- panfrost: Simplify depth/stencil/alpha
4267ec681f3Smrg- panfrost: Don't mask coverage mask to 4-bits
4277ec681f3Smrg- panfrost: XMLify Midgard samplers
4287ec681f3Smrg- panfrost: XMLify Bifrost samplers
4297ec681f3Smrg- panfrost: XMLify Midgard textures
4307ec681f3Smrg- panfrost: XMLify Bifrost textures
4317ec681f3Smrg- panfrost: Drop unused mali_channel_swizzle
4327ec681f3Smrg- panfrost: XMLify Block Format
4337ec681f3Smrg- panfrost: XMLify MSAA writeout mode
4347ec681f3Smrg- panfrost: XMLify exception access
4357ec681f3Smrg- panfrost: XMLify enum mali_format
4367ec681f3Smrg- panfrost: Set STRIDE_4BYTE_ALIGNED_ONLY
4377ec681f3Smrg- panfrost: Drop NXR format
4387ec681f3Smrg- panfrost: Squash 22-bit format field in attr_meta
4397ec681f3Smrg- panfrost: XMLify mali_channel
4407ec681f3Smrg- panfrost: XMLify attributes
4417ec681f3Smrg- panfrost: Merge attribute packing routines
4427ec681f3Smrg- panfrost: Add XML for attribute buffers
4437ec681f3Smrg- panfrost: Use better packs for blits
4447ec681f3Smrg- panfrost: Simplify offset fixup proof
4457ec681f3Smrg- panfrost: Make attribute-buffer map explicit
4467ec681f3Smrg- panfrost: Move attr_meta emission to the draw routine
4477ec681f3Smrg- panfrost: Use packs for attributes
4487ec681f3Smrg- panfrost: Hoist instance_shift/instance_odd fetch
4497ec681f3Smrg- panfrost: Inline panfrost_vertex_instanced
4507ec681f3Smrg- panfrost: Use packs for vertex attribute buffers
4517ec681f3Smrg- panfrost: Use packs for vertex built-ins
4527ec681f3Smrg- panfrost: Reword comment
4537ec681f3Smrg- panfrost: Pass varying descriptors by reference
4547ec681f3Smrg- panfrost: Factor out general varying case
4557ec681f3Smrg- panfrost: Use pack for XFB varying
4567ec681f3Smrg- panfrost: Use pack for general varying
4577ec681f3Smrg- panfrost: Use MALI_ATTRIBUTE_LENGTH
4587ec681f3Smrg- pan/bit: Use packs for Bifrost unit tests
4597ec681f3Smrg- panfrost: Remove mali_attr_meta
4607ec681f3Smrg- panfrost: Use packs for varying buffers
4617ec681f3Smrg- panfrost: Drop hand-rolled pandecode for attribute buffers
4627ec681f3Smrg- panfrost: Drop union mali_attr
4637ec681f3Smrg- panfrost: Update CI expectations
4647ec681f3Smrg- panfrost: Decontextualize rasterizer
4657ec681f3Smrg- panfrost: Drop rasterizer null checks in draw calls
4667ec681f3Smrg- panfrost: Drop ZSA null checks in draws
4677ec681f3Smrg- panfrost: Drop panfrost_invalidate_frame
4687ec681f3Smrg- panfrost: Drop QUADS primitive convert
4697ec681f3Smrg- panfrost: Hoist add_fbo_bo call
4707ec681f3Smrg- panfrost: Remove useless comment
4717ec681f3Smrg- panfrost: Hoist assert from bind to create
4727ec681f3Smrg- panfrost: Fix WRITES_GLOBAL bit
4737ec681f3Smrg- panfrost: Fix shared memory size computation
4747ec681f3Smrg- pan/mdg: Ensure barrier op is set on texture
4757ec681f3Smrg- pan/mdg: Handle 32-bit offsets from store_shared
4767ec681f3Smrg- pan/mdg: Identify barrier out-of-order field
4777ec681f3Smrg- pan/mdg: Fix printing of r26 ld/st sources post-RA
4787ec681f3Smrg- pan/mdg: Fix auxiliary load/store swizzle packing
4797ec681f3Smrg- panfrost: Pre-allocate memory for pool
4807ec681f3Smrg- panfrost: Introduce invisible pool
4817ec681f3Smrg- panfrost: Avoid minimum stack allocations
4827ec681f3Smrg- pan/decode: Don't try to dereference heap mapping
4837ec681f3Smrg- panfrost: Share tiler_heap across batches/contexts
4847ec681f3Smrg- panfrost: Drop implicit blend pooling
4857ec681f3Smrg- panfrost: Explicitly handle nr_cbufs=0 case
4867ec681f3Smrg- panfrost: Drop depth-only case in blend finalize
4877ec681f3Smrg- panfrost: Keep finalized blend state constant
4887ec681f3Smrg- panfrost: Fix blend leak for render targets 5-8
4897ec681f3Smrg- panfrost: Free cloned NIR shader
4907ec681f3Smrg- panfrost: Free NIR of blit shaders
4917ec681f3Smrg- panfrost: Free hash_to_temp map
4927ec681f3Smrg- pan/mdg: Free previous liveness
4937ec681f3Smrg- panfrost: Use memctx for sysvals
4947ec681f3Smrg- panfrost: Free batch->dependencies
4957ec681f3Smrg- panfrost: Pass alignments explicitly
4967ec681f3Smrg- panfrost: Fix attribute buffer underallocation
4977ec681f3Smrg- panfrost: Don't overallocate attributes
4987ec681f3Smrg- panfrost: Don't reserve for NPOT w/o instancing
4997ec681f3Smrg- panfrost: Reduce attribute buffer allocations
5007ec681f3Smrg- panfrost: Fix alignment on Bifrost
5017ec681f3Smrg- gallium: Add util_blend_factor_uses_dest helper
5027ec681f3Smrg- gallium: Add util_blend_uses_dest helper
5037ec681f3Smrg- si: Use util_blend_factor_uses_dest
5047ec681f3Smrg- r300: Use util_blend_factor_uses_dest
5057ec681f3Smrg- pan/decode: Drop legacy 32-bit job support
5067ec681f3Smrg- panfrost: Decode nested structs correctly
5077ec681f3Smrg- panfrost: Hoist blend finalize calls
5087ec681f3Smrg- panfrost: Separate shader/blend descriptor emits
5097ec681f3Smrg- panfrost: XMLify blend flags
5107ec681f3Smrg- panfrost: Simplify make_fixed_blend_mode prototype
5117ec681f3Smrg- panfrost: Honour load_dest/opaque flags
5127ec681f3Smrg- panfrost: XMLify blend equation
5137ec681f3Smrg- panfrost: Combine frag_shader_meta_init functions
5147ec681f3Smrg- panfrost: Size UBO#0 accurately
5157ec681f3Smrg- panfrost: Clamp shader->uniform_count
5167ec681f3Smrg- panfrost: Bake the initial tag into the shader pointer
5177ec681f3Smrg- panfrost: Specialize compute vs frag shader init
5187ec681f3Smrg- panfrost: Rename shader emit functions
5197ec681f3Smrg- panfrost: Clean up blend shader errata handling
5207ec681f3Smrg- panfrost: Group SFBD state together
5217ec681f3Smrg- panfrost: XMLify Midgard properties
5227ec681f3Smrg- panfrost: Pack compute Midgard properties
5237ec681f3Smrg- panfrost: Use packs for fragment properties
5247ec681f3Smrg- panfrost: Use pack for shaderless
5257ec681f3Smrg- panfrost: Fold work_count packing for blend shaders
5267ec681f3Smrg- panfrost: Simplify bind_blend_state
5277ec681f3Smrg- panfrost: Remove midgard1 bitfield
5287ec681f3Smrg- panfrost: XMLify bifrost1
5297ec681f3Smrg- panfrost: Drop redundant NULL check
5307ec681f3Smrg- panfrost: Group SFBD code tighter
5317ec681f3Smrg- panfrost: XMLify Bifrost preload
5327ec681f3Smrg- panfrost: Identify additional SFBD flags
5337ec681f3Smrg- panfrost: Support SHADERLESS mode everywhere
5347ec681f3Smrg- panfrost: Quiet pandecode error
5357ec681f3Smrg- panfrost: Derive texture/sampler_count from shader
5367ec681f3Smrg- panfrost: XMLify beginning of shader descriptor
5377ec681f3Smrg- panfrost: Derive UBO count from shader_info
5387ec681f3Smrg- panfrost: Pack vertex properties when compiling
5397ec681f3Smrg- panfrost: Prepack fragment properties/preload
5407ec681f3Smrg- panfrost: Simplify shaderless packing
5417ec681f3Smrg- panfrost: Ensure shader-db state is zero-initialized
5427ec681f3Smrg- panfrost: Allocate a state uploader
5437ec681f3Smrg- panfrost: Upload shader descriptors at CSO create
5447ec681f3Smrg- panfrost: Use preuploaded shader descriptors
5457ec681f3Smrg- panfrost: XMLify the rest of shader_meta
5467ec681f3Smrg- panfrost: Inherit default values from structs
5477ec681f3Smrg- panfrost: Use pack for blit shaders
5487ec681f3Smrg- panfrost: Use pack for Bifrost test state
5497ec681f3Smrg- panfrost: Add optional opaque packs to GenXML
5507ec681f3Smrg- panfrost: Use opaque pack for vertex shaders
5517ec681f3Smrg- panfrost: Use pack for fragment shaders
5527ec681f3Smrg- pan/decode: Use unpacks for state descriptor
5537ec681f3Smrg- panfrost: Drop mali_shader_meta
5547ec681f3Smrg- panfrost: Add opaque midgard_blend XML
5557ec681f3Smrg- panfrost: Emit explicit REPLACE for disabled colour writeout
5567ec681f3Smrg- panfrost: Drop blend indirection
5577ec681f3Smrg- panfrost: Add padded type for instance fields
5587ec681f3Smrg- panfrost: Add XML for mali_vertex_tiler_postfix
5597ec681f3Smrg- panfrost: Use draw pack for blit
5607ec681f3Smrg- panfrost: Separate postfix from emits
5617ec681f3Smrg- panfrost: Inline vt_update_{rasterizer, occlusion}
5627ec681f3Smrg- panfrost: Remove postfix parameter from UBO upload
5637ec681f3Smrg- panfrost: Avoid postfix dep for vertex_data
5647ec681f3Smrg- panfrost: Don't call panfrost_vt_init for compute
5657ec681f3Smrg- panfrost: Inline panfrost_vt_init
5667ec681f3Smrg- panfrost: Inline panfrost_vt_set_draw_info
5677ec681f3Smrg- panfrost: Detangle postfix from varying emits
5687ec681f3Smrg- panfrost: Use draw pack for compute jobs
5697ec681f3Smrg- panfrost: Use pack for draw descriptor
5707ec681f3Smrg- panfrost: Simplify ZSA bind
5717ec681f3Smrg- panfrost: Cleanup point sprite linking
5727ec681f3Smrg- panfrost: Drop point sprite from shader key
5737ec681f3Smrg- panfrost: XMLify primitive information
5747ec681f3Smrg- panfrost: Add invocation XML
5757ec681f3Smrg- panfrost: XMLify invocations
5767ec681f3Smrg- panfrost: Drop bifrost_payload_fused
5777ec681f3Smrg- panfrost: Inline bifrost_tiler_only
5787ec681f3Smrg- panfrost: Use nir_builder_init_simple_shader for blits
5797ec681f3Smrg- pan/decode: Drop scratchpad size dump
5807ec681f3Smrg- pan/decode: Drop mali_vertex_tiler_postfix arg
5817ec681f3Smrg- pan/decode: Print shader-db even for compute
5827ec681f3Smrg- pan/decode: Fix awkward syntax
5837ec681f3Smrg- pan/decode: Use generation for vertex_tiler_postfix
5847ec681f3Smrg- pan/decode: Use unpack for vertex_tiler_postfix_pre
5857ec681f3Smrg- panfrost: Remove mali_vertex_tiler_postfix
5867ec681f3Smrg- pan/decode: Drop prefix braces
5877ec681f3Smrg- panfrost: Emit texture/sampler points for compute
5887ec681f3Smrg- pan/mdg: Implement i/umul_high
5897ec681f3Smrg- pan/mdg: Scalarize 64-bit
5907ec681f3Smrg- pan/mdg: Bounds check swizzle writing globals
5917ec681f3Smrg- pan/mdg: Implement nir_intrinsic_load_sample_mask_in
5927ec681f3Smrg- pan/mdg: Refactor texture op/mode handling
5937ec681f3Smrg- pan/mdg: Add disassembly for shadow gathers
5947ec681f3Smrg- pan/mdg: Implement texture gathers
5957ec681f3Smrg- panfrost: Set PIPE_CAP_MAX_TEXTURE_GATHER_COMPONENTS
5967ec681f3Smrg- docs/features: Add missing Panfrost extensions
5977ec681f3Smrg- pan/mdg: Fix discard encoding
5987ec681f3Smrg- pan/mdg: Fix perspective combination
5997ec681f3Smrg- panfrost: Drop PIPE_CAP_MAX_COMBINED_HW_ATOMIC_COUNTER/BUFFERS
6007ec681f3Smrg- mesa/st: Don't set alpha if ALPHA_TEST is lowered
6017ec681f3Smrg- pan/mdg: Obey f2fmp size restriction in fuse_io_16
6027ec681f3Smrg- panfrost: Fix nonzero stencil mask on vertex/compute
6037ec681f3Smrg- pan/bit: Set d3d=true for CMP tests
6047ec681f3Smrg- pan/bit: Fix unit tests
6057ec681f3Smrg- pan/bi: Lower flrp16
6067ec681f3Smrg- pan/bi: Add XML describing the instruction set
6077ec681f3Smrg- pan/bi: Add ISA parser
6087ec681f3Smrg- pan/bi: Add packing generator
6097ec681f3Smrg- pan/bi: Add disassembler generator
6107ec681f3Smrg- pan/bi: Add disassembly prototypes
6117ec681f3Smrg- pan/bi: Add bi_disasm_dest\_\* helpers
6127ec681f3Smrg- pan/bi: Export dump_src
6137ec681f3Smrg- pan/bi: Use new disassembler
6147ec681f3Smrg- pan/bi: Use canonical syntax for registers/uniforms/imms
6157ec681f3Smrg- pan/bi: Use canonical syntax for special constants
6167ec681f3Smrg- pan/bi: Add dummy carry/borrow argument for iadd/isub
6177ec681f3Smrg- pan/bi: Introduce segments into the IR
6187ec681f3Smrg- pan/bi: Add format field to IR
6197ec681f3Smrg- pan/bi: Track compute_lod in IR
6207ec681f3Smrg- pan/bi: Pass blend descriptor explicitly in IR
6217ec681f3Smrg- pan/bi: Use 8-bit shifts
6227ec681f3Smrg- pan/bi: Use src1/dest_invert instead of src_invert[\]
6237ec681f3Smrg- pan/bi: Move packing helpers to dedicated file
6247ec681f3Smrg- pan/bi: Use new packing
6257ec681f3Smrg- pan/bi: Remove unused prints
6267ec681f3Smrg- pan/bi: Remove unused packing data structures
6277ec681f3Smrg- pan/bi: Drop \*FMIN reference
6287ec681f3Smrg- pan/bi: Annotate stop bit (canonically "Z-bit")
6297ec681f3Smrg- pan/bi: Annotate disassemble with format names
6307ec681f3Smrg- pan/bi: Inline dump_instr
6317ec681f3Smrg- pan/bi: Track M values of disassembled constants
6327ec681f3Smrg- pan/bi: Decode M values in disasm
6337ec681f3Smrg- pan/bi: Disassemble PC-relative addresses
6347ec681f3Smrg- pan/bi: Add bifrost_reg_mode enum
6357ec681f3Smrg- pan/bi: Pass 'first' through disassembler
6367ec681f3Smrg- pan/bi: Decode all 32-bit register modes
6377ec681f3Smrg- pan/bi: Rename port -\> slot
6387ec681f3Smrg- pan/bi: Use canonical register packing
6397ec681f3Smrg- pan/bi: Remove old register mode definitions
6407ec681f3Smrg- pan/bi: Fix assert when writing vertex outputs
6417ec681f3Smrg- pan/bi: Add copy for register COMBINEs
6427ec681f3Smrg- pan/decode: Ensure mappings are zeroed
6437ec681f3Smrg- pan/bi: Fix memory corruption in scheduler
6447ec681f3Smrg- pan/bi: Drop if 0'd combine lowering
6457ec681f3Smrg- pan/bi: Cull unnecessary edges on the CF graph
6467ec681f3Smrg- pan/bi: Use canonical floating-point modes
6477ec681f3Smrg- pan/bi: Canonicalize terminate_discarded_threads
6487ec681f3Smrg- pan/bi: Use canonical next_clause_prefetch
6497ec681f3Smrg- pan/bi: Use canonical name for staging registers
6507ec681f3Smrg- pan/bi: Expand clause type to 5-bit
6517ec681f3Smrg- pan/bi: Add missing message types
6527ec681f3Smrg- pan/bi: Print message types as strings
6537ec681f3Smrg- pan/bi: Use canonical term "message type"
6547ec681f3Smrg- pan/bi: Use canonical term dependency
6557ec681f3Smrg- pan/bi: Use canonical flow control enum
6567ec681f3Smrg- pan/bi: Pass flow_control through directly
6577ec681f3Smrg- pan/bi: Handle vector moves
6587ec681f3Smrg- pan/bi: Expose GL 2.1 on Bifrost
6597ec681f3Smrg- pan/bi: Fix simple txl test
6607ec681f3Smrg- pan/bi: Use canonical texture op names in IR
6617ec681f3Smrg- pan/bi: Streamline TEXC/TEXS naming/selection
6627ec681f3Smrg- pan/bi: Encode skip bit into IR
6637ec681f3Smrg- pan/bi: Pack skip bit for texture operations
6647ec681f3Smrg- pan/bi: Add texture operator descriptor
6657ec681f3Smrg- pan/bi: Stub out TEXC handling
6667ec681f3Smrg- pan/bi: Add data register passing infrastructure
6677ec681f3Smrg- pan/bi: Handle nir_tex_src_lod
6687ec681f3Smrg- pan/bi: Pack TEXC
6697ec681f3Smrg- pan/bi: Rewrite to fit dest = src constraint
6707ec681f3Smrg- pan/bi: Prefer 'texture_index' to 'image_index'
6717ec681f3Smrg- panfrost: Add missing XML for Bifrost samplers
6727ec681f3Smrg- panfrost: Fix Bifrost filter selection
6737ec681f3Smrg- panfrost: Fix Bifrost high LOD clamp
6747ec681f3Smrg- panfrost: Add some missing Bifrost texture XML
6757ec681f3Smrg- pan/bi: Implement txb
6767ec681f3Smrg- panfrost: Set helper_invocation_enable for Bifrost
6777ec681f3Smrg- pan/bi: Fix message type printing
6787ec681f3Smrg- pan/bi: Don't terminate helper threads
6797ec681f3Smrg- panfrost: Add panfrost_block_dim helper
6807ec681f3Smrg- pan/bi: Use new block dimension helper
6817ec681f3Smrg- panfrost: Fix faults on block-based formats on Bifrost
6827ec681f3Smrg- pan/bi: Map NIR tex ops to Bifrost ops
6837ec681f3Smrg- pan/bi: Add bi_emit_lod_cube helper
6847ec681f3Smrg- pan/bi: Implement FETCH
6857ec681f3Smrg- panfrost: Update XML for Bifrost early-z/FPK
6867ec681f3Smrg- panfrost: Set "shader modifies coverage?" flag
6877ec681f3Smrg- panfrost: Temporarily disable FP16 on Bifrost
6887ec681f3Smrg- pan/bi: Disable mediump output lowering
6897ec681f3Smrg- pan/bi: Range check newc/oldc when rewriting
6907ec681f3Smrg- panfrost: Rename gtransfer to transfer
6917ec681f3Smrg- panfrost: Use canonical characterization of tls_size
6927ec681f3Smrg- panfrost: Drop panfrost_vt_emit_shared_memory
6937ec681f3Smrg- pan/mdg: Cleanup mir_rewrite_index_src_single
6947ec681f3Smrg- pan/bi: Drop 64-bit constant support
6957ec681f3Smrg- pan/bi: Fix handling of small constants in bi_lookup_constant
6967ec681f3Smrg- pan/bi: Stub spilling
6977ec681f3Smrg- pan/bi: Add no_spill flag to IR
6987ec681f3Smrg- pan/bi: Implement bi_choose_spill_node
6997ec681f3Smrg- pan/bi: Add spills/fills parameters
7007ec681f3Smrg- pan/bi: Add bi_spill helper
7017ec681f3Smrg- pan/bi: Add bi_fill
7027ec681f3Smrg- pan/bi: Add bi_rewrite_index_src_single helper
7037ec681f3Smrg- pan/bi: Add helpers for working with singletons
7047ec681f3Smrg- pan/bi: Implement bi_spill_register
7057ec681f3Smrg- pan/bi: Factor out singleton construction from scheduler
7067ec681f3Smrg- pan/bi: Add bi_foreach_clause_in_block_safe helper
7077ec681f3Smrg- pan/bi: Pack LOAD/STORE
7087ec681f3Smrg- pan/bi: Implement spilling
7097ec681f3Smrg- pan/bi: Pipe through tls_size
7107ec681f3Smrg- panfrost: Move nir_undef_to_zero to common util/
7117ec681f3Smrg- pan/bi: Use nir_undef_to_zero
7127ec681f3Smrg- panfrost: Record architecture major version
7137ec681f3Smrg- panfrost: Don't export queries
7147ec681f3Smrg- panfrost: Calculate thread count on Bifrost
7157ec681f3Smrg- panfrost: Fix component order XML
7167ec681f3Smrg- panfrost: Implement BGRA textures
7177ec681f3Smrg- panfrost: Drop PIPE_CAP_GLSL_FEATURE_LEVEL for Bifrost
7187ec681f3Smrg- panfrost: Don't advertise MSAA on Bifrost
7197ec681f3Smrg- pan/bi: Account for bool32 ld_ubo reads
7207ec681f3Smrg- panfrost: Don't double-compose swizzles
7217ec681f3Smrg- panfrost: Add MALI_EXTRACT_INDEX helper
7227ec681f3Smrg- panfrost: Use consistent swizzle names in XML
7237ec681f3Smrg- panfrost: Add a blendable format table
7247ec681f3Smrg- panfrost: Use panfrost_blendable_formats for MFBD
7257ec681f3Smrg- panfrost: Use panfrost_blendable_formats for SFBD
7267ec681f3Smrg- panfrost: Use panfrost_blendable_formats for blending
7277ec681f3Smrg- panfrost: Complete format_to_bifrost_blend
7287ec681f3Smrg- panfrost: Remove duplicated format arg for ASTC
7297ec681f3Smrg- panfrost: Remove panfrost_is_z24s8_variant
7307ec681f3Smrg- panfrost: Add v7 special colour formats
7317ec681f3Smrg- panfrost: Add missing depth/stencil formats
7327ec681f3Smrg- panfrost: Add miscellaneous missing Midgard formats
7337ec681f3Smrg- panfrost: Add v7-specific depth formats
7347ec681f3Smrg- panfrost: Split out v6/v7 format tables
7357ec681f3Smrg- panfrost: Rename VARYING_DISCARD to CONSTANT
7367ec681f3Smrg- panfrost: Rename VARYING_POS to SNAP4
7377ec681f3Smrg- panfrost: Add missing 1/2/4/64-bit formats to XML
7387ec681f3Smrg- panfrost: Use macro for panfrost_get_default_swizzle
7397ec681f3Smrg- panfrost: Fix RGB5A1 formats
7407ec681f3Smrg- panfrost: Fix BGR233 component order
7417ec681f3Smrg- panfrost: Add missing alpha-first special formats
7427ec681f3Smrg- pan/bi: Suppress disassembly for internal shaders
7437ec681f3Smrg- pan/bi: Lower +CUBEFACE2
7447ec681f3Smrg- panfrost: Disable point sprites on Bifrost
7457ec681f3Smrg- panfrost: Advertise Bifrost support
7467ec681f3Smrg- panfrost: Drop unused swizzles
7477ec681f3Smrg- panfrost: Add bi_emit_array_index helper
7487ec681f3Smrg- pan/bi: Track tex data register swizzles
7497ec681f3Smrg- pan/bi: Handle 3D/array coordinates
7507ec681f3Smrg- pan/bi: Don't emit TEXS for array textures
7517ec681f3Smrg- panfrost: Set .array_size on Bifrost
7527ec681f3Smrg- nir: Add SRC_TYPE to store_combined_output_pan
7537ec681f3Smrg- pan/mdg: Deduplicate nir_find_variable_with_driver_location
7547ec681f3Smrg- pan/mdg: Move writeout lowering to common panfrost
7557ec681f3Smrg- panfrost: Pass through src_type
7567ec681f3Smrg- panfrost: Deduplicate shader properties
7577ec681f3Smrg- pan/bi: Add +ZS_EMIT instruction to IR
7587ec681f3Smrg- pan/bi: Infer z/stencil flags from sources passed
7597ec681f3Smrg- pan/bi: Factor out bi_emit_atest
7607ec681f3Smrg- pan/bi: Factor out bi_emit_blend
7617ec681f3Smrg- pan/bi: Stub handling for nir_intrinsic_store_combined_output_pan
7627ec681f3Smrg- pan/bi: Emit +ZS_EMIT as needed
7637ec681f3Smrg- pan/bi: Lower depth/stencil stores
7647ec681f3Smrg- pan/bi: Correctly calculate render target index
7657ec681f3Smrg- pan/mdg: Add missing Collabora copyright notices
7667ec681f3Smrg- panfrost: Add missing Collabora copyright notices
7677ec681f3Smrg- pan/bi: Model writemasks correctly
7687ec681f3Smrg
7697ec681f3SmrgAndreas Baierl (4):
7707ec681f3Smrg
7717ec681f3Smrg- lima/ppir: Skip instruction merge when having more than one successor
7727ec681f3Smrg- lima: fix glCopyTexSubImage2D
7737ec681f3Smrg- lima: set clear depth value to 0x00ffffff as default
7747ec681f3Smrg- lima/parser: Fix varyings decoding in RSW
7757ec681f3Smrg
7767ec681f3SmrgAndres Gomez (3):
7777ec681f3Smrg
7787ec681f3Smrg- gitlab-ci: reuse container_post_build when building the test images
7797ec681f3Smrg- gitlab-ci: reorder container_post_build call for arm64_test image
7807ec681f3Smrg- Revert "gitlab-ci: reuse container_post_build when building the test images"
7817ec681f3Smrg
7827ec681f3SmrgAndrew Randrianasulu (1):
7837ec681f3Smrg
7847ec681f3Smrg- st/va: fix build with old libva
7857ec681f3Smrg
7867ec681f3SmrgAndrey Vostrikov (1):
7877ec681f3Smrg
7887ec681f3Smrg- egl/x11: Free memory allocated for reply structures on error
7897ec681f3Smrg
7907ec681f3SmrgAndrii Simiklit (4):
7917ec681f3Smrg
7927ec681f3Smrg- util/xmlconfig: eliminate memory leak
7937ec681f3Smrg- nir: get rid of OOB dereferences in nir_lower_io_arrays_to_elements
7947ec681f3Smrg- glx: get rid of memory leak
7957ec681f3Smrg- glsl: avoid an out-of-bound access while setting up a location for variable
7967ec681f3Smrg
7977ec681f3SmrgAnthoine Bourgeois (4):
7987ec681f3Smrg
7997ec681f3Smrg- docs/features: Minor update extensions support
8007ec681f3Smrg- docs/features: VK_KHR_mir_surface is disabled, remove it
8017ec681f3Smrg- docs/features: add some extensions we missed
8027ec681f3Smrg- docs/features.txt: VK_EXT_separate_stencil_usage not exposed on RADV
8037ec681f3Smrg
8047ec681f3SmrgAntonio Caggiano (1):
8057ec681f3Smrg
8067ec681f3Smrg- zink: pre-hash gfx-pipeline-state
8077ec681f3Smrg
8087ec681f3SmrgAnuj Phogat (2):
8097ec681f3Smrg
8107ec681f3Smrg- intel/gen9: Enable MSC RAW Hazard Avoidance
8117ec681f3Smrg- intel: Pointer to SCISSOR_RECT array should be 64B aligned
8127ec681f3Smrg
8137ec681f3SmrgAníbal Limón (1):
8147ec681f3Smrg
8157ec681f3Smrg- src/util/disk_cache_os.c: Add missing headers for open/fcntl
8167ec681f3Smrg
8177ec681f3SmrgArcady Goldmints-Orlov (7):
8187ec681f3Smrg
8197ec681f3Smrg- broadcom/compiler: support nir_intrinsic_load_sample_id
8207ec681f3Smrg- broadcom/compiler: Add a constant folding pass after nir_lower_io
8217ec681f3Smrg- broadcom/compiler: Enable PER_QUAD for UBO and SSBO loads.
8227ec681f3Smrg- broadcom/compiler: support varyings with struct types
8237ec681f3Smrg- broadcom/compiler: use nir io semantics
8247ec681f3Smrg- broadcom/compiler: Handle non-SSA destinations for tex instructions
8257ec681f3Smrg- broadcom/compiler: Allow spills of temporaries from TMU reads
8267ec681f3Smrg
8277ec681f3SmrgBas Nieuwenhuizen (58):
8287ec681f3Smrg
8297ec681f3Smrg- radv: Add ETC2 support on RAVEN2.
8307ec681f3Smrg- radv: Fix assert that is too strict.
8317ec681f3Smrg- radv: Add forcecompress debug flag.
8327ec681f3Smrg- radv: Do not consider layouts fast-clearable on compute queue.
8337ec681f3Smrg- radv: Update CI expectations for the recent descriptor indexing regressions.
8347ec681f3Smrg- radv: When importing an image, redo the layout based on the metadata.
8357ec681f3Smrg- radv:  Clean up setting the surface flags.
8367ec681f3Smrg- radv: Use getter instead of setter to extract value.
8377ec681f3Smrg- driconf: Support selection by Vulkan applicationName.
8387ec681f3Smrg- radv: Override the uniform buffer offset alignment for World War Z.
8397ec681f3Smrg- radv: Fix handling of attribs 16-31.
8407ec681f3Smrg- radv: Remove conformance warnings with ACO.
8417ec681f3Smrg- radv: Update CTS version.
8427ec681f3Smrg- radv: Fix 3d blits.
8437ec681f3Smrg- radv: Centralize enabling thread trace.
8447ec681f3Smrg- radv: Allow triggering thread traces by file.
8457ec681f3Smrg- radv: Fix threading issue with submission refcounts.
8467ec681f3Smrg- radv: Avoid deadlock on bo_list.
8477ec681f3Smrg- spirv: Deal with glslang bug not setting the decoration for stores.
8487ec681f3Smrg- spirv: Deal with glslang not setting NonUniform on constructors.
8497ec681f3Smrg- radeonsi: Work around Wasteland 2 bug.
8507ec681f3Smrg- radv,gallium: Add driconf option to reduce advertised VRAM size.
8517ec681f3Smrg- amd/common: Store non-displayable DCC pitch.
8527ec681f3Smrg- radeonsi: Put retile map in separate buffers.
8537ec681f3Smrg- radeonsi: Move display dcc dirty tracking to framebuffer emission.
8547ec681f3Smrg- ac/surface: Fix depth import on GFX6-GFX8.
8557ec681f3Smrg- radv,radeonsi: Disable compression on interop depth images
8567ec681f3Smrg- Revert "radv: set BIG_PAGE to improve performance on GFX10.3"
8577ec681f3Smrg- Revert "radv: emit {CB,DB}_RMI_L2_CACHE_CONTROL at framebuffer time"
8587ec681f3Smrg- st/mesa: Deal with empty textures/buffers in semaphore wait/signal.
8597ec681f3Smrg- radv: Disable NGG on APUs.
8607ec681f3Smrg- radv: Simplify radv_is_hw_resolve_pipeline.
8617ec681f3Smrg- radv: Add VK_FORMAT_E5B9G9R9_UFLOAT_PACK32 rendering support.
8627ec681f3Smrg- radv: Fix emitting SQTT userdata.
8637ec681f3Smrg- radv: Use correct alignment for SQTT buffer sizes.
8647ec681f3Smrg- radv: Fix RGP Asic CU info for GFX10+.
8657ec681f3Smrg- radv: Include flushes in the barrier.
8667ec681f3Smrg- radv: Record cache flushes for RGP.
8677ec681f3Smrg- radv: Write correct dispatch size for RGP.
8687ec681f3Smrg- radeonsi: Fix imports with displayable DCC.
8697ec681f3Smrg- radv: Use atomics to read query results.
8707ec681f3Smrg- radv: Set fce metadata correctly on DCC initialization.
8717ec681f3Smrg- radv: Fix event write cmdbuffer allocation when tracing.
8727ec681f3Smrg- radv/winsys: Expand scope of allbos lock.
8737ec681f3Smrg- radv: Fix mipmap extent adjustment on GFX9+.
8747ec681f3Smrg- aco: Add VK_KHR_shader_terminate_invocation support.
8757ec681f3Smrg- amd/llvm: Add VK_KHR_shader_terminate_invocation support.
8767ec681f3Smrg- radv: Advertise VK_KHR_shader_terminate_invocation.
8777ec681f3Smrg- frontends/va: Initialize drm modifier on import.
8787ec681f3Smrg- radv: Fix 1D compressed mipmaps on GFX9.
8797ec681f3Smrg- radv: Do not access set layout during vkCmdBindDescriptorSets.
8807ec681f3Smrg- radv: Fix variable name collision.
8817ec681f3Smrg- radv: Skip tiny non-visible VRAM heap.
8827ec681f3Smrg- radv: Fix budget calculations with large BAR.
8837ec681f3Smrg- radv: Fix exporting/importing multisample images.
8847ec681f3Smrg- radv: Fix RB+ blending for VK_FORMAT_E5B9G9R9_UFLOAT_PACK32.
8857ec681f3Smrg- radv: Fix a hang on CB change by adding flushes.
8867ec681f3Smrg- radv: Deal with unused attachments in mip flush
8877ec681f3Smrg
8887ec681f3SmrgBillKristiansen (1):
8897ec681f3Smrg
8907ec681f3Smrg- compiler/glsl: Initialize local variable to zero to fix MSVC RTC error
8917ec681f3Smrg
8927ec681f3SmrgBoris Brezillon (141):
8937ec681f3Smrg
8947ec681f3Smrg- spirv: Move the emit a 'return value' store logic into own function
8957ec681f3Smrg- compiler/nir: Add new flags to lower pack/unpack split instructions
8967ec681f3Smrg- nir: Fix i64tof32 lowering
8977ec681f3Smrg- spirv: Add support for the CL Round instruction
8987ec681f3Smrg- panfrost: Rename panfrost_create_pool() into panfrost_pool_init()
8997ec681f3Smrg- panfrost: Avoid accessing pan_pool fields directly
9007ec681f3Smrg- panfrost: Store transient BOs in a dynamic array
9017ec681f3Smrg- spirv: Add a vtn_get_mem_operands() helper
9027ec681f3Smrg- spirv: Don't accept CPacked decoration on struct members
9037ec681f3Smrg- spirv: Propagate packed information to glsl_type
9047ec681f3Smrg- glsl: Propagate packed info in get_explicit_type_for_size_align()
9057ec681f3Smrg- nir/glsl: Consider block interfaces as structs when it comes to size/align calculation
9067ec681f3Smrg- nir: Expose the packed attribute attached to glsl_type objects
9077ec681f3Smrg- panfrost: gen_pack: Minor formatting improvement
9087ec681f3Smrg- panfrost: gen_pack: Fix \__gen_unpack_uint()
9097ec681f3Smrg- panfrost: gen_pack: Add pan_{unpack,print}() helpers
9107ec681f3Smrg- panfrost: gen_pack: Move the group get_length() logic to its own method
9117ec681f3Smrg- panfrost: gen_pack: Add the aggregate concept
9127ec681f3Smrg- panfrost: gen_pack: Allow empty structs
9137ec681f3Smrg- panfrost: gen_pack: Add an align() modifier
9147ec681f3Smrg- panfrost: gen_pack: Add a log2 modifier
9157ec681f3Smrg- panfrost: gen_pack: Allow enum/define values expressed in hexadecimal
9167ec681f3Smrg- panfrost: decode: Make the indentation consistent with auto-generated print helpers
9177ec681f3Smrg- panfrost: decode: Rework the DUMP_{CL,ADDR}() macros
9187ec681f3Smrg- panfrost: decode: Add a macro to dump unpacked descriptors
9197ec681f3Smrg- panfrost: decode: Use pan_{unpack,print}() when applicable
9207ec681f3Smrg- panfrost: XML-ify the local storage descriptor
9217ec681f3Smrg- panfrost: Clarify what TILED mode is
9227ec681f3Smrg- panfrost: Add Tiled linear mode to the Block Format enum
9237ec681f3Smrg- panfrost: XML-ify the midgard tiler descriptor
9247ec681f3Smrg- panfrost: XML-ify the single target framebuffer descriptor
9257ec681f3Smrg- panfrost: XML-ify the bifrost tiler descriptors
9267ec681f3Smrg- panfrost: XML-ify the multi-target framebuffer descriptors
9277ec681f3Smrg- panfrost: XML-ify the job header descriptor
9287ec681f3Smrg- panfrost: XML-ify the write value job descriptor
9297ec681f3Smrg- panfrost: XML-ify the fragment job descriptor
9307ec681f3Smrg- panfrost: Rename the Blend dither disable flag
9317ec681f3Smrg- panfrost: XML-ify the compute job descriptor
9327ec681f3Smrg- panfrost: Avoid copying job descriptors around when we can
9337ec681f3Smrg- panfrost: decode: Misc formatting improvements
9347ec681f3Smrg- panfrost: gen_pack: Fix gnu-empty-initializer errors
9357ec681f3Smrg- ci: Extend meson-clang coverage by compiling all gallium drivers
9367ec681f3Smrg- panfrost: Fix bifrost tiler descriptor definition
9377ec681f3Smrg- panfrost: Fix bifrost tiler job emission
9387ec681f3Smrg- panfrost: Adjust quirks for bifrost v6
9397ec681f3Smrg- panfrost: Add preliminary support for Mali G72
9407ec681f3Smrg- kmsro: Add mediatek entry point
9417ec681f3Smrg- panfrost: Add support for rbg16 formats
9427ec681f3Smrg- panfrost: decode: Fix decode_bifrost_constant() prototype
9437ec681f3Smrg- panfrost: decode: Flag pandecode_log_typed() as PRINTFLIKE
9447ec681f3Smrg- panfrost: bifrost: disassemble: Fix decoding of next_regs
9457ec681f3Smrg- panfrost: Fix a warning
9467ec681f3Smrg- panfrost: Adjust the draw descriptor definition
9477ec681f3Smrg- panfrost: Adjust the primitive desc definition
9487ec681f3Smrg- panfrost: Adjust the renderer state definition
9497ec681f3Smrg- panfrost: Get rid of the with_opaque qualifier on the renderer state desc
9507ec681f3Smrg- panfrost: Drop the with_opaque specifier on midgard blend desc
9517ec681f3Smrg- panfrost: gen_pack: Drop support for opaque structs
9527ec681f3Smrg- panfrost: gen_pack: Support overlapping structs
9537ec681f3Smrg- panfrost: gen_pack: Add a no-direct-packing attribute
9547ec681f3Smrg- panfrost: Rework fixed-function blending
9557ec681f3Smrg- panfrost: Rework the render target layout to use overlapping structs
9567ec681f3Smrg- panfrost: XML-ify the blend descriptors
9577ec681f3Smrg- panfrost: Fix fixed-function blend on Mali v6
9587ec681f3Smrg- panfrost: Constify the rt_fmts arg passed to pan_lower_framebuffer()
9597ec681f3Smrg- panfrost: Move the blend constant mask extraction out of make_fixed_blend_mode()
9607ec681f3Smrg- panfrost: Pass compile arguments through a struct
9617ec681f3Smrg- panfrost: Allocate blit_blend with ralloc()
9627ec681f3Smrg- panfrost: Don't leak NIR blend shaders
9637ec681f3Smrg- panfrost: Let compile_blend_shader() allocate the blend shader object
9647ec681f3Smrg- panfrost: Get rid of the constant patching done on blend shader binaries
9657ec681f3Smrg- panfrost: Move the blend shader cache at the context level
9667ec681f3Smrg- panfrost: Fix fixed-function blend on bifrost
9677ec681f3Smrg- panfrost: Extend compile_inputs to pass a blend descriptor
9687ec681f3Smrg- pan/bi: Copy blend shader info from compile_inputs
9697ec681f3Smrg- pan/bi: Use canonical name for FAU RAM sources
9707ec681f3Smrg- pan/bi: Get rid of the regs argument in bi_assign_fau_idx()
9717ec681f3Smrg- pan/bi: Rework blend descriptor access handling
9727ec681f3Smrg- pan/bi: Add support for load_blend_const_color_{r,g,b,a}_float
9737ec681f3Smrg- pan/bi: Support indirect jumps
9747ec681f3Smrg- panfrost: Add a "Bifrost Internal Blend" descriptor
9757ec681f3Smrg- panfrost: Scalarize nir_load_blend_const_color_rgba
9767ec681f3Smrg- panfrost: Flag blend shader function as an entry point
9777ec681f3Smrg- pan/bi: Add load_output support
9787ec681f3Smrg- pan/bi: Collect return addresses of blend calls
9797ec681f3Smrg- pan/bi: Special-case BLEND instruction emission for blend shaders
9807ec681f3Smrg- pan/bi: Reserve r0-r3 in blend shaders
9817ec681f3Smrg- pan/bi: Special-case load_input for blend shaders
9827ec681f3Smrg- panfrost: Add missing tile-buffer formats to the format enum
9837ec681f3Smrg- panfrost: Add blend shader support to bifrost
9847ec681f3Smrg- panfrost: Adjust the renderer state definition
9857ec681f3Smrg- panfrost: Fix tiler job injection
9867ec681f3Smrg- panfrost: Add the bifrost tiler internal state field
9877ec681f3Smrg- panfrost: Add specialized preload descriptors
9887ec681f3Smrg- panfrost: Replace unkown renderer state fields by their real names
9897ec681f3Smrg- pan/bi: Make sure we don't print special index as a register
9907ec681f3Smrg- pan/bi: Print blend descriptor source properly
9917ec681f3Smrg- pan/bi: Add support for load_sample_id
9927ec681f3Smrg- pan/bi: Support the case where TEXC needs 0 or 1 staging reg
9937ec681f3Smrg- pan/bi: Add basic support for txf_ms
9947ec681f3Smrg- panfrost: Make {midgard,bifrost}_compile_shader_nir() return a program object
9957ec681f3Smrg- panfrost: Build blit shaders on Bifrost too
9967ec681f3Smrg- panfrost: Use real name for attribute's unknown field
9977ec681f3Smrg- panfrost: Rename panfrost_transfer to panfrost_ptr
9987ec681f3Smrg- panfrost: Pass the texture payload through a panfrost_ptr
9997ec681f3Smrg- panfrost: Split panfrost_load_midg()
10007ec681f3Smrg- panfrost: Add support for native wallpapering on Bifrost
10017ec681f3Smrg- panfrost: Use native wallpapering on Bifrost
10027ec681f3Smrg- panfrost: Get rid of the non-native wallpering bits
10037ec681f3Smrg- panfrost: Preload primitive flags when gl_FrontFacing is accessed
10047ec681f3Smrg- pan/bi: Add support for load_front_face
10057ec681f3Smrg- pan/bi: Add support for load_point_coord
10067ec681f3Smrg- pan/bi: Lower {i,u}{min,max} instructions
10077ec681f3Smrg- pan/bi: Add ult support
10087ec681f3Smrg- pan/bi: Fix ms_idx type to catch missing ms_index source
10097ec681f3Smrg- panfrost: Leave push_constants pointer to NULL if there's no uniform
10107ec681f3Smrg- panfrost: Suppress Bifrost prefetching
10117ec681f3Smrg- panfrost: Add array size to XML
10127ec681f3Smrg- panfrost: Implement v7 texture payloads
10137ec681f3Smrg- pan/bi: s/t0/t1/ in bi_disasm_dest_add()
10147ec681f3Smrg- pan/bi: Move special instruction packing to a separate helper
10157ec681f3Smrg- pan/bi: Split special class in two
10167ec681f3Smrg- pan/bi: Hook up cube instructions packing
10177ec681f3Smrg- pan/bi: Lower cube map coordinates
10187ec681f3Smrg- panfrost: Force late pixel kill when depth/stencil is written from the FS
10197ec681f3Smrg- panfrost: Expose GLES3 features on Bifrost when PAN_MESA_DEBUG=deqp
10207ec681f3Smrg- pan/bi: Extract LD_VAR sample field from ins->load_vary.interp_mode
10217ec681f3Smrg- pan/bi: Support centroid and sample interpolations
10227ec681f3Smrg- pan/bi: Fix swizzle handling in bi_copy_src()
10237ec681f3Smrg- pan/bi: Add support for load_ubo
10247ec681f3Smrg- pan/bi: Lower uniforms to UBO
10257ec681f3Smrg- pan/bi: Get rid of bi_emit_ld_uniform()
10267ec681f3Smrg- pan/bi: Move bitwise op packing out of bi_pack_fma()
10277ec681f3Smrg- pan/bi: Fix ARSHIFT definitions
10287ec681f3Smrg- pan/bi: Add support for ishr
10297ec681f3Smrg- pan/bi: Add support for ushr
10307ec681f3Smrg- panfrost: Allow linear ZS resources on Bifrost
10317ec681f3Smrg- pan/bi: Add support for load_vertex_id
10327ec681f3Smrg- pan/bi: Add support for load_instance_id
10337ec681f3Smrg- panfrost: Fix Bifrost blend descriptor emission
10347ec681f3Smrg- panfrost: Fix ->reads_frag_coord assignment
10357ec681f3Smrg
10367ec681f3SmrgBoyuan Zhang (5):
10377ec681f3Smrg
10387ec681f3Smrg- vl: add flag and definition for protected playback
10397ec681f3Smrg- frontends/va: handle protected slice data buffer
10407ec681f3Smrg- radeon: add decryption params definition header
10417ec681f3Smrg- radeon/vcn: add defines for drm message buffer
10427ec681f3Smrg- radeon/vcn: program drm message buffer
10437ec681f3Smrg
10447ec681f3SmrgBrendan Dougherty (1):
10457ec681f3Smrg
10467ec681f3Smrg- mesa: Fix vertex_format_to_pipe_format index.
10477ec681f3Smrg
10487ec681f3SmrgCaio Marcelo de Oliveira Filho (19):
10497ec681f3Smrg
10507ec681f3Smrg- intel/compiler: Use C99 array initializers for prog_data/key sizes
10517ec681f3Smrg- nir: Add nir_intrinsic_terminate and nir_intrinsic_terminate_if
10527ec681f3Smrg- spirv: Update headers and metadata from latest Khronos commit
10537ec681f3Smrg- spirv: Handle SpvOpTerminateInvocation
10547ec681f3Smrg- intel/fs: Handle nir_intrinsic_terminate
10557ec681f3Smrg- vulkan: Update XML and headers to 1.2.158
10567ec681f3Smrg- anv: Advertise VK_KHR_shader_terminate_invocation
10577ec681f3Smrg- nir: Use a switch in nir_lower_explicit_io_instr
10587ec681f3Smrg- intel/fs: Don't emit_uniformize when getting a constant SSBO index
10597ec681f3Smrg- spirv: Implement SpvCapabilitySubgroupShuffleINTEL from SPV_INTEL_subgroups
10607ec681f3Smrg- nir: Add nir_intrinsic_{load,store}_deref_block_intel
10617ec681f3Smrg- spirv: Implement SpvCapabilitySubgroupBufferBlockIOINTEL
10627ec681f3Smrg- intel/fs: Add A64 OWORD BLOCK opcodes
10637ec681f3Smrg- intel/fs: Implement nir_intrinsic_{load,store}_global_block_intel
10647ec681f3Smrg- intel/fs: Add surface OWORD BLOCK opcodes
10657ec681f3Smrg- intel/fs: Implement nir_intrinsic_{load,store}_ssbo_block_intel
10667ec681f3Smrg- intel/fs: Implement nir_intrinsic_{load,store}_shared_block_intel
10677ec681f3Smrg- compiler: Add new Vulkan shader stages
10687ec681f3Smrg- spirv: Add Ray Tracing execution models
10697ec681f3Smrg
10707ec681f3SmrgCaleb Callaway (1):
10717ec681f3Smrg
10727ec681f3Smrg- iris: Add missing newline to debug log message
10737ec681f3Smrg
10747ec681f3SmrgChad Versace (2):
10757ec681f3Smrg
10767ec681f3Smrg- anv/image: Check DISJOINT in vkGetPhysicalDeviceImageFormatProperties2 (v2)
10777ec681f3Smrg- anv/image: Fix isl_surf_usage_flags for stencil images
10787ec681f3Smrg
10797ec681f3SmrgCharmaine Lee (3):
10807ec681f3Smrg
10817ec681f3Smrg- st/mesa: increase size of gl_register_file bitfields
10827ec681f3Smrg- winsys/svga: fix display corruption after surface_init
10837ec681f3Smrg- svga: fix draw elements with 8-bits indices
10847ec681f3Smrg
10857ec681f3SmrgChia-I Wu (2):
10867ec681f3Smrg
10877ec681f3Smrg- virgl: move protocol headers to a common place
10887ec681f3Smrg- virgl: update protocol headers
10897ec681f3Smrg
10907ec681f3SmrgChristian Gmeiner (17):
10917ec681f3Smrg
10927ec681f3Smrg- etnaviv: call nir_lower_bool_to_bitsize
10937ec681f3Smrg- etnaviv: completely turn off MSAA
10947ec681f3Smrg- ci: do not build libdrm for vc4, reedreno and etnaviv
10957ec681f3Smrg- etnaviv: call nir_opt_shrink_vectors(..) in opt loop
10967ec681f3Smrg- etnaviv: shuffle some variant fields
10977ec681f3Smrg- etnaviv: add disk cache
10987ec681f3Smrg- etnaviv: simplify linear stride implementation
10997ec681f3Smrg- ci: piglit: conditionally build OpenCL tests
11007ec681f3Smrg- ci/bare-metal: suppress 'No such file or directory'
11017ec681f3Smrg- etnaviv: drop etna_pipe_wait(..)
11027ec681f3Smrg- ci/x86: speed up piglit testing
11037ec681f3Smrg- nir: make tgsi_varying_semantic_to_slot(..) public
11047ec681f3Smrg- etnaviv: convert from tgsi semantic/index to varying-slot
11057ec681f3Smrg- etnaviv: move etna_dump_shader(..) to generic location
11067ec681f3Smrg- etnaviv: move etna_destroy_shader(..) to generic location
11077ec681f3Smrg- etnaviv: nir: do not run opt loop after nir_lower_bool_xxx(..)
11087ec681f3Smrg- etnaviv/drm: fix evil-twin etna_drm_table_lock
11097ec681f3Smrg
11107ec681f3SmrgConnor Abbott (59):
11117ec681f3Smrg
11127ec681f3Smrg- freedreno/afuc: Fix printing preemptleave on a5xx
11137ec681f3Smrg- freedreno/afuc: Handle setsecure opcode
11147ec681f3Smrg- freedreno/afuc: Add iret
11157ec681f3Smrg- freedreno/afuc: Handle xmov modifiers
11167ec681f3Smrg- freedreno/afuc: Make 0 a valid number
11177ec681f3Smrg- freedreno/afuc: Install asm/disasm
11187ec681f3Smrg- freedreno: Add afuc regression test
11197ec681f3Smrg- nir/spirv: Add the option to keep ViewIndex as an input
11207ec681f3Smrg- nir/lower_input_attachments: Refactor to use an options struct
11217ec681f3Smrg- nir/lower_input_attachments: Support loading layer id as an input
11227ec681f3Smrg- radv: Use an input for the layer when lowering input attachments
11237ec681f3Smrg- tu: Use an input for the layer when lowering input attachments
11247ec681f3Smrg- nir/lower_input_attachments: Support loading layer id via gl_ViewIndex
11257ec681f3Smrg- freedreno/a6xx: Add multiview registers
11267ec681f3Smrg- ir3: Add support for gl_ViewIndex in VS & FS
11277ec681f3Smrg- tu: Translate VkRenderPassMultiviewCreateInfo to VkRenderPassCreateInfo2
11287ec681f3Smrg- tu: Parse multiview render pass info
11297ec681f3Smrg- tu: Implement multiview clear/resolve interactions
11307ec681f3Smrg- tu: Improve timestamp queries
11317ec681f3Smrg- tu: Implement multiview query interactions
11327ec681f3Smrg- tu: Add multiview lowering pass
11337ec681f3Smrg- tu: Implement multiview pipeline state
11347ec681f3Smrg- tu: Enable VK_KHR_multiview
11357ec681f3Smrg- freedreno/computerator: Use a render node
11367ec681f3Smrg- tu: Expose shaderStorageImageExtendedFormats
11377ec681f3Smrg- tu: Expose shaderImageGatherExtended
11387ec681f3Smrg- ir3: Don't use the format to get the image type
11397ec681f3Smrg- tu: Expose shaderStorageImage*WithoutFormat
11407ec681f3Smrg- nir: Add nir_lower_multiview pass
11417ec681f3Smrg- anv: Use nir_lower_multiview pass
11427ec681f3Smrg- nir: Count i/o slots correctly for per-view variables
11437ec681f3Smrg- nir/lower_io_arrays: Fix xfb_offset bug
11447ec681f3Smrg- nir: Add per_view to IO semantics
11457ec681f3Smrg- nir: Handle per-view io in nir_io_add_const_offset_to_base()
11467ec681f3Smrg- tu: Write multiview control registers in binning pass
11477ec681f3Smrg- tu: Refactor shader compilation flow
11487ec681f3Smrg- ir3, tu: Run optimization loop twice
11497ec681f3Smrg- ir3, tu: Link per-view position correctly
11507ec681f3Smrg- tu: Enable multi-position output
11517ec681f3Smrg- intel/nir: Use nir control flow helpers
11527ec681f3Smrg- radv: Use nir control flow insertion helpers
11537ec681f3Smrg- ttn: Use nir control flow insertion helpers
11547ec681f3Smrg- nir/lower_returns: Use nir control flow insertion helpers
11557ec681f3Smrg- nir/opt_if: Remove open-coded nir_ssa_def_rewrite_uses()
11567ec681f3Smrg- nir/opt_if: Use early returns in opt_if_merge()
11577ec681f3Smrg- ttn: Fix number of components for IF/UIF
11587ec681f3Smrg- nir/lower_clip_cull: Store array size for FS inputs
11597ec681f3Smrg- ir3: Switch tess lowering to use location
11607ec681f3Smrg- ir3: Handle clip+cull distances
11617ec681f3Smrg- tu: Implement clip/cull distances
11627ec681f3Smrg- freedreno/a6xx: Implement user clip/cull distances
11637ec681f3Smrg- freedreno: Introduce common device info struct
11647ec681f3Smrg- tu: Use freedreno_dev_info
11657ec681f3Smrg- freedreno: Use freedreno_dev_info
11667ec681f3Smrg- freedreno/a6xx: Update SO registers for streams
11677ec681f3Smrg- ir3: Support geometry streams
11687ec681f3Smrg- util/bitset: Add a range iterator helper
11697ec681f3Smrg- tu: Support geometryStreams
11707ec681f3Smrg- tu: Support rasterizerDiscardEnable and RasterizationStreamSelect
11717ec681f3Smrg
11727ec681f3SmrgDaniel Abrecht (1):
11737ec681f3Smrg
11747ec681f3Smrg- etnaviv: Make sure to track different pipe_screens for different DRM device descriptions
11757ec681f3Smrg
11767ec681f3SmrgDaniel Schürmann (26):
11777ec681f3Smrg
11787ec681f3Smrg- aco: execute branch instructions in WQM if necessary
11797ec681f3Smrg- nir,amd: remove trinary_minmax opcodes
11807ec681f3Smrg- aco/isel: refactor code and remove unnecessary v_mov
11817ec681f3Smrg- aco/isel: refactor emit_vop3a_instruction() to handle 2 operand instructions
11827ec681f3Smrg- ac/nir: implement nir_op_[un]pack_[64/32]_\*
11837ec681f3Smrg- aco: propagate SGPRs into VOP1 instructions early.
11847ec681f3Smrg- aco: expand create_vector more carefully w.r.t. subdword operands
11857ec681f3Smrg- aco: use p_create_vector for nir_op_pack_half_2x16
11867ec681f3Smrg- nir/opt_algebraic: optimize unpack_half_2x16_split_x(ushr, a, 16)
11877ec681f3Smrg- aco: use p_split_vector for nir_op_unpack_half\_\*
11887ec681f3Smrg- aco: add validation rules for p_split_vector
11897ec681f3Smrg- aco: use v_cvt_pkrtz_f16_f32 for pack_half_2x16
11907ec681f3Smrg- radv,aco: lower_pack_half_2x16
11917ec681f3Smrg- aco: use VOP2 version of v_cvt_pkrtz_f16_f32 on GFX_6_7_10
11927ec681f3Smrg- aco: use VOP2 for v_cvt_pkrtz_f16_f32 if possible
11937ec681f3Smrg- aco: refactor GFX6_7 subdword copy lowering
11947ec681f3Smrg- aco: improve code sequences for 16bit packing
11957ec681f3Smrg- aco: use do_pack() for self-intersecting operations.
11967ec681f3Smrg- aco: fix GFX8 16-bit packing
11977ec681f3Smrg- aco: implement nir_op_unpack_[64/32]_\*
11987ec681f3Smrg- ac/nir: implement nir_op_[un]pack_64_4x16
11997ec681f3Smrg- nir: add options to lower nir_op_pack_[64/32]_\* via nir_lower_alu_to_scalar()
12007ec681f3Smrg- radv: lower pack_[64/32]_\* via nir_lower_alu_to_scalar()
12017ec681f3Smrg- radv: remove call to nir_lower_pack()
12027ec681f3Smrg- aco: refactor split_store_data() to always split into evenly sized elements
12037ec681f3Smrg- nir/lcssa: consider loops with no back-edge invariant
12047ec681f3Smrg
12057ec681f3SmrgDaniel Stone (17):
12067ec681f3Smrg
12077ec681f3Smrg- glsl/test: Don't run whitespace tests in parallel
12087ec681f3Smrg- CI: Disable Panfrost T860 and AMD Stoney tests
12097ec681f3Smrg- CI: Skip flaky CS test on VirGL
12107ec681f3Smrg- CI: Skip another flaky GS test on softpipe
12117ec681f3Smrg- CI: Disable Panfrost T720/T760 CI
12127ec681f3Smrg- meson: Add MSVC narrowing-int-to-char warnings
12137ec681f3Smrg- CI: Windows: Use separate config file for Docker
12147ec681f3Smrg- CI: Re-enable VS2019 build
12157ec681f3Smrg- CI: Disable Windows again
12167ec681f3Smrg- CI: Temporarily disable Panfrost T7xx
12177ec681f3Smrg- CI: Re-enable Panfrost T7xx
12187ec681f3Smrg- CI: Disable Panfrost T7xx CI
12197ec681f3Smrg- CI: Re-enable Panfrost T7xx CI
12207ec681f3Smrg- CI: Don't run pixmark-piano twice on radeonsi
12217ec681f3Smrg- CI: Only run OpenCL tests when we need to
12227ec681f3Smrg- CI: Disable Panfrost T760
12237ec681f3Smrg- freedreno: Add missing dependency to build
12247ec681f3Smrg
12257ec681f3SmrgDanylo Piliaiev (19):
12267ec681f3Smrg
12277ec681f3Smrg- st/mesa: Treat vertex outputs absent in outputMapping as zero in mesa_to_tgsi
12287ec681f3Smrg- anv/nir: Unify inputs_read/outputs_written between geometry stages
12297ec681f3Smrg- spirv: Only require bare types to match when copying variables
12307ec681f3Smrg- ir_constant: Return zero on out-of-bounds vector accesses
12317ec681f3Smrg- glsl: Eliminate assigments to out-of-bounds elements of vector
12327ec681f3Smrg- glsl: Eliminate out-of-bounds triop_vector_insert
12337ec681f3Smrg- intel/disasm: Change visibility of has_uip and has_jip
12347ec681f3Smrg- intel/disasm: brw_label and support functions
12357ec681f3Smrg- intel/disasm: Label support in shader disassembly for UIP/JIP
12367ec681f3Smrg- intel/assembler: Add labels support
12377ec681f3Smrg- intel/compiler: Fix pointer arithmetic when reading shader assembly
12387ec681f3Smrg- st/nir: Call st_glsl_to_nir_post_opts before interface unification
12397ec681f3Smrg- nir/lower_io: Eliminate oob writes and return zero for oob reads
12407ec681f3Smrg- nir/large_constants: Eliminate out-of-bounds writes to large constants
12417ec681f3Smrg- nir/lower_samplers: Clamp out-of-bounds access to array of samplers
12427ec681f3Smrg- intel/fs: Disable sample mask predication for scratch stores
12437ec681f3Smrg- docs: add INTEL_SHADER_ASM_READ_PATH description
12447ec681f3Smrg- nir/lower_returns: Append missing phis' sources after "break" insertion
12457ec681f3Smrg- freedreno/a6xx: Fix typo in height alignment calculation in a6xx layout
12467ec681f3Smrg
12477ec681f3SmrgDave Airlie (115):
12487ec681f3Smrg
12497ec681f3Smrg- anv: add no reloc flags on empty and simple bo paths.
12507ec681f3Smrg- CI: temp disable t720/t760 jobs.
12517ec681f3Smrg- llvmpipe: only read 0 for channels being read
12527ec681f3Smrg- gallium: add an interface for memory allocations.
12537ec681f3Smrg- gallium: add a resource flag to say no over allocation.
12547ec681f3Smrg- llvmpipe: add support for memory allocation APIs
12557ec681f3Smrg- gallivm/nir: add load push constant support
12567ec681f3Smrg- util/format: add some ZS helpers for vallium
12577ec681f3Smrg- vulkan/wsi: add sw support. (v2)
12587ec681f3Smrg- vallium: initial import of the vulkan frontend
12597ec681f3Smrg- llvmpipe/blit: for 32-bit unorm depth blits just copy 32-bit
12607ec681f3Smrg- llvmpipe: enable GL 4.5
12617ec681f3Smrg- vallium: fix input attachment lowering variable shadowing
12627ec681f3Smrg- llvmpipe/cs: update compute counters not fragment shader.
12637ec681f3Smrg- gallium/nir/tgsi: reindent some code in the nir->tgsi info (v2)
12647ec681f3Smrg- gallivm/nir: add imod support
12657ec681f3Smrg- gallivm/sample: fix lod query on array textures.
12667ec681f3Smrg- llvmpipe: lower uadd_carry/usub_borrow.
12677ec681f3Smrg- gallium/nir/tgsi: add support for compact variables
12687ec681f3Smrg- gallivm/nir: fixup compact TCS variable storage.
12697ec681f3Smrg- gallivm/nir: split tex/sampler indirect offsets
12707ec681f3Smrg- llvmpipe: lower cs local index from id
12717ec681f3Smrg- llvmpipe: lower mul 2x32_64
12727ec681f3Smrg- llvmpipe/nir: lower some more packing at nir level.
12737ec681f3Smrg- llvmpipe: add reference counting to fragment shaders.
12747ec681f3Smrg- vallium: handle 3D image views properly.
12757ec681f3Smrg- vallium: limit buffer allocations to gallium max.
12767ec681f3Smrg- gallium/nir/tgsi: fix nir->tgsi info conversion for samplers/image
12777ec681f3Smrg- gallivm/nir: lower tg4 offsets.
12787ec681f3Smrg- gallivm/nir: add indirect swizzle output loading support
12797ec681f3Smrg- gallivm/nir: add quantize to f16 support
12807ec681f3Smrg- gallivm/nir: fix const compact
12817ec681f3Smrg- gallivm/nir: lower frexp/ldexp
12827ec681f3Smrg- gallivm/nir: add subpass sampler type support
12837ec681f3Smrg- gallivm: use common code to select texel type
12847ec681f3Smrg- llvmpipe: blend has effects even if no colorbuffers.
12857ec681f3Smrg- llvmpipe: add array/3d clearing support
12867ec681f3Smrg- llvmpipe/fs: multisample depth/stencil bad ir generated
12877ec681f3Smrg- gallivm/nir: allow 64-bit arit ops
12887ec681f3Smrg- gallivm/nir: add some f16 support
12897ec681f3Smrg- vallium: disable VK_KHR_shader_float16_int8.
12907ec681f3Smrg- vulkan/device_select: don't pick a cpu driver as the default
12917ec681f3Smrg- llvmpipe: include gallivm perf flags in shader cache.
12927ec681f3Smrg- gallivm: disable brilinear for lod bias and explicit lod.
12937ec681f3Smrg- vtn: add an option to create a nir library from spirv
12947ec681f3Smrg- clover/nir: add libclc lowering pass
12957ec681f3Smrg- util: add missing extern C
12967ec681f3Smrg- clover: handle libclc shader (v3)
12977ec681f3Smrg- gallivm: fix pow(0, y) to be 0
12987ec681f3Smrg- gallivm: fix 64-bit CL intrinsics.
12997ec681f3Smrg- gallivm/nir: fix up non 32-bit load stores
13007ec681f3Smrg- gallivm/nir: handle non-32-bit mul high
13017ec681f3Smrg- llvmpipe: use an alternate env var to enable clover.
13027ec681f3Smrg- lavapipe: rename vallium to lavapipe
13037ec681f3Smrg- gallivm/nir: make sure to mask global reads.
13047ec681f3Smrg- llvmpipe/cs: add in shader shared size.
13057ec681f3Smrg- gallivm/nir: fix non-32 bit find lsb/msb
13067ec681f3Smrg- lavapipe: drop dri,dricommon deps.
13077ec681f3Smrg- ci: move to using clang 10 for meson + clover
13087ec681f3Smrg- clover: Use core libclc loader
13097ec681f3Smrg- ci: enable piglit testing of clover/llvmpipe.
13107ec681f3Smrg- clover: don't call validate spirv at all if not needed
13117ec681f3Smrg- ci: fix deqp clone + fetch
13127ec681f3Smrg- CI: build our own spirv tools
13137ec681f3Smrg- clover/nir: add a constant folding pass before lowering mem const
13147ec681f3Smrg- llvmpipe: fix sampler/image binding for clover.
13157ec681f3Smrg- gallivm: add load/store scratch support.
13167ec681f3Smrg- llvmpipe: fix 8/16 bit global stores
13177ec681f3Smrg- gallivm: fix 64->16 f2f16
13187ec681f3Smrg- gallivm: add 16-bit split/merge support.
13197ec681f3Smrg- gallivm: add b2i8/b216 support
13207ec681f3Smrg- gallivm: handle sub-32 bit masked stores.
13217ec681f3Smrg- gallivm: add support for 8/16-bit mul_hi
13227ec681f3Smrg- gallivm: get correct min/max behaviour for kernels.
13237ec681f3Smrg- gallivm: lower flrp for all sizes.
13247ec681f3Smrg- CI: remove llvmpipe cl flake test
13257ec681f3Smrg- gallivm: zero init the temporary register storage.
13267ec681f3Smrg- gallium: add a level parameter to resource parameter get
13277ec681f3Smrg- gallium: add a layer stride pipe resource parameter.
13287ec681f3Smrg- llvmpipe: add resource get param support.
13297ec681f3Smrg- lavapipe: use resource get param.
13307ec681f3Smrg- gallivm: fix f16 quantize.
13317ec681f3Smrg- lavapipe: don't write to pending clear aspects in cmd buffer
13327ec681f3Smrg- lavapipe: constify state pointers into command buffers.
13337ec681f3Smrg- lavapipe: fix dEQP-VK.info.device_properties
13347ec681f3Smrg- gallivm/nir: handle dvec3/4 inputs properly.
13357ec681f3Smrg- gallivm/nir: fix vulkan vertex inputs
13367ec681f3Smrg- lavapipe: fix 3d compressed texture copies.
13377ec681f3Smrg- lavapipe: stop crashes with 3D z blits
13387ec681f3Smrg- llvmpipe: add clear_buffer callback. (v2)
13397ec681f3Smrg- lavapipe: use clear_buffer callback
13407ec681f3Smrg- lavapipe: don't advertise linear filtering on integer textures.
13417ec681f3Smrg- gallium: add a non-multisample sample mask out behaviour flag.
13427ec681f3Smrg- llvmpipe: respect the sample mask in non-multisample flag
13437ec681f3Smrg- lavapipe: request correct sample mask behaviour
13447ec681f3Smrg- CL: update CL headers to 3.0
13457ec681f3Smrg- vtn/opencl: add ctz support
13467ec681f3Smrg- clover: access 3.0 and deprecated 2.2 API
13477ec681f3Smrg- clover/llvm: add 3.0 versioning.
13487ec681f3Smrg- clover/spirv: hook up spir-v environment for 3.0
13497ec681f3Smrg- clover: add empty cl 3.0 dispatch entries.
13507ec681f3Smrg- gallium: handle empty cbuf slots in framebuffer samples helper
13517ec681f3Smrg- u_blitter: port radv 3D blit coords logic.
13527ec681f3Smrg- lavapipe: enable alpha to one.
13537ec681f3Smrg- lavapipe: disable SNORM blending for now
13547ec681f3Smrg- llvmpipe: just use draw_regions in draw/line setup.
13557ec681f3Smrg- draw: fix tess eval pipeline statistics.
13567ec681f3Smrg- lavapipe: fixup device allocate + enable private data
13577ec681f3Smrg- lavapipe: fix wsi acquire fences
13587ec681f3Smrg- llvmpipe/setup: move point stats collection earlier.
13597ec681f3Smrg- llvmpipe: fix multisample point rendering.
13607ec681f3Smrg- llvmpipe: fix multisample lines.
13617ec681f3Smrg- lavapipe: fixup mipmap precsion bits
13627ec681f3Smrg- lavapipe: enable pipeline stats queries
13637ec681f3Smrg- gallium: fix missing bit field in p_state.h
13647ec681f3Smrg
13657ec681f3SmrgDenis Pauk (1):
13667ec681f3Smrg
13677ec681f3Smrg- mesa: bptc fixes for decompress rgba_unorm and rgb_float
13687ec681f3Smrg
13697ec681f3SmrgDuncan Hopkins (10):
13707ec681f3Smrg
13717ec681f3Smrg- meson: Add xcb-fixes to loader when using x11 and dri3. Fixes undefined symbol for xcb_xfixes_create_region in loader_dri3_helper.c
13727ec681f3Smrg- zink: clamped maxPerStageDescriptorUniformBuffers limits to INT_MAX when stored as uint32_t.
13737ec681f3Smrg- zink: Basic framework to check for optional instance layers and instance extensions.
13747ec681f3Smrg- zink: Added support for MacOS MoltenVK APIs.
13757ec681f3Smrg- zink: return fail if create_instance fails
13767ec681f3Smrg- zink: Added inbuilt debug logging from the VK_LAYER_LUNARG_standard_validation layer.
13777ec681f3Smrg- zink: add support to device info for macro guards and just VkPhysicalDevice*Features with out the have\_.
13787ec681f3Smrg- zink: have_triangle_fans support.
13797ec681f3Smrg- zink: For MoltenVk added vkFlushMappedMemoryRanges() to vkMapMemory() to fix empty mapped memory.
13807ec681f3Smrg- zink: make physical device functions use a dynamic function pointers.
13817ec681f3Smrg
13827ec681f3SmrgDylan Baker (31):
13837ec681f3Smrg
13847ec681f3Smrg- Bump development version and clear new_features
13857ec681f3Smrg- meson/freedreno: Fix lua requirement
13867ec681f3Smrg- docs: update calendar for 20.2.0-rc1
13877ec681f3Smrg- docs: update calendar for 20.2.0-rc2
13887ec681f3Smrg- meson/anv: Use variable that checks for --build-id
13897ec681f3Smrg- glsl/xxd.py: fix imports
13907ec681f3Smrg- clover/meson: use dep.get_variable instead of deprecated get_pkgconfig_variable
13917ec681f3Smrg- meson: generalize libclc usage
13927ec681f3Smrg- docs: update calendar and link releases notes for 20.2
13937ec681f3Smrg- docs/release-calender: Add 20.2 stable releases
13947ec681f3Smrg- docs: add release notes for 20.2.0
13957ec681f3Smrg- docs: Add sh256 sums for 20.2.0
13967ec681f3Smrg- docs: add release notes for 20.2.1
13977ec681f3Smrg- docs: add SHA256 sums for 20.2.1
13987ec681f3Smrg- docs: update calendar and link releases notes for 20.2.1
13997ec681f3Smrg- docs: add release notes for 20.2.2
14007ec681f3Smrg- dcs: Add sha256 sums for 20.2.2
14017ec681f3Smrg- docs: update calendar and link releases notes for 20.2.2
14027ec681f3Smrg- bump version for 20.3-rc1
14037ec681f3Smrg- .pick_status.json: Update to bf5cea7232f9ee2934c212211ebefb6fe766526d
14047ec681f3Smrg- .pick_status.json: Update to 87dc3106b077199b829a082e32ec33d0c6d400ab
14057ec681f3Smrg- .pick_status.json: Update to bac6cc586fe4c1b24351e0574d3a961eb631f6ae
14067ec681f3Smrg- bump VERSION for 20.3.0-rc2 release
14077ec681f3Smrg- .pick_status.json: Update to a59b1b18a95af1f8edb0093baf508e974e3251a2
14087ec681f3Smrg- .pick_status.json: Update to a92f597b98bb032b904c7c8a8c3a9fe798b51915
14097ec681f3Smrg- .pick_status.json: Update to 9fa1cdfe7ffd9e7ebd83055e2008f3e4b8ada549
14107ec681f3Smrg- meson: Don't add extra values to shader-cache
14117ec681f3Smrg- appveyor: disable for now
14127ec681f3Smrg- bump VERSION for 20.3-rc3
14137ec681f3Smrg- .pick_status.json: Update to 89f6b72f19dbc503386643c6283047bdb1013bef
14147ec681f3Smrg- .pick_status.json: Update to d3c67d7e7ec6b9cf10fbea0d08e92751b7b0fbae
14157ec681f3Smrg
14167ec681f3SmrgEduardo Lima Mitev (9):
14177ec681f3Smrg
14187ec681f3Smrg- st: Pass TextureTiling option from texture to memory obj
14197ec681f3Smrg- freedreno: Implement memory object create/destroy for GL_EXT_memory_object
14207ec681f3Smrg- freedreno: Refactor fd_resource_create_with_modifiers() into a helper
14217ec681f3Smrg- freedreno/layout: Move hard-coded minimum width for UBWC to a macro
14227ec681f3Smrg- freedreno: implement pipe screen's resource_from_memobj
14237ec681f3Smrg- freedreno: Implement pipe screen's get_device/driver_uuid()
14247ec681f3Smrg- freedreno: Enable GL_EXT_memory_object and GL_EXT_memory_object_fd
14257ec681f3Smrg- freedreno: Destroy syncobj too when destroying fence
14267ec681f3Smrg- turnip: Enable support for KHR_incremental_present
14277ec681f3Smrg
14287ec681f3SmrgEleni Maria Stea (3):
14297ec681f3Smrg
14307ec681f3Smrg- radeonsi: support for external buffers (ext_external_objects)
14317ec681f3Smrg- iris: handle PIPE_FD_TYPE_SYNCOBJ type
14327ec681f3Smrg- iris: add support for fence signal capability
14337ec681f3Smrg
14347ec681f3SmrgEmil Velikov (1):
14357ec681f3Smrg
14367ec681f3Smrg- radv: restrict exported symbols with static llvm
14377ec681f3Smrg
14387ec681f3SmrgEmmanuel Vadot (1):
14397ec681f3Smrg
14407ec681f3Smrg- util/os_misc: os_get_available_system_memory() for FreeBSD
14417ec681f3Smrg
14427ec681f3SmrgEric Anholt (221):
14437ec681f3Smrg
14447ec681f3Smrg- util: Split the pack/unpack functions out of the format desc.
14457ec681f3Smrg- util: Change a codegenned switch statement to a nice little table.
14467ec681f3Smrg- util: Fix up indentation in the generated format tables code.
14477ec681f3Smrg- uitl: Add R1_UNORM to the list of noaccess (no pack/unpack) formats.
14487ec681f3Smrg- util: Make all 3 fetch_rgba functions occupy the same function slot.
14497ec681f3Smrg- util: Mark the format description getter functions as const.
14507ec681f3Smrg- util: Move fetch_rgba to a separate function table.
14517ec681f3Smrg- gallium: Use unpack_rgba() instead of fetch_rgba in translate_generic
14527ec681f3Smrg- freedreno/ir3: Fix compiler warning from the setjmp fails path.
14537ec681f3Smrg- freedreno/cffdec: When .mergedregs is set, don't count half regs.
14547ec681f3Smrg- freedreno/ir3: Fix assertion failures dumping CS high full regs.
14557ec681f3Smrg- util: Expose rgba unpack/fetch functions as external functions as well.
14567ec681f3Smrg- util: Explicitly call the unpack functions from inside bptc pack/unpack.
14577ec681f3Smrg- radv: Move nir_opt_shrink_vectors() into the opt loop.
14587ec681f3Smrg- nir/opt_undef: Handle a couple more normal store intrinsics.
14597ec681f3Smrg- nir: Expand opt_undef to handle undef channels in a store intrinsic.
14607ec681f3Smrg- nir: Shrink store intrinsic num_components to the size used by the writemask.
14617ec681f3Smrg- ci/deqp-runner: Drop stale comment from deqp-runner.sh.
14627ec681f3Smrg- ci/deqp-runner: Drop unused "count" variable
14637ec681f3Smrg- ci/deqp-runner: Add a post-deqp-run filter list for known flakes.
14647ec681f3Smrg- ci/freedreno: Move our skips lists over to being known-flakes lists.
14657ec681f3Smrg- ci/freedreno: List more common flakes reported recently.
14667ec681f3Smrg- ci/bare-metal: Use a new serial buffer tool.
14677ec681f3Smrg- ci/bare-metal: Convert the main cros-servo boot code to python
14687ec681f3Smrg- ci/bare-metal: Retry booting chezas instead of failing when !POWER_GOOD
14697ec681f3Smrg- ci/bare-metal: Try rebooting chezas again if they get stuck during tftp.
14707ec681f3Smrg- nir: Make the nir_builder \*_imm helpers consistently handle bit size.
14717ec681f3Smrg- nir: Add nir_[iu]shr_imm and nir_udiv_imm helpers and use them.
14727ec681f3Smrg- nir: Add a lowering pass for backends wanting load_ubo with vec4 offsets.
14737ec681f3Smrg- freedreno/ir3: Replace our custom vec4 UBO intrinsic with the shared lowering.
14747ec681f3Smrg- nir/load_store_vectorizer: Clean up unit test swizzle assertions.
14757ec681f3Smrg- freedreno: Drop UNIFORM_BUFFER_OFFSET_ALIGNMENT to 32
14767ec681f3Smrg- ci: Mark the rest of compswap as flaky on freedreno.
14777ec681f3Smrg- freedreno/a5xx: Don't set the VARYING flag for fragcoord-only programs.
14787ec681f3Smrg- ci: Test the KHR-GL\* CTS cases with softpipe.
14797ec681f3Smrg- nir/opt_copy_prop_vars: Quiet valgrind warning about overlapping memcpy.
14807ec681f3Smrg- nir: Add a helper for general instruction-modifying passes.
14817ec681f3Smrg- nir/lower_vec_to_movs: Convert to use nir_shader_instructions_pass().
14827ec681f3Smrg- nir/opt_undef: Convert to use nir_shader_instructions_pass().
14837ec681f3Smrg- nir/lower_io_to_scalar: Convert to use nir_shader_instructions_pass().
14847ec681f3Smrg- nir/nir_lower_wrmasks: Use the nir_lower_instructions_pass() helper.
14857ec681f3Smrg- nir/lower_discard_to_demote: Use nir_shader_instructions_pass().
14867ec681f3Smrg- drm-shim: Fix unused variable warnings from asserts in release build.
14877ec681f3Smrg- panfrost: Fix OOB array access compiler warning.
14887ec681f3Smrg- panfrost: Fix remaining release-build warnings.
14897ec681f3Smrg- gallium/tests: Fix compiler warning about unused vars in trivial tests.
14907ec681f3Smrg- nvc0: Fix compiler warning about unused var that gets asserted.
14917ec681f3Smrg- vc4: Fix unused var warnings in release builds from assertions.
14927ec681f3Smrg- nv50: Fix uninitialized var warnings from using assert() as unreachable().
14937ec681f3Smrg- zink: Fix unused var warnings in release build from assertions.
14947ec681f3Smrg- etnaviv: Fix unused var warning in release build from assertions.
14957ec681f3Smrg- lima: Fix unused var/function warnings in release build from assertions.
14967ec681f3Smrg- lima: Fix uninitialized var warning from using assert() as unreachable().
14977ec681f3Smrg- virgl: Fix unused var warnings in release build from assertions.
14987ec681f3Smrg- ci: Add a release build with -Werror enabled.
14997ec681f3Smrg- nir: Fix printing of individual instructions with io semantics.
15007ec681f3Smrg- nir: Look up the shader when printing a single instruction.
15017ec681f3Smrg- ci: Make a missing device name correctly bail out of deqp-runner.sh.
15027ec681f3Smrg- turnip: Make sure we include the build id.
15037ec681f3Smrg- pipe-loader: Use real galliumvl if radeonsi is being linked.
15047ec681f3Smrg- ci: Switch to using gold as the linker.
15057ec681f3Smrg- nir: Invalidate live SSA def information when making new SSA defs.
15067ec681f3Smrg- nir: Switch the indexing of block->live_in/out arrays.
15077ec681f3Smrg- ci: Bump vulkan CTS version to 1.2.3.2, and keep the GL CTS around.
15087ec681f3Smrg- ci: Use the same VK-GL-CTS tree for GL/GLES as VK.
15097ec681f3Smrg- ci: Enable KHR-GL30 CTS testing on freedreno a630.
15107ec681f3Smrg- freedreno/a6xx: Add ARB_depth_clamp and separate clamp support.
15117ec681f3Smrg- gallivm: Report the unsupported intrinsic instead of just assert(0);
15127ec681f3Smrg- gallium/tgsi: Add support for PRIMITIVEID as a system value.
15137ec681f3Smrg- gallium/tgsi: Add some missing opcodes to tgsi_ureg.
15147ec681f3Smrg- gallium/tgsi: Add a helper for initializing ureg from a shader_info.
15157ec681f3Smrg- gallium/ureg: Set the next shader stage from the shader info.
15167ec681f3Smrg- nir: Add simplistic lowering for bany_equal/ball_inequal.
15177ec681f3Smrg- nir/opt_vectorize: Add a callback for filtering of vectorizing.
15187ec681f3Smrg- gallium/tgsi_exec: Add missing DFLR opcode support.
15197ec681f3Smrg- gallium/tgsi_exec: Fix up NumOutputs counting
15207ec681f3Smrg- ci/bare-metal: Use re.search() instead re.match() for our line matching.
15217ec681f3Smrg- ci/bare-metal: Fix detection of "POWER_GOOD not seen in time" fails
15227ec681f3Smrg- ci/bare-metal: Include a timestamp in our serial reads.
15237ec681f3Smrg- ci/bare-metal: Log why our run restarts when it does.
15247ec681f3Smrg- ci/bare-metal: Fix capturing of serial output as job artifacts.
15257ec681f3Smrg- ci/bare-metal: Use python for handling fastboot booting and parsing
15267ec681f3Smrg- nir/load_store_vectorizer: Use more imm helpers in the tests.
15277ec681f3Smrg- nir/load_store_vectorizer: Add unit tests for alignment handling.
15287ec681f3Smrg- nir: Update the comment about nir_lower_uniforms_to_ubo()'s multiplier.
15297ec681f3Smrg- nir: Add a range_base+range to nir_intrinsic_load_ubo().
15307ec681f3Smrg- freedreno/ir3: Use the new NIR UBO ranges in UBO analysis.
15317ec681f3Smrg- freedreno/ir3: Apply the max upload limit to initial range setup
15327ec681f3Smrg- nir: Use explicit deref information to provide real UBO ranges.
15337ec681f3Smrg- iris: Add missing range_base/range to our nir_load_ubos.
15347ec681f3Smrg- turnip: Fix a compiler warning in release builds of the query code.
15357ec681f3Smrg- freedreno: Make the pack struct have a .qword for wide addresses.
15367ec681f3Smrg- turnip: Fix truncation of CS shader iovas to 32 bits.
15377ec681f3Smrg- turnip: Fix truncation of iovas to 32 bits in queries.
15387ec681f3Smrg- ci/bare-metal: Update the kernel to msm-next-pgtables
15397ec681f3Smrg- ci/bare-metal: Allow wget of the kernel/dtb for kernel development.
15407ec681f3Smrg- freedreno: Add another new sysmem flake.
15417ec681f3Smrg- freedreno/cffdec: Fix up texturator parsing scripts for XML changes.
15427ec681f3Smrg- freedreno/cffdec: Add support for texturator's 2DMS layout setup.
15437ec681f3Smrg- freedreno/fdl: Add layout test for the Android CTS's MSAA mustpass surface.
15447ec681f3Smrg- turnip: Add support for a615.
15457ec681f3Smrg- turnip/kgsl: Associate fences with submits.
15467ec681f3Smrg- mesa: Make the android_stub be a set of non-installed shared libraries.
15477ec681f3Smrg- android: Disable trying to read/write to the disk cache.
15487ec681f3Smrg- gallium/drm: Deduplicate screen creation for the dynamic (clover) pipe loader.
15497ec681f3Smrg- gallium/drm: Refactor the stub screen create functions.
15507ec681f3Smrg- gallium/drm: Define the DRM entrypoints in drm_helper.h
15517ec681f3Smrg- gallium/drm: Make the pipe loader handle the driconf merging.
15527ec681f3Smrg- util/xmlconfig: Add a unit test of the code.
15537ec681f3Smrg- virgl: Clean up the driconf definition of GLES_SAMPLES_PASSED_VALUE.
15547ec681f3Smrg- driconf: Use nesting macros for defining options.
15557ec681f3Smrg- mesa: Promote Intel's simple logging façade for Android to util/
15567ec681f3Smrg- turnip: Replace tu_log*() with mesa_log*()
15577ec681f3Smrg- ci/freedreno: Sort the traces in the .yml of expectations
15587ec681f3Smrg- ci/freedreno: Add trace tests for glxgears, 0 A.D., and xonotic.
15597ec681f3Smrg- nir/lower_clip: Add i/o semantics for load/store intrinsics.
15607ec681f3Smrg- intel: Add support for i945g to intel_stub_gpu.
15617ec681f3Smrg- freedreno/ir3: Make sure we run the opt loop after lowering UBOs to vec4.
15627ec681f3Smrg- nir: Document a bit about how align_mul/offset work.
15637ec681f3Smrg- nir: Print the alignment information on casts.
15647ec681f3Smrg- nir/nir_lower_uniforms_to_ubo: Set better alignments on our new instructions.
15657ec681f3Smrg- nir/gl_nir_lower_buffers: Set up align_mul/offset on UBOs.
15667ec681f3Smrg- nir: Make the load_store_vectorizer provide align_mul + align_offset.
15677ec681f3Smrg- nir: Drop the high_offset argument to the load_store_vectorizer filter.
15687ec681f3Smrg- nir: Make nir_lower_ubo_vec4() handle non-vec4-aligned loads.
15697ec681f3Smrg- freedreno/ir3: Enable the i/o vectorizer on UBOs.
15707ec681f3Smrg- ci/bare-metal: Move the "POWER_GOOD not seen in time" check to the right time.
15717ec681f3Smrg- driconf: Eliminate the DRI_CONF_OPT_BEGIN_B macro.
15727ec681f3Smrg- driconf: Fix extra quoting on "Jimenez'".
15737ec681f3Smrg- r200: Reuse DRI_CONF_OPT_F for texture_blend_quality.
15747ec681f3Smrg- driconf: Make a DRI_CONF_OPT_S() for string options.
15757ec681f3Smrg- util/xmlconfig: Drop silly open-coded strdup.
15767ec681f3Smrg- util/xmlconfig: Indent to Mesa style.
15777ec681f3Smrg- driconf: Delete disjoint range support.
15787ec681f3Smrg- driconf: Use DRI_CONF_OPT_I for remaining int options
15797ec681f3Smrg- driconf: Make the driver's declarations be structs instead of XML.
15807ec681f3Smrg- driconf: Stop quoting true/false in boolean option definitions.
15817ec681f3Smrg- util/xmlconfig: Drop use of XML_Char in parsing.
15827ec681f3Smrg- android: Disable the user XML config parsing.
15837ec681f3Smrg- turnip: Don't expose VK_ANDROID_native_buffer on non-Android.
15847ec681f3Smrg- turnip: Use mesa's normal PRINTFLIKE macro instead of our own.
15857ec681f3Smrg- turnip: Mark the vk_errorf helper as bring printflike.
15867ec681f3Smrg- turnip: Extend the coverage of TU_DEBUG=startup.
15877ec681f3Smrg- turnip: Always enable TU_DEBUG=startup on debug drivers.
15887ec681f3Smrg- turnip: Report device loss through \_mesa_loge() instead of fprintf.
15897ec681f3Smrg- turnip/kgsl: Add strerror decode in BO init failure.
15907ec681f3Smrg- driconf: Make sure that the range check on the defaults actually works.
15917ec681f3Smrg- driconf: Restore the ability to override driconf with the environment.
15927ec681f3Smrg- ci/softpipe: Add another flaky GS test to the skips list.
15937ec681f3Smrg- freedreno/ir3: Clean up the UBO upload plan setup.
15947ec681f3Smrg- freedreno/ir3: Don't leave holes the UBO upload plan.
15957ec681f3Smrg- turnip/kgsl: Fix last minute breakage of the build.
15967ec681f3Smrg- turnip/kgsl: Add support for importing dma-bufs.
15977ec681f3Smrg- turnip: Detect Qualcomm gralloc and its UBWC flag on gralloc surfaces.
15987ec681f3Smrg- turnip: Add support for GetSwapchainGrallocUsage2ANDROID().
15997ec681f3Smrg- meson: Drop adding -Wl,--gc-sections to project c/cpp arguments.
16007ec681f3Smrg- glsl/tests: Make the tests skip on Android binary execution failures.
16017ec681f3Smrg- symbols-check: Add \__cxa_guard_\* to the list of approved symbols.
16027ec681f3Smrg- ci/android: Switch to using the Android NDK.
16037ec681f3Smrg- docs: Document how to replicate a CI build locally.
16047ec681f3Smrg- android_stub: Update platform headers to include gralloc1.h.
16057ec681f3Smrg- ci/android: Switch build to using platform SDK version 26.
16067ec681f3Smrg- util: Import a copy of drm's libsync.h
16077ec681f3Smrg- android: Add pre-4.7 Android kernel compatibility to our libsync header.
16087ec681f3Smrg- turnip: Drop a dead error checking path in device init.
16097ec681f3Smrg- turnip: Use Mesa's libsync.h instead of libdrm's libsync.h.
16107ec681f3Smrg- turnip: Don't link the WSI code if we don't have a WSI extension.
16117ec681f3Smrg- turnip: Only link libdrm in the DRM case, not KGSL.
16127ec681f3Smrg- ci: Enable NIR_VALIDATE everywhere.
16137ec681f3Smrg- nir: Introduce nir_metadata_instr_index for nir_index_instr() being current.
16147ec681f3Smrg- nir: Replace nir_ssa_def->live_index with nir_instr->index.
16157ec681f3Smrg- nir: Add a block start/end ip to live instr index metadata.
16167ec681f3Smrg- nir: Add a call to get a struct describing SSA liveness per instruction.
16177ec681f3Smrg- nir: Add an option to not lower source mods for f64/u64/i64.
16187ec681f3Smrg- gallium: Add a nir-to-TGSI pass.
16197ec681f3Smrg- softpipe: Fix buffer overflows in SSBO atomics.
16207ec681f3Smrg- softpipe: Switch to using NIR as the shader format from mesa/st.
16217ec681f3Smrg- meson: Only require libexpat when a part of the build needs it.
16227ec681f3Smrg- freedreno: Use Android's libsync instead of libdrm's.
16237ec681f3Smrg- meson: Don't try to build GLX by default on Android.
16247ec681f3Smrg- meson: Don't enable libunwind by in 'auto' mode on Android.
16257ec681f3Smrg- docs: Document how to build and install Android drivers.
16267ec681f3Smrg- freedreno/cffdec: Fix format overflow warning.
16277ec681f3Smrg- freedreno/tools: Fix compiler warnings about using sz in the error paths.
16287ec681f3Smrg- freedreno/fdperf: Silence a compiler warning about current counter.
16297ec681f3Smrg- turnip: Handle some error paths in allocating CS space from a command buffer.
16307ec681f3Smrg- turnip: Handle the error path for tu/drm's vkResetFences().
16317ec681f3Smrg- turnip: Add error path handling for descriptor pool init.
16327ec681f3Smrg- ci: Enable Werror on meson-arm64-build-test.
16337ec681f3Smrg- gallium/ntt: Add default compiler options for non-native-NIR drivers.
16347ec681f3Smrg- st/mesa: Drop the TGSI paths for PBOs and use nir-to-tgsi if needed.
16357ec681f3Smrg- st/mesa: Drop the TGSI paths for drawpixels and use nir-to-tgsi if needed.
16367ec681f3Smrg- nir: Only validate in passes that might have changed things.
16377ec681f3Smrg- docs: Move the gallium driver documentation to the top level.
16387ec681f3Smrg- docs/vmware: Move the vmware driver docs into the drivers section.
16397ec681f3Smrg- docs/vc4: Move my old vc4 wiki's documentation into docs.mesa3d.org.
16407ec681f3Smrg- docs/vc4: Add information on the hw documentation available.
16417ec681f3Smrg- docs/v3d: Add a little stub of v3d documentation.
16427ec681f3Smrg- docs: Drop extra link to old DRI wiki in the "Help" section.
16437ec681f3Smrg- docs: Add a link to the linux kernel DRM docs under "Developer Topics"
16447ec681f3Smrg- docs: Fix "Hosted by" link and drop duplicate.
16457ec681f3Smrg- ci: Add the new timeout-prone softpipe-gl test to the skips list.
16467ec681f3Smrg- mesa/st: Fix a use-after-free of the NIR shader stage.
16477ec681f3Smrg- st/nir: Fix the st->pbo.use_gs case.
16487ec681f3Smrg- st/nir: Drop setting interp mode on system values in builtins.
16497ec681f3Smrg- tu: Make sure spirv_to_nir knows we support imageStorageWithoutFormat.
16507ec681f3Smrg- turnip: Fix image size for 3D vkGetImageSubresourceLayout.
16517ec681f3Smrg- ci/bare-metal: Apply autopep8 to the bare-metal scripts.
16527ec681f3Smrg- ci/bare-metal: Reset colors at the end of a line of serial output.
16537ec681f3Smrg- ci/deqp: Switch to a new dEQP runner written in Rust.
16547ec681f3Smrg- util/set: Fix the \_mesa_set_clear function to not leave tombstones.
16557ec681f3Smrg- ci: Only install kernel modules for LAVA devices.
16567ec681f3Smrg- gallium/draw: Fix rasterizer_discard for wide points/lines.
16577ec681f3Smrg- freedreno: Fix leak of shader binary on disk cache hits.
16587ec681f3Smrg- freedreno: Fix warning about uninit size for the size==0 special case.
16597ec681f3Smrg- gallium: Fix leak of the merged driconf options.
16607ec681f3Smrg- freedreno: Fix leak of u_transfer_helper.
16617ec681f3Smrg- gallium: Fix leak of bound SSBOs at CSO context destruction.
16627ec681f3Smrg- gallivm: Fix max const buffer count.
16637ec681f3Smrg- gallium: Fix leak of currently bound UBOs at CSO context destruction.
16647ec681f3Smrg- freedreno: Break out of "should we free the entry" loop once we've freed.
16657ec681f3Smrg
16667ec681f3SmrgEric Engestrom (94):
16677ec681f3Smrg
16687ec681f3Smrg- pick-ui: specify git commands in "resolve cherry pick" message
16697ec681f3Smrg- egl/entrypoint-check: split sort-check into a function
16707ec681f3Smrg- egl/entrypoint-check: add check that GLVND and plain EGL have the same entrypoints
16717ec681f3Smrg- driconf: fix force_gl_vendor description
16727ec681f3Smrg- meson: bump required glvnd version
16737ec681f3Smrg- egl: replace \_EGLDriver param with \_EGLDisplay->Driver in \_eglReleaseDisplayResources()
16747ec681f3Smrg- egl: replace \_EGLDriver param with \_EGLDisplay->Driver in dri{2_x11,3}_create_window_surface()
16757ec681f3Smrg- egl: replace \_EGLDriver with \_EGLDisplay->Driver in \_eglQuerySurface()
16767ec681f3Smrg- egl: drop unused \_EGLDriver from Initialize()
16777ec681f3Smrg- egl: drop unused \_EGLDriver from Terminate()
16787ec681f3Smrg- egl: drop unused \_EGLDriver from {Create,Destroy}Context()
16797ec681f3Smrg- egl: drop unused \_EGLDriver from Create{Window,Pixmap,Pbuffer}Surface() & DestroySurface()
16807ec681f3Smrg- egl: drop unused \_EGLDriver from MakeCurrent()
16817ec681f3Smrg- egl: drop unused \_EGLDriver from QuerySurface()
16827ec681f3Smrg- egl: drop unused \_EGLDriver from {Bind,Release}TexImage()
16837ec681f3Smrg- egl: drop unused \_EGLDriver from SwapInterval()
16847ec681f3Smrg- egl: drop unused \_EGLDriver from SwapBuffers{,WithDamageEXT,RegionNOK}()
16857ec681f3Smrg- egl: drop unused \_EGLDriver from CopyBuffers()
16867ec681f3Smrg- egl: drop unused \_EGLDriver from SetDamageRegion()
16877ec681f3Smrg- egl: drop unused \_EGLDriver from WaitClient()
16887ec681f3Smrg- egl: drop unused \_EGLDriver & \_EGLDisplay from WaitNative()
16897ec681f3Smrg- egl: drop unused \_EGLDriver from GetProcAddress()
16907ec681f3Smrg- egl: drop unused \_EGLDriver from {Create,Destroy}ImageKHR()
16917ec681f3Smrg- egl: drop unused \_EGLDriver from {Create,Destroy,ClientWait,Wait,Signal}SyncKHR()
16927ec681f3Smrg- egl: drop unused \_EGLDriver from DupNativeFenceFDANDROID()
16937ec681f3Smrg- egl: drop unused \_EGLDriver from {Create,Export}DRMImageMESA()
16947ec681f3Smrg- egl: drop unused \_EGLDriver from {Bind,Unbind,Query}WaylandDisplayWL()
16957ec681f3Smrg- egl: drop unused \_EGLDriver from CreateWaylandBufferFromImageWL()
16967ec681f3Smrg- egl: drop unused \_EGLDriver from PostSubBufferNV()
16977ec681f3Smrg- egl: drop unused \_EGLDriver from QueryBufferAge()
16987ec681f3Smrg- egl: drop unused \_EGLDriver from ExportDMABUFImage{,Query}MESA()
16997ec681f3Smrg- egl: drop unused \_EGLDriver from QueryDmaBuf{Formats,Modifiers}EXT()
17007ec681f3Smrg- egl: drop unused \_EGLDriver from SetBlobCacheFuncsANDROID()
17017ec681f3Smrg- egl: drop unused \_EGLDriver from \_eglGetConfigs()/_eglChooseConfig()/_eglGetConfigAttrib()
17027ec681f3Smrg- egl: drop unused \_EGLDisplay from \_eglSetDamageRegionKHRClampRects()
17037ec681f3Smrg- egl: drop unused \_EGLDriver & \_EGLDisplay from \_eglQueryContext()
17047ec681f3Smrg- egl: drop unused \_EGLDriver from \_eglSurfaceAttrib()
17057ec681f3Smrg- egl: replace \_EGLDriver with \_EGLDisplay->Driver in \_eglGetSyncAttrib()
17067ec681f3Smrg- egl: replace replace \_EGLDriver with \_EGLDisplay->Driver in eglapi.c
17077ec681f3Smrg- egl: drop unused \_EGLDriver from MesaGLInteropEGL{QueryDeviceInfo,ExportObject}()
17087ec681f3Smrg- egl: replace \`&_eglDriver\`/\`NULL\` tested against \`NULL\` with simple \`true\`/\`false\`
17097ec681f3Smrg- egl: drop unused ${drv}_driver()
17107ec681f3Smrg- egl: inline \_eglGetDriverProc() into eglGetProcAddress()
17117ec681f3Smrg- egl: inline \_eglInitializeDisplay() into eglInitialize()
17127ec681f3Smrg- egl: drop now empty egldriver.c
17137ec681f3Smrg- egl: drop unused egldriver.h header
17147ec681f3Smrg- meson: fix trivial s/dir/dri/ typo
17157ec681f3Smrg- egl/x11_dri3: enable & require xfixes 2.0
17167ec681f3Smrg- egl/x11_dri3: implement EGL_KHR_swap_buffers_with_damage
17177ec681f3Smrg- docs: add release notes for 20.1.6
17187ec681f3Smrg- docs: update calendar and link releases notes for 20.1.6
17197ec681f3Smrg- gitlab-ci: fix testing whether a variable with a given name is set or not
17207ec681f3Smrg- gitlab-ci: fix quoting of variables passed down to bare-metal runners
17217ec681f3Smrg- egl: drop an indentation level in \_eglFindDisplay() by replacing break/if with a goto
17227ec681f3Smrg- egl: drop another indentation level in \_eglFindDisplay() by inverting an if
17237ec681f3Smrg- egl: drop invalid shebang
17247ec681f3Smrg- scons: bump c++ standard to 14 to match meson
17257ec681f3Smrg- docs/egl: fix typo
17267ec681f3Smrg- docs/egl: move section around
17277ec681f3Smrg- docs/egl: complete list of dri2 platforms
17287ec681f3Smrg- docs/egl: add haiku driver
17297ec681f3Smrg- docs/egl: add some more documentation
17307ec681f3Smrg- docs/egl: correct/update DRI2 mention with the shiny new DRI3
17317ec681f3Smrg- egl: move extension driver functions after core functions
17327ec681f3Smrg- egl: document which driver hooks are only required by extensions
17337ec681f3Smrg- egl: inline eglSwapInterval() fallback
17347ec681f3Smrg- egl: simplify eglSwapInterval() fallback logic
17357ec681f3Smrg- meson: don't advertise TLS support if glx wasn't build with it
17367ec681f3Smrg- egl/android: simplify dri2_initialize_android()
17377ec681f3Smrg- egl/surfaceless: simplify dri2_initialize_surfaceless()
17387ec681f3Smrg- egl/wayland: simplify dri2_initialize_wayland()
17397ec681f3Smrg- egl/x11: simplify dri2_initialize_x11()
17407ec681f3Smrg- docs: add release notes for 20.1.7
17417ec681f3Smrg- docs: update calendar and link releases notes for 20.1.7
17427ec681f3Smrg- docs: shift 20.2 rc dates by two weeks to match reality
17437ec681f3Smrg- meson: drop leftover PTHREAD_SETAFFINITY_IN_NP_HEADER
17447ec681f3Smrg- docs/download: mention tarball GPG signatures and link to the keys
17457ec681f3Smrg- docs: add another 20.1.x release to allow for more overlap with 20.2
17467ec681f3Smrg- docs/release-calendar: update 20.2
17477ec681f3Smrg- docs: add release notes for 20.1.8
17487ec681f3Smrg- docs: update calendar and link releases notes for 20.1.8
17497ec681f3Smrg- bin/gen_release_notes.py: escape special rST characters
17507ec681f3Smrg- docs: add release notes for 20.1.9
17517ec681f3Smrg- docs: update calendar and link releases notes for 20.1.9
17527ec681f3Smrg- add one last 20.1 release to coincide with expected 20.2.1
17537ec681f3Smrg- radv: add missing u_atomic.h include
17547ec681f3Smrg- docs: fix relnotes index
17557ec681f3Smrg- docs: fix release calendar
17567ec681f3Smrg- docs: fix 20.2.0 relnotes
17577ec681f3Smrg- docs: add release notes for 20.1.10
17587ec681f3Smrg- docs: update calendar and link releases notes for 20.1.10
17597ec681f3Smrg- docs/release-calendar: plan 20.3 release
17607ec681f3Smrg- gitlab-ci: drop deprecated platforms that snuck in when nobody was watching
17617ec681f3Smrg- meson: drop deprecated EGL platform build options
17627ec681f3Smrg
17637ec681f3SmrgErico Nunes (4):
17647ec681f3Smrg
17657ec681f3Smrg- lima: dont split vec3 unaligned load inputs
17667ec681f3Smrg- lima: allocate new bo for stream draw
17677ec681f3Smrg- lima: fix vertex shader uniform buffer size
17687ec681f3Smrg- lima: add natively supported vertex buffer formats
17697ec681f3Smrg
17707ec681f3SmrgErik Faye-Lund (123):
17717ec681f3Smrg
17727ec681f3Smrg- st/wgl: do not reject PFD_SUPPORT_GDI
17737ec681f3Smrg- gallium/util: factor out primitive-restart rewriting logic
17747ec681f3Smrg- gallium/indices: don't expand prim-type for 8-bit indices
17757ec681f3Smrg- gallium/indices: generalize primitive-restart logic
17767ec681f3Smrg- gallium/indices: implement prim-restart for line-loops
17777ec681f3Smrg- gallium/indices: use prim_restart-helper for polygon
17787ec681f3Smrg- gallium/indices: implement prim-restart for triangle fans
17797ec681f3Smrg- gallium/indices: introduce u_primconvert_config
17807ec681f3Smrg- gallium/indices: translate primitive-restart values
17817ec681f3Smrg- compiler/nir: make lowering global-id to local-id optional
17827ec681f3Smrg- nir: add iabs-lowering code
17837ec681f3Smrg- gallium/util: use uint sampler for stencil-reads
17847ec681f3Smrg- nir: fix const-cast warning on MSVC
17857ec681f3Smrg- v3d: remove unused header
17867ec681f3Smrg- vc4: remove unused header
17877ec681f3Smrg- gallium/aux: remove unused u_blit.[ch\]
17887ec681f3Smrg- gallium/util: add shader for stencil-blits
17897ec681f3Smrg- gallium/util: add blitter-support for stencil-fallback
17907ec681f3Smrg- mesa: handle GL_FRONT after translating to it
17917ec681f3Smrg- zink: correct typo in stencil-setup
17927ec681f3Smrg- zink: store base-object of DSA-state
17937ec681f3Smrg- zink: only set stencil-ref for back if two-sided
17947ec681f3Smrg- docs: escape backquote character
17957ec681f3Smrg- docs: show 'Edit on GitLab'-link
17967ec681f3Smrg- docs: store prefixes in redirects
17977ec681f3Smrg- docs: remove webmaster article
17987ec681f3Smrg- docs: everytime -\> every time
17997ec681f3Smrg- docs: apis -\> APIs
18007ec681f3Smrg- docs: scons -\> SCons
18017ec681f3Smrg- docs: frambuffer -\> framebuffer
18027ec681f3Smrg- docs: make two acronyms upper-case
18037ec681f3Smrg- docs: unecessarily -\> unnecessarily
18047ec681f3Smrg- docs: behaviour -\> behavior
18057ec681f3Smrg- docs: timeplan -\> time plan
18067ec681f3Smrg- docs: initialisation -\> initialization
18077ec681f3Smrg- docs: gitlab -\> GitLab
18087ec681f3Smrg- docs: url -\> URL
18097ec681f3Smrg- docs: recognisable -\> recognizable
18107ec681f3Smrg- docs: drop outdated gallium-docs comment
18117ec681f3Smrg- docs: clippping -\> clipping
18127ec681f3Smrg- docs: consistantly -\> consistently
18137ec681f3Smrg- docs: stabilisation -\> stabilization
18147ec681f3Smrg- docs: flavours -\> flavors
18157ec681f3Smrg- docs: debian -\> Debian
18167ec681f3Smrg- docs: docker -\> Docker
18177ec681f3Smrg- docs: gallium -\> Gallium
18187ec681f3Smrg- st/mesa: use roundf instead of floorf for lod-bias rounding
18197ec681f3Smrg- gallium/util: set right dst-dimensions
18207ec681f3Smrg- gallium/util: fix texture-coordinates for stencil-fallback
18217ec681f3Smrg- gallium/util: allow scaling blits for stencil-fallback
18227ec681f3Smrg- docs: softwara -\> software
18237ec681f3Smrg- docs: existant -\> existent
18247ec681f3Smrg- docs: webservice -\> web service
18257ec681f3Smrg- docs: bpp -\> BPP
18267ec681f3Smrg- docs: llvm -\> LLVM
18277ec681f3Smrg- docs: correct reference to meson.build
18287ec681f3Smrg- docs: meson -\> Meson
18297ec681f3Smrg- docs: python3 -\> Python 3
18307ec681f3Smrg- docs: flex -\> Flex
18317ec681f3Smrg- docs: bison -\> Bison
18327ec681f3Smrg- docs: mako -\> Mako
18337ec681f3Smrg- docs: chocolatey -\> Chocolatey
18347ec681f3Smrg- docs: ninja -\> Ninja
18357ec681f3Smrg- docs: mingw -\> MinGW
18367ec681f3Smrg- docs: microsoft -\> Microsoft
18377ec681f3Smrg- docs: linux -\> Linux
18387ec681f3Smrg- docs: windows -\> Windows
18397ec681f3Smrg- docs: visual studio -\> Visual Studio
18407ec681f3Smrg- docs: gpu -\> GPU
18417ec681f3Smrg- docs: cmake -\> CMake
18427ec681f3Smrg- docs: x11 -\> X11
18437ec681f3Smrg- docs: wayland -\> Wayland
18447ec681f3Smrg- docs: drm -\> DRM
18457ec681f3Smrg- docs: android -\> Android
18467ec681f3Smrg- docs: git -\> Git
18477ec681f3Smrg- docs: quote "git log"
18487ec681f3Smrg- docs: scons -\> SCons
18497ec681f3Smrg- docs: ubuntu -\> Ubuntu
18507ec681f3Smrg- docs: vmware -\> VMWare
18517ec681f3Smrg- docs: Sandybridge -\> Sandy Bridge
18527ec681f3Smrg- docs: cpu -\> CPU
18537ec681f3Smrg- gallium/util: fix memory-leak
18547ec681f3Smrg- gallium/util: allow scissored blits for stencil-fallback
18557ec681f3Smrg- zink: use nir_lower_ubo_vec4 to simplify things a bit
18567ec681f3Smrg- zink: support non-const offsets
18577ec681f3Smrg- zink: support loading any UBO
18587ec681f3Smrg- zink: do not report SSBOs as halfway supported
18597ec681f3Smrg- zink: add feature-documentation
18607ec681f3Smrg- zink: reject resource-imports with modifiers
18617ec681f3Smrg- v3d: do not report alpha-test as supported
18627ec681f3Smrg- vc4: do not report alpha-test as supported
18637ec681f3Smrg- nir: drop support for using load_alpha_ref_float
18647ec681f3Smrg- nir: drop unused alpha_ref_float
18657ec681f3Smrg- docs: create leading directories for redirects
18667ec681f3Smrg- docs: verify that targets for relative redirects exist
18677ec681f3Smrg- docs: specify redirects relative to docs-root
18687ec681f3Smrg- docs: specify redirects in conf.py instead
18697ec681f3Smrg- zink: verify geometry shader feature
18707ec681f3Smrg- docs: do not document required minimum
18717ec681f3Smrg- docs: document zink's gl \\> 3.0 requirements
18727ec681f3Smrg- mapi: remove unused function
18737ec681f3Smrg- mapi: do not call thread-unsafe dispatch getter
18747ec681f3Smrg- mapi: do not return thread-specific data for wrong thread
18757ec681f3Smrg- docs: add link to extension spec
18767ec681f3Smrg- docs: ie. -\> i.e.
18777ec681f3Smrg- docs: eg. -\> e.g.
18787ec681f3Smrg- docs: anistropy -\> anisotropy
18797ec681f3Smrg- docs: api -\> API
18807ec681f3Smrg- docs: hud -\> HUD
18817ec681f3Smrg- docs: fbo -\> FBO
18827ec681f3Smrg- docs: gcc -\> GCC
18837ec681f3Smrg- docs: clang -\> Clang
18847ec681f3Smrg- docs: s3tc -\> S3TC
18857ec681f3Smrg- spirv: correct sematic-typo
18867ec681f3Smrg- libgl-gdi: support building without softpipe
18877ec681f3Smrg- gallium/util: do not pass undefined sample-count
18887ec681f3Smrg- softpipe: correct signature of get_compiler_options
18897ec681f3Smrg- mesa/main: add missing include in glformats.h
18907ec681f3Smrg- zink: more accurately track supported blits
18917ec681f3Smrg- zink: fix layered resolves
18927ec681f3Smrg- zink: fall back to util_blitter for scaled resolves
18937ec681f3Smrg- docs: document new zink-flag
18947ec681f3Smrg- zink: do not require VK_KHR_external_memory
18957ec681f3Smrg
18967ec681f3SmrgFelix Yan (1):
18977ec681f3Smrg
18987ec681f3Smrg- Correct a typo in threads_win32.h
18997ec681f3Smrg
19007ec681f3SmrgGert Wollny (81):
19017ec681f3Smrg
19027ec681f3Smrg- gallium + mesa/st: Add PIPE_CAP_NIR_ATOMICS_AS_DEREF and use it
19037ec681f3Smrg- r600: Set PIPE_CAP_NIR_ATOMICS_AS_DEREF to true
19047ec681f3Smrg- r600/sfn: Sort uniforms by binding and offset
19057ec681f3Smrg- r600/sfn: add r600 specific lowering pass for atomics and use it
19067ec681f3Smrg- r600/sfn: Add a mapping table for atomics
19077ec681f3Smrg- r600/sfn: correct allocating and emitting of atomics
19087ec681f3Smrg- r600/sfn: Correct ssbo instruction handling
19097ec681f3Smrg- r600/sfn: handle querying SSBO size
19107ec681f3Smrg- r600/sfn: Force a minimum of 4 GPRs, it seems to fix atomics
19117ec681f3Smrg- r600: Enable compute shaders for NIR code path
19127ec681f3Smrg- compiler/nir: rewrite lower_fragcoord_wtrans to use nir_lower_instructions
19137ec681f3Smrg- compiler/nir: extend lower_fragcoord_wtrans to support VARYING_SLOT_POS
19147ec681f3Smrg- gallium/aux: reorder vertex attributes in triangle fans according to PV
19157ec681f3Smrg- meson: Make some warnings handled as errors with MSVC
19167ec681f3Smrg- r600: revert disabling llvm draw
19177ec681f3Smrg- r600/nir: fetch sources and split uniforms before emittting alu instructions
19187ec681f3Smrg- r600/sfn: correct ring op patching
19197ec681f3Smrg- r600/sfn: Fix loading vertex attributes
19207ec681f3Smrg- r600/sfn: clone shader before lowering to registers and src/dest modifiers
19217ec681f3Smrg- r600/sfn: Fix ordering of tex param moves
19227ec681f3Smrg- r600/sfn: avoid some copies
19237ec681f3Smrg- r600/sfn: Lower \*sign opcodes in nir
19247ec681f3Smrg- r600/sfn: Fix split_alu_modifiers
19257ec681f3Smrg- r600/sfn: Fix bitfield ops and 2x16 split_y
19267ec681f3Smrg- r600/sfn: Fix source swizzle for gradient queries
19277ec681f3Smrg- r600/sfn: more fixing of vec4 fetching
19287ec681f3Smrg- r600/sfn: Fix comparison with different signedness
19297ec681f3Smrg- nir: Add option lower_uniforms_to_ubo
19307ec681f3Smrg- radeonsi: set compiler flag lower_uniforms_to_ubo
19317ec681f3Smrg- freedreno/ir3: set lower_uniforms_to_ubo compiler flag
19327ec681f3Smrg- intel/compiler: Set lower_uniform_to_ubo compiler flag
19337ec681f3Smrg- llvmpipe: set lower_uniform_to_ubo compiler flag
19347ec681f3Smrg- gallium+mesa/st: lower uniforms based on compiler flag instead of packed uniforms cap
19357ec681f3Smrg- r600: enable lowering uniforms to UBO
19367ec681f3Smrg- r600/sfn: Use load_ubo_vec4 lowering pass
19377ec681f3Smrg- nir: remove ubo_r600 instrinsic since ubo_vec4 is used now
19387ec681f3Smrg- r600/sfn: make number of source components a local variable
19397ec681f3Smrg- r600/sfn: Fix component count for fdph
19407ec681f3Smrg- r600/sfn: Fix typo in comment
19417ec681f3Smrg- r600/sfn: use cnde instead of cnde_int
19427ec681f3Smrg- r600/sfn: run late algebraic optimizations
19437ec681f3Smrg- r600/sfn: remove a useless if-condition
19447ec681f3Smrg- r600: Add flag for dual-source blending to shader key
19457ec681f3Smrg- r600/sfn: Sort the outputs of the FS according to data index
19467ec681f3Smrg- r600/sfn: Keep info about dual-source blend in FS
19477ec681f3Smrg- r600/sfn: Handle number of color outputs taking dual source blending into account
19487ec681f3Smrg- r600/sfn: Take dual source blending output indices into account
19497ec681f3Smrg- r600/sfn: Acquire the number of FS outputs and the write_all info early
19507ec681f3Smrg- r600/sfn: Be a bit more verbose when logging skipped FS outputs
19517ec681f3Smrg- r600/sfn: Fix emitting shared atomics with constant sources
19527ec681f3Smrg- r600/sfn: Handle nir_op_b2b32
19537ec681f3Smrg- r600/sfn: lower to scalar for some optimizations and vectortize later
19547ec681f3Smrg- r600/sfn: Support group memory barrier
19557ec681f3Smrg- r600/sfn: save some instructions when doing multisample on sample 0
19567ec681f3Smrg- r600/sfn: use fine gradient evaluation for interpolate_at_offset
19577ec681f3Smrg- r600/sfn: Fix interpolate at sample
19587ec681f3Smrg- r600/sfn: Fix indirect const buffer access
19597ec681f3Smrg- r600/sfn; go back to not lowering uniforms to UBOs
19607ec681f3Smrg- r600/sfn: replace hand-backed literal check by NIR function
19617ec681f3Smrg- r600/sfn: remove old code to track uniforms as it is no longer needed
19627ec681f3Smrg- r600/sfn: Add support for helper invocations
19637ec681f3Smrg- r600/sfn: Fix enabling the right interpolator for inerpolate_at_sample
19647ec681f3Smrg- r600/sfn: Fix IDX register ID
19657ec681f3Smrg- r600/sfn: Add support for more barrier instructions
19667ec681f3Smrg- r600/sfn: extend life range of all variables by one
19677ec681f3Smrg- r600/sfn: Don't reuse registers for workgroup ID and local invocation ID
19687ec681f3Smrg- r600/sfn: Fix ssbo resource offset for buffer loads
19697ec681f3Smrg- r600/sfn: Fix keepalive patch
19707ec681f3Smrg- r600/sfn: fix mega fetch count for SSBO/Image atomics result fetch
19717ec681f3Smrg- r600/sfn: Rework get_temp_register to return a smart pointer to GPRValue
19727ec681f3Smrg- r600/sfn: use shared pointer to GPR for FS sysvalues
19737ec681f3Smrg- r600/sfn: Handle mem barrier and image barrier by using ACK
19747ec681f3Smrg- r600/sfn: use cacheless op for coherent image write
19757ec681f3Smrg- r600/sfn: use 32 bit bools
19767ec681f3Smrg- r600/sfn: fix remapping of deleted attributes
19777ec681f3Smrg- r600/sfn: Use register keep-alive also when scanning the shader
19787ec681f3Smrg- r600/sfn: Fix the parameter component type
19797ec681f3Smrg- r600/sfn: Update state docu
19807ec681f3Smrg- compile/nir: Correct printing dest_type
19817ec681f3Smrg- r600/sfn: lower bool to int32 only after common optimizations
19827ec681f3Smrg- r600/sfn: fix component loading from fixed buffer ID
19837ec681f3Smrg
19847ec681f3SmrgGreg V (1):
19857ec681f3Smrg
19867ec681f3Smrg- radv,anv: use CLOCK_MONOTONIC_FAST when CLOCK_MONOTONIC_RAW is undefined
19877ec681f3Smrg
19887ec681f3SmrgGuido Günther (1):
19897ec681f3Smrg
19907ec681f3Smrg- kmsro: Extend to include imx-dcss
19917ec681f3Smrg
19927ec681f3SmrgGurchetan Singh (7):
19937ec681f3Smrg
19947ec681f3Smrg- virgl: add flags to (\*resource_create) callback
19957ec681f3Smrg- drm-uapi: virtgpu_drm.h: resource create blob + host visible memory region
19967ec681f3Smrg- virgl/drm: query for resource blob and host visible memory region
19977ec681f3Smrg- virgl/drm: add resource create blob function
19987ec681f3Smrg- virgl: support PIPE_CAP_BUFFER_MAP_PERSISTENT_COHERENT
19997ec681f3Smrg- virgl: query blob mem
20007ec681f3Smrg- virgl: fix stride + layer_stride inconsistency
20017ec681f3Smrg
20027ec681f3SmrgHoe Hao Cheng (7):
20037ec681f3Smrg
20047ec681f3Smrg- zink: generate extension infrastructure using a python script
20057ec681f3Smrg- zink: hook zink_device_info.py to build system
20067ec681f3Smrg- zink: use the new extension infrastructure in device creation
20077ec681f3Smrg- zink: use the new, generated extension infrastructure
20087ec681f3Smrg- zink: remove old extension infrastructure
20097ec681f3Smrg- zink: implement pipe_device_reset_callback
20107ec681f3Smrg- zink: call the reset callback not only during a status check
20117ec681f3Smrg
20127ec681f3SmrgHyunjun Ko (4):
20137ec681f3Smrg
20147ec681f3Smrg- freedreno: support GL_EXT_semaphore
20157ec681f3Smrg- turnip: Refactor structs of tu_query
20167ec681f3Smrg- turnip: Support pipeline statistics query
20177ec681f3Smrg- turnip: Implement VK_EXT_host_query_reset
20187ec681f3Smrg
20197ec681f3SmrgIago Toral Quiroga (443):
20207ec681f3Smrg
20217ec681f3Smrg- v3d/compiler: fix V3D double-rounding of .8 fixed-point XY coordinates
20227ec681f3Smrg- v3dv: add support for valgrind macros
20237ec681f3Smrg- v3dv: implement vkCreateInstance
20247ec681f3Smrg- v3dv: implement vkDestroyInstance
20257ec681f3Smrg- v3dv: implement vkEnumeratePhysicalDevices
20267ec681f3Smrg- v3dv: pretend to initialize a physical device
20277ec681f3Smrg- v3dv: Implement vkGetPhysicalDeviceProperties
20287ec681f3Smrg- v3dv: retrieve device name from device info
20297ec681f3Smrg- v3dv: add a comment to clarify how we should implement uuid / deviceID retrieval
20307ec681f3Smrg- v3dv: implement vkGetPhysicalDeviceMemoryProperties
20317ec681f3Smrg- v3dv: implement vkGetPhysicalDeviceFeatures
20327ec681f3Smrg- v3dv: implement vkEnumerateDeviceExtensionProperties
20337ec681f3Smrg- v3dv: amend vkEnumerateInstanceExtensionProperties to handle layers
20347ec681f3Smrg- v3dv: implement vkGetPhysicalDeviceQueueFamilyProperties
20357ec681f3Smrg- v3dv: implement vkCreateDevice
20367ec681f3Smrg- v3dv: implement vkGetDeviceQueue
20377ec681f3Smrg- v3dv: add dummy implementations for the packet definition generator
20387ec681f3Smrg- v3dv: add stubs for the format table and vkGetPhysicalDeviceFormatProperties
20397ec681f3Smrg- v3dv: add some basic support for format properties
20407ec681f3Smrg- v3dv: implement vkEnumerate{Instance,Device}LayerProperties
20417ec681f3Smrg- v3dv: add stub for vkDeviceWaitIdle
20427ec681f3Smrg- v3dv: implement vkCreateImage
20437ec681f3Smrg- v3dv: implement vkGetImageMemoryRequirements
20447ec681f3Smrg- v3dv: initialize mememory heaps in the physical device
20457ec681f3Smrg- v3dv: implement vkAllocateMemory
20467ec681f3Smrg- v3dv: implement vkFreeMemory
20477ec681f3Smrg- v3dv: implement vkMapMemory
20487ec681f3Smrg- v3dv: implement vkUnmapMemory
20497ec681f3Smrg- v3dv: implement vkBindImageMemory
20507ec681f3Smrg- v3dv: implement vkCreateImageView
20517ec681f3Smrg- v3dv: implement vk{Create,Destroy}Buffer
20527ec681f3Smrg- v3dv: implement vkGetBufferMemoryRequirements
20537ec681f3Smrg- v3dv: implement vkBindBufferMemory
20547ec681f3Smrg- v3dv: implement vkCreateRenderPass
20557ec681f3Smrg- v3dv: implement vk{Create,Destroy}RenderPass
20567ec681f3Smrg- v3dv: implement vk{Create,Destroy}Framebuffer
20577ec681f3Smrg- v3dv: implement vkCreateCommandPool
20587ec681f3Smrg- v3dv: implement vk{Allocate,Free}CommandBuffers
20597ec681f3Smrg- v3dv: create a v3dv_bo struct and reference it from v3dv_device_memory
20607ec681f3Smrg- v3dv: add a concept of a command list
20617ec681f3Smrg- v3dv: implement vkBeginCommandBuffer
20627ec681f3Smrg- v3dv: start handling command buffer status
20637ec681f3Smrg- v3dv: implement vkGetPhysicalDeviceImageFormatProperties
20647ec681f3Smrg- v3dv: make v3dv_bo_alloc allocate memory for the bo struct
20657ec681f3Smrg- v3dv: compute tile size for framebuffer
20667ec681f3Smrg- v3dv: implement vkCmdBeginRenderPass
20677ec681f3Smrg- v3dv: make the command buffer own the command list BOs
20687ec681f3Smrg- v3dv: add a few more API stubs
20697ec681f3Smrg- v3dv: store base mip level in the image view
20707ec681f3Smrg- v3dv: add the tile state and alloc BOs to the command buffer BO list
20717ec681f3Smrg- v3dv: revert the decision that the command buffer takes ownership of BOs
20727ec681f3Smrg- v3dv: implement vkDestroyImage and vkDestroyImageView
20737ec681f3Smrg- v3dv: make v3dv_layer_offset public
20747ec681f3Smrg- v3dv: plug leak when destroying device
20757ec681f3Smrg- v3dv: precompute more tiling info at framebuffer creation time
20767ec681f3Smrg- v3dv: emit scissor to render area and precompute hw color clear values
20777ec681f3Smrg- v3dv: emit the render command list
20787ec681f3Smrg- v3dv: implement vkEndCommandBuffer
20797ec681f3Smrg- v3dv: create the command buffer BO set before we init CLs
20807ec681f3Smrg- v3dv: keep track of the numbre of BOs in a command buffer
20817ec681f3Smrg- v3dv: clear set of BOs in the command buffer on reset
20827ec681f3Smrg- v3dv: implement vkQueueSubmit
20837ec681f3Smrg- v3dv: be more conservative resetting command buffer state
20847ec681f3Smrg- v3dv: setup color clear values at subpass setup time
20857ec681f3Smrg- v3dv: emit tile loads
20867ec681f3Smrg- v3dv: flush at the end of each subpass
20877ec681f3Smrg- v3dv: split framebuffer internal bpp calculations from tiling calculations
20887ec681f3Smrg- v3dv: rename and make compute_tile_size_for_framebuffer() public
20897ec681f3Smrg- v3dv: implement vkCmdCopyImageToBuffer
20907ec681f3Smrg- v3dv: add the concept of a job
20917ec681f3Smrg- v3dv: implement vkCmdNextSubpass
20927ec681f3Smrg- v3dv: use the correct miplevel slice for the tile load operation
20937ec681f3Smrg- v3dv: implement vkCmdPipelineBarrier
20947ec681f3Smrg- v3dv: do not automatically emit a binner flush when finishing jobs
20957ec681f3Smrg- v3dv: fix clipping against render area
20967ec681f3Smrg- v3dv: add a note on interactions between clearing and scissor
20977ec681f3Smrg- v3dv: rewrite attachment state tracking
20987ec681f3Smrg- v3dv: only clear attachments on the first subpass that uses them
20997ec681f3Smrg- v3dv: merge subpasses into the same job when possible
21007ec681f3Smrg- v3dv: fix tile buffer loading
21017ec681f3Smrg- v3dv: rewrite the attachment needs clearing condition
21027ec681f3Smrg- v3dv: create a helper to start a new frame
21037ec681f3Smrg- v3dv/cmd_buffer: rename render pass RCL emission helpers to be more explicit
21047ec681f3Smrg- v3dv: handle VK_ATTACHMENT_UNUSED properly in more places
21057ec681f3Smrg- v3dv: implement vkDeviceWaitIdle
21067ec681f3Smrg- v3dv: implement vk{Create,Destroy}Semaphore
21077ec681f3Smrg- v3dv: implement semaphore waits and signals on queue submissions
21087ec681f3Smrg- v3dv: implement fences
21097ec681f3Smrg- v3dv: support queue submissions with multiple command buffers
21107ec681f3Smrg- v3dv: implement vkGetPhysicalDeviceSparseImageFormatProperties
21117ec681f3Smrg- v3dv: include Vulkan version 1.1 as unsupported.
21127ec681f3Smrg- v3dv: implement VK_KHR_get_physical_device_properties2
21137ec681f3Smrg- v3dv: implement VK_KHR_external_memory_capabilities
21147ec681f3Smrg- v3dv: implement VK_KHR_external_memory{_fd,_dma_buf}
21157ec681f3Smrg- v3dv: fix copy image to buffer
21167ec681f3Smrg- v3dv: implement vkGetImageSubresourceLayout
21177ec681f3Smrg- v3dv: implement DRM modifier setup for WSI
21187ec681f3Smrg- v3dv: hook up WSI support
21197ec681f3Smrg- v3dv: implement device detection on actual hardware
21207ec681f3Smrg- v3dv: allocate winsys BOs properly
21217ec681f3Smrg- v3dv: rename drm device fields so they are more explict
21227ec681f3Smrg- v3dv: don't swap RB channels when copying images to buffers
21237ec681f3Smrg- v3dv: implement support for depth testing
21247ec681f3Smrg- v3dv: don't always skip tile buffer stores
21257ec681f3Smrg- v3dv: compute subpass ranges for attachments at render pass creation time
21267ec681f3Smrg- v3dv: select the depth/stencil buffer from the attachment aspect mask
21277ec681f3Smrg- v3dv: select correct internal type for depth/stencil formats
21287ec681f3Smrg- v3dv: support depth testing on combined depth/stencil formats
21297ec681f3Smrg- v3dv: implement stencil testing
21307ec681f3Smrg- v3dv: fix indentation
21317ec681f3Smrg- v3dv: support copying depth/stencil aspects to buffer
21327ec681f3Smrg- v3dv: fix viewport state from pipeline
21337ec681f3Smrg- v3dv: implement early Z optimization
21347ec681f3Smrg- v3dv: clamp stencil masks and reference value to supported limits
21357ec681f3Smrg- v3dv: implement dynamic stencil states
21367ec681f3Smrg- v3dv: fix the mess with dynamic state handling
21377ec681f3Smrg- v3dv: add a helper to compute the hardware clear color
21387ec681f3Smrg- v3dv: add a helper to get the Z/S buffer from an aspect mask
21397ec681f3Smrg- v3dv: implement vkCmdClearAttachments
21407ec681f3Smrg- v3dv: implement indexed draws
21417ec681f3Smrg- v3dv: fix clockwise primitive setting
21427ec681f3Smrg- v3dv: ignore image view aspects for depth/stencil attachments
21437ec681f3Smrg- v3dv: take the number of layers from the framebuffer
21447ec681f3Smrg- v3dv: Add more supported formats to our format table
21457ec681f3Smrg- v3dv: don't advertise texel buffer support yet.
21467ec681f3Smrg- v3dv: implement vkCmdCopyBuffer
21477ec681f3Smrg- v3dv: implement vkCmdUpdateBuffer
21487ec681f3Smrg- v3dv: implement vkCmdFillBuffer
21497ec681f3Smrg- v3dv: move the framebuffer setup code for buffer copy/fill to a helper
21507ec681f3Smrg- v3dv: add a concept of a fake framebuffer for meta-copy operations
21517ec681f3Smrg- v3dv: refactor common code in meta copy operations
21527ec681f3Smrg- v3dv: fix copy size for image to buffer copies
21537ec681f3Smrg- v3dv: implement vkCmdCopyImage
21547ec681f3Smrg- v3dv: implement vkCmdClearColorImage
21557ec681f3Smrg- v3dv: fix buffer automatic stride for image to buffer copies
21567ec681f3Smrg- v3dv: implement vkCmdClearDepthStencilImage
21577ec681f3Smrg- v3dv: implement vkCmdCopyBufferToImage for color formats
21587ec681f3Smrg- v3dv: vkCmdCopyBufferToImage for depth/stencil formats
21597ec681f3Smrg- v3dv: add an assert to catch applications trying to clear invalid aspects
21607ec681f3Smrg- v3dv: implement indirect draws
21617ec681f3Smrg- v3dv: add support for primitive restarts on indexed draw calls
21627ec681f3Smrg- v3dv: initialize in_sync_bcl in our submits
21637ec681f3Smrg- v3dv: implement vkResetCommandBuffer
21647ec681f3Smrg- v3dv: add assertions for unimplemented fallback paths
21657ec681f3Smrg- v3dv: honor swizzle for non-copy operations of color formats
21667ec681f3Smrg- v3dv: implement vkQueueWaitIdle
21677ec681f3Smrg- v3dv: destroy wsi device during physical device termination
21687ec681f3Smrg- v3dv: implement vk{Create,Destroy}BufferView
21697ec681f3Smrg- v3dv: implement host-side event handling functions
21707ec681f3Smrg- v3dv: adjust a few limits to comply with CTS minimum requirements
21717ec681f3Smrg- v3dv: declare that we support robust buffer access
21727ec681f3Smrg- v3dv: meet requirements for supported format features and properties
21737ec681f3Smrg- v3dv: implement vkResetCommandPool
21747ec681f3Smrg- v3dv: don't swap R/B channels for VK_FORMAT_R5B6G5_UNORM_PACK16
21757ec681f3Smrg- v3dv: don't use TLB path for formats that are not supported for rendering
21767ec681f3Smrg- v3dv: fix image clearing with VK_REMAINING\_\*
21777ec681f3Smrg- v3dv: don't support image formats that we can rendet to or texture from
21787ec681f3Smrg- v3dv: fix fill buffer with VK_WHOLE_SIZE
21797ec681f3Smrg- v3dv: implement vkGetRenderAreaGranularity
21807ec681f3Smrg- v3dv: fix supertile coverage when render are size is 0.
21817ec681f3Smrg- v3dv: take memory format from appropriate miplevel for image load/store
21827ec681f3Smrg- v3dv: fix framebuffer format when computing fragment shader key
21837ec681f3Smrg- v3dv: fix subpass tracking in the command buffer state
21847ec681f3Smrg- v3dv: rewrite frame tiling setup
21857ec681f3Smrg- v3dv: more frame tiling refactors
21867ec681f3Smrg- v3dv: trivial refactors in a few meta copy helpers
21877ec681f3Smrg- v3dv: assign driver locations on fragment shader output variables
21887ec681f3Smrg- v3dv: don't reset loader data on command buffers
21897ec681f3Smrg- v3dv: drop incorrect assertion
21907ec681f3Smrg- v3dv: add a no-op fragment shader if we don't have one
21917ec681f3Smrg- v3dv: implement interpolation qualifiers
21927ec681f3Smrg- v3d/compiler: implement nir_op_fquantize2f16
21937ec681f3Smrg- v3dv: call nir_lower_io_arrays_to_elements_no_indirects on vertex shaders
21947ec681f3Smrg- v3dv: fix incorrect sizing of the vertex attribute state array
21957ec681f3Smrg- v3dv: split fragment shader array outputs
21967ec681f3Smrg- v3dv: lower usubborrow and uaddcarry
21977ec681f3Smrg- v3dv: lower {i,u}mulExtended
21987ec681f3Smrg- v3dv: don't assume that VkPipelineColorBlendStateCreateInfo is provided
21997ec681f3Smrg- v3dv: drop incorrect assertion
22007ec681f3Smrg- v3dv: drop assert for map of a mapped buffer
22017ec681f3Smrg- v3dv: fix image tiling configuration
22027ec681f3Smrg- v3dv: fix scissor outside viewport
22037ec681f3Smrg- v3dv: fix viewport Z
22047ec681f3Smrg- v3dv: work around viewport Z scale hardware bug
22057ec681f3Smrg- v3dv: don't leak job allocations
22067ec681f3Smrg- v3dv: handle the case where we fail to allocate a new job gracefully
22077ec681f3Smrg- v3dv: only export the last job sync object once
22087ec681f3Smrg- v3dv: support submits without a command buffer
22097ec681f3Smrg- v3dv: return OOM error if we fail to import or export sync objects
22107ec681f3Smrg- v3dv: use vk_error() for all queue/submit errors
22117ec681f3Smrg- v3dv: fix copies and clears of 3D images
22127ec681f3Smrg- v3dv: fix depth/stencil clear color
22137ec681f3Smrg- v3dv: implement color blending
22147ec681f3Smrg- v3dv: only expose blending on formats that support it
22157ec681f3Smrg- v3dv: add an 'always flush' mode
22167ec681f3Smrg- v3dv: always flush draw calls if we are doing sRGB blending
22177ec681f3Smrg- v3dv: implement dynamic state for blend constants
22187ec681f3Smrg- v3dv: only emit blend state if the pipeline is dirty
22197ec681f3Smrg- v3dv: rewrite dirty state handling
22207ec681f3Smrg- v3dv: drop redundant emission of stencil state
22217ec681f3Smrg- v3dv: stencil state fixes
22227ec681f3Smrg- v3dv: only emit config bits and varyings packets if needed
22237ec681f3Smrg- v3dv: use perp end caps rasterization mode for lines
22247ec681f3Smrg- v3dv: drop incorrect assertion on number of clear values at render pass begin
22257ec681f3Smrg- v3dv: disable depth/stencil testing if we don't have a depth/stencil attachment
22267ec681f3Smrg- v3dv: assert on vkCreateComputePipelines
22277ec681f3Smrg- v3dv: improve assert handling for fallback paths on meta copy/clear operations
22287ec681f3Smrg- v3dv: check support for transfer usage flags
22297ec681f3Smrg- v3dv: make sure we only expose transfer features for formats we can use
22307ec681f3Smrg- v3dv: use compatible TLB formats if possible during copies and clears
22317ec681f3Smrg- v3dv: fix incorrect image slice selection
22327ec681f3Smrg- v3dv: fix clearing of 3D images
22337ec681f3Smrg- v3dv: fix job subpass index for vkCmdClearAttachments jobs
22347ec681f3Smrg- v3dv: don't emit the subpass RCL for jobs that have emitted their own
22357ec681f3Smrg- v3dv: fix a1r5g5b5 format
22367ec681f3Smrg- v3dv: allow to create shader modules from NIR
22377ec681f3Smrg- v3dv: improve asserts for VkPipelineColorBlendStateCreateInfo handling
22387ec681f3Smrg- v3dv: implement partial color attachment clears
22397ec681f3Smrg- v3dv: implement partial depth/stencil attachment clears
22407ec681f3Smrg- v3dv: implement proper caching for partial clear pipelines
22417ec681f3Smrg- v3dv: store the clip window in the command buffer state
22427ec681f3Smrg- v3dv: check the render area against the clip window
22437ec681f3Smrg- v3dv: fix v3dv_GetRenderAreaGranularity to account for attachment bpp
22447ec681f3Smrg- v3dv: don't always assert that we have an active job
22457ec681f3Smrg- v3dv: use the TLB to clear attachments even if we have an active scissor
22467ec681f3Smrg- v3dv: restrict render pass clears to the render area
22477ec681f3Smrg- v3dv: handle stencil load/store operations
22487ec681f3Smrg- v3dv: assert on subpasses that use input or resolve attachments
22497ec681f3Smrg- v3dv: push/pop more state during meta operations
22507ec681f3Smrg- v3dv: create a v3dv_cmd_buffer_subpass_resume helper
22517ec681f3Smrg- v3dv: set render area for partial clears to match clear rect
22527ec681f3Smrg- v3dv: compute tile granularity for each subpass
22537ec681f3Smrg- v3dv: fix incorrect attachment reference
22547ec681f3Smrg- v3dv: fix incorrect attachment reference
22557ec681f3Smrg- v3dv: simplify partial clearing code
22567ec681f3Smrg- v3dv: handle partial clears of just one aspect of combined DS targets
22577ec681f3Smrg- v3d/compiler: implement nir_intrinsic_load_base_instance
22587ec681f3Smrg- v3dv: emit instanced draw calls when requested
22597ec681f3Smrg- v3dv: fix subpass merge tests
22607ec681f3Smrg- v3dv: reset all state to dirty when we start a new job for a command buffer
22617ec681f3Smrg- v3dv: implement occlusion queries
22627ec681f3Smrg- v3dv: submit a no-op job if a command buffer doesn't have any jobs.
22637ec681f3Smrg- v3dv: simplify handling of no-op jobs
22647ec681f3Smrg- v3dv: add a bunch of API stubs
22657ec681f3Smrg- v3dv: implement TFU blits
22667ec681f3Smrg- v3dv: reset subpass index at render pass end
22677ec681f3Smrg- v3dv: meta operations can happen outside a render pass
22687ec681f3Smrg- v3dv: save and restore descriptor state during meta operations if needed
22697ec681f3Smrg- v3dv: save and restore push constant state during meta operations
22707ec681f3Smrg- v3dv: implement shader draw fallback for vkCmdBlitImage
22717ec681f3Smrg- v3dv: require optimal tiling for features that reqiure sampling
22727ec681f3Smrg- v3dv: move early-Z update to pre-draw
22737ec681f3Smrg- v3dv: don't leak NIR code in pipelines
22747ec681f3Smrg- v3dv: don't leak host memory allocated for shader variants
22757ec681f3Smrg- v3dv: don't leak default pipeline attributes BO
22767ec681f3Smrg- v3dv: don't leak prog_data from shader variants
22777ec681f3Smrg- v3dv: don't leak the compiler from the physical device
22787ec681f3Smrg- v3dv: don't leak the texture shader state BO from image views
22797ec681f3Smrg- v3dv: don't leak state BO from samplers
22807ec681f3Smrg- v3dv/blit: fix integer blits from larger to lower bit size
22817ec681f3Smrg- v3dv: handle miplevel correctly for blits
22827ec681f3Smrg- v3dv: support depth blits
22837ec681f3Smrg- v3dv: don't support blitting of combined depth/stencil formats
22847ec681f3Smrg- v3dv: don't support 1D depth/stencil for transfer sources or sampling
22857ec681f3Smrg- v3dv: remove incorrect assert
22867ec681f3Smrg- v3dv: support blits with 1D and 3D images
22877ec681f3Smrg- v3dv: add framework for private driver objects
22887ec681f3Smrg- v3dv: fix leaks during recording of meta blits
22897ec681f3Smrg- v3dv: use the private object framework in the meta clear path
22907ec681f3Smrg- v3dv: implement fallback for partial image copies
22917ec681f3Smrg- v3dv: implement stencil aspect blits for combined depth/stencil format
22927ec681f3Smrg- v3d: fix Tile Rendering Mode Cfg (Color) packet description
22937ec681f3Smrg- v3dv: limit software integer RT clamp to rgb10a2
22947ec681f3Smrg- v3dv: handle copies from/to compressed formats
22957ec681f3Smrg- v3dv: implement partial buffer copies to color images
22967ec681f3Smrg- v3dv: support blitting both depth and stencil aspects at the same time
22977ec681f3Smrg- v3dv: implement partial buffer copies to depth/stencil images
22987ec681f3Smrg- v3dv: always return true from a fallback path if it can handle the case
22997ec681f3Smrg- v3dv: fix image addressing calculations to account for suballocation
23007ec681f3Smrg- v3dv: only require 4-byte alignment for linear images
23017ec681f3Smrg- v3dv: implement partial image to buffer copies
23027ec681f3Smrg- v3dv: do not rewrite blit spec for combined depth/stencil in get_blit_pipeline
23037ec681f3Smrg- v3dv: drop blit path for depth/stencil formats
23047ec681f3Smrg- v3dv: implement depth bias
23057ec681f3Smrg- v3dv: ignore dynamic updates of depth bounds state
23067ec681f3Smrg- v3dv: implement wide lines
23077ec681f3Smrg- v3dv: fix dynamic blend constants
23087ec681f3Smrg- v3dv: fix the command buffer private object framework for 32-bit
23097ec681f3Smrg- v3dv: fix depth/stencil clears on hardware
23107ec681f3Smrg- v3dv: make the driver more robust against OOM
23117ec681f3Smrg- v3dv: implement events
23127ec681f3Smrg- v3dv: don't leak BOs from CLs when using BRANCH
23137ec681f3Smrg- v3dv: fix vkResetCommandPool
23147ec681f3Smrg- v3dv: make TLB clearing paths return true/false
23157ec681f3Smrg- v3dv: drop the extra BO handling from the command buffer
23167ec681f3Smrg- v3dv: remove some unnecessary / unused functions
23177ec681f3Smrg- v3dv: assert command buffers are executable when submitting to a queue
23187ec681f3Smrg- v3dv: check that GPU device matches requirements
23197ec681f3Smrg- v3dv: ensure BCL space is available before emitting packets
23207ec681f3Smrg- v3dv: handle OOM properly during command buffer recording in more places
23217ec681f3Smrg- v3dv: fix bogus command buffer allocation scopes
23227ec681f3Smrg- v3dv: add basic support for secondary command buffers
23237ec681f3Smrg- v3dv: implement vkCmdWaitEvents for secondary command buffers
23247ec681f3Smrg- v3dv: support vkCmdClearAttachments in secondary command buffers
23257ec681f3Smrg- v3dv: don't leak attachment state
23267ec681f3Smrg- v3dv: add stubs for missing API implementations
23277ec681f3Smrg- v3dv: warn users that this is not a conformant driver
23287ec681f3Smrg- v3dv: fix BCL start offset in presence of chained BOs
23297ec681f3Smrg- v3dv: regen BO lists for CLs inside cloned jobs
23307ec681f3Smrg- v3dv: fix a few cases where we were ignoring suballocated buffers
23317ec681f3Smrg- v3dv: fix release build warnings
23327ec681f3Smrg- v3dv: actually enable early Z
23337ec681f3Smrg- v3dv: try harder to skip emission of redundant state
23347ec681f3Smrg- v3dv: add a TFU path for buffer to image copies
23357ec681f3Smrg- v3dv: add a CPU path for buffer to image copies
23367ec681f3Smrg- v3dv: try to use TFU path when creating tiled images from linear buffers
23377ec681f3Smrg- v3dv: always map full BOs
23387ec681f3Smrg- v3dv: support compute pipelines
23397ec681f3Smrg- v3dv: handle separate binding points for compute and graphics
23407ec681f3Smrg- v3dv: implement compute dispatch
23417ec681f3Smrg- v3dv: handle unsized arrays in SSBOs
23427ec681f3Smrg- v3dv: always emit index buffer state for new jobs
23437ec681f3Smrg- v3dv: implement indirect compute dispatch
23447ec681f3Smrg- v3dv: return a proper error for too large buffer allocations
23457ec681f3Smrg- v3dv: assert that our framebuffers are single sampled
23467ec681f3Smrg- v3dv: don't free BOs from imported memory objects
23477ec681f3Smrg- v3dv: pipeline initialization fixes for disabled rasterization
23487ec681f3Smrg- v3dv: handle empty set layouts
23497ec681f3Smrg- v3dv: don't reset descriptor state after a meta operation
23507ec681f3Smrg- v3dv: lower unpack_{u,s}norm_2x16
23517ec681f3Smrg- v3dv: lower frexp
23527ec681f3Smrg- v3dv: implement support for shader spilling
23537ec681f3Smrg- v3dv: fix GFXH-930 workaround
23547ec681f3Smrg- v3dv: add workaround for GFXH-1602
23557ec681f3Smrg- v3dv: improve handling of too large image sizes
23567ec681f3Smrg- v3dv: handle draw after barrier
23577ec681f3Smrg- v3dv: fix vkCmdCopyBuffer unaligned TLB access
23587ec681f3Smrg- v3dv: fix textureSize() for cube arrays
23597ec681f3Smrg- v3dv: fix srcSubresource description for image to buffer blits
23607ec681f3Smrg- v3dv: fix blit_shader() to honor the region's aspect mask
23617ec681f3Smrg- v3dv: handle unnormalized coordinates in samplers
23627ec681f3Smrg- v3dv: use swizzle X001 with D/S formats
23637ec681f3Smrg- v3dv: fix regressions for cubemap array load/store
23647ec681f3Smrg- v3dv: fix color border clamping with specific formats
23657ec681f3Smrg- v3dv: make sure we emit vertex attributes in location order
23667ec681f3Smrg- v3d/compiler: support swapping R/B channels in vertex attributes.
23677ec681f3Smrg- v3dv: handle VK_FORMAT_B8G8R8A8_UNORM vertex attributes
23687ec681f3Smrg- v3dv: don't support sRGB buffer formats
23697ec681f3Smrg- v3dv: improve pipeline barrier handling
23707ec681f3Smrg- v3dv: use a binning sync for CL jobs waiting on a semaphore
23717ec681f3Smrg- v3dv: ignore stencil load operation if attachment format doesn't have stencil
23727ec681f3Smrg- v3dv: only use per-buffer clear bit for cases were we are already storing
23737ec681f3Smrg- v3dv: avoid prime blit path when presenting WSI images
23747ec681f3Smrg- v3dv: only care about barriers between GPU jobs
23757ec681f3Smrg- v3dv: emit new shader state if viewport is dirty
23767ec681f3Smrg- v3dv: only clear depth/stencil attachments if any aspect needs clearing
23777ec681f3Smrg- v3dv: add a fast path for vkCmdClearAttachments
23787ec681f3Smrg- v3dv: enable shaderClipDistance
23797ec681f3Smrg- v3dv: enable fillModeNonSolid
23807ec681f3Smrg- v3dv: fix dynamic state after meta operation
23817ec681f3Smrg- v3dv: consider MSAA when computing frame tiling
23827ec681f3Smrg- v3dv: process VkPipelineMultisampleStateCreateInfo properly
23837ec681f3Smrg- v3dv: implement subpass multisample rendering and resolve
23847ec681f3Smrg- v3dv: implement vkCmdResolveImage for whole images
23857ec681f3Smrg- v3dv: handle multisampled image copies in the TLB path
23867ec681f3Smrg- v3dv: setup texture shader state correctly for multisampled images
23877ec681f3Smrg- v3dv: add a blit fallback path for vkCmdResolveImage
23887ec681f3Smrg- v3dv: handle multisampled image copies with the blit path
23897ec681f3Smrg- broadcom/compiler: handle gl_SampleMask writes in fragment shaders
23907ec681f3Smrg- v3dv: amend tile size tables with smallest tile sizes available
23917ec681f3Smrg- nir/glsl: add a glsl_ivec4_type() helper
23927ec681f3Smrg- v3dv: fix blitting of signed integer formats
23937ec681f3Smrg- v3dv: handle multisample resolve of integer formats
23947ec681f3Smrg- v3dv: handle multisample resolves for formats that don't support TLB resolves
23957ec681f3Smrg- v3dv: handle multisample image clears
23967ec681f3Smrg- broadcom/compiler: implement nir_intrinsic_load_sample_pos
23977ec681f3Smrg- broadcom/compiler: track if the fragment shader forces per-sample MSAA
23987ec681f3Smrg- v3dv: enable sample rate shading if fragment shader reads gl_SampleID
23997ec681f3Smrg- v3dv: implement nir_texop_texture_samples
24007ec681f3Smrg- v3dv: handle multisample rasterization with empty framebuffers
24017ec681f3Smrg- nir/lower_io: add an option to lower interpolateAt functions
24027ec681f3Smrg- v3dv: lower interpolateAt functions in NIR and enable sample rate shading
24037ec681f3Smrg- v3dv: only require texel-size alignment for linear images
24047ec681f3Smrg- v3dv: fix 3D image blits
24057ec681f3Smrg- v3dv: don't cache subpass color clear pipelines
24067ec681f3Smrg- v3dV: move meta init/finish to meta implementation files
24077ec681f3Smrg- nir: add a nir_get_ubo_size intrinsic
24087ec681f3Smrg- v3d/compiler: implement nir_intrinsic_get_ubo_size
24097ec681f3Smrg- v3dv: handle QUNIFORM_GET_UBO_SIZE
24107ec681f3Smrg- broadcom/compiler: rename QUNIFORM_GET_BUFFER_SIZE to QUNIFORM_GET_SSBO_SIZE
24117ec681f3Smrg- v3d/compiler: add a lowering pass for robust buffer access
24127ec681f3Smrg- v3dv: hook up robust buffer access
24137ec681f3Smrg- v3dv: fix color clear pipeline destruction for 32-bit architectures
24147ec681f3Smrg- v3dv: handle VK_STRUCTURE_TYPE_PHYSICAL_DEVICE_IMAGE_DRM_FORMAT_MODIFIER_INFO
24157ec681f3Smrg- v3dv: expose DRM modifiers based on supported features
24167ec681f3Smrg- v3dv: fix offset computed by vkGetImageSubresourceLayout for array images
24177ec681f3Smrg- v3dv: fix size computed by vkGetImageSubresourceLayout for 3D images
24187ec681f3Smrg- v3dv: do not expose VK_IMAGE_USAGE_SAMPLED_BIT for swapchains
24197ec681f3Smrg- v3dv: signal semaphore/fence if needed after acquiring a swapchain image
24207ec681f3Smrg- v3dv: fix sampling from stencil aspect of a combined depth/stencil image
24217ec681f3Smrg- v3dv: honor VkPipelineDepthStencilStateCreateInfo::depthWriteEnable
24227ec681f3Smrg- v3dv: don't leak dumb BO handles allocated for swapchain images
24237ec681f3Smrg- v3dv: clean-up after obtaining an XCB connection
24247ec681f3Smrg- v3dv: free noop job if needed when finishing the queue
24257ec681f3Smrg- v3d/compiler: allow to batch spills
24267ec681f3Smrg- v3dv: always program a reasonable internal depth type for copies/clears
24277ec681f3Smrg- v3dv: only advertise one memory type
24287ec681f3Smrg- v3dv: flag tmu_dirty_rcl in primaries when linking secondaries that have it set
24297ec681f3Smrg- v3dv: implement workaround for GFXH-1461
24307ec681f3Smrg- v3dv: implement workaround for GFXH-1918
24317ec681f3Smrg- v3dv: fixes for barriers in secondary command buffers
24327ec681f3Smrg- v3dv: fix blit path for copies from 3D compressed images
24337ec681f3Smrg- v3dv: generate proper UUIDs for device and driver
24347ec681f3Smrg- v3dv: limit blit framebuffer dimensions to max coordinates
24357ec681f3Smrg- v3dv: drop a couple of obsolete comments
24367ec681f3Smrg- v3dv: fix buffer copies to compressed images on the blit path
24377ec681f3Smrg- broadcom/compiler: track partially interpolated fragment inputs
24387ec681f3Smrg- v3d/compiler: implement load interpolated input intrinsics
24397ec681f3Smrg- v3dv: skip unnecessary tile loads when blitting
24407ec681f3Smrg- v3dv: fix multi-layered buffer to image copies on the blit path
24417ec681f3Smrg- v3dv: do not attempt to blit from a linear image source
24427ec681f3Smrg- v3dv: fix Z coordinate for 3D blits
24437ec681f3Smrg- v3dv: handle compressed image to buffer copies on the blit path
24447ec681f3Smrg- v3dv: handle buffer to linear depth/stencil image copies in blit path
24457ec681f3Smrg- broadcom/cle: fix vec size dump when set to 0
24467ec681f3Smrg- v3d/compiler: fix BGRA vertex attributes for vec2/float size.
24477ec681f3Smrg- v3dv: compute swap_rb flag after applying all swizzles
24487ec681f3Smrg- v3dv: properly describe swap_color_rb
24497ec681f3Smrg- v3dv: enable the logicOp feature
24507ec681f3Smrg- v3dv: grow meta descriptor pool dynamically
24517ec681f3Smrg- v3dv: enable alphaToOne feature
24527ec681f3Smrg- v3dv: add image view debug checks for VK_KHR_maintenance1
24537ec681f3Smrg- v3dv: fix base slice selection for copies involving 3D images
24547ec681f3Smrg- v3dv: update assertion to match VK_KHR_maintenance1 semantics
24557ec681f3Smrg- v3dv: implement vkTrimCommandPool
24567ec681f3Smrg- v3dv: expose VK_KHR_maintenance1
24577ec681f3Smrg- v3dv: add support for timestamp queries
24587ec681f3Smrg- v3dv: fix occlusion query inheritance in secondary command buffers
24597ec681f3Smrg- zink: require Vulkan timestamp queries for time query caps
24607ec681f3Smrg- zink: add VK_STRUCTURE_TYPE_WSI_MEMORY_ALLOCATE_INFO_MESA for WSI allocations
24617ec681f3Smrg- v3dv: add a v3dv_bo_init helper
24627ec681f3Smrg- v3dv: expose more features
24637ec681f3Smrg- zink: fix pNext chain for resource memory allocation
24647ec681f3Smrg
24657ec681f3SmrgIan Romanick (34):
24667ec681f3Smrg
24677ec681f3Smrg- intel/vec4: Silence unused paramter warnings in brw_vec4_generator.cpp
24687ec681f3Smrg- intel/compiler: Silence unused parameter warning in brw_surface_payload_size
24697ec681f3Smrg- intel/compiler: Don't fallback to vec4 when scalar GS compile fails \[v2\]
24707ec681f3Smrg- intel/vec4: Remove inline lowering of LRP
24717ec681f3Smrg- intel/compiler: Remove INTEL_SCALAR\_... env variables
24727ec681f3Smrg- intel/vec4: Remove all support for Gen8+ \[v2\]
24737ec681f3Smrg- intel/vec4: Remove everything related to VS_OPCODE_SET_SIMD4X2_HEADER_GEN9
24747ec681f3Smrg- i965: Allow viewport array extensions with allow_higher_compat_version
24757ec681f3Smrg- intel: Silence many unused parameter warnings in blorp_genX_exec.h
24767ec681f3Smrg- i965: Silence many unused parameter warnings in genX_blorp_exec.c
24777ec681f3Smrg- i965: Silence many unused parameter warnings in genX_state_upload.c
24787ec681f3Smrg- i965: Make MOCS index tables static const
24797ec681f3Smrg- i965: Rename gen10_emit_isp_disable to gen7_emit_isp_disable
24807ec681f3Smrg- intel: Disable all support for Gen10
24817ec681f3Smrg- intel/compiler: Remove Gen10-specific code
24827ec681f3Smrg- i965: Remove Gen10-specific state setup and workarounds
24837ec681f3Smrg- i965: Don't build Gen10-specific files and libraries
24847ec681f3Smrg- intel: Remove Gen10-specific cache config code
24857ec681f3Smrg- intel/isl: Don't generate Gen10-specific functions
24867ec681f3Smrg- iris: Don't generate Gen10-specific functions
24877ec681f3Smrg- anv: Don't generate Gen10-specific functions
24887ec681f3Smrg- intel: Remove Gen10-speicific perf support
24897ec681f3Smrg- intel: Remove Gen10-specific device entries
24907ec681f3Smrg- i965: Silence unused parameter warnings
24917ec681f3Smrg- mesa/st: Silence unused parameter warnings in st_context.c
24927ec681f3Smrg- mesa: Pass the correct caller string to \_mesa_lookup_or_create_texture
24937ec681f3Smrg- glx: rework \__glXCalculateUsableExtensions to be more readable
24947ec681f3Smrg- nir: Rename replicated-result dot-product instructions
24957ec681f3Smrg- mesa: Open-code hash walk in \_mesa_HashPrint
24967ec681f3Smrg- mesa: Store the atlas Id in the gl_bitmap_atlas structure
24977ec681f3Smrg- i965: Get the gl_perf_query_object Id from the object
24987ec681f3Smrg- mesa: Remove the key parameter from the \_mesa_HashWalk callback
24997ec681f3Smrg- mesa: Remove the key parameter from the \_mesa_HashDeleteAll callback
25007ec681f3Smrg- intel/compiler: Rotate instructions ROR and ROL cannot have source modifiers
25017ec681f3Smrg
25027ec681f3SmrgIcecream95 (27):
25037ec681f3Smrg
25047ec681f3Smrg- panfrost: Fix border colour
25057ec681f3Smrg- docs/features: Add missing Panfrost extensions
25067ec681f3Smrg- panfrost: Cleanup panfrost_get_param
25077ec681f3Smrg- panfrost: Remove old comment on broken depth reload
25087ec681f3Smrg- panfrost: Correctly set modifier_constant
25097ec681f3Smrg- panfrost: Seperate resource setup and bo creation
25107ec681f3Smrg- panfrost: Move tiled-linear conversion checking to a new function
25117ec681f3Smrg- panfrost: AFBC to linear layout conversion
25127ec681f3Smrg- pan/mdg: Fix spilling of non-32-bit types
25137ec681f3Smrg- panfrost: Set modifier_constant to true for exported resources
25147ec681f3Smrg- pan/mdg: Return a bool from midgard_nir_lod_errata
25157ec681f3Smrg- pan/mdg: Use nir_shader_instructions_pass for nir_lod_errata
25167ec681f3Smrg- pan/mdg: Use nir_shader_instructions_pass for fdot2 lowering
25177ec681f3Smrg- Revert "panfrost: Drop implicit blend pooling"
25187ec681f3Smrg- panfrost: Clamp uniform buffer size
25197ec681f3Smrg- panfrost: Handle non-positive viewport positions
25207ec681f3Smrg- panfrost: Remove redundant casts of viewport position
25217ec681f3Smrg- panfrost: Mark blit shaders as internal
25227ec681f3Smrg- pan/mdg: Infer whether to disassemble shaders from info.internal
25237ec681f3Smrg- panfrost: Add a debug flag to disable AFBC
25247ec681f3Smrg- panfrost: Precise occlusion query support
25257ec681f3Smrg- panfrost: Only enable occlusion queries when active
25267ec681f3Smrg- panfrost: Move zs format handling code out of the !afbc case
25277ec681f3Smrg- panfrost: Z16 depth buffer support
25287ec681f3Smrg- panfrost: AFBC compress Z16 depth buffers
25297ec681f3Smrg- panfrost: Fix AFBC blits of resources with faked RGTC
25307ec681f3Smrg- panfrost: Fix stack shift calculation
25317ec681f3Smrg
25327ec681f3SmrgIgor V. Kovalenko (1):
25337ec681f3Smrg
25347ec681f3Smrg- r600: amend space check for chips older than EVERGREEN
25357ec681f3Smrg
25367ec681f3SmrgIlia Mirkin (1):
25377ec681f3Smrg
25387ec681f3Smrg- panfrost: enable DrawTransformFeedback\*
25397ec681f3Smrg
25407ec681f3SmrgIndrajit Kumar Das (5):
25417ec681f3Smrg
25427ec681f3Smrg- mesa: add NV_copy_depth_to_color support for nir
25437ec681f3Smrg- gallium: prepare framework for supporting GL_NV_shader_atomic_int64
25447ec681f3Smrg- mesa,glsl: add support for GL_NV_shader_atomic_int64
25457ec681f3Smrg- radeonsi: enable support for GL_NV_shader_atomic_int64
25467ec681f3Smrg- radeonsi/gfx10: fix stream index for multi-stream overflow query
25477ec681f3Smrg
25487ec681f3SmrgItalo Nicola (12):
25497ec681f3Smrg
25507ec681f3Smrg- nir: add shared/global atomics to nir_get_io_offset_src()
25517ec681f3Smrg- panfrost: fix undefined value access on mir_set_intr_mask()
25527ec681f3Smrg- panfrost: add atomic_cmpxchg opcode
25537ec681f3Smrg- panfrost: add LDST_ADDRESS property to atomic ops
25547ec681f3Smrg- panfrost: introduce LDST_ATOMIC property
25557ec681f3Smrg- panfrost: add support for src[3\] in LOAD_STORE ops
25567ec681f3Smrg- panfrost: add atomic ops infrastructure
25577ec681f3Smrg- panfrost: add support for atomics
25587ec681f3Smrg- nir/algebraic: fold some nested comparisons with ball and bany
25597ec681f3Smrg- pan/mdg: remove unused arg from ALU_CHECK_CMP and ALU_CASE_CMP
25607ec681f3Smrg- pan/mdg: map uabs_i/usub to i/uabsdiff
25617ec681f3Smrg- pan/mdg: fix LOCAL_STORAGE wls_instances packing
25627ec681f3Smrg
25637ec681f3SmrgIván Briano (1):
25647ec681f3Smrg
25657ec681f3Smrg- anv: restrict number of subgroups per group
25667ec681f3Smrg
25677ec681f3SmrgJames Park (16):
25687ec681f3Smrg
25697ec681f3Smrg- amd/addrlib: Fix warning list for msvc
25707ec681f3Smrg- radv: Increased const usage
25717ec681f3Smrg- util: Hide timespec_passed on Windows
25727ec681f3Smrg- radv: Only close local_fd when valid
25737ec681f3Smrg- ac,amd/llvm,radv: Initialize structs with {0}
25747ec681f3Smrg- util,radv,radv/winsys: Cross-platform rwlock API
25757ec681f3Smrg- util,ac,aco,radv: Cross-platform memstream API
25767ec681f3Smrg- util: Fix rwlock Windows include for MinGW
25777ec681f3Smrg- util/xmlconfig: Disable for Windows like Android
25787ec681f3Smrg- aco: Clean up some C++ usages
25797ec681f3Smrg- vulkan/util,vulkan/wsi,radv: Add typed outarray API
25807ec681f3Smrg- aco: Fix accidental copies, attempt two
25817ec681f3Smrg- nir: Stabilize compact_components sort
25827ec681f3Smrg- amd/llvm,aco: Replace VLA with alloca
25837ec681f3Smrg- radv,radv/winsys: Move RADV_MAX_IBS_PER_SUBMIT
25847ec681f3Smrg- radv: Fix leak in radv_amdgpu_winsys_destroy()
25857ec681f3Smrg
25867ec681f3SmrgJan Beich (1):
25877ec681f3Smrg
25887ec681f3Smrg- spirv: switch to util_bswap32 to improve portability
25897ec681f3Smrg
25907ec681f3SmrgJan Ziak (1):
25917ec681f3Smrg
25927ec681f3Smrg- Add driver override to envvars.rst
25937ec681f3Smrg
25947ec681f3SmrgJason Ekstrand (296):
25957ec681f3Smrg
25967ec681f3Smrg- iris: no-op implement set_compute_resources
25977ec681f3Smrg- iris: Implement set_global_binding
25987ec681f3Smrg- iris: Add support for serialized NIR
25997ec681f3Smrg- intel/cs_intrinsics: Handle 64-bit intrinsics
26007ec681f3Smrg- intel/compiler: Allow MESA_SHADER_KERNEL
26017ec681f3Smrg- iris: Use blob_write_uint32 for num_system_values
26027ec681f3Smrg- iris: Add a kernel_input_size field for compiled shaders
26037ec681f3Smrg- iris/disk_cache: Stop assuming stage == cache_id
26047ec681f3Smrg- iris: Copy dest size from the original intrinsic in setup_uniforms
26057ec681f3Smrg- iris: Upload kernel inputs with system values
26067ec681f3Smrg- iris: Add support for MESA_SHADER_KERNEL in the disk cache
26077ec681f3Smrg- nir: Add and use nir_foreach_block_unstructured helpers
26087ec681f3Smrg- nir/lower_goto_if: Document some data structures
26097ec681f3Smrg- nir/lower_goto_if: Clean up ralloc usage
26107ec681f3Smrg- nir/lower_goto_if: Use util/list instead of exec_list
26117ec681f3Smrg- nir/lower_goto_if: Rework handling of skip targets
26127ec681f3Smrg- nir/lower_goto_if: Rework some set union logic
26137ec681f3Smrg- nir/lower_goto_if: Sort blocks in select_fork
26147ec681f3Smrg- nir/lower_goto_if: Add a block_for_singular_set helper
26157ec681f3Smrg- nir/lower_goto_if: Replace a tripple loop with a double loop
26167ec681f3Smrg- nir/lower_goto_if: Add a route::outside set
26177ec681f3Smrg- nir/lower_goto_if: Add some debug prints
26187ec681f3Smrg- spirv: Add a MESA_SPIRV_FORCE_UNSTRUCTURED environment variable
26197ec681f3Smrg- nir/builder: Make nir_get_ptr_bitsize take a nir_shader
26207ec681f3Smrg- spirv: Don't emit RMW for vector indexing in shared or global
26217ec681f3Smrg- clover/nir: Stop setting ubo_addr_format
26227ec681f3Smrg- clover/nir: Stop computing the global address format twice
26237ec681f3Smrg- clover/nir: Use the correct address mode for shared
26247ec681f3Smrg- nir: Initialize nir_ssa_def::live_index
26257ec681f3Smrg- nir/builder: Add a nir_iand_imm helper
26267ec681f3Smrg- nir/find_array_copies: Handle cast derefs
26277ec681f3Smrg- nir/large_constants: Handle incomplete derefs
26287ec681f3Smrg- compiler/types: Allow interfaces in get_explicit_type_for_size_align
26297ec681f3Smrg- nir/opt_large_constants: Fix a type/deref_type typo
26307ec681f3Smrg- nir: Add an LOD parameter to image_*_size
26317ec681f3Smrg- iris: Stop advertising PIPE_SHADER_IR_NIR_SERIALIZED
26327ec681f3Smrg- iris: Stop advertising clover-only caps
26337ec681f3Smrg- iris: ref/unref the GLSL type singleton in screen_create/destroy
26347ec681f3Smrg- iris: Normalize all compute shaders to MESA_SHADER_COMPUTE
26357ec681f3Smrg- iris: Always re-upload sysvals when we have kernel inputs
26367ec681f3Smrg- intel/fs: Fix an assert in load_scratch
26377ec681f3Smrg- intel/nir: Allow splitting a single load into up to 32 loads
26387ec681f3Smrg- clover/spirv: Don't call llvm::regularizeLlvmForSpirv
26397ec681f3Smrg- clover: Call clang with -O0 for the SPIR-V path
26407ec681f3Smrg- nir: Report progress properly in nir_lower_bool_to\_\*
26417ec681f3Smrg- intel/nir: Pass the nir_builder by reference in lower_alpha_to_coverage
26427ec681f3Smrg- intel/nir: Rewrite the guts of lower_alpha_to_coverage
26437ec681f3Smrg- intel/nir: Clean up lower_alpha_to_coverage a bit
26447ec681f3Smrg- nir: Use a switch in nir_inline_function_impl
26457ec681f3Smrg- nir: Take a variable remap parameter in nir_inline_function_impl
26467ec681f3Smrg- intel/fs: Add support for vec8 and vec16 ops
26477ec681f3Smrg- intel/nir: Lower things with \\> 4 components in lower_mem_access_bit_sizes
26487ec681f3Smrg- spirv: Support big-endian strings
26497ec681f3Smrg- spirv: Delete some dead workgroup variable handling code
26507ec681f3Smrg- nir: Rename num_shared to shared_size
26517ec681f3Smrg- nir: Improve the comment on num_inputs and friends
26527ec681f3Smrg- intel/fs: Fix MOV_INDIRECT and BROADCAST of Q types on Gen11+
26537ec681f3Smrg- nir: Add a new nir_var_mem_constant variable mode
26547ec681f3Smrg- nir: Add a load_global_constant intrinsic
26557ec681f3Smrg- nir/lower_io: Use the variable mode for load_scratch_base_ptr checks
26567ec681f3Smrg- nir/lower_io: Add a build_addr_for_var helper
26577ec681f3Smrg- nir/lower_io: Add support for nir_var_mem_constant
26587ec681f3Smrg- nir: Allow opt_large_constants to be run with constant_data_size \\> 0
26597ec681f3Smrg- spirv: Use nir_var_mem_constant for UniformConstant data in CL
26607ec681f3Smrg- intel/fs: Implement nir_intrinsic_load_global_constant
26617ec681f3Smrg- nouveau/nir: Implement load_global_constant
26627ec681f3Smrg- llvmpipe: Add support for load_global_constant
26637ec681f3Smrg- clover/nir: Use nir_var_mem_constant for \__constant memory
26647ec681f3Smrg- spirv: Drop the constant_as_global as option
26657ec681f3Smrg- nir/lower_explicit_io: Assert that compute address sizes match derefs
26667ec681f3Smrg- clover: Use 64-bit offsets for shader_in on 64-bit GPUs
26677ec681f3Smrg- nir/clone: Add a helper for cloning most instruction types
26687ec681f3Smrg- intel/compiler: Get rid of the global compaction table pointers
26697ec681f3Smrg- intel/compiler: Get rid of struct gen_disasm
26707ec681f3Smrg- iris: Use gen_disassemble
26717ec681f3Smrg- intel/eu: Add some new helpers
26727ec681f3Smrg- intel/fs,vec4: Stuff the constant data from NIR in the end of the program
26737ec681f3Smrg- anv: Stop storing the shader constant data side-band
26747ec681f3Smrg- intel/eu: Include brw_compiler.h in brw_eu.h
26757ec681f3Smrg- intel/eu: Add a mechanism for emitting relocatable constant MOVs
26767ec681f3Smrg- intel/fs: Add support for a new load_reloc_const intrinsic
26777ec681f3Smrg- anv: Properly cache brw_stage_prog_data::relocs
26787ec681f3Smrg- nir/builder: Add load/store_global helpers
26797ec681f3Smrg- anv: Patch constant data pointers into shaders with using softpin
26807ec681f3Smrg- iris: Patch constant data pointers into shaders
26817ec681f3Smrg- intel/fs: Don't copy-propagate stride=0 sources into ddx/ddy
26827ec681f3Smrg- intel/fs: Use a single untyped surface read for load_num_work_groups
26837ec681f3Smrg- intel/nir: Lower load_num_work_groups to 32-bit if needed
26847ec681f3Smrg- iris: Re-emit push constants if we have a varying workgroup size
26857ec681f3Smrg- intel/compiler: Handle all indirect lowering choices in brw_nir.c
26867ec681f3Smrg- nir/lower_indirect_derefs: Add a threshold
26877ec681f3Smrg- intel/nir: Stop using nir_lower_vars_to_scratch
26887ec681f3Smrg- nir: Don't bail too early in lower_mem_constant_vars
26897ec681f3Smrg- clover: Call nir_lower_mem_constant_vars
26907ec681f3Smrg- compiler/types: Make booleans 32-bit for cl_size/align
26917ec681f3Smrg- nir/glsl: Add an explicit_alignment field to glsl_type
26927ec681f3Smrg- nir: Add alignment information to cast derefs
26937ec681f3Smrg- nir: Handle all array stride cases in nir_deref_instr_array_stride
26947ec681f3Smrg- nir: Add a helper for getting the alignment of a deref
26957ec681f3Smrg- nir/lower_io: Apply alignments from derefs when available
26967ec681f3Smrg- nir/opt_deref: Don't remove casts with alignment information
26977ec681f3Smrg- nir/opt_deref: Remove restrictive alignment information from casts
26987ec681f3Smrg- spirv: Add pointer helper vars to OpCopyMemory
26997ec681f3Smrg- spirv: Propagate alignments to deref chains via casts
27007ec681f3Smrg- nir: Allow var_mem_global in nir_lower_vars_to_explicit_types
27017ec681f3Smrg- nir: Allow uniform in nir_lower_vars_to_explicit_types
27027ec681f3Smrg- clover: Use args.size() to compute new var locations
27037ec681f3Smrg- spirv: Stop counting inputs in entry_point_wrapper
27047ec681f3Smrg- clover/nir: Use lower_vars_to_explicit for uniform and global
27057ec681f3Smrg- spirv: Drop the OpenCL type layout code
27067ec681f3Smrg- anv: Set alignments on UBO/SSBO root derefs
27077ec681f3Smrg- compiler/types: Fix deserializing structs with \>= 15 members
27087ec681f3Smrg- spirv: Improve the "Entry point not found" error message
27097ec681f3Smrg- spirv2nir: Rework argument handling
27107ec681f3Smrg- nir/lower_io: Fix the unknown-array-index case in get_deref_align
27117ec681f3Smrg- nir: Add a dominance validation pass
27127ec681f3Smrg- spirv: Run repair_ssa if there are discard instructions
27137ec681f3Smrg- intel/nir: Call validate_ssa_dominance at both ends of the NIR compile
27147ec681f3Smrg- nir: More NIR_MAX_VEC_COMPONENTS fixes
27157ec681f3Smrg- nir/idiv_const: Use the modern nir_src_as\_\* constant helpers
27167ec681f3Smrg- anv: Fix the target_bo assertion in anv_reloc_list_add
27177ec681f3Smrg- clover: Pull the stride from pipe_transfer for image maps
27187ec681f3Smrg- spirv: Access qualifiers are not a bitfield
27197ec681f3Smrg- spirv: Plumb access qualifiers through from image types
27207ec681f3Smrg- nir: Add a pass for lowering CL-style image ops to texture ops
27217ec681f3Smrg- intel/fs/swsb: SCHEDULING_FENCE only emits SYNC_NOP
27227ec681f3Smrg- nir: Rename get_buffer_size to get_ssbo_size
27237ec681f3Smrg- radeonsi: Only call nir_lower_var_copies at the end of the opt loop
27247ec681f3Smrg- spirv: vtn_fail with a nice message on unsupported rounding modes
27257ec681f3Smrg- nir/liveness: Consider if uses in nir_ssa_defs_interfere
27267ec681f3Smrg- compiler/types: Add glsl_baseN_t_type(bit_size) helpers
27277ec681f3Smrg- spirv: Use the new types helpers
27287ec681f3Smrg- nir: Add a new memcpy intrinsic
27297ec681f3Smrg- nir: Add a lowering pass to lower memcpy
27307ec681f3Smrg- spirv: Add support for OpCopyMemorySized
27317ec681f3Smrg- clover/nir: Call the memcpy lowering pass
27327ec681f3Smrg- nir: Allow creating variables with nir_var_mem_push_const.
27337ec681f3Smrg- nir/lower_io: Add support for push constants
27347ec681f3Smrg- anv,radv,tu,val: Call nir_lower_io for push constants
27357ec681f3Smrg- spirv: Use derefs for push constants
27367ec681f3Smrg- vallium: Stop using lower_ubo_ssbo_access_to_offsets
27377ec681f3Smrg- spirv: Delete the legacy offset/index UBO/SSBO lowering
27387ec681f3Smrg- nir/copy_propagate: Copy-prop into jump conditions
27397ec681f3Smrg- nir: Disallow goto and goto_if in clone and \[de]serialize
27407ec681f3Smrg- nir/cf: Better handle intra-block splits
27417ec681f3Smrg- nir/validate: Improve the validation of blocks
27427ec681f3Smrg- nir/lower_goto_ifs: Don't destroy SSA form in the process
27437ec681f3Smrg- nir/dominance: Use \_mesa_set_clear instead ofhand-rolling it
27447ec681f3Smrg- spirv: Only run repair_ssa if structured
27457ec681f3Smrg- nir/lower_goto_ifs: Use rzalloc
27467ec681f3Smrg- nir/lower_goto_ifs: Add asserts for SSA forks
27477ec681f3Smrg- nir/lower_goto_ifs: Always include level dom_frontiers in prev_frontier
27487ec681f3Smrg- Revert "nir/lower_goto_if: Add a route::outside set"
27497ec681f3Smrg- anv: Allow HiZ clears for multi-view
27507ec681f3Smrg- anv: Use more temp vars in cmd_buffer_begin_subpass
27517ec681f3Smrg- anv: Skip HiZ and CCS ambiguates which preceed fast-clears
27527ec681f3Smrg- nir: Split NIR_INTRINSIC_TYPE into separate src/dest indices
27537ec681f3Smrg- nir: Add a conversion and rounding intrinsic
27547ec681f3Smrg- nir: Add builder helpers for OpenCL type conversions
27557ec681f3Smrg- nir: Add a passes for nir_intrinsic_convert_alu_types
27567ec681f3Smrg- spirv: Add some conversion handling helpers
27577ec681f3Smrg- spirv: Handle all OpenCL conversion ops with full rounding
27587ec681f3Smrg- spirv/opencl: Drop dest_type from handle_v_load_store
27597ec681f3Smrg- clover/nir: Call nir_lower_convert_alu_types
27607ec681f3Smrg- nir: Add lowering from regular ALU conversions to the intrinsic
27617ec681f3Smrg- intel/fs: NoMask initialize the address register for shuffles
27627ec681f3Smrg- nir: Fix a misspelling
27637ec681f3Smrg- nir/find_array_copies: Properly discard copies for casts
27647ec681f3Smrg- nir: Handle memcpy in copy_prop_vars and combine_stores
27657ec681f3Smrg- nir: Add a memcpy optimization pass
27667ec681f3Smrg- nir/opt_load_store_vectorize: Use bit sizes when checking mask compatibility
27677ec681f3Smrg- nir: Add component mask re-interpret helpers
27687ec681f3Smrg- nir/opt_deref: Add an instruction type switch
27697ec681f3Smrg- nir/opt_deref: Add an optimization for bitcasts
27707ec681f3Smrg- nir: Add a pass to lower vec3s to vec4s
27717ec681f3Smrg- intel/fs: Don't use NoDDClk/NoDDClr for split SHUFFLEs
27727ec681f3Smrg- iris: Fix the constant data address calculation
27737ec681f3Smrg- anv: Implement VK_EXT_transform_feedback on Gen7
27747ec681f3Smrg- spirv: Make the clc_shader const
27757ec681f3Smrg- nir/constant_folding: Use the builder
27767ec681f3Smrg- nir/constant_folding: Use nir_shader_instruction_pass
27777ec681f3Smrg- nir: Validate constant initializers
27787ec681f3Smrg- nir/constant_folding: Fold load_deref of nir_var_mem_constant
27797ec681f3Smrg- iris: Add pipe-loader support
27807ec681f3Smrg- iris: Handle runtime-specified local memory size
27817ec681f3Smrg- iris: Add support for load_work_dim as a system value
27827ec681f3Smrg- iris: Fill out compute caps and enable clover support
27837ec681f3Smrg- gallium/pipe: Add a GALLIUM_PIPE_SEARCH_DIR override env var
27847ec681f3Smrg- util/xxd.py: Add an option for binary files
27857ec681f3Smrg- spirv: Add a shared libclc loader
27867ec681f3Smrg- spirv: Move nir_lower_libclc to src/compiler/spirv
27877ec681f3Smrg- intel/nir: Don't try to emit vector load_scratch instructions
27887ec681f3Smrg- intel/nir: Lower load_global_constant in lower_mem_access_bit_sizes
27897ec681f3Smrg- i965: Take an isl_format in emit_buffer_surface_state
27907ec681f3Smrg- intel/fs: Add an alignment to VARYING_PULL_CONSTANT_LOAD_LOGICAL
27917ec681f3Smrg- intel/fs: Add an option to use dataport messages for UBOs
27927ec681f3Smrg- anv: Add a device parameter to format_for_descriptor_type
27937ec681f3Smrg- anv: Use format_for_descriptor_type for descriptor buffers
27947ec681f3Smrg- anv: Plumb the device into \*bits_for_access_flags
27957ec681f3Smrg- anv: Use the data cache for indirect UBO pulls on Gen8+
27967ec681f3Smrg- iris: Use the data cache for indirect UBO pulls
27977ec681f3Smrg- clover: Stop leaking NIR shaders
27987ec681f3Smrg- nir/opt_deref: Fix the vector bitcast optimization
27997ec681f3Smrg- nir: Allow more deref modes in phis
28007ec681f3Smrg- intel/batch_decoder: Don't clame vec4 vs/gs/tcs shaders on Gen11+
28017ec681f3Smrg- intel/fs: Copy the PTSS from g0 for scratch reads/writes
28027ec681f3Smrg- intel/fs: Add a SCRATCH_HEADER opcode
28037ec681f3Smrg- intel/fs/ra: Increment spill_offset as part of the emit_spill loop
28047ec681f3Smrg- intel/fs/ra: Refactor handling of Gen7 scratch reads
28057ec681f3Smrg- intel/fs/ra: Store the last non-spill VGRF node
28067ec681f3Smrg- intel/fs/ra: Sanity-check our IP counts
28077ec681f3Smrg- intel/fs/ra: Use a set to track added spill/fill instructions
28087ec681f3Smrg- intel/fs: Rework scratch handling on Gen9+
28097ec681f3Smrg- intel/fs: Allow constant-propagation into SAMPLEINFO and IMAGE_SIZE
28107ec681f3Smrg- anv: Go back to using the sampler for UBO pulls
28117ec681f3Smrg- Revert "iris: Use the data cache for indirect UBO pulls"
28127ec681f3Smrg- anv: Bump the number of update-after-bind descriptors to 1M
28137ec681f3Smrg- anv: Add a descriptor_count to descriptor sets
28147ec681f3Smrg- anv: Implement VariableDescriptorCount
28157ec681f3Smrg- iris: Flush caches based on brw_compiler::indirect_ubos_use_sampler
28167ec681f3Smrg- anv,iris: Use the data cache for UBO pulls on Gen12+
28177ec681f3Smrg- spirv: Add 0.5 to integer coordinates for OpImageSampleExplicitLod
28187ec681f3Smrg- nir/lower_io: Assert non-zero power-of-two alignments
28197ec681f3Smrg- compiler/types: Assert non-zero alignments in get_explicit_type_for_size_align
28207ec681f3Smrg- compiler/types: Allow images and samplers in get_explicit_type_for_size_align
28217ec681f3Smrg- clover/nir: Calculate sizes of images and samplers properly
28227ec681f3Smrg- clover/nir: Add an image lowering pass
28237ec681f3Smrg- spirv: Fix OpCopyMemorySized
28247ec681f3Smrg- nir/lower_memcpy: Don't mask the store
28257ec681f3Smrg- docs: Specify when branch points happen
28267ec681f3Smrg- nir/validate: Explain why we don't use nir_foreach_block
28277ec681f3Smrg- mesa/spirv: Lower variable initializers for global variables
28287ec681f3Smrg- nir/builder: Add a nir_ieq_imm helper
28297ec681f3Smrg- nir/phis_to_scalar: Use a deny-list for load_deref modes
28307ec681f3Smrg- nir: Handle incomplete derefs in split_struct_vars
28317ec681f3Smrg- nir: Use var->data.mode instead of deref->mode in a few cases
28327ec681f3Smrg- nir: Disallow writes to system values and mem_constant
28337ec681f3Smrg- nir/opt_find_array_copies: Allow copies from mem_constant
28347ec681f3Smrg- nir: Add and use some deref mode helpers
28357ec681f3Smrg- nir/lower_array_deref_of_vec: Use nir_deref_mode_must_be
28367ec681f3Smrg- nir/lower_io: Use nir_deref_mode\_\* helpers
28377ec681f3Smrg- nir/phis_to_scalar,gcm: Use nir_deref_mode_may_be
28387ec681f3Smrg- nir: Only force loop unrolling if we know it's a in/out/temp
28397ec681f3Smrg- nir/vars_to_ssa: Use nir_deref_must_be
28407ec681f3Smrg- nir/vec3_to_vec4: Use nir_deref_must_be
28417ec681f3Smrg- nir: Use nir_deref_mode_may_be in deref optimizations
28427ec681f3Smrg- nir/find_array_copies: Prepare for generic pointers
28437ec681f3Smrg- nir/split_*_vars: Prepare for generic pointers
28447ec681f3Smrg- nir: Make nir_deref_instr::mode a bitfield
28457ec681f3Smrg- nir: Add support for generic pointers
28467ec681f3Smrg- spirv: Add generic pointer support
28477ec681f3Smrg- nir/opt_deref: Add a deref mode specialization optimization
28487ec681f3Smrg- nir/opt_deref: Add an optimization for deref_mode_is
28497ec681f3Smrg- nir/lower_io: Add a mode parameter to build_addr_iadd
28507ec681f3Smrg- nir/lower_io: Add a mode parameter to addr_format_is\_\*
28517ec681f3Smrg- nir/lower_io: Add support for 32/64bit_global for shared
28527ec681f3Smrg- nir/lower_io: Add support for lowering deref_mode_is
28537ec681f3Smrg- nir/lower_io: Support generic pointer access
28547ec681f3Smrg- nir/lower_io: Add a new 62bit_generic address format
28557ec681f3Smrg- nir/opt_intrinsics: Report progress for the gl_SampleMask optimization
28567ec681f3Smrg- nir/constant_folding: Use a switch in try_fold_intrinsic
28577ec681f3Smrg- nir/constant_folding: Use the standard variable naming convention
28587ec681f3Smrg- nir: Move constant folding of vote to opt_constant_folding
28597ec681f3Smrg- nir/constant_folding: Fold subgroup shuffle intrinsics
28607ec681f3Smrg- nir/opt_intrinsics: Refactor a bit
28617ec681f3Smrg- nir/opt_intrinsic: Optimize bcsel(b, shuffle(x, i), shuffle(x, j))
28627ec681f3Smrg- nir/find_array_copies: Don't assume all children exist
28637ec681f3Smrg- nir/deref: Fix a typo
28647ec681f3Smrg- spirv: Add basic plumbing for ray-tracing capabilities
28657ec681f3Smrg- spirv: Remove a redundant vtn_fail_if
28667ec681f3Smrg- spirv: Add a guard for OpTypeForwardPointer storage classes
28677ec681f3Smrg- spirv: Pass the deref type to storage_class_to_mode for non-forward pointers
28687ec681f3Smrg- spirv: Add support for OpTypeAccelerationStructureKHR
28697ec681f3Smrg- spirv,nir: Add support for ray-tracing built-ins
28707ec681f3Smrg- nir/builder: Add a select_from_ssa_def_array helper
28717ec681f3Smrg- nir: Add intrinsics for object to/from world RT sysvals
28727ec681f3Smrg- nir: Add new variable modes for ray-tracing
28737ec681f3Smrg- spirv: Implement the new ray-tracing storage classes
28747ec681f3Smrg- nir,spirv: Add support for the ShaderCallKHR scope
28757ec681f3Smrg- spirv,nir: Add ray-tracing intrinsics
28767ec681f3Smrg- nir: Handle ray-tracing intrinsics and storage classes in copy-prop etc.
28777ec681f3Smrg- spirv: Update headers and metadata from latest Khronos commit
28787ec681f3Smrg- nir: Print formats on image intrinsics as text
28797ec681f3Smrg- nir: Validate image atomic formats
28807ec681f3Smrg- util,gallium: Add new 64-bit integer formats
28817ec681f3Smrg- compiler/types: Add 64-bit image types
28827ec681f3Smrg- nir: Allow 64-bit image atomics
28837ec681f3Smrg- spirv: Add support for SPV_EXT_shader_image_atomic_int64
28847ec681f3Smrg- nir/lower_bit_size: Don't cast comparison results
28857ec681f3Smrg- nir/lower_bit_size: Pass a nir_instr to the callback
28867ec681f3Smrg- nir/lower_bit_size: Add support for lowering subgroup ops
28877ec681f3Smrg- intel/nir: Refactor lower_bit_size_callback
28887ec681f3Smrg- intel/nir: Lower 8-bit scan/reduce ops to 16-bit
28897ec681f3Smrg- intel/nir: Lower 8-bit ops to 16-bit in NIR on Gen11+
28907ec681f3Smrg- intel/fs: Fix use of undefined value in fixup_nomask_control_flow
28917ec681f3Smrg- spirv: Call repair SSA for OpTerminateInvocation
28927ec681f3Smrg
28937ec681f3SmrgJesse Natalie (61):
28947ec681f3Smrg
28957ec681f3Smrg- nir: nir_range_analysis needs to be updated for vec16
28967ec681f3Smrg- u_debug_stack_test: Fix MSVC compiling by using ATTRIBUTE_NOINLINE
28977ec681f3Smrg- util/macros: Add ATTRIBUTE_NOINLINE definition for MSVC
28987ec681f3Smrg- glsl: Add 'bare' shadow sampler type
28997ec681f3Smrg- nir: Fix serialize/deserialize of void samplers/images
29007ec681f3Smrg- nir: Optimize mask+downcast to just downcast
29017ec681f3Smrg- nir: Add nir_address_format_32bit_offset_as_64bit
29027ec681f3Smrg- nir: Add nir_address_format_32bit_index_offset_pack64
29037ec681f3Smrg- nir/vtn: CL SPIR-V callers should specify address modes
29047ec681f3Smrg- mesa: Move ATTRIBUTE_NOINLINE for glsl_to_tgsi_visitor::visit_expression for MSVC
29057ec681f3Smrg- nir: Add fisnormal op
29067ec681f3Smrg- nir/vtn: Support SpvOpIsNormal via fisnormal
29077ec681f3Smrg- nir: Add fisfinite op
29087ec681f3Smrg- nir/vtn: Support SpvOpIsFinite via fisfinite
29097ec681f3Smrg- nir/vtn: Handle LessOrGreater deprecated opcode
29107ec681f3Smrg- nir/vtn: Support OpOrdered and OpUnordered opcodes
29117ec681f3Smrg- nir/glsl: Add glsl_get_cl_type_size_align helper
29127ec681f3Smrg- nir: Use 'unsigned' instead of enum types in nir_variable::data
29137ec681f3Smrg- wgl: Switch to Win10 version defines to enable usage of Win10 WGL callbacks
29147ec681f3Smrg- nir: Populate some places where existing system values were missing
29157ec681f3Smrg- nir: Add new system values and intrinsics for dealing with CL work offsets
29167ec681f3Smrg- nir: Move compute system value lowering to a separate pass
29177ec681f3Smrg- nir: Add options to nir_lower_compute_system_values to control compute ID base lowering
29187ec681f3Smrg- spirv: Use new global invocation offset system value
29197ec681f3Smrg- nir: Add a lowering pass to split 64bit phis
29207ec681f3Smrg- nir: Relax opt_if logic to prevent re-merging 64bit phis for loop headers
29217ec681f3Smrg- nir_lower_bit_size: Support lowering ops with differing source/dest sizes
29227ec681f3Smrg- nir: Implement mul_high lowering for bit sizes other than 32
29237ec681f3Smrg- nir: Remove 32bit restriction for uadd_carry optimization
29247ec681f3Smrg- nir: Add bit_count to lower_int64 pass
29257ec681f3Smrg- nir/vtn: SPIR-V bit count opcodes (core and extension) dest size mismatches nir
29267ec681f3Smrg- clover/nir/spirv: Use uniform rather than shader_in for kernel inputs
29277ec681f3Smrg- nir/vtn: Add type constant to image intrinsics
29287ec681f3Smrg- nir/vtn: Add support for kernel images to SPIRV-to-NIR.
29297ec681f3Smrg- nir/vtn: Use return type rather than image type for tex ops
29307ec681f3Smrg- nir/vtn: Handle integer sampling coordinates
29317ec681f3Smrg- nir/vtn: ImageSizeLod op can be applied to images
29327ec681f3Smrg- nir/vtn: Add intrinsics for CL image format/order queries
29337ec681f3Smrg- nir/vtn: Convert constant samplers to variables with data
29347ec681f3Smrg- nir_dominance: Use uint32_t instead of int16_t for dominance counters
29357ec681f3Smrg- nir: More NIR_MAX_VEC_COMPONENTS fixes
29367ec681f3Smrg- spirv: Handle OpTypeOpaque
29377ec681f3Smrg- glsl_type: Add packed to structure type comparison for hash map
29387ec681f3Smrg- nir_lower_system_values: Fix load_global_invocation_id to use base_work_group_id even with no base_global id
29397ec681f3Smrg- nir: Add an internal flag to shader_info
29407ec681f3Smrg- nir: Add glsl_base_type unsigned -\> signed version helper
29417ec681f3Smrg- nir/vtn: Add handling for SPIR-V event variables
29427ec681f3Smrg- vtn/opencl: Rework type handling for CL extension opcodes
29437ec681f3Smrg- vtn/opencl: Add infrastructure for calling out to libclc
29447ec681f3Smrg- vtn/opencl: Implement a lot of opcodes via libclc
29457ec681f3Smrg- vtn/opencl: Rework handle_instr to be able to handle core SPIR-V opcodes via libclc
29467ec681f3Smrg- vtn/opencl: Hook up OpenCL async copy and group wait opcodes via libclc
29477ec681f3Smrg- vtn/opencl: Switch non-native trig to use libclc
29487ec681f3Smrg- vtn/opencl: Switch exp/pow/log to use libclc
29497ec681f3Smrg- vtn/opencl: Switch division-related ops to use libclc
29507ec681f3Smrg- vtn/opencl: Switch some nir-sequence ops to use libclc
29517ec681f3Smrg- vtn/opencl: Only use libclc ldexp when lower_ldexp is set
29527ec681f3Smrg- vtn/opencl: Switch fma to conditionally use libclc for 32bit floats
29537ec681f3Smrg- spirv: Implement vload[a]_half[n\] and vstore[a]_half[n][_r\]
29547ec681f3Smrg- util: Move xxd.py to util
29557ec681f3Smrg- util: Make xxd.py output char array instead of string
29567ec681f3Smrg
29577ec681f3SmrgJohn Bates (1):
29587ec681f3Smrg
29597ec681f3Smrg- disk_cache: build option for disabled-by-default
29607ec681f3Smrg
29617ec681f3SmrgJonathan Gray (13):
29627ec681f3Smrg
29637ec681f3Smrg- util: unbreak endian detection on OpenBSD
29647ec681f3Smrg- util/anon_file: add OpenBSD shm_mkstemp() path
29657ec681f3Smrg- meson: build with \_ISOC11_SOURCE on OpenBSD
29667ec681f3Smrg- meson: don't build with USE_ELF_TLS on OpenBSD
29677ec681f3Smrg- meson: conditionally include -ldl in gbm pkg-config file
29687ec681f3Smrg- util: futex fixes for OpenBSD
29697ec681f3Smrg- util/u_thread: include pthread_np.h if found
29707ec681f3Smrg- anv: use os_get_total_physical_memory()
29717ec681f3Smrg- util/os_misc: add os_get_available_system_memory()
29727ec681f3Smrg- anv: use os_get_available_system_memory()
29737ec681f3Smrg- util/os_misc: os_get_available_system_memory() for OpenBSD
29747ec681f3Smrg- radv: remove seccomp includes
29757ec681f3Smrg- vulkan: make VK_TIME_DOMAIN_CLOCK_MONOTONIC_RAW_EXT conditional
29767ec681f3Smrg
29777ec681f3SmrgJonathan Marek (57):
29787ec681f3Smrg
29797ec681f3Smrg- panfrost: add missing dependency on midgard_pack.h
29807ec681f3Smrg- util/format: expose generated format packing functions through a header
29817ec681f3Smrg- turnip: implement VK_EXT_custom_border_color
29827ec681f3Smrg- turnip: remove dead tu_minify/typed_memcpy functions
29837ec681f3Smrg- turnip: delete a blit_image TODO that has already been resolved
29847ec681f3Smrg- turnip: fix CmdBlitImage with D32_SFLOAT_S8_UINT
29857ec681f3Smrg- turnip: rework format_to_ifmt
29867ec681f3Smrg- turnip: call packing functions directly for pack_gmem_clear_value
29877ec681f3Smrg- turnip: add missing tu_bo_list_add in CmdWriteTimestamp
29887ec681f3Smrg- freedreno/ir3: remove indirect input load
29897ec681f3Smrg- freedreno/ir3: improve handling of aliased inputs
29907ec681f3Smrg- freedreno/ir3: rework setup_{input,output} to make struct varyings work
29917ec681f3Smrg- freedreno/regs: add 7nm DSI PHY/PLL regs
29927ec681f3Smrg- turnip: delete tu_physical_device path field
29937ec681f3Smrg- turnip: delete unused tu_image fields
29947ec681f3Smrg- turnip: fix the type of tu_shader_module code field, delete unused sha1
29957ec681f3Smrg- turnip: delete unused "tu_cmd_buffer_upload"
29967ec681f3Smrg- turnip: remove some unnecessary regs init
29977ec681f3Smrg- turnip: rework vertex buffers draw state handling
29987ec681f3Smrg- turnip: device global bo list
29997ec681f3Smrg- turnip: avoid heap allocations in QueueSubmit when semaphores are used
30007ec681f3Smrg- freedreno/ir3: allow layer/viewport output for VS/GS/DS
30017ec681f3Smrg- freedreno/ir3: add view_zero to shader key
30027ec681f3Smrg- turnip: multiViewport and VK_EXT_shader_viewport_index_layer
30037ec681f3Smrg- vulkan/wsi/display: add option for display fence to signal syncobj
30047ec681f3Smrg- turnip: delete unused tu_fence_signal function
30057ec681f3Smrg- turnip: add a fd field to tu_device
30067ec681f3Smrg- turnip: require syncobj support
30077ec681f3Smrg- turnip: rework fences to use syncobjs
30087ec681f3Smrg- radv: fix incorrect ResetFences path for WSI fence
30097ec681f3Smrg- radv: use syncobj for wsi fence
30107ec681f3Smrg- turnip: fix wrong indentation in tu6_draw_common
30117ec681f3Smrg- turnip: move A6XX_RB_ALPHA_CONTROL write to init_hw
30127ec681f3Smrg- turnip: implement VK_EXT_extended_dynamic_state
30137ec681f3Smrg- turnip: remove unused cmd_buffer/device arguments in descriptor sets
30147ec681f3Smrg- turnip: delete unused/broken pipeline layout hashing code
30157ec681f3Smrg- turnip: initial implementation of VK_KHR_push_descriptor
30167ec681f3Smrg- turnip: clean up tu_device_memory
30177ec681f3Smrg- turnip: always create permanent syncobj for semaphore
30187ec681f3Smrg- turnip: set MSM_SUBMIT_SYNCOBJ_RESET for submit pWaitSemaphores
30197ec681f3Smrg- turnip: semaphores simplification (only syncobj semaphores supported)
30207ec681f3Smrg- turnip: rework GetSemaphoreFdKHR
30217ec681f3Smrg- turnip: rework ImportSemaphoreFdKHR
30227ec681f3Smrg- turnip: remove remaining uses of drmSyncobj helpers
30237ec681f3Smrg- turnip: share code between semaphores/fences + fence import/export
30247ec681f3Smrg- turnip: signal fence and semaphore in AcquireNextImage2KHR
30257ec681f3Smrg- turnip: implement legacy API functions separately
30267ec681f3Smrg- freedreno/cffdec: fix decoding of bindless descriptors
30277ec681f3Smrg- turnip: remove pre-emption marker
30287ec681f3Smrg- turnip: implement timestamp fences/semaphores for kgsl backend
30297ec681f3Smrg- turnip: rework android gralloc path so it doesn't call tu_image_create
30307ec681f3Smrg- turnip: don't implement CreateImage as two separate functions
30317ec681f3Smrg- turnip: LAYOUT_PREINITIALIZED is not different for optimal tiling
30327ec681f3Smrg- turnip: remove useless tu_image asserts
30337ec681f3Smrg- turnip: remove unnecessary/redundant tu_image fields
30347ec681f3Smrg- turnip: don't always fallback to linear for mutable formats
30357ec681f3Smrg- turnip: enable VK_EXT_image_drm_format_modifier
30367ec681f3Smrg
30377ec681f3SmrgJordan Justen (4):
30387ec681f3Smrg
30397ec681f3Smrg- anv, iris: Set MediaSamplerDOPClockGateEnable for gen12+
30407ec681f3Smrg- anv: Drop warning about gen12 not being supported
30417ec681f3Smrg- intel/dev: Add device info for ADL-S
30427ec681f3Smrg- intel/mi_builder: Support gen11 command-streamer based register offsets
30437ec681f3Smrg
30447ec681f3SmrgJose Maria Casanova Crespo (7):
30457ec681f3Smrg
30467ec681f3Smrg- vc4: Avoid negative scissor caused by no intersection
30477ec681f3Smrg- nir/algebraic: optimize iand/ior of (n)eq zero when umax/umin not available
30487ec681f3Smrg- vc4: Enable lower_umax and lower_umin
30497ec681f3Smrg- vc4: enable lower_isign for VC4
30507ec681f3Smrg- vc4: Add missing load_ubo set_align in yuv_blit fs.
30517ec681f3Smrg- vc4: Add missing range_base/range at nir_load_ubos in yuv_blit fs.
30527ec681f3Smrg- vc4: Enable nir_lower_io for uniforms
30537ec681f3Smrg
30547ec681f3SmrgJoshua Ashton (1):
30557ec681f3Smrg
30567ec681f3Smrg- zink: Fix 32-bit compilation
30577ec681f3Smrg
30587ec681f3SmrgJuan A. Suarez Romero (7):
30597ec681f3Smrg
30607ec681f3Smrg- intel: split driver/device UUID generators
30617ec681f3Smrg- iris: plumb device/driver UUID generators
30627ec681f3Smrg- intel/uuid: use git-sha1/package for the driver UUID
30637ec681f3Smrg- st/mesa: initialize lower alpha func to ALWAYS
30647ec681f3Smrg- v3d/compiler: extend swapping R/B support to all vertex attributes
30657ec681f3Smrg- v3dv: mark the right bit to swap R/B vertex attributes
30667ec681f3Smrg- v3d: Add GL_ARB_vertex_array_bgra support
30677ec681f3Smrg
30687ec681f3SmrgJulian Winkler (1):
30697ec681f3Smrg
30707ec681f3Smrg- nir: Add a structurizer
30717ec681f3Smrg
30727ec681f3SmrgKarol Herbst (48):
30737ec681f3Smrg
30747ec681f3Smrg- util/set: add \_mesa_set_intersects
30757ec681f3Smrg- spirv: rename vtn_emit_cf_list to vtn_emit_cf_list_structured
30767ec681f3Smrg- nir: Add a structured flag to nir_shader
30777ec681f3Smrg- nir: Add goto_if jump instruction
30787ec681f3Smrg- spirv: extract switch parsing into its own function
30797ec681f3Smrg- spirv: parse unstructured CFG
30807ec681f3Smrg- clover/nir: fix mem_shared by using address_format_32bit_offset
30817ec681f3Smrg- nv50/ir/nir: fix smem size
30827ec681f3Smrg- nv50/ir/nir: rework indirect function_temp handling
30837ec681f3Smrg- clover/nir: Call vars_to_explicit_types for shared memory
30847ec681f3Smrg- nve4: fix uploading unaligned sized input buffers
30857ec681f3Smrg- nv50/ir/nir: assert on unknown alu ops
30867ec681f3Smrg- clover/nir: support int64 atomics if the device supports it
30877ec681f3Smrg- nv50/ir/nir: fix global_atomic_comp_swap
30887ec681f3Smrg- nvc0: handle nr being 0 in nvc0_set_global_bindings
30897ec681f3Smrg- nv50/ir/nir: support load_work_dim
30907ec681f3Smrg- clover/spirv: rework handling of spirv extensions
30917ec681f3Smrg- clover/spirv: pass list of supported extensions to the translator
30927ec681f3Smrg- nir: rename nir_op_fne to nir_op_fneu
30937ec681f3Smrg- nir: fix nir_variable_create for kernels
30947ec681f3Smrg- clover/nir: add support for global invocation id offsets
30957ec681f3Smrg- nv50/ir: remove symbol table support for compute shaders
30967ec681f3Smrg- nv50/ir: add nv50_ir_prog_info_out
30977ec681f3Smrg- nir: use nir_var_all to get rid of casting
30987ec681f3Smrg- util: add helpers to define bitwise operators on enums for C++
30997ec681f3Smrg- nir: use enum operator helper for nir_variable_mode and nir_metadata
31007ec681f3Smrg- clover/nir: Lower function_temp to scratch.
31017ec681f3Smrg- nv50/ir: fix cas lowering for 64 bit
31027ec681f3Smrg- clover/nir: use offset for temp memory
31037ec681f3Smrg- clover/llvm: undefine \__IMAGE_SUPPORT_\_ for devices without image support
31047ec681f3Smrg- nvc0/ir: fix load propagation for sub 4 byte addressing
31057ec681f3Smrg- spirv: fix 64 bit atomic inc and dec
31067ec681f3Smrg- nvc0/cl: hande 64 bit pointers in nvc0_set_global_handle
31077ec681f3Smrg- clover/spirv: fix vec3 alignment
31087ec681f3Smrg- nir/serialize: fix serialization of system values
31097ec681f3Smrg- clover/util: add id_type_equals to support symbols with multiple sections
31107ec681f3Smrg- clover: bind constant buffer if one is provided
31117ec681f3Smrg- clover/nir: extract constant buffer into its own section
31127ec681f3Smrg- clover/spirv: parse arg_info
31137ec681f3Smrg- clover/spirv: support CL_KERNEL_COMPILE_WORK_GROUP_SIZE
31147ec681f3Smrg- clover: use pipe_image_view for images instead of set_compute_resources
31157ec681f3Smrg- clover: support custom driver strides
31167ec681f3Smrg- clover/device: use PIPE_MAX_SHADER_SAMPLER_VIEWS for max_images_read
31177ec681f3Smrg- clover/nir: set kernel_image cap
31187ec681f3Smrg- nouveau: hide SVM support behing a variable for now as kernel space is broken
31197ec681f3Smrg- nvc0/CL: enable images
31207ec681f3Smrg- llvmpipe: enable CL images
31217ec681f3Smrg- nv50/ir/nir: don't use designated initializers
31227ec681f3Smrg
31237ec681f3SmrgKenneth Graunke (15):
31247ec681f3Smrg
31257ec681f3Smrg- iris: Fix headerless sampler messages in compute shaders with preemption
31267ec681f3Smrg- nir: Copy semantics to nir_intrinsic_load_fs_input_interp_deltas
31277ec681f3Smrg- nir: Move new edgeflag assert into the io_lowered case
31287ec681f3Smrg- iris: Reorder the loops in iris_fence_await() for clarity.
31297ec681f3Smrg- iris: Drop stale syncobj references in fence_server_sync
31307ec681f3Smrg- Revert "nir: replace lower_ffma and fuse_ffma with has_ffma"
31317ec681f3Smrg- intel/compiler, anv: Delete cs_prog_data->slm_size
31327ec681f3Smrg- iris: Fix doubling of shared local memory (SLM) sizes.
31337ec681f3Smrg- anv: Set only one ISL usage bit (RT/texture) for CopyBuffer sources
31347ec681f3Smrg- isl, anv, iris: Add a centralized helper to select MOCS based on usage
31357ec681f3Smrg- isl: Enable Tigerlake HDC:L1 caches via MOCS in various cases.
31367ec681f3Smrg- iris: fix source/destination layers for 3D blits
31377ec681f3Smrg- iris: Move blit scissoring earlier.
31387ec681f3Smrg- intel/fs: Fix sampler message headers on Gen11+ when using scratch
31397ec681f3Smrg- nir/algebraic: Avoid creating new fp64 ops when using softfp64
31407ec681f3Smrg
31417ec681f3SmrgKhem Raj (1):
31427ec681f3Smrg
31437ec681f3Smrg- vc4: use intmax_t for formatted output of timespec members
31447ec681f3Smrg
31457ec681f3SmrgKristian Høgsberg (12):
31467ec681f3Smrg
31477ec681f3Smrg- egl/android: Call createImageFromDmaBufs directly
31487ec681f3Smrg- egl/android: Look up prime fds in droid_create_image_from_prime_fds()
31497ec681f3Smrg- egl/android: Drop unused ctx argument
31507ec681f3Smrg- egl/android: Simplify droid_create_image_from_name() path
31517ec681f3Smrg- egl/android: Move droid_create_image_from_prime_fds() function up
31527ec681f3Smrg- egl/android: Use droid_create_image_from_prime_fds() in get_back_bo()
31537ec681f3Smrg- egl/android: Add support for CrOS buffer info perform op
31547ec681f3Smrg- turnip: Add kgsl backend
31557ec681f3Smrg- util/formats: Add PIPE_FORMAT_R8_G8B8_420_UNORM
31567ec681f3Smrg- st/mesa: Add NV12 lowering to PIPE_FORMAT_R8_G8B8_420_UNORM
31577ec681f3Smrg- freedreno/a6xx: Generalize pointers in struct fd6_pipe_sampler_view
31587ec681f3Smrg- freedreno/a6xx: Support PIPE_FORMAT_R8_G8B8_420_UNORM for texturing
31597ec681f3Smrg
31607ec681f3SmrgKrunal Patel (2):
31617ec681f3Smrg
31627ec681f3Smrg- gallium/auxiliary/vl: Odd Dimensions are failing
31637ec681f3Smrg- radeon/vcn: Bitrate not updated when changing framerate
31647ec681f3Smrg
31657ec681f3SmrgLeo Liu (2):
31667ec681f3Smrg
31677ec681f3Smrg- frontends/omx/dec: Use the known codec profile when allocating buffers
31687ec681f3Smrg- frontends/omx/h265: Check the pps set before the scaling data
31697ec681f3Smrg
31707ec681f3SmrgLepton Wu (1):
31717ec681f3Smrg
31727ec681f3Smrg- util/ralloc: fix ralloc alignment.
31737ec681f3Smrg
31747ec681f3SmrgLionel Landwerlin (36):
31757ec681f3Smrg
31767ec681f3Smrg- anv: fix incorrect realloc failure handling
31777ec681f3Smrg- intel/dump_gpu: only write BOs mapped by the driver
31787ec681f3Smrg- intel/dump_gpu: further track mapping of BOs
31797ec681f3Smrg- intel/dump_gpu: set default device_override
31807ec681f3Smrg- intel/dump_gpu: add an only-capture option
31817ec681f3Smrg- intel/dump_gpu: only map in GTT buffers not previously mapped
31827ec681f3Smrg- anv: track the current frame and write it into the driver identifier BO
31837ec681f3Smrg- intel/dump_gpu: fix --platform option
31847ec681f3Smrg- intel/dump_gpu: add an option to capture a single frame
31857ec681f3Smrg- anv: centralize vk to gen arrays
31867ec681f3Smrg- anv: fix up dynamic clip emission
31877ec681f3Smrg- anv: don't fail userspace relocation with perf queries
31887ec681f3Smrg- intel/perf: store query symbol name
31897ec681f3Smrg- intel/perf: fix raw query kernel metric selection
31907ec681f3Smrg- anv: fix transform feedback surface size
31917ec681f3Smrg- anv: move push constant allocation tracking into gfx pipeline state
31927ec681f3Smrg- anv: simplify push constant emissions
31937ec681f3Smrg- anv: VK_INTEL_performance_query interaction with VK_EXT_private_data
31947ec681f3Smrg- anv: fix robust buffer access
31957ec681f3Smrg- include/drm-uapi: bump headers
31967ec681f3Smrg- anv: add new gem/drm helpers
31977ec681f3Smrg- anv: implement shareable timeline semaphores
31987ec681f3Smrg- intel/genxml: make sure test assert are compiled in
31997ec681f3Smrg- intel/compiler: fixup Gen12 workaround for array sizes
32007ec681f3Smrg- vulkan: bump headers/registry to 1.2.154
32017ec681f3Smrg- anv: implement VK_KHR_copy_commands2
32027ec681f3Smrg- intel/perf: fix crash when no perf queries are supported
32037ec681f3Smrg- intel/dev: add a small non installable tool to print device info
32047ec681f3Smrg- intel/dev: fix 32bit build issue
32057ec681f3Smrg- genxml: drop gen10
32067ec681f3Smrg- blorp: identify copy kernels in NIR
32077ec681f3Smrg- blorp: allow blits with floating point source layers
32087ec681f3Smrg- anv: fix source/destination layers for 3D blits
32097ec681f3Smrg- anv: report latest extension spec versions
32107ec681f3Smrg- intel/dev: Bump Max EU per subslice/dualsubslice
32117ec681f3Smrg- anv: fix descriptor pool leak in VMA object
32127ec681f3Smrg
32137ec681f3SmrgLouis Li (1):
32147ec681f3Smrg
32157ec681f3Smrg- radeon/radeon_vce: fix out of target bitrate in CBR mode (H.264)
32167ec681f3Smrg
32177ec681f3SmrgLouis-Francis Ratté-Boulianne (6):
32187ec681f3Smrg
32197ec681f3Smrg- st/mesa: factor ucp-lowering logic into helper
32207ec681f3Smrg- st/mesa: Enable clip planes lowering for geometry shaders
32217ec681f3Smrg- pipebuffer: Remove unused buffer event in slab bufmgr
32227ec681f3Smrg- st/mesa: Replace UsesStreams by ActiveStreamMask for GS
32237ec681f3Smrg- glsl/linker: Add support for XFB varying lowering in geometry shader
32247ec681f3Smrg- gallium: Fix NIR validation when lowering polygon stipple
32257ec681f3Smrg
32267ec681f3SmrgLucas Stach (19):
32277ec681f3Smrg
32287ec681f3Smrg- etnaviv: stop leaking the dummy texure descriptor BO
32297ec681f3Smrg- gallium/dri: allow create image for formats that only support SV or RT binding
32307ec681f3Smrg- etnaviv: drm: fix BO refcount race
32317ec681f3Smrg- etnaviv: blt: properly program surface TS offset for clears
32327ec681f3Smrg- etnaviv: update headers from rnndb
32337ec681f3Smrg- etnaviv: tex_desc: fix TS compression enable
32347ec681f3Smrg- etnaviv: cosmetic etna_resource_alloc fixes
32357ec681f3Smrg- etnaviv: do proper cpu prep/fini when clearing allocated buffer
32367ec681f3Smrg- etnaviv: simplify etna_screen_bo_from_handle
32377ec681f3Smrg- etnaviv: pass correct layout to etna_resource_alloc for scanout resources
32387ec681f3Smrg- etnaviv: don't import allocated scanout resources via from_handle
32397ec681f3Smrg- Revert "gallium/dri: fix dri2_from_planar for multiplanar images"
32407ec681f3Smrg- etnaviv: emit RA_EARLY_DEPTH on dirty ZSA
32417ec681f3Smrg- etnaviv: flush depth cache when changing depth config
32427ec681f3Smrg- etnaviv: update headers from rnndb
32437ec681f3Smrg- etnaviv: expose shader discard usage in etna_shader_variant
32447ec681f3Smrg- etnaviv: rework ZSA into a derived state
32457ec681f3Smrg- gallium: document convention for get_handle calls on multi-planar resources
32467ec681f3Smrg- etnaviv: fix disabling of INT filter for real
32477ec681f3Smrg
32487ec681f3SmrgLukas F. Hartmann (1):
32497ec681f3Smrg
32507ec681f3Smrg- etnaviv: Fix disabling early-z rejection on GC7000L (HALTI5)
32517ec681f3Smrg
32527ec681f3SmrgMarcin Ślusarz (50):
32537ec681f3Smrg
32547ec681f3Smrg- intel/perf: fix calculation of used counter space
32557ec681f3Smrg- intel/perf: fix how pipeline stats are stored
32567ec681f3Smrg- intel/perf: streamline error handling in read_oa_samples_until
32577ec681f3Smrg- intel/perf: fix performance counters availability after glFinish
32587ec681f3Smrg- intel/perf: split load_oa_metrics
32597ec681f3Smrg- intel/perf: export performance counters sorted by \[group|set\] and name
32607ec681f3Smrg- glsl: fix crashes on out of bound matrix access using constant index
32617ec681f3Smrg- gitlab: ask for more detailed info about GPU
32627ec681f3Smrg- mesa: fix formatting of messages printed using \_mesa_log
32637ec681f3Smrg- anv: refresh cached current batch bo after emitting some commands
32647ec681f3Smrg- iris: handle os_dupfd_cloexec failure
32657ec681f3Smrg- iris: verify color component width in convert_fast_clear_color
32667ec681f3Smrg- i965: verify format width in blorp_get_client_bo
32677ec681f3Smrg- intel/perf: don't generate logically dead code
32687ec681f3Smrg- intel/compiler/test: use TEST_DEBUG env var consistently
32697ec681f3Smrg- intel/compiler: mark debug constant as const
32707ec681f3Smrg- intel/fs,vec4: remove unused assignments
32717ec681f3Smrg- intel: add INTEL_DEBUG=shaders
32727ec681f3Smrg- intel/fs: add hint how to get more info when shader validation fails
32737ec681f3Smrg- intel/compiler: match brw_compile\_\* declarations with their definitions
32747ec681f3Smrg- intel/compiler: use the same name for nir shaders in brw_compile\_\* functions
32757ec681f3Smrg- intel/compiler: move extern C functions out of namespace brw
32767ec681f3Smrg- intel/compiler: print dispatch width when shader fails to compile
32777ec681f3Smrg- intel/compiler: fix typo in a comment
32787ec681f3Smrg- anv: fix minor gen_ioctl(I915_PERF_IOCTL_CONFIG) error handling issue
32797ec681f3Smrg- intel/compiler: remove unused fs_validator::param_size
32807ec681f3Smrg- intel/compiler: initialize remaining fields of various classes
32817ec681f3Smrg- intel/tools: fix possible memory leak in the error path
32827ec681f3Smrg- intel/tools: handle ftell errors
32837ec681f3Smrg- intel/compiler: quiet Coverity warnings
32847ec681f3Smrg- intel/tools: fix possible randomly increased verbosity of error2aub
32857ec681f3Smrg- intel: add INTEL_DEBUG expected value in declaration
32867ec681f3Smrg- iris: drop likely/unlikely around INTEL_DEBUG
32877ec681f3Smrg- i965: drop likely/unlikely around INTEL_DEBUG
32887ec681f3Smrg- anv: drop likely/unlikely around INTEL_DEBUG
32897ec681f3Smrg- intel: drop likely/unlikely around INTEL_DEBUG
32907ec681f3Smrg- vulkan/wsi: fix possible random stalls in wsi_display_wait_for_event
32917ec681f3Smrg- intel/tools: fix invalid type in argument to printf
32927ec681f3Smrg- intel/genxml: don't generate identical code for different branches
32937ec681f3Smrg- anv: always annotate memory returned from anv_gem_mmap
32947ec681f3Smrg- intel: remove dead code
32957ec681f3Smrg- i965: remove prototypes of not-existing functions
32967ec681f3Smrg- intel/compiler: use C++ template instead of preprocessor
32977ec681f3Smrg- intel/compiler: remove branch weight heuristic
32987ec681f3Smrg- intel/tools: allow --color option to be used without arg
32997ec681f3Smrg- anv: remove dead code from anv_create_cmd_buffer
33007ec681f3Smrg- intel/tools: handle some failures
33017ec681f3Smrg- intel/tools: refactor logging to be easier to follow by static analyzers
33027ec681f3Smrg- intel/tools: add missing new lines to few remaining fail_if users
33037ec681f3Smrg- nir: handle float atomics in copy propagation pass
33047ec681f3Smrg
33057ec681f3SmrgMarek Olšák (278):
33067ec681f3Smrg
33077ec681f3Smrg- radeonsi: enable ETC2 hw acceleration on Raven2
33087ec681f3Smrg- ac/gpu_info: set num_tiles_pipes on gfx10+ too
33097ec681f3Smrg- Revert "radeonsi: honor a user-specified pitch on gfx10.3"
33107ec681f3Smrg- radeonsi: use correct wave size in gfx10_ngg_calculate_subgroup_info
33117ec681f3Smrg- radeonsi: use the same units for esgs_ring_size and ngg_emit_size
33127ec681f3Smrg- radeonsi: increase minimum NGG vertex count requirement per workgroup on gfx 10.3
33137ec681f3Smrg- radeonsi: fix applying the NGG minimum vertex count requirement
33147ec681f3Smrg- radeonsi: don't count unusable vertices to the NGG LDS size
33157ec681f3Smrg- radeonsi: add a common function for getting the size of gs_ngg_scratch
33167ec681f3Smrg- radeonsi: remove the NGG hack decreasing LDS usage to deal with overflows
33177ec681f3Smrg- radeonsi: various fixes for gfx10.3
33187ec681f3Smrg- radeonsi: disable NGG culling on gfx10.3 because of hangs
33197ec681f3Smrg- radeonsi: fix compute-based culling with VERTEX_COUNTER_GDS_MODE == 1
33207ec681f3Smrg- compiler: add glsl_print_type
33217ec681f3Smrg- nir: remove nir_strip stub declaration
33227ec681f3Smrg- nir: handle load_input_vertex in nir_get_io_offset_src
33237ec681f3Smrg- nir: save IO semantics in lowered IO intrinsics
33247ec681f3Smrg- nir: gather all IO info from IO intrinsics
33257ec681f3Smrg- nir: update IO semantics in nir_io_add_const_offset_to_base
33267ec681f3Smrg- nir: print IO semantics (v2)
33277ec681f3Smrg- nir: properly identify texcoords for lowered IO in nir_lower_drawpixels
33287ec681f3Smrg- nir: add shader_info::io_lowered
33297ec681f3Smrg- nir: add interpolation qualifiers for color sysvals into shader_info
33307ec681f3Smrg- nir: generate lowered IO in nir_lower_passthrough_edgeflags
33317ec681f3Smrg- st/mesa: don't pass NIR to draw module if IO is lowered
33327ec681f3Smrg- st/mesa: don't generate NIR for ARB_vp/fp if NIR is not preferred
33337ec681f3Smrg- st/mesa: handle lowered IO in st_nir_assign_vs_in_locations
33347ec681f3Smrg- gallium/tgsi: add helper tgsi_get_interp_mode
33357ec681f3Smrg- radeonsi: fix tess levels coming as scalar arrays from SPIR-V
33367ec681f3Smrg- st/mesa: remove useless code for lowered IO in st_nir_assign_vs_in_locations
33377ec681f3Smrg- gallivm: fix build on LLVM 12 due to LLVMAddConstantPropagationPass removal
33387ec681f3Smrg- amd/registers: expose the canonicalize.py program as a function
33397ec681f3Smrg- amd/registers: sort registers by offset in json
33407ec681f3Smrg- amd/registers: add a script that generates json from kernel headers
33417ec681f3Smrg- amd/registers: add non-gfx10 register files generated from kernel headers
33427ec681f3Smrg- amd/registers: switch to new generated register definitions
33437ec681f3Smrg- nir: fix a bug in is_dual_slot in nir_io_add_const_offset_to_base
33447ec681f3Smrg- st/mesa: fix lowered IO - don't call st_nir_assign_vs_in_locations twice
33457ec681f3Smrg- radeonsi: don't crash if input_usage_mask is 0 for a VS input
33467ec681f3Smrg- radeonsi: get color interpolation info from shader_info
33477ec681f3Smrg- radeonsi: clean up code for loading VS inputs
33487ec681f3Smrg- ac/nir: handle all lowered IO intrinsics
33497ec681f3Smrg- radeonsi: lower IO intrinsics - complete rewrite of input/output scanning
33507ec681f3Smrg- radeonsi: remove in/out/uniform variables from NIR after lowering IO
33517ec681f3Smrg- radeonsi: don't lower indirect IO in GLSL
33527ec681f3Smrg- radeonsi: don't execute LDS stores for TCS outputs that are never read
33537ec681f3Smrg- radeonsi: simplify handling color interp modes in si_emit_spi_map
33547ec681f3Smrg- radeonsi: change PIPE_SHADER to MESA_SHADER (si_shader_selector::type)
33557ec681f3Smrg- radeonsi: change PIPE_SHADER to MESA_SHADER (si_shader_context::type)
33567ec681f3Smrg- radeonsi: change PIPE_SHADER to MESA_SHADER (debug flags)
33577ec681f3Smrg- radeonsi: change PIPE_SHADER to MESA_SHADER (si_compile_llvm)
33587ec681f3Smrg- radeonsi: change PIPE_SHADER to MESA_SHADER (si_get_shader_part)
33597ec681f3Smrg- radeonsi: remove unused si_shader_context::type
33607ec681f3Smrg- radeonsi: change PIPE_SHADER to MESA_SHADER (si_shader_dump_disassembly)
33617ec681f3Smrg- radeonsi: precompute si_*_descriptors_idx in si_shader_selector
33627ec681f3Smrg- radeonsi: change PIPE_SHADER to MESA_SHADER (si_dump_descriptors)
33637ec681f3Smrg- radeonsi: remove si_shader_selector::type
33647ec681f3Smrg- compiler: add INTERP_MODE_COLOR for radeonsi
33657ec681f3Smrg- radeonsi: replace TGSI_INTERPOLATE with INTERP_MODE
33667ec681f3Smrg- radeonsi: replace TGSI_SEMANTIC with VARYING_SLOT and FRAG_RESULT
33677ec681f3Smrg- radeonsi: optimize out the loop in si_get_ps_input_cntl
33687ec681f3Smrg- ac/llvm: fix unaligned VS input loads on gfx10.3
33697ec681f3Smrg- nir: get ffma support from NIR options for nir_lower_flrp
33707ec681f3Smrg- nir/algebraic: trivially enable existing 32-bit patterns for all bit sizes
33717ec681f3Smrg- nir/algebraic: add 16-bit versions of a few 32-bit patterns
33727ec681f3Smrg- glsl_to_nir: fix crashes with int16 shifts
33737ec681f3Smrg- radeonsi: remove redundant no-signed-zero-fp-math LLVM attribute
33747ec681f3Smrg- radeonsi: move nir_shader_compiler_options into si_screen
33757ec681f3Smrg- Revert "ac: generate FMA for inexact instructions for radeonsi"
33767ec681f3Smrg- ac/llvm: remove stub prototype for fmed3
33777ec681f3Smrg- ac/llvm: fix amdgcn.rcp for v2f16
33787ec681f3Smrg- ac/llvm: fix amdgcn.fract for v2f16
33797ec681f3Smrg- ac/llvm: fix amdgcn.rsq for v2f16
33807ec681f3Smrg- ac/llvm: fix bcsel for v2*16
33817ec681f3Smrg- ac/llvm: remove dead code handling for fmod
33827ec681f3Smrg- ac/llvm: add better code for isign
33837ec681f3Smrg- ac/llvm: add better code for fsign
33847ec681f3Smrg- ac/llvm: fix b2f for v2f16
33857ec681f3Smrg- radeonsi: stop using TGSI_PROPERTY_NEXT_SHADER
33867ec681f3Smrg- radeonsi: stop using TGSI_PROPERTY_VS_WINDOW_SPACE_POSITION / VS_BLIT_SGPRS_AMD
33877ec681f3Smrg- radeonsi: stop using TGSI_PROPERTY_TCS_VERTICES_OUT
33887ec681f3Smrg- radeonsi: stop using TGSI_PROPERTY_TES_POINT_MODE / TES_PRIM_MODE
33897ec681f3Smrg- radeonsi: stop using TGSI_PROPERTY_TES_SPACING
33907ec681f3Smrg- radeonsi: stop using TGSI_PROPERTY_TES_VERTEX_ORDER_CW
33917ec681f3Smrg- radeonsi: stop using TGSI_PROPERTY_GS\_\*
33927ec681f3Smrg- radeonsi: stop using TGSI_PROPERTY_CS\_\*
33937ec681f3Smrg- radeonsi: stop using TGSI_PROPERTY_FS_EARLY_DEPTH_STENCIL
33947ec681f3Smrg- radeonsi: stop using TGSI_PROPERTY_FS_POST_DEPTH_COVERAGE
33957ec681f3Smrg- radeonsi: stop using TGSI_PROPERTY_FS_COORD_PIXEL_CENTER
33967ec681f3Smrg- radeonsi: stop using TGSI_PROPERTY_FS_DEPTH_LAYOUT
33977ec681f3Smrg- radeonsi: stop using TGSI_PROPERTY_CS_LOCAL_SIZE
33987ec681f3Smrg- radeonsi: stop using TGSI_PROPERTY_FS_COLOR0_WRITES_ALL_CBUFS
33997ec681f3Smrg- radeonsi: remove info::samplers_declared, image_buffers, msaa_images_declared
34007ec681f3Smrg- radeonsi: remove redundant si_shader_info::shader_buffers_declared
34017ec681f3Smrg- radeonsi: remove redundant si_shader_info::images_declared
34027ec681f3Smrg- radeonsi: remove redundant si_shader_info::const_buffers_declared
34037ec681f3Smrg- radeonsi: remove redundant si_shader_info:\*(clip|cull)\* fields
34047ec681f3Smrg- radeonsi: remove unused si_shader_info::uses_(vertexid|basevertex)
34057ec681f3Smrg- radeonsi: merge uses_persp_opcode_interp_sample/uses_linear_opcode_interp_sample
34067ec681f3Smrg- radeonsi: remove redundant si_shader_info::uses_kill
34077ec681f3Smrg- radeonsi: reduce type sizes in si_shader_selector
34087ec681f3Smrg- radeonsi: rename num_memory_instructions -\> num_memory_stores
34097ec681f3Smrg- radeonsi: remove redundant si_shader_info::writes_memory
34107ec681f3Smrg- radeonsi: remove redundant GS variables in si_shader_selector
34117ec681f3Smrg- radeonsi: remove redundant si_shader_selector::max_gs_stream
34127ec681f3Smrg- radeonsi: remove redundant si_shader_info::uses_derivatives
34137ec681f3Smrg- radeonsi: use shader_info::cs::local_size_variable to clean up some code
34147ec681f3Smrg- radeonsi: deduplicate setting key.mono.u.vs_export_prim_id
34157ec681f3Smrg- radeonsi: kill point size VS output if it's not used by the rasterizer
34167ec681f3Smrg- radeonsi: set outputs_written_before_ps for geometry shaders too
34177ec681f3Smrg- radeonsi: eliminate unused shader outputs for separate NGG geometry shaders
34187ec681f3Smrg- radeonsi: remove swizzle == ~0 dead code in si_llvm_load_input_gs
34197ec681f3Smrg- ac,radeonsi: lower 64-bit IO to 32 bits and remove all dead code
34207ec681f3Smrg- radeonsi: inline trivial PS functions
34217ec681f3Smrg- nir: add mediump flag to IO semantics
34227ec681f3Smrg- nir: fix lower_mediump_outputs to not require variables
34237ec681f3Smrg- nir/algebraic: add flrp patterns for 16 and 64 bits
34247ec681f3Smrg- nir/algebraic: expand existing 32-bit patterns to all bit sizes using loops
34257ec681f3Smrg- nir: remove redundant opcode u2ump
34267ec681f3Smrg- nir: enforce 32-bit src type requirement for f2fmp and i2imp
34277ec681f3Smrg- nir: add new mediump opcodes f2[ui]mp, i2fmp, u2fmp
34287ec681f3Smrg- nir/algebraic: collapse conversion opcodes (many patterns)
34297ec681f3Smrg- nir/algebraic: add late optimizations that optimize out mediump conversions (v3)
34307ec681f3Smrg- nir/opt_vectorize: don't lose exact and no_*_wrap flags
34317ec681f3Smrg- st/mesa: don't enable NV_copy_depth_to_color if NIR doesn't support FP64
34327ec681f3Smrg- nir,radeonsi: move ffma fusing to late optimizations for better codegen
34337ec681f3Smrg- radeonsi: clean up ffma handling
34347ec681f3Smrg- Revert "radeonsi: set BIG_PAGE fields on gfx10.3"
34357ec681f3Smrg- Revert "radeonsi: move L2_CACHE_CONTROL registers into si_emit_framebuffer_state"
34367ec681f3Smrg- radeonsi: don't lower pack for better 16-bit vectorization
34377ec681f3Smrg- radeonsi: set flags for FP16 in shaders
34387ec681f3Smrg- radeonsi: implement 16-bit FS color outputs
34397ec681f3Smrg- radeonsi: vectorize IO for better ALU vectorization
34407ec681f3Smrg- radeonsi: don't scalarize 16-bit vec2 ALU opcodes
34417ec681f3Smrg- radeonsi: add 16-bit ALU vectorization
34427ec681f3Smrg- gallium: rename PIPE_TRANSFER\_\* -\> PIPE_MAP\_\*
34437ec681f3Smrg- gallium: rename pipe_transfer_usage -\> pipe_map_flags
34447ec681f3Smrg- gallium: rename transfer flags -\> map flags in comments
34457ec681f3Smrg- radeon: rename RADEON_TRANSFER\_\* -\> RADEON_MAP\_\*
34467ec681f3Smrg- radeonsi: set TRUNC_COORD=0 for Total War: WARHAMMER to fix it
34477ec681f3Smrg- radeonsi: move debug options from si_disk_cache_create to si_get_ir_cache_key
34487ec681f3Smrg- radeonsi: remove KILL_PS_INF_INTERP/CLAMP_DIV_BY_ZERO, use screen::options
34497ec681f3Smrg- amd: add Dimgrey Cavefish support
34507ec681f3Smrg- amd: add VanGogh support
34517ec681f3Smrg- radeonsi: set KEEP_TOGETHER_ENABLE if needed
34527ec681f3Smrg- radeonsi: move binning parameters into si_screen
34537ec681f3Smrg- radeonsi: break a binning batch on a new PS if bins can use multiple state sets
34547ec681f3Smrg- radeonsi: add a tweak for PS wave CU utilization for gfx10.3
34557ec681f3Smrg- nir: split fuse_ffma into fuse_ffma16/32/64
34567ec681f3Smrg- nir: split lower_ffma into lower_ffma16/32/64
34577ec681f3Smrg- radeonsi: fuse or lower ffma optimally on all chips
34587ec681f3Smrg- nir: replace lower_ffma and fuse_ffma with has_ffma
34597ec681f3Smrg- radeonsi: use optimal order of operations when setting up a compute dispatch
34607ec681f3Smrg- radeonsi: call si_upload_graphics_shader_descriptors before the big conditional
34617ec681f3Smrg- radeonsi: move a displaced comment in si_draw_vbo
34627ec681f3Smrg- radeonsi: don't call emit_cache_flush after uploading bindless descriptors
34637ec681f3Smrg- radeonsi: reorganize the code around the gfx9 scissor bug
34647ec681f3Smrg- radeonsi: move si_upload_vertex_buffer_descriptors into si_state_draw.c
34657ec681f3Smrg- radeonsi: add unlikely statements into si_draw_vbo
34667ec681f3Smrg- radeonsi: lift the conditional for skipping si_upload_vertex_buffer_descriptors
34677ec681f3Smrg- radeonsi: always inline draw-related functions that have only one use
34687ec681f3Smrg- nir: gather indirect info from lowered IO intrinsics
34697ec681f3Smrg- nir: gather tess.tcs_cross_invocation info from lowered IO intrinsics
34707ec681f3Smrg- nir: set system_values_read for all intrinsics
34717ec681f3Smrg- nir: gather fs.uses_sample_qualifier from lowered IO
34727ec681f3Smrg- nir: fix input/output info gathering for lowered IO
34737ec681f3Smrg- nir: gather information about fbfetch and dual source color
34747ec681f3Smrg- radeonsi: fix indirect dispatches with variable block sizes
34757ec681f3Smrg- radeonsi: call nir_shader_gather_info after lowering and optimizing NIR
34767ec681f3Smrg- radeonsi: use info.system_values_read
34777ec681f3Smrg- radeonsi: get information about FS color outputs from shader_info directly
34787ec681f3Smrg- radeonsi: get input/output usage flags from shader_info directly
34797ec681f3Smrg- radeonsi: run NIR optimizations that glsl_to_nir runs but other places might not
34807ec681f3Smrg- radeonsi: assume that constant load_local_group_size has been optimized out
34817ec681f3Smrg- radeonsi: remove redundant variables from struct si_compute
34827ec681f3Smrg- radeonsi: remove redundant info.uses_fbfetch
34837ec681f3Smrg- gallivm: add support for lowered IO in vertex shaders
34847ec681f3Smrg- util: implement f16c - fast half<->float conversions
34857ec681f3Smrg- util: move util_half_to_float code into \_mesa_half_to_float_slow
34867ec681f3Smrg- util: remove util_float_to_half and util_half_to_float wrappers
34877ec681f3Smrg- gallium/util: remove redundant util_float_to_half_rtz
34887ec681f3Smrg- gallium/util: remove empty file u_half.h
34897ec681f3Smrg- radeonsi: Fix dead lock with aux_context_lock in si_screen_clear_buffer.
34907ec681f3Smrg- radeonsi: simplify NGG culling enablement and add radeonsi_shader_culling option
34917ec681f3Smrg- radeonsi: kill disabled clip distances and planes at per-channel granularity
34927ec681f3Smrg- radeonsi: move si_set_active_descriptors_for_shader into si_update_common_shader_state
34937ec681f3Smrg- radeonsi: use staging buffer uploads for most VRAM buffers
34947ec681f3Smrg- radeonsi: call nir_lower_bool_to_int32 last because it breaks nir_opt_if
34957ec681f3Smrg- radeonsi: restructure si_pipe_set_constant_buffer
34967ec681f3Smrg- mesa: factor out layout parsing for glInterleavedArrays
34977ec681f3Smrg- gl_marshal.py: inline print_sync_dispatch
34987ec681f3Smrg- driconf: force the vendor string to NVIDIA to fix viewperf energy tests
34997ec681f3Smrg- driconf: enable force_glsl_extensions_warn for viewperf
35007ec681f3Smrg- st/mesa: enable GL name reuse for queries based on the driconf option
35017ec681f3Smrg- util/idalloc: resize if ID is too large for reservation
35027ec681f3Smrg- gallium/util: add set_frontend_noop into driver_noop and u_threaded_context
35037ec681f3Smrg- radeonsi: remove dead variable postponed_kill
35047ec681f3Smrg- radeonsi: implement GL_INTEL_blackhole_render
35057ec681f3Smrg- gallium/u_threaded_context: don't call memcpy in tc_set_constant_buffer
35067ec681f3Smrg- gallium/u_threaded_context: always flush asynchronously if requested
35077ec681f3Smrg- gallium/u_threaded_context: fix use-after-free in transfer_unmap
35087ec681f3Smrg- util: implement F16C using inline assembly on x86_64
35097ec681f3Smrg- util: move util_half_to_float code into \_mesa_half_to_float_slow
35107ec681f3Smrg- util: remove util_float_to_half and util_half_to_float wrappers
35117ec681f3Smrg- gallium/util: remove redundant util_float_to_half_rtz
35127ec681f3Smrg- gallium/util: remove empty file u_half.h
35137ec681f3Smrg- mesa: don't use GET_DISPATCH because it doesn't work with glthread
35147ec681f3Smrg- mesa: remove api_loopback to remove call indirections
35157ec681f3Smrg- glthread: handle glInterleavedArrays
35167ec681f3Smrg- nir/algebraic: always lower idiv to shifts if bitops are allowed
35177ec681f3Smrg- util: add \_mesa_set_create_u32_keys where keys are not pointers
35187ec681f3Smrg- nir: add new helper passes that lower uniforms to literals
35197ec681f3Smrg- gallium: add pipe_context::set_inlinable_constants
35207ec681f3Smrg- st/mesa: pass inlinable uniforms to drivers if they requested it
35217ec681f3Smrg- ac/surface: fix valgrind warnings in DCC retile tile lookups
35227ec681f3Smrg- winsys/amdgpu: rework the VM alignment optimizations
35237ec681f3Smrg- winsys/amdgpu: apply the VM alignment optimization to the physical alignment too
35247ec681f3Smrg- radeonsi: update the DMA perf test
35257ec681f3Smrg- radeonsi: disable SDMA on gfx6-7 and gfx10.3 to decrease CPU overhead
35267ec681f3Smrg- Revert "radeonsi/gfx10: disable vertex grouping"
35277ec681f3Smrg- radeonsi: don't disable NGG culling on gfx10.3
35287ec681f3Smrg- radeonsi: enable NGG culling by default on gfx10.3 dGPUs
35297ec681f3Smrg- radeonsi: optimize out LDS bank conflicts in the NGG culling shader
35307ec681f3Smrg- radeonsi: remove indirection when loading position at the end for NGG culling
35317ec681f3Smrg- radeonsi: write VS/TES system values into LDS after culling
35327ec681f3Smrg- radeonsi: pack LDS better for NGG culling
35337ec681f3Smrg- radeonsi: tweak LATE_ALLOC_GS numbers for faster NGG culling
35347ec681f3Smrg- radeonsi: enable NGG on Navi14 PRO cards
35357ec681f3Smrg- radeonsi: enable NGG culling by default on Navi1x PRO cards
35367ec681f3Smrg- ac/llvm: don't lower bool to int32, switch to native i1 bool
35377ec681f3Smrg- amd: update addrlib
35387ec681f3Smrg- nir: consider load_color intrinsics as both inputs and sysval in gathering
35397ec681f3Smrg- Revert "st/mesa: don't pass NIR to draw module if IO is lowered"
35407ec681f3Smrg- st/mesa: make sure prog->info is up to date for NIR (v2)
35417ec681f3Smrg- amd: regenerate gfx103.json from kernel headers
35427ec681f3Smrg- amd: correct typos in gfx10-rsrc.json
35437ec681f3Smrg- amd: update gfx10-rsrc.json for gfx10.3
35447ec681f3Smrg- amd: replace 0x028848 with the register definition
35457ec681f3Smrg- amd: print NUM_PKRS with AMD_DEBUG=info on gfx10.3
35467ec681f3Smrg- Revert "radeonsi: use staging buffer uploads for most VRAM buffers"
35477ec681f3Smrg- util: remove unused util_get_L3_for_pinned_thread
35487ec681f3Smrg- util: consolidate thread_get_time functions
35497ec681f3Smrg- st/mesa: remove random L3 pinning heuristic for glthread
35507ec681f3Smrg- util: add util_set_thread_affinity helpers including Windows support
35517ec681f3Smrg- util: add util_get_current_cpu using sched_getcpu and Windows equivalent
35527ec681f3Smrg- util: completely rewrite and do AMD Zen L3 cache pinning correctly
35537ec681f3Smrg- glthread: pin driver threads to the same L3 as the main thread regularly
35547ec681f3Smrg- radeonsi: implement inlinable uniforms
35557ec681f3Smrg- gallium: move pipe_draw_info::start/count to the beginning and pad empty space
35567ec681f3Smrg- gallium: add pipe_context::multi_draw
35577ec681f3Smrg- winsys/amdgpu: remove incorrect assertion check against max_check_space_size
35587ec681f3Smrg- radeonsi: add num_draws parameter into si_need_gfx_cs_space
35597ec681f3Smrg- radeonsi don't get count from pipe_draw_info in si_num_prims_for_vertices
35607ec681f3Smrg- radeonsi: don't check info->count == 0
35617ec681f3Smrg- radeonsi: implement multi_draw but supporting only 1 draw
35627ec681f3Smrg- radeonsi: add support for multi draws
35637ec681f3Smrg- radeonsi: set NOT_EOP for back-to-back draws on gfx10+
35647ec681f3Smrg- radeonsi: implement multi_draw for compute-based primitive culling
35657ec681f3Smrg- gallium/u_threaded: move a structure up to be used later
35667ec681f3Smrg- gallium/u_threaded: merge consecutive draw calls within batches
35677ec681f3Smrg- st/mesa: fix use-after-free when updating shader info in st_link_nir
35687ec681f3Smrg- radeonsi: fix min_direct_count value
35697ec681f3Smrg- radeonsi: do VGT_FLUSH when switching NGG -\> legacy on Sienna Cichlid
35707ec681f3Smrg- radeonsi: only do VGT_FLUSH for fast launch if previous draw was normal launch
35717ec681f3Smrg- radeonsi: determine correctly if switching from normal launch to fast launch
35727ec681f3Smrg- radeonsi: add options.inline_uniforms to the shader cache key
35737ec681f3Smrg- ac: fix detection of Pro graphics
35747ec681f3Smrg- ac: fix min/max_good_num_cu_per_sa on gfx10.3 with disabled SEs
35757ec681f3Smrg- radeonsi: fix NGG streamout regression
35767ec681f3Smrg- radeonsi: fix scan_instruction for bindless inc_wrap/dec_wrap atomics
35777ec681f3Smrg- nir: fix gathering TCS cross invocation access with lowered IO
35787ec681f3Smrg- nir: fix gathering patch IO usage with lowered IO
35797ec681f3Smrg- ac/nir: fix a typo in ac_are_tessfactors_def_in_all_invocs
35807ec681f3Smrg- mesa: call FLUSH_VERTICES before changing sampler uniforms
35817ec681f3Smrg- st/mesa: fix uninitialized/random clip plane state vars in lower_ucp
35827ec681f3Smrg- radeonsi: fix a memory leak in si_create_dcc_retile_cs
35837ec681f3Smrg- radeonsi: fix a nasty bug in si_pm4.c
35847ec681f3Smrg- radeonsi: disable WGP mode on gfx10.3 to prevent hangs
35857ec681f3Smrg
35867ec681f3SmrgMarek Vasut (2):
35877ec681f3Smrg
35887ec681f3Smrg- etnaviv: Remove etna_resource_get_status()
35897ec681f3Smrg- etnaviv: Add lock around pending_ctx
35907ec681f3Smrg
35917ec681f3SmrgMarijn Suijten (5):
35927ec681f3Smrg
35937ec681f3Smrg- util: Makefile.sources: Add disk_cache_os.{c,h}
35947ec681f3Smrg- android: gallium/auxiliary: Deduplicate nir_to_tgsi.c inclusion
35957ec681f3Smrg- scons: gallium/auxiliary: Unconditionally compile NIR regardless of LLVM
35967ec681f3Smrg- android: panfrost: Move nir_undef_to_zero to util
35977ec681f3Smrg- android: freedreno: Add freedreno_dev_info.[ch\] to Makefile.sources
35987ec681f3Smrg
35997ec681f3SmrgMark Janes (2):
36007ec681f3Smrg
36017ec681f3Smrg- intel/fs: Assert if lower_source_modifiers converts 32x16 to 32x32 multiplication
36027ec681f3Smrg- intel/fs: work around gen12 lower-precision source modifier limitation
36037ec681f3Smrg
36047ec681f3SmrgMark Menzynski (5):
36057ec681f3Smrg
36067ec681f3Smrg- nv50/ir: Use a bit field in info_out structure
36077ec681f3Smrg- nv50/ir: Add nv50_ir_prog_info_out serialize and deserialize
36087ec681f3Smrg- nv50/ir: Add prog_info_out print
36097ec681f3Smrg- nv50/ir: Add nv50_ir_prog_info serialize
36107ec681f3Smrg- nvc0: Add shader disk caching
36117ec681f3Smrg
36127ec681f3SmrgMartin Peres (11):
36137ec681f3Smrg
36147ec681f3Smrg- driconf: bump the maximum string size from 25 to 1024
36157ec681f3Smrg- driconf: initialize the option value before using it
36167ec681f3Smrg- dri/DRI2ConfigQueryExtension: add support for string options
36177ec681f3Smrg- glx/extensions: split set_glx_extension into find\_ and set\_
36187ec681f3Smrg- glx: stop using hardcoded array sizes for bitfields
36197ec681f3Smrg- glx: initial plumbing to let users force-enable/disable extensions
36207ec681f3Smrg- glx: let users force-enable/disable indirect GL extensions
36217ec681f3Smrg- driconf: add a way to override GLX extensions
36227ec681f3Smrg- driconf: add a way to override indirect-GL extensions
36237ec681f3Smrg- driconf: disable GLX_OML_swap_method by default on Brink
36247ec681f3Smrg- driconf: allow higher compat version for Brink
36257ec681f3Smrg
36267ec681f3SmrgMatt Turner (3):
36277ec681f3Smrg
36287ec681f3Smrg- intel/tools: Disassemble WAIT's argument as a destination
36297ec681f3Smrg- Revert F16C series (MR 6774)
36307ec681f3Smrg- glcpp: Handle bison-3.6 error message changes
36317ec681f3Smrg
36327ec681f3SmrgMauro Rossi (28):
36337ec681f3Smrg
36347ec681f3Smrg- android: panfrost: Rename encoder/ to lib/
36357ec681f3Smrg- android: panfrost: Move pandecode into lib/
36367ec681f3Smrg- android: pan/mdg: Separate disassembler and compiler targets
36377ec681f3Smrg- android: pan/bi: Separate disasm/compiler targets
36387ec681f3Smrg- android: panfrost: Redirect cmdstream includes through GenXML
36397ec681f3Smrg- android: panfrost/bifrost: add libpanfrost_lib static dependency
36407ec681f3Smrg- android: panfrost: Redirect cmdstream includes through GenXML (v2)
36417ec681f3Smrg- android: util/format: fix generated sources rules
36427ec681f3Smrg- android: amd/registers: switch to new generated register definitions
36437ec681f3Smrg- android: util: fix missing include path
36447ec681f3Smrg- android: nv50/ir: Add nv50_ir_prog_info_out serialize and deserialize
36457ec681f3Smrg- android: freedreno: Implement pipe screen's get_device/driver_uuid()
36467ec681f3Smrg- android: freedreno/common: add libmesa_git_sha1 static dependency
36477ec681f3Smrg- egl/android: HAVE_DRM_GRALLOC path fixes (v2)
36487ec681f3Smrg- android: aco/isel: Move context initialization code to a dedicated file
36497ec681f3Smrg- android: pan/bi: Use new disassembler
36507ec681f3Smrg- android: pan/bi: Use new packing
36517ec681f3Smrg- android: pan/bi: fix typo in bifrost_gen_disasm.c gen rules
36527ec681f3Smrg- android: gallium/iris: cleanup iris_driinfo.h gen rules
36537ec681f3Smrg- android: gallium/radeonsi: cleanup si_driinfo.h gen rules
36547ec681f3Smrg- android: gallium/virgl: cleanup virgl_driinfo.h gen rules
36557ec681f3Smrg- android: util: add log.c to Makefile.sources
36567ec681f3Smrg- android: pan/bi: Use new disassembler (v2)
36577ec681f3Smrg- android: panfrost: use python3 for generated sources rules
36587ec681f3Smrg- android: util: Move xxd.py to util
36597ec681f3Smrg- android: util,ac,aco,radv: Cross-platform memstream API
36607ec681f3Smrg- android: fix libsync dependencies (v2)
36617ec681f3Smrg- android: aco: add aco_form_hard_clauses.cpp to Makefile.sources
36627ec681f3Smrg
36637ec681f3SmrgMichael Olbrich (1):
36647ec681f3Smrg
36657ec681f3Smrg- meson.build: xxf86vm is not needed for -Dglx-direct=false
36667ec681f3Smrg
36677ec681f3SmrgMichael Tretter (2):
36687ec681f3Smrg
36697ec681f3Smrg- etnaviv: fix comment for source of etna_mesa_debug
36707ec681f3Smrg- etnaviv: free tgsi tokens when shader state is deleted
36717ec681f3Smrg
36727ec681f3SmrgMichel Dänzer (31):
36737ec681f3Smrg
36747ec681f3Smrg- ci: Fix up rules for post-merge / main project branch pipelines
36757ec681f3Smrg- ci: Create test-docs job in mesa/mesa pipelines for MRs
36767ec681f3Smrg- ci: Don't exclude "success" job from mesa/mesa pipelines for MRs
36777ec681f3Smrg- ci: Restrict "success" job to pipelines for MRs
36787ec681f3Smrg- ci: Do not create manual test-docs job in post-merge pipelines
36797ec681f3Smrg- ci: Remove any existing results directory before running piglit
36807ec681f3Smrg- ci: Add "is scheduled pipeline" YAML anchor
36817ec681f3Smrg- ci: Add "is master branch of main project" YAML anchor
36827ec681f3Smrg- ci: Add "is pre-merge pipeline for Marge Bot" YAML anchor
36837ec681f3Smrg- ci: Add "is post-merge pipeline, not for Marge Bot" YAML anchor
36847ec681f3Smrg- ci: Add "is forked branch or pre-merge pipeline" YAML anchor
36857ec681f3Smrg- ci: Add "is forked branch" YAML anchor
36867ec681f3Smrg- ci: Add "is post-merge pipeline" YAML anchor
36877ec681f3Smrg- ci: Add "is pre-merge pipeline" YAML anchor
36887ec681f3Smrg- ci: Add "is for Marge Bot" YAML anchor
36897ec681f3Smrg- ci: Always use CI_PROJECT_NAMESPACE instead of CI_PROJECT_PATH
36907ec681f3Smrg- ci: Prevent pages job from running in pre-merge pipelines
36917ec681f3Smrg- ci: Don't create test-docs job if the pages one exists in the pipeline
36927ec681f3Smrg- ci: Use ignore_scheduled_pipelines anchor in .radeonsi-rules
36937ec681f3Smrg- gallium: Make pipe_viewport_state swizzle_x/y/z/w bit-fields 8 bits wide
36947ec681f3Smrg- ci: Move test-docs job to deploy stage
36957ec681f3Smrg- ci: Add empty needs: to pages job
36967ec681f3Smrg- ci: Add jobs running ci-fairy checks
36977ec681f3Smrg- loader/dri3: Only allocate additional buffers if needed
36987ec681f3Smrg- loader/dri3: Keep current number of back buffers if frame was skipped
36997ec681f3Smrg- loader/dri3: Allocate up to 4 back buffers for page flips
37007ec681f3Smrg- ci: Add "check mr" job to needs: of build jobs
37017ec681f3Smrg- ci: Run git_archive job if all_paths matches
37027ec681f3Smrg- i965/bufmgr: Handle NULL bufmgr in brw_bufmgr_get_for_fd
37037ec681f3Smrg- iris/bufmgr: Handle NULL bufmgr in iris_bufmgr_get_for_fd
37047ec681f3Smrg- ac: Don't negate strstr return values in ac_query_gpu_info
37057ec681f3Smrg
37067ec681f3SmrgMichel Zou (9):
37077ec681f3Smrg
37087ec681f3Smrg- swr: fix build with mingw
37097ec681f3Smrg- swr: missing \_BitScanForward64 on 32 bits win
37107ec681f3Smrg- swr: fix \_BitScanForward64 on unix
37117ec681f3Smrg- util: drop non-posix header fnmatch
37127ec681f3Smrg- lavapipe: fix usleep usage in lvp_device
37137ec681f3Smrg- wsi: move drm code to wsi_common_drm.c
37147ec681f3Smrg- gallium: use libpipe_loader_links
37157ec681f3Smrg- lavapipe: configure suffix in icd json
37167ec681f3Smrg- util: use dllexport for mingw too
37177ec681f3Smrg
37187ec681f3SmrgMike Blumenkrantz (118):
37197ec681f3Smrg
37207ec681f3Smrg- zink: basic primitive restart support for strip/fan topologies
37217ec681f3Smrg- zink: move 8bit index handling out of u_primconvert path
37227ec681f3Smrg- zink: use util_draw_vbo_without_prim_restart for unsupported prim modes
37237ec681f3Smrg- zink: set primitive restart cap
37247ec681f3Smrg- zink: move shader state methods for pipe_context into zink_program.c
37257ec681f3Smrg- zink: adjust zink_shader struct to contain full streamout info
37267ec681f3Smrg- zink: refcount zink_gfx_program objects
37277ec681f3Smrg- zink: split up creating zink_shader objects and VkShaderModule objects
37287ec681f3Smrg- zink: use ZINK_SHADER_COUNT instead of PIPE_SHADER_TYPES - 1 everywhere
37297ec681f3Smrg- zink: start using per-stage flags for new shaders, refcount shader modules
37307ec681f3Smrg- zink: always compile shaders in pipeline order
37317ec681f3Smrg- zink: rename zink_gfx_program::stages to 'modules'
37327ec681f3Smrg- gallium: add pipe_transfer_usage for z/s only mappings
37337ec681f3Smrg- gallium/u_transfer_helper: add util functions for doing deinterleaving during map
37347ec681f3Smrg- zink: print error when getprocaddr fails for extension functions
37357ec681f3Smrg- zink: change pipeline hashes to index based on vk primitive type
37367ec681f3Smrg- zink: handle more draw modes
37377ec681f3Smrg- zink: invalidate pipeline hash on more changes
37387ec681f3Smrg- zink: use u_transfer_helper to split/merge interleaved depth/stencil formats
37397ec681f3Smrg- zink: add note about buffer<->image copy functions not handling multisample
37407ec681f3Smrg- zink: generically handle matrix types
37417ec681f3Smrg- anv: improve error message when failing to open device path
37427ec681f3Smrg- anv: assert that the target bo is valid when adding a reloc list
37437ec681f3Smrg- zink: use correct value for color buffer sample count when creating renderpass
37447ec681f3Smrg- zink: use correct number of samples on framebuffer in set_framebuffer_state
37457ec681f3Smrg- zink: use correct layer count when creating framebuffer
37467ec681f3Smrg- zink: clamp min created fb size to 1x1
37477ec681f3Smrg- zink: verify that src and dst aspects are the same in resource_copy_region hook
37487ec681f3Smrg- zink: implement ARB_instanced_arrays
37497ec681f3Smrg- zink: move viewport count to zink_gfx_pipeline_state
37507ec681f3Smrg- zink: set multiviewport cap in ntv when gl_ViewportIndex is a written output
37517ec681f3Smrg- zink: correctly set up fb-sized scissors for each viewport
37527ec681f3Smrg- zink: apply viewport count when creating pipelines
37537ec681f3Smrg- zink: reorder create_stream_output_target to fix failure case leak
37547ec681f3Smrg- zink: combine all surface layout-setting for src/dst into util function
37557ec681f3Smrg- zink: unify all occurrences of waiting on a fence
37567ec681f3Smrg- zink: correctly handle ARB_arrays_of_arrays in ntv for samplers
37577ec681f3Smrg- zink: run nir_lower_uniforms_to_ubo conditionally
37587ec681f3Smrg- zink: fix shader buffer size caps to use 65536
37597ec681f3Smrg- zink: always emit descriptor set 0 in ntv
37607ec681f3Smrg- zink: emit ubo variables sized based on the overall ubo block size
37617ec681f3Smrg- zink: don't emit ubos or bindings for ubo variables
37627ec681f3Smrg- zink: correctly set up ubo bindings and buffer indices
37637ec681f3Smrg- zink: use sizeof(vec4) multiplier for nir_lower_uniforms_to_ubo
37647ec681f3Smrg- zink: hook up driconf
37657ec681f3Smrg- xmlconfig: fix scandir_filter
37667ec681f3Smrg- zink: handle timestamp queries
37677ec681f3Smrg- zink: handle TIME_ELAPSED queries
37687ec681f3Smrg- zink: add pipe_context::get_timestamp hook
37697ec681f3Smrg- zink: enable pipe caps for ARB_timer_query
37707ec681f3Smrg- anv: remove VkPipelineCacheCreateInfo::flags assert
37717ec681f3Smrg- radv: remove VkPipelineCacheCreateInfo::flags assert
37727ec681f3Smrg- util/hash_table: add function for reserving size in a hash table
37737ec681f3Smrg- zink: enable VK_KHR_vulkan_memory_model extension
37747ec681f3Smrg- zink: add VK_EXT_custom_border_color
37757ec681f3Smrg- zink: support VK_EXT_blend_operation_advanced
37767ec681f3Smrg- zink: support VK_EXT_extended_dynamic_state
37777ec681f3Smrg- zink: add VK_EXT_pipeline_creation_cache_control
37787ec681f3Smrg- zink: enable VK_EXT_shader_stencil_export
37797ec681f3Smrg- zink: ARB_uniform_buffer_object is now implemented, so add cap and feature doc
37807ec681f3Smrg- glsl: fix up location setting for variables pointing to a UBO's base
37817ec681f3Smrg- nir: update ubo locations in nir_lower_uniforms_to_ubo
37827ec681f3Smrg- zink: add a mechanism to track current resource usage in batches
37837ec681f3Smrg- zink: optimize transfer_map for resources with pending reads/writes
37847ec681f3Smrg- zink: add more explicit fencing for transfer maps
37857ec681f3Smrg- zink: explicitly flag fb attachments as being written to in render passes
37867ec681f3Smrg- zink: don't leak sampler view textures
37877ec681f3Smrg- zink: redo slot mapping again for the last time really I mean it
37887ec681f3Smrg- zink: export PIPE_CAP_MAX*_VARYINGS values
37897ec681f3Smrg- zink: unify code for emitting named uint-based variable instructions
37907ec681f3Smrg- glsl: more accurately handle swizzle in 64bit varying split with no left value
37917ec681f3Smrg- zink: increase descriptor pool sizes for other descriptor types we'll be using
37927ec681f3Smrg- zink: implement ARB_texture_buffer_object
37937ec681f3Smrg- zink: ensure resource tracking for sampler buffers in render batches
37947ec681f3Smrg- zink: assert valid format in zink_create_sampler_view()
37957ec681f3Smrg- zink: handle null attachment for ARB_texture_buffer_object samplers
37967ec681f3Smrg- zink: add VK_BUFFER_USAGE_INDEX_BUFFER_BIT to vertex buffer creation
37977ec681f3Smrg- zink: add last few format maps for ARB_vertex_type_2_10_10_10_rev
37987ec681f3Smrg- zink: fix stencil wrapping
37997ec681f3Smrg- zink: add some spirv_builder functions we'll be using for geometry shaders
38007ec681f3Smrg- zink: handle shader io vars more generically for use with gs
38017ec681f3Smrg- zink: add ntv handling for geometry shader variables
38027ec681f3Smrg- zink: re-transform gl_Position for gs input
38037ec681f3Smrg- zink: add handling for gs in ntv
38047ec681f3Smrg- zink: remove ADJACENCY prim types from primconvert path
38057ec681f3Smrg- zink: round out handling for streamout buffer stride setting during draw
38067ec681f3Smrg- zink: add gallium handling for geometry shaders
38077ec681f3Smrg- zink: enable gs pipe caps
38087ec681f3Smrg- zink: bump to glsl 1.40
38097ec681f3Smrg- zink: mark off GL 3.1 as done in features.txt
38107ec681f3Smrg- zink: GLSL 1.50
38117ec681f3Smrg- zink: set 3.2 complete in features.txt
38127ec681f3Smrg- zink: bump GLSL to 3.30
38137ec681f3Smrg- zink: set 3.3 complete in features.txt
38147ec681f3Smrg- zink: implement ARB_draw_indirect
38157ec681f3Smrg- zink: add helper for vec-type input variables in ntv
38167ec681f3Smrg- zink: add ntv handling for ARB_sample_shading
38177ec681f3Smrg- zink: add a pipe_context::get_sample_position hook
38187ec681f3Smrg- zink: mark ARB_sample_shading as supported
38197ec681f3Smrg- doc/features: remove zink entries for GL 3.3 items
38207ec681f3Smrg- zink: deduplicate some query result code
38217ec681f3Smrg- zink: more correctly handle PIPE_QUERY_PRIMITIVES_GENERATED queries
38227ec681f3Smrg- zink: also create an xfb query for every primitives generated query
38237ec681f3Smrg- zink: store batch id onto query object at time of start
38247ec681f3Smrg- zink: fixup gs/xfb tracking for primitives generated queries
38257ec681f3Smrg- zink: rework query overflow handling
38267ec681f3Smrg- zink: always use query->type for starting/stopping xfb queries
38277ec681f3Smrg- zink: always reset query pools on next query begin
38287ec681f3Smrg- zink: add pass for lowering dynamic ubo/ssbo block indexing to constants
38297ec681f3Smrg- zink: break up dynamic access lowering
38307ec681f3Smrg- util/threaded_context: use driver's buffer alignment for staging transfers
38317ec681f3Smrg- nir/clip_disable: write 0s instead of undefs for disabled clip planes
38327ec681f3Smrg- nir/clip_disable: try for better no-op
38337ec681f3Smrg- nir/clip_disable: handle 2x vec4 case
38347ec681f3Smrg- zink: implement ARB_texture_query_lod
38357ec681f3Smrg- zink: use same function for all pipe_context::delete_*_state shader methods
38367ec681f3Smrg- zink: add a quadop function in spirv_builder
38377ec681f3Smrg- zink: add some spirv builder functions for barriers
38387ec681f3Smrg
38397ec681f3SmrgNanley Chery (46):
38407ec681f3Smrg
38417ec681f3Smrg- dri_util: Update internal_format to GL_RGB8 for MESA_FORMAT_B8G8R8X8_UNORM
38427ec681f3Smrg- iris: Don't call SET_TILING for dmabuf imports
38437ec681f3Smrg- iris: Make iris_bo_import_dmabuf take a modifier
38447ec681f3Smrg- iris: Drop iris_resource_alloc_separate_aux
38457ec681f3Smrg- iris: Drop unused resource allocation optimization
38467ec681f3Smrg- iris: Drop old comment on clear color BO allocation
38477ec681f3Smrg- iris: Move size/offset calculations out of configure_aux
38487ec681f3Smrg- iris: Add and use iris_resource_configure_main
38497ec681f3Smrg- iris: Drop buffer support in resource_from_handle
38507ec681f3Smrg- gallium/dri2: Report correct YUYV and UYVY plane count
38517ec681f3Smrg- iris: Fix aux assertion in resource_get_handle
38527ec681f3Smrg- iris: Fold a condition into no_gpu for consistency
38537ec681f3Smrg- iris: Make iris_has_color_unresolved more generic
38547ec681f3Smrg- iris: Avoid resolving Z/S reads in transfer_map
38557ec681f3Smrg- iris: Drop a use of the need_resolve boolean
38567ec681f3Smrg- iris: Better determine map_would_stall for Z/S
38577ec681f3Smrg- gallium/dri2: Report I915_FORMAT_MOD_Y_TILED_GEN12_MC_CCS num_planes
38587ec681f3Smrg- gallium/dri2: Support I915_FORMAT_MOD_Y_TILED_GEN12_MC_CCS import
38597ec681f3Smrg- intel/isl: Describe I915_FORMAT_MOD_Y_TILED_GEN12_MC_CCS
38607ec681f3Smrg- intel/isl: Support ISL_AUX_USAGE_MC in surface states
38617ec681f3Smrg- intel/isl: Add YUV format info for the aux-map
38627ec681f3Smrg- st/mesa: Don't map all P01X DRM formats to P016
38637ec681f3Smrg- intel/common: Add get_aux_map_format_bits()
38647ec681f3Smrg- iris: Support planar resource imports for MC
38657ec681f3Smrg- intel/common: Drop unused gen_aux_map_add_image
38667ec681f3Smrg- iris: Support MC modifier in plane count queries
38677ec681f3Smrg- iris: Support I915_FORMAT_MOD_Y_TILED_GEN12_MC_CCS
38687ec681f3Smrg- blorp: Fix alignment test for HIZ_CCS_WT fast-clears
38697ec681f3Smrg- blorp: Drop trailing whitespace in blorp_clear.c
38707ec681f3Smrg- anv/image: Disable multi-layer CCS_E on TGL+
38717ec681f3Smrg- blorp: Ensure aligned HIZ_CCS_WT partial clears
38727ec681f3Smrg- iris: Fix a fast-clear skipping optimization
38737ec681f3Smrg- anv: Enable multi-layer aux-map init for HIZ+CCS
38747ec681f3Smrg- Revert "anv: Add driconf option to disable compression for 16bpp format"
38757ec681f3Smrg- iris: Add fast-clear restriction for 8bpp surfaces
38767ec681f3Smrg- isl: Allow CCS for 8bpp surfaces with 3+ miplevels
38777ec681f3Smrg- st/mesa: Add missing sentinels in format_map[\]
38787ec681f3Smrg- intel/isl: Drop redundant unpack of unorm channels
38797ec681f3Smrg- isl: Fix the aux-map encoding for D24_UNORM_X8
38807ec681f3Smrg- iris: Fix fast-clears of swizzled LA formats
38817ec681f3Smrg- iris: Fix SINT assert in convert_fast_clear_color
38827ec681f3Smrg- iris: Fix fast-clears of swizzled alpha formats
38837ec681f3Smrg- iris: Flush dmabufs during context flushes
38847ec681f3Smrg- mesa: Add and use \_mesa_has_depth_float_channel
38857ec681f3Smrg- mesa: Clamp some depth values in glClearBufferfv
38867ec681f3Smrg- mesa: Clamp some depth values in glClearBufferfi
38877ec681f3Smrg
38887ec681f3SmrgNeil Roberts (3):
38897ec681f3Smrg
38907ec681f3Smrg- v3d: Make the function to set tex dirty state for a stage global
38917ec681f3Smrg- v3d: Split the creating of TEXTURE_SHADER_STATE into a helper function
38927ec681f3Smrg- v3d: Update the TEXTURE_SHADER_STATE when there’s a new buf for a tex
38937ec681f3Smrg
38947ec681f3SmrgPhilipp Zabel (3):
38957ec681f3Smrg
38967ec681f3Smrg- meson: fix power8 option
38977ec681f3Smrg- gallium/dri: fix dri2_query_image for multiplanar images
38987ec681f3Smrg- gallium/dri: fix dri2_from_planar for multiplanar images
38997ec681f3Smrg
39007ec681f3SmrgPierre Moreau (5):
39017ec681f3Smrg
39027ec681f3Smrg- clover/spirv: Remove unused tuple header
39037ec681f3Smrg- clover/spirv: Print linked SPIR-V module if asked
39047ec681f3Smrg- meson: Raise minimum version for SPIR-V OpenCL deps (v4)
39057ec681f3Smrg- clover/llvm: Use the highest supported SPIR-V version (v4)
39067ec681f3Smrg- clover/nir: Register callback for translation messages (v2)
39077ec681f3Smrg
39087ec681f3SmrgPierre-Eric Pelloux-Prayer (61):
39097ec681f3Smrg
39107ec681f3Smrg- ac/llvm: handle static/shared llvm init separately
39117ec681f3Smrg- mesa/st: introduce PIPE_CAP_NO_CLIP_ON_COPY_TEX
39127ec681f3Smrg- radeonsi: enable PIPE_CAP_NO_CLIP_ON_COPY_TEX
39137ec681f3Smrg- ac/llvm: add option to clamp division by zero
39147ec681f3Smrg- radeonsi,driconf: add clamp_div_by_zero option
39157ec681f3Smrg- radeonsi: use radeonsi_clamp_div_by_zero for SPECviewperf13, Road Redemption
39167ec681f3Smrg- amd/llvm: switch to 3-spaces style
39177ec681f3Smrg- amd/common: switch to 3-spaces style
39187ec681f3Smrg- mesa: move u_idalloc from gallium/aux/util to util
39197ec681f3Smrg- util/idalloc: add util_idalloc_reserve
39207ec681f3Smrg- util/idalloc: add lowest_free_idx to avoid iterating from 0
39217ec681f3Smrg- mesa: add a isGenName parameter to \_mesa_HashInsert
39227ec681f3Smrg- mesa: add GL name reuse support
39237ec681f3Smrg- mesa: add \_mesa_HashFindFreeKeys
39247ec681f3Smrg- mesa: use \_mesa_HashFindFreeKeys for GL functions
39257ec681f3Smrg- driconf: add option to reuse GL names
39267ec681f3Smrg- glsl: fix per_vertex_accumulator::fields size
39277ec681f3Smrg- r600/uvd: set dec->bs_ptr = NULL on unmap
39287ec681f3Smrg- radeon/vcn: set dec->bs_ptr = NULL on unmap
39297ec681f3Smrg- radeonsi: fix quant_mode selection for large negative values
39307ec681f3Smrg- radeonsi: fix guardband handling for large values
39317ec681f3Smrg- mesa: fix glUniform\* when a struct contains a bindless sampler
39327ec681f3Smrg- gallium: add PIPE_CAP_MAX_TEXTURE_MB
39337ec681f3Smrg- radeonsi: move GL vendor workaround to drirc
39347ec681f3Smrg- radeonsi: reduce PIPE_SHADER_CAP_MAX_CONST_BUFFER_SIZE value
39357ec681f3Smrg- radeonsi: change vendor name to AMD
39367ec681f3Smrg- radeonsi: force linear for textures with height=1 (gfx6-8)
39377ec681f3Smrg- radeonsi/tmz: use secure job if framebuffer has dcc
39387ec681f3Smrg- radeonsi/tmz: use secure job if using an encrypted z/s buffer
39397ec681f3Smrg- radeonsi/tmz: add safety assert when tmz is enabled
39407ec681f3Smrg- radeonsi/tmz: allocate depth/stencil buffers as encrypted
39417ec681f3Smrg- radeonsi: introduce SI_RESOURCE_FLAG_INTERNAL / RADEON_FLAG_DRIVER_INTERNAL
39427ec681f3Smrg- amd: add AMDGPU_IDS_FLAGS_TMZ definition to amdgpu_drm.h
39437ec681f3Smrg- ac/gpu_info: add detection of TMZ support
39447ec681f3Smrg- radeonsi/tmz: allow secure job if the app made a tmz allocation
39457ec681f3Smrg- amd/winsys: add RADEON_FLUSH_TOGGLE_SECURE_SUBMISSION
39467ec681f3Smrg- radeonsi/tmz: fail si_texture_transfer_map if tex is encrypted
39477ec681f3Smrg- radeonsi/tmz: add tmz variant of sctx::wait_mem_scratch
39487ec681f3Smrg- radeonsi/tmz: add tmz variant for sctx::tess_rings
39497ec681f3Smrg- radeonsi: disable primitive discard if tmz is in use
39507ec681f3Smrg- radeonsi/tmz: add a tmz variant for sctx::eop_bug_scratch
39517ec681f3Smrg- radeonsi/tmz: add workaround for mpv/vaapi subtitles
39527ec681f3Smrg- amd/tmz: move uses_secure_bos to radeon_winsys
39537ec681f3Smrg- gallium/vl: do not call transfer_unmap if transfer is NULL
39547ec681f3Smrg- gallium/vl: add chroma_format arg to vl_video_buffer functions
39557ec681f3Smrg- omx/tizonia: fix build
39567ec681f3Smrg- gallium: add new cap PIPE_CAP_DEVICE_PROTECTED_CONTENT
39577ec681f3Smrg- gallium: introduce PIPE_BIND_PROTECTED
39587ec681f3Smrg- radeonsi: honor PIPE_BIND_PROTECTED
39597ec681f3Smrg- egl: implement EGL_EXT_protected_surface support
39607ec681f3Smrg- radeonsi: enable PIPE_CAP_DEVICE_PROTECTED_CONTENT
39617ec681f3Smrg- egl: handle EGL_PROTECTED_CONTENT_EXT for eglImage
39627ec681f3Smrg- dri: introduce createImageFromDmaBufs3
39637ec681f3Smrg- egl/dri2: implement createImageFromDmaBufs3
39647ec681f3Smrg- driconf: add disable_protected_content_check option
39657ec681f3Smrg- radeonsi: fix RADEON_FLUSH flags conflicts
39667ec681f3Smrg- radeon: add si_vid_create_tmz_buffer helper
39677ec681f3Smrg- radeon/vcn: delay dec->ctx and dec->dpb allocation
39687ec681f3Smrg- va/picture: make sure destination buffer is protected if needed
39697ec681f3Smrg- va: support VA_RT_FORMAT_PROTECTED
39707ec681f3Smrg- radeonsi/gfx10: flush gfx cs on ngg -\> legacy transition
39717ec681f3Smrg
39727ec681f3SmrgPierre-Loup A. Griffais (2):
39737ec681f3Smrg
39747ec681f3Smrg- radv: fix null descriptor for dynamic buffers
39757ec681f3Smrg- radv: fix vertex buffer null descriptors
39767ec681f3Smrg
39777ec681f3SmrgQiang Yu (4):
39787ec681f3Smrg
39797ec681f3Smrg- radeonsi: fix syncobj wait timeout
39807ec681f3Smrg- radeonsi: fix user fence space when MCBP is enabled
39817ec681f3Smrg- radeonsi: fix max syncobj wait timeout
39827ec681f3Smrg- radeonsi: fix user fence GPU address
39837ec681f3Smrg
39847ec681f3SmrgRhys Perry (160):
39857ec681f3Smrg
39867ec681f3Smrg- aco: fix C++11/C++14 compilation
39877ec681f3Smrg- aco: set constant_data_offset correctly in the case of merged shaders
39887ec681f3Smrg- aco: don't move memory accesses to before control barriers
39897ec681f3Smrg- nir/opt_remove_phis: optimize out phis with undef
39907ec681f3Smrg- gitlab: ask inxi output to be in code blocks
39917ec681f3Smrg- util: add a alignof() macro
39927ec681f3Smrg- nir: fix potential left shift of a negative value
39937ec681f3Smrg- nir: fix memory leak in nir_cf_list_clone
39947ec681f3Smrg- radv: don't pass null to \_mesa_sha1_update
39957ec681f3Smrg- radv: align pipeline cache entry and header sizes
39967ec681f3Smrg- radv: fix null memcpy and zero-sized malloc
39977ec681f3Smrg- aco: fix non-rtz pack_half_2x16
39987ec681f3Smrg- nir: add and use nir_intrinsic_has\_ helpers
39997ec681f3Smrg- aco: use nir_intrinsic_has_access
40007ec681f3Smrg- bifrost: use nir_intrinsic_has_type
40017ec681f3Smrg- aco: consider branch definitions in spiller
40027ec681f3Smrg- aco: don't consider the first partial spill if it's the wrong type
40037ec681f3Smrg- aco: don't fix break condition for break+discard to exec
40047ec681f3Smrg- aco: fix regclass checks when fixing to vcc/exec with Builder
40057ec681f3Smrg- aco: fix spills_entry heuristic for branch blocks in init_live_in_vars()
40067ec681f3Smrg- aco: keep loop live-through variables spilled
40077ec681f3Smrg- aco: reserve 2 sgprs for each branch
40087ec681f3Smrg- aco: create long jumps
40097ec681f3Smrg- aco/tests: add test for GFX10 0x3f bug
40107ec681f3Smrg- aco: shorten disassembly for repeated instructions
40117ec681f3Smrg- aco/tests: add tests for long jumps
40127ec681f3Smrg- aco: remove 64-bit SGPR ubfe/ibfe
40137ec681f3Smrg- aco: fix sgpr ubfe/ibfe if the offset is too large
40147ec681f3Smrg- aco: sink get_alu_src() in bfe lowering
40157ec681f3Smrg- spirv: fix Uniform and Output MemoryAccessMakePointer{Visible,Available}
40167ec681f3Smrg- spirv: make OpLoad/OpStore visibility/availablity barriers acquire/release
40177ec681f3Smrg- spirv: add vtn_emit_make_{visible,available}_barrier helpers
40187ec681f3Smrg- spirv: implement MakePointerAvailable/MakePointerVisible for OpCopyMemory
40197ec681f3Smrg- spirv: implement Volatile memory semantic
40207ec681f3Smrg- spirv: implement Volatile image operand
40217ec681f3Smrg- spirv: implement SpvMemoryAccessVolatileMask
40227ec681f3Smrg- spirv: add some tests for volatile/available/visible
40237ec681f3Smrg- radv: remove descriptor_indexing fails from expected fails
40247ec681f3Smrg- aco: fix mad splitting after applying output modifiers
40257ec681f3Smrg- aco: remove omod_success/clamp_success
40267ec681f3Smrg- aco: fix byte_align_scalar for 3 dword vectors
40277ec681f3Smrg- nir/load_store_vectorize: rework alignment calculation
40287ec681f3Smrg- nir/opt_shrink_vectors: shrink image stores using the format
40297ec681f3Smrg- aco: fix one-off error in Operand(uint16_t)
40307ec681f3Smrg- aco: improve fsign selection
40317ec681f3Smrg- nir/opt_if: fix opt_if_merge when destination branch has a jump
40327ec681f3Smrg- nir/opt_loop_unroll: fix is_access_out_of_bounds with vectors
40337ec681f3Smrg- aco: fix v_writelane_b32 with two sgprs
40347ec681f3Smrg- aco: workaround disassembler bug of v_writelane_b32 with literal
40357ec681f3Smrg- aco: don't apply constant to SDWA on GFX8
40367ec681f3Smrg- aco: fix value numbering of reductions
40377ec681f3Smrg- aco: fix validation of sub-dword parallel-copies
40387ec681f3Smrg- aco: pass -fno-exceptions and -fno-rtti
40397ec681f3Smrg- aco: fix incorrect assertion in emit_vop3a_instruction()
40407ec681f3Smrg- radv: initialize with expanded cmask if the destination layout needs it
40417ec681f3Smrg- radv,aco: fix reading primitive ID in FS after TES
40427ec681f3Smrg- aco: keep track of temporaries' regclasses in the Program
40437ec681f3Smrg- aco: use bit vectors for liveness sets
40447ec681f3Smrg- aco: use io semantics to get an intrinsic's slot
40457ec681f3Smrg- aco: use nir_get_io_offset_src() in visit_load_input()
40467ec681f3Smrg- aco: use nir's constant source helpers more
40477ec681f3Smrg- aco: remove dead indirect fs input loading
40487ec681f3Smrg- aco: stop multiplying driver_location by 4
40497ec681f3Smrg- st/nir: call nir_opt_access before gl_nir_lower_buffers
40507ec681f3Smrg- radeonsi: don't use nir_opt_access
40517ec681f3Smrg- nir/instr_set: hash intrinsic sources
40527ec681f3Smrg- nir/load_store_vectorize: improve vectorization with identical operations
40537ec681f3Smrg- aco: fix get_buffer_resource_flags()
40547ec681f3Smrg- aco: remove trailing whitespace
40557ec681f3Smrg- radv: remove trailing whitespace
40567ec681f3Smrg- aco: Add loop creation helpers.
40577ec681f3Smrg- nir: return progress from nir_lower_io_to_scalar_early
40587ec681f3Smrg- radv: move optimizations in shader_compile_to_nir() to after io_to_scalar
40597ec681f3Smrg- radv: use radv_optimize_nir() less in radv_link_shaders()
40607ec681f3Smrg- spirv: add and use a generator id enum
40617ec681f3Smrg- spirv: replace discard with demote for incorrect HLSL->SPIR-V translations
40627ec681f3Smrg- radv: remove RDR2 discard workaround
40637ec681f3Smrg- android: fix SPIR-V -\> NIR build
40647ec681f3Smrg- aco: optimize more uniform reductions/scans
40657ec681f3Smrg- aco: implement elect
40667ec681f3Smrg- radv/aco,nir/lower_subgroups: don't lower elect
40677ec681f3Smrg- nir: add last_invocation intrinsic
40687ec681f3Smrg- aco: implement last_invocation
40697ec681f3Smrg- nir: move divergence analysis options to nir_shader_compiler_options
40707ec681f3Smrg- nir: allow divergence information to be updated when inserting instruction
40717ec681f3Smrg- nir: add pass to optimize uniform atomics
40727ec681f3Smrg- aco: use nir_opt_uniform_atomics
40737ec681f3Smrg- nir/opt_uniform_atomics: optimize image atomics
40747ec681f3Smrg- nir/opt_uniform_atomics: don't optimize atomics twice
40757ec681f3Smrg- aco: fix get_ssbo_size with a vgpr resource
40767ec681f3Smrg- scons: fix SPIR-V -\> NIR build
40777ec681f3Smrg- nir/opt_uniform_atomics: remove useless returns
40787ec681f3Smrg- aco: implement 16-bit literals
40797ec681f3Smrg- aco: propagate literals into sub-dword pseudo instructions on GFX9+
40807ec681f3Smrg- aco: don't use v_pack_b32_f16 if 16-bit input denormals are flushed
40817ec681f3Smrg- nir/opt_load_store_vectorize: don't vectorize stores across demote
40827ec681f3Smrg- nir/opt_load_store_vectorize: add some tests for discard/demote behaviour
40837ec681f3Smrg- aco: add missing SCC clobber in get_buffer_size
40847ec681f3Smrg- ci: disable check commits job for now
40857ec681f3Smrg- nir/loop_analyze: adjust force unrolling to only include interesting modes
40867ec681f3Smrg- ac/nir: remove bindless image atomic format check
40877ec681f3Smrg- aco: remove isel_context::allocated
40887ec681f3Smrg- aco: update phi_map in add_subdword_operand()
40897ec681f3Smrg- aco: don't do divergent break+discard
40907ec681f3Smrg- aco: skip value numbering of copies
40917ec681f3Smrg- aco: copy-propgate through p_create_vector during value numbering
40927ec681f3Smrg- aco: expand vectors passed as copy operands
40937ec681f3Smrg- aco: don't use bld.copy() in handle_operands()
40947ec681f3Smrg- aco: allow literals on sub-dword p_parallelcopy
40957ec681f3Smrg- aco: always use p_parallelcopy for pre-RA copies
40967ec681f3Smrg- aco: use Builder::copy more
40977ec681f3Smrg- aco: remove some unused optimizations
40987ec681f3Smrg- aco: use v_mov_b32_sdwa for some 16-bit constants
40997ec681f3Smrg- aco: remove all-undef phi opt
41007ec681f3Smrg- aco: ignore the ACO-inserted continue in create_continue_phis()
41017ec681f3Smrg- aco: default to a definition size of 32
41027ec681f3Smrg- aco: round bytes_written to dwords if larger than 4 bytes
41037ec681f3Smrg- aco: use control flow creation helpers in select_gs_copy_shader
41047ec681f3Smrg- aco: use mubuf helper in select_gs_copy_shader
41057ec681f3Smrg- aco: move individual instruction disassembly to its own helper
41067ec681f3Smrg- aco: refactor repeated instruction disassembly
41077ec681f3Smrg- aco: switch aco_print_asm to a FILE \\*
41087ec681f3Smrg- aco: create s_clause on GFX10+
41097ec681f3Smrg- aco: assert a label only uses one of the members in ssa_info's union
41107ec681f3Smrg- aco: fix printing of some sdwa sels
41117ec681f3Smrg- aco: fix combine_inverse_comparison()
41127ec681f3Smrg- aco: don't allow destination opsel for v_cvt_pknorm
41137ec681f3Smrg- aco: handle SDWA in the optimizer
41147ec681f3Smrg- docs/features: update unpromoted Vulkan extensions
41157ec681f3Smrg- docs/features: add Vulkan 1.2
41167ec681f3Smrg- radv: add some missing radv_{start,stop}_feedback
41177ec681f3Smrg- radv: fix shader caching with discard->demote workaround
41187ec681f3Smrg- radv: fix shader caching with NaN fixup workaround
41197ec681f3Smrg- nir: scalarize fdot in reverse
41207ec681f3Smrg- spirv: reverse order in matrix multiplication
41217ec681f3Smrg- nir/algebraic: better propagate constants up fadd chains
41227ec681f3Smrg- nir: add nir_alu_src_is_trivial_ssa()
41237ec681f3Smrg- nir: skip bcsel with non-trivial swizzle in opt_simplify_bcsel_of_phi()
41247ec681f3Smrg- nir: use nir_alu_src_is_trivial_ssa() in nir_ssa_for_alu_src()
41257ec681f3Smrg- nir: add shader_info::bit_sizes_used
41267ec681f3Smrg- nir/lower_bit_size: optimize upcast of b2i8/b2i16
41277ec681f3Smrg- radv: move a few passes to after load/store vectorization
41287ec681f3Smrg- radv: do nir_lower_bit_size after algebraic optimizations
41297ec681f3Smrg- radv: rework nir_lower_bit_size callback and run DA on GFX8+
41307ec681f3Smrg- aco: implement some 16-bit arithmetic instead of lowering
41317ec681f3Smrg- aco: implement 8/16-bit instructions which can be trivially widened
41327ec681f3Smrg- spirv: fix GLSLstd450Modf/GLSLstd450Frexp when the destination is vector
41337ec681f3Smrg- util: add mapping from Vulkan to Gallium R64 integer formats
41347ec681f3Smrg- amd/common: add PIPE_FORMAT_R64_{UINT,SINT} to GFX10 format table
41357ec681f3Smrg- aco: implement 64-bit images
41367ec681f3Smrg- ac/nir: implement 64-bit images
41377ec681f3Smrg- radv: implement VK_EXT_shader_image_atomic_int64
41387ec681f3Smrg- aco: don't combine precise max(min()) to med3
41397ec681f3Smrg- aco: fix combine_constant_comparison_ordering() NaN check with 16/64-bit
41407ec681f3Smrg- aco: disallow various v_add_u32 opts if modifiers are used
41417ec681f3Smrg- aco: disable omod if the sign of zeros should be preserved
41427ec681f3Smrg- aco: fix fp16 \*0.5 omod
41437ec681f3Smrg- aco: fix v_mul_hi_u32_u24 format
41447ec681f3Smrg- nir/unsigned_upper_bound: fix buffer overflow in search_phi_bcsel
41457ec681f3Smrg- nir: fix sampler_lod_parameters_pan indices
41467ec681f3Smrg
41477ec681f3SmrgRicardo Garcia (1):
41487ec681f3Smrg
41497ec681f3Smrg- anv: Ignore continue flag in primary cmd buffers
41507ec681f3Smrg
41517ec681f3SmrgRicardo Quesada (1):
41527ec681f3Smrg
41537ec681f3Smrg- anv: support fd==-1 in ImportSemaphoreFdKHR
41547ec681f3Smrg
41557ec681f3SmrgRob Clark (46):
41567ec681f3Smrg
41577ec681f3Smrg- freedreno/registers: add some missing regs to build
41587ec681f3Smrg- freedreno/ir3: don't install ir3_compiler cmdline tool
41597ec681f3Smrg- freedreno/ir3: add tracking for \\# of instructions per category
41607ec681f3Smrg- freedreno/ir3: add more disasm stats
41617ec681f3Smrg- freedreno/crashdec: handle section name typos
41627ec681f3Smrg- freedreno/decode: try harder to not crash in disasm
41637ec681f3Smrg- freedreno/registers: SC_WAIT_WC is not a6xx
41647ec681f3Smrg- freedreno/a6xx: only generate streamout for draw pass shader
41657ec681f3Smrg- freedreno/a6xx: fix occlusion query with more than one tile
41667ec681f3Smrg- freedreno/cffdump: add arg to filter by process name
41677ec681f3Smrg- freedreno/a6xx: disable LRZ when color channels are masked
41687ec681f3Smrg- freedreno/a6xx: refactor debug logging
41697ec681f3Smrg- freedreno: add debug helper to dump buffers
41707ec681f3Smrg- freedreno: handle case of shadowing current render target
41717ec681f3Smrg- freedreno/gmemtool: add tile_alignw/h and a650
41727ec681f3Smrg- freedreno: add env var to override GMEM size
41737ec681f3Smrg- freedreno: add env var to override tiles-per-pipe
41747ec681f3Smrg- freedreno/a6xx: fix hang with large render target
41757ec681f3Smrg- freedreno/batch: split out helper for rb alloc
41767ec681f3Smrg- freedreno/batch: replace lrz_clear with prologue
41777ec681f3Smrg- freedreno/a5xx+a6xx: use sysmem path for nondraw batches
41787ec681f3Smrg- freedreno/a6xx: move ubwc clear to blitter
41797ec681f3Smrg- freedreno: Fix missing rsc->seqno updates
41807ec681f3Smrg- freedreno: fence_server_sync() fixes
41817ec681f3Smrg- freedreno: Fix rast state for multisample clear
41827ec681f3Smrg- freedreno: Don't bypass fd_draw_vbo() in clear fallback
41837ec681f3Smrg- freedreno/a6xx: Skip empty tile_setup
41847ec681f3Smrg- freedreno/a6xx: Fix fd6_draw_vbo() return
41857ec681f3Smrg- freedreno: Clear gs/tcs/tes state for clear blits
41867ec681f3Smrg- freedreno/a6xx: Fix MSAA clear
41877ec681f3Smrg- freedreno: fix fence-fd leak
41887ec681f3Smrg- ci/deqp-runner: Allow overriding width/height/config
41897ec681f3Smrg- ci: cherry-pick deqp fix for config choosing
41907ec681f3Smrg- ci: Enable remaining (non-rotate) mustpass CTS tests
41917ec681f3Smrg- freedreno/drm: drop bo's dev reference
41927ec681f3Smrg- freedreno: Don't leak border_color_buf reference
41937ec681f3Smrg- freedreno/a6xx: Small cleanup
41947ec681f3Smrg- freedreno/drm: Also clean ring_cache
41957ec681f3Smrg- freedreno/registers: Add a couple things used on kernel side
41967ec681f3Smrg- freedreno: Don't leak LRZ bo's
41977ec681f3Smrg- freedreno: Update import/export traces
41987ec681f3Smrg- freedreno: Disallow tiled if SHARED and not QCOM_COMPRESSED
41997ec681f3Smrg- freedreno: Rework GMEM limit init
42007ec681f3Smrg- freedreno/gmem: Respect max-height limits too
42017ec681f3Smrg- freedreno: Protect gmem_cache ralloc allocations
42027ec681f3Smrg- freedreno/ir3: Fix crash in shader compile fail path
42037ec681f3Smrg
42047ec681f3SmrgRohan Garg (3):
42057ec681f3Smrg
42067ec681f3Smrg- anv: Mark anv_dump_{start,finish} as PUBLIC
42077ec681f3Smrg- gitlab-ci: Test the traces from bgfx
42087ec681f3Smrg- virgl: Always enable emulated BGRA and swizzling unless specifically told not to
42097ec681f3Smrg
42107ec681f3SmrgRoland Scheidegger (1):
42117ec681f3Smrg
42127ec681f3Smrg- gallivm: add InstSimplify pass
42137ec681f3Smrg
42147ec681f3SmrgRoman Gilg (2):
42157ec681f3Smrg
42167ec681f3Smrg- vulkan/wsi/x11: add sent image counter
42177ec681f3Smrg- vulkan/wsi/x11: wait for acquirable images in FIFO mode
42187ec681f3Smrg
42197ec681f3SmrgRoman Stratiienko (1):
42207ec681f3Smrg
42217ec681f3Smrg- android: freedreno: Another build fix
42227ec681f3Smrg
42237ec681f3SmrgRuijing Dong (1):
42247ec681f3Smrg
42257ec681f3Smrg- frontends/omx/enc: fix omx h264 encoding force-keyframe-period issue.
42267ec681f3Smrg
42277ec681f3SmrgRyan Neph (1):
42287ec681f3Smrg
42297ec681f3Smrg- virgl: Fixes portal2 binary name in tweak config
42307ec681f3Smrg
42317ec681f3SmrgSagar Ghuge (12):
42327ec681f3Smrg
42337ec681f3Smrg- intel/isl: Drop unnecessary check on 16bpp depth format
42347ec681f3Smrg- intel/blorp: Conditionally clear full surface depth and stencil
42357ec681f3Smrg- anv: Factor out dri option initialization code in separate function
42367ec681f3Smrg- anv: Add driconf option to disable compression for 16bpp format
42377ec681f3Smrg- anv: Return number of layers/levels attached to anv_image
42387ec681f3Smrg- anv: Handle compressed stencil buffer transition on Gen12+
42397ec681f3Smrg- anv: Set stencil_aux_usage flag
42407ec681f3Smrg- anv: Get aux usage from plane while clearing stencil buffer
42417ec681f3Smrg- anv: Don't track clear bo for stencil buffer compression
42427ec681f3Smrg- anv: Return optimal aux state for stencil buffer compression
42437ec681f3Smrg- anv: Pass correct stencil aux usage during MSAA resolve
42447ec681f3Smrg- anv: Enable stencil buffer compression on Gen12+
42457ec681f3Smrg
42467ec681f3SmrgSamuel Iglesias Gonsálvez (14):
42477ec681f3Smrg
42487ec681f3Smrg- freedreno/layout: add tile_all flag to the layout
42497ec681f3Smrg- turnip: add environment variable to disable LRZ
42507ec681f3Smrg- turnip: create LRZ buffer
42517ec681f3Smrg- turnip: disable LRZ on specific cases
42527ec681f3Smrg- turnip: disable LRZ writes when blend is enabled
42537ec681f3Smrg- turnip: disable LRZ depending on fragment changes
42547ec681f3Smrg- turnip: add LRZ tracking to command buffer state
42557ec681f3Smrg- turnip: add LRZ valid tracking for secondary command buffers
42567ec681f3Smrg- turnip: add support to clear LRZ
42577ec681f3Smrg- turnip: emit correct LRZ fast clear setup
42587ec681f3Smrg- turnip: disable LRZ on vkCmdClearAttachments()
42597ec681f3Smrg- turnip: disable LRZ on vkCmdClearattachments() 3D fallback path
42607ec681f3Smrg- turnip: enable LRZ
42617ec681f3Smrg- turnip: don't initialize GRAS_LRZ_CNTL/RB_LRZ_CNTL tu6_init_hw()
42627ec681f3Smrg
42637ec681f3SmrgSamuel Pitoiset (157):
42647ec681f3Smrg
42657ec681f3Smrg- radv: allow to force-enable LLVM internally for a specific shader stage
42667ec681f3Smrg- radv: report the spirv-nir logs back to the application
42677ec681f3Smrg- radv: rework the error function helpers a bit
42687ec681f3Smrg- radv: report errors back to the application via VK_EXT_debug_report
42697ec681f3Smrg- radv: report a better error message when QueueWaitIdle() failed
42707ec681f3Smrg- radv/gfx10: add missing initialization of registers
42717ec681f3Smrg- radv: limit LATE_ALLOC_GS to prevent a GPU hang on GFX10
42727ec681f3Smrg- radv: fix emitting the border color pointer on the compute queue
42737ec681f3Smrg- radv/winsys: add null winsys entries for Sienna Cichild/Navy Flounder
42747ec681f3Smrg- gitlab-ci: test Fossilize with GFX1030
42757ec681f3Smrg- aco: do not set valid_mask for POS0 exports on GFX 10.3
42767ec681f3Smrg- radv: track and report if a logical device is lost
42777ec681f3Smrg- aco: rename DEBUG_VALIDATE to DEBUG_VALIDATE_IR
42787ec681f3Smrg- aco: rework the way various compilation/validation errors are reported
42797ec681f3Smrg- radv,aco: report ACO errors/warnings back via VK_EXT_debug_report
42807ec681f3Smrg- aco: fix file leak in ra_fail()
42817ec681f3Smrg- radv: ignore BB labels when splitting the disassembly string
42827ec681f3Smrg- aco: add ACO_DEBUG=force-waitcnt to emit wait-states
42837ec681f3Smrg- amd/registers: add missing TBA registers on GFX6-GFX8
42847ec681f3Smrg- amd/registers: add some SQ_WAVE\_\* register definitions
42857ec681f3Smrg- aco: add TBA/TMA/TTMP0-11 physical registers definitions
42867ec681f3Smrg- aco: validate that SMEM operands can use fixed registers
42877ec681f3Smrg- aco: add a helper for building a trap handler shader
42887ec681f3Smrg- aco: skip unnecessary compiler pass for the trap handler program
42897ec681f3Smrg- radv: add a small interface for creating the trap handler shader
42907ec681f3Smrg- radv: add initial trap handler support with RADV_TRAP_HANDLER=1
42917ec681f3Smrg- radv: enable the trap handler and configure the shader exceptions
42927ec681f3Smrg- radv: use the trap handler to detect faulty shaders/instructions
42937ec681f3Smrg- radv: align the TMA BO size to 256
42947ec681f3Smrg- radv: allocate the TMA BO into 32-bit addr space
42957ec681f3Smrg- radv: fix setting EXCP_EN for different shader stages
42967ec681f3Smrg- radv: print a warning when RADV_TRAP_HANDLER is used
42977ec681f3Smrg- aco: add ACO_DEBUG=novn,noopt,nosched for debugging purposes
42987ec681f3Smrg- radv: emit {CB,DB}_RMI_L2_CACHE_CONTROL at framebuffer time
42997ec681f3Smrg- radv: set BIG_PAGE to improve performance on GFX10.3
43007ec681f3Smrg- aco: fix wrong source position for constant with nir_op_cube_face_coord
43017ec681f3Smrg- radv: dump shader stats with VK_KHR_pipeline_executable_properties
43027ec681f3Smrg- radv: force RADV_DEBUG=syncshaders when RADV_TRACE_FILE is used
43037ec681f3Smrg- radv: improve reporting faulty pipelines when a GPU hang is detected
43047ec681f3Smrg- radv: dump GPU info into the hang report
43057ec681f3Smrg- nir/algebraic: mark some optimizations with fsat(NaN) as inexact
43067ec681f3Smrg- spirv: fix retrieving dest type for OpFragmentMaskFetchAMD
43077ec681f3Smrg- radv,aco: disable opts if VK_PIPELINE_CREATE_DISABLE_OPTIMIZATION_BIT
43087ec681f3Smrg- aco: handle unaligned loads on GFX10.3
43097ec681f3Smrg- spirv: fix emitting switch cases that directly jump to the merge block
43107ec681f3Smrg- radv: fix transform feedback crashes if pCounterBufferOffsets is NULL
43117ec681f3Smrg- radv: add a helper for loading meta descriptors
43127ec681f3Smrg- radv: do not lower UBO/SSBO access to offsets
43137ec681f3Smrg- radv: remove useless assignment of MAX_API_VERSION
43147ec681f3Smrg- radv: bump the advertised patch version to 145
43157ec681f3Smrg- radv: add VK_KHR_copy_commands2 but leave it disabled
43167ec681f3Smrg- radv: add support for CmdBlitImage2KHR()
43177ec681f3Smrg- radv: add support for CmdCopyBuffer2KHR()
43187ec681f3Smrg- radv: add support for CmdCopyBufferToImage2KHR()
43197ec681f3Smrg- radv: add support for CmdCopyImage2KHR()
43207ec681f3Smrg- radv: add support for CmdCopyImageToBuffer2KHR()
43217ec681f3Smrg- radv: cleanup selecting the hardware resolve path
43227ec681f3Smrg- radv: add support for CmdResolveImage2KHR()
43237ec681f3Smrg- radv: advertise VK_KHR_copy_commands2
43247ec681f3Smrg- radv: set KEEP_TOGETHER_ENABLE if necessary on GFX10+
43257ec681f3Smrg- radv: add a tweak for PS wave CU utilization for gfx10.3
43267ec681f3Smrg- ci: adjust RadeonSI rules
43277ec681f3Smrg- ci: add dEQP-VK.info.device_extensions to the list of skipped tests
43287ec681f3Smrg- nir/lower_memory_model: return progress when visiting instructions
43297ec681f3Smrg- nir/lower_memory_model: do not break with global atomic operations
43307ec681f3Smrg- ac/nir: implement nir_intrinsic_{load,store}_global
43317ec681f3Smrg- ac/nir: implement nir_intrinsic_global_atomic\_\*
43327ec681f3Smrg- radv: lower deref operations for global memory for both backends
43337ec681f3Smrg- ac/llvm: fix invalid IR if image stores are shrinked using the format
43347ec681f3Smrg- nir/lower_io: change nir_io_add_const_offset_to_base to use bitfield modes
43357ec681f3Smrg- radeonsi: call nir_io_add_const_offset_to_base only once per shader
43367ec681f3Smrg- radv/llvm: call nir_lower_io_to_vector with FS to fix array tests
43377ec681f3Smrg- radv: call nir_io_add_const_offset_to_base for FS outputs
43387ec681f3Smrg- radv: move lowering of FS outputs outside of ACO
43397ec681f3Smrg- radv: fix gathering writes_memory for global store/atomic operations
43407ec681f3Smrg- ac/llvm: fix invalid use of unreachable in ac_build_atomic_rmw()
43417ec681f3Smrg- ac/nir: fix nir_intrinsic_shared_atomic_fadd
43427ec681f3Smrg- radv: gather output usage mask from store_output for VS, TES and GS
43437ec681f3Smrg- radv/aco: lower IO for all stages outside of ACO
43447ec681f3Smrg- aco: apply the clamped integer addition disassembly workaround for v_add3
43457ec681f3Smrg- aco/tests: add disassembler tests to reproduce the add3+clamp crash
43467ec681f3Smrg- ac/llvm: adjust dmask when image stores are shrinked using the format
43477ec681f3Smrg- ac/nir: remove dead load/store deref code for temporary variables
43487ec681f3Smrg- radv/llvm: assign driver locations for VS, TCS, TES and GS correctly
43497ec681f3Smrg- radv/llvm: lower GS IO
43507ec681f3Smrg- radv/llvm: lower TES IO
43517ec681f3Smrg- radv/llvm: gather TCS outputs from the output variables
43527ec681f3Smrg- radv/llvm: lower TCS IO
43537ec681f3Smrg- radv/llvm: gather VS input usage mask from load_input
43547ec681f3Smrg- radv/llvm: lower VS IO
43557ec681f3Smrg- ac/llvm: implement nir_op_unpack_half_2x16_split_{x,y}
43567ec681f3Smrg- radv/llvm: enable lower_unpack_half_2x16
43577ec681f3Smrg- ac/nir: remove dead global load/store/atomic derefs code
43587ec681f3Smrg- ac/nir: remove dead shader IO code
43597ec681f3Smrg- radeonsi: remove dead code in TCS/TES/GS since const_index is always 0
43607ec681f3Smrg- ac,radv,radeonsi: remove unused parameters in the shader ABI IO
43617ec681f3Smrg- radv: remove unused gs.writes_memory in the shader info pass
43627ec681f3Smrg- radv: remove dead deref code in the shader info pass
43637ec681f3Smrg- ac/nir,radv: fix invalid IR when loading inline uniform blocks
43647ec681f3Smrg- nir/constant_folding: init nir_const_value to zero
43657ec681f3Smrg- aco: bail out if the NIR IO base offset isn't zero
43667ec681f3Smrg- aco: more uses of nir_get_io_offset_src()
43677ec681f3Smrg- ac/nir: implement nir_op_fsat
43687ec681f3Smrg- radv/llvm: do not lower nir_op_fsat
43697ec681f3Smrg- radv/llvm: remove dead code for 64-bit GS inputs
43707ec681f3Smrg- aco: dump the program if the disassembler failed
43717ec681f3Smrg- radv/llvm: do not lower sub
43727ec681f3Smrg- radv: use the same NIR compiler options for both compiler backends
43737ec681f3Smrg- radv/llvm: stop assigning driver_location in NIR->LLVM
43747ec681f3Smrg- ac,radv,radeonsi: stop multiplying driver_location by 4
43757ec681f3Smrg- ac/nir: pass the variable location to store_tcs_outputs
43767ec681f3Smrg- radv/llvm: switch to NIR IO assigned locations
43777ec681f3Smrg- radv/llvm: reduce the ESGS itemsize by using NIR IO assigned locations
43787ec681f3Smrg- radv/llvm: reduce LDS size for tess by using NIR IO assigned locations
43797ec681f3Smrg- radv: remove one leftover TODO in the shader info pass
43807ec681f3Smrg- ac/llvm: move AC_FETCH_FORMAT to non-LLVM code
43817ec681f3Smrg- radv: replace RADV_ALPHA_ADJUST by AC_FETCH_FORMAT
43827ec681f3Smrg- radv: move lower_io_arrays_to_elements before lower_io_to_scalar_early
43837ec681f3Smrg- radv: fix adjusting vertex alpha
43847ec681f3Smrg- aco: implement missing nir_op_unpack_half_2x16_split_{x,y}_flush_to_zero
43857ec681f3Smrg- radv/aco: disable NGG GS support because it randomly hangs the GPU
43867ec681f3Smrg- radv: fix ignoring the vertex attribute stride if set as dynamic
43877ec681f3Smrg- aco: remove stub lower_wqm() prototype
43887ec681f3Smrg- aco: remove useless occurences of radv_nir_compiler_options
43897ec681f3Smrg- aco: remove unused radv_shader.h includes
43907ec681f3Smrg- radv: move compiler statistics to ACO
43917ec681f3Smrg- aco: compute the CS workgroup size from the shader NIR info
43927ec681f3Smrg- aco: adjust an assertion about the wavesize in emit_gfx10_wave64_bpermute()
43937ec681f3Smrg- radv: fix optimizing needed states if some are marked as dynamic
43947ec681f3Smrg- ac/nir: implement missing nir_op_pack_half_2x16_split
43957ec681f3Smrg- radv: report latest extension spec versions
43967ec681f3Smrg- radv: add missing 'discardtodemote' option in the debug list
43977ec681f3Smrg- Revert "radv/aco: disable NGG GS support because it randomly hangs the GPU"
43987ec681f3Smrg- ac/nir: handle non-const offset with txf/txf_ms
43997ec681f3Smrg- radv: move all NIR pass outside of ACO
44007ec681f3Smrg- ac/nir: do not sign-extend the result of texop_samples_identical
44017ec681f3Smrg- radv,aco: fix use of texop_samples_identical in the resolve meta path
44027ec681f3Smrg- aco: fix determining if LOD is zero for nir_texop_txf/nir_texop_txs
44037ec681f3Smrg- ac/nir: ignore set_vertex_and_primitive_count intrinsic
44047ec681f3Smrg- ac/nir: abort when an unknown intrinsic is reached
44057ec681f3Smrg- ac: add an option to dump GPU info to a file
44067ec681f3Smrg- radv: add radv_dump_cmd() helper
44077ec681f3Smrg- radv: dump UMR ring and waves into the hang report
44087ec681f3Smrg- radv: dump GPU hang report logs into $HOME/radv_dumps_<pid\>
44097ec681f3Smrg- radv: re-order GPU hang report dumps by usefulness
44107ec681f3Smrg- radv: replace RADV_TRACE_FILE by RADV_DEBUG=hang
44117ec681f3Smrg- radv: do not perform a FMASK expand for non-writeable MSAA images
44127ec681f3Smrg- radv: flush CB before and after FMASK_DECOMPRESS or DCC_DECOMPRESS
44137ec681f3Smrg- radv: enable VK_AMD_mixed_attachment_samples on GFX6-GFX7
44147ec681f3Smrg- radv,aco: adjust the sample mask only if per-sample shading is enabled
44157ec681f3Smrg- radv,aco: optimize computing the sample mask for per-sample shading
44167ec681f3Smrg- aco: store NIR range analysis data to the isel context
44177ec681f3Smrg- aco: select v_mul_{hi}_u32_u24 for 24-bit multiplications
44187ec681f3Smrg- nir/algebraic: distribute imul(iadd(a, b), c) when b and c are constants
44197ec681f3Smrg- aco: optimize v_and(a, v_subbrev_co(0, 0, vcc)) -\> v_cndmask(0, a, vcc)
44207ec681f3Smrg- nir/algebraic: optimize bitfield_select(a, b, 0) to iand(a, b)
44217ec681f3Smrg- aco: fix combining add/sub to b2i if a new dest needs to be allocated
44227ec681f3Smrg
44237ec681f3SmrgSerge Martin (13):
44247ec681f3Smrg
44257ec681f3Smrg- clover: set LLVM min version to 8.0.1
44267ec681f3Smrg- clover: implements clEnqueueMigrateMemObjects
44277ec681f3Smrg- clover: implements clEnqueueFillImage
44287ec681f3Smrg- clover: implements clGetKernelArgInfo
44297ec681f3Smrg- clover: bind sampler_t type to module::argument::sampler
44307ec681f3Smrg- clover: add CL_KERNEL_ATTRIBUTES for clGetKernelInfo
44317ec681f3Smrg- clover: implements clGetKernelWorkGroupInfo CL_KERNEL_COMPILE_WORK_GROUP_SIZE
44327ec681f3Smrg- clover: implements notification callback on program builds
44337ec681f3Smrg- clover: avoid adding an extra space to compiler options
44347ec681f3Smrg- clover: move tokenize function to algorithm
44357ec681f3Smrg- clover: validate image_row_pitch and image_slice_pitch in clEnqueueMapImage
44367ec681f3Smrg- clover: clCreateImage: calculate image row_pitch and slice_pitch when not provided
44377ec681f3Smrg- clover: implements clSetContextDestructorCallback
44387ec681f3Smrg
44397ec681f3SmrgSuresh Guttula (2):
44407ec681f3Smrg
44417ec681f3Smrg- gallium: update abs_delta segementation parameter
44427ec681f3Smrg- radeon/vcn : Corrected dpb_size calculation for VP9_2
44437ec681f3Smrg
44447ec681f3SmrgTapani Pälli (16):
44457ec681f3Smrg
44467ec681f3Smrg- anv: add a check for depthStencilState before using it
44477ec681f3Smrg- anv: null check for buffer before reading size
44487ec681f3Smrg- anv: take depth in to account in anv_GetImageSubresourceLayout
44497ec681f3Smrg- mesa: refactor floating point texture fbo completeness check on gles
44507ec681f3Smrg- mesa: add EXT_color_buffer_half_float plumbing
44517ec681f3Smrg- mesa/st: enable EXT_color_buffer_half_float when formats supported
44527ec681f3Smrg- glsl: mark some builtins with correct glsl(es) version check
44537ec681f3Smrg- iris: remove additional pipe control done before hiz for older gens
44547ec681f3Smrg- glsl: take EXT_gpu_shader4 in to account when adding round
44557ec681f3Smrg- gallivm/nir: handle nir_op_flt in lp_build_nir_llvm
44567ec681f3Smrg- iris: fix the order of src and dst for fence memcpy
44577ec681f3Smrg- mesa/st: call memobj_destroy only if there is memory imported
44587ec681f3Smrg- mesa: do not throw \_mesa_problem when invalid enum is used
44597ec681f3Smrg- mesa/st: use a lock to protect access to variants when updating them
44607ec681f3Smrg- egl/dri2: fix race between image create and egl_image_target_texture
44617ec681f3Smrg- iris: initialize shared screen->vtbl only once
44627ec681f3Smrg
44637ec681f3SmrgThong Thai (10):
44647ec681f3Smrg
44657ec681f3Smrg- radeon/vcn: fix jpeg decode for navi10
44667ec681f3Smrg- frontends/va: Add support for NV12/P010/P016 to vaDeriveImage
44677ec681f3Smrg- frontends/va: Derive image from interlaced buffers
44687ec681f3Smrg- frontends/va: Derive image from interlaced buffers in some cases
44697ec681f3Smrg- gallium: Parse packed HEVC SPS encode header for crop parameters
44707ec681f3Smrg- radeon: Pass HEVC encode crop parameters to the encoder
44717ec681f3Smrg- frontends/va: Enabled packed headers for H.264 encoder
44727ec681f3Smrg- gallium/auxiliary/vl: Include src region in scale_y calculation
44737ec681f3Smrg- frontends/va/postproc: Un-break field flag
44747ec681f3Smrg- frontends/va: Return P010/P016 as possible surface formats when encoding
44757ec681f3Smrg
44767ec681f3SmrgTimothy Arceri (15):
44777ec681f3Smrg
44787ec681f3Smrg- i965: add support for force_gl_vendor
44797ec681f3Smrg- disk_cache: move cache dir generation into OS specific helper file
44807ec681f3Smrg- disk_cache: add disk_cache_enabled() helper
44817ec681f3Smrg- disk_cache: move index mmap into OS specific helper
44827ec681f3Smrg- disk_cache: move munmap into an OS specific helper
44837ec681f3Smrg- disk_cache: move evict_lru_item() to an OS specific helper
44847ec681f3Smrg- disk_cache: create new helper for writing cache items to disk
44857ec681f3Smrg- disk_cache: move get_cache_file() to an OS specific helper
44867ec681f3Smrg- disk_cache: add new OS specific helper disk_cache_evict_item()
44877ec681f3Smrg- disk_cache: move cache item loading code into disk_cache_load_item() helper
44887ec681f3Smrg- glsl: don't duplicate state vars as uniforms in the NIR linker
44897ec681f3Smrg- util/disk_cache: remove unused function param
44907ec681f3Smrg- glsl: relax rule on varying matching for shaders older than 4.00
44917ec681f3Smrg- glsl: add extra pp tokens workaround and enable for CoR
44927ec681f3Smrg- glsl: drop NMS OpenGL workarounds
44937ec681f3Smrg
44947ec681f3SmrgTimur Kristóf (50):
44957ec681f3Smrg
44967ec681f3Smrg- aco: Fix unused variable warning by adding ASSERTED.
44977ec681f3Smrg- aco: Fix convert_to_SDWA when instruction has 3 operands.
44987ec681f3Smrg- aco: Move README to README-ISA
44997ec681f3Smrg- aco: Fixup markdown formatting of the README-ISA.
45007ec681f3Smrg- aco: Add README which explains about what ACO is and how it works.
45017ec681f3Smrg- aco: Fix emit_boolean_exclusive_scan in wave32 mode.
45027ec681f3Smrg- aco: Clean up emit_mbcnt.
45037ec681f3Smrg- aco: Add base argument to emit_mbcnt.
45047ec681f3Smrg- aco: Use NIR IO semantics for tess factor IO locations.
45057ec681f3Smrg- radv/aco: Set I/O variable locations outside ACO.
45067ec681f3Smrg- nir: Add ability to count emitted GS primitives.
45077ec681f3Smrg- nir: Add ability to count emitted GS vertices per primitive.
45087ec681f3Smrg- nir: Add ability to overwrite incomplete GS primitives.
45097ec681f3Smrg- nir: Count vertices per stream.
45107ec681f3Smrg- nir: Add ability to count primitives per stream.
45117ec681f3Smrg- radv/aco: Use new GS lowering options for ACO with NGG GS.
45127ec681f3Smrg- aco: Clarify missing export error message in assembler.
45137ec681f3Smrg- aco: Extract lanecount_to_mask to a separate function.
45147ec681f3Smrg- aco: Extract thread_id_in_threadgroup to a separate function.
45157ec681f3Smrg- aco: Use thread_id_in_threadgroup helper for ES outputs.
45167ec681f3Smrg- aco: Optimize thread_id_in_threadgroup when there is just one wave.
45177ec681f3Smrg- aco: Add wave-specific opcode for s_lshl and s_flbit.
45187ec681f3Smrg- aco/ngg: Refactor gs_alloc_req in preparation for NGG GS.
45197ec681f3Smrg- aco/ngg: Refactor ngg_emit_prim_export in preparation for NGG GS.
45207ec681f3Smrg- aco/ngg: Make primitive export packing less prone to error.
45217ec681f3Smrg- aco/ngg: Clean up and reorganize NGG VS/TES code.
45227ec681f3Smrg- aco/ngg: Allow NGG GS to store ES outputs.
45237ec681f3Smrg- aco/ngg: Allow NGG GS to load per-vertex GS inputs.
45247ec681f3Smrg- aco/ngg: Allow NGG GS to create VS exports.
45257ec681f3Smrg- aco/ngg: Setup NGG GS.
45267ec681f3Smrg- aco/ngg: Create LDS layout for NGG GS.
45277ec681f3Smrg- aco/ngg: Implement workgroup reduce / exclusive scan for NGG GS.
45287ec681f3Smrg- aco/ngg: Implement NGG GS output.
45297ec681f3Smrg- aco/ngg: Place workgroup barrier outside control flow for NGG GS.
45307ec681f3Smrg- aco/ngg: Add shader query support to NGG GS.
45317ec681f3Smrg- radv/aco: Enable NGG GS by default.
45327ec681f3Smrg- aco/ngg: Use more efficient LDS layout to help reduce bank conflicts.
45337ec681f3Smrg- aco/ngg: Allocate NGG GS space early for const vertex/primitive counts.
45347ec681f3Smrg- aco/ngg: Calculate workgroup size of NGG shaders.
45357ec681f3Smrg- nir: Emit set_vertex_and_primitive_count for inactive streams.
45367ec681f3Smrg- aco/ngg: Add assertion to make sure we always know the vertex count.
45377ec681f3Smrg- aco: Assert that workgroup barriers are not used inappropriately.
45387ec681f3Smrg- aco/ngg: Put shader query reduction operand into a VGPR.
45397ec681f3Smrg- aco: Add some validation for PSEUDO_REDUCTION instructions.
45407ec681f3Smrg- aco: Make emitting reduction instructions a bit more convenient.
45417ec681f3Smrg- aco: Add a few assertions about LDS usage.
45427ec681f3Smrg- aco/ngg: Export a zero-area triangle when primitive count is 0.
45437ec681f3Smrg- aco/ngg: Incorporate GS invocations into workgroup size calculation.
45447ec681f3Smrg- aco/optimizer: Only set scc_needed when it is actually needed.
45457ec681f3Smrg- aco: Fix NGG GS assert failure from the WG scan.
45467ec681f3Smrg
45477ec681f3SmrgTomeu Vizoso (16):
45487ec681f3Smrg
45497ec681f3Smrg- Revert "CI: temp disable t720/t760 jobs."
45507ec681f3Smrg- Revert "CI: Disable Panfrost T720/T760 CI"
45517ec681f3Smrg- ci: Split traces.yml file per driver
45527ec681f3Smrg- ci: Test Panfrost with more traces
45537ec681f3Smrg- ci: Fix URL to imagediff page in traces dashboard
45547ec681f3Smrg- ci: Update kernel used in LAVA to 5.8-based drm-misc
45557ec681f3Smrg- ci: Run deqp-gles2 on RadeonSI
45567ec681f3Smrg- ci: Run deqp-gles3 and deqp-gles31 on RadeonSI
45577ec681f3Smrg- ci: Update kernel for LAVA
45587ec681f3Smrg- ci: Test Panfrost on Khadas VIM3 boards
45597ec681f3Smrg- ci: Disable pm_runtime and max clocks in LAVA jobs
45607ec681f3Smrg- ci: Unskip fragment_ops tests on Bifrost
45617ec681f3Smrg- virgl: Correctly align size of blobs
45627ec681f3Smrg- ci: Update kernel for LAVA to 5.10-rc2 plus patches
45637ec681f3Smrg- ci: Update dEQP skips and fails for Bifrost on G52
45647ec681f3Smrg- ci: Distribute ADMGPU driver to LAVA as a module
45657ec681f3Smrg
45667ec681f3SmrgTony Wasserka (26):
45677ec681f3Smrg
45687ec681f3Smrg- nir/lower_idiv: Port recent LLVM fixes to emit_udiv
45697ec681f3Smrg- radv: Fix various non-critical integer overflows
45707ec681f3Smrg- aco: Fix integer overflows when emitting parallel copies during RA
45717ec681f3Smrg- amd/common: Fix various non-critical integer overflows
45727ec681f3Smrg- aco/isel: Turn the function template emit_load into a proper function
45737ec681f3Smrg- aco/isel: Simplify nested branching code
45747ec681f3Smrg- aco/isel: Consistently use references for input parameters in emit_load
45757ec681f3Smrg- aco/isel: Remove unused definitions
45767ec681f3Smrg- aco/isel: Move context initialization code to a dedicated file
45777ec681f3Smrg- aco/isel: Move add_startpgm to aco_instruction_selection.cpp
45787ec681f3Smrg- aco/isel: Compile all helper functions with static linkage
45797ec681f3Smrg- nir: Fix undefined behavior due to signed integer multiplication overflows
45807ec681f3Smrg- nir: Fix unaligned pointer access
45817ec681f3Smrg- radv: Avoid calling memcpy with null pointers
45827ec681f3Smrg- radv: Fix unaligned memory access when writing specialization map entries
45837ec681f3Smrg- radv: Clean up CreateDescriptorSetLayout
45847ec681f3Smrg- radv: Respect alignment requirements in descriptor set layouts
45857ec681f3Smrg- aco/isel: Fix out-of-bounds write in visit_load_input
45867ec681f3Smrg- aco/isel: Always export position data from VS/NGG
45877ec681f3Smrg- aco/isel: Remove some dead code
45887ec681f3Smrg- aco/isel: Remove now unused VS-related code from create_null_export
45897ec681f3Smrg- aco: Use strong typing to model SW<->HW stage mappings
45907ec681f3Smrg- aco: Clean up symbol names and comments related to NGG
45917ec681f3Smrg- aco/isel: Miscellaneous cleanups using the new Stage API
45927ec681f3Smrg- aco/ra: Fix counting of subdword variables in get_reg_create_vector
45937ec681f3Smrg- aco: Fix format string used when raising validation errors
45947ec681f3Smrg
45957ec681f3SmrgVeerabadhran Gopalakrishnan (1):
45967ec681f3Smrg
45977ec681f3Smrg- frontends/va: Added protected playback support for VP9
45987ec681f3Smrg
45997ec681f3SmrgVinson Lee (64):
46007ec681f3Smrg
46017ec681f3Smrg- util: Fix memory leaks in unit test.
46027ec681f3Smrg- meson: Fix lmsensors warning message.
46037ec681f3Smrg- radv/winsys: Fix memory leak.
46047ec681f3Smrg- vulkan: Fix memory leaks.
46057ec681f3Smrg- panfrost: Fix gnu-empty-initializer errors.
46067ec681f3Smrg- freedreno: Fix file descriptor leak.
46077ec681f3Smrg- svga: Fix unused printf argument.
46087ec681f3Smrg- spirv: Initialize spirv_test member shader.
46097ec681f3Smrg- nv50/ir: Add fallthrough statement.
46107ec681f3Smrg- nv50/ir: Remove duplicate mask assignment.
46117ec681f3Smrg- ac/llvm: Fix nonportable sizeof.
46127ec681f3Smrg- freedreno: Check file descriptor before write.
46137ec681f3Smrg- nv50/ir: Initialize Converter members.
46147ec681f3Smrg- libgl-gdi: Fix unused-variable warnings.
46157ec681f3Smrg- disk_cache: Fix filename leak on error path.
46167ec681f3Smrg- radesonsi: Remove unsigned comparison to zero.
46177ec681f3Smrg- panfrost: Delete debug allocated syncobj.
46187ec681f3Smrg- turnip: Release bo_mutex lock before potential error path.
46197ec681f3Smrg- pan/bi: Fix typo.
46207ec681f3Smrg- glsl: Initialize ir_constant member const_elements in all constructors.
46217ec681f3Smrg- r600/sfn: Initialize GPRValue member m_pin_to_channel.
46227ec681f3Smrg- gallium/dri2: Move image->texture assignment after image NULL check.
46237ec681f3Smrg- panfrost: Remove extra printf arguments.
46247ec681f3Smrg- anv: Check file descriptor before closing.
46257ec681f3Smrg- aco: Initialize mad_info member literal_idx.
46267ec681f3Smrg- gallium/swr: Remove unreachable code.
46277ec681f3Smrg- pan/mdg: Fix memory leak on error path.
46287ec681f3Smrg- lima: Print usage if --help is any of the arguments.
46297ec681f3Smrg- radv: Fix asserts using assign instead of compare.
46307ec681f3Smrg- nv50/ir: Initialize Source members.
46317ec681f3Smrg- freedreno: Move rsc NULL check to before rsc dereferences.
46327ec681f3Smrg- intel/vec4: Remove leftover code from Gen8+ removal.
46337ec681f3Smrg- glsl: Initialize ast_node member field location.path in constructor.
46347ec681f3Smrg- meson: Use more portable compiler option -std.
46357ec681f3Smrg- swr/rasterizer: Remove BuilderGfxMem member mpTrackMemAccessFuncTy.
46367ec681f3Smrg- util/xmlconfig: Initialize xmlconfig member options in constructor.
46377ec681f3Smrg- svga: Remove unused printf argument.
46387ec681f3Smrg- glsl: Initialize ir_to_mesa_visitor members in constructor.
46397ec681f3Smrg- v3dv: Fix assert using assign instead of compare.
46407ec681f3Smrg- glsl: Initialize lower_ubo_reference_visitor members in constructor.
46417ec681f3Smrg- glsl: Initialize add_uniform_to_shader member var in constructor.
46427ec681f3Smrg- v3dv: Remove unsigned comparison to zero.
46437ec681f3Smrg- v3dv: Initialize time before usage by free_stale_bos.
46447ec681f3Smrg- panfrost: Fix stride for AFBC_FORMAT_MOD_BLOCK_SIZE_32x8.
46457ec681f3Smrg- v3dv: Fix assert using assign instead of compare.
46467ec681f3Smrg- glsl: Initialize ir_if_to_cond_assign_visitor members in constructor.
46477ec681f3Smrg- glsl: Initialize lower_shared_reference_visitor members.
46487ec681f3Smrg- scons/windows: Support build with LLVM 11.
46497ec681f3Smrg- amd/addrlib: Initialize Gfx10Lib members in constructor.
46507ec681f3Smrg- Fix VMware capitalization.
46517ec681f3Smrg- glsl: Update loop_terminator constructor to accept parameters.
46527ec681f3Smrg- draw: Remove draw_install_aaline_stage dead code.
46537ec681f3Smrg- os: Fix open result check.
46547ec681f3Smrg- gallium: Remove duplicate resource variable.
46557ec681f3Smrg- tgsi: Initialize tgsi_declaration_dimension padding.
46567ec681f3Smrg- radesonsi: Remove unnecessary shader->selector NULL check.
46577ec681f3Smrg- amd/addrlib: Add missing va_end.
46587ec681f3Smrg- v3dv: Remove unsigned comparison to zero.
46597ec681f3Smrg- st/nine: Remove unnecessary NULL check.
46607ec681f3Smrg- turnip: Fix file descriptor return.
46617ec681f3Smrg- vdpau: Add missing printf format specifier.
46627ec681f3Smrg- frontends/va: Fix \*num_entrypoints check.
46637ec681f3Smrg- clover/spirv: Add missing break for SpvOpExecutionMode case.
46647ec681f3Smrg- turnip: Close sync_fd only if it is a valid file descriptor.
46657ec681f3Smrg
46667ec681f3SmrgWoody Chow (1):
46677ec681f3Smrg
46687ec681f3Smrg- st/mesa: Fix EGLImageTargetTexture2D for GL_TEXTURE_2D
46697ec681f3Smrg
46707ec681f3SmrgYevhenii Kolesnikov (1):
46717ec681f3Smrg
46727ec681f3Smrg- nir/large_constants: only search for constant duplicates
46737ec681f3Smrg
46747ec681f3SmrgYogesh Mohan Marimuthu (1):
46757ec681f3Smrg
46767ec681f3Smrg- src/mesa: add GL_NV_half_float extension support (v2)
46777ec681f3Smrg
46787ec681f3Smrgjzielins (4):
46797ec681f3Smrg
46807ec681f3Smrg- gallium/swr: Fix compilation with LLVM 12
46817ec681f3Smrg- gallium/swr: Fix compilation TCS/TES compilation issues
46827ec681f3Smrg- swr: Fix crashes on non-AVX hardware
46837ec681f3Smrg- swr: Use ElemenCount constructor for LLVM 11
46847ec681f3Smrg
46857ec681f3Smrgn00b7 (1):
46867ec681f3Smrg
46877ec681f3Smrg- v3dv/device: handle primary nodes for newer kernels
46887ec681f3Smrg
46897ec681f3Smrgorbea (1):
46907ec681f3Smrg
46917ec681f3Smrg- spirv/vtn_cfg.c: Include util/debug.h for env_var_as_boolean.
46927ec681f3Smrg
46937ec681f3Smrgzhu yong (1):
46947ec681f3Smrg
46957ec681f3Smrg- meson: add support for loongson's mips/mips64 arch.
4696