17ec681f3SmrgMesa 21.1.0 Release Notes / 2021-05-05 27ec681f3Smrg====================================== 37ec681f3Smrg 47ec681f3SmrgMesa 21.1.0 is a new development release. People who are concerned 57ec681f3Smrgwith stability and reliability should stick with a previous release or 67ec681f3Smrgwait for Mesa 21.1.1. 77ec681f3Smrg 87ec681f3SmrgMesa 21.1.0 implements the OpenGL 4.6 API, but the version reported by 97ec681f3SmrgglGetString(GL_VERSION) or glGetIntegerv(GL_MAJOR_VERSION) / 107ec681f3SmrgglGetIntegerv(GL_MINOR_VERSION) depends on the particular driver being used. 117ec681f3SmrgSome drivers don't support all the features required in OpenGL 4.6. OpenGL 127ec681f3Smrg4.6 is **only** available if requested at context creation. 137ec681f3SmrgCompatibility contexts may report a lower version depending on each driver. 147ec681f3Smrg 157ec681f3SmrgMesa 21.1.0 implements the Vulkan 1.2 API, but the version reported by 167ec681f3Smrgthe apiVersion property of the VkPhysicalDeviceProperties struct 177ec681f3Smrgdepends on the particular driver being used. 187ec681f3Smrg 197ec681f3SmrgSHA256 checksum 207ec681f3Smrg--------------- 217ec681f3Smrg 227ec681f3Smrg:: 237ec681f3Smrg 247ec681f3Smrg 0128f10e22970d3aed3d1034003731f94623015cd9797c07151417649c1b1ff8 mesa-21.1.0.tar.xz 257ec681f3Smrg 267ec681f3Smrg 277ec681f3SmrgNew features 287ec681f3Smrg------------ 297ec681f3Smrg 307ec681f3Smrg- VK_KHR_workgroup_memory_explicit_layout on Intel, RADV 317ec681f3Smrg 327ec681f3Smrg- DRM format modifiers for AMD. 337ec681f3Smrg 347ec681f3Smrg- VK_KHR_zero_initialize_workgroup_memory on Intel, RADV 357ec681f3Smrg 367ec681f3Smrg- Zink exposes GL 4.6 and ES 3.1 377ec681f3Smrg 387ec681f3Smrg- GL_EXT_depth_bounds_test on softpipe, zink 397ec681f3Smrg 407ec681f3Smrg- GL_EXT_texture_filter_minmax on nvc0 (gm200+) 417ec681f3Smrg 427ec681f3Smrg- GL_ARB_texture_filter_minmax on nvc0 (gm200+) 437ec681f3Smrg 447ec681f3Smrg- GL_ARB_post_depth_coverage on zink 457ec681f3Smrg 467ec681f3Smrg- VK_KHR_copy_commands2 on lavapipe 477ec681f3Smrg 487ec681f3Smrg- lavapipe exposes Vulkan 1.1 497ec681f3Smrg 507ec681f3Smrg- VRS attachment on RADV 517ec681f3Smrg 527ec681f3Smrg- None 537ec681f3Smrg 547ec681f3Smrg 557ec681f3SmrgBug fixes 567ec681f3Smrg--------- 577ec681f3Smrg 587ec681f3Smrg- No sRGB capable visuals/fbconfigs reported in glx 597ec681f3Smrg- Graphics corruption and GPU hang with RADV/LLVM 607ec681f3Smrg- old kernels (4.19) support in radv 617ec681f3Smrg- Elite Dangerous: Odyssey alpha crashes GPU on launch 627ec681f3Smrg- CSGO: Some default variables can cause problems with trust mode 637ec681f3Smrg- mesa git started to break wine + UnrealTournament.exe (old dx6 game) 647ec681f3Smrg- SuperTuxKart artifacting on RK3399 657ec681f3Smrg- [amdgpu]: Golf With Your Friends (431240): ERROR Waiting for fences timed out 667ec681f3Smrg- Strange results when trying to read from VK_FORMAT_R64_SFLOAT in compute shader 677ec681f3Smrg- anv: dEQP-VK.binding_model.buffer_device_address.set3.depth3.basessbo.convertcheck* slow 687ec681f3Smrg- Iris doesn't support INTEL_performance_query anymore 697ec681f3Smrg- RADV: TRUNC_COORD breaks gather operations 707ec681f3Smrg- [RADV] corruption in avatar after dying in Heroes of the Storm 717ec681f3Smrg- Metro Exodus crashing due to memory overflow 727ec681f3Smrg- Sauerbraten shader rendering broken on RV530 (r300g) 737ec681f3Smrg- texture glitches on CS:GO on Tiger Lake 747ec681f3Smrg- Build fail due to "parameter name omitted" on Gallium Nine 757ec681f3Smrg- Non-DRI builds broken by recent cleanups in Mesa core 767ec681f3Smrg- Cinnamon core dump after installing latest oibaf mesa build (165a69d2) 777ec681f3Smrg- yuv sampler lowering regression 787ec681f3Smrg- anv: anv_descriptor_set_binding_layout::array_size overflows u16 797ec681f3Smrg- RADV - Vertex explosion in DIRT 5 on RDNA2 807ec681f3Smrg- ci: Use renderdoc from debian 817ec681f3Smrg- ci: Use debian apitrace in x86 images 827ec681f3Smrg- SIGSEV in v3d_emit_gl_shader_state 837ec681f3Smrg- Xorg crash due to assertion failure after GPU soft reset 847ec681f3Smrg- AMD hevc_vaapi ffmpeg encoding = wrong image width (48px black bar on the right) 857ec681f3Smrg- panfrost: Page fault in glamor when running GIMP with X11 on Mali T860 867ec681f3Smrg- gallium: python trace scripts need updating 877ec681f3Smrg- EGL context creation fails when EGL_KHR_create_context_no_error is mentioned for OpenGL ES 1.1. 887ec681f3Smrg- [spirv-fuzz] NIR validation failed after spirv_to_nir: error: nir_block_dominates 897ec681f3Smrg- [bisected][regression][i965,iris] dEQP-VK.clipping.user_defined.clip_cull_distance.* failing on multiple platforms 907ec681f3Smrg- No Mans Sky GPU hang on Radeon ACO 917ec681f3Smrg- radeonsi: prusa-slicer crashes on mesa 21 927ec681f3Smrg- anv: dEQP-VK.subgroups.ballot_broadcast.compute.subgroupbroadcast_i8vec3_requiredsubgroupsize16 fails on ICL, TGL 937ec681f3Smrg- [radeonsi] Rendering of Firefox UI and website content corrupts randomly and after window geometry changes 947ec681f3Smrg- max_binding inconsistency in vulkan descriptor set drivers 957ec681f3Smrg- anv: conservative rasterization ext question mark 967ec681f3Smrg- Unigine Tropics MSAA failure 977ec681f3Smrg- warning: xnack 'Off' was requested for a processor that does not support it! [AMD VEGAM with LLVM 12.0.0] 987ec681f3Smrg- Compiling some ARB assembly shaders leads to memory corruption 997ec681f3Smrg- Specifying an explicit location for an array output messes up transform feedback 1007ec681f3Smrg- Request for VK_EXT_conservative_rasterization support on Anvil Intel driver for newest DXVK.. 1017ec681f3Smrg- radv: dEQP regressions after addrlib update 1027ec681f3Smrg- Up to 30% performance drop (GLBenchmark, GfxBench) 1037ec681f3Smrg- DOTA 2 don't no longer starts since commit ad241b15a9e517dd4c4e8d7b1d5dab7c3a74b37c 1047ec681f3Smrg- Clover doesn't work for kmsro drivers 1057ec681f3Smrg- aco_tests isel.sparse.clause fails with llvm-12 1067ec681f3Smrg- util cpu detection breaks on 128-core AMD machines 1077ec681f3Smrg- util cpu detection breaks on 128-core AMD machines 1087ec681f3Smrg- Default GL_MAX_TEXTURE_BUFFER_SIZE very small 1097ec681f3Smrg- intel_nullhw.c:41:38: error: field ‘vtable’ has incomplete type 1107ec681f3Smrg- ACO error with GCN 1 GPU 1117ec681f3Smrg- kmsro advertises EGL_MESA_device_software 1127ec681f3Smrg- d3d12: Use ID3D12Device9::CreateCommandQueue1 when available 1137ec681f3Smrg- [RADV] Halo: The Master Chief Collection: Crash in Halo Reach Firefight 1147ec681f3Smrg- freedreno: use SAMPLE_COUNT to autotune sysmem vs gmem 1157ec681f3Smrg- freedreno: draw_vbo optimizations 1167ec681f3Smrg- [Bisected][RadeonSI] Mesa crashes when rendering with Eevee in Blender 1177ec681f3Smrg- subgroupBallotFindMSB() broken in RADV/ACO 20.3.4 1187ec681f3Smrg- nir_print: util_cpu_detect() is not called prior to _mesa_half_to_float() 1197ec681f3Smrg- turnip: buffer overflow read on dEQP-VK.ycbcr.query.levels.tess_eval.r8g8b8a8_unorm 1207ec681f3Smrg- RuneScape crashes GLOn12 1217ec681f3Smrg- d3d12: Surfaces need to use shareable descriptors 1227ec681f3Smrg- [RADV][RDNA2] Red Dead Redemption 2 image glitches during menu/overlay menu transitions 1237ec681f3Smrg- "unknown intrinsic" assertion triggered by multiview shader in non-multiview renderpass in Vulkan on intel 1247ec681f3Smrg- [i965][g965,ilk,g33,g45][bisected] dEQP-GLES2.functional.fbo.completeness.attachment_combinations.* failures 1257ec681f3Smrg- radv: VK_FORMAT_FEATURE_SAMPLED_IMAGE_BIT exposed for VK_FORMAT_R64_SFLOAT 1267ec681f3Smrg- anv: android building error after commit 4fb6c05 1277ec681f3Smrg- Compiling mesa with -Dtools=all throws deprecation warnings for intel tools 1287ec681f3Smrg- DXVK is broken in latest master 1297ec681f3Smrg- OpenCLOn12: Affinity Photo CL kernels produce invalid DXIL phis 1307ec681f3Smrg- nir -> tgsi conversion problem 1317ec681f3Smrg- [i965][g965,ilk,g33][bisected] fp16 enablement causes deqp test failures 1327ec681f3Smrg- mesa/st: Uniforms are not updated after lowering alpha test 1337ec681f3Smrg- [i965][bisected][regression] piglit failing primitive-restart-vbo_combined_vertex_and_index on multiple platforms 1347ec681f3Smrg- [RADV] Nioh 2 - The Complete Edition: "Bloom" on lights 1357ec681f3Smrg- [RADV] Oblivion: Poor Performance while MSAA Is Enabled 1367ec681f3Smrg- lima context state bugs with shader compile 1377ec681f3Smrg- [RADV][BISECTED] The Surge 2 (644830) - In-game assets do not render correctly since 20.3.4. 1387ec681f3Smrg- amd clang cannot convert ‘llvm::AtomicOrdering’ to ‘llvm::MaybeAlign’ build failure 1397ec681f3Smrg- [iris][icl,tgl][bisected][regression] failure on piglit.spec.arb_separate_shader_objects.programuniform coverage 1407ec681f3Smrg- opencl build fail 1417ec681f3Smrg- anv: dEQP-VK.glsl.builtin_var.fragcoord_msaa.* fails 1427ec681f3Smrg- Request - depth format feature SAMPLED_IMAGE_FILTER_LINEAR 1437ec681f3Smrg- "radeonsi: Check pitch and offset for validity." is a bad commit 1447ec681f3Smrg- Add OpenCL information to docs/features.txt 1457ec681f3Smrg- [regression] [bisected] piglit.spec.arb_framebuffer_object.fbo-drawbuffers-none gldrawpixels fails 1467ec681f3Smrg- RADV: robustBufferAccessUpdateAfterBind is not exposed 1477ec681f3Smrg- debug build compilation failed: inlining failed in call to ‘always_inline’ ‘_nir_visit_dest_indirect’: indirect function call with a yet undetermined callee 1487ec681f3Smrg- [RADV/DXVK] Shadow artifacts with different games 1497ec681f3Smrg- glxgears segfaults with classic i915 1507ec681f3Smrg- mesa_glthread=true Black Mesa 1517ec681f3Smrg- freedreno: rendering corruption in dead cells 1527ec681f3Smrg- ac/rgp: Android building error after commit 12515d6 1537ec681f3Smrg- d3d12: Assert failures & crashes on latest master 1547ec681f3Smrg- RADV/ACO - DCC causing garbled output on RX570 1557ec681f3Smrg- draw.c:121: _mesa_set_draw_vao: Assertion \`vao->_EnabledWithMapMode == _mesa_vao_enable_to_vp_inputs(vao->_AttributeMapMode, vao->Enabled)' failed. 1567ec681f3Smrg- ANV: Weird jitter in Witcher 1 1577ec681f3Smrg- RADV - Path of Exile: Shimmering outlines where water and other objects meet 1587ec681f3Smrg- ANV: Weird jitter in Witcher 1 1597ec681f3Smrg- ANV: Weird jitter in Witcher 1 1607ec681f3Smrg- meson: meson-built libraries have inconsistent compatability / current versions compared to older autotools-built libraries 1617ec681f3Smrg- device select layer breaks other layers 1627ec681f3Smrg- RADV: Extreme overhead in vkQueueSubmit 1637ec681f3Smrg- Graphical glitch of popupping missing texture on Mesa version >18.0.5 (Padoka Stable + Unstable/Oibaf/ubuntu-x-swat PPAs) 1647ec681f3Smrg- [regression] [bisected] dEQP-GLES2.functional.fbo.render.stencil_clear.rbo_rgb5_a1_stencil_index8 fails 1657ec681f3Smrg- occasional corruption issue with RADV in multiple games, disappears after using amdvlk 1667ec681f3Smrg- panfrost T860 regression 1677ec681f3Smrg- OpenGL on GMA4500MHD 1687ec681f3Smrg- piglit-replay: JUnit file contains wrong links to the tracie dashboard 1697ec681f3Smrg- R8 texture upload / corruption bug on Radeon RX 5700 XT 1707ec681f3Smrg- Ambient Occlusion in Two Point Hospital shows black spot artifacts 1717ec681f3Smrg- freedreno: async background shader compile 1727ec681f3Smrg- AMD VAAPI encoding has ceased to work 1737ec681f3Smrg- Rage 2: Visual corruption on in-game menu with ACO. 1747ec681f3Smrg- ACO doesn't correctly render map in Borderlands 3 vs. LLVM on 5700 XT 1757ec681f3Smrg- Invalid shader under panfrost/wayland 1767ec681f3Smrg- Strange Brigade refuses to load correctly since some recent commits 1777ec681f3Smrg- GLonD3D12: Crashes and suboptimal fallback 1787ec681f3Smrg- GLonD3D12: Crashes and suboptimal fallback 1797ec681f3Smrg- GLonD3D12: Crashes and suboptimal fallback 1807ec681f3Smrg- [RADV][REGRESSION][BISECTED] radv_GetMemoryFdPropertiesKHR returns no valid memory types for vaapi drmbuf 1817ec681f3Smrg- anv: vkQueueSubmit with waitSemaphore value of 0 hangs CPU 1827ec681f3Smrg- ttn: invalid base/range triggering nir_validate assertion 1837ec681f3Smrg- Sampling with mipmapped HiZ behaves unexpectedly on Gen9 1847ec681f3Smrg- zink: ARB_map_buffers issues on CI 1857ec681f3Smrg- u_upload_mgr: assert failure for large uploads 1867ec681f3Smrg- [RADV][ACO] Overwatch game crash: amd/compiler/aco_insert_exec_mask.cpp: Failed Assertion 1877ec681f3Smrg- PRIME render offloading broken 1887ec681f3Smrg- Use out encoding for float immediates 1897ec681f3Smrg- [RADV] Severe performance drop when exceeding VRAM compared to AMDVLK 1907ec681f3Smrg- LIBGL_ALWAYS_SOFTWARE=1 picks zink over actual software rasterizers 1917ec681f3Smrg- crash/assert in fd_set_viewport_states 1927ec681f3Smrg- RADV: Occlusion query hangs Big Navi GPU 1937ec681f3Smrg- "mesa: don't allocate matrices with malloc" cause eglCreateContext problem on android 7. 1947ec681f3Smrg- Metal Gear Solid V: The Phantom Pain: texture issues and vertex stretches 1957ec681f3Smrg- [iris and Navi 10] piglit.spec.arb_multi_draw_indirect.arb_draw_elements_base_vertex-multidrawelements -indirect regression 1967ec681f3Smrg- miscompiled compute shader loop on llvmpipe (and Iris) 1977ec681f3Smrg- ci: minio caching of arm64 artifacts for bare-metal 1987ec681f3Smrg- Graphics glitches after upgrade to mesa 20.3 on Khadas VIM3 Pro (Mali G52 GPU) 1997ec681f3Smrg- glthread crash in _mesa_glthread_upload 2007ec681f3Smrg- freedreno piglit flakes 2017ec681f3Smrg- RADV: NonUniform OpArrayLength on SSBO ignores NonUniform. 2027ec681f3Smrg- Iris driver causing graphics glitch in QEMU spice egl DMA-BUF 2037ec681f3Smrg- [RADV/ACO] Death Stranding cause a GPU hung (\*ERROR* Waiting for fences timed out!) 2047ec681f3Smrg- [TGL] Elder Scrolls Online misrenders 2057ec681f3Smrg- [ANV] System hang with GRVK demos 2067ec681f3Smrg- ci: Fractional deqp runs with valgrind enabled. 2077ec681f3Smrg- Regression: Segfault in cso_destroy_context() regression in 20.2 2087ec681f3Smrg- Rendering artifacts in Barn Finders specifically on Radeon Vega 2097ec681f3Smrg- Graphics regression in Assassins Creed Odyssey 2107ec681f3Smrg- [ANV] Compilation warnings 2117ec681f3Smrg- regression in !8152 2127ec681f3Smrg- [bdw][icl][iris] fails new test \`clearbuffer-depth-cs-probe` 2137ec681f3Smrg- ci: new traces runner needs dashboard links in the job log and junit 2147ec681f3Smrg- zink: car model corruption with game TORCS 2157ec681f3Smrg- glGetInternalformati64v(GL_TEXTURE_2D, GL_SR8_EXT, GL_COLOR_ENCODING) returns GL_NONE 2167ec681f3Smrg- Windows: 32-bit build is broken hard 2177ec681f3Smrg- ANV: Not handling separate stencil layouts properly 2187ec681f3Smrg- [Regression][Intel][OpenGL][Bisected] Copying whole 2D array texture failed on latest driver 2197ec681f3Smrg- turnip: dEQP-VK.tessellation.invariance.outer_triangle_set.quads_fractional_odd_spacing failure 2207ec681f3Smrg- i915 regressions bisected to "vbo/dlist: use a shared index buffer" 2217ec681f3Smrg- intel: Chrome OS "hatch" (cometlake) fails on dEQP-GLES31.functional.geometry_shading.layered.render_with_default_layer_cubemap 2227ec681f3Smrg- radv: dEQP-VK.sparse_resources.* failures on GFX9 2237ec681f3Smrg- freedreno: rendering corruption in War Robots 2247ec681f3Smrg- radv: dEQP-VK.sparse_resources.* failures on GFX9 2257ec681f3Smrg- Mesa 20.3.x crashes pidgin on AMD RX480 2267ec681f3Smrg- timespec_get used unconditionally / build fails when targeting macOS 10.14 or earlier 2277ec681f3Smrg- libunwind not located / used on macOS 2287ec681f3Smrg- libunwind not located / used on macOS 2297ec681f3Smrg- meson fails to locate libexpat on macOS 2307ec681f3Smrg- CarX Drift Racing 2 fails to start 2317ec681f3Smrg- Some games using FNA framework show blank screen 2327ec681f3Smrg- Intel Vulkan regression of angle_end2end_tests 2337ec681f3Smrg- Defer lavapipe warning to queue / command / swapchain buffer creation 2347ec681f3Smrg 2357ec681f3Smrg 2367ec681f3SmrgChanges 2377ec681f3Smrg------- 2387ec681f3Smrg 2397ec681f3SmrgAaron Watry (2): 2407ec681f3Smrg 2417ec681f3Smrg- clover: implement CL_IMAGE_ELEMENT_SIZE 2427ec681f3Smrg- clover: implement CL_IMAGE_NUM_MIP_LEVELS and CL_IMAGE_NUM_SAMPLES 2437ec681f3Smrg 2447ec681f3SmrgAbhishek Kumar (2): 2457ec681f3Smrg 2467ec681f3Smrg- intel: change urb max shader geometry for CML GT1 2477ec681f3Smrg- intel: change urb max shader geometry for KBL GT1 2487ec681f3Smrg 2497ec681f3SmrgAdam Jackson (66): 2507ec681f3Smrg 2517ec681f3Smrg- docs: Mark some non-core zink extensions complete 2527ec681f3Smrg- docs: Mark some ES3 zink features complete 2537ec681f3Smrg- egl: Fix error string returned by eglQueryDeviceAttribEXT 2547ec681f3Smrg- zink: Factor out instance setup a bit more 2557ec681f3Smrg- osmesa: Pacify MSVC in the test code 2567ec681f3Smrg- glx: Fix GLX_SGI_video_sync for the no-current-drawable case 2577ec681f3Smrg- nir: Silence a warning at -Og 2587ec681f3Smrg- softfloat: Silence a warning at -Og 2597ec681f3Smrg- glsl: Silence a warning at -Og 2607ec681f3Smrg- glsl: Silence some warnings at -Og 2617ec681f3Smrg- glsl: Silence a warning at -Og 2627ec681f3Smrg- loader: Silence a warning at -Og 2637ec681f3Smrg- gallivm: Silence a warning at -Og 2647ec681f3Smrg- nir/ttn: Silence some warnings at -Og 2657ec681f3Smrg- vl: Silence a warning at -Og 2667ec681f3Smrg- gallivm: Silence a warning at -Og 2677ec681f3Smrg- nouveau: Silence some warnings at -Og 2687ec681f3Smrg- nouveau: Silence a warning at -Og 2697ec681f3Smrg- xlib: Fix build regression since 99e25d183d9 2707ec681f3Smrg- gallium/xlib: Partial fix for glXCopySubBufferMESA 2717ec681f3Smrg- mesa: Store depth bounds test bounds as GLclampd 2727ec681f3Smrg- softpipe: Fix depth comparison with float Z formats 2737ec681f3Smrg- softpipe: Implement GL_EXT_depth_bounds_test 2747ec681f3Smrg- docs: Document GL_EXT_depth_bounds_test 2757ec681f3Smrg- zink: Enable GL_EXT_depth_bounds_test 2767ec681f3Smrg- zink: more and better debug printfs 2777ec681f3Smrg- zink: Fix a thinko in instance setup 2787ec681f3Smrg- zink: Wire up ARB_post_depth_coverage 2797ec681f3Smrg- glx: Pull use_x_font out of the context vtable 2807ec681f3Smrg- glx: Pull get_proc_address out of the context vtable 2817ec681f3Smrg- glx: Remove windows' stub {bind,release}_text_image context hooks 2827ec681f3Smrg- glx/drisw: Implement WaitX and WaitGL 2837ec681f3Smrg- dri: Explicitly handle all the config attributes 2847ec681f3Smrg- dri: Fold attribMap into the code 2857ec681f3Smrg- mesa: Remove misc pbuffer attributes from struct gl_config 2867ec681f3Smrg- mesa: Remove the texture-from-pixmap state from struct gl_config 2877ec681f3Smrg- mesa: Remove transparency state from struct gl_config 2887ec681f3Smrg- mesa: Remove unused gl_config::level 2897ec681f3Smrg- mesa: Remove the pretense of aux buffer support 2907ec681f3Smrg- mesa: Stop tracking visual rating in gl_config 2917ec681f3Smrg- mesa: Remove redundant gl_config::sampleBuffers 2927ec681f3Smrg- ci: Bump the llvmpipe test timeout to 240 seconds 2937ec681f3Smrg- mesa/st: Remove unused ST_ATTACHMENT_SAMPLE 2947ec681f3Smrg- mesa/st: Check for successful framebuffer allocation in st_api_make_current 2957ec681f3Smrg- gallium: Remove curious st_visual::no_config 2967ec681f3Smrg- radeon: Exchange one curious idiom for another in radeonMakeCurrent 2977ec681f3Smrg- mesa: Remove unused _mesa_create_framebuffer 2987ec681f3Smrg- mesa: Make _mesa_initialize_visual return void 2997ec681f3Smrg- mesa: Remove unused gl_config::mutableRenderBuffer 3007ec681f3Smrg- mesa: gl_config::rgbBits should count alphaBits too 3017ec681f3Smrg- dri: Don't tie the accum buffer's alpha-ness to the color buffer's 3027ec681f3Smrg- glx: Stop pretending to validate the pbuffer fbconfig attributes 3037ec681f3Smrg- glx: Don't downgrade the visual caveat from the server 3047ec681f3Smrg- glx: Downgrade aux-buffer-ful fbconfigs 3057ec681f3Smrg- glx: Downgrade tfp mipmap-capable fbconfigs 3067ec681f3Smrg- glx: Downgrade sRGB-ful fbconfigs 3077ec681f3Smrg- dri: Use __DRI_BUFFER_COUNT consistently internally 3087ec681f3Smrg- glx: Default sRGBCapable in the same place as the other config attribs 3097ec681f3Smrg- glx: Clean up fbconfig attribute handling 3107ec681f3Smrg- glx: Remove some #if 0'd DRI config attribute fetch 3117ec681f3Smrg- glx: Don't pointlesly add -D_REENTRANT to libGL's cflags 3127ec681f3Smrg- glx: Move {Bind,Release}TexImage from context to screen vtable 3137ec681f3Smrg- glx: Be more robust against null fbconfigs 3147ec681f3Smrg- glx: Lift sending the MakeCurrent request to top-level code 3157ec681f3Smrg- Revert "glx: Lift sending the MakeCurrent request to top-level code" 3167ec681f3Smrg- gallium/xlib: Fix for recent gl_config changes 3177ec681f3Smrg 3187ec681f3SmrgAdrian Ratiu (1): 3197ec681f3Smrg 3207ec681f3Smrg- docs: docker: minor stale documentation fix 3217ec681f3Smrg 3227ec681f3SmrgAlejandro Piñeiro (36): 3237ec681f3Smrg 3247ec681f3Smrg- v3dv/pipeline: enable lower_add_sat NIR option 3257ec681f3Smrg- v3d/compiler: enable lower_add_sat NIR option 3267ec681f3Smrg- v3dv/descriptor: assert CrateDescriptorPool receives valid count values 3277ec681f3Smrg- v3dv: drop v3dv_resource definition 3287ec681f3Smrg- v3dv: properly handle two different binding points for cmd_buffers 3297ec681f3Smrg- v3dv: move to subclassing instance/physical device 3307ec681f3Smrg- v3dv: remove reference to v3dv_instance on v3dv_physical_device 3317ec681f3Smrg- v3dv: port to using common dispatch code. 3327ec681f3Smrg- v3dv: support for depthBiasClamp 3337ec681f3Smrg- v3dv/device: clarify that we can't expose textureCompressionBC 3347ec681f3Smrg- v3dv/formats: expose support for BC1-3 compressed formats 3357ec681f3Smrg- v3dv/meta_copy: get tlb compatible BC compressed formats for copies 3367ec681f3Smrg- v3dv/descriptor_set: don't free individual set if not allowed 3377ec681f3Smrg- v3dv: avoid some maybe-uninitialized warnings 3387ec681f3Smrg- v3dv/pipeline_cache: add more details when dumping debug info 3397ec681f3Smrg- v3dv/pipeline: remove pregenerate_variant 3407ec681f3Smrg- v3dv/pipeline: remove pipeline->use_push_constants 3417ec681f3Smrg- broadcom/compiler: add local_size in v3d_compute_prog_data 3427ec681f3Smrg- broadcom/compiler: add driver_location_map at vs prog data 3437ec681f3Smrg- v3dv/pipeline: use driver_location_map instead of nir utilities 3447ec681f3Smrg- v3dv/pipeline: move topology to pipeline 3457ec681f3Smrg- v3dv/pipeline: remove compiled_variant_count field 3467ec681f3Smrg- v3dv/pipeline: remove v3d_key from shader_variant and pipeline stage 3477ec681f3Smrg- v3dv: define broadcom shader stages 3487ec681f3Smrg- v3dv/pipeline: use broadcom_shader_stage as pipeline/variant stage type 3497ec681f3Smrg- v3dv/pipeline: try to get the shader variant directly from the cache 3507ec681f3Smrg- v3dv/pipeline: don't create a variant if compilation failed 3517ec681f3Smrg- v3dv/pipeline: compute sha1 for no-op fragment shaders correctly 3527ec681f3Smrg- v3dv/device: avoid unused-result warning with asprintf 3537ec681f3Smrg- v3dv: Add support for the on-disk shader cache 3547ec681f3Smrg- v3dv/cmd_buffer: return early for draw commands if there is nothing to draw 3557ec681f3Smrg- v3dv: define a default attribute values with float type 3567ec681f3Smrg- vulkan: track number of bindings instead of max binding for CreateDescriptorSetLayout 3577ec681f3Smrg- v3dv/device: do not compute per-pipeline limits multiplying per-stage 3587ec681f3Smrg- v3dv/device: fix and cleanup v3dv limits 3597ec681f3Smrg- v3dv/pipeline: reduce descriptor_map size 3607ec681f3Smrg 3617ec681f3SmrgAlexander Kapshuk (1): 3627ec681f3Smrg 3637ec681f3Smrg- frontends/va/image: Eliminate repetitive code on error paths 3647ec681f3Smrg 3657ec681f3SmrgAlexander Shi (1): 3667ec681f3Smrg 3677ec681f3Smrg- mesa: texparam: Add a clamping macro to handle out-of-range floats returned as integers. 3687ec681f3Smrg 3697ec681f3SmrgAlexander von Gluck IV (1): 3707ec681f3Smrg 3717ec681f3Smrg- egl/haiku: Fix ConfigID naming inline with mesa 3727ec681f3Smrg 3737ec681f3SmrgAlyssa Rosenzweig (345): 3747ec681f3Smrg 3757ec681f3Smrg- pan/bi: Fix assertion 3767ec681f3Smrg- pan/bi: Pipe scratch_size in from NIR 3777ec681f3Smrg- pan/bi: Fix 64-bit SSBO addresses 3787ec681f3Smrg- pan/bi: Fix RA of node 0 3797ec681f3Smrg- pan/bi: Fix printing of node 0 3807ec681f3Smrg- panfrost: Fix TLS sizing if cores are missing 3817ec681f3Smrg- panfrost: Allow waiting on slots 6/7 during preload 3827ec681f3Smrg- pan/bi: Add internal debug flag 3837ec681f3Smrg- pan/bi: Validate format 12 tuple count in disasm 3847ec681f3Smrg- pan/bi: Print FAU index in verbose mode 3857ec681f3Smrg- pan/bi: Refactor PC-relative printing 3867ec681f3Smrg- pan/bi: Lint for infinite loops 3877ec681f3Smrg- pan/bi: Print disasm/stats with DEBUG=internal 3887ec681f3Smrg- pan/bi: Fix IDLE register mode packing 3897ec681f3Smrg- pan/bi: Fix staging register packing 3907ec681f3Smrg- pan/bi: Fix dependency wait calculation 3917ec681f3Smrg- pan/bi: Fix M1/M2 decoding in disassembler 3927ec681f3Smrg- pan/bi: Pull out bi_count_read_registers helper 3937ec681f3Smrg- pan/bi: Move bi_next_clause to bir.c 3947ec681f3Smrg- pan/bi: Pass through wait_{6, 7} flags 3957ec681f3Smrg- pan/bi: Add dead branch elimination pass 3967ec681f3Smrg- pan/bi: Add "soft" mode to DCE 3977ec681f3Smrg- pan/bi: Add bi_{before,after}_clause cursors 3987ec681f3Smrg- pan/bi: Add bi_foreach_clause_in_block_rev 3997ec681f3Smrg- pan/bi: Add bi_foreach_instr_in_tuple helper 4007ec681f3Smrg- pan/bi: Add bi_foreach_instr_in_clause iterators 4017ec681f3Smrg- pan/bi: Add destination iterator macro 4027ec681f3Smrg- pan/bi: Don't open code bi_foreach_dest 4037ec681f3Smrg- pan/bi: Permit multiple destinations in RA 4047ec681f3Smrg- pan/bi: Add interference per clause 4057ec681f3Smrg- pan/bi: Implement spilling at the clause-level 4067ec681f3Smrg- pan/bi: Don't fill garbage 4077ec681f3Smrg- pan/bi: Add CUBEFACE pseudoinstruction 4087ec681f3Smrg- pan/bi: Print multiple destinations if needed 4097ec681f3Smrg- pan/bi: Move init_builder to common code 4107ec681f3Smrg- pan/bi: Add "word equivalence" relation for index 4117ec681f3Smrg- pan/bi: Stub out scheduler unit test 4127ec681f3Smrg- pan/bi: Factor nir_function_impl out of the context 4137ec681f3Smrg- pan/bi: Add bi_can_{fma, add} predicates 4147ec681f3Smrg- pan/bi: Annotate ISA.xml with 'last' parameter 4157ec681f3Smrg- pan/bi: Pipe last flag into opcode tables 4167ec681f3Smrg- pan/bi: Add bi_must_last predicate 4177ec681f3Smrg- pan/bi: Add bi_must_message predicate 4187ec681f3Smrg- pan/bi: Label table instructions 4197ec681f3Smrg- pan/bi: Emit branch and table bits in opcode table 4207ec681f3Smrg- pan/bi: Add various read predicates 4217ec681f3Smrg- pan/bi: Unit test bi_can_{fma, add} 4227ec681f3Smrg- pan/bi: Test bi_must_last 4237ec681f3Smrg- pan/bi: Test bi_must_message 4247ec681f3Smrg- pan/bi: Test read predicates 4257ec681f3Smrg- pan/bi: Move bi_constants to bifrost.h 4267ec681f3Smrg- pan/bi: Use canonical terminology for tuple 4277ec681f3Smrg- pan/bi: Use enum bifrost_message_type 4287ec681f3Smrg- pan/bi: Clarify tuple comment 4297ec681f3Smrg- pan/bi: Amend misleading comment 4307ec681f3Smrg- pan/bi: Pack multiple tuples in-memory 4317ec681f3Smrg- pan/bi: Add clause encodings as a table 4327ec681f3Smrg- pan/bi: Move bi_packed_tuple to compiler.h 4337ec681f3Smrg- pan/bi: Add bi_pack_literal 4347ec681f3Smrg- pan/bi: Add bi_pack_upper 4357ec681f3Smrg- pan/bi: Add bi_pack_tuple_bits 4367ec681f3Smrg- pan/bi: Add bi_pack_sync 4377ec681f3Smrg- pan/bi: Add tuple/embedded constant pack 4387ec681f3Smrg- pan/bi: Add subword 5/6 pack 4397ec681f3Smrg- pan/bi: Add subword 4 or 7 pack 4407ec681f3Smrg- pan/bi: Add pack_format helper 4417ec681f3Smrg- pan/bi: Calculate pos for constant packing 4427ec681f3Smrg- pan/bi: Pack multiple tuples per clause 4437ec681f3Smrg- pan/bi: Add packing unit test group 4447ec681f3Smrg- pan/bi: Test pack_literal 4457ec681f3Smrg- pan/bi: Test pack_upper 4467ec681f3Smrg- pan/bi: Test pack_tuple_bits 4477ec681f3Smrg- pan/bi: Test pack_sync 4487ec681f3Smrg- pan/bi: Add packing format tests 4497ec681f3Smrg- pan/decode: Be explicit when printing invocations 4507ec681f3Smrg- pan/decode: Remove tiler size checks 4517ec681f3Smrg- pan/decode: Remove dependency of decoder on the encoder 4527ec681f3Smrg- pan/decode: Deduplicate SFBD blend printing 4537ec681f3Smrg- pan/decode: Deduplicate shader property printing 4547ec681f3Smrg- pan/decode: Remove unused MEMORY_PROP macro 4557ec681f3Smrg- pan/decode: Simplify tiler printing 4567ec681f3Smrg- pan/decode: Remove pandecode_prop 4577ec681f3Smrg- pan/decode: Remove unused disasm stats 4587ec681f3Smrg- pan/decode: Remove mesa header dependencies 4597ec681f3Smrg- pan/mdg: Drop unused stage parameter to disassembler 4607ec681f3Smrg- pan/decode: Remove tile range validation 4617ec681f3Smrg- pan/decode: Prefer sizeof to ARRAY_SIZE for char 4627ec681f3Smrg- nir/lower_io: Fix grammar errors 4637ec681f3Smrg- pan/bi: Fix NULL deref with empty shader 4647ec681f3Smrg- pan/bi: Add side_effects helper 4657ec681f3Smrg- pan/bi: Respect side effects in DCE 4667ec681f3Smrg- pan/bi: Implement AXCHG 4677ec681f3Smrg- pan/bi: Implement ACMPXCHG 4687ec681f3Smrg- pan/bi: Add bi_fmul_f32 convenience method 4697ec681f3Smrg- pan/bi: Fix FLOG_TABLE modifier handling 4707ec681f3Smrg- pan/bi: Lower frcp to Newton-Raphson 4717ec681f3Smrg- pan/bi: Lower frsq to Newton-Raphson 4727ec681f3Smrg- pan/bi: Lower FEXP2 with a table 4737ec681f3Smrg- pan/bi: Lower flog2 to a table and polynomial 4747ec681f3Smrg- pan/bi: Rename NO_FP32_TRANSCENDENTALS quirk 4757ec681f3Smrg- pan/bi: Fix bi quirks detection 4767ec681f3Smrg- pan/bi: Lower FP32 transcendentals where required 4777ec681f3Smrg- pan/bi: Lower transcendentals on G71 4787ec681f3Smrg- pan/bi: Print program size in shader-db 4797ec681f3Smrg- pan/bi: Cleanup terminal block check 4807ec681f3Smrg- pan/bi: Dead code eliminate per-channel 4817ec681f3Smrg- pan/bi: Include ATEST datum in the instruction 4827ec681f3Smrg- pan/bi: Add scheduler data structures 4837ec681f3Smrg- pan/bi: Add cubeface lowering 4847ec681f3Smrg- pan/bi: Flatten block lists 4857ec681f3Smrg- pan/bi: Stub worklist routines 4867ec681f3Smrg- pan/bi: Add constant count estimates to scheduler 4877ec681f3Smrg- pan/bi: Add FAU update helper 4887ec681f3Smrg- pan/bi: Add bi_tuple_is_new_src 4897ec681f3Smrg- pan/bi: Add bi_count_succ_reads helper 4907ec681f3Smrg- pan/bi: Validate reads_t 4917ec681f3Smrg- pan/bi: Add T0/T1 constraint check 4927ec681f3Smrg- pan/bi: Add writes_reg predicate 4937ec681f3Smrg- pan/bi: Add bi_instr_schedulable predicate 4947ec681f3Smrg- pan/bi: Choose instructions to schedule 4957ec681f3Smrg- pan/bi: Destructively schedule a single instruction 4967ec681f3Smrg- pan/bi: Add passthrough register rewriting helper 4977ec681f3Smrg- pan/bi: Extract bi_ec0_packed helper 4987ec681f3Smrg- pan/bi: Add bi_foreach_instr_and_src_in_tuple 4997ec681f3Smrg- pan/bi: Move bi_constant_field to bifrost.h 5007ec681f3Smrg- pan/bi: Add pcrel_idx to bi_clause 5017ec681f3Smrg- pan/bi: Derive M0 from pcrel_idx while packing 5027ec681f3Smrg- pan/bi: Add trivial rewrite helpers 5037ec681f3Smrg- pan/bi: Add constant to passthrough rewrite 5047ec681f3Smrg- pan/bi: Add constant state constructor 5057ec681f3Smrg- pan/bi: Add constant merging routines 5067ec681f3Smrg- pan/bi: Add constant modifier handling 5077ec681f3Smrg- pan/bi: Schedule blocks 5087ec681f3Smrg- pan/bi: Switch to new scheduler 5097ec681f3Smrg- pan/bi: Remove old FAU assignment code 5107ec681f3Smrg- pan/bi: Remove older cube map lowering 5117ec681f3Smrg- pan/bi: Add nosched debug option 5127ec681f3Smrg- pan/bi: Fix 'last tuple' for terminal-NOP clauses 5137ec681f3Smrg- pan/bi: Fix 2-write pseudo op scheduling 5147ec681f3Smrg- pan/mdg: Fix multithreaded shader-db 5157ec681f3Smrg- pan/mdg: Add MIDGARD_MESA_DEBUG=inorder option 5167ec681f3Smrg- pan/mdg: Optimize UBO offset calculations 5177ec681f3Smrg- pan/mdg: Set lower_uniforms_to_ubo 5187ec681f3Smrg- panfrost: Fix race condition in UBO mapping to CPU 5197ec681f3Smrg- panfrost: Respect buffer_offset when mapping to CPU 5207ec681f3Smrg- panfrost: Move sysvals to dedicated UBO 5217ec681f3Smrg- panfrost: Don't truncate uniform_count 5227ec681f3Smrg- panfrost: Add UBO push data structure 5237ec681f3Smrg- panfrost: Push uniforms required by the program 5247ec681f3Smrg- panfrost: Set FAU count based on program->push 5257ec681f3Smrg- panfrost: Don't store uniform_count on Midgard 5267ec681f3Smrg- pan/mdg: Update UBO promotion comment 5277ec681f3Smrg- pan/mdg: Push uniforms based on UBO analysis 5287ec681f3Smrg- pan/bi: Fix multithreaded shader-db 5297ec681f3Smrg- pan/bi: Add bi_replace_index helper 5307ec681f3Smrg- pan/bi: Add bi_is_ssa helper 5317ec681f3Smrg- pan/bi: Print FAU uniforms in IR 5327ec681f3Smrg- pan/bi: Generalize bi_update_fau with fast zero 5337ec681f3Smrg- pan/bi: Handle modifiers in rewrite_fau_to_pass 5347ec681f3Smrg- pan/bi: Rework FAU lowering 5357ec681f3Smrg- pan/bi: Simplify derivative lowering 5367ec681f3Smrg- pan/bi: Add SSA-based scalar copy propagation 5377ec681f3Smrg- pan/bi: Push UBOs on Bifrost 5387ec681f3Smrg- panfrost: Enable ES3 conformant floating-point 5397ec681f3Smrg- compiler, nir: Add and set barrier metadata 5407ec681f3Smrg- panfrost: Set barriers flag for compute shaders 5417ec681f3Smrg- panfrost: Pass is_blit flag around 5427ec681f3Smrg- pan/bi: Skip ATEST for colour blit shaders 5437ec681f3Smrg- panfrost: Fake shader images for bifrost+deqp 5447ec681f3Smrg- pan/bi: Fix jumps to terminal block again 5457ec681f3Smrg- pan/bi: Fix empty shader handling 5467ec681f3Smrg- nir: Add sample_positions_pan intrinsic 5477ec681f3Smrg- pan/decode: Cleanup sample locations decode 5487ec681f3Smrg- pan/decode: Only print local storage for vertex jobs 5497ec681f3Smrg- panfrost: Preload sample mask if needed 5507ec681f3Smrg- panfrost: Add sample positions sysval 5517ec681f3Smrg- panfrost: Push sample positions sysval for Midgard 5527ec681f3Smrg- panfrost: Refactor sample shading state 5537ec681f3Smrg- panfrost: Respect info.fs.uses_sample_shading 5547ec681f3Smrg- panfrost: Add panfrost_sample_pattern helper 5557ec681f3Smrg- panfrost: Set tiler descriptor sampler pattern 5567ec681f3Smrg- panfrost: Generalize MSAA handling 5577ec681f3Smrg- panfrost: Don't set REQ_MSAA in pan_mfbd 5587ec681f3Smrg- panfrost: Don't use PAN_REQ_MSAA in SFBD 5597ec681f3Smrg- panfrost: Remove PAN_REQ_MSAA 5607ec681f3Smrg- panfrost: Remove PAN_REQ_DEPTH_WRITE 5617ec681f3Smrg- panfrost: Remove batch_is_scanout 5627ec681f3Smrg- panfrost: Set sample count/pattern for tiler FBD 5637ec681f3Smrg- panfrost: Upload sample positions on device init 5647ec681f3Smrg- panfrost: Use sample location LUT 5657ec681f3Smrg- panfrost: Ensure open_device has pandecode initialized 5667ec681f3Smrg- panfrost: Advertise MSAA 8x and 16x 5677ec681f3Smrg- panfrost: Implement get_sample_position 5687ec681f3Smrg- panfrost: Overhaul sysval handling 5697ec681f3Smrg- panfrost: Add MULTISAMPLED sysval 5707ec681f3Smrg- pan/mdg: Assert on bad 64-bit swizzle in disassembly 5717ec681f3Smrg- pan/mdg: Remove unused pack_unorm_4x8 lowering 5727ec681f3Smrg- pan/mdg: Lower bitfield instructions 5737ec681f3Smrg- pan/mdg: Rename bitcount8 to popcnt, fixing the unit 5747ec681f3Smrg- pan/mdg: Implement uclz 5757ec681f3Smrg- pan/mdg: Lower ufind_msb, poorly 5767ec681f3Smrg- pan/mdg: Stub load_barycentric_sample 5777ec681f3Smrg- pan/mdg: Lower stores from helpers 5787ec681f3Smrg- pan/bi: Remove redundant TEXC opcode check 5797ec681f3Smrg- pan/bi: Implement texture gathers 5807ec681f3Smrg- pan/bi: Lower bitfield inserts/extracts 5817ec681f3Smrg- pan/bi: Implement bitfield_reverse 5827ec681f3Smrg- pan/bi: Implement ufind_msb 5837ec681f3Smrg- pan/bi: Lower ifind_msb 5847ec681f3Smrg- pan/bi: Fix gl_SampleID read 5857ec681f3Smrg- pan/bi: Implement load_sample_mask_in 5867ec681f3Smrg- pan/bi: Implement nir_intrinsic_load_sample_positions_pan 5877ec681f3Smrg- pan/bi: Decouple sysval loading from NIR 5887ec681f3Smrg- pan/bi: Implement coverage mask updates 5897ec681f3Smrg- pan/{mdg, bi}: Lower load_helper_invocation 5907ec681f3Smrg- pan/{mdg, bi}: Lower load_sample_pos 5917ec681f3Smrg- panfrost: Simplify bind_compute_state 5927ec681f3Smrg- panfrost: Remove stale TODOs 5937ec681f3Smrg- panfrost: Assert on indirect compute shaders 5947ec681f3Smrg- panfrost: Advertise SAMPLE_SHADING 5957ec681f3Smrg- panfrost: Bump max SSBO count 5967ec681f3Smrg- panfrost: Bump advertised ESSL feature level 5977ec681f3Smrg- panfrost: Advertise OES_standard_derivatives 5987ec681f3Smrg- docs/features: Mark more TBO exts done on panfrost 5997ec681f3Smrg- docs/features: Mark some ES3.1 done on Panfrost 6007ec681f3Smrg- docs/features: Mark sample shading done on Panfrost 6017ec681f3Smrg- docs/features: gl_HelperInvocation on Panfrost 6027ec681f3Smrg- nir: Fix grammar error 6037ec681f3Smrg- panfrost: Fix uniform_count on Midgard 6047ec681f3Smrg- pan/bi: Stub scoreboarding 6057ec681f3Smrg- pan/bi: Implement barriers 6067ec681f3Smrg- pan/decode: Fix tiler printing on Bifrost 6077ec681f3Smrg- pan/decode: Pretty print 22-bit pixel formats 6087ec681f3Smrg- pan/decode: Disambiguate border colours 6097ec681f3Smrg- panfrost: Label groups in GenXML 6107ec681f3Smrg- panfrost: Track coverage, early fragment tests 6117ec681f3Smrg- panfrost: Flesh out pixel kill / zs update 6127ec681f3Smrg- panfrost: Handle PIPE_FORMAT_NONE as 'constant' 6137ec681f3Smrg- panfrost: Flesh out allow_forward_pixel_to_kill check 6147ec681f3Smrg- panfrost: Export bifrost_get_blend_desc with type size 6157ec681f3Smrg- panfrost: Add RT conversion sysval 6167ec681f3Smrg- panfrost: Fix NULL deref for an empty frag shader 6177ec681f3Smrg- panfrost: Spell fix 6187ec681f3Smrg- panfrost: Take panfrost_dev for AFBC selection 6197ec681f3Smrg- panfrost: Set border colour on Bifrost sampler 6207ec681f3Smrg- panfrost: Remove useless check 6217ec681f3Smrg- pan/bi: Fix RA of node 0 again 6227ec681f3Smrg- pan/bi: Don't inline 64-bit constants 6237ec681f3Smrg- pan/bi: Fix LD_GCLK staging count 6247ec681f3Smrg- pan/bi: Don't read alpha out of bounds 6257ec681f3Smrg- pan/bi: Allow @rNULL with tied operands 6267ec681f3Smrg- pan/bi: Add SEG_ADD.i64 pseudoinstruction 6277ec681f3Smrg- pan/bi: Add 32-bit atomic pseudoinstruction 6287ec681f3Smrg- pan/bi: Lower atomic pseudo-instructions 6297ec681f3Smrg- pan/bi: Extract bi_atom_opc from NIR intrinsic 6307ec681f3Smrg- pan/bi: Add ATOM_C1 promotion check 6317ec681f3Smrg- pan/bi: Handle computational atomics 6327ec681f3Smrg- pan/bi: Materialize \*DTSEL_IMM in the scheduler 6337ec681f3Smrg- pan/bi: Implement image_atomic_exchange 6347ec681f3Smrg- pan/bi: Implement image_atomic_comp_swap 6357ec681f3Smrg- pan/bi: Implement shader_clock intrinsic 6367ec681f3Smrg- pan/bi: Fix blend shaders using LD_TILE with MRT 6377ec681f3Smrg- pan/bi: Flesh out LD_TILE emit 6387ec681f3Smrg- pan/bi: Elucidate slot 6/7 operation 6397ec681f3Smrg- pan/bi: Preload sample ID for sample shading 6407ec681f3Smrg- pan/bi: Implement interpolateAtSample 6417ec681f3Smrg- pan/bi: Add imm_f16 helper 6427ec681f3Smrg- pan/bi: Implement interpolateAtOffset 6437ec681f3Smrg- pan/bi: Allow dynamically uniform tex indices 6447ec681f3Smrg- pan/bi: Use explicit move even for RT#0 of MRT 6457ec681f3Smrg- panfrost: Comment on state of ARB_shader_clock 6467ec681f3Smrg- panfrost: Advertise FRAMEBUFFER_NO_ATTACHMENTS 6477ec681f3Smrg- docs/features: Mark atomics/images done on Bifrost 6487ec681f3Smrg- panfrost/ci: Mark flaky test 6497ec681f3Smrg- gallium/tessellator: Remove XBOX 360 code 6507ec681f3Smrg- gallium/tessellator: Remove unused includes 6517ec681f3Smrg- gallium/tessellator: Rename D3D11 defines 6527ec681f3Smrg- pan/bi: Fix more jumps to terminal blocks 6537ec681f3Smrg- pan/bi: Optimize out redundant jumps to #0x0 6547ec681f3Smrg- pan/bi: Fix elimination of repeated branches 6557ec681f3Smrg- panfrost: Fix infinite loop spilling 6567ec681f3Smrg- panfrost: Fix NULL dereference adding cbuf to batch 6577ec681f3Smrg- panfrost: Remove redundant NULL check 6587ec681f3Smrg- panfrost: Fix NULL deref in pan_mfbd.c 6597ec681f3Smrg- panfrost: Fix NULL derefs in pan_cmdstream.c 6607ec681f3Smrg- panfrost: Fix NULL deref in pan_sfbd 6617ec681f3Smrg- panfrost: Raise TEXTURE_BUFFER_OFFSET_ALIGNMENT 6627ec681f3Smrg- panfrost: Hide MSAA 8x/16x support 6637ec681f3Smrg- panfrost: Fix UNORM 16 rendering 6647ec681f3Smrg- panfrost: Reinterpret format for reload blits 6657ec681f3Smrg- panfrost: Fix typo in midgard.xml 6667ec681f3Smrg- panfrost: Don't advertise OES_copy_image 6677ec681f3Smrg- pan/bi: Use nir_opt_sink/move for constants 6687ec681f3Smrg- pan/bi: Reduce liveness calculations in DCE 6697ec681f3Smrg- pan/bi: Inline \`bytemask of read components` 6707ec681f3Smrg- pan/bi: Mark branches as having side effects 6717ec681f3Smrg- pan/bi: Mark DISCARD as having side effects 6727ec681f3Smrg- pan/bi: Make bi_writemask take a destination 6737ec681f3Smrg- pan/bi: Allow spilling with multiple destinations 6747ec681f3Smrg- pan/bi: Annotate instructions by destination count 6757ec681f3Smrg- pan/bi: Adapt builder to dest count 6767ec681f3Smrg- pan/bi: Remove unused definitions 6777ec681f3Smrg- pan/bi: Do copyprop in linear-time 6787ec681f3Smrg- panfrost/lcra: Fix constraint counting 6797ec681f3Smrg- pan/bi: Use replace_index in more places 6807ec681f3Smrg- pan/bi: Allow negating constants 6817ec681f3Smrg- pan/bi: Implement fsin/fcos 6827ec681f3Smrg- pan/bi: Mark message-passing sources/dests live 6837ec681f3Smrg- pan/bi: Set clause_state.message conservatively 6847ec681f3Smrg- pan/bi: Treat +DISCARD.f32 as message-passing 6857ec681f3Smrg- nir/lower_viewport_transform: Allow geom/tess 6867ec681f3Smrg- pan/bi: Implement u{add, sub}_sat 6877ec681f3Smrg- nir: Unify memory atomics 6887ec681f3Smrg- meson: Remove kmsro from gallium-drivers 6897ec681f3Smrg- pan/bi: Document register conventions 6907ec681f3Smrg- pan/bi: Add bi_foreach_block_rev 6917ec681f3Smrg- pan/bi: Handle 16-bit blend sr_count 6927ec681f3Smrg- pan/bi: Only run copyprop once 6937ec681f3Smrg- pan/bi: Only run DCE once 6947ec681f3Smrg- pan/bi: Remove TODO: RA warnings 6957ec681f3Smrg- pan/bi: Remove stale todo/assert 6967ec681f3Smrg- panfrost: Deduplicate UBO count assignment 6977ec681f3Smrg- panfrost: Deduplicate Bifrost fau_count 6987ec681f3Smrg- panfrost: Only check blend work count on midgard 6997ec681f3Smrg- vulkan: Deduplicate mesa stage conversion 7007ec681f3Smrg- pan/bi: Enable all nir_opt_move/sink optimizations 7017ec681f3Smrg- pan/bi: Split writemasks for memory stores 7027ec681f3Smrg- pan/bi: Lower large arrays to scratch 7037ec681f3Smrg- pan/bi: Add bi_swz_16 helper 7047ec681f3Smrg- pan/bi: Optimize MKVEC.v2i16 generation 7057ec681f3Smrg- pan/bi: Lower swizzles 7067ec681f3Smrg- pan/bi: Fill in some more conversions 7077ec681f3Smrg- pan/bi: Generalize f2i16, f2u16 7087ec681f3Smrg- pan/bi: Remove conversion lowering 7097ec681f3Smrg- nir: Add nir_type_convert 7107ec681f3Smrg- nir: Add {i2f, u2f, f2i, f2u} helpers 7117ec681f3Smrg- nir/lower_idiv: Convert to lower_instructions 7127ec681f3Smrg- nir/lower_idiv: Factor out numer/denom load 7137ec681f3Smrg- nir/lower_idiv: Add 8-bit and 16-bit lowering path 7147ec681f3Smrg- pan/bi: Determine block successors correctly 7157ec681f3Smrg- panfrost: Fix AFBC body_size for shared resources 7167ec681f3Smrg- panfrost: Enable AFBC buffer sharing 7177ec681f3Smrg- nir: Add varying precision linking helper (v2) 7187ec681f3Smrg- docs: Add some notes on building for macOS 7197ec681f3Smrg- panfrost: Fix formats converting uninit from AFBC 7207ec681f3Smrg 7217ec681f3SmrgAndreas Bergmeier (1): 7227ec681f3Smrg 7237ec681f3Smrg- v3dv: Output a message if file open fails in physical_device_init 7247ec681f3Smrg 7257ec681f3SmrgAndres Gomez (29): 7267ec681f3Smrg 7277ec681f3Smrg- ci: recover tracie dashboard URLs for failing traces 7287ec681f3Smrg- ci: correct the trace image URLs in the piglit summary 7297ec681f3Smrg- ci: make piglit runner less noisy and show a better failure message 7307ec681f3Smrg- ci: clean paths used in the piglit runner 7317ec681f3Smrg- ci: correct piglit's HTML summary location for artifacts upload 7327ec681f3Smrg- ci: make sure piglit's artifacts are not overwritten 7337ec681f3Smrg- ci: correct artifacts location for piglit's runner messages 7347ec681f3Smrg- ci: tracie dashboard URLs only in the failure after the testcase 7357ec681f3Smrg- ci: piglit runner colors diff output on failures 7367ec681f3Smrg- ci: remove pytest since we don't need it any more 7377ec681f3Smrg- ci: only install piglit dependencies when installing piglit 7387ec681f3Smrg- ci: build gfxreconstruct v0.9.5 7397ec681f3Smrg- ci: add libdrm to the x86_test-vk container 7407ec681f3Smrg- .mailmap: colapse duplicates for Timothy Arceri 7417ec681f3Smrg- .mailmap: resolve duplicates for Icecream95 7427ec681f3Smrg- .mailmap: resolve duplicates for Christopher Li 7437ec681f3Smrg- .mailmap: resolve duplicates for Emmanuel Vadot 7447ec681f3Smrg- .mailmap: resolve duplicates for Indrajit Das 7457ec681f3Smrg- .mailmap: resolve duplicates for James Xiong 7467ec681f3Smrg- .mailmap: resolve duplicates for Jan Zielinski 7477ec681f3Smrg- .mailmap: resolve duplicates for Lin Johnson 7487ec681f3Smrg- .mailmap: resolve duplicates for Mark Menzynski 7497ec681f3Smrg- .mailmap: resolve duplicates for Matthias Hopf 7507ec681f3Smrg- .mailmap: resolve duplicates for Matthias Lorenz 7517ec681f3Smrg- .mailmap: resolve duplicates for Maya Rashish 7527ec681f3Smrg- .mailmap: resolve duplicates for Mun Gwan-gyeong 7537ec681f3Smrg- .mailmap: resolve duplicates for Satyeshwar Singh 7547ec681f3Smrg- .mailmap: resolve duplicates for Yogesh Mohan Marimuthu 7557ec681f3Smrg- .mailmap: add an alias for Eleni Maria Stea 7567ec681f3Smrg 7577ec681f3SmrgAndrew McMahon (1): 7587ec681f3Smrg 7597ec681f3Smrg- util: add mesa_glthread for Half Life 2 and Black Mesa. 7607ec681f3Smrg 7617ec681f3SmrgAndrii Simiklit (7): 7627ec681f3Smrg 7637ec681f3Smrg- st/mesa: fix pbo upload/download for arrays of textures with only 1 layer 7647ec681f3Smrg- iris: don't emit IRIS_DIRTY_VF depending on trash in restart_index 7657ec681f3Smrg- mesa: ensure parameter list capacity before associating uniform storage 7667ec681f3Smrg- glsl/linker: Fix xfb stride alignment for buffers containing 64bit types 7677ec681f3Smrg- gitlab-ci: remove fixed tests 7687ec681f3Smrg- spirv: repair ssa defs for switchs with only default case 7697ec681f3Smrg- nir/spirv: remove unused fields from \`vtn_builder` 7707ec681f3Smrg 7717ec681f3SmrgAntonio Caggiano (4): 7727ec681f3Smrg 7737ec681f3Smrg- zink: check shader stencil output 7747ec681f3Smrg- zink: support stencil-export 7757ec681f3Smrg- zink: fix destroy batch 7767ec681f3Smrg- ci: Use lock file to build deqp-runner 7777ec681f3Smrg 7787ec681f3SmrgAnuj Phogat (32): 7797ec681f3Smrg 7807ec681f3Smrg- intel/anv: Fix condition to set MipModeFilter for YUV surface 7817ec681f3Smrg- intel/anv: Fix condition for planar yuv surface 7827ec681f3Smrg- intel: Rename files with gen\_ prefix in common code to intel\_ 7837ec681f3Smrg- intel: Rename "gen\_" prefix used in common code to "intel\_" 7847ec681f3Smrg- intel: Fix broken alignment due to gen\_ prefix renaming 7857ec681f3Smrg- intel: Rename "GEN\_" prefix used in common code to "INTEL\_" 7867ec681f3Smrg- i965: Remove blank line at EOF 7877ec681f3Smrg- i965: Rename files with "intel\_" prefix to "brw\_" 7887ec681f3Smrg- intel/isl: Drop intel\_ prefix in function names 7897ec681f3Smrg- anv: Remove redundant #if checks 7907ec681f3Smrg- intel: Remove GEN_IS_HASWELL macro 7917ec681f3Smrg- intel: Simplify version checks involving haswell 7927ec681f3Smrg- intel: Remove GEN_IS_G4X macro 7937ec681f3Smrg- intel: Simplify few version checks involving G4X 7947ec681f3Smrg- intel: Rename GEN_VERSIONx10 macro to GFX_VERx10 7957ec681f3Smrg- intel: Rename GEN_GEN macro to GFX_VER 7967ec681f3Smrg- intel: Rename ISL_DEV_GEN to ISL_GFX_VER 7977ec681f3Smrg- intel: Rename genx10 field in gen_device_info struct to verx10 7987ec681f3Smrg- intel: Rename gen field in gen_device_info struct to ver 7997ec681f3Smrg- intel: Rename genx keyword in filenames to gfxx 8007ec681f3Smrg- intel: Rename GENx prefix in macros to GFXx in build files 8017ec681f3Smrg- intel: Rename GENx prefix in macros to GFXx in source files 8027ec681f3Smrg- intel: Rename genx keyword to gfxx in build files 8037ec681f3Smrg- intel: Rename genx keyword to gfxx in source files 8047ec681f3Smrg- intel: Rename Genx keyword to Gfxx 8057ec681f3Smrg- intel: Rename GENx keyword to GFXx 8067ec681f3Smrg- intel: Rename IS_GEN* macros to IS_GFX_VER* 8077ec681f3Smrg- intel: Make line wrapping changes due to IS_GFX_VER_BETWEEN 8087ec681f3Smrg- intel: Remove unused MAKE_GEN macro 8097ec681f3Smrg- intel: Rename GEN_{ALL, LT, ..} macros to GFX_{ALL, LT, ..} 8107ec681f3Smrg- intel: Rename GEN:BUG:### to Wa_### 8117ec681f3Smrg- intel: Rename WA_### to Wa_### 8127ec681f3Smrg 8137ec681f3SmrgArcady Goldmints-Orlov (14): 8147ec681f3Smrg 8157ec681f3Smrg- v3dv: Fix uninitialized variable warnings 8167ec681f3Smrg- nir: add more intrinsics to divergence analysis 8177ec681f3Smrg- nir: handle v3d intrinsics in divergence analysis 8187ec681f3Smrg- nir: store the results of divergence analysis on loops 8197ec681f3Smrg- broadcom/compiler: Use ANYA for branches in uniform ifs 8207ec681f3Smrg- broadcom/compiler: Emit uniform loops using uniform control flow 8217ec681f3Smrg- broadcom/compiler: Enable PER_QUAD TMU access only in uniform control flow 8227ec681f3Smrg- v3dv: Only lower local arrays of size up to 2 to if-chains 8237ec681f3Smrg- broadcom/compiler: improve generation of if conditions 8247ec681f3Smrg- Revert "broadcom/compiler: improve generation of if conditions" 8257ec681f3Smrg- v3dv: initialize render_fd at the top of physical_device_init 8267ec681f3Smrg- broadcom/compiler: Add a v3d_compile argument to vir_set_[pu]f 8277ec681f3Smrg- broadcom/compiler: Skip bool_to_cond where possible 8287ec681f3Smrg- broadcom/compiler: Merge instructions more efficiently 8297ec681f3Smrg 8307ec681f3SmrgArno Messiaen (1): 8317ec681f3Smrg 8327ec681f3Smrg- lima/ppir: increase usage of pipeline regs 8337ec681f3Smrg 8347ec681f3SmrgAxel Davy (76): 8357ec681f3Smrg 8367ec681f3Smrg- st/nine: Reduce system memory allocated by D3DUSAGE_AUTOGENMIPMAP 8377ec681f3Smrg- st/nine: Do not allow depth buffer render targets 8387ec681f3Smrg- st/nine: Clamp GetAvailableTextureMem 8397ec681f3Smrg- st/nine: Unmap buffers after full unlock 8407ec681f3Smrg- st/nine: Track formats compatible with FETCH4 8417ec681f3Smrg- st/nine: Implement experimental FETCH4 8427ec681f3Smrg- st/nine: Enable DF24 support 8437ec681f3Smrg- st/nine: Add new debug and error checks 8447ec681f3Smrg- st/nine: Refactor ht_guid_delete 8457ec681f3Smrg- st/nine: Protect \*PrivateData also for Volumes 8467ec681f3Smrg- st/nine: Fix leak at device destruction 8477ec681f3Smrg- driconf: Rename csmt_int back to csmt_force 8487ec681f3Smrg- st/nine: Simplify checks for driconf options 8497ec681f3Smrg- st/nine: Add new function to know if we are the worker 8507ec681f3Smrg- st/nine: Add RAM memory manager for textures 8517ec681f3Smrg- st/nine: Use the texture memory helper 8527ec681f3Smrg- st/nine: Control the memfd virtual limit 8537ec681f3Smrg- st/nine: Add driconf option to limit texture memory 8547ec681f3Smrg- st/nine: Set default dynamic_texture_workaround to true 8557ec681f3Smrg- st/nine: Check memfd_create support 8567ec681f3Smrg- st/nine: Fix compilation issue in nine_debug 8577ec681f3Smrg- st/nine: Optimize EndScene 8587ec681f3Smrg- st/nine: Implement SYSTEMMEM buffers same as MANAGED 8597ec681f3Smrg- st/nine: Refactor DrawPrimitiveUp 8607ec681f3Smrg- st/nine: Optimize DrawPrimitiveUp 8617ec681f3Smrg- st/nine: Use correct bind flag at buffer creation 8627ec681f3Smrg- gallium/util: Add new u_box helpers 8637ec681f3Smrg- st/nine: Track pending MANAGED buffer uploads 8647ec681f3Smrg- st/nine: Optimize dynamic systemmem buffers 8657ec681f3Smrg- st/nine: Force DYNAMIC SYSTEMMEM for sw vertex processing 8667ec681f3Smrg- st/nine: Always use DYNAMIC with SYSTEMMEM 8677ec681f3Smrg- st/nine: Use stream_uploader for bad cases of systemmem 8687ec681f3Smrg- st/nine: detect worker threads syncs for systemmem 8697ec681f3Smrg- radeonsi: Limit the size of the in-memory shader cache 8707ec681f3Smrg- radeonsi: fix leak when the in-memory cache is full 8717ec681f3Smrg- st/nine: Disable fpu exceptions during init 8727ec681f3Smrg- st/nine: Fix crash on texture creation failure 8737ec681f3Smrg- st/nine: Fix cubetexture early destruction 8747ec681f3Smrg- st/nine: Add missing breaks 8757ec681f3Smrg- st/nine: Fix invalid NULL check 8767ec681f3Smrg- st/nine: Prevent use after free on dtor 8777ec681f3Smrg- st/nine: Fix reading invalid pointer 8787ec681f3Smrg- st/nine: Fix compilation warnings 8797ec681f3Smrg- st/nine: Fix read outside bounds for some textures 8807ec681f3Smrg- st/nine: Fix value of pipe_draw_info's max_index vertex 8817ec681f3Smrg- st/nine: Prevent negative reference count 8827ec681f3Smrg- st/nine: Improve Surface GetContainer 8837ec681f3Smrg- st/nine: Fix alpha to coverage states 8847ec681f3Smrg- st/nine: Enable multisampling also without depth buffer 8857ec681f3Smrg- st/nine: Handle D3DFMT_NULL multisampling 8867ec681f3Smrg- st/nine: Remove errors on unsupported lock flags 8877ec681f3Smrg- st/nine: Increase number of constants of vs1_sw 8887ec681f3Smrg- st/nine: Ignore swizzle on samplers 8897ec681f3Smrg- st/nine: Clamp max_anisotropy 8907ec681f3Smrg- st/nine: Refuse depth buffers as rendertargets 8917ec681f3Smrg- st/nine: Fix ps ff BLENDTEXTUREALPHA 8927ec681f3Smrg- st/nine: Fix ff has_aNrm computation 8937ec681f3Smrg- st/nine: Catch redundant scissor and viewport settings 8947ec681f3Smrg- st/nine: Pseudo implement set/getClipstatus 8957ec681f3Smrg- st/nine: Improve Reset on Ex devices 8967ec681f3Smrg- st/nine: Pseudo implement Create*Ex functions 8977ec681f3Smrg- st/nine: Complete \*Ex stubs 8987ec681f3Smrg- st/nine: Add logging to Ex function 8997ec681f3Smrg- st/nine: Have NOOVERWRITE win over DISCARD 9007ec681f3Smrg- st/nine: Do not memset buffers twice 9017ec681f3Smrg- st/nine: Add fallback for YUV formats 9027ec681f3Smrg- st/nine: Use PIPE_MAP_ONCE for persistent buffers 9037ec681f3Smrg- st/nine: Disable buffer_upload when csmt is off 9047ec681f3Smrg- st/nine: Allow to override the vram size 9057ec681f3Smrg- st/nine: Make it optional to use a sw renderer 9067ec681f3Smrg- st/nine: Lower texture_memory_limit default 9077ec681f3Smrg- st/nine: Bump num of backbuffers for tearfree thread_submit 9087ec681f3Smrg- st/nine: Improve performance with thread_submit 9097ec681f3Smrg- st/nine: Default thread_submit to true 9107ec681f3Smrg- st/nine: Default tearfree_discard to true 9117ec681f3Smrg- st/nine: Fix compilation error on non-x86 platforms 9127ec681f3Smrg 9137ec681f3SmrgBas Nieuwenhuizen (87): 9147ec681f3Smrg 9157ec681f3Smrg- ac/surface: Fix GFX9 sparse mip info. 9167ec681f3Smrg- radv: Do not use a pipe offset for aliased sparse images. 9177ec681f3Smrg- radv: Add a trivial implementation of VK_KHR_deferred_host_operation 9187ec681f3Smrg- radv: Use stricter HW resolve swizzle compat check. 9197ec681f3Smrg- radv: Expose VK_KHR_workgroup_memory_explicit_layout. 9207ec681f3Smrg- radv: Do not hash vk_object_base in descriptor set layout. 9217ec681f3Smrg- amd/common: Add modifier size helper. 9227ec681f3Smrg- radv: Extract DCC format support handling. 9237ec681f3Smrg- radv: Use the surface offset from ac_surface instead of a plane offset. 9247ec681f3Smrg- radv: Don't relayout images with modifiers. 9257ec681f3Smrg- radv: Add format modifier format queries. 9267ec681f3Smrg- radv: Add drm format modifier queries. 9277ec681f3Smrg- radv: Add image layout with drm format modifiers. 9287ec681f3Smrg- radv: Enable DRM format modifiers on GFX9+. 9297ec681f3Smrg- radv: Enable modifiers with the WSI. 9307ec681f3Smrg- radv: Add modifier fails for CTS bug. 9317ec681f3Smrg- radv: Fix assert. 9327ec681f3Smrg- radv: Implement VK_KHR_zero_initialize_workgroup_memory. 9337ec681f3Smrg- radv: Improve spilling on discrete GPUs. 9347ec681f3Smrg- radv: Fix vram override with fully visible VRAM. 9357ec681f3Smrg- radv: Remove custom icd json generation. 9367ec681f3Smrg- radv: Define supported extensions in C. 9377ec681f3Smrg- radv: Ignore WC flags for VRAM. 9387ec681f3Smrg- radv: Determine swizzles correctly. 9397ec681f3Smrg- radv: Add plane width/height helpers. 9407ec681f3Smrg- radv: Use u_format helpers when possible. 9417ec681f3Smrg- radv: Remove VK_SWIZZLE_*. 9427ec681f3Smrg- radv: Do no use vk_format for getting divisors. 9437ec681f3Smrg- radv: Do not use generated table for plane formats. 9447ec681f3Smrg- radv: Stop checking for MULTIPLANE layout. 9457ec681f3Smrg- radv: Stop using plane_count. 9467ec681f3Smrg- radv: Only support format with a PIPE_FORMAT. 9477ec681f3Smrg- radv: Start using util_format_description for everything. 9487ec681f3Smrg- radv: Remove the format table. 9497ec681f3Smrg- radv: Remove vk_format_has_stencil/depth helpers. 9507ec681f3Smrg- radv: Properly handle modifier import failure. 9517ec681f3Smrg- radv: Do pipe misalignment check per plane. 9527ec681f3Smrg- radv: Don't use dedicated memory info to indicate sharing. 9537ec681f3Smrg- vulkan/device_select: Stop using device properties 2. 9547ec681f3Smrg- amd/common: constify ac_surface_set_umd_metata. 9557ec681f3Smrg- radv: Handle UMD metadata on import. 9567ec681f3Smrg- radv: Use shared code for setting opaque metadata. 9577ec681f3Smrg- amd/common: Add retile map size helper. 9587ec681f3Smrg- radv: Implement initialization of displayable DCC. 9597ec681f3Smrg- radv: Implement displayable DCC retiling. 9607ec681f3Smrg- radv: Add DCC info to the metadata. 9617ec681f3Smrg- radv: Use ac_surface DCC settings for shareable images. 9627ec681f3Smrg- radv: Enable displayable DCC. 9637ec681f3Smrg- radv: Disable displayable DCC for GFX8 properly. 9647ec681f3Smrg- ac/rgp: Only report double the prims per clock on GFX10. 9657ec681f3Smrg- radv: Expose robustBufferAccessUpdateAfterBind correctly. 9667ec681f3Smrg- frontends/va: Use correct size for secondary planes. 9677ec681f3Smrg- radv: Enable linear sampling for depth textures. 9687ec681f3Smrg- radv: Add sam option. 9697ec681f3Smrg- radv: Add nodisplaydcc option. 9707ec681f3Smrg- radv: Use correct DCC compressed block size for sampling. 9717ec681f3Smrg- radv: Dedupe winsyses per device. 9727ec681f3Smrg- radv: Allow extra planes for DCC. 9737ec681f3Smrg- radv: Enable sharing with DCC with modifiers. 9747ec681f3Smrg- radv: Ensure we never decompress or FCE read-only textures. 9757ec681f3Smrg- radv: Allow DCC for images with modifiers that are read-only. 9767ec681f3Smrg- radv: Use 8x8 meta compute workgroups. 9777ec681f3Smrg- radv: Enable DCC for image stores on GFX10. 9787ec681f3Smrg- radv: Only set WRITE_COMPRESS_ENABLE on supported HW. 9797ec681f3Smrg- vulkan: Fix descriptor set creation with zero bindings. 9807ec681f3Smrg- lavapipe: Free sorted descriptor array. 9817ec681f3Smrg- zink: Remove initialization of some arrays 9827ec681f3Smrg- zink: Only set the needed number of scissors. 9837ec681f3Smrg- radv: Flush caches for shader read operations. 9847ec681f3Smrg- nir: Fix shader calls with nir_opt_dead_write_vars. 9857ec681f3Smrg- nir: Extract shader_info->cs.shared_size out of union. 9867ec681f3Smrg- nir: Remove nir_shader->shared_size. 9877ec681f3Smrg- nir: Do not reset shared_size in nir_lower_io. 9887ec681f3Smrg- radv: Support DCC without a fast clear value. 9897ec681f3Smrg- radv: Support DCC without DCC/FCE predicates. 9907ec681f3Smrg- radv: Add retiling for foreign queues. 9917ec681f3Smrg- radv: Support DCC modifiers fully. 9927ec681f3Smrg- radv: Add clang-format for AMD code. 9937ec681f3Smrg- radv: Format. 9947ec681f3Smrg- radv: Update editorconfig. 9957ec681f3Smrg- radv: Re-enable retiling. 9967ec681f3Smrg- radv: Refactor cs_domain to be a winsys function. 9977ec681f3Smrg- radv: Use VRAM cmdbuffers in more situations. 9987ec681f3Smrg- radv/winsys: Remove use_local_bos 9997ec681f3Smrg- radv: Fix memory leak on descriptor pool reset with layout_size=0. 10007ec681f3Smrg- amd/common: Use cap to test kernel modifier support. 10017ec681f3Smrg- radv: Only require DRM 3.23. 10027ec681f3Smrg 10037ec681f3SmrgBastian Beranek (1): 10047ec681f3Smrg 10057ec681f3Smrg- glx: Assign unique serial number to GLXBadFBConfig error 10067ec681f3Smrg 10077ec681f3SmrgBen Niu (1): 10087ec681f3Smrg 10097ec681f3Smrg- util: When building 'ARM64EC', don't use x64 intrinsics which need to be emulated 10107ec681f3Smrg 10117ec681f3SmrgBenjamin Tissoires (3): 10127ec681f3Smrg 10137ec681f3Smrg- CI: windows: augment the timeout of building the windows container 10147ec681f3Smrg- CI: windows: split the layers to meet new registry requirements 10157ec681f3Smrg- CI: windows: Force using LLVM 12 10167ec681f3Smrg 10177ec681f3SmrgBillKristiansen (1): 10187ec681f3Smrg 10197ec681f3Smrg- d3d12: fix for upside-down multisample stencil blit 10207ec681f3Smrg 10217ec681f3SmrgBoris Brezillon (91): 10227ec681f3Smrg 10237ec681f3Smrg- panfrost: Don't skip the test with a 4k shader 10247ec681f3Smrg- panfrost: Fix tiler job injection (again) 10257ec681f3Smrg- panfrost: Get rid of IS_BIFROST 10267ec681f3Smrg- panfrost: Don't memset the last attribute buffer entry twice 10277ec681f3Smrg- panfrost: Only allocate the extra attribute buffer entry on Bifrost 10287ec681f3Smrg- panfrost: Set attribs and attrib_bufs to NULL when attrib_count = 0 10297ec681f3Smrg- panfrost: Rename and move pan_render_condition_check() 10307ec681f3Smrg- panfrost: Use dev->arch where appropriate 10317ec681f3Smrg- panfrost: Add a panfrost_compile_shader() helper 10327ec681f3Smrg- panfrost: Update ctx->batch when a fresh batch is requested 10337ec681f3Smrg- panfrost: Fix a polygon list corruption in the multi-context case 10347ec681f3Smrg- panfrost: Don't add the tiler BO when it's not accessed 10357ec681f3Smrg- pan/bi: Add an is_terminal_block() helper 10367ec681f3Smrg- pan/bi: Make sure we never branch to an non-existing clause 10377ec681f3Smrg- pan/bi: Add uclz() support 10387ec681f3Smrg- pan/bi: Support bit_count() 10397ec681f3Smrg- panfrost: Use panfrost_get_shader_options() in panfrost_build_blit_shader() 10407ec681f3Smrg- panfrost: Hide backend compiler internals 10417ec681f3Smrg- panfrost: Prefix shader related helpers with pan_shader\_ 10427ec681f3Smrg- panfrost: Move sysval_to_id out of panfrost_sysvals 10437ec681f3Smrg- panfrost: Keep the compiler inputs in the context 10447ec681f3Smrg- panfrost: Move the shader compilation logic out of the gallium driver 10457ec681f3Smrg- panfrost: Provide a helper to prepare the shader related parts of an RSD 10467ec681f3Smrg- panfrost: Use the pan_shader_prepare_rsd() helper 10477ec681f3Smrg- panfrost: Rename pan_blend.h into pan_blend_cso.h 10487ec681f3Smrg- panfrost: Move the blend lowering code out of the gallium driver 10497ec681f3Smrg- panfrost: Move the blend logic out of the gallium driver 10507ec681f3Smrg- Revert "pan/bi: Optimize out redundant jumps to #0x0" 10517ec681f3Smrg- pan/bi: Move int64 lowering before idiv lowering 10527ec681f3Smrg- panfrost: Split the direct and indirect draw logic 10537ec681f3Smrg- panfrost: Add a parameter to suppress next job prefetching 10547ec681f3Smrg- panfrost: Allow passing an explicit global dependency when queuing a job 10557ec681f3Smrg- panfrost: Add a pan_section_offset() helper 10567ec681f3Smrg- panfrost: Move pan_special_varying definition to pan_encoder.h 10577ec681f3Smrg- pan/bi: Extend the bi_builder to support type variants correctly 10587ec681f3Smrg- panfrost: Add a knob to disable the UBO -> push constants optimization 10597ec681f3Smrg- panfrost: Allow passing an explicit UBO index for the sysval UBO 10607ec681f3Smrg- panfrost: Print the correct UBO size when dumping UBO information 10617ec681f3Smrg- panfrost: Don't count the special vertex/instance ID attributes on Bifrost 10627ec681f3Smrg- panfrost: Split the sampler and texture count 10637ec681f3Smrg- panfrost: Expose panfrost_modifier_to_layout() 10647ec681f3Smrg- pan/gen_pack: Parse alignment requirements 10657ec681f3Smrg- panfrost: Specify descriptor alignment requirements 10667ec681f3Smrg- panfrost: Provide various helpers to simplify descriptor allocation 10677ec681f3Smrg- panfrost: Define the Surface and Surface-with-stride descriptors 10687ec681f3Smrg- panfrost: Emit surface descriptors with pan_pack() 10697ec681f3Smrg- panfrost: Use the descriptor allocators where appropriate 10707ec681f3Smrg- panfrost: Get rid of panfrost_pool_alloc() 10717ec681f3Smrg- panfrost: Move the blend shader cache at the device level 10727ec681f3Smrg- panfrost: Use the blend shader cache attached to the device 10737ec681f3Smrg- panfrost: Don't reserve space in the color buffer for disabled RTs 10747ec681f3Smrg- panfrost: Skip disabled RTs when selecting a RT for transaction elimination 10757ec681f3Smrg- panfrost: Stop including pan_device.h from pan_bo.h 10767ec681f3Smrg- panfrost: Add helpers to support indirect draws 10777ec681f3Smrg- panfrost: Prepare things for indirect draws 10787ec681f3Smrg- panfrost: Hook up indirect draw support 10797ec681f3Smrg- panfrost: s/panfrost_slice/pan_image_slice_layout/ 10807ec681f3Smrg- panfrost: Move image states out of pan_image_layout 10817ec681f3Smrg- panfrost: Add a format field to pan_image_layout 10827ec681f3Smrg- panfrost: Stop passing a depth > 1 when creating 2D textures 10837ec681f3Smrg- panfrost: Add extra info to the pan_image_layout struct 10847ec681f3Smrg- panfrost: Split pan_image in two 10857ec681f3Smrg- panfrost: Add an offset field so we can attach a sub-buffer to an image 10867ec681f3Smrg- panfrost: Move out-of-band CRC info to pan_image 10877ec681f3Smrg- panfrost: Move special Z32_S8X24 case out of panfrost_setup_layout() 10887ec681f3Smrg- panfrost: Add a pan_image_layout_init() helper 10897ec681f3Smrg- panfrost: Patch the gallium driver to use pan_image_layout_init() 10907ec681f3Smrg- panfrost: Pass an image view to panfrost_new_texture() 10917ec681f3Smrg- panfrost: Provide a helper to calculate the polygon list size 10927ec681f3Smrg- panfrost: Provide a helper to retrieve image surface pointers 10937ec681f3Smrg- panfrost: Pass a const device to panfrost_sample_positions() 10947ec681f3Smrg- pan/midg: Use the sampler index passed to the texture instruction 10957ec681f3Smrg- panfrost: Add various helpers to simplify FB desc emission 10967ec681f3Smrg- panfrost: Add an helper to emit fragment jobs 10977ec681f3Smrg- panfrost: Add align info to the draw and draw padding definitions 10987ec681f3Smrg- panfrost: Add the early ZS pre frame mode 10997ec681f3Smrg- panfrost: s/pandecode_vertex_tiler_postfix_pre/pandecode_dcd/ 11007ec681f3Smrg- panfrost: Decode pre/post frame DCDs 11017ec681f3Smrg- panfrost: Extend pan_fb_info to allow passing a tile enable map 11027ec681f3Smrg- panfrost: Extend pan_fb_info to allow passing pre/post frame DCDs 11037ec681f3Smrg- panfrost: Always pass a non-NULL screen to set_damage_region() 11047ec681f3Smrg- panfrost: Create a blitter library to replace the existing preload helpers 11057ec681f3Smrg- panfrost: Fix partial update 11067ec681f3Smrg- panfrost: Use the generic preload and FB helpers in the gallium driver 11077ec681f3Smrg- panfrost: Kill the old tile-buffer preload logic 11087ec681f3Smrg- panfrost: Pass a tile enable map to avoid reloading untouched tiles 11097ec681f3Smrg- panfrost: Fix pan_blitter_get_blit_shader() 11107ec681f3Smrg- panfrost: Don't advertise AFBC mods when the format is not supported 11117ec681f3Smrg- panfrost: Reserve thread storage descriptor in panfrost_launch_grid() 11127ec681f3Smrg- panfrost: Fix indirect draws 11137ec681f3Smrg- panfrost: Fix ZS reloading on Bifrost v6 11147ec681f3Smrg 11157ec681f3SmrgBoyuan Zhang (2): 11167ec681f3Smrg 11177ec681f3Smrg- frontend/va/image: add pipe flush for vlVaPutImage 11187ec681f3Smrg- frontends/omx/h265: search entire dpb list 11197ec681f3Smrg 11207ec681f3SmrgCaio Marcelo de Oliveira Filho (43): 11217ec681f3Smrg 11227ec681f3Smrg- intel/fs: Separate SLM size calculation from encoding 11237ec681f3Smrg- nir: Add a data pointer to the callback in nir_remove_dead_variables 11247ec681f3Smrg- spirv: Don't remove variables used by resource indexing intrinsics 11257ec681f3Smrg- nir/linking: Remove system_value handling from helper 11267ec681f3Smrg- compiler: Use util/bitset.h for system_values_read 11277ec681f3Smrg- ci: Add nouveau chipset 162 to shader-db runs 11287ec681f3Smrg- vulkan: Update XML and headers to 1.2.168 11297ec681f3Smrg- spirv: Update headers and metadata from latest Khronos commit 11307ec681f3Smrg- nir: Two shared memory \*blocks* may alias each other 11317ec681f3Smrg- spirv: Implement SPV_KHR_workgroup_memory_explicit_layout 11327ec681f3Smrg- anv: Implement VK_KHR_workgroup_memory_explicit_layout 11337ec681f3Smrg- spirv: Don't bother counting num_images/num_textures 11347ec681f3Smrg- spirv: Don't remove dead variables in \`create_library` mode 11357ec681f3Smrg- spirv: Store SPIR-V version of the module 11367ec681f3Smrg- spirv: Refactor variable initializer code 11377ec681f3Smrg- spirv: Recognize zero initializers in Workgroup variables 11387ec681f3Smrg- nir: Add nir_zero_initialize_shared_memory 11397ec681f3Smrg- anv: Implement VK_KHR_zero_initialize_workgroup_memory 11407ec681f3Smrg- spirv: Fail when parsing invalid Initializers 11417ec681f3Smrg- spirv: Use OpEntryPoint to identify valid I/O variables 11427ec681f3Smrg- spirv: Count variables \*after* unused ones are removed 11437ec681f3Smrg- spirv: Skip creating unused variables in SPIR-V >= 1.4 11447ec681f3Smrg- spirv: Allow variable pointers pointing to an array of blocks 11457ec681f3Smrg- intel/compiler: Use gl_varying_slot_name_for_stage() 11467ec681f3Smrg- freedreno/ir3: Use gl_varying_slot_name_for_stage() 11477ec681f3Smrg- etnaviv: Use gl_varying_slot_name_for_stage() 11487ec681f3Smrg- st/atifs: Use gl_varying_slot_name_for_stage() 11497ec681f3Smrg- compiler: Drop now unused gl_varying_slot_name() 11507ec681f3Smrg- spirv: Reuse nir_is_per_vertex_io() 11517ec681f3Smrg- spirv: Explicitly break when finished handling SpvDecorationBuiltIn 11527ec681f3Smrg- spirv: Update a couple of comments in variable handling 11537ec681f3Smrg- anv: Lower ViewIndex to zero when multiview is disabled 11547ec681f3Smrg- spirv: Update headers and metadata from latest Khronos commit 11557ec681f3Smrg- nir: Handle deref_atomic_fadd in a couple of passes 11567ec681f3Smrg- intel/compiler: Make vue_map parameter const for brw_compile_fs 11577ec681f3Smrg- intel/compiler: Use a struct for brw_compile_fs parameters 11587ec681f3Smrg- intel/compiler: Use a struct for brw_compile_vs parameters 11597ec681f3Smrg- intel/compiler: Refactor the shader INTEL_DEBUG checks 11607ec681f3Smrg- intel/compiler: Make brw_postprocess_nir take debug_enabled as a parameter 11617ec681f3Smrg- intel/compiler: Make vec4 generator take debug_enabled as a parameter 11627ec681f3Smrg- intel/compiler: Make visitors take debug_enabled as a parameter 11637ec681f3Smrg- intel/compiler: Use INTEL_DEBUG=blorp to dump blorp shaders 11647ec681f3Smrg- intel/compiler: Use a struct for brw_compile_cs parameters 11657ec681f3Smrg 11667ec681f3SmrgChad Versace (30): 11677ec681f3Smrg 11687ec681f3Smrg- anv/image: Replace bo_is_owned with from_gralloc (v2) 11697ec681f3Smrg- anv/image: Rename anv_image_plane::surface -> primary_surface 11707ec681f3Smrg- anv/image: Move vkGetImageMemoryRequirements 11717ec681f3Smrg- anv/image: Drop duplicate 'format' in anv_image_create() 11727ec681f3Smrg- anv/image: Fix interpretation of 'disjoint' 11737ec681f3Smrg- anv/android: Fix size check for imported gralloc bo 11747ec681f3Smrg- anv: Add anv_surface_is_valid() 11757ec681f3Smrg- anv/image: Clean up anv_GetImageMemoryRequirements2 11767ec681f3Smrg- anv: Refactor anv_image_get_compression_state_addr 11777ec681f3Smrg- anv/image: Add anv_image_address() 11787ec681f3Smrg- blorp/gen12: Don't use aux address if implicit CCS 11797ec681f3Smrg- anv/image: Make memory layout more explicit 11807ec681f3Smrg- vulkan: Track dependencies of Python imports 11817ec681f3Smrg- anv/image: Simplify assertions in anv_image_from_swapchain() 11827ec681f3Smrg- anv/image: Fix tiling if VkImageSwapchainCreateInfoKHR 11837ec681f3Smrg- anv/image: In vkCreateDmaBufImageINTEL use modifiers 11847ec681f3Smrg- anv/image: Check that anv_image is compatible with its modifier 11857ec681f3Smrg- anv/image: Refactor check_memory_bindings() 11867ec681f3Smrg- anv/image: Fix cleanup of failed image creation 11877ec681f3Smrg- anv/image: Add ANV_IMAGE_MEMORY_BINDING_PRIVATE 11887ec681f3Smrg- anv/image: Fix Vk*ImagePlaneMemory*Info for modifier images 11897ec681f3Smrg- anv: Move assert in vkGetImageSubresourceLayout 11907ec681f3Smrg- anv/image: Fix vkGetImageSubresourceLayout for modifier images 11917ec681f3Smrg- anv: Implement image acquire/release of modifier images 11927ec681f3Smrg- anv: Declare anv_layout_to_* as pure functions 11937ec681f3Smrg- anv/image: Add 'offset' param to add_surface() 11947ec681f3Smrg- anv/image: Support VkImageDrmFormatModifierExplicitCreateInfoEXT 11957ec681f3Smrg- anv: Enable VK_EXT_image_drm_format_modifier 11967ec681f3Smrg- anv: Remove vkCreateDmaBufINTEL (v4) 11977ec681f3Smrg- anv: Drop unused anv_image_create_info::stride 11987ec681f3Smrg 11997ec681f3SmrgCharmaine Lee (1): 12007ec681f3Smrg 12017ec681f3Smrg- gallivm: increase size of texture target enum bitfield 12027ec681f3Smrg 12037ec681f3SmrgChia-I Wu (38): 12047ec681f3Smrg 12057ec681f3Smrg- virgl: update headers 12067ec681f3Smrg- virgl: add support for VIRGL_CAP_V2_UNTYPED_RESOURCE 12077ec681f3Smrg- targets/libgl-xlib: add support for virgl 12087ec681f3Smrg- virgl: update headers from virglrenderer 12097ec681f3Smrg- venus: add driver skeleton 12107ec681f3Smrg- venus: add generated venus-protocol headers 12117ec681f3Smrg- venus: add experimental renderers 12127ec681f3Smrg- venus: add a CS encoder/decoder 12137ec681f3Smrg- venus: add a ring buffer 12147ec681f3Smrg- venus: initial support for vkCreateInstance 12157ec681f3Smrg- venus: initial support for VkPhysicalDevice commands 12167ec681f3Smrg- venus: initial support for VkDevice commands 12177ec681f3Smrg- venus: initial support for queue/fence/semaphore 12187ec681f3Smrg- venus: initial support for VkDeviceMemory commands 12197ec681f3Smrg- venus: initial support for buffers/images/samplers 12207ec681f3Smrg- venus: initial support for descriptor sets 12217ec681f3Smrg- venus: initial support for render pass and fb 12227ec681f3Smrg- venus: initial support for events and queries 12237ec681f3Smrg- venus: initial support for module and pipelines 12247ec681f3Smrg- venus: initial support for command buffers 12257ec681f3Smrg- venus: advertise extensions promoted to 1.1 12267ec681f3Smrg- venus: advertise extensions promoted to 1.2 12277ec681f3Smrg- venus: initial support for transform feedback 12287ec681f3Smrg- venus: initial support for WSI 12297ec681f3Smrg- venus: update venus-protocol headers 12307ec681f3Smrg- venus: prepare for splitting vn_device.[ch] 12317ec681f3Smrg- venus: split out vn_command_buffer.[ch] 12327ec681f3Smrg- venus: split out vn_pipeline.[ch] 12337ec681f3Smrg- venus: split out vn_query_pool.[ch] 12347ec681f3Smrg- venus: split out vn_render_pass.[ch] 12357ec681f3Smrg- venus: split out vn_descriptor_set.[ch] 12367ec681f3Smrg- venus: split out vn_buffer.[ch] 12377ec681f3Smrg- venus: split out vn_image.[ch] 12387ec681f3Smrg- venus: split out vn_device_memory.[ch] 12397ec681f3Smrg- venus: split out vn_queue.[ch] 12407ec681f3Smrg- venus: include individual venus-protcol headers 12417ec681f3Smrg- ci: enable venus in some meson build jobs 12427ec681f3Smrg- venus: check vn_renderer_info::vk_xml_version 12437ec681f3Smrg 12447ec681f3SmrgChristian Gmeiner (26): 12457ec681f3Smrg 12467ec681f3Smrg- etnaviv: handle NULL views in set_sampler_views 12477ec681f3Smrg- vc4: add drm-shim 12487ec681f3Smrg- ci: Update baremetal kernel to 5.11 plus patches 12497ec681f3Smrg- nir: add load_texture_rect_scaling 12507ec681f3Smrg- nir: add has_txs flag 12517ec681f3Smrg- nir/lower_tex: 'txs free' tex_rect lowering 12527ec681f3Smrg- nir/lower_tex: wider usage of nir_tex_instr_src_index(..) 12537ec681f3Smrg- gallium: add PIPE_CAP_TEXRECT 12547ec681f3Smrg- gallium/st: lower rectangle textures if not supported 12557ec681f3Smrg- ttn: lower rectangle textures if not supported 12567ec681f3Smrg- etnaviv: nir: support nir_intrinsic_load_texture_rect_scaling 12577ec681f3Smrg- etnaviv: let st lower rect tex 12587ec681f3Smrg- vc4: let st lower rect tex 12597ec681f3Smrg- etnaviv: nir: add ubo lowering pass 12607ec681f3Smrg- etnaviv: use nir_lower_uniforms_to_ubo(..) 12617ec681f3Smrg- etnaviv: fix etna_nir_lower_ubo_to_uniform pass 12627ec681f3Smrg- etnaviv: extend lower ubo tests 12637ec681f3Smrg- gallium: call util_cpu_detect() 12647ec681f3Smrg- etnaviv: use nir_lower_idiv(..) before opt loop 12657ec681f3Smrg- ci/bare-metal: fix fastboot 12667ec681f3Smrg- etnaviv: etnaviv: put sampler limit determination into own function 12677ec681f3Smrg- etnaviv: factor out TS state emitting 12687ec681f3Smrg- etnaviv: add support for NTE 12697ec681f3Smrg- etnaviv: rename struct members 12707ec681f3Smrg- ci/bare-metal: no need to use tee 12717ec681f3Smrg- etnaviv: tell the truth if alpha-test is supported 12727ec681f3Smrg 12737ec681f3SmrgConnor Abbott (61): 12747ec681f3Smrg 12757ec681f3Smrg- nir/lower_tex: Handle sized tex destination types 12767ec681f3Smrg- freedreno/ir3: Handle sized tex destination types 12777ec681f3Smrg- ntt: Handle sized tex destination types 12787ec681f3Smrg- nir/lower_bool: Rewrite dest_type for boolean destinations 12797ec681f3Smrg- brw/vec4: Don't convert tex dest type to glsl_type 12807ec681f3Smrg- radv/meta: Use sized types for nir_tex_instr::dest_type 12817ec681f3Smrg- v3dv/meta: Use sized types for nir_tex_instr::dest_type 12827ec681f3Smrg- intel/blorp: Use sized types for nir_tex_instr::dest_type 12837ec681f3Smrg- anv: Use sized types for nir_tex_instr::dest_type 12847ec681f3Smrg- dxil: Use sized types for nir_tex_instr::dest_type 12857ec681f3Smrg- panfrost/blit: Use sized types for nir_tex_instr::dest_type 12867ec681f3Smrg- d3d12/blit: Use sized types for nir_tex_instr::dest_type 12877ec681f3Smrg- nir: Use sized types for nir_tex_instr::dest_type 12887ec681f3Smrg- st/mesa: Use sized types for nir_tex_instr::dest_type 12897ec681f3Smrg- gallium/nir: Use sized types for nir_tex_instr::dest_type 12907ec681f3Smrg- ttn: Use sized types for nir_tex_instr::dest_type 12917ec681f3Smrg- st/atifs: Use sized types for nir_tex_instr::dest_type 12927ec681f3Smrg- glsl/nir: Use sized types for nir_tex_instr::dest_type 12937ec681f3Smrg- vtn: Use sized types for nir_tex_instr::dest_type 12947ec681f3Smrg- ptn: Use sized types for nir_tex_instr::dest_type 12957ec681f3Smrg- nir: Validate nir_tex_instr::dest_type bitsize 12967ec681f3Smrg- nir/lower_tex: Assume that nir_tex_instr::dest_type is sized 12977ec681f3Smrg- panfrost: Assume that nir_tex_instr::dest_type is sized 12987ec681f3Smrg- ir3: Assume that nir_tex_instr::dest_type is sized 12997ec681f3Smrg- ntt: Assume that nir_tex_instr::dest_type is sized 13007ec681f3Smrg- freedreno/a6xx: Document threadsize-related fields 13017ec681f3Smrg- freedreno/cffdec: Use rb trees for tracking buffers 13027ec681f3Smrg- ir3/parser: Fix parsing of "0.0" in @const line 13037ec681f3Smrg- freedreno/computerator: Fix example assembly 13047ec681f3Smrg- ir3/parser: Support labels 13057ec681f3Smrg- ir3/parser: Add ability to specify branchstack 13067ec681f3Smrg- freedreno/computerator: Add branching example 13077ec681f3Smrg- freedreno/computerator: Fix thrsz type 13087ec681f3Smrg- freedreno/a6xx: Fix compute threadsize type 13097ec681f3Smrg- freedreno/registers: Handle typed registers with fields 13107ec681f3Smrg- freedreno/a6xx: Cleanup SP_XS_CTRL_REG0 definitions 13117ec681f3Smrg- freedreno: Add local_size to ir3_shader_variant 13127ec681f3Smrg- ir3: Calcuate max_waves and threadsize 13137ec681f3Smrg- turnip: Use threadsize calculated by ir3 13147ec681f3Smrg- freedreno: Use threadsize calculated by ir3 13157ec681f3Smrg- freedreno/computerator: Use threadsize calculated by ir3 13167ec681f3Smrg- freedreno: Report max_waves in shaderdb output 13177ec681f3Smrg- freedreno/computerator: Add script for finding reg file size 13187ec681f3Smrg- util/bitset: Avoid out-of-bounds reads 13197ec681f3Smrg- freedreno/a3xx: Fix SP_FS_CTRL_REG1_INITIALOUTSTANDING 13207ec681f3Smrg- ir3/legalize: Fix last input (ss) insertion 13217ec681f3Smrg- ir3: Fix valid flags for STIB 13227ec681f3Smrg- ir3/cp_postsched: Set address of uses for relative mov's 13237ec681f3Smrg- ir3: Don't copy propagate arrays in ir3_cp 13247ec681f3Smrg- ir3/postsched: Make sure to schedule inputs before kill 13257ec681f3Smrg- vtn: Handle ZeroExtend/SignExtend image operands 13267ec681f3Smrg- tu: Expose VK_KHR_spirv_1_4 and VK_EXT_scalar_block_layout 13277ec681f3Smrg- tu: Expose VK_KHR_relaxed_block_layout 13287ec681f3Smrg- ir3/sched: Don't penalize uses of already-waited tex/SFU 13297ec681f3Smrg- ir3/sched: Don't schedule too many tex/SFU instructions 13307ec681f3Smrg- ir3: Fix list corruption in legalize_block() 13317ec681f3Smrg- tu: Correctly preserve old push descriptor contents 13327ec681f3Smrg- ir3: Prevent oob writes to inputs/outputs array 13337ec681f3Smrg- nir/lower_clip_disable: Fix store writemask 13347ec681f3Smrg- tu: Fix SP_GS_PRIM_SIZE for large sizes 13357ec681f3Smrg- ir3/postsched: Fix dependencies for a0.x/p0.x 13367ec681f3Smrg 13377ec681f3SmrgCorentin Noël (1): 13387ec681f3Smrg 13397ec681f3Smrg- ci: Use lavacli from master 13407ec681f3Smrg 13417ec681f3SmrgDaniel Schürmann (67): 13427ec681f3Smrg 13437ec681f3Smrg- aco: fix VOP3P assembly, VN and validation 13447ec681f3Smrg- aco/RA: fix subdword operands on VOP3P instructions 13457ec681f3Smrg- aco: allow constants/literals on every src position for VOP3P 13467ec681f3Smrg- aco: allow SGPRs on every src position for VOP3P 13477ec681f3Smrg- aco: change usesModifiers() considering opsel_hi on packed instructions 13487ec681f3Smrg- aco: create helpers to emit vop3p instructions 13497ec681f3Smrg- aco: emit packed 16bit instructions 13507ec681f3Smrg- radv: vectorize 16bit instructions 13517ec681f3Smrg- aco: simplify multiply-add combining 13527ec681f3Smrg- aco: optimize packed mul+add to v_pk_fma_f16 13537ec681f3Smrg- aco: optimize packed clamp 13547ec681f3Smrg- aco: optimize packed fneg 13557ec681f3Smrg- aco: optimize v_pk_fma_f16 -> v_pk_fmac_f16 on GFX10 13567ec681f3Smrg- aco: propagate swizzles when optimizing packed clamp & fma 13577ec681f3Smrg- aco: remove divergent branches which only jump over very few instructions 13587ec681f3Smrg- aco/optimizer: don't copy-prop logical phis 13597ec681f3Smrg- aco/optimizer: don't propagate subdword temps of different size 13607ec681f3Smrg- aco: generalize subdword constant copy lowering 13617ec681f3Smrg- aco/validate: validate that p_create_vector operands are aligned unless they are subdword operands 13627ec681f3Smrg- aco/validate: ensure that Operand and Definition size matches for parallelcopies 13637ec681f3Smrg- aco/validate: relax subdword restrictions 13647ec681f3Smrg- aco: propagate temporaries into PSEUDO instructions if it can take it 13657ec681f3Smrg- aco/optimizer: expand subdword vectors with SGPRs on all generations 13667ec681f3Smrg- aco/optimizer: convert extract_vector with index 0 into parallelcopies if possible 13677ec681f3Smrg- radv: don't vectorize shift operations 13687ec681f3Smrg- aco: fix VCC hint on boolean subgroup operations 13697ec681f3Smrg- aco: fix nir_intrinsic_ballot with wave32 13707ec681f3Smrg- aco: fix shared VGPR allocation on RDNA2 13717ec681f3Smrg- aco: change gpr_alloc_granule to full alignment 13727ec681f3Smrg- aco: refactor GPR limit calculation 13737ec681f3Smrg- aco: don't decrease the vgpr_limit when encountering bpermute 13747ec681f3Smrg- aco: also consider VCC in get_reg_specified() 13757ec681f3Smrg- aco: check get_reg_specified() on register hints 13767ec681f3Smrg- aco: don't abort() if disassembly fails 13777ec681f3Smrg- aco: use VCC as regular SGPR pair on GFX10 13787ec681f3Smrg- aco: don't create unnecessary exec phi on merge blocks 13797ec681f3Smrg- aco: handle non-temp phi definitions and operands 13807ec681f3Smrg- aco: make all exec accesses non-temporaries 13817ec681f3Smrg- aco: remove dead code for the handling of exec temporaries 13827ec681f3Smrg- aco: fix assertion in insert_exec_mask pass 13837ec681f3Smrg- nir: lower load_helper to is_helper if the shader uses demote() 13847ec681f3Smrg- nir: lower is/load_helper to zero if no helper lanes are needed 13857ec681f3Smrg- aco: remove special handling of load_helper_invocation 13867ec681f3Smrg- aco: don't rematerialize exec 13877ec681f3Smrg- aco: value number VOPC instructions with different exec masks 13887ec681f3Smrg- aco/value_numbering: use can_eliminate() function to avoid unnecessary hashmap lookups 13897ec681f3Smrg- aco/optimizer: set VCC hint on new v_cmp_* definitions 13907ec681f3Smrg- aco/ra: allow VCC on SMEM sbase operand on GFX10+ 13917ec681f3Smrg- .mailmap: fix email for Daniel Schürmann 13927ec681f3Smrg- aco/ra: split affinity creation into separate function 13937ec681f3Smrg- aco/ra: split register_file initialization into separate function 13947ec681f3Smrg- aco/ra: refactor SSA repairing during register allocation 13957ec681f3Smrg- aco/ra: iterate backwards when coalescing phis 13967ec681f3Smrg- aco/ra: allow m0 in get_reg_specified() 13977ec681f3Smrg- aco/ra: remove exec handling for phis 13987ec681f3Smrg- aco/spill: refactor spill decision taking 13997ec681f3Smrg- aco/spill: reload spilled exec masks directly to exec 14007ec681f3Smrg- aco/spill: spill phi constants and exec directly to VGPR 14017ec681f3Smrg- aco/spill: don't count phis as variable access 14027ec681f3Smrg- aco/spill: refactor some more spill decision taking 14037ec681f3Smrg- aco/spill: refactor live-in registerDemand calculation 14047ec681f3Smrg- aco/spill: use correct next_use_distances at loop header 14057ec681f3Smrg- aco: lower p_spill with constants correctly 14067ec681f3Smrg- aco: fix kill flags on phi operands 14077ec681f3Smrg- aco: add new reindex_ssa() pass 14087ec681f3Smrg- aco/cssa: rewrite lower_to_cssa pass 14097ec681f3Smrg- aco/cssa: don't create parallelcopies for constants and exec 14107ec681f3Smrg 14117ec681f3SmrgDaniel Stone (3): 14127ec681f3Smrg 14137ec681f3Smrg- CI: Try really hard to get updated Windows TLS certs 14147ec681f3Smrg- CI: Trigger Windows builds for llvmpipe & Vulkan changes 14157ec681f3Smrg- CI: Change LAVA job visibility 14167ec681f3Smrg 14177ec681f3SmrgDanylo Piliaiev (50): 14187ec681f3Smrg 14197ec681f3Smrg- turnip/ir3: handle image load/stores produced by AtomicLoad/Store 14207ec681f3Smrg- turnip: make GS use correct varyings size from previous stage 14217ec681f3Smrg- ir3: add debug option to override shader assembly 14227ec681f3Smrg- freedreno/ir3/parser: add cat7 support 14237ec681f3Smrg- turnip: don't emit tess consts if they are not used 14247ec681f3Smrg- freedreno: clamp scissor bounds 14257ec681f3Smrg- freedreno/a2xx: fix scissors clamp bounds 14267ec681f3Smrg- turnip: enable inheritedQueries 14277ec681f3Smrg- turnip: consider HW limit on number of views when apply multipos opt 14287ec681f3Smrg- turnip: consider tile_max_h when calculating tiling config 14297ec681f3Smrg- turnip,freedreno/a6xx: tell hw the size of shared mem used by CS 14307ec681f3Smrg- turnip/ir3: check for bindless IBOs in atomic dests fixup 14317ec681f3Smrg- turnip: fix leak of tu_shader object during compute pipeline creation 14327ec681f3Smrg- ir3: prevent duplication of instruction's dependencies 14337ec681f3Smrg- ir3: make mark_kill_path exit early if instr is already seen 14347ec681f3Smrg- ir3: disallow moving memory writes over discard 14357ec681f3Smrg- freedreno/hw: fix populating branch targets in isa_decode pre-pass 14367ec681f3Smrg- turnip: fix SP_HS_WAVE_INPUT_SIZE value 14377ec681f3Smrg- freedreno/a5xx: port handling of PIPE_BUFFER textures from a6xx 14387ec681f3Smrg- ir3: use OPC_GETBUF to get size of sampler buffers 14397ec681f3Smrg- turnip: lower device index to zero 14407ec681f3Smrg- turnip: fill VkMemoryDedicatedRequirements 14417ec681f3Smrg- turnip: set zmode to A6XX_EARLY_Z if FS forces early fragment test 14427ec681f3Smrg- turnip: implement intrinsic_vulkan_resource_reindex 14437ec681f3Smrg- ci/freedreno: run freedreno jobs on any change in src/freedreno/ 14447ec681f3Smrg- ir3: fix oob access to regs array for getbuf,getinfo,rgetinfo 14457ec681f3Smrg- ir3/isa,parser: fix encoding and parsing of bindless s2en SAM 14467ec681f3Smrg- ir3: match mova1 mnemonic when writing to A1 14477ec681f3Smrg- freedreno/isa: assert if field's range is out of bitset's range 14487ec681f3Smrg- ir3: disallow .sat on SEL instructions 14497ec681f3Smrg- ir3: update info about applicability of saturation modifier 14507ec681f3Smrg- turnip: expose several already implemented extensions 14517ec681f3Smrg- nir: add nir_shader_as_str function 14527ec681f3Smrg- turnip: implement VK_KHR_pipeline_executable_properties 14537ec681f3Smrg- turnip: clamp to zero negative upper left corner of viewport 14547ec681f3Smrg- turnip,ir3: account for dispatch group offsets 14557ec681f3Smrg- freedreno/a6xx: copy full 64bit of primitive counter 14567ec681f3Smrg- freedreno/a6xx: fix primitive counters debug output 14577ec681f3Smrg- ir3/isa: account for randomly set by blob lowest bit of ibo atomics 14587ec681f3Smrg- glsl/linker: Fix attempts to split up 64bit varyings between slots 14597ec681f3Smrg- glsl/linker: Fix xfb with explicit locations and 64bit types 14607ec681f3Smrg- ir3: nir_op_f2f16 should round to even 14617ec681f3Smrg- ir3: convert shift amount to 16b for 16b shifts 14627ec681f3Smrg- turnip: enable infinities for f16 math and document the register 14637ec681f3Smrg- turnip: enable VK_KHR_16bit_storage on A650 14647ec681f3Smrg- turnip: handle format list for compressed formats 14657ec681f3Smrg- docs: mark float_controls,float16_int8,16bit_storage as done on Turnip 14667ec681f3Smrg- turnip: fix alignment of non-32b types in workgroup memory 14677ec681f3Smrg- turnip: implement variableMultisampleRate 14687ec681f3Smrg- turnip: support copying both aspects of D32_SFLOAT_S8_UINT 14697ec681f3Smrg 14707ec681f3SmrgDave Airlie (163): 14717ec681f3Smrg 14727ec681f3Smrg- device-select-layer: update for vulkan 1.2 14737ec681f3Smrg- lavapipe: fix missing piece of VK_KHR_get_physical_device_properties2 14747ec681f3Smrg- vk-device-select: add device group support 14757ec681f3Smrg- lavapipe: refactor image surface creation 14767ec681f3Smrg- lavapipe: rewrite attachment clearing for conditional rendering. 14777ec681f3Smrg- gallium: add a cond rendering hook for vulkan. 14787ec681f3Smrg- llvmpipe: handle vulkan conditional rendering 14797ec681f3Smrg- lavapipe: add VK_EXT_conditional_rendering support. 14807ec681f3Smrg- CI: add lavapipe to llvmpipe rules. 14817ec681f3Smrg- lavapipe: add support for external memory/fd/sempahore extensions 14827ec681f3Smrg- llvmpipe: handle firstvertex for vulkan draw parameters 14837ec681f3Smrg- lavapipe: handle shader draw parameters 14847ec681f3Smrg- lavapipe: add missing loader interface negoitation 14857ec681f3Smrg- lavapipe: move to subclassing instance/physical device. 14867ec681f3Smrg- lavapipe: add missing wsi entrypoint. 14877ec681f3Smrg- lavapipe: sort extensions in proper order. 14887ec681f3Smrg- lavapipe: use common dispatch layer. 14897ec681f3Smrg- radv: move queue object to a common base object 14907ec681f3Smrg- radv: remove all entrypoint enabled debug option 14917ec681f3Smrg- radv: move to subclassed instance/physical_device structs 14927ec681f3Smrg- radv: port to using common dispatch code. 14937ec681f3Smrg- zink: don't pick a cpu device ever. 14947ec681f3Smrg- llvmpipe: add a mutex around debug resource tracking 14957ec681f3Smrg- llvmpipe: fix use after free with fs variant cleanup 14967ec681f3Smrg- lavapipe: reset shader constant buffers after execution 14977ec681f3Smrg- glsl: fix leak in gl_nir_link_uniform_blocks 14987ec681f3Smrg- llvmpipe: enable GL spir-v support 14997ec681f3Smrg- util/format: add helper to check if a format is scaled. 15007ec681f3Smrg- llvmpipe: don't support scaled formats outside vertex buffers 15017ec681f3Smrg- lavapipe: add support for 2/10/10/10 scaled formats. 15027ec681f3Smrg- lavapipe: add support for missing 10/10/10/2 formats. 15037ec681f3Smrg- lavapipe: add reference counting to descriptor set layout 15047ec681f3Smrg- lavapipe: avoid pointer to pipeline layout in execution 15057ec681f3Smrg- lavapipe: set viewport state dirty on first execute 15067ec681f3Smrg- lavapipe: implement physical device group enumeration 15077ec681f3Smrg- lavapipe/meson: drop megadrivers build req 15087ec681f3Smrg- lavapipe: fix some void ptr arithmetic 15097ec681f3Smrg- lavapipe: use msvc compatible 0 init 15107ec681f3Smrg- lavepipe: some misc msvc fixes 15117ec681f3Smrg- lavapipe: make OPT macro MSVC compatible 15127ec681f3Smrg- lavapipe: use os_time for timing related things 15137ec681f3Smrg- vulkan/util: add api to reset object magic + private data. 15147ec681f3Smrg- radv: reset object base on recycled command buffers 15157ec681f3Smrg- tu: reset object base on recycled command buffers 15167ec681f3Smrg- lavapipe: reset object base on recycled command buffers 15177ec681f3Smrg- util: add optimised memset64 15187ec681f3Smrg- u_surface: use optimised memset64 15197ec681f3Smrg- llvmpipe: zs clear use 64-bit memset 15207ec681f3Smrg- lavapipe: use clear interface for renderpass clears 15217ec681f3Smrg- glx: proposed fix for setSwapInterval 15227ec681f3Smrg- zink: use extensioned draw indirect functions. 15237ec681f3Smrg- zink/ci: update results now that we are testing zink/lavapipe 15247ec681f3Smrg- lavapipe: add calibrated timestamp support 15257ec681f3Smrg- zink/ci: update results for GL 3.3 testing enables 15267ec681f3Smrg- zink/ci: disable arb_timer_query tests 15277ec681f3Smrg- lavapipe: use the common icd generator 15287ec681f3Smrg- lavapipe: fix msvc initialiser 15297ec681f3Smrg- lavapipe: add dll definition file instead of using PUBLIC 15307ec681f3Smrg- lavapipe: fix icd generation for windows 15317ec681f3Smrg- meson/llvm: add native for gallium swrast 15327ec681f3Smrg- lavapipe: handle tessellation domain winding 15337ec681f3Smrg- lavapipe: enable KHR_maintenance2 15347ec681f3Smrg- lavapipe: enable KHR_maintenace3 15357ec681f3Smrg- lavapipe: fix descriptor set layout freeing. 15367ec681f3Smrg- lavapipe: fix depth texturing swizzle 15377ec681f3Smrg- lavapipe: use null probe path on win32 15387ec681f3Smrg- ci: try building lavapipe on windows 15397ec681f3Smrg- zink/instance: work with vulkan 1.0 and later loader. 15407ec681f3Smrg- lavapipe: expose a 1.0 vulkan API for now. 15417ec681f3Smrg- lavapipe: Define supported extensions in C 15427ec681f3Smrg- lavapipe: VK_EXT_extended_dynamic_state support 15437ec681f3Smrg- lavapipe: reorder descriptor set stages to get correct binding 15447ec681f3Smrg- lavapipe: sort bindings before creating descriptor set 15457ec681f3Smrg- clover: fix array images view creation 15467ec681f3Smrg- lavapipe: fix pipeline vp/scissor mixup. 15477ec681f3Smrg- lavapipe: fix dynamic viewport/scissor pipeline emission 15487ec681f3Smrg- draw: fix uses viewport index for tess eval shader 15497ec681f3Smrg- draw/prim_assembler: write correct decomposed primitive lengths 15507ec681f3Smrg- llvmpipe: add support for shader viewport layer 15517ec681f3Smrg- lavapipe: enable EXT_shader_viewport_index_layer 15527ec681f3Smrg- zink/ci: update results after layer extensions enabled in lavapipe 15537ec681f3Smrg- util/panfrost/glsl: rename BITSET_LAST_BIT to BITSET_LAST_BIT_SIZED 15547ec681f3Smrg- util/bitset: add a new last bit api 15557ec681f3Smrg- shader_info: convert textures_used to a bitset. 15567ec681f3Smrg- gallium: add a sampler reduction cap + settings 15577ec681f3Smrg- gallium: add a view mask to the draw command 15587ec681f3Smrg- gallivm: mark subpass input attachments as 2d arrays 15597ec681f3Smrg- gallivm: add support for load_view_index intrinsic 15607ec681f3Smrg- draw: add interface to notify renderer of the current view index 15617ec681f3Smrg- draw: refactor out the instances drawing code 15627ec681f3Smrg- draw: add view_mask rendering support 15637ec681f3Smrg- draw: pass the view index to the render driver 15647ec681f3Smrg- draw/vs: pass the view index to the vertex shader 15657ec681f3Smrg- draw: add tess/gs support for multiview index 15667ec681f3Smrg- llvmpipe: add the view index callback from draw 15677ec681f3Smrg- llvmpipe: add view index support to rasterizer 15687ec681f3Smrg- lavapipe: add clear support for multiview 15697ec681f3Smrg- lavapipe: add draw support for multiview 15707ec681f3Smrg- lavapipe: add input attachment support for multiview 15717ec681f3Smrg- lavapipe: add render pass support for multiview 15727ec681f3Smrg- lavapipe: enable KHR_multiview 15737ec681f3Smrg- llvmpipe: add reduction mode support 15747ec681f3Smrg- lavapipe: add EXT_sampler_filter_minmax support 15757ec681f3Smrg- lavapipe: add support for VK_KHR_create_renderpass2 15767ec681f3Smrg- lavapipe: move queue to base object 15777ec681f3Smrg- lavapipe: move to the common casting interfaces 15787ec681f3Smrg- lavapipe: move to common create render pass code 15797ec681f3Smrg- lavapipe: add single ssbo variable pointer support. 15807ec681f3Smrg- docs: update lavapipe features.txt 15817ec681f3Smrg- lavapipe: enable KHR_uniform_buffer_standard_layout 15827ec681f3Smrg- lavapipe: enable EXT_scalar_block_layout 15837ec681f3Smrg- lavapipe: add missing break 15847ec681f3Smrg- lavapipe: fix writing availability for queries. 15857ec681f3Smrg- lavapipe: add host query reset 15867ec681f3Smrg- gallivm: convert packing to uint64 not double 15877ec681f3Smrg- lavapipe: only init immutable samplers for correct types. 15887ec681f3Smrg- lavapipe: add support for KHR_buffer_device_address. 15897ec681f3Smrg- lavapipe: bump maxMemoryAllocationCount 15907ec681f3Smrg- lavapipe: fix image format properties 15917ec681f3Smrg- lavapipe: add missing sampler minmax properties 15927ec681f3Smrg- lavapipe: add missing device group api 15937ec681f3Smrg- lavapipe: drop unused vk_format in image struct 15947ec681f3Smrg- lavapipe: fix templated descriptor updates 15957ec681f3Smrg- gallivm: fix non-32bit ubo loads 15967ec681f3Smrg- gallivm/nir: handle bool registers. 15977ec681f3Smrg- nir: port fp16 casting code from dxil 15987ec681f3Smrg- nir: lower 64-bit floats to 32-bit first. 15997ec681f3Smrg- gallivm: use fp16 casts lowering 16007ec681f3Smrg- lavapipe: enable 8/16-bit storage extensions 16017ec681f3Smrg- llvmpipe: fix cube image size query 16027ec681f3Smrg- st/glthread: allow for invalid L3 cache id. 16037ec681f3Smrg- util: rework AMD cpu L3 cache affinity code. 16047ec681f3Smrg- gallivm: add 64-bit atomic support for ssbo/shared. 16057ec681f3Smrg- gallivm: add 64-bit atomic global support 16067ec681f3Smrg- lavapipe: enable KHR_shader_atomic_int64 16077ec681f3Smrg- lavapipe: only reference pCounterBuffers if non-NULL 16087ec681f3Smrg- lavapipe: fail out if spirv->nir fails 16097ec681f3Smrg- lavapipe: fix only clearing depth or stencil paths. 16107ec681f3Smrg- zink/ci: update results after lavapipe clear fixes 16117ec681f3Smrg- lavapipe: add support for KHR_imageless_framebuffer 16127ec681f3Smrg- drisw: move zink down the list below the sw drivers. 16137ec681f3Smrg- zink/ci: handle getting correct drisw driver. 16147ec681f3Smrg- llvmpipe: when depth clamp is disable clamp to 0.0/1.0 16157ec681f3Smrg- llvmpipe: always take depth clamping from state tracker 16167ec681f3Smrg- ci: update zink/virgl results for depth clamping fixes 16177ec681f3Smrg- lavapipe: add vulkan 1.1 properties/features apis 16187ec681f3Smrg- lavapipe: fix missing protected memory properties 16197ec681f3Smrg- gallivm: add subgroup vote 64-bit and feq support. 16207ec681f3Smrg- gallivm: move get_flt_bld to header. 16217ec681f3Smrg- gallivm: add subgroup system values support 16227ec681f3Smrg- gallivm: add subgroup elect intrinsic support. 16237ec681f3Smrg- gallivm: add subgroup reduction + in/ex scan support 16247ec681f3Smrg- gallivm: add subgroup ballot support 16257ec681f3Smrg- gallivm: add subgroup read invocation support 16267ec681f3Smrg- gallivm: add subgroup lowering support 16277ec681f3Smrg- gallivm: add compute shader subgroup system values support 16287ec681f3Smrg- lavapipe: enable subgroups features 16297ec681f3Smrg- lavapipe: enable correct workgroup sizing 16307ec681f3Smrg- lavapipe: enable Vulkan 1.1 support 16317ec681f3Smrg- docs: update lavapipe bits for 1.1 16327ec681f3Smrg- lavapipe: add vk1.1 image swapchain support 16337ec681f3Smrg- lavapipe: add dummy sampler ycbcr conversion 16347ec681f3Smrg- lavapipe: fix mipmapped resolves. 16357ec681f3Smrg 16367ec681f3SmrgDavid McFarland (1): 16377ec681f3Smrg 16387ec681f3Smrg- radv: fix divide by zero with no tesselation params 16397ec681f3Smrg 16407ec681f3SmrgDouglas Anderson (1): 16417ec681f3Smrg 16427ec681f3Smrg- gallium/indices: Use "__restrict" to help the compiler 16437ec681f3Smrg 16447ec681f3SmrgDrew Davenport (1): 16457ec681f3Smrg 16467ec681f3Smrg- radeonsi: Report multi-plane formats as unsupported 16477ec681f3Smrg 16487ec681f3SmrgDylan Baker (33): 16497ec681f3Smrg 16507ec681f3Smrg- VERSION: bump for 21.1.0 cycle 16517ec681f3Smrg- docs: add release notes for 20.3.3 16527ec681f3Smrg- docs: Add sha256sum for 20.3.3 16537ec681f3Smrg- docs: update calendar and link releases notes for 20.3.3 16547ec681f3Smrg- docs: update calendar for 21.0.0-rc1 16557ec681f3Smrg- bin/post_version: convert the csv.reader into a concrete list 16567ec681f3Smrg- docs: add release notes for 20.3.4 16577ec681f3Smrg- docs: Add sha256sum for 20.3.4 16587ec681f3Smrg- docs: update calendar and link releases notes for 20.3.4 16597ec681f3Smrg- docs: update calendar for 21.0.0-rc2 16607ec681f3Smrg- docs: update calendar for 21.0.0-rc3 16617ec681f3Smrg- Scons: check for timespec_get on windows as well as unices 16627ec681f3Smrg- docs: Remove 21.0 features from features_new.txt 16637ec681f3Smrg- docs: add release notes for 21.0.0 16647ec681f3Smrg- docs: update calendar and link releases notes for 21.0.0 16657ec681f3Smrg- docs: Add calendar entries for 21.0 release. 16667ec681f3Smrg- docs: Extend calendar entries for 21.0 by 1 releases. 16677ec681f3Smrg- docs: Add calendar entries for 21.1 release candidates. 16687ec681f3Smrg- docs: add release notes for 20.3.5 16697ec681f3Smrg- docs: Add hashes for 20.3.5 16707ec681f3Smrg- docs: update calendar and link releases notes for 20.3.5 16717ec681f3Smrg- docs: add release notes for 21.0.1 16727ec681f3Smrg- docs: Add 21.0.1 hashes 16737ec681f3Smrg- docs: update calendar and link releases notes for 21.0.1 16747ec681f3Smrg- docs: add release notes for 21.0.2 16757ec681f3Smrg- relnotes: Add sha256sum for 21.0.2 16767ec681f3Smrg- docs: update calendar and link releases notes for 21.0.2 16777ec681f3Smrg- meson: OpenMP is supposed to be optional 16787ec681f3Smrg- .pick_status.json: Update to ee9b744cb5d1466960e78b1de44ad345590e348c 16797ec681f3Smrg- VERSION: bump for 21.1.0-rc3 16807ec681f3Smrg- .pick_status.json: Update to cbd6e5f2e592a9834a03004a473537f25aea4336 16817ec681f3Smrg- .pick_status.json: Update to ede0b3c643279f4126fb10552a2f1d00be27f16d 16827ec681f3Smrg- .pick_status.json: Update to b80720acb13e1014aea89e6bd25f22d43df85356 16837ec681f3Smrg 16847ec681f3SmrgEdward O'Callaghan (1): 16857ec681f3Smrg 16867ec681f3Smrg- clover: Implement CL_MEM_OBJECT_IMAGE1D 16877ec681f3Smrg 16887ec681f3SmrgEleni Maria Stea (7): 16897ec681f3Smrg 16907ec681f3Smrg- anv: Added the VK_EXT_sample_locations extension to the anv_extensions list 16917ec681f3Smrg- anv: Implement physical device properties for VK_EXT_sample_locations 16927ec681f3Smrg- anv/state: Take explicit sample locations in emit helpers 16937ec681f3Smrg- anv: Add support for sample locations 16947ec681f3Smrg- anv: Removed unused header file 16957ec681f3Smrg- anv: Enabled the VK_EXT_sample_locations extension 16967ec681f3Smrg- iris: fix in fences backend for ext_external_objects edge case 16977ec681f3Smrg 16987ec681f3SmrgElla-0 (1): 16997ec681f3Smrg 17007ec681f3Smrg- glsl: build without bison 17017ec681f3Smrg 17027ec681f3SmrgEnrico Galli (2): 17037ec681f3Smrg 17047ec681f3Smrg- microsoft/spirv_to_dxil: Add support for load_vulkan_descriptor 17057ec681f3Smrg- microsoft/spirv_to_dxil: Use non-zero exit code on failed compilations 17067ec681f3Smrg 17077ec681f3SmrgEric Anholt (207): 17087ec681f3Smrg 17097ec681f3Smrg- gallium/ttn: Add support for TGSI_OPCODE_I64NEG/ABS. 17107ec681f3Smrg- gallium/ntt: Stop lowering integer source mods. 17117ec681f3Smrg- gallium/tgsi: Assert that we don't see integer abs modifiers. 17127ec681f3Smrg- gallium/tgsi: Remove support for f64 src modifiers. 17137ec681f3Smrg- gallium/tgsi: Rewrite the docs on source modifiers. 17147ec681f3Smrg- gallium/tgsi: garbage collect unused TGSI_UTIL_SIGN_MODE. 17157ec681f3Smrg- mesa/st: Make a single helper for the NIR-to-TGSI transfer. 17167ec681f3Smrg- mesa/st: Lower shader images before handing off to NIR-to-TGSI. 17177ec681f3Smrg- mesa/st: Dump nir-to-tgsi output when ST_DEBUG=tgsi or nir is set. 17187ec681f3Smrg- gallium/ntt: Don't vectorize IBFE/UBFE/BFI. 17197ec681f3Smrg- gallium/ntt: Add support for store_per_vertex_output. 17207ec681f3Smrg- gallium/ntt: Avoid referencing undefined channels of system values. 17217ec681f3Smrg- ci/freedreno: Mark some a5xx separate_shader tests as xfails. 17227ec681f3Smrg- ci/freedreno: Fix up the xfail/flake handling of a3xx texture functions. 17237ec681f3Smrg- ci/freedreno: Remove a bunch of stale flakes from a3xx. 17247ec681f3Smrg- ci/freedreno: Drop some long-unseen a5xx flakes. 17257ec681f3Smrg- ci/freedreno: Drop skip list stuff from a5xx flakes. 17267ec681f3Smrg- ci/freedreno: Remove some long-unseen a6xx known flakes. 17277ec681f3Smrg- util/format: Fix pack/unpack of A1R5G5B5_UINT. 17287ec681f3Smrg- mesa: Add some little unit tests showing format unpack behavior. 17297ec681f3Smrg- mesa: Drop incorrect statement about Z unpack behavior. 17307ec681f3Smrg- mesa: Replace the float[4] unpack code with util/format's. 17317ec681f3Smrg- mesa: Make _mesa_unpack_rgba_block() use the u_format pack/unpack. 17327ec681f3Smrg- util: Move most of src/mesa/main/format_utils.h to util/format/ 17337ec681f3Smrg- util: Fix UBSan failure on _mesa_unorm_to_unorm. 17347ec681f3Smrg- util: Fix rounding of unpack_unorm8 from small unorm formats. 17357ec681f3Smrg- mesa: Reuse util_format's unpack_8unorm. 17367ec681f3Smrg- mesa: Reuse util_format's unpack functions for pure integer formats. 17377ec681f3Smrg- util: Give a reasonable answer when unpacking z32unorm from floats. 17387ec681f3Smrg- mesa: Use a bunch of util functions for Z/S unpacking. 17397ec681f3Smrg- mesa: Move the rest of format_unpack.py out of code generation. 17407ec681f3Smrg- util/format: Simplify the generated unpack code. 17417ec681f3Smrg- swrast: Use util_format_write_4/4ub for the scattered pixel writes. 17427ec681f3Smrg- mesa/main: Replace float pack function with util_format_pack_rgba(). 17437ec681f3Smrg- mesa/main: Replace the uint format packing code with util/format's. 17447ec681f3Smrg- ci/piglit: Upgrade to a newer piglit in our containers. 17457ec681f3Smrg- ci/freedreno: Fix xfail setup for sampler3d_float_vertex. 17467ec681f3Smrg- nir_to_tgsi: Store directly to TGSI outputs when possible. 17477ec681f3Smrg- r300,i915g: Report no shader buffers or images on non-TCL HW. 17487ec681f3Smrg- nir_to_tgsi: Fix buffer overflow in atomic image compswap. 17497ec681f3Smrg- swr: Don't report support for shader images. 17507ec681f3Smrg- panfrost: Stub out set_shader_images(). 17517ec681f3Smrg- gallium: Fix leak of shader images on context destruction. 17527ec681f3Smrg- util/bitset: Avoid dereferencing the bitset for size == 0. 17537ec681f3Smrg- ci: Add a fractional deqp run of softpipe with asan enabled. 17547ec681f3Smrg- freedreno/a6xx: Skip the body of emit_state if we're clean. 17557ec681f3Smrg- freedreno: Move blend gmem checks to a blend dirty state check. 17567ec681f3Smrg- freedreno: Move framebuffer state checks under a ctx->dirty flag. 17577ec681f3Smrg- freedreno: Skip some batch dependency tracking if !ctx->dirty. 17587ec681f3Smrg- ci/freedreno: Detect cheza HFI errors and restart the run. 17597ec681f3Smrg- ci/freedreno: Ban vs-clip-vertex-enables which flakes in CI. 17607ec681f3Smrg- ci/freedreno: Ban more flaky clip-enables tests. 17617ec681f3Smrg- ci/freedreno: Make a630 piglit_shader run a manual run, too. 17627ec681f3Smrg- freedreno: Use a real type instead of void * for the fd_batch->key. 17637ec681f3Smrg- freedreno: Early-out from the resource write path when we're the writer. 17647ec681f3Smrg- freedreno: Remove duplicate bc invalidate on flush_write_batch(). 17657ec681f3Smrg- ci: Update baremetal kernel to 5.11-rc5 plus patches. 17667ec681f3Smrg- mesa/st: Allocate the gl_context with 16-byte alignment. 17677ec681f3Smrg- ci/freedreno: Drop pointless GIT_STRATEGY setting for a630. 17687ec681f3Smrg- ci/freedreno: Use the new nginx cache for trace downloads. 17697ec681f3Smrg- ci/freedreno: Use the http cache for artifacts downloads, too. 17707ec681f3Smrg- ci/docs: Update CI farm requirements suggestions. 17717ec681f3Smrg- docs/ci: Document setting up the http cache for traces. 17727ec681f3Smrg- ci/lava+baremetal: Add an xserver to the root fs. 17737ec681f3Smrg- ci/freedreno: Do our piglit runs against Xorg. 17747ec681f3Smrg- ci/freedreno: Add Valve games and other traces now that we have GLX. 17757ec681f3Smrg- freedreno: Make sure that queries are disabled during shadow blits. 17767ec681f3Smrg- freedreno: rename batch->active_providers to query_providers_used. 17777ec681f3Smrg- freedreno: Backport a5xx/a6xx fix for active query handling. 17787ec681f3Smrg- freedreno: Drop pointless clear of used providers. 17797ec681f3Smrg- freedreno/a6xx: Skip guessing VSC size with indirect TF draw counts. 17807ec681f3Smrg- docs: Document PIPE_CAP_STREAM_OUTPUT_PAUSE_RESUME. 17817ec681f3Smrg- freedreno/a6xx: Add support for glDrawTransformFeedback(). 17827ec681f3Smrg- ci/deqp: Bump runner to 0.5.1 for recent runtime perf improvements. 17837ec681f3Smrg- ci/freedreno: bump VK coverage to 1/4 of the CTS. 17847ec681f3Smrg- ci/freedreno: Run a3xx gles3 in parallel and increase coverage. 17857ec681f3Smrg- ci/virgl: Fix GLES31 testing on desktop GL. 17867ec681f3Smrg- freedreno: Force updating active queries on batch reordering. 17877ec681f3Smrg- freedreno: Remove FD_STAGE_* in favor of a "disable_all" flag. 17887ec681f3Smrg- freedreno/a5xx: Don't forget to count our custom blits against queries. 17897ec681f3Smrg- mesa/st: Always precompile the first shader variant. 17907ec681f3Smrg- mesa/st: Assume that the default variant is always first in the list. 17917ec681f3Smrg- vc4: Remove vestiges of alpha test lowering. 17927ec681f3Smrg- vc4: Stop advertising support for PIPE_CAP_TWO_SIDED_COLOR. 17937ec681f3Smrg- vc4: Stop advertising support for VS color clamping. 17947ec681f3Smrg- v3d: Clean up vestiges of alpha test lowering. 17957ec681f3Smrg- v3d: Stop advertising support for PIPE_CAP_TWO_SIDED_COLOR. 17967ec681f3Smrg- v3d: Stop advertising support for PIPE_CAP_*_COLOR_CLAMPED. 17977ec681f3Smrg- v3d: Stop advertising support for flat shading. 17987ec681f3Smrg- gallium: Document behavior of more lowering pipe caps. 17997ec681f3Smrg- freedreno: Add missing dep on freedreno tracepoints. 18007ec681f3Smrg- gallium: Flip the default value of PIPE_CAP_SHAREABLE_SHADERS. 18017ec681f3Smrg- virgl: Drop a context dependency from part of the shader compile path. 18027ec681f3Smrg- v3d/qpu: Avoid leaking memory in the QPU disasm test. 18037ec681f3Smrg- mesa/st: Make sure to unbind cb0 on transition away from gs/tess shaders. 18047ec681f3Smrg- ci: Allow better customization of the name of the artifacts for minio. 18057ec681f3Smrg- ci/freedreno: Add a fractional gles31 run with asan enabled. 18067ec681f3Smrg- ci/freedreno: Drop the "arm64" in front of job names. 18077ec681f3Smrg- ci: Move specific driver testing to separate files in separate dirs. 18087ec681f3Smrg- ci/freedreno: Fix a5xx piglit runs. 18097ec681f3Smrg- ci/freedreno: Remove stray BM_DTB definition. 18107ec681f3Smrg- ci/bare-metal: Use an upstream kernel for db820c. 18117ec681f3Smrg- ci/a5xx: Update the piglit expectations. 18127ec681f3Smrg- ci/a5xx: Increase our dEQP GLES3 fraction by 4x. 18137ec681f3Smrg- ci: Move the dEQP and traces expectations to the per-driver CI dirs. 18147ec681f3Smrg- ci: Move the piglit expectations lists to the per-driver CI dirs. 18157ec681f3Smrg- ci/zink: Add tests of gles2, gles3, and gl33 on lavapipe. 18167ec681f3Smrg- zink: Use mesa_loge() for should-never-be-reached initialization errors. 18177ec681f3Smrg- zink: Remove NULL checks after GET_PROC_ADDR_INSTANCE(). 18187ec681f3Smrg- softpipe: Fix the const buffer overflow check. 18197ec681f3Smrg- mesa: Get the FXT1 compressor/decompressor off of GL types. 18207ec681f3Smrg- mesa: Move the FXT1 compressor/decompressor to util/ 18217ec681f3Smrg- llvmpipe: Enable FXT1 texture decompression. 18227ec681f3Smrg- v3d: Replace driver lowering of GL_CLAMP with mesa/st's. 18237ec681f3Smrg- ci/piglit: Stop including the test counts at the end of expectations. 18247ec681f3Smrg- ci/iris: Move the traces yml file to the driver-specific dir. 18257ec681f3Smrg- mesa: Always make sure uniform storage doesn't get reallocated. 18267ec681f3Smrg- freedreno: Remove uniform variables after finalizing NIR. 18277ec681f3Smrg- freedreno: Drop custom driver lowering of two-sided color. 18287ec681f3Smrg- freedreno: Drop custom driver lowering of GL's color clamping. 18297ec681f3Smrg- freedreno: Use the mesa/st frontend lowering of GL_CLAMP. 18307ec681f3Smrg- freedreno/a5xx+: Stop recompiling on texture samples changes. 18317ec681f3Smrg- freedreno/a5xx+: Drop the unused no_decode_srgb flag. 18327ec681f3Smrg- freedreno/a5xx: Fix cube image load/stores. 18337ec681f3Smrg- nir: Add a nir_src_is_undef() helper, like nir_src_is_const(). 18347ec681f3Smrg- nir/vec_to_movs: Don't generate MOVs for undef channels. 18357ec681f3Smrg- ci: Move deqp-default-skips.txt back to .gitlab-ci/ 18367ec681f3Smrg- ci/lava: Move the per-driver gitlab-ci.yml to each driver. 18377ec681f3Smrg- ci/lava: Move the driver expectation files to the per-driver CI dir. 18387ec681f3Smrg- tgsi_exec: Roll the loops for condmask handling. 18397ec681f3Smrg- tgsi_exec: Jump over entirely non-taken THEN or ELSE branches. 18407ec681f3Smrg- ci/freedreno: Also retest when only CI configuration changes. 18417ec681f3Smrg- ci/freedreno: Switch the fastboot boards to using nfsroot. 18427ec681f3Smrg- ci/a5xx: Run all of gles2 in one job. 18437ec681f3Smrg- ci/a3xx: Run all of GLES3 dEQP. 18447ec681f3Smrg- ci/a5xx: Increase the gles3/31 coverage. 18457ec681f3Smrg- ci/a5xx: Update piglit expectations. 18467ec681f3Smrg- ci/zink: Add another primitive restart flake. 18477ec681f3Smrg- ci/turnip: Mark a flaky WSI test. 18487ec681f3Smrg- lima: stop encoding the texture format in the shader key 18497ec681f3Smrg- lima: don't look at dirty bits for setup of FS key 18507ec681f3Smrg- lima: upload the shader to a BO at shader creation 18517ec681f3Smrg- lima: avoid stomping over bound shader state when creating new shaders 18527ec681f3Smrg- nir-to-tgsi: Fix handling of partial writemasks on SSA/REG decls. 18537ec681f3Smrg- docs: Add some documentation of game GL buffer object mapping behavior. 18547ec681f3Smrg- freedreno/a5xx: Introduce an event write helper like a6xx has. 18557ec681f3Smrg- freedreno/a5xx: Flush depth at the end of sysmem, like a6xx does. 18567ec681f3Smrg- ci/freedreno: Mark another a5xx TF flake. 18577ec681f3Smrg- u_format: Mark the generated pack/unpack src/dst args as restrict. 18587ec681f3Smrg- mesa/st: Unify st_get_vp_variant() and st_get_common_variant(). 18597ec681f3Smrg- mesa/st: Add perf debug for draw-time variant compiles. 18607ec681f3Smrg- mesa/st: Fix precompile misses on compat GL VSes writing to color outputs. 18617ec681f3Smrg- virgl: Update GLES expectations. 18627ec681f3Smrg- ci/freedreno: Add three more a5xx flakes from the last day. 18637ec681f3Smrg- freedreno/a5xx: Fix the texel buffer alignment requirement. 18647ec681f3Smrg- freedreno/a5xx: Fix the max texture buffer size. 18657ec681f3Smrg- ci/panfrost: Disable t860/radeonsi testing while the runners are struggling. 18667ec681f3Smrg- ci: Bump deqp-runner to v0.6.3. 18677ec681f3Smrg- ci/freedreno: Switch the piglit testing to the new piglit runner. 18687ec681f3Smrg- ci/bare-metal: Restart a run on intermittent kernel lockups. 18697ec681f3Smrg- ci/freedreno: Mark an a630 piglit flake from async shader compiling. 18707ec681f3Smrg- ci/freedreno: Mark the rest of the glx_arb_sync_control@timing as flakes. 18717ec681f3Smrg- nir_to_tgsi: Respect PIPE_SHADER_CAP_TGSI_SQRT_SUPPORTED. 18727ec681f3Smrg- freedreno/a5xx: Fix stream-output binning handling. 18737ec681f3Smrg- freedreno/ir3: Demote centroid usage to pixel on non-msaa. 18747ec681f3Smrg- ci/freedreno: Mark all of dEQP TF as flaky. 18757ec681f3Smrg- ci/bare-metal: Move the db820c lockup detect to the right boot script. 18767ec681f3Smrg- ci/freedreno: Mark glx-swap-copy as a flake on a630. 18777ec681f3Smrg- freedreno/a6xx: Rename the RB_BLIT_INFO.INTEGER field to SAMPLE_0. 18787ec681f3Smrg- freedreno/a6xx: Disable sample averaging on depth/stencil resolves. 18797ec681f3Smrg- freedreno: Move the ir3 linked shader cache to the context. 18807ec681f3Smrg- freedreno/a3xx: Switch to using ir3_cache for looking up our VS/FS. 18817ec681f3Smrg- freedreno/a4xx: Switch to using ir3_cache for looking up our VS/FS 18827ec681f3Smrg- freedreno/a5xx: Switch to using ir3_cache for looking up our VS/FS 18837ec681f3Smrg- turnip: Fix KGSL build since common dispatch rework. 18847ec681f3Smrg- broadcom: Disbale CLIF dumping when libexpat isn't available. 18857ec681f3Smrg- ci/android: Make sure we don't detect system libexpat. 18867ec681f3Smrg- ci/android: Build the v3dv driver. 18877ec681f3Smrg- iris: Flag for resolves when stencil enable changes, too. 18887ec681f3Smrg- freedreno: Assert that TF prims generated can ignore active_queries. 18897ec681f3Smrg- freedreno/ir3: Move max-tf-vtx calculation to a .c file 18907ec681f3Smrg- freedreno: Move max-tf-vtx calculation to just the HW that needs it. 18917ec681f3Smrg- freedreno: Move the primitives generated/written updates after the draw. 18927ec681f3Smrg- freedreno: Don't count SW TF queries on a6xx. 18937ec681f3Smrg- freedreno: Clamp TF prims written to buffer size pre-a6xx. 18947ec681f3Smrg- ci/freedreno: Mark a630 as flaky on arb_draw_indirect-transform-feedback 18957ec681f3Smrg- nir: Update clip_distance_array_size in clip lowering. 18967ec681f3Smrg- freedreno/a6xx: Use the frontend userclip lowering. 18977ec681f3Smrg- freedreno/a5xx: Add support for clip distances and use them for userclip. 18987ec681f3Smrg- freedreno/a5xx: Use VALIDREG/CONDREG like a6xx do. 18997ec681f3Smrg- ci/freedreno: Demote a630-asan to a manual test for now. 19007ec681f3Smrg- ci: Drop the custom db820c kernel/dtb from the kernel+rootfs. 19017ec681f3Smrg- ci/freedreno: Add more new traces for a630 (minetest, TDM, pioneer, glyphy). 19027ec681f3Smrg- ci/freedreno: Rename a306-test and a530-test to drop "arm64" from the name. 19037ec681f3Smrg- ci/freedreno: Add trace testing on a3xx, a5xx. 19047ec681f3Smrg- freedreno/a5xx: Fix alpha test vs early Z bugs. 19057ec681f3Smrg- freedreno/a6xx: Fix alpha tests. 19067ec681f3Smrg- ci/freedreno: Switch to the trimmed glxgears trace. 19077ec681f3Smrg- ci/freedreno: Fix up the a5xx border color flake annotation. 19087ec681f3Smrg- ci: Uprev deqp runner to 0.6.5. 19097ec681f3Smrg- ci: Uprev piglit to 6a4be9e9946d ("piglit: NOTE! Default branch is now main") 19107ec681f3Smrg- ci: bump bare-metal kernel to bring in an a530 stability fix 19117ec681f3Smrg- freedreno: Fix YUV sampler regression. 19127ec681f3Smrg- nir_to_tgsi: Use ARL instead of UARL in the !native_integers case. 19137ec681f3Smrg- nir: Generate load_ubo_vec4 directly for !PIPE_CAP_NATIVE_INTEGERS 19147ec681f3Smrg- freedreno/a6xx: Don't try to do Z-as-RGBA blits for mismatched formats. 19157ec681f3Smrg- ci/virgl: Mark a couple of new Crash tests as flakes. 19167ec681f3Smrg 19177ec681f3SmrgEric Engestrom (12): 19187ec681f3Smrg 19197ec681f3Smrg- VERSION: bump to 21.1.0-rc1 19207ec681f3Smrg- .pick_status.json: Update to c74d93cf0187e07cdfacc448a947a8cae485eb41 19217ec681f3Smrg- .pick_status.json: Update to 95d9d811c91076d50385b2fbd330335b68688c69 19227ec681f3Smrg- .pick_status.json: Update to fcb5ba58165cd407408f8dd9a102f0c5e16a9956 19237ec681f3Smrg- VERSION: bump for 21.1.0-rc2 19247ec681f3Smrg- .pick_status.json: Mark 8acf361db4190aa5f7c788019d1e42d1df031b81 as denominated 19257ec681f3Smrg- .pick_status.json: Update to 35a28e038107410bb6a733c51cbd267aa79a4b20 19267ec681f3Smrg- .pick_status.json: Update to 7e905bd00f32b4fa48689a8e6266b145662cfc48 19277ec681f3Smrg- .pick_status.json: Update to 72eca47c660b6c6051be5a5a80660ae765ecbaa5 19287ec681f3Smrg- .pick_status.json: Update to f3d2fade82c168a7ffffa4bd7bf22585c45c711b 19297ec681f3Smrg- .pick_status.json: Update to f5d6a1b916fb163ee72e6a6f356937b1fbac53e0 19307ec681f3Smrg- .pick_status.json: Update to 1d418e79b8a0f4270775277b7115b88ac4c77113 19317ec681f3Smrg 19327ec681f3SmrgErico Nunes (15): 19337ec681f3Smrg 19347ec681f3Smrg- lima: introduce fs and vs shader cache 19357ec681f3Smrg- lima/ppir: fix creation of mov node for non-ssa tex dest 19367ec681f3Smrg- lima: set yuv formats as external_only 19377ec681f3Smrg- lima: enable r and rg pixel formats again 19387ec681f3Smrg- lima: always set stride in texture descriptor 19397ec681f3Smrg- lima: implement GL_EXT_texture_swizzle 19407ec681f3Smrg- docs/features: add lima features 19417ec681f3Smrg- lima: fix max sampler views 19427ec681f3Smrg- lima: run nir dce after nir_lower_vec_to_movs 19437ec681f3Smrg- lima/ppir: remove liveness info from blocks 19447ec681f3Smrg- lima/ppir: remove use of live_out 19457ec681f3Smrg- lima/ppir: rework liveness data structures to bitset 19467ec681f3Smrg- lima: fix half float render 19477ec681f3Smrg- lima: enable rg formats for fp16 render 19487ec681f3Smrg- lima: increase epsilon for depthrange near == far 19497ec681f3Smrg 19507ec681f3SmrgErik Faye-Lund (158): 19517ec681f3Smrg 19527ec681f3Smrg- zink: handle NULL views in zink_set_sampler_views 19537ec681f3Smrg- zink: fix vertex-stride wrangling 19547ec681f3Smrg- docs: fix sphinx-warnings due to lacking escaping 19557ec681f3Smrg- docs: fix broken link 19567ec681f3Smrg- docs: turn non-code into comment 19577ec681f3Smrg- docs/features: add missing features for zink 19587ec681f3Smrg- docs/features: remove a few redundant zink mentions 19597ec681f3Smrg- zink: always expose linear float textures 19607ec681f3Smrg- zink: respect feature-cap for robust buffer access 19617ec681f3Smrg- zink: respect feature-cap for independent blending 19627ec681f3Smrg- zink: respect feature-cap for sample-shading 19637ec681f3Smrg- zink: respect feature-cap for multi-draw indirect 19647ec681f3Smrg- zink: check for extension instead of function 19657ec681f3Smrg- zink: require vulkan memory model for tesselation 19667ec681f3Smrg- zink: make all xfb caps depend on extension 19677ec681f3Smrg- zink: respect fragment-shader depth-layout 19687ec681f3Smrg- zink: clone shader before lowering clip_halfz 19697ec681f3Smrg- docs/zink: add missing colon 19707ec681f3Smrg- docs/zink: add two missing required features 19717ec681f3Smrg- docs/zink: document the independentBlend requirement for GL3 19727ec681f3Smrg- docs/zink: fix phrasing of GL 3.3 requirements 19737ec681f3Smrg- docs/zink: add GL 4.0 requirements 19747ec681f3Smrg- docs/zink: add GL 4.1 requirements 19757ec681f3Smrg- docs/zink: add GL 4.2 requirements 19767ec681f3Smrg- docs/features: mark off two more extensions for zink 19777ec681f3Smrg- docs/zink: correct vk version for GL 4.2 19787ec681f3Smrg- mesa/main: remove leftover bumpmap code 19797ec681f3Smrg- compiler/nir: add texcoord replace lowering pass 19807ec681f3Smrg- gallium/st: lower point-sprites if not supported 19817ec681f3Smrg- zink: request texcoord replace lowering 19827ec681f3Smrg- docs/features: mark ssbos as done for zink 19837ec681f3Smrg- zink: remove stale TODO 19847ec681f3Smrg- zink: be more careful about limits when unsupported 19857ec681f3Smrg- zink: correct return-type for function 19867ec681f3Smrg- zink: only emit SpvCapabilityDerivativeControl when needed 19877ec681f3Smrg- zink: only emit cap when needed 19887ec681f3Smrg- zik: correct spir-v caps for textures and images 19897ec681f3Smrg- zink: do not insist shaders come from glsl 19907ec681f3Smrg- zink: add a get_primitive_mode-helper 19917ec681f3Smrg- zink: add a get_spacing-helper 19927ec681f3Smrg- zink: refactor vertex-order emitting 19937ec681f3Smrg- zink: wrap some long lines 19947ec681f3Smrg- docs: fix invalid rst syntax 19957ec681f3Smrg- zink: check for error when calling vkEnumeratePhysicalDevices 19967ec681f3Smrg- zink: explicitly check for VK_NULL_HANDLE 19977ec681f3Smrg- zink: support using lavapipe 19987ec681f3Smrg- CI: always expose docs artifacts 19997ec681f3Smrg- ci: make sure all lava-builders have libvulkan 20007ec681f3Smrg- ci: run piglit on zink with lavapipe 20017ec681f3Smrg- lavapipe: report correct value for minMemoryMapAlignment 20027ec681f3Smrg- ci: document arm oddity in build-rules 20037ec681f3Smrg- zink: correctly handle 64 valid timestamp bits 20047ec681f3Smrg- zink: enable excluded test 20057ec681f3Smrg- ci: enable max texture size tests for zink 20067ec681f3Smrg- lavapipe: handle null-buffers for xfb 20077ec681f3Smrg- ci: disable sporadically failing test 20087ec681f3Smrg- zink: drop extra set of parens 20097ec681f3Smrg- zink: do not use extra staging resource unless needed 20107ec681f3Smrg- zink: don't always require linear display-targets 20117ec681f3Smrg- zink: limit host-visible bind-flags 20127ec681f3Smrg- zink: ignore irrelevant bind-flags 20137ec681f3Smrg- zink: use gallium api to copy to display-target 20147ec681f3Smrg- zink: add X32_S8X24 format 20157ec681f3Smrg- zink: correct inaccurate comment 20167ec681f3Smrg- lavapipe: fix primitive-restart for uint8 indices 20177ec681f3Smrg- zink: fix emulation of no mipfilter 20187ec681f3Smrg- zink: fix free of ralloced pointer 20197ec681f3Smrg- gallium/st: fix shader_has_one_variant 20207ec681f3Smrg- gallium/st: fix shader_has_one_variant 20217ec681f3Smrg- gallium/st: reserve space in default uniform block for lowered constants 20227ec681f3Smrg- docs: remove stray newline 20237ec681f3Smrg- docs: remove excessive wrapping 20247ec681f3Smrg- docs: remove excessive quoting 20257ec681f3Smrg- docs: document zink GL 4.3 requirements 20267ec681f3Smrg- docs: document zink GL 4.4 requirements 20277ec681f3Smrg- docs: document zink GL 4.5 requirements 20287ec681f3Smrg- docs: document zink GL 4.6 requirements 20297ec681f3Smrg- docs: simplify format requirements 20307ec681f3Smrg- zink: factor out interpolation to helper 20317ec681f3Smrg- zink: emit all interpolation modes 20327ec681f3Smrg- zink: check for pipeline statistics feature 20337ec681f3Smrg- zink: check for depth-bias-clamp feature 20347ec681f3Smrg- zink: check for stores and atomics features 20357ec681f3Smrg- zink: add missing required feature 20367ec681f3Smrg- zink: check for mirror-clamp extension 20377ec681f3Smrg- zink: fix vector comparison 20387ec681f3Smrg- zink: drop bool attempt in float vector compares 20397ec681f3Smrg- zink: do not open-code vector-compares 20407ec681f3Smrg- zink: follow spir-v 1.0 spec 20417ec681f3Smrg- docs: Add 21.0.0 hashes 20427ec681f3Smrg- zink: tighten emitted image spir-v caps 20437ec681f3Smrg- zink: remove no-longer-needed clipdist1 patching 20447ec681f3Smrg- frontends/va: correct check for invalid format 20457ec681f3Smrg- zink: handle errors in nir_to_spirv 20467ec681f3Smrg- zink: pre-populate locations in variables 20477ec681f3Smrg- zink: do not depend on shader_slots_reserved for xfb 20487ec681f3Smrg- zink: use pre-populated shader-locations 20497ec681f3Smrg- lavapipe: report correct value for maxTexelBufferElements 20507ec681f3Smrg- docs: do not try to copy missing file 20517ec681f3Smrg- compiler/glsl: avoid null-pointer deref 20527ec681f3Smrg- docs: remove bogus zink-requirement 20537ec681f3Smrg- docs: remove zink incorrect requirement 20547ec681f3Smrg- zink: do not enable unused extension 20557ec681f3Smrg- docs: clarify VK_KHR_external_memory requirement 20567ec681f3Smrg- zink: check base-requirements 20577ec681f3Smrg- zink: assert that pstage is within range 20587ec681f3Smrg- zink: simplify shader-removal 20597ec681f3Smrg- zink: document why we're calling pipe_shader_type_from_mesa 20607ec681f3Smrg- docs: appling -> applying 20617ec681f3Smrg- docs: sytem -> system 20627ec681f3Smrg- docs: ie. -> i.e. 20637ec681f3Smrg- docs: vulkan -> Vulkan 20647ec681f3Smrg- zink: do not request scoped memory barriers 20657ec681f3Smrg- docs: optimisation -> optimization 20667ec681f3Smrg- docs: opencl -> OpenCL 20677ec681f3Smrg- docs: Xorg -> X.Org 20687ec681f3Smrg- docs: nops -> NOPs 20697ec681f3Smrg- docs: lod -> LOD 20707ec681f3Smrg- docs: lex / yacc -> Lex / Yacc 20717ec681f3Smrg- docs: dfsm -> DFSM 20727ec681f3Smrg- docs: fix incorrect possessive form 20737ec681f3Smrg- docs: fix invalid rst 20747ec681f3Smrg- docs: fix rst-quoting issues in release-notes 20757ec681f3Smrg- docs: spell out full name of gitlab instance 20767ec681f3Smrg- docs: spell out development 20777ec681f3Smrg- docs: spell out environment 20787ec681f3Smrg- docs: spell out freedesktop.org 20797ec681f3Smrg- docs: no-op'd -> disabled 20807ec681f3Smrg- docs: fix release notes for 20.3.5 20817ec681f3Smrg- ci: turn sphinx-build warnings into errors 20827ec681f3Smrg- bin/gen_release_notes.py: more robust rST escaping 20837ec681f3Smrg- compiler/glsl: correct the number of string-arguments 20847ec681f3Smrg- compiler/glsl: fix volatile string 20857ec681f3Smrg- compiler/glsl: clean up output 20867ec681f3Smrg- glsl: fix is_integer_16_32 20877ec681f3Smrg- glsl: fix int16 type 20887ec681f3Smrg- glsl: tolerate int16 loop counters 20897ec681f3Smrg- gallium/st: correct range for float16 20907ec681f3Smrg- gallium/st: correct range for int16 20917ec681f3Smrg- zink: document scalarBlockLayout requirement 20927ec681f3Smrg- zink: fix typo in function name 20937ec681f3Smrg- compiler/glsl: drop rogue argument to _mesa_glsl_error 20947ec681f3Smrg- compiler/glsl: do not cast struct to string 20957ec681f3Smrg- lavapipe: do not subtract 8 from enum 20967ec681f3Smrg- lavapipe: check all vertex-stages 20977ec681f3Smrg- lavapipe: check all graphics stages 20987ec681f3Smrg- lavapipe: ask pipe-driver for int16 support 20997ec681f3Smrg- zink: do not clear on cpu 21007ec681f3Smrg- zink: fall back from cached to non-cached memory 21017ec681f3Smrg- zink: do not dereference NULL pointer 21027ec681f3Smrg- zink: verify that src/dst support blitting 21037ec681f3Smrg- zink: verify that source-format support linear-filter 21047ec681f3Smrg- zink: fix stencil-export cap emission 21057ec681f3Smrg- gallivm: handle 16-bit input in i2b32 21067ec681f3Smrg- zink: do not read outside of array 21077ec681f3Smrg- zink: do not require vulkan memory model for shader-images 21087ec681f3Smrg- zink: correct image cap checks 21097ec681f3Smrg- zink: fix shader-image requirements 21107ec681f3Smrg 21117ec681f3SmrgFan Yugang (1): 21127ec681f3Smrg 21137ec681f3Smrg- intel/tools: Show unknown instructions in decoded state. 21147ec681f3Smrg 21157ec681f3SmrgFrancisco Jerez (9): 21167ec681f3Smrg 21177ec681f3Smrg- intel/gen12: Fix memory corruption issues in fused Gen12 parts. 21187ec681f3Smrg- intel/genxml: Fix pixel hashing 3DSTATE_3D_MODE field definitions for Gen12 and Gen12.5. 21197ec681f3Smrg- intel/genxml: Define 3DSTATE_SUBSLICE_HASH_TABLE command for Gen12 and Gen12.5. 21207ec681f3Smrg- intel/dev: Implement pixel pipe subslice counting for Gen12+. 21217ec681f3Smrg- iris/gen11+: Calculate pixel hashing tables instead of hardcoding. 21227ec681f3Smrg- iris/gen12: Implement programming of pixel pipe hashing tables. 21237ec681f3Smrg- anv/gen11+: Calculate pixel hashing tables instead of hardcoding. 21247ec681f3Smrg- anv/gen12: Implement programming of pixel pipe hashing tables. 21257ec681f3Smrg- iris/gen12: Work around push constant corruption on context switch. 21267ec681f3Smrg 21277ec681f3SmrgGeorg Lehmann (1): 21287ec681f3Smrg 21297ec681f3Smrg- vulkan/device_select: Only call vkGetPhysicalDeviceProperties2 if the device supports it. 21307ec681f3Smrg 21317ec681f3SmrgGert Wollny (89): 21327ec681f3Smrg 21337ec681f3Smrg- r600/nir: clone shader before first query to shader key 21347ec681f3Smrg- r600/sfn: fix use of b32all/and 21357ec681f3Smrg- r600: Add flags to INTERP_X and INTERP_Z two-slot ops 21367ec681f3Smrg- r600/sb: Add support for INTERP_X and INTERP_Z ops 21377ec681f3Smrg- r600/nir: pass array info to r600_shader for sb 21387ec681f3Smrg- r600/sfn: update shader array info 21397ec681f3Smrg- r600/sfn: Keep array registers alive for the whole shader 21407ec681f3Smrg- r600/sb: fix boundary assert for mem-instruction decoding 21417ec681f3Smrg- r600/sb: fall back to un-optimized byte code when ra_init fails 21427ec681f3Smrg- r600: Enable sb also for NIR 21437ec681f3Smrg- compiler/nir: Add support for lowering stores with nir_lower_instruction 21447ec681f3Smrg- r600/sfn: Fix use of cnde_int for bcsel 21457ec681f3Smrg- r600/sfn: Set unnormalized flag for z-coordinate when fetching from array 21467ec681f3Smrg- r600/sfn: Add the position input as varying 21477ec681f3Smrg- r600/sfn: Fix FS inputs when reading from the same position 21487ec681f3Smrg- r600/sfn: Fix dual source blend lowered to FRAG_DATA 21497ec681f3Smrg- r600/sfn: Use the constant buffer ID when given 21507ec681f3Smrg- gallium/tgsi_to_nir: Handle SAMPLE_MASK output in FS 21517ec681f3Smrg- gallium/tgsi-to-nir: Take property NUM_CLIPDIST_ENABLED into account 21527ec681f3Smrg- r600/sfn: Handle memory_barrier_atomic_counters 21537ec681f3Smrg- r600/sfn: Fix indirect_file flag for IMAGES 21547ec681f3Smrg- r600/sfn: remove duplicate barriers 21557ec681f3Smrg- r600/sfn: Base instr lowering class on nir_lower_instruction code 21567ec681f3Smrg- nir: Add flag to tex instruction to indicate lowering cube to array 21577ec681f3Smrg- nir: Add r600 specific CUBE opcode to evaluate cube texture coords and face 21587ec681f3Smrg- r600/sfn: Add support for cube_r600 instruction 21597ec681f3Smrg- r600/sfn: add lowering pass for cube textures 21607ec681f3Smrg- r600/sfn: fix gather with cube lowering 21617ec681f3Smrg- r600/sfn: use lower bool to int32 and lower int_tg4 only on shader clone 21627ec681f3Smrg- r600/sfn: use lowering pass for cube textures 21637ec681f3Smrg- r600/sfn: remove old cube texturing code 21647ec681f3Smrg- r600/sfn: Lower FS inputs to temps late and, and lower interpolate at 21657ec681f3Smrg- r600/sfn: set info about using helper_invocation to skip sb 21667ec681f3Smrg- r600/sfn: lower isign and iabs in nir 21677ec681f3Smrg- r600/sfn: Allow any channel for the helper invocation evaluation 21687ec681f3Smrg- r600: unify nir shader options evaluation 21697ec681f3Smrg- r600/sfn: remove code for nir_op_fsign since it is lowered 21707ec681f3Smrg- r600/sfn: remove unused emit_alu_op2_split_src_mods 21717ec681f3Smrg- r600/sfn: remove some old debug output 21727ec681f3Smrg- r600/sfn: encode component in address for local IO 21737ec681f3Smrg- nir: disaallow reordering for r600 shared load and remove component field 21747ec681f3Smrg- r600/sfn: handle querying the number of layers in cube arrays 21757ec681f3Smrg- r600/sfn: Fix loading TES gl_PatchVerticesIn 21767ec681f3Smrg- r600: Don't optimize using source modifiers on literals 21777ec681f3Smrg- r600: Enable GLSL 450 for nir shaders. 21787ec681f3Smrg- r600/sfn: Update status 21797ec681f3Smrg- nir: Add r600 specific intrinsic for loading the tesselation coords 21807ec681f3Smrg- r600/sfn: lower intrinsic_load_tess_coord to driver version 21817ec681f3Smrg- r600/sfn: eliminate loading unused component loads from shared memory 21827ec681f3Smrg- virgl: implement support for PIPE_CAP_STRING_MARKER 21837ec681f3Smrg- r600/sfn: sort alu opcodes in switch statememt 21847ec681f3Smrg- r600/sfn: remove unused code 21857ec681f3Smrg- r600/sfn: fix buffer offset for ssbo writes 21867ec681f3Smrg- r600/sfn: Fix including/not including c++ parts of header 21877ec681f3Smrg- r600/sfn: lower bitfield_extract and bitfield_insert in NIR 21887ec681f3Smrg- r600/sfn: lower idiv, imod, etc in nir 21897ec681f3Smrg- r600/sfn remove some leftover debug output 21907ec681f3Smrg- nir: add opcodes for \*find_msb_rev and lowering 21917ec681f3Smrg- nir: Add opcodes for fused comp + csel and optimizations 21927ec681f3Smrg- nir: Add r600 specific sin and cos variants 21937ec681f3Smrg- r600/sfn: Add algebraic lowering for fsin and fcos 21947ec681f3Smrg- r600/sfn: optimize comp+csel using fused ops 21957ec681f3Smrg- r600/sfn: lower find_msb variants to find_msb_rev 21967ec681f3Smrg- r60/sfn: don't lower scomp 21977ec681f3Smrg- r600: Handle negate of second operator in TGSI_OPCODE_UADD 21987ec681f3Smrg- r600/sfn: Make some value pool functions private 21997ec681f3Smrg- r600/sfn: Add skelton for visitor pattern 22007ec681f3Smrg- r600/sfn: remove extra parameter from alu assemebly emission 22017ec681f3Smrg- r600/sfn: fix some formatting 22027ec681f3Smrg- r600/sfn: switch assembler creation to use visitor 22037ec681f3Smrg- r600/sfn: drop the local register mep 22047ec681f3Smrg- r600/sfn: lower VS IO and drop old deref code 22057ec681f3Smrg- r600/sfn: lowered FS output IO 22067ec681f3Smrg- r600/sfn: remove old deref code 22077ec681f3Smrg- r600/sfn: force dual source blend output handling in some cases 22087ec681f3Smrg- r600/sfn: remove find_msb lowering in driver 22097ec681f3Smrg- r600/sfn: clean up multi-sample texture load 22107ec681f3Smrg- r600/sfn: clean up value pool interface usage in emit_stream 22117ec681f3Smrg- r600/sfn: use get_temp_vec4 directly when fetching 22127ec681f3Smrg- r600/sfn: make allocate_temp_register private to valuepool 22137ec681f3Smrg- r600: Fix texture buffer offset alignment 22147ec681f3Smrg- r600: don't set an index_bias for indirect draw calls 22157ec681f3Smrg- r600: Don't advertise support for scaled int16 vertex formats 22167ec681f3Smrg- r600/sfn: allocate number of VS outputs based on max driver_location 22177ec681f3Smrg- r600/sfn: Only fetch the constants that are needed in fdd* 22187ec681f3Smrg- r600: Enable sb for nir only on specific request 22197ec681f3Smrg- Revert "r600: don't set an index_bias for indirect draw calls" 22207ec681f3Smrg- Revert "r600: Don't advertise support for scaled int16 vertex formats" 22217ec681f3Smrg- r600: don't set an index_bias for indirect draw calls 22227ec681f3Smrg 22237ec681f3SmrgGiovanni Mascellani (2): 22247ec681f3Smrg 22257ec681f3Smrg- disk_cache: Fail creation when cannot inizialize queue. 22267ec681f3Smrg- anv: Allow null handle in DestroyDescriptorUpdateTemplate. 22277ec681f3Smrg 22287ec681f3SmrgGreg V (1): 22297ec681f3Smrg 22307ec681f3Smrg- meson: Add missing lavapipe dep 22317ec681f3Smrg 22327ec681f3SmrgGustavo Padovan (4): 22337ec681f3Smrg 22347ec681f3Smrg- gitlab-ci: extend x86_64 kernel config to suport Intel devices 22357ec681f3Smrg- gitlab-ci: build the iris gallium driver as well 22367ec681f3Smrg- gitlab-ci: add intel APL and GLK devices with manual triggers 22377ec681f3Smrg- traces-iris: fix expectation for Intel GLK 22387ec681f3Smrg 22397ec681f3SmrgHans-Kristian Arntzen (2): 22407ec681f3Smrg 22417ec681f3Smrg- radv: Take image alignment into account when allocating MUTABLE pool. 22427ec681f3Smrg- radv: Allocate buffer list for MUTABLE descriptor types as well. 22437ec681f3Smrg 22447ec681f3SmrgHoe Hao Cheng (19): 22457ec681f3Smrg 22467ec681f3Smrg- zink/codegen: add some new attributes to Extension 22477ec681f3Smrg- zink/codegen: make 'struct' optional in Version 22487ec681f3Smrg- zink/codegen: enable instance extension unconditionally if promoted 22497ec681f3Smrg- zink/codegen: codegen-ize load_instance_extensions() 22507ec681f3Smrg- zink/codegen: adding/fixing comments and copyright notice 22517ec681f3Smrg- zink/codegen: find promotion version using vulkan registry 22527ec681f3Smrg- zink: remove excessive checks for loader version 22537ec681f3Smrg- zink: expose PIPE_CAP_ANISOTROPIC_FILTER 22547ec681f3Smrg- zink: VK_KHR_draw_indirect_count is a device extension 22557ec681f3Smrg- zink/codegen: introduce ExtensionRegistry 22567ec681f3Smrg- zink/codegen: more validation in zink_instance 22577ec681f3Smrg- zink/codegen: introduce notion of non-standard extensions 22587ec681f3Smrg- zink/codegen: make zink_device_info accept vk.xml 22597ec681f3Smrg- zink/codegen: perform basic validation in zink_device_info 22607ec681f3Smrg- zink/codegen: validate has_properties and has_features 22617ec681f3Smrg- zink/codegen: fix type annotations 22627ec681f3Smrg- zink/codegen: do not enable extensions that are now core 22637ec681f3Smrg- zink: enable KHR_shader_draw_parameters on Vulkan <1.2 22647ec681f3Smrg- zink: fix detection of KHR_maintenance1/2 22657ec681f3Smrg 22667ec681f3SmrgHyunjun Ko (5): 22677ec681f3Smrg 22687ec681f3Smrg- turnip: Return correct value of tu6_load_state_size 22697ec681f3Smrg- nir: Set access at lower_ubo_vec4 22707ec681f3Smrg- ir3: fix has_src() to return correctly in ir3_nir_lower_tex_prefetch 22717ec681f3Smrg- ir3: Add nonuniform encodings to ir3 encoder and parser 22727ec681f3Smrg- turnip: Enable nonuniform descriptor indexing 22737ec681f3Smrg 22747ec681f3SmrgIago Toral Quiroga (93): 22757ec681f3Smrg 22767ec681f3Smrg- v3dv: only update uniforms for dirty descriptors if stage has descriptors 22777ec681f3Smrg- v3dv: disable early Z writes if Z writes are disabled 22787ec681f3Smrg- v3dv: don't wait for idle on occlusion query pool resets 22797ec681f3Smrg- v3dv: use PIPE_TIMEOUT_INFINITE 22807ec681f3Smrg- v3dv: refactor checks for subpass attachment clears 22817ec681f3Smrg- v3dv: refactor checks for subpass attachment loading 22827ec681f3Smrg- v3dv: refactor checks for subpass attachment stores 22837ec681f3Smrg- v3dv: do not emit full tile buffers clears to handle Z/S clears 22847ec681f3Smrg- v3dv: enable early Z/S clears 22857ec681f3Smrg- v3dv: fix disabling Early Z for the whole frame 22867ec681f3Smrg- broadcom/compiler: fix indentation with TABs 22877ec681f3Smrg- broadcom/compiler: prepare TMU spilling code to account for TMU pipelining 22887ec681f3Smrg- broadcom/compiler: implement pipelining for general TMU operations 22897ec681f3Smrg- broadcom/compiler: support pipelining of tex instructions 22907ec681f3Smrg- broadcom/compiler: refactor image load/store TMU emission code 22917ec681f3Smrg- broadcom/compiler: support pipelining of image load/store instructions 22927ec681f3Smrg- broadcom/compiler: disable TMU pipelining if we fail to register allocate 22937ec681f3Smrg- broadcom/compiler: disallow spilling if TMU pipelining was enabled 22947ec681f3Smrg- broadcom/compiler: log spilling shaders to perf output 22957ec681f3Smrg- broadcom/compiler: let QPUs stall on TMU input/config overflows 22967ec681f3Smrg- v3dv: handle D/S buffer to image copies with the texel buffer path 22977ec681f3Smrg- v3dv: batch copies in the copy_buffer_to_image_blit path 22987ec681f3Smrg- v3dv: allow a component swizzle in copy_buffer_to_image_shader 22997ec681f3Smrg- v3d/common: use spaces instead of TABs 23007ec681f3Smrg- v3dv: serialize pipeline compilation when debugging shaders 23017ec681f3Smrg- v3dv: add a perf trace when a device is created with robust buffer access 23027ec681f3Smrg- v3d/compiler: fix QPU scheduler TMU sequence shuffling 23037ec681f3Smrg- broadcom/compiler: add V3D_QPU_WADDR_UNIFA 23047ec681f3Smrg- broadcom/compiler: pass a devinfo to check if an instruction writes to TMU 23057ec681f3Smrg- broadcom/compiler: name registers correctly based on V3D version 23067ec681f3Smrg- broadcom/compiler: don't check for GFXH-1633 on V3D 4.2.x 23077ec681f3Smrg- broadcom/compiler: add a helper to check if an instruction writes unifa 23087ec681f3Smrg- broadcom/compiler: disallow unifa overlap with thread switch/end 23097ec681f3Smrg- broadcom/compiler: preserve ordering of unifa/ldunifa sequences 23107ec681f3Smrg- broadcom/compiler: ensure 3-slot delay between unifa and ldunifa 23117ec681f3Smrg- broadcom/compiler: disallow reading two uniforms in the same instruction 23127ec681f3Smrg- broadcom/compiler: do not DCE ldunifa 23137ec681f3Smrg- broadcom/compiler: emit ldunifarf when needed 23147ec681f3Smrg- broadcom/compiler: use unifa for UBO loads from uniform addresses 23157ec681f3Smrg- broadcom/compiler: don't emit redundant ldunif 23167ec681f3Smrg- broadcom/compiler: use a helper function to decide on TMU spilling 23177ec681f3Smrg- broadcom/compiler: don't dump shader-db stats for failed shaders 23187ec681f3Smrg- broadcom/compiler: fix ldunif optimization 23197ec681f3Smrg- broadcom/compiler: allow dead code elimination of unused trailing ldunifa 23207ec681f3Smrg- broadcom/compiler: remove unused leading ldunifa 23217ec681f3Smrg- broadcom/compiler: add a constant alu optimization pass 23227ec681f3Smrg- broadcom/compiler: skip unnecessary unifa writes 23237ec681f3Smrg- broadcom/compiler: use nir_opt_sink 23247ec681f3Smrg- v3dv: fix branching to large secondaries with more than one BCL buffer. 23257ec681f3Smrg- broadcom/compiler: fix DAG pre-remove for merged instructions 23267ec681f3Smrg- broadcom/compiler: fix indentation style 23277ec681f3Smrg- broadcom/compiler: track pipelineable ldvary sequences 23287ec681f3Smrg- broadcom/compiler: pipeline smooth ldvary sequences 23297ec681f3Smrg- broadcom/compiler: allow pipelining of flat and noperspective varyings 23307ec681f3Smrg- broadcom/compiler: ldvary pipelining tracking and documentation clean-ups 23317ec681f3Smrg- broadcom/compiler: drop the destination for unused ldunifa 23327ec681f3Smrg- broadcom/compiler: be more aggressive skipping unifa writes 23337ec681f3Smrg- broadcom/compiler: always restart ldvary pipelining when scheduling ldvary 23347ec681f3Smrg- broadcom/compiler: ldvary doesn't implicitly write to r3 since V3D 4.1 23357ec681f3Smrg- broadcom/compiler: fix flags check for ldvary merge 23367ec681f3Smrg- broadcom/compiler: add an additional sanity check assert to the ldvary fixup 23377ec681f3Smrg- broadcom/compiler: move code block around 23387ec681f3Smrg- broadcom/compiler: simplify ldvary pipelining 23397ec681f3Smrg- broadcom/compiler: disallow ldunif during ldvary sequences if possible 23407ec681f3Smrg- v3dv: call util_cpu_detect() when initializing the instance 23417ec681f3Smrg- broadcom/compiler: flag wrtmuc with a read dependency on last_tmu_config 23427ec681f3Smrg- broadcom/compiler: be more flexible scheduling TMU writes 23437ec681f3Smrg- vulkan/util: call glsl_type_singleton_init_or_ref from vk_instance_init 23447ec681f3Smrg- compiler/glsl: call util_cpu_detect from glsl_type_singleton_init_or_ref 23457ec681f3Smrg- broadcom/compiler: fix end of tmu sequence detection 23467ec681f3Smrg- broadcom/compiler: use nir_opt_load_store_vectorize 23477ec681f3Smrg- broadcom/compiler: use nir_lower_wrmasks to simplify TMU general stores 23487ec681f3Smrg- broadcom/compiler: handle implicit uniform loads when optimizing constant alu 23497ec681f3Smrg- broadcom/compiler: optimize constant vfpack 23507ec681f3Smrg- broadcom/compiler: use nir_lower_undef_to_zero 23517ec681f3Smrg- v3dv/pipeline_cache: fix assert 23527ec681f3Smrg- broadcom/compiler: convert add to mul when possible to allow merge 23537ec681f3Smrg- broadcom/compiler: add a v3d_qpu_writes_accum helper 23547ec681f3Smrg- broadcom/compiler: try to fill up delay slots after a thrsw 23557ec681f3Smrg- broadcom/compiler: flag TMU read dependencies against last TMU config 23567ec681f3Smrg- broadcom/compiler: flag TMU reads with a read dependency on last TMU config 23577ec681f3Smrg- broadcom/compiler: dump instruction index when failing to pack instructions 23587ec681f3Smrg- broadcom/compiler: add a NOP count stat to shader-db 23597ec681f3Smrg- broadcom/compiler: try to fill up delay slots after a branch instruction 23607ec681f3Smrg- broadcom/compiler: try to fill up delay slots after unconditional branch 23617ec681f3Smrg- broadcom/compiler: implement restriction for branch after setmsf 23627ec681f3Smrg- broadcom/compiler: optimize branch emission for uniform break/continue 23637ec681f3Smrg- v3dv: fix index buffer binding 23647ec681f3Smrg- broadcom/compiler: add a definition for the unifa skip distance 23657ec681f3Smrg- broadcom/compiler: allow compilation strategies to limit minimum thread count 23667ec681f3Smrg- broadcom/compiler: sort constant UBO loads by index and offset 23677ec681f3Smrg- broadcom/compiler: rename unifa tracking fields 23687ec681f3Smrg- v3dv: fix descriptor set limits 23697ec681f3Smrg 23707ec681f3SmrgIan Romanick (33): 23717ec681f3Smrg 23727ec681f3Smrg- i965: Don't advertise OpenGL 3.3+ if driconf disables GL_ARB_blend_func_extended 23737ec681f3Smrg- i965: Use allow_higher_compat_version option during screen initialization 23747ec681f3Smrg- i965: Don't parse driconf again 23757ec681f3Smrg- nir/algebraic: Fix a >> #b << #b for sizes other than 32-bit 23767ec681f3Smrg- nir/algebraic: add patterns for a >> #b << #b and a << #b >> #b 23777ec681f3Smrg- nir/algebraic: Partially revert 3f782cdd2591 23787ec681f3Smrg- intel/eu/validate: Add some checks for CMP and CMPN 23797ec681f3Smrg- intel/compiler: Enable the ability to emit CMPN instructions 23807ec681f3Smrg- intel/compiler: Make the CMPN builder work like the CMP builder 23817ec681f3Smrg- intel/compiler: Use CMPN for min / max on Gen4 and Gen5 23827ec681f3Smrg- nir/algebraic: Fix some min/max of b2f replacements 23837ec681f3Smrg- nir/algebraic: Remove some redundant b2f logic-op reduction patterns 23847ec681f3Smrg- nir/algebraic: Add some max/min optimizations with 3 variables 23857ec681f3Smrg- nir/range-analysis: C++ linkage 23867ec681f3Smrg- nir/range_analysis: Handle vectors better in ssa_def_bits_used 23877ec681f3Smrg- intel/compiler: Silence unused parameter warnings in files that include brw_eu.h 23887ec681f3Smrg- intel: Silence unused parameter warnings in files that include gen_device_info.h 23897ec681f3Smrg- intel: Silence unused parameter warnings in files that include genX_pack.h 23907ec681f3Smrg- intel/compiler: Relax some conditions in try_copy_propagate 23917ec681f3Smrg- gallium/dri: Remove dri2_format_mapping::cpp 23927ec681f3Smrg- nir/search: Constify instruction parameter to search helpers 23937ec681f3Smrg- nir/algebraic: Apply addition property of equality more conservatively 23947ec681f3Smrg- nir/algebraic: Apply addition property of equality to the other ordering too 23957ec681f3Smrg- nir/range_analysis: Refactor fsat handling 23967ec681f3Smrg- nir/range_analysis: Add "is finite" range analysis tracking 23977ec681f3Smrg- nir/range_analysis: Add "is a number" range analysis tracking 23987ec681f3Smrg- nir/range_analysis: Fix analysis of fmin, fmax, or fsat with NaN source 23997ec681f3Smrg- nir/search: Use range analysis for is_finite 24007ec681f3Smrg- nir/range_analysis: Simplify analysis of bcsel 24017ec681f3Smrg- mesa: Add anything dynamically indexed before any non-dynamically indexed 24027ec681f3Smrg- mesa: Clean up _mesa_layout_parameters after previous commit 24037ec681f3Smrg- tgsi_exec: Fix NaN behavior of saturate 24047ec681f3Smrg- tgsi_exec: Fix NaN behavior of min and max 24057ec681f3Smrg 24067ec681f3SmrgIcecream95 (60): 24077ec681f3Smrg 24087ec681f3Smrg- pan/bi: Lower 64-bit integers 24097ec681f3Smrg- pan/bi: Handle 64-bit pack and unpack operations 24107ec681f3Smrg- pan/bi: Add some compute intrinsic loads 24117ec681f3Smrg- pan/bi: Set compute lowering options 24127ec681f3Smrg- pan/bi: Improve interoperability of the command-line disassembler 24137ec681f3Smrg- pan/bi: Implement load/store intrinsics 24147ec681f3Smrg- pan/bi: Implement load_kernel_input 24157ec681f3Smrg- panfrost: Set bifrost_props for compute shaders 24167ec681f3Smrg- pan/bi: Improve unknown intrinsic error 24177ec681f3Smrg- panfrost: Use the correct NIR options for OpenCL on Bifrost 24187ec681f3Smrg- pan/bi: Use pan_nir_lower_64bit_intrin 24197ec681f3Smrg- panfrost: Add a sysval for local_group_size 24207ec681f3Smrg- panfrost: Add a sysval for local_work_dim 24217ec681f3Smrg- panfrost: Assert on sysval overflow 24227ec681f3Smrg- pan/mdg: Limit int64 vectorization 24237ec681f3Smrg- pan/mdg: Don't reorder loads/stores past each other 24247ec681f3Smrg- pan/mdg: Allow 64-bit src_bitsize for comparison operations 24257ec681f3Smrg- pan/bi: Add w0 to the 'h01' swizzle bucket 24267ec681f3Smrg- pan/bi: Lower umul_high 24277ec681f3Smrg- panfrost: Set TLS for compute jobs 24287ec681f3Smrg- pan/bi: Implement saturated add/sub operations 24297ec681f3Smrg- pan/bi: Implement ihadd/irhadd operations 24307ec681f3Smrg- pan/bi: Implement packing ops between 32-bit vec1 and 16-bit vec2 24317ec681f3Smrg- pan/mdg: Fix spilling when scratch memory is used 24327ec681f3Smrg- pan/bi: Iterate from zero when setting RA interference 24337ec681f3Smrg- panfrost: Add a function to determine if a resource is 2D 24347ec681f3Smrg- panfrost: Only checksum resources when it makes sense to 24357ec681f3Smrg- panfrost: Add a debug flag to disable checksumming 24367ec681f3Smrg- panfrost: Transaction elimination support 24377ec681f3Smrg- panfrost: Fix the tile size assertion 24387ec681f3Smrg- pan/decode: Free mapped memory objects on BO unreference 24397ec681f3Smrg- panfrost: Add support for INTEL_blackhole_render 24407ec681f3Smrg- panfrost: Use normal malloc/free instead of ralloc for surfaces 24417ec681f3Smrg- panfrost: Add the tiler heap to fragment jobs 24427ec681f3Smrg- pan/bi: Return the size of the last clause from bi_pack 24437ec681f3Smrg- pan/bi: Fix shader prefetch size 24447ec681f3Smrg- panfrost: Fix clear color packing for 12-byte formats 24457ec681f3Smrg- pan/bi: Don't check liveness unless the index is valid 24467ec681f3Smrg- pan/bi: Use the correct size for UBO loads 24477ec681f3Smrg- pan/bi: Remove check for first_ubo_is_default_ubo 24487ec681f3Smrg- pan/bi: Implement image load/store 24497ec681f3Smrg- pan/bi: nir_intrinsic_image_size support 24507ec681f3Smrg- st/mesa: Update constants on alpha test change if it's lowered 24517ec681f3Smrg- panfrost: Disable early-z when alpha test is used 24527ec681f3Smrg- pan/mdg: Rename load/store operations 24537ec681f3Smrg- pan/mdg: Use appropriate sizes for global loads/stores 24547ec681f3Smrg- pipe-loader,gallium/drm: Fix the kmsro pipe_loader target 24557ec681f3Smrg- pipe-loader: Stop trying to use kmsro for vgem 24567ec681f3Smrg- panfrost: Implement panfrost_set_global_binding 24577ec681f3Smrg- panfrost: Flush output after disassembling shaders 24587ec681f3Smrg- panfrost: Only do point coord replacement for PIPE_PRIM_POINTS 24597ec681f3Smrg- panfrost: Only add resource checksum BOs to the batch once 24607ec681f3Smrg- panfrost: Align BO size to 4096 bytes 24617ec681f3Smrg- panfrost: Add fast path for graphics work group computation 24627ec681f3Smrg- panfrost: Unset shared/scanout binding flags for staging resources 24637ec681f3Smrg- pan/bi: Skip nir_opt_move/sink for blend shaders 24647ec681f3Smrg- panfrost: Fix shader texture count 24657ec681f3Smrg- pan/decode: Allow frame shader DCDs to be in another BO than the FBD 24667ec681f3Smrg- pan/mdg: Fix calculation of available work registers 24677ec681f3Smrg- panfrost: Fix viewport scissor for preload draws 24687ec681f3Smrg 24697ec681f3SmrgIlia Mirkin (55): 24707ec681f3Smrg 24717ec681f3Smrg- nv50/ir: ignore FS_BLEND_EQUATION_ADVANCED 24727ec681f3Smrg- nv50,nvc0: explicitly list recently-added caps 24737ec681f3Smrg- st/mesa: fix broken moves for u2i64 and related ops 24747ec681f3Smrg- nv50/ir: clear dnz flag when converting mul/mad to simpler ops 24757ec681f3Smrg- glsl: only expose int64 atomics when extension is enabled 24767ec681f3Smrg- cso: set index_bounds_valid = true for arrays draws 24777ec681f3Smrg- nvc0: index_bias is now only set for indexed draws 24787ec681f3Smrg- nvc0/ir: add fixup to deal with interpolateAtSample with non-MSAA 24797ec681f3Smrg- nv50,nvc0: clear internal vbo masks based on the trailing slots 24807ec681f3Smrg- ci: remove nouveau from shader-db runs 24817ec681f3Smrg- nouveau: reinstate fencing on screen destroy 24827ec681f3Smrg- nv50: add PIPE_CAP_NIR_IMAGES_AS_DEREF to unsupported list 24837ec681f3Smrg- nv50,nvc0: add scissored clear support 24847ec681f3Smrg- st/mesa: do scissored clears on depth/stencil as well when supported 24857ec681f3Smrg- i965: support GL_EXT_color_buffer_half_float 24867ec681f3Smrg- mesa: fix conditions for fp16 render format eligibility 24877ec681f3Smrg- mesa: fix fbo attachment size check for RBs, make it trigger in ES2 24887ec681f3Smrg- mesa: add tracking of reduction mode 24897ec681f3Smrg- st/mesa: add EXT_texture_filter_minmax support 24907ec681f3Smrg- nvc0: enable minmax reductions on gm200+ 24917ec681f3Smrg- docs: add notes about nvc0 support of ARB/EXT_texture_filter_minmax 24927ec681f3Smrg- mesa: only report INCOMPLETE_FORMATS for GLES1 / desktop 24937ec681f3Smrg- gallium,st: add missing viewport swizzles 24947ec681f3Smrg- nv50: initialize target for blit source surfaces 24957ec681f3Smrg- nv50,nvc0: remove explicit target argument from view creation 24967ec681f3Smrg- nv50: add appropriate space check before adding new pushbuffer 24977ec681f3Smrg- nvc0: ensure sufficient push space for indirect data 24987ec681f3Smrg- nvc0: fix reported driver queries for Pascal and later GPUs 24997ec681f3Smrg- mesa: fix restoring of texture attributes for msaa binding points 25007ec681f3Smrg- nv50: adapt texture and constbuf paths for compute shaders 25017ec681f3Smrg- nv50: add resource tracking for shader images and buffers 25027ec681f3Smrg- nv50: implement memory barrier handling 25037ec681f3Smrg- nv50: add texture, constbuf, image, buffer validation 25047ec681f3Smrg- nv50: pass in third axis via user param 25057ec681f3Smrg- nv50/ir: retrieve (n)ctaid.z from first user param 25067ec681f3Smrg- nv50/ir: force shared memory indirect to be an address 25077ec681f3Smrg- nv50/ir: do not use inline offsets for global, ensure indirect access 25087ec681f3Smrg- nv50/ir: fix emission of RED 25097ec681f3Smrg- nv50/ir: lower buffer to global 25107ec681f3Smrg- nv50/ir: fix emitting movs from imm to short registers 25117ec681f3Smrg- nv50/ir: fix emission of cvt with half-reg destinations 25127ec681f3Smrg- nv50/ir: fix emission of logic ops on half-regs 25137ec681f3Smrg- nv50/ir: fix emission of shifts on half-regs 25147ec681f3Smrg- nv50/ir: logic ops on half-regs can't take an immediate 25157ec681f3Smrg- nv50/ir: add support for 16-bit immediates 25167ec681f3Smrg- nv50/ir: fix emission of 16-bit add 25177ec681f3Smrg- nv50/ir: fix emission of cas without a destination 25187ec681f3Smrg- nv50: fix expression for ucp offset 25197ec681f3Smrg- nv50/ir: avoid inlining results of a locked load 25207ec681f3Smrg- nv50/ir: fix emission of ld/st lock/unlock 25217ec681f3Smrg- st/mesa: adapt for the case where buffers are not supported in frag 25227ec681f3Smrg- nv50/ir: fix texture size for msaa textures 25237ec681f3Smrg- nv50: emulate indirect draws 25247ec681f3Smrg- nv50/ir: fake SV_THREAD_KILL support 25257ec681f3Smrg- nv50: enable ARB_framebuffer_no_attachments 25267ec681f3Smrg 25277ec681f3SmrgItalo Nicola (15): 25287ec681f3Smrg 25297ec681f3Smrg- panfrost: fix attribute continuation decoding 25307ec681f3Smrg- panfrost: add 3d attribute buffer continuation to XML 25317ec681f3Smrg- panfrost: decode 3d attribute continuation 25327ec681f3Smrg- panfrost: add resource modifier conversion 25337ec681f3Smrg- panfrost: implement gallium->set_shader_images 25347ec681f3Smrg- panfrost: emit shader image attribute descriptors 25357ec681f3Smrg- panfrost: implement image_size sysval 25367ec681f3Smrg- pan/mdg: create nir pass to lower image coord bitsize 25377ec681f3Smrg- pan/mdg: enable image bitsize lowering pass 25387ec681f3Smrg- pan/mdg: add ld_image opcodes 25397ec681f3Smrg- pan/mdg: rename st_image opcodes and add float16 versions 25407ec681f3Smrg- pan/mdg: implement shader image instructions 25417ec681f3Smrg- pan/mdg: implement nir_intrinsic_image_size 25427ec681f3Smrg- panfrost: advertise images for midgard 25437ec681f3Smrg- pan/mdg: prevent csel_v from being scheduled alongside writeout 25447ec681f3Smrg 25457ec681f3SmrgIván Briano (4): 25467ec681f3Smrg 25477ec681f3Smrg- anv: don't advertise mipmaps for linear 3D surfaces on BDW 25487ec681f3Smrg- anv: move buffer size alignment into helper function 25497ec681f3Smrg- anv: use helper function to get the buffer size 25507ec681f3Smrg- intel, anv: propagate robustness setting to nir_opt_load_store_vectorize 25517ec681f3Smrg 25527ec681f3SmrgJames Jones (4): 25537ec681f3Smrg 25547ec681f3Smrg- nouveau: Stash supported sector layout in screen 25557ec681f3Smrg- nouveau: Use DRM_FORMAT_MOD_NVIDIA_BLOCK_LINEAR_2D 25567ec681f3Smrg- nouveau: no modifier != the invalid modifier 25577ec681f3Smrg- nouveau: Use format modifiers in buffer allocation 25587ec681f3Smrg 25597ec681f3SmrgJames Park (22): 25607ec681f3Smrg 25617ec681f3Smrg- radv: Use linker pragma to simulate weak functions 25627ec681f3Smrg- radv: Remove unnecessary headers 25637ec681f3Smrg- radv: Stub sections that don't have _WIN32 support 25647ec681f3Smrg- radv: Modify radv_descriptor_set for MSVC legality 25657ec681f3Smrg- radv: Pointer arithmetic on char/uint8_t, not void 25667ec681f3Smrg- radv: Disable calibrated timestamps on Windows 25677ec681f3Smrg- radv: Use typed outarray API 25687ec681f3Smrg- radv: Fix struct initialization for MSVC 25697ec681f3Smrg- gallium/tessellator: Fix warning suppression 25707ec681f3Smrg- microsoft: Fix comma in variadic macro for MSVC 25717ec681f3Smrg- ac: Remove unnecessary header 25727ec681f3Smrg- radv: Use typed outarray API 25737ec681f3Smrg- amd: Disable surface modifier test on Windows 25747ec681f3Smrg- radv: Pointer arithmetic without void* 25757ec681f3Smrg- radv: Update JSON generator if Windows 25767ec681f3Smrg- vulkan: Use typed outarray API 25777ec681f3Smrg- vulkan: Fix source list for vulkan_wsi on Windows 25787ec681f3Smrg- vulkan: Update dispatch table gen for Windows 25797ec681f3Smrg- vulkan/util: Use util_bitcount 25807ec681f3Smrg- ac/rgp: BSD elf library compatibility 25817ec681f3Smrg- amd: Hide amdgpu_drm.h on Windows 25827ec681f3Smrg- amd: Hide drm_fourcc.h on Windows 25837ec681f3Smrg 25847ec681f3SmrgJames Zhu (1): 25857ec681f3Smrg 25867ec681f3Smrg- amd: add Aldebaran chip enum 25877ec681f3Smrg 25887ec681f3SmrgJan Beich (2): 25897ec681f3Smrg 25907ec681f3Smrg- ac: prefer system EM_AMDGPU definition 25917ec681f3Smrg- ac/rgp: define EM_AMDGPU if missing for compatibility 25927ec681f3Smrg 25937ec681f3SmrgJason Ekstrand (195): 25947ec681f3Smrg 25957ec681f3Smrg- intel/tools: Decode COMPUTE_WALKER 25967ec681f3Smrg- intel/fs: Allow compute dispatch without a pushed subgroup ID on Gen12-HP 25977ec681f3Smrg- anv: Add a general state pool 25987ec681f3Smrg- intel/fs: Emit code for Gen12-HP indirect compute data 25997ec681f3Smrg- anv: Enable push constants on gen12-hp 26007ec681f3Smrg- intel/genxml,anv,iris: Drop the legacy compute path from gen125.xml 26017ec681f3Smrg- anv: Add a trivial implementation of VK_KHR_deferred_host_operation 26027ec681f3Smrg- anv: Exit early from cmd_buffer_apply_pipe_flushes 26037ec681f3Smrg- anv: Take the set of stages to flush in flush_descriptor_sets 26047ec681f3Smrg- anv: Only flush descriptors used by the pipeline 26057ec681f3Smrg- anv: Early-exit from cmd_buffer_flush_state 26067ec681f3Smrg- nir/lower_int64: Add a level of wrapper functions 26077ec681f3Smrg- nir/lower_int64: Fix lowering of f2[ui]64 for 16-bit float 26087ec681f3Smrg- nir/lower_int64: Add lowering for some 64-bit subgroup ops 26097ec681f3Smrg- nir/lower_int64: Add lowering for 64-bit iadd shuffle/reduce 26107ec681f3Smrg- nir/lower_int64: Lower 64-bit vote_ieq 26117ec681f3Smrg- intel/compiler: Return 1 for immediates in regs_read 26127ec681f3Smrg- intel/compiler: Move brw_reg_type_for_bit_size to brw_reg_type.h 26137ec681f3Smrg- intel/reg,fs: Handle immediates properly in subscript() 26147ec681f3Smrg- intel/fs: QUAD_SWIZZLE requires packed data 26157ec681f3Smrg- intel/fs: Support 64-bit SEL_EXEC on Gen11+ 26167ec681f3Smrg- intel/fs: Support 64-bit SHUFFLE on Gen11+ 26177ec681f3Smrg- intel/fs: Support 64-bit CLUSTER_BROADCAST on Gen11+ 26187ec681f3Smrg- intel/fs: Properly lower 64-bit MUL on 64-bit-incapable platforms 26197ec681f3Smrg- intel/fs: Refactor our shuffle emit code 26207ec681f3Smrg- intel/fs: Implement umin/umax shuffle 26217ec681f3Smrg- anv: Advertise shaderInt64 on Gen11+ 26227ec681f3Smrg- anv: Break SAMPLE_PATTERN and MULTISAMPLE emit into helpers 26237ec681f3Smrg- intel/fs: Add an ex_desc field to fs_inst for SHADER_OPCODE_SEND 26247ec681f3Smrg- anv: Drop anv_dump 26257ec681f3Smrg- anv: Fix an old parameter name in GetDeviceQueue 26267ec681f3Smrg- anv: Refactor anv_queue_finish() 26277ec681f3Smrg- anv: Add an anv_queue_family struct 26287ec681f3Smrg- nir/from_ssa: Respect and populate divergence information 26297ec681f3Smrg- vulkan/meson: Add missing dependencise on vk_extensions_gen.py 26307ec681f3Smrg- anv: Clean up the vk_device on the CreateDevice error path 26317ec681f3Smrg- radv: Properly clean up vk_device 26327ec681f3Smrg- turnip: Properly clean up vk_device 26337ec681f3Smrg- v3dv: Properly clean up vk_device 26347ec681f3Smrg- lavapipe: Properly clean up vk_device 26357ec681f3Smrg- vulkan: Move vk_device to its own file 26367ec681f3Smrg- vulkan: Add a return code to vk_device_init 26377ec681f3Smrg- vulkan: Add common extension tables 26387ec681f3Smrg- anv: Use the common extension table struct 26397ec681f3Smrg- vulkan: Add common dispatch table generation 26407ec681f3Smrg- vulkan: Add dispatch table loading helpers 26417ec681f3Smrg- vulkan-overlay-layer: Use the new dispatch tables 26427ec681f3Smrg- vulkan: Add dispatch table lookup helpers 26437ec681f3Smrg- vulkan: Add common instance and physical device structs 26447ec681f3Smrg- vulkan: Add generators for instance trampoline functions 26457ec681f3Smrg- vulkan: Add entrypoint tables and related helpers 26467ec681f3Smrg- vulkan: Add common Get*ProcAddr implementations 26477ec681f3Smrg- vulkan: Add a common entrypoint table generator 26487ec681f3Smrg- anv: Add a single anv_genX switch macro 26497ec681f3Smrg- anv: Use the common dispatch framework 26507ec681f3Smrg- vulkan: Add framework for common entrypoints 26517ec681f3Smrg- vulkan,anv: Move GetDeviceProcAddr to common code 26527ec681f3Smrg- vulkan,anv: Add common entrypoints for VK_EXT_private_data 26537ec681f3Smrg- anv: Make Get*MemoryRequirements a wrapper 26547ec681f3Smrg- vulkan,anv: Move a bunch of trivial wrappers to common code 26557ec681f3Smrg- vulkan,anv: Move VK_KHR_copy_commands2 wrappers to common code 26567ec681f3Smrg- vulkan: Add a truly common VK_EXT_debug_report implementation 26577ec681f3Smrg- anv: Switch to the common VK_EXT_debug_report 26587ec681f3Smrg- turnip: Use the common dispatch framework 26597ec681f3Smrg- turnip: Use common entrypoints for VK_EXT_private_data 26607ec681f3Smrg- turnip: Drop some legacy wrappers in favor of common code 26617ec681f3Smrg- turnip: Switch to the common VK_EXT_debug_report 26627ec681f3Smrg- lavapipe: Drop some wrappers in favor of common code 26637ec681f3Smrg- v3dv: Drop v3dv_instance::app_info 26647ec681f3Smrg- v3dv: Use common entrypoints for VK_EXT_private_data 26657ec681f3Smrg- v3dv: Switch to the common VK_EXT_debug_report 26667ec681f3Smrg- radv: Use common entrypoints for VK_EXT_private_data 26677ec681f3Smrg- radv: Switch to the common VK_EXT_debug_report 26687ec681f3Smrg- vulkan: Make vk_debug_report_callback derive from vk_object_base 26697ec681f3Smrg- anv: Use vk_object_base::type for debug_report 26707ec681f3Smrg- vulkan: Use vk_object_base::type for debug_report 26717ec681f3Smrg- vulkan: Make the debug_report implementation internal 26727ec681f3Smrg- anv,radv: Use common entrypoints for VK_KHR_deferred_operation 26737ec681f3Smrg- vulkan: Rework vk_device_init and friends 26747ec681f3Smrg- vulkan: Drop the type_prefix parameter from gen_extensions 26757ec681f3Smrg- nir: Add some ssa-only fast-paths for nir_src rewrite 26767ec681f3Smrg- nir: Drop the lower_mem_constant_vars declaration 26777ec681f3Smrg- vulkan: Add a common helper for enumerating instance extension properties 26787ec681f3Smrg- vulkan: Rework extension disabling on Android 26797ec681f3Smrg- anv: Pull the patch version from the XML 26807ec681f3Smrg- anv: Make anv_icd.py more generic and independent 26817ec681f3Smrg- anv,vulkan: Move anv_icd.py to a common location 26827ec681f3Smrg- anv: Move extension tables to anv_device.c 26837ec681f3Smrg- anv: Add fake graphics-only and compute-only queue families 26847ec681f3Smrg- nir: Add a couple helpers for phis and cursors 26857ec681f3Smrg- nir/lower_bit_size: Support phi instructions 26867ec681f3Smrg- intel/nir: Lower 8-bit phis on Gen11+ 26877ec681f3Smrg- nir: Add some range analysis for used bits 26887ec681f3Smrg- nir/algebraic: Clean up up-cast of down-cast when we can 26897ec681f3Smrg- nir/algebraic: Covert up-cast of down-cast to extract on Intel 26907ec681f3Smrg- spirv: Store the nir_function in vtn_function 26917ec681f3Smrg- spirv: Delete the impl for prototype-only functions 26927ec681f3Smrg- nir: Don't optimize bcsel-of-shuffle across blocks 26937ec681f3Smrg- nir: Fix parameter order in the bcsel-of-shuffle optimization 26947ec681f3Smrg- nir/opt_large_constants: Handle generic pointers 26957ec681f3Smrg- intel/fs: Shuffle can't handle source modifiers 26967ec681f3Smrg- anv/formats: Advertise linear sampling on depth formats 26977ec681f3Smrg- anv/android: Re-implement AcquireImageANDROID 26987ec681f3Smrg- intel/mi_builder: Create a context in the tests 26997ec681f3Smrg- intel/mi_builder: Delete a bogus comment 27007ec681f3Smrg- intel/mi_builder: Fix a misleading comment 27017ec681f3Smrg- intel/mi_builder: Short-circuit shifts in more cases 27027ec681f3Smrg- intel/mi_builder: Add constant folding 27037ec681f3Smrg- intel/mi_builder: Rewrite unit tests in terms of constant folding 27047ec681f3Smrg- intel/mi_builder: Add tests for gen_mi_z and gen_mi_nz 27057ec681f3Smrg- intel: Rename gen_mi_builder.h to mi_builder.h 27067ec681f3Smrg- intel/mi_builder: Drop the gen\_ prefix 27077ec681f3Smrg- intel/mi_builder: Use AddCSMMIOStartOffset for LRI 27087ec681f3Smrg- intel/mi_builder: Add ieq/ine helpers 27097ec681f3Smrg- intel/mi_builder: Support inverted values in mi_store 27107ec681f3Smrg- intel/mi_builder: Add load/store_offest on GFX 12.5+ 27117ec681f3Smrg- genxml: Clean up MI_SET_PREDICATE 27127ec681f3Smrg- intel/batch_decoder: Don't follow predicated MI_BATCH_BUFFER_START 27137ec681f3Smrg- intel/mi_builder: Use softpin for tests on gen8+ 27147ec681f3Smrg- intel/mi_builder: Return an address from __gen_get_batch_address 27157ec681f3Smrg- intel/mi_builder: Add control-flow support 27167ec681f3Smrg- nir: Add and use a new nir_ssa_def_rewrite_uses_src helper 27177ec681f3Smrg- nir: Make nir_ssa_def_rewrite_uses take an SSA value 27187ec681f3Smrg- nir: Make nir_ssa_def_rewrite_uses_after take an SSA value 27197ec681f3Smrg- intel/mi_builder: Fix some indentation 27207ec681f3Smrg- intel/mi_builder: Fix a couple of #ifs 27217ec681f3Smrg- anv: Drop anv_extensions.py 27227ec681f3Smrg- turnip: Move the CreateRenderPass wrapper to common code 27237ec681f3Smrg- anv: Move multialloc to common code 27247ec681f3Smrg- vulkan: Use VK_MULTIALLOC in CreateRenderPass 27257ec681f3Smrg- anv: Move vk_format helpers to common code 27267ec681f3Smrg- vulkan: Use correct aspectMask in CreateRenderPass 27277ec681f3Smrg- vulkan: Add some asserts and checks for multiview in CreateRenderPass 27287ec681f3Smrg- vulkan: Preserve preserve attachments in CreateRenderPass 27297ec681f3Smrg- anv: Drop CreateRenderPass 27307ec681f3Smrg- radv/meta: Use CreateRenderPass2 27317ec681f3Smrg- radv: Drop CreateRenderPass 27327ec681f3Smrg- intel/fs: Use INTEL_MASK for pushish constant address masking 27337ec681f3Smrg- intel/fs: Handle payload node interference in destinations 27347ec681f3Smrg- vulkan: Use ALWAYS_INLINE for multialloc 27357ec681f3Smrg- vk/alloc: Handle zero sizes better in vk_multialloc_add 27367ec681f3Smrg- vulkan/alloc: Add VK_MULTIALLOC_DECL macros 27377ec681f3Smrg- vulkan/util: Add a type parameter to vk_multialloc_add 27387ec681f3Smrg- vulkan/alloc: Use char * for pointer arithmetic 27397ec681f3Smrg- anv,genxml: Handle L3SQCREG1_SQGHPCI in GenXML 27407ec681f3Smrg- anv: Add an anv_batch_write_reg macro 27417ec681f3Smrg- iris: Add an iris_write_reg macro 27427ec681f3Smrg- genxml: Make 1-bit L3$ config register fields bool on Gen7 27437ec681f3Smrg- intel/fs,rt: Add a predicate to load_global_const_block 27447ec681f3Smrg- anv: Use load_global_constant for shader constants 27457ec681f3Smrg- anv: Use nir_shader_instructions_pass in apply_pipeline_layout 27467ec681f3Smrg- anv/apply_pipeline_layout: Refactor descriptor chasing code 27477ec681f3Smrg- anv/apply_pipeline_layout: Rework the early pass index/offset helpers 27487ec681f3Smrg- anv/apply_pipeline_layout: Lower UBO loads in the early pass 27497ec681f3Smrg- anv/apply_pipeline_layout: Run DCE between the early and late passes 27507ec681f3Smrg- anv/apply_pipeline_layout: Move bounds checking later for index/offset 27517ec681f3Smrg- anv/apply_pipeline_layout: Plumb through a UBO address format 27527ec681f3Smrg- anv/apply_pipeline_layout: Add some switch statements 27537ec681f3Smrg- nir: Add a new 64+32-bit address format 27547ec681f3Smrg- anv: Use 64bit_global_32bit_offset for SSBOs 27557ec681f3Smrg- anv: Rework the 64bit_bounded_global resource index format 27567ec681f3Smrg- anv: Zero out the last dword of UBO/SSBO descriptors in the shader 27577ec681f3Smrg- anv/apply_pipeline_layout: Apply dynamic offsets in load_ssbo_descriptor 27587ec681f3Smrg- anv/apply_pipeline_layout: Refactor all our descriptor address builders 27597ec681f3Smrg- anv/apply_pipeline_layout: Rework the desc_addr_format helper 27607ec681f3Smrg- anv/apply_pipeline_layout: Use the new helpers for early lowering 27617ec681f3Smrg- anv/apply_pipeline_layout: Use the new helpers for images 27627ec681f3Smrg- nir/lower_io: Support global addresses for UBOs in nir_lower_explicit_io 27637ec681f3Smrg- anv: Add a pass for lowering A64 UBO access 27647ec681f3Smrg- anv: Do UBO loads with global addresses for bindless 27657ec681f3Smrg- anv/apply_pipeline_layout: Add support for A64 descriptor access 27667ec681f3Smrg- nir: Add image atomic_fmin/fmax intrinsics 27677ec681f3Smrg- spirv: Add support for SPV_EXT_shader_atomic_float_min_max 27687ec681f3Smrg- intel/fs: Add support for 16-bit A64 float and integer atomics 27697ec681f3Smrg- intel/genxml: Binding table pointers are 15 bits on GFX version 12.5+ 27707ec681f3Smrg- intel/tools: Handle milti-LRI in the batch decoder 27717ec681f3Smrg- intel/tools: Handle GT_MODE in the batch decoder 27727ec681f3Smrg- intel/genxml: Make BindingTablePoolEnable a bool 27737ec681f3Smrg- intel/tools: Handle BINDING_TABLE_POOL_ALLOC in batch decoding 27747ec681f3Smrg- anv: Align inline uniform data to ANV_UBO_ALIGNMENT 27757ec681f3Smrg- anv: Implement VK_EXT_conservative_rasterization 27767ec681f3Smrg- anv: Fix coverage masks for VK_EXT_conservative_rasterization 27777ec681f3Smrg- intel: Drop gen_device_info::has_resource_streamer 27787ec681f3Smrg- anv: Clean up anv_device_memory::base on failure 27797ec681f3Smrg- anv: Refactor framebuffer creation 27807ec681f3Smrg- anv: Clean up anv_descriptor_pool::base on the error path 27817ec681f3Smrg- anv: Clean up anv_semaphore::base on the error path 27827ec681f3Smrg- vulkan: Add a vk_object_multialloc helper 27837ec681f3Smrg- anv: Use vk_object_alloc/free 27847ec681f3Smrg- anv: Make memory type and queue family pointers const 27857ec681f3Smrg- intel: fix querying mip levels on null surfaces on SKL and prior 27867ec681f3Smrg- intel/compiler: Don't insert barriers for NULL sources 27877ec681f3Smrg- anv: Use the same re-order mode for streamout as for GS 27887ec681f3Smrg- intel/isl: Fix isl_color_value_unpack to match the prototype 27897ec681f3Smrg- intel/nir: Set lower txs with non-zero LOD 27907ec681f3Smrg 27917ec681f3SmrgJeremy Huddleston (5): 27927ec681f3Smrg 27937ec681f3Smrg- darwin: Use the system libexpat 27947ec681f3Smrg- util: Fix pointer to integer conversion error when using libunwind 27957ec681f3Smrg- darwin: Use the system libunwind 27967ec681f3Smrg- Fall back on clock_gettime when timespec_get() is unavailable 27977ec681f3Smrg- Adjust dylib compatibility versions to match what was set by mesa-18.3's autotools-based builds 27987ec681f3Smrg 27997ec681f3SmrgJesse Natalie (76): 28007ec681f3Smrg 28017ec681f3Smrg- nir: Work around MSVC x86 internal compiler error 28027ec681f3Smrg- main: Undefine MemoryBarrier for Windows 28037ec681f3Smrg- glapi: Undefine MemoryBarrier 28047ec681f3Smrg- mapi: Undefine MemoryBarrier 28057ec681f3Smrg- drisw: Disable automatic use of layered drivers with LIBGL_ALWAYS_SOFTWARE 28067ec681f3Smrg- wgl: Refactor screen creation to a function 28077ec681f3Smrg- wgl: Add a loop for screen creation with an ordered list of fallbacks 28087ec681f3Smrg- d3d12: Fail screen creation if a shader validator is needed and can't be created 28097ec681f3Smrg- wgl: Disable automatic use of layered drivers with LIBGL_ALWAYS_SOFTWARE 28107ec681f3Smrg- CI: Use a sha for the Windows SPIRV-LLVM-Translator dependency 28117ec681f3Smrg- microsoft/clc: Add -fgnu89-inline to clang args 28127ec681f3Smrg- microsoft/clc: Add test with inline function 28137ec681f3Smrg- clover: Add -fgnu89-inline to Clang command line 28147ec681f3Smrg- microsoft/clc: Only apply float scaling to 32bit fdiv 28157ec681f3Smrg- microsoft/clc: Let lower_vars_to_explicit_types fill kernel input driver_location 28167ec681f3Smrg- microsoft/clc: Fix wrap modes for inline samplers for integer textures 28177ec681f3Smrg- microsoft/clc: Move inline samplers to the end of the variable list 28187ec681f3Smrg- microsoft/clc: Use driver_location for metadata instead of re-computing offsets 28197ec681f3Smrg- microsoft/clc: Re-order dead variable removal after uniform vars_to_explicit_types 28207ec681f3Smrg- microsoft/clc: Add a test with an unused kernel arg 28217ec681f3Smrg- glapi: Support "ELF" TLS on Windows 28227ec681f3Smrg- docs: Document USE_ELF_TLS can work on Windows too 28237ec681f3Smrg- meson/gallium: Add an option to not use LLVM for gallium draw module 28247ec681f3Smrg- d3d12: Handle null constant buffers 28257ec681f3Smrg- nir: Add a nir_after_instr_and_phis helper 28267ec681f3Smrg- microsoft/compiler: Don't separate phis while inserting upcasts 28277ec681f3Smrg- d3d12: Move descriptor pools to screen, and add lock 28287ec681f3Smrg- d3d12: Handle is_new_style_shadow comparison filtering 28297ec681f3Smrg- d3d12: Really handle null constant buffers 28307ec681f3Smrg- u_format: Add restrict to fn pointer and manual format pack/unpack/fetch 28317ec681f3Smrg- panfrost: Add a Meson dependency on bi_opcodes.h for bifrost_compiler 28327ec681f3Smrg- meson, util: Make zlib optional again 28337ec681f3Smrg- nir: Temporarily disable optimizations for MSVC ARM64 28347ec681f3Smrg- wgl: Fix wglCreatePbufferARB pixel format lookup 28357ec681f3Smrg- d3d12: Use ID3D12Device9::CreateCommandQueue1 when available 28367ec681f3Smrg- d3d12: Use CreateDXGIFactory2 and use the debug flag when appropriate 28377ec681f3Smrg- wgl: Add unit test infrastructure for OpenGL32.dll on Windows 28387ec681f3Smrg- wgl: Add a context to framebuffer destruction 28397ec681f3Smrg- d3d12: Add a constant for num_buffers 28407ec681f3Smrg- d3d12: Clean up swapchains on framebuffer destruction 28417ec681f3Smrg- wgl, d3d12: Add a d3d12-specific test for swapchain leaks 28427ec681f3Smrg- microsoft/compiler: Move blob_init earlier to prevent crash on failure 28437ec681f3Smrg- microsoft/compiler: Add copy_prop_vars to optimization loop 28447ec681f3Smrg- microsoft/compiler: Add a lowering pass to split clip/cull distance compact arrays 28457ec681f3Smrg- microsoft/compiler: Enable dxil_nir.h to be included from C++ 28467ec681f3Smrg- microsoft/compiler: Support compact arrays for clip/cull in nir_to_dxil 28477ec681f3Smrg- d3d12: Use compact arrays for clip/cull distance 28487ec681f3Smrg- microsoft/spirv_to_dxil: Implement TODO for removing dead functions 28497ec681f3Smrg- spirv_to_dxil: Handle clip/cull distance 28507ec681f3Smrg- microsoft/compiler: Fix barrier flag for shared memory 28517ec681f3Smrg- microsoft/spirv_to_dxil: Lower globals to function_temp 28527ec681f3Smrg- microsoft/spirv_to_dxil: Lower io arrays 28537ec681f3Smrg- microsoft/compiler: Support fp16 i/o vars 28547ec681f3Smrg- nir: Add a new opcode for [un]packing doubles 28557ec681f3Smrg- microsoft/compiler: Add a lowering pass to emit double [un]pack instructions 28567ec681f3Smrg- microsoft/compiler: Implement new double pack/unpack alu ops 28577ec681f3Smrg- microsoft/spirv_to_dxil: Support doubles 28587ec681f3Smrg- microsoft/compiler: Add some more float16 support 28597ec681f3Smrg- meson: Refuse to build lavapipe without llvmpipe 28607ec681f3Smrg- vtn: Don't warn about linkage capability if we're creating a NIR library 28617ec681f3Smrg- vtn: Add a cap for CL drivers to support read-write images 28627ec681f3Smrg- microsoft/clc: Update unit test to always use COMMON state for buffers 28637ec681f3Smrg- meson: For MSVC, suppress warnings generated by useless delayloads 28647ec681f3Smrg- driconf: Remove default values from string driconf entries 28657ec681f3Smrg- CI: Enable -werror for Windows 28667ec681f3Smrg- vtn: Support scoped control barriers for OpenCL too 28677ec681f3Smrg- nir_opt_deref: ptr_as_array(deref_cast<T*>(x))[0] isn't the same as x[0] if the cast has alignment 28687ec681f3Smrg- nir: Fix MSVC warning C4334 (32bit shift cast to 64bit) 28697ec681f3Smrg- d3d12: Fix MSVC warning C4334 (32bit shift cast to 64bit) 28707ec681f3Smrg- microsoft/clc: Fix MSVC unreferenced variable warnings 28717ec681f3Smrg- microsoft/clc: Fix undeclared function warning 28727ec681f3Smrg- microsoft/compiler: Fix MSVC warning C4334 (32bit shift cast to 64bit) 28737ec681f3Smrg- shader_enums: Fix MSVC warning C4334 (32bit shift cast to 64bit) 28747ec681f3Smrg- gallium/aux: Fix MSVC warning C4334 (32bit shift cast to 64bit) 28757ec681f3Smrg- llvmpipe: Fix MSVC warning C4334 (32bit shift cast to 64bit) 28767ec681f3Smrg- xmlconfig: Fix MSVC warning C4334 (32bit shift cast to 64bit) 28777ec681f3Smrg 28787ec681f3SmrgJesse Schwartzentruber (1): 28797ec681f3Smrg 28807ec681f3Smrg- glcpp: Fix undefined behaviour in glcpp 28817ec681f3Smrg 28827ec681f3SmrgJoel Linn (2): 28837ec681f3Smrg 28847ec681f3Smrg- freedreno/a2xx: fix/add RBBM perfcounter 28857ec681f3Smrg- freedreno/a2xx: add RB perfcounter 1-3 28867ec681f3Smrg 28877ec681f3SmrgJonathan Marek (13): 28887ec681f3Smrg 28897ec681f3Smrg- turnip: fix logicOp 28907ec681f3Smrg- turnip: delete unused vk_format_parse.py file 28917ec681f3Smrg- turnip: use vk_format_is_int to disable COLOR_ATTACHMENT_BLEND_BIT 28927ec681f3Smrg- turnip: IMAGE_FILTER_{LINEAR,CUBIC}_BIT only for non-integer formats 28937ec681f3Smrg- turnip: don't always use 3d ops for blit_image 28947ec681f3Smrg- turnip: add missing register write to disable dithering 28957ec681f3Smrg- freedreno/registers: use macro instead of inline function for array regs 28967ec681f3Smrg- freedreno/a6xx: update perfcntr registers (declare as arrays) 28977ec681f3Smrg- freedreno/a6xx: always use reg64 for address registers (no LO/HI) 28987ec681f3Smrg- freedreno/a6xx: update some registers 28997ec681f3Smrg- freedreno/a6xx: set SP_PERFCTR_ENABLE in computerator 29007ec681f3Smrg- turnip: implement VK_KHR_shader_float_controls 29017ec681f3Smrg- turnip: enable VK_KHR_shader_float16_int8 29027ec681f3Smrg 29037ec681f3SmrgJordan Justen (28): 29047ec681f3Smrg 29057ec681f3Smrg- intel/genxml/gen125: Add CFE_STATE and COMPUTE_WALKER 29067ec681f3Smrg- intel/compiler: Disable push constants on gen12-hp 29077ec681f3Smrg- anv: Emit CFE_STATE for gen12-hp 29087ec681f3Smrg- anv: Don't use MEDIA_INTERFACE_DESCRIPTOR_LOAD for gen12-hp 29097ec681f3Smrg- anv: Use COMPUTE_WALKER for gen12-hp 29107ec681f3Smrg- iris: Add support for COMPUTE_WALKER 29117ec681f3Smrg- iris: Fix android build due to missing link to libmesa_iris_gen125 29127ec681f3Smrg- anv: Add exec_flags to anv_queue 29137ec681f3Smrg- anv: Turn device->queue into an array 29147ec681f3Smrg- anv: Print queue number with INTEL_DEBUG=bat 29157ec681f3Smrg- anv: Support i915 query (DRM_IOCTL_I915_QUERY) from Linux v4.17 29167ec681f3Smrg- anv: Gather engine info from i915 if available 29177ec681f3Smrg- anv: Add anv_gem_count_engines 29187ec681f3Smrg- anv: Support multiple engines with DRM_IOCTL_I915_GEM_CONTEXT_CREATE_EXT 29197ec681f3Smrg- anv: Add ANV_QUEUE_OVERRIDE env-var to override advertised queues 29207ec681f3Smrg- i965: Call util_cpu_detect() early in screen creation 29217ec681f3Smrg- intel: Use GEN_VERSIONx10 in more places 29227ec681f3Smrg- intel/dev: Add devinfo genx10 field 29237ec681f3Smrg- intel: Use devinfo genx10 field 29247ec681f3Smrg- anv: Restructure mem heap/type init code 29257ec681f3Smrg- anv: Put cache memory type first on non-llc platforms 29267ec681f3Smrg- anv: Add mem heap/type support for local-mem 29277ec681f3Smrg- anv: Drop has_slm in emit_l3_config for gen11+ 29287ec681f3Smrg- anv: Use fallback paths if DRM_I915_QUERY_ENGINE_INFO fails 29297ec681f3Smrg- i965/gen11: Fix must-be-ones bit positions in 3D_MODE 29307ec681f3Smrg- genxml/gen12: 3D_MODE bits 31:16 are no longer must-be-one 29317ec681f3Smrg- Revert "intel/compiler: Silence unused parameter warning in update_inst_scoreboard" 29327ec681f3Smrg- intel/compiler: Fix INTEL_DEBUG=hex 29337ec681f3Smrg 29347ec681f3SmrgJose Maria Casanova Crespo (4): 29357ec681f3Smrg 29367ec681f3Smrg- v3d: Enables DRM_FORMAT_MOD_BROADCOM_SAND128 support 29377ec681f3Smrg- v3d: YUV formats at is_dmabuf_modifier_supported are external_only 29387ec681f3Smrg- v3d: YUV formats at query_dmabuf_modifiers are external_only 29397ec681f3Smrg- v3d: DRM_FORMAT_MOD_BROADCOM_SAND128 only available for NV12 format. 29407ec681f3Smrg 29417ec681f3SmrgJoshua Ashton (3): 29427ec681f3Smrg 29437ec681f3Smrg- lavapipe: handle NULL pStrides in CmdBindVertexBuffers2EXT 29447ec681f3Smrg- lavapipe: implement CmdBindVertexBuffers with CmdBindVertexBuffers2EXT 29457ec681f3Smrg- lavapipe: enable KHR_image_format_list 29467ec681f3Smrg 29477ec681f3SmrgJosé Fonseca (6): 29487ec681f3Smrg 29497ec681f3Smrg- scons: Add u_format_pack.h include path. 29507ec681f3Smrg- wgl: Match opengl32.dll export ordinals. 29517ec681f3Smrg- util: Always use timespec_get on Windows. 29527ec681f3Smrg- appveyor: Remove integration. 29537ec681f3Smrg- scons: Remove. 29547ec681f3Smrg- gitlab-ci: Build meson-mingw32-x86_64 w/o zlib. 29557ec681f3Smrg 29567ec681f3SmrgJuan A. Suarez Romero (40): 29577ec681f3Smrg 29587ec681f3Smrg- v3d: fix dest offset in TFU setup 29597ec681f3Smrg- v3d: use a compatible supported format for TFU-based blit 29607ec681f3Smrg- vc4/ci: Replace expect script by python script 29617ec681f3Smrg- ci/vc4: allow custom timeout values for activity 29627ec681f3Smrg- ci/vc4: rename stage to Broadcom 29637ec681f3Smrg- ci/vc4: Add piglit job 29647ec681f3Smrg- ci: Bump deqp to current vulkan-cts-1.2.5.1 29657ec681f3Smrg- ci: add option to overwrite CPU arch 29667ec681f3Smrg- ci/v3d: Add V3D and V3DV testing 29677ec681f3Smrg- ci/v3d: Update expected resuls for piglit 29687ec681f3Smrg- ci/piglit: allow parallel piglit jobs 29697ec681f3Smrg- ci/vc4/v3d: Parallelize piglit jobs 29707ec681f3Smrg- ci/piglit: fix parallel piglit results 29717ec681f3Smrg- ci/baremetal: highlight message errors 29727ec681f3Smrg- ci/broadcom: retry always when serial log timeout 29737ec681f3Smrg- ci: Bump deqp to vk-gl-cts 1.2.5.2 29747ec681f3Smrg- ci/broadcom: allow custom kernels 29757ec681f3Smrg- vc4: destroy renderonly object if present 29767ec681f3Smrg- ci/armXX: add libgl1-mesa-dev dependency 29777ec681f3Smrg- ci/v3dv: add flaky test in the skip list 29787ec681f3Smrg- ci/vc4/v3d: run piglit testsuite against Xorg 29797ec681f3Smrg- ci/broadcom: use new piglit runner 29807ec681f3Smrg- ci/broadcom: update piglit expected results 29817ec681f3Smrg- ci/v3d: run full GLES3 and GLES31 testsuite 29827ec681f3Smrg- broadcom/compiler: fix unused value 29837ec681f3Smrg- v3dv: fix unused value 29847ec681f3Smrg- ci/v3dv: update flaky tests 29857ec681f3Smrg- broadcom/cle: do not leak spec 29867ec681f3Smrg- ci/broadcom: update expected list 29877ec681f3Smrg- v3d: use uint type in _gen_unpack_uint 29887ec681f3Smrg- broadcom/compiler: fix first_component assertion 29897ec681f3Smrg- broadcom/compiler: use signed pointers for packed condition 29907ec681f3Smrg- ci/broadcom: use SNMP to turn on/off devices 29917ec681f3Smrg- broadcom/compiler: use VPM offsets in GS load_per_vertex input 29927ec681f3Smrg- v3d: use GS_BIN inputs as VS_BIN outputs 29937ec681f3Smrg- v3dv: fix assertion 29947ec681f3Smrg- ci: Update VK-GL-CTS to 1.2.6.0 29957ec681f3Smrg- v3d: do not emit attribute if has no resource 29967ec681f3Smrg- ci/v3dv: skip Vulkan waiver tests 29977ec681f3Smrg- util/hash_table: do not leak u64 struct key 29987ec681f3Smrg 29997ec681f3SmrgJérôme Glisse (3): 30007ec681f3Smrg 30017ec681f3Smrg- gallium: add support for SVM (Share Virtual Memory) migrate 30027ec681f3Smrg- clover: implement clEnqueueSVMMigrateMem 30037ec681f3Smrg- nouveau: add support for SVM migrate 30047ec681f3Smrg 30057ec681f3SmrgKarol Herbst (9): 30067ec681f3Smrg 30077ec681f3Smrg- clover: track allocated svm pointers 30087ec681f3Smrg- clover/api: make use of validate_mem_migration_flags in clEnqueueMigrateMemObjects 30097ec681f3Smrg- nouveau: print warning about unhandled cap only once 30107ec681f3Smrg- clover: simplify image arguments 30117ec681f3Smrg- clover: rework quering image max sizes 30127ec681f3Smrg- clover: Fix build with llvm-12. 30137ec681f3Smrg- clover: Add missing include for llvm-12 build fix 30147ec681f3Smrg- tegra/context: fix regression in tegra_draw_vbo 30157ec681f3Smrg- tegra/context: unwrap indirect_draw_count as well 30167ec681f3Smrg 30177ec681f3SmrgKeith Packard (2): 30187ec681f3Smrg 30197ec681f3Smrg- glx: Provide glvnd wrapper for glXSwapIntervalEXT 30207ec681f3Smrg- wsi/x11: Fix type of target_msc argument to x11_present_to_x11_dri3 30217ec681f3Smrg 30227ec681f3SmrgKenneth Graunke (62): 30237ec681f3Smrg 30247ec681f3Smrg- vbo: Don't set node->min_index = max_index = indices_offset when merging 30257ec681f3Smrg- vbo: Only mark merged line strips as lines when actually converting them 30267ec681f3Smrg- tnl: Try not to botch index buffer munging when start > 0. 30277ec681f3Smrg- tnl: Respect \`start` when converting indices to GLuint 30287ec681f3Smrg- tnl: Reset nr_bos to 0 between map/unmap cycles. 30297ec681f3Smrg- Revert "mesa: allow half float textures based on ARB_half_float_pixel" 30307ec681f3Smrg- iris: Consider resolves after changing a resource's aux state 30317ec681f3Smrg- iris: Drop find_existing_assembly optimization from program cache 30327ec681f3Smrg- iris: Drop iris_print_program_cache(). 30337ec681f3Smrg- iris: Refactor iris_debug_recompile interface to take a shader. 30347ec681f3Smrg- intel: Produce a "constrained" output from gen_get_urb_config() 30357ec681f3Smrg- iris: Reconfigure the URB only if it's necessary or possibly useful 30367ec681f3Smrg- iris: Move the URB size checks into iris_update_compiled_xs 30377ec681f3Smrg- iris: Properly handle new unbind_num_trailing_slots parameters 30387ec681f3Smrg- iris: Use shader_info rather than vs_prog_data for draw parameter checks 30397ec681f3Smrg- iris: Minor code restyling in iris_bind_vs_state 30407ec681f3Smrg- iris: Move VS draw parameter dirty flagging to iris_bind_vs_state 30417ec681f3Smrg- iris: Refcount shader variants 30427ec681f3Smrg- iris: Store a list of shader variants in the shader itself 30437ec681f3Smrg- iris: Enable PIPE_CAP_SHAREABLE_SHADERS. 30447ec681f3Smrg- iris: add hooks to call INTEL_MEASURE 30457ec681f3Smrg- iris: Fill out scratch base address dynamically 30467ec681f3Smrg- iris: Remove context from iris_debug_recompile 30477ec681f3Smrg- iris: Remove context from iris_upload_shader() 30487ec681f3Smrg- iris: Remove context from iris_compile_vs and friends 30497ec681f3Smrg- iris: Remove context from iris_create_uncompiled_shader 30507ec681f3Smrg- iris: Remove context from iris_disk_cache_retrieve 30517ec681f3Smrg- iris: Make a pin_scratch_space() helper 30527ec681f3Smrg- iris: Reference the shader variant for last_vue_map as well 30537ec681f3Smrg- iris: Pin surface state buffers after possibly updating the clear color 30547ec681f3Smrg- i965: Rename use_intel_mipree_map_blit to use_blitter_to_map 30557ec681f3Smrg- i965: Rename intel_batchbuffer_* to brw_batch_*. 30567ec681f3Smrg- i965: Rename intel_screen to brw_screen 30577ec681f3Smrg- i965: Rename intel_texture_{object,image} to brw_texture_{object,image} 30587ec681f3Smrg- i965: Rename intel_renderbuffer to brw_renderbuffer 30597ec681f3Smrg- i965: Rename intel_mip* to brw_mip*. 30607ec681f3Smrg- i965: Use __func__ in blorp perf_debug macros 30617ec681f3Smrg- i965: Rename intel_buffer_object to brw_buffer_object 30627ec681f3Smrg- i965: Rename intel_image_format and intel_buffer to brw_* 30637ec681f3Smrg- i965: Rename the rest of intel_* functions to brw_* 30647ec681f3Smrg- i965: Rename intelInit and brwInit camel-case functions to brw_* 30657ec681f3Smrg- i965: Rename some camel-case local variables 30667ec681f3Smrg- i965: Rename more camel-case functions to brw and underscore style 30677ec681f3Smrg- i965: Rename DRI extension structs to be "brw" instead of "intel" 30687ec681f3Smrg- i965: Eliminate all tabs except in brw_defines.h 30697ec681f3Smrg- tnl: Call _mesa_matrix_analyse to make sure the inverse MVP is updated 30707ec681f3Smrg- glsl/float64: Bump #version to 400 30717ec681f3Smrg- iris: Defer uploading of surface states 30727ec681f3Smrg- iris: Defer stream output target space allocation until set time 30737ec681f3Smrg- iris: Rework zeroing of stream output buffer offsets 30747ec681f3Smrg- iris: Support rebinding of stream output targets 30757ec681f3Smrg- iris: Use different shader uploaders for precompile vs. draw time 30767ec681f3Smrg- iris: Make various classes inherit from u_threaded_context base classes 30777ec681f3Smrg- iris: Use thread safe slab allocators in transfer_map handling 30787ec681f3Smrg- iris: Enable u_threaded_context 30797ec681f3Smrg- vbo: Fix vbo_sw_primitive_restart for start > 0 30807ec681f3Smrg- intel/genxml: Add a partial GT_MODE definition for Gen11+. 30817ec681f3Smrg- iris: Delete stale comment in iris_lost_context_state 30827ec681f3Smrg- intel: Fix release build breakage 30837ec681f3Smrg- Half-revert "gallium/dri2: Pass the resource that corresponds to the plane" 30847ec681f3Smrg- intel: Mark an otherwise unused variable in intel_dump_gpu as ASSERTED 30857ec681f3Smrg- ci: Enable iris testing in meson-release 30867ec681f3Smrg 30877ec681f3SmrgKristian Høgsberg (1): 30887ec681f3Smrg 30897ec681f3Smrg- macros: Add thread-safety annotation macros 30907ec681f3Smrg 30917ec681f3SmrgLeo Liu (12): 30927ec681f3Smrg 30937ec681f3Smrg- radeon/vcn: clean the message buffers and their indexes logic 30947ec681f3Smrg- radeon/vcn: add dynamic dpb interface 30957ec681f3Smrg- radeon/vcn: add dynamic dpb buffer Tier1 support 30967ec681f3Smrg- radeon/vcn: enable dynamic dpb Tier1 support 30977ec681f3Smrg- radeon/vcn: add dynamic dpb Tier2 message buffer interface 30987ec681f3Smrg- radeon/vcn: implement dynamic dpb Tier2 support 30997ec681f3Smrg- radeon/vcn: enable dynamic dpb Tier2 support 31007ec681f3Smrg- meson: bump drm amdgpu version to 2.4.105 31017ec681f3Smrg- ci: Fix meson-i386 build failed after libdrm bump version 31027ec681f3Smrg- include/drm-uapi: bump AMDGPU headers 31037ec681f3Smrg- ac: add function for querying video capabilities 31047ec681f3Smrg- radeonsi: replace the hard coded video decode and encode caps 31057ec681f3Smrg 31067ec681f3SmrgLepton Wu (3): 31077ec681f3Smrg 31087ec681f3Smrg- virgl: Don't destroy resource while it's in use. 31097ec681f3Smrg- virgl: Use atomic operation directly. 31107ec681f3Smrg- virgl: move new added field to the end. 31117ec681f3Smrg 31127ec681f3SmrgLionel Landwerlin (96): 31137ec681f3Smrg 31147ec681f3Smrg- anv: add transfer usage for color/depth/stencil attachments 31157ec681f3Smrg- anv: don't disable KHR_performance_query in debug mode 31167ec681f3Smrg- intel/mi_builder: optimize 64bit immediate register loads & memory stores 31177ec681f3Smrg- intel/mi_builder: fix self modifying batches 31187ec681f3Smrg- intel/perf: restructure i915 perf version checks 31197ec681f3Smrg- intel/perf: add definition for generic perf counters 31207ec681f3Smrg- intel/perf: link queries back to the gen_perf_config object 31217ec681f3Smrg- intel/perf: move gt_frequency to results 31227ec681f3Smrg- anv: Fix stencil layout in render passes 31237ec681f3Smrg- intel: silence unused var warnings in release builds 31247ec681f3Smrg- anv: fix invalid programming of BLEND_STATE 31257ec681f3Smrg- intel/common: store sample position in plain arrays 31267ec681f3Smrg- anv: pass context to reset stats helper 31277ec681f3Smrg- anv: store queue creation flags on anv_queue 31287ec681f3Smrg- genxml: PERFCNT registers are available since HSW 31297ec681f3Smrg- intel/perf: prep work to enable new perf counters 31307ec681f3Smrg- intel/perf: query register descriptions 31317ec681f3Smrg- intel/perf: add performance query layout using MI_SRM 31327ec681f3Smrg- intel/perf: switch query code to use query layout 31337ec681f3Smrg- anv: fix layout comment 31347ec681f3Smrg- anv: remove unused query pool field 31357ec681f3Smrg- intel/perf: rename lkf into ehl 31367ec681f3Smrg- intel/perf: add reorder script 31377ec681f3Smrg- intel/perf: reorder xml files 31387ec681f3Smrg- intel/perf: remove reordering script 31397ec681f3Smrg- intel/perf: update files from IGT 31407ec681f3Smrg- intel/perf: small ICL equation refactor 31417ec681f3Smrg- intel/perf: add async compute metrics 31427ec681f3Smrg- intel/dev: identify tigerlake 31437ec681f3Smrg- intel/perf: break TGL perf configs in GT1/2 31447ec681f3Smrg- intel/dev: identify rocketlake 31457ec681f3Smrg- intel/perf: add RKL support 31467ec681f3Smrg- intel/perf: add DG1 support 31477ec681f3Smrg- intel/perf: drop the special READ_REG operator 31487ec681f3Smrg- anv: compute commands required to implement perf queries 31497ec681f3Smrg- anv: switch khr perf query code to use query layout 31507ec681f3Smrg- anv: switch intel perf queries to query layout 31517ec681f3Smrg- anv: add a comment describing has_relocs field 31527ec681f3Smrg- anv: break up internal queueing function 31537ec681f3Smrg- anv: only signal wsi fence BO on last command buffer 31547ec681f3Smrg- drm-shim: report support for timeline semaphores 31557ec681f3Smrg- intel/stub: plug some gaps in our ioctl faking 31567ec681f3Smrg- anv: print out perf permission warning only once 31577ec681f3Smrg- anv: discard all timeline wait/signal value=0 31587ec681f3Smrg- vulkan: document flags choice for vkGetDeviceQueue 31597ec681f3Smrg- genxml: add MI_SET_APPID on Gen12+ 31607ec681f3Smrg- genxml: Add PIPE_CONTROL protected memory bits 31617ec681f3Smrg- isl: add external parameter to isl_mocs() 31627ec681f3Smrg- anv: track command buffer pool flags 31637ec681f3Smrg- anv: track buffer creation flags 31647ec681f3Smrg- intel/dev: identify alderlake 31657ec681f3Smrg- intel/perf: Add Alderlake metrics 31667ec681f3Smrg- intel/perf: fix roll over PERF_CNT counter accumulation 31677ec681f3Smrg- anv: reset binary syncobj to be signaled before submission 31687ec681f3Smrg- anv: don't wait for completion of work on vkQueuePresent() 31697ec681f3Smrg- anv: Fix wait_count missing increment 31707ec681f3Smrg- anv: make use of new helper function directly in anv_QueueSubmit() 31717ec681f3Smrg- anv: track the end of the command buffers 31727ec681f3Smrg- anv: end command buffer with a potential jump 31737ec681f3Smrg- anv: allow multiple command buffers in anv_queue_submit 31747ec681f3Smrg- anv: group as many command buffers into a single execbuf 31757ec681f3Smrg- anv: fix missing general state pool in validation list 31767ec681f3Smrg- anv: implement INTEL_DEBUG=submit 31777ec681f3Smrg- anv: fix MI_PREDICATE_RESULT write 31787ec681f3Smrg- intel/tools: fix meson warning 31797ec681f3Smrg- intel/dev: add helpers to compute subslice/eu total 31807ec681f3Smrg- intel/dev: add warning on missing kernel uAPI for Gen8+ 31817ec681f3Smrg- iris: use gen_device_info helper to get subslice total 31827ec681f3Smrg- i965: stop using get_param for things queried by gen_device_info 31837ec681f3Smrg- anv: stop using get_param for things queried by gen_device_info 31847ec681f3Smrg- intel/dev: switch over to mesa log infrastructure 31857ec681f3Smrg- anv: move L3 config emission to genX_state.c 31867ec681f3Smrg- anv: move L3 initialization to device init on Gen11+ 31877ec681f3Smrg- intel: install intel_device_info 31887ec681f3Smrg- intel/fs/vec4: add missing dependency in write-on-write fixed GRFs 31897ec681f3Smrg- intel/dev: store size of CS prefetch 31907ec681f3Smrg- intel/mi_builder: use device info to use the right CS prefetch size 31917ec681f3Smrg- anv: use the device size of CS prefetch to pad secondary buffer calls 31927ec681f3Smrg- meson: switch vulkan layer to list of choices 31937ec681f3Smrg- intel: Add null hw layer 31947ec681f3Smrg- gitlab-ci: fix vulkan build layer enabling 31957ec681f3Smrg- intel/nullhw: fix build 31967ec681f3Smrg- etnaviv/drm: only print out fence error on non timeout 31977ec681f3Smrg- intel/fs/copy_prop: check stride constraints with actual final type 31987ec681f3Smrg- intel/fs: implement another copy propagation restriction 31997ec681f3Smrg- intel/compiler: lower bit sizes in NIR postprocessing 32007ec681f3Smrg- anv: put correct number of BT prefetch for compute on XeHP+ 32017ec681f3Smrg- intel/fs: limit OW reads to 8 owords on XeHP+ 32027ec681f3Smrg- microsoft: fixup clc_log() define 32037ec681f3Smrg- anv: bump internal descriptor index fields to 32bits 32047ec681f3Smrg- anv: fix 3DSTATE_MULTISAMPLE emission on gen8+ 32057ec681f3Smrg- anv: disable baked in pipeline bits from dynamic emission path 32067ec681f3Smrg- spirv: fix uToAccelerationStructure handling 32077ec681f3Smrg- spirv: fixup pointer_to/from_ssa with acceleration structures 32087ec681f3Smrg- vulkan/wsi/display: don't report support if there is no drm fd 32097ec681f3Smrg- i965/bufmgr: fix invalid assertion 32107ec681f3Smrg 32117ec681f3SmrgLucas Stach (5): 32127ec681f3Smrg 32137ec681f3Smrg- renderonly: remove layering violations 32147ec681f3Smrg- renderonly: close the gpu fd when destroying renderonly 32157ec681f3Smrg- etnaviv: don't try to copy PIPE_BUFFER with the 3D engine 32167ec681f3Smrg- etnaviv: remove stale comment in etna_resource_copy_region 32177ec681f3Smrg- Revert remaining half of "gallium/dri2: Pass the resource that corresponds to the plane" 32187ec681f3Smrg 32197ec681f3SmrgLukas Feller (2): 32207ec681f3Smrg 32217ec681f3Smrg- v3dv: fix assertion in job_compute_frame_tiling 32227ec681f3Smrg- v3dv: fix stride in buffer copy 32237ec681f3Smrg 32247ec681f3SmrgMarcin Ślusarz (20): 32257ec681f3Smrg 32267ec681f3Smrg- intel/perf: export information about units of performance counters 32277ec681f3Smrg- intel/compiler: cache computed register pressure benefit 32287ec681f3Smrg- intel/tools/aub: print better error message when mmap fails 32297ec681f3Smrg- intel/tools/aub: handle truncated input file 32307ec681f3Smrg- intel/tools/aub: remove superfluous new line from error messages 32317ec681f3Smrg- intel/dump_gpu: mark bo as unmapped if its address changes 32327ec681f3Smrg- anv: fix memory allocation error handling 32337ec681f3Smrg- iris: fix decode_get_bo 32347ec681f3Smrg- i965: fix decode_get_bo 32357ec681f3Smrg- intel/batch_decoder: catch invalid sampler state pointer 32367ec681f3Smrg- intel/batch_decoder: drop bogus check 32377ec681f3Smrg- intel/batch_decoder: fix decoding of sampler states 32387ec681f3Smrg- intel/batch_decoder: assert on invalid sampler pointer 32397ec681f3Smrg- intel/aub_viewer: catch invalid sampler state pointer 32407ec681f3Smrg- intel/aub_viewer: drop bogus check 32417ec681f3Smrg- intel/aub_viewer: fix decoding of sampler states 32427ec681f3Smrg- gallium: add PIPE_CAP_ALLOW_DYNAMIC_VAO_FASTPATH 32437ec681f3Smrg- iris: disable dynamic VAO fastpath on GFX version 9 32447ec681f3Smrg- gallium/u_threaded: implement INTEL_performance_query hooks 32457ec681f3Smrg- gallium/u_threaded: offload begin/end_intel_perf_query 32467ec681f3Smrg 32477ec681f3SmrgMarek Olšák (406): 32487ec681f3Smrg 32497ec681f3Smrg- mesa: always set valid index bounds for non-indexed draws for classic drivers 32507ec681f3Smrg- st/nine: stop using cso_set_sampler_views 32517ec681f3Smrg- st/xa: stop using cso_set_sampler_views 32527ec681f3Smrg- gallium/tests: stop using cso_set_sampler_views 32537ec681f3Smrg- gallium/api: add state invalidate interface as alternative to cso_save/restore 32547ec681f3Smrg- gallium/hud: don't use cso_context to restore VBs, constbuf 0 and sampler views 32557ec681f3Smrg- gallium/pp: don't use cso_context to restore VBs, constbuf 0 and sampler views 32567ec681f3Smrg- st/mesa: don't use cso_context to restore VBs, sampler views for glBitmap 32577ec681f3Smrg- st/mesa: don't use cso_context to restore VBs for glClear 32587ec681f3Smrg- st/mesa: don't use cso_context to restore VBs, sampler views for glDrawPixels 32597ec681f3Smrg- st/mesa: don't use cso_context to restore VBs, sampler views for glDrawTex*OES 32607ec681f3Smrg- st/mesa: don't use cso_context to restore VBs, etc. for PBO glReadPixels 32617ec681f3Smrg- st/mesa: don't use cso_context to restore VBs, etc. for PBO glTexSubImage 32627ec681f3Smrg- st/mesa: don't use cso_context to set const bufs, sampler views and images 32637ec681f3Smrg- st/mesa: replace st->pipe with pipe in a few places 32647ec681f3Smrg- cso_context: remove ability to restore VBs, const bufs, sampler views, images 32657ec681f3Smrg- st/mesa: unbind sampler views, images, and vertex buffers after meta ops 32667ec681f3Smrg- st/mesa: optimize binding and unbinding shader images 32677ec681f3Smrg- radeonsi: constant buffer cleanups 32687ec681f3Smrg- radeonsi: don't clear unaligned bits when unbinding vertex buffers 32697ec681f3Smrg- radeonsi: move emit_cache_flush functions into si_gfx_cs.c 32707ec681f3Smrg- radeonsi: don't pass pipe_draw_info into si_emit_vs_state 32717ec681f3Smrg- radeonsi: don't pass pipe_draw_info into si_emit_ia_multi_vgt_param 32727ec681f3Smrg- radeonsi: translate pipe_prim_type only when it changes 32737ec681f3Smrg- radeonsi: don't pass pipe_draw_info into si_emit_derived_tess_state 32747ec681f3Smrg- radeonsi: don't compute average vertex count in si_draw_vbo 32757ec681f3Smrg- radeonsi: fix si_num_prims_for_vertices for PIPE_PRIM_POLYGON 32767ec681f3Smrg- radeonsi: make cik_emit_prefetch_L2 templated and move it to si_state_draw.cpp 32777ec681f3Smrg- radeonsi: add a specialized function for CP DMA L2 prefetch 32787ec681f3Smrg- radeonsi: make sctx->vertex_elements always non-NULL 32797ec681f3Smrg- radeonsi: remove MRT-draw-calls, spill-draw-calls, spill-compute-calls 32807ec681f3Smrg- radeonsi: get out of si_emit_vs_state early for blit vertex shaders 32817ec681f3Smrg- radeonsi: rearrange condition for streamout workaround on gfx7 and gfx8 32827ec681f3Smrg- radeonsi: don't use si_get_vs_state in most places 32837ec681f3Smrg- radeonsi: trim the size of si_vgt_param_key and si_vgt_stages_key 32847ec681f3Smrg- mesa: fix alpha channel of ETC2_SRGB8 decompression for !bgra 32857ec681f3Smrg- radeonsi: unify uploaders on APUs too 32867ec681f3Smrg- radeonsi: don't pass pipe_draw_info into si_emit_draw_registers 32877ec681f3Smrg- radeonsi: don't set context_roll for non-gfx9 in templated functions 32887ec681f3Smrg- radeonsi: add si_get_user_data_base selecting user data registers 32897ec681f3Smrg- radeonsi: evaluate sh_base in si_emit_vs_state at compile time 32907ec681f3Smrg- radeonsi: inline the last use of si_get_vs_state 32917ec681f3Smrg- radeonsi: evaluate si_get_vs in si_draw_vbo at compile time 32927ec681f3Smrg- radeonsi: enable the GS tri strip adj workaround with primitive_restart 32937ec681f3Smrg- radeonsi: clear dirty_atoms and dirty_states only if we entered the emit loop 32947ec681f3Smrg- radeonsi: move variables closer to their use in most draw state functions 32957ec681f3Smrg- radeonsi: don't validate inlinable uniforms at draw time 32967ec681f3Smrg- radeonsi: allow instance_count == 0 on chips that handle it correctly 32977ec681f3Smrg- glthread: remove marshal="draw" because it doesn't do much 32987ec681f3Smrg- glthread: don't sync with NV_half_float vertex attrib functions 32997ec681f3Smrg- glthread: add specialized versions of unmarshal_Draw funcs without user buffers 33007ec681f3Smrg- glthread: track all matrix stack depths 33017ec681f3Smrg- glthread: implement glGetIntegerv for states that glthread tracks 33027ec681f3Smrg- glthread: rename inside_dlist to ListMode for future use 33037ec681f3Smrg- glthread: remove if (COMPAT) conditions from functions that are GL-compat-only 33047ec681f3Smrg- mesa: add _mesa_get_list helper 33057ec681f3Smrg- glthread: add display list support to fix state tracking with display lists 33067ec681f3Smrg- mesa: remove _mesa_initialize_exec_dispatch from draw.c by autogenerating it 33077ec681f3Smrg- mesa: remove redundant glRect functions for display lists 33087ec681f3Smrg- mesa: optimize glCallLists by using loops inside a switch 33097ec681f3Smrg- mesa: simplify handling OPCODE_CONTINUE for display lists 33107ec681f3Smrg- mesa: simplify terminating display list loops 33117ec681f3Smrg- mesa: remove STATE_INTERNAL 33127ec681f3Smrg- mesa: combine STATE_ENV, STATE_LOCAL enums with STATE_xxx_PROGRAM 33137ec681f3Smrg- mesa: flatten STATE_MATERIAL and STATE_LIGHTPROD tokens 33147ec681f3Smrg- mesa: eliminate the switch statement for STATE_TEXGEN 33157ec681f3Smrg- glsl: remove unused internal builtin gl_CurrentAttribVertMESA 33167ec681f3Smrg- glsl: split gl_CurrentAttribFragMESA into elements 33177ec681f3Smrg- mesa: skip memmove in optimize_state_parameters if it's no-op 33187ec681f3Smrg- mesa: rename STATE_LIGHT_ATTRIBS -> STATE_LIGHT_ARRAY for consistency 33197ec681f3Smrg- mesa: optimize get_local_param_pointer and program_local_parameters4fv 33207ec681f3Smrg- mesa: don't allocate local parameters in fetch_state 33217ec681f3Smrg- mesa: merge local and env program parameters for faster uploads 33227ec681f3Smrg- mesa: sort state vars with constant indexing for ARB programs 33237ec681f3Smrg- mesa: add upper bound to limit program state var iterations 33247ec681f3Smrg- mesa: compute gl_program_parameter_list::UniformBytes accurately 33257ec681f3Smrg- mesa: don't handle STATE_* enums in fetch_state that don't do anything 33267ec681f3Smrg- mesa: sort and tightly pack STATE_* enums to generate better switch code 33277ec681f3Smrg- mesa: merge equivalent switch cases in prog_statevars.c 33287ec681f3Smrg- st/mesa: enable state var merging to improve fetch_state performance 33297ec681f3Smrg- radeonsi: add new possibly faster command submission helpers 33307ec681f3Smrg- radeonsi: clear dirty_states if si_pm4_bind_state is unbinding or no-op 33317ec681f3Smrg- radeonsi: don't mark NULL states as dirty in si_pm4_reset_emitted 33327ec681f3Smrg- radeonsi: optimize translating index_size to index_type 33337ec681f3Smrg- radeonsi: don't use rasterizer_discard to validate draws, only check ps_shader 33347ec681f3Smrg- radeonsi: add internal blitter_running flag 33357ec681f3Smrg- radeonsi: simplify determining whether render condition is enabled at draw time 33367ec681f3Smrg- radeonsi: inline si_blend_color and si_clip_state structures 33377ec681f3Smrg- radeonsi: move y_inverted out of si_viewports 33387ec681f3Smrg- radeonsi: don't set vertex buffer dirty flags when they don't do anything 33397ec681f3Smrg- radeonsi: move if (sctx->vertex_buffers_dirty) into the upload function 33407ec681f3Smrg- radeonsi: rename SI_SGPR_RW_BUFFERS to SI_SGPR_INTERNAL_BINDINGS 33417ec681f3Smrg- radeonsi: skip some code for ALLOW_PRIM_DISCARD_CS if tess or GS is enabled 33427ec681f3Smrg- radeonsi: enable accidentally disabled fast launch with non-indexed tri strips 33437ec681f3Smrg- radeonsi: iterate from draw 1 for total/min_direct_count computation 33447ec681f3Smrg- st/mesa: don't enable smoothing if multisampling is enabled 33457ec681f3Smrg- Revert "gallium/u_upload_mgr: allow use of FLUSH_EXPLICIT with persistent mappings" 33467ec681f3Smrg- gallium: add take_ownership param into set_constant_buffer to eliminate atomics 33477ec681f3Smrg- gallium: add unbind_num_trailing_slots to set_vertex_buffers 33487ec681f3Smrg- gallium: add unbind_num_trailing_slots to set_shader_images 33497ec681f3Smrg- gallium: add unbind_num_trailing_slots to set_sampler_views 33507ec681f3Smrg- gallium: add take_ownership param into set_vertex_buffers to eliminate atomics 33517ec681f3Smrg- cso_context,u_vbuf: add take_ownership param into set_vertex_buffers 33527ec681f3Smrg- st/mesa: eliminate all atomic ops when setting vertex buffers 33537ec681f3Smrg- st/mesa: skip atomics when binding UBOs 33547ec681f3Smrg- gallium/u_upload_mgr: eliminate all atomics for the upload buffer 33557ec681f3Smrg- gallium/u_threaded: add a null constant buffer codepath 33567ec681f3Smrg- gallium/u_threaded: unify user and non-user codepaths in set_constant_buffer 33577ec681f3Smrg- gallium/util: optimize pipe_vertex_buffer_reference binding the same buffer 33587ec681f3Smrg- gallium,u_threaded: add pipe_draw_info::take_index_buffer_ownership 33597ec681f3Smrg- st/mesa: set take_index_buffer_ownership to skip an atomic in u_threaded 33607ec681f3Smrg- gallium/u_vbuf: skip draws with 0 vertices 33617ec681f3Smrg- radeonsi: fix centroid with VRS coarse shading 33627ec681f3Smrg- glthread: fix interpreting vertex size == GL_BGRA for vertex attribs 33637ec681f3Smrg- glthread: fix glVertexAttribDivisor calls not being tracked by non-VBO uploads 33647ec681f3Smrg- glapi: guard against invalid XML definitions for glthread 33657ec681f3Smrg- ac,radeonsi: track memory usage in KB to reduce types from uint64 to uint32 33667ec681f3Smrg- radeonsi: optimize no-op cases in si_upload_shader_descriptors 33677ec681f3Smrg- radeonsi: mark shader_pointers dirty once outside the upload descriptors loop 33687ec681f3Smrg- radeonsi: move si_pm4_delete_state logic into si_pm4_free_state 33697ec681f3Smrg- radeonsi: delete si_pm4_delete_state 33707ec681f3Smrg- radeonsi: don't check for redundant and NULL states in si_emit_all_states 33717ec681f3Smrg- radeonsi: optimize si_emit_prefetch_L2 when it's split 33727ec681f3Smrg- radeonsi: reorganize si_draw_vbo for lower register pressure (part 1) 33737ec681f3Smrg- radeonsi: reorganize si_draw_vbo for lower register pressure (part 2) 33747ec681f3Smrg- radeonsi: set VB user SGPRs in si_upload_vertex_buffer_descriptors 33757ec681f3Smrg- radeonsi: prefetch VB descriptors right after uploading 33767ec681f3Smrg- radeonsi: precompute NGG cull flags in si_create_rs_state 33777ec681f3Smrg- mesa: remove/replace FLUSH_VERTICES when it doesn't do anything 33787ec681f3Smrg- mesa: optimize most _mesa_ActiveTexture calls in glPopAttrib 33797ec681f3Smrg- mesa: optimize glPopAttrib(GL_VIEWPORT_BIT) 33807ec681f3Smrg- mesa: don't push/pop gl_texture_object::Target 33817ec681f3Smrg- mesa: fix glPopAttrib for many texture fields 33827ec681f3Smrg- mesa: flush glBegin/End before changing GL_DEPTH_STENCIL_TEXTURE_MODE 33837ec681f3Smrg- mesa: for every state change, remember states we changed for glPopAttrib 33847ec681f3Smrg- mesa: fix trivial bugs in glPopAttrib 33857ec681f3Smrg- mesa: optimize out _NEW_ALL in glPopAttrib(GL_ENABLE_BIT) 33867ec681f3Smrg- mesa: only pop states in glPopAttrib that have been changed since glPushAttrib 33877ec681f3Smrg- mesa: partially skip glPush/PopAttrib for MSAA textures and texture buffers 33887ec681f3Smrg- mesa: don't save gl_shared_state in glPushAttrib 33897ec681f3Smrg- mesa: don't push/pop default texture attributes redundantly 33907ec681f3Smrg- mesa: pop all textures up to NumCurrentTexUsed, not just MaxTextureUnits 33917ec681f3Smrg- mesa: don't count buffer references for the context that created them 33927ec681f3Smrg- radeonsi: set current_rast_prim at bind time for tess and GS 33937ec681f3Smrg- radeonsi: simplify the NGG culling condition in si_draw_vbo 33947ec681f3Smrg- radeonsi: tune NGG shader culling vertex threshold for each chip 33957ec681f3Smrg- radeon: decrease the size of radeon_cmdbuf by switching prev fields to uint16 33967ec681f3Smrg- Revert "gallium/u_vbuf: skip draws with 0 vertices" 33977ec681f3Smrg- gallium/u_vbuf: skip non-indirect draws with 0 vertices 33987ec681f3Smrg- winsys/amdgpu,radeonsi: add HUD counters for how much memory is wasted by slabs 33997ec681f3Smrg- winsys/amdgpu: clean up slab alignment code, handle small buffers better 34007ec681f3Smrg- winsys/amdgpu,pb_slab: add slabs with 3/4 of power of two sizes to save memory 34017ec681f3Smrg- winsys/amdgpu: expand the slab allocation range to [256 B, 1 MB] 34027ec681f3Smrg- winsys/amdgpu: optimize out conditionals in amdgpu_lookup_buffer 34037ec681f3Smrg- winsys/amdgpu: remove amdgpu_winsys_bo::num_cs_references to remove atomics 34047ec681f3Smrg- winsys/amdgpu: pack amdgpu_winsys_bo::is_shared and protect it by a mutex 34057ec681f3Smrg- winsys/amdgpu: move amdgpu_winsys_bo::cpu_ptr into the u.real union 34067ec681f3Smrg- winsys/amdgpu: move amdgpu_winsys_bo::is_shared to the u.real union 34077ec681f3Smrg- winsys/amdgpu: move amdgpu_winsys_bo::is_user_ptr to the u.real union 34087ec681f3Smrg- winsys/amdgpu: move amdgpu_winsys_bo::use_reusable_pool to the u.real union 34097ec681f3Smrg- winsys/amdgpu: don't inc/dec num_active_ioctls for backing BOs of sparse BOs 34107ec681f3Smrg- winsys/amdgpu: don't set unused usage for backing BOs of sparse BOs 34117ec681f3Smrg- winsys/amdgpu: try not to skip any code with RADEON_NOOP=1 to test CPU perf 34127ec681f3Smrg- tgsi_to_nir: translate SAMPLEID 34137ec681f3Smrg- tgsi_to_nir: translate FBFETCH 34147ec681f3Smrg- gallium/u_tests: test no-op fragment shader instead of NULL fragment shader 34157ec681f3Smrg- winsys/amdgpu: disallow pb_cache for backing buffers of sparse buffers 34167ec681f3Smrg- ac/gpu_info: print use_late_alloc 34177ec681f3Smrg- ac/gpu_info: rename num_tcc_blocks -> max_tcc_blocks 34187ec681f3Smrg- ac/gpu_info: add radeon_info::num_tcc_blocks 34197ec681f3Smrg- ac/gpu_info: remove redundant radeon_info::num_sdp_interfaces 34207ec681f3Smrg- ac/gpu_info: inline get_l2_cache_size and set cache sizes farther down 34217ec681f3Smrg- ac/gpu_info: conceal L2 cache sizes 34227ec681f3Smrg- amd: sort chip enums based on hw revision 34237ec681f3Smrg- radeonsi: skip s_sendmsg(gs_alloc_req) for NGG passthrough on new chips 34247ec681f3Smrg- radeonsi: add debug options nodisplaytiling and nodisplaydcc 34257ec681f3Smrg- amd: update addrlib 34267ec681f3Smrg- mesa: optimize draw index type checking 34277ec681f3Smrg- mesa: precompute all valid primitive types at context creation 34287ec681f3Smrg- mesa: precompute draw time prim validation during state changes 34297ec681f3Smrg- mesa: move check_valid_to_render call into _mesa_valid_prim_mode 34307ec681f3Smrg- mesa: fold most of check_valid_to_render into _mesa_update_valid_to_render_state 34317ec681f3Smrg- mesa: inline check_valid_to_render 34327ec681f3Smrg- mesa: add skeleton code for DrawPixels/CopyPixels/Bitmap precomputed validation 34337ec681f3Smrg- mesa: don't report 1 for GL_VALIDATE_STATUS if user didn't validate pipeline 34347ec681f3Smrg- mesa: move shader pipeline validation from draws to state changes 34357ec681f3Smrg- mesa: move sampler uniform validation from draws to state changes 34367ec681f3Smrg- mesa: move some uniform debug code from draws to state changes 34377ec681f3Smrg- mesa: move FBO completeness checking from draws to state changes 34387ec681f3Smrg- mesa: move ARB program and integer FBO validation from draws to state changes 34397ec681f3Smrg- mesa: move GL_FILL_RECTANGLE validation from draws to state changes 34407ec681f3Smrg- mesa: move blending validation from draws to state changes 34417ec681f3Smrg- mesa: inline _mesa_valid_to_render now that it doesn't do validation 34427ec681f3Smrg- mesa: optimize the dual source blend error checking using a bitmask 34437ec681f3Smrg- mesa: remove VERBOSE_DRAW 34447ec681f3Smrg- mesa: remove optional draw validation code to increase performance 34457ec681f3Smrg- mesa: call _mesa_update_state() before validation 34467ec681f3Smrg- mesa: remove an optional GL error about mapped buffers during execution 34477ec681f3Smrg- mesa: skip MultiDrawArrays with primcount == 0 34487ec681f3Smrg- mesa: don't skip draws with count == 0 or numInstances == 0 34497ec681f3Smrg- mesa: add a separate valid primitive mask just for glDrawElements 34507ec681f3Smrg- mesa: move disallowed TFB in DrawElements on GLES from draws to state changes 34517ec681f3Smrg- mesa: validate numInstances in common functions to unify code 34527ec681f3Smrg- mesa: optimize set_varying_vp_inputs by precomputing the conditions 34537ec681f3Smrg- mesa: move gl_context::varying_vp_inputs into ctx->VertexProgram._VaryingInputs 34547ec681f3Smrg- mesa: set _DrawVAOEnabledAttribs only when it changes 34557ec681f3Smrg- mesa: precompute _mesa_get_vao_vp_inputs 34567ec681f3Smrg- mesa: precompute draw time determination of enabled vertex arrays 34577ec681f3Smrg- mesa: gather errors and call _mesa_error only once in validate_Draw 34587ec681f3Smrg- mesa: inline _mesa_set_draw_vao and set_varying_vp_inputs for draw calls 34597ec681f3Smrg- mesa: inline draw validate functions 34607ec681f3Smrg- mesa: add debug code to catch missing _mesa_update_valid_to_render_state calls 34617ec681f3Smrg- ac/surface: use family_id so as not to crash with SI_FORCE_FAMILY in addrlib 34627ec681f3Smrg- radeonsi: for tess, determine the minimum num_patches before optimizing tg size 34637ec681f3Smrg- radeonsi: improve comments in si_emit_derived_tess_state 34647ec681f3Smrg- radeonsi: allocate filled_size for streamout targets in set_streamout_buffers 34657ec681f3Smrg- radeonsi: do late NIR optimizations after uniform inlining 34667ec681f3Smrg- radeonsi: fix the value of uses_bindless_samplers 34677ec681f3Smrg- radeonsi: gather info about bindless images and memory stores with strstr(intr) 34687ec681f3Smrg- radeonsi: gather shader info about indirect UBO/SSBO/samplers/images 34697ec681f3Smrg- radeonsi: gather shader info about VMEM usage for MEM_ORDERED 34707ec681f3Smrg- radeonsi: set MEM_ORDERED optimally 34717ec681f3Smrg- glthread: assume all parameters are fixed if marshal_sync is present 34727ec681f3Smrg- glthread: don't declare pointers with const in unmarshal functions 34737ec681f3Smrg- glthread: don't sync when using pixel buffer objects 34747ec681f3Smrg- glthread: ignore the return value of glUnmapBuffer, don't sync, and return true 34757ec681f3Smrg- i915: use align_calloc for the context to fix m32 crashes 34767ec681f3Smrg- radeon,r200: use align_calloc for the context to fix m32 crashes 34777ec681f3Smrg- nouveau_vieux: use align_calloc for the context to fix m32 crashes 34787ec681f3Smrg- mesa: remove unnecessary NewState flagging for glPopAttrib(GL_ENABLE_BIT) 34797ec681f3Smrg- mesa: move fixed-func-related _mesa_update_state code closer together 34807ec681f3Smrg- mesa: split _NEW_LIGHT into 3 flags: _NEW_LIGHT_(FF_PROGRAM|CONSTANTS|STATE) 34817ec681f3Smrg- mesa: rework _MESA_NEW_NEED_EYE_COORDS to reduce fixed-func program updates 34827ec681f3Smrg- mesa: don't compute the inverted projection matrix if not used 34837ec681f3Smrg- mesa: don't compute the ModelView * Projection matrix if not used 34847ec681f3Smrg- mesa: add _NEW_MATERIAL to reduce the weight of _NEW_LIGHT_CONSTANTS 34857ec681f3Smrg- mesa: don't update derived material values in _mesa_update_state and elsewhere 34867ec681f3Smrg- mesa: remove _NEW_VARYING_VP_INPUTS in favor of _NEW_FF_(VERT|FRAG)_PROGRAM 34877ec681f3Smrg- mesa: remove _NEW_LIGHT_FF_PROGRAM in favor of _NEW_FF_(VERT|FRAG)_PROGRAM 34887ec681f3Smrg- mesa: don't push/pop ctx->Texture._* derived states 34897ec681f3Smrg- mesa: remove the fixed-func vert prog dependency on all texture states 34907ec681f3Smrg- mesa: sort state parameters for ffvp to enable better parameter merging 34917ec681f3Smrg- mesa: merge STATE_LIGHTPROD parameters 34927ec681f3Smrg- mesa: merge STATE_LIGHT_ATTENUATION and STATE_LIGHT_POSITION_* parameters 34937ec681f3Smrg- vbo: optimize copy_to_current functions 34947ec681f3Smrg- vbo: don't call update_color_material in copy_to_current if it's a no-op 34957ec681f3Smrg- mesa: be precise about when to set _NEW_CURRENT_ATTRIB and _NEW_MATERIAL 34967ec681f3Smrg- mesa: move _mesa_update_pixel out of _mesa_update_state 34977ec681f3Smrg- mesa: only update fixed-func programs on texture matrix enablement changes 34987ec681f3Smrg- mesa: don't update fixed-func vert prog on irrelevant _NEW_TRANSFORM changes 34997ec681f3Smrg- mesa: don't update fixed-func programs on irrelevant _NEW_POINT changes 35007ec681f3Smrg- mesa: don't update fixed-func programs on irrelevant _NEW_FOG changes 35017ec681f3Smrg- mesa: don't update fixed-func programs on irrelevant _NEW_RENDER_MODE changes 35027ec681f3Smrg- mesa: don't update the fixed-func frag prog on irrelevant _NEW_COLOR changes 35037ec681f3Smrg- mesa: don't update tnl spaces on irrelevant _NEW_POINT/TEXTURE_STATE changes 35047ec681f3Smrg- mesa: skip a subset of _mesa_update_state if no relevant flags are set 35057ec681f3Smrg- radeonsi: don't index si_context::shaders with enum gl_shader_stage 35067ec681f3Smrg- ac/llvm: fix ac_build_atomic_rmw with LLVM 13 35077ec681f3Smrg- radeonsi: don't crash on NULL images in si_check_needs_implicit_sync 35087ec681f3Smrg- ac/llvm: add support for 16-bit source operands for samplers 35097ec681f3Smrg- ac/llvm: implement 16-bit and 64-bit fpow correctly 35107ec681f3Smrg- ac/llvm: fix visit_load_ubo_buffer to use SMEM for 16 bits instead of VMEM 35117ec681f3Smrg- ac/llvm: add type parameter into ac_build_buffer_load to fix 16-bit TES inputs 35127ec681f3Smrg- ac/llvm: open code fpow on LLVM 12 using fmul.legacy 35137ec681f3Smrg- driconf: add performance tweaks for viewperf 35147ec681f3Smrg- ac/surface: select best swizzle mode for 3D sampler performance 35157ec681f3Smrg- ac,radeonsi: add sampler changes for Aldebaran 35167ec681f3Smrg- ac: set the TCC line size for Aldebaran 35177ec681f3Smrg- ac/llvm: unpack thread IDs on Aldebaran 35187ec681f3Smrg- ac: handle bigger instruction prefetch for Aldebaran 35197ec681f3Smrg- ac,radeonsi: use correct VGPR granularity on Aldebaran 35207ec681f3Smrg- ac: remove switch cases for pc_lines for compute-only chips 35217ec681f3Smrg- radeonsi: enable RGP on gfx10.3 35227ec681f3Smrg- gallium/u_threaded: don't sync in create_stream_output_target 35237ec681f3Smrg- gallium: add pipe_screen::num_contexts for skipping mutex locking in util_range 35247ec681f3Smrg- radeonsi: update pipe_screen::num_contexts 35257ec681f3Smrg- ac/llvm: handle demote in LLVM 13 that just added support for it 35267ec681f3Smrg- ac/gpu_info: fix more non-coherent RB and GL2 combinations 35277ec681f3Smrg- radeonsi: use pipe_sampler_state::border_color_is_integer to simplify stuff 35287ec681f3Smrg- mesa: fix Blender crash due to optimizations in buffer reference counting 35297ec681f3Smrg- mesa: add assertions for buffer reference counts 35307ec681f3Smrg- mesa: fix a oldNum typo in reallocation in _mesa_reserve_parameter_storage 35317ec681f3Smrg- mesa: don't overallocate ParameterValues 4 times (v2) 35327ec681f3Smrg- mesa: clear reserved parameter storage because it's stored in the shader cache 35337ec681f3Smrg- mesa: fix parameter reservation size 35347ec681f3Smrg- st/mesa: add a driconf option to transcode ETC2 to DXTC 35357ec681f3Smrg- util: add most missing formats with reversed RGB channel order 35367ec681f3Smrg- util: fail assertion in util_format_linear if it can't translate SRGB format 35377ec681f3Smrg- util: add new helper util_format_rgb_to_bgr 35387ec681f3Smrg- radeonsi: select the optimal micro tile mode at clear regardless of fast clear 35397ec681f3Smrg- radeonsi: add a fast path for MSAA resolving with RGB -> BGR swizzling 35407ec681f3Smrg- amd/addrlib: add back the incorrect original DCC checking 35417ec681f3Smrg- amd/addrlib: prevent defining regparm differently 35427ec681f3Smrg- amd/addrlib: define endianess differently 35437ec681f3Smrg- amd: update addrlib 35447ec681f3Smrg- ac/llvm: don't set unsupported xnack options to fix LLVM crashes on gfx6-8 35457ec681f3Smrg- radeonsi: disable sparse buffers on gfx7-8 35467ec681f3Smrg- radeonsi: set the clear/copy cache policy based on L2 cache size 35477ec681f3Smrg- radeonsi: don't insert start/stop pipeline stat events if it has no effect 35487ec681f3Smrg- radeonsi: never set DISABLE_WR_CONFIRM for CP DMA clears and copies 35497ec681f3Smrg- radeonsi: rename internal compute sync flags 35507ec681f3Smrg- radeonsi: remove unused SI_CP_DMA_SKIP_* definitions 35517ec681f3Smrg- radeonsi: merge CP DMA flags with internal compute flags 35527ec681f3Smrg- radeonsi: inline clear_buffer in si_screen_clear_buffer 35537ec681f3Smrg- radeonsi: set compute/cpdma sync flags in the outermost caller 35547ec681f3Smrg- radeonsi: reduce syncing in si_dcc_decompress 35557ec681f3Smrg- radeonsi: reduce syncing for initializing new buffers 35567ec681f3Smrg- radeonsi: reduce syncing in si_compute_expand_fmask when it's already idle 35577ec681f3Smrg- radeonsi: don't do an L2 flush in compute_do_clear_or_copy if we're not syncing 35587ec681f3Smrg- radeonsi: rename and apply SI_OP_CPDMA_SKIP_CACHE_FLUSH to compute as well 35597ec681f3Smrg- radeonsi: use the optimal packet order before draw packets for VS flushes too 35607ec681f3Smrg- radeonsi: add SI_CONTEXT_PFP_SYNC_ME to skip syncing PFP for image operations 35617ec681f3Smrg- radeonsi: return false from si_is_format_supported instead of NULL 35627ec681f3Smrg- radeonsi: don't use constbuf and set cache policy for 12-byte clear shader 35637ec681f3Smrg- radeonsi: don't use a constant buffer for the copy_image compute shader 35647ec681f3Smrg- radeonsi: decrease the maximum variable block size 35657ec681f3Smrg- radeonsi: pack the variable block size in one SGPR, 10 bits per component 35667ec681f3Smrg- amd: fix parsing the last dword of DMA_DATA packets 35677ec681f3Smrg- ac/surface: add CMASK info for level 0 35687ec681f3Smrg- radeonsi: determine accurately whether the framebuffer state has DCC MSAA 35697ec681f3Smrg- radeonsi: remove si_screen::dcc_msaa_allowed 35707ec681f3Smrg- radeonsi: parallelize CMASK and DCC clears 35717ec681f3Smrg- radeonsi: return success/failure from si_alloc_separate_cmask 35727ec681f3Smrg- radeonsi: add num_layers variable into si_do_fast_color_clear 35737ec681f3Smrg- radeonsi: group and parallelize all clears in si_texture_create_object 35747ec681f3Smrg- radeonsi: set better default depth clear value 35757ec681f3Smrg- radeonsi: enable HTILE with mipmapping on gfx9+ 35767ec681f3Smrg- radeonsi: unset PIPE_CLEAR_* flags for non-existent buffers 35777ec681f3Smrg- radeonsi: turn the loops over color buffers into while loops in si_clear 35787ec681f3Smrg- radeonsi: don't use CP DMA for clears/copies except for very small ones 35797ec681f3Smrg- ac/surface/tests: move shareable code into ac_surface_test_common.h 35807ec681f3Smrg- radeonsi: fix si_compute_copy_image if DCC decompression happens before a copy 35817ec681f3Smrg- gallium/pb: pass an optional winsys pointer to the buffer destroy function 35827ec681f3Smrg- winsys/radeon: rename radeon_bo_reference -> radeon_ws_bo_reference 35837ec681f3Smrg- radeon_winsys.h: add a new function radeon_bo_reference that takes a winsys 35847ec681f3Smrg- radeon_winsys.h: add a winsys parameter to most winsys buffer functions 35857ec681f3Smrg- winsys/amdgpu: remove amdgpu_winsys_bo::ws 35867ec681f3Smrg- winsys/amdgpu: add amdgpu_cs::ws to reduce dereferences 35877ec681f3Smrg- gallium/pb: change pb_buffer::alignment to alignment_log2 35887ec681f3Smrg- gallium/pb: remove 8 bytes from pb_buffer by packing variables 35897ec681f3Smrg- winsys/amdgpu: remove another 8 bytes from amdgpu_winsys_bo by packing better 35907ec681f3Smrg- ac/surface: split dcc level info from surface_info to save space 35917ec681f3Smrg- ac/surface: overlap color and Z/S fields using a union in legacy_surf_layout 35927ec681f3Smrg- ac/surface: change legacy_surf_level::offset to 32-bit offset_256B shifted by 8 35937ec681f3Smrg- ac/surface: inline and reorder gfx9_surf_flags for better packing 35947ec681f3Smrg- ac/surface: pack gfx9_surf_meta_flags better 35957ec681f3Smrg- ac/surface: pack gfx9_surf_layout:resource_type better to save 8 bytes 35967ec681f3Smrg- ac/surface: pack radeon_surf::num_htile_levels better 35977ec681f3Smrg- ac/surface: pack alignments by storing log2 in radeon_surf 35987ec681f3Smrg- ac/surface: overlap color and Z/S fields using a union in gfx9_surf_layout 35997ec681f3Smrg- ac/surface: pack radeon_surf better 36007ec681f3Smrg- ac/surface: unify htile_levels and dcc_levels as meta_levels 36017ec681f3Smrg- ac/surface: unify htile_* and dcc_* fields as meta_* fields 36027ec681f3Smrg- ac/surface: use named "color and "zs" structures in unions 36037ec681f3Smrg- radeonsi: don't cache FMASK transactions from CB in L2 36047ec681f3Smrg- radeonsi: restructure DCC disablement into a switch 36057ec681f3Smrg- radeonsi: allow trivial DCC clears for shared textures with DCC constant encode 36067ec681f3Smrg- radeonsi: implement per-level DCC and CMASK fast clears for gfx10+ 36077ec681f3Smrg- radeonsi: implement Z/S fast clear for non-zero mipmap levels 36087ec681f3Smrg- radeonsi: parallelize Z/S conversion into TC-compatible with fast color clears 36097ec681f3Smrg- radeonsi: clean up some mess around htile_stencil_disabled 36107ec681f3Smrg- radeonsi: add si_can_fast_clear_depth/stencil helpers 36117ec681f3Smrg- radeonsi: indent the code for TC-compatibility HTILE transition 36127ec681f3Smrg- radeonsi: implement fast Z/S clears using clear_buffer on HTILE 36137ec681f3Smrg- radeonsi: enable DCC fast clears for non-zero mipmap levels and 0/1 clear values 36147ec681f3Smrg- radeonsi: when transitioning to TC-compat HTILE, try to do a proper clear 36157ec681f3Smrg- radeonsi: do Z-only or S-only HTILE clear using a compute shader doing RMW 36167ec681f3Smrg- radeonsi: refine fast clears for small buffers, always use them for large HTILE 36177ec681f3Smrg- radeonsi: try to fix DCC coherency issues with DCC decompression 36187ec681f3Smrg- radeonsi: allow DCC_DECOMPRESS via CB with MSAA textures 36197ec681f3Smrg- ac/surface: only apply the 3D swizzle mode tuning to gfx10+ 36207ec681f3Smrg- ac/surface/tests: test Sienna Cichlid and Navy Flounder 36217ec681f3Smrg- ac/surface/tests: fix a random segfault in the modifier test 36227ec681f3Smrg- amd/addrlib: expose DCC address equations to drivers 36237ec681f3Smrg- meson: add an optional OpenMP dependency for AMD tests 36247ec681f3Smrg- ac/surface: add a test of DccAddrFromCoord prototype outside of addrlib 36257ec681f3Smrg- ac/surface: limit the number of swizzle modes that can have displayable DCC 36267ec681f3Smrg- ac,radeonsi: rewrite DCC retiling without the DCC retile map 36277ec681f3Smrg- radeonsi: fix and enable full DCC with MSAA 2x on gfx9 36287ec681f3Smrg- radeonsi: implement DCC MSAA 4x/8x fast clear using DCC equations on gfx9 36297ec681f3Smrg- radeonsi: enable DCC for MSAA 4x and 8x on gfx9 36307ec681f3Smrg- radeonsi: move binding the internal compute shader into si_launch_grid_internal 36317ec681f3Smrg- radeonsi: unify internal compute with SSBOs in si_launch_grid_internal_ssbos 36327ec681f3Smrg- compiler: move TEXTURE_COORD/VERTEX_GENERIC_ATTRIB limits into shader_enums.h 36337ec681f3Smrg- nir: add src and dest types to all IO loads and stores for mediump 36347ec681f3Smrg- nir: add new VARYING_SLOTs and shader info for packed 16-bit varyings 36357ec681f3Smrg- nir: add many passes that lower and optimize 16-bit input/outputs and samplers 36367ec681f3Smrg- glsl: pack 16-bit uniforms in the NIR linker 36377ec681f3Smrg- mesa: implement glUniform for packed FP16 uniforms 36387ec681f3Smrg- mesa: implement glGetUniform for FP16 uniforms 36397ec681f3Smrg- mesa: implement glGetActiveUniform for FP16 uniforms 36407ec681f3Smrg- glsl: lower mediump uniforms to FP16 based on an option 36417ec681f3Smrg- gallium: add PIPE_SHADER_CAP_FP16_CONST_BUFFERS for FP16 uniforms 36427ec681f3Smrg- st/mesa: fix nir_lower_io if it's done right after IO vectorization 36437ec681f3Smrg- ac/llvm: implement 16-bit packed VS outputs and FS inputs 36447ec681f3Smrg- radeonsi: implement 16-bit VS->PS varyings 36457ec681f3Smrg- radeonsi: implement 16-bit VS inputs 36467ec681f3Smrg- radeonsi: optimize and legalize 16-bit samplers 36477ec681f3Smrg- radeonsi: kill 16-bit VS outputs if PS doesn't use them or doing Z-only draw 36487ec681f3Smrg- radeonsi: enable FP16 for mediump on gfx9+ if radeonsi_fp16=true 36497ec681f3Smrg- nir: handle mediump varyings in varying compaction helpers 36507ec681f3Smrg- radeonsi: don't decompress DCC for float formats in si_compute_copy_image 36517ec681f3Smrg- radeonsi: fix automatic DCC retiling after DCC clear and DCC decompression 36527ec681f3Smrg- radeonsi: fix automatic DCC retiling after compute image stores 36537ec681f3Smrg- radeonsi: make the gfx9 DCC MSAA clear shader depend on the number of samples 36547ec681f3Smrg- util: fix (re-enable) L3 cache pinning 36557ec681f3Smrg 36567ec681f3SmrgMarek Vasut (2): 36577ec681f3Smrg 36587ec681f3Smrg- compiler/nir: Increment shader input count and mark as used when adding new gl_PointCoord 36597ec681f3Smrg- etnaviv: Fix point sprite Z,W coordinate replacement 36607ec681f3Smrg 36617ec681f3SmrgMark Janes (12): 36627ec681f3Smrg 36637ec681f3Smrg- intel: Print GPU timing data based on INTEL_MEASURE 36647ec681f3Smrg- anv: enable timestamp for INTEL_MEASURE 36657ec681f3Smrg- anv: implement anv layer of INTEL_MEASURE 36667ec681f3Smrg- blorp: add hook for INTEL_MEASURE 36677ec681f3Smrg- anv: add hooks to call INTEL_MEASURE 36687ec681f3Smrg- iris: implement iris layer of INTEL_MEASURE 36697ec681f3Smrg- iris: add a iris_context reference to iris_batch 36707ec681f3Smrg- intel: stop tracking submission state in INTEL_MEASURE 36717ec681f3Smrg- intel: support secondary command buffers in INTEL_MEASURE 36727ec681f3Smrg- intel: combine common gather routines in INTEL_MEASURE 36737ec681f3Smrg- intel: check setuid before writing output file in INTEL_MEASURE 36747ec681f3Smrg- Revert "blorp/gen12: Don't use aux address if implicit CCS" 36757ec681f3Smrg 36767ec681f3SmrgMatt Turner (8): 36777ec681f3Smrg 36787ec681f3Smrg- docs/freedreno: Fix a few typos 36797ec681f3Smrg- turnip: Remove unused TU_DEBUG_IR3 flag 36807ec681f3Smrg- docs: Mark VK_KHR_maintenance1 as done on turnip 36817ec681f3Smrg- ci: Use CI_PROJECT_ROOT_NAMESPACE 36827ec681f3Smrg- tu: Skip tu_tiling_config_update_tile_layout() if not using gmem 36837ec681f3Smrg- ci: Disable panfrost g52 36847ec681f3Smrg- Remove Scons leftovers 36857ec681f3Smrg- ir3: Don't count (nopX) towards the wrong category 36867ec681f3Smrg 36877ec681f3SmrgMatti Hamalainen (2): 36887ec681f3Smrg 36897ec681f3Smrg- gallium: Fix broken trace XML output 36907ec681f3Smrg- gallium/tools: update trace scripts to Python 3 36917ec681f3Smrg 36927ec681f3SmrgMauro Rossi (29): 36937ec681f3Smrg 36947ec681f3Smrg- android: r600/sfn: add sfn_nir_lower_64bit.cpp to Makefile.sources 36957ec681f3Smrg- android: freedreno/hw/isa: Add description of ir3 ISA 36967ec681f3Smrg- android: freedreno/ir3: Switch over to new encoder/decoder 36977ec681f3Smrg- android: pan/mdg: create nir pass to lower image coord bitsize 36987ec681f3Smrg- android: intel: Print GPU timing data based on INTEL_MEASURE 36997ec681f3Smrg- android: anv: implement anv layer of INTEL_MEASURE 37007ec681f3Smrg- android: iris: implement iris layer of INTEL_MEASURE 37017ec681f3Smrg- android: radv: port to using common dispatch code. 37027ec681f3Smrg- android: radv: fix building error in radv_android.c 37037ec681f3Smrg- android: util/fossilize_db: add missing sources to Makefile.sources 37047ec681f3Smrg- android: ac/rgp: fix building error 37057ec681f3Smrg- android: mesa: Move the FXT1 compressor/decompressor to util/ 37067ec681f3Smrg- android: pan/bi: reorder static dependencies in gallium/dri 37077ec681f3Smrg- driconf: avoid Non-ASCII character error in driconf_static.py 37087ec681f3Smrg- android: driconf: Generate a static table when no xmlconfig 37097ec681f3Smrg- android: i965: Rename files with "intel\_" prefix to "brw\_" 37107ec681f3Smrg- android: util: create some standalone compression helpers 37117ec681f3Smrg- android: anv: add libcutils shared dependency 37127ec681f3Smrg- android: r600/sfn: fix sfn_nir_algebraic.c gen rules 37137ec681f3Smrg- android: vulkan/util: add vk_descriptors.{c,h} to Makefile.sources 37147ec681f3Smrg- android: amd/addrlib: define endianess to build 37157ec681f3Smrg- android: panfrost: Use the blend shader cache attached to the device 37167ec681f3Smrg- vulkan/util: Fix implicit declaration of ffs for Android build 37177ec681f3Smrg- android: anv: Remove anv_intel.c from Makefile.sources 37187ec681f3Smrg- android: anv: fix build error in anv_android.c 37197ec681f3Smrg- compiler/glsl: fix include for Android build 37207ec681f3Smrg- android: panfrost/lib: add pan_cs.c to Makefile.sources 37217ec681f3Smrg- android: gallium/radeonsi: add nir include path 37227ec681f3Smrg- android: amd/common: add nir include path 37237ec681f3Smrg 37247ec681f3SmrgMichael Tang (5): 37257ec681f3Smrg 37267ec681f3Smrg- microsoft/compiler: Make resource_state_manager only build with_gallium_d3d12 37277ec681f3Smrg- util: Make os_read_file use O_BINARY on Windows 37287ec681f3Smrg- microsoft/spirv_to_dxil: Fix spirv2dxil I/O to use binary mode 37297ec681f3Smrg- microsoft/spirv_to_dxil: Add lowering pass to handle gl_PerVertex 37307ec681f3Smrg- microsoft/spirv_to_dxil: Add extra lowering functions according to the docs on nir_inline_functions 37317ec681f3Smrg 37327ec681f3SmrgMichel Dänzer (53): 37337ec681f3Smrg 37347ec681f3Smrg- ci: Remove .gitlab-ci/meson-build.bat 37357ec681f3Smrg- ci: Use meson test directly instead of ninja test 37367ec681f3Smrg- wsi/x11: Use get_screen_resources_current in wsi_x11_detect_xwayland 37377ec681f3Smrg- ci: Enable process isolation for softpipe & freedreno piglit jobs 37387ec681f3Smrg- ci: Use GNU time as meson test wrapper 37397ec681f3Smrg- ci: Run 'time' in the background and propagate signals to test process 37407ec681f3Smrg- ci: Fix MESA_TEMPLATES_COMMIT value 37417ec681f3Smrg- ci: Update to newer ci-fairy 37427ec681f3Smrg- ci: Set GALLIVM_PERF=no_filter_hacks for llvmpipe-piglit-quick_shader 37437ec681f3Smrg- ci: Set GALLIVM_PERF=no_filter_hacks for llvmpipe-piglit-quick_gl 37447ec681f3Smrg- ci: Set GALLIVM_PERF=nopt,no_filter_hacks for llvmpipe-gles2 37457ec681f3Smrg- ci: Use MESA\_ namespace for image variables in Windows jobs 37467ec681f3Smrg- ci: Use MESA_IMAGE_TAG everywhere 37477ec681f3Smrg- ci: Move FDO_DISTRIBUTION_TAG assignment to template 37487ec681f3Smrg- ci: Add and use .set-image template to construct docker image name 37497ec681f3Smrg- ci: Incorporate base image tag into dependent image tags 37507ec681f3Smrg- ci: Append build image tag to LAVA tag used for minio path 37517ec681f3Smrg- ci: Add trailing slash to path for documentation preview 37527ec681f3Smrg- ci: Restrict meson-gallium job to gstreamer runners 37537ec681f3Smrg- ci: Disable scons-win64 job 37547ec681f3Smrg- ci: Move meson-build.sh to meson/build.sh 37557ec681f3Smrg- ci: Drop SIGINT handling from meson test wrapper script 37567ec681f3Smrg- ci: Move /usr/bin/time check from meson test wrapper to build script 37577ec681f3Smrg- aco/tests: Use _exit in child process 37587ec681f3Smrg- ci: Add strace to the x86_build docker image 37597ec681f3Smrg- ci: Run meson tests in strace if it's available and can be used 37607ec681f3Smrg- ci: Don't run meson tests in strace for meson-mingw32-x86_64 job 37617ec681f3Smrg- intel/tools: Use subprocess.Popen to read output directly from a pipe 37627ec681f3Smrg- Revert "ci: Restrict meson-gallium job to gstreamer runners" 37637ec681f3Smrg- glcpp: Fully initialize struct gl_context 37647ec681f3Smrg- ci: Disable valgrind in some build jobs 37657ec681f3Smrg- glsl/tests: Bump glcpp valgrind test timeout to 240 seconds 37667ec681f3Smrg- glsl/tests: Don't use tempfiles 37677ec681f3Smrg- glsl/tests: Use exit code 126 to detect valgrind errors 37687ec681f3Smrg- Revert "ci: disable glcpp tests for now" 37697ec681f3Smrg- Revert "meson: add enable-glcpp-tests option" 37707ec681f3Smrg- Revert "glsl/test: Don't run whitespace tests in parallel" 37717ec681f3Smrg- ci: Remove INCLUDE_PIGLIT 37727ec681f3Smrg- ci: Build ARM baremetal rootfs in native container 37737ec681f3Smrg- ci: Merge ARM testing docker images to a single arm_test one 37747ec681f3Smrg- wsi/x11: Wait for fences with IMMEDIATE on Xwayland 37757ec681f3Smrg- ci: Fix HTML summary path for piglit OpenCL job artifacts 37767ec681f3Smrg- intel/blorp: Initialize texture_data[0] 37777ec681f3Smrg- ci: Do not install armhf LLVM packages 37787ec681f3Smrg- ci: Bump LLVM/clang from 10 to 11 37797ec681f3Smrg- ci: Move docker images from Debian buster to bullseye 37807ec681f3Smrg- ci: Install librenderdoc from Debian bullseye 37817ec681f3Smrg- ci: Install spirv-tools from Debian bullseye 37827ec681f3Smrg- ci: Install llvm-spirv from Debian bullseye 37837ec681f3Smrg- ci: Install GLVND from Debian bullseye 37847ec681f3Smrg- ci: Install Rust & cargo from Debian for x86_test* images 37857ec681f3Smrg- ci: Do not append ci-templates commit hash to Windows docker image tag 37867ec681f3Smrg- ci: Update to latest ci-templates 37877ec681f3Smrg 37887ec681f3SmrgMichel Zou (25): 37897ec681f3Smrg 37907ec681f3Smrg- vulkan/lavapipe: add missing VKAPI_ATTR/CALL 37917ec681f3Smrg- vulkan: Fix windows api conflict 37927ec681f3Smrg- zink: Fix win32 build 37937ec681f3Smrg- vulkan: Fix windows api conflict 37947ec681f3Smrg- meson: invalid keyword argument dependencies 37957ec681f3Smrg- zink: fix win32 build 37967ec681f3Smrg- util: fix gcc vsnprintf overflow 37977ec681f3Smrg- glapi: keep declspec(thread) msvc-specific 37987ec681f3Smrg- vulkan: implement wsi_win32 backend 37997ec681f3Smrg- lavapipe: add mingw32 def file 38007ec681f3Smrg- lavapipe: set empty dll prefix 38017ec681f3Smrg- gallium: remove DROP_PIPE_LOADER_MISC 38027ec681f3Smrg- meson/xmlconfig: win32 regex fallback 38037ec681f3Smrg- meson: detect winflex/bison only on native win32 38047ec681f3Smrg- turnip: update features.txt 38057ec681f3Smrg- lavapipe: update features.txt 38067ec681f3Smrg- vulkan: fix CreateRenderPass prototype 38077ec681f3Smrg- swr: extern declaration for win32 intrinsics 38087ec681f3Smrg- swr: fix win32 intrinsics 38097ec681f3Smrg- swr: Fix SWR_CONTEXT pre-declaration 38107ec681f3Smrg- swr: fix unused SplitString warning 38117ec681f3Smrg- swr: fix deprecated llvm 11 declaration warning 38127ec681f3Smrg- swr: fix array-bounds warning 38137ec681f3Smrg- lavapipe: Fix type narrowing 38147ec681f3Smrg- docs: missing lvp win32surface ext in features.txt 38157ec681f3Smrg 38167ec681f3SmrgMike Blumenkrantz (775): 38177ec681f3Smrg 38187ec681f3Smrg- zink: clamp sampler+samplerview limits 38197ec681f3Smrg- util/hash_table: optimize rehash for empty table and no-func clears 38207ec681f3Smrg- util/set: optimize rehash for empty table and no-func clears 38217ec681f3Smrg- util/set: add the found param to search_or_add 38227ec681f3Smrg- util/set: split off create() into an init() function 38237ec681f3Smrg- zink: optimize renderpass hash table 38247ec681f3Smrg- nir/lower_uniforms_to_ubo: set explicit_binding on uniform_0 38257ec681f3Smrg- zink: add spirv builder function for runtime array type 38267ec681f3Smrg- zink: add util function for emitting ntv atomic ops 38277ec681f3Smrg- zink: add set_shader_buffers pipe_context method 38287ec681f3Smrg- zink: hook up ssbo shader bindings 38297ec681f3Smrg- zink: emit ssbo variables in ntv 38307ec681f3Smrg- zink: modify ubo loading in ntv to work for ssbos 38317ec681f3Smrg- zink: start supporting atomic shader ops 38327ec681f3Smrg- zink: split UBOs and samplers into 'read' batch references during draw 38337ec681f3Smrg- zink: flag ssbo buffer resources as having pending writes on batch 38347ec681f3Smrg- zink: add more usage bits for buffer types 38357ec681f3Smrg- zink: partially enable SSBO pipe cap 38367ec681f3Smrg- zink: only emit streamout targets during draw if we have them 38377ec681f3Smrg- zink: rework framebuffer state 38387ec681f3Smrg- zink: add batch flag for checking renderpass state 38397ec681f3Smrg- zink: remove renderpass refcounting 38407ec681f3Smrg- zink: ralloc zink_framebuffer structs 38417ec681f3Smrg- zink: rename param in zink_create_framebuffer 38427ec681f3Smrg- zink: use 'fb' variable name for zink_framebuffer objects in zink_framebuffer.c 38437ec681f3Smrg- zink: decouple renderpass from framebuffer state 38447ec681f3Smrg- zink: move zink_clear to zink_clear.c 38457ec681f3Smrg- zink: start to refactor clearing 38467ec681f3Smrg- zink: handle clears with scissor regions 38477ec681f3Smrg- zink: break out scissor region testing for clear functions 38487ec681f3Smrg- zink: break out color/zs no_rp clear into separate functions 38497ec681f3Smrg- zink: break out some of the u_blitter setup into util function 38507ec681f3Smrg- zink: add a pipe_context::clear_texture hook 38517ec681f3Smrg- zink: enable PIPE_CAP_CLEAR_TEXTURE 38527ec681f3Smrg- zink: reduce blendfactor when alpha_to_one is set 38537ec681f3Smrg- zink: tweak xfb slot mapping in ntv 38547ec681f3Smrg- zink: process ubos with location values set as long as they're actually ubos 38557ec681f3Smrg- zink: add VK_KHR_driver_properties 38567ec681f3Smrg- zink: enable WSI-faking for RADV too 38577ec681f3Smrg- zink: rename zink_context::\*image_views -> sampler_views 38587ec681f3Smrg- zink: add ntv util function for getting image type 38597ec681f3Smrg- zink: rewrite image/sampler glsl -> vk type functions for robustness 38607ec681f3Smrg- zink: add spirv_builder function for hexops 38617ec681f3Smrg- zink: add spirv builder functions for image ops 38627ec681f3Smrg- zink: add ntv function for emitting variable access decorations 38637ec681f3Smrg- zink: verify format caps and add storage image usage when possible in creation 38647ec681f3Smrg- zink: add 'has_draw' flag to batch struct 38657ec681f3Smrg- zink: add a pipe_context::memory_barrier hook 38667ec681f3Smrg- zink: add shader image support to zink_binding() 38677ec681f3Smrg- zink: add new 'sampler_types' variable to ntv_context struct 38687ec681f3Smrg- zink: handle image variable types in ntv 38697ec681f3Smrg- zink: handle more atomic ops in ntv 38707ec681f3Smrg- zink: handle nir_intrinsic_memory_barrier in ntv 38717ec681f3Smrg- zink: add nir_var_uniform case to get_storage_class() 38727ec681f3Smrg- zink: expand ntv array derefs to track image derefs 38737ec681f3Smrg- zink: add handling for all basic image ops in ntv 38747ec681f3Smrg- zink: enable early frag test execution in ntv when necessary 38757ec681f3Smrg- zink: enable image caps in ntv when a shader has images 38767ec681f3Smrg- zink: handle image descriptors during zink_shader creation 38777ec681f3Smrg- zink: break out bufferview creation into separate function 38787ec681f3Smrg- zink: add a pipe_context::set_shader_images hook 38797ec681f3Smrg- zink: handle shader image descriptor updates during draw 38807ec681f3Smrg- zink: check if multisample support exists for shader image formats 38817ec681f3Smrg- zink: export shader image caps using features 38827ec681f3Smrg- zink: GLSL 420 38837ec681f3Smrg- docs/features: mark off GL 4.2 for zink 38847ec681f3Smrg- zink: set PIPE_CAP_COPY_BETWEEN_COMPRESSED_AND_PLAIN_FORMATS 38857ec681f3Smrg- zink: force per-sample interpolation 38867ec681f3Smrg- zink: set PIPE_CAP_FRAMEBUFFER_NO_ATTACHMENT 38877ec681f3Smrg- zink: set PIPE_CAP_ROBUST_BUFFER_ACCESS_BEHAVIOR 38887ec681f3Smrg- zink: support VK_IMAGE_LAYOUT_DEPTH_STENCIL_READ_ONLY_OPTIMAL usage 38897ec681f3Smrg- zink: add layout member to barrier setup in draw 38907ec681f3Smrg- zink: support PIPE_FORMAT_X24S8_UINT 38917ec681f3Smrg- zink: check correct caps for PIPE_CAP_IMAGE_LOAD_FORMATTED 38927ec681f3Smrg- zink: enable PIPE_CAP_SAMPLER_VIEW_TARGET 38937ec681f3Smrg- gallium/u_inlines: add helper for simplifying pipe_context::resource_copy_region 38947ec681f3Smrg- zink: add function for waiting on a specific batch's fence 38957ec681f3Smrg- zink: don't force a renderpass start when setting framebuffer state 38967ec681f3Smrg- spirv: handle NoContraction in GLSL450 alu ops 38977ec681f3Smrg- zink: fix streamout for clipdistance 38987ec681f3Smrg- zink: add a VkExternalMemoryImageCreateInfo for PIPE_BIND_SHARED images 38997ec681f3Smrg- zink: set lower_mul_2x32_64 when 64bit int support is available 39007ec681f3Smrg- zink: enable PIPE_CAP_TEXTURE_MIRROR_CLAMP_TO_EDGE 39017ec681f3Smrg- zink: flag gfx pipeline dirty using newer mechanism 39027ec681f3Smrg- zink: guarantee surface lifetimes for shader images 39037ec681f3Smrg- st/bitmap: use GL_CLAMP_TO_EDGE for bitmap samplers 39047ec681f3Smrg- st/drawpixels: use GL_CLAMP_TO_EDGE instead of GL_CLAMP for samplers 39057ec681f3Smrg- zink: don't export PIPE_CAP_MAX_COMBINED_SHADER_BUFFERS value 39067ec681f3Smrg- zink: add spirv builder function for OpAtomicStore 39077ec681f3Smrg- zink: flag ssbo buffer resources as having pending writes per stage 39087ec681f3Smrg- zink: handle null ssbo attachments without crashing 39097ec681f3Smrg- zink: handle more ssbo ops in ntv 39107ec681f3Smrg- zink: rework ssbo indexing and binding 39117ec681f3Smrg- zink: support nir_intrinsic_store_ssbo 39127ec681f3Smrg- zink: implement get_ssbo_size nir intrinsic 39137ec681f3Smrg- zink: flatten out ssbo/ubo variable decls in ntv 39147ec681f3Smrg- zink: export ssbo caps 39157ec681f3Smrg- Revert "glcpp: disable 'windows' tests" 39167ec681f3Smrg- meson: add enable-glcpp-tests option 39177ec681f3Smrg- ci: disable glcpp tests for now 39187ec681f3Smrg- zink: add barrier helper for buffer resources 39197ec681f3Smrg- zink: add a stage param for buffer resource barriers 39207ec681f3Smrg- zink: add helper function for checking if access flags include write access 39217ec681f3Smrg- zink: improve barrier helper for buffer resources and add check for barrier need 39227ec681f3Smrg- zink: flag previous vertex stages as dirty when toggling a later stage 39237ec681f3Smrg- zink: add shader key for vs shaders 39247ec681f3Smrg- zink: flag shaders as needing update when clip_halfz changes 39257ec681f3Smrg- zink: move tess/geom shader info to vs shader key 39267ec681f3Smrg- glsl: support 64bit integer loop iterators 39277ec681f3Smrg- radv: print image array size in debug mode 39287ec681f3Smrg- zink: move maintenance2 extension to right file 39297ec681f3Smrg- zink: unify shader image unbind codepath 39307ec681f3Smrg- zink: be a little more precise about query types in one conditional 39317ec681f3Smrg- radv: null bo list pointer for null descriptors on update 39327ec681f3Smrg- radv: zero the bo descriptor array when allocating a new set 39337ec681f3Smrg- zink: force 4 component formats for samplerview/render textures 39347ec681f3Smrg- zink: support nir_intrinsic_memory_barrier_buffer 39357ec681f3Smrg- zink: add defines for compute batch and gfx batch count 39367ec681f3Smrg- zink: bump resource usage flags to allow 5 batches 39377ec681f3Smrg- zink: make get_resource_usage() public 39387ec681f3Smrg- zink: make zink_batch_reference_resource_rw return usage info 39397ec681f3Smrg- zink: wait on compute batch when necessary during transfer map 39407ec681f3Smrg- zink: add spirv_builder function for emitting a 3word literal exec mode 39417ec681f3Smrg- zink: handle COMPUTE bindings in compiler/ntv 39427ec681f3Smrg- zink: handle COMPUTE setup in ntv 39437ec681f3Smrg- zink: handle COMPUTE glsl variables 39447ec681f3Smrg- zink: implement shared load/store nir ops in ntv 39457ec681f3Smrg- zink: add handling for shared atomic ops in ntv 39467ec681f3Smrg- zink: handle nir_intrinsic_memory_barrier_shared in ntv 39477ec681f3Smrg- zink: ignore compute batch when starting/ending batches 39487ec681f3Smrg- zink: take a pipe_reference param in zink_batch_reference_program 39497ec681f3Smrg- zink: refactor batch creation 39507ec681f3Smrg- zink: make allocate_descriptor_set() take more params instead of a gfx_program 39517ec681f3Smrg- zink: explicitly get shader stage from shader during binding setup in draw 39527ec681f3Smrg- zink: rename pipeline_cache_entry -> gfx_pipeline_cache_entry 39537ec681f3Smrg- zink: add compute programs and pipelines 39547ec681f3Smrg- zink: break out descriptor updating into separate function 39557ec681f3Smrg- zink: setup compute batch and add handling 39567ec681f3Smrg- zink: handle memory barriers for compute batch 39577ec681f3Smrg- zink: handle descriptor set updates for compute operations 39587ec681f3Smrg- zink: flush gfx/compute batches when the other pipeline needs resource sync 39597ec681f3Smrg- zink: add launch_grid pipe_context hook for compute handling 39607ec681f3Smrg- zink: export compute-specific shader/compute caps 39617ec681f3Smrg- zink: enable compute 39627ec681f3Smrg- zink: GLSL 430 39637ec681f3Smrg- features: mark off GL 4.3 for zink 39647ec681f3Smrg- zink: add spirv_builder wrapper for vote intrinsics 39657ec681f3Smrg- zink: handle vote intrinsics in ntv 39667ec681f3Smrg- zink: rework viewport handling 39677ec681f3Smrg- zink: handle nir_texop_texture_samples 39687ec681f3Smrg- zink: add a texture barrier hook 39697ec681f3Smrg- zink: use = and not \|= for VkMemoryPropertyFlags during resource creation 39707ec681f3Smrg- zink: set HOST_COHERENT bit for coherent resource creation 39717ec681f3Smrg- zink: track persistent, non-coherent, writable transfer map count for resources 39727ec681f3Smrg- zink: slightly refactor batch resource referencing in update_descriptors() 39737ec681f3Smrg- zink: flush all resources with persistent maps on work batch before draw/compute 39747ec681f3Smrg- zink: enable PIPE_CAP_BUFFER_MAP_PERSISTENT_COHERENT 39757ec681f3Smrg- zink: rename zink_context::dummy_buffer -> dummy_vertex_buffer 39767ec681f3Smrg- zink: create dummy xfb buffer 39777ec681f3Smrg- zink: handle null xfb buffers 39787ec681f3Smrg- zink: use better mapping for PIPE_FORMAT_X24S8_UINT 39797ec681f3Smrg- zink: handle nir_intrinsic_load_helper_invocation 39807ec681f3Smrg- zink: fix xfb buffer refcounting 39817ec681f3Smrg- zink: add PIPE_BIND_QUERY_BUFFER to the all-purpose resource creation path 39827ec681f3Smrg- zink: add a get_query_result_resource hook 39837ec681f3Smrg- zink: enable PIPE_CAP_TGSI_ARRAY_COMPONENTS 39847ec681f3Smrg- zink: enable PIPE_CAP_QUERY_BUFFER_OBJECT 39857ec681f3Smrg- zink: GLSL 440 39867ec681f3Smrg- zink: enable PIPE_CAP_CONDITIONAL_RENDER_INVERTED 39877ec681f3Smrg- zink: enable PIPE_CAP_CLIP_HALFZ 39887ec681f3Smrg- zink: enable PIPE_CAP_TGSI_TXQS 39897ec681f3Smrg- zink: enable PIPE_CAP_TEXTURE_BARRIER 39907ec681f3Smrg- zink: GLSL 450 39917ec681f3Smrg- features: mark off GL 4.5 for zink 39927ec681f3Smrg- zink: add spirv interfaces for bo and image/sampler/push variables 39937ec681f3Smrg- zink: lower flrp64 and ffma64 when in softfp64 mode 39947ec681f3Smrg- zink: always use query->type for starting/stopping xfb queries 39957ec681f3Smrg- zink: make the xfb_query_pool into an array 39967ec681f3Smrg- zink: break out cpu query reading for qbos into separate function 39977ec681f3Smrg- zink: put SO_OVERFLOW queries on the primgen list 39987ec681f3Smrg- zink: support SO_OVERFLOW pipe query types 39997ec681f3Smrg- zink: fix streamout for tess stage 40007ec681f3Smrg- zink: flag exact alu op results in ntv with NoContraction 40017ec681f3Smrg- zink: unset generated TCS if its parent TESS is unset 40027ec681f3Smrg- zink: hook up cs invocation queries to the compute batch 40037ec681f3Smrg- zink: add support for pipeline statistics queries 40047ec681f3Smrg- zink: fix slot mapping for legacy gl io with tess stages 40057ec681f3Smrg- zink: handle 1bit undef values in ntv 40067ec681f3Smrg- zink: add handling for ARB_shader_draw_parameters variables in ntv 40077ec681f3Smrg- zink: create a struct for tracking push constant layout 40087ec681f3Smrg- zink: rework tcs injection to be more compatible with new push const struct 40097ec681f3Smrg- zink: add push constant value to indicate whether the current draw is indexed 40107ec681f3Smrg- zink: wrap shader gl_BaseVertex access with a bcsel based on push constant state 40117ec681f3Smrg- zink: add a draw_id param to vs push constants 40127ec681f3Smrg- zink: add a vs shader key for rewriting gl_DrawID 40137ec681f3Smrg- zink: break out push constant creation in compiler and add drawid value 40147ec681f3Smrg- zink: rewrite drawid based on shader key value 40157ec681f3Smrg- zink: add util function for submitting the compute batch 40167ec681f3Smrg- zink: enable PIPE_CAP_TGSI_VOTE 40177ec681f3Smrg- zink: enable PIPE_CAP_DRAW_PARAMETERS 40187ec681f3Smrg- zink: enable PIPE_CAP_POLYGON_OFFSET_CLAMP 40197ec681f3Smrg- zink: enable PIPE_CAP_QUERY_SO_OVERFLOW 40207ec681f3Smrg- zink: enable pipeline statistics cap 40217ec681f3Smrg- zink: PIPE_CAP_GL_SPIRV 40227ec681f3Smrg- zink: GLSL 460 40237ec681f3Smrg- features: mark off GL 4.6 and ES 3.1 for zink 40247ec681f3Smrg- zink: support nir_intrinsic_group_memory_barrier 40257ec681f3Smrg- zink: fix device codegen extension detection 40267ec681f3Smrg- zink: add nir_intrinsic_memory_barrier_image handling 40277ec681f3Smrg- zink: use nir_shader_instructions_pass for draw params pass 40287ec681f3Smrg- zink: add flag for no-oping fence finish 40297ec681f3Smrg- zink: hook up valid_buffer_range for buffer resources using util_range 40307ec681f3Smrg- zink: create a VkPipelineCache object on the screen and use it 40317ec681f3Smrg- zink: add a disk cache for pipeline objects 40327ec681f3Smrg- gallium/trace: add a pipe_screen::get_compiler_options method 40337ec681f3Smrg- zink: handle dual blending override from driconf 40347ec681f3Smrg- zink: move command pool to the batch 40357ec681f3Smrg- nir/lower_tex: rewrite tex/txb -> txd/txl before saturating srcs 40367ec681f3Smrg- mesa/st: add pipe_sampler_state::border_color_is_integer 40377ec681f3Smrg- mesa/st: add PIPE_CAP_GL_CLAMP 40387ec681f3Smrg- zink: enable GL_CLAMP cap 40397ec681f3Smrg- gallium/trace: remove transfer_map assert 40407ec681f3Smrg- zink: add helper function for getting pipeline stage from shader stage 40417ec681f3Smrg- zink: set buffer resource barriers for descriptor resources in update_descriptors() 40427ec681f3Smrg- zink: rework xfb counter resource barriers 40437ec681f3Smrg- zink: rework xfb barrier transitions when reusing as vertex inputs 40447ec681f3Smrg- zink: remove aspect param from zink_resource_barrier 40457ec681f3Smrg- zink: add a VkPipelineStageFlags param to zink_resource_barrier() 40467ec681f3Smrg- zink: add helper for image resource barriers and avoid unnecessary barriers 40477ec681f3Smrg- zink: use define for max descriptor array size 40487ec681f3Smrg- zink: add generic wrapper for checking whether a resource needs a barrier 40497ec681f3Smrg- zink: avoid emitting unnecessary pipeline barriers during update_descriptors 40507ec681f3Smrg- zink: break out barrier transitioning in update_descriptors 40517ec681f3Smrg- zink: combine resource barriers where possible during update_descriptors 40527ec681f3Smrg- zink: take struct zink_batch param instead of direct cmdbuf in barrier helpers 40537ec681f3Smrg- zink: assert batch is not in a renderpass when emitting pipeline barrier 40547ec681f3Smrg- zink: add barriers for index and draw param buffers 40557ec681f3Smrg- zink: add access param for image resource barriers 40567ec681f3Smrg- zink: add access info for update_descriptor image barriers 40577ec681f3Smrg- zink: add batch references for resources in clear functions 40587ec681f3Smrg- zink: improve barrier usage for clear functions 40597ec681f3Smrg- zink: zink_resource_barrier -> zink_resource_image_barrier 40607ec681f3Smrg- zink: add general zink_resource_barrier() wrapper 40617ec681f3Smrg- zink: be more explicit with image barriers for copy operations 40627ec681f3Smrg- zink: fix surface creation for cube slices 40637ec681f3Smrg- zink: tag some missing ES features 40647ec681f3Smrg- zink: update relnotes 40657ec681f3Smrg- zink: just call context destructor on creation fail 40667ec681f3Smrg- zink: add buffer barriers for resource_copy_region 40677ec681f3Smrg- zink: break out buffer copying into util function with batch param 40687ec681f3Smrg- zink: just end the current renderpass in zink_batch_no_rp() 40697ec681f3Smrg- zink: break out even more of zink_blit state saving 40707ec681f3Smrg- zink: use vkGetFenceStatus when we're obviously checking for status 40717ec681f3Smrg- zink: fix buffer resource usage flags 40727ec681f3Smrg- zink: break out query result buffer copying into util function 40737ec681f3Smrg- zink: simplify some of the qbo direct buffer write code 40747ec681f3Smrg- zink: better handling for availability queries on qbos when query/resource is busy 40757ec681f3Smrg- zink: improve batch flushing for queries when compute batches are involved 40767ec681f3Smrg- zink: always use 64bit flag for query results 40777ec681f3Smrg- zink: handle scissor+viewport states dynamically if extension is available 40787ec681f3Smrg- zink: remove 'scissors' member of viewport state 40797ec681f3Smrg- zink: always set VK_BUFFER_USAGE_UNIFORM_TEXEL_BUFFER_BIT for non-staging resources 40807ec681f3Smrg- zink: add available|visible masks to all barriers in ntv 40817ec681f3Smrg- zink: set conformant ubo/ssbo size limits 40827ec681f3Smrg- zink: destroy renderpass objects on context destroy 40837ec681f3Smrg- zink: rename 'has_draw' flag on batches and set it when the batch is used 40847ec681f3Smrg- zink: move gfx pipeline creation closer to the bind point 40857ec681f3Smrg- zink: only reset pipeline hash conditionally when updating fb state 40867ec681f3Smrg- zink: simplify barrier usage 40877ec681f3Smrg- zink: beef up zink_transfer_flush_region 40887ec681f3Smrg- zink: only wait on last write-batch for resources during transfer_map 40897ec681f3Smrg- zink: change some transfer_map cases of waiting on cs batch to flushing cs 40907ec681f3Smrg- zink: handle PIPE_TRANSFER_DISCARD_WHOLE_RESOURCE in transfer_map for buffers 40917ec681f3Smrg- zink: update ci results 40927ec681f3Smrg- zink: disable some builtin-gl-sample-mask sample shading tests on ci 40937ec681f3Smrg- zink: actually disable sample mask tests on ci 40947ec681f3Smrg- lavapipe: fix color-only renderpass clears 40957ec681f3Smrg- zink: ralloc the main context 40967ec681f3Smrg- zink: create framebuffer and renderpass objects just before vkCmdBeginRenderPass() 40977ec681f3Smrg- zink: defer pipe_context::clear calls when not currently in a renderpass 40987ec681f3Smrg- zink: also defer fb clears when conditional render is active 40997ec681f3Smrg- zink: break out region overlap testing function into helper 41007ec681f3Smrg- zink: add helper for converting pipe_box -> u_rect 41017ec681f3Smrg- zink: add another helper for checking whether one rect covers another 41027ec681f3Smrg- zink: break out fb clear apply into helper function 41037ec681f3Smrg- zink: add helper for applying/discarding clears based on a rect 41047ec681f3Smrg- zink: discard pending clears during blit/copy if we'll overwrite the data 41057ec681f3Smrg- zink: add yet another clear helper, this time for applying overlap regions 41067ec681f3Smrg- zink: optimize the remaining read cases of applying pending clear calls 41077ec681f3Smrg- zink: move all the clear stuff to zink_clear.h 41087ec681f3Smrg- zink: always do full-fb clears in renderpass begin when possible 41097ec681f3Smrg- zink: ci changes 41107ec681f3Smrg- zink: improve descriptor set oom handling 41117ec681f3Smrg- zink: ci updates 41127ec681f3Smrg- zink: set PIPE_CAP_TEXTURE_BORDER_COLOR_QUIRK 41137ec681f3Smrg- zink: force 128 fs input components on intel drivers 41147ec681f3Smrg- zink: add some spirv builder functions for spec constants 41157ec681f3Smrg- zink: support gl_LocalGroupSize 41167ec681f3Smrg- zink: add more ci flakes 41177ec681f3Smrg- util/bitscan: add u_foreach_bit macros 41187ec681f3Smrg- v3dv: remove for_each_bit() macro 41197ec681f3Smrg- radv: for_each_bit -> foreach_bit 41207ec681f3Smrg- freedreno/vulkan: for_each_bit -> foreach_bit 41217ec681f3Smrg- anv: for_each_bit -> foreach_bit 41227ec681f3Smrg- zink: use 0 as default for spec constants 41237ec681f3Smrg- zink: no-op descriptor updating for draws without descriptors 41247ec681f3Smrg- nir/texcoord_replace: add a yinvert param 41257ec681f3Smrg- zink: store prim mode to context during draw 41267ec681f3Smrg- zink: handle point sprite 41277ec681f3Smrg- zink: ci updates 41287ec681f3Smrg- zink: avoid memset during update_descriptors() for resources refs 41297ec681f3Smrg- zink: move samplerview referencing around in update_descriptors() 41307ec681f3Smrg- zink: reorder zink_bind_vertex_buffers() 41317ec681f3Smrg- zink: create a single fence per batch on startup and then reuse 41327ec681f3Smrg- zink: only flush batches in pipe_context::flush if they actually have work 41337ec681f3Smrg- zink: add a define for compute batch count 41347ec681f3Smrg- zink: add util function for returning previous batch 41357ec681f3Smrg- zink: handle PIPE_FLUSH_DEFERRED 41367ec681f3Smrg- zink: handle VK_IMAGE_LAYOUT_PRESENT_SRC_KHR barriers 41377ec681f3Smrg- zink: set VK_IMAGE_LAYOUT_PRESENT_SRC_KHR on fb resources at eof flush 41387ec681f3Smrg- zink: setup CmdBindVertexBuffers2EXT member in screen for dynamic state 41397ec681f3Smrg- zink: make dynamic state usage in pipeline creation more explicit/flexible 41407ec681f3Smrg- zink: use dynamic vertex buffer strides 41417ec681f3Smrg- zink: rename zink_context::buffers -> vertex_buffers (and usage mask) 41427ec681f3Smrg- zink: add zink_program struct as a base class for compute/gfx structs 41437ec681f3Smrg- zink: use zink_program in zink_batch_reference_program() 41447ec681f3Smrg- zink: ralloc zink program structs 41457ec681f3Smrg- zink: unref programs last in batch reset 41467ec681f3Smrg- zink: properly size descriptorset layout binding stack array 41477ec681f3Smrg- zink: increment batch->descs_used during update_descriptors flushing 41487ec681f3Smrg- zink: do batch-program tracking after possibly cycling batch in update_descriptors() 41497ec681f3Smrg- zink: add spirv builder methods for OpImageQueryLevels 41507ec681f3Smrg- zink: hook up nir_texop_query_levels 41517ec681f3Smrg- zink: relax tessellation shader reqs 41527ec681f3Smrg- zink: ci updates 41537ec681f3Smrg- zink: fix dynamic bo lowering for ssbo stores 41547ec681f3Smrg- zink: pre-fetch all format properties during screen init 41557ec681f3Smrg- zink: use pre-fetched format properties everywhere 41567ec681f3Smrg- zink: don't start renderpasses during descriptor update 41577ec681f3Smrg- zink: add more usage bits for buffer resource creation 41587ec681f3Smrg- zink: handle null src for fb refs 41597ec681f3Smrg- zink: track all framebuffers per batch 41607ec681f3Smrg- zink: store total memory size on zink_screen 41617ec681f3Smrg- zink: track resource mem usage per batch 41627ec681f3Smrg- zink: force batch flush if batches are using more than 1/10 total system memory 41637ec681f3Smrg- mesa/st: clamp scissored clear regions to fb size 41647ec681f3Smrg- mesa/st: no-op scissored clear calls with size zero 41657ec681f3Smrg- zink: handle GLSL_SAMPLER_DIM_EXTERNAL in ntv 41667ec681f3Smrg- zink: ci updates 41677ec681f3Smrg- mesa/st: even better no-oping for clears 41687ec681f3Smrg- zink: apply only the pending zs clear bits during deferred clears 41697ec681f3Smrg- zink: enable PIPE_CAP_CLEAR_SCISSORED 41707ec681f3Smrg- zink: export PIPE_CAP_TGSI_VS_LAYER_VIEWPORT 41717ec681f3Smrg- zink: use staging resource for write transfer_map in order to not stall 41727ec681f3Smrg- zink: ci updates 41737ec681f3Smrg- zink: rewrite macro for getting KHR device functions 41747ec681f3Smrg- zink: add vk/spirv caps/extension for shader LAYER variable 41757ec681f3Smrg- zink: remove ntv streamout assert 41767ec681f3Smrg- zink: fix streamout emission for super-enhanced layouts 41777ec681f3Smrg- zink: fix slot mapping for fat io variables 41787ec681f3Smrg- zink: fix location usage for explicit xfb outputs 41797ec681f3Smrg- zink: run more nir passes for tess shaders 41807ec681f3Smrg- zink: stop allocating xfb slot map 41817ec681f3Smrg- zink: handle direct xfb output from output variables 41827ec681f3Smrg- zink: evaluate existing slot map during program init and force new map as needed 41837ec681f3Smrg- zink: rename variable in update_so_info() 41847ec681f3Smrg- zink: use info.has_transform_feedback_varyings to determine xfb enablement 41857ec681f3Smrg- zink: pass so_info directly to update_so_info() 41867ec681f3Smrg- zink: use slightly stricter check for update_so_info() callsite 41877ec681f3Smrg- zink: only export necessary xfb outputs to ntv 41887ec681f3Smrg- zink: don't pass so_info to ntv at all unless it's necessary 41897ec681f3Smrg- zink: unref ctx->framebuffer on context destroy 41907ec681f3Smrg- zink: fix instance/device versioning (for real this time) 41917ec681f3Smrg- zink: simplify some update_descriptor code 41927ec681f3Smrg- zink: move descriptor sets/pools from batches to programs 41937ec681f3Smrg- zink: store and reuse descriptorsets after batch completion 41947ec681f3Smrg- zink: move descriptor set alloc function to zink_program.c 41957ec681f3Smrg- zink: use more precise sizing for descriptor pools 41967ec681f3Smrg- zink: add helper function for cycling a batch 41977ec681f3Smrg- zink: even better handling for descriptor oom 41987ec681f3Smrg- zink: remove flushes for batch descriptor use 41997ec681f3Smrg- zink: add bucket allocating for descriptor sets 42007ec681f3Smrg- zink: add scaling factor for descriptor set bucket allocations 42017ec681f3Smrg- zink: add caching for descriptor sets 42027ec681f3Smrg- zink: add second level cache for descriptor sets 42037ec681f3Smrg- zink: move streamout to draw_vbo 42047ec681f3Smrg- zink: reorder descriptor barrier applying during updating 42057ec681f3Smrg- zink: move surface refs to the end of descriptor updating 42067ec681f3Smrg- zink: split descriptor sets based on usage 42077ec681f3Smrg- zink: use dynamic offsets for first ubo 42087ec681f3Smrg- zink: introduce descriptor states 42097ec681f3Smrg- zink: add a null sampler view descriptor hash to the screen 42107ec681f3Smrg- zink: pre-hash sampler views and states 42117ec681f3Smrg- zink: store last-used descriptor set for each type of set for quick reuse 42127ec681f3Smrg- zink: actually flag all used resources as used during update_descriptors 42137ec681f3Smrg- zink: add program pointer to desc set struct 42147ec681f3Smrg- zink: move descriptor set allocation near the top of update_descriptors 42157ec681f3Smrg- zink: only batch-reference the program in use once per descriptor update 42167ec681f3Smrg- zink: improve descriptor cache invalidation 42177ec681f3Smrg- zink: add flag for recycled descriptor sets 42187ec681f3Smrg- zink: don't double iterate all the per-batch sets on reset 42197ec681f3Smrg- zink: add VkPipelineLayout to zink_program meta struct 42207ec681f3Smrg- zink: split out ubo descriptor updating 42217ec681f3Smrg- zink: break out ssbo descriptor updating 42227ec681f3Smrg- zink: break out sampler descriptor updating 42237ec681f3Smrg- zink: break out image descriptor updating 42247ec681f3Smrg- zink: deduplicate VkWriteDescriptorSet setup 42257ec681f3Smrg- zink: break out descriptor stuff into new files 42267ec681f3Smrg- zink: break out all the descriptor pool/layout stuff into a new struct 42277ec681f3Smrg- zink: change program pointer on struct zink_descriptor_set to pool pointer 42287ec681f3Smrg- zink: track number of sets currently allocated per descriptor pool 42297ec681f3Smrg- zink: move descriptor type to pool object from set 42307ec681f3Smrg- zink: allow reuse of zink_descriptor_pools between programs 42317ec681f3Smrg- zink: remove intermediate func for descriptor set getting 42327ec681f3Smrg- zink: simplify check for knowing whether descriptor updating is needed 42337ec681f3Smrg- zink: pre-size descriptor transition hash table 42347ec681f3Smrg- zink: move descriptor binding out of the update codepath 42357ec681f3Smrg- zink: reuse descriptor barriers across draws 42367ec681f3Smrg- zink: track resource count on descriptor pool object 42377ec681f3Smrg- zink: directly use resource count from pool instead of accumulating every time 42387ec681f3Smrg- zink: remove struct zink_descriptor_resource from descriptor updating 42397ec681f3Smrg- zink: don't create descriptor barrier hash tables for cached descriptor set 42407ec681f3Smrg- zink: always use VK_IMAGE_LAYOUT_SHADER_READ_ONLY_OPTIMAL for sampler bindings 42417ec681f3Smrg- zink: remove image layouts from descriptor states 42427ec681f3Smrg- zink: avoid hashing states without descriptors 42437ec681f3Smrg- zink: fix arrays of texel buffer descriptors 42447ec681f3Smrg- zink: ci updates 42457ec681f3Smrg- zink: move vertex_buffers_enabled_mask to non-hashed part of pipeline state 42467ec681f3Smrg- zink: optimize pipeline hashing 42477ec681f3Smrg- zink: implement an extremely dumb resource memory cache 42487ec681f3Smrg- zink: ci updates 42497ec681f3Smrg- zink: remove debug printf 42507ec681f3Smrg- lavapipe: fix push descriptor set indexing 42517ec681f3Smrg- lavapipe: set more resource bind flags using image/buffer usage bits 42527ec681f3Smrg- zink: move buffer<->image copying to pipe_context::resource_copy_region hook 42537ec681f3Smrg- zink: remove extraneous flush in transfer_map_region_flush 42547ec681f3Smrg- zink: ci updates 42557ec681f3Smrg- zink: optimize resource usage tracking 42567ec681f3Smrg- zink: use _mesa_set_search_and_add() for set management 42577ec681f3Smrg- zink: ralloc screen objects 42587ec681f3Smrg- zink: implement a surface cache 42597ec681f3Smrg- zink: use a safer iteration for fb surfaces during rp init 42607ec681f3Smrg- zink: determine whether the vulkan driver requires mesa flush wsi 42617ec681f3Smrg- zink: force mesa flush implicit fencing on ANV 42627ec681f3Smrg- zink: force explicit fence only on first frame flush 42637ec681f3Smrg- zink: use internal api for first-frame fence 42647ec681f3Smrg- zink: return true from program ref functions upon free 42657ec681f3Smrg- zink: unset ctx->program pointers when an unref destroys the object 42667ec681f3Smrg- zink: stop leaking programs 42677ec681f3Smrg- vk/util: add unified shader module struct/functions 42687ec681f3Smrg- vk/util: add a util macro for initializing stack vk_shader_module structs 42697ec681f3Smrg- lavapipe: use common interfaces for shader modules 42707ec681f3Smrg- radv: use common interfaces for shader modules 42717ec681f3Smrg- v3dv: use common interfaces for shader modules 42727ec681f3Smrg- tu: use common interfaces for shader modules 42737ec681f3Smrg- anv: use common interfaces for shader modules 42747ec681f3Smrg- zink: add batch usage flags for sampler views/states and desc sets 42757ec681f3Smrg- zink: avoid looping for non-ubo descriptor updates based on set usage 42767ec681f3Smrg- zink: break out batch id finding for resource usage into util function 42777ec681f3Smrg- zink: move resource internals to a separate struct 42787ec681f3Smrg- zink: split out backing resource object create/destroy 42797ec681f3Smrg- zink: track resource_object usage instead of resource usage 42807ec681f3Smrg- zink: handle cached descriptor set punting 42817ec681f3Smrg- zink: add some asserts for pipeline barriers to check renderpass state 42827ec681f3Smrg- zink: add util function for checking whether a shader descriptor is a buffer 42837ec681f3Smrg- zink: add util function for checking whether shader descriptor is buffer from program 42847ec681f3Smrg- zink: use an explicit zink_buffer_view struct 42857ec681f3Smrg- zink: explicitly use zink_surface objects for sampler/image view objects 42867ec681f3Smrg- zink: store bufferview hash to bufferview struct 42877ec681f3Smrg- zink: simplify bufferview and imageview descriptor state hashing 42887ec681f3Smrg- zink: add extra batch tracking for sampler views 42897ec681f3Smrg- zink: massively beef up batch tracking for shader images 42907ec681f3Smrg- zink: add helper function for getting a resource for a descriptor 42917ec681f3Smrg- zink: update null descriptor hashes to reflect current descriptor states 42927ec681f3Smrg- zink: improve debug asserts for samplers/images during descriptor updates 42937ec681f3Smrg- zink: properly handle null bufferview descriptor states 42947ec681f3Smrg- zink: cache bufferviews 42957ec681f3Smrg- zink: add missing null check 42967ec681f3Smrg- zink: unset last_vertex_stage_dirty after applying it 42977ec681f3Smrg- zink: run nir_convert_from_ssa last during compile 42987ec681f3Smrg- zink: use intermediate var for glsl non-array type during shader create 42997ec681f3Smrg- zink: break out bo array type construction into ntv util function 43007ec681f3Smrg- zink: also break out whole ntv bo struct pointer construction 43017ec681f3Smrg- zink: add unsized array type to get_glsl_type_element() handling 43027ec681f3Smrg- zink: add debug info about missing atomic ops 43037ec681f3Smrg- zink: add ntv util function for checking if a glsl type is an atomic counter 43047ec681f3Smrg- zink: break out sized uint array construction into util function 43057ec681f3Smrg- zink: flatten binding numbers a bit 43067ec681f3Smrg- zink: directly set nir variable bindings for reuse during ntv 43077ec681f3Smrg- zink: move zink_binding() to compiler.c 43087ec681f3Smrg- zink: apply Delete All The Code methodology to the ubo/ssbo variables 43097ec681f3Smrg- zink: set ntv variable descriptor sets during compile phase 43107ec681f3Smrg- zink: ci updates 43117ec681f3Smrg- ci/panfrost: disable the rest of these jobs temporarily 43127ec681f3Smrg- zink: hook up resource bind history 43137ec681f3Smrg- zink: remove direct samplerview batch-tracking 43147ec681f3Smrg- zink: add a pipe_context::invalidate_resource hook 43157ec681f3Smrg- zink: set valid region for streamout buffers on bind 43167ec681f3Smrg- zink: handle streamout buffer rebinds 43177ec681f3Smrg- zink: invalidate resources on map when discarding range 43187ec681f3Smrg- zink: enable PIPE_CAP_INVALIDATE_BUFFER 43197ec681f3Smrg- zink: switch to deqp-runner for piglit jobs 43207ec681f3Smrg- zink: always use requested format for sampler view creation 43217ec681f3Smrg- zink: ci updates 43227ec681f3Smrg- zink: more consolidation for null sampler/image view hashing 43237ec681f3Smrg- zink: add a pipe_context::fence_server_sync hook 43247ec681f3Smrg- zink: add enum for different queues 43257ec681f3Smrg- zink: refactor resource_sync_writes_from_batch_usage() to manage batch id internally 43267ec681f3Smrg- zink: convert ZINK_RESOURCE_ACCESS defines to enum 43277ec681f3Smrg- zink: abstract zink_get_resource_usage() and move it to be internal 43287ec681f3Smrg- zink: return enum zink_queue from zink_batch_reference_resource_rw() 43297ec681f3Smrg- zink: split out batch resource-set clearing into separate function 43307ec681f3Smrg- zink: move active query pruning to batch reset 43317ec681f3Smrg- zink: move batch init into zink_batch.c 43327ec681f3Smrg- zink: also move batch destructor into zink_batch.c 43337ec681f3Smrg- zink: move other batch-tracking implementations to unified codepath 43347ec681f3Smrg- zink: use macro to streamline batch struct member init 43357ec681f3Smrg- zink: remove query batch-tracking init from begin_query() 43367ec681f3Smrg- zink: move fence reset to zink_fence_init() 43377ec681f3Smrg- zink: clear framebuffer state on context destroy 43387ec681f3Smrg- zink: enable spirv extension for post depth coverage 43397ec681f3Smrg- compiler/spirv: fix image sample queries 43407ec681f3Smrg- zink: handle nir_intrinsic_image_deref_samples 43417ec681f3Smrg- zink: flatten 2d_array surfaces when necessary 43427ec681f3Smrg- lavapipe: support VK_KHR_copy_commands2 43437ec681f3Smrg- lavapipe: rewrite cmdbufs to always do descriptor binds/pushes first 43447ec681f3Smrg- lavapipe: force state updates when beginning queries 43457ec681f3Smrg- llvmpipe/setup: force fs constant updating upon beginning queries 43467ec681f3Smrg- zink: break out surface viewtype clamping into util function 43477ec681f3Smrg- zink: improve surface viewtype clamping 43487ec681f3Smrg- zink: correctly clamp samplerview surface types 43497ec681f3Smrg- ci/lavapipe: split out lavapipe ci into lavapipe dir 43507ec681f3Smrg- llvmpipe/setup: use bigger hammer to force fs constant updating correctly 43517ec681f3Smrg- zink: split off a bunch of batch struct members to new batch state struct 43527ec681f3Smrg- zink: rewrite queue dispatch to use monotonic batch ids instead of hardcoded ones 43537ec681f3Smrg- zink: more accurately check samplecount caps for shader images 43547ec681f3Smrg- zink: make fb ref func return bool on free 43557ec681f3Smrg- zink: add explicit surface/bufferview batch-tracking functions 43567ec681f3Smrg- zink: use surface references for fb attachments 43577ec681f3Smrg- zink: break out surface destroy function into a screen function 43587ec681f3Smrg- zink: use a custom surface referencing function whenever unrefing a surface 43597ec681f3Smrg- zink: implement a global framebuffer cache 43607ec681f3Smrg- vk: consolidate dynamic descriptor binding sorting 43617ec681f3Smrg- ci: update xfails for ppc64le and s390x 43627ec681f3Smrg- zink: break out buffer mapping part of zink_transfer_map 43637ec681f3Smrg- zink: cache transfer maps 43647ec681f3Smrg- zink: unify clear color conversion code 43657ec681f3Smrg- nir: add nir_lower_indirect_builtin_uniform_derefs() 43667ec681f3Smrg- st/glsl_to_nir: lower indirect derefs of builtins in non-packed uniform case 43677ec681f3Smrg- softpipe: ci updates 43687ec681f3Smrg- zink: move 'batch_id' and 'is_compute' members to fence 43697ec681f3Smrg- zink: make batch usage unsetting function public 43707ec681f3Smrg- zink: always reset batch states when finding a new one 43717ec681f3Smrg- zink: move batch-tracked resources to fence object 43727ec681f3Smrg- zink: fix spirv image operand ordering 43737ec681f3Smrg- zink: fix multisampled shader image load/store 43747ec681f3Smrg- zink: force PIPE_SWIZZLE_1 for X channels in samplerviews 43757ec681f3Smrg- zink: handle blitting of color formats with ignored alpha channels 43767ec681f3Smrg- zink: emulate PIPE_FORMAT_R8G8B8X8_UNORM 43777ec681f3Smrg- zink: ci updates 43787ec681f3Smrg- zink: relax unreachable() to debug_printf when waiting on batch 43797ec681f3Smrg- zink: rework public batch flush function to be useful again 43807ec681f3Smrg- zink: move zink_flush_compute() users to zink_flush_queue() 43817ec681f3Smrg- zink: always flag xfb barrier on gfx flush when appropriate 43827ec681f3Smrg- zink: simplify some queue-related query code 43837ec681f3Smrg- zink: refactor clears a little to track a bitfield of enabled clears on the context 43847ec681f3Smrg- zink: trigger pending clears during flush 43857ec681f3Smrg- zink: ci updates 43867ec681f3Smrg- zink: add wrapper to reset batch state structs 43877ec681f3Smrg- zink: call clear() instead of reset() for batch states on context destroy 43887ec681f3Smrg- zink: unify gfx and compute batches 43897ec681f3Smrg- zink: isolate gfx stage bits when updating shader modules 43907ec681f3Smrg- zink: store conditional render predicate to query and split out start/stop 43917ec681f3Smrg- zink: only update conditional render buffer when it needs to be updated 43927ec681f3Smrg- zink: toggle conditional render when beginning/ending a renderpass 43937ec681f3Smrg- zink: ci updates 43947ec681f3Smrg- zink: handle gallium multi draws more effectively 43957ec681f3Smrg- zink: create separate upload mgr for constants 43967ec681f3Smrg- zink: explicitly use stream uploader for staging buffers 43977ec681f3Smrg- zink: add buffer_subdata hook 43987ec681f3Smrg- zink: avoid unnecessary resource refs during descriptor update 43997ec681f3Smrg- zink: remove handling for resource flushing between compute/gfx batches 44007ec681f3Smrg- zink: remove unnecessary flush during image maps 44017ec681f3Smrg- zink: add more rp cache asserts 44027ec681f3Smrg- compiler/spirv: use undefs when extending image coords 44037ec681f3Smrg- zink: don't generate sampled image type for non-sampled images 44047ec681f3Smrg- util/set: stop leaking u32 key sets which pass a mem ctx 44057ec681f3Smrg- lavapipe: fix CmdCopyQueryPoolResults for partial pipeline statistics queries 44067ec681f3Smrg- lavapipe: use the passed offset for CmdCopyQueryPoolResults 44077ec681f3Smrg- lavapipe: stop tracking draw start/count on rendering state 44087ec681f3Smrg- zink: ci updates 44097ec681f3Smrg- lavapipe: ignore templateType when descriptor template isn't for push descriptors 44107ec681f3Smrg- lavapipe: remove lvp_descriptor_update_template::descriptor_set_layout 44117ec681f3Smrg- zink: fix handling for image types in resource_copy_region hook 44127ec681f3Smrg- zink: also fix image buffer layer copying 44137ec681f3Smrg- lavapipe: fix array texture region copies 44147ec681f3Smrg- zink: only do shader updates when relevant stages are dirty 44157ec681f3Smrg- zink: use correct surface ref function for context destroy 44167ec681f3Smrg- zink: stall when we start getting a lot of uncompleted batches 44177ec681f3Smrg- zink: reset all fences when waiting on batch state 44187ec681f3Smrg- zink: fix format support detection for storage texel buffers and shader images 44197ec681f3Smrg- zink: break out image/buffer create info structs into helper funcs 44207ec681f3Smrg- zink: make descriptor state invalidate public 44217ec681f3Smrg- zink: reorder barrier util functions to set up barrier struct before batch 44227ec681f3Smrg- zink: break out barrier struct initializing into helper funcs 44237ec681f3Smrg- zink: create separate vk image/buffer objects for shader image use 44247ec681f3Smrg- zink: incrementally add image usage flags based on device caps 44257ec681f3Smrg- zink: add color output bit and/or use linear tiling for sampled images 44267ec681f3Smrg- zink: check image format props before creating image 44277ec681f3Smrg- zink: toggle between linear/optimal tiling during image creation 44287ec681f3Smrg- zink: flatten out buffer creation usage flags codepath 44297ec681f3Smrg- zink: ralloc shader cache and keys 44307ec681f3Smrg- zink: rework border color handling 44317ec681f3Smrg- zink: clean up query creation failure paths 44327ec681f3Smrg- zink: create result buffers for all query streams 44337ec681f3Smrg- zink: remove flush from query buffer copy 44347ec681f3Smrg- zink: manually handle more bool query types for copying 44357ec681f3Smrg- zink: remove special casing for occlusion qbos 44367ec681f3Smrg- zink: rewrite query internals 44377ec681f3Smrg- zink: bump pools up to 5k queries each 44387ec681f3Smrg- zink: don't use PARTIAL bit for query results with time queries 44397ec681f3Smrg- zink: reorder availability handling for (user) qbos 44407ec681f3Smrg- zink: remove explicit fencing for query results 44417ec681f3Smrg- zink: ci updates 44427ec681f3Smrg- lavapipe: refactor base draw dispatch to handle multidraws 44437ec681f3Smrg- lavapipe: refactor indexed draw dispatch to handle multidraws 44447ec681f3Smrg- aux/draw: stop copying draw params unnecessarily 44457ec681f3Smrg- aux/draw: rewrite PRIM_RESTART_LOOP macro as a function 44467ec681f3Smrg- aux/draw: pass the full draw params through to draw_instances() 44477ec681f3Smrg- aux/draw: pass the full draw params through to draw_pt_arrays_restart() 44487ec681f3Smrg- aux/draw: move draw param sanitization to end of function 44497ec681f3Smrg- aux/draw: track increment_draw_id value from draw info 44507ec681f3Smrg- aux/draw: pass full draw params to draw_pt_arrays() 44517ec681f3Smrg- llvmpipe: stop flattening multidraws 44527ec681f3Smrg- lavapipe: ignore unused clearvalues when beginning renderpass 44537ec681f3Smrg- zink: rework texture_barrier hook 44547ec681f3Smrg- zink: move update_descriptors & related funcs to zink_descriptors.c 44557ec681f3Smrg- zink: move descriptor barrier handling to main update function 44567ec681f3Smrg- zink: simplify some descriptor update function parameters 44577ec681f3Smrg- zink: use GENERAL layout for sampler images that are also bound as shader images 44587ec681f3Smrg- zink: rework some includes 44597ec681f3Smrg- zink: rework memory_barrier hook 44607ec681f3Smrg- zink: add locking for descriptor pools 44617ec681f3Smrg- zink: add locking for resource maps 44627ec681f3Smrg- zink: manually invoke cpu detection during screen init 44637ec681f3Smrg- zink: add locking for batch states 44647ec681f3Smrg- zink: add function for checking whether a batch is done 44657ec681f3Smrg- zink: split fence finish func 44667ec681f3Smrg- zink: add locking for fence resources 44677ec681f3Smrg- zink: explicitly reset a couple more batch state members 44687ec681f3Smrg- zink: assume fence has already completed if a batch state isn't found 44697ec681f3Smrg- zink: rename init_batch_state to get_batch_state 44707ec681f3Smrg- zink: store context to batch state 44717ec681f3Smrg- zink: make a local screen pointer in zink_flush 44727ec681f3Smrg- zink: remove zink_fence_init() 44737ec681f3Smrg- zink: move VkQueue to batch object 44747ec681f3Smrg- zink: break out queue submit into separate functions 44757ec681f3Smrg- zink: also check for device lost reset on flush 44767ec681f3Smrg- zink: remove zink_create_fence() 44777ec681f3Smrg- zink: track coherent resource objects 44787ec681f3Smrg- zink: use cached memory for all resources when possible 44797ec681f3Smrg- radv: stop zeroing radv_draw_info during draw 44807ec681f3Smrg- radv: refactor draw dispatch 44817ec681f3Smrg- radv: track whether gl_BaseInstance is used 44827ec681f3Smrg- radv: simplify vs draw param counting during setup 44837ec681f3Smrg- radv: set gfx pipeline vtx_emit_num to the number of sgprs 44847ec681f3Smrg- radv: track whether drawid is used on the pipeline struct 44857ec681f3Smrg- radv: track whether baseinstance is used on the pipeline struct 44867ec681f3Smrg- radv: break out vertex shader param emission into separate function 44877ec681f3Smrg- radv: make vertex param sgpr count more explicit 44887ec681f3Smrg- radv: reorder vertex shader params 44897ec681f3Smrg- radv: don't emit baseinstance and drawid if neither is used 44907ec681f3Smrg- radv: don't reset vertex state params on pipeline bind if reg layout matches 44917ec681f3Smrg- zink: implement threaded context 44927ec681f3Smrg- zink: ci updates 44937ec681f3Smrg- zink: handle PIPE_MAP_DONTBLOCK for buffer read maps 44947ec681f3Smrg- zink: add set_context_param hook 44957ec681f3Smrg- zink: add batch tracking id for program struct 44967ec681f3Smrg- zink: track last completed batch id to optimize checking states 44977ec681f3Smrg- zink: handle expired deferred fences more reasonably 44987ec681f3Smrg- zink: hook up timeline semaphore signalling during batch submission 44997ec681f3Smrg- zink: add timeline semaphore fastpath for checking/triggering batch completion 45007ec681f3Smrg- zink: optimize batch states for timeline use 45017ec681f3Smrg- zink: enforce device lost status 45027ec681f3Smrg- zink: be more explicit about blit layer/depth usage 45037ec681f3Smrg- zink: use VkSubresourceLayout::depthPitch as layer_stride when mapping 3D imgs 45047ec681f3Smrg- zink: zink_push_constant -> zink_gfx_push_constant 45057ec681f3Smrg- zink: use max_rt to determine number of blend state attachments 45067ec681f3Smrg- zink: emit ImageCubeArray cap when accessing arrayed cube dimension images 45077ec681f3Smrg- zink: fix layercount for array texture blits 45087ec681f3Smrg- zink: add some asserts to avoid zero-sized blit regions 45097ec681f3Smrg- features: mark off ARB_compute_variable_group_size for zink 45107ec681f3Smrg- features: mark off GL_OES_viewport_array for zink 45117ec681f3Smrg- zink: store shader_info to ntv_context struct 45127ec681f3Smrg- zink: only emit SpvCapabilitySampleMaskPostDepthCoverage if the mode is set 45137ec681f3Smrg- zink: enable PIPE_CAP_TGSI_TES_LAYER_VIEWPORT 45147ec681f3Smrg- features: mark off ARB_shader_viewport_layer_array for zink 45157ec681f3Smrg- zink: avoid cached memory allocations when not requested 45167ec681f3Smrg- util/threaded_context: support pipe_context::set_sample_locations 45177ec681f3Smrg- zink: hook up cs push constant for nir_intrinsic_load_work_dim 45187ec681f3Smrg- zink: use better usage flags for staging resources 45197ec681f3Smrg- zink: use vkGetPhysicalDeviceFormatProperties2 when available 45207ec681f3Smrg- zink: use 2 variant to check image format props during create 45217ec681f3Smrg- zink: only use host mem for staging resources with linear tiling 45227ec681f3Smrg- zink: move cmdpool reset to batch state reset 45237ec681f3Smrg- zink: split total_mem off to total_video_mem, use total_mem for tc 45247ec681f3Smrg- zink: relax maybe_flush mem threshold 45257ec681f3Smrg- zink: relax maybe_flush batch count threshold 45267ec681f3Smrg- zink: check last_finished first in fence_finish early out case 45277ec681f3Smrg- zink: defer timestamp query pool resets to end_query 45287ec681f3Smrg- zink: reset queries when suspending if >50% of total pool is used 45297ec681f3Smrg- zink: don't use cached mem for staging resources 45307ec681f3Smrg- zink: flag DYNAMIC resources as coherent 45317ec681f3Smrg- zink: drop VK_PIPELINE_CREATE_DISABLE_OPTIMIZATION_BIT from compute path 45327ec681f3Smrg- aux/trace: add a set_inlinable_constants hook 45337ec681f3Smrg- intel: avoid dumping null cs sampler/binding states 45347ec681f3Smrg- zink: emit WorkgroupSize when not using ExecutionModeLocalSize 45357ec681f3Smrg- lavapipe: add some asserts for blit region extents 45367ec681f3Smrg- zink: export PIPE_CAP_PREFER_BLIT_BASED_TEXTURE_TRANSFER 45377ec681f3Smrg- aux/trace: add screen deduplication for zink+lavapipe tracing 45387ec681f3Smrg- aux/trace: add a bunch of methods for lavapipe 45397ec681f3Smrg- util/set: add macro for destructively iterating set entries 45407ec681f3Smrg- util/hash_table: add macro for destructively iterating entries 45417ec681f3Smrg- aux/trace: add GALLIUM_TRACE_TRIGGER mode 45427ec681f3Smrg- zink: add a pipe_screen::finalize_nir hook 45437ec681f3Smrg- zink: implement uniform inlining 45447ec681f3Smrg- zink: add env var to force uniform inlining 45457ec681f3Smrg- zink: remove atomic usage from batch tracking comparisons 45467ec681f3Smrg- zink: bypass separate stencil path in resource_reference_rw when not a zs image 45477ec681f3Smrg- zink: fix conditional when assigning tess variable io 45487ec681f3Smrg- zink: stop unmapping resources 45497ec681f3Smrg- zink: simplify clear-apply on fb state change 45507ec681f3Smrg- zink: use set_foreach_remove() 45517ec681f3Smrg- zink: use explicit subpass deps 45527ec681f3Smrg- zink: hook up EXT_fragment_shader_interlock 45537ec681f3Smrg- zink: support ARB_fragment_shader_interlock 45547ec681f3Smrg- aux/trace: dump all the blend state members 45557ec681f3Smrg- features: mark off ARB_fragment_shader_interlock for zink 45567ec681f3Smrg- gallium/threaded_context: add another rule for buffer mapping 45577ec681f3Smrg- zink: fix CI flakiness in glx-multithread-clearbuffer 45587ec681f3Smrg- zink: make timeline semaphores per-screen 45597ec681f3Smrg- zink: handle checking batch completion from other contexts without timelines 45607ec681f3Smrg- zink: only unmap PIPE_MAP_ONCE in synchronous mode 45617ec681f3Smrg- zink: don't lose existing pNext when using wsi_image_create_info in image creation 45627ec681f3Smrg- anv: fix debugoptimized build compile 45637ec681f3Smrg- zink: move descriptor state management to descriptors.c 45647ec681f3Smrg- zink: make a bunch of descriptor functions static 45657ec681f3Smrg- zink: create separate linear tiling image for scanout 45667ec681f3Smrg- zink: flag anv for mesa image create wsi 45677ec681f3Smrg- zink: disable mutable formats for zs formats and scanout images 45687ec681f3Smrg- aux/trace: enhance trigger mode to dump context states during bind 45697ec681f3Smrg- aux/trace: dump current fb state on trigger-mode draw if it hasn't been seen yet 45707ec681f3Smrg- aux/trace: do deep dumps of fb state for triggered traces 45717ec681f3Smrg- aux/trace: use ralloc_free for ralloc()ed state pointers 45727ec681f3Smrg- zink: compare against screen batch id when determining which semaphore to use 45737ec681f3Smrg- zink: always copy the nir shader before compiling 45747ec681f3Smrg- zink: fix tcs slot map eval for user vars 45757ec681f3Smrg- zink: fix tcs input reservation for user vars 45767ec681f3Smrg- zink: merge copy-to-scanout path into non-deferred flush path 45777ec681f3Smrg- zink: force scanout sync when mapping scanout resource 45787ec681f3Smrg- zink: use undefined layout for first scanout obj transition 45797ec681f3Smrg- zink: move scanout sync to end of batch 45807ec681f3Smrg- zink: add a flag indicating whether scanout object needs updating 45817ec681f3Smrg- zink: move wsi flush info conditional to queue submission 45827ec681f3Smrg- zink: directly set batch->state->flush_res from flush_resource hook 45837ec681f3Smrg- zink: add clear-on-flush mechanic deeper into flush codepath 45847ec681f3Smrg- Revert "zink: force scanout sync when mapping scanout resource" 45857ec681f3Smrg- softpipe: fix render condition checking 45867ec681f3Smrg- softpipe: fix streamout queries 45877ec681f3Smrg- softpipe: ci updates 45887ec681f3Smrg- zink: track persistent resource objects, not resources 45897ec681f3Smrg- zink: restore previous semaphore (prev_sem) handling 45907ec681f3Smrg- zink: use cached memory for staging resources 45917ec681f3Smrg- zink: only reset query on suspend if the query has previously been stopped 45927ec681f3Smrg- zink: when performing an implicit reset, sync qbos 45937ec681f3Smrg 45947ec681f3SmrgNanley Chery (22): 45957ec681f3Smrg 45967ec681f3Smrg- gallium: Map _DRI_IMAGE_FORMAT_NONE to NULL 45977ec681f3Smrg- gallium: Flush GL API resources in eglCreateImage 45987ec681f3Smrg- iris: Disable aux as needed in iris_flush_resource 45997ec681f3Smrg- blorp: Assert 8x4 alignment for a HiZ op on Gen8-9 46007ec681f3Smrg- i965,iris: Delete misleading HiZ sampling comments 46017ec681f3Smrg- iris: Drop an XXX comment about sampling HiZ arrays 46027ec681f3Smrg- iris: Drop a stale comment about HiZ sampling 46037ec681f3Smrg- iris: Delete redundant assertion in iris_hiz_exec 46047ec681f3Smrg- iris: Drop batch param from iris_resource_prepare_render 46057ec681f3Smrg- iris: Fix the depth aspect aux usage in iris_blit 46067ec681f3Smrg- iris: Keep aux_usage in iris_blorp_surf_for_resource 46077ec681f3Smrg- iris: Fix aux usage of depth buffer prepare/finish 46087ec681f3Smrg- iris: Loosen aux state getter/setter assert on HiZ 46097ec681f3Smrg- iris: Don't avoid aux state getter/setter with HiZ 46107ec681f3Smrg- iris: Drop iris_resource::aux::has_hiz 46117ec681f3Smrg- iris: Call iris_sample_with_depth_aux earlier 46127ec681f3Smrg- iris: Set BO maps to NULL in bo_free 46137ec681f3Smrg- drm-uapi: Update drm_fourcc.h for new TGL modifier 46147ec681f3Smrg- isl: Describe I915_FORMAT_MOD_Y_TILED_GEN12_RC_CCS_CC 46157ec681f3Smrg- iris: Support clear color plane imports for RC_CCS_CC 46167ec681f3Smrg- iris: Support RC_CCS_CC modifier in plane queries 46177ec681f3Smrg- iris: Support I915_FORMAT_MOD_Y_TILED_GEN12_RC_CCS_CC 46187ec681f3Smrg 46197ec681f3SmrgNeha Bhende (3): 46207ec681f3Smrg 46217ec681f3Smrg- mesa: set states in fast path for restoring light attributes 46227ec681f3Smrg- gallium/u_vbuf: use updated pipe_draw_start_count while using draw_vbo 46237ec681f3Smrg- nir_to_tgsi: Fix indices for CMP in nir_to_tgsi for nir_op_fcsel 46247ec681f3Smrg 46257ec681f3SmrgPhilipp Zabel (1): 46267ec681f3Smrg 46277ec681f3Smrg- meson: Fix missing xcb-xrandr dependency for Vulkan X11 WSI 46287ec681f3Smrg 46297ec681f3SmrgPierre Moreau (4): 46307ec681f3Smrg 46317ec681f3Smrg- docs/features: Add OpenCL status 46327ec681f3Smrg- spirv: Ignore WorkgroupSize in non-compute stages 46337ec681f3Smrg- nv50: Replace hardcoded texture/constbuf count with define 46347ec681f3Smrg- nv50: Update texture indices to match stage indices 46357ec681f3Smrg 46367ec681f3SmrgPierre-Eric Pelloux-Prayer (55): 46377ec681f3Smrg 46387ec681f3Smrg- ac: add ifdef __cplusplus guard to header 46397ec681f3Smrg- radeonsi: invalidate compute sgprs in si_rebind_buffer 46407ec681f3Smrg- radeonsi: inhibit clockgating when using SQTT 46417ec681f3Smrg- ci: split src/mesa/\**/* matching rule 46427ec681f3Smrg- radeonsi/sqtt: use more event identifier 46437ec681f3Smrg- radeonsi/sqtt: fix SQTT bo size overflow 46447ec681f3Smrg- radeonsi/sqtt: allow AMD_THREAD_TRACE_TRIGGER to be a frame number 46457ec681f3Smrg- radeonsi/sqtt: forward string markers to sqtt 46467ec681f3Smrg- radeonsi: don't use cp_dma prefetch on GFX6 46477ec681f3Smrg- gallium/u_upload_mgr: lower risk of hitting an assert 46487ec681f3Smrg- radeonsi: fix indentation issue in si_texture.c 46497ec681f3Smrg- radeonsi: store si_context::xxx_shader members in union 46507ec681f3Smrg- radeonsi: fix read from compute / write from draw sync 46517ec681f3Smrg- radeonsi: fix si_check_render_feedback 46527ec681f3Smrg- radeonsi: replace force_cp_dma arg of si_clear_buffer by enum 46537ec681f3Smrg- radeonsi: enable dcc image stores on gfx10+ 46547ec681f3Smrg- radeonsi: force dcc clear to use compute clear 46557ec681f3Smrg- mesa: update vao _EnabledWithMapMode in copy_array_object 46567ec681f3Smrg- radeonsi: properly set SPI_SHADER_PGM_HI_ES 46577ec681f3Smrg- ac/rgp: make the max gap between shader code a warning 46587ec681f3Smrg- ac/rtld: make ac_rtld_upload returns the code size 46597ec681f3Smrg- ac/rgp: move radv/sqtt functions to ac 46607ec681f3Smrg- radeonsi/sqtt: keep a copy of the uploaded shader code 46617ec681f3Smrg- radeonsi/sqtt: remove duplicate token 46627ec681f3Smrg- radeonsi/sqtt: don't always use WGP 0 46637ec681f3Smrg- radeonsi/sqtt: export shader code to RGP 46647ec681f3Smrg- radeonsi/sqtt: fix user event max size 46657ec681f3Smrg- frontends/va: fix protected slice data buffer read size 46667ec681f3Smrg- mesa/st: fix lower_tex_src_plane in multiple samplers scenario 46677ec681f3Smrg- dlist: remove ListExt feature 46687ec681f3Smrg- mesa: remove 2 recursive lock usages of _mesa_HashTable 46697ec681f3Smrg- mesa/hash: make the mtx non-recursive 46707ec681f3Smrg- mesa/hash: switch to simple_mtx 46717ec681f3Smrg- mesa: make _mesa_HashTable InDeleteAll debug only 46727ec681f3Smrg- vbo/dlist: use DrawGallium(Complex) 46737ec681f3Smrg- nir/lower_tex: ignore texture_index if tex_instr has deref src 46747ec681f3Smrg- mesa/st: fix st_nir_lower_tex_src_plane arguments 46757ec681f3Smrg- mesa/st: ignore texture_index if tex_instr has deref src 46767ec681f3Smrg- gallium/u_threaded: split draws that don't fit in a batch 46777ec681f3Smrg- st/draw: remove st_draw_vbo 46787ec681f3Smrg- vbo: inline vbo_primitive_restart in brw_primitive_restart 46797ec681f3Smrg- radeonsi/rgp: export barriers 46807ec681f3Smrg- radeonsi/rgp: export compute shader programs 46817ec681f3Smrg- gallium/u_threaded: skip refcounting only once 46827ec681f3Smrg- driconf: add workarounds for Teardown 46837ec681f3Smrg- amdgpu,radeon: add needs_reset param to ctx_query_reset_status 46847ec681f3Smrg- radeonsi: submit cs to failed context instead of skipping them 46857ec681f3Smrg- radeonsi: use SI_CONTEXT_FLAG_AUX when recreating the aux context 46867ec681f3Smrg- radeonsi: do not recreate the aux context from the aux context 46877ec681f3Smrg- radeonsi: only recreate the aux_context when soft recovery failed 46887ec681f3Smrg- radeonsi: re-create the aux context in si_create_context 46897ec681f3Smrg- amdgpu,radeon: add full_reset_only param to ctx_query_reset_status 46907ec681f3Smrg- radeonsi: avoid querying gpu state if possible 46917ec681f3Smrg- r600/sb: Use assignments for resetting struct r600_sb::literal 46927ec681f3Smrg- driconf: add workaround for Golf With Friends 46937ec681f3Smrg 46947ec681f3SmrgQiang Yu (1): 46957ec681f3Smrg 46967ec681f3Smrg- lima: fix xserver page flip fail for full screen client 46977ec681f3Smrg 46987ec681f3SmrgRhys Perry (141): 46997ec681f3Smrg 47007ec681f3Smrg- nir/loop_unroll: unroll more aggressively if it can improve load scheduling 47017ec681f3Smrg- aco: fix convert_to_SDWA() check in add_subdword_definition() 47027ec681f3Smrg- aco: add test for incorrect convert_to_SDWA() check 47037ec681f3Smrg- radv: fix max_waves estimation on GFX10.3 47047ec681f3Smrg- aco: fix num_waves on GFX10+ 47057ec681f3Smrg- aco: have emit_wqm() take Builder instead of isel_context 47067ec681f3Smrg- aco: add emit_mimg() helper 47077ec681f3Smrg- aco: move VADDR to the end of the operand list 47087ec681f3Smrg- aco: use non-sequential addressing 47097ec681f3Smrg- aco: only require texture coordinates to be in WQM if NSA is used 47107ec681f3Smrg- aco: add affinity for non-sequential MIMG operands 47117ec681f3Smrg- radv,aco: don't use MUBUF for multi-channel loads on GFX8 with robustness2 47127ec681f3Smrg- nir/lower_io: fix array_length lowering if buffer is smaller than offset 47137ec681f3Smrg- radv,aco: use deref_buffer_array_length 47147ec681f3Smrg- radv: use nir_opt_access 47157ec681f3Smrg- nir/sink,nir/move: sink/move reorderable load_ssbo 47167ec681f3Smrg- radv: sink load_ssbo 47177ec681f3Smrg- aco: don't consider a phi trivial if same's register doesn't match the def 47187ec681f3Smrg- aco: remove Format::{VOP3A,VOP3B} 47197ec681f3Smrg- aco: add instruction cast and format-check methods 47207ec681f3Smrg- aco: use instruction cast methods 47217ec681f3Smrg- aco: use format-check methods 47227ec681f3Smrg- aco: return references in instruction cast methods 47237ec681f3Smrg- aco: fix WQM for texture instructions with args before the coordinates 47247ec681f3Smrg- nir/opt_uniform_atomics: recognize more complicated invocation comparisons 47257ec681f3Smrg- nir/opt_uniform_atomics: fix elect detection 47267ec681f3Smrg- aco: disable a*1.0 optimization if the instruction is precise 47277ec681f3Smrg- nir/algebraic: optimize out exact a*1.0 if it's used only as a float 47287ec681f3Smrg- aco: optimize a*0.0 47297ec681f3Smrg- aco: optimize out a*1.0 if it's used as a float 47307ec681f3Smrg- nir/algebraic: optimize out exact a+0.0 if it's used only as a float 47317ec681f3Smrg- nir/algebraic: eliminate exact a*0.0 if float execution mode allow it 47327ec681f3Smrg- aco: don't affect isPrecise() after applying output modifiers 47337ec681f3Smrg- nir,spirv: allow non-uniform OpArrayLength 47347ec681f3Smrg- radv,ac/nir: implement non-uniform get_ssbo_size 47357ec681f3Smrg- aco: implement non-uniform get_ssbo_size 47367ec681f3Smrg- radv: round-up num_records division in radv_flush_vertex_descriptors 47377ec681f3Smrg- radv: correctly enable WGP_MODE for NGG and GS 47387ec681f3Smrg- radv: correctly enable WGP_MODE for tessellation control 47397ec681f3Smrg- aco: add fallback algorithm in get_reg() 47407ec681f3Smrg- aco: always set exec_live=false 47417ec681f3Smrg- aco: optimize AC_FETCH_FORMAT_SNORM alpha adjust 47427ec681f3Smrg- aco: do not flag all blocks WQM to ensure we enter all nested loops in WQM 47437ec681f3Smrg- aco: rewrite setting of Exact_Branch 47447ec681f3Smrg- aco: remove loop to flag loop blocks as WQM 47457ec681f3Smrg- aco: fix adjust_vertex_fetch_alpha 47467ec681f3Smrg- radv: use a more relaxed alignment for upload buffer allocations 47477ec681f3Smrg- radv: fix max_lds_per_simd on GFX10 47487ec681f3Smrg- radv: switch MaxWaves statistic to wave32 waves 47497ec681f3Smrg- ac: split lds_granularity into encode and allocation granularities 47507ec681f3Smrg- radv: use lds_{encode,alloc}_granularity 47517ec681f3Smrg- radv: round up max_lds_per_simd / lds_per_wave 47527ec681f3Smrg- aco: fix waves calculation for wave32 47537ec681f3Smrg- aco: add Program::wgp_mode 47547ec681f3Smrg- radv,aco: add radv_nir_compiler_options::wgp_mode 47557ec681f3Smrg- aco: consider that GFX10.3 allocates LDS in 1024 byte blocks 47567ec681f3Smrg- aco: add DeviceInfo 47577ec681f3Smrg- aco: fix transition_to_{WQM,Exact} if exec.back() is not in exec 47587ec681f3Smrg- radv: relax shared alignment requirements in mem_vectorize_callback 47597ec681f3Smrg- radv,aco: allow unaligned LDS access on GFX9+ 47607ec681f3Smrg- aco/lower_phis: fix all_preds_uniform with continue_or_break 47617ec681f3Smrg- nir/dce: replace instruction worklist with ssa def bitset 47627ec681f3Smrg- nir: inline nir_foreach_{src,dest} 47637ec681f3Smrg- nir/dce: perform DCE for unlooped instructions in a single pass 47647ec681f3Smrg- aco: calculate all p_as_uniform and v_readfirstlane_b32 sources in WQM 47657ec681f3Smrg- aco: use p_as_uniform for get_sampler_desc and convert_pointer_to_64_bit 47667ec681f3Smrg- nir: fix build at -O1 47677ec681f3Smrg- nir: add nir_ssa_def_is_unused() 47687ec681f3Smrg- nir/copy_prop: remove unused copies 47697ec681f3Smrg- nir/copy_prop: visit copies instead of sources 47707ec681f3Smrg- nir/copy_prop: use nir_{instr,if}_rewrite_{src,condition}_ssa 47717ec681f3Smrg- Revert "radv,aco: allow unaligned LDS access on GFX9+" 47727ec681f3Smrg- aco: add missing usable_read2 check 47737ec681f3Smrg- nir/opt_shrink_vectors: add option to skip shrinking image stores 47747ec681f3Smrg- radv: don't shrink image stores for The Surge 2 47757ec681f3Smrg- radv: don't set sx_blend_opt_epsilon for V_028C70_COLOR_10_11_11 47767ec681f3Smrg- radv,aco: remove aco_compiler_statistics 47777ec681f3Smrg- radv: cache pipeline statistics 47787ec681f3Smrg- aco: set compr for fp16 exports 47797ec681f3Smrg- radv/llvm: fix enabled_channels for compressed exports 47807ec681f3Smrg- aco: simplify loop_nest_depth tracking in isel 47817ec681f3Smrg- aco: track divergent and uniform branch depth 47827ec681f3Smrg- aco: move wait_imm to aco_ir.h 47837ec681f3Smrg- aco: lower p_constaddr into separate instructions earlier 47847ec681f3Smrg- aco: add instruction classes 47857ec681f3Smrg- aco: add latency and inverse throughput statistics 47867ec681f3Smrg- aco: add print option to print program without temporary IDs 47877ec681f3Smrg- aco: add ACO_DEBUG=perfinfo 47887ec681f3Smrg- aco: remove vmem/smem score statistics 47897ec681f3Smrg- aco: fix NSA MIMG followed by MUBUF/MTBUF 47907ec681f3Smrg- aco/tests: add test for NSAToVMEMBug 47917ec681f3Smrg- aco: fix NSA following writelane 47927ec681f3Smrg- aco/tests: add test for waNsaCannotFollowWritelane 47937ec681f3Smrg- nir: Don't update base in vectorize_loads() 47947ec681f3Smrg- aco: implement 64-bit VGPR {u,i}find_msb 47957ec681f3Smrg- aco: use uadd32_sat() helper for nir_op_uadd_sat 47967ec681f3Smrg- aco: use a single instruction for uadd32_sat() on GFX8 47977ec681f3Smrg- aco: implement image_deref_samples 47987ec681f3Smrg- aco: add aco_print_program() flag to print kill flags 47997ec681f3Smrg- aco: add aco_print_program() flags to print live_out and register demand 48007ec681f3Smrg- docs: document ACO_DEBUG=perfinfo 48017ec681f3Smrg- aco: add ACO_DEBUG=liveinfo 48027ec681f3Smrg- radv: lower variables to ssa before nir_propagate_invariant 48037ec681f3Smrg- radv: lower view_index to zero if multiview is disabled 48047ec681f3Smrg- ci: add expected fail for RADV 48057ec681f3Smrg- aco: don't optimize min(a*1.0, ...) to min(a, ...) on GFX8 48067ec681f3Smrg- aco: use -1.0*x and 1.0*|x| for fneg/fabs 48077ec681f3Smrg- aco/tests: add tests for denormal-aware propagation 48087ec681f3Smrg- ac: invalidate metadata after hs_emit_write_tess_factors() 48097ec681f3Smrg- aco/tests: fix isel.sparse.clause for LLVM 12+ 48107ec681f3Smrg- lavapipe: fix initialization of pipe_stream_output with unwritten outputs 48117ec681f3Smrg- nir/gather_info: implement partial masking of struct and compact I/O 48127ec681f3Smrg- nir/lower_tex: handle deref casts 48137ec681f3Smrg- nir_to_tgsi: run constant folding after nir_opt_algebraic 48147ec681f3Smrg- aco: fix integer tg4 workaround with unnormalized coordinates 48157ec681f3Smrg- draw: fix pstipple, aaline and aapoint without LLVM 48167ec681f3Smrg- aco: ensure loops nested in a WQM loop are in WQM 48177ec681f3Smrg- nir/gather_info: fix partial masking of compact I/O with location_frac!=0 48187ec681f3Smrg- radv: remove second nir_lower_idiv 48197ec681f3Smrg- nir/lower_idiv: add options to use fp32 for 8-bit division lowering 48207ec681f3Smrg- nir/lower_idiv: make lowered divisions exact 48217ec681f3Smrg- aco: fix 16-bit u2f32 48227ec681f3Smrg- aco: fix 16-bit f2{u8,i8} on GFX6/7 48237ec681f3Smrg- radv: don't use fp16 for 8-bit division lowering before GFX9 48247ec681f3Smrg- nir: add nir_block_get_predecessors_sorted() helper 48257ec681f3Smrg- nir/lcssa: fix nondeterminism in predecessor iteration 48267ec681f3Smrg- nir/loop_unroll: fix is_indirect_load() with load_global 48277ec681f3Smrg- radv: fix conditions for running nir_opt_vectorize 48287ec681f3Smrg- aco/ra: use original names when renaming loop carried phi operands 48297ec681f3Smrg- aco/ra: remove live-in temporary from live_out_per_block when moving it 48307ec681f3Smrg- radv: fix barrier in radv_decompress_dcc_compute shader 48317ec681f3Smrg- radv: fix clearing DCC-compressed e5b9g9r9 images 48327ec681f3Smrg- aco: set TRUNC_COORD=0 for nir_texop_tg4 48337ec681f3Smrg- ac/nir: set TRUNC_COORD=0 for nir_texop_tg4 48347ec681f3Smrg- Revert "radeonsi: set TRUNC_COORD=0 for Total War: WARHAMMER to fix it" 48357ec681f3Smrg- aco: don't update register demand during RA validation 48367ec681f3Smrg- aco: allow SDWA sels smaller than the operand size 48377ec681f3Smrg- radv: disable VK_FORMAT_R64_SFLOAT 48387ec681f3Smrg- vulkan: fix use-after-free in vk_common_DestroyDebugReportCallbackEXT 48397ec681f3Smrg- radv: fix use-after-free upon GS copy shader cache hits 48407ec681f3Smrg- radv,ac/llvm: use a dword alignment for descriptor loads 48417ec681f3Smrg 48427ec681f3SmrgRob Clark (143): 48437ec681f3Smrg 48447ec681f3Smrg- freedreno/ir3: Fix ldg decoding/parsing 48457ec681f3Smrg- freedreno/ir3: Decouple ir3_info collection from assembler 48467ec681f3Smrg- freedreno/ir3: Add some new "logical" opcodes 48477ec681f3Smrg- freedreno/hw: Add isaspec mechanism for documenting/defining an ISA 48487ec681f3Smrg- freedreno/hw/isa: Add description of ir3 ISA 48497ec681f3Smrg- freedreno/hw/isa: Add expression caching 48507ec681f3Smrg- freedreno/ir3/tests: Switch disasm test over to new decoder 48517ec681f3Smrg- freedreno/ir3: Switch over to new encoder/decoder 48527ec681f3Smrg- freedreno/ir3: Small resinfo disasm tweak 48537ec681f3Smrg- freedreno/ir3: Better sstall estimation 48547ec681f3Smrg- freedreno/ir3: Realign disasm shader stats 48557ec681f3Smrg- freedreno/ir3/decode: Switch over to new disasm 48567ec681f3Smrg- freedreno/ir3: Remove legacy packed-struct encoding 48577ec681f3Smrg- frontend/dri: Expose RGB[AX]_SRGB as well 48587ec681f3Smrg- freedreno/isa: Fix branch/jump offset encoding 48597ec681f3Smrg- freedreno/a6xx: Add r2d support for GMEM resolves 48607ec681f3Smrg- gallium/util: Add helpers to determine if z/s is written 48617ec681f3Smrg- freedreno/a6xx: Don't early-z if there are stencil writes 48627ec681f3Smrg- r300: Use util_writes_depth_stencil() helper 48637ec681f3Smrg- radeonsi: Use util_writes_stencil() helper 48647ec681f3Smrg- freedreno: Add perf_warn() trace helper 48657ec681f3Smrg- freedreno: Add fmt/args macros for pipe_resource 48667ec681f3Smrg- freedreno/a6xx: Add helper to check if UBWC is supported 48677ec681f3Smrg- freedreno: Add perf_warn() for missed UBWC opportunities 48687ec681f3Smrg- ci/freedreno/a6xx: Skip vs-output-array-vec2-index-wr-before-gs 48697ec681f3Smrg- freedreno/a6xx: Fix 3dmark misrendering with unwritten MRTs 48707ec681f3Smrg- mesa: Remove _mesa_destroy_context() 48717ec681f3Smrg- freedreno/decode: Fix overflow 48727ec681f3Smrg- freedreno: Put an upper limit on VSC size 48737ec681f3Smrg- freedreno: Misc cleanup 48747ec681f3Smrg- freedreno/a5xx: Drop fd5_compute_stateobj 48757ec681f3Smrg- freedreno/a6xx: Drop fd6_compute_stateobj 48767ec681f3Smrg- freedreno/ir3+a5xx+a6xx: De-duplicate create_compute_state() 48777ec681f3Smrg- freedreno/ir3: Add ir3_shader_state 48787ec681f3Smrg- freedreno/ir3: Move ir3_compiler_create() 48797ec681f3Smrg- freedreno/ir3: Add ir3_screen_fini() 48807ec681f3Smrg- freedreno/ir3: Reshuffle ir3_shader_create() 48817ec681f3Smrg- freedreno/ir3: Reshuffle compute state creation 48827ec681f3Smrg- freedreno/ir3: Async shader compile 48837ec681f3Smrg- freedreno/ir3: Add missing shader prog cache invalidation 48847ec681f3Smrg- freedreno: Quiet fallthrough warnings 48857ec681f3Smrg- freedreno: Split batch_flush_reset_dependencies() 48867ec681f3Smrg- freedreno: driver-thread annotations 48877ec681f3Smrg- freedreno/ir3/print: More sane ssa src/dst display 48887ec681f3Smrg- freedreno/ir3/print: Improve branch printing 48897ec681f3Smrg- util/fossilize_db: Fix compile error with clang 48907ec681f3Smrg- freedreno: Handle InvalidateBufferData() case 48917ec681f3Smrg- freedreno: Add perf_debug logging for bo stalls 48927ec681f3Smrg- freedreno: Workaround for UNSYNC+DISCARD_RANGE 48937ec681f3Smrg- driconf: Generate a static table when no xmlconfig 48947ec681f3Smrg- xmlconfig: Reshuffle to keep attr processing 48957ec681f3Smrg- xmlconfig: Add static driconfig support 48967ec681f3Smrg- freedreno/ir3: Drop foreach_bit() macro 48977ec681f3Smrg- freedreno: Drop foreach_bit() macro 48987ec681f3Smrg- etnaviv: Drop foreach_bit() macro 48997ec681f3Smrg- v3d: Drop foreach_bit() macro 49007ec681f3Smrg- freedreno: Fix think-o in fd_resource_wait() 49017ec681f3Smrg- freedreno/ir3: Fix initial_variants_synchronous() condition 49027ec681f3Smrg- freedreno: Add FD_DBG() macro 49037ec681f3Smrg- freedreno: Slight perf_debug rework 49047ec681f3Smrg- freedreno: Add macro for duration based warns 49057ec681f3Smrg- util/u_queue: Ensure num_cpu_mask_bits is valid 49067ec681f3Smrg- util: Add accessor for util_cpu_caps 49077ec681f3Smrg- freedreno/a6xx: Always pass ctx to fd6_emit_textures() 49087ec681f3Smrg- freedreno/a6xx: Fix uncompressed resource vs stale CSO 49097ec681f3Smrg- freedreno/ir3: Add comments about shader key/gen 49107ec681f3Smrg- freedreno: Deduplicate fixup_shader_state() 49117ec681f3Smrg- freedreno/a6xx: Fix compile warning 49127ec681f3Smrg- driconf: Add ignore_map_unsynchronized option 49137ec681f3Smrg- freedreno: Remove dead-cells MBR workaround 49147ec681f3Smrg- util: Extract thread-id helpers from u_current 49157ec681f3Smrg- gallium/u_threaded: Add helper to assert driver thread 49167ec681f3Smrg- gallium/u_threaded: use mesa_log for debug msgs 49177ec681f3Smrg- freedreno: Fix u_blitter constant-buffer leak 49187ec681f3Smrg- freedreno: Factor out common fd_resource init 49197ec681f3Smrg- freedreno: Split out batch/resource tracking 49207ec681f3Smrg- freedreno: Restructure transfer_map() 49217ec681f3Smrg- freedreno: Extend threaded_resource 49227ec681f3Smrg- freedreno: Extend threaded_transfer 49237ec681f3Smrg- freedreno: Extract out helper for transfer-map flag munging 49247ec681f3Smrg- freedreno: Add fd_replace_buffer_storage() 49257ec681f3Smrg- freedreno: Add transfer_pool_unsync 49267ec681f3Smrg- freedreno/a6xx: Move UBWC demotion to first sampler view bind 49277ec681f3Smrg- freedreno: Check cb0 in rebind_resource() 49287ec681f3Smrg- freedreno: threaded_context support 49297ec681f3Smrg- freedreno: threaded_context async flush support 49307ec681f3Smrg- freedreno: Fix fd_fence_finish() 49317ec681f3Smrg- freedreno/drm: Avoid unitialized timestamp in submit fail 49327ec681f3Smrg- freedreno/drm: Split softpin "reloc" functions 49337ec681f3Smrg- freedreno/drm: Split 64b vs 32b paths 49347ec681f3Smrg- freedreno/drm: Move emit_reloc_tail to head 49357ec681f3Smrg- freedreno/drm: Inline iova calculation 49367ec681f3Smrg- freedreno/ir3: Precompute whether we need driver-params 49377ec681f3Smrg- freedreno: Add helpers to mark dirty state 49387ec681f3Smrg- freedreno: Add mapping to generation specific dirty state 49397ec681f3Smrg- freedreno/a6xx: Convert to dirty_groups 49407ec681f3Smrg- freedreno: Small dirty flag re-org 49417ec681f3Smrg- freedreno: Add dirty bit for state that needs rsc tracking 49427ec681f3Smrg- freedreno: Don't ignore geom/tess stage resources 49437ec681f3Smrg- freedreno: Split out helper for updating sw stats 49447ec681f3Smrg- freedreno: Only collect sw stats when required 49457ec681f3Smrg- freedreno/a6xx/vsc: Be more tolerate of degenerate prims 49467ec681f3Smrg- freedreno: Drop u_trim_pipe_prim() from fast-paths 49477ec681f3Smrg- u_draw: Add helper to emultate multi-draw 49487ec681f3Smrg- freedreno: Use multi-draw helper 49497ec681f3Smrg- freedreno: Handle multi-draw edge cases 49507ec681f3Smrg- freedreno: Push multi-draw closer to backend 49517ec681f3Smrg- freedreno/a6xx: Emit streamout state on every draw 49527ec681f3Smrg- freedreno: Add draw cost estimation 49537ec681f3Smrg- freedreno/batch: Export key/hash fxns 49547ec681f3Smrg- freedreno/batch: Add a way to clone a batch key 49557ec681f3Smrg- freedreno: Add gmem_reason_mask 49567ec681f3Smrg- freedreno/a6xx: Fix sRGB/snorm vs sysmem clear path 49577ec681f3Smrg- freedreno: Autotune bypass vs GMEM rendering decision 49587ec681f3Smrg- freedreno/a6xx: Fix typo 49597ec681f3Smrg- freedreno: Make headers C++ happy 49607ec681f3Smrg- freedreno/fdperf: Use os_read_file() 49617ec681f3Smrg- freedreno: Split out devicetree helpers 49627ec681f3Smrg- ci: Disable panfrost t760 49637ec681f3Smrg- freedreno/a6xx: Fix indirect+patches draws 49647ec681f3Smrg- freedreno/a6xx: Fix obsolete comment 49657ec681f3Smrg- d3d12: Use util_draw_multi() helper 49667ec681f3Smrg- etnaviv: Use util_draw_multi() helper 49677ec681f3Smrg- i915: Use util_draw_multi() helper 49687ec681f3Smrg- iris: Use util_draw_multi() helper 49697ec681f3Smrg- lima: Use util_draw_multi() helper 49707ec681f3Smrg- llvmpipe: Use util_draw_multi() helper 49717ec681f3Smrg- nouveau: Use util_draw_multi() helper 49727ec681f3Smrg- r300: Use util_draw_multi() helper 49737ec681f3Smrg- r600: Use util_draw_multi() helper 49747ec681f3Smrg- softpipe: Use util_draw_multi() helper 49757ec681f3Smrg- svga: Use util_draw_multi() helper 49767ec681f3Smrg- tegra: Use util_draw_multi() helper 49777ec681f3Smrg- vc4: Use util_draw_multi() helper 49787ec681f3Smrg- v3d: Use util_draw_multi() helper 49797ec681f3Smrg- virgl: Use util_draw_multi() helper 49807ec681f3Smrg- freedreno: Don't handle multi-draw in indirect case 49817ec681f3Smrg- util/primconvert: Handle indirect and multi-draw 49827ec681f3Smrg- freedreno: Add .clang-format 49837ec681f3Smrg- freedreno: Some manual reformatting 49847ec681f3Smrg- freedreno: Re-indent 49857ec681f3Smrg- freedreno: Manual fixups 49867ec681f3Smrg- freedreno: Add missing foreach macros and update indentation 49877ec681f3Smrg 49887ec681f3SmrgRohan Garg (8): 49897ec681f3Smrg 49907ec681f3Smrg- virgl: Cache depth and stencil buffers 49917ec681f3Smrg- ci: Ensure that jobs inherting the ci-deqp jobs artifact meson logs 49927ec681f3Smrg- intel/genxml: Free resource before exiting 49937ec681f3Smrg- intel/compiler: Free resources on test teardown 49947ec681f3Smrg- virgl: update headers 49957ec681f3Smrg- virgl: Return total video memory if available 49967ec681f3Smrg- virgl: Add support for querying detailed memory info 49977ec681f3Smrg- virgl: Support the ETC1_RGB8 format as virglrenderer supports it 49987ec681f3Smrg 49997ec681f3SmrgRoman Stratiienko (1): 50007ec681f3Smrg 50017ec681f3Smrg- egl: android: use num_planes param in createImageFromDmaBufs() 50027ec681f3Smrg 50037ec681f3SmrgRuijing Dong (1): 50047ec681f3Smrg 50057ec681f3Smrg- radeon/vcn: release si buffer for encoding at the end. 50067ec681f3Smrg 50077ec681f3SmrgRyan Neph (1): 50087ec681f3Smrg 50097ec681f3Smrg- Revert "virgl: fix BGRA emulation artifacts during window resize" 50107ec681f3Smrg 50117ec681f3SmrgSagar Ghuge (7): 50127ec681f3Smrg 50137ec681f3Smrg- anv: Invalidate the correct AUX-TT entry 50147ec681f3Smrg- anv: Skip CCS ambiguate which preceed fast-clears 50157ec681f3Smrg- intel/mi_builder: Added support for command streamer shift operations 50167ec681f3Smrg- anv: Add anv_memregion structure 50177ec681f3Smrg- Revert "Revert "blorp/gen12: Don't use aux address if implicit CCS"" 50187ec681f3Smrg- intel/blorp: Fix condition to figure out aux_address 50197ec681f3Smrg- anv: Set correct binding table entry count 50207ec681f3Smrg 50217ec681f3SmrgSamuel Iglesias Gonsálvez (9): 50227ec681f3Smrg 50237ec681f3Smrg- turnip: disable UBWC on Z24_S8 MSAA images on A630 50247ec681f3Smrg- turnip: set sparseAddressSpaceSize to zero 50257ec681f3Smrg- turnip: fix UINT64_MAX size wrapping in tu_GetBufferMemoryRequirements() 50267ec681f3Smrg- turnip: fix resolve MSAA D24_UNORM_S8_UINT image to S8_UINT 50277ec681f3Smrg- turnip: fix resolve MSAA D32_SFLOAT_S8_UINT image to S8_UINT 50287ec681f3Smrg- util: fix parsing of /proc/meminfo MemAvailable value 50297ec681f3Smrg- turnip: keep track of memory heap usage, size and flags 50307ec681f3Smrg- turnip: VK_EXT_memory_budget implementation 50317ec681f3Smrg- turnip: set depth plane control zmode to A6XX_LATE_Z when sample mask is written 50327ec681f3Smrg 50337ec681f3SmrgSamuel Pitoiset (218): 50347ec681f3Smrg 50357ec681f3Smrg- radv: do not invalidate the L2 metadata cache on compute queues 50367ec681f3Smrg- ci: mark some sparse CTS as expected failures on RAVEN 50377ec681f3Smrg- radv: flush L2 metadata as part of CB/DB flush instead of CS_DONE on GFX9 50387ec681f3Smrg- radv: add a comment explaining the micro tile mode resolve 50397ec681f3Smrg- radv: enable TC-compat HTILE with D32S8 and MSAA on GFX9+ 50407ec681f3Smrg- radv: enable TC-compat HTILE for D16S8 on GFX9+ 50417ec681f3Smrg- radv: restore invalidating the vector cache for internal meta operations 50427ec681f3Smrg- radv: flush L2 for images affected by the pipe misaligned issue on GFX10+ 50437ec681f3Smrg- ci: exclude one CTS test that timeout most of the time for RADV CI 50447ec681f3Smrg- radv: remove redundant check in radv_process_depth_stencil() 50457ec681f3Smrg- radv: remove unnecessary radv_image::tc_compatible_htile 50467ec681f3Smrg- radv: remove redundant check in depth_view_can_fast_clear() 50477ec681f3Smrg- radv: fix a sync issue with geometry shader primitives query on GFX10+ 50487ec681f3Smrg- radv: fix overflow when computing the SQTT buffer size 50497ec681f3Smrg- radv: inhibit clock gating when tracing with SQTT 50507ec681f3Smrg- ac/rgp: add support for GFX10.3 50517ec681f3Smrg- ac,radv: add SQTT support on GFX10.3 50527ec681f3Smrg- radv: enable SQTT support on GFX10.3 50537ec681f3Smrg- radv: fix separate depth/stencil layout in render pass 50547ec681f3Smrg- radv: add multi-layer support to FMASK color expand 50557ec681f3Smrg- radv: use the range aspect mask in FMASK color expand 50567ec681f3Smrg- radv: use a workgroup size of 8x8 for FMASK color expand 50577ec681f3Smrg- radv: only decompress the depth/stencil aspect that needs to be resolved 50587ec681f3Smrg- radv: enable sparseImageInt64Atomics/sparseImageFloat32Atomics 50597ec681f3Smrg- radv,aco: fix shifting input VGPRs for the LS VGPR init bug on GFX9 50607ec681f3Smrg- radv: synchronize Cmd{Set,Write}Event() using PS_DONE/CS_DONE events 50617ec681f3Smrg- radv: add support for emitting PS_DONE/CS_DONE on GFX6-8 50627ec681f3Smrg- radv: remove radv_util.h 50637ec681f3Smrg- radv: remove stub() macros 50647ec681f3Smrg- radv: remove unused EMPTY constant in radv_descript_set.c 50657ec681f3Smrg- nir/algebraic: mark more optimization with fsat(NaN) as inexact 50667ec681f3Smrg- ac/surface: store HTILE mip info into the surface 50677ec681f3Smrg- radv: use the image view range when fast clearing depth 50687ec681f3Smrg- radv: check if HTILE is enabled per-level instead of the entire image 50697ec681f3Smrg- radv: do not decompress/resummarize levels without HTILE 50707ec681f3Smrg- radv: remove mipmaps related assertions when initializing HTILE 50717ec681f3Smrg- radv: add support for fast clearing levels of the HTILE buffer 50727ec681f3Smrg- radv: teach radv_htile_enabled() about the number of HTILE levels 50737ec681f3Smrg- radv: enable TC-compat HTILE for mipmaps on GFX10+ 50747ec681f3Smrg- radv: re-disable TC-compat HTILE for D32S8 on all generations 50757ec681f3Smrg- radv: fix centroid with VRS coarse shading 50767ec681f3Smrg- radv/winsys: move the initial BO domain to radeon_winsys_bo 50777ec681f3Smrg- radv: prefer CP DMA for GTT buffer copies/clears on dGPUs due to slow PCIe 50787ec681f3Smrg- radv: fix waiting on the last enabled RB for occlusion queries 50797ec681f3Smrg- radv/winsys: use an array for the global BO list instead of a list 50807ec681f3Smrg- radv/winsys: remove the radv_amdgpu_winsys_bo::ws indirection 50817ec681f3Smrg- radv/winsys: remove useless continue preamble CS for IBs path 50827ec681f3Smrg- radv/winsys: remove useless is_local check in radv_amdgpu_cs_add_buffer() 50837ec681f3Smrg- radv/winsys: remove unused radeon_bo_usage enum 50847ec681f3Smrg- radv/winsys: simplify the user fence logic for submission 50857ec681f3Smrg- radv/winsys: remove unused fields in radv_amdgpu_cs_request 50867ec681f3Smrg- radv/winsys: stop zeroing radv_amdgpu_cs_request 50877ec681f3Smrg- radv: use less AMDGPU contexts by creating only one per queue priority 50887ec681f3Smrg- radv: add radeon_winsys_bo::use_global_list 50897ec681f3Smrg- radv: stop using VM_ALWAYS_VALID on APUs 50907ec681f3Smrg- radv/winsys: move the debug_all_bos check outside of the add/del helpers 50917ec681f3Smrg- radv/winsys: set use_global_list to avoid adding a BO twice 50927ec681f3Smrg- radv/winsys: add buffer_make_resident() to the API 50937ec681f3Smrg- radv/winsys: add the resident BOs to the list of BOs at submit time 50947ec681f3Smrg- radv/winsys: enable the global BO list unconditionally 50957ec681f3Smrg- radv: use the global BO list from the winsys 50967ec681f3Smrg- radv: fix printing the debug option names 50977ec681f3Smrg- radv: fix double free when creating a fence failed 50987ec681f3Smrg- radv: stop allocating useless ESGS scratch BO on GFX10+ 50997ec681f3Smrg- radv: fix memory leaks if a submission fails 51007ec681f3Smrg- radv: do not overallocate the SQTT buffer 51017ec681f3Smrg- radv: adjust an error message related to the SQTT buffer size 51027ec681f3Smrg- radv: add support for resizing the SQTT buffer automatically 51037ec681f3Smrg- ac/rgp: append the number of seconds to the generated RGP file 51047ec681f3Smrg- radv: emit pipeline bind markers for SQTT 51057ec681f3Smrg- radv: only make the WSI images resident if the global BO list is used 51067ec681f3Smrg- radv/winsys: set use_global_list inside the critical section 51077ec681f3Smrg- radv: only apply the MRT output NaN fixup to non-meta shaders 51087ec681f3Smrg- radv: create the start/stop CS for SQTT dynamically 51097ec681f3Smrg- radv: move SQTT parameters initialization to radv_thread_trace_init() 51107ec681f3Smrg- radv: remove an outdated TODO about SQTT cache flushes 51117ec681f3Smrg- radv: make sure to allocate enough space when emitting SQTT userdata 51127ec681f3Smrg- radv: stop emitting pipeline bind markers 51137ec681f3Smrg- radv: do not allow to capture SQTT on the compute queue 51147ec681f3Smrg- radv: add support for user event markers with SQTT 51157ec681f3Smrg- radv: only emit pipeline bind markers for application pipelines 51167ec681f3Smrg- radv: use the pipeline key as hash for pipeline bind markers 51177ec681f3Smrg- radv: set correct value for OFFCHIP_BUFFERING on GFX10+ 51187ec681f3Smrg- radv: make the border color BO a resident buffer 51197ec681f3Smrg- radv: make the trace BO a resident buffer 51207ec681f3Smrg- radv: make the TMA/TBA BOs resident buffers 51217ec681f3Smrg- radv: emit the trap handler registers earlier 51227ec681f3Smrg- radv: rework radv_cmd_buffer_resolve_subpass() a bit 51237ec681f3Smrg- radv: emit missing subpass resolve marker for SQTT 51247ec681f3Smrg- ac/rgp: fill CPU info by parsing /proc/cpuinfo 51257ec681f3Smrg- radv: store a pointer to the code in radv_shader_variant 51267ec681f3Smrg- radv: add support for exporting pipelines with RGP 51277ec681f3Smrg- radv: add support for instruction timing with RGP 51287ec681f3Smrg- radv: do not scale the depth bias for D16_UNORM depth surfaces 51297ec681f3Smrg- include/drm-uapi: bump AMDGPU headers 51307ec681f3Smrg- ac/rgp: recognize more memory types 51317ec681f3Smrg- ac/rgp: report LDS size in CU mode on GFX10+ 51327ec681f3Smrg- ac/rgp: report the number of memory operations per clock 51337ec681f3Smrg- ac/rgp: report the number of primitives per clock 51347ec681f3Smrg- radv: remove duplicate REG_INCLUDE_CONTEXT setting for SQTT 51357ec681f3Smrg- radv: always select the first active CU when profiling with SQTT 51367ec681f3Smrg- radv: fix exporting SQTT pipelines with LLVM 51377ec681f3Smrg- radv: exclude perf counters for SQTT also on GFX10.3 51387ec681f3Smrg- Revert "radv: do not overallocate the SQTT buffer" 51397ec681f3Smrg- radeonsi,radv: do not overallocate the SQTT buffer size 51407ec681f3Smrg- radv: remove useless decompression of the DS resolve attachment 51417ec681f3Smrg- radv: do not trace inactive shader engines with SQTT 51427ec681f3Smrg- ac/sqtt: fix determining if the trace is complete on GFX10+ 51437ec681f3Smrg- radv: double the SQTT buffer size when it is resized 51447ec681f3Smrg- radv: trigger a new SQTT capture automatically after resizing the buffer 51457ec681f3Smrg- radv: bump the initial SQTT buffer size to 32MB per SE 51467ec681f3Smrg- radv: fix RGP barrier layout transition for TC-compatible CMASK images 51477ec681f3Smrg- Revert "radv: stop using VM_ALWAYS_VALID on APUs" 51487ec681f3Smrg- radv: cleanup enabling TC-compat HTILE for depth surfaces 51497ec681f3Smrg- radv: remove useless check about mips+layers for TC-compat HTILE images 51507ec681f3Smrg- radv: skip useless FCE when fast-clearing MSAA images with DCC enabled 51517ec681f3Smrg- radv: re-enable TC-compat HTILE for MSAA D32S8 images on GFX9+ 51527ec681f3Smrg- radv: do not declare push constants for DCC decompress on compute 51537ec681f3Smrg- radv: check if dynamic VRS state changed 51547ec681f3Smrg- radv: check if dynamic line stipple state changed 51557ec681f3Smrg- radv: disable sampling with VK_FORMAT_R64_SFLOAT 51567ec681f3Smrg- radv: fix meta save/restore state with non renderable images 51577ec681f3Smrg- radv: fix potential clears with non renderable images on GFX9+ 51587ec681f3Smrg- radv: fix initialization of disable_compression when clearing color image 51597ec681f3Smrg- radv: add missing SQTT events for copy_commands2/create_renderpass2 51607ec681f3Smrg- radv: remove useless DCC disable check for 3D images on GFX10+ 51617ec681f3Smrg- radv: rework radv_use_dcc_for_image() a bit 51627ec681f3Smrg- vulkan: add missing vk_shader_module.c/h includes to Makefile 51637ec681f3Smrg- radv: use common entrypoints for VK_KHR_copy_commands2 51647ec681f3Smrg- radv: do not enable TC-compat CMASK if the image isn't readable by a shader 51657ec681f3Smrg- radv: remove redundant check when enabling TC-compat CMASK 51667ec681f3Smrg- radv: make sure FMASK is enabled for TC-compat CMASK 51677ec681f3Smrg- radv: only configure the CMASK tiling for TC-compat on GFX8 51687ec681f3Smrg- radv: initialize TC-compat CMASK images with the DCC clear code 51697ec681f3Smrg- radv: enable TC-compat CMASK on GFX10+ 51707ec681f3Smrg- radv: add notccompatcmask debug option 51717ec681f3Smrg- radv: extend the dirty bits to 64-bit 51727ec681f3Smrg- ac/surface: init CMASK slice size on GFX9+ 51737ec681f3Smrg- radv: fix clearing CMASK layers on GFX9+ 51747ec681f3Smrg- radv: initialize CMASK with correct clear codes 51757ec681f3Smrg- radv: restore previous MRT CB_SHADER_MASK logic 51767ec681f3Smrg- radv: gather if the FS uses perspective or linear interpolations 51777ec681f3Smrg- radv: determine if a pipeline is candidate for flat shading 51787ec681f3Smrg- radv: enable VRS 2x2 coarse shading for flat shading on GFX10.3+ 51797ec681f3Smrg- radv: add RADV_DEBUG=novrsflatshading option 51807ec681f3Smrg- ci: update list of expected CTS failures for RADV 51817ec681f3Smrg- vulkan: add common entrypoints for VK_KHR_create_renderpass2 51827ec681f3Smrg- radv: use common entrypoints for VK_KHR_create_renderpass2 51837ec681f3Smrg- turnip: use common entrypoints for VK_KHR_create_renderpass2 51847ec681f3Smrg- lavapipe: use common entrypoints for VK_KHR_create_renderpass2 51857ec681f3Smrg- anv: use common entrypoints for VK_KHR_create_renderpass2 51867ec681f3Smrg- radv: report that degenerated triangles are not culled 51877ec681f3Smrg- radv: require DRM 3.35+ 51887ec681f3Smrg- ac/surface: do not allocate FMASK or CMASK for stencil-only surfaces on GFX9+ 51897ec681f3Smrg- radv: do not fixup DCC after compute color resolves if DCC stores enabled 51907ec681f3Smrg- radv: only set WRITE_COMPRESS_ENABLE for storage image descriptors 51917ec681f3Smrg- radv: use a sampled image descriptor for reads for the MSAA color decompress 51927ec681f3Smrg- radv: compress FMASK for all layouts except GENERAL 51937ec681f3Smrg- radv: cleanup FMASK expand transitions 51947ec681f3Smrg- radv: do not force enable FMASK during MSAA blits 51957ec681f3Smrg- radv: use COLOR_ATTACHMENT_OPTIMAL for fast clear/hw resolve operations 51967ec681f3Smrg- ac: add ac_get_family_name() helper 51977ec681f3Smrg- radv: change RADV_FORCE_FAMILY to use family name instead of LLVM processor name 51987ec681f3Smrg- radv: try to keep HTILE compressed with DEPTH_STENCIL_READ_ONLY_OPTIMAL 51997ec681f3Smrg- radv: clean up fence syncobj code 52007ec681f3Smrg- ac: add ac_gpu_info::has_image_load_dcc_bug 52017ec681f3Smrg- aco: fix get_sampler_desc() for image loads 52027ec681f3Smrg- aco: implement a workaround for the image load DCC hw bug on GFX10.3 52037ec681f3Smrg- radv: allow DCC for storage images on GFX10.3 with RADV_PERFTEST=dccstores 52047ec681f3Smrg- radv: handle implicit subpass dependencies per attachment 52057ec681f3Smrg- radv: init CMASK/FMASK/DCC in parallel 52067ec681f3Smrg- radv: perform MSAA color decompression for storage images with DCC 52077ec681f3Smrg- radv: enable DCC stores with MSAA 4x/8x on GFX10+ 52087ec681f3Smrg- radv: simplify a check when enabling DCC for concurrent images 52097ec681f3Smrg- radv: enable DCC for concurrent images on GFX10 52107ec681f3Smrg- radv: make sure FMASK decompress and FCE are performed on gfx queue 52117ec681f3Smrg- radv: add MSAA support to ClearColorImage() on compute queue 52127ec681f3Smrg- radv: do not clamp framebuffer dimensions to the minimum dimension 52137ec681f3Smrg- radv: add MSAA support to CopyImage() on compute queue 52147ec681f3Smrg- radv: use explicit VRS mode when configuring PA_CL_VRS_CNTL 52157ec681f3Smrg- radv: allow to force VRS rates on GFX10.3 with RADV_FORCE_VRS 52167ec681f3Smrg- radv: fix needed dynamic state for VRS 52177ec681f3Smrg- amd/addrlib: expose HTILE address equations to drivers on GFX10+ 52187ec681f3Smrg- ac/surface: rename ac_surface_dcc_address_test.c 52197ec681f3Smrg- ac/surface: add a test of HtileAddrFromCoord prototype outside of addrlib 52207ec681f3Smrg- ac/surface: rename gfx9_dcc_equation to gfx9_meta_equation 52217ec681f3Smrg- ac/surface: increase gfx9_meta_equation::gfx10_bits by 4 elements 52227ec681f3Smrg- ac/surface: copy the HTILE equations to the surface 52237ec681f3Smrg- ac/surface: implement HtileAddrFromCoord in NIR 52247ec681f3Smrg- ac/surface: store the HTILE pitch to the surface 52257ec681f3Smrg- radv: expose R8_UINT as the only supported format for VRS attachments 52267ec681f3Smrg- radv: do not allow MSAA with fragment shading rate attachments 52277ec681f3Smrg- radv: do not enable DCC for fragment shading rate attachments 52287ec681f3Smrg- radv: determine if attachment VRS is enabled 52297ec681f3Smrg- radv: configure the VRS HTILE encoding size 52307ec681f3Smrg- radv: do not use the whole HTILE buffer for depth when VRS is used 52317ec681f3Smrg- radv: update the HTILE clear word when VRS is used 52327ec681f3Smrg- radv: allow HTILE for very small images if VRS attachment is used 52337ec681f3Smrg- radv: create an image for VRS if no depth/stencil attachment is bound 52347ec681f3Smrg- radv: handle the VRS attachment subpass 52357ec681f3Smrg- radv: bind our internal depth buffer when not provided by the app 52367ec681f3Smrg- radv: add support for copying VRS rates into HTILE 52377ec681f3Smrg- radv: copy VRS rates to HTILE when beginning a subpass 52387ec681f3Smrg- radv: configure the VRS combiners when an attachment is used 52397ec681f3Smrg- radv: advertise attachmentFragmentShadingRate on GFX10.3 52407ec681f3Smrg- ac: add missing BUF_DATA_FORMAT_10_11_11 vertex format on GFX10+ 52417ec681f3Smrg- radv: keep DCC compressed for clears on compute with image stores 52427ec681f3Smrg- aco: fix opquantize2f16 on GFX6-7 52437ec681f3Smrg- radv: fix fast clearing depth-only or stencil-only aspects with HTILE 52447ec681f3Smrg- radv: fix emitting depth bias when beginning a command buffer 52457ec681f3Smrg- radv: fix emitting default depth bounds state on GFX6 52467ec681f3Smrg- radv/winsys: fix allocating the number of CS in the sysmem path 52477ec681f3Smrg- radv/winsys: fix resetting the number of padded IB words 52487ec681f3Smrg- radv: make sure CP DMA is idle before executing secondary command buffers 52497ec681f3Smrg- radv: fix various CMASK regressions on GFX9 52507ec681f3Smrg- radv: fix computation of the number of user SGPRS for NGG GS state 52517ec681f3Smrg- radv: check if DCC is enabled when resolving different levels 52527ec681f3Smrg- radv/winsys: fix executing huge secondary command buffers on GFX6 52537ec681f3Smrg 52547ec681f3SmrgSerge Martin (1): 52557ec681f3Smrg 52567ec681f3Smrg- clover: return CL_INVALID_VALUE when origin or region are NULL 52577ec681f3Smrg 52587ec681f3SmrgSimon Ser (15): 52597ec681f3Smrg 52607ec681f3Smrg- nouveau/nvc0: fix linear buffer alignment for scan-out/cursors 52617ec681f3Smrg- nouveau/nv50: fix linear buffer alignment for scan-out/cursors 52627ec681f3Smrg- frontends/va: extract pipe format to DRM format mapping 52637ec681f3Smrg- frontends/va: add support for VA_EXPORT_SURFACE_COMPOSED_LAYERS 52647ec681f3Smrg- frontends/va: add pipe to DRM format mapping for NV12 and P010 52657ec681f3Smrg- radeonsi/uvd: make format modifiers-aware 52667ec681f3Smrg- egl: use render node for wl_drm if available 52677ec681f3Smrg- gbm: fail early when modifier list only contains INVALID 52687ec681f3Smrg- gbm: remove fprintf calls in gbm_dri_bo_create 52697ec681f3Smrg- egl/wayland: avoid unnecessary roundtrip when authenticated 52707ec681f3Smrg- gbm: add gbm_bo_get_fd_for_plane 52717ec681f3Smrg- egl: fix software flag in _eglAddDevice call on DRM 52727ec681f3Smrg- egl: only take render nodes into account when listing DRM devices 52737ec681f3Smrg- Revert "egl: Don't add hardware device if there is no render node v2." 52747ec681f3Smrg- radv: fix format feature reporting for modifiers 52757ec681f3Smrg 52767ec681f3SmrgSimon Zeni (1): 52777ec681f3Smrg 52787ec681f3Smrg- egl/dri2: enable EGL_WL_bind_wayland_display in EGL device platform 52797ec681f3Smrg 52807ec681f3SmrgStéphane Marchesin (1): 52817ec681f3Smrg 52827ec681f3Smrg- virgl: Add simple disk cache 52837ec681f3Smrg 52847ec681f3SmrgSureshGuttula (3): 52857ec681f3Smrg 52867ec681f3Smrg- va/picture : Added failure check for stability 52877ec681f3Smrg- frontends/va: Update conditional checks for code stability. 52887ec681f3Smrg- frontends/va : Fix memory leaks incase of error returns 52897ec681f3Smrg 52907ec681f3SmrgTamara Schmitz (1): 52917ec681f3Smrg 52927ec681f3Smrg- util: add mesa_glthread for Valheim in OpenGL mode. 52937ec681f3Smrg 52947ec681f3SmrgTapani Pälli (17): 52957ec681f3Smrg 52967ec681f3Smrg- mesa: add GL_SR8_EXT, GL_SRG8_EXT for color/srgb format queries 52977ec681f3Smrg- intel/perf: cleanup, remove duplicate function declaration 52987ec681f3Smrg- intel/perf: introduce additional ralloc context parameter 52997ec681f3Smrg- i965: use aligned malloc for context instead of ralloc 53007ec681f3Smrg- mesa: add check that non base level attachment is mipmap complete 53017ec681f3Smrg- gitlab-ci: bump piglit commit for windows 53027ec681f3Smrg- anv: toggle on sample shading if it is set in the shader 53037ec681f3Smrg- anv/android: fix compilation failure 53047ec681f3Smrg- anv: fix compilation due to missing vk_format_from_android 53057ec681f3Smrg- mesa: check cube completeness for cube fbo attachments 53067ec681f3Smrg- anv/android: fix image creation with external format 53077ec681f3Smrg- android: add some more stub functions for cross compilation 53087ec681f3Smrg- intel/common: disable batch decoder on Android platform 53097ec681f3Smrg- loader: prefer iris on Android 53107ec681f3Smrg- iris: clamp PointWidth in 3DSTATE_SF like i965 does 53117ec681f3Smrg- egl: support no error attribute set to false with ES 1.1 53127ec681f3Smrg- glx: revert "Downgrade sRGB-ful fbconfigs" 53137ec681f3Smrg 53147ec681f3SmrgThong Thai (2): 53157ec681f3Smrg 53167ec681f3Smrg- frontends/va/config: Fix check for packed header config 53177ec681f3Smrg- radeon: Add cropping to encoded H.265 when padding is used 53187ec681f3Smrg 53197ec681f3SmrgTimothee Chabat (1): 53207ec681f3Smrg 53217ec681f3Smrg- llvmpipe: increase PIPE_CAP_MAX_TEXTURE_BUFFER_SIZE value 53227ec681f3Smrg 53237ec681f3SmrgTimothy Arceri (23): 53247ec681f3Smrg 53257ec681f3Smrg- util/disk_cache: do crc32 check on compressed data for ZSTD 53267ec681f3Smrg- util/disk_cache: move cache path strdup call back into disk_cache.c 53277ec681f3Smrg- util/disk_cache: use a new cache dir for the single file cache feature 53287ec681f3Smrg- util/mesa_sha1: add helper to reconvert sha1 hex strings 53297ec681f3Smrg- util/fossilize_db: add basic fossilize db util to read/write shader caches 53307ec681f3Smrg- util/disk_cache: make use of single file cache when env var set 53317ec681f3Smrg- nir: handle negatives in ffma reassociation optimisation 53327ec681f3Smrg- util/disk_cache: fix crash in fossilize_db 53337ec681f3Smrg- util/disk_cache: move cache tests to the util directory 53347ec681f3Smrg- util/disk_cache: make MESA_DISK_CACHE_READ_ONLY_FOZ_DBS a relative path 53357ec681f3Smrg- Revert "glsl: default to compat shaders in compat profile" 53367ec681f3Smrg- glsl: fix declarations of gl_MaxVaryingFloats 53377ec681f3Smrg- util: create some standalone compression helpers 53387ec681f3Smrg- util/disk_cache: make use of the new compression helpers 53397ec681f3Smrg- util/fossilize_db: remove compression from foz db helper 53407ec681f3Smrg- util/compress: make compression function inputs const 53417ec681f3Smrg- util/disk_cache: separate file reads from cache item validation 53427ec681f3Smrg- util/disk_cache: detangle cache item creation from disk writing 53437ec681f3Smrg- util/disk_cache: add cache item headers to single file cache entries 53447ec681f3Smrg- glsl: add compilation errors for attribute and varying qualifiers 53457ec681f3Smrg- glsl: enforce restrictions on builtin functions moved to compat 53467ec681f3Smrg- mesa: fix incomplete GL_NV_half_float implementation 53477ec681f3Smrg- util: disable glthread in CSGO 53487ec681f3Smrg 53497ec681f3SmrgTimur Kristóf (55): 53507ec681f3Smrg 53517ec681f3Smrg- radv: Only enable sparse features on Polaris and newer. 53527ec681f3Smrg- tgsi_to_nir: Fix uniform ranges. 53537ec681f3Smrg- aco: Fix LDS statistics of tess control shaders. 53547ec681f3Smrg- radv/llvm: Fix reporting LDS stats of tess control shaders. 53557ec681f3Smrg- aco: Disallow LSHS temp-only I/O when VS output is written indirectly. 53567ec681f3Smrg- pan/bi: Use correct enum type for NIR intrinsics. 53577ec681f3Smrg- aco: Use ASSERTED to avoid unused variable warning. 53587ec681f3Smrg- intel/compiler: Use assume() instead of assert() for array bounds. 53597ec681f3Smrg- intel/compiler: Make room for maximum dest size in nir_emit_texture. 53607ec681f3Smrg- anv: Use unreachable() in anv_genX. 53617ec681f3Smrg- anv: Use ASSERTED for results that are only used in asserts. 53627ec681f3Smrg- nir: Add new nir_builder helpers for iadd with no_unsigned_wrap. 53637ec681f3Smrg- nir: Add nir_builder helper for I/O address offset calculations. 53647ec681f3Smrg- nir: Add a few more algebraic optimizations to help address calculation. 53657ec681f3Smrg- nir: Fix unsigned upper bound of local_invocation_index for non-CS stages. 53667ec681f3Smrg- nir: Shrink vectors for load_shared. 53677ec681f3Smrg- nir: Add unsigned upper bound for TCS load_invocation_id. 53687ec681f3Smrg- nir: Add default unsigned upper bound configuration. 53697ec681f3Smrg- nir: Add AMD-specific buffer load/store intrinsics. 53707ec681f3Smrg- nir: Add nir_opt_offsets to fold const adds into load/store offsets. 53717ec681f3Smrg- nir: Add tessellation related AMD-specific intrinsics. 53727ec681f3Smrg- nir: Add AMD-specific Geometry Shader related intrinsics. 53737ec681f3Smrg- aco: Implement new buffer load/store intrinsics. 53747ec681f3Smrg- aco: Implement the new tessellation I/O related NIR intrinsics. 53757ec681f3Smrg- aco: Implement new Geometry Shader intrinsics. 53767ec681f3Smrg- ac/llvm: Implement AMD-specific buffer load/store intrinsics. 53777ec681f3Smrg- ac/llvm: Implement the new tessellation intrinsics. 53787ec681f3Smrg- ac/llvm: Implement new Geometry Shader intrinsics. 53797ec681f3Smrg- ac/llvm: Make shared loads/stores work correctly for non-CS stages. 53807ec681f3Smrg- ac/llvm: Make sure to always emit integer comparison for nir_op_ieq. 53817ec681f3Smrg- ac/llvm: Add constant offset to load/store_shared. 53827ec681f3Smrg- ac/llvm: Emit more efficient code for load_shared. 53837ec681f3Smrg- ac: Add NIR passes to lower VS->TCS->TES I/O to memory accesses. 53847ec681f3Smrg- ac: Add NIR passes to lower ES->GS I/O to memory accesses. 53857ec681f3Smrg- radv: Lower IO and set driver locations earlier. 53867ec681f3Smrg- radv: Save I/O usage data to both shader infos for merged stages. 53877ec681f3Smrg- radv: Calculate tess patches and LDS use outside the backend compilers. 53887ec681f3Smrg- radv: Determine tcs_in_out_eq in radv_pipeline instead of the compiler. 53897ec681f3Smrg- radv: Fill some tess shader info earlier. 53907ec681f3Smrg- radv: Reorder some NIR optimizations in preparation for the I/O changes. 53917ec681f3Smrg- radv: Use new, NIR-based I/O lowering. 53927ec681f3Smrg- radv/llvm: Only store TCS outputs where they are really needed. 53937ec681f3Smrg- radv/llvm: Delete superfluous tess and ESGS I/O code. 53947ec681f3Smrg- aco: Delete superfluous tess and ESGS I/O code. 53957ec681f3Smrg- aco: Fix constant address offset calculation for ds_read2 instructions. 53967ec681f3Smrg- ac/llvm: Fix alignment of shared load intrinsics. 53977ec681f3Smrg- aco: Optimize workgroup exclusive scan to better avoid bank conflicts. 53987ec681f3Smrg- aco: Align NGG scratch size to 16 so a single ds_read can always read it. 53997ec681f3Smrg- aco: Remove useless s_setprio near gs_alloc_req. 54007ec681f3Smrg- aco: Use s_setprio 3 at the beginning of every VS and TES. 54017ec681f3Smrg- aco: Extract ngg_nogs_export_prim_id to a separate function. 54027ec681f3Smrg- aco: Set block_kind_export_end in create_vs/fs_exports. 54037ec681f3Smrg- aco: Emit fewer branches for NGG VS/TES with late primitive export. 54047ec681f3Smrg- aco: Add a simple heuristic to decide early or late primitive export. 54057ec681f3Smrg- aco: Mark VCC clobbered for iadd8 and iadd16 reductions on GFX6-7. 54067ec681f3Smrg 54077ec681f3SmrgTomeu Vizoso (17): 54087ec681f3Smrg 54097ec681f3Smrg- ci: Fix selection of linker in Android builds 54107ec681f3Smrg- ci: Move container files into their own dir 54117ec681f3Smrg- ci: Move out expect files from .gitlab-ci 54127ec681f3Smrg- ci: Disable two radeonsi jobs 54137ec681f3Smrg- Revert "ci/panfrost: disable the rest of these jobs temporarily" 54147ec681f3Smrg- Revert "ci/panfrost: Disable t860/radeonsi testing while the runners are struggling." 54157ec681f3Smrg- Revert "CI: Disable Panfrost T760" 54167ec681f3Smrg- ci: Fix visibility property of LAVA jobs 54177ec681f3Smrg- ci/fdo: Use trimmed traces for Valve games 54187ec681f3Smrg- gallium/dri2: Pass the resource that corresponds to the plane 54197ec681f3Smrg- ci: Use a single template for LAVA jobs 54207ec681f3Smrg- ci: Set more reasonable timeouts for LAVA jobs 54217ec681f3Smrg- ci: Don't retry failed test runs 54227ec681f3Smrg- ci: Disable t720 LAVA jobs 54237ec681f3Smrg- Revert "ci: Disable t720 LAVA jobs" 54247ec681f3Smrg- Revert "ci: Disable panfrost g52" 54257ec681f3Smrg- Revert "ci: Disable panfrost t760" 54267ec681f3Smrg 54277ec681f3SmrgTony Wasserka (34): 54287ec681f3Smrg 54297ec681f3Smrg- aco/ra: Update register use bounds before recursing in get_regs_for_copies 54307ec681f3Smrg- aco/ra: Introduce PhysRegInterval helper class 54317ec681f3Smrg- aco/ra: Conservatively refactor existing code to use PhysRegInterval 54327ec681f3Smrg- aco/ra: Remove always-false conditions 54337ec681f3Smrg- aco/ra: Add iterator interface for PhysRegInterval 54347ec681f3Smrg- aco/ra: Use std::find_if(_not) to clean up get_reg_simple 54357ec681f3Smrg- aco/ra: Use std::all_of to simplify a loop 54367ec681f3Smrg- aco/ra: Conservatively refactor get_reg_specified to use PhysRegInterval 54377ec681f3Smrg- aco/ra: Move commonly repeated code to a helper function 54387ec681f3Smrg- aco/ra: Add helpers to test for intersection/containment of reg intervals 54397ec681f3Smrg- aco/ra: Use std::all_of to simplify a loop 54407ec681f3Smrg- aco/ra: Remove unused function parameter 54417ec681f3Smrg- aco/ra: Use PhysReg for member functions of PhysRegInterval 54427ec681f3Smrg- aco/ra: Use PhysReg when indexing into RegisterFile's containers 54437ec681f3Smrg- aco/ra: Use PhysRegInterval for collect_vars parameters 54447ec681f3Smrg- aco/ra: Use PhysRegInterval for count_zero 54457ec681f3Smrg- aco/ra: Fix print_regs using the wrong constant to check for blocked slots 54467ec681f3Smrg- aco/ra: Fix build with print_regs enabled 54477ec681f3Smrg- aco/ra: Remove preprocessor guards for print_regs 54487ec681f3Smrg- aco/ra: Add helper to get a PhysRegInterval for the register demand 54497ec681f3Smrg- aco: Fix vector::reserve() being called with the wrong size 54507ec681f3Smrg- radv: Fix improper max_index_count argument for indexed draws 54517ec681f3Smrg- ac: Add has_zero_index_buffer_bug to ac_gpu_info 54527ec681f3Smrg- radv: Skip 0-sized index buffers only when necessary 54537ec681f3Smrg- aco/ra: Avoid unnecessary copying of std::vectors 54547ec681f3Smrg- aco/isel: Don't emit unsupported i16<->f16 conversion opcodes on GFX6/7 54557ec681f3Smrg- aco/isel: Fix i64/u64->float32 conversion for large inputs 54567ec681f3Smrg- aco/isel: Don't request sign extension when truncating signed integers 54577ec681f3Smrg- aco/isel: Add documentation and asserts for convert_int 54587ec681f3Smrg- aco/isel: Fix large inputs being truncated in int32->f16 conversions 54597ec681f3Smrg- aco/isel: Add documentation for (u)int64->f16 conversion 54607ec681f3Smrg- ci: skip pipeline_barrier tests that currently crash on RADV 54617ec681f3Smrg- gitlab: rename RADV bug report template 54627ec681f3Smrg- aco/spill: Fix improper handling of exec phis 54637ec681f3Smrg 54647ec681f3SmrgVasily Khoruzhick (10): 54657ec681f3Smrg 54667ec681f3Smrg- lima: add precompile debug flag 54677ec681f3Smrg- lima/ppir: don't use list_length() in loop in regalloc and liveness analysis 54687ec681f3Smrg- lima: update dEQP fails and skips lists 54697ec681f3Smrg- lima: relax checks of imported BO 54707ec681f3Smrg- lima: rename \*_shader_state to \*_compiled_shader 54717ec681f3Smrg- lima: rename lima_{fs,vs}_bind_state to lima_{fs,vs}_uncompiled_shader 54727ec681f3Smrg- lima: implement shader disk cache 54737ec681f3Smrg- lima: compute nir_sha1 for shader key even if disk cache is disabled 54747ec681f3Smrg- lima: use passed surface to get mipmap level for reload, not cbuf 54757ec681f3Smrg- lima: limit number of draws per job 54767ec681f3Smrg 54777ec681f3SmrgVinson Lee (31): 54787ec681f3Smrg 54797ec681f3Smrg- panfrost: Fix typos. 54807ec681f3Smrg- nouveau: Fix typos. 54817ec681f3Smrg- nv50/ir: Initialize DataArray members in constructor. 54827ec681f3Smrg- r600/sfn: Remove StoreMerger unused member b. 54837ec681f3Smrg- nv50/ir: Add InsertConstraintsPass constructor. 54847ec681f3Smrg- nv50/ir: Initialize CodeEmitter members in constructor. 54857ec681f3Smrg- nv50/ir: Initialize RegAlloc member func in constructor. 54867ec681f3Smrg- clover: Add constructor for global_argument. 54877ec681f3Smrg- lima: Fix typos. 54887ec681f3Smrg- v3dv: Fix assert. 54897ec681f3Smrg- nvc0/ir: Initialize NVC0LoweringPass member gpEmitAddress. 54907ec681f3Smrg- nvc0/ir: Initialize SchedDataCalculator members in constructor. 54917ec681f3Smrg- nv50/ir: Initialize BindArgumentsPass member sub in constructor. 54927ec681f3Smrg- virgl: Convert errno to string. 54937ec681f3Smrg- r600/sfn: Initialize FragmentShaderFromNir member m_pos_input. 54947ec681f3Smrg- etnaviv: Fix memory leak in etna_vertex_elements_state_create. 54957ec681f3Smrg- nv50/ir: Initialize ValueDef member origin in constructors. 54967ec681f3Smrg- nv50/ir: Initialize Instruction members. 54977ec681f3Smrg- aco: Initialize ds_state.front.writeMask. 54987ec681f3Smrg- r600: Fix typos. 54997ec681f3Smrg- llvmpipe: Fix typos. 55007ec681f3Smrg- nir/lower_tex: Change coord type to int. 55017ec681f3Smrg- gv100/ir: Initialize CodeEmitterGV100 members in constructor. 55027ec681f3Smrg- zink: Remove leftover dead code. 55037ec681f3Smrg- nv50/ir: Add constructor for NV50LegalizePostRA. 55047ec681f3Smrg- iris: Fix typos. 55057ec681f3Smrg- clover: Add constructor for sampler_argument. 55067ec681f3Smrg- ac: Fix emit_split_buffer_store modulus operation. 55077ec681f3Smrg- freedreno: Fix file descriptor leak. 55087ec681f3Smrg- glsl: Initialize parcel_out_uniform_storage members. 55097ec681f3Smrg- Remove leftover dead code. 55107ec681f3Smrg 55117ec681f3SmrgVíctor Manuel Jáquez Leal (1): 55127ec681f3Smrg 55137ec681f3Smrg- frontends/va/context: don't set max_references with num_render_targets 55147ec681f3Smrg 55157ec681f3SmrgWitold Baryluk (3): 55167ec681f3Smrg 55177ec681f3Smrg- lavapipe: Defer lavapipe warning to CreateDevice 55187ec681f3Smrg- util: Use explicit relaxed reads for u_queue 55197ec681f3Smrg- radv: memset the alignment hole in cache_entry to 0 55207ec681f3Smrg 55217ec681f3SmrgXin He (1): 55227ec681f3Smrg 55237ec681f3Smrg- virgl: use atomic operations when increase sub_ctx_id 55247ec681f3Smrg 55257ec681f3SmrgYannik Marek (1): 55267ec681f3Smrg 55277ec681f3Smrg- turnip: fix alpha to coverage in no color and unused attachment cases 55287ec681f3Smrg 55297ec681f3SmrgYevhenii Kharchenko (2): 55307ec681f3Smrg 55317ec681f3Smrg- st/mesa: fix PBO download for TEXTURE_1D_ARRAY textures 55327ec681f3Smrg- intel/compiler: remove unused member 'input_vue_map' 55337ec681f3Smrg 55347ec681f3SmrgYevhenii Kolesnikov (3): 55357ec681f3Smrg 55367ec681f3Smrg- iris: only set point sprite overrides if actually using points 55377ec681f3Smrg- nir/from_ssa: consider defs in sibling blocks 55387ec681f3Smrg- nir/from_ssa: don't check for interference within the same set 55397ec681f3Smrg 55407ec681f3SmrgYiwei Zhang (3): 55417ec681f3Smrg 55427ec681f3Smrg- venus: properly enable WSI for different platforms 55437ec681f3Smrg- venus: bring up Android support 55447ec681f3Smrg- venus: implement vn_debug_init_once with os_get_option 55457ec681f3Smrg 55467ec681f3SmrgYogesh Mohan Marimuthu (7): 55477ec681f3Smrg 55487ec681f3Smrg- ac/rgp: add ac_msgpack.h/c 55497ec681f3Smrg- ac/rgp: add rgp co, col, pso data structures 55507ec681f3Smrg- ac/rgp: add helper function to write rgp elf oject 55517ec681f3Smrg- ac/rgp: expose data structure to populate co, col, pso database 55527ec681f3Smrg- ac/rgp,radeonsi,radv: pass struct thread_trace_data to ac_sqtt_dump_data() 55537ec681f3Smrg- ac/rgp: dump co, col, pso database to rgp profile file 55547ec681f3Smrg- ac/rgp: set gfxip in elf_hdr.e_flags 55557ec681f3Smrg 55567ec681f3Smrgchenli (1): 55577ec681f3Smrg 55587ec681f3Smrg- mesa: update oudated members for debug and check 55597ec681f3Smrg 55607ec681f3Smrgcheyang (3): 55617ec681f3Smrg 55627ec681f3Smrg- frontend/dri: fix doesn't support RGBA ordering still expose RGBA in config 55637ec681f3Smrg- glsl: redeclare built-in variable with separate shader 55647ec681f3Smrg- virgl: add astc 2d compressed formats 5565