17ec681f3SmrgMesa 21.1.0 Release Notes / 2021-05-05
27ec681f3Smrg======================================
37ec681f3Smrg
47ec681f3SmrgMesa 21.1.0 is a new development release. People who are concerned
57ec681f3Smrgwith stability and reliability should stick with a previous release or
67ec681f3Smrgwait for Mesa 21.1.1.
77ec681f3Smrg
87ec681f3SmrgMesa 21.1.0 implements the OpenGL 4.6 API, but the version reported by
97ec681f3SmrgglGetString(GL_VERSION) or glGetIntegerv(GL_MAJOR_VERSION) /
107ec681f3SmrgglGetIntegerv(GL_MINOR_VERSION) depends on the particular driver being used.
117ec681f3SmrgSome drivers don't support all the features required in OpenGL 4.6. OpenGL
127ec681f3Smrg4.6 is **only** available if requested at context creation.
137ec681f3SmrgCompatibility contexts may report a lower version depending on each driver.
147ec681f3Smrg
157ec681f3SmrgMesa 21.1.0 implements the Vulkan 1.2 API, but the version reported by
167ec681f3Smrgthe apiVersion property of the VkPhysicalDeviceProperties struct
177ec681f3Smrgdepends on the particular driver being used.
187ec681f3Smrg
197ec681f3SmrgSHA256 checksum
207ec681f3Smrg---------------
217ec681f3Smrg
227ec681f3Smrg::
237ec681f3Smrg
247ec681f3Smrg    0128f10e22970d3aed3d1034003731f94623015cd9797c07151417649c1b1ff8  mesa-21.1.0.tar.xz
257ec681f3Smrg
267ec681f3Smrg
277ec681f3SmrgNew features
287ec681f3Smrg------------
297ec681f3Smrg
307ec681f3Smrg- VK_KHR_workgroup_memory_explicit_layout on Intel, RADV
317ec681f3Smrg
327ec681f3Smrg- DRM format modifiers for AMD.
337ec681f3Smrg
347ec681f3Smrg- VK_KHR_zero_initialize_workgroup_memory on Intel, RADV
357ec681f3Smrg
367ec681f3Smrg- Zink exposes GL 4.6 and ES 3.1
377ec681f3Smrg
387ec681f3Smrg- GL_EXT_depth_bounds_test on softpipe, zink
397ec681f3Smrg
407ec681f3Smrg- GL_EXT_texture_filter_minmax on nvc0 (gm200+)
417ec681f3Smrg
427ec681f3Smrg- GL_ARB_texture_filter_minmax on nvc0 (gm200+)
437ec681f3Smrg
447ec681f3Smrg- GL_ARB_post_depth_coverage on zink
457ec681f3Smrg
467ec681f3Smrg- VK_KHR_copy_commands2 on lavapipe
477ec681f3Smrg
487ec681f3Smrg- lavapipe exposes Vulkan 1.1
497ec681f3Smrg
507ec681f3Smrg- VRS attachment on RADV
517ec681f3Smrg
527ec681f3Smrg- None
537ec681f3Smrg
547ec681f3Smrg
557ec681f3SmrgBug fixes
567ec681f3Smrg---------
577ec681f3Smrg
587ec681f3Smrg- No sRGB capable visuals/fbconfigs reported in glx
597ec681f3Smrg- Graphics corruption and GPU hang with RADV/LLVM
607ec681f3Smrg- old kernels (4.19) support in radv
617ec681f3Smrg- Elite Dangerous: Odyssey alpha crashes GPU on launch
627ec681f3Smrg- CSGO: Some default variables can cause problems with trust mode
637ec681f3Smrg- mesa git started to break wine + UnrealTournament.exe (old dx6 game)
647ec681f3Smrg- SuperTuxKart artifacting on RK3399
657ec681f3Smrg- [amdgpu]: Golf With Your Friends (431240): ERROR Waiting for fences timed out
667ec681f3Smrg- Strange results when trying to read from VK_FORMAT_R64_SFLOAT in compute shader
677ec681f3Smrg- anv: dEQP-VK.binding_model.buffer_device_address.set3.depth3.basessbo.convertcheck* slow
687ec681f3Smrg- Iris doesn't support INTEL_performance_query anymore
697ec681f3Smrg- RADV: TRUNC_COORD breaks gather operations
707ec681f3Smrg- [RADV] corruption in avatar after dying in Heroes of the Storm
717ec681f3Smrg- Metro Exodus crashing due to memory overflow
727ec681f3Smrg- Sauerbraten shader rendering broken on RV530 (r300g)
737ec681f3Smrg- texture glitches on CS:GO on Tiger Lake
747ec681f3Smrg- Build fail due to "parameter name omitted" on Gallium Nine
757ec681f3Smrg- Non-DRI builds broken by recent cleanups in Mesa core
767ec681f3Smrg- Cinnamon core dump after installing latest oibaf mesa build (165a69d2)
777ec681f3Smrg- yuv sampler lowering regression
787ec681f3Smrg- anv: anv_descriptor_set_binding_layout::array_size overflows u16
797ec681f3Smrg- RADV - Vertex explosion in DIRT 5 on RDNA2
807ec681f3Smrg- ci: Use renderdoc from debian
817ec681f3Smrg- ci: Use debian apitrace in x86 images
827ec681f3Smrg- SIGSEV in v3d_emit_gl_shader_state
837ec681f3Smrg- Xorg crash due to assertion failure after GPU soft reset
847ec681f3Smrg- AMD hevc_vaapi ffmpeg encoding = wrong image width (48px black bar on the right)
857ec681f3Smrg- panfrost: Page fault in glamor when running GIMP with X11 on Mali T860
867ec681f3Smrg- gallium: python trace scripts need updating
877ec681f3Smrg- EGL context creation fails when EGL_KHR_create_context_no_error is mentioned for OpenGL ES 1.1.
887ec681f3Smrg- [spirv-fuzz] NIR validation failed after spirv_to_nir: error: nir_block_dominates
897ec681f3Smrg- [bisected][regression][i965,iris] dEQP-VK.clipping.user_defined.clip_cull_distance.* failing on multiple platforms
907ec681f3Smrg- No Mans Sky GPU hang on Radeon ACO
917ec681f3Smrg- radeonsi: prusa-slicer crashes on mesa 21
927ec681f3Smrg- anv: dEQP-VK.subgroups.ballot_broadcast.compute.subgroupbroadcast_i8vec3_requiredsubgroupsize16 fails on ICL, TGL
937ec681f3Smrg- [radeonsi] Rendering of Firefox UI and website content corrupts randomly and after window geometry changes
947ec681f3Smrg- max_binding inconsistency in vulkan descriptor set drivers
957ec681f3Smrg- anv: conservative rasterization ext question mark
967ec681f3Smrg- Unigine Tropics MSAA failure
977ec681f3Smrg- warning: xnack 'Off' was requested for a processor that does not support it! [AMD VEGAM with LLVM 12.0.0]
987ec681f3Smrg- Compiling some ARB assembly shaders leads to memory corruption
997ec681f3Smrg- Specifying an explicit location for an array output messes up transform feedback
1007ec681f3Smrg- Request for VK_EXT_conservative_rasterization support on Anvil Intel driver for newest DXVK..
1017ec681f3Smrg- radv: dEQP regressions after addrlib update
1027ec681f3Smrg- Up to 30% performance drop (GLBenchmark, GfxBench)
1037ec681f3Smrg- DOTA 2 don't no longer starts since commit ad241b15a9e517dd4c4e8d7b1d5dab7c3a74b37c
1047ec681f3Smrg- Clover doesn't work for kmsro drivers
1057ec681f3Smrg- aco_tests isel.sparse.clause fails with llvm-12
1067ec681f3Smrg- util cpu detection breaks on 128-core AMD machines
1077ec681f3Smrg- util cpu detection breaks on 128-core AMD machines
1087ec681f3Smrg- Default GL_MAX_TEXTURE_BUFFER_SIZE very small
1097ec681f3Smrg- intel_nullhw.c:41:38: error: field ‘vtable’ has incomplete type
1107ec681f3Smrg- ACO error with GCN 1 GPU
1117ec681f3Smrg- kmsro advertises EGL_MESA_device_software
1127ec681f3Smrg- d3d12: Use ID3D12Device9::CreateCommandQueue1 when available
1137ec681f3Smrg- [RADV] Halo: The Master Chief Collection: Crash in Halo Reach Firefight
1147ec681f3Smrg- freedreno: use SAMPLE_COUNT to autotune sysmem vs gmem
1157ec681f3Smrg- freedreno: draw_vbo optimizations
1167ec681f3Smrg- [Bisected][RadeonSI] Mesa crashes when rendering with Eevee in Blender
1177ec681f3Smrg- subgroupBallotFindMSB() broken in RADV/ACO 20.3.4
1187ec681f3Smrg- nir_print: util_cpu_detect() is not called prior to _mesa_half_to_float()
1197ec681f3Smrg- turnip: buffer overflow read on dEQP-VK.ycbcr.query.levels.tess_eval.r8g8b8a8_unorm
1207ec681f3Smrg- RuneScape crashes GLOn12
1217ec681f3Smrg- d3d12: Surfaces need to use shareable descriptors
1227ec681f3Smrg- [RADV][RDNA2] Red Dead Redemption 2 image glitches during menu/overlay menu transitions
1237ec681f3Smrg- "unknown intrinsic" assertion triggered by multiview shader in non-multiview renderpass in Vulkan on intel
1247ec681f3Smrg- [i965][g965,ilk,g33,g45][bisected] dEQP-GLES2.functional.fbo.completeness.attachment_combinations.* failures
1257ec681f3Smrg- radv: VK_FORMAT_FEATURE_SAMPLED_IMAGE_BIT exposed for VK_FORMAT_R64_SFLOAT
1267ec681f3Smrg- anv: android building error after commit 4fb6c05
1277ec681f3Smrg- Compiling mesa with -Dtools=all throws deprecation warnings for intel tools
1287ec681f3Smrg- DXVK is broken in latest master
1297ec681f3Smrg- OpenCLOn12: Affinity Photo CL kernels produce invalid DXIL phis
1307ec681f3Smrg- nir -> tgsi conversion problem
1317ec681f3Smrg- [i965][g965,ilk,g33][bisected] fp16 enablement causes deqp test failures
1327ec681f3Smrg- mesa/st: Uniforms are not updated after lowering alpha test
1337ec681f3Smrg- [i965][bisected][regression] piglit failing primitive-restart-vbo_combined_vertex_and_index on multiple platforms
1347ec681f3Smrg- [RADV] Nioh 2 - The Complete Edition: "Bloom" on lights
1357ec681f3Smrg- [RADV] Oblivion: Poor Performance while MSAA Is Enabled
1367ec681f3Smrg- lima context state bugs with shader compile
1377ec681f3Smrg- [RADV][BISECTED] The Surge 2 (644830) - In-game assets do not render correctly since 20.3.4.
1387ec681f3Smrg- amd clang cannot convert ‘llvm::AtomicOrdering’ to ‘llvm::MaybeAlign’ build failure
1397ec681f3Smrg- [iris][icl,tgl][bisected][regression] failure on piglit.spec.arb_separate_shader_objects.programuniform coverage
1407ec681f3Smrg- opencl build fail
1417ec681f3Smrg- anv: dEQP-VK.glsl.builtin_var.fragcoord_msaa.* fails
1427ec681f3Smrg- Request - depth format feature SAMPLED_IMAGE_FILTER_LINEAR
1437ec681f3Smrg- "radeonsi: Check pitch and offset for validity." is a bad commit
1447ec681f3Smrg- Add OpenCL information to docs/features.txt
1457ec681f3Smrg- [regression] [bisected] piglit.spec.arb_framebuffer_object.fbo-drawbuffers-none gldrawpixels fails
1467ec681f3Smrg- RADV: robustBufferAccessUpdateAfterBind is not exposed
1477ec681f3Smrg- debug build compilation failed: inlining failed in call to ‘always_inline’ ‘_nir_visit_dest_indirect’: indirect function call with a yet undetermined callee
1487ec681f3Smrg- [RADV/DXVK] Shadow artifacts with different games
1497ec681f3Smrg- glxgears segfaults with classic i915
1507ec681f3Smrg- mesa_glthread=true Black Mesa
1517ec681f3Smrg- freedreno: rendering corruption in dead cells
1527ec681f3Smrg- ac/rgp: Android building error after commit 12515d6
1537ec681f3Smrg- d3d12: Assert failures & crashes on latest master
1547ec681f3Smrg- RADV/ACO - DCC causing garbled output on RX570
1557ec681f3Smrg- draw.c:121: _mesa_set_draw_vao: Assertion \`vao->_EnabledWithMapMode == _mesa_vao_enable_to_vp_inputs(vao->_AttributeMapMode, vao->Enabled)' failed.
1567ec681f3Smrg- ANV: Weird jitter in Witcher 1
1577ec681f3Smrg- RADV - Path of Exile: Shimmering outlines where water and other objects meet
1587ec681f3Smrg- ANV: Weird jitter in Witcher 1
1597ec681f3Smrg- ANV: Weird jitter in Witcher 1
1607ec681f3Smrg- meson: meson-built libraries have inconsistent compatability / current versions compared to older autotools-built libraries
1617ec681f3Smrg- device select layer breaks other layers
1627ec681f3Smrg- RADV: Extreme overhead in vkQueueSubmit
1637ec681f3Smrg- Graphical glitch of popupping missing texture on Mesa version >18.0.5 (Padoka Stable + Unstable/Oibaf/ubuntu-x-swat PPAs)
1647ec681f3Smrg- [regression] [bisected] dEQP-GLES2.functional.fbo.render.stencil_clear.rbo_rgb5_a1_stencil_index8 fails
1657ec681f3Smrg- occasional corruption issue with RADV in multiple games, disappears after using amdvlk
1667ec681f3Smrg- panfrost T860 regression
1677ec681f3Smrg- OpenGL on GMA4500MHD
1687ec681f3Smrg- piglit-replay: JUnit file contains wrong links to the tracie dashboard
1697ec681f3Smrg- R8 texture upload / corruption bug on Radeon RX 5700 XT
1707ec681f3Smrg- Ambient Occlusion in Two Point Hospital shows black spot artifacts
1717ec681f3Smrg- freedreno: async background shader compile
1727ec681f3Smrg- AMD VAAPI encoding has ceased to work
1737ec681f3Smrg- Rage 2: Visual corruption on in-game menu with ACO.
1747ec681f3Smrg- ACO doesn't correctly render map in Borderlands 3 vs. LLVM on 5700 XT
1757ec681f3Smrg- Invalid shader under panfrost/wayland
1767ec681f3Smrg- Strange Brigade refuses to load correctly since some recent commits
1777ec681f3Smrg- GLonD3D12: Crashes and suboptimal fallback
1787ec681f3Smrg- GLonD3D12: Crashes and suboptimal fallback
1797ec681f3Smrg- GLonD3D12: Crashes and suboptimal fallback
1807ec681f3Smrg- [RADV][REGRESSION][BISECTED] radv_GetMemoryFdPropertiesKHR returns no valid memory types for vaapi drmbuf
1817ec681f3Smrg- anv: vkQueueSubmit with waitSemaphore value of 0 hangs CPU
1827ec681f3Smrg- ttn: invalid base/range triggering nir_validate assertion
1837ec681f3Smrg- Sampling with mipmapped HiZ behaves unexpectedly on Gen9
1847ec681f3Smrg- zink: ARB_map_buffers issues on CI
1857ec681f3Smrg- u_upload_mgr: assert failure for large uploads
1867ec681f3Smrg- [RADV][ACO] Overwatch game crash: amd/compiler/aco_insert_exec_mask.cpp: Failed Assertion
1877ec681f3Smrg- PRIME render offloading broken
1887ec681f3Smrg- Use out encoding for float immediates
1897ec681f3Smrg- [RADV] Severe performance drop when exceeding VRAM compared to AMDVLK
1907ec681f3Smrg- LIBGL_ALWAYS_SOFTWARE=1 picks zink over actual software rasterizers
1917ec681f3Smrg- crash/assert in fd_set_viewport_states
1927ec681f3Smrg- RADV: Occlusion query hangs Big Navi GPU
1937ec681f3Smrg- "mesa: don't allocate matrices with malloc" cause eglCreateContext problem on android 7.
1947ec681f3Smrg- Metal Gear Solid V: The Phantom Pain: texture issues and vertex stretches
1957ec681f3Smrg- [iris and Navi 10] piglit.spec.arb_multi_draw_indirect.arb_draw_elements_base_vertex-multidrawelements -indirect regression
1967ec681f3Smrg- miscompiled compute shader loop on llvmpipe (and Iris)
1977ec681f3Smrg- ci: minio caching of arm64 artifacts for bare-metal
1987ec681f3Smrg- Graphics glitches after upgrade to mesa 20.3 on Khadas VIM3 Pro (Mali G52 GPU)
1997ec681f3Smrg- glthread crash in _mesa_glthread_upload
2007ec681f3Smrg- freedreno piglit flakes
2017ec681f3Smrg- RADV: NonUniform OpArrayLength on SSBO ignores NonUniform.
2027ec681f3Smrg- Iris driver causing graphics glitch in QEMU spice egl DMA-BUF
2037ec681f3Smrg- [RADV/ACO] Death Stranding cause a GPU hung (\*ERROR* Waiting for fences timed out!)
2047ec681f3Smrg- [TGL] Elder Scrolls Online misrenders
2057ec681f3Smrg- [ANV] System hang with GRVK demos
2067ec681f3Smrg- ci: Fractional deqp runs with valgrind enabled.
2077ec681f3Smrg- Regression: Segfault in cso_destroy_context() regression in 20.2
2087ec681f3Smrg- Rendering artifacts in Barn Finders specifically on Radeon Vega
2097ec681f3Smrg- Graphics regression in Assassins Creed Odyssey
2107ec681f3Smrg- [ANV] Compilation warnings
2117ec681f3Smrg- regression in !8152
2127ec681f3Smrg- [bdw][icl][iris] fails new test \`clearbuffer-depth-cs-probe`
2137ec681f3Smrg- ci: new traces runner needs dashboard links in the job log and junit
2147ec681f3Smrg- zink: car model corruption with game TORCS
2157ec681f3Smrg- glGetInternalformati64v(GL_TEXTURE_2D, GL_SR8_EXT, GL_COLOR_ENCODING) returns GL_NONE
2167ec681f3Smrg- Windows: 32-bit build is broken hard
2177ec681f3Smrg- ANV: Not handling separate stencil layouts properly
2187ec681f3Smrg- [Regression][Intel][OpenGL][Bisected] Copying whole 2D array texture failed on latest driver
2197ec681f3Smrg- turnip: dEQP-VK.tessellation.invariance.outer_triangle_set.quads_fractional_odd_spacing failure
2207ec681f3Smrg- i915 regressions bisected to "vbo/dlist: use a shared index buffer"
2217ec681f3Smrg- intel: Chrome OS "hatch" (cometlake) fails on dEQP-GLES31.functional.geometry_shading.layered.render_with_default_layer_cubemap
2227ec681f3Smrg- radv: dEQP-VK.sparse_resources.* failures on GFX9
2237ec681f3Smrg- freedreno: rendering corruption in War Robots
2247ec681f3Smrg- radv: dEQP-VK.sparse_resources.* failures on GFX9
2257ec681f3Smrg- Mesa 20.3.x crashes pidgin on AMD RX480
2267ec681f3Smrg- timespec_get used unconditionally / build fails when targeting macOS 10.14 or earlier
2277ec681f3Smrg- libunwind not located / used on macOS
2287ec681f3Smrg- libunwind not located / used on macOS
2297ec681f3Smrg- meson fails to locate libexpat on macOS
2307ec681f3Smrg- CarX Drift Racing 2 fails to start
2317ec681f3Smrg- Some games using FNA framework show blank screen
2327ec681f3Smrg- Intel Vulkan regression of angle_end2end_tests
2337ec681f3Smrg- Defer lavapipe warning to queue / command / swapchain buffer creation
2347ec681f3Smrg
2357ec681f3Smrg
2367ec681f3SmrgChanges
2377ec681f3Smrg-------
2387ec681f3Smrg
2397ec681f3SmrgAaron Watry (2):
2407ec681f3Smrg
2417ec681f3Smrg- clover: implement CL_IMAGE_ELEMENT_SIZE
2427ec681f3Smrg- clover: implement CL_IMAGE_NUM_MIP_LEVELS and CL_IMAGE_NUM_SAMPLES
2437ec681f3Smrg
2447ec681f3SmrgAbhishek Kumar (2):
2457ec681f3Smrg
2467ec681f3Smrg- intel: change urb max shader geometry for CML GT1
2477ec681f3Smrg- intel: change urb max shader geometry for KBL GT1
2487ec681f3Smrg
2497ec681f3SmrgAdam Jackson (66):
2507ec681f3Smrg
2517ec681f3Smrg- docs: Mark some non-core zink extensions complete
2527ec681f3Smrg- docs: Mark some ES3 zink features complete
2537ec681f3Smrg- egl: Fix error string returned by eglQueryDeviceAttribEXT
2547ec681f3Smrg- zink: Factor out instance setup a bit more
2557ec681f3Smrg- osmesa: Pacify MSVC in the test code
2567ec681f3Smrg- glx: Fix GLX_SGI_video_sync for the no-current-drawable case
2577ec681f3Smrg- nir: Silence a warning at -Og
2587ec681f3Smrg- softfloat: Silence a warning at -Og
2597ec681f3Smrg- glsl: Silence a warning at -Og
2607ec681f3Smrg- glsl: Silence some warnings at -Og
2617ec681f3Smrg- glsl: Silence a warning at -Og
2627ec681f3Smrg- loader: Silence a warning at -Og
2637ec681f3Smrg- gallivm: Silence a warning at -Og
2647ec681f3Smrg- nir/ttn: Silence some warnings at -Og
2657ec681f3Smrg- vl: Silence a warning at -Og
2667ec681f3Smrg- gallivm: Silence a warning at -Og
2677ec681f3Smrg- nouveau: Silence some warnings at -Og
2687ec681f3Smrg- nouveau: Silence a warning at -Og
2697ec681f3Smrg- xlib: Fix build regression since 99e25d183d9
2707ec681f3Smrg- gallium/xlib: Partial fix for glXCopySubBufferMESA
2717ec681f3Smrg- mesa: Store depth bounds test bounds as GLclampd
2727ec681f3Smrg- softpipe: Fix depth comparison with float Z formats
2737ec681f3Smrg- softpipe: Implement GL_EXT_depth_bounds_test
2747ec681f3Smrg- docs: Document GL_EXT_depth_bounds_test
2757ec681f3Smrg- zink: Enable GL_EXT_depth_bounds_test
2767ec681f3Smrg- zink: more and better debug printfs
2777ec681f3Smrg- zink: Fix a thinko in instance setup
2787ec681f3Smrg- zink: Wire up ARB_post_depth_coverage
2797ec681f3Smrg- glx: Pull use_x_font out of the context vtable
2807ec681f3Smrg- glx: Pull get_proc_address out of the context vtable
2817ec681f3Smrg- glx: Remove windows' stub {bind,release}_text_image context hooks
2827ec681f3Smrg- glx/drisw: Implement WaitX and WaitGL
2837ec681f3Smrg- dri: Explicitly handle all the config attributes
2847ec681f3Smrg- dri: Fold attribMap into the code
2857ec681f3Smrg- mesa: Remove misc pbuffer attributes from struct gl_config
2867ec681f3Smrg- mesa: Remove the texture-from-pixmap state from struct gl_config
2877ec681f3Smrg- mesa: Remove transparency state from struct gl_config
2887ec681f3Smrg- mesa: Remove unused gl_config::level
2897ec681f3Smrg- mesa: Remove the pretense of aux buffer support
2907ec681f3Smrg- mesa: Stop tracking visual rating in gl_config
2917ec681f3Smrg- mesa: Remove redundant gl_config::sampleBuffers
2927ec681f3Smrg- ci: Bump the llvmpipe test timeout to 240 seconds
2937ec681f3Smrg- mesa/st: Remove unused ST_ATTACHMENT_SAMPLE
2947ec681f3Smrg- mesa/st: Check for successful framebuffer allocation in st_api_make_current
2957ec681f3Smrg- gallium: Remove curious st_visual::no_config
2967ec681f3Smrg- radeon: Exchange one curious idiom for another in radeonMakeCurrent
2977ec681f3Smrg- mesa: Remove unused _mesa_create_framebuffer
2987ec681f3Smrg- mesa: Make _mesa_initialize_visual return void
2997ec681f3Smrg- mesa: Remove unused gl_config::mutableRenderBuffer
3007ec681f3Smrg- mesa: gl_config::rgbBits should count alphaBits too
3017ec681f3Smrg- dri: Don't tie the accum buffer's alpha-ness to the color buffer's
3027ec681f3Smrg- glx: Stop pretending to validate the pbuffer fbconfig attributes
3037ec681f3Smrg- glx: Don't downgrade the visual caveat from the server
3047ec681f3Smrg- glx: Downgrade aux-buffer-ful fbconfigs
3057ec681f3Smrg- glx: Downgrade tfp mipmap-capable fbconfigs
3067ec681f3Smrg- glx: Downgrade sRGB-ful fbconfigs
3077ec681f3Smrg- dri: Use __DRI_BUFFER_COUNT consistently internally
3087ec681f3Smrg- glx: Default sRGBCapable in the same place as the other config attribs
3097ec681f3Smrg- glx: Clean up fbconfig attribute handling
3107ec681f3Smrg- glx: Remove some #if 0'd DRI config attribute fetch
3117ec681f3Smrg- glx: Don't pointlesly add -D_REENTRANT to libGL's cflags
3127ec681f3Smrg- glx: Move {Bind,Release}TexImage from context to screen vtable
3137ec681f3Smrg- glx: Be more robust against null fbconfigs
3147ec681f3Smrg- glx: Lift sending the MakeCurrent request to top-level code
3157ec681f3Smrg- Revert "glx: Lift sending the MakeCurrent request to top-level code"
3167ec681f3Smrg- gallium/xlib: Fix for recent gl_config changes
3177ec681f3Smrg
3187ec681f3SmrgAdrian Ratiu (1):
3197ec681f3Smrg
3207ec681f3Smrg- docs: docker: minor stale documentation fix
3217ec681f3Smrg
3227ec681f3SmrgAlejandro Piñeiro (36):
3237ec681f3Smrg
3247ec681f3Smrg- v3dv/pipeline: enable lower_add_sat NIR option
3257ec681f3Smrg- v3d/compiler: enable lower_add_sat NIR option
3267ec681f3Smrg- v3dv/descriptor: assert CrateDescriptorPool receives valid count values
3277ec681f3Smrg- v3dv: drop v3dv_resource definition
3287ec681f3Smrg- v3dv: properly handle two different binding points for cmd_buffers
3297ec681f3Smrg- v3dv: move to subclassing instance/physical device
3307ec681f3Smrg- v3dv: remove reference to v3dv_instance on v3dv_physical_device
3317ec681f3Smrg- v3dv: port to using common dispatch code.
3327ec681f3Smrg- v3dv: support for depthBiasClamp
3337ec681f3Smrg- v3dv/device: clarify that we can't expose textureCompressionBC
3347ec681f3Smrg- v3dv/formats: expose support for BC1-3 compressed formats
3357ec681f3Smrg- v3dv/meta_copy: get tlb compatible BC compressed formats for copies
3367ec681f3Smrg- v3dv/descriptor_set: don't free individual set if not allowed
3377ec681f3Smrg- v3dv: avoid some maybe-uninitialized warnings
3387ec681f3Smrg- v3dv/pipeline_cache: add more details when dumping debug info
3397ec681f3Smrg- v3dv/pipeline: remove pregenerate_variant
3407ec681f3Smrg- v3dv/pipeline: remove pipeline->use_push_constants
3417ec681f3Smrg- broadcom/compiler: add local_size in v3d_compute_prog_data
3427ec681f3Smrg- broadcom/compiler: add driver_location_map at vs prog data
3437ec681f3Smrg- v3dv/pipeline: use driver_location_map instead of nir utilities
3447ec681f3Smrg- v3dv/pipeline: move topology to pipeline
3457ec681f3Smrg- v3dv/pipeline: remove compiled_variant_count field
3467ec681f3Smrg- v3dv/pipeline: remove v3d_key from shader_variant and pipeline stage
3477ec681f3Smrg- v3dv: define broadcom shader stages
3487ec681f3Smrg- v3dv/pipeline: use broadcom_shader_stage as pipeline/variant stage type
3497ec681f3Smrg- v3dv/pipeline: try to get the shader variant directly from the cache
3507ec681f3Smrg- v3dv/pipeline: don't create a variant if compilation failed
3517ec681f3Smrg- v3dv/pipeline: compute sha1 for no-op fragment shaders correctly
3527ec681f3Smrg- v3dv/device: avoid unused-result warning with asprintf
3537ec681f3Smrg- v3dv: Add support for the on-disk shader cache
3547ec681f3Smrg- v3dv/cmd_buffer: return early for draw commands if there is nothing to draw
3557ec681f3Smrg- v3dv: define a default attribute values with float type
3567ec681f3Smrg- vulkan: track number of bindings instead of max binding for CreateDescriptorSetLayout
3577ec681f3Smrg- v3dv/device: do not compute per-pipeline limits multiplying per-stage
3587ec681f3Smrg- v3dv/device: fix and cleanup v3dv limits
3597ec681f3Smrg- v3dv/pipeline: reduce descriptor_map size
3607ec681f3Smrg
3617ec681f3SmrgAlexander Kapshuk (1):
3627ec681f3Smrg
3637ec681f3Smrg- frontends/va/image: Eliminate repetitive code on error paths
3647ec681f3Smrg
3657ec681f3SmrgAlexander Shi (1):
3667ec681f3Smrg
3677ec681f3Smrg- mesa: texparam: Add a clamping macro to handle out-of-range floats returned as integers.
3687ec681f3Smrg
3697ec681f3SmrgAlexander von Gluck IV (1):
3707ec681f3Smrg
3717ec681f3Smrg- egl/haiku: Fix ConfigID naming inline with mesa
3727ec681f3Smrg
3737ec681f3SmrgAlyssa Rosenzweig (345):
3747ec681f3Smrg
3757ec681f3Smrg- pan/bi: Fix assertion
3767ec681f3Smrg- pan/bi: Pipe scratch_size in from NIR
3777ec681f3Smrg- pan/bi: Fix 64-bit SSBO addresses
3787ec681f3Smrg- pan/bi: Fix RA of node 0
3797ec681f3Smrg- pan/bi: Fix printing of node 0
3807ec681f3Smrg- panfrost: Fix TLS sizing if cores are missing
3817ec681f3Smrg- panfrost: Allow waiting on slots 6/7 during preload
3827ec681f3Smrg- pan/bi: Add internal debug flag
3837ec681f3Smrg- pan/bi: Validate format 12 tuple count in disasm
3847ec681f3Smrg- pan/bi: Print FAU index in verbose mode
3857ec681f3Smrg- pan/bi: Refactor PC-relative printing
3867ec681f3Smrg- pan/bi: Lint for infinite loops
3877ec681f3Smrg- pan/bi: Print disasm/stats with DEBUG=internal
3887ec681f3Smrg- pan/bi: Fix IDLE register mode packing
3897ec681f3Smrg- pan/bi: Fix staging register packing
3907ec681f3Smrg- pan/bi: Fix dependency wait calculation
3917ec681f3Smrg- pan/bi: Fix M1/M2 decoding in disassembler
3927ec681f3Smrg- pan/bi: Pull out bi_count_read_registers helper
3937ec681f3Smrg- pan/bi: Move bi_next_clause to bir.c
3947ec681f3Smrg- pan/bi: Pass through wait_{6, 7} flags
3957ec681f3Smrg- pan/bi: Add dead branch elimination pass
3967ec681f3Smrg- pan/bi: Add "soft" mode to DCE
3977ec681f3Smrg- pan/bi: Add bi_{before,after}_clause cursors
3987ec681f3Smrg- pan/bi: Add bi_foreach_clause_in_block_rev
3997ec681f3Smrg- pan/bi: Add bi_foreach_instr_in_tuple helper
4007ec681f3Smrg- pan/bi: Add bi_foreach_instr_in_clause iterators
4017ec681f3Smrg- pan/bi: Add destination iterator macro
4027ec681f3Smrg- pan/bi: Don't open code bi_foreach_dest
4037ec681f3Smrg- pan/bi: Permit multiple destinations in RA
4047ec681f3Smrg- pan/bi: Add interference per clause
4057ec681f3Smrg- pan/bi: Implement spilling at the clause-level
4067ec681f3Smrg- pan/bi: Don't fill garbage
4077ec681f3Smrg- pan/bi: Add CUBEFACE pseudoinstruction
4087ec681f3Smrg- pan/bi: Print multiple destinations if needed
4097ec681f3Smrg- pan/bi: Move init_builder to common code
4107ec681f3Smrg- pan/bi: Add "word equivalence" relation for index
4117ec681f3Smrg- pan/bi: Stub out scheduler unit test
4127ec681f3Smrg- pan/bi: Factor nir_function_impl out of the context
4137ec681f3Smrg- pan/bi: Add bi_can_{fma, add} predicates
4147ec681f3Smrg- pan/bi: Annotate ISA.xml with 'last' parameter
4157ec681f3Smrg- pan/bi: Pipe last flag into opcode tables
4167ec681f3Smrg- pan/bi: Add bi_must_last predicate
4177ec681f3Smrg- pan/bi: Add bi_must_message predicate
4187ec681f3Smrg- pan/bi: Label table instructions
4197ec681f3Smrg- pan/bi: Emit branch and table bits in opcode table
4207ec681f3Smrg- pan/bi: Add various read predicates
4217ec681f3Smrg- pan/bi: Unit test bi_can_{fma, add}
4227ec681f3Smrg- pan/bi: Test bi_must_last
4237ec681f3Smrg- pan/bi: Test bi_must_message
4247ec681f3Smrg- pan/bi: Test read predicates
4257ec681f3Smrg- pan/bi: Move bi_constants to bifrost.h
4267ec681f3Smrg- pan/bi: Use canonical terminology for tuple
4277ec681f3Smrg- pan/bi: Use enum bifrost_message_type
4287ec681f3Smrg- pan/bi: Clarify tuple comment
4297ec681f3Smrg- pan/bi: Amend misleading comment
4307ec681f3Smrg- pan/bi: Pack multiple tuples in-memory
4317ec681f3Smrg- pan/bi: Add clause encodings as a table
4327ec681f3Smrg- pan/bi: Move bi_packed_tuple to compiler.h
4337ec681f3Smrg- pan/bi: Add bi_pack_literal
4347ec681f3Smrg- pan/bi: Add bi_pack_upper
4357ec681f3Smrg- pan/bi: Add bi_pack_tuple_bits
4367ec681f3Smrg- pan/bi: Add bi_pack_sync
4377ec681f3Smrg- pan/bi: Add tuple/embedded constant pack
4387ec681f3Smrg- pan/bi: Add subword 5/6 pack
4397ec681f3Smrg- pan/bi: Add subword 4 or 7 pack
4407ec681f3Smrg- pan/bi: Add pack_format helper
4417ec681f3Smrg- pan/bi: Calculate pos for constant packing
4427ec681f3Smrg- pan/bi: Pack multiple tuples per clause
4437ec681f3Smrg- pan/bi: Add packing unit test group
4447ec681f3Smrg- pan/bi: Test pack_literal
4457ec681f3Smrg- pan/bi: Test pack_upper
4467ec681f3Smrg- pan/bi: Test pack_tuple_bits
4477ec681f3Smrg- pan/bi: Test pack_sync
4487ec681f3Smrg- pan/bi: Add packing format tests
4497ec681f3Smrg- pan/decode: Be explicit when printing invocations
4507ec681f3Smrg- pan/decode: Remove tiler size checks
4517ec681f3Smrg- pan/decode: Remove dependency of decoder on the encoder
4527ec681f3Smrg- pan/decode: Deduplicate SFBD blend printing
4537ec681f3Smrg- pan/decode: Deduplicate shader property printing
4547ec681f3Smrg- pan/decode: Remove unused MEMORY_PROP macro
4557ec681f3Smrg- pan/decode: Simplify tiler printing
4567ec681f3Smrg- pan/decode: Remove pandecode_prop
4577ec681f3Smrg- pan/decode: Remove unused disasm stats
4587ec681f3Smrg- pan/decode: Remove mesa header dependencies
4597ec681f3Smrg- pan/mdg: Drop unused stage parameter to disassembler
4607ec681f3Smrg- pan/decode: Remove tile range validation
4617ec681f3Smrg- pan/decode: Prefer sizeof to ARRAY_SIZE for char
4627ec681f3Smrg- nir/lower_io: Fix grammar errors
4637ec681f3Smrg- pan/bi: Fix NULL deref with empty shader
4647ec681f3Smrg- pan/bi: Add side_effects helper
4657ec681f3Smrg- pan/bi: Respect side effects in DCE
4667ec681f3Smrg- pan/bi: Implement AXCHG
4677ec681f3Smrg- pan/bi: Implement ACMPXCHG
4687ec681f3Smrg- pan/bi: Add bi_fmul_f32 convenience method
4697ec681f3Smrg- pan/bi: Fix FLOG_TABLE modifier handling
4707ec681f3Smrg- pan/bi: Lower frcp to Newton-Raphson
4717ec681f3Smrg- pan/bi: Lower frsq to Newton-Raphson
4727ec681f3Smrg- pan/bi: Lower FEXP2 with a table
4737ec681f3Smrg- pan/bi: Lower flog2 to a table and polynomial
4747ec681f3Smrg- pan/bi: Rename NO_FP32_TRANSCENDENTALS quirk
4757ec681f3Smrg- pan/bi: Fix bi quirks detection
4767ec681f3Smrg- pan/bi: Lower FP32 transcendentals where required
4777ec681f3Smrg- pan/bi: Lower transcendentals on G71
4787ec681f3Smrg- pan/bi: Print program size in shader-db
4797ec681f3Smrg- pan/bi: Cleanup terminal block check
4807ec681f3Smrg- pan/bi: Dead code eliminate per-channel
4817ec681f3Smrg- pan/bi: Include ATEST datum in the instruction
4827ec681f3Smrg- pan/bi: Add scheduler data structures
4837ec681f3Smrg- pan/bi: Add cubeface lowering
4847ec681f3Smrg- pan/bi: Flatten block lists
4857ec681f3Smrg- pan/bi: Stub worklist routines
4867ec681f3Smrg- pan/bi: Add constant count estimates to scheduler
4877ec681f3Smrg- pan/bi: Add FAU update helper
4887ec681f3Smrg- pan/bi: Add bi_tuple_is_new_src
4897ec681f3Smrg- pan/bi: Add bi_count_succ_reads helper
4907ec681f3Smrg- pan/bi: Validate reads_t
4917ec681f3Smrg- pan/bi: Add T0/T1 constraint check
4927ec681f3Smrg- pan/bi: Add writes_reg predicate
4937ec681f3Smrg- pan/bi: Add bi_instr_schedulable predicate
4947ec681f3Smrg- pan/bi: Choose instructions to schedule
4957ec681f3Smrg- pan/bi: Destructively schedule a single instruction
4967ec681f3Smrg- pan/bi: Add passthrough register rewriting helper
4977ec681f3Smrg- pan/bi: Extract bi_ec0_packed helper
4987ec681f3Smrg- pan/bi: Add bi_foreach_instr_and_src_in_tuple
4997ec681f3Smrg- pan/bi: Move bi_constant_field to bifrost.h
5007ec681f3Smrg- pan/bi: Add pcrel_idx to bi_clause
5017ec681f3Smrg- pan/bi: Derive M0 from pcrel_idx while packing
5027ec681f3Smrg- pan/bi: Add trivial rewrite helpers
5037ec681f3Smrg- pan/bi: Add constant to passthrough rewrite
5047ec681f3Smrg- pan/bi: Add constant state constructor
5057ec681f3Smrg- pan/bi: Add constant merging routines
5067ec681f3Smrg- pan/bi: Add constant modifier handling
5077ec681f3Smrg- pan/bi: Schedule blocks
5087ec681f3Smrg- pan/bi: Switch to new scheduler
5097ec681f3Smrg- pan/bi: Remove old FAU assignment code
5107ec681f3Smrg- pan/bi: Remove older cube map lowering
5117ec681f3Smrg- pan/bi: Add nosched debug option
5127ec681f3Smrg- pan/bi: Fix 'last tuple' for terminal-NOP clauses
5137ec681f3Smrg- pan/bi: Fix 2-write pseudo op scheduling
5147ec681f3Smrg- pan/mdg: Fix multithreaded shader-db
5157ec681f3Smrg- pan/mdg: Add MIDGARD_MESA_DEBUG=inorder option
5167ec681f3Smrg- pan/mdg: Optimize UBO offset calculations
5177ec681f3Smrg- pan/mdg: Set lower_uniforms_to_ubo
5187ec681f3Smrg- panfrost: Fix race condition in UBO mapping to CPU
5197ec681f3Smrg- panfrost: Respect buffer_offset when mapping to CPU
5207ec681f3Smrg- panfrost: Move sysvals to dedicated UBO
5217ec681f3Smrg- panfrost: Don't truncate uniform_count
5227ec681f3Smrg- panfrost: Add UBO push data structure
5237ec681f3Smrg- panfrost: Push uniforms required by the program
5247ec681f3Smrg- panfrost: Set FAU count based on program->push
5257ec681f3Smrg- panfrost: Don't store uniform_count on Midgard
5267ec681f3Smrg- pan/mdg: Update UBO promotion comment
5277ec681f3Smrg- pan/mdg: Push uniforms based on UBO analysis
5287ec681f3Smrg- pan/bi: Fix multithreaded shader-db
5297ec681f3Smrg- pan/bi: Add bi_replace_index helper
5307ec681f3Smrg- pan/bi: Add bi_is_ssa helper
5317ec681f3Smrg- pan/bi: Print FAU uniforms in IR
5327ec681f3Smrg- pan/bi: Generalize bi_update_fau with fast zero
5337ec681f3Smrg- pan/bi: Handle modifiers in rewrite_fau_to_pass
5347ec681f3Smrg- pan/bi: Rework FAU lowering
5357ec681f3Smrg- pan/bi: Simplify derivative lowering
5367ec681f3Smrg- pan/bi: Add SSA-based scalar copy propagation
5377ec681f3Smrg- pan/bi: Push UBOs on Bifrost
5387ec681f3Smrg- panfrost: Enable ES3 conformant floating-point
5397ec681f3Smrg- compiler, nir: Add and set barrier metadata
5407ec681f3Smrg- panfrost: Set barriers flag for compute shaders
5417ec681f3Smrg- panfrost: Pass is_blit flag around
5427ec681f3Smrg- pan/bi: Skip ATEST for colour blit shaders
5437ec681f3Smrg- panfrost: Fake shader images for bifrost+deqp
5447ec681f3Smrg- pan/bi: Fix jumps to terminal block again
5457ec681f3Smrg- pan/bi: Fix empty shader handling
5467ec681f3Smrg- nir: Add sample_positions_pan intrinsic
5477ec681f3Smrg- pan/decode: Cleanup sample locations decode
5487ec681f3Smrg- pan/decode: Only print local storage for vertex jobs
5497ec681f3Smrg- panfrost: Preload sample mask if needed
5507ec681f3Smrg- panfrost: Add sample positions sysval
5517ec681f3Smrg- panfrost: Push sample positions sysval for Midgard
5527ec681f3Smrg- panfrost: Refactor sample shading state
5537ec681f3Smrg- panfrost: Respect info.fs.uses_sample_shading
5547ec681f3Smrg- panfrost: Add panfrost_sample_pattern helper
5557ec681f3Smrg- panfrost: Set tiler descriptor sampler pattern
5567ec681f3Smrg- panfrost: Generalize MSAA handling
5577ec681f3Smrg- panfrost: Don't set REQ_MSAA in pan_mfbd
5587ec681f3Smrg- panfrost: Don't use PAN_REQ_MSAA in SFBD
5597ec681f3Smrg- panfrost: Remove PAN_REQ_MSAA
5607ec681f3Smrg- panfrost: Remove PAN_REQ_DEPTH_WRITE
5617ec681f3Smrg- panfrost: Remove batch_is_scanout
5627ec681f3Smrg- panfrost: Set sample count/pattern for tiler FBD
5637ec681f3Smrg- panfrost: Upload sample positions on device init
5647ec681f3Smrg- panfrost: Use sample location LUT
5657ec681f3Smrg- panfrost: Ensure open_device has pandecode initialized
5667ec681f3Smrg- panfrost: Advertise MSAA 8x and 16x
5677ec681f3Smrg- panfrost: Implement get_sample_position
5687ec681f3Smrg- panfrost: Overhaul sysval handling
5697ec681f3Smrg- panfrost: Add MULTISAMPLED sysval
5707ec681f3Smrg- pan/mdg: Assert on bad 64-bit swizzle in disassembly
5717ec681f3Smrg- pan/mdg: Remove unused pack_unorm_4x8 lowering
5727ec681f3Smrg- pan/mdg: Lower bitfield instructions
5737ec681f3Smrg- pan/mdg: Rename bitcount8 to popcnt, fixing the unit
5747ec681f3Smrg- pan/mdg: Implement uclz
5757ec681f3Smrg- pan/mdg: Lower ufind_msb, poorly
5767ec681f3Smrg- pan/mdg: Stub load_barycentric_sample
5777ec681f3Smrg- pan/mdg: Lower stores from helpers
5787ec681f3Smrg- pan/bi: Remove redundant TEXC opcode check
5797ec681f3Smrg- pan/bi: Implement texture gathers
5807ec681f3Smrg- pan/bi: Lower bitfield inserts/extracts
5817ec681f3Smrg- pan/bi: Implement bitfield_reverse
5827ec681f3Smrg- pan/bi: Implement ufind_msb
5837ec681f3Smrg- pan/bi: Lower ifind_msb
5847ec681f3Smrg- pan/bi: Fix gl_SampleID read
5857ec681f3Smrg- pan/bi: Implement load_sample_mask_in
5867ec681f3Smrg- pan/bi: Implement nir_intrinsic_load_sample_positions_pan
5877ec681f3Smrg- pan/bi: Decouple sysval loading from NIR
5887ec681f3Smrg- pan/bi: Implement coverage mask updates
5897ec681f3Smrg- pan/{mdg, bi}: Lower load_helper_invocation
5907ec681f3Smrg- pan/{mdg, bi}: Lower load_sample_pos
5917ec681f3Smrg- panfrost: Simplify bind_compute_state
5927ec681f3Smrg- panfrost: Remove stale TODOs
5937ec681f3Smrg- panfrost: Assert on indirect compute shaders
5947ec681f3Smrg- panfrost: Advertise SAMPLE_SHADING
5957ec681f3Smrg- panfrost: Bump max SSBO count
5967ec681f3Smrg- panfrost: Bump advertised ESSL feature level
5977ec681f3Smrg- panfrost: Advertise OES_standard_derivatives
5987ec681f3Smrg- docs/features: Mark more TBO exts done on panfrost
5997ec681f3Smrg- docs/features: Mark some ES3.1 done on Panfrost
6007ec681f3Smrg- docs/features: Mark sample shading done on Panfrost
6017ec681f3Smrg- docs/features: gl_HelperInvocation on Panfrost
6027ec681f3Smrg- nir: Fix grammar error
6037ec681f3Smrg- panfrost: Fix uniform_count on Midgard
6047ec681f3Smrg- pan/bi: Stub scoreboarding
6057ec681f3Smrg- pan/bi: Implement barriers
6067ec681f3Smrg- pan/decode: Fix tiler printing on Bifrost
6077ec681f3Smrg- pan/decode: Pretty print 22-bit pixel formats
6087ec681f3Smrg- pan/decode: Disambiguate border colours
6097ec681f3Smrg- panfrost: Label groups in GenXML
6107ec681f3Smrg- panfrost: Track coverage, early fragment tests
6117ec681f3Smrg- panfrost: Flesh out pixel kill / zs update
6127ec681f3Smrg- panfrost: Handle PIPE_FORMAT_NONE as 'constant'
6137ec681f3Smrg- panfrost: Flesh out allow_forward_pixel_to_kill check
6147ec681f3Smrg- panfrost: Export bifrost_get_blend_desc with type size
6157ec681f3Smrg- panfrost: Add RT conversion sysval
6167ec681f3Smrg- panfrost: Fix NULL deref for an empty frag shader
6177ec681f3Smrg- panfrost: Spell fix
6187ec681f3Smrg- panfrost: Take panfrost_dev for AFBC selection
6197ec681f3Smrg- panfrost: Set border colour on Bifrost sampler
6207ec681f3Smrg- panfrost: Remove useless check
6217ec681f3Smrg- pan/bi: Fix RA of node 0 again
6227ec681f3Smrg- pan/bi: Don't inline 64-bit constants
6237ec681f3Smrg- pan/bi: Fix LD_GCLK staging count
6247ec681f3Smrg- pan/bi: Don't read alpha out of bounds
6257ec681f3Smrg- pan/bi: Allow @rNULL with tied operands
6267ec681f3Smrg- pan/bi: Add SEG_ADD.i64 pseudoinstruction
6277ec681f3Smrg- pan/bi: Add 32-bit atomic pseudoinstruction
6287ec681f3Smrg- pan/bi: Lower atomic pseudo-instructions
6297ec681f3Smrg- pan/bi: Extract bi_atom_opc from NIR intrinsic
6307ec681f3Smrg- pan/bi: Add ATOM_C1 promotion check
6317ec681f3Smrg- pan/bi: Handle computational atomics
6327ec681f3Smrg- pan/bi: Materialize \*DTSEL_IMM in the scheduler
6337ec681f3Smrg- pan/bi: Implement image_atomic_exchange
6347ec681f3Smrg- pan/bi: Implement image_atomic_comp_swap
6357ec681f3Smrg- pan/bi: Implement shader_clock intrinsic
6367ec681f3Smrg- pan/bi: Fix blend shaders using LD_TILE with MRT
6377ec681f3Smrg- pan/bi: Flesh out LD_TILE emit
6387ec681f3Smrg- pan/bi: Elucidate slot 6/7 operation
6397ec681f3Smrg- pan/bi: Preload sample ID for sample shading
6407ec681f3Smrg- pan/bi: Implement interpolateAtSample
6417ec681f3Smrg- pan/bi: Add imm_f16 helper
6427ec681f3Smrg- pan/bi: Implement interpolateAtOffset
6437ec681f3Smrg- pan/bi: Allow dynamically uniform tex indices
6447ec681f3Smrg- pan/bi: Use explicit move even for RT#0 of MRT
6457ec681f3Smrg- panfrost: Comment on state of ARB_shader_clock
6467ec681f3Smrg- panfrost: Advertise FRAMEBUFFER_NO_ATTACHMENTS
6477ec681f3Smrg- docs/features: Mark atomics/images done on Bifrost
6487ec681f3Smrg- panfrost/ci: Mark flaky test
6497ec681f3Smrg- gallium/tessellator: Remove XBOX 360 code
6507ec681f3Smrg- gallium/tessellator: Remove unused includes
6517ec681f3Smrg- gallium/tessellator: Rename D3D11 defines
6527ec681f3Smrg- pan/bi: Fix more jumps to terminal blocks
6537ec681f3Smrg- pan/bi: Optimize out redundant jumps to #0x0
6547ec681f3Smrg- pan/bi: Fix elimination of repeated branches
6557ec681f3Smrg- panfrost: Fix infinite loop spilling
6567ec681f3Smrg- panfrost: Fix NULL dereference adding cbuf to batch
6577ec681f3Smrg- panfrost: Remove redundant NULL check
6587ec681f3Smrg- panfrost: Fix NULL deref in pan_mfbd.c
6597ec681f3Smrg- panfrost: Fix NULL derefs in pan_cmdstream.c
6607ec681f3Smrg- panfrost: Fix NULL deref in pan_sfbd
6617ec681f3Smrg- panfrost: Raise TEXTURE_BUFFER_OFFSET_ALIGNMENT
6627ec681f3Smrg- panfrost: Hide MSAA 8x/16x support
6637ec681f3Smrg- panfrost: Fix UNORM 16 rendering
6647ec681f3Smrg- panfrost: Reinterpret format for reload blits
6657ec681f3Smrg- panfrost: Fix typo in midgard.xml
6667ec681f3Smrg- panfrost: Don't advertise OES_copy_image
6677ec681f3Smrg- pan/bi: Use nir_opt_sink/move for constants
6687ec681f3Smrg- pan/bi: Reduce liveness calculations in DCE
6697ec681f3Smrg- pan/bi: Inline \`bytemask of read components`
6707ec681f3Smrg- pan/bi: Mark branches as having side effects
6717ec681f3Smrg- pan/bi: Mark DISCARD as having side effects
6727ec681f3Smrg- pan/bi: Make bi_writemask take a destination
6737ec681f3Smrg- pan/bi: Allow spilling with multiple destinations
6747ec681f3Smrg- pan/bi: Annotate instructions by destination count
6757ec681f3Smrg- pan/bi: Adapt builder to dest count
6767ec681f3Smrg- pan/bi: Remove unused definitions
6777ec681f3Smrg- pan/bi: Do copyprop in linear-time
6787ec681f3Smrg- panfrost/lcra: Fix constraint counting
6797ec681f3Smrg- pan/bi: Use replace_index in more places
6807ec681f3Smrg- pan/bi: Allow negating constants
6817ec681f3Smrg- pan/bi: Implement fsin/fcos
6827ec681f3Smrg- pan/bi: Mark message-passing sources/dests live
6837ec681f3Smrg- pan/bi: Set clause_state.message conservatively
6847ec681f3Smrg- pan/bi: Treat +DISCARD.f32 as message-passing
6857ec681f3Smrg- nir/lower_viewport_transform: Allow geom/tess
6867ec681f3Smrg- pan/bi: Implement u{add, sub}_sat
6877ec681f3Smrg- nir: Unify memory atomics
6887ec681f3Smrg- meson: Remove kmsro from gallium-drivers
6897ec681f3Smrg- pan/bi: Document register conventions
6907ec681f3Smrg- pan/bi: Add bi_foreach_block_rev
6917ec681f3Smrg- pan/bi: Handle 16-bit blend sr_count
6927ec681f3Smrg- pan/bi: Only run copyprop once
6937ec681f3Smrg- pan/bi: Only run DCE once
6947ec681f3Smrg- pan/bi: Remove TODO: RA warnings
6957ec681f3Smrg- pan/bi: Remove stale todo/assert
6967ec681f3Smrg- panfrost: Deduplicate UBO count assignment
6977ec681f3Smrg- panfrost: Deduplicate Bifrost fau_count
6987ec681f3Smrg- panfrost: Only check blend work count on midgard
6997ec681f3Smrg- vulkan: Deduplicate mesa stage conversion
7007ec681f3Smrg- pan/bi: Enable all nir_opt_move/sink optimizations
7017ec681f3Smrg- pan/bi: Split writemasks for memory stores
7027ec681f3Smrg- pan/bi: Lower large arrays to scratch
7037ec681f3Smrg- pan/bi: Add bi_swz_16 helper
7047ec681f3Smrg- pan/bi: Optimize MKVEC.v2i16 generation
7057ec681f3Smrg- pan/bi: Lower swizzles
7067ec681f3Smrg- pan/bi: Fill in some more conversions
7077ec681f3Smrg- pan/bi: Generalize f2i16, f2u16
7087ec681f3Smrg- pan/bi: Remove conversion lowering
7097ec681f3Smrg- nir: Add nir_type_convert
7107ec681f3Smrg- nir: Add {i2f, u2f, f2i, f2u} helpers
7117ec681f3Smrg- nir/lower_idiv: Convert to lower_instructions
7127ec681f3Smrg- nir/lower_idiv: Factor out numer/denom load
7137ec681f3Smrg- nir/lower_idiv: Add 8-bit and 16-bit lowering path
7147ec681f3Smrg- pan/bi: Determine block successors correctly
7157ec681f3Smrg- panfrost: Fix AFBC body_size for shared resources
7167ec681f3Smrg- panfrost: Enable AFBC buffer sharing
7177ec681f3Smrg- nir: Add varying precision linking helper (v2)
7187ec681f3Smrg- docs: Add some notes on building for macOS
7197ec681f3Smrg- panfrost: Fix formats converting uninit from AFBC
7207ec681f3Smrg
7217ec681f3SmrgAndreas Bergmeier (1):
7227ec681f3Smrg
7237ec681f3Smrg- v3dv: Output a message if file open fails in physical_device_init
7247ec681f3Smrg
7257ec681f3SmrgAndres Gomez (29):
7267ec681f3Smrg
7277ec681f3Smrg- ci: recover tracie dashboard URLs for failing traces
7287ec681f3Smrg- ci: correct the trace image URLs in the piglit summary
7297ec681f3Smrg- ci: make piglit runner less noisy and show a better failure message
7307ec681f3Smrg- ci: clean paths used in the piglit runner
7317ec681f3Smrg- ci: correct piglit's HTML summary location for artifacts upload
7327ec681f3Smrg- ci: make sure piglit's artifacts are not overwritten
7337ec681f3Smrg- ci: correct artifacts location for piglit's runner messages
7347ec681f3Smrg- ci: tracie dashboard URLs only in the failure after the testcase
7357ec681f3Smrg- ci: piglit runner colors diff output on failures
7367ec681f3Smrg- ci: remove pytest since we don't need it any more
7377ec681f3Smrg- ci: only install piglit dependencies when installing piglit
7387ec681f3Smrg- ci: build gfxreconstruct v0.9.5
7397ec681f3Smrg- ci: add libdrm to the x86_test-vk container
7407ec681f3Smrg- .mailmap: colapse duplicates for Timothy Arceri
7417ec681f3Smrg- .mailmap: resolve duplicates for Icecream95
7427ec681f3Smrg- .mailmap: resolve duplicates for Christopher Li
7437ec681f3Smrg- .mailmap: resolve duplicates for Emmanuel Vadot
7447ec681f3Smrg- .mailmap: resolve duplicates for Indrajit Das
7457ec681f3Smrg- .mailmap: resolve duplicates for James Xiong
7467ec681f3Smrg- .mailmap: resolve duplicates for Jan Zielinski
7477ec681f3Smrg- .mailmap: resolve duplicates for Lin Johnson
7487ec681f3Smrg- .mailmap: resolve duplicates for Mark Menzynski
7497ec681f3Smrg- .mailmap: resolve duplicates for Matthias Hopf
7507ec681f3Smrg- .mailmap: resolve duplicates for Matthias Lorenz
7517ec681f3Smrg- .mailmap: resolve duplicates for Maya Rashish
7527ec681f3Smrg- .mailmap: resolve duplicates for Mun Gwan-gyeong
7537ec681f3Smrg- .mailmap: resolve duplicates for Satyeshwar Singh
7547ec681f3Smrg- .mailmap: resolve duplicates for Yogesh Mohan Marimuthu
7557ec681f3Smrg- .mailmap: add an alias for Eleni Maria Stea
7567ec681f3Smrg
7577ec681f3SmrgAndrew McMahon (1):
7587ec681f3Smrg
7597ec681f3Smrg- util: add mesa_glthread for Half Life 2 and Black Mesa.
7607ec681f3Smrg
7617ec681f3SmrgAndrii Simiklit (7):
7627ec681f3Smrg
7637ec681f3Smrg- st/mesa: fix pbo upload/download for arrays of textures with only 1 layer
7647ec681f3Smrg- iris: don't emit IRIS_DIRTY_VF depending on trash in restart_index
7657ec681f3Smrg- mesa: ensure parameter list capacity before associating uniform storage
7667ec681f3Smrg- glsl/linker: Fix xfb stride alignment for buffers containing 64bit types
7677ec681f3Smrg- gitlab-ci: remove fixed tests
7687ec681f3Smrg- spirv: repair ssa defs for switchs with only default case
7697ec681f3Smrg- nir/spirv: remove unused fields from \`vtn_builder`
7707ec681f3Smrg
7717ec681f3SmrgAntonio Caggiano (4):
7727ec681f3Smrg
7737ec681f3Smrg- zink: check shader stencil output
7747ec681f3Smrg- zink: support stencil-export
7757ec681f3Smrg- zink: fix destroy batch
7767ec681f3Smrg- ci: Use lock file to build deqp-runner
7777ec681f3Smrg
7787ec681f3SmrgAnuj Phogat (32):
7797ec681f3Smrg
7807ec681f3Smrg- intel/anv: Fix condition to set MipModeFilter for YUV surface
7817ec681f3Smrg- intel/anv: Fix condition for planar yuv surface
7827ec681f3Smrg- intel: Rename files with gen\_ prefix in common code to intel\_
7837ec681f3Smrg- intel: Rename "gen\_" prefix used in common code to "intel\_"
7847ec681f3Smrg- intel: Fix broken alignment due to gen\_ prefix renaming
7857ec681f3Smrg- intel: Rename "GEN\_" prefix used in common code to "INTEL\_"
7867ec681f3Smrg- i965: Remove blank line at EOF
7877ec681f3Smrg- i965: Rename files with "intel\_" prefix to "brw\_"
7887ec681f3Smrg- intel/isl: Drop intel\_ prefix in function names
7897ec681f3Smrg- anv: Remove redundant #if checks
7907ec681f3Smrg- intel: Remove GEN_IS_HASWELL macro
7917ec681f3Smrg- intel: Simplify version checks involving haswell
7927ec681f3Smrg- intel: Remove GEN_IS_G4X macro
7937ec681f3Smrg- intel: Simplify few version checks involving G4X
7947ec681f3Smrg- intel: Rename GEN_VERSIONx10 macro to GFX_VERx10
7957ec681f3Smrg- intel: Rename GEN_GEN macro to GFX_VER
7967ec681f3Smrg- intel: Rename ISL_DEV_GEN to ISL_GFX_VER
7977ec681f3Smrg- intel: Rename genx10 field in gen_device_info struct to verx10
7987ec681f3Smrg- intel: Rename gen field in gen_device_info struct to ver
7997ec681f3Smrg- intel: Rename genx keyword in filenames to gfxx
8007ec681f3Smrg- intel: Rename GENx prefix in macros to GFXx in build files
8017ec681f3Smrg- intel: Rename GENx prefix in macros to GFXx in source files
8027ec681f3Smrg- intel: Rename genx keyword to gfxx in build files
8037ec681f3Smrg- intel: Rename genx keyword to gfxx in source files
8047ec681f3Smrg- intel: Rename Genx keyword to Gfxx
8057ec681f3Smrg- intel: Rename GENx keyword to GFXx
8067ec681f3Smrg- intel: Rename IS_GEN* macros to IS_GFX_VER*
8077ec681f3Smrg- intel: Make line wrapping changes due to IS_GFX_VER_BETWEEN
8087ec681f3Smrg- intel: Remove unused MAKE_GEN macro
8097ec681f3Smrg- intel: Rename GEN_{ALL, LT, ..} macros to GFX_{ALL, LT, ..}
8107ec681f3Smrg- intel: Rename GEN:BUG:### to Wa_###
8117ec681f3Smrg- intel: Rename WA_### to Wa_###
8127ec681f3Smrg
8137ec681f3SmrgArcady Goldmints-Orlov (14):
8147ec681f3Smrg
8157ec681f3Smrg- v3dv: Fix uninitialized variable warnings
8167ec681f3Smrg- nir: add more intrinsics to divergence analysis
8177ec681f3Smrg- nir: handle v3d intrinsics in divergence analysis
8187ec681f3Smrg- nir: store the results of divergence analysis on loops
8197ec681f3Smrg- broadcom/compiler: Use ANYA for branches in uniform ifs
8207ec681f3Smrg- broadcom/compiler: Emit uniform loops using uniform control flow
8217ec681f3Smrg- broadcom/compiler: Enable PER_QUAD TMU access only in uniform control flow
8227ec681f3Smrg- v3dv: Only lower local arrays of size up to 2 to if-chains
8237ec681f3Smrg- broadcom/compiler: improve generation of if conditions
8247ec681f3Smrg- Revert "broadcom/compiler: improve generation of if conditions"
8257ec681f3Smrg- v3dv: initialize render_fd at the top of physical_device_init
8267ec681f3Smrg- broadcom/compiler: Add a v3d_compile argument to vir_set_[pu]f
8277ec681f3Smrg- broadcom/compiler: Skip bool_to_cond where possible
8287ec681f3Smrg- broadcom/compiler: Merge instructions more efficiently
8297ec681f3Smrg
8307ec681f3SmrgArno Messiaen (1):
8317ec681f3Smrg
8327ec681f3Smrg- lima/ppir: increase usage of pipeline regs
8337ec681f3Smrg
8347ec681f3SmrgAxel Davy (76):
8357ec681f3Smrg
8367ec681f3Smrg- st/nine: Reduce system memory allocated by D3DUSAGE_AUTOGENMIPMAP
8377ec681f3Smrg- st/nine: Do not allow depth buffer render targets
8387ec681f3Smrg- st/nine: Clamp GetAvailableTextureMem
8397ec681f3Smrg- st/nine: Unmap buffers after full unlock
8407ec681f3Smrg- st/nine: Track formats compatible with FETCH4
8417ec681f3Smrg- st/nine: Implement experimental FETCH4
8427ec681f3Smrg- st/nine: Enable DF24 support
8437ec681f3Smrg- st/nine: Add new debug and error checks
8447ec681f3Smrg- st/nine: Refactor ht_guid_delete
8457ec681f3Smrg- st/nine: Protect \*PrivateData also for Volumes
8467ec681f3Smrg- st/nine: Fix leak at device destruction
8477ec681f3Smrg- driconf: Rename csmt_int back to csmt_force
8487ec681f3Smrg- st/nine: Simplify checks for driconf options
8497ec681f3Smrg- st/nine: Add new function to know if we are the worker
8507ec681f3Smrg- st/nine: Add RAM memory manager for textures
8517ec681f3Smrg- st/nine: Use the texture memory helper
8527ec681f3Smrg- st/nine: Control the memfd virtual limit
8537ec681f3Smrg- st/nine: Add driconf option to limit texture memory
8547ec681f3Smrg- st/nine: Set default dynamic_texture_workaround to true
8557ec681f3Smrg- st/nine: Check memfd_create support
8567ec681f3Smrg- st/nine: Fix compilation issue in nine_debug
8577ec681f3Smrg- st/nine: Optimize EndScene
8587ec681f3Smrg- st/nine: Implement SYSTEMMEM buffers same as MANAGED
8597ec681f3Smrg- st/nine: Refactor DrawPrimitiveUp
8607ec681f3Smrg- st/nine: Optimize DrawPrimitiveUp
8617ec681f3Smrg- st/nine: Use correct bind flag at buffer creation
8627ec681f3Smrg- gallium/util: Add new u_box helpers
8637ec681f3Smrg- st/nine: Track pending MANAGED buffer uploads
8647ec681f3Smrg- st/nine: Optimize dynamic systemmem buffers
8657ec681f3Smrg- st/nine: Force DYNAMIC SYSTEMMEM for sw vertex processing
8667ec681f3Smrg- st/nine: Always use DYNAMIC with SYSTEMMEM
8677ec681f3Smrg- st/nine: Use stream_uploader for bad cases of systemmem
8687ec681f3Smrg- st/nine: detect worker threads syncs for systemmem
8697ec681f3Smrg- radeonsi: Limit the size of the in-memory shader cache
8707ec681f3Smrg- radeonsi: fix leak when the in-memory cache is full
8717ec681f3Smrg- st/nine: Disable fpu exceptions during init
8727ec681f3Smrg- st/nine: Fix crash on texture creation failure
8737ec681f3Smrg- st/nine: Fix cubetexture early destruction
8747ec681f3Smrg- st/nine: Add missing breaks
8757ec681f3Smrg- st/nine: Fix invalid NULL check
8767ec681f3Smrg- st/nine: Prevent use after free on dtor
8777ec681f3Smrg- st/nine: Fix reading invalid pointer
8787ec681f3Smrg- st/nine: Fix compilation warnings
8797ec681f3Smrg- st/nine: Fix read outside bounds for some textures
8807ec681f3Smrg- st/nine: Fix value of pipe_draw_info's max_index vertex
8817ec681f3Smrg- st/nine: Prevent negative reference count
8827ec681f3Smrg- st/nine: Improve Surface GetContainer
8837ec681f3Smrg- st/nine: Fix alpha to coverage states
8847ec681f3Smrg- st/nine: Enable multisampling also without depth buffer
8857ec681f3Smrg- st/nine: Handle D3DFMT_NULL multisampling
8867ec681f3Smrg- st/nine: Remove errors on unsupported lock flags
8877ec681f3Smrg- st/nine: Increase number of constants of vs1_sw
8887ec681f3Smrg- st/nine: Ignore swizzle on samplers
8897ec681f3Smrg- st/nine: Clamp max_anisotropy
8907ec681f3Smrg- st/nine: Refuse depth buffers as rendertargets
8917ec681f3Smrg- st/nine: Fix ps ff BLENDTEXTUREALPHA
8927ec681f3Smrg- st/nine: Fix ff has_aNrm computation
8937ec681f3Smrg- st/nine: Catch redundant scissor and viewport settings
8947ec681f3Smrg- st/nine: Pseudo implement set/getClipstatus
8957ec681f3Smrg- st/nine: Improve Reset on Ex devices
8967ec681f3Smrg- st/nine: Pseudo implement Create*Ex functions
8977ec681f3Smrg- st/nine: Complete \*Ex stubs
8987ec681f3Smrg- st/nine: Add logging to Ex function
8997ec681f3Smrg- st/nine: Have NOOVERWRITE win over DISCARD
9007ec681f3Smrg- st/nine: Do not memset buffers twice
9017ec681f3Smrg- st/nine: Add fallback for YUV formats
9027ec681f3Smrg- st/nine: Use PIPE_MAP_ONCE for persistent buffers
9037ec681f3Smrg- st/nine: Disable buffer_upload when csmt is off
9047ec681f3Smrg- st/nine: Allow to override the vram size
9057ec681f3Smrg- st/nine: Make it optional to use a sw renderer
9067ec681f3Smrg- st/nine: Lower texture_memory_limit default
9077ec681f3Smrg- st/nine: Bump num of backbuffers for tearfree thread_submit
9087ec681f3Smrg- st/nine: Improve performance with thread_submit
9097ec681f3Smrg- st/nine: Default thread_submit to true
9107ec681f3Smrg- st/nine: Default tearfree_discard to true
9117ec681f3Smrg- st/nine: Fix compilation error on non-x86 platforms
9127ec681f3Smrg
9137ec681f3SmrgBas Nieuwenhuizen (87):
9147ec681f3Smrg
9157ec681f3Smrg- ac/surface: Fix GFX9 sparse mip info.
9167ec681f3Smrg- radv: Do not use a pipe offset for aliased sparse images.
9177ec681f3Smrg- radv: Add a trivial implementation of VK_KHR_deferred_host_operation
9187ec681f3Smrg- radv: Use stricter HW resolve swizzle compat check.
9197ec681f3Smrg- radv: Expose VK_KHR_workgroup_memory_explicit_layout.
9207ec681f3Smrg- radv: Do not hash vk_object_base in descriptor set layout.
9217ec681f3Smrg- amd/common: Add modifier size helper.
9227ec681f3Smrg- radv: Extract DCC format support handling.
9237ec681f3Smrg- radv: Use the surface offset from ac_surface instead of a plane offset.
9247ec681f3Smrg- radv: Don't relayout images with modifiers.
9257ec681f3Smrg- radv: Add format modifier format queries.
9267ec681f3Smrg- radv: Add drm format modifier queries.
9277ec681f3Smrg- radv: Add image layout with drm format modifiers.
9287ec681f3Smrg- radv: Enable DRM format modifiers on GFX9+.
9297ec681f3Smrg- radv: Enable modifiers with the WSI.
9307ec681f3Smrg- radv: Add modifier fails for CTS bug.
9317ec681f3Smrg- radv: Fix assert.
9327ec681f3Smrg- radv: Implement VK_KHR_zero_initialize_workgroup_memory.
9337ec681f3Smrg- radv: Improve spilling on discrete GPUs.
9347ec681f3Smrg- radv: Fix vram override with fully visible VRAM.
9357ec681f3Smrg- radv: Remove custom icd json generation.
9367ec681f3Smrg- radv: Define supported extensions in C.
9377ec681f3Smrg- radv: Ignore WC flags for VRAM.
9387ec681f3Smrg- radv: Determine swizzles correctly.
9397ec681f3Smrg- radv: Add plane width/height helpers.
9407ec681f3Smrg- radv: Use u_format helpers when possible.
9417ec681f3Smrg- radv: Remove VK_SWIZZLE_*.
9427ec681f3Smrg- radv: Do no use vk_format for getting divisors.
9437ec681f3Smrg- radv: Do not use generated table for plane formats.
9447ec681f3Smrg- radv: Stop checking for MULTIPLANE layout.
9457ec681f3Smrg- radv: Stop using plane_count.
9467ec681f3Smrg- radv: Only support format with a PIPE_FORMAT.
9477ec681f3Smrg- radv: Start using util_format_description for everything.
9487ec681f3Smrg- radv: Remove the format table.
9497ec681f3Smrg- radv: Remove vk_format_has_stencil/depth helpers.
9507ec681f3Smrg- radv: Properly handle modifier import failure.
9517ec681f3Smrg- radv: Do pipe misalignment check per plane.
9527ec681f3Smrg- radv: Don't use dedicated memory info to indicate sharing.
9537ec681f3Smrg- vulkan/device_select: Stop using device properties 2.
9547ec681f3Smrg- amd/common: constify ac_surface_set_umd_metata.
9557ec681f3Smrg- radv: Handle UMD metadata on import.
9567ec681f3Smrg- radv: Use shared code for setting opaque metadata.
9577ec681f3Smrg- amd/common: Add retile map size helper.
9587ec681f3Smrg- radv: Implement initialization of displayable DCC.
9597ec681f3Smrg- radv: Implement displayable DCC retiling.
9607ec681f3Smrg- radv: Add DCC info to the metadata.
9617ec681f3Smrg- radv: Use ac_surface DCC settings for shareable images.
9627ec681f3Smrg- radv: Enable displayable DCC.
9637ec681f3Smrg- radv: Disable displayable DCC for GFX8 properly.
9647ec681f3Smrg- ac/rgp: Only report double the prims per clock on GFX10.
9657ec681f3Smrg- radv: Expose robustBufferAccessUpdateAfterBind correctly.
9667ec681f3Smrg- frontends/va: Use correct size for secondary planes.
9677ec681f3Smrg- radv: Enable linear sampling for depth textures.
9687ec681f3Smrg- radv: Add sam option.
9697ec681f3Smrg- radv: Add nodisplaydcc option.
9707ec681f3Smrg- radv: Use correct DCC compressed block size for sampling.
9717ec681f3Smrg- radv: Dedupe winsyses per device.
9727ec681f3Smrg- radv: Allow extra planes for DCC.
9737ec681f3Smrg- radv: Enable sharing with DCC with modifiers.
9747ec681f3Smrg- radv: Ensure we never decompress or FCE read-only textures.
9757ec681f3Smrg- radv: Allow DCC for images with modifiers that are read-only.
9767ec681f3Smrg- radv: Use 8x8 meta compute workgroups.
9777ec681f3Smrg- radv: Enable DCC for image stores on GFX10.
9787ec681f3Smrg- radv: Only set WRITE_COMPRESS_ENABLE on supported HW.
9797ec681f3Smrg- vulkan: Fix descriptor set creation with zero bindings.
9807ec681f3Smrg- lavapipe: Free sorted descriptor array.
9817ec681f3Smrg- zink: Remove initialization of some arrays
9827ec681f3Smrg- zink: Only set the needed number of scissors.
9837ec681f3Smrg- radv: Flush caches for shader read operations.
9847ec681f3Smrg- nir: Fix shader calls with nir_opt_dead_write_vars.
9857ec681f3Smrg- nir: Extract shader_info->cs.shared_size out of union.
9867ec681f3Smrg- nir: Remove nir_shader->shared_size.
9877ec681f3Smrg- nir: Do not reset shared_size in nir_lower_io.
9887ec681f3Smrg- radv: Support DCC without a fast clear value.
9897ec681f3Smrg- radv: Support DCC without DCC/FCE predicates.
9907ec681f3Smrg- radv: Add retiling for foreign queues.
9917ec681f3Smrg- radv: Support DCC modifiers fully.
9927ec681f3Smrg- radv: Add clang-format for AMD code.
9937ec681f3Smrg- radv: Format.
9947ec681f3Smrg- radv: Update editorconfig.
9957ec681f3Smrg- radv: Re-enable retiling.
9967ec681f3Smrg- radv: Refactor cs_domain to be a winsys function.
9977ec681f3Smrg- radv: Use VRAM cmdbuffers in more situations.
9987ec681f3Smrg- radv/winsys: Remove use_local_bos
9997ec681f3Smrg- radv: Fix memory leak on descriptor pool reset with  layout_size=0.
10007ec681f3Smrg- amd/common: Use cap to test kernel modifier support.
10017ec681f3Smrg- radv: Only require DRM 3.23.
10027ec681f3Smrg
10037ec681f3SmrgBastian Beranek (1):
10047ec681f3Smrg
10057ec681f3Smrg- glx: Assign unique serial number to GLXBadFBConfig error
10067ec681f3Smrg
10077ec681f3SmrgBen Niu (1):
10087ec681f3Smrg
10097ec681f3Smrg- util: When building 'ARM64EC', don't use x64 intrinsics which need to be emulated
10107ec681f3Smrg
10117ec681f3SmrgBenjamin Tissoires (3):
10127ec681f3Smrg
10137ec681f3Smrg- CI: windows: augment the timeout of building the windows container
10147ec681f3Smrg- CI: windows: split the layers to meet new registry requirements
10157ec681f3Smrg- CI: windows: Force using LLVM 12
10167ec681f3Smrg
10177ec681f3SmrgBillKristiansen (1):
10187ec681f3Smrg
10197ec681f3Smrg- d3d12: fix for upside-down multisample stencil blit
10207ec681f3Smrg
10217ec681f3SmrgBoris Brezillon (91):
10227ec681f3Smrg
10237ec681f3Smrg- panfrost: Don't skip the test with a 4k shader
10247ec681f3Smrg- panfrost: Fix tiler job injection (again)
10257ec681f3Smrg- panfrost: Get rid of IS_BIFROST
10267ec681f3Smrg- panfrost: Don't memset the last attribute buffer entry twice
10277ec681f3Smrg- panfrost: Only allocate the extra attribute buffer entry on Bifrost
10287ec681f3Smrg- panfrost: Set attribs and attrib_bufs to NULL when attrib_count = 0
10297ec681f3Smrg- panfrost: Rename and move pan_render_condition_check()
10307ec681f3Smrg- panfrost: Use dev->arch where appropriate
10317ec681f3Smrg- panfrost: Add a panfrost_compile_shader() helper
10327ec681f3Smrg- panfrost: Update ctx->batch when a fresh batch is requested
10337ec681f3Smrg- panfrost: Fix a polygon list corruption in the multi-context case
10347ec681f3Smrg- panfrost: Don't add the tiler BO when it's not accessed
10357ec681f3Smrg- pan/bi: Add an is_terminal_block() helper
10367ec681f3Smrg- pan/bi: Make sure we never branch to an non-existing clause
10377ec681f3Smrg- pan/bi: Add uclz() support
10387ec681f3Smrg- pan/bi: Support bit_count()
10397ec681f3Smrg- panfrost: Use panfrost_get_shader_options() in panfrost_build_blit_shader()
10407ec681f3Smrg- panfrost: Hide backend compiler internals
10417ec681f3Smrg- panfrost: Prefix shader related helpers with pan_shader\_
10427ec681f3Smrg- panfrost: Move sysval_to_id out of panfrost_sysvals
10437ec681f3Smrg- panfrost: Keep the compiler inputs in the context
10447ec681f3Smrg- panfrost: Move the shader compilation logic out of the gallium driver
10457ec681f3Smrg- panfrost: Provide a helper to prepare the shader related parts of an RSD
10467ec681f3Smrg- panfrost: Use the pan_shader_prepare_rsd() helper
10477ec681f3Smrg- panfrost: Rename pan_blend.h into pan_blend_cso.h
10487ec681f3Smrg- panfrost: Move the blend lowering code out of the gallium driver
10497ec681f3Smrg- panfrost: Move the blend logic out of the gallium driver
10507ec681f3Smrg- Revert "pan/bi: Optimize out redundant jumps to #0x0"
10517ec681f3Smrg- pan/bi: Move int64 lowering before idiv lowering
10527ec681f3Smrg- panfrost: Split the direct and indirect draw logic
10537ec681f3Smrg- panfrost: Add a parameter to suppress next job prefetching
10547ec681f3Smrg- panfrost: Allow passing an explicit global dependency when queuing a job
10557ec681f3Smrg- panfrost: Add a pan_section_offset() helper
10567ec681f3Smrg- panfrost: Move pan_special_varying definition to pan_encoder.h
10577ec681f3Smrg- pan/bi: Extend the bi_builder to support type variants correctly
10587ec681f3Smrg- panfrost: Add a knob to disable the UBO -> push constants optimization
10597ec681f3Smrg- panfrost: Allow passing an explicit UBO index for the sysval UBO
10607ec681f3Smrg- panfrost: Print the correct UBO size when dumping UBO information
10617ec681f3Smrg- panfrost: Don't count the special vertex/instance ID attributes on Bifrost
10627ec681f3Smrg- panfrost: Split the sampler and texture count
10637ec681f3Smrg- panfrost: Expose panfrost_modifier_to_layout()
10647ec681f3Smrg- pan/gen_pack: Parse alignment requirements
10657ec681f3Smrg- panfrost: Specify descriptor alignment requirements
10667ec681f3Smrg- panfrost: Provide various helpers to simplify descriptor allocation
10677ec681f3Smrg- panfrost: Define the Surface and Surface-with-stride descriptors
10687ec681f3Smrg- panfrost: Emit surface descriptors with pan_pack()
10697ec681f3Smrg- panfrost: Use the descriptor allocators where appropriate
10707ec681f3Smrg- panfrost: Get rid of panfrost_pool_alloc()
10717ec681f3Smrg- panfrost: Move the blend shader cache at the device level
10727ec681f3Smrg- panfrost: Use the blend shader cache attached to the device
10737ec681f3Smrg- panfrost: Don't reserve space in the color buffer for disabled RTs
10747ec681f3Smrg- panfrost: Skip disabled RTs when selecting a RT for transaction elimination
10757ec681f3Smrg- panfrost: Stop including pan_device.h from pan_bo.h
10767ec681f3Smrg- panfrost: Add helpers to support indirect draws
10777ec681f3Smrg- panfrost: Prepare things for indirect draws
10787ec681f3Smrg- panfrost: Hook up indirect draw support
10797ec681f3Smrg- panfrost: s/panfrost_slice/pan_image_slice_layout/
10807ec681f3Smrg- panfrost: Move image states out of pan_image_layout
10817ec681f3Smrg- panfrost: Add a format field to pan_image_layout
10827ec681f3Smrg- panfrost: Stop passing a depth > 1 when creating 2D textures
10837ec681f3Smrg- panfrost: Add extra info to the pan_image_layout struct
10847ec681f3Smrg- panfrost: Split pan_image in two
10857ec681f3Smrg- panfrost: Add an offset field so we can attach a sub-buffer to an image
10867ec681f3Smrg- panfrost: Move out-of-band CRC info to pan_image
10877ec681f3Smrg- panfrost: Move special Z32_S8X24 case out of panfrost_setup_layout()
10887ec681f3Smrg- panfrost: Add a pan_image_layout_init() helper
10897ec681f3Smrg- panfrost: Patch the gallium driver to use pan_image_layout_init()
10907ec681f3Smrg- panfrost: Pass an image view to panfrost_new_texture()
10917ec681f3Smrg- panfrost: Provide a helper to calculate the polygon list size
10927ec681f3Smrg- panfrost: Provide a helper to retrieve image surface pointers
10937ec681f3Smrg- panfrost: Pass a const device to panfrost_sample_positions()
10947ec681f3Smrg- pan/midg: Use the sampler index passed to the texture instruction
10957ec681f3Smrg- panfrost: Add various helpers to simplify FB desc emission
10967ec681f3Smrg- panfrost: Add an helper to emit fragment jobs
10977ec681f3Smrg- panfrost: Add align info to the draw and draw padding definitions
10987ec681f3Smrg- panfrost: Add the early ZS pre frame mode
10997ec681f3Smrg- panfrost: s/pandecode_vertex_tiler_postfix_pre/pandecode_dcd/
11007ec681f3Smrg- panfrost: Decode pre/post frame DCDs
11017ec681f3Smrg- panfrost: Extend pan_fb_info to allow passing a tile enable map
11027ec681f3Smrg- panfrost: Extend pan_fb_info to allow passing pre/post frame DCDs
11037ec681f3Smrg- panfrost: Always pass a non-NULL screen to set_damage_region()
11047ec681f3Smrg- panfrost: Create a blitter library to replace the existing preload helpers
11057ec681f3Smrg- panfrost: Fix partial update
11067ec681f3Smrg- panfrost: Use the generic preload and FB helpers in the gallium driver
11077ec681f3Smrg- panfrost: Kill the old tile-buffer preload logic
11087ec681f3Smrg- panfrost: Pass a tile enable map to avoid reloading untouched tiles
11097ec681f3Smrg- panfrost: Fix pan_blitter_get_blit_shader()
11107ec681f3Smrg- panfrost: Don't advertise AFBC mods when the format is not supported
11117ec681f3Smrg- panfrost: Reserve thread storage descriptor in panfrost_launch_grid()
11127ec681f3Smrg- panfrost: Fix indirect draws
11137ec681f3Smrg- panfrost: Fix ZS reloading on Bifrost v6
11147ec681f3Smrg
11157ec681f3SmrgBoyuan Zhang (2):
11167ec681f3Smrg
11177ec681f3Smrg- frontend/va/image: add pipe flush for vlVaPutImage
11187ec681f3Smrg- frontends/omx/h265: search entire dpb list
11197ec681f3Smrg
11207ec681f3SmrgCaio Marcelo de Oliveira Filho (43):
11217ec681f3Smrg
11227ec681f3Smrg- intel/fs: Separate SLM size calculation from encoding
11237ec681f3Smrg- nir: Add a data pointer to the callback in nir_remove_dead_variables
11247ec681f3Smrg- spirv: Don't remove variables used by resource indexing intrinsics
11257ec681f3Smrg- nir/linking: Remove system_value handling from helper
11267ec681f3Smrg- compiler: Use util/bitset.h for system_values_read
11277ec681f3Smrg- ci: Add nouveau chipset 162 to shader-db runs
11287ec681f3Smrg- vulkan: Update XML and headers to 1.2.168
11297ec681f3Smrg- spirv: Update headers and metadata from latest Khronos commit
11307ec681f3Smrg- nir: Two shared memory \*blocks* may alias each other
11317ec681f3Smrg- spirv: Implement SPV_KHR_workgroup_memory_explicit_layout
11327ec681f3Smrg- anv: Implement VK_KHR_workgroup_memory_explicit_layout
11337ec681f3Smrg- spirv: Don't bother counting num_images/num_textures
11347ec681f3Smrg- spirv: Don't remove dead variables in \`create_library` mode
11357ec681f3Smrg- spirv: Store SPIR-V version of the module
11367ec681f3Smrg- spirv: Refactor variable initializer code
11377ec681f3Smrg- spirv: Recognize zero initializers in Workgroup variables
11387ec681f3Smrg- nir: Add nir_zero_initialize_shared_memory
11397ec681f3Smrg- anv: Implement VK_KHR_zero_initialize_workgroup_memory
11407ec681f3Smrg- spirv: Fail when parsing invalid Initializers
11417ec681f3Smrg- spirv: Use OpEntryPoint to identify valid I/O variables
11427ec681f3Smrg- spirv: Count variables \*after* unused ones are removed
11437ec681f3Smrg- spirv: Skip creating unused variables in SPIR-V >= 1.4
11447ec681f3Smrg- spirv: Allow variable pointers pointing to an array of blocks
11457ec681f3Smrg- intel/compiler: Use gl_varying_slot_name_for_stage()
11467ec681f3Smrg- freedreno/ir3: Use gl_varying_slot_name_for_stage()
11477ec681f3Smrg- etnaviv: Use gl_varying_slot_name_for_stage()
11487ec681f3Smrg- st/atifs: Use gl_varying_slot_name_for_stage()
11497ec681f3Smrg- compiler: Drop now unused gl_varying_slot_name()
11507ec681f3Smrg- spirv: Reuse nir_is_per_vertex_io()
11517ec681f3Smrg- spirv: Explicitly break when finished handling SpvDecorationBuiltIn
11527ec681f3Smrg- spirv: Update a couple of comments in variable handling
11537ec681f3Smrg- anv: Lower ViewIndex to zero when multiview is disabled
11547ec681f3Smrg- spirv: Update headers and metadata from latest Khronos commit
11557ec681f3Smrg- nir: Handle deref_atomic_fadd in a couple of passes
11567ec681f3Smrg- intel/compiler: Make vue_map parameter const for brw_compile_fs
11577ec681f3Smrg- intel/compiler: Use a struct for brw_compile_fs parameters
11587ec681f3Smrg- intel/compiler: Use a struct for brw_compile_vs parameters
11597ec681f3Smrg- intel/compiler: Refactor the shader INTEL_DEBUG checks
11607ec681f3Smrg- intel/compiler: Make brw_postprocess_nir take debug_enabled as a parameter
11617ec681f3Smrg- intel/compiler: Make vec4 generator take debug_enabled as a parameter
11627ec681f3Smrg- intel/compiler: Make visitors take debug_enabled as a parameter
11637ec681f3Smrg- intel/compiler: Use INTEL_DEBUG=blorp to dump blorp shaders
11647ec681f3Smrg- intel/compiler: Use a struct for brw_compile_cs parameters
11657ec681f3Smrg
11667ec681f3SmrgChad Versace (30):
11677ec681f3Smrg
11687ec681f3Smrg- anv/image: Replace bo_is_owned with from_gralloc (v2)
11697ec681f3Smrg- anv/image: Rename anv_image_plane::surface -> primary_surface
11707ec681f3Smrg- anv/image: Move vkGetImageMemoryRequirements
11717ec681f3Smrg- anv/image: Drop duplicate 'format' in anv_image_create()
11727ec681f3Smrg- anv/image: Fix interpretation of 'disjoint'
11737ec681f3Smrg- anv/android: Fix size check for imported gralloc bo
11747ec681f3Smrg- anv: Add anv_surface_is_valid()
11757ec681f3Smrg- anv/image: Clean up anv_GetImageMemoryRequirements2
11767ec681f3Smrg- anv: Refactor anv_image_get_compression_state_addr
11777ec681f3Smrg- anv/image: Add anv_image_address()
11787ec681f3Smrg- blorp/gen12: Don't use aux address if implicit CCS
11797ec681f3Smrg- anv/image: Make memory layout more explicit
11807ec681f3Smrg- vulkan: Track dependencies of Python imports
11817ec681f3Smrg- anv/image: Simplify assertions in anv_image_from_swapchain()
11827ec681f3Smrg- anv/image: Fix tiling if VkImageSwapchainCreateInfoKHR
11837ec681f3Smrg- anv/image: In vkCreateDmaBufImageINTEL use modifiers
11847ec681f3Smrg- anv/image: Check that anv_image is compatible with its modifier
11857ec681f3Smrg- anv/image: Refactor check_memory_bindings()
11867ec681f3Smrg- anv/image: Fix cleanup of failed image creation
11877ec681f3Smrg- anv/image: Add ANV_IMAGE_MEMORY_BINDING_PRIVATE
11887ec681f3Smrg- anv/image: Fix Vk*ImagePlaneMemory*Info for modifier images
11897ec681f3Smrg- anv: Move assert in vkGetImageSubresourceLayout
11907ec681f3Smrg- anv/image: Fix vkGetImageSubresourceLayout for modifier images
11917ec681f3Smrg- anv: Implement image acquire/release of modifier images
11927ec681f3Smrg- anv: Declare anv_layout_to_* as pure functions
11937ec681f3Smrg- anv/image: Add 'offset' param to add_surface()
11947ec681f3Smrg- anv/image: Support VkImageDrmFormatModifierExplicitCreateInfoEXT
11957ec681f3Smrg- anv: Enable VK_EXT_image_drm_format_modifier
11967ec681f3Smrg- anv: Remove vkCreateDmaBufINTEL (v4)
11977ec681f3Smrg- anv: Drop unused anv_image_create_info::stride
11987ec681f3Smrg
11997ec681f3SmrgCharmaine Lee (1):
12007ec681f3Smrg
12017ec681f3Smrg- gallivm: increase size of texture target enum bitfield
12027ec681f3Smrg
12037ec681f3SmrgChia-I Wu (38):
12047ec681f3Smrg
12057ec681f3Smrg- virgl: update headers
12067ec681f3Smrg- virgl: add support for VIRGL_CAP_V2_UNTYPED_RESOURCE
12077ec681f3Smrg- targets/libgl-xlib: add support for virgl
12087ec681f3Smrg- virgl: update headers from virglrenderer
12097ec681f3Smrg- venus: add driver skeleton
12107ec681f3Smrg- venus: add generated venus-protocol headers
12117ec681f3Smrg- venus: add experimental renderers
12127ec681f3Smrg- venus: add a CS encoder/decoder
12137ec681f3Smrg- venus: add a ring buffer
12147ec681f3Smrg- venus: initial support for vkCreateInstance
12157ec681f3Smrg- venus: initial support for VkPhysicalDevice commands
12167ec681f3Smrg- venus: initial support for VkDevice commands
12177ec681f3Smrg- venus: initial support for queue/fence/semaphore
12187ec681f3Smrg- venus: initial support for VkDeviceMemory commands
12197ec681f3Smrg- venus: initial support for buffers/images/samplers
12207ec681f3Smrg- venus: initial support for descriptor sets
12217ec681f3Smrg- venus: initial support for render pass and fb
12227ec681f3Smrg- venus: initial support for events and queries
12237ec681f3Smrg- venus: initial support for module and pipelines
12247ec681f3Smrg- venus: initial support for command buffers
12257ec681f3Smrg- venus: advertise extensions promoted to 1.1
12267ec681f3Smrg- venus: advertise extensions promoted to 1.2
12277ec681f3Smrg- venus: initial support for transform feedback
12287ec681f3Smrg- venus: initial support for WSI
12297ec681f3Smrg- venus: update venus-protocol headers
12307ec681f3Smrg- venus: prepare for splitting vn_device.[ch]
12317ec681f3Smrg- venus: split out vn_command_buffer.[ch]
12327ec681f3Smrg- venus: split out vn_pipeline.[ch]
12337ec681f3Smrg- venus: split out vn_query_pool.[ch]
12347ec681f3Smrg- venus: split out vn_render_pass.[ch]
12357ec681f3Smrg- venus: split out vn_descriptor_set.[ch]
12367ec681f3Smrg- venus: split out vn_buffer.[ch]
12377ec681f3Smrg- venus: split out vn_image.[ch]
12387ec681f3Smrg- venus: split out vn_device_memory.[ch]
12397ec681f3Smrg- venus: split out vn_queue.[ch]
12407ec681f3Smrg- venus: include individual venus-protcol headers
12417ec681f3Smrg- ci: enable venus in some meson build jobs
12427ec681f3Smrg- venus: check vn_renderer_info::vk_xml_version
12437ec681f3Smrg
12447ec681f3SmrgChristian Gmeiner (26):
12457ec681f3Smrg
12467ec681f3Smrg- etnaviv: handle NULL views in set_sampler_views
12477ec681f3Smrg- vc4: add drm-shim
12487ec681f3Smrg- ci: Update baremetal kernel to 5.11 plus patches
12497ec681f3Smrg- nir: add load_texture_rect_scaling
12507ec681f3Smrg- nir: add has_txs flag
12517ec681f3Smrg- nir/lower_tex: 'txs free' tex_rect lowering
12527ec681f3Smrg- nir/lower_tex: wider usage of nir_tex_instr_src_index(..)
12537ec681f3Smrg- gallium: add PIPE_CAP_TEXRECT
12547ec681f3Smrg- gallium/st: lower rectangle textures if not supported
12557ec681f3Smrg- ttn: lower rectangle textures if not supported
12567ec681f3Smrg- etnaviv: nir: support nir_intrinsic_load_texture_rect_scaling
12577ec681f3Smrg- etnaviv: let st lower rect tex
12587ec681f3Smrg- vc4: let st lower rect tex
12597ec681f3Smrg- etnaviv: nir: add ubo lowering pass
12607ec681f3Smrg- etnaviv: use nir_lower_uniforms_to_ubo(..)
12617ec681f3Smrg- etnaviv: fix etna_nir_lower_ubo_to_uniform pass
12627ec681f3Smrg- etnaviv: extend lower ubo tests
12637ec681f3Smrg- gallium: call util_cpu_detect()
12647ec681f3Smrg- etnaviv: use nir_lower_idiv(..) before opt loop
12657ec681f3Smrg- ci/bare-metal: fix fastboot
12667ec681f3Smrg- etnaviv: etnaviv: put sampler limit determination into own function
12677ec681f3Smrg- etnaviv: factor out TS state emitting
12687ec681f3Smrg- etnaviv: add support for NTE
12697ec681f3Smrg- etnaviv: rename struct members
12707ec681f3Smrg- ci/bare-metal: no need to use tee
12717ec681f3Smrg- etnaviv: tell the truth if alpha-test is supported
12727ec681f3Smrg
12737ec681f3SmrgConnor Abbott (61):
12747ec681f3Smrg
12757ec681f3Smrg- nir/lower_tex: Handle sized tex destination types
12767ec681f3Smrg- freedreno/ir3: Handle sized tex destination types
12777ec681f3Smrg- ntt: Handle sized tex destination types
12787ec681f3Smrg- nir/lower_bool: Rewrite dest_type for boolean destinations
12797ec681f3Smrg- brw/vec4: Don't convert tex dest type to glsl_type
12807ec681f3Smrg- radv/meta: Use sized types for nir_tex_instr::dest_type
12817ec681f3Smrg- v3dv/meta: Use sized types for nir_tex_instr::dest_type
12827ec681f3Smrg- intel/blorp: Use sized types for nir_tex_instr::dest_type
12837ec681f3Smrg- anv: Use sized types for nir_tex_instr::dest_type
12847ec681f3Smrg- dxil: Use sized types for nir_tex_instr::dest_type
12857ec681f3Smrg- panfrost/blit: Use sized types for nir_tex_instr::dest_type
12867ec681f3Smrg- d3d12/blit: Use sized types for nir_tex_instr::dest_type
12877ec681f3Smrg- nir: Use sized types for nir_tex_instr::dest_type
12887ec681f3Smrg- st/mesa: Use sized types for nir_tex_instr::dest_type
12897ec681f3Smrg- gallium/nir: Use sized types for nir_tex_instr::dest_type
12907ec681f3Smrg- ttn: Use sized types for nir_tex_instr::dest_type
12917ec681f3Smrg- st/atifs: Use sized types for nir_tex_instr::dest_type
12927ec681f3Smrg- glsl/nir: Use sized types for nir_tex_instr::dest_type
12937ec681f3Smrg- vtn: Use sized types for nir_tex_instr::dest_type
12947ec681f3Smrg- ptn: Use sized types for nir_tex_instr::dest_type
12957ec681f3Smrg- nir: Validate nir_tex_instr::dest_type bitsize
12967ec681f3Smrg- nir/lower_tex: Assume that nir_tex_instr::dest_type is sized
12977ec681f3Smrg- panfrost: Assume that nir_tex_instr::dest_type is sized
12987ec681f3Smrg- ir3: Assume that nir_tex_instr::dest_type is sized
12997ec681f3Smrg- ntt: Assume that nir_tex_instr::dest_type is sized
13007ec681f3Smrg- freedreno/a6xx: Document threadsize-related fields
13017ec681f3Smrg- freedreno/cffdec: Use rb trees for tracking buffers
13027ec681f3Smrg- ir3/parser: Fix parsing of "0.0" in @const line
13037ec681f3Smrg- freedreno/computerator: Fix example assembly
13047ec681f3Smrg- ir3/parser: Support labels
13057ec681f3Smrg- ir3/parser: Add ability to specify branchstack
13067ec681f3Smrg- freedreno/computerator: Add branching example
13077ec681f3Smrg- freedreno/computerator: Fix thrsz type
13087ec681f3Smrg- freedreno/a6xx: Fix compute threadsize type
13097ec681f3Smrg- freedreno/registers: Handle typed registers with fields
13107ec681f3Smrg- freedreno/a6xx: Cleanup SP_XS_CTRL_REG0 definitions
13117ec681f3Smrg- freedreno: Add local_size to ir3_shader_variant
13127ec681f3Smrg- ir3: Calcuate max_waves and threadsize
13137ec681f3Smrg- turnip: Use threadsize calculated by ir3
13147ec681f3Smrg- freedreno: Use threadsize calculated by ir3
13157ec681f3Smrg- freedreno/computerator: Use threadsize calculated by ir3
13167ec681f3Smrg- freedreno: Report max_waves in shaderdb output
13177ec681f3Smrg- freedreno/computerator: Add script for finding reg file size
13187ec681f3Smrg- util/bitset: Avoid out-of-bounds reads
13197ec681f3Smrg- freedreno/a3xx: Fix SP_FS_CTRL_REG1_INITIALOUTSTANDING
13207ec681f3Smrg- ir3/legalize: Fix last input (ss) insertion
13217ec681f3Smrg- ir3: Fix valid flags for STIB
13227ec681f3Smrg- ir3/cp_postsched: Set address of uses for relative mov's
13237ec681f3Smrg- ir3: Don't copy propagate arrays in ir3_cp
13247ec681f3Smrg- ir3/postsched: Make sure to schedule inputs before kill
13257ec681f3Smrg- vtn: Handle ZeroExtend/SignExtend image operands
13267ec681f3Smrg- tu: Expose VK_KHR_spirv_1_4 and VK_EXT_scalar_block_layout
13277ec681f3Smrg- tu: Expose VK_KHR_relaxed_block_layout
13287ec681f3Smrg- ir3/sched: Don't penalize uses of already-waited tex/SFU
13297ec681f3Smrg- ir3/sched: Don't schedule too many tex/SFU instructions
13307ec681f3Smrg- ir3: Fix list corruption in legalize_block()
13317ec681f3Smrg- tu: Correctly preserve old push descriptor contents
13327ec681f3Smrg- ir3: Prevent oob writes to inputs/outputs array
13337ec681f3Smrg- nir/lower_clip_disable: Fix store writemask
13347ec681f3Smrg- tu: Fix SP_GS_PRIM_SIZE for large sizes
13357ec681f3Smrg- ir3/postsched: Fix dependencies for a0.x/p0.x
13367ec681f3Smrg
13377ec681f3SmrgCorentin Noël (1):
13387ec681f3Smrg
13397ec681f3Smrg- ci: Use lavacli from master
13407ec681f3Smrg
13417ec681f3SmrgDaniel Schürmann (67):
13427ec681f3Smrg
13437ec681f3Smrg- aco: fix VOP3P assembly, VN and validation
13447ec681f3Smrg- aco/RA: fix subdword operands on VOP3P instructions
13457ec681f3Smrg- aco: allow constants/literals on every src position for VOP3P
13467ec681f3Smrg- aco: allow SGPRs on every src position for VOP3P
13477ec681f3Smrg- aco: change usesModifiers() considering opsel_hi on packed instructions
13487ec681f3Smrg- aco: create helpers to emit vop3p instructions
13497ec681f3Smrg- aco: emit packed 16bit instructions
13507ec681f3Smrg- radv: vectorize 16bit instructions
13517ec681f3Smrg- aco: simplify multiply-add combining
13527ec681f3Smrg- aco: optimize packed mul+add to v_pk_fma_f16
13537ec681f3Smrg- aco: optimize packed clamp
13547ec681f3Smrg- aco: optimize packed fneg
13557ec681f3Smrg- aco: optimize v_pk_fma_f16 -> v_pk_fmac_f16 on GFX10
13567ec681f3Smrg- aco: propagate swizzles when optimizing packed clamp & fma
13577ec681f3Smrg- aco: remove divergent branches which only jump over very few instructions
13587ec681f3Smrg- aco/optimizer: don't copy-prop logical phis
13597ec681f3Smrg- aco/optimizer: don't propagate subdword temps of different size
13607ec681f3Smrg- aco: generalize subdword constant copy lowering
13617ec681f3Smrg- aco/validate: validate that p_create_vector operands are aligned unless they are subdword operands
13627ec681f3Smrg- aco/validate: ensure that Operand and Definition size matches for parallelcopies
13637ec681f3Smrg- aco/validate: relax subdword restrictions
13647ec681f3Smrg- aco: propagate temporaries into PSEUDO instructions if it can take it
13657ec681f3Smrg- aco/optimizer: expand subdword vectors with SGPRs on all generations
13667ec681f3Smrg- aco/optimizer: convert extract_vector with index 0 into parallelcopies if possible
13677ec681f3Smrg- radv: don't vectorize shift operations
13687ec681f3Smrg- aco: fix VCC hint on boolean subgroup operations
13697ec681f3Smrg- aco: fix nir_intrinsic_ballot with wave32
13707ec681f3Smrg- aco: fix shared VGPR allocation on RDNA2
13717ec681f3Smrg- aco: change gpr_alloc_granule to full alignment
13727ec681f3Smrg- aco: refactor GPR limit calculation
13737ec681f3Smrg- aco: don't decrease the vgpr_limit when encountering bpermute
13747ec681f3Smrg- aco: also consider VCC in get_reg_specified()
13757ec681f3Smrg- aco: check get_reg_specified() on register hints
13767ec681f3Smrg- aco: don't abort() if disassembly fails
13777ec681f3Smrg- aco: use VCC as regular SGPR pair on GFX10
13787ec681f3Smrg- aco: don't create unnecessary exec phi on merge blocks
13797ec681f3Smrg- aco: handle non-temp phi definitions and operands
13807ec681f3Smrg- aco: make all exec accesses non-temporaries
13817ec681f3Smrg- aco: remove dead code for the handling of exec temporaries
13827ec681f3Smrg- aco: fix assertion in insert_exec_mask pass
13837ec681f3Smrg- nir: lower load_helper to is_helper if the shader uses demote()
13847ec681f3Smrg- nir: lower is/load_helper to zero if no helper lanes are needed
13857ec681f3Smrg- aco: remove special handling of load_helper_invocation
13867ec681f3Smrg- aco: don't rematerialize exec
13877ec681f3Smrg- aco: value number VOPC instructions with different exec masks
13887ec681f3Smrg- aco/value_numbering: use can_eliminate() function to avoid unnecessary hashmap lookups
13897ec681f3Smrg- aco/optimizer: set VCC hint on new v_cmp_* definitions
13907ec681f3Smrg- aco/ra: allow VCC on SMEM sbase operand on GFX10+
13917ec681f3Smrg- .mailmap: fix email for Daniel Schürmann
13927ec681f3Smrg- aco/ra: split affinity creation into separate function
13937ec681f3Smrg- aco/ra: split register_file initialization into separate function
13947ec681f3Smrg- aco/ra: refactor SSA repairing during register allocation
13957ec681f3Smrg- aco/ra: iterate backwards when coalescing phis
13967ec681f3Smrg- aco/ra: allow m0 in get_reg_specified()
13977ec681f3Smrg- aco/ra: remove exec handling for phis
13987ec681f3Smrg- aco/spill: refactor spill decision taking
13997ec681f3Smrg- aco/spill: reload spilled exec masks directly to exec
14007ec681f3Smrg- aco/spill: spill phi constants and exec directly to VGPR
14017ec681f3Smrg- aco/spill: don't count phis as variable access
14027ec681f3Smrg- aco/spill: refactor some more spill decision taking
14037ec681f3Smrg- aco/spill: refactor live-in registerDemand calculation
14047ec681f3Smrg- aco/spill: use correct next_use_distances at loop header
14057ec681f3Smrg- aco: lower p_spill with constants correctly
14067ec681f3Smrg- aco: fix kill flags on phi operands
14077ec681f3Smrg- aco: add new reindex_ssa() pass
14087ec681f3Smrg- aco/cssa: rewrite lower_to_cssa pass
14097ec681f3Smrg- aco/cssa: don't create parallelcopies for constants and exec
14107ec681f3Smrg
14117ec681f3SmrgDaniel Stone (3):
14127ec681f3Smrg
14137ec681f3Smrg- CI: Try really hard to get updated Windows TLS certs
14147ec681f3Smrg- CI: Trigger Windows builds for llvmpipe & Vulkan changes
14157ec681f3Smrg- CI: Change LAVA job visibility
14167ec681f3Smrg
14177ec681f3SmrgDanylo Piliaiev (50):
14187ec681f3Smrg
14197ec681f3Smrg- turnip/ir3: handle image load/stores produced by AtomicLoad/Store
14207ec681f3Smrg- turnip: make GS use correct varyings size from previous stage
14217ec681f3Smrg- ir3: add debug option to override shader assembly
14227ec681f3Smrg- freedreno/ir3/parser: add cat7 support
14237ec681f3Smrg- turnip: don't emit tess consts if they are not used
14247ec681f3Smrg- freedreno: clamp scissor bounds
14257ec681f3Smrg- freedreno/a2xx: fix scissors clamp bounds
14267ec681f3Smrg- turnip: enable inheritedQueries
14277ec681f3Smrg- turnip: consider HW limit on number of views when apply multipos opt
14287ec681f3Smrg- turnip: consider tile_max_h when calculating tiling config
14297ec681f3Smrg- turnip,freedreno/a6xx: tell hw the size of shared mem used by CS
14307ec681f3Smrg- turnip/ir3: check for bindless IBOs in atomic dests fixup
14317ec681f3Smrg- turnip: fix leak of tu_shader object during compute pipeline creation
14327ec681f3Smrg- ir3: prevent duplication of instruction's dependencies
14337ec681f3Smrg- ir3: make mark_kill_path exit early if instr is already seen
14347ec681f3Smrg- ir3: disallow moving memory writes over discard
14357ec681f3Smrg- freedreno/hw: fix populating branch targets in isa_decode pre-pass
14367ec681f3Smrg- turnip: fix SP_HS_WAVE_INPUT_SIZE value
14377ec681f3Smrg- freedreno/a5xx: port handling of PIPE_BUFFER textures from a6xx
14387ec681f3Smrg- ir3: use OPC_GETBUF to get size of sampler buffers
14397ec681f3Smrg- turnip: lower device index to zero
14407ec681f3Smrg- turnip: fill VkMemoryDedicatedRequirements
14417ec681f3Smrg- turnip: set zmode to A6XX_EARLY_Z if FS forces early fragment test
14427ec681f3Smrg- turnip: implement intrinsic_vulkan_resource_reindex
14437ec681f3Smrg- ci/freedreno: run freedreno jobs on any change in src/freedreno/
14447ec681f3Smrg- ir3: fix oob access to regs array for getbuf,getinfo,rgetinfo
14457ec681f3Smrg- ir3/isa,parser: fix encoding and parsing of bindless s2en SAM
14467ec681f3Smrg- ir3: match mova1 mnemonic when writing to A1
14477ec681f3Smrg- freedreno/isa: assert if field's range is out of bitset's range
14487ec681f3Smrg- ir3: disallow .sat on SEL instructions
14497ec681f3Smrg- ir3: update info about applicability of saturation modifier
14507ec681f3Smrg- turnip: expose several already implemented extensions
14517ec681f3Smrg- nir: add nir_shader_as_str function
14527ec681f3Smrg- turnip: implement VK_KHR_pipeline_executable_properties
14537ec681f3Smrg- turnip: clamp to zero negative upper left corner of viewport
14547ec681f3Smrg- turnip,ir3: account for dispatch group offsets
14557ec681f3Smrg- freedreno/a6xx: copy full 64bit of primitive counter
14567ec681f3Smrg- freedreno/a6xx: fix primitive counters debug output
14577ec681f3Smrg- ir3/isa: account for randomly set by blob lowest bit of ibo atomics
14587ec681f3Smrg- glsl/linker: Fix attempts to split up 64bit varyings between slots
14597ec681f3Smrg- glsl/linker: Fix xfb with explicit locations and 64bit types
14607ec681f3Smrg- ir3: nir_op_f2f16 should round to even
14617ec681f3Smrg- ir3: convert shift amount to 16b for 16b shifts
14627ec681f3Smrg- turnip: enable infinities for f16 math and document the register
14637ec681f3Smrg- turnip: enable VK_KHR_16bit_storage on A650
14647ec681f3Smrg- turnip: handle format list for compressed formats
14657ec681f3Smrg- docs: mark float_controls,float16_int8,16bit_storage as done on Turnip
14667ec681f3Smrg- turnip: fix alignment of non-32b types in workgroup memory
14677ec681f3Smrg- turnip: implement variableMultisampleRate
14687ec681f3Smrg- turnip: support copying both aspects of D32_SFLOAT_S8_UINT
14697ec681f3Smrg
14707ec681f3SmrgDave Airlie (163):
14717ec681f3Smrg
14727ec681f3Smrg- device-select-layer: update for vulkan 1.2
14737ec681f3Smrg- lavapipe: fix missing piece of VK_KHR_get_physical_device_properties2
14747ec681f3Smrg- vk-device-select: add device group support
14757ec681f3Smrg- lavapipe: refactor image surface creation
14767ec681f3Smrg- lavapipe: rewrite attachment clearing for conditional rendering.
14777ec681f3Smrg- gallium: add a cond rendering hook for vulkan.
14787ec681f3Smrg- llvmpipe: handle vulkan conditional rendering
14797ec681f3Smrg- lavapipe: add VK_EXT_conditional_rendering support.
14807ec681f3Smrg- CI: add lavapipe to llvmpipe rules.
14817ec681f3Smrg- lavapipe: add support for external memory/fd/sempahore extensions
14827ec681f3Smrg- llvmpipe: handle firstvertex for vulkan draw parameters
14837ec681f3Smrg- lavapipe: handle shader draw parameters
14847ec681f3Smrg- lavapipe: add missing loader interface negoitation
14857ec681f3Smrg- lavapipe: move to subclassing instance/physical device.
14867ec681f3Smrg- lavapipe: add missing wsi entrypoint.
14877ec681f3Smrg- lavapipe: sort extensions in proper order.
14887ec681f3Smrg- lavapipe: use common dispatch layer.
14897ec681f3Smrg- radv: move queue object to a common base object
14907ec681f3Smrg- radv: remove all entrypoint enabled debug option
14917ec681f3Smrg- radv: move to subclassed instance/physical_device structs
14927ec681f3Smrg- radv: port to using common dispatch code.
14937ec681f3Smrg- zink: don't pick a cpu device ever.
14947ec681f3Smrg- llvmpipe: add a mutex around debug resource tracking
14957ec681f3Smrg- llvmpipe: fix use after free with fs variant cleanup
14967ec681f3Smrg- lavapipe: reset shader constant buffers after execution
14977ec681f3Smrg- glsl: fix leak in gl_nir_link_uniform_blocks
14987ec681f3Smrg- llvmpipe: enable GL spir-v support
14997ec681f3Smrg- util/format: add helper to check if a format is scaled.
15007ec681f3Smrg- llvmpipe: don't support scaled formats outside vertex buffers
15017ec681f3Smrg- lavapipe: add support for 2/10/10/10 scaled formats.
15027ec681f3Smrg- lavapipe: add support for missing 10/10/10/2 formats.
15037ec681f3Smrg- lavapipe: add reference counting to descriptor set layout
15047ec681f3Smrg- lavapipe: avoid pointer to pipeline layout in execution
15057ec681f3Smrg- lavapipe: set viewport state dirty on first execute
15067ec681f3Smrg- lavapipe: implement physical device group enumeration
15077ec681f3Smrg- lavapipe/meson: drop megadrivers build req
15087ec681f3Smrg- lavapipe: fix some void ptr arithmetic
15097ec681f3Smrg- lavapipe: use msvc compatible 0 init
15107ec681f3Smrg- lavepipe: some misc msvc fixes
15117ec681f3Smrg- lavapipe: make OPT macro MSVC compatible
15127ec681f3Smrg- lavapipe: use os_time for timing related things
15137ec681f3Smrg- vulkan/util: add api to reset object magic + private data.
15147ec681f3Smrg- radv: reset object base on recycled command buffers
15157ec681f3Smrg- tu: reset object base on recycled command buffers
15167ec681f3Smrg- lavapipe: reset object base on recycled command buffers
15177ec681f3Smrg- util: add optimised memset64
15187ec681f3Smrg- u_surface: use optimised memset64
15197ec681f3Smrg- llvmpipe: zs clear use 64-bit memset
15207ec681f3Smrg- lavapipe: use clear interface for renderpass clears
15217ec681f3Smrg- glx: proposed fix for setSwapInterval
15227ec681f3Smrg- zink: use extensioned draw indirect functions.
15237ec681f3Smrg- zink/ci: update results now that we are testing zink/lavapipe
15247ec681f3Smrg- lavapipe: add calibrated timestamp support
15257ec681f3Smrg- zink/ci: update results for GL 3.3 testing enables
15267ec681f3Smrg- zink/ci: disable arb_timer_query tests
15277ec681f3Smrg- lavapipe: use the common icd generator
15287ec681f3Smrg- lavapipe: fix msvc initialiser
15297ec681f3Smrg- lavapipe: add dll definition file instead of using PUBLIC
15307ec681f3Smrg- lavapipe: fix icd generation for windows
15317ec681f3Smrg- meson/llvm: add native for gallium swrast
15327ec681f3Smrg- lavapipe: handle tessellation domain winding
15337ec681f3Smrg- lavapipe: enable KHR_maintenance2
15347ec681f3Smrg- lavapipe: enable KHR_maintenace3
15357ec681f3Smrg- lavapipe: fix descriptor set layout freeing.
15367ec681f3Smrg- lavapipe: fix depth texturing swizzle
15377ec681f3Smrg- lavapipe: use null probe path on win32
15387ec681f3Smrg- ci: try building lavapipe on windows
15397ec681f3Smrg- zink/instance: work with vulkan 1.0 and later loader.
15407ec681f3Smrg- lavapipe: expose a 1.0 vulkan API for now.
15417ec681f3Smrg- lavapipe: Define supported extensions in C
15427ec681f3Smrg- lavapipe: VK_EXT_extended_dynamic_state support
15437ec681f3Smrg- lavapipe: reorder descriptor set stages to get correct binding
15447ec681f3Smrg- lavapipe: sort bindings before creating descriptor set
15457ec681f3Smrg- clover: fix array images view creation
15467ec681f3Smrg- lavapipe: fix pipeline vp/scissor mixup.
15477ec681f3Smrg- lavapipe: fix dynamic viewport/scissor pipeline emission
15487ec681f3Smrg- draw: fix uses viewport index for tess eval shader
15497ec681f3Smrg- draw/prim_assembler: write correct decomposed primitive lengths
15507ec681f3Smrg- llvmpipe: add support for shader viewport layer
15517ec681f3Smrg- lavapipe: enable EXT_shader_viewport_index_layer
15527ec681f3Smrg- zink/ci: update results after layer extensions enabled in lavapipe
15537ec681f3Smrg- util/panfrost/glsl: rename BITSET_LAST_BIT to BITSET_LAST_BIT_SIZED
15547ec681f3Smrg- util/bitset: add a new last bit api
15557ec681f3Smrg- shader_info: convert textures_used to a bitset.
15567ec681f3Smrg- gallium: add a sampler reduction cap + settings
15577ec681f3Smrg- gallium: add a view mask to the draw command
15587ec681f3Smrg- gallivm: mark subpass input attachments as 2d arrays
15597ec681f3Smrg- gallivm: add support for load_view_index intrinsic
15607ec681f3Smrg- draw: add interface to notify renderer of the current view index
15617ec681f3Smrg- draw: refactor out the instances drawing code
15627ec681f3Smrg- draw: add view_mask rendering support
15637ec681f3Smrg- draw: pass the view index to the render driver
15647ec681f3Smrg- draw/vs: pass the view index to the vertex shader
15657ec681f3Smrg- draw: add tess/gs support for multiview index
15667ec681f3Smrg- llvmpipe: add the view index callback from draw
15677ec681f3Smrg- llvmpipe: add view index support to rasterizer
15687ec681f3Smrg- lavapipe: add clear support for multiview
15697ec681f3Smrg- lavapipe: add draw support for multiview
15707ec681f3Smrg- lavapipe: add input attachment support for multiview
15717ec681f3Smrg- lavapipe: add render pass support for multiview
15727ec681f3Smrg- lavapipe: enable KHR_multiview
15737ec681f3Smrg- llvmpipe: add reduction mode support
15747ec681f3Smrg- lavapipe: add EXT_sampler_filter_minmax support
15757ec681f3Smrg- lavapipe: add support for VK_KHR_create_renderpass2
15767ec681f3Smrg- lavapipe: move queue to base object
15777ec681f3Smrg- lavapipe: move to the common casting interfaces
15787ec681f3Smrg- lavapipe: move to common create render pass code
15797ec681f3Smrg- lavapipe: add single ssbo variable pointer support.
15807ec681f3Smrg- docs: update lavapipe features.txt
15817ec681f3Smrg- lavapipe: enable KHR_uniform_buffer_standard_layout
15827ec681f3Smrg- lavapipe: enable EXT_scalar_block_layout
15837ec681f3Smrg- lavapipe: add missing break
15847ec681f3Smrg- lavapipe: fix writing availability for queries.
15857ec681f3Smrg- lavapipe: add host query reset
15867ec681f3Smrg- gallivm: convert packing to uint64 not double
15877ec681f3Smrg- lavapipe: only init immutable samplers for correct types.
15887ec681f3Smrg- lavapipe: add support for KHR_buffer_device_address.
15897ec681f3Smrg- lavapipe: bump maxMemoryAllocationCount
15907ec681f3Smrg- lavapipe: fix image format properties
15917ec681f3Smrg- lavapipe: add missing sampler minmax properties
15927ec681f3Smrg- lavapipe: add missing device group api
15937ec681f3Smrg- lavapipe: drop unused vk_format in image struct
15947ec681f3Smrg- lavapipe: fix templated descriptor updates
15957ec681f3Smrg- gallivm: fix non-32bit ubo loads
15967ec681f3Smrg- gallivm/nir: handle bool registers.
15977ec681f3Smrg- nir: port fp16 casting code from dxil
15987ec681f3Smrg- nir: lower 64-bit floats to 32-bit first.
15997ec681f3Smrg- gallivm: use fp16 casts lowering
16007ec681f3Smrg- lavapipe: enable 8/16-bit storage extensions
16017ec681f3Smrg- llvmpipe: fix cube image size query
16027ec681f3Smrg- st/glthread: allow for invalid L3 cache id.
16037ec681f3Smrg- util: rework AMD cpu L3 cache affinity code.
16047ec681f3Smrg- gallivm: add 64-bit atomic support for ssbo/shared.
16057ec681f3Smrg- gallivm: add 64-bit atomic global support
16067ec681f3Smrg- lavapipe: enable KHR_shader_atomic_int64
16077ec681f3Smrg- lavapipe: only reference pCounterBuffers if non-NULL
16087ec681f3Smrg- lavapipe: fail out if spirv->nir fails
16097ec681f3Smrg- lavapipe: fix only clearing depth or stencil paths.
16107ec681f3Smrg- zink/ci: update results after lavapipe clear fixes
16117ec681f3Smrg- lavapipe: add support for KHR_imageless_framebuffer
16127ec681f3Smrg- drisw: move zink down the list below the sw drivers.
16137ec681f3Smrg- zink/ci: handle getting correct drisw driver.
16147ec681f3Smrg- llvmpipe: when depth clamp is disable clamp to 0.0/1.0
16157ec681f3Smrg- llvmpipe: always take depth clamping from state tracker
16167ec681f3Smrg- ci: update zink/virgl results for depth clamping fixes
16177ec681f3Smrg- lavapipe: add vulkan 1.1 properties/features apis
16187ec681f3Smrg- lavapipe: fix missing protected memory properties
16197ec681f3Smrg- gallivm: add subgroup vote 64-bit and feq support.
16207ec681f3Smrg- gallivm: move get_flt_bld to header.
16217ec681f3Smrg- gallivm: add subgroup system values support
16227ec681f3Smrg- gallivm: add subgroup elect intrinsic support.
16237ec681f3Smrg- gallivm: add subgroup reduction + in/ex scan support
16247ec681f3Smrg- gallivm: add subgroup ballot support
16257ec681f3Smrg- gallivm: add subgroup read invocation support
16267ec681f3Smrg- gallivm: add subgroup lowering support
16277ec681f3Smrg- gallivm: add compute shader subgroup system values support
16287ec681f3Smrg- lavapipe: enable subgroups features
16297ec681f3Smrg- lavapipe: enable correct workgroup sizing
16307ec681f3Smrg- lavapipe: enable Vulkan 1.1 support
16317ec681f3Smrg- docs: update lavapipe bits for 1.1
16327ec681f3Smrg- lavapipe: add vk1.1 image swapchain support
16337ec681f3Smrg- lavapipe: add dummy sampler ycbcr conversion
16347ec681f3Smrg- lavapipe: fix mipmapped resolves.
16357ec681f3Smrg
16367ec681f3SmrgDavid McFarland (1):
16377ec681f3Smrg
16387ec681f3Smrg- radv: fix divide by zero with no tesselation params
16397ec681f3Smrg
16407ec681f3SmrgDouglas Anderson (1):
16417ec681f3Smrg
16427ec681f3Smrg- gallium/indices: Use "__restrict" to help the compiler
16437ec681f3Smrg
16447ec681f3SmrgDrew Davenport (1):
16457ec681f3Smrg
16467ec681f3Smrg- radeonsi: Report multi-plane formats as unsupported
16477ec681f3Smrg
16487ec681f3SmrgDylan Baker (33):
16497ec681f3Smrg
16507ec681f3Smrg- VERSION: bump for 21.1.0 cycle
16517ec681f3Smrg- docs: add release notes for 20.3.3
16527ec681f3Smrg- docs: Add sha256sum for 20.3.3
16537ec681f3Smrg- docs: update calendar and link releases notes for 20.3.3
16547ec681f3Smrg- docs: update calendar for 21.0.0-rc1
16557ec681f3Smrg- bin/post_version: convert the csv.reader into a concrete list
16567ec681f3Smrg- docs: add release notes for 20.3.4
16577ec681f3Smrg- docs: Add sha256sum for 20.3.4
16587ec681f3Smrg- docs: update calendar and link releases notes for 20.3.4
16597ec681f3Smrg- docs: update calendar for 21.0.0-rc2
16607ec681f3Smrg- docs: update calendar for 21.0.0-rc3
16617ec681f3Smrg- Scons: check for timespec_get on windows as well as unices
16627ec681f3Smrg- docs: Remove 21.0 features from features_new.txt
16637ec681f3Smrg- docs: add release notes for 21.0.0
16647ec681f3Smrg- docs: update calendar and link releases notes for 21.0.0
16657ec681f3Smrg- docs: Add calendar entries for 21.0 release.
16667ec681f3Smrg- docs: Extend calendar entries for 21.0 by 1 releases.
16677ec681f3Smrg- docs: Add calendar entries for 21.1 release candidates.
16687ec681f3Smrg- docs: add release notes for 20.3.5
16697ec681f3Smrg- docs: Add hashes for 20.3.5
16707ec681f3Smrg- docs: update calendar and link releases notes for 20.3.5
16717ec681f3Smrg- docs: add release notes for 21.0.1
16727ec681f3Smrg- docs: Add 21.0.1 hashes
16737ec681f3Smrg- docs: update calendar and link releases notes for 21.0.1
16747ec681f3Smrg- docs: add release notes for 21.0.2
16757ec681f3Smrg- relnotes: Add sha256sum for 21.0.2
16767ec681f3Smrg- docs: update calendar and link releases notes for 21.0.2
16777ec681f3Smrg- meson: OpenMP is supposed to be optional
16787ec681f3Smrg- .pick_status.json: Update to ee9b744cb5d1466960e78b1de44ad345590e348c
16797ec681f3Smrg- VERSION: bump for 21.1.0-rc3
16807ec681f3Smrg- .pick_status.json: Update to cbd6e5f2e592a9834a03004a473537f25aea4336
16817ec681f3Smrg- .pick_status.json: Update to ede0b3c643279f4126fb10552a2f1d00be27f16d
16827ec681f3Smrg- .pick_status.json: Update to b80720acb13e1014aea89e6bd25f22d43df85356
16837ec681f3Smrg
16847ec681f3SmrgEdward O'Callaghan (1):
16857ec681f3Smrg
16867ec681f3Smrg- clover: Implement CL_MEM_OBJECT_IMAGE1D
16877ec681f3Smrg
16887ec681f3SmrgEleni Maria Stea (7):
16897ec681f3Smrg
16907ec681f3Smrg- anv: Added the VK_EXT_sample_locations extension to the anv_extensions list
16917ec681f3Smrg- anv: Implement physical device properties for VK_EXT_sample_locations
16927ec681f3Smrg- anv/state: Take explicit sample locations in emit helpers
16937ec681f3Smrg- anv: Add support for sample locations
16947ec681f3Smrg- anv: Removed unused header file
16957ec681f3Smrg- anv: Enabled the VK_EXT_sample_locations extension
16967ec681f3Smrg- iris: fix in fences backend for ext_external_objects edge case
16977ec681f3Smrg
16987ec681f3SmrgElla-0 (1):
16997ec681f3Smrg
17007ec681f3Smrg- glsl: build without bison
17017ec681f3Smrg
17027ec681f3SmrgEnrico Galli (2):
17037ec681f3Smrg
17047ec681f3Smrg- microsoft/spirv_to_dxil: Add support for load_vulkan_descriptor
17057ec681f3Smrg- microsoft/spirv_to_dxil: Use non-zero exit code on failed compilations
17067ec681f3Smrg
17077ec681f3SmrgEric Anholt (207):
17087ec681f3Smrg
17097ec681f3Smrg- gallium/ttn: Add support for TGSI_OPCODE_I64NEG/ABS.
17107ec681f3Smrg- gallium/ntt: Stop lowering integer source mods.
17117ec681f3Smrg- gallium/tgsi: Assert that we don't see integer abs modifiers.
17127ec681f3Smrg- gallium/tgsi: Remove support for f64 src modifiers.
17137ec681f3Smrg- gallium/tgsi: Rewrite the docs on source modifiers.
17147ec681f3Smrg- gallium/tgsi: garbage collect unused TGSI_UTIL_SIGN_MODE.
17157ec681f3Smrg- mesa/st: Make a single helper for the NIR-to-TGSI transfer.
17167ec681f3Smrg- mesa/st: Lower shader images before handing off to NIR-to-TGSI.
17177ec681f3Smrg- mesa/st: Dump nir-to-tgsi output when ST_DEBUG=tgsi or nir is set.
17187ec681f3Smrg- gallium/ntt: Don't vectorize IBFE/UBFE/BFI.
17197ec681f3Smrg- gallium/ntt: Add support for store_per_vertex_output.
17207ec681f3Smrg- gallium/ntt: Avoid referencing undefined channels of system values.
17217ec681f3Smrg- ci/freedreno: Mark some a5xx separate_shader tests as xfails.
17227ec681f3Smrg- ci/freedreno: Fix up the xfail/flake handling of a3xx texture functions.
17237ec681f3Smrg- ci/freedreno: Remove a bunch of stale flakes from a3xx.
17247ec681f3Smrg- ci/freedreno: Drop some long-unseen a5xx flakes.
17257ec681f3Smrg- ci/freedreno: Drop skip list stuff from a5xx flakes.
17267ec681f3Smrg- ci/freedreno: Remove some long-unseen a6xx known flakes.
17277ec681f3Smrg- util/format: Fix pack/unpack of A1R5G5B5_UINT.
17287ec681f3Smrg- mesa: Add some little unit tests showing format unpack behavior.
17297ec681f3Smrg- mesa: Drop incorrect statement about Z unpack behavior.
17307ec681f3Smrg- mesa: Replace the float[4] unpack code with util/format's.
17317ec681f3Smrg- mesa: Make _mesa_unpack_rgba_block() use the u_format pack/unpack.
17327ec681f3Smrg- util: Move most of src/mesa/main/format_utils.h to util/format/
17337ec681f3Smrg- util: Fix UBSan failure on _mesa_unorm_to_unorm.
17347ec681f3Smrg- util: Fix rounding of unpack_unorm8 from small unorm formats.
17357ec681f3Smrg- mesa: Reuse util_format's unpack_8unorm.
17367ec681f3Smrg- mesa: Reuse util_format's unpack functions for pure integer formats.
17377ec681f3Smrg- util: Give a reasonable answer when unpacking z32unorm from floats.
17387ec681f3Smrg- mesa: Use a bunch of util functions for Z/S unpacking.
17397ec681f3Smrg- mesa: Move the rest of format_unpack.py out of code generation.
17407ec681f3Smrg- util/format: Simplify the generated unpack code.
17417ec681f3Smrg- swrast: Use util_format_write_4/4ub for the scattered pixel writes.
17427ec681f3Smrg- mesa/main: Replace float pack function with util_format_pack_rgba().
17437ec681f3Smrg- mesa/main: Replace the uint format packing code with util/format's.
17447ec681f3Smrg- ci/piglit: Upgrade to a newer piglit in our containers.
17457ec681f3Smrg- ci/freedreno: Fix xfail setup for sampler3d_float_vertex.
17467ec681f3Smrg- nir_to_tgsi: Store directly to TGSI outputs when possible.
17477ec681f3Smrg- r300,i915g: Report no shader buffers or images on non-TCL HW.
17487ec681f3Smrg- nir_to_tgsi: Fix buffer overflow in atomic image compswap.
17497ec681f3Smrg- swr: Don't report support for shader images.
17507ec681f3Smrg- panfrost: Stub out set_shader_images().
17517ec681f3Smrg- gallium: Fix leak of shader images on context destruction.
17527ec681f3Smrg- util/bitset: Avoid dereferencing the bitset for size == 0.
17537ec681f3Smrg- ci: Add a fractional deqp run of softpipe with asan enabled.
17547ec681f3Smrg- freedreno/a6xx: Skip the body of emit_state if we're clean.
17557ec681f3Smrg- freedreno: Move blend gmem checks to a blend dirty state check.
17567ec681f3Smrg- freedreno: Move framebuffer state checks under a ctx->dirty flag.
17577ec681f3Smrg- freedreno: Skip some batch dependency tracking if !ctx->dirty.
17587ec681f3Smrg- ci/freedreno: Detect cheza HFI errors and restart the run.
17597ec681f3Smrg- ci/freedreno: Ban vs-clip-vertex-enables which flakes in CI.
17607ec681f3Smrg- ci/freedreno: Ban more flaky clip-enables tests.
17617ec681f3Smrg- ci/freedreno: Make a630 piglit_shader run a manual run, too.
17627ec681f3Smrg- freedreno: Use a real type instead of void * for the fd_batch->key.
17637ec681f3Smrg- freedreno: Early-out from the resource write path when we're the writer.
17647ec681f3Smrg- freedreno: Remove duplicate bc invalidate on flush_write_batch().
17657ec681f3Smrg- ci: Update baremetal kernel to 5.11-rc5 plus patches.
17667ec681f3Smrg- mesa/st: Allocate the gl_context with 16-byte alignment.
17677ec681f3Smrg- ci/freedreno: Drop pointless GIT_STRATEGY setting for a630.
17687ec681f3Smrg- ci/freedreno: Use the new nginx cache for trace downloads.
17697ec681f3Smrg- ci/freedreno: Use the http cache for artifacts downloads, too.
17707ec681f3Smrg- ci/docs: Update CI farm requirements suggestions.
17717ec681f3Smrg- docs/ci: Document setting up the http cache for traces.
17727ec681f3Smrg- ci/lava+baremetal: Add an xserver to the root fs.
17737ec681f3Smrg- ci/freedreno: Do our piglit runs against Xorg.
17747ec681f3Smrg- ci/freedreno: Add Valve games and other traces now that we have GLX.
17757ec681f3Smrg- freedreno: Make sure that queries are disabled during shadow blits.
17767ec681f3Smrg- freedreno: rename batch->active_providers to query_providers_used.
17777ec681f3Smrg- freedreno: Backport a5xx/a6xx fix for active query handling.
17787ec681f3Smrg- freedreno: Drop pointless clear of used providers.
17797ec681f3Smrg- freedreno/a6xx: Skip guessing VSC size with indirect TF draw counts.
17807ec681f3Smrg- docs: Document PIPE_CAP_STREAM_OUTPUT_PAUSE_RESUME.
17817ec681f3Smrg- freedreno/a6xx: Add support for glDrawTransformFeedback().
17827ec681f3Smrg- ci/deqp: Bump runner to 0.5.1 for recent runtime perf improvements.
17837ec681f3Smrg- ci/freedreno: bump VK coverage to 1/4 of the CTS.
17847ec681f3Smrg- ci/freedreno: Run a3xx gles3 in parallel and increase coverage.
17857ec681f3Smrg- ci/virgl: Fix GLES31 testing on desktop GL.
17867ec681f3Smrg- freedreno: Force updating active queries on batch reordering.
17877ec681f3Smrg- freedreno: Remove FD_STAGE_* in favor of a "disable_all" flag.
17887ec681f3Smrg- freedreno/a5xx: Don't forget to count our custom blits against queries.
17897ec681f3Smrg- mesa/st: Always precompile the first shader variant.
17907ec681f3Smrg- mesa/st: Assume that the default variant is always first in the list.
17917ec681f3Smrg- vc4: Remove vestiges of alpha test lowering.
17927ec681f3Smrg- vc4: Stop advertising support for PIPE_CAP_TWO_SIDED_COLOR.
17937ec681f3Smrg- vc4: Stop advertising support for VS color clamping.
17947ec681f3Smrg- v3d: Clean up vestiges of alpha test lowering.
17957ec681f3Smrg- v3d: Stop advertising support for PIPE_CAP_TWO_SIDED_COLOR.
17967ec681f3Smrg- v3d: Stop advertising support for PIPE_CAP_*_COLOR_CLAMPED.
17977ec681f3Smrg- v3d: Stop advertising support for flat shading.
17987ec681f3Smrg- gallium: Document behavior of more lowering pipe caps.
17997ec681f3Smrg- freedreno: Add missing dep on freedreno tracepoints.
18007ec681f3Smrg- gallium: Flip the default value of PIPE_CAP_SHAREABLE_SHADERS.
18017ec681f3Smrg- virgl: Drop a context dependency from part of the shader compile path.
18027ec681f3Smrg- v3d/qpu: Avoid leaking memory in the QPU disasm test.
18037ec681f3Smrg- mesa/st: Make sure to unbind cb0 on transition away from gs/tess shaders.
18047ec681f3Smrg- ci: Allow better customization of the name of the artifacts for minio.
18057ec681f3Smrg- ci/freedreno: Add a fractional gles31 run with asan enabled.
18067ec681f3Smrg- ci/freedreno: Drop the "arm64" in front of job names.
18077ec681f3Smrg- ci: Move specific driver testing to separate files in separate dirs.
18087ec681f3Smrg- ci/freedreno: Fix a5xx piglit runs.
18097ec681f3Smrg- ci/freedreno: Remove stray BM_DTB definition.
18107ec681f3Smrg- ci/bare-metal: Use an upstream kernel for db820c.
18117ec681f3Smrg- ci/a5xx: Update the piglit expectations.
18127ec681f3Smrg- ci/a5xx: Increase our dEQP GLES3 fraction by 4x.
18137ec681f3Smrg- ci: Move the dEQP and traces expectations to the per-driver CI dirs.
18147ec681f3Smrg- ci: Move the piglit expectations lists to the per-driver CI dirs.
18157ec681f3Smrg- ci/zink: Add tests of gles2, gles3, and gl33 on lavapipe.
18167ec681f3Smrg- zink: Use mesa_loge() for should-never-be-reached initialization errors.
18177ec681f3Smrg- zink: Remove NULL checks after GET_PROC_ADDR_INSTANCE().
18187ec681f3Smrg- softpipe: Fix the const buffer overflow check.
18197ec681f3Smrg- mesa: Get the FXT1 compressor/decompressor off of GL types.
18207ec681f3Smrg- mesa: Move the FXT1 compressor/decompressor to util/
18217ec681f3Smrg- llvmpipe: Enable FXT1 texture decompression.
18227ec681f3Smrg- v3d: Replace driver lowering of GL_CLAMP with mesa/st's.
18237ec681f3Smrg- ci/piglit: Stop including the test counts at the end of expectations.
18247ec681f3Smrg- ci/iris: Move the traces yml file to the driver-specific dir.
18257ec681f3Smrg- mesa: Always make sure uniform storage doesn't get reallocated.
18267ec681f3Smrg- freedreno: Remove uniform variables after finalizing NIR.
18277ec681f3Smrg- freedreno: Drop custom driver lowering of two-sided color.
18287ec681f3Smrg- freedreno: Drop custom driver lowering of GL's color clamping.
18297ec681f3Smrg- freedreno: Use the mesa/st frontend lowering of GL_CLAMP.
18307ec681f3Smrg- freedreno/a5xx+: Stop recompiling on texture samples changes.
18317ec681f3Smrg- freedreno/a5xx+: Drop the unused no_decode_srgb flag.
18327ec681f3Smrg- freedreno/a5xx: Fix cube image load/stores.
18337ec681f3Smrg- nir: Add a nir_src_is_undef() helper, like nir_src_is_const().
18347ec681f3Smrg- nir/vec_to_movs: Don't generate MOVs for undef channels.
18357ec681f3Smrg- ci: Move deqp-default-skips.txt back to .gitlab-ci/
18367ec681f3Smrg- ci/lava: Move the per-driver gitlab-ci.yml to each driver.
18377ec681f3Smrg- ci/lava: Move the driver expectation files to the per-driver CI dir.
18387ec681f3Smrg- tgsi_exec: Roll the loops for condmask handling.
18397ec681f3Smrg- tgsi_exec: Jump over entirely non-taken THEN or ELSE branches.
18407ec681f3Smrg- ci/freedreno: Also retest when only CI configuration changes.
18417ec681f3Smrg- ci/freedreno: Switch the fastboot boards to using nfsroot.
18427ec681f3Smrg- ci/a5xx: Run all of gles2 in one job.
18437ec681f3Smrg- ci/a3xx: Run all of GLES3 dEQP.
18447ec681f3Smrg- ci/a5xx: Increase the gles3/31 coverage.
18457ec681f3Smrg- ci/a5xx: Update piglit expectations.
18467ec681f3Smrg- ci/zink: Add another primitive restart flake.
18477ec681f3Smrg- ci/turnip: Mark a flaky WSI test.
18487ec681f3Smrg- lima: stop encoding the texture format in the shader key
18497ec681f3Smrg- lima: don't look at dirty bits for setup of FS key
18507ec681f3Smrg- lima: upload the shader to a BO at shader creation
18517ec681f3Smrg- lima: avoid stomping over bound shader state when creating new shaders
18527ec681f3Smrg- nir-to-tgsi: Fix handling of partial writemasks on SSA/REG decls.
18537ec681f3Smrg- docs: Add some documentation of game GL buffer object mapping behavior.
18547ec681f3Smrg- freedreno/a5xx: Introduce an event write helper like a6xx has.
18557ec681f3Smrg- freedreno/a5xx: Flush depth at the end of sysmem, like a6xx does.
18567ec681f3Smrg- ci/freedreno: Mark another a5xx TF flake.
18577ec681f3Smrg- u_format: Mark the generated pack/unpack src/dst args as restrict.
18587ec681f3Smrg- mesa/st: Unify st_get_vp_variant() and st_get_common_variant().
18597ec681f3Smrg- mesa/st: Add perf debug for draw-time variant compiles.
18607ec681f3Smrg- mesa/st: Fix precompile misses on compat GL VSes writing to color outputs.
18617ec681f3Smrg- virgl: Update GLES expectations.
18627ec681f3Smrg- ci/freedreno: Add three more a5xx flakes from the last day.
18637ec681f3Smrg- freedreno/a5xx: Fix the texel buffer alignment requirement.
18647ec681f3Smrg- freedreno/a5xx: Fix the max texture buffer size.
18657ec681f3Smrg- ci/panfrost: Disable t860/radeonsi testing while the runners are struggling.
18667ec681f3Smrg- ci: Bump deqp-runner to v0.6.3.
18677ec681f3Smrg- ci/freedreno: Switch the piglit testing to the new piglit runner.
18687ec681f3Smrg- ci/bare-metal: Restart a run on intermittent kernel lockups.
18697ec681f3Smrg- ci/freedreno: Mark an a630 piglit flake from async shader compiling.
18707ec681f3Smrg- ci/freedreno: Mark the rest of the glx_arb_sync_control@timing as flakes.
18717ec681f3Smrg- nir_to_tgsi: Respect PIPE_SHADER_CAP_TGSI_SQRT_SUPPORTED.
18727ec681f3Smrg- freedreno/a5xx: Fix stream-output binning handling.
18737ec681f3Smrg- freedreno/ir3: Demote centroid usage to pixel on non-msaa.
18747ec681f3Smrg- ci/freedreno: Mark all of dEQP TF as flaky.
18757ec681f3Smrg- ci/bare-metal: Move the db820c lockup detect to the right boot script.
18767ec681f3Smrg- ci/freedreno: Mark glx-swap-copy as a flake on a630.
18777ec681f3Smrg- freedreno/a6xx: Rename the RB_BLIT_INFO.INTEGER field to SAMPLE_0.
18787ec681f3Smrg- freedreno/a6xx: Disable sample averaging on depth/stencil resolves.
18797ec681f3Smrg- freedreno: Move the ir3 linked shader cache to the context.
18807ec681f3Smrg- freedreno/a3xx: Switch to using ir3_cache for looking up our VS/FS.
18817ec681f3Smrg- freedreno/a4xx: Switch to using ir3_cache for looking up our VS/FS
18827ec681f3Smrg- freedreno/a5xx: Switch to using ir3_cache for looking up our VS/FS
18837ec681f3Smrg- turnip: Fix KGSL build since common dispatch rework.
18847ec681f3Smrg- broadcom: Disbale CLIF dumping when libexpat isn't available.
18857ec681f3Smrg- ci/android: Make sure we don't detect system libexpat.
18867ec681f3Smrg- ci/android: Build the v3dv driver.
18877ec681f3Smrg- iris: Flag for resolves when stencil enable changes, too.
18887ec681f3Smrg- freedreno: Assert that TF prims generated can ignore active_queries.
18897ec681f3Smrg- freedreno/ir3: Move max-tf-vtx calculation to a .c file
18907ec681f3Smrg- freedreno: Move max-tf-vtx calculation to just the HW that needs it.
18917ec681f3Smrg- freedreno: Move the primitives generated/written updates after the draw.
18927ec681f3Smrg- freedreno: Don't count SW TF queries on a6xx.
18937ec681f3Smrg- freedreno: Clamp TF prims written to buffer size pre-a6xx.
18947ec681f3Smrg- ci/freedreno: Mark a630 as flaky on arb_draw_indirect-transform-feedback
18957ec681f3Smrg- nir: Update clip_distance_array_size in clip lowering.
18967ec681f3Smrg- freedreno/a6xx: Use the frontend userclip lowering.
18977ec681f3Smrg- freedreno/a5xx: Add support for clip distances and use them for userclip.
18987ec681f3Smrg- freedreno/a5xx: Use VALIDREG/CONDREG like a6xx do.
18997ec681f3Smrg- ci/freedreno: Demote a630-asan to a manual test for now.
19007ec681f3Smrg- ci: Drop the custom db820c kernel/dtb from the kernel+rootfs.
19017ec681f3Smrg- ci/freedreno: Add more new traces for a630 (minetest, TDM, pioneer, glyphy).
19027ec681f3Smrg- ci/freedreno: Rename a306-test and a530-test to drop "arm64" from the name.
19037ec681f3Smrg- ci/freedreno: Add trace testing on a3xx, a5xx.
19047ec681f3Smrg- freedreno/a5xx: Fix alpha test vs early Z bugs.
19057ec681f3Smrg- freedreno/a6xx: Fix alpha tests.
19067ec681f3Smrg- ci/freedreno: Switch to the trimmed glxgears trace.
19077ec681f3Smrg- ci/freedreno: Fix up the a5xx border color flake annotation.
19087ec681f3Smrg- ci: Uprev deqp runner to 0.6.5.
19097ec681f3Smrg- ci: Uprev piglit to 6a4be9e9946d ("piglit: NOTE! Default branch is now main")
19107ec681f3Smrg- ci: bump bare-metal kernel to bring in an a530 stability fix
19117ec681f3Smrg- freedreno: Fix YUV sampler regression.
19127ec681f3Smrg- nir_to_tgsi: Use ARL instead of UARL in the !native_integers case.
19137ec681f3Smrg- nir: Generate load_ubo_vec4 directly for !PIPE_CAP_NATIVE_INTEGERS
19147ec681f3Smrg- freedreno/a6xx: Don't try to do Z-as-RGBA blits for mismatched formats.
19157ec681f3Smrg- ci/virgl: Mark a couple of new Crash tests as flakes.
19167ec681f3Smrg
19177ec681f3SmrgEric Engestrom (12):
19187ec681f3Smrg
19197ec681f3Smrg- VERSION: bump to 21.1.0-rc1
19207ec681f3Smrg- .pick_status.json: Update to c74d93cf0187e07cdfacc448a947a8cae485eb41
19217ec681f3Smrg- .pick_status.json: Update to 95d9d811c91076d50385b2fbd330335b68688c69
19227ec681f3Smrg- .pick_status.json: Update to fcb5ba58165cd407408f8dd9a102f0c5e16a9956
19237ec681f3Smrg- VERSION: bump for 21.1.0-rc2
19247ec681f3Smrg- .pick_status.json: Mark 8acf361db4190aa5f7c788019d1e42d1df031b81 as denominated
19257ec681f3Smrg- .pick_status.json: Update to 35a28e038107410bb6a733c51cbd267aa79a4b20
19267ec681f3Smrg- .pick_status.json: Update to 7e905bd00f32b4fa48689a8e6266b145662cfc48
19277ec681f3Smrg- .pick_status.json: Update to 72eca47c660b6c6051be5a5a80660ae765ecbaa5
19287ec681f3Smrg- .pick_status.json: Update to f3d2fade82c168a7ffffa4bd7bf22585c45c711b
19297ec681f3Smrg- .pick_status.json: Update to f5d6a1b916fb163ee72e6a6f356937b1fbac53e0
19307ec681f3Smrg- .pick_status.json: Update to 1d418e79b8a0f4270775277b7115b88ac4c77113
19317ec681f3Smrg
19327ec681f3SmrgErico Nunes (15):
19337ec681f3Smrg
19347ec681f3Smrg- lima: introduce fs and vs shader cache
19357ec681f3Smrg- lima/ppir: fix creation of mov node for non-ssa tex dest
19367ec681f3Smrg- lima: set yuv formats as external_only
19377ec681f3Smrg- lima: enable r and rg pixel formats again
19387ec681f3Smrg- lima: always set stride in texture descriptor
19397ec681f3Smrg- lima: implement GL_EXT_texture_swizzle
19407ec681f3Smrg- docs/features: add lima features
19417ec681f3Smrg- lima: fix max sampler views
19427ec681f3Smrg- lima: run nir dce after nir_lower_vec_to_movs
19437ec681f3Smrg- lima/ppir: remove liveness info from blocks
19447ec681f3Smrg- lima/ppir: remove use of live_out
19457ec681f3Smrg- lima/ppir: rework liveness data structures to bitset
19467ec681f3Smrg- lima: fix half float render
19477ec681f3Smrg- lima: enable rg formats for fp16 render
19487ec681f3Smrg- lima: increase epsilon for depthrange near == far
19497ec681f3Smrg
19507ec681f3SmrgErik Faye-Lund (158):
19517ec681f3Smrg
19527ec681f3Smrg- zink: handle NULL views in zink_set_sampler_views
19537ec681f3Smrg- zink: fix vertex-stride wrangling
19547ec681f3Smrg- docs: fix sphinx-warnings due to lacking escaping
19557ec681f3Smrg- docs: fix broken link
19567ec681f3Smrg- docs: turn non-code into comment
19577ec681f3Smrg- docs/features: add missing features for zink
19587ec681f3Smrg- docs/features: remove a few redundant zink mentions
19597ec681f3Smrg- zink: always expose linear float textures
19607ec681f3Smrg- zink: respect feature-cap for robust buffer access
19617ec681f3Smrg- zink: respect feature-cap for independent blending
19627ec681f3Smrg- zink: respect feature-cap for sample-shading
19637ec681f3Smrg- zink: respect feature-cap for multi-draw indirect
19647ec681f3Smrg- zink: check for extension instead of function
19657ec681f3Smrg- zink: require vulkan memory model for tesselation
19667ec681f3Smrg- zink: make all xfb caps depend on extension
19677ec681f3Smrg- zink: respect fragment-shader depth-layout
19687ec681f3Smrg- zink: clone shader before lowering clip_halfz
19697ec681f3Smrg- docs/zink: add missing colon
19707ec681f3Smrg- docs/zink: add two missing required features
19717ec681f3Smrg- docs/zink: document the independentBlend requirement for GL3
19727ec681f3Smrg- docs/zink: fix phrasing of GL 3.3 requirements
19737ec681f3Smrg- docs/zink: add GL 4.0 requirements
19747ec681f3Smrg- docs/zink: add GL 4.1 requirements
19757ec681f3Smrg- docs/zink: add GL 4.2 requirements
19767ec681f3Smrg- docs/features: mark off two more extensions for zink
19777ec681f3Smrg- docs/zink: correct vk version for GL 4.2
19787ec681f3Smrg- mesa/main: remove leftover bumpmap code
19797ec681f3Smrg- compiler/nir: add texcoord replace lowering pass
19807ec681f3Smrg- gallium/st: lower point-sprites if not supported
19817ec681f3Smrg- zink: request texcoord replace lowering
19827ec681f3Smrg- docs/features: mark ssbos as done for zink
19837ec681f3Smrg- zink: remove stale TODO
19847ec681f3Smrg- zink: be more careful about limits when unsupported
19857ec681f3Smrg- zink: correct return-type for function
19867ec681f3Smrg- zink: only emit SpvCapabilityDerivativeControl when needed
19877ec681f3Smrg- zink: only emit cap when needed
19887ec681f3Smrg- zik: correct spir-v caps for textures and images
19897ec681f3Smrg- zink: do not insist shaders come from glsl
19907ec681f3Smrg- zink: add a get_primitive_mode-helper
19917ec681f3Smrg- zink: add a get_spacing-helper
19927ec681f3Smrg- zink: refactor vertex-order emitting
19937ec681f3Smrg- zink: wrap some long lines
19947ec681f3Smrg- docs: fix invalid rst syntax
19957ec681f3Smrg- zink: check for error when calling vkEnumeratePhysicalDevices
19967ec681f3Smrg- zink: explicitly check for VK_NULL_HANDLE
19977ec681f3Smrg- zink: support using lavapipe
19987ec681f3Smrg- CI: always expose docs artifacts
19997ec681f3Smrg- ci: make sure all lava-builders have libvulkan
20007ec681f3Smrg- ci: run piglit on zink with lavapipe
20017ec681f3Smrg- lavapipe: report correct value for minMemoryMapAlignment
20027ec681f3Smrg- ci: document arm oddity in build-rules
20037ec681f3Smrg- zink: correctly handle 64 valid timestamp bits
20047ec681f3Smrg- zink: enable excluded test
20057ec681f3Smrg- ci: enable max texture size tests for zink
20067ec681f3Smrg- lavapipe: handle null-buffers for xfb
20077ec681f3Smrg- ci: disable sporadically failing test
20087ec681f3Smrg- zink: drop extra set of parens
20097ec681f3Smrg- zink: do not use extra staging resource unless needed
20107ec681f3Smrg- zink: don't always require linear display-targets
20117ec681f3Smrg- zink: limit host-visible bind-flags
20127ec681f3Smrg- zink: ignore irrelevant bind-flags
20137ec681f3Smrg- zink: use gallium api to copy to display-target
20147ec681f3Smrg- zink: add X32_S8X24 format
20157ec681f3Smrg- zink: correct inaccurate comment
20167ec681f3Smrg- lavapipe: fix primitive-restart for uint8 indices
20177ec681f3Smrg- zink: fix emulation of no mipfilter
20187ec681f3Smrg- zink: fix free of ralloced pointer
20197ec681f3Smrg- gallium/st: fix shader_has_one_variant
20207ec681f3Smrg- gallium/st: fix shader_has_one_variant
20217ec681f3Smrg- gallium/st: reserve space in default uniform block for lowered constants
20227ec681f3Smrg- docs: remove stray newline
20237ec681f3Smrg- docs: remove excessive wrapping
20247ec681f3Smrg- docs: remove excessive quoting
20257ec681f3Smrg- docs: document zink GL 4.3 requirements
20267ec681f3Smrg- docs: document zink GL 4.4 requirements
20277ec681f3Smrg- docs: document zink GL 4.5 requirements
20287ec681f3Smrg- docs: document zink GL 4.6 requirements
20297ec681f3Smrg- docs: simplify format requirements
20307ec681f3Smrg- zink: factor out interpolation to helper
20317ec681f3Smrg- zink: emit all interpolation modes
20327ec681f3Smrg- zink: check for pipeline statistics feature
20337ec681f3Smrg- zink: check for depth-bias-clamp feature
20347ec681f3Smrg- zink: check for stores and atomics features
20357ec681f3Smrg- zink: add missing required feature
20367ec681f3Smrg- zink: check for mirror-clamp extension
20377ec681f3Smrg- zink: fix vector comparison
20387ec681f3Smrg- zink: drop bool attempt in float vector compares
20397ec681f3Smrg- zink: do not open-code vector-compares
20407ec681f3Smrg- zink: follow spir-v 1.0 spec
20417ec681f3Smrg- docs: Add 21.0.0 hashes
20427ec681f3Smrg- zink: tighten emitted image spir-v caps
20437ec681f3Smrg- zink: remove no-longer-needed clipdist1 patching
20447ec681f3Smrg- frontends/va: correct check for invalid format
20457ec681f3Smrg- zink: handle errors in nir_to_spirv
20467ec681f3Smrg- zink: pre-populate locations in variables
20477ec681f3Smrg- zink: do not depend on shader_slots_reserved for xfb
20487ec681f3Smrg- zink: use pre-populated shader-locations
20497ec681f3Smrg- lavapipe: report correct value for maxTexelBufferElements
20507ec681f3Smrg- docs: do not try to copy missing file
20517ec681f3Smrg- compiler/glsl: avoid null-pointer deref
20527ec681f3Smrg- docs: remove bogus zink-requirement
20537ec681f3Smrg- docs: remove zink incorrect requirement
20547ec681f3Smrg- zink: do not enable unused extension
20557ec681f3Smrg- docs: clarify VK_KHR_external_memory requirement
20567ec681f3Smrg- zink: check base-requirements
20577ec681f3Smrg- zink: assert that pstage is within range
20587ec681f3Smrg- zink: simplify shader-removal
20597ec681f3Smrg- zink: document why we're calling pipe_shader_type_from_mesa
20607ec681f3Smrg- docs: appling -> applying
20617ec681f3Smrg- docs: sytem -> system
20627ec681f3Smrg- docs: ie. -> i.e.
20637ec681f3Smrg- docs: vulkan -> Vulkan
20647ec681f3Smrg- zink: do not request scoped memory barriers
20657ec681f3Smrg- docs: optimisation -> optimization
20667ec681f3Smrg- docs: opencl -> OpenCL
20677ec681f3Smrg- docs: Xorg -> X.Org
20687ec681f3Smrg- docs: nops -> NOPs
20697ec681f3Smrg- docs: lod -> LOD
20707ec681f3Smrg- docs: lex / yacc -> Lex / Yacc
20717ec681f3Smrg- docs: dfsm -> DFSM
20727ec681f3Smrg- docs: fix incorrect possessive form
20737ec681f3Smrg- docs: fix invalid rst
20747ec681f3Smrg- docs: fix rst-quoting issues in release-notes
20757ec681f3Smrg- docs: spell out full name of gitlab instance
20767ec681f3Smrg- docs: spell out development
20777ec681f3Smrg- docs: spell out environment
20787ec681f3Smrg- docs: spell out freedesktop.org
20797ec681f3Smrg- docs: no-op'd -> disabled
20807ec681f3Smrg- docs: fix release notes for 20.3.5
20817ec681f3Smrg- ci: turn sphinx-build warnings into errors
20827ec681f3Smrg- bin/gen_release_notes.py: more robust rST escaping
20837ec681f3Smrg- compiler/glsl: correct the number of string-arguments
20847ec681f3Smrg- compiler/glsl: fix volatile string
20857ec681f3Smrg- compiler/glsl: clean up output
20867ec681f3Smrg- glsl: fix is_integer_16_32
20877ec681f3Smrg- glsl: fix int16 type
20887ec681f3Smrg- glsl: tolerate int16 loop counters
20897ec681f3Smrg- gallium/st: correct range for float16
20907ec681f3Smrg- gallium/st: correct range for int16
20917ec681f3Smrg- zink: document scalarBlockLayout requirement
20927ec681f3Smrg- zink: fix typo in function name
20937ec681f3Smrg- compiler/glsl: drop rogue argument to _mesa_glsl_error
20947ec681f3Smrg- compiler/glsl: do not cast struct to string
20957ec681f3Smrg- lavapipe: do not subtract 8 from enum
20967ec681f3Smrg- lavapipe: check all vertex-stages
20977ec681f3Smrg- lavapipe: check all graphics stages
20987ec681f3Smrg- lavapipe: ask pipe-driver for int16 support
20997ec681f3Smrg- zink: do not clear on cpu
21007ec681f3Smrg- zink: fall back from cached to non-cached memory
21017ec681f3Smrg- zink: do not dereference NULL pointer
21027ec681f3Smrg- zink: verify that src/dst support blitting
21037ec681f3Smrg- zink: verify that source-format support linear-filter
21047ec681f3Smrg- zink: fix stencil-export cap emission
21057ec681f3Smrg- gallivm: handle 16-bit input in i2b32
21067ec681f3Smrg- zink: do not read outside of array
21077ec681f3Smrg- zink: do not require vulkan memory model for shader-images
21087ec681f3Smrg- zink: correct image cap checks
21097ec681f3Smrg- zink: fix shader-image requirements
21107ec681f3Smrg
21117ec681f3SmrgFan Yugang (1):
21127ec681f3Smrg
21137ec681f3Smrg- intel/tools: Show unknown instructions in decoded state.
21147ec681f3Smrg
21157ec681f3SmrgFrancisco Jerez (9):
21167ec681f3Smrg
21177ec681f3Smrg- intel/gen12: Fix memory corruption issues in fused Gen12 parts.
21187ec681f3Smrg- intel/genxml: Fix pixel hashing 3DSTATE_3D_MODE field definitions for Gen12 and Gen12.5.
21197ec681f3Smrg- intel/genxml: Define 3DSTATE_SUBSLICE_HASH_TABLE command for Gen12 and Gen12.5.
21207ec681f3Smrg- intel/dev: Implement pixel pipe subslice counting for Gen12+.
21217ec681f3Smrg- iris/gen11+: Calculate pixel hashing tables instead of hardcoding.
21227ec681f3Smrg- iris/gen12: Implement programming of pixel pipe hashing tables.
21237ec681f3Smrg- anv/gen11+: Calculate pixel hashing tables instead of hardcoding.
21247ec681f3Smrg- anv/gen12: Implement programming of pixel pipe hashing tables.
21257ec681f3Smrg- iris/gen12: Work around push constant corruption on context switch.
21267ec681f3Smrg
21277ec681f3SmrgGeorg Lehmann (1):
21287ec681f3Smrg
21297ec681f3Smrg- vulkan/device_select: Only call vkGetPhysicalDeviceProperties2 if the device supports it.
21307ec681f3Smrg
21317ec681f3SmrgGert Wollny (89):
21327ec681f3Smrg
21337ec681f3Smrg- r600/nir: clone shader before first query to shader key
21347ec681f3Smrg- r600/sfn: fix use of b32all/and
21357ec681f3Smrg- r600: Add flags to INTERP_X and INTERP_Z two-slot ops
21367ec681f3Smrg- r600/sb: Add support for INTERP_X and INTERP_Z ops
21377ec681f3Smrg- r600/nir: pass array info to r600_shader for sb
21387ec681f3Smrg- r600/sfn: update shader array info
21397ec681f3Smrg- r600/sfn: Keep array registers alive for the whole shader
21407ec681f3Smrg- r600/sb: fix boundary assert for mem-instruction decoding
21417ec681f3Smrg- r600/sb: fall back to un-optimized byte code when ra_init fails
21427ec681f3Smrg- r600: Enable sb also for NIR
21437ec681f3Smrg- compiler/nir: Add support for lowering stores with nir_lower_instruction
21447ec681f3Smrg- r600/sfn: Fix use of cnde_int for bcsel
21457ec681f3Smrg- r600/sfn: Set unnormalized flag for z-coordinate when fetching from array
21467ec681f3Smrg- r600/sfn: Add the position input as varying
21477ec681f3Smrg- r600/sfn: Fix FS inputs when reading from the same position
21487ec681f3Smrg- r600/sfn: Fix dual source blend lowered to FRAG_DATA
21497ec681f3Smrg- r600/sfn: Use the constant buffer ID when given
21507ec681f3Smrg- gallium/tgsi_to_nir: Handle SAMPLE_MASK output in FS
21517ec681f3Smrg- gallium/tgsi-to-nir: Take property NUM_CLIPDIST_ENABLED into account
21527ec681f3Smrg- r600/sfn: Handle memory_barrier_atomic_counters
21537ec681f3Smrg- r600/sfn: Fix indirect_file flag for IMAGES
21547ec681f3Smrg- r600/sfn: remove duplicate barriers
21557ec681f3Smrg- r600/sfn: Base instr lowering class on nir_lower_instruction code
21567ec681f3Smrg- nir: Add flag to tex instruction to indicate lowering cube to array
21577ec681f3Smrg- nir: Add r600 specific CUBE opcode to evaluate cube texture coords and face
21587ec681f3Smrg- r600/sfn: Add support for cube_r600 instruction
21597ec681f3Smrg- r600/sfn: add lowering pass for cube textures
21607ec681f3Smrg- r600/sfn: fix gather with cube lowering
21617ec681f3Smrg- r600/sfn: use lower bool to int32 and lower int_tg4 only on shader clone
21627ec681f3Smrg- r600/sfn: use lowering pass for cube textures
21637ec681f3Smrg- r600/sfn: remove old cube texturing code
21647ec681f3Smrg- r600/sfn: Lower FS inputs to temps late and, and lower interpolate at
21657ec681f3Smrg- r600/sfn: set info about using helper_invocation to skip sb
21667ec681f3Smrg- r600/sfn: lower isign and iabs in nir
21677ec681f3Smrg- r600/sfn: Allow any channel for the helper invocation evaluation
21687ec681f3Smrg- r600: unify nir shader options evaluation
21697ec681f3Smrg- r600/sfn: remove code for nir_op_fsign since it is lowered
21707ec681f3Smrg- r600/sfn: remove unused emit_alu_op2_split_src_mods
21717ec681f3Smrg- r600/sfn: remove some old debug output
21727ec681f3Smrg- r600/sfn: encode component in address for local IO
21737ec681f3Smrg- nir: disaallow reordering for r600 shared load and remove component field
21747ec681f3Smrg- r600/sfn: handle querying the number of layers in cube arrays
21757ec681f3Smrg- r600/sfn: Fix loading TES gl_PatchVerticesIn
21767ec681f3Smrg- r600: Don't optimize using source modifiers on literals
21777ec681f3Smrg- r600: Enable GLSL 450 for nir shaders.
21787ec681f3Smrg- r600/sfn: Update status
21797ec681f3Smrg- nir: Add r600 specific intrinsic for loading the tesselation coords
21807ec681f3Smrg- r600/sfn: lower intrinsic_load_tess_coord to driver version
21817ec681f3Smrg- r600/sfn: eliminate loading unused component loads from shared memory
21827ec681f3Smrg- virgl: implement support for  PIPE_CAP_STRING_MARKER
21837ec681f3Smrg- r600/sfn: sort alu opcodes in switch statememt
21847ec681f3Smrg- r600/sfn: remove unused code
21857ec681f3Smrg- r600/sfn: fix buffer offset for ssbo writes
21867ec681f3Smrg- r600/sfn: Fix including/not including c++ parts of header
21877ec681f3Smrg- r600/sfn: lower bitfield_extract and bitfield_insert in NIR
21887ec681f3Smrg- r600/sfn: lower idiv, imod, etc in nir
21897ec681f3Smrg- r600/sfn remove some leftover debug output
21907ec681f3Smrg- nir: add opcodes for \*find_msb_rev and lowering
21917ec681f3Smrg- nir: Add opcodes for fused comp + csel and optimizations
21927ec681f3Smrg- nir: Add r600 specific sin and cos variants
21937ec681f3Smrg- r600/sfn: Add algebraic lowering for fsin and  fcos
21947ec681f3Smrg- r600/sfn: optimize comp+csel using fused ops
21957ec681f3Smrg- r600/sfn: lower find_msb variants to find_msb_rev
21967ec681f3Smrg- r60/sfn: don't lower scomp
21977ec681f3Smrg- r600: Handle negate of second operator in TGSI_OPCODE_UADD
21987ec681f3Smrg- r600/sfn: Make some value pool functions private
21997ec681f3Smrg- r600/sfn: Add skelton for visitor pattern
22007ec681f3Smrg- r600/sfn: remove extra parameter from alu assemebly emission
22017ec681f3Smrg- r600/sfn: fix some formatting
22027ec681f3Smrg- r600/sfn: switch assembler creation to use visitor
22037ec681f3Smrg- r600/sfn: drop the local register mep
22047ec681f3Smrg- r600/sfn: lower VS IO and drop old deref code
22057ec681f3Smrg- r600/sfn: lowered FS output IO
22067ec681f3Smrg- r600/sfn: remove old deref code
22077ec681f3Smrg- r600/sfn: force dual source blend output handling in some cases
22087ec681f3Smrg- r600/sfn: remove find_msb lowering in driver
22097ec681f3Smrg- r600/sfn: clean up multi-sample texture load
22107ec681f3Smrg- r600/sfn: clean up value pool interface usage in emit_stream
22117ec681f3Smrg- r600/sfn: use get_temp_vec4 directly when fetching
22127ec681f3Smrg- r600/sfn: make allocate_temp_register private to valuepool
22137ec681f3Smrg- r600: Fix texture buffer offset alignment
22147ec681f3Smrg- r600: don't set an index_bias for indirect draw calls
22157ec681f3Smrg- r600: Don't advertise support for scaled int16 vertex formats
22167ec681f3Smrg- r600/sfn: allocate number of VS outputs based on max driver_location
22177ec681f3Smrg- r600/sfn: Only fetch the constants that are needed in fdd*
22187ec681f3Smrg- r600: Enable sb for nir only on specific request
22197ec681f3Smrg- Revert "r600: don't set an index_bias for indirect draw calls"
22207ec681f3Smrg- Revert "r600: Don't advertise support for scaled int16 vertex formats"
22217ec681f3Smrg- r600: don't set an index_bias for indirect draw calls
22227ec681f3Smrg
22237ec681f3SmrgGiovanni Mascellani (2):
22247ec681f3Smrg
22257ec681f3Smrg- disk_cache: Fail creation when cannot inizialize queue.
22267ec681f3Smrg- anv: Allow null handle in DestroyDescriptorUpdateTemplate.
22277ec681f3Smrg
22287ec681f3SmrgGreg V (1):
22297ec681f3Smrg
22307ec681f3Smrg- meson: Add missing lavapipe dep
22317ec681f3Smrg
22327ec681f3SmrgGustavo Padovan (4):
22337ec681f3Smrg
22347ec681f3Smrg- gitlab-ci: extend x86_64 kernel config to suport Intel devices
22357ec681f3Smrg- gitlab-ci: build the iris gallium driver as well
22367ec681f3Smrg- gitlab-ci: add intel APL and GLK devices with manual triggers
22377ec681f3Smrg- traces-iris: fix expectation for Intel GLK
22387ec681f3Smrg
22397ec681f3SmrgHans-Kristian Arntzen (2):
22407ec681f3Smrg
22417ec681f3Smrg- radv: Take image alignment into account when allocating MUTABLE pool.
22427ec681f3Smrg- radv: Allocate buffer list for MUTABLE descriptor types as well.
22437ec681f3Smrg
22447ec681f3SmrgHoe Hao Cheng (19):
22457ec681f3Smrg
22467ec681f3Smrg- zink/codegen: add some new attributes to Extension
22477ec681f3Smrg- zink/codegen: make 'struct' optional in Version
22487ec681f3Smrg- zink/codegen: enable instance extension unconditionally if promoted
22497ec681f3Smrg- zink/codegen: codegen-ize load_instance_extensions()
22507ec681f3Smrg- zink/codegen: adding/fixing comments and copyright notice
22517ec681f3Smrg- zink/codegen: find promotion version using vulkan registry
22527ec681f3Smrg- zink: remove excessive checks for loader version
22537ec681f3Smrg- zink: expose PIPE_CAP_ANISOTROPIC_FILTER
22547ec681f3Smrg- zink: VK_KHR_draw_indirect_count is a device extension
22557ec681f3Smrg- zink/codegen: introduce ExtensionRegistry
22567ec681f3Smrg- zink/codegen: more validation in zink_instance
22577ec681f3Smrg- zink/codegen: introduce notion of non-standard extensions
22587ec681f3Smrg- zink/codegen: make zink_device_info accept vk.xml
22597ec681f3Smrg- zink/codegen: perform basic validation in zink_device_info
22607ec681f3Smrg- zink/codegen: validate has_properties and has_features
22617ec681f3Smrg- zink/codegen: fix type annotations
22627ec681f3Smrg- zink/codegen: do not enable extensions that are now core
22637ec681f3Smrg- zink: enable KHR_shader_draw_parameters on Vulkan <1.2
22647ec681f3Smrg- zink: fix detection of KHR_maintenance1/2
22657ec681f3Smrg
22667ec681f3SmrgHyunjun Ko (5):
22677ec681f3Smrg
22687ec681f3Smrg- turnip: Return correct value of tu6_load_state_size
22697ec681f3Smrg- nir: Set access at lower_ubo_vec4
22707ec681f3Smrg- ir3: fix has_src() to return correctly in ir3_nir_lower_tex_prefetch
22717ec681f3Smrg- ir3: Add nonuniform encodings to ir3 encoder and parser
22727ec681f3Smrg- turnip: Enable nonuniform descriptor indexing
22737ec681f3Smrg
22747ec681f3SmrgIago Toral Quiroga (93):
22757ec681f3Smrg
22767ec681f3Smrg- v3dv: only update uniforms for dirty descriptors if stage has descriptors
22777ec681f3Smrg- v3dv: disable early Z writes if Z writes are disabled
22787ec681f3Smrg- v3dv: don't wait for idle on occlusion query pool resets
22797ec681f3Smrg- v3dv: use PIPE_TIMEOUT_INFINITE
22807ec681f3Smrg- v3dv: refactor checks for subpass attachment clears
22817ec681f3Smrg- v3dv: refactor checks for subpass attachment loading
22827ec681f3Smrg- v3dv: refactor checks for subpass attachment stores
22837ec681f3Smrg- v3dv: do not emit full tile buffers clears to handle Z/S clears
22847ec681f3Smrg- v3dv: enable early Z/S clears
22857ec681f3Smrg- v3dv: fix disabling Early Z for the whole frame
22867ec681f3Smrg- broadcom/compiler: fix indentation with TABs
22877ec681f3Smrg- broadcom/compiler: prepare TMU spilling code to account for TMU pipelining
22887ec681f3Smrg- broadcom/compiler: implement pipelining for general TMU operations
22897ec681f3Smrg- broadcom/compiler: support pipelining of tex instructions
22907ec681f3Smrg- broadcom/compiler: refactor image load/store TMU emission code
22917ec681f3Smrg- broadcom/compiler: support pipelining of image load/store instructions
22927ec681f3Smrg- broadcom/compiler: disable TMU pipelining if we fail to register allocate
22937ec681f3Smrg- broadcom/compiler: disallow spilling if TMU pipelining was enabled
22947ec681f3Smrg- broadcom/compiler: log spilling shaders to perf output
22957ec681f3Smrg- broadcom/compiler: let QPUs stall on TMU input/config overflows
22967ec681f3Smrg- v3dv: handle D/S buffer to image copies with the texel buffer path
22977ec681f3Smrg- v3dv: batch copies in the copy_buffer_to_image_blit path
22987ec681f3Smrg- v3dv: allow a component swizzle in copy_buffer_to_image_shader
22997ec681f3Smrg- v3d/common: use spaces instead of TABs
23007ec681f3Smrg- v3dv: serialize pipeline compilation when debugging shaders
23017ec681f3Smrg- v3dv: add a perf trace when a device is created with robust buffer access
23027ec681f3Smrg- v3d/compiler: fix QPU scheduler TMU sequence shuffling
23037ec681f3Smrg- broadcom/compiler: add V3D_QPU_WADDR_UNIFA
23047ec681f3Smrg- broadcom/compiler: pass a devinfo to check if an instruction writes to TMU
23057ec681f3Smrg- broadcom/compiler: name registers correctly based on V3D version
23067ec681f3Smrg- broadcom/compiler: don't check for GFXH-1633 on V3D 4.2.x
23077ec681f3Smrg- broadcom/compiler: add a helper to check if an instruction writes unifa
23087ec681f3Smrg- broadcom/compiler: disallow unifa overlap with thread switch/end
23097ec681f3Smrg- broadcom/compiler: preserve ordering of unifa/ldunifa sequences
23107ec681f3Smrg- broadcom/compiler: ensure 3-slot delay between unifa and ldunifa
23117ec681f3Smrg- broadcom/compiler: disallow reading two uniforms in the same instruction
23127ec681f3Smrg- broadcom/compiler: do not DCE ldunifa
23137ec681f3Smrg- broadcom/compiler: emit ldunifarf when needed
23147ec681f3Smrg- broadcom/compiler: use unifa for UBO loads from uniform addresses
23157ec681f3Smrg- broadcom/compiler: don't emit redundant ldunif
23167ec681f3Smrg- broadcom/compiler: use a helper function to decide on TMU spilling
23177ec681f3Smrg- broadcom/compiler: don't dump shader-db stats for failed shaders
23187ec681f3Smrg- broadcom/compiler: fix ldunif optimization
23197ec681f3Smrg- broadcom/compiler: allow dead code elimination of unused trailing ldunifa
23207ec681f3Smrg- broadcom/compiler: remove unused leading ldunifa
23217ec681f3Smrg- broadcom/compiler: add a constant alu optimization pass
23227ec681f3Smrg- broadcom/compiler: skip unnecessary unifa writes
23237ec681f3Smrg- broadcom/compiler: use nir_opt_sink
23247ec681f3Smrg- v3dv: fix branching to large secondaries with more than one BCL buffer.
23257ec681f3Smrg- broadcom/compiler: fix DAG pre-remove for merged instructions
23267ec681f3Smrg- broadcom/compiler: fix indentation style
23277ec681f3Smrg- broadcom/compiler: track pipelineable ldvary sequences
23287ec681f3Smrg- broadcom/compiler: pipeline smooth ldvary sequences
23297ec681f3Smrg- broadcom/compiler: allow pipelining of flat and noperspective varyings
23307ec681f3Smrg- broadcom/compiler: ldvary pipelining tracking and documentation clean-ups
23317ec681f3Smrg- broadcom/compiler: drop the destination for unused ldunifa
23327ec681f3Smrg- broadcom/compiler: be more aggressive skipping unifa writes
23337ec681f3Smrg- broadcom/compiler: always restart ldvary pipelining when scheduling ldvary
23347ec681f3Smrg- broadcom/compiler: ldvary doesn't implicitly write to r3 since V3D 4.1
23357ec681f3Smrg- broadcom/compiler: fix flags check for ldvary merge
23367ec681f3Smrg- broadcom/compiler: add an additional sanity check assert to the ldvary fixup
23377ec681f3Smrg- broadcom/compiler: move code block around
23387ec681f3Smrg- broadcom/compiler: simplify ldvary pipelining
23397ec681f3Smrg- broadcom/compiler: disallow ldunif during ldvary sequences if possible
23407ec681f3Smrg- v3dv: call util_cpu_detect() when initializing the instance
23417ec681f3Smrg- broadcom/compiler: flag wrtmuc with a read dependency on last_tmu_config
23427ec681f3Smrg- broadcom/compiler: be more flexible scheduling TMU writes
23437ec681f3Smrg- vulkan/util: call glsl_type_singleton_init_or_ref from vk_instance_init
23447ec681f3Smrg- compiler/glsl: call util_cpu_detect from glsl_type_singleton_init_or_ref
23457ec681f3Smrg- broadcom/compiler: fix end of tmu sequence detection
23467ec681f3Smrg- broadcom/compiler: use nir_opt_load_store_vectorize
23477ec681f3Smrg- broadcom/compiler: use nir_lower_wrmasks to simplify TMU general stores
23487ec681f3Smrg- broadcom/compiler: handle implicit uniform loads when optimizing constant alu
23497ec681f3Smrg- broadcom/compiler: optimize constant vfpack
23507ec681f3Smrg- broadcom/compiler: use nir_lower_undef_to_zero
23517ec681f3Smrg- v3dv/pipeline_cache: fix assert
23527ec681f3Smrg- broadcom/compiler: convert add to mul when possible to allow merge
23537ec681f3Smrg- broadcom/compiler: add a v3d_qpu_writes_accum helper
23547ec681f3Smrg- broadcom/compiler: try to fill up delay slots after a thrsw
23557ec681f3Smrg- broadcom/compiler: flag TMU read dependencies against last TMU config
23567ec681f3Smrg- broadcom/compiler: flag TMU reads with a read dependency on last TMU config
23577ec681f3Smrg- broadcom/compiler: dump instruction index when failing to pack instructions
23587ec681f3Smrg- broadcom/compiler: add a NOP count stat to shader-db
23597ec681f3Smrg- broadcom/compiler: try to fill up delay slots after a branch instruction
23607ec681f3Smrg- broadcom/compiler: try to fill up delay slots after unconditional branch
23617ec681f3Smrg- broadcom/compiler: implement restriction for branch after setmsf
23627ec681f3Smrg- broadcom/compiler: optimize branch emission for uniform break/continue
23637ec681f3Smrg- v3dv: fix index buffer binding
23647ec681f3Smrg- broadcom/compiler: add a definition for the unifa skip distance
23657ec681f3Smrg- broadcom/compiler: allow compilation strategies to limit minimum thread count
23667ec681f3Smrg- broadcom/compiler: sort constant UBO loads by index and offset
23677ec681f3Smrg- broadcom/compiler: rename unifa tracking fields
23687ec681f3Smrg- v3dv: fix descriptor set limits
23697ec681f3Smrg
23707ec681f3SmrgIan Romanick (33):
23717ec681f3Smrg
23727ec681f3Smrg- i965: Don't advertise OpenGL 3.3+ if driconf disables GL_ARB_blend_func_extended
23737ec681f3Smrg- i965: Use allow_higher_compat_version option during screen initialization
23747ec681f3Smrg- i965: Don't parse driconf again
23757ec681f3Smrg- nir/algebraic: Fix a >> #b << #b for sizes other than 32-bit
23767ec681f3Smrg- nir/algebraic: add patterns for a >> #b << #b and a << #b >> #b
23777ec681f3Smrg- nir/algebraic: Partially revert 3f782cdd2591
23787ec681f3Smrg- intel/eu/validate: Add some checks for CMP and CMPN
23797ec681f3Smrg- intel/compiler: Enable the ability to emit CMPN instructions
23807ec681f3Smrg- intel/compiler: Make the CMPN builder work like the CMP builder
23817ec681f3Smrg- intel/compiler: Use CMPN for min / max on Gen4 and Gen5
23827ec681f3Smrg- nir/algebraic: Fix some min/max of b2f replacements
23837ec681f3Smrg- nir/algebraic: Remove some redundant b2f logic-op reduction patterns
23847ec681f3Smrg- nir/algebraic: Add some max/min optimizations with 3 variables
23857ec681f3Smrg- nir/range-analysis: C++ linkage
23867ec681f3Smrg- nir/range_analysis: Handle vectors better in ssa_def_bits_used
23877ec681f3Smrg- intel/compiler: Silence unused parameter warnings in files that include brw_eu.h
23887ec681f3Smrg- intel: Silence unused parameter warnings in files that include gen_device_info.h
23897ec681f3Smrg- intel: Silence unused parameter warnings in files that include genX_pack.h
23907ec681f3Smrg- intel/compiler: Relax some conditions in try_copy_propagate
23917ec681f3Smrg- gallium/dri: Remove dri2_format_mapping::cpp
23927ec681f3Smrg- nir/search: Constify instruction parameter to search helpers
23937ec681f3Smrg- nir/algebraic: Apply addition property of equality more conservatively
23947ec681f3Smrg- nir/algebraic: Apply addition property of equality to the other ordering too
23957ec681f3Smrg- nir/range_analysis: Refactor fsat handling
23967ec681f3Smrg- nir/range_analysis: Add "is finite" range analysis tracking
23977ec681f3Smrg- nir/range_analysis: Add "is a number" range analysis tracking
23987ec681f3Smrg- nir/range_analysis: Fix analysis of fmin, fmax, or fsat with NaN source
23997ec681f3Smrg- nir/search: Use range analysis for is_finite
24007ec681f3Smrg- nir/range_analysis: Simplify analysis of bcsel
24017ec681f3Smrg- mesa: Add anything dynamically indexed before any non-dynamically indexed
24027ec681f3Smrg- mesa: Clean up _mesa_layout_parameters after previous commit
24037ec681f3Smrg- tgsi_exec: Fix NaN behavior of saturate
24047ec681f3Smrg- tgsi_exec: Fix NaN behavior of min and max
24057ec681f3Smrg
24067ec681f3SmrgIcecream95 (60):
24077ec681f3Smrg
24087ec681f3Smrg- pan/bi: Lower 64-bit integers
24097ec681f3Smrg- pan/bi: Handle 64-bit pack and unpack operations
24107ec681f3Smrg- pan/bi: Add some compute intrinsic loads
24117ec681f3Smrg- pan/bi: Set compute lowering options
24127ec681f3Smrg- pan/bi: Improve interoperability of the command-line disassembler
24137ec681f3Smrg- pan/bi: Implement load/store intrinsics
24147ec681f3Smrg- pan/bi: Implement load_kernel_input
24157ec681f3Smrg- panfrost: Set bifrost_props for compute shaders
24167ec681f3Smrg- pan/bi: Improve unknown intrinsic error
24177ec681f3Smrg- panfrost: Use the correct NIR options for OpenCL on Bifrost
24187ec681f3Smrg- pan/bi: Use pan_nir_lower_64bit_intrin
24197ec681f3Smrg- panfrost: Add a sysval for local_group_size
24207ec681f3Smrg- panfrost: Add a sysval for local_work_dim
24217ec681f3Smrg- panfrost: Assert on sysval overflow
24227ec681f3Smrg- pan/mdg: Limit int64 vectorization
24237ec681f3Smrg- pan/mdg: Don't reorder loads/stores past each other
24247ec681f3Smrg- pan/mdg: Allow 64-bit src_bitsize for comparison operations
24257ec681f3Smrg- pan/bi: Add w0 to the 'h01' swizzle bucket
24267ec681f3Smrg- pan/bi: Lower umul_high
24277ec681f3Smrg- panfrost: Set TLS for compute jobs
24287ec681f3Smrg- pan/bi: Implement saturated add/sub operations
24297ec681f3Smrg- pan/bi: Implement ihadd/irhadd operations
24307ec681f3Smrg- pan/bi: Implement packing ops between 32-bit vec1 and 16-bit vec2
24317ec681f3Smrg- pan/mdg: Fix spilling when scratch memory is used
24327ec681f3Smrg- pan/bi: Iterate from zero when setting RA interference
24337ec681f3Smrg- panfrost: Add a function to determine if a resource is 2D
24347ec681f3Smrg- panfrost: Only checksum resources when it makes sense to
24357ec681f3Smrg- panfrost: Add a debug flag to disable checksumming
24367ec681f3Smrg- panfrost: Transaction elimination support
24377ec681f3Smrg- panfrost: Fix the tile size assertion
24387ec681f3Smrg- pan/decode: Free mapped memory objects on BO unreference
24397ec681f3Smrg- panfrost: Add support for INTEL_blackhole_render
24407ec681f3Smrg- panfrost: Use normal malloc/free instead of ralloc for surfaces
24417ec681f3Smrg- panfrost: Add the tiler heap to fragment jobs
24427ec681f3Smrg- pan/bi: Return the size of the last clause from bi_pack
24437ec681f3Smrg- pan/bi: Fix shader prefetch size
24447ec681f3Smrg- panfrost: Fix clear color packing for 12-byte formats
24457ec681f3Smrg- pan/bi: Don't check liveness unless the index is valid
24467ec681f3Smrg- pan/bi: Use the correct size for UBO loads
24477ec681f3Smrg- pan/bi: Remove check for first_ubo_is_default_ubo
24487ec681f3Smrg- pan/bi: Implement image load/store
24497ec681f3Smrg- pan/bi: nir_intrinsic_image_size support
24507ec681f3Smrg- st/mesa: Update constants on alpha test change if it's lowered
24517ec681f3Smrg- panfrost: Disable early-z when alpha test is used
24527ec681f3Smrg- pan/mdg: Rename load/store operations
24537ec681f3Smrg- pan/mdg: Use appropriate sizes for global loads/stores
24547ec681f3Smrg- pipe-loader,gallium/drm: Fix the kmsro pipe_loader target
24557ec681f3Smrg- pipe-loader: Stop trying to use kmsro for vgem
24567ec681f3Smrg- panfrost: Implement panfrost_set_global_binding
24577ec681f3Smrg- panfrost: Flush output after disassembling shaders
24587ec681f3Smrg- panfrost: Only do point coord replacement for PIPE_PRIM_POINTS
24597ec681f3Smrg- panfrost: Only add resource checksum BOs to the batch once
24607ec681f3Smrg- panfrost: Align BO size to 4096 bytes
24617ec681f3Smrg- panfrost: Add fast path for graphics work group computation
24627ec681f3Smrg- panfrost: Unset shared/scanout binding flags for staging resources
24637ec681f3Smrg- pan/bi: Skip nir_opt_move/sink for blend shaders
24647ec681f3Smrg- panfrost: Fix shader texture count
24657ec681f3Smrg- pan/decode: Allow frame shader DCDs to be in another BO than the FBD
24667ec681f3Smrg- pan/mdg: Fix calculation of available work registers
24677ec681f3Smrg- panfrost: Fix viewport scissor for preload draws
24687ec681f3Smrg
24697ec681f3SmrgIlia Mirkin (55):
24707ec681f3Smrg
24717ec681f3Smrg- nv50/ir: ignore FS_BLEND_EQUATION_ADVANCED
24727ec681f3Smrg- nv50,nvc0: explicitly list recently-added caps
24737ec681f3Smrg- st/mesa: fix broken moves for u2i64 and related ops
24747ec681f3Smrg- nv50/ir: clear dnz flag when converting mul/mad to simpler ops
24757ec681f3Smrg- glsl: only expose int64 atomics when extension is enabled
24767ec681f3Smrg- cso: set index_bounds_valid = true for arrays draws
24777ec681f3Smrg- nvc0: index_bias is now only set for indexed draws
24787ec681f3Smrg- nvc0/ir: add fixup to deal with interpolateAtSample with non-MSAA
24797ec681f3Smrg- nv50,nvc0: clear internal vbo masks based on the trailing slots
24807ec681f3Smrg- ci: remove nouveau from shader-db runs
24817ec681f3Smrg- nouveau: reinstate fencing on screen destroy
24827ec681f3Smrg- nv50: add PIPE_CAP_NIR_IMAGES_AS_DEREF to unsupported list
24837ec681f3Smrg- nv50,nvc0: add scissored clear support
24847ec681f3Smrg- st/mesa: do scissored clears on depth/stencil as well when supported
24857ec681f3Smrg- i965: support GL_EXT_color_buffer_half_float
24867ec681f3Smrg- mesa: fix conditions for fp16 render format eligibility
24877ec681f3Smrg- mesa: fix fbo attachment size check for RBs, make it trigger in ES2
24887ec681f3Smrg- mesa: add tracking of reduction mode
24897ec681f3Smrg- st/mesa: add EXT_texture_filter_minmax support
24907ec681f3Smrg- nvc0: enable minmax reductions on gm200+
24917ec681f3Smrg- docs: add notes about nvc0 support of ARB/EXT_texture_filter_minmax
24927ec681f3Smrg- mesa: only report INCOMPLETE_FORMATS for GLES1 / desktop
24937ec681f3Smrg- gallium,st: add missing viewport swizzles
24947ec681f3Smrg- nv50: initialize target for blit source surfaces
24957ec681f3Smrg- nv50,nvc0: remove explicit target argument from view creation
24967ec681f3Smrg- nv50: add appropriate space check before adding new pushbuffer
24977ec681f3Smrg- nvc0: ensure sufficient push space for indirect data
24987ec681f3Smrg- nvc0: fix reported driver queries for Pascal and later GPUs
24997ec681f3Smrg- mesa: fix restoring of texture attributes for msaa binding points
25007ec681f3Smrg- nv50: adapt texture and constbuf paths for compute shaders
25017ec681f3Smrg- nv50: add resource tracking for shader images and buffers
25027ec681f3Smrg- nv50: implement memory barrier handling
25037ec681f3Smrg- nv50: add texture, constbuf, image, buffer validation
25047ec681f3Smrg- nv50: pass in third axis via user param
25057ec681f3Smrg- nv50/ir: retrieve (n)ctaid.z from first user param
25067ec681f3Smrg- nv50/ir: force shared memory indirect to be an address
25077ec681f3Smrg- nv50/ir: do not use inline offsets for global, ensure indirect access
25087ec681f3Smrg- nv50/ir: fix emission of RED
25097ec681f3Smrg- nv50/ir: lower buffer to global
25107ec681f3Smrg- nv50/ir: fix emitting movs from imm to short registers
25117ec681f3Smrg- nv50/ir: fix emission of cvt with half-reg destinations
25127ec681f3Smrg- nv50/ir: fix emission of logic ops on half-regs
25137ec681f3Smrg- nv50/ir: fix emission of shifts on half-regs
25147ec681f3Smrg- nv50/ir: logic ops on half-regs can't take an immediate
25157ec681f3Smrg- nv50/ir: add support for 16-bit immediates
25167ec681f3Smrg- nv50/ir: fix emission of 16-bit add
25177ec681f3Smrg- nv50/ir: fix emission of cas without a destination
25187ec681f3Smrg- nv50: fix expression for ucp offset
25197ec681f3Smrg- nv50/ir: avoid inlining results of a locked load
25207ec681f3Smrg- nv50/ir: fix emission of ld/st lock/unlock
25217ec681f3Smrg- st/mesa: adapt for the case where buffers are not supported in frag
25227ec681f3Smrg- nv50/ir: fix texture size for msaa textures
25237ec681f3Smrg- nv50: emulate indirect draws
25247ec681f3Smrg- nv50/ir: fake SV_THREAD_KILL support
25257ec681f3Smrg- nv50: enable ARB_framebuffer_no_attachments
25267ec681f3Smrg
25277ec681f3SmrgItalo Nicola (15):
25287ec681f3Smrg
25297ec681f3Smrg- panfrost: fix attribute continuation decoding
25307ec681f3Smrg- panfrost: add 3d attribute buffer continuation to XML
25317ec681f3Smrg- panfrost: decode 3d attribute continuation
25327ec681f3Smrg- panfrost: add resource modifier conversion
25337ec681f3Smrg- panfrost: implement gallium->set_shader_images
25347ec681f3Smrg- panfrost: emit shader image attribute descriptors
25357ec681f3Smrg- panfrost: implement image_size sysval
25367ec681f3Smrg- pan/mdg: create nir pass to lower image coord bitsize
25377ec681f3Smrg- pan/mdg: enable image bitsize lowering pass
25387ec681f3Smrg- pan/mdg: add ld_image opcodes
25397ec681f3Smrg- pan/mdg: rename st_image opcodes and add float16 versions
25407ec681f3Smrg- pan/mdg: implement shader image instructions
25417ec681f3Smrg- pan/mdg: implement nir_intrinsic_image_size
25427ec681f3Smrg- panfrost: advertise images for midgard
25437ec681f3Smrg- pan/mdg: prevent csel_v from being scheduled alongside writeout
25447ec681f3Smrg
25457ec681f3SmrgIván Briano (4):
25467ec681f3Smrg
25477ec681f3Smrg- anv: don't advertise mipmaps for linear 3D surfaces on BDW
25487ec681f3Smrg- anv: move buffer size alignment into helper function
25497ec681f3Smrg- anv: use helper function to get the buffer size
25507ec681f3Smrg- intel, anv: propagate robustness setting to nir_opt_load_store_vectorize
25517ec681f3Smrg
25527ec681f3SmrgJames Jones (4):
25537ec681f3Smrg
25547ec681f3Smrg- nouveau: Stash supported sector layout in screen
25557ec681f3Smrg- nouveau: Use DRM_FORMAT_MOD_NVIDIA_BLOCK_LINEAR_2D
25567ec681f3Smrg- nouveau: no modifier != the invalid modifier
25577ec681f3Smrg- nouveau: Use format modifiers in buffer allocation
25587ec681f3Smrg
25597ec681f3SmrgJames Park (22):
25607ec681f3Smrg
25617ec681f3Smrg- radv: Use linker pragma to simulate weak functions
25627ec681f3Smrg- radv: Remove unnecessary headers
25637ec681f3Smrg- radv: Stub sections that don't have _WIN32 support
25647ec681f3Smrg- radv: Modify radv_descriptor_set for MSVC legality
25657ec681f3Smrg- radv: Pointer arithmetic on char/uint8_t, not void
25667ec681f3Smrg- radv: Disable calibrated timestamps on Windows
25677ec681f3Smrg- radv: Use typed outarray API
25687ec681f3Smrg- radv: Fix struct initialization for MSVC
25697ec681f3Smrg- gallium/tessellator: Fix warning suppression
25707ec681f3Smrg- microsoft: Fix comma in variadic macro for MSVC
25717ec681f3Smrg- ac: Remove unnecessary header
25727ec681f3Smrg- radv: Use typed outarray API
25737ec681f3Smrg- amd: Disable surface modifier test on Windows
25747ec681f3Smrg- radv: Pointer arithmetic without void*
25757ec681f3Smrg- radv: Update JSON generator if Windows
25767ec681f3Smrg- vulkan: Use typed outarray API
25777ec681f3Smrg- vulkan: Fix source list for vulkan_wsi on Windows
25787ec681f3Smrg- vulkan: Update dispatch table gen for Windows
25797ec681f3Smrg- vulkan/util: Use util_bitcount
25807ec681f3Smrg- ac/rgp: BSD elf library compatibility
25817ec681f3Smrg- amd: Hide amdgpu_drm.h on Windows
25827ec681f3Smrg- amd: Hide drm_fourcc.h on Windows
25837ec681f3Smrg
25847ec681f3SmrgJames Zhu (1):
25857ec681f3Smrg
25867ec681f3Smrg- amd: add Aldebaran chip enum
25877ec681f3Smrg
25887ec681f3SmrgJan Beich (2):
25897ec681f3Smrg
25907ec681f3Smrg- ac: prefer system EM_AMDGPU definition
25917ec681f3Smrg- ac/rgp: define EM_AMDGPU if missing for compatibility
25927ec681f3Smrg
25937ec681f3SmrgJason Ekstrand (195):
25947ec681f3Smrg
25957ec681f3Smrg- intel/tools: Decode COMPUTE_WALKER
25967ec681f3Smrg- intel/fs: Allow compute dispatch without a pushed subgroup ID on Gen12-HP
25977ec681f3Smrg- anv: Add a general state pool
25987ec681f3Smrg- intel/fs: Emit code for Gen12-HP indirect compute data
25997ec681f3Smrg- anv: Enable push constants on gen12-hp
26007ec681f3Smrg- intel/genxml,anv,iris: Drop the legacy compute path from gen125.xml
26017ec681f3Smrg- anv: Add a trivial implementation of VK_KHR_deferred_host_operation
26027ec681f3Smrg- anv: Exit early from cmd_buffer_apply_pipe_flushes
26037ec681f3Smrg- anv: Take the set of stages to flush in flush_descriptor_sets
26047ec681f3Smrg- anv: Only flush descriptors used by the pipeline
26057ec681f3Smrg- anv: Early-exit from cmd_buffer_flush_state
26067ec681f3Smrg- nir/lower_int64: Add a level of wrapper functions
26077ec681f3Smrg- nir/lower_int64: Fix lowering of f2[ui]64 for 16-bit float
26087ec681f3Smrg- nir/lower_int64: Add lowering for some 64-bit subgroup ops
26097ec681f3Smrg- nir/lower_int64: Add lowering for 64-bit iadd shuffle/reduce
26107ec681f3Smrg- nir/lower_int64: Lower 64-bit vote_ieq
26117ec681f3Smrg- intel/compiler: Return 1 for immediates in regs_read
26127ec681f3Smrg- intel/compiler: Move brw_reg_type_for_bit_size to brw_reg_type.h
26137ec681f3Smrg- intel/reg,fs: Handle immediates properly in subscript()
26147ec681f3Smrg- intel/fs: QUAD_SWIZZLE requires packed data
26157ec681f3Smrg- intel/fs: Support 64-bit SEL_EXEC on Gen11+
26167ec681f3Smrg- intel/fs: Support 64-bit SHUFFLE on Gen11+
26177ec681f3Smrg- intel/fs: Support 64-bit CLUSTER_BROADCAST on Gen11+
26187ec681f3Smrg- intel/fs: Properly lower 64-bit MUL on 64-bit-incapable platforms
26197ec681f3Smrg- intel/fs: Refactor our shuffle emit code
26207ec681f3Smrg- intel/fs: Implement umin/umax shuffle
26217ec681f3Smrg- anv: Advertise shaderInt64 on Gen11+
26227ec681f3Smrg- anv: Break SAMPLE_PATTERN and MULTISAMPLE emit into helpers
26237ec681f3Smrg- intel/fs: Add an ex_desc field to fs_inst for SHADER_OPCODE_SEND
26247ec681f3Smrg- anv: Drop anv_dump
26257ec681f3Smrg- anv: Fix an old parameter name in GetDeviceQueue
26267ec681f3Smrg- anv: Refactor anv_queue_finish()
26277ec681f3Smrg- anv: Add an anv_queue_family struct
26287ec681f3Smrg- nir/from_ssa: Respect and populate divergence information
26297ec681f3Smrg- vulkan/meson: Add missing dependencise on vk_extensions_gen.py
26307ec681f3Smrg- anv: Clean up the vk_device on the CreateDevice error path
26317ec681f3Smrg- radv: Properly clean up vk_device
26327ec681f3Smrg- turnip: Properly clean up vk_device
26337ec681f3Smrg- v3dv: Properly clean up vk_device
26347ec681f3Smrg- lavapipe: Properly clean up vk_device
26357ec681f3Smrg- vulkan: Move vk_device to its own file
26367ec681f3Smrg- vulkan: Add a return code to vk_device_init
26377ec681f3Smrg- vulkan: Add common extension tables
26387ec681f3Smrg- anv: Use the common extension table struct
26397ec681f3Smrg- vulkan: Add common dispatch table generation
26407ec681f3Smrg- vulkan: Add dispatch table loading helpers
26417ec681f3Smrg- vulkan-overlay-layer: Use the new dispatch tables
26427ec681f3Smrg- vulkan: Add dispatch table lookup helpers
26437ec681f3Smrg- vulkan: Add common instance and physical device structs
26447ec681f3Smrg- vulkan: Add generators for instance trampoline functions
26457ec681f3Smrg- vulkan: Add entrypoint tables and related helpers
26467ec681f3Smrg- vulkan: Add common Get*ProcAddr implementations
26477ec681f3Smrg- vulkan: Add a common entrypoint table generator
26487ec681f3Smrg- anv: Add a single anv_genX switch macro
26497ec681f3Smrg- anv: Use the common dispatch framework
26507ec681f3Smrg- vulkan: Add framework for common entrypoints
26517ec681f3Smrg- vulkan,anv: Move GetDeviceProcAddr to common code
26527ec681f3Smrg- vulkan,anv: Add common entrypoints for VK_EXT_private_data
26537ec681f3Smrg- anv: Make Get*MemoryRequirements a wrapper
26547ec681f3Smrg- vulkan,anv: Move a bunch of trivial wrappers to common code
26557ec681f3Smrg- vulkan,anv: Move VK_KHR_copy_commands2 wrappers to common code
26567ec681f3Smrg- vulkan: Add a truly common VK_EXT_debug_report implementation
26577ec681f3Smrg- anv: Switch to the common VK_EXT_debug_report
26587ec681f3Smrg- turnip: Use the common dispatch framework
26597ec681f3Smrg- turnip: Use common entrypoints for VK_EXT_private_data
26607ec681f3Smrg- turnip: Drop some legacy wrappers in favor of common code
26617ec681f3Smrg- turnip: Switch to the common VK_EXT_debug_report
26627ec681f3Smrg- lavapipe: Drop some wrappers in favor of common code
26637ec681f3Smrg- v3dv: Drop v3dv_instance::app_info
26647ec681f3Smrg- v3dv: Use common entrypoints for VK_EXT_private_data
26657ec681f3Smrg- v3dv: Switch to the common VK_EXT_debug_report
26667ec681f3Smrg- radv: Use common entrypoints for VK_EXT_private_data
26677ec681f3Smrg- radv: Switch to the common VK_EXT_debug_report
26687ec681f3Smrg- vulkan: Make vk_debug_report_callback derive from vk_object_base
26697ec681f3Smrg- anv: Use vk_object_base::type for debug_report
26707ec681f3Smrg- vulkan: Use vk_object_base::type for debug_report
26717ec681f3Smrg- vulkan: Make the debug_report implementation internal
26727ec681f3Smrg- anv,radv: Use common entrypoints for VK_KHR_deferred_operation
26737ec681f3Smrg- vulkan: Rework vk_device_init and friends
26747ec681f3Smrg- vulkan: Drop the type_prefix parameter from gen_extensions
26757ec681f3Smrg- nir: Add some ssa-only fast-paths for nir_src rewrite
26767ec681f3Smrg- nir: Drop the lower_mem_constant_vars declaration
26777ec681f3Smrg- vulkan: Add a common helper for enumerating instance extension properties
26787ec681f3Smrg- vulkan: Rework extension disabling on Android
26797ec681f3Smrg- anv: Pull the patch version from the XML
26807ec681f3Smrg- anv: Make anv_icd.py more generic and independent
26817ec681f3Smrg- anv,vulkan: Move anv_icd.py to a common location
26827ec681f3Smrg- anv: Move extension tables to anv_device.c
26837ec681f3Smrg- anv: Add fake graphics-only and compute-only queue families
26847ec681f3Smrg- nir: Add a couple helpers for phis and cursors
26857ec681f3Smrg- nir/lower_bit_size: Support phi instructions
26867ec681f3Smrg- intel/nir: Lower 8-bit phis on Gen11+
26877ec681f3Smrg- nir: Add some range analysis for used bits
26887ec681f3Smrg- nir/algebraic: Clean up up-cast of down-cast when we can
26897ec681f3Smrg- nir/algebraic: Covert up-cast of down-cast to extract on Intel
26907ec681f3Smrg- spirv: Store the nir_function in vtn_function
26917ec681f3Smrg- spirv: Delete the impl for prototype-only functions
26927ec681f3Smrg- nir: Don't optimize bcsel-of-shuffle across blocks
26937ec681f3Smrg- nir: Fix parameter order in the bcsel-of-shuffle optimization
26947ec681f3Smrg- nir/opt_large_constants: Handle generic pointers
26957ec681f3Smrg- intel/fs: Shuffle can't handle source modifiers
26967ec681f3Smrg- anv/formats: Advertise linear sampling on depth formats
26977ec681f3Smrg- anv/android: Re-implement AcquireImageANDROID
26987ec681f3Smrg- intel/mi_builder: Create a context in the tests
26997ec681f3Smrg- intel/mi_builder: Delete a bogus comment
27007ec681f3Smrg- intel/mi_builder: Fix a misleading comment
27017ec681f3Smrg- intel/mi_builder: Short-circuit shifts in more cases
27027ec681f3Smrg- intel/mi_builder: Add constant folding
27037ec681f3Smrg- intel/mi_builder: Rewrite unit tests in terms of constant folding
27047ec681f3Smrg- intel/mi_builder: Add tests for gen_mi_z and gen_mi_nz
27057ec681f3Smrg- intel: Rename gen_mi_builder.h to mi_builder.h
27067ec681f3Smrg- intel/mi_builder: Drop the gen\_ prefix
27077ec681f3Smrg- intel/mi_builder: Use AddCSMMIOStartOffset for LRI
27087ec681f3Smrg- intel/mi_builder: Add ieq/ine helpers
27097ec681f3Smrg- intel/mi_builder: Support inverted values in mi_store
27107ec681f3Smrg- intel/mi_builder: Add load/store_offest on GFX 12.5+
27117ec681f3Smrg- genxml: Clean up MI_SET_PREDICATE
27127ec681f3Smrg- intel/batch_decoder: Don't follow predicated MI_BATCH_BUFFER_START
27137ec681f3Smrg- intel/mi_builder: Use softpin for tests on gen8+
27147ec681f3Smrg- intel/mi_builder: Return an address from __gen_get_batch_address
27157ec681f3Smrg- intel/mi_builder: Add control-flow support
27167ec681f3Smrg- nir: Add and use a new nir_ssa_def_rewrite_uses_src helper
27177ec681f3Smrg- nir: Make nir_ssa_def_rewrite_uses take an SSA value
27187ec681f3Smrg- nir: Make nir_ssa_def_rewrite_uses_after take an SSA value
27197ec681f3Smrg- intel/mi_builder: Fix some indentation
27207ec681f3Smrg- intel/mi_builder: Fix a couple of #ifs
27217ec681f3Smrg- anv: Drop anv_extensions.py
27227ec681f3Smrg- turnip: Move the CreateRenderPass wrapper to common code
27237ec681f3Smrg- anv: Move multialloc to common code
27247ec681f3Smrg- vulkan: Use VK_MULTIALLOC in CreateRenderPass
27257ec681f3Smrg- anv: Move vk_format helpers to common code
27267ec681f3Smrg- vulkan: Use correct aspectMask in CreateRenderPass
27277ec681f3Smrg- vulkan: Add some asserts and checks for multiview in CreateRenderPass
27287ec681f3Smrg- vulkan: Preserve preserve attachments in CreateRenderPass
27297ec681f3Smrg- anv: Drop CreateRenderPass
27307ec681f3Smrg- radv/meta: Use CreateRenderPass2
27317ec681f3Smrg- radv: Drop CreateRenderPass
27327ec681f3Smrg- intel/fs: Use INTEL_MASK for pushish constant address masking
27337ec681f3Smrg- intel/fs: Handle payload node interference in destinations
27347ec681f3Smrg- vulkan: Use ALWAYS_INLINE for multialloc
27357ec681f3Smrg- vk/alloc: Handle zero sizes better in vk_multialloc_add
27367ec681f3Smrg- vulkan/alloc: Add VK_MULTIALLOC_DECL macros
27377ec681f3Smrg- vulkan/util: Add a type parameter to vk_multialloc_add
27387ec681f3Smrg- vulkan/alloc: Use char * for pointer arithmetic
27397ec681f3Smrg- anv,genxml: Handle L3SQCREG1_SQGHPCI in GenXML
27407ec681f3Smrg- anv: Add an anv_batch_write_reg macro
27417ec681f3Smrg- iris: Add an iris_write_reg macro
27427ec681f3Smrg- genxml: Make 1-bit L3$ config register fields bool on Gen7
27437ec681f3Smrg- intel/fs,rt: Add a predicate to load_global_const_block
27447ec681f3Smrg- anv: Use load_global_constant for shader constants
27457ec681f3Smrg- anv: Use nir_shader_instructions_pass in apply_pipeline_layout
27467ec681f3Smrg- anv/apply_pipeline_layout: Refactor descriptor chasing code
27477ec681f3Smrg- anv/apply_pipeline_layout: Rework the early pass index/offset helpers
27487ec681f3Smrg- anv/apply_pipeline_layout: Lower UBO loads in the early pass
27497ec681f3Smrg- anv/apply_pipeline_layout: Run DCE between the early and late passes
27507ec681f3Smrg- anv/apply_pipeline_layout: Move bounds checking later for index/offset
27517ec681f3Smrg- anv/apply_pipeline_layout: Plumb through a UBO address format
27527ec681f3Smrg- anv/apply_pipeline_layout: Add some switch statements
27537ec681f3Smrg- nir: Add a new 64+32-bit address format
27547ec681f3Smrg- anv: Use 64bit_global_32bit_offset for SSBOs
27557ec681f3Smrg- anv: Rework the 64bit_bounded_global resource index format
27567ec681f3Smrg- anv: Zero out the last dword of UBO/SSBO descriptors in the shader
27577ec681f3Smrg- anv/apply_pipeline_layout: Apply dynamic offsets in load_ssbo_descriptor
27587ec681f3Smrg- anv/apply_pipeline_layout: Refactor all our descriptor address builders
27597ec681f3Smrg- anv/apply_pipeline_layout: Rework the desc_addr_format helper
27607ec681f3Smrg- anv/apply_pipeline_layout: Use the new helpers for early lowering
27617ec681f3Smrg- anv/apply_pipeline_layout: Use the new helpers for images
27627ec681f3Smrg- nir/lower_io: Support global addresses for UBOs in nir_lower_explicit_io
27637ec681f3Smrg- anv: Add a pass for lowering A64 UBO access
27647ec681f3Smrg- anv: Do UBO loads with global addresses for bindless
27657ec681f3Smrg- anv/apply_pipeline_layout: Add support for A64 descriptor access
27667ec681f3Smrg- nir: Add image atomic_fmin/fmax intrinsics
27677ec681f3Smrg- spirv: Add support for SPV_EXT_shader_atomic_float_min_max
27687ec681f3Smrg- intel/fs: Add support for 16-bit A64 float and integer atomics
27697ec681f3Smrg- intel/genxml: Binding table pointers are 15 bits on GFX version 12.5+
27707ec681f3Smrg- intel/tools: Handle milti-LRI in the batch decoder
27717ec681f3Smrg- intel/tools: Handle GT_MODE in the batch decoder
27727ec681f3Smrg- intel/genxml: Make BindingTablePoolEnable a bool
27737ec681f3Smrg- intel/tools: Handle BINDING_TABLE_POOL_ALLOC in batch decoding
27747ec681f3Smrg- anv: Align inline uniform data to ANV_UBO_ALIGNMENT
27757ec681f3Smrg- anv: Implement VK_EXT_conservative_rasterization
27767ec681f3Smrg- anv: Fix coverage masks for VK_EXT_conservative_rasterization
27777ec681f3Smrg- intel: Drop gen_device_info::has_resource_streamer
27787ec681f3Smrg- anv: Clean up anv_device_memory::base on failure
27797ec681f3Smrg- anv: Refactor framebuffer creation
27807ec681f3Smrg- anv: Clean up anv_descriptor_pool::base on the error path
27817ec681f3Smrg- anv: Clean up anv_semaphore::base on the error path
27827ec681f3Smrg- vulkan: Add a vk_object_multialloc helper
27837ec681f3Smrg- anv: Use vk_object_alloc/free
27847ec681f3Smrg- anv: Make memory type and queue family pointers const
27857ec681f3Smrg- intel: fix querying mip levels on null surfaces on SKL and prior
27867ec681f3Smrg- intel/compiler: Don't insert barriers for NULL sources
27877ec681f3Smrg- anv: Use the same re-order mode for streamout as for GS
27887ec681f3Smrg- intel/isl: Fix isl_color_value_unpack to match the prototype
27897ec681f3Smrg- intel/nir: Set lower txs with non-zero LOD
27907ec681f3Smrg
27917ec681f3SmrgJeremy Huddleston (5):
27927ec681f3Smrg
27937ec681f3Smrg- darwin: Use the system libexpat
27947ec681f3Smrg- util: Fix pointer to integer conversion error when using libunwind
27957ec681f3Smrg- darwin: Use the system libunwind
27967ec681f3Smrg- Fall back on clock_gettime when timespec_get() is unavailable
27977ec681f3Smrg- Adjust dylib compatibility versions to match what was set by mesa-18.3's autotools-based builds
27987ec681f3Smrg
27997ec681f3SmrgJesse Natalie (76):
28007ec681f3Smrg
28017ec681f3Smrg- nir: Work around MSVC x86 internal compiler error
28027ec681f3Smrg- main: Undefine MemoryBarrier for Windows
28037ec681f3Smrg- glapi: Undefine MemoryBarrier
28047ec681f3Smrg- mapi: Undefine MemoryBarrier
28057ec681f3Smrg- drisw: Disable automatic use of layered drivers with LIBGL_ALWAYS_SOFTWARE
28067ec681f3Smrg- wgl: Refactor screen creation to a function
28077ec681f3Smrg- wgl: Add a loop for screen creation with an ordered list of fallbacks
28087ec681f3Smrg- d3d12: Fail screen creation if a shader validator is needed and can't be created
28097ec681f3Smrg- wgl: Disable automatic use of layered drivers with LIBGL_ALWAYS_SOFTWARE
28107ec681f3Smrg- CI: Use a sha for the Windows SPIRV-LLVM-Translator dependency
28117ec681f3Smrg- microsoft/clc: Add -fgnu89-inline to clang args
28127ec681f3Smrg- microsoft/clc: Add test with inline function
28137ec681f3Smrg- clover: Add -fgnu89-inline to Clang command line
28147ec681f3Smrg- microsoft/clc: Only apply float scaling to 32bit fdiv
28157ec681f3Smrg- microsoft/clc: Let lower_vars_to_explicit_types fill kernel input driver_location
28167ec681f3Smrg- microsoft/clc: Fix wrap modes for inline samplers for integer textures
28177ec681f3Smrg- microsoft/clc: Move inline samplers to the end of the variable list
28187ec681f3Smrg- microsoft/clc: Use driver_location for metadata instead of re-computing offsets
28197ec681f3Smrg- microsoft/clc: Re-order dead variable removal after uniform vars_to_explicit_types
28207ec681f3Smrg- microsoft/clc: Add a test with an unused kernel arg
28217ec681f3Smrg- glapi: Support "ELF" TLS on Windows
28227ec681f3Smrg- docs: Document USE_ELF_TLS can work on Windows too
28237ec681f3Smrg- meson/gallium: Add an option to not use LLVM for gallium draw module
28247ec681f3Smrg- d3d12: Handle null constant buffers
28257ec681f3Smrg- nir: Add a nir_after_instr_and_phis helper
28267ec681f3Smrg- microsoft/compiler: Don't separate phis while inserting upcasts
28277ec681f3Smrg- d3d12: Move descriptor pools to screen, and add lock
28287ec681f3Smrg- d3d12: Handle is_new_style_shadow comparison filtering
28297ec681f3Smrg- d3d12: Really handle null constant buffers
28307ec681f3Smrg- u_format: Add restrict to fn pointer and manual format pack/unpack/fetch
28317ec681f3Smrg- panfrost: Add a Meson dependency on bi_opcodes.h for bifrost_compiler
28327ec681f3Smrg- meson, util: Make zlib optional again
28337ec681f3Smrg- nir: Temporarily disable optimizations for MSVC ARM64
28347ec681f3Smrg- wgl: Fix wglCreatePbufferARB pixel format lookup
28357ec681f3Smrg- d3d12: Use ID3D12Device9::CreateCommandQueue1 when available
28367ec681f3Smrg- d3d12: Use CreateDXGIFactory2 and use the debug flag when appropriate
28377ec681f3Smrg- wgl: Add unit test infrastructure for OpenGL32.dll on Windows
28387ec681f3Smrg- wgl: Add a context to framebuffer destruction
28397ec681f3Smrg- d3d12: Add a constant for num_buffers
28407ec681f3Smrg- d3d12: Clean up swapchains on framebuffer destruction
28417ec681f3Smrg- wgl, d3d12: Add a d3d12-specific test for swapchain leaks
28427ec681f3Smrg- microsoft/compiler: Move blob_init earlier to prevent crash on failure
28437ec681f3Smrg- microsoft/compiler: Add copy_prop_vars to optimization loop
28447ec681f3Smrg- microsoft/compiler: Add a lowering pass to split clip/cull distance compact arrays
28457ec681f3Smrg- microsoft/compiler: Enable dxil_nir.h to be included from C++
28467ec681f3Smrg- microsoft/compiler: Support compact arrays for clip/cull in nir_to_dxil
28477ec681f3Smrg- d3d12: Use compact arrays for clip/cull distance
28487ec681f3Smrg- microsoft/spirv_to_dxil: Implement TODO for removing dead functions
28497ec681f3Smrg- spirv_to_dxil: Handle clip/cull distance
28507ec681f3Smrg- microsoft/compiler: Fix barrier flag for shared memory
28517ec681f3Smrg- microsoft/spirv_to_dxil: Lower globals to function_temp
28527ec681f3Smrg- microsoft/spirv_to_dxil: Lower io arrays
28537ec681f3Smrg- microsoft/compiler: Support fp16 i/o vars
28547ec681f3Smrg- nir: Add a new opcode for [un]packing doubles
28557ec681f3Smrg- microsoft/compiler: Add a lowering pass to emit double [un]pack instructions
28567ec681f3Smrg- microsoft/compiler: Implement new double pack/unpack alu ops
28577ec681f3Smrg- microsoft/spirv_to_dxil: Support doubles
28587ec681f3Smrg- microsoft/compiler: Add some more float16 support
28597ec681f3Smrg- meson: Refuse to build lavapipe without llvmpipe
28607ec681f3Smrg- vtn: Don't warn about linkage capability if we're creating a NIR library
28617ec681f3Smrg- vtn: Add a cap for CL drivers to support read-write images
28627ec681f3Smrg- microsoft/clc: Update unit test to always use COMMON state for buffers
28637ec681f3Smrg- meson: For MSVC, suppress warnings generated by useless delayloads
28647ec681f3Smrg- driconf: Remove default values from string driconf entries
28657ec681f3Smrg- CI: Enable -werror for Windows
28667ec681f3Smrg- vtn: Support scoped control barriers for OpenCL too
28677ec681f3Smrg- nir_opt_deref: ptr_as_array(deref_cast<T*>(x))[0] isn't the same as x[0] if the cast has alignment
28687ec681f3Smrg- nir: Fix MSVC warning C4334 (32bit shift cast to 64bit)
28697ec681f3Smrg- d3d12: Fix MSVC warning C4334 (32bit shift cast to 64bit)
28707ec681f3Smrg- microsoft/clc: Fix MSVC unreferenced variable warnings
28717ec681f3Smrg- microsoft/clc: Fix undeclared function warning
28727ec681f3Smrg- microsoft/compiler: Fix MSVC warning C4334 (32bit shift cast to 64bit)
28737ec681f3Smrg- shader_enums: Fix MSVC warning C4334 (32bit shift cast to 64bit)
28747ec681f3Smrg- gallium/aux: Fix MSVC warning C4334 (32bit shift cast to 64bit)
28757ec681f3Smrg- llvmpipe: Fix MSVC warning C4334 (32bit shift cast to 64bit)
28767ec681f3Smrg- xmlconfig: Fix MSVC warning C4334 (32bit shift cast to 64bit)
28777ec681f3Smrg
28787ec681f3SmrgJesse Schwartzentruber (1):
28797ec681f3Smrg
28807ec681f3Smrg- glcpp: Fix undefined behaviour in glcpp
28817ec681f3Smrg
28827ec681f3SmrgJoel Linn (2):
28837ec681f3Smrg
28847ec681f3Smrg- freedreno/a2xx: fix/add RBBM perfcounter
28857ec681f3Smrg- freedreno/a2xx: add RB perfcounter 1-3
28867ec681f3Smrg
28877ec681f3SmrgJonathan Marek (13):
28887ec681f3Smrg
28897ec681f3Smrg- turnip: fix logicOp
28907ec681f3Smrg- turnip: delete unused vk_format_parse.py file
28917ec681f3Smrg- turnip: use vk_format_is_int to disable COLOR_ATTACHMENT_BLEND_BIT
28927ec681f3Smrg- turnip: IMAGE_FILTER_{LINEAR,CUBIC}_BIT only for non-integer formats
28937ec681f3Smrg- turnip: don't always use 3d ops for blit_image
28947ec681f3Smrg- turnip: add missing register write to disable dithering
28957ec681f3Smrg- freedreno/registers: use macro instead of inline function for array regs
28967ec681f3Smrg- freedreno/a6xx: update perfcntr registers (declare as arrays)
28977ec681f3Smrg- freedreno/a6xx: always use reg64 for address registers (no LO/HI)
28987ec681f3Smrg- freedreno/a6xx: update some registers
28997ec681f3Smrg- freedreno/a6xx: set SP_PERFCTR_ENABLE in computerator
29007ec681f3Smrg- turnip: implement VK_KHR_shader_float_controls
29017ec681f3Smrg- turnip: enable VK_KHR_shader_float16_int8
29027ec681f3Smrg
29037ec681f3SmrgJordan Justen (28):
29047ec681f3Smrg
29057ec681f3Smrg- intel/genxml/gen125: Add CFE_STATE and COMPUTE_WALKER
29067ec681f3Smrg- intel/compiler: Disable push constants on gen12-hp
29077ec681f3Smrg- anv: Emit CFE_STATE for gen12-hp
29087ec681f3Smrg- anv: Don't use MEDIA_INTERFACE_DESCRIPTOR_LOAD for gen12-hp
29097ec681f3Smrg- anv: Use COMPUTE_WALKER for gen12-hp
29107ec681f3Smrg- iris: Add support for COMPUTE_WALKER
29117ec681f3Smrg- iris: Fix android build due to missing link to libmesa_iris_gen125
29127ec681f3Smrg- anv: Add exec_flags to anv_queue
29137ec681f3Smrg- anv: Turn device->queue into an array
29147ec681f3Smrg- anv: Print queue number with INTEL_DEBUG=bat
29157ec681f3Smrg- anv: Support i915 query (DRM_IOCTL_I915_QUERY) from Linux v4.17
29167ec681f3Smrg- anv: Gather engine info from i915 if available
29177ec681f3Smrg- anv: Add anv_gem_count_engines
29187ec681f3Smrg- anv: Support multiple engines with DRM_IOCTL_I915_GEM_CONTEXT_CREATE_EXT
29197ec681f3Smrg- anv: Add ANV_QUEUE_OVERRIDE env-var to override advertised queues
29207ec681f3Smrg- i965: Call util_cpu_detect() early in screen creation
29217ec681f3Smrg- intel: Use GEN_VERSIONx10 in more places
29227ec681f3Smrg- intel/dev: Add devinfo genx10 field
29237ec681f3Smrg- intel: Use devinfo genx10 field
29247ec681f3Smrg- anv: Restructure mem heap/type init code
29257ec681f3Smrg- anv: Put cache memory type first on non-llc platforms
29267ec681f3Smrg- anv: Add mem heap/type support for local-mem
29277ec681f3Smrg- anv: Drop has_slm in emit_l3_config for gen11+
29287ec681f3Smrg- anv: Use fallback paths if DRM_I915_QUERY_ENGINE_INFO fails
29297ec681f3Smrg- i965/gen11: Fix must-be-ones bit positions in 3D_MODE
29307ec681f3Smrg- genxml/gen12: 3D_MODE bits 31:16 are no longer must-be-one
29317ec681f3Smrg- Revert "intel/compiler: Silence unused parameter warning in update_inst_scoreboard"
29327ec681f3Smrg- intel/compiler: Fix INTEL_DEBUG=hex
29337ec681f3Smrg
29347ec681f3SmrgJose Maria Casanova Crespo (4):
29357ec681f3Smrg
29367ec681f3Smrg- v3d: Enables DRM_FORMAT_MOD_BROADCOM_SAND128 support
29377ec681f3Smrg- v3d: YUV formats at is_dmabuf_modifier_supported are external_only
29387ec681f3Smrg- v3d: YUV formats at query_dmabuf_modifiers are external_only
29397ec681f3Smrg- v3d: DRM_FORMAT_MOD_BROADCOM_SAND128 only available for NV12 format.
29407ec681f3Smrg
29417ec681f3SmrgJoshua Ashton (3):
29427ec681f3Smrg
29437ec681f3Smrg- lavapipe: handle NULL pStrides in CmdBindVertexBuffers2EXT
29447ec681f3Smrg- lavapipe: implement CmdBindVertexBuffers with CmdBindVertexBuffers2EXT
29457ec681f3Smrg- lavapipe: enable KHR_image_format_list
29467ec681f3Smrg
29477ec681f3SmrgJosé Fonseca (6):
29487ec681f3Smrg
29497ec681f3Smrg- scons: Add u_format_pack.h include path.
29507ec681f3Smrg- wgl: Match opengl32.dll export ordinals.
29517ec681f3Smrg- util: Always use timespec_get on Windows.
29527ec681f3Smrg- appveyor: Remove integration.
29537ec681f3Smrg- scons: Remove.
29547ec681f3Smrg- gitlab-ci: Build meson-mingw32-x86_64 w/o zlib.
29557ec681f3Smrg
29567ec681f3SmrgJuan A. Suarez Romero (40):
29577ec681f3Smrg
29587ec681f3Smrg- v3d: fix dest offset in TFU setup
29597ec681f3Smrg- v3d: use a compatible supported format for TFU-based blit
29607ec681f3Smrg- vc4/ci: Replace expect script by python script
29617ec681f3Smrg- ci/vc4: allow custom timeout values for activity
29627ec681f3Smrg- ci/vc4: rename stage to Broadcom
29637ec681f3Smrg- ci/vc4: Add piglit job
29647ec681f3Smrg- ci: Bump deqp to current vulkan-cts-1.2.5.1
29657ec681f3Smrg- ci: add option to overwrite CPU arch
29667ec681f3Smrg- ci/v3d: Add V3D and V3DV testing
29677ec681f3Smrg- ci/v3d: Update expected resuls for piglit
29687ec681f3Smrg- ci/piglit: allow parallel piglit jobs
29697ec681f3Smrg- ci/vc4/v3d: Parallelize piglit jobs
29707ec681f3Smrg- ci/piglit: fix parallel piglit results
29717ec681f3Smrg- ci/baremetal: highlight message errors
29727ec681f3Smrg- ci/broadcom: retry always when serial log timeout
29737ec681f3Smrg- ci: Bump deqp to vk-gl-cts 1.2.5.2
29747ec681f3Smrg- ci/broadcom: allow custom kernels
29757ec681f3Smrg- vc4: destroy renderonly object if present
29767ec681f3Smrg- ci/armXX: add libgl1-mesa-dev dependency
29777ec681f3Smrg- ci/v3dv: add flaky test in the skip list
29787ec681f3Smrg- ci/vc4/v3d: run piglit testsuite against Xorg
29797ec681f3Smrg- ci/broadcom: use new piglit runner
29807ec681f3Smrg- ci/broadcom: update piglit expected results
29817ec681f3Smrg- ci/v3d: run full GLES3 and GLES31 testsuite
29827ec681f3Smrg- broadcom/compiler: fix unused value
29837ec681f3Smrg- v3dv: fix unused value
29847ec681f3Smrg- ci/v3dv: update flaky tests
29857ec681f3Smrg- broadcom/cle: do not leak spec
29867ec681f3Smrg- ci/broadcom: update expected list
29877ec681f3Smrg- v3d: use uint type in _gen_unpack_uint
29887ec681f3Smrg- broadcom/compiler: fix first_component assertion
29897ec681f3Smrg- broadcom/compiler: use signed pointers for packed condition
29907ec681f3Smrg- ci/broadcom: use SNMP to turn on/off devices
29917ec681f3Smrg- broadcom/compiler: use VPM offsets in GS load_per_vertex input
29927ec681f3Smrg- v3d: use GS_BIN inputs as VS_BIN outputs
29937ec681f3Smrg- v3dv: fix assertion
29947ec681f3Smrg- ci: Update VK-GL-CTS to 1.2.6.0
29957ec681f3Smrg- v3d: do not emit attribute if has no resource
29967ec681f3Smrg- ci/v3dv: skip Vulkan waiver tests
29977ec681f3Smrg- util/hash_table: do not leak u64 struct key
29987ec681f3Smrg
29997ec681f3SmrgJérôme Glisse (3):
30007ec681f3Smrg
30017ec681f3Smrg- gallium: add support for SVM (Share Virtual Memory) migrate
30027ec681f3Smrg- clover: implement clEnqueueSVMMigrateMem
30037ec681f3Smrg- nouveau: add support for SVM migrate
30047ec681f3Smrg
30057ec681f3SmrgKarol Herbst (9):
30067ec681f3Smrg
30077ec681f3Smrg- clover: track allocated svm pointers
30087ec681f3Smrg- clover/api: make use of validate_mem_migration_flags in clEnqueueMigrateMemObjects
30097ec681f3Smrg- nouveau: print warning about unhandled cap only once
30107ec681f3Smrg- clover: simplify image arguments
30117ec681f3Smrg- clover: rework quering image max sizes
30127ec681f3Smrg- clover: Fix build with llvm-12.
30137ec681f3Smrg- clover: Add missing include for llvm-12 build fix
30147ec681f3Smrg- tegra/context: fix regression in tegra_draw_vbo
30157ec681f3Smrg- tegra/context: unwrap indirect_draw_count as well
30167ec681f3Smrg
30177ec681f3SmrgKeith Packard (2):
30187ec681f3Smrg
30197ec681f3Smrg- glx: Provide glvnd wrapper for glXSwapIntervalEXT
30207ec681f3Smrg- wsi/x11: Fix type of target_msc argument to x11_present_to_x11_dri3
30217ec681f3Smrg
30227ec681f3SmrgKenneth Graunke (62):
30237ec681f3Smrg
30247ec681f3Smrg- vbo: Don't set node->min_index = max_index = indices_offset when merging
30257ec681f3Smrg- vbo: Only mark merged line strips as lines when actually converting them
30267ec681f3Smrg- tnl: Try not to botch index buffer munging when start > 0.
30277ec681f3Smrg- tnl: Respect \`start` when converting indices to GLuint
30287ec681f3Smrg- tnl: Reset nr_bos to 0 between map/unmap cycles.
30297ec681f3Smrg- Revert "mesa: allow half float textures based on ARB_half_float_pixel"
30307ec681f3Smrg- iris: Consider resolves after changing a resource's aux state
30317ec681f3Smrg- iris: Drop find_existing_assembly optimization from program cache
30327ec681f3Smrg- iris: Drop iris_print_program_cache().
30337ec681f3Smrg- iris: Refactor iris_debug_recompile interface to take a shader.
30347ec681f3Smrg- intel: Produce a "constrained" output from gen_get_urb_config()
30357ec681f3Smrg- iris: Reconfigure the URB only if it's necessary or possibly useful
30367ec681f3Smrg- iris: Move the URB size checks into iris_update_compiled_xs
30377ec681f3Smrg- iris: Properly handle new unbind_num_trailing_slots parameters
30387ec681f3Smrg- iris: Use shader_info rather than vs_prog_data for draw parameter checks
30397ec681f3Smrg- iris: Minor code restyling in iris_bind_vs_state
30407ec681f3Smrg- iris: Move VS draw parameter dirty flagging to iris_bind_vs_state
30417ec681f3Smrg- iris: Refcount shader variants
30427ec681f3Smrg- iris: Store a list of shader variants in the shader itself
30437ec681f3Smrg- iris: Enable PIPE_CAP_SHAREABLE_SHADERS.
30447ec681f3Smrg- iris: add hooks to call INTEL_MEASURE
30457ec681f3Smrg- iris: Fill out scratch base address dynamically
30467ec681f3Smrg- iris: Remove context from iris_debug_recompile
30477ec681f3Smrg- iris: Remove context from iris_upload_shader()
30487ec681f3Smrg- iris: Remove context from iris_compile_vs and friends
30497ec681f3Smrg- iris: Remove context from iris_create_uncompiled_shader
30507ec681f3Smrg- iris: Remove context from iris_disk_cache_retrieve
30517ec681f3Smrg- iris: Make a pin_scratch_space() helper
30527ec681f3Smrg- iris: Reference the shader variant for last_vue_map as well
30537ec681f3Smrg- iris: Pin surface state buffers after possibly updating the clear color
30547ec681f3Smrg- i965: Rename use_intel_mipree_map_blit to use_blitter_to_map
30557ec681f3Smrg- i965: Rename intel_batchbuffer_* to brw_batch_*.
30567ec681f3Smrg- i965: Rename intel_screen to brw_screen
30577ec681f3Smrg- i965: Rename intel_texture_{object,image} to brw_texture_{object,image}
30587ec681f3Smrg- i965: Rename intel_renderbuffer to brw_renderbuffer
30597ec681f3Smrg- i965: Rename intel_mip* to brw_mip*.
30607ec681f3Smrg- i965: Use __func__ in blorp perf_debug macros
30617ec681f3Smrg- i965: Rename intel_buffer_object to brw_buffer_object
30627ec681f3Smrg- i965: Rename intel_image_format and intel_buffer to brw_*
30637ec681f3Smrg- i965: Rename the rest of intel_* functions to brw_*
30647ec681f3Smrg- i965: Rename intelInit and brwInit camel-case functions to brw_*
30657ec681f3Smrg- i965: Rename some camel-case local variables
30667ec681f3Smrg- i965: Rename more camel-case functions to brw and underscore style
30677ec681f3Smrg- i965: Rename DRI extension structs to be "brw" instead of "intel"
30687ec681f3Smrg- i965: Eliminate all tabs except in brw_defines.h
30697ec681f3Smrg- tnl: Call _mesa_matrix_analyse to make sure the inverse MVP is updated
30707ec681f3Smrg- glsl/float64: Bump #version to 400
30717ec681f3Smrg- iris: Defer uploading of surface states
30727ec681f3Smrg- iris: Defer stream output target space allocation until set time
30737ec681f3Smrg- iris: Rework zeroing of stream output buffer offsets
30747ec681f3Smrg- iris: Support rebinding of stream output targets
30757ec681f3Smrg- iris: Use different shader uploaders for precompile vs. draw time
30767ec681f3Smrg- iris: Make various classes inherit from u_threaded_context base classes
30777ec681f3Smrg- iris: Use thread safe slab allocators in transfer_map handling
30787ec681f3Smrg- iris: Enable u_threaded_context
30797ec681f3Smrg- vbo: Fix vbo_sw_primitive_restart for start > 0
30807ec681f3Smrg- intel/genxml: Add a partial GT_MODE definition for Gen11+.
30817ec681f3Smrg- iris: Delete stale comment in iris_lost_context_state
30827ec681f3Smrg- intel: Fix release build breakage
30837ec681f3Smrg- Half-revert "gallium/dri2: Pass the resource that corresponds to the plane"
30847ec681f3Smrg- intel: Mark an otherwise unused variable in intel_dump_gpu as ASSERTED
30857ec681f3Smrg- ci: Enable iris testing in meson-release
30867ec681f3Smrg
30877ec681f3SmrgKristian Høgsberg (1):
30887ec681f3Smrg
30897ec681f3Smrg- macros: Add thread-safety annotation macros
30907ec681f3Smrg
30917ec681f3SmrgLeo Liu (12):
30927ec681f3Smrg
30937ec681f3Smrg- radeon/vcn: clean the message buffers and their indexes logic
30947ec681f3Smrg- radeon/vcn: add dynamic dpb interface
30957ec681f3Smrg- radeon/vcn: add dynamic dpb buffer Tier1 support
30967ec681f3Smrg- radeon/vcn: enable dynamic dpb Tier1 support
30977ec681f3Smrg- radeon/vcn: add dynamic dpb Tier2 message buffer interface
30987ec681f3Smrg- radeon/vcn: implement dynamic dpb Tier2 support
30997ec681f3Smrg- radeon/vcn: enable dynamic dpb Tier2 support
31007ec681f3Smrg- meson: bump drm amdgpu version to 2.4.105
31017ec681f3Smrg- ci: Fix meson-i386 build failed after libdrm bump version
31027ec681f3Smrg- include/drm-uapi: bump AMDGPU headers
31037ec681f3Smrg- ac: add function for querying video capabilities
31047ec681f3Smrg- radeonsi: replace the hard coded video decode and encode caps
31057ec681f3Smrg
31067ec681f3SmrgLepton Wu (3):
31077ec681f3Smrg
31087ec681f3Smrg- virgl: Don't destroy resource while it's in use.
31097ec681f3Smrg- virgl: Use atomic operation directly.
31107ec681f3Smrg- virgl: move new added field to the end.
31117ec681f3Smrg
31127ec681f3SmrgLionel Landwerlin (96):
31137ec681f3Smrg
31147ec681f3Smrg- anv: add transfer usage for color/depth/stencil attachments
31157ec681f3Smrg- anv: don't disable KHR_performance_query in debug mode
31167ec681f3Smrg- intel/mi_builder: optimize 64bit immediate register loads & memory stores
31177ec681f3Smrg- intel/mi_builder: fix self modifying batches
31187ec681f3Smrg- intel/perf: restructure i915 perf version checks
31197ec681f3Smrg- intel/perf: add definition for generic perf counters
31207ec681f3Smrg- intel/perf: link queries back to the gen_perf_config object
31217ec681f3Smrg- intel/perf: move gt_frequency to results
31227ec681f3Smrg- anv: Fix stencil layout in render passes
31237ec681f3Smrg- intel: silence unused var warnings in release builds
31247ec681f3Smrg- anv: fix invalid programming of BLEND_STATE
31257ec681f3Smrg- intel/common: store sample position in plain arrays
31267ec681f3Smrg- anv: pass context to reset stats helper
31277ec681f3Smrg- anv: store queue creation flags on anv_queue
31287ec681f3Smrg- genxml: PERFCNT registers are available since HSW
31297ec681f3Smrg- intel/perf: prep work to enable new perf counters
31307ec681f3Smrg- intel/perf: query register descriptions
31317ec681f3Smrg- intel/perf: add performance query layout using MI_SRM
31327ec681f3Smrg- intel/perf: switch query code to use query layout
31337ec681f3Smrg- anv: fix layout comment
31347ec681f3Smrg- anv: remove unused query pool field
31357ec681f3Smrg- intel/perf: rename lkf into ehl
31367ec681f3Smrg- intel/perf: add reorder script
31377ec681f3Smrg- intel/perf: reorder xml files
31387ec681f3Smrg- intel/perf: remove reordering script
31397ec681f3Smrg- intel/perf: update files from IGT
31407ec681f3Smrg- intel/perf: small ICL equation refactor
31417ec681f3Smrg- intel/perf: add async compute metrics
31427ec681f3Smrg- intel/dev: identify tigerlake
31437ec681f3Smrg- intel/perf: break TGL perf configs in GT1/2
31447ec681f3Smrg- intel/dev: identify rocketlake
31457ec681f3Smrg- intel/perf: add RKL support
31467ec681f3Smrg- intel/perf: add DG1 support
31477ec681f3Smrg- intel/perf: drop the special READ_REG operator
31487ec681f3Smrg- anv: compute commands required to implement perf queries
31497ec681f3Smrg- anv: switch khr perf query code to use query layout
31507ec681f3Smrg- anv: switch intel perf queries to query layout
31517ec681f3Smrg- anv: add a comment describing has_relocs field
31527ec681f3Smrg- anv: break up internal queueing function
31537ec681f3Smrg- anv: only signal wsi fence BO on last command buffer
31547ec681f3Smrg- drm-shim: report support for timeline semaphores
31557ec681f3Smrg- intel/stub: plug some gaps in our ioctl faking
31567ec681f3Smrg- anv: print out perf permission warning only once
31577ec681f3Smrg- anv: discard all timeline wait/signal value=0
31587ec681f3Smrg- vulkan: document flags choice for vkGetDeviceQueue
31597ec681f3Smrg- genxml: add MI_SET_APPID on Gen12+
31607ec681f3Smrg- genxml: Add PIPE_CONTROL protected memory bits
31617ec681f3Smrg- isl: add external parameter to isl_mocs()
31627ec681f3Smrg- anv: track command buffer pool flags
31637ec681f3Smrg- anv: track buffer creation flags
31647ec681f3Smrg- intel/dev: identify alderlake
31657ec681f3Smrg- intel/perf: Add Alderlake metrics
31667ec681f3Smrg- intel/perf: fix roll over PERF_CNT counter accumulation
31677ec681f3Smrg- anv: reset binary syncobj to be signaled before submission
31687ec681f3Smrg- anv: don't wait for completion of work on vkQueuePresent()
31697ec681f3Smrg- anv: Fix wait_count missing increment
31707ec681f3Smrg- anv: make use of new helper function directly in anv_QueueSubmit()
31717ec681f3Smrg- anv: track the end of the command buffers
31727ec681f3Smrg- anv: end command buffer with a potential jump
31737ec681f3Smrg- anv: allow multiple command buffers in anv_queue_submit
31747ec681f3Smrg- anv: group as many command buffers into a single execbuf
31757ec681f3Smrg- anv: fix missing general state pool in validation list
31767ec681f3Smrg- anv: implement INTEL_DEBUG=submit
31777ec681f3Smrg- anv: fix MI_PREDICATE_RESULT write
31787ec681f3Smrg- intel/tools: fix meson warning
31797ec681f3Smrg- intel/dev: add helpers to compute subslice/eu total
31807ec681f3Smrg- intel/dev: add warning on missing kernel uAPI for Gen8+
31817ec681f3Smrg- iris: use gen_device_info helper to get subslice total
31827ec681f3Smrg- i965: stop using get_param for things queried by gen_device_info
31837ec681f3Smrg- anv: stop using get_param for things queried by gen_device_info
31847ec681f3Smrg- intel/dev: switch over to mesa log infrastructure
31857ec681f3Smrg- anv: move L3 config emission to genX_state.c
31867ec681f3Smrg- anv: move L3 initialization to device init on Gen11+
31877ec681f3Smrg- intel: install intel_device_info
31887ec681f3Smrg- intel/fs/vec4: add missing dependency in write-on-write fixed GRFs
31897ec681f3Smrg- intel/dev: store size of CS prefetch
31907ec681f3Smrg- intel/mi_builder: use device info to use the right CS prefetch size
31917ec681f3Smrg- anv: use the device size of CS prefetch to pad secondary buffer calls
31927ec681f3Smrg- meson: switch vulkan layer to list of choices
31937ec681f3Smrg- intel: Add null hw layer
31947ec681f3Smrg- gitlab-ci: fix vulkan build layer enabling
31957ec681f3Smrg- intel/nullhw: fix build
31967ec681f3Smrg- etnaviv/drm: only print out fence error on non timeout
31977ec681f3Smrg- intel/fs/copy_prop: check stride constraints with actual final type
31987ec681f3Smrg- intel/fs: implement another copy propagation restriction
31997ec681f3Smrg- intel/compiler: lower bit sizes in NIR postprocessing
32007ec681f3Smrg- anv: put correct number of BT prefetch for compute on XeHP+
32017ec681f3Smrg- intel/fs: limit OW reads to 8 owords on XeHP+
32027ec681f3Smrg- microsoft: fixup clc_log() define
32037ec681f3Smrg- anv: bump internal descriptor index fields to 32bits
32047ec681f3Smrg- anv: fix 3DSTATE_MULTISAMPLE emission on gen8+
32057ec681f3Smrg- anv: disable baked in pipeline bits from dynamic emission path
32067ec681f3Smrg- spirv: fix uToAccelerationStructure handling
32077ec681f3Smrg- spirv: fixup pointer_to/from_ssa with acceleration structures
32087ec681f3Smrg- vulkan/wsi/display: don't report support if there is no drm fd
32097ec681f3Smrg- i965/bufmgr: fix invalid assertion
32107ec681f3Smrg
32117ec681f3SmrgLucas Stach (5):
32127ec681f3Smrg
32137ec681f3Smrg- renderonly: remove layering violations
32147ec681f3Smrg- renderonly: close the gpu fd when destroying renderonly
32157ec681f3Smrg- etnaviv: don't try to copy PIPE_BUFFER with the 3D engine
32167ec681f3Smrg- etnaviv: remove stale comment in etna_resource_copy_region
32177ec681f3Smrg- Revert remaining half of "gallium/dri2: Pass the resource that corresponds to the plane"
32187ec681f3Smrg
32197ec681f3SmrgLukas Feller (2):
32207ec681f3Smrg
32217ec681f3Smrg- v3dv: fix assertion in job_compute_frame_tiling
32227ec681f3Smrg- v3dv: fix stride in buffer copy
32237ec681f3Smrg
32247ec681f3SmrgMarcin Ślusarz (20):
32257ec681f3Smrg
32267ec681f3Smrg- intel/perf: export information about units of performance counters
32277ec681f3Smrg- intel/compiler: cache computed register pressure benefit
32287ec681f3Smrg- intel/tools/aub: print better error message when mmap fails
32297ec681f3Smrg- intel/tools/aub: handle truncated input file
32307ec681f3Smrg- intel/tools/aub: remove superfluous new line from error messages
32317ec681f3Smrg- intel/dump_gpu: mark bo as unmapped if its address changes
32327ec681f3Smrg- anv: fix memory allocation error handling
32337ec681f3Smrg- iris: fix decode_get_bo
32347ec681f3Smrg- i965: fix decode_get_bo
32357ec681f3Smrg- intel/batch_decoder: catch invalid sampler state pointer
32367ec681f3Smrg- intel/batch_decoder: drop bogus check
32377ec681f3Smrg- intel/batch_decoder: fix decoding of sampler states
32387ec681f3Smrg- intel/batch_decoder: assert on invalid sampler pointer
32397ec681f3Smrg- intel/aub_viewer: catch invalid sampler state pointer
32407ec681f3Smrg- intel/aub_viewer: drop bogus check
32417ec681f3Smrg- intel/aub_viewer: fix decoding of sampler states
32427ec681f3Smrg- gallium: add PIPE_CAP_ALLOW_DYNAMIC_VAO_FASTPATH
32437ec681f3Smrg- iris: disable dynamic VAO fastpath on GFX version 9
32447ec681f3Smrg- gallium/u_threaded: implement INTEL_performance_query hooks
32457ec681f3Smrg- gallium/u_threaded: offload begin/end_intel_perf_query
32467ec681f3Smrg
32477ec681f3SmrgMarek Olšák (406):
32487ec681f3Smrg
32497ec681f3Smrg- mesa: always set valid index bounds for non-indexed draws for classic drivers
32507ec681f3Smrg- st/nine: stop using cso_set_sampler_views
32517ec681f3Smrg- st/xa: stop using cso_set_sampler_views
32527ec681f3Smrg- gallium/tests: stop using cso_set_sampler_views
32537ec681f3Smrg- gallium/api: add state invalidate interface as alternative to cso_save/restore
32547ec681f3Smrg- gallium/hud: don't use cso_context to restore VBs, constbuf 0 and sampler views
32557ec681f3Smrg- gallium/pp: don't use cso_context to restore VBs, constbuf 0 and sampler views
32567ec681f3Smrg- st/mesa: don't use cso_context to restore VBs, sampler views for glBitmap
32577ec681f3Smrg- st/mesa: don't use cso_context to restore VBs for glClear
32587ec681f3Smrg- st/mesa: don't use cso_context to restore VBs, sampler views for glDrawPixels
32597ec681f3Smrg- st/mesa: don't use cso_context to restore VBs, sampler views for glDrawTex*OES
32607ec681f3Smrg- st/mesa: don't use cso_context to restore VBs, etc. for PBO glReadPixels
32617ec681f3Smrg- st/mesa: don't use cso_context to restore VBs, etc. for PBO glTexSubImage
32627ec681f3Smrg- st/mesa: don't use cso_context to set const bufs, sampler views and images
32637ec681f3Smrg- st/mesa: replace st->pipe with pipe in a few places
32647ec681f3Smrg- cso_context: remove ability to restore VBs, const bufs, sampler views, images
32657ec681f3Smrg- st/mesa: unbind sampler views, images, and vertex buffers after meta ops
32667ec681f3Smrg- st/mesa: optimize binding and unbinding shader images
32677ec681f3Smrg- radeonsi: constant buffer cleanups
32687ec681f3Smrg- radeonsi: don't clear unaligned bits when unbinding vertex buffers
32697ec681f3Smrg- radeonsi: move emit_cache_flush functions into si_gfx_cs.c
32707ec681f3Smrg- radeonsi: don't pass pipe_draw_info into si_emit_vs_state
32717ec681f3Smrg- radeonsi: don't pass pipe_draw_info into si_emit_ia_multi_vgt_param
32727ec681f3Smrg- radeonsi: translate pipe_prim_type only when it changes
32737ec681f3Smrg- radeonsi: don't pass pipe_draw_info into si_emit_derived_tess_state
32747ec681f3Smrg- radeonsi: don't compute average vertex count in si_draw_vbo
32757ec681f3Smrg- radeonsi: fix si_num_prims_for_vertices for PIPE_PRIM_POLYGON
32767ec681f3Smrg- radeonsi: make cik_emit_prefetch_L2 templated and move it to si_state_draw.cpp
32777ec681f3Smrg- radeonsi: add a specialized function for CP DMA L2 prefetch
32787ec681f3Smrg- radeonsi: make sctx->vertex_elements always non-NULL
32797ec681f3Smrg- radeonsi: remove MRT-draw-calls, spill-draw-calls, spill-compute-calls
32807ec681f3Smrg- radeonsi: get out of si_emit_vs_state early for blit vertex shaders
32817ec681f3Smrg- radeonsi: rearrange condition for streamout workaround on gfx7 and gfx8
32827ec681f3Smrg- radeonsi: don't use si_get_vs_state in most places
32837ec681f3Smrg- radeonsi: trim the size of si_vgt_param_key and si_vgt_stages_key
32847ec681f3Smrg- mesa: fix alpha channel of ETC2_SRGB8 decompression for !bgra
32857ec681f3Smrg- radeonsi: unify uploaders on APUs too
32867ec681f3Smrg- radeonsi: don't pass pipe_draw_info into si_emit_draw_registers
32877ec681f3Smrg- radeonsi: don't set context_roll for non-gfx9 in templated functions
32887ec681f3Smrg- radeonsi: add si_get_user_data_base selecting user data registers
32897ec681f3Smrg- radeonsi: evaluate sh_base in si_emit_vs_state at compile time
32907ec681f3Smrg- radeonsi: inline the last use of si_get_vs_state
32917ec681f3Smrg- radeonsi: evaluate si_get_vs in si_draw_vbo at compile time
32927ec681f3Smrg- radeonsi: enable the GS tri strip adj workaround with primitive_restart
32937ec681f3Smrg- radeonsi: clear dirty_atoms and dirty_states only if we entered the emit loop
32947ec681f3Smrg- radeonsi: move variables closer to their use in most draw state functions
32957ec681f3Smrg- radeonsi: don't validate inlinable uniforms at draw time
32967ec681f3Smrg- radeonsi: allow instance_count == 0 on chips that handle it correctly
32977ec681f3Smrg- glthread: remove marshal="draw" because it doesn't do much
32987ec681f3Smrg- glthread: don't sync with NV_half_float vertex attrib functions
32997ec681f3Smrg- glthread: add specialized versions of unmarshal_Draw funcs without user buffers
33007ec681f3Smrg- glthread: track all matrix stack depths
33017ec681f3Smrg- glthread: implement glGetIntegerv for states that glthread tracks
33027ec681f3Smrg- glthread: rename inside_dlist to ListMode for future use
33037ec681f3Smrg- glthread: remove if (COMPAT) conditions from functions that are GL-compat-only
33047ec681f3Smrg- mesa: add _mesa_get_list helper
33057ec681f3Smrg- glthread: add display list support to fix state tracking with display lists
33067ec681f3Smrg- mesa: remove _mesa_initialize_exec_dispatch from draw.c by autogenerating it
33077ec681f3Smrg- mesa: remove redundant glRect functions for display lists
33087ec681f3Smrg- mesa: optimize glCallLists by using loops inside a switch
33097ec681f3Smrg- mesa: simplify handling OPCODE_CONTINUE for display lists
33107ec681f3Smrg- mesa: simplify terminating display list loops
33117ec681f3Smrg- mesa: remove STATE_INTERNAL
33127ec681f3Smrg- mesa: combine STATE_ENV, STATE_LOCAL enums with STATE_xxx_PROGRAM
33137ec681f3Smrg- mesa: flatten STATE_MATERIAL and STATE_LIGHTPROD tokens
33147ec681f3Smrg- mesa: eliminate the switch statement for STATE_TEXGEN
33157ec681f3Smrg- glsl: remove unused internal builtin gl_CurrentAttribVertMESA
33167ec681f3Smrg- glsl: split gl_CurrentAttribFragMESA into elements
33177ec681f3Smrg- mesa: skip memmove in optimize_state_parameters if it's no-op
33187ec681f3Smrg- mesa: rename STATE_LIGHT_ATTRIBS -> STATE_LIGHT_ARRAY for consistency
33197ec681f3Smrg- mesa: optimize get_local_param_pointer and program_local_parameters4fv
33207ec681f3Smrg- mesa: don't allocate local parameters in fetch_state
33217ec681f3Smrg- mesa: merge local and env program parameters for faster uploads
33227ec681f3Smrg- mesa: sort state vars with constant indexing for ARB programs
33237ec681f3Smrg- mesa: add upper bound to limit program state var iterations
33247ec681f3Smrg- mesa: compute gl_program_parameter_list::UniformBytes accurately
33257ec681f3Smrg- mesa: don't handle STATE_* enums in fetch_state that don't do anything
33267ec681f3Smrg- mesa: sort and tightly pack STATE_* enums to generate better switch code
33277ec681f3Smrg- mesa: merge equivalent switch cases in prog_statevars.c
33287ec681f3Smrg- st/mesa: enable state var merging to improve fetch_state performance
33297ec681f3Smrg- radeonsi: add new possibly faster command submission helpers
33307ec681f3Smrg- radeonsi: clear dirty_states if si_pm4_bind_state is unbinding or no-op
33317ec681f3Smrg- radeonsi: don't mark NULL states as dirty in si_pm4_reset_emitted
33327ec681f3Smrg- radeonsi: optimize translating index_size to index_type
33337ec681f3Smrg- radeonsi: don't use rasterizer_discard to validate draws, only check ps_shader
33347ec681f3Smrg- radeonsi: add internal blitter_running flag
33357ec681f3Smrg- radeonsi: simplify determining whether render condition is enabled at draw time
33367ec681f3Smrg- radeonsi: inline si_blend_color and si_clip_state structures
33377ec681f3Smrg- radeonsi: move y_inverted out of si_viewports
33387ec681f3Smrg- radeonsi: don't set vertex buffer dirty flags when they don't do anything
33397ec681f3Smrg- radeonsi: move if (sctx->vertex_buffers_dirty) into the upload function
33407ec681f3Smrg- radeonsi: rename SI_SGPR_RW_BUFFERS to SI_SGPR_INTERNAL_BINDINGS
33417ec681f3Smrg- radeonsi: skip some code for ALLOW_PRIM_DISCARD_CS if tess or GS is enabled
33427ec681f3Smrg- radeonsi: enable accidentally disabled fast launch with non-indexed tri strips
33437ec681f3Smrg- radeonsi: iterate from draw 1 for total/min_direct_count computation
33447ec681f3Smrg- st/mesa: don't enable smoothing if multisampling is enabled
33457ec681f3Smrg- Revert "gallium/u_upload_mgr: allow use of FLUSH_EXPLICIT with persistent mappings"
33467ec681f3Smrg- gallium: add take_ownership param into set_constant_buffer to eliminate atomics
33477ec681f3Smrg- gallium: add unbind_num_trailing_slots to set_vertex_buffers
33487ec681f3Smrg- gallium: add unbind_num_trailing_slots to set_shader_images
33497ec681f3Smrg- gallium: add unbind_num_trailing_slots to set_sampler_views
33507ec681f3Smrg- gallium: add take_ownership param into set_vertex_buffers to eliminate atomics
33517ec681f3Smrg- cso_context,u_vbuf: add take_ownership param into set_vertex_buffers
33527ec681f3Smrg- st/mesa: eliminate all atomic ops when setting vertex buffers
33537ec681f3Smrg- st/mesa: skip atomics when binding UBOs
33547ec681f3Smrg- gallium/u_upload_mgr: eliminate all atomics for the upload buffer
33557ec681f3Smrg- gallium/u_threaded: add a null constant buffer codepath
33567ec681f3Smrg- gallium/u_threaded: unify user and non-user codepaths in set_constant_buffer
33577ec681f3Smrg- gallium/util: optimize pipe_vertex_buffer_reference binding the same buffer
33587ec681f3Smrg- gallium,u_threaded: add pipe_draw_info::take_index_buffer_ownership
33597ec681f3Smrg- st/mesa: set take_index_buffer_ownership to skip an atomic in u_threaded
33607ec681f3Smrg- gallium/u_vbuf: skip draws with 0 vertices
33617ec681f3Smrg- radeonsi: fix centroid with VRS coarse shading
33627ec681f3Smrg- glthread: fix interpreting vertex size == GL_BGRA for vertex attribs
33637ec681f3Smrg- glthread: fix glVertexAttribDivisor calls not being tracked by non-VBO uploads
33647ec681f3Smrg- glapi: guard against invalid XML definitions for glthread
33657ec681f3Smrg- ac,radeonsi: track memory usage in KB to reduce types from uint64 to uint32
33667ec681f3Smrg- radeonsi: optimize no-op cases in si_upload_shader_descriptors
33677ec681f3Smrg- radeonsi: mark shader_pointers dirty once outside the upload descriptors loop
33687ec681f3Smrg- radeonsi: move si_pm4_delete_state logic into si_pm4_free_state
33697ec681f3Smrg- radeonsi: delete si_pm4_delete_state
33707ec681f3Smrg- radeonsi: don't check for redundant and NULL states in si_emit_all_states
33717ec681f3Smrg- radeonsi: optimize si_emit_prefetch_L2 when it's split
33727ec681f3Smrg- radeonsi: reorganize si_draw_vbo for lower register pressure (part 1)
33737ec681f3Smrg- radeonsi: reorganize si_draw_vbo for lower register pressure (part 2)
33747ec681f3Smrg- radeonsi: set VB user SGPRs in si_upload_vertex_buffer_descriptors
33757ec681f3Smrg- radeonsi: prefetch VB descriptors right after uploading
33767ec681f3Smrg- radeonsi: precompute NGG cull flags in si_create_rs_state
33777ec681f3Smrg- mesa: remove/replace FLUSH_VERTICES when it doesn't do anything
33787ec681f3Smrg- mesa: optimize most _mesa_ActiveTexture calls in glPopAttrib
33797ec681f3Smrg- mesa: optimize glPopAttrib(GL_VIEWPORT_BIT)
33807ec681f3Smrg- mesa: don't push/pop gl_texture_object::Target
33817ec681f3Smrg- mesa: fix glPopAttrib for many texture fields
33827ec681f3Smrg- mesa: flush glBegin/End before changing GL_DEPTH_STENCIL_TEXTURE_MODE
33837ec681f3Smrg- mesa: for every state change, remember states we changed for glPopAttrib
33847ec681f3Smrg- mesa: fix trivial bugs in glPopAttrib
33857ec681f3Smrg- mesa: optimize out _NEW_ALL in glPopAttrib(GL_ENABLE_BIT)
33867ec681f3Smrg- mesa: only pop states in glPopAttrib that have been changed since glPushAttrib
33877ec681f3Smrg- mesa: partially skip glPush/PopAttrib for MSAA textures and texture buffers
33887ec681f3Smrg- mesa: don't save gl_shared_state in glPushAttrib
33897ec681f3Smrg- mesa: don't push/pop default texture attributes redundantly
33907ec681f3Smrg- mesa: pop all textures up to NumCurrentTexUsed, not just MaxTextureUnits
33917ec681f3Smrg- mesa: don't count buffer references for the context that created them
33927ec681f3Smrg- radeonsi: set current_rast_prim at bind time for tess and GS
33937ec681f3Smrg- radeonsi: simplify the NGG culling condition in si_draw_vbo
33947ec681f3Smrg- radeonsi: tune NGG shader culling vertex threshold for each chip
33957ec681f3Smrg- radeon: decrease the size of radeon_cmdbuf by switching prev fields to uint16
33967ec681f3Smrg- Revert "gallium/u_vbuf: skip draws with 0 vertices"
33977ec681f3Smrg- gallium/u_vbuf: skip non-indirect draws with 0 vertices
33987ec681f3Smrg- winsys/amdgpu,radeonsi: add HUD counters for how much memory is wasted by slabs
33997ec681f3Smrg- winsys/amdgpu: clean up slab alignment code, handle small buffers better
34007ec681f3Smrg- winsys/amdgpu,pb_slab: add slabs with 3/4 of power of two sizes to save memory
34017ec681f3Smrg- winsys/amdgpu: expand the slab allocation range to [256 B, 1 MB]
34027ec681f3Smrg- winsys/amdgpu: optimize out conditionals in amdgpu_lookup_buffer
34037ec681f3Smrg- winsys/amdgpu: remove amdgpu_winsys_bo::num_cs_references to remove atomics
34047ec681f3Smrg- winsys/amdgpu: pack amdgpu_winsys_bo::is_shared and protect it by a mutex
34057ec681f3Smrg- winsys/amdgpu: move amdgpu_winsys_bo::cpu_ptr into the u.real union
34067ec681f3Smrg- winsys/amdgpu: move amdgpu_winsys_bo::is_shared to the u.real union
34077ec681f3Smrg- winsys/amdgpu: move amdgpu_winsys_bo::is_user_ptr to the u.real union
34087ec681f3Smrg- winsys/amdgpu: move amdgpu_winsys_bo::use_reusable_pool to the u.real union
34097ec681f3Smrg- winsys/amdgpu: don't inc/dec num_active_ioctls for backing BOs of sparse BOs
34107ec681f3Smrg- winsys/amdgpu: don't set unused usage for backing BOs of sparse BOs
34117ec681f3Smrg- winsys/amdgpu: try not to skip any code with RADEON_NOOP=1 to test CPU perf
34127ec681f3Smrg- tgsi_to_nir: translate SAMPLEID
34137ec681f3Smrg- tgsi_to_nir: translate FBFETCH
34147ec681f3Smrg- gallium/u_tests: test no-op fragment shader instead of NULL fragment shader
34157ec681f3Smrg- winsys/amdgpu: disallow pb_cache for backing buffers of sparse buffers
34167ec681f3Smrg- ac/gpu_info: print use_late_alloc
34177ec681f3Smrg- ac/gpu_info: rename num_tcc_blocks -> max_tcc_blocks
34187ec681f3Smrg- ac/gpu_info: add radeon_info::num_tcc_blocks
34197ec681f3Smrg- ac/gpu_info: remove redundant radeon_info::num_sdp_interfaces
34207ec681f3Smrg- ac/gpu_info: inline get_l2_cache_size and set cache sizes farther down
34217ec681f3Smrg- ac/gpu_info: conceal L2 cache sizes
34227ec681f3Smrg- amd: sort chip enums based on hw revision
34237ec681f3Smrg- radeonsi: skip s_sendmsg(gs_alloc_req) for NGG passthrough on new chips
34247ec681f3Smrg- radeonsi: add debug options nodisplaytiling and nodisplaydcc
34257ec681f3Smrg- amd: update addrlib
34267ec681f3Smrg- mesa: optimize draw index type checking
34277ec681f3Smrg- mesa: precompute all valid primitive types at context creation
34287ec681f3Smrg- mesa: precompute draw time prim validation during state changes
34297ec681f3Smrg- mesa: move check_valid_to_render call into _mesa_valid_prim_mode
34307ec681f3Smrg- mesa: fold most of check_valid_to_render into _mesa_update_valid_to_render_state
34317ec681f3Smrg- mesa: inline check_valid_to_render
34327ec681f3Smrg- mesa: add skeleton code for DrawPixels/CopyPixels/Bitmap precomputed validation
34337ec681f3Smrg- mesa: don't report 1 for GL_VALIDATE_STATUS if user didn't validate pipeline
34347ec681f3Smrg- mesa: move shader pipeline validation from draws to state changes
34357ec681f3Smrg- mesa: move sampler uniform validation from draws to state changes
34367ec681f3Smrg- mesa: move some uniform debug code from draws to state changes
34377ec681f3Smrg- mesa: move FBO completeness checking from draws to state changes
34387ec681f3Smrg- mesa: move ARB program and integer FBO validation from draws to state changes
34397ec681f3Smrg- mesa: move GL_FILL_RECTANGLE validation from draws to state changes
34407ec681f3Smrg- mesa: move blending validation from draws to state changes
34417ec681f3Smrg- mesa: inline _mesa_valid_to_render now that it doesn't do validation
34427ec681f3Smrg- mesa: optimize the dual source blend error checking using a bitmask
34437ec681f3Smrg- mesa: remove VERBOSE_DRAW
34447ec681f3Smrg- mesa: remove optional draw validation code to increase performance
34457ec681f3Smrg- mesa: call _mesa_update_state() before validation
34467ec681f3Smrg- mesa: remove an optional GL error about mapped buffers during execution
34477ec681f3Smrg- mesa: skip MultiDrawArrays with primcount == 0
34487ec681f3Smrg- mesa: don't skip draws with count == 0 or numInstances == 0
34497ec681f3Smrg- mesa: add a separate valid primitive mask just for glDrawElements
34507ec681f3Smrg- mesa: move disallowed TFB in DrawElements on GLES from draws to state changes
34517ec681f3Smrg- mesa: validate numInstances in common functions to unify code
34527ec681f3Smrg- mesa: optimize set_varying_vp_inputs by precomputing the conditions
34537ec681f3Smrg- mesa: move gl_context::varying_vp_inputs into ctx->VertexProgram._VaryingInputs
34547ec681f3Smrg- mesa: set _DrawVAOEnabledAttribs only when it changes
34557ec681f3Smrg- mesa: precompute _mesa_get_vao_vp_inputs
34567ec681f3Smrg- mesa: precompute draw time determination of enabled vertex arrays
34577ec681f3Smrg- mesa: gather errors and call _mesa_error only once in validate_Draw
34587ec681f3Smrg- mesa: inline _mesa_set_draw_vao and set_varying_vp_inputs for draw calls
34597ec681f3Smrg- mesa: inline draw validate functions
34607ec681f3Smrg- mesa: add debug code to catch missing _mesa_update_valid_to_render_state calls
34617ec681f3Smrg- ac/surface: use family_id so as not to crash with SI_FORCE_FAMILY in addrlib
34627ec681f3Smrg- radeonsi: for tess, determine the minimum num_patches before optimizing tg size
34637ec681f3Smrg- radeonsi: improve comments in si_emit_derived_tess_state
34647ec681f3Smrg- radeonsi: allocate filled_size for streamout targets in set_streamout_buffers
34657ec681f3Smrg- radeonsi: do late NIR optimizations after uniform inlining
34667ec681f3Smrg- radeonsi: fix the value of uses_bindless_samplers
34677ec681f3Smrg- radeonsi: gather info about bindless images and memory stores with strstr(intr)
34687ec681f3Smrg- radeonsi: gather shader info about indirect UBO/SSBO/samplers/images
34697ec681f3Smrg- radeonsi: gather shader info about VMEM usage for MEM_ORDERED
34707ec681f3Smrg- radeonsi: set MEM_ORDERED optimally
34717ec681f3Smrg- glthread: assume all parameters are fixed if marshal_sync is present
34727ec681f3Smrg- glthread: don't declare pointers with const in unmarshal functions
34737ec681f3Smrg- glthread: don't sync when using pixel buffer objects
34747ec681f3Smrg- glthread: ignore the return value of glUnmapBuffer, don't sync, and return true
34757ec681f3Smrg- i915: use align_calloc for the context to fix m32 crashes
34767ec681f3Smrg- radeon,r200: use align_calloc for the context to fix m32 crashes
34777ec681f3Smrg- nouveau_vieux: use align_calloc for the context to fix m32 crashes
34787ec681f3Smrg- mesa: remove unnecessary NewState flagging for glPopAttrib(GL_ENABLE_BIT)
34797ec681f3Smrg- mesa: move fixed-func-related _mesa_update_state code closer together
34807ec681f3Smrg- mesa: split _NEW_LIGHT into 3 flags: _NEW_LIGHT_(FF_PROGRAM|CONSTANTS|STATE)
34817ec681f3Smrg- mesa: rework _MESA_NEW_NEED_EYE_COORDS to reduce fixed-func program updates
34827ec681f3Smrg- mesa: don't compute the inverted projection matrix if not used
34837ec681f3Smrg- mesa: don't compute the ModelView * Projection matrix if not used
34847ec681f3Smrg- mesa: add _NEW_MATERIAL to reduce the weight of _NEW_LIGHT_CONSTANTS
34857ec681f3Smrg- mesa: don't update derived material values in _mesa_update_state and elsewhere
34867ec681f3Smrg- mesa: remove _NEW_VARYING_VP_INPUTS in favor of _NEW_FF_(VERT|FRAG)_PROGRAM
34877ec681f3Smrg- mesa: remove _NEW_LIGHT_FF_PROGRAM in favor of _NEW_FF_(VERT|FRAG)_PROGRAM
34887ec681f3Smrg- mesa: don't push/pop ctx->Texture._* derived states
34897ec681f3Smrg- mesa: remove the fixed-func vert prog dependency on all texture states
34907ec681f3Smrg- mesa: sort state parameters for ffvp to enable better parameter merging
34917ec681f3Smrg- mesa: merge STATE_LIGHTPROD parameters
34927ec681f3Smrg- mesa: merge STATE_LIGHT_ATTENUATION and STATE_LIGHT_POSITION_* parameters
34937ec681f3Smrg- vbo: optimize copy_to_current functions
34947ec681f3Smrg- vbo: don't call update_color_material in copy_to_current if it's a no-op
34957ec681f3Smrg- mesa: be precise about when to set _NEW_CURRENT_ATTRIB and _NEW_MATERIAL
34967ec681f3Smrg- mesa: move _mesa_update_pixel out of _mesa_update_state
34977ec681f3Smrg- mesa: only update fixed-func programs on texture matrix enablement changes
34987ec681f3Smrg- mesa: don't update fixed-func vert prog on irrelevant _NEW_TRANSFORM changes
34997ec681f3Smrg- mesa: don't update fixed-func programs on irrelevant _NEW_POINT changes
35007ec681f3Smrg- mesa: don't update fixed-func programs on irrelevant _NEW_FOG changes
35017ec681f3Smrg- mesa: don't update fixed-func programs on irrelevant _NEW_RENDER_MODE changes
35027ec681f3Smrg- mesa: don't update the fixed-func frag prog on irrelevant _NEW_COLOR changes
35037ec681f3Smrg- mesa: don't update tnl spaces on irrelevant _NEW_POINT/TEXTURE_STATE changes
35047ec681f3Smrg- mesa: skip a subset of _mesa_update_state if no relevant flags are set
35057ec681f3Smrg- radeonsi: don't index si_context::shaders with enum gl_shader_stage
35067ec681f3Smrg- ac/llvm: fix ac_build_atomic_rmw with LLVM 13
35077ec681f3Smrg- radeonsi: don't crash on NULL images in si_check_needs_implicit_sync
35087ec681f3Smrg- ac/llvm: add support for 16-bit source operands for samplers
35097ec681f3Smrg- ac/llvm: implement 16-bit and 64-bit fpow correctly
35107ec681f3Smrg- ac/llvm: fix visit_load_ubo_buffer to use SMEM for 16 bits instead of VMEM
35117ec681f3Smrg- ac/llvm: add type parameter into ac_build_buffer_load to fix 16-bit TES inputs
35127ec681f3Smrg- ac/llvm: open code fpow on LLVM 12 using fmul.legacy
35137ec681f3Smrg- driconf: add performance tweaks for viewperf
35147ec681f3Smrg- ac/surface: select best swizzle mode for 3D sampler performance
35157ec681f3Smrg- ac,radeonsi: add sampler changes for Aldebaran
35167ec681f3Smrg- ac: set the TCC line size for Aldebaran
35177ec681f3Smrg- ac/llvm: unpack thread IDs on Aldebaran
35187ec681f3Smrg- ac: handle bigger instruction prefetch for Aldebaran
35197ec681f3Smrg- ac,radeonsi: use correct VGPR granularity on Aldebaran
35207ec681f3Smrg- ac: remove switch cases for pc_lines for compute-only chips
35217ec681f3Smrg- radeonsi: enable RGP on gfx10.3
35227ec681f3Smrg- gallium/u_threaded: don't sync in create_stream_output_target
35237ec681f3Smrg- gallium: add pipe_screen::num_contexts for skipping mutex locking in util_range
35247ec681f3Smrg- radeonsi: update pipe_screen::num_contexts
35257ec681f3Smrg- ac/llvm: handle demote in LLVM 13 that just added support for it
35267ec681f3Smrg- ac/gpu_info: fix more non-coherent RB and GL2 combinations
35277ec681f3Smrg- radeonsi: use pipe_sampler_state::border_color_is_integer to simplify stuff
35287ec681f3Smrg- mesa: fix Blender crash due to optimizations in buffer reference counting
35297ec681f3Smrg- mesa: add assertions for buffer reference counts
35307ec681f3Smrg- mesa: fix a oldNum typo in reallocation in _mesa_reserve_parameter_storage
35317ec681f3Smrg- mesa: don't overallocate ParameterValues 4 times (v2)
35327ec681f3Smrg- mesa: clear reserved parameter storage because it's stored in the shader cache
35337ec681f3Smrg- mesa: fix parameter reservation size
35347ec681f3Smrg- st/mesa: add a driconf option to transcode ETC2 to DXTC
35357ec681f3Smrg- util: add most missing formats with reversed RGB channel order
35367ec681f3Smrg- util: fail assertion in util_format_linear if it can't translate SRGB format
35377ec681f3Smrg- util: add new helper util_format_rgb_to_bgr
35387ec681f3Smrg- radeonsi: select the optimal micro tile mode at clear regardless of fast clear
35397ec681f3Smrg- radeonsi: add a fast path for MSAA resolving with RGB -> BGR swizzling
35407ec681f3Smrg- amd/addrlib: add back the incorrect original DCC checking
35417ec681f3Smrg- amd/addrlib: prevent defining regparm differently
35427ec681f3Smrg- amd/addrlib: define endianess differently
35437ec681f3Smrg- amd: update addrlib
35447ec681f3Smrg- ac/llvm: don't set unsupported xnack options to fix LLVM crashes on gfx6-8
35457ec681f3Smrg- radeonsi: disable sparse buffers on gfx7-8
35467ec681f3Smrg- radeonsi: set the clear/copy cache policy based on L2 cache size
35477ec681f3Smrg- radeonsi: don't insert start/stop pipeline stat events if it has no effect
35487ec681f3Smrg- radeonsi: never set DISABLE_WR_CONFIRM for CP DMA clears and copies
35497ec681f3Smrg- radeonsi: rename internal compute sync flags
35507ec681f3Smrg- radeonsi: remove unused SI_CP_DMA_SKIP_* definitions
35517ec681f3Smrg- radeonsi: merge CP DMA flags with internal compute flags
35527ec681f3Smrg- radeonsi: inline clear_buffer in si_screen_clear_buffer
35537ec681f3Smrg- radeonsi: set compute/cpdma sync flags in the outermost caller
35547ec681f3Smrg- radeonsi: reduce syncing in si_dcc_decompress
35557ec681f3Smrg- radeonsi: reduce syncing for initializing new buffers
35567ec681f3Smrg- radeonsi: reduce syncing in si_compute_expand_fmask when it's already idle
35577ec681f3Smrg- radeonsi: don't do an L2 flush in compute_do_clear_or_copy if we're not syncing
35587ec681f3Smrg- radeonsi: rename and apply SI_OP_CPDMA_SKIP_CACHE_FLUSH to compute as well
35597ec681f3Smrg- radeonsi: use the optimal packet order before draw packets for VS flushes too
35607ec681f3Smrg- radeonsi: add SI_CONTEXT_PFP_SYNC_ME to skip syncing PFP for image operations
35617ec681f3Smrg- radeonsi: return false from si_is_format_supported instead of NULL
35627ec681f3Smrg- radeonsi: don't use constbuf and set cache policy for 12-byte clear shader
35637ec681f3Smrg- radeonsi: don't use a constant buffer for the copy_image compute shader
35647ec681f3Smrg- radeonsi: decrease the maximum variable block size
35657ec681f3Smrg- radeonsi: pack the variable block size in one SGPR, 10 bits per component
35667ec681f3Smrg- amd: fix parsing the last dword of DMA_DATA packets
35677ec681f3Smrg- ac/surface: add CMASK info for level 0
35687ec681f3Smrg- radeonsi: determine accurately whether the framebuffer state has DCC MSAA
35697ec681f3Smrg- radeonsi: remove si_screen::dcc_msaa_allowed
35707ec681f3Smrg- radeonsi: parallelize CMASK and DCC clears
35717ec681f3Smrg- radeonsi: return success/failure from si_alloc_separate_cmask
35727ec681f3Smrg- radeonsi: add num_layers variable into si_do_fast_color_clear
35737ec681f3Smrg- radeonsi: group and parallelize all clears in si_texture_create_object
35747ec681f3Smrg- radeonsi: set better default depth clear value
35757ec681f3Smrg- radeonsi: enable HTILE with mipmapping on gfx9+
35767ec681f3Smrg- radeonsi: unset PIPE_CLEAR_* flags for non-existent buffers
35777ec681f3Smrg- radeonsi: turn the loops over color buffers into while loops in si_clear
35787ec681f3Smrg- radeonsi: don't use CP DMA for clears/copies except for very small ones
35797ec681f3Smrg- ac/surface/tests: move shareable code into ac_surface_test_common.h
35807ec681f3Smrg- radeonsi: fix si_compute_copy_image if DCC decompression happens before a copy
35817ec681f3Smrg- gallium/pb: pass an optional winsys pointer to the buffer destroy function
35827ec681f3Smrg- winsys/radeon: rename radeon_bo_reference -> radeon_ws_bo_reference
35837ec681f3Smrg- radeon_winsys.h: add a new function radeon_bo_reference that takes a winsys
35847ec681f3Smrg- radeon_winsys.h: add a winsys parameter to most winsys buffer functions
35857ec681f3Smrg- winsys/amdgpu: remove amdgpu_winsys_bo::ws
35867ec681f3Smrg- winsys/amdgpu: add amdgpu_cs::ws to reduce dereferences
35877ec681f3Smrg- gallium/pb: change pb_buffer::alignment to alignment_log2
35887ec681f3Smrg- gallium/pb: remove 8 bytes from pb_buffer by packing variables
35897ec681f3Smrg- winsys/amdgpu: remove another 8 bytes from amdgpu_winsys_bo by packing better
35907ec681f3Smrg- ac/surface: split dcc level info from surface_info to save space
35917ec681f3Smrg- ac/surface: overlap color and Z/S fields using a union in legacy_surf_layout
35927ec681f3Smrg- ac/surface: change legacy_surf_level::offset to 32-bit offset_256B shifted by 8
35937ec681f3Smrg- ac/surface: inline and reorder gfx9_surf_flags for better packing
35947ec681f3Smrg- ac/surface: pack gfx9_surf_meta_flags better
35957ec681f3Smrg- ac/surface: pack gfx9_surf_layout:resource_type better to save 8 bytes
35967ec681f3Smrg- ac/surface: pack radeon_surf::num_htile_levels better
35977ec681f3Smrg- ac/surface: pack alignments by storing log2 in radeon_surf
35987ec681f3Smrg- ac/surface: overlap color and Z/S fields using a union in gfx9_surf_layout
35997ec681f3Smrg- ac/surface: pack radeon_surf better
36007ec681f3Smrg- ac/surface: unify htile_levels and dcc_levels as meta_levels
36017ec681f3Smrg- ac/surface: unify htile_* and dcc_* fields as meta_* fields
36027ec681f3Smrg- ac/surface: use named "color and "zs" structures in unions
36037ec681f3Smrg- radeonsi: don't cache FMASK transactions from CB in L2
36047ec681f3Smrg- radeonsi: restructure DCC disablement into a switch
36057ec681f3Smrg- radeonsi: allow trivial DCC clears for shared textures with DCC constant encode
36067ec681f3Smrg- radeonsi: implement per-level DCC and CMASK fast clears for gfx10+
36077ec681f3Smrg- radeonsi: implement Z/S fast clear for non-zero mipmap levels
36087ec681f3Smrg- radeonsi: parallelize Z/S conversion into TC-compatible with fast color clears
36097ec681f3Smrg- radeonsi: clean up some mess around htile_stencil_disabled
36107ec681f3Smrg- radeonsi: add si_can_fast_clear_depth/stencil helpers
36117ec681f3Smrg- radeonsi: indent the code for TC-compatibility HTILE transition
36127ec681f3Smrg- radeonsi: implement fast Z/S clears using clear_buffer on HTILE
36137ec681f3Smrg- radeonsi: enable DCC fast clears for non-zero mipmap levels and 0/1 clear values
36147ec681f3Smrg- radeonsi: when transitioning to TC-compat HTILE, try to do a proper clear
36157ec681f3Smrg- radeonsi: do Z-only or S-only HTILE clear using a compute shader doing RMW
36167ec681f3Smrg- radeonsi: refine fast clears for small buffers, always use them for large HTILE
36177ec681f3Smrg- radeonsi: try to fix DCC coherency issues with DCC decompression
36187ec681f3Smrg- radeonsi: allow DCC_DECOMPRESS via CB with MSAA textures
36197ec681f3Smrg- ac/surface: only apply the 3D swizzle mode tuning to gfx10+
36207ec681f3Smrg- ac/surface/tests: test Sienna Cichlid and Navy Flounder
36217ec681f3Smrg- ac/surface/tests: fix a random segfault in the modifier test
36227ec681f3Smrg- amd/addrlib: expose DCC address equations to drivers
36237ec681f3Smrg- meson: add an optional OpenMP dependency for AMD tests
36247ec681f3Smrg- ac/surface: add a test of DccAddrFromCoord prototype outside of addrlib
36257ec681f3Smrg- ac/surface: limit the number of swizzle modes that can have displayable DCC
36267ec681f3Smrg- ac,radeonsi: rewrite DCC retiling without the DCC retile map
36277ec681f3Smrg- radeonsi: fix and enable full DCC with MSAA 2x on gfx9
36287ec681f3Smrg- radeonsi: implement DCC MSAA 4x/8x fast clear using DCC equations on gfx9
36297ec681f3Smrg- radeonsi: enable DCC for MSAA 4x and 8x on gfx9
36307ec681f3Smrg- radeonsi: move binding the internal compute shader into si_launch_grid_internal
36317ec681f3Smrg- radeonsi: unify internal compute with SSBOs in si_launch_grid_internal_ssbos
36327ec681f3Smrg- compiler: move TEXTURE_COORD/VERTEX_GENERIC_ATTRIB limits into shader_enums.h
36337ec681f3Smrg- nir: add src and dest types to all IO loads and stores for mediump
36347ec681f3Smrg- nir: add new VARYING_SLOTs and shader info for packed 16-bit varyings
36357ec681f3Smrg- nir: add many passes that lower and optimize 16-bit input/outputs and samplers
36367ec681f3Smrg- glsl: pack 16-bit uniforms in the NIR linker
36377ec681f3Smrg- mesa: implement glUniform for packed FP16 uniforms
36387ec681f3Smrg- mesa: implement glGetUniform for FP16 uniforms
36397ec681f3Smrg- mesa: implement glGetActiveUniform for FP16 uniforms
36407ec681f3Smrg- glsl: lower mediump uniforms to FP16 based on an option
36417ec681f3Smrg- gallium: add PIPE_SHADER_CAP_FP16_CONST_BUFFERS for FP16 uniforms
36427ec681f3Smrg- st/mesa: fix nir_lower_io if it's done right after IO vectorization
36437ec681f3Smrg- ac/llvm: implement 16-bit packed VS outputs and FS inputs
36447ec681f3Smrg- radeonsi: implement 16-bit VS->PS varyings
36457ec681f3Smrg- radeonsi: implement 16-bit VS inputs
36467ec681f3Smrg- radeonsi: optimize and legalize 16-bit samplers
36477ec681f3Smrg- radeonsi: kill 16-bit VS outputs if PS doesn't use them or doing Z-only draw
36487ec681f3Smrg- radeonsi: enable FP16 for mediump on gfx9+ if radeonsi_fp16=true
36497ec681f3Smrg- nir: handle mediump varyings in varying compaction helpers
36507ec681f3Smrg- radeonsi: don't decompress DCC for float formats in si_compute_copy_image
36517ec681f3Smrg- radeonsi: fix automatic DCC retiling after DCC clear and DCC decompression
36527ec681f3Smrg- radeonsi: fix automatic DCC retiling after compute image stores
36537ec681f3Smrg- radeonsi: make the gfx9 DCC MSAA clear shader depend on the number of samples
36547ec681f3Smrg- util: fix (re-enable) L3 cache pinning
36557ec681f3Smrg
36567ec681f3SmrgMarek Vasut (2):
36577ec681f3Smrg
36587ec681f3Smrg- compiler/nir: Increment shader input count and mark as used when adding new gl_PointCoord
36597ec681f3Smrg- etnaviv: Fix point sprite Z,W coordinate replacement
36607ec681f3Smrg
36617ec681f3SmrgMark Janes (12):
36627ec681f3Smrg
36637ec681f3Smrg- intel: Print GPU timing data based on INTEL_MEASURE
36647ec681f3Smrg- anv: enable timestamp for INTEL_MEASURE
36657ec681f3Smrg- anv: implement anv layer of INTEL_MEASURE
36667ec681f3Smrg- blorp: add hook for INTEL_MEASURE
36677ec681f3Smrg- anv: add hooks to call INTEL_MEASURE
36687ec681f3Smrg- iris: implement iris layer of INTEL_MEASURE
36697ec681f3Smrg- iris: add a iris_context reference to iris_batch
36707ec681f3Smrg- intel: stop tracking submission state in INTEL_MEASURE
36717ec681f3Smrg- intel: support secondary command buffers in INTEL_MEASURE
36727ec681f3Smrg- intel: combine common gather routines in INTEL_MEASURE
36737ec681f3Smrg- intel: check setuid before writing output file in INTEL_MEASURE
36747ec681f3Smrg- Revert "blorp/gen12: Don't use aux address if implicit CCS"
36757ec681f3Smrg
36767ec681f3SmrgMatt Turner (8):
36777ec681f3Smrg
36787ec681f3Smrg- docs/freedreno: Fix a few typos
36797ec681f3Smrg- turnip: Remove unused TU_DEBUG_IR3 flag
36807ec681f3Smrg- docs: Mark VK_KHR_maintenance1 as done on turnip
36817ec681f3Smrg- ci: Use CI_PROJECT_ROOT_NAMESPACE
36827ec681f3Smrg- tu: Skip tu_tiling_config_update_tile_layout() if not using gmem
36837ec681f3Smrg- ci: Disable panfrost g52
36847ec681f3Smrg- Remove Scons leftovers
36857ec681f3Smrg- ir3: Don't count (nopX) towards the wrong category
36867ec681f3Smrg
36877ec681f3SmrgMatti Hamalainen (2):
36887ec681f3Smrg
36897ec681f3Smrg- gallium: Fix broken trace XML output
36907ec681f3Smrg- gallium/tools: update trace scripts to Python 3
36917ec681f3Smrg
36927ec681f3SmrgMauro Rossi (29):
36937ec681f3Smrg
36947ec681f3Smrg- android: r600/sfn: add sfn_nir_lower_64bit.cpp to Makefile.sources
36957ec681f3Smrg- android: freedreno/hw/isa: Add description of ir3 ISA
36967ec681f3Smrg- android: freedreno/ir3: Switch over to new encoder/decoder
36977ec681f3Smrg- android: pan/mdg: create nir pass to lower image coord bitsize
36987ec681f3Smrg- android: intel: Print GPU timing data based on INTEL_MEASURE
36997ec681f3Smrg- android: anv: implement anv layer of INTEL_MEASURE
37007ec681f3Smrg- android: iris: implement iris layer of INTEL_MEASURE
37017ec681f3Smrg- android: radv: port to using common dispatch code.
37027ec681f3Smrg- android: radv: fix building error in radv_android.c
37037ec681f3Smrg- android: util/fossilize_db: add missing sources to Makefile.sources
37047ec681f3Smrg- android: ac/rgp: fix building error
37057ec681f3Smrg- android: mesa: Move the FXT1 compressor/decompressor to util/
37067ec681f3Smrg- android: pan/bi: reorder static dependencies in gallium/dri
37077ec681f3Smrg- driconf: avoid Non-ASCII character error in driconf_static.py
37087ec681f3Smrg- android: driconf: Generate a static table when no xmlconfig
37097ec681f3Smrg- android: i965: Rename files with "intel\_" prefix to "brw\_"
37107ec681f3Smrg- android: util: create some standalone compression helpers
37117ec681f3Smrg- android: anv: add libcutils shared dependency
37127ec681f3Smrg- android: r600/sfn: fix sfn_nir_algebraic.c gen rules
37137ec681f3Smrg- android: vulkan/util: add vk_descriptors.{c,h} to Makefile.sources
37147ec681f3Smrg- android: amd/addrlib: define endianess to build
37157ec681f3Smrg- android: panfrost: Use the blend shader cache attached to the device
37167ec681f3Smrg- vulkan/util: Fix implicit declaration of ffs for Android build
37177ec681f3Smrg- android: anv: Remove anv_intel.c from Makefile.sources
37187ec681f3Smrg- android: anv: fix build error in anv_android.c
37197ec681f3Smrg- compiler/glsl: fix include for Android build
37207ec681f3Smrg- android: panfrost/lib: add pan_cs.c to Makefile.sources
37217ec681f3Smrg- android: gallium/radeonsi: add nir include path
37227ec681f3Smrg- android: amd/common: add nir include path
37237ec681f3Smrg
37247ec681f3SmrgMichael Tang (5):
37257ec681f3Smrg
37267ec681f3Smrg- microsoft/compiler: Make resource_state_manager only build with_gallium_d3d12
37277ec681f3Smrg- util: Make os_read_file use O_BINARY on Windows
37287ec681f3Smrg- microsoft/spirv_to_dxil: Fix spirv2dxil I/O to use binary mode
37297ec681f3Smrg- microsoft/spirv_to_dxil: Add lowering pass to handle gl_PerVertex
37307ec681f3Smrg- microsoft/spirv_to_dxil: Add extra lowering functions according to the docs on nir_inline_functions
37317ec681f3Smrg
37327ec681f3SmrgMichel Dänzer (53):
37337ec681f3Smrg
37347ec681f3Smrg- ci: Remove .gitlab-ci/meson-build.bat
37357ec681f3Smrg- ci: Use meson test directly instead of ninja test
37367ec681f3Smrg- wsi/x11: Use get_screen_resources_current in wsi_x11_detect_xwayland
37377ec681f3Smrg- ci: Enable process isolation for softpipe & freedreno piglit jobs
37387ec681f3Smrg- ci: Use GNU time as meson test wrapper
37397ec681f3Smrg- ci: Run 'time' in the background and propagate signals to test process
37407ec681f3Smrg- ci: Fix MESA_TEMPLATES_COMMIT value
37417ec681f3Smrg- ci: Update to newer ci-fairy
37427ec681f3Smrg- ci: Set GALLIVM_PERF=no_filter_hacks for llvmpipe-piglit-quick_shader
37437ec681f3Smrg- ci: Set GALLIVM_PERF=no_filter_hacks for llvmpipe-piglit-quick_gl
37447ec681f3Smrg- ci: Set GALLIVM_PERF=nopt,no_filter_hacks for llvmpipe-gles2
37457ec681f3Smrg- ci: Use MESA\_ namespace for image variables in Windows jobs
37467ec681f3Smrg- ci: Use MESA_IMAGE_TAG everywhere
37477ec681f3Smrg- ci: Move FDO_DISTRIBUTION_TAG assignment to template
37487ec681f3Smrg- ci: Add and use .set-image template to construct docker image name
37497ec681f3Smrg- ci: Incorporate base image tag into dependent image tags
37507ec681f3Smrg- ci: Append build image tag to LAVA tag used for minio path
37517ec681f3Smrg- ci: Add trailing slash to path for documentation preview
37527ec681f3Smrg- ci: Restrict meson-gallium job to gstreamer runners
37537ec681f3Smrg- ci: Disable scons-win64 job
37547ec681f3Smrg- ci: Move meson-build.sh to meson/build.sh
37557ec681f3Smrg- ci: Drop SIGINT handling from meson test wrapper script
37567ec681f3Smrg- ci: Move /usr/bin/time check from meson test wrapper to build script
37577ec681f3Smrg- aco/tests: Use _exit in child process
37587ec681f3Smrg- ci: Add strace to the x86_build docker image
37597ec681f3Smrg- ci: Run meson tests in strace if it's available and can be used
37607ec681f3Smrg- ci: Don't run meson tests in strace for meson-mingw32-x86_64 job
37617ec681f3Smrg- intel/tools: Use subprocess.Popen to read output directly from a pipe
37627ec681f3Smrg- Revert "ci: Restrict meson-gallium job to gstreamer runners"
37637ec681f3Smrg- glcpp: Fully initialize struct gl_context
37647ec681f3Smrg- ci: Disable valgrind in some build jobs
37657ec681f3Smrg- glsl/tests: Bump glcpp valgrind test timeout to 240 seconds
37667ec681f3Smrg- glsl/tests: Don't use tempfiles
37677ec681f3Smrg- glsl/tests: Use exit code 126 to detect valgrind errors
37687ec681f3Smrg- Revert "ci: disable glcpp tests for now"
37697ec681f3Smrg- Revert "meson: add enable-glcpp-tests option"
37707ec681f3Smrg- Revert "glsl/test: Don't run whitespace tests in parallel"
37717ec681f3Smrg- ci: Remove INCLUDE_PIGLIT
37727ec681f3Smrg- ci: Build ARM baremetal rootfs in native container
37737ec681f3Smrg- ci: Merge ARM testing docker images to a single arm_test one
37747ec681f3Smrg- wsi/x11: Wait for fences with IMMEDIATE on Xwayland
37757ec681f3Smrg- ci: Fix HTML summary path for piglit OpenCL job artifacts
37767ec681f3Smrg- intel/blorp: Initialize texture_data[0]
37777ec681f3Smrg- ci: Do not install armhf LLVM packages
37787ec681f3Smrg- ci: Bump LLVM/clang from 10 to 11
37797ec681f3Smrg- ci: Move docker images from Debian buster to bullseye
37807ec681f3Smrg- ci: Install librenderdoc from Debian bullseye
37817ec681f3Smrg- ci: Install spirv-tools from Debian bullseye
37827ec681f3Smrg- ci: Install llvm-spirv from Debian bullseye
37837ec681f3Smrg- ci: Install GLVND from Debian bullseye
37847ec681f3Smrg- ci: Install Rust & cargo from Debian for x86_test* images
37857ec681f3Smrg- ci: Do not append ci-templates commit hash to Windows docker image tag
37867ec681f3Smrg- ci: Update to latest ci-templates
37877ec681f3Smrg
37887ec681f3SmrgMichel Zou (25):
37897ec681f3Smrg
37907ec681f3Smrg- vulkan/lavapipe: add missing VKAPI_ATTR/CALL
37917ec681f3Smrg- vulkan: Fix windows api conflict
37927ec681f3Smrg- zink: Fix win32 build
37937ec681f3Smrg- vulkan: Fix windows api conflict
37947ec681f3Smrg- meson: invalid keyword argument dependencies
37957ec681f3Smrg- zink: fix win32 build
37967ec681f3Smrg- util: fix gcc vsnprintf overflow
37977ec681f3Smrg- glapi: keep declspec(thread) msvc-specific
37987ec681f3Smrg- vulkan: implement wsi_win32 backend
37997ec681f3Smrg- lavapipe: add mingw32 def file
38007ec681f3Smrg- lavapipe: set empty dll prefix
38017ec681f3Smrg- gallium: remove DROP_PIPE_LOADER_MISC
38027ec681f3Smrg- meson/xmlconfig: win32 regex fallback
38037ec681f3Smrg- meson: detect winflex/bison only on native win32
38047ec681f3Smrg- turnip: update features.txt
38057ec681f3Smrg- lavapipe: update features.txt
38067ec681f3Smrg- vulkan: fix CreateRenderPass prototype
38077ec681f3Smrg- swr: extern declaration for win32 intrinsics
38087ec681f3Smrg- swr: fix win32 intrinsics
38097ec681f3Smrg- swr: Fix SWR_CONTEXT pre-declaration
38107ec681f3Smrg- swr: fix unused SplitString warning
38117ec681f3Smrg- swr: fix deprecated llvm 11 declaration warning
38127ec681f3Smrg- swr: fix array-bounds warning
38137ec681f3Smrg- lavapipe: Fix type narrowing
38147ec681f3Smrg- docs: missing lvp win32surface ext in features.txt
38157ec681f3Smrg
38167ec681f3SmrgMike Blumenkrantz (775):
38177ec681f3Smrg
38187ec681f3Smrg- zink: clamp sampler+samplerview limits
38197ec681f3Smrg- util/hash_table: optimize rehash for empty table and no-func clears
38207ec681f3Smrg- util/set: optimize rehash for empty table and no-func clears
38217ec681f3Smrg- util/set: add the found param to search_or_add
38227ec681f3Smrg- util/set: split off create() into an init() function
38237ec681f3Smrg- zink: optimize renderpass hash table
38247ec681f3Smrg- nir/lower_uniforms_to_ubo: set explicit_binding on uniform_0
38257ec681f3Smrg- zink: add spirv builder function for runtime array type
38267ec681f3Smrg- zink: add util function for emitting ntv atomic ops
38277ec681f3Smrg- zink: add set_shader_buffers pipe_context method
38287ec681f3Smrg- zink: hook up ssbo shader bindings
38297ec681f3Smrg- zink: emit ssbo variables in ntv
38307ec681f3Smrg- zink: modify ubo loading in ntv to work for ssbos
38317ec681f3Smrg- zink: start supporting atomic shader ops
38327ec681f3Smrg- zink: split UBOs and samplers into 'read' batch references during draw
38337ec681f3Smrg- zink: flag ssbo buffer resources as having pending writes on batch
38347ec681f3Smrg- zink: add more usage bits for buffer types
38357ec681f3Smrg- zink: partially enable SSBO pipe cap
38367ec681f3Smrg- zink: only emit streamout targets during draw if we have them
38377ec681f3Smrg- zink: rework framebuffer state
38387ec681f3Smrg- zink: add batch flag for checking renderpass state
38397ec681f3Smrg- zink: remove renderpass refcounting
38407ec681f3Smrg- zink: ralloc zink_framebuffer structs
38417ec681f3Smrg- zink: rename param in zink_create_framebuffer
38427ec681f3Smrg- zink: use 'fb' variable name for zink_framebuffer objects in zink_framebuffer.c
38437ec681f3Smrg- zink: decouple renderpass from framebuffer state
38447ec681f3Smrg- zink: move zink_clear to zink_clear.c
38457ec681f3Smrg- zink: start to refactor clearing
38467ec681f3Smrg- zink: handle clears with scissor regions
38477ec681f3Smrg- zink: break out scissor region testing for clear functions
38487ec681f3Smrg- zink: break out color/zs no_rp clear into separate functions
38497ec681f3Smrg- zink: break out some of the u_blitter setup into util function
38507ec681f3Smrg- zink: add a pipe_context::clear_texture hook
38517ec681f3Smrg- zink: enable PIPE_CAP_CLEAR_TEXTURE
38527ec681f3Smrg- zink: reduce blendfactor when alpha_to_one is set
38537ec681f3Smrg- zink: tweak xfb slot mapping in ntv
38547ec681f3Smrg- zink: process ubos with location values set as long as they're actually ubos
38557ec681f3Smrg- zink: add VK_KHR_driver_properties
38567ec681f3Smrg- zink: enable WSI-faking for RADV too
38577ec681f3Smrg- zink: rename zink_context::\*image_views -> sampler_views
38587ec681f3Smrg- zink: add ntv util function for getting image type
38597ec681f3Smrg- zink: rewrite image/sampler glsl -> vk type functions for robustness
38607ec681f3Smrg- zink: add spirv_builder function for hexops
38617ec681f3Smrg- zink: add spirv builder functions for image ops
38627ec681f3Smrg- zink: add ntv function for emitting variable access decorations
38637ec681f3Smrg- zink: verify format caps and add storage image usage when possible in creation
38647ec681f3Smrg- zink: add 'has_draw' flag to batch struct
38657ec681f3Smrg- zink: add a pipe_context::memory_barrier hook
38667ec681f3Smrg- zink: add shader image support to zink_binding()
38677ec681f3Smrg- zink: add new 'sampler_types' variable to ntv_context struct
38687ec681f3Smrg- zink: handle image variable types in ntv
38697ec681f3Smrg- zink: handle more atomic ops in ntv
38707ec681f3Smrg- zink: handle nir_intrinsic_memory_barrier in ntv
38717ec681f3Smrg- zink: add nir_var_uniform case to get_storage_class()
38727ec681f3Smrg- zink: expand ntv array derefs to track image derefs
38737ec681f3Smrg- zink: add handling for all basic image ops in ntv
38747ec681f3Smrg- zink: enable early frag test execution in ntv when necessary
38757ec681f3Smrg- zink: enable image caps in ntv when a shader has images
38767ec681f3Smrg- zink: handle image descriptors during zink_shader creation
38777ec681f3Smrg- zink: break out bufferview creation into separate function
38787ec681f3Smrg- zink: add a pipe_context::set_shader_images hook
38797ec681f3Smrg- zink: handle shader image descriptor updates during draw
38807ec681f3Smrg- zink: check if multisample support exists for shader image formats
38817ec681f3Smrg- zink: export shader image caps using features
38827ec681f3Smrg- zink: GLSL 420
38837ec681f3Smrg- docs/features: mark off GL 4.2 for zink
38847ec681f3Smrg- zink: set PIPE_CAP_COPY_BETWEEN_COMPRESSED_AND_PLAIN_FORMATS
38857ec681f3Smrg- zink: force per-sample interpolation
38867ec681f3Smrg- zink: set PIPE_CAP_FRAMEBUFFER_NO_ATTACHMENT
38877ec681f3Smrg- zink: set PIPE_CAP_ROBUST_BUFFER_ACCESS_BEHAVIOR
38887ec681f3Smrg- zink: support VK_IMAGE_LAYOUT_DEPTH_STENCIL_READ_ONLY_OPTIMAL usage
38897ec681f3Smrg- zink: add layout member to barrier setup in draw
38907ec681f3Smrg- zink: support PIPE_FORMAT_X24S8_UINT
38917ec681f3Smrg- zink: check correct caps for PIPE_CAP_IMAGE_LOAD_FORMATTED
38927ec681f3Smrg- zink: enable PIPE_CAP_SAMPLER_VIEW_TARGET
38937ec681f3Smrg- gallium/u_inlines: add helper for simplifying pipe_context::resource_copy_region
38947ec681f3Smrg- zink: add function for waiting on a specific batch's fence
38957ec681f3Smrg- zink: don't force a renderpass start when setting framebuffer state
38967ec681f3Smrg- spirv: handle NoContraction in GLSL450 alu ops
38977ec681f3Smrg- zink: fix streamout for clipdistance
38987ec681f3Smrg- zink: add a VkExternalMemoryImageCreateInfo for PIPE_BIND_SHARED images
38997ec681f3Smrg- zink: set lower_mul_2x32_64 when 64bit int support is available
39007ec681f3Smrg- zink: enable PIPE_CAP_TEXTURE_MIRROR_CLAMP_TO_EDGE
39017ec681f3Smrg- zink: flag gfx pipeline dirty using newer mechanism
39027ec681f3Smrg- zink: guarantee surface lifetimes for shader images
39037ec681f3Smrg- st/bitmap: use GL_CLAMP_TO_EDGE for bitmap samplers
39047ec681f3Smrg- st/drawpixels: use GL_CLAMP_TO_EDGE instead of GL_CLAMP for samplers
39057ec681f3Smrg- zink: don't export PIPE_CAP_MAX_COMBINED_SHADER_BUFFERS value
39067ec681f3Smrg- zink: add spirv builder function for OpAtomicStore
39077ec681f3Smrg- zink: flag ssbo buffer resources as having pending writes per stage
39087ec681f3Smrg- zink: handle null ssbo attachments without crashing
39097ec681f3Smrg- zink: handle more ssbo ops in ntv
39107ec681f3Smrg- zink: rework ssbo indexing and binding
39117ec681f3Smrg- zink: support nir_intrinsic_store_ssbo
39127ec681f3Smrg- zink: implement get_ssbo_size nir intrinsic
39137ec681f3Smrg- zink: flatten out ssbo/ubo variable decls in ntv
39147ec681f3Smrg- zink: export ssbo caps
39157ec681f3Smrg- Revert "glcpp: disable 'windows' tests"
39167ec681f3Smrg- meson: add enable-glcpp-tests option
39177ec681f3Smrg- ci: disable glcpp tests for now
39187ec681f3Smrg- zink: add barrier helper for buffer resources
39197ec681f3Smrg- zink: add a stage param for buffer resource barriers
39207ec681f3Smrg- zink: add helper function for checking if access flags include write access
39217ec681f3Smrg- zink: improve barrier helper for buffer resources and add check for barrier need
39227ec681f3Smrg- zink: flag previous vertex stages as dirty when toggling a later stage
39237ec681f3Smrg- zink: add shader key for vs shaders
39247ec681f3Smrg- zink: flag shaders as needing update when clip_halfz changes
39257ec681f3Smrg- zink: move tess/geom shader info to vs shader key
39267ec681f3Smrg- glsl: support 64bit integer loop iterators
39277ec681f3Smrg- radv: print image array size in debug mode
39287ec681f3Smrg- zink: move maintenance2 extension to right file
39297ec681f3Smrg- zink: unify shader image unbind codepath
39307ec681f3Smrg- zink: be a little more precise about query types in one conditional
39317ec681f3Smrg- radv: null bo list pointer for null descriptors on update
39327ec681f3Smrg- radv: zero the bo descriptor array when allocating a new set
39337ec681f3Smrg- zink: force 4 component formats for samplerview/render textures
39347ec681f3Smrg- zink: support nir_intrinsic_memory_barrier_buffer
39357ec681f3Smrg- zink: add defines for compute batch and gfx batch count
39367ec681f3Smrg- zink: bump resource usage flags to allow 5 batches
39377ec681f3Smrg- zink: make get_resource_usage() public
39387ec681f3Smrg- zink: make zink_batch_reference_resource_rw return usage info
39397ec681f3Smrg- zink: wait on compute batch when necessary during transfer map
39407ec681f3Smrg- zink: add spirv_builder function for emitting a 3word literal exec mode
39417ec681f3Smrg- zink: handle COMPUTE bindings in compiler/ntv
39427ec681f3Smrg- zink: handle COMPUTE setup in ntv
39437ec681f3Smrg- zink: handle COMPUTE glsl variables
39447ec681f3Smrg- zink: implement shared load/store nir ops in ntv
39457ec681f3Smrg- zink: add handling for shared atomic ops in ntv
39467ec681f3Smrg- zink: handle nir_intrinsic_memory_barrier_shared in ntv
39477ec681f3Smrg- zink: ignore compute batch when starting/ending batches
39487ec681f3Smrg- zink: take a pipe_reference param in zink_batch_reference_program
39497ec681f3Smrg- zink: refactor batch creation
39507ec681f3Smrg- zink: make allocate_descriptor_set() take more params instead of a gfx_program
39517ec681f3Smrg- zink: explicitly get shader stage from shader during binding setup in draw
39527ec681f3Smrg- zink: rename pipeline_cache_entry -> gfx_pipeline_cache_entry
39537ec681f3Smrg- zink: add compute programs and pipelines
39547ec681f3Smrg- zink: break out descriptor updating into separate function
39557ec681f3Smrg- zink: setup compute batch and add handling
39567ec681f3Smrg- zink: handle memory barriers for compute batch
39577ec681f3Smrg- zink: handle descriptor set updates for compute operations
39587ec681f3Smrg- zink: flush gfx/compute batches when the other pipeline needs resource sync
39597ec681f3Smrg- zink: add launch_grid pipe_context hook for compute handling
39607ec681f3Smrg- zink: export compute-specific shader/compute caps
39617ec681f3Smrg- zink: enable compute
39627ec681f3Smrg- zink: GLSL 430
39637ec681f3Smrg- features: mark off GL 4.3 for zink
39647ec681f3Smrg- zink: add spirv_builder wrapper for vote intrinsics
39657ec681f3Smrg- zink: handle vote intrinsics in ntv
39667ec681f3Smrg- zink: rework viewport handling
39677ec681f3Smrg- zink: handle nir_texop_texture_samples
39687ec681f3Smrg- zink: add a texture barrier hook
39697ec681f3Smrg- zink: use = and not \|= for VkMemoryPropertyFlags during resource creation
39707ec681f3Smrg- zink: set HOST_COHERENT bit for coherent resource creation
39717ec681f3Smrg- zink: track persistent, non-coherent, writable transfer map count for resources
39727ec681f3Smrg- zink: slightly refactor batch resource referencing in update_descriptors()
39737ec681f3Smrg- zink: flush all resources with persistent maps on work batch before draw/compute
39747ec681f3Smrg- zink: enable PIPE_CAP_BUFFER_MAP_PERSISTENT_COHERENT
39757ec681f3Smrg- zink: rename zink_context::dummy_buffer -> dummy_vertex_buffer
39767ec681f3Smrg- zink: create dummy xfb buffer
39777ec681f3Smrg- zink: handle null xfb buffers
39787ec681f3Smrg- zink: use better mapping for PIPE_FORMAT_X24S8_UINT
39797ec681f3Smrg- zink: handle nir_intrinsic_load_helper_invocation
39807ec681f3Smrg- zink: fix xfb buffer refcounting
39817ec681f3Smrg- zink: add PIPE_BIND_QUERY_BUFFER to the all-purpose resource creation path
39827ec681f3Smrg- zink: add a get_query_result_resource hook
39837ec681f3Smrg- zink: enable PIPE_CAP_TGSI_ARRAY_COMPONENTS
39847ec681f3Smrg- zink: enable PIPE_CAP_QUERY_BUFFER_OBJECT
39857ec681f3Smrg- zink: GLSL 440
39867ec681f3Smrg- zink: enable PIPE_CAP_CONDITIONAL_RENDER_INVERTED
39877ec681f3Smrg- zink: enable PIPE_CAP_CLIP_HALFZ
39887ec681f3Smrg- zink: enable PIPE_CAP_TGSI_TXQS
39897ec681f3Smrg- zink: enable PIPE_CAP_TEXTURE_BARRIER
39907ec681f3Smrg- zink: GLSL 450
39917ec681f3Smrg- features: mark off GL 4.5 for zink
39927ec681f3Smrg- zink: add spirv interfaces for bo and image/sampler/push variables
39937ec681f3Smrg- zink: lower flrp64 and ffma64 when in softfp64 mode
39947ec681f3Smrg- zink: always use query->type for starting/stopping xfb queries
39957ec681f3Smrg- zink: make the xfb_query_pool into an array
39967ec681f3Smrg- zink: break out cpu query reading for qbos into separate function
39977ec681f3Smrg- zink: put SO_OVERFLOW queries on the primgen list
39987ec681f3Smrg- zink: support SO_OVERFLOW pipe query types
39997ec681f3Smrg- zink: fix streamout for tess stage
40007ec681f3Smrg- zink: flag exact alu op results in ntv with NoContraction
40017ec681f3Smrg- zink: unset generated TCS if its parent TESS is unset
40027ec681f3Smrg- zink: hook up cs invocation queries to the compute batch
40037ec681f3Smrg- zink: add support for pipeline statistics queries
40047ec681f3Smrg- zink: fix slot mapping for legacy gl io with tess stages
40057ec681f3Smrg- zink: handle 1bit undef values in ntv
40067ec681f3Smrg- zink: add handling for ARB_shader_draw_parameters variables in ntv
40077ec681f3Smrg- zink: create a struct for tracking push constant layout
40087ec681f3Smrg- zink: rework tcs injection to be more compatible with new push const struct
40097ec681f3Smrg- zink: add push constant value to indicate whether the current draw is indexed
40107ec681f3Smrg- zink: wrap shader gl_BaseVertex access with a bcsel based on push constant state
40117ec681f3Smrg- zink: add a draw_id param to vs push constants
40127ec681f3Smrg- zink: add a vs shader key for rewriting gl_DrawID
40137ec681f3Smrg- zink: break out push constant creation in compiler and add drawid value
40147ec681f3Smrg- zink: rewrite drawid based on shader key value
40157ec681f3Smrg- zink: add util function for submitting the compute batch
40167ec681f3Smrg- zink: enable PIPE_CAP_TGSI_VOTE
40177ec681f3Smrg- zink: enable PIPE_CAP_DRAW_PARAMETERS
40187ec681f3Smrg- zink: enable PIPE_CAP_POLYGON_OFFSET_CLAMP
40197ec681f3Smrg- zink: enable PIPE_CAP_QUERY_SO_OVERFLOW
40207ec681f3Smrg- zink: enable pipeline statistics cap
40217ec681f3Smrg- zink: PIPE_CAP_GL_SPIRV
40227ec681f3Smrg- zink: GLSL 460
40237ec681f3Smrg- features: mark off GL 4.6 and ES 3.1 for zink
40247ec681f3Smrg- zink: support nir_intrinsic_group_memory_barrier
40257ec681f3Smrg- zink: fix device codegen extension detection
40267ec681f3Smrg- zink: add nir_intrinsic_memory_barrier_image handling
40277ec681f3Smrg- zink: use nir_shader_instructions_pass for draw params pass
40287ec681f3Smrg- zink: add flag for no-oping fence finish
40297ec681f3Smrg- zink: hook up valid_buffer_range for buffer resources using util_range
40307ec681f3Smrg- zink: create a VkPipelineCache object on the screen and use it
40317ec681f3Smrg- zink: add a disk cache for pipeline objects
40327ec681f3Smrg- gallium/trace: add a pipe_screen::get_compiler_options method
40337ec681f3Smrg- zink: handle dual blending override from driconf
40347ec681f3Smrg- zink: move command pool to the batch
40357ec681f3Smrg- nir/lower_tex: rewrite tex/txb -> txd/txl before saturating srcs
40367ec681f3Smrg- mesa/st: add pipe_sampler_state::border_color_is_integer
40377ec681f3Smrg- mesa/st: add PIPE_CAP_GL_CLAMP
40387ec681f3Smrg- zink: enable GL_CLAMP cap
40397ec681f3Smrg- gallium/trace: remove transfer_map assert
40407ec681f3Smrg- zink: add helper function for getting pipeline stage from shader stage
40417ec681f3Smrg- zink: set buffer resource barriers for descriptor resources in update_descriptors()
40427ec681f3Smrg- zink: rework xfb counter resource barriers
40437ec681f3Smrg- zink: rework xfb barrier transitions when reusing as vertex inputs
40447ec681f3Smrg- zink: remove aspect param from zink_resource_barrier
40457ec681f3Smrg- zink: add a VkPipelineStageFlags param to zink_resource_barrier()
40467ec681f3Smrg- zink: add helper for image resource barriers and avoid unnecessary barriers
40477ec681f3Smrg- zink: use define for max descriptor array size
40487ec681f3Smrg- zink: add generic wrapper for checking whether a resource needs a barrier
40497ec681f3Smrg- zink: avoid emitting unnecessary pipeline barriers during update_descriptors
40507ec681f3Smrg- zink: break out barrier transitioning in update_descriptors
40517ec681f3Smrg- zink: combine resource barriers where possible during update_descriptors
40527ec681f3Smrg- zink: take struct zink_batch param instead of direct cmdbuf in barrier helpers
40537ec681f3Smrg- zink: assert batch is not in a renderpass when emitting pipeline barrier
40547ec681f3Smrg- zink: add barriers for index and draw param buffers
40557ec681f3Smrg- zink: add access param for image resource barriers
40567ec681f3Smrg- zink: add access info for update_descriptor image barriers
40577ec681f3Smrg- zink: add batch references for resources in clear functions
40587ec681f3Smrg- zink: improve barrier usage for clear functions
40597ec681f3Smrg- zink: zink_resource_barrier -> zink_resource_image_barrier
40607ec681f3Smrg- zink: add general zink_resource_barrier() wrapper
40617ec681f3Smrg- zink: be more explicit with image barriers for copy operations
40627ec681f3Smrg- zink: fix surface creation for cube slices
40637ec681f3Smrg- zink: tag some missing ES features
40647ec681f3Smrg- zink: update relnotes
40657ec681f3Smrg- zink: just call context destructor on creation fail
40667ec681f3Smrg- zink: add buffer barriers for resource_copy_region
40677ec681f3Smrg- zink: break out buffer copying into util function with batch param
40687ec681f3Smrg- zink: just end the current renderpass in zink_batch_no_rp()
40697ec681f3Smrg- zink: break out even more of zink_blit state saving
40707ec681f3Smrg- zink: use vkGetFenceStatus when we're obviously checking for status
40717ec681f3Smrg- zink: fix buffer resource usage flags
40727ec681f3Smrg- zink: break out query result buffer copying into util function
40737ec681f3Smrg- zink: simplify some of the qbo direct buffer write code
40747ec681f3Smrg- zink: better handling for availability queries on qbos when query/resource is busy
40757ec681f3Smrg- zink: improve batch flushing for queries when compute batches are involved
40767ec681f3Smrg- zink: always use 64bit flag for query results
40777ec681f3Smrg- zink: handle scissor+viewport states dynamically if extension is available
40787ec681f3Smrg- zink: remove 'scissors' member of viewport state
40797ec681f3Smrg- zink: always set VK_BUFFER_USAGE_UNIFORM_TEXEL_BUFFER_BIT for non-staging resources
40807ec681f3Smrg- zink: add available|visible masks to all barriers in ntv
40817ec681f3Smrg- zink: set conformant ubo/ssbo size limits
40827ec681f3Smrg- zink: destroy renderpass objects on context destroy
40837ec681f3Smrg- zink: rename 'has_draw' flag on batches and set it when the batch is used
40847ec681f3Smrg- zink: move gfx pipeline creation closer to the bind point
40857ec681f3Smrg- zink: only reset pipeline hash conditionally when updating fb state
40867ec681f3Smrg- zink: simplify barrier usage
40877ec681f3Smrg- zink: beef up zink_transfer_flush_region
40887ec681f3Smrg- zink: only wait on last write-batch for resources during transfer_map
40897ec681f3Smrg- zink: change some transfer_map cases of waiting on cs batch to flushing cs
40907ec681f3Smrg- zink: handle PIPE_TRANSFER_DISCARD_WHOLE_RESOURCE in transfer_map for buffers
40917ec681f3Smrg- zink: update ci results
40927ec681f3Smrg- zink: disable some builtin-gl-sample-mask sample shading tests on ci
40937ec681f3Smrg- zink: actually disable sample mask tests on ci
40947ec681f3Smrg- lavapipe: fix color-only renderpass clears
40957ec681f3Smrg- zink: ralloc the main context
40967ec681f3Smrg- zink: create framebuffer and renderpass objects just before vkCmdBeginRenderPass()
40977ec681f3Smrg- zink: defer pipe_context::clear calls when not currently in a renderpass
40987ec681f3Smrg- zink: also defer fb clears when conditional render is active
40997ec681f3Smrg- zink: break out region overlap testing function into helper
41007ec681f3Smrg- zink: add helper for converting pipe_box -> u_rect
41017ec681f3Smrg- zink: add another helper for checking whether one rect covers another
41027ec681f3Smrg- zink: break out fb clear apply into helper function
41037ec681f3Smrg- zink: add helper for applying/discarding clears based on a rect
41047ec681f3Smrg- zink: discard pending clears during blit/copy if we'll overwrite the data
41057ec681f3Smrg- zink: add yet another clear helper, this time for applying overlap regions
41067ec681f3Smrg- zink: optimize the remaining read cases of applying pending clear calls
41077ec681f3Smrg- zink: move all the clear stuff to zink_clear.h
41087ec681f3Smrg- zink: always do full-fb clears in renderpass begin when possible
41097ec681f3Smrg- zink: ci changes
41107ec681f3Smrg- zink: improve descriptor set oom handling
41117ec681f3Smrg- zink: ci updates
41127ec681f3Smrg- zink: set PIPE_CAP_TEXTURE_BORDER_COLOR_QUIRK
41137ec681f3Smrg- zink: force 128 fs input components on intel drivers
41147ec681f3Smrg- zink: add some spirv builder functions for spec constants
41157ec681f3Smrg- zink: support gl_LocalGroupSize
41167ec681f3Smrg- zink: add more ci flakes
41177ec681f3Smrg- util/bitscan: add u_foreach_bit macros
41187ec681f3Smrg- v3dv: remove for_each_bit() macro
41197ec681f3Smrg- radv: for_each_bit -> foreach_bit
41207ec681f3Smrg- freedreno/vulkan: for_each_bit -> foreach_bit
41217ec681f3Smrg- anv: for_each_bit -> foreach_bit
41227ec681f3Smrg- zink: use 0 as default for spec constants
41237ec681f3Smrg- zink: no-op descriptor updating for draws without descriptors
41247ec681f3Smrg- nir/texcoord_replace: add a yinvert param
41257ec681f3Smrg- zink: store prim mode to context during draw
41267ec681f3Smrg- zink: handle point sprite
41277ec681f3Smrg- zink: ci updates
41287ec681f3Smrg- zink: avoid memset during update_descriptors() for resources refs
41297ec681f3Smrg- zink: move samplerview referencing around in update_descriptors()
41307ec681f3Smrg- zink: reorder zink_bind_vertex_buffers()
41317ec681f3Smrg- zink: create a single fence per batch on startup and then reuse
41327ec681f3Smrg- zink: only flush batches in pipe_context::flush if they actually have work
41337ec681f3Smrg- zink: add a define for compute batch count
41347ec681f3Smrg- zink: add util function for returning previous batch
41357ec681f3Smrg- zink: handle PIPE_FLUSH_DEFERRED
41367ec681f3Smrg- zink: handle VK_IMAGE_LAYOUT_PRESENT_SRC_KHR barriers
41377ec681f3Smrg- zink: set VK_IMAGE_LAYOUT_PRESENT_SRC_KHR on fb resources at eof flush
41387ec681f3Smrg- zink: setup CmdBindVertexBuffers2EXT member in screen for dynamic state
41397ec681f3Smrg- zink: make dynamic state usage in pipeline creation more explicit/flexible
41407ec681f3Smrg- zink: use dynamic vertex buffer strides
41417ec681f3Smrg- zink: rename zink_context::buffers -> vertex_buffers (and usage mask)
41427ec681f3Smrg- zink: add zink_program struct as a base class for compute/gfx structs
41437ec681f3Smrg- zink: use zink_program in zink_batch_reference_program()
41447ec681f3Smrg- zink: ralloc zink program structs
41457ec681f3Smrg- zink: unref programs last in batch reset
41467ec681f3Smrg- zink: properly size descriptorset layout binding stack array
41477ec681f3Smrg- zink: increment batch->descs_used during update_descriptors flushing
41487ec681f3Smrg- zink: do batch-program tracking after possibly cycling batch in update_descriptors()
41497ec681f3Smrg- zink: add spirv builder methods for OpImageQueryLevels
41507ec681f3Smrg- zink: hook up nir_texop_query_levels
41517ec681f3Smrg- zink: relax tessellation shader reqs
41527ec681f3Smrg- zink: ci updates
41537ec681f3Smrg- zink: fix dynamic bo lowering for ssbo stores
41547ec681f3Smrg- zink: pre-fetch all format properties during screen init
41557ec681f3Smrg- zink: use pre-fetched format properties everywhere
41567ec681f3Smrg- zink: don't start renderpasses during descriptor update
41577ec681f3Smrg- zink: add more usage bits for buffer resource creation
41587ec681f3Smrg- zink: handle null src for fb refs
41597ec681f3Smrg- zink: track all framebuffers per batch
41607ec681f3Smrg- zink: store total memory size on zink_screen
41617ec681f3Smrg- zink: track resource mem usage per batch
41627ec681f3Smrg- zink: force batch flush if batches are using more than 1/10 total system memory
41637ec681f3Smrg- mesa/st: clamp scissored clear regions to fb size
41647ec681f3Smrg- mesa/st: no-op scissored clear calls with size zero
41657ec681f3Smrg- zink: handle GLSL_SAMPLER_DIM_EXTERNAL in ntv
41667ec681f3Smrg- zink: ci updates
41677ec681f3Smrg- mesa/st: even better no-oping for clears
41687ec681f3Smrg- zink: apply only the pending zs clear bits during deferred clears
41697ec681f3Smrg- zink: enable PIPE_CAP_CLEAR_SCISSORED
41707ec681f3Smrg- zink: export PIPE_CAP_TGSI_VS_LAYER_VIEWPORT
41717ec681f3Smrg- zink: use staging resource for write transfer_map in order to not stall
41727ec681f3Smrg- zink: ci updates
41737ec681f3Smrg- zink: rewrite macro for getting KHR device functions
41747ec681f3Smrg- zink: add vk/spirv caps/extension for shader LAYER variable
41757ec681f3Smrg- zink: remove ntv streamout assert
41767ec681f3Smrg- zink: fix streamout emission for super-enhanced layouts
41777ec681f3Smrg- zink: fix slot mapping for fat io variables
41787ec681f3Smrg- zink: fix location usage for explicit xfb outputs
41797ec681f3Smrg- zink: run more nir passes for tess shaders
41807ec681f3Smrg- zink: stop allocating xfb slot map
41817ec681f3Smrg- zink: handle direct xfb output from output variables
41827ec681f3Smrg- zink: evaluate existing slot map during program init and force new map as needed
41837ec681f3Smrg- zink: rename variable in update_so_info()
41847ec681f3Smrg- zink: use info.has_transform_feedback_varyings to determine xfb enablement
41857ec681f3Smrg- zink: pass so_info directly to update_so_info()
41867ec681f3Smrg- zink: use slightly stricter check for update_so_info() callsite
41877ec681f3Smrg- zink: only export necessary xfb outputs to ntv
41887ec681f3Smrg- zink: don't pass so_info to ntv at all unless it's necessary
41897ec681f3Smrg- zink: unref ctx->framebuffer on context destroy
41907ec681f3Smrg- zink: fix instance/device versioning (for real this time)
41917ec681f3Smrg- zink: simplify some update_descriptor code
41927ec681f3Smrg- zink: move descriptor sets/pools from batches to programs
41937ec681f3Smrg- zink: store and reuse descriptorsets after batch completion
41947ec681f3Smrg- zink: move descriptor set alloc function to zink_program.c
41957ec681f3Smrg- zink: use more precise sizing for descriptor pools
41967ec681f3Smrg- zink: add helper function for cycling a batch
41977ec681f3Smrg- zink: even better handling for descriptor oom
41987ec681f3Smrg- zink: remove flushes for batch descriptor use
41997ec681f3Smrg- zink: add bucket allocating for descriptor sets
42007ec681f3Smrg- zink: add scaling factor for descriptor set bucket allocations
42017ec681f3Smrg- zink: add caching for descriptor sets
42027ec681f3Smrg- zink: add second level cache for descriptor sets
42037ec681f3Smrg- zink: move streamout to draw_vbo
42047ec681f3Smrg- zink: reorder descriptor barrier applying during updating
42057ec681f3Smrg- zink: move surface refs to the end of descriptor updating
42067ec681f3Smrg- zink: split descriptor sets based on usage
42077ec681f3Smrg- zink: use dynamic offsets for first ubo
42087ec681f3Smrg- zink: introduce descriptor states
42097ec681f3Smrg- zink: add a null sampler view descriptor hash to the screen
42107ec681f3Smrg- zink: pre-hash sampler views and states
42117ec681f3Smrg- zink: store last-used descriptor set for each type of set for quick reuse
42127ec681f3Smrg- zink: actually flag all used resources as used during update_descriptors
42137ec681f3Smrg- zink: add program pointer to desc set struct
42147ec681f3Smrg- zink: move descriptor set allocation near the top of update_descriptors
42157ec681f3Smrg- zink: only batch-reference the program in use once per descriptor update
42167ec681f3Smrg- zink: improve descriptor cache invalidation
42177ec681f3Smrg- zink: add flag for recycled descriptor sets
42187ec681f3Smrg- zink: don't double iterate all the per-batch sets on reset
42197ec681f3Smrg- zink: add VkPipelineLayout to zink_program meta struct
42207ec681f3Smrg- zink: split out ubo descriptor updating
42217ec681f3Smrg- zink: break out ssbo descriptor updating
42227ec681f3Smrg- zink: break out sampler descriptor updating
42237ec681f3Smrg- zink: break out image descriptor updating
42247ec681f3Smrg- zink: deduplicate VkWriteDescriptorSet setup
42257ec681f3Smrg- zink: break out descriptor stuff into new files
42267ec681f3Smrg- zink: break out all the descriptor pool/layout stuff into a new struct
42277ec681f3Smrg- zink: change program pointer on struct zink_descriptor_set to pool pointer
42287ec681f3Smrg- zink: track number of sets currently allocated per descriptor pool
42297ec681f3Smrg- zink: move descriptor type to pool object from set
42307ec681f3Smrg- zink: allow reuse of zink_descriptor_pools between programs
42317ec681f3Smrg- zink: remove intermediate func for descriptor set getting
42327ec681f3Smrg- zink: simplify check for knowing whether descriptor updating is needed
42337ec681f3Smrg- zink: pre-size descriptor transition hash table
42347ec681f3Smrg- zink: move descriptor binding out of the update codepath
42357ec681f3Smrg- zink: reuse descriptor barriers across draws
42367ec681f3Smrg- zink: track resource count on descriptor pool object
42377ec681f3Smrg- zink: directly use resource count from pool instead of accumulating every time
42387ec681f3Smrg- zink: remove struct zink_descriptor_resource from descriptor updating
42397ec681f3Smrg- zink: don't create descriptor barrier hash tables for cached descriptor set
42407ec681f3Smrg- zink: always use VK_IMAGE_LAYOUT_SHADER_READ_ONLY_OPTIMAL for sampler bindings
42417ec681f3Smrg- zink: remove image layouts from descriptor states
42427ec681f3Smrg- zink: avoid hashing states without descriptors
42437ec681f3Smrg- zink: fix arrays of texel buffer descriptors
42447ec681f3Smrg- zink: ci updates
42457ec681f3Smrg- zink: move vertex_buffers_enabled_mask to non-hashed part of pipeline state
42467ec681f3Smrg- zink: optimize pipeline hashing
42477ec681f3Smrg- zink: implement an extremely dumb resource memory cache
42487ec681f3Smrg- zink: ci updates
42497ec681f3Smrg- zink: remove debug printf
42507ec681f3Smrg- lavapipe: fix push descriptor set indexing
42517ec681f3Smrg- lavapipe: set more resource bind flags using image/buffer usage bits
42527ec681f3Smrg- zink: move buffer<->image copying to pipe_context::resource_copy_region hook
42537ec681f3Smrg- zink: remove extraneous flush in transfer_map_region_flush
42547ec681f3Smrg- zink: ci updates
42557ec681f3Smrg- zink: optimize resource usage tracking
42567ec681f3Smrg- zink: use _mesa_set_search_and_add() for set management
42577ec681f3Smrg- zink: ralloc screen objects
42587ec681f3Smrg- zink: implement a surface cache
42597ec681f3Smrg- zink: use a safer iteration for fb surfaces during rp init
42607ec681f3Smrg- zink: determine whether the vulkan driver requires mesa flush wsi
42617ec681f3Smrg- zink: force mesa flush implicit fencing on ANV
42627ec681f3Smrg- zink: force explicit fence only on first frame flush
42637ec681f3Smrg- zink: use internal api for first-frame fence
42647ec681f3Smrg- zink: return true from program ref functions upon free
42657ec681f3Smrg- zink: unset ctx->program pointers when an unref destroys the object
42667ec681f3Smrg- zink: stop leaking programs
42677ec681f3Smrg- vk/util: add unified shader module struct/functions
42687ec681f3Smrg- vk/util: add a util macro for initializing stack vk_shader_module structs
42697ec681f3Smrg- lavapipe: use common interfaces for shader modules
42707ec681f3Smrg- radv: use common interfaces for shader modules
42717ec681f3Smrg- v3dv: use common interfaces for shader modules
42727ec681f3Smrg- tu: use common interfaces for shader modules
42737ec681f3Smrg- anv: use common interfaces for shader modules
42747ec681f3Smrg- zink: add batch usage flags for sampler views/states and desc sets
42757ec681f3Smrg- zink: avoid looping for non-ubo descriptor updates based on set usage
42767ec681f3Smrg- zink: break out batch id finding for resource usage into util function
42777ec681f3Smrg- zink: move resource internals to a separate struct
42787ec681f3Smrg- zink: split out backing resource object create/destroy
42797ec681f3Smrg- zink: track resource_object usage instead of resource usage
42807ec681f3Smrg- zink: handle cached descriptor set punting
42817ec681f3Smrg- zink: add some asserts for pipeline barriers to check renderpass state
42827ec681f3Smrg- zink: add util function for checking whether a shader descriptor is a buffer
42837ec681f3Smrg- zink: add util function for checking whether shader descriptor is buffer from program
42847ec681f3Smrg- zink: use an explicit zink_buffer_view struct
42857ec681f3Smrg- zink: explicitly use zink_surface objects for sampler/image view objects
42867ec681f3Smrg- zink: store bufferview hash to bufferview struct
42877ec681f3Smrg- zink: simplify bufferview and imageview descriptor state hashing
42887ec681f3Smrg- zink: add extra batch tracking for sampler views
42897ec681f3Smrg- zink: massively beef up batch tracking for shader images
42907ec681f3Smrg- zink: add helper function for getting a resource for a descriptor
42917ec681f3Smrg- zink: update null descriptor hashes to reflect current descriptor states
42927ec681f3Smrg- zink: improve debug asserts for samplers/images during descriptor updates
42937ec681f3Smrg- zink: properly handle null bufferview descriptor states
42947ec681f3Smrg- zink: cache bufferviews
42957ec681f3Smrg- zink: add missing null check
42967ec681f3Smrg- zink: unset last_vertex_stage_dirty after applying it
42977ec681f3Smrg- zink: run nir_convert_from_ssa last during compile
42987ec681f3Smrg- zink: use intermediate var for glsl non-array type during shader create
42997ec681f3Smrg- zink: break out bo array type construction into ntv util function
43007ec681f3Smrg- zink: also break out whole ntv bo struct pointer construction
43017ec681f3Smrg- zink: add unsized array type to get_glsl_type_element() handling
43027ec681f3Smrg- zink: add debug info about missing atomic ops
43037ec681f3Smrg- zink: add ntv util function for checking if a glsl type is an atomic counter
43047ec681f3Smrg- zink: break out sized uint array construction into util function
43057ec681f3Smrg- zink: flatten binding numbers a bit
43067ec681f3Smrg- zink: directly set nir variable bindings for reuse during ntv
43077ec681f3Smrg- zink: move zink_binding() to compiler.c
43087ec681f3Smrg- zink: apply Delete All The Code methodology to the ubo/ssbo variables
43097ec681f3Smrg- zink: set ntv variable descriptor sets during compile phase
43107ec681f3Smrg- zink: ci updates
43117ec681f3Smrg- ci/panfrost: disable the rest of these jobs temporarily
43127ec681f3Smrg- zink: hook up resource bind history
43137ec681f3Smrg- zink: remove direct samplerview batch-tracking
43147ec681f3Smrg- zink: add a pipe_context::invalidate_resource hook
43157ec681f3Smrg- zink: set valid region for streamout buffers on bind
43167ec681f3Smrg- zink: handle streamout buffer rebinds
43177ec681f3Smrg- zink: invalidate resources on map when discarding range
43187ec681f3Smrg- zink: enable PIPE_CAP_INVALIDATE_BUFFER
43197ec681f3Smrg- zink: switch to deqp-runner for piglit jobs
43207ec681f3Smrg- zink: always use requested format for sampler view creation
43217ec681f3Smrg- zink: ci updates
43227ec681f3Smrg- zink: more consolidation for null sampler/image view hashing
43237ec681f3Smrg- zink: add a pipe_context::fence_server_sync hook
43247ec681f3Smrg- zink: add enum for different queues
43257ec681f3Smrg- zink: refactor resource_sync_writes_from_batch_usage() to manage batch id internally
43267ec681f3Smrg- zink: convert ZINK_RESOURCE_ACCESS defines to enum
43277ec681f3Smrg- zink: abstract zink_get_resource_usage() and move it to be internal
43287ec681f3Smrg- zink: return enum zink_queue from zink_batch_reference_resource_rw()
43297ec681f3Smrg- zink: split out batch resource-set clearing into separate function
43307ec681f3Smrg- zink: move active query pruning to batch reset
43317ec681f3Smrg- zink: move batch init into zink_batch.c
43327ec681f3Smrg- zink: also move batch destructor into zink_batch.c
43337ec681f3Smrg- zink: move other batch-tracking implementations to unified codepath
43347ec681f3Smrg- zink: use macro to streamline batch struct member init
43357ec681f3Smrg- zink: remove query batch-tracking init from begin_query()
43367ec681f3Smrg- zink: move fence reset to zink_fence_init()
43377ec681f3Smrg- zink: clear framebuffer state on context destroy
43387ec681f3Smrg- zink: enable spirv extension for post depth coverage
43397ec681f3Smrg- compiler/spirv: fix image sample queries
43407ec681f3Smrg- zink: handle nir_intrinsic_image_deref_samples
43417ec681f3Smrg- zink: flatten 2d_array surfaces when necessary
43427ec681f3Smrg- lavapipe: support VK_KHR_copy_commands2
43437ec681f3Smrg- lavapipe: rewrite cmdbufs to always do descriptor binds/pushes first
43447ec681f3Smrg- lavapipe: force state updates when beginning queries
43457ec681f3Smrg- llvmpipe/setup: force fs constant updating upon beginning queries
43467ec681f3Smrg- zink: break out surface viewtype clamping into util function
43477ec681f3Smrg- zink: improve surface viewtype clamping
43487ec681f3Smrg- zink: correctly clamp samplerview surface types
43497ec681f3Smrg- ci/lavapipe: split out lavapipe ci into lavapipe dir
43507ec681f3Smrg- llvmpipe/setup: use bigger hammer to force fs constant updating correctly
43517ec681f3Smrg- zink: split off a bunch of batch struct members to new batch state struct
43527ec681f3Smrg- zink: rewrite queue dispatch to use monotonic batch ids instead of hardcoded ones
43537ec681f3Smrg- zink: more accurately check samplecount caps for shader images
43547ec681f3Smrg- zink: make fb ref func return bool on free
43557ec681f3Smrg- zink: add explicit surface/bufferview batch-tracking functions
43567ec681f3Smrg- zink: use surface references for fb attachments
43577ec681f3Smrg- zink: break out surface destroy function into a screen function
43587ec681f3Smrg- zink: use a custom surface referencing function whenever unrefing a surface
43597ec681f3Smrg- zink: implement a global framebuffer cache
43607ec681f3Smrg- vk: consolidate dynamic descriptor binding sorting
43617ec681f3Smrg- ci: update xfails for ppc64le and s390x
43627ec681f3Smrg- zink: break out buffer mapping part of zink_transfer_map
43637ec681f3Smrg- zink: cache transfer maps
43647ec681f3Smrg- zink: unify clear color conversion code
43657ec681f3Smrg- nir: add nir_lower_indirect_builtin_uniform_derefs()
43667ec681f3Smrg- st/glsl_to_nir: lower indirect derefs of builtins in non-packed uniform case
43677ec681f3Smrg- softpipe: ci updates
43687ec681f3Smrg- zink: move 'batch_id' and 'is_compute' members to fence
43697ec681f3Smrg- zink: make batch usage unsetting function public
43707ec681f3Smrg- zink: always reset batch states when finding a new one
43717ec681f3Smrg- zink: move batch-tracked resources to fence object
43727ec681f3Smrg- zink: fix spirv image operand ordering
43737ec681f3Smrg- zink: fix multisampled shader image load/store
43747ec681f3Smrg- zink: force PIPE_SWIZZLE_1 for X channels in samplerviews
43757ec681f3Smrg- zink: handle blitting of color formats with ignored alpha channels
43767ec681f3Smrg- zink: emulate PIPE_FORMAT_R8G8B8X8_UNORM
43777ec681f3Smrg- zink: ci updates
43787ec681f3Smrg- zink: relax unreachable() to debug_printf when waiting on batch
43797ec681f3Smrg- zink: rework public batch flush function to be useful again
43807ec681f3Smrg- zink: move zink_flush_compute() users to zink_flush_queue()
43817ec681f3Smrg- zink: always flag xfb barrier on gfx flush when appropriate
43827ec681f3Smrg- zink: simplify some queue-related query code
43837ec681f3Smrg- zink: refactor clears a little to track a bitfield of enabled clears on the context
43847ec681f3Smrg- zink: trigger pending clears during flush
43857ec681f3Smrg- zink: ci updates
43867ec681f3Smrg- zink: add wrapper to reset batch state structs
43877ec681f3Smrg- zink: call clear() instead of reset() for batch states on context destroy
43887ec681f3Smrg- zink: unify gfx and compute batches
43897ec681f3Smrg- zink: isolate gfx stage bits when updating shader modules
43907ec681f3Smrg- zink: store conditional render predicate to query and split out start/stop
43917ec681f3Smrg- zink: only update conditional render buffer when it needs to be updated
43927ec681f3Smrg- zink: toggle conditional render when beginning/ending a renderpass
43937ec681f3Smrg- zink: ci updates
43947ec681f3Smrg- zink: handle gallium multi draws more effectively
43957ec681f3Smrg- zink: create separate upload mgr for constants
43967ec681f3Smrg- zink: explicitly use stream uploader for staging buffers
43977ec681f3Smrg- zink: add buffer_subdata hook
43987ec681f3Smrg- zink: avoid unnecessary resource refs during descriptor update
43997ec681f3Smrg- zink: remove handling for resource flushing between compute/gfx batches
44007ec681f3Smrg- zink: remove unnecessary flush during image maps
44017ec681f3Smrg- zink: add more rp cache asserts
44027ec681f3Smrg- compiler/spirv: use undefs when extending image coords
44037ec681f3Smrg- zink: don't generate sampled image type for non-sampled images
44047ec681f3Smrg- util/set: stop leaking u32 key sets which pass a mem ctx
44057ec681f3Smrg- lavapipe: fix CmdCopyQueryPoolResults for partial pipeline statistics queries
44067ec681f3Smrg- lavapipe: use the passed offset for CmdCopyQueryPoolResults
44077ec681f3Smrg- lavapipe: stop tracking draw start/count on rendering state
44087ec681f3Smrg- zink: ci updates
44097ec681f3Smrg- lavapipe: ignore templateType when descriptor template isn't for push descriptors
44107ec681f3Smrg- lavapipe: remove lvp_descriptor_update_template::descriptor_set_layout
44117ec681f3Smrg- zink: fix handling for image types in resource_copy_region hook
44127ec681f3Smrg- zink: also fix image buffer layer copying
44137ec681f3Smrg- lavapipe: fix array texture region copies
44147ec681f3Smrg- zink: only do shader updates when relevant stages are dirty
44157ec681f3Smrg- zink: use correct surface ref function for context destroy
44167ec681f3Smrg- zink: stall when we start getting a lot of uncompleted batches
44177ec681f3Smrg- zink: reset all fences when waiting on batch state
44187ec681f3Smrg- zink: fix format support detection for storage texel buffers and shader images
44197ec681f3Smrg- zink: break out image/buffer create info structs into helper funcs
44207ec681f3Smrg- zink: make descriptor state invalidate public
44217ec681f3Smrg- zink: reorder barrier util functions to set up barrier struct before batch
44227ec681f3Smrg- zink: break out barrier struct initializing into helper funcs
44237ec681f3Smrg- zink: create separate vk image/buffer objects for shader image use
44247ec681f3Smrg- zink: incrementally add image usage flags based on device caps
44257ec681f3Smrg- zink: add color output bit and/or use linear tiling for sampled images
44267ec681f3Smrg- zink: check image format props before creating image
44277ec681f3Smrg- zink: toggle between linear/optimal tiling during image creation
44287ec681f3Smrg- zink: flatten out buffer creation usage flags codepath
44297ec681f3Smrg- zink: ralloc shader cache and keys
44307ec681f3Smrg- zink: rework border color handling
44317ec681f3Smrg- zink: clean up query creation failure paths
44327ec681f3Smrg- zink: create result buffers for all query streams
44337ec681f3Smrg- zink: remove flush from query buffer copy
44347ec681f3Smrg- zink: manually handle more bool query types for copying
44357ec681f3Smrg- zink: remove special casing for occlusion qbos
44367ec681f3Smrg- zink: rewrite query internals
44377ec681f3Smrg- zink: bump pools up to 5k queries each
44387ec681f3Smrg- zink: don't use PARTIAL bit for query results with time queries
44397ec681f3Smrg- zink: reorder availability handling for (user) qbos
44407ec681f3Smrg- zink: remove explicit fencing for query results
44417ec681f3Smrg- zink: ci updates
44427ec681f3Smrg- lavapipe: refactor base draw dispatch to handle multidraws
44437ec681f3Smrg- lavapipe: refactor indexed draw dispatch to handle multidraws
44447ec681f3Smrg- aux/draw: stop copying draw params unnecessarily
44457ec681f3Smrg- aux/draw: rewrite PRIM_RESTART_LOOP macro as a function
44467ec681f3Smrg- aux/draw: pass the full draw params through to draw_instances()
44477ec681f3Smrg- aux/draw: pass the full draw params through to draw_pt_arrays_restart()
44487ec681f3Smrg- aux/draw: move draw param sanitization to end of function
44497ec681f3Smrg- aux/draw: track increment_draw_id value from draw info
44507ec681f3Smrg- aux/draw: pass full draw params to draw_pt_arrays()
44517ec681f3Smrg- llvmpipe: stop flattening multidraws
44527ec681f3Smrg- lavapipe: ignore unused clearvalues when beginning renderpass
44537ec681f3Smrg- zink: rework texture_barrier hook
44547ec681f3Smrg- zink: move update_descriptors & related funcs to zink_descriptors.c
44557ec681f3Smrg- zink: move descriptor barrier handling to main update function
44567ec681f3Smrg- zink: simplify some descriptor update function parameters
44577ec681f3Smrg- zink: use GENERAL layout for sampler images that are also bound as shader images
44587ec681f3Smrg- zink: rework some includes
44597ec681f3Smrg- zink: rework memory_barrier hook
44607ec681f3Smrg- zink: add locking for descriptor pools
44617ec681f3Smrg- zink: add locking for resource maps
44627ec681f3Smrg- zink: manually invoke cpu detection during screen init
44637ec681f3Smrg- zink: add locking for batch states
44647ec681f3Smrg- zink: add function for checking whether a batch is done
44657ec681f3Smrg- zink: split fence finish func
44667ec681f3Smrg- zink: add locking for fence resources
44677ec681f3Smrg- zink: explicitly reset a couple more batch state members
44687ec681f3Smrg- zink: assume fence has already completed if a batch state isn't found
44697ec681f3Smrg- zink: rename init_batch_state to get_batch_state
44707ec681f3Smrg- zink: store context to batch state
44717ec681f3Smrg- zink: make a local screen pointer in zink_flush
44727ec681f3Smrg- zink: remove zink_fence_init()
44737ec681f3Smrg- zink: move VkQueue to batch object
44747ec681f3Smrg- zink: break out queue submit into separate functions
44757ec681f3Smrg- zink: also check for device lost reset on flush
44767ec681f3Smrg- zink: remove zink_create_fence()
44777ec681f3Smrg- zink: track coherent resource objects
44787ec681f3Smrg- zink: use cached memory for all resources when possible
44797ec681f3Smrg- radv: stop zeroing radv_draw_info during draw
44807ec681f3Smrg- radv: refactor draw dispatch
44817ec681f3Smrg- radv: track whether gl_BaseInstance is used
44827ec681f3Smrg- radv: simplify vs draw param counting during setup
44837ec681f3Smrg- radv: set gfx pipeline vtx_emit_num to the number of sgprs
44847ec681f3Smrg- radv: track whether drawid is used on the pipeline struct
44857ec681f3Smrg- radv: track whether baseinstance is used on the pipeline struct
44867ec681f3Smrg- radv: break out vertex shader param emission into separate function
44877ec681f3Smrg- radv: make vertex param sgpr count more explicit
44887ec681f3Smrg- radv: reorder vertex shader params
44897ec681f3Smrg- radv: don't emit baseinstance and drawid if neither is used
44907ec681f3Smrg- radv: don't reset vertex state params on pipeline bind if reg layout matches
44917ec681f3Smrg- zink: implement threaded context
44927ec681f3Smrg- zink: ci updates
44937ec681f3Smrg- zink: handle PIPE_MAP_DONTBLOCK for buffer read maps
44947ec681f3Smrg- zink: add set_context_param hook
44957ec681f3Smrg- zink: add batch tracking id for program struct
44967ec681f3Smrg- zink: track last completed batch id to optimize checking states
44977ec681f3Smrg- zink: handle expired deferred fences more reasonably
44987ec681f3Smrg- zink: hook up timeline semaphore signalling during batch submission
44997ec681f3Smrg- zink: add timeline semaphore fastpath for checking/triggering batch completion
45007ec681f3Smrg- zink: optimize batch states for timeline use
45017ec681f3Smrg- zink: enforce device lost status
45027ec681f3Smrg- zink: be more explicit about blit layer/depth usage
45037ec681f3Smrg- zink: use VkSubresourceLayout::depthPitch as layer_stride when mapping 3D imgs
45047ec681f3Smrg- zink: zink_push_constant -> zink_gfx_push_constant
45057ec681f3Smrg- zink: use max_rt to determine number of blend state attachments
45067ec681f3Smrg- zink: emit ImageCubeArray cap when accessing arrayed cube dimension images
45077ec681f3Smrg- zink: fix layercount for array texture blits
45087ec681f3Smrg- zink: add some asserts to avoid zero-sized blit regions
45097ec681f3Smrg- features: mark off ARB_compute_variable_group_size for zink
45107ec681f3Smrg- features: mark off GL_OES_viewport_array for zink
45117ec681f3Smrg- zink: store shader_info to ntv_context struct
45127ec681f3Smrg- zink: only emit SpvCapabilitySampleMaskPostDepthCoverage if the mode is set
45137ec681f3Smrg- zink: enable PIPE_CAP_TGSI_TES_LAYER_VIEWPORT
45147ec681f3Smrg- features: mark off ARB_shader_viewport_layer_array for zink
45157ec681f3Smrg- zink: avoid cached memory allocations when not requested
45167ec681f3Smrg- util/threaded_context: support pipe_context::set_sample_locations
45177ec681f3Smrg- zink: hook up cs push constant for nir_intrinsic_load_work_dim
45187ec681f3Smrg- zink: use better usage flags for staging resources
45197ec681f3Smrg- zink: use vkGetPhysicalDeviceFormatProperties2 when available
45207ec681f3Smrg- zink: use 2 variant to check image format props during create
45217ec681f3Smrg- zink: only use host mem for staging resources with linear tiling
45227ec681f3Smrg- zink: move cmdpool reset to batch state reset
45237ec681f3Smrg- zink: split total_mem off to total_video_mem, use total_mem for tc
45247ec681f3Smrg- zink: relax maybe_flush mem threshold
45257ec681f3Smrg- zink: relax maybe_flush batch count threshold
45267ec681f3Smrg- zink: check last_finished first in fence_finish early out case
45277ec681f3Smrg- zink: defer timestamp query pool resets to end_query
45287ec681f3Smrg- zink: reset queries when suspending if >50% of total pool is used
45297ec681f3Smrg- zink: don't use cached mem for staging resources
45307ec681f3Smrg- zink: flag DYNAMIC resources as coherent
45317ec681f3Smrg- zink: drop VK_PIPELINE_CREATE_DISABLE_OPTIMIZATION_BIT from compute path
45327ec681f3Smrg- aux/trace: add a set_inlinable_constants hook
45337ec681f3Smrg- intel: avoid dumping null cs sampler/binding states
45347ec681f3Smrg- zink: emit WorkgroupSize when not using ExecutionModeLocalSize
45357ec681f3Smrg- lavapipe: add some asserts for blit region extents
45367ec681f3Smrg- zink: export PIPE_CAP_PREFER_BLIT_BASED_TEXTURE_TRANSFER
45377ec681f3Smrg- aux/trace: add screen deduplication for zink+lavapipe tracing
45387ec681f3Smrg- aux/trace: add a bunch of methods for lavapipe
45397ec681f3Smrg- util/set: add macro for destructively iterating set entries
45407ec681f3Smrg- util/hash_table: add macro for destructively iterating entries
45417ec681f3Smrg- aux/trace: add GALLIUM_TRACE_TRIGGER mode
45427ec681f3Smrg- zink: add a pipe_screen::finalize_nir hook
45437ec681f3Smrg- zink: implement uniform inlining
45447ec681f3Smrg- zink: add env var to force uniform inlining
45457ec681f3Smrg- zink: remove atomic usage from batch tracking comparisons
45467ec681f3Smrg- zink: bypass separate stencil path in resource_reference_rw when not a zs image
45477ec681f3Smrg- zink: fix conditional when assigning tess variable io
45487ec681f3Smrg- zink: stop unmapping resources
45497ec681f3Smrg- zink: simplify clear-apply on fb state change
45507ec681f3Smrg- zink: use set_foreach_remove()
45517ec681f3Smrg- zink: use explicit subpass deps
45527ec681f3Smrg- zink: hook up EXT_fragment_shader_interlock
45537ec681f3Smrg- zink: support ARB_fragment_shader_interlock
45547ec681f3Smrg- aux/trace: dump all the blend state members
45557ec681f3Smrg- features: mark off ARB_fragment_shader_interlock for zink
45567ec681f3Smrg- gallium/threaded_context: add another rule for buffer mapping
45577ec681f3Smrg- zink: fix CI flakiness in glx-multithread-clearbuffer
45587ec681f3Smrg- zink: make timeline semaphores per-screen
45597ec681f3Smrg- zink: handle checking batch completion from other contexts without timelines
45607ec681f3Smrg- zink: only unmap PIPE_MAP_ONCE in synchronous mode
45617ec681f3Smrg- zink: don't lose existing pNext when using wsi_image_create_info in image creation
45627ec681f3Smrg- anv: fix debugoptimized build compile
45637ec681f3Smrg- zink: move descriptor state management to descriptors.c
45647ec681f3Smrg- zink: make a bunch of descriptor functions static
45657ec681f3Smrg- zink: create separate linear tiling image for scanout
45667ec681f3Smrg- zink: flag anv for mesa image create wsi
45677ec681f3Smrg- zink: disable mutable formats for zs formats and scanout images
45687ec681f3Smrg- aux/trace: enhance trigger mode to dump context states during bind
45697ec681f3Smrg- aux/trace: dump current fb state on trigger-mode draw if it hasn't been seen yet
45707ec681f3Smrg- aux/trace: do deep dumps of fb state for triggered traces
45717ec681f3Smrg- aux/trace: use ralloc_free for ralloc()ed state pointers
45727ec681f3Smrg- zink: compare against screen batch id when determining which semaphore to use
45737ec681f3Smrg- zink: always copy the nir shader before compiling
45747ec681f3Smrg- zink: fix tcs slot map eval for user vars
45757ec681f3Smrg- zink: fix tcs input reservation for user vars
45767ec681f3Smrg- zink: merge copy-to-scanout path into non-deferred flush path
45777ec681f3Smrg- zink: force scanout sync when mapping scanout resource
45787ec681f3Smrg- zink: use undefined layout for first scanout obj transition
45797ec681f3Smrg- zink: move scanout sync to end of batch
45807ec681f3Smrg- zink: add a flag indicating whether scanout object needs updating
45817ec681f3Smrg- zink: move wsi flush info conditional to queue submission
45827ec681f3Smrg- zink: directly set batch->state->flush_res from flush_resource hook
45837ec681f3Smrg- zink: add clear-on-flush mechanic deeper into flush codepath
45847ec681f3Smrg- Revert "zink: force scanout sync when mapping scanout resource"
45857ec681f3Smrg- softpipe: fix render condition checking
45867ec681f3Smrg- softpipe: fix streamout queries
45877ec681f3Smrg- softpipe: ci updates
45887ec681f3Smrg- zink: track persistent resource objects, not resources
45897ec681f3Smrg- zink: restore previous semaphore (prev_sem) handling
45907ec681f3Smrg- zink: use cached memory for staging resources
45917ec681f3Smrg- zink: only reset query on suspend if the query has previously been stopped
45927ec681f3Smrg- zink: when performing an implicit reset, sync qbos
45937ec681f3Smrg
45947ec681f3SmrgNanley Chery (22):
45957ec681f3Smrg
45967ec681f3Smrg- gallium: Map _DRI_IMAGE_FORMAT_NONE to NULL
45977ec681f3Smrg- gallium: Flush GL API resources in eglCreateImage
45987ec681f3Smrg- iris: Disable aux as needed in iris_flush_resource
45997ec681f3Smrg- blorp: Assert 8x4 alignment for a HiZ op on Gen8-9
46007ec681f3Smrg- i965,iris: Delete misleading HiZ sampling comments
46017ec681f3Smrg- iris: Drop an XXX comment about sampling HiZ arrays
46027ec681f3Smrg- iris: Drop a stale comment about HiZ sampling
46037ec681f3Smrg- iris: Delete redundant assertion in iris_hiz_exec
46047ec681f3Smrg- iris: Drop batch param from iris_resource_prepare_render
46057ec681f3Smrg- iris: Fix the depth aspect aux usage in iris_blit
46067ec681f3Smrg- iris: Keep aux_usage in iris_blorp_surf_for_resource
46077ec681f3Smrg- iris: Fix aux usage of depth buffer prepare/finish
46087ec681f3Smrg- iris: Loosen aux state getter/setter assert on HiZ
46097ec681f3Smrg- iris: Don't avoid aux state getter/setter with HiZ
46107ec681f3Smrg- iris: Drop iris_resource::aux::has_hiz
46117ec681f3Smrg- iris: Call iris_sample_with_depth_aux earlier
46127ec681f3Smrg- iris: Set BO maps to NULL in bo_free
46137ec681f3Smrg- drm-uapi: Update drm_fourcc.h for new TGL modifier
46147ec681f3Smrg- isl: Describe I915_FORMAT_MOD_Y_TILED_GEN12_RC_CCS_CC
46157ec681f3Smrg- iris: Support clear color plane imports for RC_CCS_CC
46167ec681f3Smrg- iris: Support RC_CCS_CC modifier in plane queries
46177ec681f3Smrg- iris: Support I915_FORMAT_MOD_Y_TILED_GEN12_RC_CCS_CC
46187ec681f3Smrg
46197ec681f3SmrgNeha Bhende (3):
46207ec681f3Smrg
46217ec681f3Smrg- mesa: set states in fast path for restoring light attributes
46227ec681f3Smrg- gallium/u_vbuf: use updated pipe_draw_start_count while using draw_vbo
46237ec681f3Smrg- nir_to_tgsi: Fix indices for CMP in nir_to_tgsi for nir_op_fcsel
46247ec681f3Smrg
46257ec681f3SmrgPhilipp Zabel (1):
46267ec681f3Smrg
46277ec681f3Smrg- meson: Fix missing xcb-xrandr dependency for Vulkan X11 WSI
46287ec681f3Smrg
46297ec681f3SmrgPierre Moreau (4):
46307ec681f3Smrg
46317ec681f3Smrg- docs/features: Add OpenCL status
46327ec681f3Smrg- spirv: Ignore WorkgroupSize in non-compute stages
46337ec681f3Smrg- nv50: Replace hardcoded texture/constbuf count with define
46347ec681f3Smrg- nv50: Update texture indices to match stage indices
46357ec681f3Smrg
46367ec681f3SmrgPierre-Eric Pelloux-Prayer (55):
46377ec681f3Smrg
46387ec681f3Smrg- ac: add ifdef __cplusplus guard to header
46397ec681f3Smrg- radeonsi: invalidate compute sgprs in si_rebind_buffer
46407ec681f3Smrg- radeonsi: inhibit clockgating when using SQTT
46417ec681f3Smrg- ci: split src/mesa/\**/* matching rule
46427ec681f3Smrg- radeonsi/sqtt: use more event identifier
46437ec681f3Smrg- radeonsi/sqtt: fix SQTT bo size overflow
46447ec681f3Smrg- radeonsi/sqtt: allow AMD_THREAD_TRACE_TRIGGER to be a frame number
46457ec681f3Smrg- radeonsi/sqtt: forward string markers to sqtt
46467ec681f3Smrg- radeonsi: don't use cp_dma prefetch on GFX6
46477ec681f3Smrg- gallium/u_upload_mgr: lower risk of hitting an assert
46487ec681f3Smrg- radeonsi: fix indentation issue in si_texture.c
46497ec681f3Smrg- radeonsi: store si_context::xxx_shader members in union
46507ec681f3Smrg- radeonsi: fix read from compute / write from draw sync
46517ec681f3Smrg- radeonsi: fix si_check_render_feedback
46527ec681f3Smrg- radeonsi: replace force_cp_dma arg of si_clear_buffer by enum
46537ec681f3Smrg- radeonsi: enable dcc image stores on gfx10+
46547ec681f3Smrg- radeonsi: force dcc clear to use compute clear
46557ec681f3Smrg- mesa: update vao _EnabledWithMapMode in copy_array_object
46567ec681f3Smrg- radeonsi: properly set SPI_SHADER_PGM_HI_ES
46577ec681f3Smrg- ac/rgp: make the max gap between shader code a warning
46587ec681f3Smrg- ac/rtld: make ac_rtld_upload returns the code size
46597ec681f3Smrg- ac/rgp: move radv/sqtt functions to ac
46607ec681f3Smrg- radeonsi/sqtt: keep a copy of the uploaded shader code
46617ec681f3Smrg- radeonsi/sqtt: remove duplicate token
46627ec681f3Smrg- radeonsi/sqtt: don't always use WGP 0
46637ec681f3Smrg- radeonsi/sqtt: export shader code to RGP
46647ec681f3Smrg- radeonsi/sqtt: fix user event max size
46657ec681f3Smrg- frontends/va: fix protected slice data buffer read size
46667ec681f3Smrg- mesa/st: fix lower_tex_src_plane in multiple samplers scenario
46677ec681f3Smrg- dlist: remove ListExt feature
46687ec681f3Smrg- mesa: remove 2 recursive lock usages of _mesa_HashTable
46697ec681f3Smrg- mesa/hash: make the mtx non-recursive
46707ec681f3Smrg- mesa/hash: switch to simple_mtx
46717ec681f3Smrg- mesa: make _mesa_HashTable InDeleteAll debug only
46727ec681f3Smrg- vbo/dlist: use DrawGallium(Complex)
46737ec681f3Smrg- nir/lower_tex: ignore texture_index if tex_instr has deref src
46747ec681f3Smrg- mesa/st: fix st_nir_lower_tex_src_plane arguments
46757ec681f3Smrg- mesa/st: ignore texture_index if tex_instr has deref src
46767ec681f3Smrg- gallium/u_threaded: split draws that don't fit in a batch
46777ec681f3Smrg- st/draw: remove st_draw_vbo
46787ec681f3Smrg- vbo: inline vbo_primitive_restart in brw_primitive_restart
46797ec681f3Smrg- radeonsi/rgp: export barriers
46807ec681f3Smrg- radeonsi/rgp: export compute shader programs
46817ec681f3Smrg- gallium/u_threaded: skip refcounting only once
46827ec681f3Smrg- driconf: add workarounds for Teardown
46837ec681f3Smrg- amdgpu,radeon: add needs_reset param to ctx_query_reset_status
46847ec681f3Smrg- radeonsi: submit cs to failed context instead of skipping them
46857ec681f3Smrg- radeonsi: use SI_CONTEXT_FLAG_AUX when recreating the aux context
46867ec681f3Smrg- radeonsi: do not recreate the aux context from the aux context
46877ec681f3Smrg- radeonsi: only recreate the aux_context when soft recovery failed
46887ec681f3Smrg- radeonsi: re-create the aux context in si_create_context
46897ec681f3Smrg- amdgpu,radeon: add full_reset_only param to ctx_query_reset_status
46907ec681f3Smrg- radeonsi: avoid querying gpu state if possible
46917ec681f3Smrg- r600/sb: Use assignments for resetting struct r600_sb::literal
46927ec681f3Smrg- driconf: add workaround for Golf With Friends
46937ec681f3Smrg
46947ec681f3SmrgQiang Yu (1):
46957ec681f3Smrg
46967ec681f3Smrg- lima: fix xserver page flip fail for full screen client
46977ec681f3Smrg
46987ec681f3SmrgRhys Perry (141):
46997ec681f3Smrg
47007ec681f3Smrg- nir/loop_unroll: unroll more aggressively if it can improve load scheduling
47017ec681f3Smrg- aco: fix convert_to_SDWA() check in add_subdword_definition()
47027ec681f3Smrg- aco: add test for incorrect convert_to_SDWA() check
47037ec681f3Smrg- radv: fix max_waves estimation on GFX10.3
47047ec681f3Smrg- aco: fix num_waves on GFX10+
47057ec681f3Smrg- aco: have emit_wqm() take Builder instead of isel_context
47067ec681f3Smrg- aco: add emit_mimg() helper
47077ec681f3Smrg- aco: move VADDR to the end of the operand list
47087ec681f3Smrg- aco: use non-sequential addressing
47097ec681f3Smrg- aco: only require texture coordinates to be in WQM if NSA is used
47107ec681f3Smrg- aco: add affinity for non-sequential MIMG operands
47117ec681f3Smrg- radv,aco: don't use MUBUF for multi-channel loads on GFX8 with robustness2
47127ec681f3Smrg- nir/lower_io: fix array_length lowering if buffer is smaller than offset
47137ec681f3Smrg- radv,aco: use deref_buffer_array_length
47147ec681f3Smrg- radv: use nir_opt_access
47157ec681f3Smrg- nir/sink,nir/move: sink/move reorderable load_ssbo
47167ec681f3Smrg- radv: sink load_ssbo
47177ec681f3Smrg- aco: don't consider a phi trivial if same's register doesn't match the def
47187ec681f3Smrg- aco: remove Format::{VOP3A,VOP3B}
47197ec681f3Smrg- aco: add instruction cast and format-check methods
47207ec681f3Smrg- aco: use instruction cast methods
47217ec681f3Smrg- aco: use format-check methods
47227ec681f3Smrg- aco: return references in instruction cast methods
47237ec681f3Smrg- aco: fix WQM for texture instructions with args before the coordinates
47247ec681f3Smrg- nir/opt_uniform_atomics: recognize more complicated invocation comparisons
47257ec681f3Smrg- nir/opt_uniform_atomics: fix elect detection
47267ec681f3Smrg- aco: disable a*1.0 optimization if the instruction is precise
47277ec681f3Smrg- nir/algebraic: optimize out exact a*1.0 if it's used only as a float
47287ec681f3Smrg- aco: optimize a*0.0
47297ec681f3Smrg- aco: optimize out a*1.0 if it's used as a float
47307ec681f3Smrg- nir/algebraic: optimize out exact a+0.0 if it's used only as a float
47317ec681f3Smrg- nir/algebraic: eliminate exact a*0.0 if float execution mode allow it
47327ec681f3Smrg- aco: don't affect isPrecise() after applying output modifiers
47337ec681f3Smrg- nir,spirv: allow non-uniform OpArrayLength
47347ec681f3Smrg- radv,ac/nir: implement non-uniform get_ssbo_size
47357ec681f3Smrg- aco: implement non-uniform get_ssbo_size
47367ec681f3Smrg- radv: round-up num_records division in radv_flush_vertex_descriptors
47377ec681f3Smrg- radv: correctly enable WGP_MODE for NGG and GS
47387ec681f3Smrg- radv: correctly enable WGP_MODE for tessellation control
47397ec681f3Smrg- aco: add fallback algorithm in get_reg()
47407ec681f3Smrg- aco: always set exec_live=false
47417ec681f3Smrg- aco: optimize AC_FETCH_FORMAT_SNORM alpha adjust
47427ec681f3Smrg- aco: do not flag all blocks WQM to ensure we enter all nested loops in WQM
47437ec681f3Smrg- aco: rewrite setting of Exact_Branch
47447ec681f3Smrg- aco: remove loop to flag loop blocks as WQM
47457ec681f3Smrg- aco: fix adjust_vertex_fetch_alpha
47467ec681f3Smrg- radv: use a more relaxed alignment for upload buffer allocations
47477ec681f3Smrg- radv: fix max_lds_per_simd on GFX10
47487ec681f3Smrg- radv: switch MaxWaves statistic to wave32 waves
47497ec681f3Smrg- ac: split lds_granularity into encode and allocation granularities
47507ec681f3Smrg- radv: use lds_{encode,alloc}_granularity
47517ec681f3Smrg- radv: round up max_lds_per_simd / lds_per_wave
47527ec681f3Smrg- aco: fix waves calculation for wave32
47537ec681f3Smrg- aco: add Program::wgp_mode
47547ec681f3Smrg- radv,aco: add radv_nir_compiler_options::wgp_mode
47557ec681f3Smrg- aco: consider that GFX10.3 allocates LDS in 1024 byte blocks
47567ec681f3Smrg- aco: add DeviceInfo
47577ec681f3Smrg- aco: fix transition_to_{WQM,Exact} if exec.back() is not in exec
47587ec681f3Smrg- radv: relax shared alignment requirements in mem_vectorize_callback
47597ec681f3Smrg- radv,aco: allow unaligned LDS access on GFX9+
47607ec681f3Smrg- aco/lower_phis: fix all_preds_uniform with continue_or_break
47617ec681f3Smrg- nir/dce: replace instruction worklist with ssa def bitset
47627ec681f3Smrg- nir: inline nir_foreach_{src,dest}
47637ec681f3Smrg- nir/dce: perform DCE for unlooped instructions in a single pass
47647ec681f3Smrg- aco: calculate all p_as_uniform and v_readfirstlane_b32 sources in WQM
47657ec681f3Smrg- aco: use p_as_uniform for get_sampler_desc and convert_pointer_to_64_bit
47667ec681f3Smrg- nir: fix build at -O1
47677ec681f3Smrg- nir: add nir_ssa_def_is_unused()
47687ec681f3Smrg- nir/copy_prop: remove unused copies
47697ec681f3Smrg- nir/copy_prop: visit copies instead of sources
47707ec681f3Smrg- nir/copy_prop: use nir_{instr,if}_rewrite_{src,condition}_ssa
47717ec681f3Smrg- Revert "radv,aco: allow unaligned LDS access on GFX9+"
47727ec681f3Smrg- aco: add missing usable_read2 check
47737ec681f3Smrg- nir/opt_shrink_vectors: add option to skip shrinking image stores
47747ec681f3Smrg- radv: don't shrink image stores for The Surge 2
47757ec681f3Smrg- radv: don't set sx_blend_opt_epsilon for V_028C70_COLOR_10_11_11
47767ec681f3Smrg- radv,aco: remove aco_compiler_statistics
47777ec681f3Smrg- radv: cache pipeline statistics
47787ec681f3Smrg- aco: set compr for fp16 exports
47797ec681f3Smrg- radv/llvm: fix enabled_channels for compressed exports
47807ec681f3Smrg- aco: simplify loop_nest_depth tracking in isel
47817ec681f3Smrg- aco: track divergent and uniform branch depth
47827ec681f3Smrg- aco: move wait_imm to aco_ir.h
47837ec681f3Smrg- aco: lower p_constaddr into separate instructions earlier
47847ec681f3Smrg- aco: add instruction classes
47857ec681f3Smrg- aco: add latency and inverse throughput statistics
47867ec681f3Smrg- aco: add print option to print program without temporary IDs
47877ec681f3Smrg- aco: add ACO_DEBUG=perfinfo
47887ec681f3Smrg- aco: remove vmem/smem score statistics
47897ec681f3Smrg- aco: fix NSA MIMG followed by MUBUF/MTBUF
47907ec681f3Smrg- aco/tests: add test for NSAToVMEMBug
47917ec681f3Smrg- aco: fix NSA following writelane
47927ec681f3Smrg- aco/tests: add test for waNsaCannotFollowWritelane
47937ec681f3Smrg- nir: Don't update base in vectorize_loads()
47947ec681f3Smrg- aco: implement 64-bit VGPR {u,i}find_msb
47957ec681f3Smrg- aco: use uadd32_sat() helper for nir_op_uadd_sat
47967ec681f3Smrg- aco: use a single instruction for uadd32_sat() on GFX8
47977ec681f3Smrg- aco: implement image_deref_samples
47987ec681f3Smrg- aco: add aco_print_program() flag to print kill flags
47997ec681f3Smrg- aco: add aco_print_program() flags to print live_out and register demand
48007ec681f3Smrg- docs: document ACO_DEBUG=perfinfo
48017ec681f3Smrg- aco: add ACO_DEBUG=liveinfo
48027ec681f3Smrg- radv: lower variables to ssa before nir_propagate_invariant
48037ec681f3Smrg- radv: lower view_index to zero if multiview is disabled
48047ec681f3Smrg- ci: add expected fail for RADV
48057ec681f3Smrg- aco: don't optimize min(a*1.0, ...) to min(a, ...) on GFX8
48067ec681f3Smrg- aco: use -1.0*x and 1.0*|x| for fneg/fabs
48077ec681f3Smrg- aco/tests: add tests for denormal-aware propagation
48087ec681f3Smrg- ac: invalidate metadata after hs_emit_write_tess_factors()
48097ec681f3Smrg- aco/tests: fix isel.sparse.clause for LLVM 12+
48107ec681f3Smrg- lavapipe: fix initialization of pipe_stream_output with unwritten outputs
48117ec681f3Smrg- nir/gather_info: implement partial masking of struct and compact I/O
48127ec681f3Smrg- nir/lower_tex: handle deref casts
48137ec681f3Smrg- nir_to_tgsi: run constant folding after nir_opt_algebraic
48147ec681f3Smrg- aco: fix integer tg4 workaround with unnormalized coordinates
48157ec681f3Smrg- draw: fix pstipple, aaline and aapoint without LLVM
48167ec681f3Smrg- aco: ensure loops nested in a WQM loop are in WQM
48177ec681f3Smrg- nir/gather_info: fix partial masking of compact I/O with location_frac!=0
48187ec681f3Smrg- radv: remove second nir_lower_idiv
48197ec681f3Smrg- nir/lower_idiv: add options to use fp32 for 8-bit division lowering
48207ec681f3Smrg- nir/lower_idiv: make lowered divisions exact
48217ec681f3Smrg- aco: fix 16-bit u2f32
48227ec681f3Smrg- aco: fix 16-bit f2{u8,i8} on GFX6/7
48237ec681f3Smrg- radv: don't use fp16 for 8-bit division lowering before GFX9
48247ec681f3Smrg- nir: add nir_block_get_predecessors_sorted() helper
48257ec681f3Smrg- nir/lcssa: fix nondeterminism in predecessor iteration
48267ec681f3Smrg- nir/loop_unroll: fix is_indirect_load() with load_global
48277ec681f3Smrg- radv: fix conditions for running nir_opt_vectorize
48287ec681f3Smrg- aco/ra: use original names when renaming loop carried phi operands
48297ec681f3Smrg- aco/ra: remove live-in temporary from live_out_per_block when moving it
48307ec681f3Smrg- radv: fix barrier in radv_decompress_dcc_compute shader
48317ec681f3Smrg- radv: fix clearing DCC-compressed e5b9g9r9 images
48327ec681f3Smrg- aco: set TRUNC_COORD=0 for nir_texop_tg4
48337ec681f3Smrg- ac/nir: set TRUNC_COORD=0 for nir_texop_tg4
48347ec681f3Smrg- Revert "radeonsi: set TRUNC_COORD=0 for Total War: WARHAMMER to fix it"
48357ec681f3Smrg- aco: don't update register demand during RA validation
48367ec681f3Smrg- aco: allow SDWA sels smaller than the operand size
48377ec681f3Smrg- radv: disable VK_FORMAT_R64_SFLOAT
48387ec681f3Smrg- vulkan: fix use-after-free in vk_common_DestroyDebugReportCallbackEXT
48397ec681f3Smrg- radv: fix use-after-free upon GS copy shader cache hits
48407ec681f3Smrg- radv,ac/llvm: use a dword alignment for descriptor loads
48417ec681f3Smrg
48427ec681f3SmrgRob Clark (143):
48437ec681f3Smrg
48447ec681f3Smrg- freedreno/ir3: Fix ldg decoding/parsing
48457ec681f3Smrg- freedreno/ir3: Decouple ir3_info collection from assembler
48467ec681f3Smrg- freedreno/ir3: Add some new "logical" opcodes
48477ec681f3Smrg- freedreno/hw: Add isaspec mechanism for documenting/defining an ISA
48487ec681f3Smrg- freedreno/hw/isa: Add description of ir3 ISA
48497ec681f3Smrg- freedreno/hw/isa: Add expression caching
48507ec681f3Smrg- freedreno/ir3/tests: Switch disasm test over to new decoder
48517ec681f3Smrg- freedreno/ir3: Switch over to new encoder/decoder
48527ec681f3Smrg- freedreno/ir3: Small resinfo disasm tweak
48537ec681f3Smrg- freedreno/ir3: Better sstall estimation
48547ec681f3Smrg- freedreno/ir3: Realign disasm shader stats
48557ec681f3Smrg- freedreno/ir3/decode: Switch over to new disasm
48567ec681f3Smrg- freedreno/ir3: Remove legacy packed-struct encoding
48577ec681f3Smrg- frontend/dri: Expose RGB[AX]_SRGB as well
48587ec681f3Smrg- freedreno/isa: Fix branch/jump offset encoding
48597ec681f3Smrg- freedreno/a6xx: Add r2d support for GMEM resolves
48607ec681f3Smrg- gallium/util: Add helpers to determine if z/s is written
48617ec681f3Smrg- freedreno/a6xx: Don't early-z if there are stencil writes
48627ec681f3Smrg- r300: Use util_writes_depth_stencil() helper
48637ec681f3Smrg- radeonsi: Use util_writes_stencil() helper
48647ec681f3Smrg- freedreno: Add perf_warn() trace helper
48657ec681f3Smrg- freedreno: Add fmt/args macros for pipe_resource
48667ec681f3Smrg- freedreno/a6xx: Add helper to check if UBWC is supported
48677ec681f3Smrg- freedreno: Add perf_warn() for missed UBWC opportunities
48687ec681f3Smrg- ci/freedreno/a6xx: Skip vs-output-array-vec2-index-wr-before-gs
48697ec681f3Smrg- freedreno/a6xx: Fix 3dmark misrendering with unwritten MRTs
48707ec681f3Smrg- mesa: Remove _mesa_destroy_context()
48717ec681f3Smrg- freedreno/decode: Fix overflow
48727ec681f3Smrg- freedreno: Put an upper limit on VSC size
48737ec681f3Smrg- freedreno: Misc cleanup
48747ec681f3Smrg- freedreno/a5xx: Drop fd5_compute_stateobj
48757ec681f3Smrg- freedreno/a6xx: Drop fd6_compute_stateobj
48767ec681f3Smrg- freedreno/ir3+a5xx+a6xx: De-duplicate create_compute_state()
48777ec681f3Smrg- freedreno/ir3: Add ir3_shader_state
48787ec681f3Smrg- freedreno/ir3: Move ir3_compiler_create()
48797ec681f3Smrg- freedreno/ir3: Add ir3_screen_fini()
48807ec681f3Smrg- freedreno/ir3: Reshuffle ir3_shader_create()
48817ec681f3Smrg- freedreno/ir3: Reshuffle compute state creation
48827ec681f3Smrg- freedreno/ir3: Async shader compile
48837ec681f3Smrg- freedreno/ir3: Add missing shader prog cache invalidation
48847ec681f3Smrg- freedreno: Quiet fallthrough warnings
48857ec681f3Smrg- freedreno: Split batch_flush_reset_dependencies()
48867ec681f3Smrg- freedreno: driver-thread annotations
48877ec681f3Smrg- freedreno/ir3/print: More sane ssa src/dst display
48887ec681f3Smrg- freedreno/ir3/print: Improve branch printing
48897ec681f3Smrg- util/fossilize_db: Fix compile error with clang
48907ec681f3Smrg- freedreno: Handle InvalidateBufferData() case
48917ec681f3Smrg- freedreno: Add perf_debug logging for bo stalls
48927ec681f3Smrg- freedreno: Workaround for UNSYNC+DISCARD_RANGE
48937ec681f3Smrg- driconf: Generate a static table when no xmlconfig
48947ec681f3Smrg- xmlconfig: Reshuffle to keep attr processing
48957ec681f3Smrg- xmlconfig: Add static driconfig support
48967ec681f3Smrg- freedreno/ir3: Drop foreach_bit() macro
48977ec681f3Smrg- freedreno: Drop foreach_bit() macro
48987ec681f3Smrg- etnaviv: Drop foreach_bit() macro
48997ec681f3Smrg- v3d: Drop foreach_bit() macro
49007ec681f3Smrg- freedreno: Fix think-o in fd_resource_wait()
49017ec681f3Smrg- freedreno/ir3: Fix initial_variants_synchronous() condition
49027ec681f3Smrg- freedreno: Add FD_DBG() macro
49037ec681f3Smrg- freedreno: Slight perf_debug rework
49047ec681f3Smrg- freedreno: Add macro for duration based warns
49057ec681f3Smrg- util/u_queue: Ensure num_cpu_mask_bits is valid
49067ec681f3Smrg- util: Add accessor for util_cpu_caps
49077ec681f3Smrg- freedreno/a6xx: Always pass ctx to fd6_emit_textures()
49087ec681f3Smrg- freedreno/a6xx: Fix uncompressed resource vs stale CSO
49097ec681f3Smrg- freedreno/ir3: Add comments about shader key/gen
49107ec681f3Smrg- freedreno: Deduplicate fixup_shader_state()
49117ec681f3Smrg- freedreno/a6xx: Fix compile warning
49127ec681f3Smrg- driconf: Add ignore_map_unsynchronized option
49137ec681f3Smrg- freedreno: Remove dead-cells MBR workaround
49147ec681f3Smrg- util: Extract thread-id helpers from u_current
49157ec681f3Smrg- gallium/u_threaded: Add helper to assert driver thread
49167ec681f3Smrg- gallium/u_threaded: use mesa_log for debug msgs
49177ec681f3Smrg- freedreno: Fix u_blitter constant-buffer leak
49187ec681f3Smrg- freedreno: Factor out common fd_resource init
49197ec681f3Smrg- freedreno: Split out batch/resource tracking
49207ec681f3Smrg- freedreno: Restructure transfer_map()
49217ec681f3Smrg- freedreno: Extend threaded_resource
49227ec681f3Smrg- freedreno: Extend threaded_transfer
49237ec681f3Smrg- freedreno: Extract out helper for transfer-map flag munging
49247ec681f3Smrg- freedreno: Add fd_replace_buffer_storage()
49257ec681f3Smrg- freedreno: Add transfer_pool_unsync
49267ec681f3Smrg- freedreno/a6xx: Move UBWC demotion to first sampler view bind
49277ec681f3Smrg- freedreno: Check cb0 in rebind_resource()
49287ec681f3Smrg- freedreno: threaded_context support
49297ec681f3Smrg- freedreno: threaded_context async flush support
49307ec681f3Smrg- freedreno: Fix fd_fence_finish()
49317ec681f3Smrg- freedreno/drm: Avoid unitialized timestamp in submit fail
49327ec681f3Smrg- freedreno/drm: Split softpin "reloc" functions
49337ec681f3Smrg- freedreno/drm: Split 64b vs 32b paths
49347ec681f3Smrg- freedreno/drm: Move emit_reloc_tail to head
49357ec681f3Smrg- freedreno/drm: Inline iova calculation
49367ec681f3Smrg- freedreno/ir3: Precompute whether we need driver-params
49377ec681f3Smrg- freedreno: Add helpers to mark dirty state
49387ec681f3Smrg- freedreno: Add mapping to generation specific dirty state
49397ec681f3Smrg- freedreno/a6xx: Convert to dirty_groups
49407ec681f3Smrg- freedreno: Small dirty flag re-org
49417ec681f3Smrg- freedreno: Add dirty bit for state that needs rsc tracking
49427ec681f3Smrg- freedreno: Don't ignore geom/tess stage resources
49437ec681f3Smrg- freedreno: Split out helper for updating sw stats
49447ec681f3Smrg- freedreno: Only collect sw stats when required
49457ec681f3Smrg- freedreno/a6xx/vsc: Be more tolerate of degenerate prims
49467ec681f3Smrg- freedreno: Drop u_trim_pipe_prim() from fast-paths
49477ec681f3Smrg- u_draw: Add helper to emultate multi-draw
49487ec681f3Smrg- freedreno: Use multi-draw helper
49497ec681f3Smrg- freedreno: Handle multi-draw edge cases
49507ec681f3Smrg- freedreno: Push multi-draw closer to backend
49517ec681f3Smrg- freedreno/a6xx: Emit streamout state on every draw
49527ec681f3Smrg- freedreno: Add draw cost estimation
49537ec681f3Smrg- freedreno/batch: Export key/hash fxns
49547ec681f3Smrg- freedreno/batch: Add a way to clone a batch key
49557ec681f3Smrg- freedreno: Add gmem_reason_mask
49567ec681f3Smrg- freedreno/a6xx: Fix sRGB/snorm vs sysmem clear path
49577ec681f3Smrg- freedreno: Autotune bypass vs GMEM rendering decision
49587ec681f3Smrg- freedreno/a6xx: Fix typo
49597ec681f3Smrg- freedreno: Make headers C++ happy
49607ec681f3Smrg- freedreno/fdperf: Use os_read_file()
49617ec681f3Smrg- freedreno: Split out devicetree helpers
49627ec681f3Smrg- ci: Disable panfrost t760
49637ec681f3Smrg- freedreno/a6xx: Fix indirect+patches draws
49647ec681f3Smrg- freedreno/a6xx: Fix obsolete comment
49657ec681f3Smrg- d3d12: Use util_draw_multi() helper
49667ec681f3Smrg- etnaviv: Use util_draw_multi() helper
49677ec681f3Smrg- i915: Use util_draw_multi() helper
49687ec681f3Smrg- iris: Use util_draw_multi() helper
49697ec681f3Smrg- lima: Use util_draw_multi() helper
49707ec681f3Smrg- llvmpipe: Use util_draw_multi() helper
49717ec681f3Smrg- nouveau: Use util_draw_multi() helper
49727ec681f3Smrg- r300: Use util_draw_multi() helper
49737ec681f3Smrg- r600: Use util_draw_multi() helper
49747ec681f3Smrg- softpipe: Use util_draw_multi() helper
49757ec681f3Smrg- svga: Use util_draw_multi() helper
49767ec681f3Smrg- tegra: Use util_draw_multi() helper
49777ec681f3Smrg- vc4: Use util_draw_multi() helper
49787ec681f3Smrg- v3d: Use util_draw_multi() helper
49797ec681f3Smrg- virgl: Use util_draw_multi() helper
49807ec681f3Smrg- freedreno: Don't handle multi-draw in indirect case
49817ec681f3Smrg- util/primconvert: Handle indirect and multi-draw
49827ec681f3Smrg- freedreno: Add .clang-format
49837ec681f3Smrg- freedreno: Some manual reformatting
49847ec681f3Smrg- freedreno: Re-indent
49857ec681f3Smrg- freedreno: Manual fixups
49867ec681f3Smrg- freedreno: Add missing foreach macros and update indentation
49877ec681f3Smrg
49887ec681f3SmrgRohan Garg (8):
49897ec681f3Smrg
49907ec681f3Smrg- virgl: Cache depth and stencil buffers
49917ec681f3Smrg- ci: Ensure that jobs inherting the ci-deqp jobs artifact meson logs
49927ec681f3Smrg- intel/genxml: Free resource before exiting
49937ec681f3Smrg- intel/compiler: Free resources on test teardown
49947ec681f3Smrg- virgl: update headers
49957ec681f3Smrg- virgl: Return total video memory if available
49967ec681f3Smrg- virgl: Add support for querying detailed memory info
49977ec681f3Smrg- virgl: Support the ETC1_RGB8 format as virglrenderer supports it
49987ec681f3Smrg
49997ec681f3SmrgRoman Stratiienko (1):
50007ec681f3Smrg
50017ec681f3Smrg- egl: android: use num_planes param in createImageFromDmaBufs()
50027ec681f3Smrg
50037ec681f3SmrgRuijing Dong (1):
50047ec681f3Smrg
50057ec681f3Smrg- radeon/vcn: release si buffer for encoding at the end.
50067ec681f3Smrg
50077ec681f3SmrgRyan Neph (1):
50087ec681f3Smrg
50097ec681f3Smrg- Revert "virgl: fix BGRA emulation artifacts during window resize"
50107ec681f3Smrg
50117ec681f3SmrgSagar Ghuge (7):
50127ec681f3Smrg
50137ec681f3Smrg- anv: Invalidate the correct AUX-TT entry
50147ec681f3Smrg- anv: Skip CCS ambiguate which preceed fast-clears
50157ec681f3Smrg- intel/mi_builder: Added support for command streamer shift operations
50167ec681f3Smrg- anv: Add anv_memregion structure
50177ec681f3Smrg- Revert "Revert "blorp/gen12: Don't use aux address if implicit CCS""
50187ec681f3Smrg- intel/blorp: Fix condition to figure out aux_address
50197ec681f3Smrg- anv: Set correct binding table entry count
50207ec681f3Smrg
50217ec681f3SmrgSamuel Iglesias Gonsálvez (9):
50227ec681f3Smrg
50237ec681f3Smrg- turnip: disable UBWC on Z24_S8 MSAA images on A630
50247ec681f3Smrg- turnip: set sparseAddressSpaceSize to zero
50257ec681f3Smrg- turnip: fix UINT64_MAX size wrapping in tu_GetBufferMemoryRequirements()
50267ec681f3Smrg- turnip: fix resolve MSAA D24_UNORM_S8_UINT image to S8_UINT
50277ec681f3Smrg- turnip: fix resolve MSAA D32_SFLOAT_S8_UINT image to S8_UINT
50287ec681f3Smrg- util: fix parsing of /proc/meminfo MemAvailable value
50297ec681f3Smrg- turnip: keep track of memory heap usage, size and flags
50307ec681f3Smrg- turnip: VK_EXT_memory_budget implementation
50317ec681f3Smrg- turnip: set depth plane control zmode to A6XX_LATE_Z when sample mask is written
50327ec681f3Smrg
50337ec681f3SmrgSamuel Pitoiset (218):
50347ec681f3Smrg
50357ec681f3Smrg- radv: do not invalidate the L2 metadata cache on compute queues
50367ec681f3Smrg- ci: mark some sparse CTS as expected failures on RAVEN
50377ec681f3Smrg- radv: flush L2 metadata as part of CB/DB flush instead of CS_DONE on GFX9
50387ec681f3Smrg- radv: add a comment explaining the micro tile mode resolve
50397ec681f3Smrg- radv: enable TC-compat HTILE with D32S8 and MSAA on GFX9+
50407ec681f3Smrg- radv: enable TC-compat HTILE for D16S8 on GFX9+
50417ec681f3Smrg- radv: restore invalidating the vector cache for internal meta operations
50427ec681f3Smrg- radv: flush L2 for images affected by the pipe misaligned issue on GFX10+
50437ec681f3Smrg- ci: exclude one CTS test that timeout most of the time for RADV CI
50447ec681f3Smrg- radv: remove redundant check in radv_process_depth_stencil()
50457ec681f3Smrg- radv: remove unnecessary radv_image::tc_compatible_htile
50467ec681f3Smrg- radv: remove redundant check in depth_view_can_fast_clear()
50477ec681f3Smrg- radv: fix a sync issue with geometry shader primitives query on GFX10+
50487ec681f3Smrg- radv: fix overflow when computing the SQTT buffer size
50497ec681f3Smrg- radv: inhibit clock gating when tracing with SQTT
50507ec681f3Smrg- ac/rgp: add support for GFX10.3
50517ec681f3Smrg- ac,radv: add SQTT support on GFX10.3
50527ec681f3Smrg- radv: enable SQTT support on GFX10.3
50537ec681f3Smrg- radv: fix separate depth/stencil layout in render pass
50547ec681f3Smrg- radv: add multi-layer support to FMASK color expand
50557ec681f3Smrg- radv: use the range aspect mask in FMASK color expand
50567ec681f3Smrg- radv: use a workgroup size of 8x8 for FMASK color expand
50577ec681f3Smrg- radv: only decompress the depth/stencil aspect that needs to be resolved
50587ec681f3Smrg- radv: enable sparseImageInt64Atomics/sparseImageFloat32Atomics
50597ec681f3Smrg- radv,aco: fix shifting input VGPRs for the LS VGPR init bug on GFX9
50607ec681f3Smrg- radv: synchronize Cmd{Set,Write}Event() using PS_DONE/CS_DONE events
50617ec681f3Smrg- radv: add support for emitting PS_DONE/CS_DONE on GFX6-8
50627ec681f3Smrg- radv: remove radv_util.h
50637ec681f3Smrg- radv: remove stub() macros
50647ec681f3Smrg- radv: remove unused EMPTY constant in radv_descript_set.c
50657ec681f3Smrg- nir/algebraic: mark more optimization with fsat(NaN) as inexact
50667ec681f3Smrg- ac/surface: store HTILE mip info into the surface
50677ec681f3Smrg- radv: use the image view range when fast clearing depth
50687ec681f3Smrg- radv: check if HTILE is enabled per-level instead of the entire image
50697ec681f3Smrg- radv: do not decompress/resummarize levels without HTILE
50707ec681f3Smrg- radv: remove mipmaps related assertions when initializing HTILE
50717ec681f3Smrg- radv: add support for fast clearing levels of the HTILE buffer
50727ec681f3Smrg- radv: teach radv_htile_enabled() about the number of HTILE levels
50737ec681f3Smrg- radv: enable TC-compat HTILE for mipmaps on GFX10+
50747ec681f3Smrg- radv: re-disable TC-compat HTILE for D32S8 on all generations
50757ec681f3Smrg- radv: fix centroid with VRS coarse shading
50767ec681f3Smrg- radv/winsys: move the initial BO domain to radeon_winsys_bo
50777ec681f3Smrg- radv: prefer CP DMA for GTT buffer copies/clears on dGPUs due to slow PCIe
50787ec681f3Smrg- radv: fix waiting on the last enabled RB for occlusion queries
50797ec681f3Smrg- radv/winsys: use an array for the global BO list instead of a list
50807ec681f3Smrg- radv/winsys: remove the radv_amdgpu_winsys_bo::ws indirection
50817ec681f3Smrg- radv/winsys: remove useless continue preamble CS for IBs path
50827ec681f3Smrg- radv/winsys: remove useless is_local check in radv_amdgpu_cs_add_buffer()
50837ec681f3Smrg- radv/winsys: remove unused radeon_bo_usage enum
50847ec681f3Smrg- radv/winsys: simplify the user fence logic for submission
50857ec681f3Smrg- radv/winsys: remove unused fields in radv_amdgpu_cs_request
50867ec681f3Smrg- radv/winsys: stop zeroing radv_amdgpu_cs_request
50877ec681f3Smrg- radv: use less AMDGPU contexts by creating only one per queue priority
50887ec681f3Smrg- radv: add radeon_winsys_bo::use_global_list
50897ec681f3Smrg- radv: stop using VM_ALWAYS_VALID on APUs
50907ec681f3Smrg- radv/winsys: move the debug_all_bos check outside of the add/del helpers
50917ec681f3Smrg- radv/winsys: set use_global_list to avoid adding a BO twice
50927ec681f3Smrg- radv/winsys: add buffer_make_resident() to the API
50937ec681f3Smrg- radv/winsys: add the resident BOs to the list of BOs at submit time
50947ec681f3Smrg- radv/winsys: enable the global BO list unconditionally
50957ec681f3Smrg- radv: use the global BO list from the winsys
50967ec681f3Smrg- radv: fix printing the debug option names
50977ec681f3Smrg- radv: fix double free when creating a fence failed
50987ec681f3Smrg- radv: stop allocating useless ESGS scratch BO on GFX10+
50997ec681f3Smrg- radv: fix memory leaks if a submission fails
51007ec681f3Smrg- radv: do not overallocate the SQTT buffer
51017ec681f3Smrg- radv: adjust an error message related to the SQTT buffer size
51027ec681f3Smrg- radv: add support for resizing the SQTT buffer automatically
51037ec681f3Smrg- ac/rgp: append the number of seconds to the generated RGP file
51047ec681f3Smrg- radv: emit pipeline bind markers for SQTT
51057ec681f3Smrg- radv: only make the WSI images resident if the global BO list is used
51067ec681f3Smrg- radv/winsys: set use_global_list inside the critical section
51077ec681f3Smrg- radv: only apply the MRT output NaN fixup to non-meta shaders
51087ec681f3Smrg- radv: create the start/stop CS for SQTT dynamically
51097ec681f3Smrg- radv: move SQTT parameters initialization to radv_thread_trace_init()
51107ec681f3Smrg- radv: remove an outdated TODO about SQTT cache flushes
51117ec681f3Smrg- radv: make sure to allocate enough space when emitting SQTT userdata
51127ec681f3Smrg- radv: stop emitting pipeline bind markers
51137ec681f3Smrg- radv: do not allow to capture SQTT on the compute queue
51147ec681f3Smrg- radv: add support for user event markers with SQTT
51157ec681f3Smrg- radv: only emit pipeline bind markers for application pipelines
51167ec681f3Smrg- radv: use the pipeline key as hash for pipeline bind markers
51177ec681f3Smrg- radv: set correct value for OFFCHIP_BUFFERING on GFX10+
51187ec681f3Smrg- radv: make the border color BO a resident buffer
51197ec681f3Smrg- radv: make the trace BO a resident buffer
51207ec681f3Smrg- radv: make the TMA/TBA BOs resident buffers
51217ec681f3Smrg- radv: emit the trap handler registers earlier
51227ec681f3Smrg- radv: rework radv_cmd_buffer_resolve_subpass() a bit
51237ec681f3Smrg- radv: emit missing subpass resolve marker for SQTT
51247ec681f3Smrg- ac/rgp: fill CPU info by parsing /proc/cpuinfo
51257ec681f3Smrg- radv: store a pointer to the code in radv_shader_variant
51267ec681f3Smrg- radv: add support for exporting pipelines with RGP
51277ec681f3Smrg- radv: add support for instruction timing with RGP
51287ec681f3Smrg- radv: do not scale the depth bias for D16_UNORM depth surfaces
51297ec681f3Smrg- include/drm-uapi: bump AMDGPU headers
51307ec681f3Smrg- ac/rgp: recognize more memory types
51317ec681f3Smrg- ac/rgp: report LDS size in CU mode on GFX10+
51327ec681f3Smrg- ac/rgp: report the number of memory operations per clock
51337ec681f3Smrg- ac/rgp: report the number of primitives per clock
51347ec681f3Smrg- radv: remove duplicate REG_INCLUDE_CONTEXT setting for SQTT
51357ec681f3Smrg- radv: always select the first active CU when profiling with SQTT
51367ec681f3Smrg- radv: fix exporting SQTT pipelines with LLVM
51377ec681f3Smrg- radv: exclude perf counters for SQTT also on GFX10.3
51387ec681f3Smrg- Revert "radv: do not overallocate the SQTT buffer"
51397ec681f3Smrg- radeonsi,radv: do not overallocate the SQTT buffer size
51407ec681f3Smrg- radv: remove useless decompression of the DS resolve attachment
51417ec681f3Smrg- radv: do not trace inactive shader engines with SQTT
51427ec681f3Smrg- ac/sqtt: fix determining if the trace is complete on GFX10+
51437ec681f3Smrg- radv: double the SQTT buffer size when it is resized
51447ec681f3Smrg- radv: trigger a new SQTT capture automatically after resizing the buffer
51457ec681f3Smrg- radv: bump the initial SQTT buffer size to 32MB per SE
51467ec681f3Smrg- radv: fix RGP barrier layout transition for TC-compatible CMASK images
51477ec681f3Smrg- Revert "radv: stop using VM_ALWAYS_VALID on APUs"
51487ec681f3Smrg- radv: cleanup enabling TC-compat HTILE for depth surfaces
51497ec681f3Smrg- radv: remove useless check about mips+layers for TC-compat HTILE images
51507ec681f3Smrg- radv: skip useless FCE when fast-clearing MSAA images with DCC enabled
51517ec681f3Smrg- radv: re-enable TC-compat HTILE for MSAA D32S8 images on GFX9+
51527ec681f3Smrg- radv: do not declare push constants for DCC decompress on compute
51537ec681f3Smrg- radv: check if dynamic VRS state changed
51547ec681f3Smrg- radv: check if dynamic line stipple state changed
51557ec681f3Smrg- radv: disable sampling with VK_FORMAT_R64_SFLOAT
51567ec681f3Smrg- radv: fix meta save/restore state with non renderable images
51577ec681f3Smrg- radv: fix potential clears with non renderable images on GFX9+
51587ec681f3Smrg- radv: fix initialization of disable_compression when clearing color image
51597ec681f3Smrg- radv: add missing SQTT events for copy_commands2/create_renderpass2
51607ec681f3Smrg- radv: remove useless DCC disable check for 3D images on GFX10+
51617ec681f3Smrg- radv: rework radv_use_dcc_for_image() a bit
51627ec681f3Smrg- vulkan: add missing vk_shader_module.c/h includes to Makefile
51637ec681f3Smrg- radv: use common entrypoints for VK_KHR_copy_commands2
51647ec681f3Smrg- radv: do not enable TC-compat CMASK if the image isn't readable by a shader
51657ec681f3Smrg- radv: remove redundant check when enabling TC-compat CMASK
51667ec681f3Smrg- radv: make sure FMASK is enabled for TC-compat CMASK
51677ec681f3Smrg- radv: only configure the CMASK tiling for TC-compat on GFX8
51687ec681f3Smrg- radv: initialize TC-compat CMASK images with the DCC clear code
51697ec681f3Smrg- radv: enable TC-compat CMASK on GFX10+
51707ec681f3Smrg- radv: add notccompatcmask debug option
51717ec681f3Smrg- radv: extend the dirty bits to 64-bit
51727ec681f3Smrg- ac/surface: init CMASK slice size on GFX9+
51737ec681f3Smrg- radv: fix clearing CMASK layers on GFX9+
51747ec681f3Smrg- radv: initialize CMASK with correct clear codes
51757ec681f3Smrg- radv: restore previous MRT CB_SHADER_MASK logic
51767ec681f3Smrg- radv: gather if the FS uses perspective or linear interpolations
51777ec681f3Smrg- radv: determine if a pipeline is candidate for flat shading
51787ec681f3Smrg- radv: enable VRS 2x2 coarse shading for flat shading on GFX10.3+
51797ec681f3Smrg- radv: add RADV_DEBUG=novrsflatshading option
51807ec681f3Smrg- ci: update list of expected CTS failures for RADV
51817ec681f3Smrg- vulkan: add common entrypoints for VK_KHR_create_renderpass2
51827ec681f3Smrg- radv: use common entrypoints for VK_KHR_create_renderpass2
51837ec681f3Smrg- turnip: use common entrypoints for VK_KHR_create_renderpass2
51847ec681f3Smrg- lavapipe: use common entrypoints for VK_KHR_create_renderpass2
51857ec681f3Smrg- anv: use common entrypoints for VK_KHR_create_renderpass2
51867ec681f3Smrg- radv: report that degenerated triangles are not culled
51877ec681f3Smrg- radv: require DRM 3.35+
51887ec681f3Smrg- ac/surface: do not allocate FMASK or CMASK for stencil-only surfaces on GFX9+
51897ec681f3Smrg- radv: do not fixup DCC after compute color resolves if DCC stores enabled
51907ec681f3Smrg- radv: only set WRITE_COMPRESS_ENABLE for storage image descriptors
51917ec681f3Smrg- radv: use a sampled image descriptor for reads for the MSAA color decompress
51927ec681f3Smrg- radv: compress FMASK for all layouts except GENERAL
51937ec681f3Smrg- radv: cleanup FMASK expand transitions
51947ec681f3Smrg- radv: do not force enable FMASK during MSAA blits
51957ec681f3Smrg- radv: use COLOR_ATTACHMENT_OPTIMAL for fast clear/hw resolve operations
51967ec681f3Smrg- ac: add ac_get_family_name() helper
51977ec681f3Smrg- radv: change RADV_FORCE_FAMILY to use family name instead of LLVM processor name
51987ec681f3Smrg- radv: try to keep HTILE compressed with DEPTH_STENCIL_READ_ONLY_OPTIMAL
51997ec681f3Smrg- radv: clean up fence syncobj code
52007ec681f3Smrg- ac: add ac_gpu_info::has_image_load_dcc_bug
52017ec681f3Smrg- aco: fix get_sampler_desc() for image loads
52027ec681f3Smrg- aco: implement a workaround for the image load DCC hw bug on GFX10.3
52037ec681f3Smrg- radv: allow DCC for storage images on GFX10.3 with RADV_PERFTEST=dccstores
52047ec681f3Smrg- radv: handle implicit subpass dependencies per attachment
52057ec681f3Smrg- radv: init CMASK/FMASK/DCC in parallel
52067ec681f3Smrg- radv: perform MSAA color decompression for storage images with DCC
52077ec681f3Smrg- radv: enable DCC stores with MSAA 4x/8x on GFX10+
52087ec681f3Smrg- radv: simplify a check when enabling DCC for concurrent images
52097ec681f3Smrg- radv: enable DCC for concurrent images on GFX10
52107ec681f3Smrg- radv: make sure FMASK decompress and FCE are performed on gfx queue
52117ec681f3Smrg- radv: add MSAA support to ClearColorImage() on compute queue
52127ec681f3Smrg- radv: do not clamp framebuffer dimensions to the minimum dimension
52137ec681f3Smrg- radv: add MSAA support to CopyImage() on compute queue
52147ec681f3Smrg- radv: use explicit VRS mode when configuring PA_CL_VRS_CNTL
52157ec681f3Smrg- radv: allow to force VRS rates on GFX10.3 with RADV_FORCE_VRS
52167ec681f3Smrg- radv: fix needed dynamic state for VRS
52177ec681f3Smrg- amd/addrlib: expose HTILE address equations to drivers on GFX10+
52187ec681f3Smrg- ac/surface: rename ac_surface_dcc_address_test.c
52197ec681f3Smrg- ac/surface: add a test of HtileAddrFromCoord prototype outside of addrlib
52207ec681f3Smrg- ac/surface: rename gfx9_dcc_equation to gfx9_meta_equation
52217ec681f3Smrg- ac/surface: increase gfx9_meta_equation::gfx10_bits by 4 elements
52227ec681f3Smrg- ac/surface: copy the HTILE equations to the surface
52237ec681f3Smrg- ac/surface: implement HtileAddrFromCoord in NIR
52247ec681f3Smrg- ac/surface: store the HTILE pitch to the surface
52257ec681f3Smrg- radv: expose R8_UINT as the only supported format for VRS attachments
52267ec681f3Smrg- radv: do not allow MSAA with fragment shading rate attachments
52277ec681f3Smrg- radv: do not enable DCC for fragment shading rate attachments
52287ec681f3Smrg- radv: determine if attachment VRS is enabled
52297ec681f3Smrg- radv: configure the VRS HTILE encoding size
52307ec681f3Smrg- radv: do not use the whole HTILE buffer for depth when VRS is used
52317ec681f3Smrg- radv: update the HTILE clear word when VRS is used
52327ec681f3Smrg- radv: allow HTILE for very small images if VRS attachment is used
52337ec681f3Smrg- radv: create an image for VRS if no depth/stencil attachment is bound
52347ec681f3Smrg- radv: handle the VRS attachment subpass
52357ec681f3Smrg- radv: bind our internal depth buffer when not provided by the app
52367ec681f3Smrg- radv: add support for copying VRS rates into HTILE
52377ec681f3Smrg- radv: copy VRS rates to HTILE when beginning a subpass
52387ec681f3Smrg- radv: configure the VRS combiners when an attachment is used
52397ec681f3Smrg- radv: advertise attachmentFragmentShadingRate on GFX10.3
52407ec681f3Smrg- ac: add missing BUF_DATA_FORMAT_10_11_11 vertex format on GFX10+
52417ec681f3Smrg- radv: keep DCC compressed for clears on compute with image stores
52427ec681f3Smrg- aco: fix opquantize2f16 on GFX6-7
52437ec681f3Smrg- radv: fix fast clearing depth-only or stencil-only aspects with HTILE
52447ec681f3Smrg- radv: fix emitting depth bias when beginning a command buffer
52457ec681f3Smrg- radv: fix emitting default depth bounds state on GFX6
52467ec681f3Smrg- radv/winsys: fix allocating the number of CS in the sysmem path
52477ec681f3Smrg- radv/winsys: fix resetting the number of padded IB words
52487ec681f3Smrg- radv: make sure CP DMA is idle before executing secondary command buffers
52497ec681f3Smrg- radv: fix various CMASK regressions on GFX9
52507ec681f3Smrg- radv: fix computation of the number of user SGPRS for NGG GS state
52517ec681f3Smrg- radv: check if DCC is enabled when resolving different levels
52527ec681f3Smrg- radv/winsys: fix executing huge secondary command buffers on GFX6
52537ec681f3Smrg
52547ec681f3SmrgSerge Martin (1):
52557ec681f3Smrg
52567ec681f3Smrg- clover: return CL_INVALID_VALUE when origin or region are NULL
52577ec681f3Smrg
52587ec681f3SmrgSimon Ser (15):
52597ec681f3Smrg
52607ec681f3Smrg- nouveau/nvc0: fix linear buffer alignment for scan-out/cursors
52617ec681f3Smrg- nouveau/nv50: fix linear buffer alignment for scan-out/cursors
52627ec681f3Smrg- frontends/va: extract pipe format to DRM format mapping
52637ec681f3Smrg- frontends/va: add support for VA_EXPORT_SURFACE_COMPOSED_LAYERS
52647ec681f3Smrg- frontends/va: add pipe to DRM format mapping for NV12 and P010
52657ec681f3Smrg- radeonsi/uvd: make format modifiers-aware
52667ec681f3Smrg- egl: use render node for wl_drm if available
52677ec681f3Smrg- gbm: fail early when modifier list only contains INVALID
52687ec681f3Smrg- gbm: remove fprintf calls in gbm_dri_bo_create
52697ec681f3Smrg- egl/wayland: avoid unnecessary roundtrip when authenticated
52707ec681f3Smrg- gbm: add gbm_bo_get_fd_for_plane
52717ec681f3Smrg- egl: fix software flag in _eglAddDevice call on DRM
52727ec681f3Smrg- egl: only take render nodes into account when listing DRM devices
52737ec681f3Smrg- Revert "egl: Don't add hardware device if there is no render node v2."
52747ec681f3Smrg- radv: fix format feature reporting for modifiers
52757ec681f3Smrg
52767ec681f3SmrgSimon Zeni (1):
52777ec681f3Smrg
52787ec681f3Smrg- egl/dri2: enable EGL_WL_bind_wayland_display in EGL device platform
52797ec681f3Smrg
52807ec681f3SmrgStéphane Marchesin (1):
52817ec681f3Smrg
52827ec681f3Smrg- virgl: Add simple disk cache
52837ec681f3Smrg
52847ec681f3SmrgSureshGuttula (3):
52857ec681f3Smrg
52867ec681f3Smrg- va/picture : Added failure check for stability
52877ec681f3Smrg- frontends/va: Update conditional checks for code stability.
52887ec681f3Smrg- frontends/va : Fix memory leaks incase of error returns
52897ec681f3Smrg
52907ec681f3SmrgTamara Schmitz (1):
52917ec681f3Smrg
52927ec681f3Smrg- util: add mesa_glthread for Valheim in OpenGL mode.
52937ec681f3Smrg
52947ec681f3SmrgTapani Pälli (17):
52957ec681f3Smrg
52967ec681f3Smrg- mesa: add GL_SR8_EXT, GL_SRG8_EXT for color/srgb format queries
52977ec681f3Smrg- intel/perf: cleanup, remove duplicate function declaration
52987ec681f3Smrg- intel/perf: introduce additional ralloc context parameter
52997ec681f3Smrg- i965: use aligned malloc for context instead of ralloc
53007ec681f3Smrg- mesa: add check that non base level attachment is mipmap complete
53017ec681f3Smrg- gitlab-ci: bump piglit commit for windows
53027ec681f3Smrg- anv: toggle on sample shading if it is set in the shader
53037ec681f3Smrg- anv/android: fix compilation failure
53047ec681f3Smrg- anv: fix compilation due to missing vk_format_from_android
53057ec681f3Smrg- mesa: check cube completeness for cube fbo attachments
53067ec681f3Smrg- anv/android: fix image creation with external format
53077ec681f3Smrg- android: add some more stub functions for cross compilation
53087ec681f3Smrg- intel/common: disable batch decoder on Android platform
53097ec681f3Smrg- loader: prefer iris on Android
53107ec681f3Smrg- iris: clamp PointWidth in 3DSTATE_SF like i965 does
53117ec681f3Smrg- egl: support no error attribute set to false with ES 1.1
53127ec681f3Smrg- glx: revert "Downgrade sRGB-ful fbconfigs"
53137ec681f3Smrg
53147ec681f3SmrgThong Thai (2):
53157ec681f3Smrg
53167ec681f3Smrg- frontends/va/config: Fix check for packed header config
53177ec681f3Smrg- radeon: Add cropping to encoded H.265 when padding is used
53187ec681f3Smrg
53197ec681f3SmrgTimothee Chabat (1):
53207ec681f3Smrg
53217ec681f3Smrg- llvmpipe: increase PIPE_CAP_MAX_TEXTURE_BUFFER_SIZE value
53227ec681f3Smrg
53237ec681f3SmrgTimothy Arceri (23):
53247ec681f3Smrg
53257ec681f3Smrg- util/disk_cache: do crc32 check on compressed data for ZSTD
53267ec681f3Smrg- util/disk_cache: move cache path strdup call back into disk_cache.c
53277ec681f3Smrg- util/disk_cache: use a new cache dir for the single file cache feature
53287ec681f3Smrg- util/mesa_sha1: add helper to reconvert sha1 hex strings
53297ec681f3Smrg- util/fossilize_db: add basic fossilize db util to read/write shader caches
53307ec681f3Smrg- util/disk_cache: make use of single file cache when env var set
53317ec681f3Smrg- nir: handle negatives in ffma reassociation optimisation
53327ec681f3Smrg- util/disk_cache: fix crash in fossilize_db
53337ec681f3Smrg- util/disk_cache: move cache tests to the util directory
53347ec681f3Smrg- util/disk_cache: make MESA_DISK_CACHE_READ_ONLY_FOZ_DBS a relative path
53357ec681f3Smrg- Revert "glsl: default to compat shaders in compat profile"
53367ec681f3Smrg- glsl: fix declarations of gl_MaxVaryingFloats
53377ec681f3Smrg- util: create some standalone compression helpers
53387ec681f3Smrg- util/disk_cache: make use of the new compression helpers
53397ec681f3Smrg- util/fossilize_db: remove compression from foz db helper
53407ec681f3Smrg- util/compress: make compression function inputs const
53417ec681f3Smrg- util/disk_cache: separate file reads from cache item validation
53427ec681f3Smrg- util/disk_cache: detangle cache item creation from disk writing
53437ec681f3Smrg- util/disk_cache: add cache item headers to single file cache entries
53447ec681f3Smrg- glsl: add compilation errors for attribute and varying qualifiers
53457ec681f3Smrg- glsl: enforce restrictions on builtin functions moved to compat
53467ec681f3Smrg- mesa: fix incomplete GL_NV_half_float implementation
53477ec681f3Smrg- util: disable glthread in CSGO
53487ec681f3Smrg
53497ec681f3SmrgTimur Kristóf (55):
53507ec681f3Smrg
53517ec681f3Smrg- radv: Only enable sparse features on Polaris and newer.
53527ec681f3Smrg- tgsi_to_nir: Fix uniform ranges.
53537ec681f3Smrg- aco: Fix LDS statistics of tess control shaders.
53547ec681f3Smrg- radv/llvm: Fix reporting LDS stats of tess control shaders.
53557ec681f3Smrg- aco: Disallow LSHS temp-only I/O when VS output is written indirectly.
53567ec681f3Smrg- pan/bi: Use correct enum type for NIR intrinsics.
53577ec681f3Smrg- aco: Use ASSERTED to avoid unused variable warning.
53587ec681f3Smrg- intel/compiler: Use assume() instead of assert() for array bounds.
53597ec681f3Smrg- intel/compiler: Make room for maximum dest size in nir_emit_texture.
53607ec681f3Smrg- anv: Use unreachable() in anv_genX.
53617ec681f3Smrg- anv: Use ASSERTED for results that are only used in asserts.
53627ec681f3Smrg- nir: Add new nir_builder helpers for iadd with no_unsigned_wrap.
53637ec681f3Smrg- nir: Add nir_builder helper for I/O address offset calculations.
53647ec681f3Smrg- nir: Add a few more algebraic optimizations to help address calculation.
53657ec681f3Smrg- nir: Fix unsigned upper bound of local_invocation_index for non-CS stages.
53667ec681f3Smrg- nir: Shrink vectors for load_shared.
53677ec681f3Smrg- nir: Add unsigned upper bound for TCS load_invocation_id.
53687ec681f3Smrg- nir: Add default unsigned upper bound configuration.
53697ec681f3Smrg- nir: Add AMD-specific buffer load/store intrinsics.
53707ec681f3Smrg- nir: Add nir_opt_offsets to fold const adds into load/store offsets.
53717ec681f3Smrg- nir: Add tessellation related AMD-specific intrinsics.
53727ec681f3Smrg- nir: Add AMD-specific Geometry Shader related intrinsics.
53737ec681f3Smrg- aco: Implement new buffer load/store intrinsics.
53747ec681f3Smrg- aco: Implement the new tessellation I/O related NIR intrinsics.
53757ec681f3Smrg- aco: Implement new Geometry Shader intrinsics.
53767ec681f3Smrg- ac/llvm: Implement AMD-specific buffer load/store intrinsics.
53777ec681f3Smrg- ac/llvm: Implement the new tessellation intrinsics.
53787ec681f3Smrg- ac/llvm: Implement new Geometry Shader intrinsics.
53797ec681f3Smrg- ac/llvm: Make shared loads/stores work correctly for non-CS stages.
53807ec681f3Smrg- ac/llvm: Make sure to always emit integer comparison for nir_op_ieq.
53817ec681f3Smrg- ac/llvm: Add constant offset to load/store_shared.
53827ec681f3Smrg- ac/llvm: Emit more efficient code for load_shared.
53837ec681f3Smrg- ac: Add NIR passes to lower VS->TCS->TES I/O to memory accesses.
53847ec681f3Smrg- ac: Add NIR passes to lower ES->GS I/O to memory accesses.
53857ec681f3Smrg- radv: Lower IO and set driver locations earlier.
53867ec681f3Smrg- radv: Save I/O usage data to both shader infos for merged stages.
53877ec681f3Smrg- radv: Calculate tess patches and LDS use outside the backend compilers.
53887ec681f3Smrg- radv: Determine tcs_in_out_eq in radv_pipeline instead of the compiler.
53897ec681f3Smrg- radv: Fill some tess shader info earlier.
53907ec681f3Smrg- radv: Reorder some NIR optimizations in preparation for the I/O changes.
53917ec681f3Smrg- radv: Use new, NIR-based I/O lowering.
53927ec681f3Smrg- radv/llvm: Only store TCS outputs where they are really needed.
53937ec681f3Smrg- radv/llvm: Delete superfluous tess and ESGS I/O code.
53947ec681f3Smrg- aco: Delete superfluous tess and ESGS I/O code.
53957ec681f3Smrg- aco: Fix constant address offset calculation for ds_read2 instructions.
53967ec681f3Smrg- ac/llvm: Fix alignment of shared load intrinsics.
53977ec681f3Smrg- aco: Optimize workgroup exclusive scan to better avoid bank conflicts.
53987ec681f3Smrg- aco: Align NGG scratch size to 16 so a single ds_read can always read it.
53997ec681f3Smrg- aco: Remove useless s_setprio near gs_alloc_req.
54007ec681f3Smrg- aco: Use s_setprio 3 at the beginning of every VS and TES.
54017ec681f3Smrg- aco: Extract ngg_nogs_export_prim_id to a separate function.
54027ec681f3Smrg- aco: Set block_kind_export_end in create_vs/fs_exports.
54037ec681f3Smrg- aco: Emit fewer branches for NGG VS/TES with late primitive export.
54047ec681f3Smrg- aco: Add a simple heuristic to decide early or late primitive export.
54057ec681f3Smrg- aco: Mark VCC clobbered for iadd8 and iadd16 reductions on GFX6-7.
54067ec681f3Smrg
54077ec681f3SmrgTomeu Vizoso (17):
54087ec681f3Smrg
54097ec681f3Smrg- ci: Fix selection of linker in Android builds
54107ec681f3Smrg- ci: Move container files into their own dir
54117ec681f3Smrg- ci: Move out expect files from .gitlab-ci
54127ec681f3Smrg- ci: Disable two radeonsi jobs
54137ec681f3Smrg- Revert "ci/panfrost: disable the rest of these jobs temporarily"
54147ec681f3Smrg- Revert "ci/panfrost: Disable t860/radeonsi testing while the runners are struggling."
54157ec681f3Smrg- Revert "CI: Disable Panfrost T760"
54167ec681f3Smrg- ci: Fix visibility property of LAVA jobs
54177ec681f3Smrg- ci/fdo: Use trimmed traces for Valve games
54187ec681f3Smrg- gallium/dri2: Pass the resource that corresponds to the plane
54197ec681f3Smrg- ci: Use a single template for LAVA jobs
54207ec681f3Smrg- ci: Set more reasonable timeouts for LAVA jobs
54217ec681f3Smrg- ci: Don't retry failed test runs
54227ec681f3Smrg- ci: Disable t720 LAVA jobs
54237ec681f3Smrg- Revert "ci: Disable t720 LAVA jobs"
54247ec681f3Smrg- Revert "ci: Disable panfrost g52"
54257ec681f3Smrg- Revert "ci: Disable panfrost t760"
54267ec681f3Smrg
54277ec681f3SmrgTony Wasserka (34):
54287ec681f3Smrg
54297ec681f3Smrg- aco/ra: Update register use bounds before recursing in get_regs_for_copies
54307ec681f3Smrg- aco/ra: Introduce PhysRegInterval helper class
54317ec681f3Smrg- aco/ra: Conservatively refactor existing code to use PhysRegInterval
54327ec681f3Smrg- aco/ra: Remove always-false conditions
54337ec681f3Smrg- aco/ra: Add iterator interface for PhysRegInterval
54347ec681f3Smrg- aco/ra: Use std::find_if(_not) to clean up get_reg_simple
54357ec681f3Smrg- aco/ra: Use std::all_of to simplify a loop
54367ec681f3Smrg- aco/ra: Conservatively refactor get_reg_specified to use PhysRegInterval
54377ec681f3Smrg- aco/ra: Move commonly repeated code to a helper function
54387ec681f3Smrg- aco/ra: Add helpers to test for intersection/containment of reg intervals
54397ec681f3Smrg- aco/ra: Use std::all_of to simplify a loop
54407ec681f3Smrg- aco/ra: Remove unused function parameter
54417ec681f3Smrg- aco/ra: Use PhysReg for member functions of PhysRegInterval
54427ec681f3Smrg- aco/ra: Use PhysReg when indexing into RegisterFile's containers
54437ec681f3Smrg- aco/ra: Use PhysRegInterval for collect_vars parameters
54447ec681f3Smrg- aco/ra: Use PhysRegInterval for count_zero
54457ec681f3Smrg- aco/ra: Fix print_regs using the wrong constant to check for blocked slots
54467ec681f3Smrg- aco/ra: Fix build with print_regs enabled
54477ec681f3Smrg- aco/ra: Remove preprocessor guards for print_regs
54487ec681f3Smrg- aco/ra: Add helper to get a PhysRegInterval for the register demand
54497ec681f3Smrg- aco: Fix vector::reserve() being called with the wrong size
54507ec681f3Smrg- radv: Fix improper max_index_count argument for indexed draws
54517ec681f3Smrg- ac: Add has_zero_index_buffer_bug to ac_gpu_info
54527ec681f3Smrg- radv: Skip 0-sized index buffers only when necessary
54537ec681f3Smrg- aco/ra: Avoid unnecessary copying of std::vectors
54547ec681f3Smrg- aco/isel: Don't emit unsupported i16<->f16 conversion opcodes on GFX6/7
54557ec681f3Smrg- aco/isel: Fix i64/u64->float32 conversion for large inputs
54567ec681f3Smrg- aco/isel: Don't request sign extension when truncating signed integers
54577ec681f3Smrg- aco/isel: Add documentation and asserts for convert_int
54587ec681f3Smrg- aco/isel: Fix large inputs being truncated in int32->f16 conversions
54597ec681f3Smrg- aco/isel: Add documentation for (u)int64->f16 conversion
54607ec681f3Smrg- ci: skip pipeline_barrier tests that currently crash on RADV
54617ec681f3Smrg- gitlab: rename RADV bug report template
54627ec681f3Smrg- aco/spill: Fix improper handling of exec phis
54637ec681f3Smrg
54647ec681f3SmrgVasily Khoruzhick (10):
54657ec681f3Smrg
54667ec681f3Smrg- lima: add precompile debug flag
54677ec681f3Smrg- lima/ppir: don't use list_length() in loop in regalloc and liveness analysis
54687ec681f3Smrg- lima: update dEQP fails and skips lists
54697ec681f3Smrg- lima: relax checks of imported BO
54707ec681f3Smrg- lima: rename \*_shader_state to \*_compiled_shader
54717ec681f3Smrg- lima: rename lima_{fs,vs}_bind_state to lima_{fs,vs}_uncompiled_shader
54727ec681f3Smrg- lima: implement shader disk cache
54737ec681f3Smrg- lima: compute nir_sha1 for shader key even if disk cache is disabled
54747ec681f3Smrg- lima: use passed surface to get mipmap level for reload, not cbuf
54757ec681f3Smrg- lima: limit number of draws per job
54767ec681f3Smrg
54777ec681f3SmrgVinson Lee (31):
54787ec681f3Smrg
54797ec681f3Smrg- panfrost: Fix typos.
54807ec681f3Smrg- nouveau: Fix typos.
54817ec681f3Smrg- nv50/ir: Initialize DataArray members in constructor.
54827ec681f3Smrg- r600/sfn: Remove StoreMerger unused member b.
54837ec681f3Smrg- nv50/ir: Add InsertConstraintsPass constructor.
54847ec681f3Smrg- nv50/ir: Initialize CodeEmitter members in constructor.
54857ec681f3Smrg- nv50/ir: Initialize RegAlloc member func in constructor.
54867ec681f3Smrg- clover: Add constructor for global_argument.
54877ec681f3Smrg- lima: Fix typos.
54887ec681f3Smrg- v3dv: Fix assert.
54897ec681f3Smrg- nvc0/ir: Initialize NVC0LoweringPass member gpEmitAddress.
54907ec681f3Smrg- nvc0/ir: Initialize SchedDataCalculator members in constructor.
54917ec681f3Smrg- nv50/ir: Initialize BindArgumentsPass member sub in constructor.
54927ec681f3Smrg- virgl: Convert errno to string.
54937ec681f3Smrg- r600/sfn: Initialize FragmentShaderFromNir member m_pos_input.
54947ec681f3Smrg- etnaviv: Fix memory leak in etna_vertex_elements_state_create.
54957ec681f3Smrg- nv50/ir: Initialize ValueDef member origin in constructors.
54967ec681f3Smrg- nv50/ir: Initialize Instruction members.
54977ec681f3Smrg- aco: Initialize ds_state.front.writeMask.
54987ec681f3Smrg- r600: Fix typos.
54997ec681f3Smrg- llvmpipe: Fix typos.
55007ec681f3Smrg- nir/lower_tex: Change coord type to int.
55017ec681f3Smrg- gv100/ir: Initialize CodeEmitterGV100 members in constructor.
55027ec681f3Smrg- zink: Remove leftover dead code.
55037ec681f3Smrg- nv50/ir: Add constructor for NV50LegalizePostRA.
55047ec681f3Smrg- iris: Fix typos.
55057ec681f3Smrg- clover: Add constructor for sampler_argument.
55067ec681f3Smrg- ac: Fix emit_split_buffer_store modulus operation.
55077ec681f3Smrg- freedreno: Fix file descriptor leak.
55087ec681f3Smrg- glsl: Initialize parcel_out_uniform_storage members.
55097ec681f3Smrg- Remove leftover dead code.
55107ec681f3Smrg
55117ec681f3SmrgVíctor Manuel Jáquez Leal (1):
55127ec681f3Smrg
55137ec681f3Smrg- frontends/va/context: don't set max_references with num_render_targets
55147ec681f3Smrg
55157ec681f3SmrgWitold Baryluk (3):
55167ec681f3Smrg
55177ec681f3Smrg- lavapipe: Defer lavapipe warning to CreateDevice
55187ec681f3Smrg- util: Use explicit relaxed reads for u_queue
55197ec681f3Smrg- radv: memset the alignment hole in cache_entry to 0
55207ec681f3Smrg
55217ec681f3SmrgXin He (1):
55227ec681f3Smrg
55237ec681f3Smrg- virgl: use atomic operations when increase sub_ctx_id
55247ec681f3Smrg
55257ec681f3SmrgYannik Marek (1):
55267ec681f3Smrg
55277ec681f3Smrg- turnip: fix alpha to coverage in no color and unused attachment cases
55287ec681f3Smrg
55297ec681f3SmrgYevhenii Kharchenko (2):
55307ec681f3Smrg
55317ec681f3Smrg- st/mesa: fix PBO download for TEXTURE_1D_ARRAY textures
55327ec681f3Smrg- intel/compiler: remove unused member 'input_vue_map'
55337ec681f3Smrg
55347ec681f3SmrgYevhenii Kolesnikov (3):
55357ec681f3Smrg
55367ec681f3Smrg- iris: only set point sprite overrides if actually using points
55377ec681f3Smrg- nir/from_ssa: consider defs in sibling blocks
55387ec681f3Smrg- nir/from_ssa: don't check for interference within the same set
55397ec681f3Smrg
55407ec681f3SmrgYiwei Zhang (3):
55417ec681f3Smrg
55427ec681f3Smrg- venus: properly enable WSI for different platforms
55437ec681f3Smrg- venus: bring up Android support
55447ec681f3Smrg- venus: implement vn_debug_init_once with os_get_option
55457ec681f3Smrg
55467ec681f3SmrgYogesh Mohan Marimuthu (7):
55477ec681f3Smrg
55487ec681f3Smrg- ac/rgp: add ac_msgpack.h/c
55497ec681f3Smrg- ac/rgp: add rgp co, col, pso data structures
55507ec681f3Smrg- ac/rgp: add helper function to write rgp elf oject
55517ec681f3Smrg- ac/rgp: expose data structure to populate co, col, pso database
55527ec681f3Smrg- ac/rgp,radeonsi,radv: pass struct thread_trace_data to ac_sqtt_dump_data()
55537ec681f3Smrg- ac/rgp: dump co, col, pso database to rgp profile file
55547ec681f3Smrg- ac/rgp: set gfxip in elf_hdr.e_flags
55557ec681f3Smrg
55567ec681f3Smrgchenli (1):
55577ec681f3Smrg
55587ec681f3Smrg- mesa: update oudated members for debug and check
55597ec681f3Smrg
55607ec681f3Smrgcheyang (3):
55617ec681f3Smrg
55627ec681f3Smrg- frontend/dri: fix doesn't support RGBA ordering still expose RGBA in config
55637ec681f3Smrg- glsl: redeclare built-in variable with separate shader
55647ec681f3Smrg- virgl: add astc 2d compressed formats
5565