17ec681f3SmrgMesa 20.0.0 Release Notes / 2020-02-19 27ec681f3Smrg====================================== 37ec681f3Smrg 47ec681f3SmrgMesa 20.0.0 is a new development release. People who are concerned with 57ec681f3Smrgstability and reliability should stick with a previous release or wait 67ec681f3Smrgfor Mesa 20.0.1. 77ec681f3Smrg 87ec681f3SmrgMesa 20.0.0 implements the OpenGL 4.6 API, but the version reported by 97ec681f3SmrgglGetString(GL_VERSION) or glGetIntegerv(GL_MAJOR_VERSION) / 107ec681f3SmrgglGetIntegerv(GL_MINOR_VERSION) depends on the particular driver being 117ec681f3Smrgused. Some drivers don't support all the features required in OpenGL 127ec681f3Smrg4.6. OpenGL 4.6 is **only** available if requested at context creation. 137ec681f3SmrgCompatibility contexts may report a lower version depending on each 147ec681f3Smrgdriver. 157ec681f3Smrg 167ec681f3SmrgMesa 20.0.0 implements the Vulkan 1.2 API, but the version reported by 177ec681f3Smrgthe apiVersion property of the VkPhysicalDeviceProperties struct depends 187ec681f3Smrgon the particular driver being used. 197ec681f3Smrg 207ec681f3SmrgSHA256 checksum 217ec681f3Smrg--------------- 227ec681f3Smrg 237ec681f3Smrg:: 247ec681f3Smrg 257ec681f3Smrg bb6db3e54b608d2536d4000b3de7dd3ae115fc114e8acbb5afff4b3bbed04b34 mesa-20.0.0.tar.xz 267ec681f3Smrg 277ec681f3SmrgNew features 287ec681f3Smrg------------ 297ec681f3Smrg 307ec681f3Smrg- OpenGL 4.6 on radeonsi. 317ec681f3Smrg- GL_ARB_gl_spirv on radeonsi. 327ec681f3Smrg- GL_ARB_spirv_extensions on radeonsi. 337ec681f3Smrg- GL_EXT_direct_state_access for compatibility profile. 347ec681f3Smrg- VK_AMD_device_coherent_memory on RADV. 357ec681f3Smrg- VK_AMD_mixed_attachment_samples on RADV. 367ec681f3Smrg- VK_AMD_shader_explicit_vertex_parameter on RADV. 377ec681f3Smrg- VK_AMD_shader_image_load_store_lod on RADV. 387ec681f3Smrg- VK_AMD_shader_fragment_mask on RADV. 397ec681f3Smrg- VK_EXT_subgroup_size_control on RADV/LLVM. 407ec681f3Smrg- VK_KHR_separate_depth_stencil_layouts on Intel, RADV. 417ec681f3Smrg- VK_KHR_shader_subgroup_extended_types on RADV. 427ec681f3Smrg- VK_KHR_swapchain_mutable_format on RADV. 437ec681f3Smrg- VK_KHR_shader_float_controls on RADV/ACO. 447ec681f3Smrg- GFX6 (Southern Islands) and GFX7 (Sea Islands) support on RADV/ACO. 457ec681f3Smrg- Wave32 support for GFX10 (Navi) on RADV/ACO. 467ec681f3Smrg- Compilation of Geometry Shaders on RADV/ACO. 477ec681f3Smrg- Vulkan 1.2 on Intel, RADV. 487ec681f3Smrg- GL_INTEL_shader_integer_functions2 and 497ec681f3Smrg VK_INTEL_shader_integer_functions2 on Intel. 507ec681f3Smrg 517ec681f3SmrgBug fixes 527ec681f3Smrg--------- 537ec681f3Smrg 547ec681f3Smrg- drisw crashes on calling NULL putImage on EGL surfaceless platform 557ec681f3Smrg (pbuffer EGLSurface) 567ec681f3Smrg- [radeonsi][vaapi][bisected] invalid VASurfaceID when playing 577ec681f3Smrg interlaced DVB stream in Kodi 587ec681f3Smrg- [RADV] GPU hangs while the cutscene plays in the game Assassin's 597ec681f3Smrg Creed Origins 607ec681f3Smrg- ACO: The Elder Scrolls Online crashes on startup (Navi) 617ec681f3Smrg- Broken rendering of glxgears on S/390 architecture (64bit, BigEndian) 627ec681f3Smrg- aco: sun flickering with Assassins Creeds Origins 637ec681f3Smrg- !1896 broke ext_image_dma_buf_import piglit tests with radeonsi 647ec681f3Smrg- aco: wrong geometry with Assassins Creed Origins on GFX6 657ec681f3Smrg- valgrind errors since commit a8ec4082a41 667ec681f3Smrg- OSMesa osmesa_choose_format returns a format not supported by 677ec681f3Smrg st_new_renderbuffer_fb 687ec681f3Smrg- Build error with VS on WIN 697ec681f3Smrg- Using EGL_KHR_surfaceless_context causes spurious "libEGL warning: 707ec681f3Smrg FIXME: egl/x11 doesn't support front buffer rendering." 717ec681f3Smrg- !3460 broke texsubimage test with piglit on zink+anv 727ec681f3Smrg- The screen is black when using ACO 737ec681f3Smrg- [Regression] JavaFX unbounded VRAM+RAM usage 747ec681f3Smrg- radv: implement VK_AMD_shader_explicit_vertex_parameter 757ec681f3Smrg- Civilization VI crashes when loading game (AMD Vega Mobile) 767ec681f3Smrg- [radeonsi] X-Server crashes when trying to start Guild Wars 2 with 777ec681f3Smrg the commits from !3421 787ec681f3Smrg- aco: implement GFX6 support 797ec681f3Smrg- Add support for VK_KHR_swapchain_mutable_format 807ec681f3Smrg- radv: The Surge 2 crashes in ac_get_elem_bits() 817ec681f3Smrg- [Regression] JavaFX unbounded VRAM+RAM usage 827ec681f3Smrg- Use the OpenCL dispatch defnitions from OpenCL_Headers 837ec681f3Smrg- [regression][ilk,g965,g45] various dEQP-GLES2.functional.shaders.\* 847ec681f3Smrg failures 857ec681f3Smrg- aco: Dead Rising 4 crashes in lower_to_hw_instr() on GFX6-GFX7 867ec681f3Smrg- libvulkan_radeon.so crash with \`free(): double free detected in 877ec681f3Smrg tcache 2\` 887ec681f3Smrg- Commit be08e6a causes crash in com.android.launcher3 (Launcher) 897ec681f3Smrg- anv: Regression causing issues for radv when there are no Intel 907ec681f3Smrg devices 917ec681f3Smrg- Mesa no longer compiles with GCC 10 927ec681f3Smrg- [Navi/aco] Guild Wars 2 - ring gfx timeout with commit 3bca0af2 937ec681f3Smrg- [radv/aco] Regression is causing a soft crash in The Witcher 3 947ec681f3Smrg- [bisected] [radeonsi] GPU hangs/resets while playing interlaced 957ec681f3Smrg content on Kodi with VAAPI 967ec681f3Smrg- [radeonsi] MSAA image not copied properly after image store through 977ec681f3Smrg texture view 987ec681f3Smrg- T-Rex and Manhattan onscreen performance issue on Android 997ec681f3Smrg- VkSamplerCreateInfo compareEnable not respected 1007ec681f3Smrg- VkSamplerCreateInfo compareEnable not respected 1017ec681f3Smrg- Freedreno drm softpin driver implementation leaks memory 1027ec681f3Smrg- [POLARIS10] VRAM leak involving glTexImage2D with non-NULL data 1037ec681f3Smrg argument 1047ec681f3Smrg- [regression][bisected][ivb/byt] crucible test 1057ec681f3Smrg func.push-constants.basic.q0 causes gpu hang 1067ec681f3Smrg- MR 3096 broke lots of piglit ext_framebuffer_object tests on Raven 1077ec681f3Smrg- Rise of the Tomb Raider benchmark crash on Dell XPS 7390 2-in-1 w/ 1087ec681f3Smrg Iris Plus Graphics (Ice Lake 8x8 GT2) 1097ec681f3Smrg- Raven Ridge (2400G): Resident Evil 2 crashes my machine 1107ec681f3Smrg- Common practice of glGetActiveUniform leads to O(N²) behavior in Mesa 1117ec681f3Smrg- Rocket League ingame artifacts 1127ec681f3Smrg- [radv] SteamVR direct mode no longer works 1137ec681f3Smrg- [ANV] unused create parameters not properly ignored 1147ec681f3Smrg- [Bisected] Mesa fails to start alacritty with the wayland backend 1157ec681f3Smrg (AMD Vega). 1167ec681f3Smrg- [iris] piglit test clip-distance-vs-gs-out fails due to VUE map 1177ec681f3Smrg mismatch between VS <-> GS stages 1187ec681f3Smrg- [radv] SteamVR direct mode no longer works 1197ec681f3Smrg- Blocky corruption in The Surge 2 1207ec681f3Smrg- radeonsi: Floating point exception on R9 270 gpu for a set of traces 1217ec681f3Smrg- [RADV] [Navi] LOD artifacting in Halo - The Master Chief Collection 1227ec681f3Smrg (Halo Reach) 1237ec681f3Smrg- [CTS] 1247ec681f3Smrg dEQP-VK.api.image_clearing.core.clear_color_image.2d.linear.single_layer.r32g32b32\_\* 1257ec681f3Smrg fail on GFX6-GFX8 1267ec681f3Smrg- Vulkan: Please consider adding another sample count to 1277ec681f3Smrg sampledImageIntegerSampleCounts 1287ec681f3Smrg- Navi10: Bitrate based encoding with VAAPI/RadeonSI unusable 1297ec681f3Smrg- [RADV] create parameters not properly ignored 1307ec681f3Smrg- [regression][bdw,gen9,hsw,icl][iris] gltcs failures on 1317ec681f3Smrg mesa=8172b1fa03f 1327ec681f3Smrg- Bugs in RadeonSI VAAPI implementation 1337ec681f3Smrg- [GFX10] Glitch rendering Custom Avatars in Beat Saber 1347ec681f3Smrg- intel/fs: Check for 16-bit immediates in 1357ec681f3Smrg fs_visitor::lower_mul_dword_inst is too strict 1367ec681f3Smrg- i965/iris: assert when destroy GL context with active query 1377ec681f3Smrg- Visuals without alpha bits are not sRGB-capable 1387ec681f3Smrg- swapchain throttling: wait for fence has 1ns timeout 1397ec681f3Smrg- radeonsi: OpenGL app always produces page fault in gfxhub on Navi 10 1407ec681f3Smrg- [regression] 1417ec681f3Smrg KHR-GLES31.core.geometry_shader.api.program_pipeline_vs_gs_capture 1427ec681f3Smrg fails for various drivers 1437ec681f3Smrg- [CTS] 1447ec681f3Smrg dEQP-VK.spirv_assembly.instruction.spirv1p4.entrypoint.tess_con_pc_entry_point 1457ec681f3Smrg hangs on GFX10 1467ec681f3Smrg- [RADV] SPIR-V warning when compiling shader using storage 1477ec681f3Smrg multisampled image array 1487ec681f3Smrg- [RADV] The Dead Rising 4 is causing a GPU hang with LLVM backend 1497ec681f3Smrg- macOS u_thread.h:156:4: error: implicit declaration of function 1507ec681f3Smrg 'pthread_getcpuclockid' 1517ec681f3Smrg- [Wine / Vulkan] Doom 2016 Hangs on Main Menu 1527ec681f3Smrg- NULL resource when playing VP9 video through VDPAU on RX 570 1537ec681f3Smrg- radeonsi: mpv --vo=vaapi incorrect rendering on gfx9+ 1547ec681f3Smrg- [BSW/BDW] skia lcdblendmode & lcdoverlap test failure 1557ec681f3Smrg- Create a way to prefer iris vs i965 via driconf 1567ec681f3Smrg- [Bisected] i965: CS:GO crashes in emit_deref_copy_load_store with 1577ec681f3Smrg debug Mesa 1587ec681f3Smrg- radv/aco Jedi Fallen Order hair rendering buggy 1597ec681f3Smrg- Inaccurate information on https://www.mesa3d.org/repository.html 1607ec681f3Smrg about how to get git write access. 1617ec681f3Smrg- [RADV] VK_KHR_timeline_semaphore balloons in runtime 1627ec681f3Smrg- Shadow of Mordor has randomly dancing black shadows on Talion's face 1637ec681f3Smrg- gen7 crucible failures func.push-constants.basic.q0 and 1647ec681f3Smrg func.shader-subgroup-vote.basic.q0 1657ec681f3Smrg- GL_EXT_disjoint_timer_query failing with GL_INVALID_ENUM 1667ec681f3Smrg- Unreal 4 Elemental and MatineeFightScene demos misrender 1677ec681f3Smrg- gputest gimark has unwanted black liquorice flakes 1687ec681f3Smrg- triangle strip clipping with GL_FIRST_VERTEX_CONVENTION causes wrong 1697ec681f3Smrg vertex's attribute to be broadcasted for flat interpolation 1707ec681f3Smrg- [bisected][regression][g45,g965,ilk] piglit arb_fragment_program kil 1717ec681f3Smrg failures 1727ec681f3Smrg- glcts crashes since the enablement of ARB_shading_language_include 1737ec681f3Smrg- Android build broken 1747ec681f3Smrg- ld.lld: error: duplicate symbol (mesa-19.3.0-rc1) 1757ec681f3Smrg- Divinity: Original Sin Enhanced Edition(Native) crash on start 1767ec681f3Smrg- HSW. Tropico 6 and SuperTuxKart have shadows flickering 1777ec681f3Smrg- GL_EXT_disjoint_timer_query failing with GL_INVALID_ENUM 1787ec681f3Smrg- glxgears segfaults on POWER / Xvnc 1797ec681f3Smrg- [regression][bdw,gen9,icl][iris] piglit failures on mesa 1807ec681f3Smrg f9fd04aca15fd00889caa666ba38007268e67f5c 1817ec681f3Smrg- Redundant builds of libmesa_classic and libmesa_gallium 1827ec681f3Smrg- [IVB,BYT] [Regression] [Bisected] Core dump at launching 1837ec681f3Smrg arb_compute_shader/linker/bug-93840.shader_test 1847ec681f3Smrg- Vulkan drivers need access to format utils of gallium 1857ec681f3Smrg- Disabling lower_fragdata_array causes shader-db to crash for some 1867ec681f3Smrg drivers 1877ec681f3Smrg- GL_EXT_disjoint_timer_query failing with GL_INVALID_ENUM 1887ec681f3Smrg- Android build broken by commit 9020f51 "util/u_endian: Add error 1897ec681f3Smrg checks" 1907ec681f3Smrg- radv secure compile feature breaks compilation of RADV on armhf EABI 1917ec681f3Smrg (19.3-rc1) 1927ec681f3Smrg- radv_debug.c warnings when compiling on 32 bits : cast to pointer 1937ec681f3Smrg from integer of different size 1947ec681f3Smrg- Meson: Mesa3D build failure with standalone Mingw-w64 multilib 1957ec681f3Smrg- [regression][bisected] KHR46 VertexArrayAttribFormat has unexpectedly 1967ec681f3Smrg generated GL_INVALID_OPERATION 1977ec681f3Smrg- textureSize(samplerExternalOES, int) missing in desktop mesa 19.1.7 1987ec681f3Smrg implementation 1997ec681f3Smrg- zink: implicly casting integers to pointers, warnings on 32-bit 2007ec681f3Smrg compile 2017ec681f3Smrg- Objects leaving trails in Firefox with antialias and 2027ec681f3Smrg preserveDrawingBuffer in three.js WebGLRednerer with mesa 19.2 2037ec681f3Smrg 2047ec681f3SmrgChanges 2057ec681f3Smrg------- 2067ec681f3Smrg 2077ec681f3SmrgAaron Watry (1): 2087ec681f3Smrg 2097ec681f3Smrg- clover/llvm: fix build after llvm 10 commit 1dfede3122ee 2107ec681f3Smrg 2117ec681f3SmrgAdam Jackson (1): 2127ec681f3Smrg 2137ec681f3Smrg- drisw: Cache the depth of the X drawable 2147ec681f3Smrg 2157ec681f3SmrgAfonso Bordado (4): 2167ec681f3Smrg 2177ec681f3Smrg- pan/midgard: Optimize comparisions with similar operations 2187ec681f3Smrg- pan/midgard: Move midgard_is_branch_unit to helpers 2197ec681f3Smrg- pan/midgard: Optimize branches with inverted arguments 2207ec681f3Smrg- pan/midgard: Fix midgard_compile.h includes 2217ec681f3Smrg 2227ec681f3SmrgAlan Coopersmith (1): 2237ec681f3Smrg 2247ec681f3Smrg- intel/perf: adapt to platforms like Solaris without d_type in struct 2257ec681f3Smrg dirent 2267ec681f3Smrg 2277ec681f3SmrgAlejandro Piñeiro (4): 2287ec681f3Smrg 2297ec681f3Smrg- v3d: adds an extra MOV for any sig.ld\* 2307ec681f3Smrg- mesa/main/util: moving gallium u_mm to util, remove main/mm 2317ec681f3Smrg- nir/opt_peephole_select: remove unused variables 2327ec681f3Smrg- turnip: remove unused descriptor state dirty 2337ec681f3Smrg 2347ec681f3SmrgAlexander van der Grinten (1): 2357ec681f3Smrg 2367ec681f3Smrg- egl: Fix \_eglPointerIsDereferencable w/o mincore() 2377ec681f3Smrg 2387ec681f3SmrgAlexander von Gluck IV (1): 2397ec681f3Smrg 2407ec681f3Smrg- haiku/hgl: Fix build via header reordering 2417ec681f3Smrg 2427ec681f3SmrgAlyssa Rosenzweig (223): 2437ec681f3Smrg 2447ec681f3Smrg- pipe-loader: Build kmsro loader for with all kmsro targets 2457ec681f3Smrg- pan/midgard: Remove OP_IS_STORE_VARY 2467ec681f3Smrg- pan/midgard: Add a dummy source for loads 2477ec681f3Smrg- pan/midgard: Refactor swizzles 2487ec681f3Smrg- pan/midgard: Eliminate blank_alu_src 2497ec681f3Smrg- pan/midgard: Use fp32 blend shaders 2507ec681f3Smrg- pan/midgard: Validate tags when branching 2517ec681f3Smrg- pan/midgard: Fix quadword_count handling 2527ec681f3Smrg- pan/midgard: Compute bundle interference 2537ec681f3Smrg- pan/midgard: Add bizarre corner case 2547ec681f3Smrg- pan/midgard: offset_swizzle doesn't need dstsize 2557ec681f3Smrg- pan/midgard: Extend offset_swizzle to non-32-bit 2567ec681f3Smrg- pan/midgard: Extend swizzle packing for vec4/16-bit 2577ec681f3Smrg- pan/midgard: Extend default_phys_reg to !32-bit 2587ec681f3Smrg- panfrost/ci: Update T760 expectations 2597ec681f3Smrg- pan/midgard: Fix printing of half-registers in texture ops 2607ec681f3Smrg- pan/midgard: Disassemble half-steps correctly 2617ec681f3Smrg- pan/midgard: Pass shader stage to disassembler 2627ec681f3Smrg- pan/midgard: Switch base for vertex texturing on T720 2637ec681f3Smrg- nir: Add load_output_u8_as_fp16_pan intrinsic 2647ec681f3Smrg- pan/midgard: Identify ld_color_buffer_u8_as_fp16\* 2657ec681f3Smrg- pan/midgard: Implement nir_intrinsic_load_output_u8_as_fp16_pan 2667ec681f3Smrg- pan/midgard: Pack load/store masks 2677ec681f3Smrg- panfrost: Select format-specific blending intrinsics 2687ec681f3Smrg- pan/midgard: Add blend shader selection bits for MRT 2697ec681f3Smrg- pan/midgard: Implement linearly-constrained register allocation 2707ec681f3Smrg- pan/midgard: Integrate LCRA 2717ec681f3Smrg- pan/midgard: Remove util/ra support 2727ec681f3Smrg- pan/midgard: Compute spill costs 2737ec681f3Smrg- pan/lcra: Use Chaitin's spilling heuristic 2747ec681f3Smrg- pan/midgard: Copypropagate vector creation 2757ec681f3Smrg- pan/midgard: Fix copypropagation for textures 2767ec681f3Smrg- pan/midgard: Generalize texture registers across GPUs 2777ec681f3Smrg- pan/midgard: Fix vertex texturing on early Midgard 2787ec681f3Smrg- pan/midgard: Use texture, not textureLod, on early Midgard 2797ec681f3Smrg- pan/midgard: Disassemble with old pipeline always on T720 2807ec681f3Smrg- pan/midgard: Prioritize texture registers 2817ec681f3Smrg- pan/midgard: Expand 64-bit writemasks 2827ec681f3Smrg- pan/midgard: Implement i2i64 and u2u64 2837ec681f3Smrg- pan/midgard: Fix mir_round_bytemask_down for !32b 2847ec681f3Smrg- pan/midgard: Pack 64-bit swizzles 2857ec681f3Smrg- pan/midgard: Use generic constant packing for 8/64-bit 2867ec681f3Smrg- pan/midgard: Implement non-aligned UBOs 2877ec681f3Smrg- pan/midgard: Expose more typesize helpers 2887ec681f3Smrg- pan/midgard: Fix masks/alignment for 64-bit loads 2897ec681f3Smrg- pan/midgard: Represent ld/st offset unpacked 2907ec681f3Smrg- pan/midgard: Use shader stage in mir_op_computes_derivative 2917ec681f3Smrg- panfrost: Stub out clover callbacks 2927ec681f3Smrg- panfrost: Pass kernel inputs as uniforms 2937ec681f3Smrg- panfrost: Disable tiling for GLOBAL resources 2947ec681f3Smrg- panfrost: Set PIPE_COMPUTE_CAP_ADDRESS_BITS to 64 2957ec681f3Smrg- pan/midgard: Introduce quirks checks 2967ec681f3Smrg- panfrost: Add the lod_bias field 2977ec681f3Smrg- nir: Add load_sampler_lod_paramaters_pan intrinsic 2987ec681f3Smrg- pan/midgard: Implement load_sampler_lod_paramaters_pan 2997ec681f3Smrg- pan/midgard: Add LOD bias/clamp lowering 3007ec681f3Smrg- pan/midgard: Describe quirk MIDGARD_BROKEN_LOD 3017ec681f3Smrg- pan/midgard: Enable LOD lowering only on buggy chips 3027ec681f3Smrg- panfrost: Add lcra.c to Android.mk 3037ec681f3Smrg- pan/midgard: Use lower_tex_without_implicit_lod 3047ec681f3Smrg- panfrost: Add information about T720 tiling 3057ec681f3Smrg- panfrost: Implement pan_tiler for non-hierarchy GPUs 3067ec681f3Smrg- panfrost: Simplify draw_flags 3077ec681f3Smrg- pan/midgard: Splatter on fragment out 3087ec681f3Smrg- gitlab-ci: Remove non-default skips from Panfrost 3097ec681f3Smrg- panfrost: Remove blend shader hack 3107ec681f3Smrg- panfrost: Update SET_VALUE with information from igt 3117ec681f3Smrg- panfrost: Rename SET_VALUE to WRITE_VALUE 3127ec681f3Smrg- gallium/util: Support POLYGON in u_stream_outputs_for_vertices 3137ec681f3Smrg- pan/midgard: Move spilling code out of scheduler 3147ec681f3Smrg- pan/midgard: Split spill node selection/spilling 3157ec681f3Smrg- pan/midgard: Simplify spillability test 3167ec681f3Smrg- pan/midgard: Remove spill cost heuristic 3177ec681f3Smrg- pan/midgard: Move bounds checking into LCRA 3187ec681f3Smrg- pan/midgard: Remove consecutive_skip code 3197ec681f3Smrg- pan/midgard: Remove code marked "TODO: remove me" 3207ec681f3Smrg- pan/midgard: Dynamically allocate r26/27 for spills 3217ec681f3Smrg- pan/midgard: Use no_spill bitmask 3227ec681f3Smrg- pan/midgard: Don't use no_spill for memory spill src 3237ec681f3Smrg- pan/midgard: Force alignment for csel_v 3247ec681f3Smrg- pan/midgard: Don't try to free NULL in LCRA 3257ec681f3Smrg- pan/midgard: Simplify and fix vector copyprop 3267ec681f3Smrg- pan/midgard: Fix shift for TLS access 3277ec681f3Smrg- panfrost: Describe thread local storage sizing rules 3287ec681f3Smrg- panfrost: Rename unknown_address_0 -> scratchpad 3297ec681f3Smrg- panfrost: Split stack_shift nibble from unk0 3307ec681f3Smrg- panfrost: Add routines to calculate stack size/shift 3317ec681f3Smrg- panfrost: Factor out panfrost_query_raw 3327ec681f3Smrg- panfrost: Query core count and thread tls alloc 3337ec681f3Smrg- panfrost: Route stack_size from compiler 3347ec681f3Smrg- panfrost: Emit SFBD/MFBD after a batch, instead of before 3357ec681f3Smrg- panfrost: Handle minor cppcheck issues 3367ec681f3Smrg- pan/midgard: Remove unused ld/st packing hepers 3377ec681f3Smrg- pan/midgard: Handle misc. cppcheck warnings 3387ec681f3Smrg- panfrost: Calculate maximum stack_size per batch 3397ec681f3Smrg- panfrost: Pass size to panfrost_batch_get_scratchpad 3407ec681f3Smrg- pandecode: Add cast 3417ec681f3Smrg- panfrost: Move nir_undef_to_zero to Midgard compiler 3427ec681f3Smrg- panfrost: Move property queries to \_encoder 3437ec681f3Smrg- panfrost: Add panfrost_model_name helper 3447ec681f3Smrg- panfrost: Report GPU name in es2_info 3457ec681f3Smrg- ci: Remove T760/T860 from CI temporarily 3467ec681f3Smrg- panfrost: Pass blend RT number through 3477ec681f3Smrg- pan/midgard: Add schedule barrier after fragment writeout 3487ec681f3Smrg- pan/midgard: Writeout per render target 3497ec681f3Smrg- pan/midgard: Fix liveness analysis with multiple epilogues 3507ec681f3Smrg- pan/midgard: Set r1.w magic 3517ec681f3Smrg- panfrost: Fix FBD issue 3527ec681f3Smrg- ci: Reinstate Panfrost CI 3537ec681f3Smrg- panfrost: Remove fbd_type enum 3547ec681f3Smrg- panfrost: Pack invocation_shifts manually instead of a bit field 3557ec681f3Smrg- panfrost: Remove asserts in panfrost_pack_work_groups_compute 3567ec681f3Smrg- panfrost: Simplify sampler upload condition 3577ec681f3Smrg- panfrost: Don't double-create scratchpad 3587ec681f3Smrg- panfrost: Add PAN_MESA_DEBUG=precompile for shader-db 3597ec681f3Smrg- panfrost: Let precompile imply shaderdb 3607ec681f3Smrg- panfrost: Handle empty shaders 3617ec681f3Smrg- pan/midgard: Use a reg temporary for mutiple writes 3627ec681f3Smrg- pan/midgard: Hoist temporary coordinate for cubemaps 3637ec681f3Smrg- pan/midgard: Set .shadow for shadow samplers 3647ec681f3Smrg- pan/midgard: Set Z to shadow comparator for 2D 3657ec681f3Smrg- pan/midgard: Add uniform/work heuristic 3667ec681f3Smrg- pan/midgard: Implement textureOffset for 2D textures 3677ec681f3Smrg- pan/midgard: Fix crash with txs 3687ec681f3Smrg- pan/midgard: Lower txd with lower_tex 3697ec681f3Smrg- panfrost: Decode shader types in pantrace shader-db 3707ec681f3Smrg- pan/decode: Skip COMPUTE in blobber-db 3717ec681f3Smrg- pan/decode: Prefix blobberdb with MESA_SHADER\_\* 3727ec681f3Smrg- pan/decode: Append 0:0 spills:fills to blobber-db 3737ec681f3Smrg- pan/midgard: Fix disassembler cycle/quadword counting 3747ec681f3Smrg- pan/midgard: Bounds check lcra_restrict_range 3757ec681f3Smrg- pan/midgard: Extend IS_VEC4_ONLY to arguments 3767ec681f3Smrg- pan/midgard: Clamp LOD register swizzle 3777ec681f3Smrg- pan/midgard: Expand swizzle for texelFetch 3787ec681f3Smrg- pan/midgard: Fix fallthrough from offset to comparator 3797ec681f3Smrg- pan/midgard: Do witchcraft on texture offsets 3807ec681f3Smrg- pan/midgard: Generalize temp coordinate to non-2D 3817ec681f3Smrg- pan/midgard: Implement shadow cubemaps 3827ec681f3Smrg- pan/midgard: Enable lower_(un)pack\_\* lowering 3837ec681f3Smrg- pan/midgard: Support loads from R11G11B10 in a blend shader 3847ec681f3Smrg- pan/midgard: Add mir_upper_override helper 3857ec681f3Smrg- pan/midgard: Compute destination override 3867ec681f3Smrg- panfrost: Rename pan_instancing.c -> pan_attributes.c 3877ec681f3Smrg- panfrost: Factor batch/resource out of instancing routines 3887ec681f3Smrg- panfrost: Move instancing routines to encoder/ 3897ec681f3Smrg- panfrost: Factor out panfrost_compute_magic_divisor 3907ec681f3Smrg- panfrost: Fix off-by-one in pan_invocation.c 3917ec681f3Smrg- pan/decode: Fix reference computation for invocations 3927ec681f3Smrg- panfrost: Slight cleanup of Gallium's pan_attribute.c 3937ec681f3Smrg- panfrost: Remove pan_shift_odd 3947ec681f3Smrg- pan/decode: Handle gl_VertexID/gl_InstanceID 3957ec681f3Smrg- panfrost: Unset vertex_id_zero_based 3967ec681f3Smrg- pan/midgard: Factor out emit_attr_read 3977ec681f3Smrg- pan/midgard: Lower gl_VertexID/gl_InstanceID to attributes 3987ec681f3Smrg- panfrost: Extend attribute_count for vertex builtins 3997ec681f3Smrg- panfrost: Route gl_VertexID through cmdstream 4007ec681f3Smrg- pan/midgard: Fix minor typo 4017ec681f3Smrg- panfrost: Remove MALI_SPECIAL_ATTRIBUTE_BASE defines 4027ec681f3Smrg- panfrost: Update information on fixed attributes/varyings 4037ec681f3Smrg- panfrost: Remove MALI_ATTR_INTERNAL 4047ec681f3Smrg- panfrost: Inline away MALI_NEGATIVE 4057ec681f3Smrg- panfrost: Implement remaining texture wrap modes 4067ec681f3Smrg- panfrost: Add pan_attributes.c to Android.mk 4077ec681f3Smrg- panfrost: Add missing #include in common header 4087ec681f3Smrg- panfrost: Remove mali_alt_func 4097ec681f3Smrg- panfrost; Update comment about work/uniform_count 4107ec681f3Smrg- panfrost: Remove 32-bit next_job path 4117ec681f3Smrg- glsl: Set .flat for gl_FrontFacing 4127ec681f3Smrg- pan/midgard: Promote tilebuffer reads to 32-bit 4137ec681f3Smrg- pan/midgard: Use type-appropriate st_vary 4147ec681f3Smrg- pan/midgard: Implement flat shading 4157ec681f3Smrg- panfrost: Identify glProvokingVertex flag 4167ec681f3Smrg- panfrost: Disable some CAPs we want lowered 4177ec681f3Smrg- panfrost: Implement integer varyings 4187ec681f3Smrg- panfrost: Remove MRT indirection in blend shaders 4197ec681f3Smrg- panfrost: Respect glPointSize() 4207ec681f3Smrg- pan/midgard: Convert fragment writeout to proper branches 4217ec681f3Smrg- pan/midgard: Remove prepacked_branch 4227ec681f3Smrg- panfrost: Handle RGB16F colour clear 4237ec681f3Smrg- panfrost: Pack MRT blend shaders into a single BO 4247ec681f3Smrg- pan/midgard: Fix memory corruption in constant combining 4257ec681f3Smrg- pan/midgard: Use better heuristic for shader termination 4267ec681f3Smrg- pan/midgard: Generalize IS_ALU and quadword_size 4277ec681f3Smrg- pan/midgard: Generate MRT writeout loops 4287ec681f3Smrg- pan/midgard: Remove old comment 4297ec681f3Smrg- pan/midgard: Identity ld_color_buffer as 32-bit 4307ec681f3Smrg- pan/midgard: Use upper ALU tags for MFBD writeout 4317ec681f3Smrg- panfrost: Texture from Z32F_S8 as R32F 4327ec681f3Smrg- panfrost: Support rendering to non-zero Z/S layers 4337ec681f3Smrg- panfrost: Implement sRGB blend shaders 4347ec681f3Smrg- panfrost: Cleanup tiling selection logic 4357ec681f3Smrg- panfrost: Report MSAA 4x supported for dEQP 4367ec681f3Smrg- panfrost: Handle PIPE_FORMAT_R10G10B10A2_USCALED 4377ec681f3Smrg- panfrost: Respect constant buffer_offset 4387ec681f3Smrg- panfrost: Adjust for mismatch between hardware/Gallium in arrays/cube 4397ec681f3Smrg- pan/midgard: Account for z/w flip in texelFetch 4407ec681f3Smrg- panfrost: Don't double-flip Z/W for 2D arrays 4417ec681f3Smrg- pan/midgard: Support indirect UBO offsets 4427ec681f3Smrg- panfrost: Fix linear depth textures 4437ec681f3Smrg- pan/midgard: Bytemasks should round up, not round down 4447ec681f3Smrg- panfrost: Identify un/pack colour opcodes 4457ec681f3Smrg- pan/midgard: Fix recursive csel scheduling 4467ec681f3Smrg- panfrost: Expose some functionality with dEQP flag 4477ec681f3Smrg- panfrost: Compile tiling routines with -O3 4487ec681f3Smrg- panfrost,lima: De-Galliumize tiling routines 4497ec681f3Smrg- panfrost: Rework linear<--->tiled conversions 4507ec681f3Smrg- panfrost: Add pandecode entries for ASTC/ETC formats 4517ec681f3Smrg- panfrost: Fix crash in compute variant allocation 4527ec681f3Smrg- panfrost: Drop mysterious zero=0xFFFF field 4537ec681f3Smrg- panfrost: Don't use implicit mali_exception_status enum 4547ec681f3Smrg- pan/decode: Remove last_size 4557ec681f3Smrg- pan/midgard: Remove pack_color define 4567ec681f3Smrg- pan/decode: Remove SHORT_SLIDE indirection 4577ec681f3Smrg- panfrost: Fix 32-bit warning for \`indices\` 4587ec681f3Smrg- pan/decode: Drop MFBD compute shader stuff 4597ec681f3Smrg- pan/midgard: Record TEXTURE_OP_BARRIER 4607ec681f3Smrg- pan/midgard: Disassemble barrier instructions 4617ec681f3Smrg- pan/midgard: Validate barriers use a barrier tag 4627ec681f3Smrg- pan/midgard: Handle tag 0x4 as texture 4637ec681f3Smrg- pan/midgard: Remove float_bitcast 4647ec681f3Smrg- pan/midgard: Fix missing prefixes 4657ec681f3Smrg- pan/midgard: Don't crash with constants on unknown ops 4667ec681f3Smrg- pan/midgard: Use fprintf instead of printf for constants 4677ec681f3Smrg 4687ec681f3SmrgAndreas Baierl (14): 4697ec681f3Smrg 4707ec681f3Smrg- lima: Beautify stream dumps 4717ec681f3Smrg- lima: Parse VS and PLBU command stream while making a dump 4727ec681f3Smrg- lima/streamparser: Fix typo in vs semaphore parser 4737ec681f3Smrg- lima/streamparser: Add findings introduced with gl_PointSize 4747ec681f3Smrg- lima/parser: Some fixes and cleanups 4757ec681f3Smrg- lima/parser: Add RSW parsing 4767ec681f3Smrg- lima/parser: Add texture descriptor parser 4777ec681f3Smrg- lima: Rotate dump files after each finished pp frame 4787ec681f3Smrg- lima: Fix dump file creation 4797ec681f3Smrg- lima/parser: Fix rsw parser 4807ec681f3Smrg- lima/parser: Fix VS cmd stream parser 4817ec681f3Smrg- lima/parser: Make rsw alpha blend parsing more readable 4827ec681f3Smrg- lima: Add stencil support 4837ec681f3Smrg- lima: Fix alpha blending 4847ec681f3Smrg 4857ec681f3SmrgAndres Rodriguez (1): 4867ec681f3Smrg 4877ec681f3Smrg- vulkan/wsi: disable the hardware cursor 4887ec681f3Smrg 4897ec681f3SmrgAndrii Simiklit (5): 4907ec681f3Smrg 4917ec681f3Smrg- main: fix several 'may be used uninitialized' warnings 4927ec681f3Smrg- glsl: fix an incorrect max_array_access after optimization of 4937ec681f3Smrg ssbo/ubo 4947ec681f3Smrg- glsl: fix a binding points assignment for ssbo/ubo arrays 4957ec681f3Smrg- glsl/nir: do not change an element index to have correct block name 4967ec681f3Smrg- mesa/st: fix a memory leak in get_version 4977ec681f3Smrg 4987ec681f3SmrgAnthony Pesch (5): 4997ec681f3Smrg 5007ec681f3Smrg- util: import xxhash 5017ec681f3Smrg- util: move fnv1a hash implementation into its own header 5027ec681f3Smrg- util/hash_table: replace \_mesa_hash_data's fnv1a hash function with 5037ec681f3Smrg xxhash 5047ec681f3Smrg- util/hash_table: added hash functions for integer types 5057ec681f3Smrg- util/hash_table: update users to use new optimal integer hash 5067ec681f3Smrg functions 5077ec681f3Smrg 5087ec681f3SmrgAnuj Phogat (2): 5097ec681f3Smrg 5107ec681f3Smrg- intel: Add device info for 1x4x6 Jasper Lake 5117ec681f3Smrg- intel: Add pci-ids for Jasper Lake 5127ec681f3Smrg 5137ec681f3SmrgArno Messiaen (5): 5147ec681f3Smrg 5157ec681f3Smrg- lima: fix stride in texture descriptor 5167ec681f3Smrg- lima: add layer_stride field to lima_resource struct 5177ec681f3Smrg- lima: introduce ppir_op_load_coords_reg to differentiate between 5187ec681f3Smrg loading texture coordinates straight from a varying vs loading them 5197ec681f3Smrg from a register 5207ec681f3Smrg- lima: add cubemap support 5217ec681f3Smrg- lima/ppir: add lod-bias support 5227ec681f3Smrg 5237ec681f3SmrgBas Nieuwenhuizen (33): 5247ec681f3Smrg 5257ec681f3Smrg- radv: Fix timeout handling in syncobj wait. 5267ec681f3Smrg- radv: Remove \_mesa_locale_init/fini calls. 5277ec681f3Smrg- turnip: Remove \_mesa_locale_init/fini calls. 5287ec681f3Smrg- anv: Remove \_mesa_locale_init/fini calls. 5297ec681f3Smrg- radv: Fix disk_cache_get size argument. 5307ec681f3Smrg- radv: Close all unnecessary fds in secure compile. 5317ec681f3Smrg- radv: Do not change scratch settings while shaders are active. 5327ec681f3Smrg- radv: Allocate cmdbuffer space for buffer marker write. 5337ec681f3Smrg- radv: Enable VK_KHR_buffer_device_address. 5347ec681f3Smrg- amd/llvm: Refactor ac_build_scan. 5357ec681f3Smrg- radv: Unify max_descriptor_set_size. 5367ec681f3Smrg- radv: Fix timeline semaphore refcounting. 5377ec681f3Smrg- radv: Fix RGBX Android<->Vulkan format correspondence. 5387ec681f3Smrg- amd/common: Fix tcCompatible degradation on Stoney. 5397ec681f3Smrg- amd/common: Always use addrlib for HTILE tc-compat. 5407ec681f3Smrg- radv: Limit workgroup size to 1024. 5417ec681f3Smrg- radv: Expose all sample counts for integer formats as well. 5427ec681f3Smrg- amd/common: Handle alignment of 96-bit formats. 5437ec681f3Smrg- nir: Add clone/hash/serialize support for non-uniform tex 5447ec681f3Smrg instructions. 5457ec681f3Smrg- nir: print non-uniform tex fields. 5467ec681f3Smrg- amd/common: Always initialize gfx9 mipmap offset/pitch. 5477ec681f3Smrg- turnip: Use VK_NULL_HANDLE instead of NULL. 5487ec681f3Smrg- meson: Enable -Werror=int-conversion. 5497ec681f3Smrg- Revert "amd/common: Always initialize gfx9 mipmap offset/pitch." 5507ec681f3Smrg- radv: Only use the gfx mipmap level offset/pitch for linear textures. 5517ec681f3Smrg- spirv: Fix glsl type assert in spir2nir. 5527ec681f3Smrg- radv: Emit a BATCH_BREAK when changing pixel shaders or 5537ec681f3Smrg CB_TARGET_MASK. 5547ec681f3Smrg- radv: Use new scanout gfx9 metadata flag. 5557ec681f3Smrg- radv: Disable VK_EXT_sample_locations on GFX10. 5567ec681f3Smrg- radv: Remove syncobj_handle variable in header. 5577ec681f3Smrg- radv: Expose VK_KHR_swapchain_mutable_format. 5587ec681f3Smrg- radv: Allow DCC & TC-compat HTILE with 5597ec681f3Smrg VK_IMAGE_CREATE_EXTENDED_USAGE_BIT. 5607ec681f3Smrg- radv: Do not set SX DISABLE bits for RB+ with unused surfaces. 5617ec681f3Smrg 5627ec681f3SmrgBen Crocker (1): 5637ec681f3Smrg 5647ec681f3Smrg- llvmpipe: use ppc64le/ppc64 Large code model for JIT-compiled shaders 5657ec681f3Smrg 5667ec681f3SmrgBernd Kuhls (1): 5677ec681f3Smrg 5687ec681f3Smrg- util/os_socket: Include unistd.h to fix build error 5697ec681f3Smrg 5707ec681f3SmrgBoris Brezillon (21): 5717ec681f3Smrg 5727ec681f3Smrg- panfrost: MALI_DEPTH_TEST is actually MALI_DEPTH_WRITEMASK 5737ec681f3Smrg- panfrost: Destroy the upload manager allocated in 5747ec681f3Smrg panfrost_create_context() 5757ec681f3Smrg- panfrost: Release the ctx->pipe_framebuffer ref 5767ec681f3Smrg- panfrost: Move BO cache related fields to a sub-struct 5777ec681f3Smrg- panfrost: Try to evict unused BOs from the cache 5787ec681f3Smrg- gallium: Fix the ->set_damage_region() implementation 5797ec681f3Smrg- panfrost: Make sure we reset the damage region of RTs at flush time 5807ec681f3Smrg- panfrost: Remove unneeded phi nodes 5817ec681f3Smrg- panfrost/midgard: Fix swizzle for store instructions 5827ec681f3Smrg- panfrost/midgard: Print the actual source register for store 5837ec681f3Smrg operations 5847ec681f3Smrg- panfrost/midgard: Use a union to manipulate embedded constants 5857ec681f3Smrg- panfrost/midgard: Rework mir_adjust_constants() to make it type/size 5867ec681f3Smrg agnostic 5877ec681f3Smrg- panfrost/midgard: Make sure promote_fmov() only promotes 32-bit imovs 5887ec681f3Smrg- panfrost/midgard: Factorize f2f and u2u handling 5897ec681f3Smrg- panfrost/midgard: Add f2f64 support 5907ec681f3Smrg- panfrost/midgard: Fix mir_print_instruction() for branch instructions 5917ec681f3Smrg- panfrost/midgard: Add 64 bits float <-> int converters 5927ec681f3Smrg- panfrost/midgard: Add missing lowering passes for type/size 5937ec681f3Smrg conversion ops 5947ec681f3Smrg- panfrost/midgard: Add a condense_writemask() helper 5957ec681f3Smrg- panfrost/midgard: Prettify embedded constant prints 5967ec681f3Smrg- panfrost: Fix the damage box clamping logic 5977ec681f3Smrg 5987ec681f3SmrgBrian Ho (14): 5997ec681f3Smrg 6007ec681f3Smrg- turnip: Update tu_query_pool with turnip-specific fields 6017ec681f3Smrg- turnip: Implement vkCreateQueryPool for occlusion queries 6027ec681f3Smrg- turnip: Implement vkCmdBeginQuery for occlusion queries 6037ec681f3Smrg- turnip: Implement vkCmdEndQuery for occlusion queries 6047ec681f3Smrg- turnip: Update query availability on render pass end 6057ec681f3Smrg- turnip: Implement vkGetQueryPoolResults for occlusion queries 6067ec681f3Smrg- turnip: Implement vkCmdResetQueryPool 6077ec681f3Smrg- turnip: Implement vkCmdCopyQueryPoolResults for occlusion queries 6087ec681f3Smrg- anv: Properly fetch partial results in vkGetQueryPoolResults 6097ec681f3Smrg- anv: Handle unavailable queries in vkCmdCopyQueryPoolResults 6107ec681f3Smrg- turnip: Enable occlusionQueryPrecise 6117ec681f3Smrg- turnip: Free event->bo on vkDestroyEvent 6127ec681f3Smrg- turnip: Fix vkGetQueryPoolResults with available flag 6137ec681f3Smrg- turnip: Fix vkCmdCopyQueryPoolResults with available flag 6147ec681f3Smrg 6157ec681f3SmrgBrian Paul (4): 6167ec681f3Smrg 6177ec681f3Smrg- s/APIENTRY/GLAPIENTRY/ in teximage.c 6187ec681f3Smrg- nir: fix a couple signed/unsigned comparison warnings in 6197ec681f3Smrg nir_builder.h 6207ec681f3Smrg- Call shmget() with permission 0600 instead of 0777 6217ec681f3Smrg- nir: no-op C99 \_Pragma() with MSVC 6227ec681f3Smrg 6237ec681f3SmrgC Stout (1): 6247ec681f3Smrg 6257ec681f3Smrg- util/vector: Fix u_vector_foreach when head rolls over 6267ec681f3Smrg 6277ec681f3SmrgCaio Marcelo de Oliveira Filho (24): 6287ec681f3Smrg 6297ec681f3Smrg- spirv: Don't leak GS initialization to other stages 6307ec681f3Smrg- glsl: Check earlier for MaxShaderStorageBlocks and MaxUniformBlocks 6317ec681f3Smrg- glsl: Check earlier for MaxTextureImageUnits and MaxImageUniforms 6327ec681f3Smrg- anv: Initialize depth_bounds_test_enable when not explicitly set 6337ec681f3Smrg- spirv: Consider the sampled_image case in wa_glslang_179 workaround 6347ec681f3Smrg- intel/fs: Lower 64-bit MOVs after lower_load_payload() 6357ec681f3Smrg- intel/fs: Fix lowering of dword multiplication by 16-bit constant 6367ec681f3Smrg- intel/vec4: Fix lowering of multiplication by 16-bit constant 6377ec681f3Smrg- anv/gen12: Temporarily disable VK_KHR_buffer_device_address (and EXT) 6387ec681f3Smrg- spirv: Implement SPV_KHR_non_semantic_info 6397ec681f3Smrg- panfrost: Fix Makefile.sources 6407ec681f3Smrg- anv: Drop unused function parameter 6417ec681f3Smrg- anv: Ignore some CreateInfo structs when rasterization is disabled 6427ec681f3Smrg- intel/fs: Only use SLM fence in compute shaders 6437ec681f3Smrg- spirv: Drop EXT for PhysicalStorageBuffer symbols 6447ec681f3Smrg- spirv: Handle PhysicalStorageBuffer in memory barriers 6457ec681f3Smrg- nir: Add missing nir_var_mem_global to various passes 6467ec681f3Smrg- intel/fs: Add FS_OPCODE_SCHEDULING_FENCE 6477ec681f3Smrg- intel/fs: Add workgroup_size() helper 6487ec681f3Smrg- intel/fs: Don't emit fence for shared memory if only one thread is 6497ec681f3Smrg used 6507ec681f3Smrg- intel/fs: Don't emit control barrier if only one thread is used 6517ec681f3Smrg- anv: Always initialize target_stencil_layout 6527ec681f3Smrg- intel/compiler: Add names for SHADER_OPCODE_[IU]SUB_SAT 6537ec681f3Smrg- nir: Make nir_deref_path_init skip trivial casts 6547ec681f3Smrg 6557ec681f3SmrgChris Wilson (1): 6567ec681f3Smrg 6577ec681f3Smrg- egl: Mention if swrast is being forced 6587ec681f3Smrg 6597ec681f3SmrgChristian Gmeiner (24): 6607ec681f3Smrg 6617ec681f3Smrg- drm-shim: fix EOF case 6627ec681f3Smrg- etnaviv: rs: upsampling is not supported 6637ec681f3Smrg- etnaviv: add drm-shim 6647ec681f3Smrg- etnaviv: drop not used config_out function param 6657ec681f3Smrg- etnaviv: use a more self-explanatory param name 6667ec681f3Smrg- etnaviv: handle 8 byte block in tiling 6677ec681f3Smrg- etnaviv: add support for extended pe formats 6687ec681f3Smrg- etnaviv: fix integer vertex formats 6697ec681f3Smrg- etnaviv: use NORMALIZE_SIGN_EXTEND 6707ec681f3Smrg- etnaviv: fix R10G10B10A2 vertex format entries 6717ec681f3Smrg- etnaviv: handle integer case for GENERIC_ATTRIB_SCALE 6727ec681f3Smrg- etnaviv: remove dead code 6737ec681f3Smrg- etnaviv: remove not used etna_bits_ones(..) 6747ec681f3Smrg- etnaviv: drop compiled_rs_state forward declaration 6757ec681f3Smrg- etnaviv: update resource status after flushing 6767ec681f3Smrg- gallium: add PIPE_CAP_MAX_VERTEX_BUFFERS 6777ec681f3Smrg- etnaviv: check if MSAA is supported 6787ec681f3Smrg- etnaviv: gc400 does not support any vertex sampler 6797ec681f3Smrg- etnaviv: use a better name for FE_VERTEX_STREAM_UNK14680 6807ec681f3Smrg- etnaviv: move state based texture structs 6817ec681f3Smrg- etnaviv: move descriptor based texture structs 6827ec681f3Smrg- etnaviv: add deqp debug option 6837ec681f3Smrg- etnaviv: drop default state for PE_STENCIL_CONFIG_EXT2 6847ec681f3Smrg- etnaviv: drm-shim: add GC400 6857ec681f3Smrg 6867ec681f3SmrgConnor Abbott (19): 6877ec681f3Smrg 6887ec681f3Smrg- nir: Fix non-determinism in lower_global_vars_to_local 6897ec681f3Smrg- radv: Rename ac_arg_regfile 6907ec681f3Smrg- ac: Add a shared interface between radv, radeonsi, LLVM and ACO 6917ec681f3Smrg- ac/nir, radv, radeonsi: Switch to using ac_shader_args 6927ec681f3Smrg- radv: Move argument declaration out of nir_to_llvm 6937ec681f3Smrg- aco: Constify radv_nir_compiler_options in isel 6947ec681f3Smrg- aco: Use radv_shader_args in aco_compile_shader() 6957ec681f3Smrg- aco: Split vector arguments at the beginning 6967ec681f3Smrg- aco: Make num_workgroups and local_invocation_ids one argument each 6977ec681f3Smrg- radv: Replace supports_spill with explict_scratch_args 6987ec681f3Smrg- aco: Use common argument handling 6997ec681f3Smrg- aco: Make unused workgroup id's 0 7007ec681f3Smrg- nir: Maintain the algebraic automaton's state as we work. 7017ec681f3Smrg- a6xx: Add more CP packets 7027ec681f3Smrg- freedreno: Use new macros for CP_WAIT_REG_MEM and CP_WAIT_MEM_GTE 7037ec681f3Smrg- freedreno: Fix CP_MEM_TO_REG flag definitions 7047ec681f3Smrg- freedreno: Document CP_COND_REG_EXEC more 7057ec681f3Smrg- freedreno: Document CP_UNK_A6XX_55 7067ec681f3Smrg- freedreno: Document CP_INDIRECT_BUFFER_CHAIN 7077ec681f3Smrg 7087ec681f3SmrgDaniel Ogorchock (2): 7097ec681f3Smrg 7107ec681f3Smrg- panfrost: Fix panfrost_bo_access memory leak 7117ec681f3Smrg- panfrost: Fix headers and gpu_headers memory leak 7127ec681f3Smrg 7137ec681f3SmrgDaniel Schürmann (58): 7147ec681f3Smrg 7157ec681f3Smrg- aco: fix immediate offset for spills if scratch is used 7167ec681f3Smrg- aco: only use single-dword loads/stores for spilling 7177ec681f3Smrg- aco: fix accidential reordering of instructions when scheduling 7187ec681f3Smrg- aco: workaround Tonga/Iceland hardware bug 7197ec681f3Smrg- aco: fix invalid access on Pseudo_instructions 7207ec681f3Smrg- aco: preserve kill flag on moved operands during RA 7217ec681f3Smrg- aco: rematerialize s_movk instructions 7227ec681f3Smrg- aco: check if SALU instructions are predeceeded by exec when 7237ec681f3Smrg calculating WQM needs 7247ec681f3Smrg- aco: value number instructions using the execution mask 7257ec681f3Smrg- aco: use s_and_b64 exec to reduce uniform booleans to one bit 7267ec681f3Smrg- amd/llvm: Add Subgroup Scan functions for SI 7277ec681f3Smrg- radv: Enable Subgroup Arithmetic and Clustered for SI 7287ec681f3Smrg- aco: don't value-number instructions from within a loop with ones 7297ec681f3Smrg after the loop. 7307ec681f3Smrg- aco: don't split live-ranges of linear VGPRs 7317ec681f3Smrg- aco: fix a couple of value numbering issues 7327ec681f3Smrg- aco: refactor visit_store_fs_output() to use the Builder 7337ec681f3Smrg- aco: Initial GFX7 Support 7347ec681f3Smrg- aco: SI/CI - fix sampler aniso 7357ec681f3Smrg- aco: fix SMEM offsets for SI/CI 7367ec681f3Smrg- aco: implement nir_op_fquantize2f16 for SI/CI 7377ec681f3Smrg- aco: only use scalar loads for readonly buffers on SI/CI 7387ec681f3Smrg- aco: implement nir_op_isign on SI/CI 7397ec681f3Smrg- aco: move buffer_store data to VGPR if needed 7407ec681f3Smrg- aco: implement quad swizzles for SI/CI 7417ec681f3Smrg- aco: recognize SI/CI SMRD hazards 7427ec681f3Smrg- aco: fix disassembly of writelane instructions. 7437ec681f3Smrg- aco: split read/writelane opcode into VOP2/VOP3 version for SI/CI 7447ec681f3Smrg- aco: implement 64bit VGPR shifts for SI/CI 7457ec681f3Smrg- aco: make 1/2*PI a literal constant on SI/CI 7467ec681f3Smrg- aco: implement 64bit i2b for SI /CI 7477ec681f3Smrg- aco: implement 64bit ine/ieq for SI/CI 7487ec681f3Smrg- aco: disable disassembly for SI/CI due to lack of support by LLVM 7497ec681f3Smrg- radv: only flush scalar cache for SSBO writes with ACO on GFX8+ 7507ec681f3Smrg- aco: flush denorms after fmin/fmax on pre-GFX9 7517ec681f3Smrg- aco: don't use a scalar temporary for reductions on GFX10 7527ec681f3Smrg- aco: implement (clustered) reductions for SI/CI 7537ec681f3Smrg- aco: implement inclusive_scan for SI/CI 7547ec681f3Smrg- aco: implement exclusive scan for SI/CI 7557ec681f3Smrg- radv: disable Youngblood app profile if ACO is used 7567ec681f3Smrg- aco: return to loop_active mask at continue_or_break blocks 7577ec681f3Smrg- radv: Enable ACO on GFX7 (Sea Islands) 7587ec681f3Smrg- aco: use soffset for MUBUF instructions on SI/CI 7597ec681f3Smrg- aco: improve readfirstlane after uniform ssbo loads on GFX7 7607ec681f3Smrg- aco: propagate temporaries into expanded vectors 7617ec681f3Smrg- nir: fix printing of var_decl with more than 4 components. 7627ec681f3Smrg- aco: compact various Instruction classes 7637ec681f3Smrg- aco: compact aco::span<T> to use uint16_t offset and size instead of 7647ec681f3Smrg pointer and size_t. 7657ec681f3Smrg- aco: fix unconditional demote_to_helper 7667ec681f3Smrg- aco: rework lower_to_cssa() 7677ec681f3Smrg- aco: handle phi affinities transitively through parallelcopies 7687ec681f3Smrg- aco: ignore parallelcopies to the same register on jump threading 7697ec681f3Smrg- aco: fix combine_salu_not_bitwise() when SCC is used 7707ec681f3Smrg- aco: reorder VMEM operands in ACO IR 7717ec681f3Smrg- aco: fix register allocation with multiple live-range splits 7727ec681f3Smrg- aco: simplify adjust_sample_index_using_fmask() & get_image_coords() 7737ec681f3Smrg- aco: simplify gathering of MIMG address components 7747ec681f3Smrg- docs: add new features for RADV/ACO. 7757ec681f3Smrg- aco: fix image_atomic_cmp_swap 7767ec681f3Smrg 7777ec681f3SmrgDaniel Stone (2): 7787ec681f3Smrg 7797ec681f3Smrg- Revert "st/dri: do FLUSH_VERTICES before calling flush_resource" 7807ec681f3Smrg- Revert "gallium: add st_context_iface::flush_resource to call 7817ec681f3Smrg FLUSH_VERTICES" 7827ec681f3Smrg 7837ec681f3SmrgDanylo Piliaiev (12): 7847ec681f3Smrg 7857ec681f3Smrg- intel/blorp: Fix usage of uninitialized memory in key hashing 7867ec681f3Smrg- i965/program_cache: Lift restriction on shader key size 7877ec681f3Smrg- intel/blorp: Fix usage of uninitialized memory in key hashing 7887ec681f3Smrg- intel/fs: Do not lower large local arrays to scratch on gen7 7897ec681f3Smrg- i965: Unify CC_STATE and BLEND_STATE atoms on Haswell as a workaround 7907ec681f3Smrg- glsl: Add varyings to "zero-init of uninitialized vars" workaround 7917ec681f3Smrg- drirc: Add glsl_zero_init workaround for GpuTest 7927ec681f3Smrg- iris/query: Implement PIPE_QUERY_GPU_FINISHED 7937ec681f3Smrg- iris: Fix value of out-of-bounds accesses for vertex attributes 7947ec681f3Smrg- i965: Do not set front_buffer_dirty if there is no front buffer 7957ec681f3Smrg- st/mesa: Handle the rest renderbuffer formats from OSMesa 7967ec681f3Smrg- st/nir: Unify inputs_read/outputs_written before serializing NIR 7977ec681f3Smrg 7987ec681f3SmrgDave Airlie (74): 7997ec681f3Smrg 8007ec681f3Smrg- nir/serialize: pack function has name and entry point into flags. 8017ec681f3Smrg- nir/serialize: fix serializing functions with no implementations. 8027ec681f3Smrg- spirv: don't store 0 to cs.ptr_size for non kernel stages. 8037ec681f3Smrg- spirv: get the correct type for function returns. 8047ec681f3Smrg- spirv/nir/opencl: handle some multiply instructions. 8057ec681f3Smrg- nir: add 64-bit ufind_msb lowering support. (v2) 8067ec681f3Smrg- nouveau: request ufind_msb64 lowering in the frontend. 8077ec681f3Smrg- vtn/opencl: add clz support 8087ec681f3Smrg- nir: fix deref offset builder 8097ec681f3Smrg- llvmpipe: initial query buffer object support. (v2) 8107ec681f3Smrg- docs: add llvmpipe to ARB_query_buffer_object. 8117ec681f3Smrg- gallivm: split out the flow control ir to a common file. 8127ec681f3Smrg- gallivm: nir->tgsi info convertor (v2) 8137ec681f3Smrg- gallivm: add popcount intrinsic wrapper 8147ec681f3Smrg- gallivm: add cttz wrapper 8157ec681f3Smrg- gallivm: add selection for non-32 bit types 8167ec681f3Smrg- gallivm: add nir->llvm translation (v2) 8177ec681f3Smrg- draw: add nir info gathering and building support 8187ec681f3Smrg- gallium: add nir lowering passes for the draw pipe stages. (v2) 8197ec681f3Smrg- gallivm: add swizzle support where one channel isn't defined. 8207ec681f3Smrg- llvmpipe: add initial nir support 8217ec681f3Smrg- nir/samplers: don't zero samplers_used/txf. 8227ec681f3Smrg- llvmpipe/images: handle undefined atomic without crashing 8237ec681f3Smrg- gallivm/llvmpipe: add support for front facing in sysval. 8247ec681f3Smrg- llvmpipe: enable texcoord semantics 8257ec681f3Smrg- gallium/scons: fix graw-xlib build on OSX. 8267ec681f3Smrg- llvmpipe: add queries disabled flag 8277ec681f3Smrg- llvmpipe: disable occlusion queries when requested by state tracker 8287ec681f3Smrg- draw: add support for collecting primitives generated outside 8297ec681f3Smrg streamout 8307ec681f3Smrg- llvmpipe: enable support for primitives generated outside streamout 8317ec681f3Smrg- aco: handle gfx7 int8/10 clamping on exports 8327ec681f3Smrg- gallivm: add bitfield reverse and ufind_msb 8337ec681f3Smrg- llvmpipe/nir: handle texcoord requirements 8347ec681f3Smrg- gallivm: fix transpose for when first channel isn't created 8357ec681f3Smrg- gallivm: fix perspective enable if usage_mask doesn't have 0 bit set 8367ec681f3Smrg- gallivm/nir: cleanup code and call cmp wrapper 8377ec681f3Smrg- gallivm/nir: copy compare ordering code from tgsi 8387ec681f3Smrg- gallivm: add base instance sysval support 8397ec681f3Smrg- gallivm/draw: add support for draw_id system value. 8407ec681f3Smrg- gallivm: fixup base_vertex support 8417ec681f3Smrg- llvmpipe: enable ARB_shader_draw_parameters. 8427ec681f3Smrg- vtn: convert vload/store to single value loops 8437ec681f3Smrg- vtn/opencl: add shuffle/shuffle support 8447ec681f3Smrg- gallivm/nir: wrap idiv to avoid divide by 0 (v2) 8457ec681f3Smrg- llvmpipe: switch to NIR by default 8467ec681f3Smrg- nir: sanitize work group intrinsics to always be 32-bit. 8477ec681f3Smrg- gallivm: add 64-bit const int creator. 8487ec681f3Smrg- llvmpipe/gallivm: add kernel inputs 8497ec681f3Smrg- gallivm: add support for 8-bit/16-bit integer builders 8507ec681f3Smrg- gallivm: pick integer builders for alu instructions. 8517ec681f3Smrg- gallivm/nir: allow 8/16-bit conversion and comparison. 8527ec681f3Smrg- tgsi/mesa: handle KERNEL case 8537ec681f3Smrg- gallivm/llvmpipe: add support for work dimension intrinsic. 8547ec681f3Smrg- gallivm/llvmpipe: add support for block size intrinsic 8557ec681f3Smrg- gallivm/llvmpipe: add support for global operations. 8567ec681f3Smrg- llvmpipe: handle serialized nir as a shader type. 8577ec681f3Smrg- llvmpipe: add support for compute shader params 8587ec681f3Smrg- llvmpipe/nir: use nir_max_vec_components in more places 8597ec681f3Smrg- gallivm: handle non-32 bit undefined 8607ec681f3Smrg- llvmpipe: lower hadd/add_sat 8617ec681f3Smrg- gallivm/nir: lower packing 8627ec681f3Smrg- gallivm/nir: add vec8/16 support 8637ec681f3Smrg- llvmpipe: add debug option to enable OpenCL support. 8647ec681f3Smrg- gallivm: fixup const int64 builder. 8657ec681f3Smrg- llvmpipe: enable ARB_shader_group_vote. 8667ec681f3Smrg- gallium/util: add multi_draw_indirect to util_draw_indirect. 8677ec681f3Smrg- llvmpipe: enable driver side multi draw indirect 8687ec681f3Smrg- llvmpipe: add support for ARB_indirect_parameters. 8697ec681f3Smrg- llvmpipe: add ARB_derivative_control support 8707ec681f3Smrg- gallivm: fix gather component handling. 8717ec681f3Smrg- llvmpipe: fix some integer instruction lowering. 8727ec681f3Smrg- galllivm: fix gather offset casting 8737ec681f3Smrg- gallivm: fix find lsb 8747ec681f3Smrg- gallivm/nir: add missing break for isub. 8757ec681f3Smrg 8767ec681f3SmrgDavid Heidelberg (1): 8777ec681f3Smrg 8787ec681f3Smrg- .mailmap: use correct email address 8797ec681f3Smrg 8807ec681f3SmrgDavid Stevens (1): 8817ec681f3Smrg 8827ec681f3Smrg- virgl: support emulating planar image sampling 8837ec681f3Smrg 8847ec681f3SmrgDenis Pauk (2): 8857ec681f3Smrg 8867ec681f3Smrg- gallium/swr: Enable support bptc format. 8877ec681f3Smrg- docs/features: mark GL_ARB_texture_compression_bptc as done for 8887ec681f3Smrg llvmpipe, softpipe, swr 8897ec681f3Smrg 8907ec681f3SmrgDongwon Kim (3): 8917ec681f3Smrg 8927ec681f3Smrg- gallium: enable INTEL_PERFORMANCE_QUERY 8937ec681f3Smrg- iris: INTEL performance query implementation 8947ec681f3Smrg- gallium: check all planes' pipe formats in case of multi-samplers 8957ec681f3Smrg 8967ec681f3SmrgDrew Davenport (1): 8977ec681f3Smrg 8987ec681f3Smrg- radeonsi: Clear uninitialized variable 8997ec681f3Smrg 9007ec681f3SmrgDrew DeVault (1): 9017ec681f3Smrg 9027ec681f3Smrg- st_get_external_sampler_key: improve error message 9037ec681f3Smrg 9047ec681f3SmrgDuncan Hopkins (1): 9057ec681f3Smrg 9067ec681f3Smrg- zink: make sure src image is transfer-src-optimal 9077ec681f3Smrg 9087ec681f3SmrgDylan Baker (69): 9097ec681f3Smrg 9107ec681f3Smrg- Bump VERSION to 20.0.0-devel 9117ec681f3Smrg- docs/new_features: Empty the feature list for the 20.0 cycle 9127ec681f3Smrg- nir: correct use of identity check in python 9137ec681f3Smrg- r200: use preprocessor for big vs little endian checks 9147ec681f3Smrg- r100: Use preprocessor to select big vs little endian paths 9157ec681f3Smrg- dri/osmesa: use preprocessor for selecting endian code paths 9167ec681f3Smrg- util/u_endian: Use \_WIN32 instead of \_MSC_VER 9177ec681f3Smrg- util/u_endian: set PIPE_ARCH_*_ENDIAN to 1 9187ec681f3Smrg- mesa/main: replace uses of \_mesa_little_endian with preprocessor 9197ec681f3Smrg- mesa/swrast: replace instances of \_mesa_little_endian with 9207ec681f3Smrg preprocessor 9217ec681f3Smrg- mesa/main: delete now unused \_mesa_little_endian 9227ec681f3Smrg- gallium/osmesa: Use PIPE_ARCH_*_ENDIAN instead of little_endian 9237ec681f3Smrg function 9247ec681f3Smrg- util: rename PIPE_ARCH_*_ENDIAN to UTIL_ARCH_*_ENDIAN 9257ec681f3Smrg- util/u_endian: Add error checks 9267ec681f3Smrg- meson: Add dep_glvnd to egl deps when building with glvnd 9277ec681f3Smrg- docs: add release notes for 19.2.3 9287ec681f3Smrg- docs: add sha256 sum to 19.2.3 release notes 9297ec681f3Smrg- docs: update calendar, add news item and link release notes for 9307ec681f3Smrg 19.2.2 9317ec681f3Smrg- meson: gtest needs pthreads 9327ec681f3Smrg- gallium/osmesa: Convert osmesa test to gtest 9337ec681f3Smrg- osmesa/tests: Extend render test to cover other working cases 9347ec681f3Smrg- util: Use ZSTD for shader cache if possible 9357ec681f3Smrg- docs: Add release notes for 19.2.4 9367ec681f3Smrg- docs: Add SHA256 sum for for 19.2.4 9377ec681f3Smrg- docs: update calendar, add news item and link release notes for 9387ec681f3Smrg 19.2.4 9397ec681f3Smrg- docs: Add relnotes for 19.2.5 9407ec681f3Smrg- docs/relnotes/19.2.5: Add SHA256 sum 9417ec681f3Smrg- docs: update calendar, add news item and link release notes for 9427ec681f3Smrg 19.2.5 9437ec681f3Smrg- docs/release-calendar: Update for extended 19.3 rc period 9447ec681f3Smrg- docs: Add release notes for 19.2.6 9457ec681f3Smrg- docs: Add SHA256 sum for 19.2.6 9467ec681f3Smrg- docs: update calendar, add news item and link release notes for 9477ec681f3Smrg 19.2.6 9487ec681f3Smrg- gallium/auxiliary: Fix uses of gnu struct = {} extension 9497ec681f3Smrg- meson: Add -Werror=gnu-empty-initializer to MSVC compat args 9507ec681f3Smrg- docs: Add release notes for 19.2.7 9517ec681f3Smrg- docs: Add SHA256 sums for 19.2.7 9527ec681f3Smrg- docs: update calendar, add news item and link release notes for 9537ec681f3Smrg 19.2.7 9547ec681f3Smrg- docs: Update mesa 19.3 release calendar 9557ec681f3Smrg- meson/broadcom: libbroadcom_cle needs expat headers 9567ec681f3Smrg- meson/broadcom: libbroadcom_cle also needs zlib 9577ec681f3Smrg- docs: add release notes for 19.3.0 9587ec681f3Smrg- docs/19.3.0: Add SHA256 sums 9597ec681f3Smrg- docs: Update release notes, index, and calendar for 19.3.0 9607ec681f3Smrg- dcos: add releanse notes for 19.3.1 9617ec681f3Smrg- docs: Add release notes, update calendar, and add news for 19.3.1 9627ec681f3Smrg- docs: add relnotes for 19.2.8 9637ec681f3Smrg- docs/relnotes/19.2.8: Add SHA256 sum 9647ec681f3Smrg- docs: Add release notes, news, and update calendar for 19.2.8 9657ec681f3Smrg- docs: Add release notes for 19.3.2 9667ec681f3Smrg- docs: add SHA256 sums for 19.3.2 9677ec681f3Smrg- docs: Add release notes for 19.3.2, update calendar and home page 9687ec681f3Smrg- docs: Update release calendar for 20.0 9697ec681f3Smrg- docs: Add relnotes for 19.3.3 release 9707ec681f3Smrg- docs: Add SHA 256 sums for 19.3.3 9717ec681f3Smrg- docs: update news, calendar, and link release notes for 19.3.3 9727ec681f3Smrg- VERSION: bump to 20.0.0-rc1 9737ec681f3Smrg- bin/pick-ui: Add a new maintainer script for picking patches 9747ec681f3Smrg- .pick_status.json: Update to 0d14f41625fa00187f690f283c1eb6a22e354a71 9757ec681f3Smrg- .pick_status.json: Update to b550b7ef3b8d12f533b67b1a03159a127a3ff34a 9767ec681f3Smrg- .pick_status.json: Update to 9afdcd64f2c96f3fcc1a28912987f2e8066aa995 9777ec681f3Smrg- .pick_status.json: Update to 7eaf21cb6f67adbe0e79b80b4feb8c816a98a720 9787ec681f3Smrg- VERSION: bump to 20.0-rc2 9797ec681f3Smrg- .pick_status.json: Update to d8bae10bfe0f487dcaec721743cd51441bcc12f5 9807ec681f3Smrg- .pick_status.json: Update to 689817c9dfde9a0852f2b2489cb0fa93ffbcb215 9817ec681f3Smrg- .pick_status.json: Update to 23037627359e739c42b194dec54875aefbb9d00b 9827ec681f3Smrg- VERSION: bump for 20.0.0-rc3 9837ec681f3Smrg- .pick_status.json: Update to 2a98cf3b2ecea43cea148df7f77d2abadfd1c9db 9847ec681f3Smrg- .pick_status.json: Update to 946eacbafb47c8b94d47e7c9d2a8b02fff5a22fa 9857ec681f3Smrg- .pick_status.json: Update to bee5c9b0dc13dbae0ccf124124eaccebf7f2a435 9867ec681f3Smrg 9877ec681f3SmrgEduardo Lima Mitev (2): 9887ec681f3Smrg 9897ec681f3Smrg- turnip: Remove failed command buffer from pool 9907ec681f3Smrg- turnip: Fix issues in tu_compute_pipeline_create() that may lead to 9917ec681f3Smrg crash 9927ec681f3Smrg 9937ec681f3SmrgElie Tournier (4): 9947ec681f3Smrg 9957ec681f3Smrg- Docs: remove duplicate meson docs for windows 9967ec681f3Smrg- docs: fix ascii html representation 9977ec681f3Smrg- nir/algebraic: i2f(f2i()) -> trunc() 9987ec681f3Smrg- nir/algebraic: sqrt(x)*sqrt(x) -> fabs(x) 9997ec681f3Smrg 10007ec681f3SmrgEmmanuel Gil Peyrot (1): 10017ec681f3Smrg 10027ec681f3Smrg- intel/compiler: Return early if read() failed 10037ec681f3Smrg 10047ec681f3SmrgEric Anholt (102): 10057ec681f3Smrg 10067ec681f3Smrg- ci: Make lava inherit the ccache setup of the .build script. 10077ec681f3Smrg- ci: Switch over to an autoscaling GKE cluster for builds. 10087ec681f3Smrg- Revert "ci: Switch over to an autoscaling GKE cluster for builds." 10097ec681f3Smrg- mesa/st: Add mapping of MESA_FORMAT_RGB_SNORM16 to gallium. 10107ec681f3Smrg- gallium: Add defines for FXT1 texture compression. 10117ec681f3Smrg- gallium: Add some more channel orderings of packed formats. 10127ec681f3Smrg- gallium: Add an equivalent of MESA_FORMAT_BGR_UNORM8. 10137ec681f3Smrg- gallium: Add equivalents of packed MESA_FORMAT_*UINT formats. 10147ec681f3Smrg- mesa: Stop defining a full separate format for RGBA_UINT8. 10157ec681f3Smrg- mesa/st: Test round-tripping of all compressed formats. 10167ec681f3Smrg- mesa: Prepare for the MESA_FORMAT\_\* enum to be sparse. 10177ec681f3Smrg- mesa: Redefine MESA_FORMAT\_\* in terms of PIPE_FORMAT_*. 10187ec681f3Smrg- mesa/st: Gut most of st_mesa_format_to_pipe_format(). 10197ec681f3Smrg- mesa/st: Make st_pipe_format_to_mesa_format an effective no-op. 10207ec681f3Smrg- u_format: Fix swizzle of A1R5G5B5. 10217ec681f3Smrg- ci: Use several debian buster packages instead of hand-building. 10227ec681f3Smrg- ci: Make the skip list regexes match the full test name. 10237ec681f3Smrg- ci: Use cts_runner for our dEQP runs. 10247ec681f3Smrg- ci: Enable all of GLES3/3.1 testing for softpipe. 10257ec681f3Smrg- ci: Remove old commented copy of freedreno artifacts. 10267ec681f3Smrg- ci: Disable flappy blit tests on a630. 10277ec681f3Smrg- ci: Expand the freedreno blit skip regex to cover more cases. 10287ec681f3Smrg- util: Move gallium's PIPE_FORMAT utils to /util/format/ 10297ec681f3Smrg- mesa: Move compile of common Mesa core files to a static lib. 10307ec681f3Smrg- mesa/st: Simplify st_choose_matching_format(). 10317ec681f3Smrg- mesa: Don't put sRGB formats in the array format table. 10327ec681f3Smrg- mesa/st: Reuse st_choose_matching_format from st_choose_format(). 10337ec681f3Smrg- util: Add a mapping from VkFormat to PIPE_FORMAT. 10347ec681f3Smrg- turnip: Drop the copy of the formats table. 10357ec681f3Smrg- ci: Move freedreno's parallelism to the runner instead of gitlab-ci 10367ec681f3Smrg jobs. 10377ec681f3Smrg- ci: Use a tag from the parallel-deqp-runner repo. 10387ec681f3Smrg- nir: Add a scheduler pass to reduce maximum register pressure. 10397ec681f3Smrg- nir: Refactor algebraic's block walk 10407ec681f3Smrg- nir: Make algebraic backtrack and reprocess after a replacement. 10417ec681f3Smrg- freedreno: Introduce a fd_resource_layer_stride() helper. 10427ec681f3Smrg- freedreno: Introduce a fd_resource_tile_mode() helper. 10437ec681f3Smrg- freedreno: Introduce a resource layout header. 10447ec681f3Smrg- freedreno: Convert the slice struct to the new resource header. 10457ec681f3Smrg- freedreno/a6xx: Log the tiling mode in resource layout debug. 10467ec681f3Smrg- turnip: Disable timestamp queries for now. 10477ec681f3Smrg- turnip: Fix unused variable warnings. 10487ec681f3Smrg- turnip: Drop redefinition of VALIDREG now that it's in ir3.h. 10497ec681f3Smrg- turnip: Reuse tu6_stage2opcode() more. 10507ec681f3Smrg- turnip: Add basic SSBO support. 10517ec681f3Smrg- turnip: Refactor the graphics pipeline create implementation. 10527ec681f3Smrg- turnip: Add a helper function for getting tu_buffer iovas. 10537ec681f3Smrg- turnip: Sanity check that we're adding valid BOs to the list. 10547ec681f3Smrg- turnip: Move pipeline BO list adding to BindPipeline. 10557ec681f3Smrg- turnip: Add support for compute shaders. 10567ec681f3Smrg- ci: Disable egl_ext_device_drm tests in piglit. 10577ec681f3Smrg- freedreno: Enable texture upload memory throttling. 10587ec681f3Smrg- freedreno: Stop forcing ALLOW_MAPPED_BUFFERS_DURING_EXEC off. 10597ec681f3Smrg- freedreno: Track the set of UBOs to be uploaded in UBO analysis. 10607ec681f3Smrg- freedreno: Drop the extra offset field for mipmap slices. 10617ec681f3Smrg- freedreno: Refactor the UBWC flags registers emission. 10627ec681f3Smrg- freedreno: Move UBWC layout into a slices array like the non-UBWC 10637ec681f3Smrg slices. 10647ec681f3Smrg- tu: Move our image layout into a freedreno_layout struct. 10657ec681f3Smrg- freedreno: Move a6xx's setup_slices() to a shareable helper function. 10667ec681f3Smrg- freedreno: Switch the 16-bit workaround to match what turnip does. 10677ec681f3Smrg- tu: Move UBWC layout into fdl6_layout() and use that function. 10687ec681f3Smrg- turnip: Lower usub_borrow. 10697ec681f3Smrg- turnip: Drop unused variable. 10707ec681f3Smrg- turnip: Add support for descriptor arrays. 10717ec681f3Smrg- turnip: Fix support for immutable samplers. 10727ec681f3Smrg- ci: Fix caselist results archiving after parallel-deqp-runner rename. 10737ec681f3Smrg- mesa: Fix detection of invalidating both depth and stencil. 10747ec681f3Smrg- mesa/st: Deduplicate the NIR uniform lowering code. 10757ec681f3Smrg- mesa/st: Move the vec4 type size function into core GLSL types. 10767ec681f3Smrg- mesa/prog: Reuse count_vec4_slots() from ir_to_mesa. 10777ec681f3Smrg- mesa/st: Move the dword slot counting function to glsl_types as well. 10787ec681f3Smrg- i965: Reuse the new core glsl_count_dword_slots(). 10797ec681f3Smrg- nir: Fix printing of ~0 .locations. 10807ec681f3Smrg- turnip: Refactor linkage state setup. 10817ec681f3Smrg- mesa: Make atomic lowering put atomics above SSBOs. 10827ec681f3Smrg- gallium: Pack the atomic counters just above the SSBOs. 10837ec681f3Smrg- nir: Drop the ssbo_offset to atomic lowering. 10847ec681f3Smrg- compiler: Add a note about how num_ssbos works in the program info. 10857ec681f3Smrg- freedreno: Stop scattered remapping of SSBOs/images to IBOs. 10867ec681f3Smrg- radeonsi: Remove a bunch of default handling of pipe caps. 10877ec681f3Smrg- r600: Remove a bunch of default handling of pipe caps. 10887ec681f3Smrg- r300: Remove a bunch of default handling of pipe caps. 10897ec681f3Smrg- radeonsi: Drop PIPE_CAP_TGSI_ANY_REG_AS_ADDRESS. 10907ec681f3Smrg- turnip: Fix some whitespace around binary operators. 10917ec681f3Smrg- turnip: Refactor the intrinsic lowering. 10927ec681f3Smrg- turnip: Add limited support for storage images. 10937ec681f3Smrg- turnip: Disable UBWC on images used as storage images. 10947ec681f3Smrg- turnip: Add support for non-zero (still constant) UBO buffer indices. 10957ec681f3Smrg- turnip: Add support for uniform texel buffers. 10967ec681f3Smrg- freedreno/ir3: Plumb the ir3_shader_variant into legalize. 10977ec681f3Smrg- turnip: Add support for fine derivatives. 10987ec681f3Smrg- turnip: Fix execution of secondary cmd bufs with nothing in primary. 10997ec681f3Smrg- freedreno: Add some missing a6xx address declarations. 11007ec681f3Smrg- freedreno: Fix OUT_REG() on address regs without a .bo supplied. 11017ec681f3Smrg- turnip: Port krh's packing macros from freedreno to tu. 11027ec681f3Smrg- turnip: Convert renderpass setup to the new register packing macros. 11037ec681f3Smrg- turnip: Convert the rest of tu_cmd_buffer.c over to the new pack 11047ec681f3Smrg macros. 11057ec681f3Smrg- vulkan/wsi: Fix compiler warning when no WSI platforms are enabled. 11067ec681f3Smrg- iris: Silence warning about AUX_USAGE_MC. 11077ec681f3Smrg- mesa/st: Fix compiler warnings from INTEL_shader_integer_functions. 11087ec681f3Smrg- ci: Enable -Werror on the meson-i386 build. 11097ec681f3Smrg- tu: Fix binning address setup after pack macros change. 11107ec681f3Smrg- Revert "gallium: Fix big-endian addressing of non-bitmask array 11117ec681f3Smrg formats." 11127ec681f3Smrg 11137ec681f3SmrgEric Engestrom (58): 11147ec681f3Smrg 11157ec681f3Smrg- meson: split out idep_xmlconfig_headers from idep_xmlconfig 11167ec681f3Smrg- anv: add missing xmlconfig headers dependency 11177ec681f3Smrg- radv: drop unnecessary xmlpool_options_h 11187ec681f3Smrg- pipe-loader: drop unnecessary xmlpool_options_h 11197ec681f3Smrg- loader: replace xmlpool_options_h with idep_xmlconfig_headers 11207ec681f3Smrg- targets/omx: replace xmlpool_options_h with idep_xmlconfig_headers 11217ec681f3Smrg- targets/va: replace xmlpool_options_h with idep_xmlconfig_headers 11227ec681f3Smrg- targets/vdpau: replace xmlpool_options_h with idep_xmlconfig_headers 11237ec681f3Smrg- targets/xa: replace xmlpool_options_h with idep_xmlconfig_headers 11247ec681f3Smrg- targets/xvmc: replace xmlpool_options_h with idep_xmlconfig_headers 11257ec681f3Smrg- dri: replace xmlpool_options_h with idep_xmlconfig_headers 11267ec681f3Smrg- i915: replace xmlpool_options_h with idep_xmlconfig_headers 11277ec681f3Smrg- nouveau: replace xmlpool_options_h with idep_xmlconfig_headers 11287ec681f3Smrg- r200: replace xmlpool_options_h with idep_xmlconfig_headers 11297ec681f3Smrg- radeon: replace xmlpool_options_h with idep_xmlconfig_headers 11307ec681f3Smrg- meson: move idep_xmlconfig_headers to xmlpool/ 11317ec681f3Smrg- gitlab-ci: build a recent enough version of GLVND (ie. 1.2.0) 11327ec681f3Smrg- meson: require glvnd 1.2.0 11337ec681f3Smrg- meson: revert glvnd workaround 11347ec681f3Smrg- meson: add variable to control the symbols checks 11357ec681f3Smrg- meson: move the generic symbols check arguments to a common variable 11367ec681f3Smrg- meson: add windows support to symbols checks 11377ec681f3Smrg- meson: require \`nm\` again on Unix systems 11387ec681f3Smrg- mesa/imports: let the build system detect strtok_r() 11397ec681f3Smrg- egl: fix \_EGL_NATIVE_PLATFORM fallback 11407ec681f3Smrg- egl: move #include of local headers out of Khronos headers 11417ec681f3Smrg- gitlab-ci: build libdrm using meson instead of autotools 11427ec681f3Smrg- gitlab-ci: auto-cancel CI runs when a newer commit is pushed to the 11437ec681f3Smrg same branch 11447ec681f3Smrg- CL: sync C headers with Khronos 11457ec681f3Smrg- CL: sync C++ headers with Khronos 11467ec681f3Smrg- vulkan: delete typo'd header 11477ec681f3Smrg- egl: use EGL_CAST() macro in eglmesaext.h 11487ec681f3Smrg- anv: add missing "fall-through" annotation 11497ec681f3Smrg- vk_util: drop duplicate formats in vk_format_map[] 11507ec681f3Smrg- meson: drop duplicate \`lib\` prefix on libiris_gen\* 11517ec681f3Smrg- meson: drop \`intel_\` prefix on imgui_core 11527ec681f3Smrg- docs: reword a bit and list HTTPS before FTP 11537ec681f3Smrg- intel: add mi_builder_test for gen12 11547ec681f3Smrg- intel/compiler: add ASSERTED annotation to avoid "unused variable" 11557ec681f3Smrg warning 11567ec681f3Smrg- intel/compiler: replace \`0\` pointer with \`NULL\` 11577ec681f3Smrg- util/simple_mtx: don't set the canary when it can't be checked 11587ec681f3Smrg- anv: drop unused #include 11597ec681f3Smrg- travis: autodetect python version instead of hard-coding it 11607ec681f3Smrg- util/format: remove left-over util_format_description_table 11617ec681f3Smrg declaration 11627ec681f3Smrg- util/format: add PIPE_FORMAT_ASTC_*x*x*_SRGB to 11637ec681f3Smrg util_format_{srgb,linear}() 11647ec681f3Smrg- util/format: add trivial srgb<->linear conversion test 11657ec681f3Smrg- u_format: move format tests to util/tests/ 11667ec681f3Smrg- amd: fix empty-body issues 11677ec681f3Smrg- nine: fix empty-body-issues 11687ec681f3Smrg- meson: simplify install_megadrivers.py invocation 11697ec681f3Smrg- mesa: avoid returning a value in a void function 11707ec681f3Smrg- meson: use github URL for wraps instead of completely unreliable 11717ec681f3Smrg wrapdb 11727ec681f3Smrg- egl: drop confusing mincore() error message 11737ec681f3Smrg- llvmpipe: drop LLVM < 3.4 support 11747ec681f3Smrg- util/atomic: fix return type of p_atomic_add_return() fallback 11757ec681f3Smrg- util/os_socket: fix header unavailable on windows 11767ec681f3Smrg- freedreno/perfcntrs: fix fd leak 11777ec681f3Smrg- util/disk_cache: check for write() failure in the zstd path 11787ec681f3Smrg 11797ec681f3SmrgErico Nunes (17): 11807ec681f3Smrg 11817ec681f3Smrg- lima: fix nir shader memory leak 11827ec681f3Smrg- lima: fix bo submit memory leak 11837ec681f3Smrg- lima/ppir: enable lower_fdph 11847ec681f3Smrg- gallium/util: add alignment parameter to util_upload_index_buffer 11857ec681f3Smrg- lima: allocate separate bo to store varyings 11867ec681f3Smrg- lima: refactor indexed draw indices upload 11877ec681f3Smrg- vc4: move the draw splitting routine to shared code 11887ec681f3Smrg- lima: split draw calls on 64k vertices 11897ec681f3Smrg- lima/ppir: fix lod bias src 11907ec681f3Smrg- lima/ppir: remove assert on ppir_emit_tex unsupported feature 11917ec681f3Smrg- lima: set shader caps to optimize control flow 11927ec681f3Smrg- lima/ppir: remove orphan load node after cloning 11937ec681f3Smrg- lima/ppir: implement full liveness analysis for regalloc 11947ec681f3Smrg- lima/ppir: handle write to dead registers in ppir 11957ec681f3Smrg- lima/ppir: fix ssa undef emit 11967ec681f3Smrg- lima/ppir: split ppir_op_undef into undef and dummy again 11977ec681f3Smrg- lima/ppir: fix src read mask swizzling 11987ec681f3Smrg 11997ec681f3SmrgErik Faye-Lund (82): 12007ec681f3Smrg 12017ec681f3Smrg- zink: heap-allocate samplers objects 12027ec681f3Smrg- zink: emit line-width when using polygon line-mode 12037ec681f3Smrg- anv: remove incorrect polygonMode=point early-out 12047ec681f3Smrg- zink: use actual format for render-pass 12057ec681f3Smrg- zink: always allow mutating the format 12067ec681f3Smrg- zink: do not advertize coherent mapping 12077ec681f3Smrg- zink: disable fragment-shader texture-lod 12087ec681f3Smrg- zink: transition resources before resolving 12097ec681f3Smrg- zink: always allow sampling of images 12107ec681f3Smrg- zink: use u_blitter when format-reinterpreting 12117ec681f3Smrg- zink/spirv: drop temp-array for component-count 12127ec681f3Smrg- zink/spirv: support loading bool constants 12137ec681f3Smrg- zink/spirv: implement bany_fnequal[2-4] 12147ec681f3Smrg- zink/spirv: implement bany_inequal[2-4] 12157ec681f3Smrg- zink/spirv: implement ball_iequal[2-4] 12167ec681f3Smrg- zink/spirv: implement ball_fequal[2-4] 12177ec681f3Smrg- zink: do advertize integer support in shaders 12187ec681f3Smrg- zink/spirv: add support for nir_op_flrp 12197ec681f3Smrg- zink: correct depth-stencil format 12207ec681f3Smrg- nir: patch up deref-vars when lowering clip-planes 12217ec681f3Smrg- zink: always allow transfer to/from buffers 12227ec681f3Smrg- zink: implement buffer-to-buffer copies 12237ec681f3Smrg- zink: remove no-longer-needed hack 12247ec681f3Smrg- zink: move format-checking to separate source 12257ec681f3Smrg- zink: move filter-helper to separate helper-header 12267ec681f3Smrg- zink: move blitting to separate source 12277ec681f3Smrg- zink: move drawing separate source 12287ec681f3Smrg- st/mesa: unmap pbo after updating cache 12297ec681f3Smrg- zink: use true/false instead of TRUE/FALSE 12307ec681f3Smrg- zink: reject invalid sample-counts 12317ec681f3Smrg- zink: fix crash when restoring sampler-states 12327ec681f3Smrg- zink: delete query rather than allocating a new one 12337ec681f3Smrg- zink: do not try to destroy NULL-fence 12347ec681f3Smrg- zink: handle calloc-failure 12357ec681f3Smrg- zink: avoid NULL-deref 12367ec681f3Smrg- zink: avoid NULL-deref 12377ec681f3Smrg- zink: avoid NULL-deref 12387ec681f3Smrg- zink: error-check right variable 12397ec681f3Smrg- zink: silence coverity error 12407ec681f3Smrg- zink: enable PIPE_CAP_MIXED_COLORBUFFER_FORMATS 12417ec681f3Smrg- zink: implement nir_texop_txd 12427ec681f3Smrg- zink: implement txf 12437ec681f3Smrg- zink: implement some more trivial opcodes 12447ec681f3Smrg- zink: simplify front-face type 12457ec681f3Smrg- zink: factor out builtin-var creation 12467ec681f3Smrg- zink: implement load_vertex_id 12477ec681f3Smrg- zink: use nir_fmul_imm 12487ec681f3Smrg- zink: remove unused code-path in lower_pos_write 12497ec681f3Smrg- nir/zink: move clip_halfz-lowering to common code 12507ec681f3Smrg- etnaviv: use nir_lower_clip_halfz instead of open-coding 12517ec681f3Smrg- st/mesa: use uint-samplers for sampling stencil buffers 12527ec681f3Smrg- zink: fixup initialization of operand_mask / num_extra_operands 12537ec681f3Smrg- util: initialize float-array with float-literals 12547ec681f3Smrg- st/wgl: eliminate implicit cast warning 12557ec681f3Smrg- gallium: fix a warning 12567ec681f3Smrg- mesa/st: use float literals 12577ec681f3Smrg- docs: fix typo in html tag name 12587ec681f3Smrg- docs: fix paragraphs 12597ec681f3Smrg- docs: open paragraph before closing it 12607ec681f3Smrg- docs: use code-tag instead of pre-tag 12617ec681f3Smrg- docs: use code-tags instead of pre-tags 12627ec681f3Smrg- docs: use code-tags instead of pre-tags 12637ec681f3Smrg- docs: move paragraph closing tag 12647ec681f3Smrg- docs: remove double-closed definition-list 12657ec681f3Smrg- docs: do not double-close link tag 12667ec681f3Smrg- docs: do not use definition-list for sub-topics 12677ec681f3Smrg- docs: use figure/figcaption instead of tables 12687ec681f3Smrg- docs: remove trailing header 12697ec681f3Smrg- docs: remove leading spaces 12707ec681f3Smrg- docs: remove trailing newlines 12717ec681f3Smrg- docs: use [1] instead of asterisk for footnote 12727ec681f3Smrg- docs: remove pointless, stray newline 12737ec681f3Smrg- docs: fixup indentation 12747ec681f3Smrg- zink: implement nir_texop_txs 12757ec681f3Smrg- zink: support offset-variants of texturing 12767ec681f3Smrg- zink: avoid incorrect vector-construction 12777ec681f3Smrg- zink: store image-type per texture 12787ec681f3Smrg- zink: support sampling non-float textures 12797ec681f3Smrg- zink: support arrays of samplers 12807ec681f3Smrg- zink: set compareEnable when setting compareOp 12817ec681f3Smrg- st/mesa: use uint-result for sampling stencil buffers 12827ec681f3Smrg- Revert "nir: Add a couple trivial abs optimizations" 12837ec681f3Smrg 12847ec681f3SmrgFlorian Will (1): 12857ec681f3Smrg 12867ec681f3Smrg- radv/winsys: set IB flags prior to submit in the sysmem path 12877ec681f3Smrg 12887ec681f3SmrgFrancisco Jerez (26): 12897ec681f3Smrg 12907ec681f3Smrg- glsl: Fix software 64-bit integer to 32-bit float conversions. 12917ec681f3Smrg- intel/fs/gen11+: Handle ROR/ROL in lower_simd_width(). 12927ec681f3Smrg- intel/fs/gen8+: Fix r127 dst/src overlap RA workaround for EOT 12937ec681f3Smrg message payload. 12947ec681f3Smrg- intel/fs: Fix nir_intrinsic_load_barycentric_at_sample for SIMD32. 12957ec681f3Smrg- intel/fs/cse: Fix non-deterministic behavior due to inaccurate 12967ec681f3Smrg liveness calculation. 12977ec681f3Smrg- intel/fs: Make implied_mrf_writes() an fs_inst method. 12987ec681f3Smrg- intel/fs: Try to vectorize header setup in lower_load_payload(). 12997ec681f3Smrg- intel/fs: Generalize fs_reg::is_contiguous() to register files other 13007ec681f3Smrg than VGRF. 13017ec681f3Smrg- intel/fs: Rework fs_inst::is_copy_payload() into multiple 13027ec681f3Smrg classification helpers. 13037ec681f3Smrg- intel/fs: Extend copy propagation dataflow analysis to copies with 13047ec681f3Smrg FIXED_GRF source. 13057ec681f3Smrg- intel/fs: Add partial support for copy-propagating FIXED_GRFs. 13067ec681f3Smrg- intel/fs: Add support for copy-propagating a block of multiple 13077ec681f3Smrg FIXED_GRFs. 13087ec681f3Smrg- intel/fs: Allow limited copy propagation of a LOAD_PAYLOAD into 13097ec681f3Smrg another. 13107ec681f3Smrg- intel/fs/gen4-6: Allocate registers from aligned_pairs_class based on 13117ec681f3Smrg LINTERP use. 13127ec681f3Smrg- intel/fs/gen6: Constrain barycentric source of LINTERP during bank 13137ec681f3Smrg conflict mitigation. 13147ec681f3Smrg- intel/fs/gen6: Generalize aligned_pairs_class to SIMD16 aligned 13157ec681f3Smrg barycentrics. 13167ec681f3Smrg- intel/fs/gen6: Use SEL instead of bashing thread payload for unlit 13177ec681f3Smrg centroid workaround. 13187ec681f3Smrg- intel/fs: Split fetch_payload_reg() into separate helper for 13197ec681f3Smrg barycentrics. 13207ec681f3Smrg- intel/fs: Introduce barycentric layout lowering pass. 13217ec681f3Smrg- intel/fs: Switch to standard vector layout for barycentrics at 13227ec681f3Smrg optimization time. 13237ec681f3Smrg- intel/fs/cse: Make HALT instruction act as CSE barrier. 13247ec681f3Smrg- intel/fs/gen7: Fix fs_inst::flags_written() for 13257ec681f3Smrg SHADER_OPCODE_FIND_LIVE_CHANNEL. 13267ec681f3Smrg- intel/fs: Add virtual instruction to load mask of live channels into 13277ec681f3Smrg flag register. 13287ec681f3Smrg- intel/fs/gen12: Workaround unwanted SEND execution due to broken 13297ec681f3Smrg NoMask control flow. 13307ec681f3Smrg- intel/fs/gen12: Fixup/simplify SWSB annotations of SIMD32 scratch 13317ec681f3Smrg writes. 13327ec681f3Smrg- intel/fs/gen12: Workaround data coherency issues due to broken NoMask 13337ec681f3Smrg control flow. 13347ec681f3Smrg 13357ec681f3SmrgFritz Koenig (1): 13367ec681f3Smrg 13377ec681f3Smrg- freedreno: reorder format check 13387ec681f3Smrg 13397ec681f3SmrgGeorg Lehmann (3): 13407ec681f3Smrg 13417ec681f3Smrg- Correctly wait in the fragment stage until all semaphores are 13427ec681f3Smrg signaled 13437ec681f3Smrg- Vulkan Overlay: Don't try to change the image layout to present twice 13447ec681f3Smrg- Vulkan overlay: use the corresponding image index for each swapchain 13457ec681f3Smrg 13467ec681f3SmrgGert Wollny (12): 13477ec681f3Smrg 13487ec681f3Smrg- r600: Disable eight bit three channel formats 13497ec681f3Smrg- virgl: Increase the shader transfer buffer by doubling the size 13507ec681f3Smrg- gallium/tgsi_from_mesa: Add 'extern "C"' to be able to include from 13517ec681f3Smrg C++ 13527ec681f3Smrg- nir: make nir_get_texture_size/lod available outside nir_lower_tex 13537ec681f3Smrg- gallium: tgsi_from_mesa - handle VARYING_SLOT_FACE 13547ec681f3Smrg- r600: Add functions to dump the shader info 13557ec681f3Smrg- r600: Make it possible to include r600_asm.h in a C++ file 13567ec681f3Smrg- r600/sb: Correct SB disassambler for better debugging 13577ec681f3Smrg- r600: Fix maximum line width 13587ec681f3Smrg- r600: Make SID and unsigned value 13597ec681f3Smrg- r600: Delete vertex buffer only if there is actually a shader state 13607ec681f3Smrg- mesa/st: glsl_to_nir: don't lower atomics to SSBOs if driver supports 13617ec681f3Smrg HW atomics 13627ec681f3Smrg 13637ec681f3SmrgGuido Günther (2): 13647ec681f3Smrg 13657ec681f3Smrg- etnaviv: drm: Don't miscalculate timeout 13667ec681f3Smrg- freedreno/drm: Don't miscalculate timeout 13677ec681f3Smrg 13687ec681f3SmrgGurchetan Singh (11): 13697ec681f3Smrg 13707ec681f3Smrg- drirc: set allow_higher_compat_version for Faster Than Light 13717ec681f3Smrg- virgl/drm: update UAPI 13727ec681f3Smrg- teximage: split out helper from EGLImageTargetTexture2DOES 13737ec681f3Smrg- glapi / teximage: implement EGLImageTargetTexStorageEXT 13747ec681f3Smrg- dri_util: add driImageFormatToSizedInternalGLFormat function 13757ec681f3Smrg- i965: track if image is created by a dmabuf 13767ec681f3Smrg- i965: refactor intel_image_target_texture_2d 13777ec681f3Smrg- i965: support EXT_EGL_image_storage 13787ec681f3Smrg- st/dri: track if image is created by a dmabuf 13797ec681f3Smrg- st/mesa: refactor egl image binding a bit 13807ec681f3Smrg- st/mesa: implement EGLImageTargetTexStorage 13817ec681f3Smrg 13827ec681f3SmrgHyunjun Ko (7): 13837ec681f3Smrg 13847ec681f3Smrg- freedreno/ir3: cleanup by removing repeated code 13857ec681f3Smrg- freedreno: support 16b for the sampler opcode 13867ec681f3Smrg- freedreno/ir3: fix printing output registers of FS. 13877ec681f3Smrg- freedreno/ir3: fixup when changing to mad.f16 13887ec681f3Smrg- freedreno/ir3: enable half precision for pre-fs texture fetch 13897ec681f3Smrg- turnip: fix invalid VK_ERROR_OUT_OF_POOL_MEMORY 13907ec681f3Smrg- freedreno/ir3: put the conversion back for half const to the right 13917ec681f3Smrg place. 13927ec681f3Smrg 13937ec681f3SmrgIago Toral Quiroga (32): 13947ec681f3Smrg 13957ec681f3Smrg- v3d: rename vertex shader key (num)_fs_inputs fields 13967ec681f3Smrg- mesa/st: make sure we remove dead IO variables before handing NIR to 13977ec681f3Smrg backends 13987ec681f3Smrg- glsl: add missing initialization of the location path field 13997ec681f3Smrg- v3d: fix indirect BO allocation for uniforms 14007ec681f3Smrg- v3d: actually root the first BO in a command list in the job 14017ec681f3Smrg- v3d: add missing plumbing for VPM load instructions 14027ec681f3Smrg- v3d: add debug assert 14037ec681f3Smrg- v3d: enable debug options for geometry shader dumps 14047ec681f3Smrg- v3d: remove unused variable 14057ec681f3Smrg- v3d: add initial compiler plumbing for geometry shaders 14067ec681f3Smrg- v3d: fix packet descriptions for geometry and tessellation shaders 14077ec681f3Smrg- v3d: emit geometry shader state commands 14087ec681f3Smrg- v3d: implement geometry shader instancing 14097ec681f3Smrg- v3d: add 1-way SIMD packing definition 14107ec681f3Smrg- v3d: compute appropriate VPM memory configuration for geometry shader 14117ec681f3Smrg workloads 14127ec681f3Smrg- v3d: we always have at least one output segment 14137ec681f3Smrg- v3d: add support for adjacency primitives 14147ec681f3Smrg- v3d: don't try to render if shaders failed to compile 14157ec681f3Smrg- v3d: predicate geometry shader outputs inside non-uniform control 14167ec681f3Smrg flow 14177ec681f3Smrg- v3d: save geometry shader state for blitting 14187ec681f3Smrg- v3d: support transform feedback with geometry shaders 14197ec681f3Smrg- v3d: remove obsolete assertion 14207ec681f3Smrg- v3d: do not limit new CL space allocations with branch to 4096 bytes 14217ec681f3Smrg- v3d: support rendering to multi-layered framebuffers 14227ec681f3Smrg- v3d: move layer rendering to a separate helper 14237ec681f3Smrg- v3d: handle writes to gl_Layer from geometry shaders 14247ec681f3Smrg- v3d: fix primitive queries for geometry shaders 14257ec681f3Smrg- v3d: disable lowering of indirect inputs 14267ec681f3Smrg- v3d: support precompiling geometry shaders 14277ec681f3Smrg- v3d: expose OES_geometry_shader 14287ec681f3Smrg- u_vbuf: don't try to delete NULL driver CSO 14297ec681f3Smrg- v3d: fix bug when checking result of syncobj fence import 14307ec681f3Smrg 14317ec681f3SmrgIan Romanick (39): 14327ec681f3Smrg 14337ec681f3Smrg- intel/compiler: Report the number of non-spill/fill SEND messages on 14347ec681f3Smrg vec4 too 14357ec681f3Smrg- nir/algebraic: Add the ability to mark a replacement as exact 14367ec681f3Smrg- nir/algebraic: Mark other comparison exact when removing a == a 14377ec681f3Smrg- intel/fs: Disable conditional discard optimization on Gen4 and Gen5 14387ec681f3Smrg- nir/range-analysis: Add pragmas to help loop unrolling 14397ec681f3Smrg- nir/range_analysis: Make sure the table validation only occurs once 14407ec681f3Smrg- nir/opt_peephole_select: Don't count some unary operations 14417ec681f3Smrg- intel/compiler: Increase nir_opt_peephole_select threshold 14427ec681f3Smrg- nir/algebraic: Simplify some Inf and NaN avoidance code 14437ec681f3Smrg- nir/algebraic: Rearrange bcsel sequences generated by 14447ec681f3Smrg nir_opt_peephole_select 14457ec681f3Smrg- intel/compiler: Fix 'comparison is always true' warning 14467ec681f3Smrg- mesa: Silence 'left shift of negative value' warning in BPTC 14477ec681f3Smrg compression code 14487ec681f3Smrg- mesa: Silence unused parameter warning 14497ec681f3Smrg- anv: Fix error message format string 14507ec681f3Smrg- mesa: Extension boilerplate for INTEL_shader_integer_functions2 14517ec681f3Smrg- glsl: Add new expressions for INTEL_shader_integer_functions2 14527ec681f3Smrg- glsl_types: Add function to get an unsigned base type from a signed 14537ec681f3Smrg type 14547ec681f3Smrg- glsl: Add built-in functions for INTEL_shader_integer_functions2 14557ec681f3Smrg- nir: Add new instructions for INTEL_shader_integer_functions2 14567ec681f3Smrg- nir/algebraic: Add lowering for uabs_usub and uabs_isub 14577ec681f3Smrg- nir/algebraic: Add lowering for 64-bit hadd and rhadd 14587ec681f3Smrg- nir/algebraic: Add lowering for 64-bit usub_sat 14597ec681f3Smrg- nir/algebraic: Add lowering for 64-bit uadd_sat 14607ec681f3Smrg- nir/algebraic: Add lowering for 64-bit iadd_sat and isub_sat 14617ec681f3Smrg- compiler: Translate GLSL IR to NIR for new 14627ec681f3Smrg INTEL_shader_integer_functions2 expressions 14637ec681f3Smrg- intel/fs: Don't lower integer multiplies that don't need lowering 14647ec681f3Smrg- intel/fs: Add SHADER_OPCODE_[IU]SUB_SAT pseudo-ops 14657ec681f3Smrg- intel/fs: Implement support for NIR opcodes for 14667ec681f3Smrg INTEL_shader_integer_functions2 14677ec681f3Smrg- nir/spirv: Translate SPIR-V to NIR for new 14687ec681f3Smrg INTEL_shader_integer_functions2 opcodes 14697ec681f3Smrg- spirv: Silence a bunch of unused parameter warnings 14707ec681f3Smrg- spirv: Add support for IntegerFunctions2INTEL capability 14717ec681f3Smrg- i965: Enable INTEL_shader_integer_functions2 on Gen8+ 14727ec681f3Smrg- gallium: Add a cap bit for OpenCL-style extended integer functions 14737ec681f3Smrg- gallium: Add a cap bit for integer multiplication between 32-bit and 14747ec681f3Smrg 16-bit 14757ec681f3Smrg- iris: Enable INTEL_shader_integer_functions2 14767ec681f3Smrg- anv: Enable SPV_INTEL_shader_integer_functions2 and 14777ec681f3Smrg VK_INTEL_shader_integer_functions2 14787ec681f3Smrg- nir/algebraic: Optimize some 64-bit integer comparisons involving 14797ec681f3Smrg zero 14807ec681f3Smrg- relnotes: Add GL_INTEL_shader_integer_functions2 and 14817ec681f3Smrg VK_INTEL_shader_integer_functions2 14827ec681f3Smrg- intel/fs: Don't count integer instructions as being possibly coissue 14837ec681f3Smrg 14847ec681f3SmrgIcecream95 (16): 14857ec681f3Smrg 14867ec681f3Smrg- gallium/auxiliary: Reduce conversions in 14877ec681f3Smrg u_vbuf_get_minmax_index_mapped 14887ec681f3Smrg- gallium/auxiliary: Handle count == 0 in 14897ec681f3Smrg u_vbuf_get_minmax_index_mapped 14907ec681f3Smrg- panfrost: Add negative lod bias support 14917ec681f3Smrg- panfrost: Compact the bo_access readers array 14927ec681f3Smrg- panfrost: Dynamically allocate shader variants 14937ec681f3Smrg- panfrost: Add ETC1/ETC2 texture formats 14947ec681f3Smrg- panfrost: Add ASTC texture formats 14957ec681f3Smrg- pan/midgard: Fix bundle dynarray leak 14967ec681f3Smrg- pan/midgard: Fix a memory leak in the disassembler 14977ec681f3Smrg- pan/midgard: Support disassembling to a file 14987ec681f3Smrg- pan/bifrost: Support disassembling to a file 14997ec681f3Smrg- pan/decode: Support dumping to a file 15007ec681f3Smrg- pan/decode: Dump to a file 15017ec681f3Smrg- pan/decode: Rotate trace files 15027ec681f3Smrg- panfrost: Don't copy uniforms when the size is zero 15037ec681f3Smrg- pan/midgard: Fix a liveness info leak 15047ec681f3Smrg 15057ec681f3SmrgIcenowy Zheng (2): 15067ec681f3Smrg 15077ec681f3Smrg- lima: support indexed draw with bias 15087ec681f3Smrg- lima: fix lima_set_vertex_buffers() 15097ec681f3Smrg 15107ec681f3SmrgIlia Mirkin (7): 15117ec681f3Smrg 15127ec681f3Smrg- gm107/ir: fix loading z offset for layered 3d image bindings 15137ec681f3Smrg- nv50/ir: mark STORE destination inputs as used 15147ec681f3Smrg- nv50,nvc0: fix destination coordinates of blit 15157ec681f3Smrg- nvc0: add dummy reset status support 15167ec681f3Smrg- gm107/ir: avoid combining geometry shader stores at 0x60 15177ec681f3Smrg- nvc0: treat all draws without color0 broadcast as MRT 15187ec681f3Smrg- nvc0: disable xfb's which don't have a stride 15197ec681f3Smrg 15207ec681f3SmrgItalo Nicola (1): 15217ec681f3Smrg 15227ec681f3Smrg- intel/compiler: remove old comment 15237ec681f3Smrg 15247ec681f3SmrgIván Briano (4): 15257ec681f3Smrg 15267ec681f3Smrg- intel/compiler: Don't change hstride if not needed 15277ec681f3Smrg- anv: Export filter_minmax support only when it's really supported 15287ec681f3Smrg- anv: Export VK_KHR_buffer_device_address only when really supported 15297ec681f3Smrg- anv: Enable Vulkan 1.2 support 15307ec681f3Smrg 15317ec681f3SmrgJames Xiong (3): 15327ec681f3Smrg 15337ec681f3Smrg- iris: try to set the specified tiling when importing a dmabuf 15347ec681f3Smrg- gallium: dmabuf support for yuv formats that are not natively 15357ec681f3Smrg supported 15367ec681f3Smrg- gallium: let the pipe drivers decide the supported modifiers 15377ec681f3Smrg 15387ec681f3SmrgJan Vesely (2): 15397ec681f3Smrg 15407ec681f3Smrg- clover: Initialize Asm Parsers 15417ec681f3Smrg- clover: Use explicit conversion from llvm::StringRef to std::string 15427ec681f3Smrg 15437ec681f3SmrgJan Zielinski (8): 15447ec681f3Smrg 15457ec681f3Smrg- gallium/swr: Fix depth values for blit scenario 15467ec681f3Smrg- swr/rasterizer: Add tessellator implementation to the rasterizer 15477ec681f3Smrg- gallium/swr: Fix Windows build 15487ec681f3Smrg- gallium/gallivm/tgsi: enable tessellation shaders 15497ec681f3Smrg- gallium/gallivm: enable linking lp_bld_printf function with C++ code 15507ec681f3Smrg- gallium/swr: implementation of tessellation shaders compilation 15517ec681f3Smrg- gallium/swr: fix tessellation state save/restore 15527ec681f3Smrg- docs: Update SWR tessellation support 15537ec681f3Smrg 15547ec681f3SmrgJason Ekstrand (212): 15557ec681f3Smrg 15567ec681f3Smrg- util: Add a util_sparse_array data structure 15577ec681f3Smrg- anv: Move refcount to anv_bo 15587ec681f3Smrg- anv: Use a util_sparse_array for the GEM handle -> BO map 15597ec681f3Smrg- anv: Fix a relocation race condition 15607ec681f3Smrg- anv: Stop storing the GEM handle in anv_reloc_list_add 15617ec681f3Smrg- anv: Declare the bo in the anv_block_pool_foreach_bo loop 15627ec681f3Smrg- anv: Inline anv_block_pool_get_bo 15637ec681f3Smrg- anv: Replace ANV_BO_EXTERNAL with anv_bo::is_external 15647ec681f3Smrg- anv: Handle state pool relocations using "wrapper" BOs 15657ec681f3Smrg- anv: Fix a potential BO handle leak 15667ec681f3Smrg- anv: Rework anv_block_pool_expand_range 15677ec681f3Smrg- anv: Use anv_block_pool_foreach_bo in get_bo_from_pool 15687ec681f3Smrg- anv: Rework the internal BO allocation API 15697ec681f3Smrg- anv: Choose BO flags internally in anv_block_pool 15707ec681f3Smrg- anv/tests: Zero-initialize instances 15717ec681f3Smrg- anv/tests: Initialize the BO cache and device mutex 15727ec681f3Smrg- anv: Allocate block pool BOs from the cache 15737ec681f3Smrg- anv: Use the query_slot helper in vkResetQueryPoolEXT 15747ec681f3Smrg- anv: Allocate query pool BOs from the cache 15757ec681f3Smrg- anv: Set more flags on descriptor pool buffers 15767ec681f3Smrg- anv: Allocate descriptor buffers from the BO cache 15777ec681f3Smrg- util: Add a free list structure for use with util_sparse_array 15787ec681f3Smrg- anv: Allocate batch and fence buffers from the cache 15797ec681f3Smrg- anv: Allocate scratch BOs from the cache 15807ec681f3Smrg- anv: Allocate misc BOs from the cache 15817ec681f3Smrg- anv: Drop anv_bo_init and anv_bo_init_new 15827ec681f3Smrg- anv: Add a device parameter to anv_execbuf_add_bo 15837ec681f3Smrg- anv: Set the batch allocator for compute pipelines 15847ec681f3Smrg- anv: Use a bitset for tracking residency 15857ec681f3Smrg- anv: Zero released anv_bo structs 15867ec681f3Smrg- anv: Use the new BO alloc API for Android 15877ec681f3Smrg- anv: Don't delete fragment shaders that write sample mask 15887ec681f3Smrg- anv: Don't claim the null RT as a valid color target 15897ec681f3Smrg- anv: Stop compacting render targets in the binding table 15907ec681f3Smrg- anv: Move the RT BTI flush workaround to begin_subpass 15917ec681f3Smrg- spirv: Remove the type from sampled_image 15927ec681f3Smrg- spirv: Add a vtn_decorate_pointer helper 15937ec681f3Smrg- spirv: Sort out the mess that is sampled image 15947ec681f3Smrg- nir/builder: Add a nir_extract_bits helper 15957ec681f3Smrg- nir: Add tests for nir_extract_bits 15967ec681f3Smrg- intel/nir: Use nir_extract_bits in lower_mem_access_bit_sizes 15977ec681f3Smrg- intel/fs: Add DWord scattered read/write opcodes 15987ec681f3Smrg- intel/fs: refactor surface header setup 15997ec681f3Smrg- intel/nir: Plumb devinfo through lower_mem_access_bit_sizes 16007ec681f3Smrg- intel/fs: Implement the new load/store_scratch intrinsics 16017ec681f3Smrg- intel/fs: Lower large local arrays to scratch 16027ec681f3Smrg- anv: Lock around fetching sync file FDs from semaphores 16037ec681f3Smrg- anv: Plumb timeline semaphore signal/wait values through from the API 16047ec681f3Smrg- spirv: Fix the MSVC build 16057ec681f3Smrg- anv/pipeline: Assume layout != NULL 16067ec681f3Smrg- genxml: Mark everything in genX_pack.h always_inline 16077ec681f3Smrg- anv: Input attachments are always single-plane 16087ec681f3Smrg- anv: Flatten descriptor bindings in anv_nir_apply_pipeline_layout 16097ec681f3Smrg- anv: Delete dead shader constant pushing code 16107ec681f3Smrg- anv: Stop bounds-checking pushed UBOs 16117ec681f3Smrg- anv: Pre-compute push ranges for graphics pipelines 16127ec681f3Smrg- intel/compiler: Add a flag to avoid compacting push constants 16137ec681f3Smrg- anv: Re-arrange push constant data a bit 16147ec681f3Smrg- anv: Rework push constant handling 16157ec681f3Smrg- anv: Use a switch statement for binding table setup 16167ec681f3Smrg- anv: More carefully dirty state in BindDescriptorSets 16177ec681f3Smrg- anv: More carefully dirty state in BindPipeline 16187ec681f3Smrg- anv: Use an anv_state for the next binding table 16197ec681f3Smrg- anv: Emit a NULL vertex for zero base_vertex/instance 16207ec681f3Smrg- nir: Validate that variables are in the right lists 16217ec681f3Smrg- iris: Re-enable param compaction 16227ec681f3Smrg- Revert "i965/fs: Merge CMP and SEL into CSEL on Gen8+" 16237ec681f3Smrg- vulkan/enum_to_str: Handle out-of-order aliases 16247ec681f3Smrg- anv/entrypoints: Better handle promoted extensions 16257ec681f3Smrg- vulkan: Update the XML and headers to 1.1.129 16267ec681f3Smrg- anv: Push constants are relative to dynamic state on IVB 16277ec681f3Smrg- anv: Set up SBE_SWIZ properly for gl_Viewport 16287ec681f3Smrg- anv: Respect the always_flush_cache driconf option 16297ec681f3Smrg- iris: Stop setting up fake params 16307ec681f3Smrg- anv: Drop bo_flags from anv_bo_pool 16317ec681f3Smrg- anv: Add a has_softpin boolean 16327ec681f3Smrg- blorp: Pass the VB size to the VF cache workaround 16337ec681f3Smrg- anv: Always invalidate the VF cache in BeginCommandBuffer 16347ec681f3Smrg- anv: Apply cache flushes after setting index/draw VBs 16357ec681f3Smrg- anv: Use PIPE_CONTROL flushes to implement the gen8 VF cache WA 16367ec681f3Smrg- anv: Don't leak when set_tiling fails 16377ec681f3Smrg- util/atomic: Add a \_return variant of p_atomic_add 16387ec681f3Smrg- anv: Disallow allocating above heap sizes 16397ec681f3Smrg- anv: Stop tracking VMA allocations 16407ec681f3Smrg- anv: Set up VMA heaps independently from memory heaps 16417ec681f3Smrg- anv: Stop advertising two heaps just for the VF cache WA 16427ec681f3Smrg- anv: Add an explicit_address parameter to anv_device_alloc_bo 16437ec681f3Smrg- util/vma: Factor out the hole splitting part of util_vma_heap_alloc 16447ec681f3Smrg- util/vma: Add a function to allocate a particular address range 16457ec681f3Smrg- anv: Add allocator support for client-visible addresses 16467ec681f3Smrg- anv: Use a pNext loop in AllocateMemory 16477ec681f3Smrg- anv: Implement VK_KHR_buffer_device_address 16487ec681f3Smrg- util/atomic: Add p_atomic_add_return for the unlocked path 16497ec681f3Smrg- vulkan/wsi: Provide the implicitly synchronized BO to vkQueueSubmit 16507ec681f3Smrg- vulkan/wsi: Add a hooks for signaling semaphores and fences 16517ec681f3Smrg- anv: Always add in EXEC_OBJECT_WRITE when specified in extra_flags 16527ec681f3Smrg- anv: Use submit-time implicit sync instead of allocate-time 16537ec681f3Smrg- anv: Add a fence_reset_reset_temporary helper 16547ec681f3Smrg- anv: Use BO fences/semaphores for AcquireNextImage 16557ec681f3Smrg- anv: Return VK_ERROR_OUT_OF_DEVICE_MEMORY for too-large buffers 16567ec681f3Smrg- anv: Re-capture all batch and state buffers 16577ec681f3Smrg- anv: Re-emit all compute state on pipeline switch 16587ec681f3Smrg- ANV: Stop advertising smoothLines support on gen10+ 16597ec681f3Smrg- anv: Flush the queue on DeviceWaitIdle 16607ec681f3Smrg- anv: Unconditionally advertise Vulkan 1.1 16617ec681f3Smrg- anv: Bump the advertised patch version to 129 16627ec681f3Smrg- i965: Enable GL_EXT_gpu_shader4 on Gen6+ 16637ec681f3Smrg- anv: Properly advertise sampledImageIntegerSampleCounts 16647ec681f3Smrg- anv: Drop unneeded struct keywords 16657ec681f3Smrg- blorp: Stop whacking Z24 depth to BGRA8 16667ec681f3Smrg- blorp: Allow reading with HiZ 16677ec681f3Smrg- i965/blorp: Don't resolve HiZ unless we're reinterpreting 16687ec681f3Smrg- intel/blorp: Use the source format when using blorp_copy with HiZ 16697ec681f3Smrg- anv: Allow HiZ in TRANSFER_SRC_OPTIMAL on Gen8-9 16707ec681f3Smrg- i965: Allow HiZ for glCopyImageSubData sources 16717ec681f3Smrg- intel/nir: Add a memory barrier before barrier() 16727ec681f3Smrg- intel/disasm: Fix decoding of src0 of SENDS 16737ec681f3Smrg- genxml: Remove a non-existant HW bit 16747ec681f3Smrg- anv: Don't add dynamic state base address to push constants on Gen7 16757ec681f3Smrg- anv: Flag descriptors dirty when gl_NumWorkgroups is used 16767ec681f3Smrg- anv: Re-use flush_descriptor_sets in flush_compute_state 16777ec681f3Smrg- intel/vec4: Support scoped_memory_barrier 16787ec681f3Smrg- nir: Handle more barriers in dead_write and copy_prop 16797ec681f3Smrg- nir: Handle barriers with more granularity in combine_stores 16807ec681f3Smrg- llmvpipe: No-op implement more barriers 16817ec681f3Smrg- nir: Add a new memory_barrier_tcs_patch intrinsic 16827ec681f3Smrg- spirv: Add a workaround for OpControlBarrier on old GLSLang 16837ec681f3Smrg- spirv: Add output memory semantics to OpControlBarrier in TCS 16847ec681f3Smrg- nir/glsl: Emit memory barriers as part of barrier() 16857ec681f3Smrg- intel/nir: Stop adding redundant barriers 16867ec681f3Smrg- nir: Rename nir_intrinsic_barrier to control_barrier 16877ec681f3Smrg- nir/lower_atomics_to_ssbo: Also lower barriers 16887ec681f3Smrg- anv: Drop an unused variable 16897ec681f3Smrg- intel/blorp: Fill out all the dwords of MI_ATOMIC 16907ec681f3Smrg- anv: Don't over-advertise descriptor indexing features 16917ec681f3Smrg- anv: Memset array properties 16927ec681f3Smrg- vulkan/wsi: Add a driconf option to force WSI to advertise 16937ec681f3Smrg BGRA8_UNORM first 16947ec681f3Smrg- vulkan: Update the XML and headers to 1.2.131 16957ec681f3Smrg- turnip: Pretend to support Vulkan 1.2 16967ec681f3Smrg- anv: Bump the patch version to 131 16977ec681f3Smrg- anv,nir: Lower quad_broadcast with dynamic index in NIR 16987ec681f3Smrg- anv: Implement the new core version feature queries 16997ec681f3Smrg- anv: Implement the new core version property queries 17007ec681f3Smrg- relnotes: Add Vulkan 1.2 17017ec681f3Smrg- anv: Drop some VK_IMAGE_TILING_OPTIMAL checks 17027ec681f3Smrg- anv: Support modifiers in GetImageFormatProperties2 17037ec681f3Smrg- vulkan/wsi: Move the ImageCreateInfo higher up 17047ec681f3Smrg- vulkan/wsi: Use the interface from the real modifiers extension 17057ec681f3Smrg- vulkan/wsi: Filter modifiers with ImageFormatProperties 17067ec681f3Smrg- vulkan/wsi: Implement VK_KHR_swapchain_mutable_format 17077ec681f3Smrg- anv/blorp: Rename buffer image stride parameters 17087ec681f3Smrg- anv: Canonicalize buffer formats for image/buffer copies 17097ec681f3Smrg- anv: Add an anv_physical_device field to anv_device 17107ec681f3Smrg- anv: Take an anv_device in vk_errorf 17117ec681f3Smrg- anv: Take a device in anv_perf_warn 17127ec681f3Smrg- anv: Stop allocating WSI event fences off the instance 17137ec681f3Smrg- anv: Drop the instance pointer from anv_device 17147ec681f3Smrg- anv: Move the physical device dispatch table to anv_instance 17157ec681f3Smrg- anv: Drop separate chipset_id fields 17167ec681f3Smrg- anv: Re-arrange physical_device_init 17177ec681f3Smrg- anv: Allow enumerating multiple physical devices 17187ec681f3Smrg- anv/apply_pipeline_layout: Initialize the nir_builder before use 17197ec681f3Smrg- intel/blorp: resize src and dst surfaces separately 17207ec681f3Smrg- anv: Use TRANSFER_SRC_OPTIMAL for depth/stencil MSAA resolves 17217ec681f3Smrg- anv: Add a layout_to_aux_state helper 17227ec681f3Smrg- anv: Use isl_aux_state for HiZ resolves 17237ec681f3Smrg- anv: Add a usage parameter to anv_layout_to_aux_usage 17247ec681f3Smrg- anv: Allow HiZ in read-only depth layouts 17257ec681f3Smrg- anv: Improve BTI change cache flushing 17267ec681f3Smrg- intel/fs: Don't unnecessarily fall back to indirect sends on Gen12 17277ec681f3Smrg- intel/disasm: Properly disassemble indirect SENDs 17287ec681f3Smrg- intel/isl: Plumb devinfo into isl_genX(buffer_fill_state_s) 17297ec681f3Smrg- intel/isl: Add a hack for the Gen12 A0 texture buffer bug 17307ec681f3Smrg- anv: Rework the meaning of anv_image::planes[]::aux_usage 17317ec681f3Smrg- anv: Replace aux_surface.isl.size_B checks with aux_usage checks 17327ec681f3Smrg- intel/aux-map: Add some #defines 17337ec681f3Smrg- intel/aux-map: Factor out some useful helpers 17347ec681f3Smrg- anv: Delete a redundant calculation 17357ec681f3Smrg- isl: Add a helper for calculating subimage memory ranges 17367ec681f3Smrg- anv: Add another align_down helper 17377ec681f3Smrg- anv: Make AUX table invalidate a PIPE\_\* bit 17387ec681f3Smrg- anv: Make anv_vma_alloc/free a lot dumber 17397ec681f3Smrg- anv: Rework CCS memory handling on TGL-LP 17407ec681f3Smrg- intel/blorp: Add support for CCS_E copies with UNORM formats 17417ec681f3Smrg- intel/isl: Allow CCS_E on more formats 17427ec681f3Smrg- intel/genxml: Make SO_DECL::"Hole Flag" a Boolean 17437ec681f3Smrg- anv: Insert holes for non-existant XFB varyings 17447ec681f3Smrg- intel/blorp: Handle bit-casting UNORM and BGRA formats 17457ec681f3Smrg- anv: Replace one more aux_surface.isl.size_B check 17467ec681f3Smrg- intel/mi_builder: Force write completion on Gen12+ 17477ec681f3Smrg- anv: Set actual state pool sizes when we have softpin 17487ec681f3Smrg- anv: Re-use one old BT block in reset_batch_bo_chain 17497ec681f3Smrg- anv/block_pool: Ensure allocations have contiguous maps 17507ec681f3Smrg- anv: Rename a variable 17517ec681f3Smrg- genxml: Add a new 3DSTATE_SF field on gen12 17527ec681f3Smrg- anv,iris: Set 3DSTATE_SF::DerefBlockSize to per-poly on Gen12+ 17537ec681f3Smrg- intel/genxml: Drop SLMEnable from L3CNTLREG on Gen11 17547ec681f3Smrg- iris: Set SLMEnable based on the L3$ config 17557ec681f3Smrg- iris: Store the L3$ configs in the screen 17567ec681f3Smrg- iris: Use the URB size from the L3$ config 17577ec681f3Smrg- i965: Re-emit l3 state before BLORP executes 17587ec681f3Smrg- intel: Take a gen_l3_config in gen_get_urb_config 17597ec681f3Smrg- intel/blorp: Always emit URB config on Gen7+ 17607ec681f3Smrg- iris: Consolodate URB emit 17617ec681f3Smrg- anv: Emit URB setup earlier 17627ec681f3Smrg- intel/common: Return the block size from get_urb_config 17637ec681f3Smrg- intel/blorp: Plumb deref block size through to 3DSTATE_SF 17647ec681f3Smrg- anv: Plumb deref block size through to 3DSTATE_SF 17657ec681f3Smrg- iris: Plumb deref block size through to 3DSTATE_SF 17667ec681f3Smrg- anv: Always fill out the AUX table even if CCS is disabled 17677ec681f3Smrg- intel/fs: Write the address register with NoMask for MOV_INDIRECT 17687ec681f3Smrg- anv/blorp: Use the correct size for vkCmdCopyBufferToImage 17697ec681f3Smrg 17707ec681f3SmrgJonathan Gray (4): 17717ec681f3Smrg 17727ec681f3Smrg- winsys/amdgpu: avoid double simple_mtx_unlock() 17737ec681f3Smrg- i965: update Makefile.sources for perf changes 17747ec681f3Smrg- util/futex: use futex syscall on OpenBSD 17757ec681f3Smrg- util/u_thread: don't restrict u_thread_get_time_nano() to \__linux_\_ 17767ec681f3Smrg 17777ec681f3SmrgJonathan Marek (98): 17787ec681f3Smrg 17797ec681f3Smrg- freedreno: add Adreno 640 ID 17807ec681f3Smrg- freedreno/ir3: disable texture prefetch for 1d array textures 17817ec681f3Smrg- freedreno/registers: fix a6xx_2d_blit_cntl ROTATE 17827ec681f3Smrg- etnaviv: blt: use only for tiling, and add missing formats 17837ec681f3Smrg- etnaviv: separate PE and RS formats, use only RS only for tiling 17847ec681f3Smrg- etnaviv: blt: set TS dirty after clear 17857ec681f3Smrg- turnip: add display wsi 17867ec681f3Smrg- turnip: add x11 wsi 17877ec681f3Smrg- turnip: implement CmdClearColorImage/CmdClearDepthStencilImage 17887ec681f3Smrg- turnip: fix sRGB GMEM clear 17897ec681f3Smrg- util: add missing R8G8B8A8_SRGB format to vk_format_map 17907ec681f3Smrg- freedreno/regs: update UBWC related bits 17917ec681f3Smrg- turnip: implement UBWC 17927ec681f3Smrg- etnaviv: avoid using RS for 64bpp formats 17937ec681f3Smrg- etnaviv: implement 64bpp clear 17947ec681f3Smrg- etnaviv: blt: fix partial ZS clears with TS 17957ec681f3Smrg- etnaviv: support 3d/array/integer formats in texture descriptors 17967ec681f3Smrg- turnip: fix integer render targets 17977ec681f3Smrg- freedreno/registers: add missing MH perfcounter enum for a2xx 17987ec681f3Smrg- freedreno/perfcntrs: add a2xx MH counters 17997ec681f3Smrg- freedreno/perfcntrs/fdperf: fix u64 print on 32-bit builds 18007ec681f3Smrg- freedreno/perfcntrs/fdperf: add missing a20x compatible 18017ec681f3Smrg- freedreno/perfcntrs/fdperf: add missing a2xx case in select_counter 18027ec681f3Smrg- turnip: fix display wsi fence timing out 18037ec681f3Smrg- turnip: don't skip unused attachments when setting up tiling config 18047ec681f3Smrg- turnip: implement CmdClearAttachments 18057ec681f3Smrg- turnip: don't set unused BLIT_DST_INFO bits for GMEM clear 18067ec681f3Smrg- turnip: MSAA resolve directly from GMEM 18077ec681f3Smrg- turnip: allow writes to draw_cs outside of render pass 18087ec681f3Smrg- turnip: add function to allocate aligned memory in a substream cs 18097ec681f3Smrg- turnip: improve emit_textures 18107ec681f3Smrg- turnip: implement border color 18117ec681f3Smrg- turnip: add hw binning 18127ec681f3Smrg- turnip: fix incorrectly failing assert 18137ec681f3Smrg- freedreno/ir3: add GLSL_SAMPLER_DIM_SUBPASS to tex_info 18147ec681f3Smrg- freedreno/registers: add a6xx texture format for stencil sampler 18157ec681f3Smrg- turnip: fix hw binning render area 18167ec681f3Smrg- turnip: fix tile layout logic 18177ec681f3Smrg- turnip: update tile_align_w/tile_align_h 18187ec681f3Smrg- turnip: set load_layer_id to zero 18197ec681f3Smrg- turnip: set FRAG_WRITES_SAMPMASK bit 18207ec681f3Smrg- turnip: fix VK_IMAGE_ASPECT_STENCIL_BIT image view 18217ec681f3Smrg- turnip: no 8x msaa on 128bpp formats 18227ec681f3Smrg- turnip: add dirty bit for push constants 18237ec681f3Smrg- turnip: subpass rework 18247ec681f3Smrg- turnip: CmdClearAttachments fixes 18257ec681f3Smrg- turnip: implement subpass input attachments 18267ec681f3Smrg- etnaviv: remove sRGB formats from format table 18277ec681f3Smrg- etnaviv: sRGB render target support 18287ec681f3Smrg- etnaviv: set output mode and saturate bits 18297ec681f3Smrg- etnaviv: update INT_FILTER choice for GLES3 formats 18307ec681f3Smrg- etnaviv: disable integer vertex formats on pre-HALTI2 hardware 18317ec681f3Smrg- etnaviv: remove swizzle from format table 18327ec681f3Smrg- etnaviv: add missing formats 18337ec681f3Smrg- etnaviv: add missing vs_needs_z_div handling to NIR backend 18347ec681f3Smrg- turnip: use single substream cs 18357ec681f3Smrg- turnip: use common blit path for buffer copy 18367ec681f3Smrg- turnip: don't require src image to be set for clear blits 18377ec681f3Smrg- turnip: implement CmdFillBuffer/CmdUpdateBuffer 18387ec681f3Smrg- freedreno/ir3: lower mul_2x32_64 18397ec681f3Smrg- turnip: fix emit_textures for compute shaders 18407ec681f3Smrg- turnip: remove compute emit_border_color 18417ec681f3Smrg- turnip: fix emit_ibo 18427ec681f3Smrg- turnip: change emit_ibo to be like emit_textures 18437ec681f3Smrg- turnip: remove duplicate A6XX_SP_CS_CONFIG_NIBO 18447ec681f3Smrg- nir: add option to lower half packing opcodes 18457ec681f3Smrg- freedreno/ir3: lower pack/unpack ops 18467ec681f3Smrg- turnip: don't set LRZ enable at end of renderpass 18477ec681f3Smrg- freedreno/ir3: update prefetch input_offset when packing inlocs 18487ec681f3Smrg- turnip: add cache invalidate to fix input attachment cases 18497ec681f3Smrg- turnip: don't set SP_FS_CTRL_REG0_VARYING if only fragcoord is used 18507ec681f3Smrg- freedreno/ir3: fix vertex shader sysvals with pre_assign_inputs 18517ec681f3Smrg- freedreno/registers: document vertex/instance id offset bits 18527ec681f3Smrg- freedreno/ir3: support load_base_instance 18537ec681f3Smrg- turnip: emit base instance vs driver param 18547ec681f3Smrg- turnip: emit_compute_driver_params fixes 18557ec681f3Smrg- turnip: compute gmem offsets at renderpass creation time 18567ec681f3Smrg- turnip: implement secondary command buffers 18577ec681f3Smrg- nir: fix assign_io_var_locations for vertex inputs 18587ec681f3Smrg- turnip: minor warning fixes 18597ec681f3Smrg- util/format: add missing vulkan formats 18607ec681f3Smrg- turnip: disable B8G8R8 vertex formats 18617ec681f3Smrg- etnaviv: fix incorrectly failing vertex size assert 18627ec681f3Smrg- etnaviv: update headers from rnndb 18637ec681f3Smrg- etnaviv: HALTI2+ instanced draw 18647ec681f3Smrg- etnaviv: implement gl_VertexID/gl_InstanceID 18657ec681f3Smrg- etnaviv: remove unnecessary vertex_elements_state_create error 18667ec681f3Smrg checking 18677ec681f3Smrg- st/mesa: don't lower YUV when driver supports it natively 18687ec681f3Smrg- st/mesa: run st_nir_lower_tex_src_plane for lowered xyuv/ayuv 18697ec681f3Smrg- freedreno/ir3: allow inputs with the same location 18707ec681f3Smrg- turnip: remove tu_sort_variables_by_location 18717ec681f3Smrg- turnip: fix array/matrix varyings 18727ec681f3Smrg- turnip: hook up GetImageDrmFormatModifierPropertiesEXT 18737ec681f3Smrg- turnip: set linear tiling for scanout images 18747ec681f3Smrg- vulkan/wsi: remove unused image_get_modifier 18757ec681f3Smrg- turnip: simplify tu_physical_device_get_format_properties 18767ec681f3Smrg- etnaviv: implement UBOs 18777ec681f3Smrg- turnip: hook up cmdbuffer event set/wait 18787ec681f3Smrg 18797ec681f3SmrgJordan Justen (7): 18807ec681f3Smrg 18817ec681f3Smrg- iris: Add IRIS_DIRTY_RENDER_BUFFER state flag 18827ec681f3Smrg- iris/gen11+: Move flush for render target change 18837ec681f3Smrg- iris: Allow max dynamic pool size of 2GB for gen12 18847ec681f3Smrg- intel: Remove unused Tigerlake PCI ID 18857ec681f3Smrg- iris: Fix some indentation in iris_init_render_context 18867ec681f3Smrg- iris: Emit CS Stall before Instruction Cache flush for gen12 WA 18877ec681f3Smrg- anv: Emit CS Stall before Instruction Cache flush for gen12 WA 18887ec681f3Smrg 18897ec681f3SmrgJose Maria Casanova Crespo (1): 18907ec681f3Smrg 18917ec681f3Smrg- v3d: Fix predication with atomic image operations 18927ec681f3Smrg 18937ec681f3SmrgJuan A. Suarez Romero (3): 18947ec681f3Smrg 18957ec681f3Smrg- nir/lower_double_ops: relax lower mod() 18967ec681f3Smrg- Revert "nir/lower_double_ops: relax lower mod()" 18977ec681f3Smrg- nir/spirv: skip unreachable blocks in Phi second pass 18987ec681f3Smrg 18997ec681f3SmrgKai Wasserbäch (4): 19007ec681f3Smrg 19017ec681f3Smrg- nir: fix unused variable warning in nir_lower_vars_to_explicit_types 19027ec681f3Smrg- nir: fix unused variable warning in 19037ec681f3Smrg find_and_update_previous_uniform_storage 19047ec681f3Smrg- nir: fix unused function warning in src/compiler/nir/nir.c 19057ec681f3Smrg- intel/gen_decoder: Fix unused-but-set-variable warning 19067ec681f3Smrg 19077ec681f3SmrgKarol Herbst (14): 19087ec681f3Smrg 19097ec681f3Smrg- nv50/ir: fix crash in isUniform for undefined values 19107ec681f3Smrg- nir/validate: validate num_components on registers and intrinsics 19117ec681f3Smrg- nir/serialize: fix vec8 and vec16 19127ec681f3Smrg- nir/tests: add serializer tests 19137ec681f3Smrg- nir/tests: MSVC build fix 19147ec681f3Smrg- spirv: handle UniformConstant for OpenCL kernels 19157ec681f3Smrg- clover/nir: treat UniformConstant as global memory 19167ec681f3Smrg- clover/nir: set spirv environment to OpenCL 19177ec681f3Smrg- clover/spirv: allow Int64 Atomics for supported devices 19187ec681f3Smrg- nir: handle nir_deref_type_ptr_as_array in 19197ec681f3Smrg rematerialize_deref_in_block 19207ec681f3Smrg- nv50/ir: implement global atomics and handle it for nir 19217ec681f3Smrg- nir/serialize: cast swizzle before shifting 19227ec681f3Smrg- aco: use NIR_MAX_VEC_COMPONENTS instead of 4 19237ec681f3Smrg- nv50ir/nir: support vec8 and vec16 19247ec681f3Smrg 19257ec681f3SmrgKenneth Graunke (57): 19267ec681f3Smrg 19277ec681f3Smrg- iris: Fix "Force Zero RTA Index Enable" setting again 19287ec681f3Smrg- nir: Handle image arrays when setting variable data 19297ec681f3Smrg- Revert "intel/blorp: Fix usage of uninitialized memory in key 19307ec681f3Smrg hashing" 19317ec681f3Smrg- iris: Properly move edgeflag_out from output list to global list 19327ec681f3Smrg- iris: Wrap iris_fix_edge_flags in NIR_PASS 19337ec681f3Smrg- mesa: Handle GL_COLOR_INDEX in \_mesa_format_from_format_and_type(). 19347ec681f3Smrg- iris: Change keybox parenting 19357ec681f3Smrg- iris: Stop mutating the resource in get_rt_read_isl_surf(). 19367ec681f3Smrg- iris: Drop 'old_address' parameter from iris_rebind_buffer 19377ec681f3Smrg- iris: Create an "iris_surface_state" wrapper struct 19387ec681f3Smrg- iris: Maintain CPU-side SURFACE_STATE copies for views and surfaces. 19397ec681f3Smrg- iris: Update SURFACE_STATE addresses when setting sampler views 19407ec681f3Smrg- iris: Disable VF cache partial address workaround on Gen11+ 19417ec681f3Smrg- driconf, glsl: Add a vs_position_always_invariant option 19427ec681f3Smrg- drirc: Set vs_position_always_invariant for Shadow of Mordor on Intel 19437ec681f3Smrg- st/mesa: Add GL_TDFX_texture_compression_FXT1 support 19447ec681f3Smrg- iris: Map FXT1 texture formats 19457ec681f3Smrg- meson: Add a "prefer_iris" build option 19467ec681f3Smrg- main: Change u_mmAllocMem align2 from bytes (old API) to bits (new 19477ec681f3Smrg API) 19487ec681f3Smrg- meson: Include iris in default gallium-drivers for x86/x86_64 19497ec681f3Smrg- util: Detect use-after-destroy in simple_mtx 19507ec681f3Smrg- intel/genxml: Add a partial TCCNTLREG definition 19517ec681f3Smrg- iris: Enable Gen11 Color/Z write merging optimization 19527ec681f3Smrg- anv: Enable Gen11 Color/Z write merging optimization 19537ec681f3Smrg- intel/decoder: Make get_state_size take a full 64-bit address and a 19547ec681f3Smrg base 19557ec681f3Smrg- iris: Create smaller program keys without legacy features 19567ec681f3Smrg- iris: Default to X-tiling for scanout buffers without modifiers 19577ec681f3Smrg- iris: Alphabetize source files after iris_perf.c was added 19587ec681f3Smrg- drirc: Final Fantasy VIII: Remastered needs 19597ec681f3Smrg allow_higher_compat_version 19607ec681f3Smrg- iris: Make helper functions to turn iris shader keys into brw keys. 19617ec681f3Smrg- iris: Fix shader recompile debug printing 19627ec681f3Smrg- iris: Avoid replacing backing storage for buffers with no contents 19637ec681f3Smrg- intel: Drop Gen11 WaBTPPrefetchDisable workaround 19647ec681f3Smrg- st/nir: Optionally unify inputs_read/outputs_written when linking. 19657ec681f3Smrg- iris: Set nir_shader_compiler_options::unify_interfaces. 19667ec681f3Smrg- st/mesa: Allow ASTC5x5 fallbacks separately from other ASTC LDR 19677ec681f3Smrg formats. 19687ec681f3Smrg- iris: Disable ASTC 5x5 support on Gen9 for now. 19697ec681f3Smrg- iris: Delete remnants of the unimplemented ASTC 5x5 workaround 19707ec681f3Smrg- iris: Allow HiZ for copy_region sources 19717ec681f3Smrg- anv: Only enable EWA LOD algorithm when doing anisotropic filtering. 19727ec681f3Smrg- Revert "nir: assert that nir_lower_tex runs after lowering derefs" 19737ec681f3Smrg- i965: Simplify brw_get_renderer_string() 19747ec681f3Smrg- iris: Simplify iris_get_renderer_string() 19757ec681f3Smrg- intel: Use similar brand strings to the Windows drivers 19767ec681f3Smrg- intel/compiler: Fix illegal mutation in get_nir_image_intrinsic_image 19777ec681f3Smrg- iris: Fix export of fences that have already completed. 19787ec681f3Smrg- st/mesa: Allocate full miplevels if MaxLevel is explicitly set 19797ec681f3Smrg- iris: Drop some workarounds which are no longer necessary 19807ec681f3Smrg- anv: Drop some workarounds that are no longer necessary 19817ec681f3Smrg- intel: Fix aux map alignments on 32-bit builds. 19827ec681f3Smrg- meson: Prefer 'iris' by default over 'i965'. 19837ec681f3Smrg- loader: Check if the kernel driver is i915 before loading iris 19847ec681f3Smrg- iris: Drop 'engine' from iris_batch. 19857ec681f3Smrg- iris: Make iris_emit_default_l3_config pull devinfo from the batch 19867ec681f3Smrg- iris: Support multiple chained batches. 19877ec681f3Smrg- i965: Use brw_batch_references in tex_busy check 19887ec681f3Smrg- loader: Fix leak of kernel driver name 19897ec681f3Smrg 19907ec681f3SmrgKristian Høgsberg (62): 19917ec681f3Smrg 19927ec681f3Smrg- freedreno/registers: Fix typo 19937ec681f3Smrg- freedreno/registers: Move SP_PRIMITIVE_CNTL and SP_VS_VPC_DST 19947ec681f3Smrg- freedreno/registers: Add comments about primitive counters 19957ec681f3Smrg- freedreno/a6xx: Fix primitive counters again 19967ec681f3Smrg- freedreno/a6xx: Clear sysmem with CP_BLIT 19977ec681f3Smrg- freedreno: Add nogmem debug option to force bypass rendering 19987ec681f3Smrg- freedreno/a6xx: Fix layered texture type enum 19997ec681f3Smrg- freedreno/a6x: Rename z/s formats 20007ec681f3Smrg- freedreno/a6xx: Add register offset for STG/LDG 20017ec681f3Smrg- freedreno/ir3: Emit link map as byte or dwords offsets as needed 20027ec681f3Smrg- freedreno/ir3: Add load and store intrinsics for global io 20037ec681f3Smrg- freedreno: Don't count primitives for patches 20047ec681f3Smrg- freedreno/ir3: Add ir3 intrinsics for tessellation 20057ec681f3Smrg- freedreno/ir3: Use imul24 in offset calculations 20067ec681f3Smrg- freedreno/ir3: Add tessellation field to shader key 20077ec681f3Smrg- freedreno/ir3: Extend geometry lowering pass to handle tessellation 20087ec681f3Smrg- freedreno/ir3: Add new synchronization opcodes 20097ec681f3Smrg- freedreno/ir3: End TES with chsh when using GS 20107ec681f3Smrg- freedreno/ir3: Implement tess coord intrinsic 20117ec681f3Smrg- freedreno/ir3: Implement TCS synchronization intrinsics 20127ec681f3Smrg- freedreno/ir3: Setup inputs and outputs for tessellation stages 20137ec681f3Smrg- freedreno/ir3: Don't assume binning shader is always VS 20147ec681f3Smrg- freedreno/ir3: Pre-color TCS header and primitive ID inputs 20157ec681f3Smrg- freedreno/ir3: Allocate const space for tessellation parameters 20167ec681f3Smrg- freedreno/a6xx: Build the right draw command for tessellation 20177ec681f3Smrg- freedreno/a6xx: Allocate and program tessellation buffer 20187ec681f3Smrg- freedreno/a6xx: Emit constant parameters for tessellation stages 20197ec681f3Smrg- freedreno/a6xx: Program state for tessellation stages 20207ec681f3Smrg- freedreno: Use bypass rendering for tessellation 20217ec681f3Smrg- freedreno/a6xx: Only set emit.hs/ds when we're drawing patches 20227ec681f3Smrg- freedreno/blitter: Save tessellation state 20237ec681f3Smrg- freedreno/a6xx: Only use merged regs and four quads for VS+FS 20247ec681f3Smrg- freedreno/a6xx: Turn on tessellation shaders 20257ec681f3Smrg- freedreno/ir3: Use regid() helper when setting up precolor regs 20267ec681f3Smrg- freedreno/registers: Remove duplicate register definitions 20277ec681f3Smrg- freedreno: New struct packing macros 20287ec681f3Smrg- freedreno/registers: Add 64 bit address registers 20297ec681f3Smrg- freedreno/a6xx: Drop stale include 20307ec681f3Smrg- freedreno/a6xx: Include fd6_pack.h in a few files 20317ec681f3Smrg- freedreno/a6xx: Convert emit_mrt() to OUT_REG() 20327ec681f3Smrg- freedreno/a6xx: Convert emit_zs() to OUT_REG() 20337ec681f3Smrg- freedreno/a6xx: Convert VSC pipe setup to OUT_REG() 20347ec681f3Smrg- freedreno/a6xx: Convert gmem blits to OUT_REG() 20357ec681f3Smrg- freedreno/a6xx: Convert some tile setup to OUT_REG() 20367ec681f3Smrg- freedreno/a6xx: Silence warning for unused perf counters 20377ec681f3Smrg- freedreno/a6xx: Document the CP_SET_DRAW_STATE enable bits 20387ec681f3Smrg- freedreno/a6xx: Make DEBUG_BLIT_FALLBACK only dump fallbacks 20397ec681f3Smrg- freedreno: Add debug flag for forcing linear layouts 20407ec681f3Smrg- freedreno/a6xx: Program sampler swap based on resource tiling 20417ec681f3Smrg- freedreno/a6xx: Pick blitter swap based on resource tiling 20427ec681f3Smrg- freedreno/a6xx: Add fd_resource_swap() helper 20437ec681f3Smrg- freedreno/a6xx: Use blitter for resolve blits 20447ec681f3Smrg- freedreno/a6xx: RB6_R8G8B8 is actually 32 bit RGBX 20457ec681f3Smrg- freedreno/a6xx: Use A6XX_SP_2D_SRC_FORMAT_MASK macro 20467ec681f3Smrg- freedreno/a6xx: Handle srgb blits on the blitter 20477ec681f3Smrg- freedreno/a6xx: Move handle_rgba_blit() up 20487ec681f3Smrg- freedreno/a6xx: Rewrite compressed blits in a helper function 20497ec681f3Smrg- freedreno/a6xx: Set up multisample sysmem MRTs correctly 20507ec681f3Smrg- st/mesa: Lower vars to ssa and constant prop before 20517ec681f3Smrg gl_nir_lower_buffers 20527ec681f3Smrg- ir3: Set up full/half register conflicts correctly 20537ec681f3Smrg- iris: Advertise PIPE_CAP_NATIVE_FENCE_FD 20547ec681f3Smrg- iris: Print warning and return \*out = NULL when fd to syncobj fails 20557ec681f3Smrg 20567ec681f3SmrgKrzysztof Raszkowski (10): 20577ec681f3Smrg 20587ec681f3Smrg- gallium/swr: Fix GS invocation issues - Fixed proper setting 20597ec681f3Smrg gl_InvocationID. - Fixed GS vertices output memory overflow. 20607ec681f3Smrg- gallium/swr: Enable some ARB_gpu_shader5 extensions Enable / add to 20617ec681f3Smrg features.txt: - Enhanced textureGather. - Geometry shader instancing. 20627ec681f3Smrg - Geometry shader multiple streams. 20637ec681f3Smrg- gallium/swr: Fix crash when use GL_TDFX_texture_compression_FXT1 20647ec681f3Smrg format. 20657ec681f3Smrg- gallivm: add TGSI bit arithmetic opcodes support 20667ec681f3Smrg- gallium/swr: Fix glVertexPointer race condition. 20677ec681f3Smrg- gallium/swr: Disable showing detected arch message. 20687ec681f3Smrg- docs/GL4: update gallium/swr features 20697ec681f3Smrg- gallium/swr: add option for static link 20707ec681f3Smrg- gallium/swr: Fix gcc 4.8.5 compile error 20717ec681f3Smrg- gallium/swr: simplify environmental variabled expansion code 20727ec681f3Smrg 20737ec681f3SmrgLasse Lopperi (1): 20747ec681f3Smrg 20757ec681f3Smrg- freedreno/drm: Fix memory leak in softpin implementation 20767ec681f3Smrg 20777ec681f3SmrgLaurent Carlier (1): 20787ec681f3Smrg 20797ec681f3Smrg- egl: avoid local modifications for eglext.h Khronos standard header 20807ec681f3Smrg file 20817ec681f3Smrg 20827ec681f3SmrgLeo Liu (1): 20837ec681f3Smrg 20847ec681f3Smrg- ac: add missing Arcturus to the info of pc lines 20857ec681f3Smrg 20867ec681f3SmrgLepton Wu (2): 20877ec681f3Smrg 20887ec681f3Smrg- gallium: dri2: Use index as plane number. 20897ec681f3Smrg- android: mesa: Revert "android: mesa: revert "Enable asm 20907ec681f3Smrg unconditionally"" 20917ec681f3Smrg 20927ec681f3SmrgLionel Landwerlin (60): 20937ec681f3Smrg 20947ec681f3Smrg- intel/dev: set default num_eu_per_subslice on gen12 20957ec681f3Smrg- intel/perf: add TGL support 20967ec681f3Smrg- intel/perf: fix Android build 20977ec681f3Smrg- mesa: check draw buffer completeness on 20987ec681f3Smrg glClearBufferfi/glClearBufferiv 20997ec681f3Smrg- vulkan: bump headers/registry to 1.1.127 21007ec681f3Smrg- anv: Properly handle host query reset of performance queries 21017ec681f3Smrg- anv: implement VK_KHR_separate_depth_stencil_layouts 21027ec681f3Smrg- mesa: check framebuffer completeness only after state update 21037ec681f3Smrg- anv: invalidate file descriptor of semaphore sync fd at vkQueueSubmit 21047ec681f3Smrg- anv: remove list items on batch fini 21057ec681f3Smrg- anv: detach batch emission allocation from device 21067ec681f3Smrg- anv: expose timeout helpers outside of anv_queue.c 21077ec681f3Smrg- anv: move queue init/finish to anv_queue.c 21087ec681f3Smrg- anv: allow NULL batch parameter to anv_queue_submit_simple_batch 21097ec681f3Smrg- anv: prepare driver to report submission error through queues 21107ec681f3Smrg- anv: refcount semaphores 21117ec681f3Smrg- anv: prepare the driver for delayed submissions 21127ec681f3Smrg- anv/wsi: signal the semaphore in the acquireNextImage 21137ec681f3Smrg- anv: implement VK_KHR_timeline_semaphore 21147ec681f3Smrg- intel/dev: flag the Elkhart Lake platform 21157ec681f3Smrg- intel/perf: add EHL performance query support 21167ec681f3Smrg- intel/perf: fix invalid hw_id in query results 21177ec681f3Smrg- intel/perf: set read buffer len to 0 to identify empty buffer 21187ec681f3Smrg- intel/perf: take into account that reports read can be fairly old 21197ec681f3Smrg- intel/perf: simplify the processing of OA reports 21207ec681f3Smrg- intel/perf: fix improper pointer access 21217ec681f3Smrg- anv: fix missing gen12 handling 21227ec681f3Smrg- anv: fix incorrect VMA alignment for CCS main surfaces 21237ec681f3Smrg- anv: fix fence underlying primitive checks 21247ec681f3Smrg- anv: fix assumptions about temporary fence payload 21257ec681f3Smrg- intel/perf: drop batchbuffer flushing at query begin 21267ec681f3Smrg- i965/iris: perf-queries: don't invalidate/flush 3d pipeline 21277ec681f3Smrg- anv: constify pipeline layout in nir passes 21287ec681f3Smrg- anv: drop unused parameter from apply layout pass 21297ec681f3Smrg- vulkan/wsi: error out when image fence doesn't signal 21307ec681f3Smrg- mesa: avoid triggering assert in implementation 21317ec681f3Smrg- i965/iris/perf: factor out frequency register capture 21327ec681f3Smrg- loader: fix close on uninitialized file descriptor value 21337ec681f3Smrg- anv: don't close invalid syncfd semaphore 21347ec681f3Smrg- anv: fix intel perf queries availability writes 21357ec681f3Smrg- anv: set stencil layout for input attachments 21367ec681f3Smrg- iris: Implement Gen12 workaround for non pipelined state 21377ec681f3Smrg- anv: Implement Gen12 workaround for non pipelined state 21387ec681f3Smrg- anv: only use VkSamplerCreateInfo::compareOp if enabled 21397ec681f3Smrg- anv: fix pipeline switch back for non pipelined states 21407ec681f3Smrg- genxml: add new Gen11+ PIPE_CONTROL field 21417ec681f3Smrg- iris: handle new PIPE_CONTROL field 21427ec681f3Smrg- iris: implement another workaround for non pipelined states 21437ec681f3Smrg- anv: implement another workaround for non pipelined states 21447ec681f3Smrg- intel/perf: expose timestamp begin for mdapi 21457ec681f3Smrg- intel/perf: report query split for mdapi 21467ec681f3Smrg- anv: enable VK_KHR_swapchain_mutable_format 21477ec681f3Smrg- anv: don't report error with other vendor DRM devices 21487ec681f3Smrg- anv: ensure prog params are initialized with 0s 21497ec681f3Smrg- anv/iris: warn gen12 3DSTATE_HS restriction 21507ec681f3Smrg- intel: Implement Gen12 workaround for array textures of size 1 21517ec681f3Smrg- isl: drop CCS row pitch requirement for linear surfaces 21527ec681f3Smrg- isl: add gen12 comment about CCS for linear tiling 21537ec681f3Smrg- anv: implement gen9 post sync pipe control workaround 21547ec681f3Smrg- anv: set MOCS on push constants 21557ec681f3Smrg 21567ec681f3SmrgLuis Mendes (1): 21577ec681f3Smrg 21587ec681f3Smrg- radv: fix radv secure compile feature breaks compilation on armhf 21597ec681f3Smrg EABI and aarch64 21607ec681f3Smrg 21617ec681f3SmrgMarco Felsch (1): 21627ec681f3Smrg 21637ec681f3Smrg- etnaviv: Fix assert when try to accumulate an invalid fd 21647ec681f3Smrg 21657ec681f3SmrgMarek Olšák (245): 21667ec681f3Smrg 21677ec681f3Smrg- glsl: encode/decode types using a union with bitfields for 21687ec681f3Smrg readability 21697ec681f3Smrg- glsl: encode vector_elements and matrix_columns better 21707ec681f3Smrg- glsl: encode explicit_stride for basic types better 21717ec681f3Smrg- glsl: encode array types better 21727ec681f3Smrg- glsl: encode struct/interface types better 21737ec681f3Smrg- st/mesa: call nir_opt_access only once 21747ec681f3Smrg- st/mesa: call nir_lower_flrp only once per shader 21757ec681f3Smrg- compiler: make variable::data::binding unsigned 21767ec681f3Smrg- nir: pack nir_variable::data::stream 21777ec681f3Smrg- nir: pack nir_variable::data::xfb\_\* 21787ec681f3Smrg- radeonsi: use IR SHA1 as the cache key for the in-memory shader cache 21797ec681f3Smrg- radeonsi: don't keep compute shader IR after compilation 21807ec681f3Smrg- radeonsi: keep serialized NIR instead of nir_shader in 21817ec681f3Smrg si_shader_selector 21827ec681f3Smrg- nir: pack the rest of nir_variable::data 21837ec681f3Smrg- nir/serialize: don't expand 16-bit variable state slots to 32 bits 21847ec681f3Smrg- nir/serialize: store 32-bit object IDs instead of 64-bit 21857ec681f3Smrg- nir/serialize: pack nir_variable flags 21867ec681f3Smrg- mesa: expose SPIR-V extensions in the Compatibility profile too 21877ec681f3Smrg- util: add blob_finish_get_buffer 21887ec681f3Smrg- radeonsi/nir: call nir_serialize only once per shader 21897ec681f3Smrg- radeonsi/nir: fix compute shader crash due to nir_binary == NULL 21907ec681f3Smrg- glsl/linker: pass shader_info to analyze_clip_cull_usage directly 21917ec681f3Smrg- compiler: pack shader_info from 160 bytes to 96 bytes 21927ec681f3Smrg- st/mesa: fix Sanctuary and Tropics by disabling ARB_gpu_shader5 for 21937ec681f3Smrg them 21947ec681f3Smrg- st/mesa: rename DEBUG_TGSI -> DEBUG_PRINT_IR 21957ec681f3Smrg- st/mesa: remove \\n being only printed in debug builds after printed 21967ec681f3Smrg TGSI 21977ec681f3Smrg- st/mesa: print TCS/TES/GS/CS TGSI in the right place & keep disk 21987ec681f3Smrg cache enabled 21997ec681f3Smrg- st/mesa: add ST_DEBUG=nir to print NIR shaders 22007ec681f3Smrg- st/mesa: remove unused TGSI-only debug printing functions 22017ec681f3Smrg- gallium/noop: call finalize_nir 22027ec681f3Smrg- radeonsi/nir: remove dead function temps 22037ec681f3Smrg- radeonsi/nir: call nir_lower_flrp only once per shader 22047ec681f3Smrg- radeonsi/nir: don't lower fma, instead, fuse fma 22057ec681f3Smrg- mesa: enable glthread for 7 Days To Die 22067ec681f3Smrg- st/mesa: rename delete_basic_variant -> delete_common_variant 22077ec681f3Smrg- st/mesa: decrease the size of st_fp_variant_key from 48 to 40 bytes 22087ec681f3Smrg- st/mesa: start deduplicating some program code 22097ec681f3Smrg- st/mesa: initialize affected_states and uniform storage earlier in 22107ec681f3Smrg deserialize 22117ec681f3Smrg- st/mesa: consolidate and simplify code flagging 22127ec681f3Smrg program::affected_states 22137ec681f3Smrg- st/mesa: trivially merge st_vertex_program into st_common_program 22147ec681f3Smrg- st/mesa: rename st_common_program to st_program 22157ec681f3Smrg- st/mesa: cleanups after unification of st_vertex/common program 22167ec681f3Smrg- st/mesa: rename occurences of stcp to stp to correspond to st_program 22177ec681f3Smrg- st/mesa: more cleanups after unification of st_vertex/common_program 22187ec681f3Smrg- st/mesa: subclass st_vertex_program for VP-specific members 22197ec681f3Smrg- st/mesa: call nir_sweep in st_finalize_nir 22207ec681f3Smrg- st/mesa: keep serialized NIR instead of nir_shader in st_program 22217ec681f3Smrg- st/mesa: call nir_serialize only once per shader 22227ec681f3Smrg- nir: move data.image.access to data.access 22237ec681f3Smrg- nir/print: only print image.format for image variables 22247ec681f3Smrg- glsl_to_nir: rename image_access to mem_access 22257ec681f3Smrg- nir: move data.descriptor_set above data.index for better packing 22267ec681f3Smrg- nir: don't use GLenum16 in nir.h 22277ec681f3Smrg- ac: add radeon_info::num_rings and move ring_type to amd_family.h 22287ec681f3Smrg- ac: fill num_rings for remaining IPs 22297ec681f3Smrg- winsys/amdgpu: detect noop dependencies on the same ring correctly 22307ec681f3Smrg- nir: strip as we serialize to remove the nir_shader_clone call 22317ec681f3Smrg- nir/serialize: do ctx = {0} instead of manual initializations 22327ec681f3Smrg- util/blob: add 8-bit and 16-bit reads and writes 22337ec681f3Smrg- nir/serialize: pack instructions better 22347ec681f3Smrg- nir/serialize: pack src better and limit the object count to 1M from 22357ec681f3Smrg 1G 22367ec681f3Smrg- nir/serialize: don't serialize var->data for temporaries 22377ec681f3Smrg- nir/serialize: deduplicate serialized var types by reusing the last 22387ec681f3Smrg unique one 22397ec681f3Smrg- nir/serialize: try to store a diff in var data locations instead of 22407ec681f3Smrg var data 22417ec681f3Smrg- nir/serialize: pack load_const with non-64-bit constants better 22427ec681f3Smrg- nir/serialize: pack 1-component constants into 20 bits if possible 22437ec681f3Smrg- nir/serialize: pack nir_intrinsic_instr::const_index[] better 22447ec681f3Smrg- nir/serialize: try to pack two alu srcs into 1 uint32 22457ec681f3Smrg- nir/serialize: don't store deref types if not needed 22467ec681f3Smrg- nir/serialize: don't serialize mode for deref non-cast instructions 22477ec681f3Smrg- nir/serialize: try to put deref->var index into the unused bits of 22487ec681f3Smrg the header 22497ec681f3Smrg- nir/serialize: cleanup - fold nir_deref_type_var cases into switches 22507ec681f3Smrg- nir/serialize: try to pack both deref array src into 32 bits 22517ec681f3Smrg- nir/serialize: remove up to 3 consecutive equal ALU instruction 22527ec681f3Smrg headers 22537ec681f3Smrg- nir/serialize: reuse the writemask field for 2 src X swizzles of SSA 22547ec681f3Smrg ALU 22557ec681f3Smrg- nir/serialize: serialize swizzles for vec8 and vec16 22567ec681f3Smrg- nir/serialize: serialize writemask for vec8 and vec16 22577ec681f3Smrg- nir/serialize: don't serialize redundant 22587ec681f3Smrg nir_intrinsic_instr::num_components 22597ec681f3Smrg- nir/serialize: use 3 unused bits in intrinsic for 22607ec681f3Smrg packed_const_indices 22617ec681f3Smrg- nir/serialize: support any num_components for remaining instructions 22627ec681f3Smrg- ac: set swizzled bit in cache policy as a hint not to merge 22637ec681f3Smrg loads/stores 22647ec681f3Smrg- radeonsi: initialize the per-context compiler on demand 22657ec681f3Smrg- radeonsi/nir: don't run si_nir_opts again if there is no change 22667ec681f3Smrg- st/mesa: don't serialize all streamout state if there are no SO 22677ec681f3Smrg outputs 22687ec681f3Smrg- st/mesa: don't use redundant stp->state.ir.nir 22697ec681f3Smrg- st/mesa: don't call ProgramStringNotify in glsl_to_nir 22707ec681f3Smrg- st/mesa: propagate gl_PatchVerticesIn from TCS to TES before linking 22717ec681f3Smrg for NIR 22727ec681f3Smrg- st/mesa: simplify looping over linked shaders when linking NIR 22737ec681f3Smrg- st/mesa: don't use \*\* in the st_nir_link_shaders signature 22747ec681f3Smrg- st/mesa: add st_variant base class to simplify code for shader 22757ec681f3Smrg variants 22767ec681f3Smrg- ac/nir: don't rely on data.patch for tess factors 22777ec681f3Smrg- radeonsi/nir: implement subgroup system values for SPIR-V 22787ec681f3Smrg- radeonsi: simplify the interface of 22797ec681f3Smrg get_dw_address_from_generic_indices 22807ec681f3Smrg- radeonsi: simplify get_tcs_tes_buffer_address_from_generic_indices 22817ec681f3Smrg- radeonsi/nir: validate is_patch because SPIR-V doesn't set it for 22827ec681f3Smrg tess factors 22837ec681f3Smrg- radeonsi/nir: don't rely on data.patch for tess factors 22847ec681f3Smrg- radeonsi/nir: fix location_frac handling for TCS outputs 22857ec681f3Smrg- radeonsi/nir: support interface output types to fix SPIR-V xfb 22867ec681f3Smrg piglits 22877ec681f3Smrg- radeonsi: enable SPIR-V and GL 4.6 for NIR 22887ec681f3Smrg- util/driconfig: print ATTENTION if MESA_DEBUG=silent is not set 22897ec681f3Smrg- radeonsi/gfx10: simplify some duplicated NGG GS code 22907ec681f3Smrg- radeonsi/gfx10: fix the vertex order for triangle strips emitted by a 22917ec681f3Smrg GS 22927ec681f3Smrg- llvmpipe: implement TEX_LZ and TXF_LZ opcodes 22937ec681f3Smrg- gallivm: implement LOAD with CONSTBUF but don't enable it for 22947ec681f3Smrg llvmpipe 22957ec681f3Smrg- st/mesa: support UBOs for Selection/Feedback/RasterPos 22967ec681f3Smrg- st/mesa: save currently bound vertex samplers and sampler views in 22977ec681f3Smrg st_context 22987ec681f3Smrg- st/mesa: support samplers for Selection/Feedback/RasterPos 22997ec681f3Smrg- st/mesa: support SSBOs for Selection/Feedback/RasterPos 23007ec681f3Smrg- st/mesa: support shader images for Selection/Feedback/RasterPos 23017ec681f3Smrg- st/mesa: use a separate VS variant for the draw module 23027ec681f3Smrg- st/mesa: remove st_vp_variant::num_inputs 23037ec681f3Smrg- st/mesa: remove struct st_vp_variant in favor of st_common_variant 23047ec681f3Smrg- st/mesa: don't generate VS TGSI if NIR is enabled 23057ec681f3Smrg- draw, st/mesa: generate TGSI for ffvp/ARB_vp if draw lacks LLVM 23067ec681f3Smrg- st/mesa: release the draw shader properly to fix driver crashes 23077ec681f3Smrg (iris) 23087ec681f3Smrg- st/dri: assume external consumers of back buffers can write to the 23097ec681f3Smrg buffers 23107ec681f3Smrg- radeonsi: enable NIR by default and document GL 4.6 support 23117ec681f3Smrg- radeonsi/gfx10: disable vertex grouping 23127ec681f3Smrg- radeonsi/gfx10: simplify the tess_turns_off_ngg condition 23137ec681f3Smrg- radeonsi: don't rely on CLEAR_STATE to set PA_SC_GENERIC_SCISSOR\_\* 23147ec681f3Smrg- ac: fix ac_get_i1_sgpr_mask for Wave32 23157ec681f3Smrg- ac: fix the return value in cull_bbox when bbox culling is disabled 23167ec681f3Smrg- radeonsi: deduplicate ES and GS thread enablement code 23177ec681f3Smrg- radeonsi: disallow compute-based culling if polygon mode is enabled 23187ec681f3Smrg- radeonsi: set is_monolithic for VS prologs when the shader is really 23197ec681f3Smrg monolithic 23207ec681f3Smrg- radeonsi: don't wrap the VS prolog in if (ES thread) .. endif 23217ec681f3Smrg- radeonsi/gfx10: don't insert NGG streamout atomics if they are never 23227ec681f3Smrg used 23237ec681f3Smrg- radeonsi: allow generating VS prologs with 0 inputs 23247ec681f3Smrg- radeonsi: fix determining whether the VS prolog is needed 23257ec681f3Smrg- radeonsi: reset more fields in si_llvm_context_set_ir to fix reusing 23267ec681f3Smrg ctx 23277ec681f3Smrg- radeonsi/gfx10: fix ngg_get_ordered_id 23287ec681f3Smrg- amd/addrlib: update to the latest version 23297ec681f3Smrg- ac/surface: fix an assertion failure on gfx9 in CMASK computation 23307ec681f3Smrg- radeonsi/gfx10: don't declare any LDS for NGG if it's not used 23317ec681f3Smrg- radeonsi/gfx10: enable NGG passthrough for eligible shaders 23327ec681f3Smrg- radeonsi/gfx10: improve performance for TES using PrimID but not 23337ec681f3Smrg exporting it 23347ec681f3Smrg- Revert "u_vbuf: Regard non-constant vbufs with non-instance elements 23357ec681f3Smrg as free" 23367ec681f3Smrg- winsys/radeon: initialize pte_fragment_size 23377ec681f3Smrg- radeonsi: preserve the scanout flag for shared resources on gfx9 and 23387ec681f3Smrg gfx10 23397ec681f3Smrg- radeonsi: ignore PIPE_BIND_SCANOUT for imported textures 23407ec681f3Smrg- radeonsi: remove the "display_dcc_offset == 0" assertion 23417ec681f3Smrg- radeonsi: rename SDMA debug flags 23427ec681f3Smrg- radeonsi: remove broken and unused SI SDMA image copy code 23437ec681f3Smrg- radeonsi: add AMD_DEBUG=nodmaclear for debugging 23447ec681f3Smrg- radeonsi: add AMD_DEBUG=nodmacopyimage for debugging 23457ec681f3Smrg- radeonsi: rename dma_cs -> sdma_cs 23467ec681f3Smrg- radeonsi: move SI and CIK+ SDMA code into 1 common function for 23477ec681f3Smrg cleanups 23487ec681f3Smrg- radeonsi: disable SDMA on gfx8 to fix corruption on RX 580 23497ec681f3Smrg- radeonsi: remove TGSI 23507ec681f3Smrg- gallium: put u_vbuf_get_caps return values into u_vbuf_caps 23517ec681f3Smrg- gallium/cso_context: move non-vbuf vertex buffer and element code 23527ec681f3Smrg into helpers 23537ec681f3Smrg- gallium: bypass u_vbuf if it's not needed (no fallbacks and no user 23547ec681f3Smrg VBOs) 23557ec681f3Smrg- ac/gpu_info: always use distributed tessellation on gfx10 23567ec681f3Smrg- radeonsi: fix monolithic pixel shaders with two-sided colors and 23577ec681f3Smrg SampleMaskIn 23587ec681f3Smrg- radeonsi: fix context roll tracking in si_emit_shader_vs 23597ec681f3Smrg- radeonsi: test polygon mode enablement accurately 23607ec681f3Smrg- radeonsi: determine accurately if line stippling is enabled for 23617ec681f3Smrg performance 23627ec681f3Smrg- radeonsi: clean up messy si_emit_rasterizer_prim_state 23637ec681f3Smrg- ac: unify build_sendmsg_gs_alloc_req 23647ec681f3Smrg- ac: unify primitive export code 23657ec681f3Smrg- ac/gpu_info: add pc_lines and use it in radeonsi 23667ec681f3Smrg- ac: add 128-bit bitcount 23677ec681f3Smrg- ac: add ac_build_s_endpgm 23687ec681f3Smrg- radeonsi/gfx9: force the micro tile mode for MSAA resolve correctly 23697ec681f3Smrg on gfx9 23707ec681f3Smrg- radeonsi: rename desc_list_byte_size -> vb_desc_list_alloc_size 23717ec681f3Smrg- radeonsi: add si_context::num_vertex_elements 23727ec681f3Smrg- radeonsi: don't allow draw calls with uninitialized VS inputs 23737ec681f3Smrg- radeonsi: simplify si_set_vertex_buffers 23747ec681f3Smrg- ac,radeonsi: increase the maximum number of shader args and return 23757ec681f3Smrg values 23767ec681f3Smrg- radeonsi: put up to 5 VBO descriptors into user SGPRs 23777ec681f3Smrg- radeonsi: don't enable VBOs in user SGPRs if compute-based culling 23787ec681f3Smrg can be used 23797ec681f3Smrg- radeonsi: fix assertion and other failures in 23807ec681f3Smrg si_emit_graphics_shader_pointers 23817ec681f3Smrg- radeonsi: actually enable VBOs in user SGPRs 23827ec681f3Smrg- radeonsi: don't adjust depth and stencil PS output locations 23837ec681f3Smrg- radeonsi: rename DBG_NO_TGSI -> DBG_NO_NIR 23847ec681f3Smrg- radeonsi: remove TGSI from comments 23857ec681f3Smrg- radeonsi: rename si_shader_info -> si_shader_binary_info 23867ec681f3Smrg- radeonsi: fork tgsi_shader_info and tgsi_tessctrl_info 23877ec681f3Smrg- radeonsi: merge si_tessctrl_info into si_shader_info 23887ec681f3Smrg- radeonsi: clean up si_shader_info 23897ec681f3Smrg- radeonsi: rename si_compile_tgsi_main -> si_build_main_function 23907ec681f3Smrg- radeonsi: rename si_shader_create -> si_create_shader_variant for 23917ec681f3Smrg clarity 23927ec681f3Smrg- radeonsi: fold si_create_function into si_llvm_create_func 23937ec681f3Smrg- radeonsi: remove always constant ballot_mask_bits from 23947ec681f3Smrg si_llvm_context_init 23957ec681f3Smrg- radeonsi: move PS LLVM code into si_shader_llvm_ps.c 23967ec681f3Smrg- radeonsi: separate code computing info for small primitive culling 23977ec681f3Smrg- ac/cull: don't read Position.Z if it's not needed for culling 23987ec681f3Smrg- radeonsi: make si_insert_input\_\* functions non-static 23997ec681f3Smrg- radeonsi: move VS_STATE.LS_OUT_PATCH_SIZE a few bits higher to make 24007ec681f3Smrg space there 24017ec681f3Smrg- radeonsi/gfx10: separate code for getting edgeflags from the 24027ec681f3Smrg gs_invocation_id VGPR 24037ec681f3Smrg- radeonsi/gfx10: separate code for determining the number of vertices 24047ec681f3Smrg for NGG 24057ec681f3Smrg- radeonsi: fix si_build_wrapper_function for compute-based primitive 24067ec681f3Smrg culling 24077ec681f3Smrg- radeonsi: work around an LLVM crash when using 24087ec681f3Smrg llvm.amdgcn.icmp.i64.i1 24097ec681f3Smrg- radeonsi: move si_insert_input\_\* functions 24107ec681f3Smrg- radeonsi: move tessellation shader code into si_shader_llvm_tess.c 24117ec681f3Smrg- radeonsi: remove llvm_type_is_64bit 24127ec681f3Smrg- radeonsi: move geometry shader code into si_shader_llvm_gs.c 24137ec681f3Smrg- radeonsi: move code for shader resources into 24147ec681f3Smrg si_shader_llvm_resources.c 24157ec681f3Smrg- radeonsi: remove useless #includes 24167ec681f3Smrg- radeonsi: merge si_compile_llvm and si_llvm_compile functions 24177ec681f3Smrg- gallium: add st_context_iface::flush_resource to call FLUSH_VERTICES 24187ec681f3Smrg- st/dri: do FLUSH_VERTICES before calling flush_resource 24197ec681f3Smrg- Revert "radeonsi: unbind image before compute clear" 24207ec681f3Smrg- radeonsi: clean up how internal compute dispatches are handled 24217ec681f3Smrg- radeonsi: don't invoke decompression inside internal launch_grid 24227ec681f3Smrg- radeonsi: fix doubles and int64 24237ec681f3Smrg- radeonsi: turn an assertion into return in si_nir_store_output_tcs 24247ec681f3Smrg- ac: add prefix bitcount functions 24257ec681f3Smrg- ac: add ac_build_readlane without optimization barrier 24267ec681f3Smrg- radeonsi/gfx10: update comments and remove invalid TODOs 24277ec681f3Smrg- radeonsi/gfx10: correct VS PrimitiveID implementation for NGG 24287ec681f3Smrg- radeonsi/gfx10: move s_sendmsg gs_alloc_req to the beginning of 24297ec681f3Smrg shaders 24307ec681f3Smrg- radeonsi/gfx10: export primitives at the beginning of VS/TES 24317ec681f3Smrg- radeonsi/gfx10: merge main and pos/param export IF blocks into one if 24327ec681f3Smrg possible 24337ec681f3Smrg- radeonsi/gfx10: don't initialize VGPRs not used by NGG passthrough 24347ec681f3Smrg- radeonsi/gfx10: move GE_PC_ALLOC setting to shader states 24357ec681f3Smrg- radeonsi/gfx10: implement NGG culling for 4x wave32 subgroups 24367ec681f3Smrg- ac: add helper ac_build_triangle_strip_indices_to_triangle 24377ec681f3Smrg- radeonsi/gfx10: rewrite late alloc computation 24387ec681f3Smrg- radeonsi/gfx10: enable GS fast launch for triangles and strips with 24397ec681f3Smrg NGG culling 24407ec681f3Smrg- radeonsi: use ctx->ac. for types and integer constants 24417ec681f3Smrg- radeonsi: move non-LLVM code out of si_shader_llvm.c 24427ec681f3Smrg- radeonsi: move VS shader code into si_shader_llvm_vs.c 24437ec681f3Smrg- radeonsi: move si_shader_llvm_build.c content into si_shader_llvm.c 24447ec681f3Smrg- radeonsi: minor cleanup in si_shader_internal.h 24457ec681f3Smrg- radeonsi: move si_nir_build_llvm into si_shader_llvm.c 24467ec681f3Smrg- radeonsi: fold si_shader_context_set_ir into si_build_main_function 24477ec681f3Smrg- radeonsi: move more LLVM functions into si_shader_llvm.c 24487ec681f3Smrg- radeonsi: make si_compile_llvm return bool 24497ec681f3Smrg- radeonsi: make si_compile_shader return bool 24507ec681f3Smrg- radeonsi: change prototypes of si_is_multi_part_shader & 24517ec681f3Smrg si_is_merged_shader 24527ec681f3Smrg- radeonsi: separate LLVM compilation from non-LLVM code 24537ec681f3Smrg- util/simple_mtx: add a missing include to get ASSERTED 24547ec681f3Smrg- gallium/util: add a cache of live shaders for shader CSO 24557ec681f3Smrg deduplication 24567ec681f3Smrg- radeonsi: use the live shader cache 24577ec681f3Smrg- radeonsi: restructure si_shader_cache_load_shader 24587ec681f3Smrg- radeonsi: print shader cache stats with AMD_DEBUG=cache_stats 24597ec681f3Smrg- radeonsi: expose shader cache stats to the HUD 24607ec681f3Smrg- radeonsi: make screen available to shader part compilation 24617ec681f3Smrg- radeonsi: fix a regression since the addition of si_shader_llvm_vs.c 24627ec681f3Smrg- Revert "winsys/amdgpu: Close KMS handles for other DRM file 24637ec681f3Smrg descriptions" 24647ec681f3Smrg- Revert "winsys/amdgpu: Re-use amdgpu_screen_winsys when possible" 24657ec681f3Smrg- radeonsi: don't report that multi-plane formats are supported 24667ec681f3Smrg- radeonsi: fix the DCC MSAA bug workaround 24677ec681f3Smrg- radeonsi: don't wait for shader compilation to finish when destroying 24687ec681f3Smrg a context 24697ec681f3Smrg 24707ec681f3SmrgMarek Vasut (5): 24717ec681f3Smrg 24727ec681f3Smrg- etnaviv: Replace bitwise OR with logical OR 24737ec681f3Smrg- etnaviv: tgsi: Fix gl_FrontFacing support 24747ec681f3Smrg- etnaviv: Report correct number of vertex buffers 24757ec681f3Smrg- etnaviv: Do not filter out PIPE_FORMAT_S8_UINT_Z24_UNORM on 24767ec681f3Smrg pre-HALTI2 24777ec681f3Smrg- etnaviv: Destroy rsc->pending_ctx set in etna_resource_destroy() 24787ec681f3Smrg 24797ec681f3SmrgMark Janes (3): 24807ec681f3Smrg 24817ec681f3Smrg- Revert "st/mesa: call nir_serialize only once per shader" 24827ec681f3Smrg- Revert "st/mesa: keep serialized NIR instead of nir_shader in 24837ec681f3Smrg st_program" 24847ec681f3Smrg- iris: separating out common perf code 24857ec681f3Smrg 24867ec681f3SmrgMarkus Wick (3): 24877ec681f3Smrg 24887ec681f3Smrg- mapi/glapi: Generate sizeof() helpers instead of fixed sizes. 24897ec681f3Smrg- mesa/glthread: Implement ARB_multi_bind. 24907ec681f3Smrg- drirc: Enable glthread for dolphin/citra/yuzu. 24917ec681f3Smrg 24927ec681f3SmrgMartin Fuzzey (1): 24937ec681f3Smrg 24947ec681f3Smrg- etnaviv: update Android build files 24957ec681f3Smrg 24967ec681f3SmrgMathias Fröhlich (1): 24977ec681f3Smrg 24987ec681f3Smrg- egl: Implement getImage/putImage on pbuffer swrast. 24997ec681f3Smrg 25007ec681f3SmrgMatt Turner (19): 25017ec681f3Smrg 25027ec681f3Smrg- intel/compiler: Use ARRAY_SIZE() 25037ec681f3Smrg- intel/compiler: Extract GEN\_\* macros into separate file 25047ec681f3Smrg- intel/compiler: Split has_64bit_types into float/int 25057ec681f3Smrg- intel/compiler: Don't disassemble align1 3-src operands on Gen < 10 25067ec681f3Smrg- intel/compiler: Limit compaction unit tests to specific gens 25077ec681f3Smrg- intel/compiler: Add NF some more places 25087ec681f3Smrg- intel/compiler: Add a INVALID_{,HW_}REG_TYPE macros 25097ec681f3Smrg- intel/compiler: Split hw_type tables 25107ec681f3Smrg- intel/compiler: Handle invalid inputs to brw_reg_type_to_*() 25117ec681f3Smrg- intel/compiler: Handle invalid compacted immediates 25127ec681f3Smrg- intel/compiler: Factor out brw_validate_instruction() 25137ec681f3Smrg- intel/compiler: Validate some instruction word encodings 25147ec681f3Smrg- intel/compiler: Add unit tests for new EU validation checks 25157ec681f3Smrg- intel/compiler: Validate fuzzed instructions 25167ec681f3Smrg- intel/compiler: Test compaction on Gen <= 12 25177ec681f3Smrg- gitlab-ci: Skip ext_timer_query/time-elapsed 25187ec681f3Smrg- intel/compiler: Move Gen4/5 rounding to visitor 25197ec681f3Smrg- util: Explain BITSET_FOREACH_SET params 25207ec681f3Smrg- util: Remove tmp argument from BITSET_FOREACH_SET macro 25217ec681f3Smrg 25227ec681f3SmrgMauro Rossi (9): 25237ec681f3Smrg 25247ec681f3Smrg- android: aco: fix Lower to CSSA 25257ec681f3Smrg- android: radeonsi: fix build error due to wrong u_format.csv file 25267ec681f3Smrg path 25277ec681f3Smrg- android: util/format: fix include path list 25287ec681f3Smrg- android: radeonsi: fix build after vl refactoring (v2) 25297ec681f3Smrg- android: nir: add a load/store vectorization pass 25307ec681f3Smrg- android: util: Add a mapping from VkFormat to PIPE_FORMAT. 25317ec681f3Smrg- android: radv: fix vk_format_table.c generated source build 25327ec681f3Smrg- android: radeonsi,ac: fix building error due to ac changes 25337ec681f3Smrg- android: radv: build radv_shader_args.c 25347ec681f3Smrg 25357ec681f3SmrgMichel Dänzer (36): 25367ec681f3Smrg 25377ec681f3Smrg- gitlab-ci: Set arm job CCACHE_DIR properly 25387ec681f3Smrg- gitlab-ci: Use separate arm64 build/test docker images 25397ec681f3Smrg- gitlab-ci: Don't build libdrm for ARM 25407ec681f3Smrg- gitlab-ci: Use ninja -j4 for building dEQP 25417ec681f3Smrg- gitlab-ci: Move artifact preparation to separate script 25427ec681f3Smrg- gitlab-ci: Share dEQP build process between x86 & ARM test image 25437ec681f3Smrg scripts 25447ec681f3Smrg- gitlab-ci: Sort packages in debian-install.sh 25457ec681f3Smrg- gitlab-ci: Run piglit tests with llvmpipe 25467ec681f3Smrg- gitlab-ci: Use separate docker images for x86 build/test jobs 25477ec681f3Smrg- gitlab-ci: Delete install/bin from artifacts as well 25487ec681f3Smrg- gitlab-ci: Document that ci-templates refs must be in sync 25497ec681f3Smrg- gitlab-ci: Use functional container job names 25507ec681f3Smrg- gitlab-ci: Rename container install scripts to match job names 25517ec681f3Smrg (better) 25527ec681f3Smrg- gitlab-ci: Organize images using new REPO_SUFFIX templates feature 25537ec681f3Smrg- gitlab-ci: Directly use host-mapped directory for ccache 25547ec681f3Smrg- gitlab-ci: Stop reporting piglit test results via JUnit 25557ec681f3Smrg- gitlab-ci: Stop storing piglit test results as JUnit 25567ec681f3Smrg- gitlab-ci: Put HTML summary in artifacts for failed piglit jobs 25577ec681f3Smrg- gitlab-ci: Update to current ci-templates master 25587ec681f3Smrg- gitlab-ci: Run piglit glslparser & quick_shader tests separately 25597ec681f3Smrg- glsl/tests: Use splitlines() instead of strip() 25607ec681f3Smrg- gitlab-ci: Use the common run policy for LAVA jobs as well again 25617ec681f3Smrg- gitlab-ci: Overhaul job run policy 25627ec681f3Smrg- gitlab-ci: Don't exclude any piglit quick_shader tests 25637ec681f3Smrg- gitlab-ci: Test against LLVM / clang 9 on x86 25647ec681f3Smrg- gitlab-ci: Stop using manual jobs for merge requests 25657ec681f3Smrg- gitlab-ci: Set GIT_STRATEGY to none for the dummy job 25667ec681f3Smrg- gitlab-ci: Use single if for manual job rules entry 25677ec681f3Smrg- winsys/amdgpu: Keep a list of amdgpu_screen_winsyses in amdgpu_winsys 25687ec681f3Smrg- winsys/amdgpu: Keep track of retrieved KMS handles using hash tables 25697ec681f3Smrg- winsys/amdgpu: Only re-export KMS handles for different DRM FDs 25707ec681f3Smrg- util: Add os_same_file_description helper 25717ec681f3Smrg- winsys/amdgpu: Re-use amdgpu_screen_winsys when possible 25727ec681f3Smrg- winsys/amdgpu: Close KMS handles for other DRM file descriptions 25737ec681f3Smrg- winsys/amdgpu: Re-use amdgpu_screen_winsys when possible 25747ec681f3Smrg- winsys/amdgpu: Close KMS handles for other DRM file descriptions 25757ec681f3Smrg 25767ec681f3SmrgMichel Zou (3): 25777ec681f3Smrg 25787ec681f3Smrg- Meson: Check for dladdr with MinGW 25797ec681f3Smrg- disk_cache_get_function_timestamp: check for dladdr 25807ec681f3Smrg- Meson: Add llvm>=9 modules 25817ec681f3Smrg 25827ec681f3SmrgMiguel Casas-Sanchez (1): 25837ec681f3Smrg 25847ec681f3Smrg- i965: Ensure that all 2101010 image imports can pass framebuffer 25857ec681f3Smrg completeness. 25867ec681f3Smrg 25877ec681f3SmrgNanley Chery (3): 25887ec681f3Smrg 25897ec681f3Smrg- gallium/dri2: Fix creation of multi-planar modifier images 25907ec681f3Smrg- gallium: Store the image format in winsys_handle 25917ec681f3Smrg- iris: Fix import of multi-planar surfaces with modifiers 25927ec681f3Smrg 25937ec681f3SmrgNataraj Deshpande (1): 25947ec681f3Smrg 25957ec681f3Smrg- egl/android: Restrict minimum triple buffering for android 25967ec681f3Smrg color_buffers 25977ec681f3Smrg 25987ec681f3SmrgNathan Kidd (1): 25997ec681f3Smrg 26007ec681f3Smrg- llvmpipe: Check thread creation errors 26017ec681f3Smrg 26027ec681f3SmrgNeha Bhende (3): 26037ec681f3Smrg 26047ec681f3Smrg- st/mesa: release tgsi tokens for shader states 26057ec681f3Smrg- svga: fix size of format_conversion_table[] 26067ec681f3Smrg- svga: Use pipe_shader_state_from_tgsi to set shader state 26077ec681f3Smrg 26087ec681f3SmrgNeil Armstrong (3): 26097ec681f3Smrg 26107ec681f3Smrg- Add support for T820 CI Jobs 26117ec681f3Smrg- ci: Remove T820 from CI temporarily 26127ec681f3Smrg- gitlab-ci/lava: add pipeline information in the lava job name 26137ec681f3Smrg 26147ec681f3SmrgNeil Roberts (9): 26157ec681f3Smrg 26167ec681f3Smrg- nir/opcodes: Add a helper function to generate the comparison binops 26177ec681f3Smrg- nir/opcodes: Add a helper function to generate reduce opcodes 26187ec681f3Smrg- nir: Add a 16-bit bool type 26197ec681f3Smrg- nir: Add a 8-bit bool type 26207ec681f3Smrg- nir/lower_alu_to_scalar: Support lowering 8- and 16-bit reduce ops 26217ec681f3Smrg- freedreno/ir3: Support 16-bit comparison instructions 26227ec681f3Smrg- freedreno/ir3: Add implementation of nir_op_b16csel 26237ec681f3Smrg- freedreno/ir3: Implement f2b16 and i2b16 26247ec681f3Smrg- freedreno/ir3: Enabling lowering 16-bit flrp 26257ec681f3Smrg 26267ec681f3SmrgPaul Cercueil (5): 26277ec681f3Smrg 26287ec681f3Smrg- kmsro: Extend to include ingenic-drm 26297ec681f3Smrg- u_vbuf: Mark vbufs incompatible if more were requested than HW 26307ec681f3Smrg supports 26317ec681f3Smrg- u_vbuf: Only create driver CSO if no incompatible elements 26327ec681f3Smrg- u_vbuf: Regard non-constant vbufs with non-instance elements as free 26337ec681f3Smrg- u_vbuf: Return true in u_vbuf_get_caps if nb of vbufs is below 26347ec681f3Smrg minimum 26357ec681f3Smrg 26367ec681f3SmrgPaul Gofman (1): 26377ec681f3Smrg 26387ec681f3Smrg- state_tracker: Handle texture view min level in st_generate_mipmap() 26397ec681f3Smrg 26407ec681f3SmrgPaulo Zanoni (2): 26417ec681f3Smrg 26427ec681f3Smrg- intel/compiler: remove the operand restriction for src1 on GLK 26437ec681f3Smrg- intel/compiler: fix nir_op_{i,u}*32 on ICL 26447ec681f3Smrg 26457ec681f3SmrgPeng Huang (1): 26467ec681f3Smrg 26477ec681f3Smrg- radeonsi: make si_fence_server_signal flush pipe without work 26487ec681f3Smrg 26497ec681f3SmrgPhilipp Sieweck (1): 26507ec681f3Smrg 26517ec681f3Smrg- svga: check return value of define_query_vgpu{9,10} 26527ec681f3Smrg 26537ec681f3SmrgPierre Moreau (4): 26547ec681f3Smrg 26557ec681f3Smrg- compiler/spirv: Fix uses of gnu struct = {} extension 26567ec681f3Smrg- include/CL: Update OpenCL headers to latest 26577ec681f3Smrg- clover: Use the dispatch table type from the OpenCL headers 26587ec681f3Smrg- clover/meson: Define OpenCL header macros 26597ec681f3Smrg 26607ec681f3SmrgPierre-Eric Pelloux-Prayer (54): 26617ec681f3Smrg 26627ec681f3Smrg- radeonsi: tell the shader disk cache what IR is used 26637ec681f3Smrg- mesa: enable msaa in clear_with_quad if needed 26647ec681f3Smrg- mesa: pass vao as a function paramter 26657ec681f3Smrg- mesa: add EXT_dsa glVertexArray\* functions declarations 26667ec681f3Smrg- mesa: rework \_mesa_lookup_vao_err to allow usage from EXT_dsa 26677ec681f3Smrg- mesa: add vao/vbo lookup helper for EXT_dsa 26687ec681f3Smrg- mesa: add EXT_dsa glVertexArray\* functions implementation 26697ec681f3Smrg- mesa: add gl_vertex_array_object parameter to client state helpers 26707ec681f3Smrg- mesa: add EXT_dsa glEnableVertexArrayEXT / glDisableVertexArrayEXT 26717ec681f3Smrg- mesa: add EXT_dsa EnableVertexArrayAttribEXT / 26727ec681f3Smrg DisableVertexArrayAttribEXT 26737ec681f3Smrg- mesa: extract helper function from \_mesa_GetPointerv 26747ec681f3Smrg- mesa: add EXT_dsa glGetVertexArray\* 4 functions 26757ec681f3Smrg- mesa: fix call to \_mesa_lookup_vao_err 26767ec681f3Smrg- radeonsi: fix shader disk cache key 26777ec681f3Smrg- radeonsi: enable mesa_glthread for GfxBench 26787ec681f3Smrg- mesa: update features.txt to reflect EXT_dsa status 26797ec681f3Smrg- mesa: add ARB_framebuffer_no_attachments named functions 26807ec681f3Smrg- mesa: add ARB_vertex_attrib_64bit VertexArrayVertexAttribLOffsetEXT 26817ec681f3Smrg- mesa: add ARB_clear_buffer_object named functions 26827ec681f3Smrg- mesa: add ARB_gpu_shader_fp64 selector-less functions 26837ec681f3Smrg- mesa: add ARB_instanced_arrays EXT_dsa function 26847ec681f3Smrg- mesa: add ARB_texture_buffer_range glTextureBufferRangeEXT function 26857ec681f3Smrg- mesa: implement ARB_texture_storage_multisample + EXT_dsa functions 26867ec681f3Smrg- mesa: extend vertex_array_attrib_format to support EXT_dsa 26877ec681f3Smrg- mesa: add ARB_vertex_attrib_binding glVertexArray\* functions 26887ec681f3Smrg- mesa: add ARB_sparse_buffer NamedBufferPageCommitmentEXT function 26897ec681f3Smrg- mesa: enable EXT_direct_state_access 26907ec681f3Smrg- mesa: fix warning in 32 bits build 26917ec681f3Smrg- radeonsi: implement sdma for GFX9 26927ec681f3Smrg- radeonsi: display cs blit count for AMD_DEBUG=testdma 26937ec681f3Smrg- radeonsi: use gfx9.surf_offset to compute texture offset 26947ec681f3Smrg- radeonsi: fix multi plane buffers creation 26957ec681f3Smrg- radeonsi: dcc dirty flag 26967ec681f3Smrg- st/mesa: add a notify_before_flush callback param to flush 26977ec681f3Smrg- st/dri: use st->flush callback to flush the backbuffer 26987ec681f3Smrg- radeonsi: disable dcc for 2x MSAA surface and bpe < 4 26997ec681f3Smrg- gallium: refuse to create buffers larger than UINT32_MAX 27007ec681f3Smrg- radeon/vcn2: enable rate control for hevc encoding 27017ec681f3Smrg- radeonsi: check ctx->sdma_cs before using it 27027ec681f3Smrg- radeonsi: release saved resources in si_retile_dcc 27037ec681f3Smrg- radeonsi: release saved resources in si_compute_expand_fmask 27047ec681f3Smrg- radeonsi: release saved resources in si_compute_clear_render_target 27057ec681f3Smrg- radeonsi: release saved resources in si_compute_copy_image 27067ec681f3Smrg- radeonsi: release saved resources in si_compute_clear_12bytes_buffer 27077ec681f3Smrg- radeonsi: release saved resources in si_compute_do_clear_or_copy 27087ec681f3Smrg- radeonsi: fix fmask expand compute shader 27097ec681f3Smrg- radeonsi: make sure fmask expand is done if needed 27107ec681f3Smrg- radeonsi: unbind image before compute clear 27117ec681f3Smrg- radeonsi: drop the negation from fmask_is_not_identity 27127ec681f3Smrg- util: call bind_sampler_states before setting sampler_views 27137ec681f3Smrg- radeonsi: move AMD_DEBUG tests to AMD_TEST 27147ec681f3Smrg- docs: document AMD_DEBUG variable 27157ec681f3Smrg- radeonsi: stop using the VM_ALWAYS_VALID flag 27167ec681f3Smrg- radeonsi/ngg: add VGT_FLUSH when enabling fast launch 27177ec681f3Smrg 27187ec681f3SmrgProdea Alexandru-Liviu (2): 27197ec681f3Smrg 27207ec681f3Smrg- Meson: Remove lib prefix from graw and osmesa when building with 27217ec681f3Smrg Mingw. Also remove version sufix from osmesa swrast on Windows. 27227ec681f3Smrg- Appveyor: Quickly fix meson build. As this required use of Python 27237ec681f3Smrg 3.8, mako module also had to be updated. 27247ec681f3Smrg 27257ec681f3SmrgQiang Yu (3): 27267ec681f3Smrg 27277ec681f3Smrg- lima: sync lima_drm.h with kernel 27287ec681f3Smrg- lima: create heap buffer with new interface if available 27297ec681f3Smrg- lima: add noheap debug option 27307ec681f3Smrg 27317ec681f3SmrgRafael Antognolli (23): 27327ec681f3Smrg 27337ec681f3Smrg- intel/isl: Add MOCS settings to isl_device. 27347ec681f3Smrg- anv: Use mocs settings from isl_dev. 27357ec681f3Smrg- iris: Use mocs from isl_dev. 27367ec681f3Smrg- intel: Add workaround for stencil state. 27377ec681f3Smrg- intel/genxml: Add 3DSTATE_CONSTANT_ALL packet. 27387ec681f3Smrg- intel/aubinator: Decode 3DSTATE_CONSTANT_ALL. 27397ec681f3Smrg- intel/blorp: Use 3DSTATE_CONSTANT_ALL to setup push constants. 27407ec681f3Smrg- iris: Rework push constants emitting code. 27417ec681f3Smrg- iris: Use 3DSTATE_CONSTANT_ALL when possible. 27427ec681f3Smrg- anv: Move gen8+ push constant packet workaround. 27437ec681f3Smrg- anv: Add get_push_range_address() helper. 27447ec681f3Smrg- anv: Move code for emitting push constants into its own function. 27457ec681f3Smrg- anv: Use 3DSTATE_CONSTANT_ALL when possible. 27467ec681f3Smrg- iris: Add restriction to 3DSTATE_CONSTANT\_ packets. 27477ec681f3Smrg- util/os_socket: Add socket related functions. 27487ec681f3Smrg- vulkan/overlay: Add a control socket. 27497ec681f3Smrg- vulkan/overlay: Add support for a control socket. 27507ec681f3Smrg- vulkan/overlay: Add a command to start capturing data to a file. 27517ec681f3Smrg- vulkan/overlay: Add basic overlay control script. 27527ec681f3Smrg- vulkan/overlay: Update docs. 27537ec681f3Smrg- iris: Implement WA for push constants. 27547ec681f3Smrg- utils/os_socket: Define ssize_t on windows. 27557ec681f3Smrg- intel: Load the driver even if I915_PARAM_REVISION is not found. 27567ec681f3Smrg 27577ec681f3SmrgRhys Perry (131): 27587ec681f3Smrg 27597ec681f3Smrg- radv: adjust loop unrolling heuristics for int64 27607ec681f3Smrg- aco: add Instruction::usesModifiers() and add more checks in the 27617ec681f3Smrg optimizer 27627ec681f3Smrg- radv: fix radv_nir_get_max_workgroup_size when nir=NULL 27637ec681f3Smrg- aco: use DPP instead of exec modification when lowering GFX10 27647ec681f3Smrg shuffles 27657ec681f3Smrg- aco: fix shuffle with uniform operands 27667ec681f3Smrg- nir/divergence: improve DA of shuffle 27677ec681f3Smrg- aco: fix read_invocation with VGPR lane index 27687ec681f3Smrg- aco: don't propagate vgprs into v_readlane/v_writelane 27697ec681f3Smrg- aco: combine read_invocation and shuffle implementations 27707ec681f3Smrg- radv: enable FP16/FP64 denormals earlier and only for LLVM 27717ec681f3Smrg- aco: don't combine literals into v_cndmask_b32/v_subb/v_addc 27727ec681f3Smrg- aco: fix 64-bit fsign with 0 27737ec681f3Smrg- aco: implement VK_KHR_shader_float_controls 27747ec681f3Smrg- aco: refactor reduction lowering helpers 27757ec681f3Smrg- aco: implement 64-bit integer reductions 27767ec681f3Smrg- radv/aco: enable VK_KHR_shader_subgroup_extended_types 27777ec681f3Smrg- nir: make nir_variable::{num_members,num_state_slots} a uint16_t 27787ec681f3Smrg- nir: add nir_variable::index and nir_index_vars 27797ec681f3Smrg- nir/large_constants: use nir_index_vars and nir_variable::index 27807ec681f3Smrg- docs: update features.txt for RADV 27817ec681f3Smrg- aco: improve waitcnt insertion around loops 27827ec681f3Smrg- aco: fix copy+paste error 27837ec681f3Smrg- aco: fix waitcnts for barriers at block ends 27847ec681f3Smrg- nir: add nir_num_variable_modes and nir_var_mem_push_const 27857ec681f3Smrg- radv: set alignment for load_ssbo/store_ssbo in meta shaders 27867ec681f3Smrg- nir: add a load/store vectorization pass 27877ec681f3Smrg- nir: add load/store vectorizer tests 27887ec681f3Smrg- aco: enable load/store vectorizer 27897ec681f3Smrg- aco: allow constant offsets for global/scratch instructions on GFX10 27907ec681f3Smrg- aco: set dlc/glc correctly for image loads 27917ec681f3Smrg- aco: propagate p_wqm on an image_sample's coordinate p_create_vector 27927ec681f3Smrg- aco: fix i2i64 27937ec681f3Smrg- aco: fix incorrect cast in parse_wait_instr() 27947ec681f3Smrg- aco: add v_nop inbetween exec write and VMEM/DS/FLAT 27957ec681f3Smrg- aco: improve WAR hazard workaround with >64bit stores 27967ec681f3Smrg- aco: fix GFX10 opcodes for some global/flat atomics 27977ec681f3Smrg- aco: fix assembly of FLAT/GLOBAL atomics 27987ec681f3Smrg- aco: fix SADDR with FLAT on GFX10 27997ec681f3Smrg- aco: don't enable store_global for helper invocations 28007ec681f3Smrg- aco: improve FLAT/GLOBAL scheduling 28017ec681f3Smrg- aco: implement global atomics 28027ec681f3Smrg- ac/llvm: fix pointer type for global atomics 28037ec681f3Smrg- ac/llvm: improve sync scope for global atomics 28047ec681f3Smrg- radv: set writes_memory for global memory stores/atomics 28057ec681f3Smrg- aco: validate the CFG 28067ec681f3Smrg- aco: handle loop exit and IF merge phis with break/discard 28077ec681f3Smrg- aco: fix block_kind_discard s_andn2 definition to exec 28087ec681f3Smrg- nir/lower_io_to_vector: don't create arrays when not needed 28097ec681f3Smrg- nir/load_store_vectorize: fix combining stores with aliasing loads 28107ec681f3Smrg between 28117ec681f3Smrg- aco/wave32: fix comparison optimizations 28127ec681f3Smrg- aco: improve jump threading with wave32 28137ec681f3Smrg- aco: fix vgpr alloc granule with wave32 28147ec681f3Smrg- aco: limit register usage for large work groups 28157ec681f3Smrg- aco: set vm for pos0 exports on GFX10 28167ec681f3Smrg- aco: fix imageSize()/textureSize() with large buffers on GFX8 28177ec681f3Smrg- aco: fix uninitialized data in the binary 28187ec681f3Smrg- aco: handle VOP3 modifiers when combining a constant comparison's NaN 28197ec681f3Smrg test 28207ec681f3Smrg- aco: handle omod successors with the constant in the first operand 28217ec681f3Smrg- aco: check usesModifiers() when identifying a neg/abs 28227ec681f3Smrg- aco: better handle neg/abs of sgprs 28237ec681f3Smrg- aco: set exec_potentially_empty for demotes 28247ec681f3Smrg- aco: don't DCE atomics with return values 28257ec681f3Smrg- aco: disable add combining for ds_swizzle_b32 28267ec681f3Smrg- aco: check if multiplication/clamp is live when applying output 28277ec681f3Smrg modifier 28287ec681f3Smrg- nir/divergence: handle load_primitive_id in GS 28297ec681f3Smrg- nir/lower_gs_intrinsics: add option for per-stream counts 28307ec681f3Smrg- aco: update IR validator 28317ec681f3Smrg- aco: apply literals to split mads 28327ec681f3Smrg- aco: combine two sgprs into a VALU if they're the same 28337ec681f3Smrg- aco: improve can_use_VOP3() 28347ec681f3Smrg- aco: rewrite literal combining 28357ec681f3Smrg- aco: rewrite apply_sgprs() 28367ec681f3Smrg- aco: add check_vop3_operands() 28377ec681f3Smrg- aco: be more careful with literals in combine_salu_{n2,lshl_add} 28387ec681f3Smrg- aco: follow through temporary when merging tests into constant 28397ec681f3Smrg comparisons 28407ec681f3Smrg- aco: allow applying two sgprs to an instruction 28417ec681f3Smrg- aco: allow an extra SGPR with multiple uses to be applied to VOP3 28427ec681f3Smrg- aco: take advantage of GFX10's constant bus limit and VOP3 literals 28437ec681f3Smrg- aco: improve creation of v_madmk_f32/v_madak_f32 28447ec681f3Smrg- aco: fix clamp optimization 28457ec681f3Smrg- aco: improve clamp optimization 28467ec681f3Smrg- aco: add min(-max(), ) and max(-min(), ) optimization 28477ec681f3Smrg- aco: don't move literal to reg when making an instruction VOP3 on 28487ec681f3Smrg GFX10 28497ec681f3Smrg- aco: allow input modifiers on v_cndmask_b32 28507ec681f3Smrg- aco: replace extract_vector with copies 28517ec681f3Smrg- aco: improve readfirstlane after uniform LDS loads 28527ec681f3Smrg- aco: add integer min/max to can_swap_operands 28537ec681f3Smrg- nir/sink,nir/move: move/sink load_per_vertex_input 28547ec681f3Smrg- nir/sink,nir/move: move/sink nir_op_mov 28557ec681f3Smrg- nir/algebraic: a & ~(a >> 31) -> imax(a, 0) 28567ec681f3Smrg- aco: fix stack buffer overflow in apply_sgprs() 28577ec681f3Smrg- aco: fix fall-through test in try_remove_simple_block() with 28587ec681f3Smrg back-edges 28597ec681f3Smrg- aco: fix operand kill flags when a temporary is used more than once 28607ec681f3Smrg- aco: fix off-by-one error when initializing sgpr_live_in 28617ec681f3Smrg- radv: move gs copy shader creation before other variants 28627ec681f3Smrg- aco: improve support for s_sendmsg 28637ec681f3Smrg- radv/aco,aco: implement GS on GFX9+ 28647ec681f3Smrg- aco: implement GS on GFX7-8 28657ec681f3Smrg- radv/aco: allow ACO for GS 28667ec681f3Smrg- aco: explicitly mark end blocks for exports 28677ec681f3Smrg- aco: remove needs_instance_id 28687ec681f3Smrg- aco: implement GS copy shaders 28697ec681f3Smrg- radv/aco: use ACO for GS copy shaders 28707ec681f3Smrg- aco: use nir_move_copies 28717ec681f3Smrg- aco: fix WaR check for >64-bit FLAT/GLOBAL instructions 28727ec681f3Smrg- aco: fix operand to scc when selecting SGPR ufind_msb/ifind_msb 28737ec681f3Smrg- aco: always add sgprs to sgpr_ids when choosing literals 28747ec681f3Smrg- aco: fix literal application with v_cndmask_b32/v_addc_co_u32/etc 28757ec681f3Smrg- amd/common,radv: move vertex_format_table to ac_shader_util.{h,c} 28767ec681f3Smrg- aco: rework vertex fetching a bit 28777ec681f3Smrg- aco: skip unused channels at the start when fetching vertices 28787ec681f3Smrg- aco: handle unaligned vertex fetch on GFX10 28797ec681f3Smrg- aco: value-number MUBUF instructions 28807ec681f3Smrg- aco: use MUBUF in some situations instead of splitting vertex fetches 28817ec681f3Smrg- aco: fix rebase error from GS copy shader support 28827ec681f3Smrg- aco: ensure predecessors' p_logical_end is in WQM when a p_phi is in 28837ec681f3Smrg WQM 28847ec681f3Smrg- aco: run p_wqm instructions in WQM 28857ec681f3Smrg- nir/algebraic: add patterns for a >> #b << #b 28867ec681f3Smrg- nir/algebraic: add some half packing optimizations 28877ec681f3Smrg- aco: fix target calculation when vgpr spilling introduces sgpr 28887ec681f3Smrg spilling 28897ec681f3Smrg- aco: don't consider loop header blocks branch blocks in 28907ec681f3Smrg add_coupling_code 28917ec681f3Smrg- aco: don't update demand in add_coupling_code() for loop headers 28927ec681f3Smrg- aco: only create parallelcopy to restore exec at loop exit if needed 28937ec681f3Smrg- aco: don't always add logical edges from continue_break blocks to 28947ec681f3Smrg headers 28957ec681f3Smrg- aco: error when block has no logical preds but VGPRs are live at the 28967ec681f3Smrg start 28977ec681f3Smrg- aco: set exec_potentially_empty after continues/breaks in nested IFs 28987ec681f3Smrg- aco: improve assertion at the end of spiller 28997ec681f3Smrg- aco: fill reg_demand with sensible information in add_coupling_code() 29007ec681f3Smrg- aco: parallelcopy exec mask before s_wqm 29017ec681f3Smrg- aco: fix exec mask consistency issues 29027ec681f3Smrg- aco: fix gfx10_wave64_bpermute 29037ec681f3Smrg 29047ec681f3SmrgRicardo Garcia (1): 29057ec681f3Smrg 29067ec681f3Smrg- anv: Unify GetDeviceQueue and GetDeviceQueue2 29077ec681f3Smrg 29087ec681f3SmrgRob Clark (89): 29097ec681f3Smrg 29107ec681f3Smrg- freedreno/ir3: split pre-coloring to it's own function 29117ec681f3Smrg- freedreno/ir3: use SSA flag on dest register too 29127ec681f3Smrg- freedreno/ir3: ir3_print tweaks 29137ec681f3Smrg- freedreno/ir3/ra: move regs_count==0 check 29147ec681f3Smrg- freedreno/ir3/ra: remove ir print after livein/out 29157ec681f3Smrg- freedreno/ir3: remove obsolete comment 29167ec681f3Smrg- freedreno/a3xx: fix SP_FS_MRT_REG.HALF_PRECISION 29177ec681f3Smrg- freedreno/a4xx: fix SP_FS_MRT_REG.HALF_PRECISION 29187ec681f3Smrg- freedreno/ir3: sync disasm changes from envytools 29197ec681f3Smrg- freedreno/ir3: also track # of nops for shader-db 29207ec681f3Smrg- freedreno: fix eglDupNativeFenceFD error 29217ec681f3Smrg- freedreno/ir3: fix valgrind complaint with STLW 29227ec681f3Smrg- freedreno/ir3: remove half-precision output 29237ec681f3Smrg- freedreno/ir3: rename fanin/fanout to collect/split 29247ec681f3Smrg- freedreno/ir3: remove impossible condition 29257ec681f3Smrg- freedreno/ir3: add input/output iterators 29267ec681f3Smrg- freedreno/ir3: show input/output wrmask's in disasm 29277ec681f3Smrg- freedreno/ir3: helper to print ir if debug enabled 29287ec681f3Smrg- freedreno/ir3: remove first-vertex sysval 29297ec681f3Smrg- freedreno/ir3: simplify creating sysval inputs 29307ec681f3Smrg- freedreno/ir3: re-work shader inputs/outputs 29317ec681f3Smrg- freedreno/ir3: only tex instructions have wrmask 29327ec681f3Smrg- freedreno/ir3: fix gpu hang with pre-fs-tex-fetch 29337ec681f3Smrg- freedreno/ir3: legalize cleanups 29347ec681f3Smrg- freedreno/ir3: remove unused parameter 29357ec681f3Smrg- freedreno/perfcntrs: small cleanup 29367ec681f3Smrg- freedreno/perfcntrs: remove gallium dependencies 29377ec681f3Smrg- freedreno/perfcntrs: move to shared location 29387ec681f3Smrg- freedreno/perfcntrs: add accessor to get per-gen tables 29397ec681f3Smrg- freedreno/perfctrs/a2xx: move CP to be first group 29407ec681f3Smrg- freedreno/perfcntrs/a6xx: remove RBBM counters 29417ec681f3Smrg- freedreno/perfcntrs: add fdperf 29427ec681f3Smrg- freedreno/perfctrs/fdperf: periodically restore counters 29437ec681f3Smrg- gitlab-ci: update deqp build so we can generate xml 29447ec681f3Smrg- gitlab-ci/deqp: preserve full list of unexpected results 29457ec681f3Smrg- gitlab-ci/deqp: preserve caselists for blocks with fails 29467ec681f3Smrg- gitlab-ci/deqp: detect and report flakes 29477ec681f3Smrg- gitlab-ci: bump arm test container 29487ec681f3Smrg- gitlab-ci/deqp: generate xml results for fails/flakes 29497ec681f3Smrg- gitlab-ci/deqp: generate junit results 29507ec681f3Smrg- gitlab-ci/freedreno/a6xx: remove most of the flakes 29517ec681f3Smrg- freedreno: use rsc->slice accessor everywhere 29527ec681f3Smrg- freedreno: switch to layout helper 29537ec681f3Smrg- gitlab-ci: disable junit results for deqp 29547ec681f3Smrg- freedreno/ir3: remove store_output lowered to store_shared_ir3 29557ec681f3Smrg- freedreno/ir3: fix neverball assert in case of unused VS inputs 29567ec681f3Smrg- nir/lower_clip: Fix incorrect driver loc for clipdist outputs 29577ec681f3Smrg- freedreno/fdperf: use drmOpen() 29587ec681f3Smrg- freedreno/a6xx: disable LRZ when blending 29597ec681f3Smrg- freedreno/a5xx+a6xx: split LRZ layout to per-gen 29607ec681f3Smrg- freedreno/a6xx: fix LRZ layout 29617ec681f3Smrg- freedreno/a6xx: fix LRZ logic 29627ec681f3Smrg- freedreno/a6xx: enable LRZ by default 29637ec681f3Smrg- spirv: add OpLifetime\* 29647ec681f3Smrg- freedreno/ir3: add last-baryf shaderdb stat 29657ec681f3Smrg- freedreno/ir3: add scheduler traces 29667ec681f3Smrg- freedreno/ir3: add iterator macros 29677ec681f3Smrg- freedreno/a6xx: fix OUT_REG() vs growable cmdstream 29687ec681f3Smrg- nir+vtn: vec8+vec16 support 29697ec681f3Smrg- freedreno/ir3: fix flat shading again 29707ec681f3Smrg- nir: assert that nir_lower_tex runs after lowering derefs 29717ec681f3Smrg- mesa/st: lower samplers before nir_lower_tex 29727ec681f3Smrg- freedreno/ir3: rename instructions 29737ec681f3Smrg- gitlab-ci: fix missing caselist.css/xsl 29747ec681f3Smrg- freedreno/a6xx: limit scratch/debug markers to debug builds 29757ec681f3Smrg- freedreno/a6xx: cleanup rasterizer state 29767ec681f3Smrg- freedreno/a6xx: separate rast stateobj for prim restart 29777ec681f3Smrg- freedreno/a6xx: drop a few more per-draw registers 29787ec681f3Smrg- freedreno/a6xx: move dynamic program state to streaming stateobj 29797ec681f3Smrg- freedreno/a6xx: add PROG_FB_RAST stateobj 29807ec681f3Smrg- freedreno/drm: fix invalid-cmdstream-size with older kernels 29817ec681f3Smrg- freedreno: use PIPE_CAP_RGB_OVERRIDE_DST_ALPHA_BLEND 29827ec681f3Smrg- mesa/st: random whitespace cleanup 29837ec681f3Smrg- freedreno/a6xx: remove special handling based on MRT format 29847ec681f3Smrg- freedreno/a6xx: convert blend state to stateobj 29857ec681f3Smrg- freedreno: extract vsc pipe bo from GMEM state 29867ec681f3Smrg- freedreno: consolidate GMEM state 29877ec681f3Smrg- freedreno: constify fd_tile 29887ec681f3Smrg- freedreno: constify fd_vsc_pipe 29897ec681f3Smrg- freedreno/a6xx: constify gmem state 29907ec681f3Smrg- freedreno/a5xx: constify gmem state 29917ec681f3Smrg- freedreno/a4xx: constify gmem state 29927ec681f3Smrg- freedreno/a3xx: constify gmem state 29937ec681f3Smrg- freedreno/a2xx: constify gmem state 29947ec681f3Smrg- freedreno: get GMEM state from batch 29957ec681f3Smrg- freedreno: add gmem state cache 29967ec681f3Smrg- freedreno: add gmem_lock 29977ec681f3Smrg- freedreno: remove flush-queue 29987ec681f3Smrg- freedreno: allow ctx->batch to be NULL 29997ec681f3Smrg 30007ec681f3SmrgRobert Foss (5): 30017ec681f3Smrg 30027ec681f3Smrg- nir: Build nir_lower_point_size.c in libmesa_nir 30037ec681f3Smrg- android: Add panfrost support to build scripts 30047ec681f3Smrg- android: Fix u_format_table.c being generated twice 30057ec681f3Smrg- panfrost: Prefix schedule_program to prevent collision 30067ec681f3Smrg- android: Fix whitespace issue 30077ec681f3Smrg 30087ec681f3SmrgRohan Garg (1): 30097ec681f3Smrg 30107ec681f3Smrg- gitlab-ci: Use lavacli from packages 30117ec681f3Smrg 30127ec681f3SmrgRoland Scheidegger (3): 30137ec681f3Smrg 30147ec681f3Smrg- gallium/scons: fix graw_gdi build 30157ec681f3Smrg- util/atomic: Fix p_atomic_add for unlocked and msvc paths 30167ec681f3Smrg- winsys/svga: use new ioctl for logging 30177ec681f3Smrg 30187ec681f3SmrgRoman Stratiienko (2): 30197ec681f3Smrg 30207ec681f3Smrg- Android: Fix build issue without LLVM 30217ec681f3Smrg- panfrost: Fix Android build 30227ec681f3Smrg 30237ec681f3SmrgRoss Zwisler (1): 30247ec681f3Smrg 30257ec681f3Smrg- intel: limit shader geometry on BDW GT1 30267ec681f3Smrg 30277ec681f3SmrgSagar Ghuge (1): 30287ec681f3Smrg 30297ec681f3Smrg- intel/compiler: Clear accumulator register before EOT 30307ec681f3Smrg 30317ec681f3SmrgSamuel Iglesias Gonsálvez (1): 30327ec681f3Smrg 30337ec681f3Smrg- main: fix coverity error in \_mesa_program_resource_find_name() 30347ec681f3Smrg 30357ec681f3SmrgSamuel Pitoiset (202): 30367ec681f3Smrg 30377ec681f3Smrg- radv: declare NGG scratch for VS or TES and only on GFX10 30387ec681f3Smrg- radv: fix compute pipeline keys when optimizations are disabled 30397ec681f3Smrg- docs: document all RADV environment variables 30407ec681f3Smrg- radv: add a note about perftest/debug options 30417ec681f3Smrg- radv: fix 32-bit compiler warnings 30427ec681f3Smrg- nir: fix packing of nir_variable 30437ec681f3Smrg- radv/gfx10: enable wave32 for compute based on shader's wavesize 30447ec681f3Smrg- radv: hardcode the number of waves for the GFX6 LS-HS bug 30457ec681f3Smrg- radv: determine shaders wavesize at pipeline level 30467ec681f3Smrg- radv: rely on shader's wavesize when computing NGG info 30477ec681f3Smrg- radv: implement VK_EXT_subgroup_size_control 30487ec681f3Smrg- radv/gfx10: fix primitive indices orientation for NGG GS 30497ec681f3Smrg- ac: handle pointer types to LDS in ac_get_elem_bits() 30507ec681f3Smrg- gitlab-ci: build a specific libdrm version for ARM64 30517ec681f3Smrg- gitlab-ci: build RADV on ARM64 30527ec681f3Smrg- ac: fix build with recent LLVM 30537ec681f3Smrg- radv: remove useless RADV_DEBUG=unsafemath debug option 30547ec681f3Smrg- radv: make sure to not clear the ds attachment after resolves 30557ec681f3Smrg- ac: add radeon_info::has_l2_uncached 30567ec681f3Smrg- radv: implement VK_AMD_device_coherent_memory 30577ec681f3Smrg- spirv: fix lowering of OpGroupNonUniformAllEqual 30587ec681f3Smrg- ac: remove useless cast in ac_build_set_inactive() 30597ec681f3Smrg- ac: add 8-bit and 16-bit supports to ac_build_shuffle() 30607ec681f3Smrg- ac: add 8-bit and 16-bit supports to ac_build_readlane() 30617ec681f3Smrg- ac: add 8-bit and 16-bit supports to ac_build_set_inactive() 30627ec681f3Smrg- ac: add 8-bit and 16-bit supports to ac_build_dpp() 30637ec681f3Smrg- ac: add 8-bit and 16-bit supports to ac_build_swizzle() 30647ec681f3Smrg- ac: add 8-bit and 16-bit supports to get_reduction_identity() 30657ec681f3Smrg- ac: add 8-bit and 16-bit supports to ac_build_wwm() 30667ec681f3Smrg- ac: add 8-bit and 16-bit supports to ac_build_optimization_barrier() 30677ec681f3Smrg- ac: add 16-bit float support to ac_build_alu_op() 30687ec681f3Smrg- radv: advertise VK_KHR_shader_subgroup_extended_types on GFX8-GFX9 30697ec681f3Smrg- radv: enable VK_KHR_shader_subgroup_extended_types on GFX6-GFX7 30707ec681f3Smrg- docs: add missing new features for RADV 30717ec681f3Smrg- pipe-loader: check that the pointer to driconf_xml isn't NULL 30727ec681f3Smrg- gitlab-ci: move building piglit into a separate script 30737ec681f3Smrg- gitlab-ci: fix ldd check for Vulkan drivers 30747ec681f3Smrg- gitlab-ci: add a job that only build things needed for testing 30757ec681f3Smrg- gitlab-ci: do not build with debugoptimized for meson-main 30767ec681f3Smrg- gitlab-ci: build swr in meson-main 30777ec681f3Smrg- gitlab-ci: build GLVND in meson-clang 30787ec681f3Smrg- gitlab-ci: remove now useless meson-swr-glvnd build job 30797ec681f3Smrg- gitlab-ci: reduce the number of scons build 30807ec681f3Smrg- radv: disable subgroup shuffle operations on GFX10 30817ec681f3Smrg- ac/llvm: fix the local invocation index for wave32 30827ec681f3Smrg- meson: only build imgui when needed 30837ec681f3Smrg- radv: set the image view aspect mask during subpass transitions 30847ec681f3Smrg- radv: set the image view aspect mask before resolves 30857ec681f3Smrg- radv: rework creation of decompress/resummarize meta pipelines 30867ec681f3Smrg- radv: create decompress pipelines for separate depth/stencil layouts 30877ec681f3Smrg- radv: select the depth decompress path based on the aspect mask 30887ec681f3Smrg- ac/llvm: fix warning in ac_build_canonicalize() 30897ec681f3Smrg- radv: fix reporting subgroup size with 30907ec681f3Smrg VK_KHR_pipeline_executable_properties 30917ec681f3Smrg- radv: fix enabling sample shading with SampleID/SamplePosition 30927ec681f3Smrg- radv/gfx10: fix implementation of exclusive scans 30937ec681f3Smrg- ac: add 8-bit and 16-bit supports to ac_build_permlane16() 30947ec681f3Smrg- radv: enable VK_KHR_shader_subgroup_extended_types on GFX10 30957ec681f3Smrg- ac/llvm: convert src operands to pointers if necessary 30967ec681f3Smrg- radv: add more constants to avoid using magic numbers 30977ec681f3Smrg- radv,ac/nir: lower deref operations for shared memory 30987ec681f3Smrg- aco: drop useless lowering of deref operations for shared memory 30997ec681f3Smrg- ac/llvm: fix atomic var operations if source isn't a deref 31007ec681f3Smrg- radv: remove dead shader input/output variables 31017ec681f3Smrg- radv: simplify a check in radv_fixup_vertex_input_fetches() 31027ec681f3Smrg- radv/gfx10: fix the vertex order for triangle strips emitted by a GS 31037ec681f3Smrg- gitlab-ci: rename build-deqp.sh to build-deqp-gl.sh 31047ec681f3Smrg- gitlab-ci: add a gl suffix to the x86 test image and all test jobs 31057ec681f3Smrg- gitlab-ci: add a new job that builds a base test image for VK 31067ec681f3Smrg- gitlab-ci: build cts_runner in the x86 test image for VK 31077ec681f3Smrg- gitlab-ci: build dEQP VK 1.1.6 in the x86 test image for VK 31087ec681f3Smrg- gitlab-ci: add a new base test job for VK 31097ec681f3Smrg- gitlab-ci: allow to run dEQP Vulkan with DEQP_VER 31107ec681f3Smrg- gitlab-ci: configure the Vulkan ICD export with VK_DRIVER 31117ec681f3Smrg- gitlab-ci: build RADV in meson-testing 31127ec681f3Smrg- gitlab-ci: add a job that runs Vulkan CTS with RADV conditionally 31137ec681f3Smrg- radv: do not use VK_TRUE/VK_FALSE 31147ec681f3Smrg- radv: move emission of two PA_SC\_\* registers to the pipeline CS 31157ec681f3Smrg- radv: fix possibly wrong PA_SC_AA_CONFIG value for conservative rast 31167ec681f3Smrg- radv: synchronize after performing a separate depth/stencil fast 31177ec681f3Smrg clears 31187ec681f3Smrg- radv: do not init HTILE as compressed state when dst layout allows it 31197ec681f3Smrg- radv: initialize HTILE for separate depth/stencil aspects 31207ec681f3Smrg- radv: implement VK_KHR_separate_depth_stencil_layouts 31217ec681f3Smrg- gitlab-ci: set RADV_DEBUG=checkir for RADV test jobs 31227ec681f3Smrg- ac/nir: fix out-of-bound access when loading constants from global 31237ec681f3Smrg- radv: enable SpvCapabilityImageMSArray 31247ec681f3Smrg- radv: handle unaligned vertex fetches on GFX6/GFX10 31257ec681f3Smrg- radv/gfx10: fix ngg_get_ordered_id 31267ec681f3Smrg- radv/gfx10: fix the out-of-bounds check for vertex descriptors 31277ec681f3Smrg- ac: declare an enum for the OOB select field on GFX10 31287ec681f3Smrg- radv: init a default multisample state for the resolve FS path 31297ec681f3Smrg- radv: ignore pMultisampleState if rasterization is disabled 31307ec681f3Smrg- radv: ignore pTessellationState if the pipeline doesn't use tess 31317ec681f3Smrg- radv: ignore pDepthStencilState if rasterization is disabled 31327ec681f3Smrg- radv: tidy up radv_pipeline_init_blend_state() 31337ec681f3Smrg- radv: ignore pColorBlendState if rasterization is disabled 31347ec681f3Smrg- radv: rely on pipeline layout when creating push descriptors with 31357ec681f3Smrg template 31367ec681f3Smrg- radv: return the correct pitch for linear mipmaps on GFX10 31377ec681f3Smrg- radv: record number of color/depth samples for each subpass 31387ec681f3Smrg- radv: implement VK_AMD_mixed_attachment_samples 31397ec681f3Smrg- ac/surface: use uint16_t for mipmap level pitches 31407ec681f3Smrg- radv: do not fill keys from fragment shader twice 31417ec681f3Smrg- spirv: add SpvCapabilityImageReadWriteLodAMD 31427ec681f3Smrg- spirv,nir: add new lod parameter to image_{load,store} intrinsics 31437ec681f3Smrg- amd/llvm: handle nir_intrinsic_image_deref_{load,store} with lod 31447ec681f3Smrg- aco: handle nir_intrinsic_image_deref_{load,store} with lod 31457ec681f3Smrg- radv: advertise VK_AMD_shader_image_load_store_lod 31467ec681f3Smrg- radv/gfx10: disable vertex grouping 31477ec681f3Smrg- radv/gfx10: determine if a pipeline is eligible for NGG passthrough 31487ec681f3Smrg- radv/gfx10: do not declare LDS for NGG if useless 31497ec681f3Smrg- radv/gfx10: add support for NGG passthrough mode 31507ec681f3Smrg- radv/gfx10: improve performance for TES using PrimID but not 31517ec681f3Smrg exporting it 31527ec681f3Smrg- radv: only use VkSamplerCreateInfo::compareOp if enabled 31537ec681f3Smrg- radv/gfx10: enable all CUs if NGG is never used 31547ec681f3Smrg- radv/gfx10: simplify some duplicated NGG GS code 31557ec681f3Smrg- vulkan/overlay: Fix for Vulkan 1.2 31567ec681f3Smrg- radv: update VK_EXT_descriptor_indexing for Vulkan 1.2 31577ec681f3Smrg- radv: update VK_EXT_host_query_reset for Vulkan 1.2 31587ec681f3Smrg- radv: update VK_EXT_sampler_filter_minmax for Vulkan 1.2 31597ec681f3Smrg- radv: update VK_EXT_scalar_block_layout for Vulkan 1.2 31607ec681f3Smrg- radv: update VK_KHR_8bit_storage for Vulkan 1.2 31617ec681f3Smrg- radv: update VK_KHR_buffer_device_address for Vulkan 1.2 31627ec681f3Smrg- radv: update VK_KHR_create_renderpass2 for Vulkan 1.2 31637ec681f3Smrg- radv: update VK_KHR_depth_stencil_resolve for Vulkan 1.2 31647ec681f3Smrg- radv: update VK_KHR_draw_indirect_count for Vulkan 1.2 31657ec681f3Smrg- radv: update VK_KHR_driver_properties for Vulkan 1.2 31667ec681f3Smrg- radv: update VK_KHR_image_format_list for Vulkan 1.2 31677ec681f3Smrg- radv: update VK_KHR_imageless_framebuffer for Vulkan 1.2 31687ec681f3Smrg- radv: update VK_KHR_shader_atomic_int64 for Vulkan 1.2 31697ec681f3Smrg- radv: update VK_KHR_shader_float16_int8 for Vulkan 1.2 31707ec681f3Smrg- radv: update VK_KHR_shader_float_controls for Vulkan 1.2 31717ec681f3Smrg- radv: update VK_KHR_shader_subgroup_extended_types for Vulkan 1.2 31727ec681f3Smrg- radv: update VK_KHR_uniform_buffer_standard_layout for Vulkan 1.2 31737ec681f3Smrg- radv: update VK_KHR_timeline_semaphore for Vulkan 1.2 31747ec681f3Smrg- radv: implement Vulkan 1.1 features and properties 31757ec681f3Smrg- radv: implement Vulkan 1.2 features and properties 31767ec681f3Smrg- radv: enable Vulkan 1.2 31777ec681f3Smrg- aco: fix emitting SMEM instructions with no operands on GFX6-GFX7 31787ec681f3Smrg- aco: do not select 96-bit/128-bit variants for ds_read/ds_write on 31797ec681f3Smrg GFX6 31807ec681f3Smrg- aco: do not combine additions of DS instructions on GFX6 31817ec681f3Smrg- aco: implement stream output with vec3 on GFX6 31827ec681f3Smrg- aco: fix emitting slc for MUBUF instructions on GFX6-GFX7 31837ec681f3Smrg- aco: print assembly with CLRXdisasm for GFX6-GFX7 if found on the 31847ec681f3Smrg system 31857ec681f3Smrg- aco: fix constant folding of SMRD instructions on GFX6 31867ec681f3Smrg- aco: do not use the vec3 variant for stores on GFX6 31877ec681f3Smrg- aco: do not use the vec3 variant for loads on GFX6 31887ec681f3Smrg- aco: add new addr64 bit to MUBUF instructions on GFX6-GFX7 31897ec681f3Smrg- aco: implement nir_intrinsic_load_barycentric_at_sample on GFX6 31907ec681f3Smrg- radv: fix double free corruption in radv_alloc_memory() 31917ec681f3Smrg- radv: add explicit external subpass dependencies to meta operations 31927ec681f3Smrg- radv: handle missing implicit subpass dependencies 31937ec681f3Smrg- spirv: add SpvCapabilityFragmentMaskAMD 31947ec681f3Smrg- nir: add two new texture ops for multisample fragment color/mask 31957ec681f3Smrg fetches 31967ec681f3Smrg- spirv: add support for SpvOpFragment{Mask}FetchAMD operations 31977ec681f3Smrg- nir/lower_input_attachments: lower nir_texop_fragment_{mask}_fetch 31987ec681f3Smrg- ac/nir: add support for nir_texop_fragment_{mask}_fetch 31997ec681f3Smrg- aco: add support for nir_texop_fragment_{mask}_fetch 32007ec681f3Smrg- radv: advertise VK_AMD_shader_fragment_mask 32017ec681f3Smrg- aco: fix printing assembly with CLRXdisasm on GFX6 32027ec681f3Smrg- aco: fix wrong IR in nir_intrinsic_load_barycentric_at_sample 32037ec681f3Smrg- aco: implement nir_intrinsic_store_global on GFX6 32047ec681f3Smrg- aco: implement nir_intrinsic_load_global on GFX6 32057ec681f3Smrg- aco: implement nir_intrinsic_global_atomic\_\* on GFX6 32067ec681f3Smrg- aco: implement 64-bit nir_op_ftrunc on GFX6 32077ec681f3Smrg- aco: implement 64-bit nir_op_fceil on GFX6 32087ec681f3Smrg- aco: implement 64-bit nir_op_fround_even on GFX6 32097ec681f3Smrg- aco: implement 64-bit nir_op_ffloor on GFX6 32107ec681f3Smrg- aco: implement nir_op_f2i64/nir_op_f2u64 on GFX6 32117ec681f3Smrg- ac/llvm: fix missing casts in ac_build_readlane() 32127ec681f3Smrg- aco: combine MRTZ (depth, stencil, sample mask) exports 32137ec681f3Smrg- aco: fix a hardware bug for MRTZ exports on GFX6 32147ec681f3Smrg- aco: fix a hazard with v_interp\_\* and v_{read,readfirst}lane\_\* on 32157ec681f3Smrg GFX6 32167ec681f3Smrg- aco: copy the literal offset of SMEM instructions to a temporary 32177ec681f3Smrg- radv: enable ACO support for GFX6 32187ec681f3Smrg- radv: print NIR shaders after lowering FS inputs/outputs 32197ec681f3Smrg- radv: do not allow sparse resources with multi-planar formats 32207ec681f3Smrg- radv: enable VK_AMD_shader_fragment_mask on GFX6-GFX7 32217ec681f3Smrg- compiler: add a new explicit interpolation mode 32227ec681f3Smrg- spirv: add support for SpvDecorationExplicitInterpAMD 32237ec681f3Smrg- compiler: add PERSP to the existing barycentric system values 32247ec681f3Smrg- compiler: add new SYSTEM_VALUE_BARYCENTRIC\_\* 32257ec681f3Smrg- spirv: add support for SpvBuiltInBaryCoord\* 32267ec681f3Smrg- nir: add nir_intrinsic_load_barycentric_model 32277ec681f3Smrg- nir: lower SYSTEM_VALUE_BARYCENTRIC\_\* to nir_load_barycentric() 32287ec681f3Smrg- nir: add nir_intrinsic_interp_deref_at_vertex 32297ec681f3Smrg- nir: lower interp_deref_at_vertex to load_input_vertex 32307ec681f3Smrg- spirv: implement SPV_AMD_shader_explicit_vertex_parameter 32317ec681f3Smrg- ac/llvm: implement VK_AMD_shader_explicit_vertex_parameter 32327ec681f3Smrg- aco: implement VK_AMD_shader_explicit_vertex_parameter 32337ec681f3Smrg- radv: gather which input PS variables use an explicit interpolation 32347ec681f3Smrg mode 32357ec681f3Smrg- radv: implement VK_AMD_shader_explicit_vertex_parameter 32367ec681f3Smrg- radv: bump conformance version to 1.2.0.0 32377ec681f3Smrg- radv: remove the non conformant VK implementation warning on GFX10 32387ec681f3Smrg- aco: fix VS input loads with MUBUF on GFX6 32397ec681f3Smrg- radv/gfx10: add a separate flag for creating a GDS OA buffer 32407ec681f3Smrg- radv/gfx10: implement NGG GS queries 32417ec681f3Smrg- radv/gfx10: re-enable NGG GS 32427ec681f3Smrg- radv: refactor physical device properties 32437ec681f3Smrg- aco: fix MUBUF VS input loads when expanding vec3 to vec4 on GFX6 32447ec681f3Smrg- aco: do not use ds_{read,write}2 on GFX6 32457ec681f3Smrg- aco: fix waiting for scalar stores before "writing back" data on 32467ec681f3Smrg GFX8-GFX9 32477ec681f3Smrg- aco: fix creating v_madak if v_mad_f32 has two sgpr literals 32487ec681f3Smrg- nir: do not use De Morgan's Law rules for flt and fge 32497ec681f3Smrg 32507ec681f3SmrgSamuel Thibault (3): 32517ec681f3Smrg 32527ec681f3Smrg- loader: #define PATH_MAX when undefined (eg. Hurd) 32537ec681f3Smrg- util: Do not fail to build on unknown pthread_setname_np 32547ec681f3Smrg- meson: Do not require libdrm for DRI2 on hurd 32557ec681f3Smrg 32567ec681f3SmrgSatyajit Sahu (1): 32577ec681f3Smrg 32587ec681f3Smrg- radeon/vcn: Handle crop parameters for encoder 32597ec681f3Smrg 32607ec681f3SmrgSonny Jiang (1): 32617ec681f3Smrg 32627ec681f3Smrg- radeonsi: use compute shader for clear 12-byte buffer 32637ec681f3Smrg 32647ec681f3SmrgStephan Gerhold (1): 32657ec681f3Smrg 32667ec681f3Smrg- kmsro: Add "mcde" entry point 32677ec681f3Smrg 32687ec681f3SmrgTapani Pälli (33): 32697ec681f3Smrg 32707ec681f3Smrg- nir: fix couple of compile warnings 32717ec681f3Smrg- util/android: fix android build errors 32727ec681f3Smrg- Revert "egl: implement new functions from 32737ec681f3Smrg EGL_EXT_image_flush_external" 32747ec681f3Smrg- Revert "egl: handle EGL_IMAGE_EXTERNAL_FLUSH_EXT" 32757ec681f3Smrg- Revert "st/dri: add support for EGL_EXT_image_flush_external" 32767ec681f3Smrg- Revert "st/dri: assume external consumers of back buffers can write 32777ec681f3Smrg to the buffers" 32787ec681f3Smrg- Revert "dri_interface: add interface for 32797ec681f3Smrg EGL_EXT_image_flush_external" 32807ec681f3Smrg- mesa: allow bit queries for EXT_disjoint_timer_query 32817ec681f3Smrg- Revert "mesa: allow bit queries for EXT_disjoint_timer_query" 32827ec681f3Smrg- mesa: allow bit queries for EXT_disjoint_timer_query 32837ec681f3Smrg- gitlab-ci: update Piglit commit, update skips 32847ec681f3Smrg- mapi: add GetInteger64vEXT with EXT_disjoint_timer_query 32857ec681f3Smrg- glsl: handle max uniform limits with lower_const_arrays_to_uniforms 32867ec681f3Smrg- gitlab-ci: bump piglit checkout commit 32877ec681f3Smrg- glsl: additional interface redeclaration check for SSO programs 32887ec681f3Smrg- intel/compiler: add newline to limit_dispatch_width message 32897ec681f3Smrg- intel/compiler: force simd8 when dual src blending on gen8 32907ec681f3Smrg- dri: add \__DRI_IMAGE_FORMAT_SXRGB8 32917ec681f3Smrg- i965: expose MESA_FORMAT_B8G8R8X8_SRGB visual 32927ec681f3Smrg- mesa/st/i965: add a ProgramResourceHash for quicker resource lookup 32937ec681f3Smrg- mesa: create program resource hash in a single place 32947ec681f3Smrg- iris: set depth stall enabled when depth flush enabled on gen12 32957ec681f3Smrg- anv: set depth stall enabled when depth flush enabled on gen12 32967ec681f3Smrg- isl/gen12: add reminder comment about missing WA with 3D surfaces 32977ec681f3Smrg- anv: fix assert in GetImageDrmFormatModifierPropertiesEXT 32987ec681f3Smrg- anv: add assert for isl_mod_info in choose_isl_tiling_flags 32997ec681f3Smrg- anv: initialize clear_color_is_zero_one 33007ec681f3Smrg- egl/android: fix buffer_count for applications setting max count 33017ec681f3Smrg- anv/android: setup gralloc1 usage from gralloc0 usage manually 33027ec681f3Smrg- anv/android: make format_supported_with_usage static 33037ec681f3Smrg- intel/vec4: fix valgrind errors with vf_values array 33047ec681f3Smrg- glsl: fix a memory leak with resource_set 33057ec681f3Smrg- iris: fix aux buf map failure in 32bits app on Android 33067ec681f3Smrg 33077ec681f3SmrgThomas Hellstrom (4): 33087ec681f3Smrg 33097ec681f3Smrg- winsys/svga: Enable transhuge pages for buffer objects 33107ec681f3Smrg- svga: Avoid discard DMA uploads 33117ec681f3Smrg- gallium/util: Increase the debug_flush map depth 33127ec681f3Smrg- svga: Fix banded DMA upload 33137ec681f3Smrg 33147ec681f3SmrgThong Thai (8): 33157ec681f3Smrg 33167ec681f3Smrg- st/va: Convert interlaced NV12 to progressive 33177ec681f3Smrg- util/format: Add the P010 format used for 10-bit videos 33187ec681f3Smrg- gallium: Add PIPE_FORMAT_P010 support 33197ec681f3Smrg- st/va: Add support for P010, used for 10-bit videos 33207ec681f3Smrg- radeon: Use P010 for decoding of 10-bit videos 33217ec681f3Smrg- r600: Remove HEVC related code since HEVC is not supported 33227ec681f3Smrg- mesa: Prevent \_MaxLevel from being less than zero 33237ec681f3Smrg- Revert "st/va: Convert interlaced NV12 to progressive" 33247ec681f3Smrg 33257ec681f3SmrgTimothy Arceri (66): 33267ec681f3Smrg 33277ec681f3Smrg- glsl: just use NIR to lower outputs when driver can't read outputs 33287ec681f3Smrg- glsl: disable lower_fragdata_array() for NIR drivers 33297ec681f3Smrg- mesa: add ARB_shading_language_include stubs 33307ec681f3Smrg- glsl: add infrastructure for ARB_shading_language_include 33317ec681f3Smrg- mesa: add ARB_shading_language_include infrastructure to 33327ec681f3Smrg gl_shared_state 33337ec681f3Smrg- mesa: add helper to validate tokenise shader include path 33347ec681f3Smrg- mesa: add \_mesa_lookup_shader_include() helper 33357ec681f3Smrg- mesa: add copy_string() helper 33367ec681f3Smrg- mesa: add glNamedStringARB() support 33377ec681f3Smrg- mesa: implement glGetNamedStringARB() 33387ec681f3Smrg- mesa: make error checking optional in \_mesa_lookup_shader_include() 33397ec681f3Smrg- mesa: implement glIsNamedStringARB() 33407ec681f3Smrg- mesa: implement glGetNamedStringivARB() 33417ec681f3Smrg- mesa: split \_mesa_lookup_shader_include() in two 33427ec681f3Smrg- mesa: implement glDeleteNamedStringARB() 33437ec681f3Smrg- glsl: add ARB_shading_language_include support to #line 33447ec681f3Smrg- glsl: pass gl_context to glcpp_parser_create() 33457ec681f3Smrg- glsl: add preprocessor #include support 33467ec681f3Smrg- glsl: error if #include used while extension is disabled 33477ec681f3Smrg- glsl: add can_skip_compile() helper 33487ec681f3Smrg- glsl: delay compilation skip if shader contains an include 33497ec681f3Smrg- mesa: add support cursor support for relative path shader includes 33507ec681f3Smrg- mesa: add shader include lookup support for relative paths 33517ec681f3Smrg- mesa: implement glCompileShaderIncludeARB() 33527ec681f3Smrg- mesa: enable ARB_shading_language_include 33537ec681f3Smrg- gitlab-ci: bump piglit checkout commit 33547ec681f3Smrg- gitlab-ci: update for arb_shading_language_include 33557ec681f3Smrg- compiler: move build definition of pp_standalone_scaffolding.c 33567ec681f3Smrg- radv: add some infrastructure for fresh forks for each secure compile 33577ec681f3Smrg- radv: add a secure_compile_open_fifo_fds() helper 33587ec681f3Smrg- radv: create a fresh fork for each pipeline compile 33597ec681f3Smrg- docs: update source code repository documentation 33607ec681f3Smrg- glsl: move calculate_array_size_and_stride() to link_uniforms.cpp 33617ec681f3Smrg- glsl: don't set uniform block as used when its not 33627ec681f3Smrg- glsl: make use of active_shader_mask when building resource list 33637ec681f3Smrg- glsl/nir: iterate the system values list when adding varyings 33647ec681f3Smrg- docs: remove mailing list as way of submitting patches 33657ec681f3Smrg- glsl: move nir_remap_dual_slot_attributes() call out of glsl_to_nir() 33667ec681f3Smrg- glsl: copy the how_declared field when converting to nir 33677ec681f3Smrg- nir: add some fields to nir_variable_data 33687ec681f3Smrg- glsl: copy the new data fields when converting to nir 33697ec681f3Smrg- glsl: add support for named varyings in 33707ec681f3Smrg nir_build_program_resource_list() 33717ec681f3Smrg- glsl: add subroutine support to nir_build_program_resource_list() 33727ec681f3Smrg- st/glsl_to_nir: call gl_nir_lower_buffers() a little later 33737ec681f3Smrg- st/glsl_to_nir: use nir based program resource list builder 33747ec681f3Smrg- st/glsl_to_nir: fix SSO validation regression 33757ec681f3Smrg- glsl: rename gl_nir_link() to gl_nir_link_spirv() 33767ec681f3Smrg- glsl: add gl_nir_link_check_atomic_counter_resources() 33777ec681f3Smrg- glsl: add new gl_nir_link_glsl() helper 33787ec681f3Smrg- glsl: reorder link_and_validate_uniforms() calls 33797ec681f3Smrg- mesa: add new UseNIRGLSLLinker constant 33807ec681f3Smrg- glsl: use nir linker to link atomics 33817ec681f3Smrg- glsl: add check_image_resources() for the nir linker 33827ec681f3Smrg- glsl: use nir version of check_image_resources() for nir linker 33837ec681f3Smrg- glsl: move check_subroutine_resources() into the shared util code 33847ec681f3Smrg- glsl: call check_subroutine_resources() from the nir linker 33857ec681f3Smrg- glsl: move uniform resource checks into the common linker code 33867ec681f3Smrg- glsl: call uniform resource checks from the nir linker 33877ec681f3Smrg- glsl: move calculate_subroutine_compat() to shared linker code 33887ec681f3Smrg- glsl: call calculate_subroutine_compat() from the nir linker 33897ec681f3Smrg- glsl: fix potential bug in nir uniform linker 33907ec681f3Smrg- glsl: remove bogus assert in nir uniform linking 33917ec681f3Smrg- glsl: fix check for matrices in blocks when using nir uniform linker 33927ec681f3Smrg- glsl: count uniform components and storage better in nir linking 33937ec681f3Smrg- glsl_to_nir: update interface type properly 33947ec681f3Smrg- glsl: fix gl_nir_set_uniform_initializers() for image arrays 33957ec681f3Smrg 33967ec681f3SmrgTimur Kristóf (39): 33977ec681f3Smrg 33987ec681f3Smrg- ac: Handle invalid GFX10 format correctly in ac_get_tbuffer_format. 33997ec681f3Smrg- aco: Make sure not to mistakenly propagate 64-bit constants. 34007ec681f3Smrg- aco: Treat all booleans as per-lane. 34017ec681f3Smrg- aco: Optimize out trivial code from uniform bools. 34027ec681f3Smrg- aco: Fix operand of s_bcnt1_i32_b64 in emit_boolean_reduce. 34037ec681f3Smrg- aco: Remove superfluous argument from emit_boolean_logic. 34047ec681f3Smrg- aco: Remove lower_linear_bool_phi, it is not needed anymore. 34057ec681f3Smrg- aco: Optimize load_subgroup_id to one bit field extract instruction. 34067ec681f3Smrg- aco/wave32: Change uniform bool optimization to work with wave32. 34077ec681f3Smrg- aco/wave32: Replace hardcoded numbers in spiller with wave size. 34087ec681f3Smrg- aco/wave32: Introduce emit_mbcnt which takes wave size into account. 34097ec681f3Smrg- aco/wave32: Add wave size specific opcodes to aco_builder. 34107ec681f3Smrg- aco/wave32: Use lane mask regclass for exec/vcc. 34117ec681f3Smrg- aco/wave32: Fix load_local_invocation_index to support wave32. 34127ec681f3Smrg- aco/wave32: Use wave_size for barrier intrinsic. 34137ec681f3Smrg- aco/wave32: Allow setting the subgroup ballot size to 64-bit. 34147ec681f3Smrg- aco/wave32: Fix reductions. 34157ec681f3Smrg- aco: Fix uniform i2i64. 34167ec681f3Smrg- ac/llvm: Fix ac_build_reduce in wave32 mode. 34177ec681f3Smrg- aco/wave32: Set the definitions of v_cmp instructions to the lane 34187ec681f3Smrg mask. 34197ec681f3Smrg- aco: Implement 64-bit constant propagation. 34207ec681f3Smrg- aco: Allow optimizing vote_all and nir_op_iand. 34217ec681f3Smrg- aco: Don't skip combine_instruction when definitions[1] is used. 34227ec681f3Smrg- aco: Optimize out s_and with exec, when used on uniform bitwise 34237ec681f3Smrg values. 34247ec681f3Smrg- aco: Flip s_cbranch / s_cselect to optimize out an s_not if possible. 34257ec681f3Smrg- nouveau/nvc0: add extern keyword to nvc0_miptree_vtbl. 34267ec681f3Smrg- intel/compiler: Fix array bounds warning on GCC 10. 34277ec681f3Smrg- radeon: Move si_get_pic_param to radeon_vce.c 34287ec681f3Smrg- r600: Move get_pic_param to radeon_vce.c 34297ec681f3Smrg- gallium: Fix a couple of multiple definition warnings. 34307ec681f3Smrg- radeon: Fix multiple definition error with radeon_debug 34317ec681f3Smrg- aco: Fix -Wstringop-overflow warnings in aco_span. 34327ec681f3Smrg- aco: Fix maybe-uninitialized warnings. 34337ec681f3Smrg- aco: Fix signedness compare warning. 34347ec681f3Smrg- aco: Make a better guess at which instructions need the VCC hint. 34357ec681f3Smrg- aco: Transform uniform bitwise instructions to 32-bit if possible. 34367ec681f3Smrg- aco/gfx10: Fix VcmpxExecWARHazard mitigation. 34377ec681f3Smrg- aco: Fix the meaning of is_atomic. 34387ec681f3Smrg- aco/optimizer: Don't combine uniform bool s_and to s_andn2. 34397ec681f3Smrg 34407ec681f3SmrgTomasz Pyra (1): 34417ec681f3Smrg 34427ec681f3Smrg- gallium/swr: Fix arb_transform_feedback2 34437ec681f3Smrg 34447ec681f3SmrgTomeu Vizoso (38): 34457ec681f3Smrg 34467ec681f3Smrg- gitlab-ci: Disable lima jobs 34477ec681f3Smrg- gitlab-ci: Run only LAVA jobs in special-named branches 34487ec681f3Smrg- panfrost: Add checksum fields to SFBD descriptor 34497ec681f3Smrg- panfrost: Set 0x10 bit on mali_shader_meta.unknown2_4 on T720 34507ec681f3Smrg- panfrost: Rework format encoding on SFBD 34517ec681f3Smrg- panfrost: Take into account texture layers in SFBD 34527ec681f3Smrg- panfrost: Decode blend shaders for SFBD 34537ec681f3Smrg- panfrost: Generate polygon list manually for SFBD 34547ec681f3Smrg- panfrost: Print the right zero field 34557ec681f3Smrg- panfrost: Pipe the GPU ID into compiler and disassembler 34567ec681f3Smrg- panfrost: Set depth and stencil for SFBD based on the format 34577ec681f3Smrg- panfrost: Multiply offset_units by 2 34587ec681f3Smrg- panfrost: Make sure the shader descriptor is in sync with the GL 34597ec681f3Smrg state 34607ec681f3Smrg- gitlab-ci: Remove limit on kernel logging 34617ec681f3Smrg- panfrost: Just print tiler fields as-is for Tx20 34627ec681f3Smrg- panfrost: Rework buffers in SFBD 34637ec681f3Smrg- gitlab-ci: Fix dir name for VK-GL-CTS sources 34647ec681f3Smrg- panfrost: Don't print the midgard_blend_rt structs on SFBD 34657ec681f3Smrg- panfrost: Add quirks system to cmdstream 34667ec681f3Smrg- panfrost: Simplify shader patching 34677ec681f3Smrg- panfrost: White list the Mali T720 34687ec681f3Smrg- gitlab-ci: Test Panfrost on T720 GPUs 34697ec681f3Smrg- panfrost: Add PAN_MESA_DEBUG=sync 34707ec681f3Smrg- panfrost: Hold a reference to sampler views 34717ec681f3Smrg- pan/midgard: Remove undefined behavior 34727ec681f3Smrg- nir: Don't copy empty array 34737ec681f3Smrg- util: Don't access members of NULL pointers 34747ec681f3Smrg- panfrost: Don't lose bits! 34757ec681f3Smrg- st/mesa: Don't access members of NULL pointers 34767ec681f3Smrg- panfrost: Handle Z24_UNORM_S8_UINT as MALI_Z32_UNORM 34777ec681f3Smrg- panfrost: Increase PIPE_SHADER_CAP_MAX_OUTPUTS to 16 34787ec681f3Smrg- panfrost: Dynamically allocate array of texture pointers 34797ec681f3Smrg- panfrost: Map with size of first layer for 3D textures 34807ec681f3Smrg- panfrost: Store internal format 34817ec681f3Smrg- gitlab-ci: Update kernel for LAVA to 5.5-rc1 plus fixes 34827ec681f3Smrg- gitlab-ci: Switch LAVA jobs to use shared dEQP runner 34837ec681f3Smrg- gitlab-ci: Upgrade kernel for LAVA jobs to v5.5-rc5 34847ec681f3Smrg- gitlab-ci: Consolidate container and build stages for LAVA 34857ec681f3Smrg 34867ec681f3SmrgUrja Rannikko (4): 34877ec681f3Smrg 34887ec681f3Smrg- panfrost: free last_read/write tables in mir_create_dependency_graph 34897ec681f3Smrg- panfrost: free allocations in schedule_block 34907ec681f3Smrg- panfrost: add lcra_free() to free lcra state 34917ec681f3Smrg- panfrost: free spill cost table in mir_spill_register 34927ec681f3Smrg 34937ec681f3SmrgVasily Khoruzhick (31): 34947ec681f3Smrg 34957ec681f3Smrg- lima: add debug prints for BO cache 34967ec681f3Smrg- lima: align size before trying to fetch BO from cache 34977ec681f3Smrg- lima: ignore flags while looking for BO in cache 34987ec681f3Smrg- lima: set dithering flag when necessary 34997ec681f3Smrg- lima: add support for gl_PointSize 35007ec681f3Smrg- lima: enable tiling 35017ec681f3Smrg- lima: handle DRM_FORMAT_MOD_INVALID in resource_from_handle() 35027ec681f3Smrg- lima: expose tiled format modifier in query_dmabuf_modifiers() 35037ec681f3Smrg- lima: use single BO for GP outputs 35047ec681f3Smrg- lima: drop suballocator 35057ec681f3Smrg- lima: fix allocation of GP outputs storage for indexed draw 35067ec681f3Smrg- lima: postpone PP stream generation 35077ec681f3Smrg- lima: don't reload and redraw tiles that were not updated 35087ec681f3Smrg- lima: fix PP stream terminator size 35097ec681f3Smrg- lima: use linear layout for shared buffers if modifier is not 35107ec681f3Smrg specified 35117ec681f3Smrg- lima: add debug flag to disable tiling 35127ec681f3Smrg- lima: drop support for R8G8B8 format 35137ec681f3Smrg- lima: fix PLBU_CMD_PRIMITIVE_SETUP command 35147ec681f3Smrg- lima: fix viewport clipping 35157ec681f3Smrg- lima: implement polygon offset 35167ec681f3Smrg- lima: fix PIPE_CAP\_\* to mark features that aren't supported yet 35177ec681f3Smrg- lima: add new findings to texture descriptor 35187ec681f3Smrg- lima: fix handling of reverse depth range 35197ec681f3Smrg- ci: lava: pass CI_NODE_INDEX and CI_NODE_TOTAL to lava jobs 35207ec681f3Smrg- ci: Re-enable CI for lima on mali450 35217ec681f3Smrg- lima: implement invalidate_resource() 35227ec681f3Smrg- nir: don't emit ishl in \_nir_mul_imm() if backend doesn't support 35237ec681f3Smrg bitops 35247ec681f3Smrg- lima: use imul for calculations with intrinsic src 35257ec681f3Smrg- lima: ppir: don't delete root ld_tex nodes without successors in 35267ec681f3Smrg current block 35277ec681f3Smrg- lima: ppir: always create move and update ld_tex successors for all 35287ec681f3Smrg blocks 35297ec681f3Smrg- lima: disable early-z if fragment shader uses discard 35307ec681f3Smrg 35317ec681f3SmrgVinson Lee (9): 35327ec681f3Smrg 35337ec681f3Smrg- swr: Fix build with llvm-10.0. 35347ec681f3Smrg- panfrost: Fix gnu-empty-initializer build errors. 35357ec681f3Smrg- scons: Bump C standard to gnu11 on macOS 10.15. 35367ec681f3Smrg- util/u_thread: Restrict u_thread_get_time_nano on macOS. 35377ec681f3Smrg- swr: Fix build with llvm-10.0. 35387ec681f3Smrg- swr: Fix build with llvm-10.0. 35397ec681f3Smrg- lima: Fix build with GCC 10. 35407ec681f3Smrg- swr: Fix GCC 4.9 checks. 35417ec681f3Smrg- panfrost: Remove unused anonymous enum variables. 35427ec681f3Smrg 35437ec681f3SmrgWladimir J. van der Laan (2): 35447ec681f3Smrg 35457ec681f3Smrg- u_vbuf: add logic to use a limited number of vbufs 35467ec681f3Smrg- u_vbuf: use single vertex buffer if it's not possible to have 35477ec681f3Smrg multiple 35487ec681f3Smrg 35497ec681f3SmrgX512 (1): 35507ec681f3Smrg 35517ec681f3Smrg- util/u_thread: Fix build under Haiku 35527ec681f3Smrg 35537ec681f3SmrgYevhenii Kolesnikov (5): 35547ec681f3Smrg 35557ec681f3Smrg- glsl: Enable textureSize for samplerExternalOES 35567ec681f3Smrg- meson: Fix linkage of libgallium_nine with libgalliumvl 35577ec681f3Smrg- meta: Cleanup function for DrawTex 35587ec681f3Smrg- main: allow external textures for BindImageTexture 35597ec681f3Smrg- meta: Add cleanup function for Bitmap 35607ec681f3Smrg 35617ec681f3SmrgZebediah Figura (1): 35627ec681f3Smrg 35637ec681f3Smrg- Revert "draw: revert using correct order for prim decomposition." 35647ec681f3Smrg 35657ec681f3Smrgluc (1): 35667ec681f3Smrg 35677ec681f3Smrg- zink: confused compilation macro usage for zink in target helpers. 3568