17ec681f3SmrgMesa 20.2.0 Release Notes / 2020-09-28 27ec681f3Smrg====================================== 37ec681f3Smrg 47ec681f3SmrgMesa 20.2.0 is a new development release. People who are concerned 57ec681f3Smrgwith stability and reliability should stick with a previous release or 67ec681f3Smrgwait for Mesa 20.2.1. 77ec681f3Smrg 87ec681f3SmrgMesa 20.2.0 implements the OpenGL 4.6 API, but the version reported by 97ec681f3SmrgglGetString(GL_VERSION) or glGetIntegerv(GL_MAJOR_VERSION) / 107ec681f3SmrgglGetIntegerv(GL_MINOR_VERSION) depends on the particular driver being used. 117ec681f3SmrgSome drivers don't support all the features required in OpenGL 4.6. OpenGL 127ec681f3Smrg4.6 is **only** available if requested at context creation. 137ec681f3SmrgCompatibility contexts may report a lower version depending on each driver. 147ec681f3Smrg 157ec681f3SmrgMesa 20.2.0 implements the Vulkan 1.2 API, but the version reported by 167ec681f3Smrgthe apiVersion property of the VkPhysicalDeviceProperties struct 177ec681f3Smrgdepends on the particular driver being used. 187ec681f3Smrg 197ec681f3SmrgSHA256 checksum 207ec681f3Smrg--------------- 217ec681f3Smrg 227ec681f3Smrg:: 237ec681f3Smrg 247ec681f3Smrg 63f0359575d558ef98dd78adffc0df4c66b76964ebf603b778b7004964191d30 mesa-20.2.0.tar.xz 257ec681f3Smrg 267ec681f3Smrg 277ec681f3SmrgNew features 287ec681f3Smrg------------ 297ec681f3Smrg 307ec681f3Smrg- GL_ARB_compute_variable_group_size on Iris. 317ec681f3Smrg 327ec681f3Smrg- GL_ARB_gpu_shader5 on llvmpipe 337ec681f3Smrg 347ec681f3Smrg- GL_ARB_post_depth_coverage on llvmpipe 357ec681f3Smrg 367ec681f3Smrg- GLES 3.2 on llvmpipe 377ec681f3Smrg 387ec681f3Smrg- GL_EXT_shader_group_vote on GLES3. 397ec681f3Smrg 407ec681f3Smrg- GL_EXT_texture_shadow_lod on llvmpipe 417ec681f3Smrg 427ec681f3Smrg- VK_AMD_texture_gather_bias_lod on RADV. 437ec681f3Smrg 447ec681f3Smrg- VK_AMD_gpu_shader_half_float on RADV/ACO. 457ec681f3Smrg 467ec681f3Smrg- VK_AMD_gpu_shader_int16 on RADV/ACO. 477ec681f3Smrg 487ec681f3Smrg- VK_EXT_extended_dynamic_state on ANV and RADV. 497ec681f3Smrg 507ec681f3Smrg- VK_EXT_image_robustness on RADV. 517ec681f3Smrg 527ec681f3Smrg- VK_EXT_private_data on ANV and RADV. 537ec681f3Smrg 547ec681f3Smrg- VK_EXT_custom_border_color on ANV and RADV. 557ec681f3Smrg 567ec681f3Smrg- VK_EXT_pipeline_creation_cache_control on ANV and RADV. 577ec681f3Smrg 587ec681f3Smrg- VK_EXT_shader_demote_to_helper_invocation on RADV/LLVM. 597ec681f3Smrg 607ec681f3Smrg- VK_EXT_subgroup_size_control on RADV/ACO. 617ec681f3Smrg 627ec681f3Smrg- VK_GOOGLE_user_type on ANV and RADV. 637ec681f3Smrg 647ec681f3Smrg- VK_KHR_shader_subgroup_extended_types on RADV/ACO. 657ec681f3Smrg 667ec681f3Smrg- GL_ARB_gl_spirv on nvc0/nir. 677ec681f3Smrg 687ec681f3Smrg- GL_ARB_spirv_extensions on nvc0/nir. 697ec681f3Smrg 707ec681f3Smrg- RADV now uses ACO per default as backend 717ec681f3Smrg 727ec681f3Smrg- RADV_DEBUG=llvm option to enable LLVM backend for RADV 737ec681f3Smrg 747ec681f3Smrg- VK_EXT_image_robustness for ANV 757ec681f3Smrg 767ec681f3Smrg- VK_EXT_shader_atomic_float on ANV 777ec681f3Smrg 787ec681f3Smrg- VK_EXT_4444_formats on ANV and RADV. 797ec681f3Smrg 807ec681f3Smrg- VK_KHR_memory_model on RADV. 817ec681f3Smrg 827ec681f3Smrg- GL 4.5 on llvmpipe 837ec681f3Smrg 847ec681f3Smrg- EGL_KHR_swap_buffers_with_damage on X11 (DRI3) 857ec681f3Smrg 867ec681f3Smrg 877ec681f3SmrgBug fixes 887ec681f3Smrg--------- 897ec681f3Smrg 907ec681f3Smrg- [Regression][Bisected][20.2][radeonsi] American Truck Simulator continually allocates memory until OOM 917ec681f3Smrg- anv: dEQP-VK.robustness.robustness2.* failures on gen12 927ec681f3Smrg- [RADV] Problems reading primitive ID in fragment shader after tessellation 937ec681f3Smrg- Massive memory leak (at least AMD, others unknown) 947ec681f3Smrg- Substance Painter 6.1.3 black glitches on Radeon RX570 957ec681f3Smrg- vkCmdCopyImage broadcasts subsample 0 of MSAA src into all subsamples of dst on RADV 967ec681f3Smrg- Crash in ruvd_end_frame when calling vaBeginPicture/vaEndPicture without rendering anything 977ec681f3Smrg- X-Plane 11 Installer crashes on startup since `glsl: declare gl_Layer/gl_ViewportIndex/gl_ViewportMask as vs builtins` 987ec681f3Smrg- Horizon Zero Dawn graphics corruption with with radv 997ec681f3Smrg- Amber test opt_peel_loop_initial_if: Assertion failed 1007ec681f3Smrg- Dirt Rally: Flickering glitches on certain foliage since Mesa 20.1.0 caused by MSAA 1017ec681f3Smrg- [BRW] WRC 5 asserts with gallium nine and iris. 1027ec681f3Smrg- radv: Corruption in "The Surge 2" 1037ec681f3Smrg- [RADV] Detroit: Become Human Demo game lock-ups with RADV 1047ec681f3Smrg- Road Redemption certain graphic effects rendered white color 1057ec681f3Smrg- vulkan/wsi/x11: deadlock with Xwayland when compositor holds multiple buffers 1067ec681f3Smrg- [RADV/ACO] Death Stranding cause a GPU hung (*ERROR* Waiting for fences timed out!) 1077ec681f3Smrg- lp_bld_init.c:172:7: error: implicit declaration of function ‘LLVMAddConstantPropagationPass’; did you mean ‘LLVMAddCorrelatedValuePropagationPass’? [-Werror=implicit-function-declaration] 1087ec681f3Smrg- Intel Vulkan driver crash with alpha-to-coverage 1097ec681f3Smrg- EGL_KHR_swap_buffers_with_damage support on X11 1107ec681f3Smrg- radv: blitting 3D images with linear filter 1117ec681f3Smrg- [ACO] Compiling pipelines from RPCS3's shader interpreter spins forever in ACO code 1127ec681f3Smrg- Intel Vulkan driver assertion with small xfb buffer 1137ec681f3Smrg- [spirv-fuzz] SPIR-V parsing failed "src->type->type == dest->type->type" 1147ec681f3Smrg- radeonsi: radeonsi crashes in Chrome on chromeos 1157ec681f3Smrg- [RADV] commit d19bc94e4eb94 broke gamescope with Navi 1167ec681f3Smrg- 4e3a7dcf6ee4946c46ae8b35e7883a49859ef6fb breaks Gamescope showing windows properly. 1177ec681f3Smrg- anv: crashes in CTS test dEQP-VK.subgroups.*.framebuffer.*_tess_eval 1187ec681f3Smrg- Intel Vuikan (anv) crash in copy_non_dynamic_state() when using validation layer 1197ec681f3Smrg- Mafia 3: Trees get rendered incorrectly 1207ec681f3Smrg- radv: dEQP-VK.synchronization.op.multi_queue.timeline_semaphore.write_clear_attachments_*_concurrent fail when forcing DCC. 1217ec681f3Smrg- Crash on GTA 5 through proton 5.0.9 and GE versions 1227ec681f3Smrg- Mesa 20.2.0-rc1 fails to build for AMD 1237ec681f3Smrg- Assertion failure compiling shader from Zigguart 1247ec681f3Smrg- Panfrost locks for waiting fence when running Source engine games 1257ec681f3Smrg- ci: `-Dtools=panfrost` should be build-tested 1267ec681f3Smrg- panfrost: Register allocation fails for Firefox WebRender shaders 1277ec681f3Smrg- VRAM leak with vuilkan external memory + opengl memory objects 1287ec681f3Smrg- [vulkan/build] Recent build system changes made VK_EXT_acquire_xlib_display unnecessarily depend on GBM 1297ec681f3Smrg- ci: Capture devcoredumps on chezas 1307ec681f3Smrg- Possible array out of bounds in brw_vec4_nir.cpp 1317ec681f3Smrg- freedreno/a6xx: incorrect rendering in asphalt 9 1327ec681f3Smrg- [tgl][bisected][regression][iris] failure on dEQP-EGL.functional.wide_color.pbuffer_8888_colorspace_default 1337ec681f3Smrg- Multiply defined symbols compiling with gcc@10.1.0 1347ec681f3Smrg- shrinking descriptor pool on intel+vulkan 1357ec681f3Smrg- dEQP-VK.renderpass2.dedicated_allocation.attachment.1.12 fails on NAVI14 1367ec681f3Smrg- turnip: binning and indirect dependency 1377ec681f3Smrg- Amber test leads to NIR validation failed after nir_opt_if (on spirv-fuzz shader) 1387ec681f3Smrg- Unable to compile mesa-git from b559d26c 1397ec681f3Smrg- Ambient light too bright with ACO in AC: Odyssey 1407ec681f3Smrg- Multiple issues with Detroit Become Human 1417ec681f3Smrg- ci: Capture artifacts in baremetal mode 1427ec681f3Smrg- turnip/ir3: fine derivatives 1437ec681f3Smrg- panfrost: regression: Major stuttering and low compositor FPS with glmark2 1447ec681f3Smrg- khr_debug-push-pop-group_gl: ../src/util/simple_mtx.h:86: simple_mtx_lock: Assertion \`c != _SIMPLE_MTX_INVALID_VALUE' failed. 1457ec681f3Smrg- freedreno/a6xx: skai/skqp fails 1467ec681f3Smrg- SPIR-V parsing fails in src/compiler/spirv/spirv_to_nir.c 1477ec681f3Smrg- SPIR-V parsing fails in src/compiler/spirv/vtn_cfg.c 1487ec681f3Smrg- Weird GLSL bug 1497ec681f3Smrg- iris driver is broken in Freedesktop 19.08 1507ec681f3Smrg- LLVM not properly shutdown in `si_pipe.c`? 1517ec681f3Smrg- Panfrost: add current status to docs/features.txt 1527ec681f3Smrg- Opengl incorrect rendering on yuzu Amd 1537ec681f3Smrg- RADV: VK_ACCESS_MEMORY_READ/WRITE_BIT is not implemented 1547ec681f3Smrg- [bisected][regression][all platforms] multiple deqp-gles31/glescts/piglit failures 1557ec681f3Smrg- 7406ea37, "ac/surface: require that gfx8 doesn't have DCC in order to be displayable", breaks Gamescope being able to launch games on RX580, and possibly other gfx8 cards 1567ec681f3Smrg- vkGetSemaphoreCounterValue doesn't update without vkWaitSemaphores calls on Intel UHD 620 1577ec681f3Smrg- [RADV] System crash when playing XCOM Chimera Squad because of commit #7a5e6fd2 1587ec681f3Smrg- [RADV] Non-precise occlusion queries return non-zero when all fragments are discarded 1597ec681f3Smrg- [DXVK] Project Cars rendering problems 1607ec681f3Smrg- ADDRLIB ODR Violation 1617ec681f3Smrg- Build fails with current mesa from git "undefinierter Verweis auf »nir_lower_clip_disable«" 1627ec681f3Smrg- KDE Compositor stuttering after Check for window destruction in dri3_wait_for_event_locked 1637ec681f3Smrg- Add fallthrough to prevent errors caused by missing break 1647ec681f3Smrg- i965/20.1: gray rendering with torcs racing 1657ec681f3Smrg- glBindBufferRange call seems to be ignored by one of two shader-programs on radeon cards 1667ec681f3Smrg- [bisected][g33] piglit.spec.ext_framebuffer_object.fbo-cubemap failure 1677ec681f3Smrg- Increase GL_MAX_COMPUTE_SHADER_STORAGE_BLOCKS to greater value. 1687ec681f3Smrg- nir: st_nir_lower_builtin fails for gl_LightSource[i] 1697ec681f3Smrg- Sometimes VLC player process gets stuck in memory after closure if video output used is Auto or OpenGL 1707ec681f3Smrg- Double unlock in rbug_context.c 1717ec681f3Smrg- Double copy for TexSubImage 1727ec681f3Smrg- [v3d] corruption when GS omits some vertices 1737ec681f3Smrg- Iris crashes when reading from multisampled front buffer on platforms without front buffer 1747ec681f3Smrg- freedreno: subway surfers crash when repeatedly toggling fullscreen 1757ec681f3Smrg- [RADV/GFX8] Performance drop in DOOM Eternal when "Present from compute" is enabled 1767ec681f3Smrg- freedreno: multiple applications crash on a5xx 1777ec681f3Smrg- Use-after-free crash innv50_ir::GCRA::RIG_Node::init() 1787ec681f3Smrg- intel: Sample mask writes need to be honored in Vulkan 1797ec681f3Smrg- [RADV] - Path of Exile (238960) - Map outline, landscape and markers are missing with the Vulkan renderer. 1807ec681f3Smrg- ASTC texture decompression fails when using software fallback 1817ec681f3Smrg- [i965][iris][regression][bisected] multiple piglit and glcts failures on all platforms 1827ec681f3Smrg- please publish GPG keyring used to sign new releases 1837ec681f3Smrg- [BISECTED] compiling shader causes crash 1847ec681f3Smrg- Missing render Information on Stellaris 1857ec681f3Smrg- freedreno/ir3: allow copy-propagate from array 1867ec681f3Smrg- Zink + GALLIUM_HUD SIGSEGV 1877ec681f3Smrg- piglit spec@egl_ext_device_base@conformance fails LLVM 11 Git assertion since "llvmpipe/fs: add caching support" 1887ec681f3Smrg- llvmpipe: 1x1 framebuffer with a 2x2 viewport 1897ec681f3Smrg- [regression] nir build failure 1907ec681f3Smrg- ci: need to end baremetal tests after kernel panic/instaboot 1917ec681f3Smrg- If-statement body is executed for false condition 1927ec681f3Smrg- freedreno/a6xx: broken rendering in playcanvas "after the flood" 1937ec681f3Smrg- [regression] performance drop on Dota 2, CS:GO, and gfxbench GL benchmarks on ICL/Iris 1947ec681f3Smrg- [amd] C++ ODR violatation for union GB_ADDR_CONFIG 1957ec681f3Smrg- Zink reports incorrect amount of video memory 1967ec681f3Smrg- [RADV/LLVM]: void llvm::ICmpInst::AssertOK(): Assertion \`getOperand(0)->getType() == getOperand(1)->getType() && "Both operands to ICmp instruction are not of the same type!"' failed. 1977ec681f3Smrg- glsl-1.50-gs-max-output hangs on Navi10 + NGG 1987ec681f3Smrg- anv: Runs out of binding tables with PPSSPP during long runs 1997ec681f3Smrg- Segfault in Panfrost with waypipe 2007ec681f3Smrg- ci: Use rsync instead of rm -rf ; cp for baremetal rootfs 2017ec681f3Smrg- i965: Rendering problems replaying a trace of "Refunct" after mesa-20.1.0-rc1 release [bisected] 2027ec681f3Smrg- Panfrost (rk3399 NanoPi M4) hang/crash on playing video on Kodi/X11 2037ec681f3Smrg- gallium/winsys/radeon/drm fails assertion on 32bit 2047ec681f3Smrg- NIR validation failed after glsl to nir, before function inline, wrong {src,dst}->type ? 2057ec681f3Smrg- nir/spirv asin() function not precise enough 2067ec681f3Smrg- Mesa 20.0.7 / 20.1.0-rc4 regression, extremally long shader compilation time in NIR 2077ec681f3Smrg- Android build error after 689acc73 2087ec681f3Smrg- freedreno/a6xx: gpu hangs in google earth 2097ec681f3Smrg- Mesa-git build fails on Fedora Rawhide 2107ec681f3Smrg- Doom Eternal 1.1 performs very poorly on RADV 2117ec681f3Smrg- iris/i965: possible regression in 20.0.5 due to changes in buffer manager sharing across screens (firefox/mozilla#1634213) 2127ec681f3Smrg- iris/i965: possible regression in 20.0.5 due to changes in buffer manager sharing across screens (firefox/mozilla#1634213) 2137ec681f3Smrg- Incorrect _NetBSD__ macro inside execmem.c 2147ec681f3Smrg- Possible invalid sizeof in device.c 2157ec681f3Smrg- YUV FP16 lowering validation failing 2167ec681f3Smrg- GLSL compiler assertion is_float() failed in glsl/ir_validate.cpp, visit_leave on specific WebGL shader 2177ec681f3Smrg- [RADV] - Doom Eternal (782330) & Metro Exodus (412020) - Title requires 'RADV_DEBUG=zerovram' to eliminate colorful graphical aberrations. 2187ec681f3Smrg- [RADV] - Doom Eternal (782330) & Metro Exodus (412020) - Title requires 'RADV_DEBUG=zerovram' to eliminate colorful graphical aberrations. 2197ec681f3Smrg- mesa trunk master vulkan overlay-layer meson.build warning empty configuration_data() object 2207ec681f3Smrg- [meson] increase minimum required version 2217ec681f3Smrg- Kicad fails to render 3D PCB models. 2227ec681f3Smrg- freedreno: minetest: alpha channel issue on a6xx 2237ec681f3Smrg- Reproduceable i915 gpu hang Intel Iris Plus Graphics (Ice Lake 8x8 GT2) 2247ec681f3Smrg- 7 Days to Die - "Reflection Quality" setting broken, results in environment rendered black 2257ec681f3Smrg- glsl: regression affecting shader compilation time 2267ec681f3Smrg- freedreno: glamor issue with x11 desktops 2277ec681f3Smrg- finish converting from fnv1a to xxhash 2287ec681f3Smrg- Hang in iris_dri in kitty 2297ec681f3Smrg- Setting twice value to output_stream in radv_nir_to_llvm.c 2307ec681f3Smrg- Overwriting value of `jit_tex->sample_stride` in lp_setup.c 2317ec681f3Smrg- [AMDGPU][OpenGL] apitrace of kernel/firmware crash that requires a reboot 2327ec681f3Smrg- Flickering in Superposition benchmark 2337ec681f3Smrg- Double lock in fbobject.c 2347ec681f3Smrg- Possible typo in aco_insert_waitcnt.cpp 2357ec681f3Smrg- [bisected] Steam crashes when newest Iris built with LTO 2367ec681f3Smrg- Freeing null pointer inside radv_amdgpu_cs.c 2377ec681f3Smrg- Duplicated sub expression in radv_nir_to_llvm.c 2387ec681f3Smrg- i965/vec4: opt_cse_local cause the out of bound array access 2397ec681f3Smrg- NIR: Regression on shader using 8/16-bit integers 2407ec681f3Smrg- ACO: Compiler segfault on 8/16-bit integers. 2417ec681f3Smrg- lp_bld_intr.c:70:16: error: use of undeclared identifier 'LLVMFixedVectorTypeKind'; did you mean 'LLVMVectorTypeKind'? 2427ec681f3Smrg- recent seqno changes causing surfaceflinger crash 2437ec681f3Smrg- [radeonsi] [glthread] Crash with glthread enabled 2447ec681f3Smrg- Deadlock in anv_timelines_wait() 2457ec681f3Smrg- [gles3] supertuxkart: some textures are incorrect 2467ec681f3Smrg- post_version.py does not work with release candidates 2477ec681f3Smrg- post_version.py does not work with release candidates 2487ec681f3Smrg- radv regression on android 2497ec681f3Smrg- ogl: Set mesa_glthread=true as default on the RPCS3 emulator 2507ec681f3Smrg- [iris] android deqp dEQP-EGL.functional.robustness.negative_context#invalid_notification_strategy_enum fails 2517ec681f3Smrg- zink: conditional rendering 2527ec681f3Smrg- [RadeonSI] Glitches on VEGA8 + RX 560X after MR 4863 2537ec681f3Smrg- RadeonSI OpenGL broken for GFX8 after unify code for overriding offset 2547ec681f3Smrg- freedreno/turnip: Don't request fragcoord components we don't use 2557ec681f3Smrg- Make check fails in ANV 2567ec681f3Smrg- src\util\meson.build:294:4: ERROR: Program or command 'winepath' not found or not executable 2577ec681f3Smrg- Please add Zink to features.txt 2587ec681f3Smrg- llvmpipe: assert triggers in LLVM 2597ec681f3Smrg- debug builds are massively broken on Windows 2607ec681f3Smrg- ci: Report flakes on IRC from baremetal tests 2617ec681f3Smrg- heavy glitches on amd ryzen 5 since version 20.x 2627ec681f3Smrg- zink asserts with 32-bit boolean 2637ec681f3Smrg- OpenGL: Surviving Mars black screen late-game (possible shader problem) 2647ec681f3Smrg- Kerbal Space Program (KSP) hangs entire Navi system 2657ec681f3Smrg- Dirt: Showdown bad performance and broken rendering with enabled advanced lightning 2667ec681f3Smrg- gravit & Firefox WebGL broken since 3dc2ccc14c0e035368fea6ae3cce8c481f3c4ad2 "ac/surface: replace RADEON_SURF_OPTIMIZE_FOR_SPACE with !FORCE_SWIZZLE_MODE" 2677ec681f3Smrg- mesa 20.0.5 causing kitty to crash 2687ec681f3Smrg- radeonsi: "Torchlight II" trace showing regression on mesa-20.0.6 [bisected] 2697ec681f3Smrg- [RADV/LLVM/ACO/Regression] After mesa commit a3dc7fffbb7be0f1b2ac478b16d3acc5662dff66 all games stucks at start 2707ec681f3Smrg- Android building error after commit 2ab45f41 2717ec681f3Smrg- freedreno/a6xx: pubg rendering glitches 2727ec681f3Smrg- iris: Crash when trying to capture window in OBS Studio 2737ec681f3Smrg- lp_test_format failure with llvm-11 2747ec681f3Smrg 2757ec681f3Smrg 2767ec681f3SmrgChanges 2777ec681f3Smrg------- 2787ec681f3Smrg 2797ec681f3SmrgAbhishek Kumar (1): 2807ec681f3Smrg 2817ec681f3Smrg- egl: Limit the EGL ver for android 2827ec681f3Smrg 2837ec681f3SmrgAdam Jackson (1): 2847ec681f3Smrg 2857ec681f3Smrg- glx: Fix build and warnings with -Dglx=dri -Dglx-direct=false 2867ec681f3Smrg 2877ec681f3SmrgAlejandro Piñeiro (9): 2887ec681f3Smrg 2897ec681f3Smrg- v3d/tex: only look up the 2nd texture gather offset for 1d non-arrays 2907ec681f3Smrg- v3d/tex: set up default values for Configuration Parameter 1 if possible 2917ec681f3Smrg- v3d/tex: use TMUSLOD register if possible 2927ec681f3Smrg- v3d: moving v3d simulator to src/broadcom 2937ec681f3Smrg- v3d/tex: handle correctly coordinates for cube/cubearrays images 2947ec681f3Smrg- vulkan/util: add struct vk_pipeline_cache_header 2957ec681f3Smrg- nir/lower_tex: handle query lod with nir_lower_tex_packing_16 at lower_tex_packing 2967ec681f3Smrg- v3d/packet: fix typo on Set InstanceID/PrimitiveID packet 2977ec681f3Smrg- v3d: set instance id to 0 at start of tile 2987ec681f3Smrg 2997ec681f3SmrgAlyssa Rosenzweig (475): 3007ec681f3Smrg 3017ec681f3Smrg- pan/mdg: Track more types 3027ec681f3Smrg- pan/mdg: Be a bit more pedantic in invert passes 3037ec681f3Smrg- panfrost: Enumify bifrost blend types 3047ec681f3Smrg- pan/bi: Add texture indices to IR 3057ec681f3Smrg- pan/bi: Pipe multiple textures through 3067ec681f3Smrg- pan/bi: Pack round opcodes (FMA, either 16 or 32) 3077ec681f3Smrg- pan/bit: Add framework forinterpreting double vs float 3087ec681f3Smrg- pan/bit: Interpret ROUND 3097ec681f3Smrg- pan/bit: Add round tests 3107ec681f3Smrg- panfrost: Fix texture field size 3117ec681f3Smrg- panfrost: Fix size of bifrost sampler descriptor 3127ec681f3Smrg- panfrost: Fix sampler wrap/filter field orders 3137ec681f3Smrg- panfrost: Fix norm coords on bifrost sampler 3147ec681f3Smrg- panfrost: Fix tiled texture "stride"s on Bifrost 3157ec681f3Smrg- pan/decode: Don't crash on missing payload 3167ec681f3Smrg- pan/bi: Enable lower_mediump_outputs NIR pass 3177ec681f3Smrg- panfrost: Update Bifrost fields in mali_shader_meta 3187ec681f3Smrg- pan/bi: Lower for now sincos 3197ec681f3Smrg- pan/mdg: Ingest actual isub ops 3207ec681f3Smrg- pan/mdg: Rename .one to .sat_signed 3217ec681f3Smrg- pan/mdg: Move constant switch opts to algebraic pass 3227ec681f3Smrg- pan/mdg: Drop forever todo 3237ec681f3Smrg- pan/mdg: Drop `opt` in name of midgard_opt_cull_dead_branch 3247ec681f3Smrg- pan/mdg: Enable nir_opt_algebraic_distribute_src_mods 3257ec681f3Smrg- panfrost: Update dEQP expectation list 3267ec681f3Smrg- panfrost: Setup gl_FragCoord as sysval on Bifrost 3277ec681f3Smrg- pan/bi: Add clause type for gl_FragCoord.zw load 3287ec681f3Smrg- pan/bi: Abort on unknown op packing 3297ec681f3Smrg- pan/bi: Abort on unhandled intrinsics 3307ec681f3Smrg- pan/bi: Futureproof COMBINE lowering against non-u32 3317ec681f3Smrg- pan/bi: Print bad instruction on src packing fail 3327ec681f3Smrg- pan/bi: Passthrough direct ld_var addresses 3337ec681f3Smrg- pan/bi: Lower gl_FragCoord 3347ec681f3Smrg- pan/bi: Set clause type for gl_FragCoord.z 3357ec681f3Smrg- pan/bi: Fix double-abs flipping 3367ec681f3Smrg- pan/bi: Fix missing swizzle 3377ec681f3Smrg- pan/bi: Fix incorrectly flipped swizzle 3387ec681f3Smrg- pan/bi: Disable CSEL4 emit for now 3397ec681f3Smrg- pan/bi: Fix DISCARD ops in disasm 3407ec681f3Smrg- pan/bi: Structify DISCARD 3417ec681f3Smrg- pan/bi: Remove BI_GENERIC 3427ec681f3Smrg- pan/bi: Unwrap BRANCH into CONDITIONAL class 3437ec681f3Smrg- pan/bi: Handle discard_if in NIR->BIR naively 3447ec681f3Smrg- pan/bi: Emit discard (not if) 3457ec681f3Smrg- pan/bi: Add float-only mode to condition fusing 3467ec681f3Smrg- pan/bi: Fuse conditions into discard_if 3477ec681f3Smrg- pan/bi: Handle discard/branch in get_component_count 3487ec681f3Smrg- pan/bi: Pack ADD.DISCARD 3497ec681f3Smrg- pan/bi: Structify ADD ICMP 16 3507ec681f3Smrg- pan/bi: Pack ADD ICMP 32 3517ec681f3Smrg- pan/bi: Pack ADD ICMP 16 3527ec681f3Smrg- pan/bi: Don't pack ICMP on FMA 3537ec681f3Smrg- pan/bit: Add swizzles to round tests 3547ec681f3Smrg- pan/bit: Add more 16-bit fmod tests 3557ec681f3Smrg- pan/bit: Add ICMP tests 3567ec681f3Smrg- pan/bi: Rename BI_ISUB to BI_IMATH 3577ec681f3Smrg- pan/bi: Use IMATH for nir_op_iadd 3587ec681f3Smrg- pan/bi: Pack FMA IADD/ISUB 32 3597ec681f3Smrg- pan/bi: Pack ADD IADD/ISUB for 8/16/32 3607ec681f3Smrg- pan/bi: Add SUB.v2i16/SUB.v4i8 opcodes to disasm 3617ec681f3Smrg- pan/bi: Don't schedule <32-bit IMATH to FMA 3627ec681f3Smrg- pan/bit: Interpret IMATH 3637ec681f3Smrg- pan/bit: Interpret v4i8 ops 3647ec681f3Smrg- pan/bit: Remove test names 3657ec681f3Smrg- pan/bit: Use swizzle helper for round 3667ec681f3Smrg- pan/bit: Factor out identity swizzle helper 3677ec681f3Smrg- pan/bit: Add IMATH packing tests 3687ec681f3Smrg- pan/decode: Fix flags_hi printing 3697ec681f3Smrg- pan/mdg: Explain helper invocations dataflow theory 3707ec681f3Smrg- pan/mdg: Analyze helper invocation termination 3717ec681f3Smrg- pan/mdg: Analyze helper execution requirements 3727ec681f3Smrg- pan/mdg: Use the helper invo analyze passes 3737ec681f3Smrg- pan/mdg: Use analysis to set .cont/.last flags 3747ec681f3Smrg- pan/mdg: Remove texture_op_count 3757ec681f3Smrg- pan/mdg: Set types for derivatives 3767ec681f3Smrg- pan/mdg: Fix derivative swizzle 3777ec681f3Smrg- panfrost: Run dEQP-GLES3.functional.shaders.derivate.* on CI 3787ec681f3Smrg- pan/decode: Use a page table for tracking mmaps 3797ec681f3Smrg- pan/decode: Fix min/max_tile_coord mixup 3807ec681f3Smrg- pan/mfbd: Add format codes for PIPE_FORMAT_B5G5R5A1_UNORM 3817ec681f3Smrg- panfrost: Switch formats to table 3827ec681f3Smrg- panfrost: Fix Z24 vs Z32 mixup 3837ec681f3Smrg- panfrost: Enable AFBC for Z24X8 3847ec681f3Smrg- nir: Add fsat_signed opcode 3857ec681f3Smrg- nir: Add fclamp_pos opcode 3867ec681f3Smrg- panfrost: Add modifier detection helpers 3877ec681f3Smrg- pan/mdg: Remove .pos propagation pass 3887ec681f3Smrg- pan/mdg: Drop nir_lower_to_source_mods 3897ec681f3Smrg- pan/mdg: Prepare for modifier helpers 3907ec681f3Smrg- pan/mdg: Ingest fsat_signed/fclamp_pos 3917ec681f3Smrg- pan/mdg: Apply abs/neg modifiers 3927ec681f3Smrg- pan/mdg: Treat inot as a modifier 3937ec681f3Smrg- pan/mdg: Remove invert optimizations 3947ec681f3Smrg- pan/mdg: Use helpers for branch/discard inversion 3957ec681f3Smrg- pan/mdg: Apply outmods 3967ec681f3Smrg- pan/mdg: Emit fcsel when beneficial 3977ec681f3Smrg- pan/mdg: Optimize pipelining logic 3987ec681f3Smrg- pan/mdg: Precompute mir_special_index 3997ec681f3Smrg- pan/mdg: Optimize liveness computation in DCE 4007ec681f3Smrg- pan/mdg: Handle comparisons in fp16 path 4017ec681f3Smrg- pan/mdg: Fix constant combining crash 4027ec681f3Smrg- pan/mdg: Remove mir_*size routines 4037ec681f3Smrg- pan/mdg: Remove mir_get_alu_src 4047ec681f3Smrg- pan/mdg: Include more types 4057ec681f3Smrg- pan/mdg: Handle dest up/lower correctly with swizzles 4067ec681f3Smrg- pan/mdg: Respect !32-bit sizes in RA 4077ec681f3Smrg- pan/mdg: Explain ld/st sign/zero extension 4087ec681f3Smrg- pan/mdg: Add abs/neg/shift modifiers to IR 4097ec681f3Smrg- pan/mdg: Use src_types to determine size in scheduling 4107ec681f3Smrg- pan/mdg: Use type to determine triviality of a move 4117ec681f3Smrg- pan/mdg: Identify scalar integer mods 4127ec681f3Smrg- pan/mdg: Promote imov to fmov on a NIR level 4137ec681f3Smrg- pan/mdg: Remove promote_float pass 4147ec681f3Smrg- pan/mdg: Defer modifier packing until emit time 4157ec681f3Smrg- pan/mdg: Remove redundant redundancy 4167ec681f3Smrg- pan/mdg: Streamline dest_override handling 4177ec681f3Smrg- pan/mdg: Implement b2f16 4187ec681f3Smrg- pan/mdg: Don't generate conversions for fp16 LUTs 4197ec681f3Smrg- pan/mdg: Ignore dest.type when offseting load swizzle 4207ec681f3Smrg- pan/lcra: Remove unused alignment parameters 4217ec681f3Smrg- pan/lcra: Allow per-variable bounds to be set 4227ec681f3Smrg- pan/mdg: Use type size to determine alignment 4237ec681f3Smrg- pan/mdg: Eliminate load_64 4247ec681f3Smrg- pan/mdg: Set RA bounds for fp16 4257ec681f3Smrg- pan/mdg: Print mask when dest=0 4267ec681f3Smrg- pan/mdg: Round up bytemasks when spilling 4277ec681f3Smrg- pan/mdg: Print constant vectors less wrong 4287ec681f3Smrg- pan/mdg: Factor out mir_adjust_constant 4297ec681f3Smrg- pan/mdg: Only combine 16-bit constants to lower half 4307ec681f3Smrg- pan/mdg: Separately pack constants to the upper half 4317ec681f3Smrg- pan/mdg: Fix type checking issues with compute 4327ec681f3Smrg- pan/mdg: Pack barriers correctly 4337ec681f3Smrg- pan/mdg: Use shifts instead of division for RA sizes 4347ec681f3Smrg- pan/mdg: Implement vector constant printing for 8-bit 4357ec681f3Smrg- pan/mdg: Implement condense_writemask for 8-bit 4367ec681f3Smrg- pan/mdg: Pack 8-bit swizzles in 16-bit ops 4377ec681f3Smrg- panfrost: Guard experimental fp16 behind debug flag 4387ec681f3Smrg- panfrost: Keep cached BOs mmap'd 4397ec681f3Smrg- panfrost: Remove deadcode 4407ec681f3Smrg- panfrost: Fill in SCALED formats to format table 4417ec681f3Smrg- panfrost: Don't set PIPE_CAP_VERTEX_BUFFER_STRIDE_4BYTE_ALIGNED_ONLY 4427ec681f3Smrg- panfrost: Don't zero staging buffer for tiling 4437ec681f3Smrg- panfrost: Allow bpp24 tiling 4447ec681f3Smrg- panfrost: Allow tiling on RECT textures 4457ec681f3Smrg- panfrost: Limit blend shader work count 4467ec681f3Smrg- panfrost: Remove dated comment about leaks 4477ec681f3Smrg- panfrost: Disable tib read/write when colourmask = 0x0 4487ec681f3Smrg- panfrost: Avoid redundant shader executions with mask=0x0 4497ec681f3Smrg- panfrost: Don't set CAN_DISCARD for MFBD 4507ec681f3Smrg- panfrost: Fix transform feedback types 4517ec681f3Smrg- pan/mdg: Cleanup comments that look like division 4527ec681f3Smrg- pan/mdg: Eliminate expand_writemask division 4537ec681f3Smrg- pan/mdg: Eliminate 64-bit swizzle packing division 4547ec681f3Smrg- pan/mdg: Avoid division in printing helpers 4557ec681f3Smrg- pan/mdg: Eliminate remaining divisions from compiler 4567ec681f3Smrg- panfrost: Fix dated comment 4577ec681f3Smrg- panfrost: Use _mesa_roundevenf when packing clear colours 4587ec681f3Smrg- panfrost: Handle !independent_blend for blend shaders 4597ec681f3Smrg- pan/mdg: Add pack_colour_32 opcode 4607ec681f3Smrg- pan/mdg: Lower shifts to 32-bit 4617ec681f3Smrg- pan/mdg: Ensure we don't DCE into impossible masks 4627ec681f3Smrg- pan/mdg: Allow DCE on ld_color_buffer masks 4637ec681f3Smrg- panfrost: Add debug print before query flushes 4647ec681f3Smrg- panfrost: Only run batch debug when specifically asked 4657ec681f3Smrg- nir: Add un/pack_32_4x8 opcodes 4667ec681f3Smrg- util: Add SATURATE macro 4677ec681f3Smrg- util/format: Use SATURATE 4687ec681f3Smrg- mesa: Use SATURATE 4697ec681f3Smrg- mesa/swrast: Use SATURATE 4707ec681f3Smrg- gallium/draw: Use SATURATE 4717ec681f3Smrg- glsl: Use SATURATE 4727ec681f3Smrg- panfrost: Use SATURATE 4737ec681f3Smrg- softpipe: Use SATURATE 4747ec681f3Smrg- intel: Use SATURATE 4757ec681f3Smrg- i965: Use SATURATE 4767ec681f3Smrg- iris: Use SATURATE 4777ec681f3Smrg- etnaviv: Use SATURATE 4787ec681f3Smrg- nouveau: Use SATURATE 4797ec681f3Smrg- pan/decode: Fix unused variable warning 4807ec681f3Smrg- pan/decode: Fix tiler warning 4817ec681f3Smrg- pan/decode: Dump missing field on Bifrost 4827ec681f3Smrg- pan/decode: Dump unknown2 4837ec681f3Smrg- panfrost: Fix Bifrost blending with depth-only FBO 4847ec681f3Smrg- panfrost: Adjust null_rt for Bifrost 4857ec681f3Smrg- panfrost: Tweak zsbuf magic numbers for Bifrost 4867ec681f3Smrg- panfrost: Tweak Bifrost colour buffer magic 4877ec681f3Smrg- panfrost: Force Z/S tiling on Bifrost 4887ec681f3Smrg- panfrost: Share MRT blend flag calculation with Bifrost 4897ec681f3Smrg- panfrost: Set unk2 to accomodate blending 4907ec681f3Smrg- panfrost: Identify Bifrost texture format swizzle 4917ec681f3Smrg- panfrost: Ensure nonlinear strides are 16-aligned 4927ec681f3Smrg- panfrost: Document Midgard Inf/NaN suppress bit 4937ec681f3Smrg- panfrost: Add defines for bifrost unk1 flags 4947ec681f3Smrg- panfrost: Identify MALI_BIFROST_EARLY_Z flag 4957ec681f3Smrg- panfrost: Set MALI_BIFROST_EARLY_Z as necessary 4967ec681f3Smrg- pan/decode: Decode Bifrost shader flags 4977ec681f3Smrg- pan/bi: Add TEX.vtx opcode for vertex texturing 4987ec681f3Smrg- pan/bi: Also add compact vertex texturing 4997ec681f3Smrg- pan/bi: Document compute_lod bit for compact tex 5007ec681f3Smrg- pan/bi: Allow vertex txl with lod=0 as compact 5017ec681f3Smrg- pan/bi: Add f16 TEXC.vtx op 5027ec681f3Smrg- pan/bi: Pack compact vertex texturing 5037ec681f3Smrg- pan/bi: Add CSEL.16 packing tests 5047ec681f3Smrg- pan/bi: Suppress inf/nan for now 5057ec681f3Smrg- panfrost: Don't generate gl_FragCoord varying on Bifrost 5067ec681f3Smrg- panfrost: Set reads_frag_coord as a sysval 5077ec681f3Smrg- panfrost: Preload gl_FragCoord on Bifrost 5087ec681f3Smrg- pan/bi: Remove FMA? parameter from get_src 5097ec681f3Smrg- pan/bi: Remove comment about old scheduler design 5107ec681f3Smrg- pan/bi: Move bi_registers to common IR structures 5117ec681f3Smrg- pan/bi: Move bi_registers to bi_bundle 5127ec681f3Smrg- pan/bi: Drop `struct` from bi_registers 5137ec681f3Smrg- pan/bi: Add FILE* argument to bi_print_registers 5147ec681f3Smrg- pan/bi: Move bi_flip_ports out of port assignment 5157ec681f3Smrg- pan/bi: Document constant count invariant 5167ec681f3Smrg- pan/bi: Disassemble pos=0xe 5177ec681f3Smrg- pan/bi: Add MUL.i32 to disasm 5187ec681f3Smrg- pan/bi: Remove more artefacts of 2-pass scheduling 5197ec681f3Smrg- pan/bi: Add bi_layout.c for clause layout helpers 5207ec681f3Smrg- pan/bi: Add helper to measure clause size 5217ec681f3Smrg- pan/bi: Remove schedule_barrier 5227ec681f3Smrg- pan/bi: Allow printing branches without targets 5237ec681f3Smrg- pan/bi: Fix emit_if successor assignment 5247ec681f3Smrg- pan/bi: Only rewrite COMBINE dest if not SSA 5257ec681f3Smrg- pan/bi: Fix CONVERT component counting 5267ec681f3Smrg- pan/bi: Fix branch condition typesize 5277ec681f3Smrg- pan/bi: Passthrough ZERO in branch packing 5287ec681f3Smrg- pan/bi: Add branch constant field to IR 5297ec681f3Smrg- pan/bi: Pack branch offset constants 5307ec681f3Smrg- pan/bi: Set branch_constant if there is a branch 5317ec681f3Smrg- pan/bi: Assign constant port for branch offsets 5327ec681f3Smrg- pan/bi: Preliminary branch packing 5337ec681f3Smrg- pan/bi: Link clauses back to their blocks 5347ec681f3Smrg- pan/bi: Add bi_foreach_clause_in_block_from{_rev} helpers 5357ec681f3Smrg- pan/bi: Measure distance between blocks 5367ec681f3Smrg- pan/bi: Pack proper clause offsets 5377ec681f3Smrg- pan/bi: Set branch_conditional if b2b is set 5387ec681f3Smrg- pan/bi: Set back-to-back bit more accurately 5397ec681f3Smrg- pan/bi: Set branch conditional bit 5407ec681f3Smrg- pan/bi: Pack unconditional branch 5417ec681f3Smrg- pan/bi: Defer block naming until after emit 5427ec681f3Smrg- pan/bi: Add bi_foreach_block_from_rev helper 5437ec681f3Smrg- pan/bi: Measure backwards branches as well 5447ec681f3Smrg- pan/bi: Allow two successors in header packing 5457ec681f3Smrg- pan/bi: Passthrough deps of the branch target 5467ec681f3Smrg- panfrost: Disable QUAD_STRIP/POLYGON on Bifrost 5477ec681f3Smrg- panfrost: Add GPU IDs for G31/G52 5487ec681f3Smrg- panfrost: Probe G31/G52 if PAN_MESA_DEBUG=bifrost 5497ec681f3Smrg- pan/mdg: Handle un/pack opcodes as moves 5507ec681f3Smrg- pan/mdg: Add pack_unorm_4x8 via 8-bit 5517ec681f3Smrg- pan/mdg: Treat packs "specially" 5527ec681f3Smrg- pan/mdg: Handle bitsize for packs 5537ec681f3Smrg- pan/mdg: Print 8-bit constants 5547ec681f3Smrg- pan/mdg: Drop the u8 from the colorbuf op names 5557ec681f3Smrg- pan/mdg: Implement raw colourbuf loads on T720 5567ec681f3Smrg- panfrost: Add theory for new framebuffer lowering 5577ec681f3Smrg- panfrost: Determine unpacked type for formats 5587ec681f3Smrg- panfrost: Add quirks for blend shader types 5597ec681f3Smrg- panfrost: Determine load classes for formats 5607ec681f3Smrg- panfrost: Determine classes for stores 5617ec681f3Smrg- panfrost: Stub out lowering boilerplate 5627ec681f3Smrg- panfrost: Un/pack pure 32-bit 5637ec681f3Smrg- panfrost: Un/pack pure 16-bit 5647ec681f3Smrg- panfrost: Un/pack pure 8-bit 5657ec681f3Smrg- panfrost: Un/pack 8-bit UNORM 5667ec681f3Smrg- panfrost: Flesh out dispatch 5677ec681f3Smrg- panfrost: Un/pack UNORM 4 5687ec681f3Smrg- panfrost: Un/pack RGB565 and RGB5A1 5697ec681f3Smrg- panfrost: Un/pack RGB10_A2_UNORM 5707ec681f3Smrg- panfrost: Un/pack RGB10_A2_UINT 5717ec681f3Smrg- panfrost: Un/pack R11G11B10 5727ec681f3Smrg- panfrost: Un/pack sRGB via NIR 5737ec681f3Smrg- panfrost: Switch to pan_lower_framebuffer 5747ec681f3Smrg- panfrost: Conditionally allow fp16 blending 5757ec681f3Smrg- panfrost: Account for differing types in blend lower 5767ec681f3Smrg- panfrost: Let Gallium pack colours 5777ec681f3Smrg- panfrost: Check for large tilebuffer requirements 5787ec681f3Smrg- panfrost: Add separate_stencil BO to batch 5797ec681f3Smrg- panfrost: Use internal_format throughout 5807ec681f3Smrg- panfrost: Update fails list 5817ec681f3Smrg- pan/mdg: Handle 16-bit ld_vary 5827ec681f3Smrg- pan/mdg: Fuse f2f16 into load_interpolated_input 5837ec681f3Smrg- panfrost: Fix PRESENT flag mix-up 5847ec681f3Smrg- panfrost: Permit AFBC of RGB8 5857ec681f3Smrg- panfrost: Use VTX tag for vertex texturing 5867ec681f3Smrg- panfrost: Don't flush explicitly when mipmapping 5877ec681f3Smrg- panfrost: Remove unused nir_lower_framebuffer pass 5887ec681f3Smrg- pan/mdg: Disassemble out-of-order bits 5897ec681f3Smrg- pan/mdg: Add quirk for missing out-of-order support 5907ec681f3Smrg- pan/mdg: Enable out-of-order execution after texture ops 5917ec681f3Smrg- nir: Fold f2f16(b2f32(x)) to b2f16(x) 5927ec681f3Smrg- pan/mdg: Don't double-replicate blend on T720 5937ec681f3Smrg- pan/mdg: Distinguish blend shaders in internal shader-db 5947ec681f3Smrg- pan/mdg: Add roundmode enum 5957ec681f3Smrg- pan/mdg: Add opcode roundmode property 5967ec681f3Smrg- pan/mdg: Lower roundmodes 5977ec681f3Smrg- pan/mdg: Implement \*_rtz conversions with roundmode 5987ec681f3Smrg- pan/mdg: Fold roundmode into applicable instructions 5997ec681f3Smrg- pan/mdg: Handle f2u8 6007ec681f3Smrg- pan/mdg: Allow f2u8 and friends thru 6017ec681f3Smrg- pan/mdg: Handle regular nir_intrinsic_load_output 6027ec681f3Smrg- panfrost: Passthrough NATIVE loads/stores 6037ec681f3Smrg- pan/bi: Handle SEL with vec3 16-bit 6047ec681f3Smrg- pan/bi: Fix SEL.16 swizzle 6057ec681f3Smrg- pan/bi: Pack second argument of F32_TO_F16 6067ec681f3Smrg- pan/bi: Passthrough second argument of F32_TO_F16 6077ec681f3Smrg- pan/bi: Handle vectorized load_const 6087ec681f3Smrg- panfrost: Update MALI_EARLY_Z description 6097ec681f3Smrg- panfrost: Document MALI_WRITES_GLOBAL bit 6107ec681f3Smrg- panfrost: Handle writes_memory correctly 6117ec681f3Smrg- panfrost: Readd MIDGARD_SHADERLESS quirk to t760 6127ec681f3Smrg- panfrost: Explicitly convert to 32-bit for logic-ops 6137ec681f3Smrg- pan/bi: Disassemble gl_PointCoord reads. 6147ec681f3Smrg- panfrost: Prefer sysval for gl_PointCoord on Bifrost 6157ec681f3Smrg- panfrost: Fix gl_PointSize out of GL_POINTS 6167ec681f3Smrg- panfrost: Mark point sprites as todo on Bifrost 6177ec681f3Smrg- pan/mdg: Legalize inverts with constants 6187ec681f3Smrg- pan/mdg: Ensure ld_vary_16 is aligned 6197ec681f3Smrg- panfrost: Ensure we have ro before using it 6207ec681f3Smrg- nir: Remove nir_intrinsic_output_u8_as_fp16_pan 6217ec681f3Smrg- pan/mdg: Avoid fusing ld_vary_16 with non-zero component 6227ec681f3Smrg- panfrost: Calculate varying size by format 6237ec681f3Smrg- panfrost: Add panfrost_streamout_offset helper 6247ec681f3Smrg- panfrost: Introduce bitfields for tracking varyings 6257ec681f3Smrg- panfrost: Determine varying buffer presence 6267ec681f3Smrg- panfrost: Emit unlinked varyings 6277ec681f3Smrg- panfrost: Emit special varyings 6287ec681f3Smrg- panfrost: Emit xfb records 6297ec681f3Smrg- panfrost: Add helper to determine if we are capturing 6307ec681f3Smrg- panfrost: Add high-level varying emit 6317ec681f3Smrg- panfrost: Use new varying linking 6327ec681f3Smrg- panfrost: Remove unused routines 6337ec681f3Smrg- panfrost: Allow R/RG/RGB varyings 6347ec681f3Smrg- panfrost: Only store varying formats 6357ec681f3Smrg- panfrost: Use shader_info harder 6367ec681f3Smrg- panfrost: Override varying format to minimal precision 6377ec681f3Smrg- panfrost: Demote mediump varyings to fp16 6387ec681f3Smrg- pan/mdg: Explicitly type 64-bit uniform moves 6397ec681f3Smrg- pan/mdg: Analyze types for 64-bitness in RA 6407ec681f3Smrg- pan/mdg: Prefer type over regmode for schedule constraints 6417ec681f3Smrg- pan/mdg: Precolour blend inputs 6427ec681f3Smrg- panfrost: Merge bifrost_bo/midgard_bo 6437ec681f3Smrg- panfrost: Update sampler view in Bifrost path 6447ec681f3Smrg- panfrost: Fix level_2 6457ec681f3Smrg- panfrost: Correctly calculate tiled stride 6467ec681f3Smrg- panfrost: Enable AFBC for RGB565 6477ec681f3Smrg- panfrost: Simplify AFBC format check 6487ec681f3Smrg- pan/mdg: Factor out unit check 6497ec681f3Smrg- pan/mdg: Allow scheduling "x + x" to multipliers 6507ec681f3Smrg- pan/mdg: Canonicalize (x * 2.0) to (x + x) 6517ec681f3Smrg- pan/mdg: Reassociate adds for multiply-by-two 6527ec681f3Smrg- nir: Propagate \*2*16 conversions into vectors 6537ec681f3Smrg- panfrost: Specify stack_shift on SFBD 6547ec681f3Smrg- pan/mdg: Defer nir_fuse_io_16 until after opts 6557ec681f3Smrg- pan/mdg: Don't assign destination in writeout block to r1 6567ec681f3Smrg- pan/mdg: Remove bundle interference code 6577ec681f3Smrg- pan/mdg: Schedule writeout to VLUT 6587ec681f3Smrg- pan/mdg: Defer smul, vlut until after writeout moves 6597ec681f3Smrg- pan/mdg: Allow Z/S writes to use any 2nd stage unit 6607ec681f3Smrg- pan/mdg: Prioritize non-moves on VADD/VLUT 6617ec681f3Smrg- pan/mdg: Skip r1.w write where possible 6627ec681f3Smrg- pan/mdg: Schedule based on liveness 6637ec681f3Smrg- pan/mdg: Respect type/mask in mir_lower_special_reads 6647ec681f3Smrg- pan/mdg: Fix indirect UBO swizzles 6657ec681f3Smrg- pan/decode: Fix MSAA texture decoding 6667ec681f3Smrg- pan/decode: Identify layered MSAA flag 6677ec681f3Smrg- pan/mdg: Allow ignoring move mode 6687ec681f3Smrg- pan/mdg: Handle GLSL_SAMPLER_DIM_MS 6697ec681f3Smrg- pan/mdg: Handle nir_tex_src_ms_index 6707ec681f3Smrg- pan/mdg: Handle nir_texop_txf_ms 6717ec681f3Smrg- pan/mdg: Use _VTX tag for texelFetch in frag shaders 6727ec681f3Smrg- panfrost: Set depth to sample_count for MSAA 2D 6737ec681f3Smrg- panfrost: Identify layer_stride 6747ec681f3Smrg- panfrost: Allocate space for multisampling 6757ec681f3Smrg- panfrost: Index texture by sample 6767ec681f3Smrg- panfrost: Include pointer for each sample 6777ec681f3Smrg- panfrost: Set layer_stride for multisampled rendering 6787ec681f3Smrg- panfrost: Don't advertise MSAA 2x 6797ec681f3Smrg- panfrost: Identify coverage_mask 6807ec681f3Smrg- panfrost: Pass sample_mask to the hardware 6817ec681f3Smrg- panfrost: Implement alpha-to-coverage 6827ec681f3Smrg- panfrost: Identify depth/stencil layer strides 6837ec681f3Smrg- panfrost: Set depth/stencil_layer_stride accordingly 6847ec681f3Smrg- panfrost: Enable MSAA if we render to such a surface 6857ec681f3Smrg- panfrost: Save sample_mask before blitting 6867ec681f3Smrg- panfrost: Expose MSAA 4x 6877ec681f3Smrg- glsl: Handle 16-bit types in loop analysis 6887ec681f3Smrg- docs/features: Track Panfrost 6897ec681f3Smrg- panfrost: Introduce pan_pool struct 6907ec681f3Smrg- panfrost: Allocate pool BOs against the pool 6917ec681f3Smrg- panfrost: Track the device through the pool 6927ec681f3Smrg- panfrost: Expose pool-based allocation API 6937ec681f3Smrg- panfrost: Move debug flags into the device 6947ec681f3Smrg- panfrost: Drop Gallium-local pan_bo_create wrapper 6957ec681f3Smrg- panfrost: Move pool routines to common code 6967ec681f3Smrg- panfrost: Factor out scoreboarding state 6977ec681f3Smrg- panfrost: Pass polygon_list to tiler init function 6987ec681f3Smrg- panfrost: Drop batch from scoreboard routines 6997ec681f3Smrg- panfrost: Move scoreboarding routines to common 7007ec681f3Smrg- panfrost: Handle PIPE_FORMAT_X24S8_UINT 7017ec681f3Smrg- panfrost: Handle PIPE_FORMAT_S8_UINT 7027ec681f3Smrg- panfrost: Move panfrost_translate_texture_type 7037ec681f3Smrg- panfrost: Report blend shader work count 7047ec681f3Smrg- panfrost: Clamp pure int pixels 7057ec681f3Smrg- panfrost: Generate shader variants on framebuffer bind 7067ec681f3Smrg- panfrost: Always use SOFTWARE for pure formats 7077ec681f3Smrg- panfrost: Extend fetched framebuffer results 7087ec681f3Smrg- panfrost: Fix fence leak 7097ec681f3Smrg- panfrost: Fix write to free'd memory 7107ec681f3Smrg- panfrost: Add a sparse array to map GEM handles to BOs 7117ec681f3Smrg- panfrost: Index BOs from the BO map sparse array 7127ec681f3Smrg- panfrost: Merge PAN_BO_IMPORTED/PAN_BO_EXPORTED 7137ec681f3Smrg- panfrost: Remove PAN_BO_COHERENT_LOCAL 7147ec681f3Smrg- panfrost: Remove PAN_BO_DONT_REUSE 7157ec681f3Smrg- panfrost: Remove panfrost_bo_access type 7167ec681f3Smrg- panfrost: Compact unused BO flag bits 7177ec681f3Smrg- panfrost: Add format codes for new compressed textures 7187ec681f3Smrg- panfrost: Pipe in compressed texture feature mask 7197ec681f3Smrg- panfrost: Filter compressed texture formats 7207ec681f3Smrg- panfrost: Map PIPE_{DXT, RGTC, BPTC} to MALI_BCn 7217ec681f3Smrg- docs/features: Update ASTC entries for Panfrost 7227ec681f3Smrg- pan/mdg: Bump compiler RT maximum 7237ec681f3Smrg- pan/mdg: Identify per-sample interpolation mode 7247ec681f3Smrg- pan/mdg: Implement gl_SampleID 7257ec681f3Smrg- panfrost: Force Z/S writeback 7267ec681f3Smrg- panfrost: Expose panfrost_get_blend_shader 7277ec681f3Smrg- panfrost: Add MALI_PER_SAMPLE bit 7287ec681f3Smrg- panfrost: Include sample count in payload estimates 7297ec681f3Smrg- panfrost: Identify zs_samples field 7307ec681f3Smrg- panfrost: Add rectangle subtraction algorithm 7317ec681f3Smrg- panfrost: Handle per-sample shading 7327ec681f3Smrg- panfrost: Set zs_samples as necessary 7337ec681f3Smrg- panfrost: Track surfaces drawn per-batch 7347ec681f3Smrg- panfrost: Extract panfrost_batch_reserve_framebuffer 7357ec681f3Smrg- panfrost: Use Midgard-specific reloads 7367ec681f3Smrg- panfrost: Call util_blitter_save_fragment_constant_buffer_slot 7377ec681f3Smrg- panfrost: Overhaul tilebuffer allocations 7387ec681f3Smrg- panfrost: Set PIPE_CAP_MIXED_COLORBUFFER_FORMATS 7397ec681f3Smrg- panfrost: Fix sRGB clear colour packing 7407ec681f3Smrg- panfrost: Implement Z32F_S8 blits 7417ec681f3Smrg- panfrost: Abort on unsupported blit 7427ec681f3Smrg- panfrost: Avoid integer underflow in rt_count_1 7437ec681f3Smrg- panfrost: Honour cso->compare_mode 7447ec681f3Smrg- panfrost: Fix faults with RASTERIZER_DISCARD 7457ec681f3Smrg- panfrost: Report CAPs more honestly 7467ec681f3Smrg- panfrost: Enable Chromium 7477ec681f3Smrg- panfrost: Revert "Disable frame throttling" 7487ec681f3Smrg- docs/features: Mark trivial missed feature 7497ec681f3Smrg- panfrost: Enable FP16 by default 7507ec681f3Smrg- panfrost: Avoid wait=true flushing all batches 7517ec681f3Smrg- panfrost: Remove wait parameter to flush_all_batches 7527ec681f3Smrg- panfrost: Skip specifying in_syncs 7537ec681f3Smrg- panfrost: Allocate syncobjs in panfrost_flush 7547ec681f3Smrg- panfrost: Remove unused batch_fence->signaled 7557ec681f3Smrg- panfrost: Remove unused batch_fence->ctx 7567ec681f3Smrg- pan/bit: Update f32->f16 convert test 7577ec681f3Smrg- pan/bit: Remove BI_SHIFT stub 7587ec681f3Smrg- pan/mdg: Mask spills from texture write 7597ec681f3Smrg- pan/mdg: Test for SSA before chasing addresses 7607ec681f3Smrg- docs/features: Add GL_EXT_multisampled_render_to_texture 7617ec681f3Smrg- panfrost: Add MSAA mode selection field 7627ec681f3Smrg- panfrost: Implement EXT_multisampled_render_to_texture 7637ec681f3Smrg- panfrost: Set STRIDE_4BYTE_ALIGNED_ONLY 7647ec681f3Smrg- panfrost: Fix WRITES_GLOBAL bit 7657ec681f3Smrg- pan/mdg: Ensure barrier op is set on texture 7667ec681f3Smrg- panfrost: Fix blend leak for render targets 5-8 7677ec681f3Smrg- panfrost: Free cloned NIR shader 7687ec681f3Smrg- panfrost: Free NIR of blit shaders 7697ec681f3Smrg- panfrost: Free hash_to_temp map 7707ec681f3Smrg- pan/mdg: Free previous liveness 7717ec681f3Smrg- panfrost: Use memctx for sysvals 7727ec681f3Smrg- panfrost: Free batch->dependencies 7737ec681f3Smrg- pan/mdg: Fix discard encoding 7747ec681f3Smrg- pan/mdg: Fix perspective combination 7757ec681f3Smrg- pan/bit: Set d3d=true for CMP tests 7767ec681f3Smrg 7777ec681f3SmrgAndreas Baierl (1): 7787ec681f3Smrg 7797ec681f3Smrg- nir/ lower_int_to_float: Handle umax and umin 7807ec681f3Smrg 7817ec681f3SmrgAndres Gomez (10): 7827ec681f3Smrg 7837ec681f3Smrg- .mailmap: add an alias for Iago Toral Quiroga 7847ec681f3Smrg- .mailmap: add an alias for Andres Gomez 7857ec681f3Smrg- gitlab-ci: update tracie README after changes in main script 7867ec681f3Smrg- scripts: remove unittest.mock dependency when not used 7877ec681f3Smrg- gitlab-ci: create always the "results" directory with tracie 7887ec681f3Smrg- gitlab-ci: correct tracie behavior with replay errors 7897ec681f3Smrg- gitlab-ci: build gfxreconstruct from the "dev" branch 7907ec681f3Smrg- gitlab-ci: get the last frame from a gfxr trace using gfxrecon-info 7917ec681f3Smrg- gitlab-ci/traces: updated paths and checksums for POLARIS10 traces 7927ec681f3Smrg- gitlab-ci: Test AMD's Raven with traces 7937ec681f3Smrg 7947ec681f3SmrgAndrey Vostrikov (1): 7957ec681f3Smrg 7967ec681f3Smrg- egl/x11: Free memory allocated for reply structures on error 7977ec681f3Smrg 7987ec681f3SmrgAndrii Simiklit (3): 7997ec681f3Smrg 8007ec681f3Smrg- glsl_type: don't serialize padding bytes from glsl_struct_field 8017ec681f3Smrg- i965/vec4: Ignore swizzle of VGRF for use by var_range_end() 8027ec681f3Smrg- glsl: fix crash on glsl macro redefinition 8037ec681f3Smrg 8047ec681f3SmrgAni (1): 8057ec681f3Smrg 8067ec681f3Smrg- drirc: Enable glthread for rpcs3 8077ec681f3Smrg 8087ec681f3SmrgAnuj Phogat (6): 8097ec681f3Smrg 8107ec681f3Smrg- intel/devinfo: Add is_dg1 to device info 8117ec681f3Smrg- intel/l3: Add DG1 L3 configuration 8127ec681f3Smrg- intel/ehl: Use GEN11_URB_MIN_MAX_ENTRIES in device info 8137ec681f3Smrg- intel/ehl: Use macro GEN11_LP_FEATURES in device info 8147ec681f3Smrg- intel/ehl: Rename gen_device_info struct 8157ec681f3Smrg- intel/ehl: Add new PCI-IDs 8167ec681f3Smrg 8177ec681f3SmrgArcady Goldmints-Orlov (4): 8187ec681f3Smrg 8197ec681f3Smrg- anv: increase minUniformBufferOffsetAlignment to 64 8207ec681f3Smrg- intel/compiler: fix alignment assert in nir_emit_intrinsic 8217ec681f3Smrg- nir/spirv/glsl450: increase asin(x) precision 8227ec681f3Smrg- intel/compiler: Always apply sample mask on Vulkan. 8237ec681f3Smrg 8247ec681f3SmrgAxel Davy (19): 8257ec681f3Smrg 8267ec681f3Smrg- st/nine: Set correctly blend max_rt 8277ec681f3Smrg- gallium/util: Fix leak in the live shader cache 8287ec681f3Smrg- ttn: Add new allow_disk_cache parameter 8297ec681f3Smrg- ttn: Implement disk cache 8307ec681f3Smrg- st/nine: Enable ttn cache 8317ec681f3Smrg- radeonsi: Enable tgsi to nir disk cache 8327ec681f3Smrg- st/nine: Add checks for pure device 8337ec681f3Smrg- st/nine: Return error when setting invalid depth buffer 8347ec681f3Smrg- st/nine: Do not return invalidcall on getrenderstate 8357ec681f3Smrg- st/nine: Pass more adapter formats for CheckDepthStencilMatch 8367ec681f3Smrg- st/nine: Improve return error code in CheckDeviceFormat 8377ec681f3Smrg- st/nine: Fix uninitialized variable in BEM() 8387ec681f3Smrg- st/nine: Fix a crash if the state is not initialized 8397ec681f3Smrg- st/nine: Add missing NULL checks 8407ec681f3Smrg- st/nine: Increase available GPU memory 8417ec681f3Smrg- st/nine: Retry allocations after freeing some space 8427ec681f3Smrg- st/nine: Improve pDestRect handling 8437ec681f3Smrg- st/nine: Ignore pDirtyRegion 8447ec681f3Smrg- st/nine: Handle full pSourceRect better 8457ec681f3Smrg 8467ec681f3SmrgBas Nieuwenhuizen (80): 8477ec681f3Smrg 8487ec681f3Smrg- radv: Fix implicit sync with recent allocation changes. 8497ec681f3Smrg- radv: Extend tiling flags to 64-bit. 8507ec681f3Smrg- radv: Provide a better error for permission issues with priorities. 8517ec681f3Smrg- radv: Support VK_PIPELINE_COMPILE_REQUIRED_EXT. 8527ec681f3Smrg- radv: Support VK_PIPELINE_CREATE_EARLY_RETURN_ON_FAILURE_BIT_EXT. 8537ec681f3Smrg- radv: Support VK_PIPELINE_CACHE_CREATE_EXTERNALLY_SYNCHRONIZED_BIT_EXT. 8547ec681f3Smrg- radv: Expose VK_EXT_pipeline_creation_cache_control. 8557ec681f3Smrg- radv/winsys: Finish mapping for sparse residency. 8567ec681f3Smrg- radv/winsys: Remove extra sizeof multiply. 8577ec681f3Smrg- radv: Handle failing to create .cache dir. 8587ec681f3Smrg- radv: Remove dead code. 8597ec681f3Smrg- radv: Do not close fd -1 when NULL-winsys creation fails. 8607ec681f3Smrg- radv: Implement vkGetSwapchainGrallocUsage2ANDROID. 8617ec681f3Smrg- frontend/dri: Implement mapping individual planes. 8627ec681f3Smrg- util/format: Add VK_FORMAT_D16_UNORM_S8_UINT. 8637ec681f3Smrg- util/format: Use correct pipe format for VK_FORMAT_G8_B8_R8_3PLANE_420_UNORM. 8647ec681f3Smrg- util/format: Add more multi-planar formats. 8657ec681f3Smrg- gallium/dri: Remove lowered_yuv tracking for plane mapping. 8667ec681f3Smrg- radeonsi: Explicitly map Z16_UNORM_S8_UINT to None for GFX10. 8677ec681f3Smrg- amd/common,radeonsi: Move gfx10_format_table to common. 8687ec681f3Smrg- radeonsi: Define gfx10_format in the common header. 8697ec681f3Smrg- radv: Include gfx10_format_table.h only from a single source file. 8707ec681f3Smrg- radv: Use common gfx10_format_table.h 8717ec681f3Smrg- radv: Use ac_surface to determine fmask enable. 8727ec681f3Smrg- radv: Pass no_metadata_planes info in to ac_surface. 8737ec681f3Smrg- radv: Enforce the contiguous memory for DCC layers in ac_surface. 8747ec681f3Smrg- radv: Rely on ac_surface for avoiding cmask for linear images. 8757ec681f3Smrg- radv: Use offsets in surface struct. 8767ec681f3Smrg- radv: Disable DCC in ac_surface. 8777ec681f3Smrg- radv: Disable HTILE in ac_surface. 8787ec681f3Smrg- radv: Allocate values/predicates at the end of the image. 8797ec681f3Smrg- amd/common: Add total alignment calculation. 8807ec681f3Smrg- radv: Use ac_surface to allocate aux surfaces. 8817ec681f3Smrg- vulkan/wsi/x11: Ensure we create at least minImageCount images. 8827ec681f3Smrg- radv/winsys: Deal with realloc failures in BO lists. 8837ec681f3Smrg- radv: Handle mmap failures. 8847ec681f3Smrg- radv/winsys: Distinguish device/host memory errors. 8857ec681f3Smrg- radv: Make radv_alloc_shader_memory static. 8867ec681f3Smrg- turnip: semaphore support. 8877ec681f3Smrg- meson: Do not require shader cache for radv. 8887ec681f3Smrg- amd/addrlib: fix another C++ one definition rule violation 8897ec681f3Smrg- radv: Set handle types in Android semaphore/fence import. 8907ec681f3Smrg- radv: Always enable PERFECT_ZPASS_COUNTS. 8917ec681f3Smrg- Revert "radv: add support for MRTs compaction to avoid holes" 8927ec681f3Smrg- radv: Use correct semaphore handle type for Android import. 8937ec681f3Smrg- amd/llvm: Mark pointer function arguments as 32-byte aligned. 8947ec681f3Smrg- amd/common: Cache intra-tile addresses for retile map. 8957ec681f3Smrg- amd/addrlib: Clean up unused colorFlags argument 8967ec681f3Smrg- amd/registers: add RLC_PERFMON_CLK_CNTL for pre-GFX10 8977ec681f3Smrg- radeonsi: Inhibit clock-gating for perf counters. 8987ec681f3Smrg- meson: Add mising git_sha1.h dependency. 8997ec681f3Smrg- amd: Add detection of timeline semaphore support. 9007ec681f3Smrg- radv/winsys: Add binary syncobj ABI changes for timeline semaphores. 9017ec681f3Smrg- radv: Add thread for timeline syncobj submission. 9027ec681f3Smrg- radv: Add winsys support for submitting timeline syncobj. 9037ec681f3Smrg- radv: Add winsys functions for timeline syncobj. 9047ec681f3Smrg- radv: Add timeline syncobj for timeline semaphores. 9057ec681f3Smrg- radv: Fix uninitialized variable in renderpass. 9067ec681f3Smrg- vulkan/wsi/x11: report device-group present rectangles with prime. 9077ec681f3Smrg- vulkan/wsi: Convert usage of -1 to UINT32_MAX. 9087ec681f3Smrg- radv: Fix host->host signalling with legacy timeline semaphores. 9097ec681f3Smrg- mesa/st: Actually free the driver part of memory objects on destruction. 9107ec681f3Smrg- radv: Don't use both DCC and CMASK for single sample images. 9117ec681f3Smrg- radv: Fix assert that is too strict. 9127ec681f3Smrg- radv: Do not consider layouts fast-clearable on compute queue. 9137ec681f3Smrg- radv: When importing an image, redo the layout based on the metadata. 9147ec681f3Smrg- radv: Use getter instead of setter to extract value. 9157ec681f3Smrg- driconf: Support selection by Vulkan applicationName. 9167ec681f3Smrg- radv: Override the uniform buffer offset alignment for World War Z. 9177ec681f3Smrg- radv: Fix handling of attribs 16-31. 9187ec681f3Smrg- radv: Remove conformance warnings with ACO. 9197ec681f3Smrg- radv: Update CTS version. 9207ec681f3Smrg- radv: Fix 3d blits. 9217ec681f3Smrg- radv: Fix threading issue with submission refcounts. 9227ec681f3Smrg- radv: Avoid deadlock on bo_list. 9237ec681f3Smrg- spirv: Deal with glslang not setting NonUniform on constructors. 9247ec681f3Smrg- radeonsi: Work around Wasteland 2 bug. 9257ec681f3Smrg- spirv: Deal with glslang bug not setting the decoration for stores. 9267ec681f3Smrg- ac/surface: Fix depth import on GFX6-GFX8. 9277ec681f3Smrg- st/mesa: Deal with empty textures/buffers in semaphore wait/signal. 9287ec681f3Smrg 9297ec681f3SmrgBen Skeggs (38): 9307ec681f3Smrg 9317ec681f3Smrg- nir: use bitfield_insert instead of bfi in nir_lower_double_ops 9327ec681f3Smrg- nvir: bump max encoding size of instructions 9337ec681f3Smrg- nvir: introduce OP_LOP3_LUT 9347ec681f3Smrg- nvir: introduce OP_WARPSYNC 9357ec681f3Smrg- nvir: introduce OP_BREV with lowering to EXTBF_REV for current GPUs 9367ec681f3Smrg- nvir: introduce OP_SHF 9377ec681f3Smrg- nvir: introduce OP_BMSK 9387ec681f3Smrg- nvir: introduce OP_SGXT 9397ec681f3Smrg- nvir: introduce OP_FINAL 9407ec681f3Smrg- nvir: add constant folding for OP_PERMT 9417ec681f3Smrg- nvir: run replaceZero() before replaceCvt() 9427ec681f3Smrg- nvir/nir: fix fragment program output when using MRT 9437ec681f3Smrg- nvir/nir: move nir options to codegen 9447ec681f3Smrg- nvir/nir: flesh out options 9457ec681f3Smrg- nvir/nir: turn on lower_rotate 9467ec681f3Smrg- nvir/nir: implement nir_op_extract_u8 9477ec681f3Smrg- nvir/nir: implement nir_op_extract_i8 9487ec681f3Smrg- nvir/nir: implement nir_op_extract_u16 9497ec681f3Smrg- nvir/nir: implement nir_op_extract_i16 9507ec681f3Smrg- nvir/nir: implement nir_op_urol 9517ec681f3Smrg- nvir/nir: implement nir_op_uror 9527ec681f3Smrg- nvir/nir: nir expects the shift amount to wrap, rather than clamp 9537ec681f3Smrg- nvir/nir: use nir_lower_idiv 9547ec681f3Smrg- nvir/gm107: implement OP_PERMT 9557ec681f3Smrg- nvir/gm107: replace SHR+AND+AND with PRMT+PRMT in PFETCH lowering 9567ec681f3Smrg- nvir/gm107: separate out header for sched data calculator 9577ec681f3Smrg- nvir/nir/gm107: split nir shader compiler options from gf100 9587ec681f3Smrg- nvir/nir/gm107: turn on nir_lower_extract64 9597ec681f3Smrg- nvir/nir/gm107: switch off lower_extract_byte 9607ec681f3Smrg- nvir/nir/gm107: switch off lower_extract_word 9617ec681f3Smrg- nvir/gv100: initial support 9627ec681f3Smrg- nvir/gv100: enable support for tu1xx 9637ec681f3Smrg- nvc0: use NVIDIA headers for GK104->GM2xx compute QMD 9647ec681f3Smrg- nvc0: use NVIDIA headers for GP100- compute QMD 9657ec681f3Smrg- nvc0: move setting of entrypoint for a shader stage to a function 9667ec681f3Smrg- nvc0: remove hardcoded blitter vertprog 9677ec681f3Smrg- nvc0: initial support for gv100 9687ec681f3Smrg- nvc0: initial support for tu1xx 9697ec681f3Smrg 9707ec681f3SmrgBenjamin Cheng (1): 9717ec681f3Smrg 9727ec681f3Smrg- drirc: Add picom to adaptive_sync exclusion list 9737ec681f3Smrg 9747ec681f3SmrgBenjamin Tissoires (3): 9757ec681f3Smrg 9767ec681f3Smrg- CI: reduce bandwidth for git pull 9777ec681f3Smrg- gitlab-ci: update ci-fairy minio to latest upstream 9787ec681f3Smrg- gitlab-ci: do not run full CI on scheduled pipelines 9797ec681f3Smrg 9807ec681f3SmrgBlaž Tomažič (1): 9817ec681f3Smrg 9827ec681f3Smrg- radeonsi: Fix omitted flush when moving suballocated texture 9837ec681f3Smrg 9847ec681f3SmrgBoris Brezillon (14): 9857ec681f3Smrg 9867ec681f3Smrg- spirv: Split the vtn_emit_scoped_memory_barrier() logic 9877ec681f3Smrg- nir: Replace the scoped_memory barrier by a scoped_barrier 9887ec681f3Smrg- intel/compiler: Extract control barriers from scoped barriers 9897ec681f3Smrg- spirv: Use scoped barriers for SpvOpControlBarrier 9907ec681f3Smrg- nir: Add new rules to optimize NOOP pack/unpack pairs 9917ec681f3Smrg- nir: Use a switch in build_deref_offset()/deref_instr_get_const_offset() 9927ec681f3Smrg- nir: Allow casts in nir_deref_instr_get[_const]_offset() 9937ec681f3Smrg- freedreno: Initialize lower_int64_options to a proper value 9947ec681f3Smrg- nir: Stop passing an options arg to nir_lower_int64() 9957ec681f3Smrg- nir: Extend nir_lower_int64() to support i2f/f2i lowering 9967ec681f3Smrg- intel: Set int64_options to ~0 when lowering 64b ops 9977ec681f3Smrg- nir: Get rid of __[u]int64_to_fp32() and __fp32_to_[u]int64() 9987ec681f3Smrg- nir: Fix i64tof32 lowering 9997ec681f3Smrg- spirv: Add a vtn_get_mem_operands() helper 10007ec681f3Smrg 10017ec681f3SmrgBoyuan Zhang (2): 10027ec681f3Smrg 10037ec681f3Smrg- radeon/vcn/enc: Re-write PPS encoding for HEVC 10047ec681f3Smrg- radeon/vcn: bump vcn3.0 encode major version to 1 10057ec681f3Smrg 10067ec681f3SmrgBrian Ho (14): 10077ec681f3Smrg 10087ec681f3Smrg- turnip: Execute ir3_nir_lower_gs pass again 10097ec681f3Smrg- turnip: Fill out VkPhysicalDeviceSubgroupProperties 10107ec681f3Smrg- nir: Support sysval tess levels in SPIR-V to NIR 10117ec681f3Smrg- nir: Add an option for lowering TessLevelInner/Outer to vecs 10127ec681f3Smrg- turnip: Lower shaders for tessellation 10137ec681f3Smrg- turnip: Offset by component when lowering gl_TessLevel* 10147ec681f3Smrg- turnip: Parse tess state and support PATCH primtype 10157ec681f3Smrg- turnip: Allocate tess BOs as a function of draw size 10167ec681f3Smrg- turnip: Update VFD_CONTROL with tess system values 10177ec681f3Smrg- turnip: Emit HS/DS user consts as draw states 10187ec681f3Smrg- turnip: Support tess for draws 10197ec681f3Smrg- turnip: Force sysmem for tessellation 10207ec681f3Smrg- ir3: Unconditionally enable MERGEDREGS on a6xx 10217ec681f3Smrg- turnip: Enable tessellationShader physical device feature 10227ec681f3Smrg 10237ec681f3SmrgCaio Marcelo de Oliveira Filho (32): 10247ec681f3Smrg 10257ec681f3Smrg- intel/dev: Bail when INTEL_DEVID_OVERRIDE is not valid 10267ec681f3Smrg- intel/fs: Clean up variable group size handling in backend 10277ec681f3Smrg- intel/fs: Add an option to lower variable group size in backend 10287ec681f3Smrg- intel/fs: Add and use a new load_simd_width_intel intrinsic 10297ec681f3Smrg- intel: Let drivers call brw_nir_lower_cs_intrinsics() 10307ec681f3Smrg- iris: Implement ARB_compute_variable_group_size 10317ec681f3Smrg- util/list: Add list_foreach_entry_from_safe 10327ec681f3Smrg- nir: Use deref intrinsics to set writes_memory when gathering info 10337ec681f3Smrg- intel/fs: Use writes_memory from shader_info 10347ec681f3Smrg- nir: Consider atomic counter intrinsics when setting writes_memory 10357ec681f3Smrg- intel/fs: Remove unused emission of load_simd_with_intel 10367ec681f3Smrg- intel/fs: Remove unused state from brw_nir_lower_cs_intrinsics 10377ec681f3Smrg- intel/fs: Early return when can't satisfy explicit group size 10387ec681f3Smrg- intel/fs: Remove redundant assert() 10397ec681f3Smrg- intel/fs: Remove min_dispatch_width spilling decision from RA 10407ec681f3Smrg- intel/fs: Support INTEL_DEBUG=no8,no32 in compute shaders 10417ec681f3Smrg- intel/fs: Add helper to get prog_offset and simd_size 10427ec681f3Smrg- i965: Use new helper functions to pick SIMD variant for CS 10437ec681f3Smrg- iris: Set CS KernelStatePointer at dispatch 10447ec681f3Smrg- iris: Use new helper functions to pick SIMD variant for CS 10457ec681f3Smrg- anv: Use new helper functions to pick SIMD variant for CS 10467ec681f3Smrg- intel/fs: Generate multiple CS SIMD variants for variable group size 10477ec681f3Smrg- iris, i965: Drop max_variable_local_size 10487ec681f3Smrg- iris, i965: Update limits for ARB_compute_variable_group_size 10497ec681f3Smrg- intel: Add helper to calculate GPGPU_WALKER::RightExecutionMask 10507ec681f3Smrg- nir: Fix printing execution scope of a scoped barrier 10517ec681f3Smrg- spirv: Memory semantics is optional for OpControlBarrier 10527ec681f3Smrg- intel/fs: Add Fall-through comment 10537ec681f3Smrg- nir: Fix logic that ends combine barrier sequence 10547ec681f3Smrg- spirv: Handle most execution modes earlier 10557ec681f3Smrg- nir: Filter modes of scoped memory barrier in nir_opt_load_store_vectorize 10567ec681f3Smrg- spirv: Propagate explicit layout only in types that need it 10577ec681f3Smrg 10587ec681f3SmrgCharmaine Lee (1): 10597ec681f3Smrg 10607ec681f3Smrg- llvmpipe: do not enable tessellation shader without llvm coroutines support 10617ec681f3Smrg 10627ec681f3SmrgChris Forbes (12): 10637ec681f3Smrg 10647ec681f3Smrg- bifrost: Set RTZ rounding mode for f2i conversion 10657ec681f3Smrg- bifrost: Lower x->bool conversions to != 0 10667ec681f3Smrg- bifrost: Emit "d3d" variant of comparison instructions 10677ec681f3Smrg- bifrost: Document d3d/gl comparison control bit 10687ec681f3Smrg- bifrost: Add lowering for b2i32 10697ec681f3Smrg- bifrost: Add support for nir_op_inot 10707ec681f3Smrg- bifrost: Add support for nir_op_ishl 10717ec681f3Smrg- bifrost: Add support for nir_op_uge 10727ec681f3Smrg- bifrost: Add support for nir_op_imul 10737ec681f3Smrg- bifrost: Add support for nir_op_iabs 10747ec681f3Smrg- bifrost: Honor src swizzle in special math ops 10757ec681f3Smrg- bifrost: Fix packing of ADD_FEXP2_FAST 10767ec681f3Smrg 10777ec681f3SmrgChris Wilson (6): 10787ec681f3Smrg 10797ec681f3Smrg- iris: Place a seqno at the end of every batch 10807ec681f3Smrg- iris: Convert fences to using lightweight seqno 10817ec681f3Smrg- iris: Store a seqno for each batch in the fence 10827ec681f3Smrg- iris: Initialise stub iris_seqno to 0 10837ec681f3Smrg- iris: Rename iris_seqno to iris_fine_fence 10847ec681f3Smrg- iris: Fixup copy'n'paste mistake in Makefile.sources 10857ec681f3Smrg 10867ec681f3SmrgChristian Gmeiner (31): 10877ec681f3Smrg 10887ec681f3Smrg- etnaviv: fix SAMP_ANISOTROPY register value 10897ec681f3Smrg- etnaviv: do not use int filter when anisotropic filtering is used 10907ec681f3Smrg- ci: bare-metal: make it possible to use a script for serial 10917ec681f3Smrg- ci: extend expect-output.sh 10927ec681f3Smrg- ci: add U-Boot specific fetch strings 10937ec681f3Smrg- etnaviv: drop translate_blend(..) 10947ec681f3Smrg- ci: add arm_test-base docker image 10957ec681f3Smrg- ci: use separate docker images for baremetal builds 10967ec681f3Smrg- ci: fix possible spuriously run of jobs 10977ec681f3Smrg- etnaviv: delete not used struct 10987ec681f3Smrg- etnaviv: convert enums 10997ec681f3Smrg- etnaviv: move etna_lower_io(..) to etnaviv_nir.c 11007ec681f3Smrg- etnaviv: get rid of etna_compile dependency 11017ec681f3Smrg- etnaviv: move etna_lower_alu(..) to etnaviv_nir.c 11027ec681f3Smrg- etnaviv: drop OPT_V define 11037ec681f3Smrg- etnaviv: make more use of compile_error(..) 11047ec681f3Smrg- etnaviv: move liveness related stuff into own file 11057ec681f3Smrg- etnaviv: merge struct etna_compile and etna_state 11067ec681f3Smrg- etnaviv: drop emit macro 11077ec681f3Smrg- etnaviv: move functions that generate asm to own file 11087ec681f3Smrg- etnaviv: move nir compiler related stuff into .c file 11097ec681f3Smrg- etnaviv: move ra into own file 11107ec681f3Smrg- etnaviv: replace prims-emitted query 11117ec681f3Smrg- ci: bare-metal: use nginx to get results from DUT 11127ec681f3Smrg- etnaviv: explicitly set nir_variable_mode 11137ec681f3Smrg- etnaviv: introduce struct etna_compiler 11147ec681f3Smrg- etnaviv: move shader_count to etna_compiler 11157ec681f3Smrg- etnaviv: do register setup only once 11167ec681f3Smrg- etnaviv: fix nir validation problem 11177ec681f3Smrg- etnaviv: call nir_lower_bool_to_bitsize 11187ec681f3Smrg- etnaviv: completely turn off MSAA 11197ec681f3Smrg 11207ec681f3SmrgChristopher Egert (2): 11217ec681f3Smrg 11227ec681f3Smrg- radv: use util_float_to_half_rtz 11237ec681f3Smrg- r600: Use TRUNC_COORD on samplers 11247ec681f3Smrg 11257ec681f3SmrgClément Guérin (1): 11267ec681f3Smrg 11277ec681f3Smrg- radv: Always expose non-visible local memory type on dedicated GPUs 11287ec681f3Smrg 11297ec681f3SmrgCon Kolivas (1): 11307ec681f3Smrg 11317ec681f3Smrg- Linux: Change minimum priority threads from SCHED_IDLE to nice 19 SCHED_BATCH. 11327ec681f3Smrg 11337ec681f3SmrgConnor Abbott (88): 11347ec681f3Smrg 11357ec681f3Smrg- tu: Support pipelines without a fragment shader 11367ec681f3Smrg- tu: Add a "scratch bo" allocation mechanism 11377ec681f3Smrg- tu: Add noubwc debug flag to disable UBWC 11387ec681f3Smrg- tu: Implement fallback linear staging blit for CopyImage 11397ec681f3Smrg- freedreno/a6xx: Document dual-src blending enable bits 11407ec681f3Smrg- ir3: Fixup dual-source blending slot 11417ec681f3Smrg- tu: Move RENDER_COMPONENTS setting to pipeline state 11427ec681f3Smrg- tu: Implement dual-src blending 11437ec681f3Smrg- tu: Advertise COLOR_ATTACHMENT_BLEND_BIT for blendable formats 11447ec681f3Smrg- tu: Always initialize image_view fields for blit sources 11457ec681f3Smrg- tu: Fall back to 3d blit path for BC1_RGB_* formats 11467ec681f3Smrg- tu: Fix buffer compressed pitch calculation with unaligned sizes 11477ec681f3Smrg- tu: Support VK_FORMAT_FEATURE_BLIT_SRC_BIT for texture-only formats 11487ec681f3Smrg- tu: Fix IBO descriptor for cubes 11497ec681f3Smrg- tu: Respect VK_IMAGE_CREATE_MUTABLE_FORMAT_BIT 11507ec681f3Smrg- tu: Add missing storage image/texel buffer bits 11517ec681f3Smrg- tu: Remove useless post-binning flushes 11527ec681f3Smrg- tu: Don't actually track seqno's for events 11537ec681f3Smrg- tu: Remove useless event_write helpers 11547ec681f3Smrg- tu: Rewrite flushing to use barriers 11557ec681f3Smrg- tu: Fix context faults loading unused descriptor sets 11567ec681f3Smrg- ir3: Pass reserved_user_consts to ir3_shader_from_nir() 11577ec681f3Smrg- tu: Remove num_samp hack 11587ec681f3Smrg- tu: Use the ir3 shader API 11597ec681f3Smrg- tu: Remove tu_shader_compile_options 11607ec681f3Smrg- tu: Set num_components to 0 when building bindless intrinsics 11617ec681f3Smrg- ir3: Don't calculate num_samp ourselves 11627ec681f3Smrg- tu: Actually remove dead variables after io lowering 11637ec681f3Smrg- ir3: Split out variant-specific lowering and optimizations 11647ec681f3Smrg- ir3, freedreno: Round up constlen earlier 11657ec681f3Smrg- ir3: Include ir3_compiler from ir3_shader 11667ec681f3Smrg- ir3: Support variants with different constlen's 11677ec681f3Smrg- ir3: Add ir3_trim_constlen() 11687ec681f3Smrg- tu: Share constlen between different stages properly 11697ec681f3Smrg- freedreno: Refactor ir3_cache shader compilation 11707ec681f3Smrg- freedreno: Share constlen between different stages properly 11717ec681f3Smrg- freedreno: On a5xx+ INDX_SIZE is MAX_INDICES 11727ec681f3Smrg- freedreno/registers: Label firstIndex field in CP_DRAW_INDX_OFFSET 11737ec681f3Smrg- tu: Pass firstIndex directly to CP_DRAW_INDX_OFFSET 11747ec681f3Smrg- freedreno/a6xx: use firstIndex field 11757ec681f3Smrg- nir: Refactor load/store intrinsic helper 11767ec681f3Smrg- nir: add vec2_index_32bit_offset address format 11777ec681f3Smrg- tu: Rewrite variable lowering 11787ec681f3Smrg- tu: Enable KHR_variable_pointers 11797ec681f3Smrg- ir3: Add layer_zero variant bit 11807ec681f3Smrg- tu: Force gl_Layer to 0 when necessary 11817ec681f3Smrg- freedreno/a6xx: Force gl_Layer to 0 when necessary 11827ec681f3Smrg- freedreno: Include adreno_pm4.xml.h before adreno_a6xx.xml.h 11837ec681f3Smrg- freedreno: Sync registers with envytools 11847ec681f3Smrg- freedreno/a6xx: Rename and document HLSQ_UPDATE_CNTL 11857ec681f3Smrg- freedreno/a6xx: Add some documentation for shared consts 11867ec681f3Smrg- tu: Don't invalidate irrelevant state when changing pipeline 11877ec681f3Smrg- freedreno/a6xx: Add stencilref register info 11887ec681f3Smrg- ir3: Handle gl_FragStencilRefARB 11897ec681f3Smrg- tu: Enable VK_EXT_shader_stencil_export 11907ec681f3Smrg- freedreno: Add a helper for computing guardband sizes 11917ec681f3Smrg- tu: Use common guardband helper 11927ec681f3Smrg- freedreno: Use common guardband helper 11937ec681f3Smrg- freedreno/ir3: Fix SSBO size for bindless SSBO's 11947ec681f3Smrg- tu: Enable VK_EXT_depth_clip_enable 11957ec681f3Smrg- freedreno: Clean up CP_DRAW_MULTI_INDIRECT definition 11967ec681f3Smrg- freedreno: Add INDIRECT_COUNT CP_DRAW_INDIRECT_MULTI variants 11977ec681f3Smrg- tu: Integrate WFI/WAIT_FOR_ME/WAIT_MEM_WRITES with cache tracking 11987ec681f3Smrg- tu: Add missing wfi to tu6_emit_hw() 11997ec681f3Smrg- tu: Implement VK_KHR_draw_indirect_count 12007ec681f3Smrg- tu: Fix empty blit scissor case 12017ec681f3Smrg- tu: Fix hangs for DS with no output 12027ec681f3Smrg- tu: Detect invalid-for-binning renderpass dependencies 12037ec681f3Smrg- tu: Enable vertex & fragment stores & atomics 12047ec681f3Smrg- tu: Fix descriptor update templates with input attachments 12057ec681f3Smrg- ir3: Validate bindless samp_tex correctly 12067ec681f3Smrg- ir3: Remove redundant samp_tex validation 12077ec681f3Smrg- ir3: Fix incorrect src flags for samp_tex 12087ec681f3Smrg- tu: Enable resource dynamic indexing 12097ec681f3Smrg- freedreno/rnn: Return success when parsing addvariant 12107ec681f3Smrg- tu: Dump CP_DRAW_INDIRECT_MULTI draw BO's 12117ec681f3Smrg- freedreno/rnn: Support stripes in rnndec_decodereg 12127ec681f3Smrg- freedreno/cffdec: Handle CP_DRAW_INDIRECT_MULTI like other draws 12137ec681f3Smrg- freedreno: Add trace for CP_DRAW_INDIRECT_MULTI 12147ec681f3Smrg- freedreno/a6xx: Fix CP_BIN_SIZE_ADDRESS name 12157ec681f3Smrg- freedreno/rnn: Make rnn_decode_enum() respect variants 12167ec681f3Smrg- freedreno/cffdec: Stop open-coding enum parsing 12177ec681f3Smrg- freedreno/afuc: Add missing rnn_prepdb() 12187ec681f3Smrg- freedreno/afuc: Fix PM4 enum parsing 12197ec681f3Smrg- tu: Fix DST_INCOHERENT_FLUSH copy/paste error 12207ec681f3Smrg- freedreno: Document draw predication packets 12217ec681f3Smrg- tu: Reset has_tess after renderpass 12227ec681f3Smrg- tu: Implement VK_EXT_conditional_rendering 12237ec681f3Smrg 12247ec681f3SmrgD Scott Phillips (4): 12257ec681f3Smrg 12267ec681f3Smrg- intel/fs: Update location of Render Target Array Index for gen12 12277ec681f3Smrg- anv,iris: Fix input vertex max for tcs on gen12 12287ec681f3Smrg- intel/dump_gpu: Fix name of LD_PRELOAD in env append logic 12297ec681f3Smrg- anv/gen11+: Disable object level preemption 12307ec681f3Smrg 12317ec681f3SmrgDaniel Schürmann (54): 12327ec681f3Smrg 12337ec681f3Smrg- aco: either copy-propagate or inline create_vector operands 12347ec681f3Smrg- aco: coalesce parallelcopies during register allocation 12357ec681f3Smrg- nir: add nir_intrinsic_elect to divergence analysis 12367ec681f3Smrg- nir: refactor divergence analysis state 12377ec681f3Smrg- nir: rework phi handling in divergence analysis 12387ec681f3Smrg- nir: simplify phi handling in divergence analysis 12397ec681f3Smrg- nir: reset ssa-defs as non-divergent during divergence analysis instead of upfront 12407ec681f3Smrg- aco: fix WQM coalescing 12417ec681f3Smrg- aco: restrict copying of create_vector operands to GFX9+ 12427ec681f3Smrg- aco: don't move create_vector subdword operands to unsupported register offsets 12437ec681f3Smrg- aco: fix corner case in register allocation 12447ec681f3Smrg- aco: don't allow unaligned subdword accesses on GFX6/7 12457ec681f3Smrg- aco: fix register assignment for p_create_vector on GFX6/7 12467ec681f3Smrg- aco: simplify statistics collection for copies 12477ec681f3Smrg- aco: use full-register instructions to implement subdword packing on GFX6/7 12487ec681f3Smrg- aco: Workarounds subdword lowering on GFX6/7 12497ec681f3Smrg- aco: adjust GFX6 subdword lowering workarounds for 8bit 12507ec681f3Smrg- aco: add and use scratch SGPR to lower subdword p_create_vector on GFX6/7 12517ec681f3Smrg- aco: coalesce copies more aggressively when lowering to hw 12527ec681f3Smrg- aco: skip partial copies on first iteration when lowering to hw 12537ec681f3Smrg- aco: optimize packing of 16bit subdword registers on GFX6/7 12547ec681f3Smrg- aco: remove unnecessary split- and create_vector instructions for subdword loads 12557ec681f3Smrg- aco: fix shared subdword loads 12567ec681f3Smrg- aco: reorder calls to aco_validate() and cleanup aco_compile_shader() 12577ec681f3Smrg- aco: don't allow SGPRs on logical phis 12587ec681f3Smrg- aco: fix WQM handling in nested loops 12597ec681f3Smrg- radv/aco: implement logic64 instead of lowering 12607ec681f3Smrg- aco: align swap operations to 4 bytes on GFX6/7 12617ec681f3Smrg- aco: don't allow partial copies on GFX6/7 12627ec681f3Smrg- radv: introduce RADV_DEBUG=llvm option 12637ec681f3Smrg- radv: change use_aco -> use_llvm 12647ec681f3Smrg- radv: enable ACO by default 12657ec681f3Smrg- aco: fix partial copies on GFX6/7 12667ec681f3Smrg- aco: remove superflous (bool & exec) if the result comes from VOPC 12677ec681f3Smrg- nir: also move vecN in case of nir_move_copies 12687ec681f3Smrg- nir: refactor nir_can_move_instr 12697ec681f3Smrg- nir/algebraic: optimize bcsel(a, 0, 1) to b2i 12707ec681f3Smrg- nir: also move b2i in case of nir_move_copies 12717ec681f3Smrg- nir/algebraic: optimize iand/ior of (n)eq zero 12727ec681f3Smrg- nir/algebraic: add optimizations for fsign/isign 12737ec681f3Smrg- nir/algebraic: add some more unop + bcsel optimizations 12747ec681f3Smrg- nir/algebraic: optimize fmul(x, bcsel(c, -1.0, 1.0)) -> bcsel(c, -x, x) 12757ec681f3Smrg- nir/algebraic: optimize (a < 0.0) ? -a : a -> fabs(a) 12767ec681f3Smrg- nir/algebraic: add distributive rules for ior/iand 12777ec681f3Smrg- nir/algebraic: propagate b2i out of ior/iand 12787ec681f3Smrg- nir/algebraic: fold some nested bcsel 12797ec681f3Smrg- aco: fix scratch loads which cross element_size boundaries 12807ec681f3Smrg- aco: ensure to not extract more components than have been fetched 12817ec681f3Smrg- aco: don't split store data if it was already split into more elements 12827ec681f3Smrg- aco: prevent infinite recursion in RA for subdword variables 12837ec681f3Smrg- aco: ensure readfirstlane subdword operands are always dword aligned 12847ec681f3Smrg- radv: call radv_nir_lower_ycbcr_textures after first optimizations 12857ec681f3Smrg- aco: add GFX6/7 subdword lowering tests 12867ec681f3Smrg- aco: execute branch instructions in WQM if necessary 12877ec681f3Smrg 12887ec681f3SmrgDaniel Stone (13): 12897ec681f3Smrg 12907ec681f3Smrg- CI: Disable Panfrost T7x0 jobs 12917ec681f3Smrg- CI: Re-enable Panfrost T7x0 jobs 12927ec681f3Smrg- llvmpipe: Expect increased exp precision on Windows 12937ec681f3Smrg- CI: Windows: Build LLVM and llvmpipe 12947ec681f3Smrg- CI: Disable Panfrost T720/T760 12957ec681f3Smrg- Revert "CI: Disable Panfrost T720/T760" 12967ec681f3Smrg- CI: Enable assertions on Windows 12977ec681f3Smrg- CI: Try shared libraries on Windows 12987ec681f3Smrg- CI: Correct build-directory path on Windows, and keep it 12997ec681f3Smrg- CI: Re-enable the Windows VS2019 build job 13007ec681f3Smrg- CI: Temporarily disable Panfrost T860 jobs 13017ec681f3Smrg- CI: Re-enable Panfrost T860 jobs 13027ec681f3Smrg- CI: Disable Windows build due to unstable infrastructure 13037ec681f3Smrg 13047ec681f3SmrgDanylo Piliaiev (25): 13057ec681f3Smrg 13067ec681f3Smrg- glsl: rename has_implicit_uint_to_int_conversion to *_int_to_uint_* 13077ec681f3Smrg- i965: Fix out-of-bounds access to brw_stage_state::surf_offset 13087ec681f3Smrg- anv: Translate relative timeout to absolute when calling anv_timelines_wait 13097ec681f3Smrg- anv: Fix deadlock in anv_timelines_wait 13107ec681f3Smrg- meson: Disable GCC's dead store elimination for memory zeroing custom new 13117ec681f3Smrg- mesa: Fix double-lock of Shared->FrameBuffers and usage of wrong mutex 13127ec681f3Smrg- st/mesa: Clear texture's views when texture is removed from Shared->TexObjects 13137ec681f3Smrg- intel/fs: Work around dual-source blending hangs in combination with SIMD16 13147ec681f3Smrg- glsl: Don't replace lrp pattern with lrp if arguments are not floats 13157ec681f3Smrg- glsl: inline functions with unsupported return type before converting to nir 13167ec681f3Smrg- i965: Work around incorrect usage of glDrawRangeElements in UE4 13177ec681f3Smrg- st/mesa: account for "loose", per-mipmap level textures in CopyImageSubData 13187ec681f3Smrg- iris: Honor scanout requirement from DRI 13197ec681f3Smrg- iris: Fix fast-clearing of depth via glClearTex(Sub)Image 13207ec681f3Smrg- nir/opt_if: Fix opt_if_simplification when else branch has jump 13217ec681f3Smrg- nir/tests: Add tests for opt_if_simplification 13227ec681f3Smrg- st/mesa: Treat vertex outputs absent in outputMapping as zero in mesa_to_tgsi 13237ec681f3Smrg- anv/nir: Unify inputs_read/outputs_written between geometry stages 13247ec681f3Smrg- spirv: Only require bare types to match when copying variables 13257ec681f3Smrg- glsl: Eliminate out-of-bounds triop_vector_insert 13267ec681f3Smrg- intel/compiler: Fix pointer arithmetic when reading shader assembly 13277ec681f3Smrg- glsl: Eliminate assigments to out-of-bounds elements of vector 13287ec681f3Smrg- nir/lower_io: Eliminate oob writes and return zero for oob reads 13297ec681f3Smrg- nir/large_constants: Eliminate out-of-bounds writes to large constants 13307ec681f3Smrg- nir/lower_samplers: Clamp out-of-bounds access to array of samplers 13317ec681f3Smrg 13327ec681f3SmrgDaryl W. Grunau (1): 13337ec681f3Smrg 13347ec681f3Smrg- prevent multiply defined symbols 13357ec681f3Smrg 13367ec681f3SmrgDave Airlie (199): 13377ec681f3Smrg 13387ec681f3Smrg- i965: add support for gen 5 pipelined pointers to dump 13397ec681f3Smrg- i965: disable shadow batches when batch debugging. 13407ec681f3Smrg- draw/tess: free tessellation control shader i/o memory. 13417ec681f3Smrg- llvmpipo/nir: free compute shader NIR 13427ec681f3Smrg- llvmpipe: simple texture barrier implementation. 13437ec681f3Smrg- gallivm/sample: add multisample support for texel fetch 13447ec681f3Smrg- gallivm/sample: add multisample image operation support 13457ec681f3Smrg- gallivm/nir/tgsi: add multisample texture sampling. 13467ec681f3Smrg- gallivm/nir: add multisample support to image size 13477ec681f3Smrg- gallivm/nir: add multisample image operations 13487ec681f3Smrg- draw: introduce sampler num samples + stride members 13497ec681f3Smrg- draw: add support for num_samples + sample_stride to the image paths 13507ec681f3Smrg- llvmpipe: add num_samples/sample_stride support to jit textures 13517ec681f3Smrg- llvmpipe: add samples support to image jit 13527ec681f3Smrg- util: add a resource wrapper to get resource samples 13537ec681f3Smrg- llvmpipe: add multisample support to texture allocator. 13547ec681f3Smrg- llvmpipe: add a max samples define set to 4. 13557ec681f3Smrg- gallium/util: split out zstencil clearing code. 13567ec681f3Smrg- llvmpipe: fix race between draw and setting fragment shader. 13577ec681f3Smrg- llvmpipe: add get_sample_position support (v2) 13587ec681f3Smrg- llvmpipe/jit: pass fragment sample mask via jit context. 13597ec681f3Smrg- llvmpipe: pass incoming sample_mask into fragment shader context. 13607ec681f3Smrg- llvmpipe: add internal multisample texture mapping path. 13617ec681f3Smrg- llvmpipe: add multisample resource copy region support. 13627ec681f3Smrg- llvmpipe: add clear texture support for multisample textures. 13637ec681f3Smrg- llvmpipe: handle multisample render target clears 13647ec681f3Smrg- draw: disable point/line smoothing for multisample (v2) 13657ec681f3Smrg- llvmpipe: pass color and depth sample strides into fragment shader. 13667ec681f3Smrg- llvmpipe: record sample info for color/depth buffers in scene 13677ec681f3Smrg- llvmpipe/rast: fix tile clearing for multisample color and depth tiles 13687ec681f3Smrg- llvmpipe: plumb multisample state bit into setup code. 13697ec681f3Smrg- llvmpipe: add multisample bit to fragment shader key. 13707ec681f3Smrg- llvmpipe: change mask input to fragment shader to 64-bit. 13717ec681f3Smrg- llvmpipe: add cbuf/zsbuf + coverage samples to the fragment shader key. 13727ec681f3Smrg- gallivm: add sample id/pos intrinsic support 13737ec681f3Smrg- gallivm: add mask api to force mask 13747ec681f3Smrg- nir/tgsi: translate the interp location 13757ec681f3Smrg- llvmpipe: pass interp location into interpolation code. 13767ec681f3Smrg- llvmpipe: add centroid interpolation support. 13777ec681f3Smrg- llvmpipe: add per-sample interpolation. 13787ec681f3Smrg- llvmpipe: move getting mask value out of depth code. (v2) 13797ec681f3Smrg- llvmpipe: add per-sample depth/stencil test 13807ec681f3Smrg- llvmpipe: move some fs code around 13817ec681f3Smrg- llvmpipe: multisample sample mask + early/late depth pass 13827ec681f3Smrg- llvmpipe: handle multisample early depth test/late depth write 13837ec681f3Smrg- llvmpipe: interpolate Z at sample points for early depth test. 13847ec681f3Smrg- llvmpipe: handle multisample color stores. 13857ec681f3Smrg- llvmpipe: hook up sample position system value 13867ec681f3Smrg- llvmpipe: add multisample alpha to coverage support. 13877ec681f3Smrg- llvmpipe: add multisample alpha to one support 13887ec681f3Smrg- llvmpipe: handle gl_SampleMask writing. 13897ec681f3Smrg- llvmpipe: don't allow branch to end for early Z with multisample 13907ec681f3Smrg- llvmpipe: pass mask store into interp for centroid interpolation 13917ec681f3Smrg- llvmpipe: move color storing earlier in frag shader 13927ec681f3Smrg- llvmpipe: fix multisample occlusion queries. 13937ec681f3Smrg- llvmpipe: disable opaque variant for multisample 13947ec681f3Smrg- llvmpipe: add new rast api to pass full 64-bit mask. 13957ec681f3Smrg- llvmpipe: add fixed point sample positions to scene. 13967ec681f3Smrg- llvmpipe: build 64-bit coverage mask in rasterizer 13977ec681f3Smrg- llvmpipe: fixup multisample coverage masks for covered tiles 13987ec681f3Smrg- llvmpipe: generate multisample triangle rasterizer functions (v2) 13997ec681f3Smrg- llvmpipe: choose multisample rasterizer functions per triangle (v2) 14007ec681f3Smrg- llvmpipe: choose correct position for multisample 14017ec681f3Smrg- llvmpipe: don't choose pixel centers for multisample 14027ec681f3Smrg- drisw: add multisample support to sw dri layer. 14037ec681f3Smrg- llvmpipe: enable 4x sample MSAA + texture multisample 14047ec681f3Smrg- gallivm/sample: add num samples query for txqs (v2) 14057ec681f3Smrg- gallivm/nir: hooks up texture samples queries 14067ec681f3Smrg- llvmpipe: enable GL_ARB_shader_texture_image_samples 14077ec681f3Smrg- llvmpipe: add min samples support to the fragment shader. 14087ec681f3Smrg- llvmpipe: enable ARB_sample_shading 14097ec681f3Smrg- llvmpipe: make sample position a global array. 14107ec681f3Smrg- zink: enable conditional rendering if available 14117ec681f3Smrg- r600: enable TEXCOORD semantic for TGSI. 14127ec681f3Smrg- r600/sfn: plumb the chip class into the instruction emission 14137ec681f3Smrg- r600/sfn: fix cayman float instruction emission. 14147ec681f3Smrg- r600/sfn: cayman fix int trans op2 14157ec681f3Smrg- r600/sfn: add callstack non-evergreen support 14167ec681f3Smrg- r600/sfn: add emit if start cayman support 14177ec681f3Smrg- llvmpipe: don't use sample mask with 0 samples 14187ec681f3Smrg- llvmpipe: use per-sample position not sample id for interp 14197ec681f3Smrg- llvmpipe/interp: fix interpolating frag pos for sample shading 14207ec681f3Smrg- llvmpipe: remove non-simple interpolation paths. 14217ec681f3Smrg- gallivm/nir: add an interpolation interface. 14227ec681f3Smrg- llvmpipe/interp: refactor out use of pixel center offset 14237ec681f3Smrg- llvmpipe/interp: refactor out centroid calculations 14247ec681f3Smrg- llvmpipe: add interp instruction support 14257ec681f3Smrg- llvmpipe/fs: hook up the interpolation APIs. 14267ec681f3Smrg- gallivm/nir: add sample_mask_in support 14277ec681f3Smrg- llvmpipe: add gl_SampleMaskIn support. 14287ec681f3Smrg- r600/sfn: fix nop channel assignment. 14297ec681f3Smrg- llvmpipe: compute shaders work better with all the threads. 14307ec681f3Smrg- llvmpipe: move coroutines out of noopt case 14317ec681f3Smrg- ci: bump virglrenderer to latest version 14327ec681f3Smrg- util/disk_cache: add fallback for disk_cache_get_function_identifier 14337ec681f3Smrg- llvmpipe/cs: overhaul cs variant key state. 14347ec681f3Smrg- llvmpipe/draw: drop variant number from function names. 14357ec681f3Smrg- gallivm: rework coroutine malloc/free callouts. 14367ec681f3Smrg- gallivm: rework debug printf hook to use global mapping. 14377ec681f3Smrg- gallivm: add support for a cache object 14387ec681f3Smrg- gallivm: skip operations if we have a cached object. 14397ec681f3Smrg- gallivm: add cache interface to mcjit 14407ec681f3Smrg- llvmpipe: add infrastructure for disk cache support 14417ec681f3Smrg- gallivm: don't cache shaders that use fetch functions. 14427ec681f3Smrg- llvmpipe/fs: add caching support 14437ec681f3Smrg- llvmpipe/cs: add shader caching 14447ec681f3Smrg- draw: add disk cache callbacks for draw shaders 14457ec681f3Smrg- llvmpipe: hook draw disk cache up 14467ec681f3Smrg- draw: add disk caching for draw shaders 14477ec681f3Smrg- draw/gs: fix emitting inactive primitives crash 14487ec681f3Smrg- draw/gs: add more info to debugging. 14497ec681f3Smrg- gallivm/nir: add group barrier support 14507ec681f3Smrg- llvmpipe: fix subpixel bits reporting. 14517ec681f3Smrg- gallivm/format: convert unsigned values to float properly. 14527ec681f3Smrg- gallivm/conv: enable conversion min code. (v2) 14537ec681f3Smrg- gallivm/sample: fix texel type for stencil 8-bit 14547ec681f3Smrg- llvmpipe/setup: add planes for draw regions if no scissor. 14557ec681f3Smrg- gallivm/cache: don't require a null terminator for cache data. 14567ec681f3Smrg- mesa/gles3: add support for GL_EXT_shader_group_vote 14577ec681f3Smrg- virgl: change vendor id to reflect reality more. 14587ec681f3Smrg- llvmpipe: change vendor to be more generic. 14597ec681f3Smrg- softpipe: change vendor name to something more generic. 14607ec681f3Smrg- gallivm/nir: fix const loading on big endian systems 14617ec681f3Smrg- glsl: fix constant packing for 64-bit big endian. 14627ec681f3Smrg- gallivm/nir: fix big-endian 64-bit splitting/merging. 14637ec681f3Smrg- llvmpipe: fix occlusion queries on big-endian. 14647ec681f3Smrg- mesa/get: fix enum16 big-endian getting. 14657ec681f3Smrg- draw/llvm: fix big-endian mask adjusting 14667ec681f3Smrg- draw: pass nr_samplers into llvm sample state creation. 14677ec681f3Smrg- llvmpipe: pass number of samplers into llvm sampler code. 14687ec681f3Smrg- gallivm/sample: change texture function generator api 14697ec681f3Smrg- gallivm: add indirect texture switch statement builder. 14707ec681f3Smrg- draw: add support for indirect texture access 14717ec681f3Smrg- llvmpipe: add support for indirect texture access. 14727ec681f3Smrg- gallivm/nir: add texture unit indexing 14737ec681f3Smrg- gallivm/nir: handle non-uniform texture offsets 14747ec681f3Smrg- gallivm/sample: pass indirect offset into texture/image units 14757ec681f3Smrg- llvmpipe/draw: wire up indirect offset 14767ec681f3Smrg- gallivm/sample: handle size unit offset 14777ec681f3Smrg- llvmpipe: enable ARB_gpu_shader5 14787ec681f3Smrg- draw: pass number of images to image soa create 14797ec681f3Smrg- llvmpipe: pass number of images into image soa create 14807ec681f3Smrg- gallivm/nir: support passing image index into image code. 14817ec681f3Smrg- gallivm/nir: refactor image operations for indirect support. 14827ec681f3Smrg- gallivm/img: refactor out the texel return type (v2) 14837ec681f3Smrg- gallivm/nir: add support for indirect image loading 14847ec681f3Smrg- draw/sample: add support for indirect images 14857ec681f3Smrg- llvmpipe: handle indirect images properly 14867ec681f3Smrg- ci: fixup tests after all indirect images fixes. 14877ec681f3Smrg- docs: update llvmpipe GL 4.0 status 14887ec681f3Smrg- draw/clip: cleanup viewport index handling code. 14897ec681f3Smrg- draw/clip: fix viewport index for geometry shaders 14907ec681f3Smrg- mesa/version: only enable GL4.1 with correct limits. 14917ec681f3Smrg- llvmpipe: bump texture/scene limits to enable GL 4.1 14927ec681f3Smrg- llvmpipe: bump to GL support to GL 4.1 14937ec681f3Smrg- llvmpipe: enable GL 4.2 14947ec681f3Smrg- gallivm/nir: call end prim at end on all GS streams. 14957ec681f3Smrg- draw: emit so primitives before ending empty pipeline. 14967ec681f3Smrg- draw/gs: fix up current verts in output fetching. 14977ec681f3Smrg- gallivm/draw/gs: pass vertex stream count into shader build 14987ec681f3Smrg- draw/gs: only allocate memory for streams needed. 14997ec681f3Smrg- gallivm/gs_iface: pass stream into end primitive interface. 15007ec681f3Smrg- gallivm/nir: don't access stream var outside bounds 15017ec681f3Smrg- gallivm/nir: end primitive for all streams. 15027ec681f3Smrg- draw: account primitive lengths for all streams. 15037ec681f3Smrg- draw/gs: reverse the polarity of the invocation/prims execution 15047ec681f3Smrg- draw: use common exit path in pipeline finish. 15057ec681f3Smrg- draw: free vertex info from geometry streams. 15067ec681f3Smrg- draw/gs: use mask to limit vertex emission. 15077ec681f3Smrg- ci/virgl: update results after streams fixes. 15087ec681f3Smrg- llvmpipe: add ARB_post_depth_coverage support. 15097ec681f3Smrg- llvmpipe: denote NEW fs when images change. 15107ec681f3Smrg- llvmpipe: flush resources on sampler view binding 15117ec681f3Smrg- llvmpipe/cs: fix image/sampler binding for compute 15127ec681f3Smrg- nouveau: avoid LTO ODR warning (v2) 15137ec681f3Smrg- gallivm/sample: always square rho before fast log2 15147ec681f3Smrg- llvmpipe/format: fix snorm conversion 15157ec681f3Smrg- mesa: change dsa texture error codes for GL 4.6 15167ec681f3Smrg- ci: bump piglit checkout for dsa tests 15177ec681f3Smrg- llvmpipe: fix stencil only formats. 15187ec681f3Smrg- llvmpipe: fix position offset interpolation 15197ec681f3Smrg- llvmpipe/cs: respect render condition 15207ec681f3Smrg- llvmpipe: add framebuffer fetching support (v1.1) 15217ec681f3Smrg- ci/llvmpipe: reenable gpu shader5 tests 15227ec681f3Smrg- llvmpipe: enable EXT_texture_shadow_lod 15237ec681f3Smrg- llvmpipe/draw: handle constant buffer limits and robustness (v1.1) 15247ec681f3Smrg- drisw: add robustness extension support. 15257ec681f3Smrg- glx/drisw: add robustness support 15267ec681f3Smrg- llvmpipe: add device reset query context hook. 15277ec681f3Smrg- llvmpipe: enable robust buffer access + GL 4.3, GLES 3.2 and robust buffer access behaviour 15287ec681f3Smrg- llvmpipe/ms: fix sign extension bug in rasterizer. 15297ec681f3Smrg- Revert "llvmpipe: Use the default behavior of ALLOW_MAPPED_BUFFERS." 15307ec681f3Smrg- radv: cleanup locking around timeline waiting. 15317ec681f3Smrg- llvmpipe: only read 0 for channels being read 15327ec681f3Smrg- llvmpipe/blit: for 32-bit unorm depth blits just copy 32-bit 15337ec681f3Smrg- llvmpipe: enable GL 4.5 15347ec681f3Smrg- llvmpipe/cs: update compute counters not fragment shader. 15357ec681f3Smrg- llvmpipe: include gallivm perf flags in shader cache. 15367ec681f3Smrg- gallivm: disable brilinear for lod bias and explicit lod. 15377ec681f3Smrg 15387ec681f3SmrgDavid McFarland (1): 15397ec681f3Smrg 15407ec681f3Smrg- radv: link with ld_args_build_id 15417ec681f3Smrg 15427ec681f3SmrgDavid Stevens (2): 15437ec681f3Smrg 15447ec681f3Smrg- nir: Add colorspace support to YUV lowering pass 15457ec681f3Smrg- i965/i915: Add colorspace support to YUV sampling 15467ec681f3Smrg 15477ec681f3SmrgDenys (1): 15487ec681f3Smrg 15497ec681f3Smrg- gitlab: Ask about reproduction rate in the issue template 15507ec681f3Smrg 15517ec681f3SmrgDmitriy Nester (8): 15527ec681f3Smrg 15537ec681f3Smrg- mesa: check draw buffer completeness on glClearBufferfv/glClearBufferuiv 15547ec681f3Smrg- nir: replace fnv1a hash function with xxhash 15557ec681f3Smrg- freedreno: replace fnv1a hash function with xxhash 15567ec681f3Smrg- i965: replace fnv1a hash function with xxhash 15577ec681f3Smrg- util/hash_table: replace fnv1a hash function with xxhash 15587ec681f3Smrg- r600: replace fnv1a hash function with xxhash 15597ec681f3Smrg- zink: replace fnv1a hash function with xxhash 15607ec681f3Smrg- util: delete fnv1a hash function 15617ec681f3Smrg 15627ec681f3SmrgDuncan Hopkins (1): 15637ec681f3Smrg 15647ec681f3Smrg- zink. Changed sampler default name. 15657ec681f3Smrg 15667ec681f3SmrgDylan Baker (41): 15677ec681f3Smrg 15687ec681f3Smrg- docs: Add release notes for 20.0.6 15697ec681f3Smrg- docs: Add SHA256 sums for 20.0.6 15707ec681f3Smrg- docs: update calendar, add news item, and link releases notes for 20.0.6 15717ec681f3Smrg- docs: Add release notes for 20.0.7 15727ec681f3Smrg- docs/relnotes Add sha256 sums to 20.0.7 15737ec681f3Smrg- docs: update calendar, add news item, and link releases notes for 20.0.7 15747ec681f3Smrg- tests: Make tests aware of meson test wrapper 15757ec681f3Smrg- meson: Bump required version to 0.52.0 15767ec681f3Smrg- meson: Use the check_header function 15777ec681f3Smrg- meson: Use build_always_stale instead of build_always 15787ec681f3Smrg- meson: Use builtins for checking gnu __attributes__ 15797ec681f3Smrg- drm-shim/meson: The name of the target is a string not a list 15807ec681f3Smrg- drm-shim/meson: Use portable override_options for setting C standard 15817ec681f3Smrg- meson: use gnu_symbol_visibility argument 15827ec681f3Smrg- meson: use 2 space not 3 space indent 15837ec681f3Smrg- meson: deprecated 'true' and 'false' in combo options for 'enabled' and 'disabled' 15847ec681f3Smrg- vulkan-overlay/meson: use install_data instead of configure_file 15857ec681f3Smrg- docs: Add release notes for 20.0.8 15867ec681f3Smrg- docs: Add sha256sums for 20.0.8 15877ec681f3Smrg- docs: update calendar, add news item, and link releases notes for 20.0.8 15887ec681f3Smrg- mesa/swrast: use logf2 instead of util_fast_log2 15897ec681f3Smrg- VERSION: bump for 20.2.0-rc1 15907ec681f3Smrg- .pick_status.json: Update to 9333a8570d2174b73da63c3ee6f1a740ae487ab8 15917ec681f3Smrg- .pick_status.json: Update to 1e28745bc0d3528c1dfc25459456849feb58d407 15927ec681f3Smrg- meson/freedreno: Fix lua requirement 15937ec681f3Smrg- .pick_status.json: Update to fdb97d3d2914c8f887a7968432db4fdbd35d8376 15947ec681f3Smrg- bump version for 20.2.0-rc2 15957ec681f3Smrg- .pick_status.json: Update to 61042b1bdb199f98dd34085ed29a8c492ed9b2a3 15967ec681f3Smrg- .pick_status.json: Update to 6d28270968e0728bf8bdf48a6abd261c50d9ef07 15977ec681f3Smrg- .pick_status.json: Update to ca7d66e847d08914cec0a5e003b400da9c0a2695 15987ec681f3Smrg- VERSION: bump for 20.2.0-rc3 15997ec681f3Smrg- .pick_status.json: Update to 7fbded8b5821a47c26245b181446f972f920a96e 16007ec681f3Smrg- .pick_status.json: Mark e93979ba599355c42df01a89073362b970489a3a as denominated 16017ec681f3Smrg- .pick_status.json: Update to b9927c8c8d0c105699306a68773c015930ff9509 16027ec681f3Smrg- VERSION: bump for 20.2.0-rc4 16037ec681f3Smrg- .pick_status.json: Update to ef980ac0c1cd65993ba0c1d20e1c09b45bfef99d 16047ec681f3Smrg- fix: gallivm: disable brilenear for lod bias and explicit lod. 16057ec681f3Smrg- .pick_status.json: Update to a1f46d7b6943699e5efb60fbcfdd1450db85adb1 16067ec681f3Smrg- amd/ac_surface: convert tabs to 3 spaces 16077ec681f3Smrg- .pick_status.json: Update to 90b98c06493f8a9759e5496d5ec91fb60edf7b92 16087ec681f3Smrg- .pick_status.json: Update to 472a20c5fc0feda0f074b4ff95fd7c7a6305c8cd 16097ec681f3Smrg 16107ec681f3SmrgEduardo Lima Mitev (2): 16117ec681f3Smrg 16127ec681f3Smrg- freedreno: Centralize UUID generation into new files freedreno_uuid.c/h 16137ec681f3Smrg- freedreno/uuid: Generate meaningful device and driver UUID 16147ec681f3Smrg 16157ec681f3SmrgElie Tournier (12): 16167ec681f3Smrg 16177ec681f3Smrg- virgl: implement ARB_clear_texture 16187ec681f3Smrg- virgl: Enable CAP_CLEAR_TEXTURE if host supports it 16197ec681f3Smrg- docs/features: Add ARB_clear_texture to virgl 16207ec681f3Smrg- gallium: add TGSI_PROPERTY_FS_BLEND_EQUATION_ADVANCED 16217ec681f3Smrg- glsl_to_tgsi: Set TGSI_PROPERTY_FS_BLEND_EQUATION_ADVANCED 16227ec681f3Smrg- virgl: Reserved last caps of capability_bits 16237ec681f3Smrg- gallium: Add PIPE_CAP_BLEND_EQUATION_ADVANCED 16247ec681f3Smrg- st: expose KHR_blend_equation_advanced if PIPE_CAP_BLEND_EQUATION_ADVANCED 16257ec681f3Smrg- glsl_to_ir: do lower_blend_equation if PIPE_CAP_FBFETCH 16267ec681f3Smrg- virgl: Use alpha_src_factor to store blend_equation_advenced value 16277ec681f3Smrg- virgl: Encode barrier for blend_equation_advanced 16287ec681f3Smrg- virgl: set PIPE_CAP_BLEND_EQUATION_ADVANCED 16297ec681f3Smrg 16307ec681f3SmrgEmmanuel (3): 16317ec681f3Smrg 16327ec681f3Smrg- meson: Do not enable USE_ELF_TLS for FreeBSD 16337ec681f3Smrg- iris: Explicitly cast value to uint64_t 16347ec681f3Smrg- i965: Explicitly cast value to uint64_t 16357ec681f3Smrg 16367ec681f3SmrgEmmanuel Gil Peyrot (2): 16377ec681f3Smrg 16387ec681f3Smrg- util/rand_xor: use getrandom() when available 16397ec681f3Smrg- Expose EGL_KHR_platform_* when EXT is supported 16407ec681f3Smrg 16417ec681f3SmrgEmmanuel Vadot (1): 16427ec681f3Smrg 16437ec681f3Smrg- meson: Add versioning for xvmc tracker 16447ec681f3Smrg 16457ec681f3SmrgEric Anholt (228): 16467ec681f3Smrg 16477ec681f3Smrg- freedreno/ir3: Initialize the unused dwords of the immediates consts. 16487ec681f3Smrg- freedreno/ir3: Drop redundant IR3_REG_HALF setup in ALU ops. 16497ec681f3Smrg- freedreno/ir3: Leave bools as 1-bit, storing them in full regs. 16507ec681f3Smrg- freedreno/ir3: Set up the block predecessors for a3xx TF 16517ec681f3Smrg- freedreno/ir3: Fix the a3xx TF outputs stores. 16527ec681f3Smrg- freedreno/ir3: Fix register allocation assertion failures. 16537ec681f3Smrg- freedreno: Stop doing binning shaders other than the VS in shader-db. 16547ec681f3Smrg- freedreno/ir3: Skip tess epilogue if the program is missing stores. 16557ec681f3Smrg- freedreno: Fix assertion failures on GS/tess shaders with shader-db enabled. 16567ec681f3Smrg- freedreno/ir3: Remove unused half precision shader key flag. 16577ec681f3Smrg- freedreno: Emit debug messages when doing draw-time recompiles of shaders. 16587ec681f3Smrg- freedreno/ir3: Improve shader key normalization. 16597ec681f3Smrg- freedreno/ir3: Stop initializing regid of so->outputs during setup. 16607ec681f3Smrg- freedreno/ir3: Set up outputs for multi-slot varyings. 16617ec681f3Smrg- freedreno: Immediately compile a default variant of shaders. 16627ec681f3Smrg- freedreno/ir3: Set the FS .msaa flag to true during precompiles. 16637ec681f3Smrg- freedreno/ir3: Add some more tests of cat6 disasm. 16647ec681f3Smrg- freedreno/ir3: Sync some new changes from envytools. 16657ec681f3Smrg- freedreno/ir3: Define the bindful uniform/nonuniform desc modes for cat6 a6xx. 16667ec681f3Smrg- freedreno/ir3: Disable sin/cos range reduction for mediump. 16677ec681f3Smrg- ci: Clean up setup of the job-specific env vars in baremetal testing. 16687ec681f3Smrg- ci: Enable IRC flake reporting on freedreno baremetal boards. 16697ec681f3Smrg- ci: Improve the flakes reports on IRC. 16707ec681f3Smrg- ci: Fix the nick used in IRC reporting. 16717ec681f3Smrg- freedreno: Deduplicate ringbuffer macros with computerator/fdperf 16727ec681f3Smrg- freedreno: Clean up tests around ORing in the reloc flags. 16737ec681f3Smrg- freedreno: Rename append_bo() in case it doesn't get inlined. 16747ec681f3Smrg- freedreno: Initialize the bo's iova at creation time. 16757ec681f3Smrg- freedreno: Start moving relocs flags into the BOs. 16767ec681f3Smrg- freedreno: Replace OUT_RELOCD with permanently flagging shader BOs for it. 16777ec681f3Smrg- freedreno: Mark all ringbuffer BOs as to be dumped on crash. 16787ec681f3Smrg- freedreno: Tell the kernel that all BOs are for writing. 16797ec681f3Smrg- freedreno: Replace OUT_RELOCW with OUT_RELOC. 16807ec681f3Smrg- freedreno: Drop the "write" arg to emit_const_bo now relocs don't care. 16817ec681f3Smrg- nir: Fix count when we didn't lower load_uniforms but did shift load_ubos. 16827ec681f3Smrg- freedreno: Fix non-constbuf-upload UBO block indices and count. 16837ec681f3Smrg- freedreno: Add a nohw flag to skip submitting to the kernel. 16847ec681f3Smrg- freedreno: Split the fd_batch_resource_used by read vs write. 16857ec681f3Smrg- freedreno: Add an early out for preparing to read a resource. 16867ec681f3Smrg- freedreno: Move the resource_read early out to an inline. 16877ec681f3Smrg- freedreno: Skip taking the lock for resource usage if it's already flagged. 16887ec681f3Smrg- freedreno/a4xx+: Increase max texture size to 16384. 16897ec681f3Smrg- freedreno/a6xx: Improve layout testcase logging for UBWC fails. 16907ec681f3Smrg- freedreno/a6xx: Add a testcase for UBWC buffer sharing. 16917ec681f3Smrg- freedreno: Pull the tile_alignment lookup for a layout to a helper. 16927ec681f3Smrg- freedreno/a6xx: Fix UBWC blockheight for RG8. 16937ec681f3Smrg- freedreno/a6xx: Fix UBWC mipmap sizing. 16947ec681f3Smrg- freedreno/a6xx: Fix UBWC mipmapping height alignment. 16957ec681f3Smrg- nir: Include num_ubos in the printed shader (if nonzero). 16967ec681f3Smrg- freedreno/ir3: Clean up a silly nir_src_for_ssa(src.ssa). 16977ec681f3Smrg- freedreno/ir3: Leave the cursor alone during ir3_nir_try_propagate_bit_shift. 16987ec681f3Smrg- freedreno/ir3: Move i/o offset lowering after analyze_ubo_ranges. 16997ec681f3Smrg- freedreno: Trim num_ubos to just the ones we haven't lowered to constbuf. 17007ec681f3Smrg- freedreno/a6xx: Use LDC for UBO loads. 17017ec681f3Smrg- freedreno: Drop the noubo fails list for CI, since there aren't any now. 17027ec681f3Smrg- freedreno: Fix attempts to push UBO contents past the constlen on pre-a6xx. 17037ec681f3Smrg- freedreno: Fix resource layout dump loop. 17047ec681f3Smrg- freedreno: Avoid duplicate BO relocs in FD_RINGBUFFER_OBJECTs. 17057ec681f3Smrg- ci: Move cross file generation to a shared script. 17067ec681f3Smrg- ci: Autodetect whether we need cross setup in lava_arm builds. 17077ec681f3Smrg- ci: Make cmake toolchain file for deqp cross build setup. 17087ec681f3Smrg- ci: Make the create-rootfs more resilient. 17097ec681f3Smrg- ci: Update versions of packages to remove from rootfses. 17107ec681f3Smrg- ci: Switch the baremetal runner to be an x86 docker image. 17117ec681f3Smrg- ci: Disable SMP on the a5xx boards. 17127ec681f3Smrg- ci: Make a530's GLES3/31 fractional runs much more complete. 17137ec681f3Smrg- freedreno/a5xx: Move resource layout to fdl. 17147ec681f3Smrg- freedreno/fdl: Separate the list of a6xx testcases from the the test code. 17157ec681f3Smrg- freedreno/a5xx: Add the outline of a unit test for a5xx layout. 17167ec681f3Smrg- freedreno/a5xx: Set MIN_LAYERSZ on 3D textures like we do on a6xx. 17177ec681f3Smrg- freedreno/a5xx: Define the 2D blit UBWC pitch fields 17187ec681f3Smrg- ci: Fix DEQP_CASELIST_FILTER (used by a630 noubo run) 17197ec681f3Smrg- ci: Do an explicit NIR validation-enabled pass on freedreno a630. 17207ec681f3Smrg- ci: Don't forget to set NIR_VALIDATE in baremetal runs. 17217ec681f3Smrg- ci: Enable a fractional run with UBO-to-constbuf disabled on a3xx. 17227ec681f3Smrg- ci: Improve baremetal's logging of the job env var passthrough. 17237ec681f3Smrg- freedreno/a6xx: Fix the size of buffer image views. 17247ec681f3Smrg- freedreno: Fix printing of unused src in disasm of cat6 RESINFO. 17257ec681f3Smrg- freedreno: Add more resinfo/ldgb testcases. 17267ec681f3Smrg- freedreno: Fix resinfo asm, which doesn't have srcs besides IBO number. 17277ec681f3Smrg- freedreno: Set the immediate flag in a4/a5xx resinfos. 17287ec681f3Smrg- freedreno/ir3: Refactor out IBO source references. 17297ec681f3Smrg- freedreno/ir3: Move handle_bindless_cat6 to compiler_nir and reuse. 17307ec681f3Smrg- freedreno/ir3: Use RESINFO for a6xx image size queries. 17317ec681f3Smrg- ci: Drop double ".txt" suffix on the unexpected results file. 17327ec681f3Smrg- ci: Drop old comment about enabling --deqp-watchdog. 17337ec681f3Smrg- ci: Auto-detect the architecture for VK ICD filenames. 17347ec681f3Smrg- ci: Add DEQP_EXPECTED_RENDERER support for VK tests. 17357ec681f3Smrg- ci: Move baremetal DEQP_NO_SAVE_RESULTS setup to the yml. 17367ec681f3Smrg- ci: Quick exit qpa extraction for non-matching qpas. 17377ec681f3Smrg- ci: Disable the firmware loader user helper option in arm64 kernels. 17387ec681f3Smrg- ci: Build a cheza kernel. 17397ec681f3Smrg- ci: Add scripts for controlling bare-metal chezas. 17407ec681f3Smrg- ci: Switch cheza (freedreno a630) testing to baremetal. 17417ec681f3Smrg- ci: Don't build an arm_test container now that the last user is gone. 17427ec681f3Smrg- ci: Rename x86_cross_arm_test to just arm_test. 17437ec681f3Smrg- turnip: Move vertex buffer bindings to SET_DRAW_STATE. 17447ec681f3Smrg- turnip: Don't bother clamping VB size. 17457ec681f3Smrg- turnip: Simplify vertex buffer bindings. 17467ec681f3Smrg- turnip: Use tu_cs_emit_regs() for BLEND_CONTROL. 17477ec681f3Smrg- turnip: Add support for alphaToOne. 17487ec681f3Smrg- freedreno/a6xx: Add support for ALPHA_TO_ONE. 17497ec681f3Smrg- freedreno: Upload gallium constbufs as needed when referenced as a UBO. 17507ec681f3Smrg- freedreno/ir3: Refactor ir3_cp's lower_immed(). 17517ec681f3Smrg- freedreno/ir3: Stop pushing immediates once we've filled the constbuf. 17527ec681f3Smrg- freedreno/ir3: Drop unnecessary alignment of pushed UBO size. 17537ec681f3Smrg- freedreno/ir3: Stop shifting UBO 1 down to be UBO 0. 17547ec681f3Smrg- freedreno/ir3: Account for driver params in UBO max const upload. 17557ec681f3Smrg- freedreno/ir3: Drop the max_const on a6xx to 512. 17567ec681f3Smrg- freedreno/ir3: Handle cases where we decide not to lower UBO 0 loads. 17577ec681f3Smrg- turnip: Fix crashes in compute with no descriptors to load. 17587ec681f3Smrg- ci: Bump up to the current version of the VK CTS. 17597ec681f3Smrg- ci: Disable shader cache on vulkan CI runs. 17607ec681f3Smrg- ci: Build the full VK CTS for baremetal testing. 17617ec681f3Smrg- ci: Enable pre-merge fractional vulkan CTS runs on the turnip driver. 17627ec681f3Smrg- ci: Use rsync for initial nfsroot population on cheza. 17637ec681f3Smrg- turnip: Expose robustBufferAccess. 17647ec681f3Smrg- freedreno/a6xx: Fix clip_halfz support. 17657ec681f3Smrg- ci: Leave a note as to what might be going on with a test. 17667ec681f3Smrg- ci: Fix weird filesystem globs appearing in failed test .qpa files. 17677ec681f3Smrg- ci: Disable some flaky tests on turnip. 17687ec681f3Smrg- ci/bare-metal: Reword the final output of the init script on the board. 17697ec681f3Smrg- ci/bare-metal: Make which test to run configurable. 17707ec681f3Smrg- ci/bare-metal: Use the deqp-runner bits straight out of the artifacts. 17717ec681f3Smrg- ci/bare-metal: Stop fetching the git tree. 17727ec681f3Smrg- ci/bare-metal: Terminate the job with an error on kernel panic. 17737ec681f3Smrg- docs: Replace ancient swrast conformance docs with more current information. 17747ec681f3Smrg- docs: Add dri-devel to the mailing lists and drop the DRI wiki link. 17757ec681f3Smrg- ci: disable the windows tests until the runner can be stabilized again 17767ec681f3Smrg- ci: Bump vulkan CTS to 1.2.3.0. 17777ec681f3Smrg- ci: Enable NIR validation on a630 GLES2 and VK tests. 17787ec681f3Smrg- ci/bare-metal: Skip setting of unset variables at startup. 17797ec681f3Smrg- ci/bare-metal: Don't include dev packages in arm*test. 17807ec681f3Smrg- ci/tracie: Print the path if the trace isn't found. 17817ec681f3Smrg- ci/tracie: Fix apitrace dump using "less" which isn't in the ARM rootfs. 17827ec681f3Smrg- ci: Add a freedreno a630 tracie run. 17837ec681f3Smrg- freedreno/a6xx: Define the register fields for polygon fill mode. 17847ec681f3Smrg- turnip: Add support for polygon fill modes. 17857ec681f3Smrg- freedreno/a6xx: Add support for polygon fill mode (as long as front==back). 17867ec681f3Smrg- ci: Remove a stray "always" on the freedreno traces job. 17877ec681f3Smrg- ci/bare-metal: Fail early when we get stuck powering on a cheza. 17887ec681f3Smrg- ci/baremetal: Bump the kernel to a recent drm-msm-fixes for msm semaphores. 17897ec681f3Smrg- turnip: Do better TU_DEBUG=startup logging of drmGetDevices2() failure. 17907ec681f3Smrg- turnip: Fix error handling of DRM_MSM_GEM_INFO ioctls. 17917ec681f3Smrg- turnip: Properly return VK_DEVICE_LOST on queuesubmit failures. 17927ec681f3Smrg- gallium/util: Add a helper function for point sprite handling. 17937ec681f3Smrg- vc4: Enable PIPE_CAP_TGSI_TEXCOORD. 17947ec681f3Smrg- v3d: Enable PIPE_CAP_TGSI_TEXCOORD. 17957ec681f3Smrg- v3d: Fix -Wmaybe-uninitialized compiler warning in the v33 code. 17967ec681f3Smrg- ci: Disable pixmark-piano trace on a630 due to GPU hangs. 17977ec681f3Smrg- util: Avoid strict aliasing bugs in xxhash. 17987ec681f3Smrg- util: Mark util_format_description() as a const function. 17997ec681f3Smrg- softpipe: Clean up softpipe's SSBO load/store interpreting instructions. 18007ec681f3Smrg- util: Remove unused util_format_planar_is_supported(). 18017ec681f3Smrg- etnaviv: Use the util_pack_color_union() helper. 18027ec681f3Smrg- gallium/util: Fix location of the comment about S8_UINT handling. 18037ec681f3Smrg- gallium/util: Clean up the Z/S tile write path. 18047ec681f3Smrg- gallium/util: Move the Z/S handling to the outside of get_tile(). 18057ec681f3Smrg- svga: Reuse util_format_unpack_rgba(). 18067ec681f3Smrg- util: Merge util_format_write_4* functions. 18077ec681f3Smrg- util: Merge util_format_read_4* functions. 18087ec681f3Smrg- util: Use designated initializers to clean up the format tables' pack/unpack. 18097ec681f3Smrg- llvmpipe: Generalize "could llvmpipe fetch this format" check in unit testing. 18107ec681f3Smrg- util: Remove the stub pack/unpack functions for YUV formats. 18117ec681f3Smrg- util: Share a single function pointer for the 4-byte rgba unpack function. 18127ec681f3Smrg- docs: Move the current CI .rst doc to docs/ci/ and link to it from .gitlab-ci. 18137ec681f3Smrg- docs: Move the conformance and the CI docs to a top level Testing section. 18147ec681f3Smrg- docs: Move the gitlab-ci docs to RST. 18157ec681f3Smrg- docs: Relax the expectations of HW CI farms. 18167ec681f3Smrg- docs: Document how to interact with docker containers. 18177ec681f3Smrg- freedreno/ir3_cmdline: Fix an uninit var warning. 18187ec681f3Smrg- freedreno/ir3: Fix uninit var warning. 18197ec681f3Smrg- intel: Fix release-build warnings about sf_entry_size. 18207ec681f3Smrg- intel/perf: Fix unused var warning in release builds. 18217ec681f3Smrg- intel/perf: Move perf query register programming to static tables. 18227ec681f3Smrg- freedreno/a2xx: Fix compiler warning in disasm. 18237ec681f3Smrg- meson: Enable GCing of functions and data from compilation units by default. 18247ec681f3Smrg- freedreno/ir3: Fix duplicated fine derivatives instructions. 18257ec681f3Smrg- freedreno/ir3: Add unit tests for derivatives disasm. 18267ec681f3Smrg- ci: Use FDO_CI_CONCURRENT as our -j flags when present in the runner env. 18277ec681f3Smrg- freedreno/ir3: Add a note about the instructions in the disasm test. 18287ec681f3Smrg- freedreno/ir3: Add a bunch more tests for cat6 opcodes. 18297ec681f3Smrg- freedreno/ir3: Refactor cat6 general dst printing. 18307ec681f3Smrg- freedreno/ir3: Fix disasm of register offsets in ldp/stp. 18317ec681f3Smrg- freedreno/ir3: Add missing ld_args_build_id to the ir3_delay unit test. 18327ec681f3Smrg- ci: Set XDG_CACHE_HOME to tmpfs for bare-metal runners to avoid NFS. 18337ec681f3Smrg- ci: Update checksums for freedreno traces. 18347ec681f3Smrg- llvmpipe: Remove a bunch of default handling of pipe caps. 18357ec681f3Smrg- llvmpipe: Use the default behavior of ALLOW_MAPPED_BUFFERS. 18367ec681f3Smrg- softpipe: Remove a bunch of default handling of pipe caps. 18377ec681f3Smrg- softpipe: Use the default behavior of ALLOW_MAPPED_BUFFERS. 18387ec681f3Smrg- virgl: Remove a bunch of default handling of pipe caps. 18397ec681f3Smrg- swr: Remove a bunch of default handling of pipe caps. 18407ec681f3Smrg- swr: Use the default behavior of ALLOW_MAPPED_BUFFERS. 18417ec681f3Smrg- svga: Remove a bunch of default handling of pipe caps. 18427ec681f3Smrg- i915: Remove a bunch of default handling of pipe caps. 18437ec681f3Smrg- softpipe: Refactor pipe_shader_state setup. 18447ec681f3Smrg- softpipe: Convert to comma-separated SOFTPIPE_DEBUG for debug options. 18457ec681f3Smrg- softpipe: Add support for reporting shader-db output. 18467ec681f3Smrg- softpipe: Enable PIPE_CAP_TGSI_TEXCOORD. 18477ec681f3Smrg- softpipe: Enable PIPE_CAP_TGSI_ANY_REG_AS_ADDRESS; 18487ec681f3Smrg- ci/bare-metal: Capture the first devcoredump a job produces. 18497ec681f3Smrg- drm-shim: Return -EINVAL instead of abort()ing on unknown ioctls. 18507ec681f3Smrg- docs: Explain how to set up a personal gitlab runner. 18517ec681f3Smrg- nir: Add a pass to cut the trailing ends of vectors. 18527ec681f3Smrg- i965: Enable vector shrinking in the vec4 backend. 18537ec681f3Smrg- amd: Swap from nir_opt_shrink_load() to nir_opt_shrink_vectors(). 18547ec681f3Smrg- nir: Remove the old nir_opt_shrink_load. 18557ec681f3Smrg- freedreno: Fix "Offset of packed bitfield changed" warnings: 18567ec681f3Smrg- nir/lower_amul: Use num_ubos/ssbos instead of recomputing it. 18577ec681f3Smrg- nir: Add a little more docs about NIR's constant_data. 18587ec681f3Smrg- nir: Print the constant data size associated with a shader. 18597ec681f3Smrg- freedreno/ir3: Fix the type of half-float indirect uniform loads. 18607ec681f3Smrg- freedreno/a6xx: Document the bit for the magic 32bit-uniforms-as-16b mode. 18617ec681f3Smrg- freedreno/computerator: Set SP_MODE_CONTROL to the same value as vulkan/GL 18627ec681f3Smrg- freedreno/ir3: Merge the redundant immediate_idx/immediates_count fields 18637ec681f3Smrg- freedreno/ir3: Simpify the immediates from an array of vec4 to array of dwords. 18647ec681f3Smrg- freedreno: Rename emit_const_bo() to emit_const_ptrs(). 18657ec681f3Smrg- freedreno: Split ir3_const's user buffer and indirect upload APIs. 18667ec681f3Smrg- freedreno/ir3: Clean up instrlen setup. 18677ec681f3Smrg- freedreno: Increase the NUM_UNIT on compute's consts in indirect dispatch. 18687ec681f3Smrg- freedreno: Add more asserts for DST_OFF/NUM_UNIT in indirect const uploads. 18697ec681f3Smrg- freedreno/ir3: Fix assertion failures dumping CS high full regs. 18707ec681f3Smrg- turnip: Make sure we include the build id. 18717ec681f3Smrg- gallium/tgsi_exec: Fix up NumOutputs counting 18727ec681f3Smrg- freedreno: Make the pack struct have a .qword for wide addresses. 18737ec681f3Smrg- turnip: Fix truncation of CS shader iovas to 32 bits. 18747ec681f3Smrg- turnip: Fix truncation of iovas to 32 bits in queries. 18757ec681f3Smrg 18767ec681f3SmrgEric Engestrom (146): 18777ec681f3Smrg 18787ec681f3Smrg- cut 20.1 branch 18797ec681f3Smrg- docs: update calendar for 20.1.0-rc2 18807ec681f3Smrg- post_version.py: fix branch name construction for release candidates 18817ec681f3Smrg- post_version.py: invert `is_point` into `is_first_release` to make its purpose clearer 18827ec681f3Smrg- post_version.py: stop adding release candidates to the index and relnotes 18837ec681f3Smrg- docs: update calendar for 20.1.0-rc3 18847ec681f3Smrg- gitlab-ci: exclude scripts that don't affect the build 18857ec681f3Smrg- util/rand_xor: make it clear that {,s_}rand_xorshift128plus take *exactly 2* uint64_t 18867ec681f3Smrg- util/rand_xor: drop unused header 18877ec681f3Smrg- util/rand_xor: fallback Linux to time-based instead of fixed seed 18887ec681f3Smrg- util/rand_xor: extend the urandom path to all non-Windows platforms 18897ec681f3Smrg- docs: update calendar for 20.1.0-rc4 18907ec681f3Smrg- anv: pass the fd directly to anv_gem_reg_read() 18917ec681f3Smrg- anv: replace magic `| 1` with already #define'd name 18927ec681f3Smrg- anv: disable VK_EXT_calibrated_timestamps when the timestamp register is unreadable 18937ec681f3Smrg- git_sha1_gen.py: fix out-of-date comment 18947ec681f3Smrg- git_sha1_gen.py: fix code style 18957ec681f3Smrg- git_sha1_gen.py: fix whitespace 18967ec681f3Smrg- compiler: delete leftover autotools test wrapper 18977ec681f3Smrg- no_extern_c.h: fix typo in comment 18987ec681f3Smrg- tree-wide: fix deprecated GitLab URLs 18997ec681f3Smrg- docs: drop no-longer-relevant comment about bugzilla 19007ec681f3Smrg- docs: Add release notes for 20.1.0 19017ec681f3Smrg- docs: update calendar, add news item, and link releases notes for 20.1.0 19027ec681f3Smrg- meson: remove "empty array"/"array of an empty string" confusion 19037ec681f3Smrg- glapi: remove deprecated .getchildren() that has been replace with an iterator 19047ec681f3Smrg- intel/genxml: drop sort_xml.sh and move the loop directly in gen_sort_tags.py 19057ec681f3Smrg- intel: fix gen_sort_tags.py 19067ec681f3Smrg- docs: Add release notes for 20.1.1 19077ec681f3Smrg- docs: update calendar, add news item, and link releases notes for 20.1.1 19087ec681f3Smrg- v3d: add missing unlock() in error path 19097ec681f3Smrg- intel/genxml: drop python 2 support for gen_sort_tags.py 19107ec681f3Smrg- intel/genxml: replace gen_sort_tags.py MIT licence with SPDX equivalent 19117ec681f3Smrg- docs: update the blocks of unused EGL enums assigned to us 19127ec681f3Smrg- i965: drop dead #include "config.h" 19137ec681f3Smrg- iris: drop dead #include "config.h" 19147ec681f3Smrg- gen_release_notes.py: update script to the new rST way of things 19157ec681f3Smrg- post_version.py: update script to the new rST way of things 19167ec681f3Smrg- intel/tools: rewrite run-test.sh in python 19177ec681f3Smrg- intel/tools: make test aware of the meson test wrapper 19187ec681f3Smrg- khronos-update.py: add script to simplify update of Khronos headers & xml files 19197ec681f3Smrg- docs: remove plain-text copy of versions.rst 19207ec681f3Smrg- util/os_file: replace broken windows-detection code with detect_os.h 19217ec681f3Smrg- util: introduce os_dupfd_cloexec() helper 19227ec681f3Smrg- replace all F_DUPFD_CLOEXEC with os_dupfd_cloexec() 19237ec681f3Smrg- vulkan/wsi: replace all dup() with os_dupfd_cloexec() 19247ec681f3Smrg- radv: replace all dup() with os_dupfd_cloexec() 19257ec681f3Smrg- anv: replace all dup() with os_dupfd_cloexec() 19267ec681f3Smrg- iris: replace all dup() with os_dupfd_cloexec() 19277ec681f3Smrg- i965: replace all dup() with os_dupfd_cloexec() 19287ec681f3Smrg- egl: replace all dup() with os_dupfd_cloexec() 19297ec681f3Smrg- etnaviv: replace all dup() with os_dupfd_cloexec() 19307ec681f3Smrg- freedreno: replace all dup() with os_dupfd_cloexec() 19317ec681f3Smrg- svga: replace all dup() with os_dupfd_cloexec() 19327ec681f3Smrg- virgl: replace all dup() with os_dupfd_cloexec() 19337ec681f3Smrg- docs: publish our release maintainers' keys 19347ec681f3Smrg- docs: remind release maintainers to sign the tarballs and publish their key 19357ec681f3Smrg- docs: suggest alternative installation methods for meson 19367ec681f3Smrg- docs: stop considering `Cc: mesa-stable` as an email address 19377ec681f3Smrg- docs: reword "sending a patch revision" to "updating a merge request" 19387ec681f3Smrg- docs: drop `git sendemail` instructions 19397ec681f3Smrg- docs: prefer `Fixes:` over `Cc: mesa-stable` 19407ec681f3Smrg- docs: add some formatting to the "backport merge request" option 19417ec681f3Smrg- docs: reword a sentence a bit 19427ec681f3Smrg- docs: make it clear that the tags needs to be in the commit message 19437ec681f3Smrg- docs: move `Fixes:` tag explanation to its own section 19447ec681f3Smrg- docs: move "stable" tag explanation next to `Fixes:` 19457ec681f3Smrg- driconf: drop 28% catalan translation 19467ec681f3Smrg- driconf: drop 15% german translation 19477ec681f3Smrg- driconf: drop 26% spanish translation 19487ec681f3Smrg- driconf: drop 6% french translation 19497ec681f3Smrg- driconf: drop 8% dutch translation 19507ec681f3Smrg- driconf: drop 9% swedish translation 19517ec681f3Smrg- driconf: drop now unused translation facility 19527ec681f3Smrg- util: rename xmlpool.h to driconf.h 19537ec681f3Smrg- gitlab-ci: drop gettext from the build images 19547ec681f3Smrg- docs: drop deleted file from extra sphinx files 19557ec681f3Smrg- docs: cat maintainer keys to a single file 19567ec681f3Smrg- docs: add some padding to the release calendar 19577ec681f3Smrg- docs: add planning for 20.2 19587ec681f3Smrg- bin/symbols-check: explain C++ symbols workaround 19597ec681f3Smrg- docs: Add release notes for 20.1.2 19607ec681f3Smrg- docs: update calendar and link releases notes for 20.1.2 19617ec681f3Smrg- docs: fix 20.1.2 relnotes 19627ec681f3Smrg- docs: add a page explaining the GitLab CI and the Intel CI 19637ec681f3Smrg- mesa/glformats: make _mesa_gles_error_check_format_and_type() more consistent 19647ec681f3Smrg- docs: add release notes for 20.1.3 19657ec681f3Smrg- docs: update calendar and link releases notes for 20.1.3 19667ec681f3Smrg- docs: fix a bunch of typos 19677ec681f3Smrg- egl: always compile surfaceless 19687ec681f3Smrg- vulkan: automatically compile the `display` platform when available 19697ec681f3Smrg- meson: move xlib-lease block further down 19707ec681f3Smrg- egl: automatically compile the `drm` platform when available 19717ec681f3Smrg- introduce `commit_in_branch.py` script to help devs figure this out 19727ec681f3Smrg- bin/gen_release_notes.py: drop new_features.txt when we release XX.Y.0 19737ec681f3Smrg- egl/wayland: add missing newline between functions 19747ec681f3Smrg- glx: drop always-true #ifdef 19757ec681f3Smrg- docs/submittingpatches: add more than one `Cc: mesa-stable` example to the examples list 19767ec681f3Smrg- meson/intel: add missing dep on git_sha1.h 19777ec681f3Smrg- meson: fix android vulkan build 19787ec681f3Smrg- egl: inline fallback for create_pixmap_surface 19797ec681f3Smrg- egl: inline fallback for create_pbuffer_surface 19807ec681f3Smrg- egl: drop unused fallback function 19817ec681f3Smrg- egl: inline fallback for swap_buffers_with_damage 19827ec681f3Smrg- egl: inline fallback for swap_buffers_region 19837ec681f3Smrg- egl: inline fallback for post_sub_buffer 19847ec681f3Smrg- egl: inline fallback for copy_buffers 19857ec681f3Smrg- egl: inline fallback for query_buffer_age 19867ec681f3Smrg- egl: inline fallback for create_wayland_buffer_from_image 19877ec681f3Smrg- egl: inline fallback for get_sync_values 19887ec681f3Smrg- egl: drop now empty egl_dri2_fallbacks.h 19897ec681f3Smrg- egl: mark the rest of the callbacks as mandatory or optional 19907ec681f3Smrg- egl: inline _EGLAPI into _EGLDriver 19917ec681f3Smrg- docs: add release notes for 20.1.4 19927ec681f3Smrg- docs: update calendar and link releases notes for 20.1.4 19937ec681f3Smrg- post_version.py: don't generate relnotes twice 19947ec681f3Smrg- post_version.py: drop incorrect conf.py changes 19957ec681f3Smrg- post_version.py: stop using non-existent functions and fix commit message 19967ec681f3Smrg- post_version.py: update the files in the current worktree, not the one with the script that we run 19977ec681f3Smrg- post_version.py: fix relnotes links 19987ec681f3Smrg- bin/gen_release_notes: automatically commit release notes 19997ec681f3Smrg- docs/releasing: improve wording 20007ec681f3Smrg- bin/khronos-update: having a folder in include/ is not a requirement 20017ec681f3Smrg- bin/khronos-update: add support for the SPIRV files 20027ec681f3Smrg- bin/khronos-update: add workaround for python bug 9625 20037ec681f3Smrg- egl: replace _eglInitDriver() with a simple variable 20047ec681f3Smrg- egl: drop unnecessary _eglGetDriver() 20057ec681f3Smrg- egl: fix _eglMatchDriver() return type 20067ec681f3Smrg- egl: inline _eglMatchAndInitialize() and refactor _eglMatchDriver() 20077ec681f3Smrg- egl: rename _eglMatchDriver() to _eglInitializeDisplay() 20087ec681f3Smrg- egl: drop left-over function prototype 20097ec681f3Smrg- egl: const _eglDriver 20107ec681f3Smrg- egl/haiku: drop overwritten preset of EGL version 20117ec681f3Smrg- egl: consistently use dri2_egl_display() helper macro 20127ec681f3Smrg- meson: fix `-D xlib-lease=auto` detection 20137ec681f3Smrg- docs: add release notes for 20.1.5 20147ec681f3Smrg- docs: update calendar and link releases notes for 20.1.5 20157ec681f3Smrg- pick-ui: specify git commands in "resolve cherry pick" message 20167ec681f3Smrg- egl/entrypoint-check: split sort-check into a function 20177ec681f3Smrg- egl/entrypoint-check: add check that GLVND and plain EGL have the same entrypoints 20187ec681f3Smrg- driconf: fix force_gl_vendor description 20197ec681f3Smrg- meson: bump required glvnd version 20207ec681f3Smrg- egl/x11_dri3: enable & require xfixes 2.0 20217ec681f3Smrg- egl/x11_dri3: implement EGL_KHR_swap_buffers_with_damage 20227ec681f3Smrg- meson: don't advertise TLS support if glx wasn't build with it 20237ec681f3Smrg- meson: drop leftover PTHREAD_SETAFFINITY_IN_NP_HEADER 20247ec681f3Smrg 20257ec681f3SmrgErico Nunes (16): 20267ec681f3Smrg 20277ec681f3Smrg- lima/ppir: introduce liveness internal live set 20287ec681f3Smrg- lima/ppir: fix lod bias register codegen 20297ec681f3Smrg- lima/ppir: do not assume single src for pipeline outputs 20307ec681f3Smrg- lima/ppir: combine varying loads in node_to_instr 20317ec681f3Smrg- lima/ppir: duplicate intrinsics in nir 20327ec681f3Smrg- lima/ppir: duplicate consts in nir 20337ec681f3Smrg- lima/ppir: remove unused clone functions 20347ec681f3Smrg- lima/ppir: rework emit nir to ppir 20357ec681f3Smrg- lima/ppir: rework store output 20367ec681f3Smrg- lima/ppir: add fallback mov option for const scheduler 20377ec681f3Smrg- lima/ppir: rework select conditions 20387ec681f3Smrg- lima/ppir: handle failures on all ppir_emit_cf_list paths 20397ec681f3Smrg- lima/ppir: improve handling for successors in other blocks 20407ec681f3Smrg- lima/ppir: rework tex lowering 20417ec681f3Smrg- lima/ppir: optimize tex loads with single successor 20427ec681f3Smrg- lima/ppir: use a ready list in node_to_instr 20437ec681f3Smrg 20447ec681f3SmrgErik Faye-Lund (124): 20457ec681f3Smrg 20467ec681f3Smrg- compiler/nir: move tan-calculation to helper 20477ec681f3Smrg- vtn/opencl: add native_tan-support 20487ec681f3Smrg- vtn/opencl: native variants of sin/cos 20497ec681f3Smrg- vtn/opencl: native divide support 20507ec681f3Smrg- vtn/opencl: native powr support 20517ec681f3Smrg- vtn/opencl: native recip support 20527ec681f3Smrg- vtn/opencl: native rsqrt support 20537ec681f3Smrg- vtn/opencl: native sqrt support 20547ec681f3Smrg- compiler/glsl: explicitly store NumUniformBlocks 20557ec681f3Smrg- mesa/st: consider NumUniformBlocks instead of num_ubos when binding 20567ec681f3Smrg- zink: use nir_lower_uniforms_to_ubo 20577ec681f3Smrg- zink: lower b2b to b2i 20587ec681f3Smrg- util/os_memory: never use os_memory_debug.h 20597ec681f3Smrg- st/wgl: pass st_context_iface into stw_st_framebuffer_present_locked 20607ec681f3Smrg- st/wgl: allocate and resolve msaa-textures 20617ec681f3Smrg- docs/features: add zink features 20627ec681f3Smrg- zink: load vk_GetMemoryFdKHR while creating screen 20637ec681f3Smrg- zink: add a GET_PROC_ADDR macro to simplify load_device_extensions 20647ec681f3Smrg- docs/features: mark GL_NV_conditional_render as done for zink 20657ec681f3Smrg- zink: disable vkCmdResolveImage when respecting render-condition 20667ec681f3Smrg- zink: do not expose real value for PIPE_CAP_MAX_VIEWPORTS 20677ec681f3Smrg- zink: correct PIPE_SHADER_CAP_MAX_SHADER_IMAGES 20687ec681f3Smrg- zink: mark depth-component cube-maps as done 20697ec681f3Smrg- zink: implement i2b1 20707ec681f3Smrg- docs: fix broken release-calendar 20717ec681f3Smrg- zink: hammer in an explicit wait when retrieving buffer contents for reading 20727ec681f3Smrg- zink: use samples from state 20737ec681f3Smrg- zink: do not dig into resource for nr_samples 20747ec681f3Smrg- zink: pass batch instead of context for queries 20757ec681f3Smrg- zink: implement nir_texop_txf_ms 20767ec681f3Smrg- zink: expose PIPE_CAP_TEXTURE_MULTISAMPLE 20777ec681f3Smrg- docs/features: mark GL_ARB_texture_multisample as done for zink 20787ec681f3Smrg- zink: use general-layout when blitting to/from same resource 20797ec681f3Smrg- zink: Use store_dest_raw instead of storing an uint 20807ec681f3Smrg- nir: reuse existing psiz-variable 20817ec681f3Smrg- zink: emulate B8G8R8X8_SRGB with B8G8R8A8_SRGB 20827ec681f3Smrg- zink: assert that image-view format isn't undefined 20837ec681f3Smrg- zink: only report device-local memory as video-memory 20847ec681f3Smrg- gallium/hud: do not specify potentially invalid depth-range 20857ec681f3Smrg- TEMP: add rst-conversion scripts 20867ec681f3Smrg- docs: convert articles to reructuredtext 20877ec681f3Smrg- TEMP: remove rst-conversion scripts 20887ec681f3Smrg- docs: delete no longer needed file 20897ec681f3Smrg- docs: fixup botched table 20907ec681f3Smrg- docs: escape double colons 20917ec681f3Smrg- docs: escape asterisks 20927ec681f3Smrg- docs: escape trailing underscores properly 20937ec681f3Smrg- docs: fixup broken rst 20947ec681f3Smrg- docs: fixup heading-levels 20957ec681f3Smrg- docs: use sphinx 20967ec681f3Smrg- docs: disable syntax-highlighting by default 20977ec681f3Smrg- docs: use code-block with caption instead of table 20987ec681f3Smrg- docs: format notes as rst-notes 20997ec681f3Smrg- docs: use code-blocks 21007ec681f3Smrg- docs: drop open-coded toc for articles 21017ec681f3Smrg- docs: add xlibdriver to table-of-contents 21027ec681f3Smrg- docs: do not copy source-files to site 21037ec681f3Smrg- docs: use rst footnotes instead of manual ones 21047ec681f3Smrg- docs: reformat license table as rst table 21057ec681f3Smrg- docs: use rst-note for highlighted text 21067ec681f3Smrg- docs: bundle extra files 21077ec681f3Smrg- docs: include specs into the generated docs 21087ec681f3Smrg- gitlab-ci: build and deploy docs 21097ec681f3Smrg- docs: drop news in favour of the introduction as index-page 21107ec681f3Smrg- README: update references to internal docs 21117ec681f3Smrg- docs: update internal references 21127ec681f3Smrg- docs/relnotes: update internal references 21137ec681f3Smrg- radv: update internal reference 21147ec681f3Smrg- bin/perf-annotate-jit.py: update internal reference 21157ec681f3Smrg- docs/release-calendar: restore missing id 21167ec681f3Smrg- nir: do not try to merge xfb-outputs 21177ec681f3Smrg- Revert "gallium/hud: don't use user vertex buffers" 21187ec681f3Smrg- gallium/hud: don't use user vertex buffers 21197ec681f3Smrg- zink: enable cull-distance if supported 21207ec681f3Smrg- zink: expose GLSL 1.30 21217ec681f3Smrg- docs: update internal references 21227ec681f3Smrg- docs/relnotes: update internal references 21237ec681f3Smrg- docs: fixup relnotes after rst-conversion 21247ec681f3Smrg- docs/features: mark GL3 as complete for zink 21257ec681f3Smrg- docs/features: update ARB_texture_buffer_object line 21267ec681f3Smrg- docs/features: remove driver-list for forward-compatible context 21277ec681f3Smrg- mesa/main: fix inverted condition 21287ec681f3Smrg- gallium/os: call "ANSI" version of GetCommandLine 21297ec681f3Smrg- graw/gdi: do not depend on UNICODE macro 21307ec681f3Smrg- gallium/util: limit STACK_LEN on Windows 21317ec681f3Smrg- gallium/util: add missing include 21327ec681f3Smrg- docs: update favicon 21337ec681f3Smrg- docs: remove non-existent reference 21347ec681f3Smrg- docs: restore accidentally dropped labels 21357ec681f3Smrg- docs: fix internal references 21367ec681f3Smrg- docs: use ref-links for internal references 21377ec681f3Smrg- gallium/docs: update to recent sphinx 21387ec681f3Smrg- gallium/docs: fixup formatting of numbered lists 21397ec681f3Smrg- gallium/docs: remove reference to non-existent label 21407ec681f3Smrg- gallium/docs: use none for highlight_language 21417ec681f3Smrg- gallium/docs: prefix exts dir with underscore 21427ec681f3Smrg- gallium/docs: remove non-existent static dir 21437ec681f3Smrg- gallium/docs: remove unused imgmath extension 21447ec681f3Smrg- ci: only build docs in the upstream-repo 21457ec681f3Smrg- ci: only build docs if any docs changed 21467ec681f3Smrg- ci: test docs for non-master builds 21477ec681f3Smrg- ci: move deploy-stage later in the pipeline 21487ec681f3Smrg- ci: move test-docs to container stage 21497ec681f3Smrg- ci: add graphviz to the .docs-base template 21507ec681f3Smrg- merge gallium docs into main docs 21517ec681f3Smrg- docs: clean up gallium index-file 21527ec681f3Smrg- docs: add an extension to generate redirects 21537ec681f3Smrg- docs: move gallium specific docs into gallium folder 21547ec681f3Smrg- docs: use svg for graphviz output 21557ec681f3Smrg- docs: fixup envvar output 21567ec681f3Smrg- zink: expose depth-clip if supported 21577ec681f3Smrg- mesa/main: factor out one-time-init into a helper 21587ec681f3Smrg- mesa/main: use call_once instead of open-coding 21597ec681f3Smrg- gallium/util: do not use _MTX_INITIALIZER_NP on Windows 21607ec681f3Smrg- mesa/main: use p_atomic_inc_return instead of locking 21617ec681f3Smrg- mesa: do not use bitfields for advanced-blend state 21627ec681f3Smrg- mesa: treat Color._AdvancedBlendMode as enum 21637ec681f3Smrg- zink: use ralloc in nir-to-spirv 21647ec681f3Smrg- zink: use ralloc for plain malloc-calls 21657ec681f3Smrg- zink: pass mem_ctx to ralloc_size-call 21667ec681f3Smrg- zink: use ralloc for spirv_builder as well 21677ec681f3Smrg- mesa/program: fix shadow property for samplers 21687ec681f3Smrg- docs: add some very basic documentation about zink 21697ec681f3Smrg- mesa: handle GL_FRONT after translating to it 21707ec681f3Smrg 21717ec681f3SmrgFrancisco Jerez (23): 21727ec681f3Smrg 21737ec681f3Smrg- intel/ir: Update performance analysis parameters for memory fence codegen changes. 21747ec681f3Smrg- iris: Simplify iris_batch_prepare_noop(). 21757ec681f3Smrg- iris: Extend iris_context dirty state flags to 128 bits. 21767ec681f3Smrg- iris: Add batch-local synchronization book-keeping to iris_bo. 21777ec681f3Smrg- iris: Add infrastructure to partition batch into sync boundaries. 21787ec681f3Smrg- iris: Bracket batch operations which access memory within sync regions. 21797ec681f3Smrg- iris: Annotate all BO uses with domain and sequence number information. 21807ec681f3Smrg- iris: Drop redundant iris_address::write flag. 21817ec681f3Smrg- iris: Report use of any in-flight buffers on first draw call after sync boundary. 21827ec681f3Smrg- iris: Introduce cache coherency matrix for batch-local memory ordering. 21837ec681f3Smrg- iris: Update cache coherency matrix on PIPE_CONTROL. 21847ec681f3Smrg- iris: Implement buffer-local memory barrier based on cache coherency matrix. 21857ec681f3Smrg- iris: Insert buffer barrier in existing cache flush helpers. 21867ec681f3Smrg- iris: Remove batch argument of iris_resource_prepare_access() and friends. 21877ec681f3Smrg- iris: Perform compute predraw flushes from compute batch. 21887ec681f3Smrg- iris: Remove depth cache set tracking and synchronization. 21897ec681f3Smrg- iris: Remove render cache hash table-based synchronization. 21907ec681f3Smrg- iris: Open-code iris_cache_flush_for_read() and iris_cache_flush_for_depth(). 21917ec681f3Smrg- iris: Emit single render target flush PIPE_CONTROL on format mismatch. 21927ec681f3Smrg- iris: Remove iris_flush_depth_and_render_caches(). 21937ec681f3Smrg- OPTIONAL: iris: Perform BLORP buffer barriers outside of iris_blorp_exec() hook. 21947ec681f3Smrg- iris/icl+: Report same caching domain as main surface for clear color BO. 21957ec681f3Smrg- intel/ir/gen12+: Work around FS performance regressions due to SIMD32 discard divergence. 21967ec681f3Smrg 21977ec681f3SmrgFrank Binns (2): 21987ec681f3Smrg 21997ec681f3Smrg- docs: change "Fixes:" tag example to match git fixes output 22007ec681f3Smrg- egl/dri2: only take a dri2_dpy reference when binding a new context/surfaces 22017ec681f3Smrg 22027ec681f3SmrgFrédéric Bonnard (2): 22037ec681f3Smrg 22047ec681f3Smrg- clover: Fix types collision between c++ and altivec 22057ec681f3Smrg- meson: Revert commit overriding C++ standard with gnu++11 on ppc64el 22067ec681f3Smrg 22077ec681f3SmrgGert Wollny (66): 22087ec681f3Smrg 22097ec681f3Smrg- r600: Annotate some case fallthroughs 22107ec681f3Smrg- r600: remove unused static functions 22117ec681f3Smrg- r600/sb: replace memset by using member initialization/assignment 22127ec681f3Smrg- r600: remove some unused variables to silence warnings 22137ec681f3Smrg- r600: Fix warning regarding mixing enums and unsigned in ?: expression 22147ec681f3Smrg- r600: Fix nir compiler options, i.e. don't lower IO to temps for TESS 22157ec681f3Smrg- r600/sfn: Unify semantic name and index query and use TEXCOORD semantic 22167ec681f3Smrg- r600/sfn: Fix printing vertex fetch instruction flags 22177ec681f3Smrg- r600: Lower int64 ops from TGSI-to-NIR shaders too 22187ec681f3Smrg- r600: Lower lerp after tgsi_to_nir 22197ec681f3Smrg- r600: Add support for loading index register from other than chan X 22207ec681f3Smrg- r600/sfn: Handle CF index loading from non-X channel 22217ec681f3Smrg- r600/sfn: rework getting a vector and uniforms from the value pool 22227ec681f3Smrg- r600/sfn: Skip move instructions if they are only ssa and without modifiers 22237ec681f3Smrg- r600/sfn: re-use an allocated register in lookup 22247ec681f3Smrg- r600/sfn: skip copying LOD if the target register is is the same 22257ec681f3Smrg- r600/sfn: Fix memring print output 22267ec681f3Smrg- r600/sfn: Fix RING instruction assembly emission 22277ec681f3Smrg- r600/sfn: Fix GDS assembly emission 22287ec681f3Smrg- r600/sfn: Fix RAT instruction assembly emission 22297ec681f3Smrg- r600/sfn: Make allocate_reserved_registers forward to a virtual function 22307ec681f3Smrg- r600/sfn: Fix handling of output register index 22317ec681f3Smrg- r600/sfn: Make 3vec loads skip possible moves 22327ec681f3Smrg- r600/sfn: Add support for viewport index output 22337ec681f3Smrg- r600/sfn: Take FOGC, and backcolors into account im GS outputs 22347ec681f3Smrg- r600/sfn: Handle loading sample_pos 22357ec681f3Smrg- r600/sfn: Add FS output sample_mask 22367ec681f3Smrg- r600/sfn: Don't reject VARYING_SLOT_PCNT 22377ec681f3Smrg- r600/sfn: remove pointless check 22387ec681f3Smrg- r600/sfn: assert when alu dest is missing 22397ec681f3Smrg- r600/sfn: support indirect sampler buffer reads. 22407ec681f3Smrg- r600/sfn: Add support for texture_samples 22417ec681f3Smrg- r600/sfn: use the per shader atomic base 22427ec681f3Smrg- r600/sfn: SSBO: Fix query of dest components 22437ec681f3Smrg- r600/sfn: Fix clip vertex output as possible stream variable 22447ec681f3Smrg- r600/sfn: Fix splitting constants that come from different kcache banks. 22457ec681f3Smrg- r600/sfn: Don't reorder outputs by location 22467ec681f3Smrg- r600/sfn: Fix printing ALU op without dest 22477ec681f3Smrg- r600: Fix duplicated subexpression in r600_asm.c 22487ec681f3Smrg- r600/sfn: Fix mapping for f32tof64 and f64tof32 22497ec681f3Smrg- r600/sfn: use modern c++ in printing LDS read instruction 22507ec681f3Smrg- r600/sfn: Correctly update the number of literals when forcing a new group 22517ec681f3Smrg- r600/sfn: remove debug output leftover 22527ec681f3Smrg- nir: lower_tex: Don't normalize coordinates for TXF with RECT 22537ec681f3Smrg- r600/sfn: lower image derefs 22547ec681f3Smrg- r600/sfn: Add imageio support 22557ec681f3Smrg- r600/sfn: Add support for image_size 22567ec681f3Smrg- r600/sfn: Add support for reading cube image array dim. 22577ec681f3Smrg- r600/sfn: Take SSBO buffer ID offset into account 22587ec681f3Smrg- r600/sfn: Handle memory_barrier 22597ec681f3Smrg- r600/sfn: Add lowering pass for shared IO 22607ec681f3Smrg- r600/sfn: Add support for shared atomics 22617ec681f3Smrg- r600/sfn: Don't set num_components on TESS sysvalue intrinsics 22627ec681f3Smrg- r600/sfn: lower rotate ALU ops 22637ec681f3Smrg- r600/sfn: Pipe through requesting a register at a given channel 22647ec681f3Smrg- r600/sfn: emit texture instructions in one block 22657ec681f3Smrg- r600/sfn: Add option to get a temp value for a specific channel 22667ec681f3Smrg- r600/sfn: correct handling of loading vec4 with fetching constants 22677ec681f3Smrg- r600/sfn: Add a forced output swizzle for depth write 22687ec681f3Smrg- r600/sfn: Fix Ring output swizzle masks 22697ec681f3Smrg- r600/sfn: Fix default z swizzle for GDS instructions 22707ec681f3Smrg- r600: Add shader key item to identify when the sample mask should be used 22717ec681f3Smrg- r600/sfn: Only use sample mask if the according shader key is set 22727ec681f3Smrg- r600/sfn: Make the pin_to_channel generic 22737ec681f3Smrg- d600/sfn: write stream outputs to correct mem ring 22747ec681f3Smrg- gallivm/nir: Lower uniforms to UBOs in llvm draw if the driver didn't request this already 22757ec681f3Smrg 22767ec681f3SmrgGreg V (1): 22777ec681f3Smrg 22787ec681f3Smrg- gallium,util: undef ALIGN on FreeBSD to prevent name clash 22797ec681f3Smrg 22807ec681f3SmrgGuido Günther (2): 22817ec681f3Smrg 22827ec681f3Smrg- etnaviv: drm: Use NSEC_PER_SEC 22837ec681f3Smrg- etnaviv: drm: Normalize nano seconds 22847ec681f3Smrg 22857ec681f3SmrgGurchetan Singh (1): 22867ec681f3Smrg 22877ec681f3Smrg- virgl: apply bgra dest swizzle and add Portal 2 22887ec681f3Smrg 22897ec681f3SmrgHanno Böck (1): 22907ec681f3Smrg 22917ec681f3Smrg- Properly check mmap return value 22927ec681f3Smrg 22937ec681f3SmrgHyunjun Ko (6): 22947ec681f3Smrg 22957ec681f3Smrg- freedreno,tu: Don't request fragcoord components not being read. 22967ec681f3Smrg- tu,radv: fix potentially wrong offset of flexible array. 22977ec681f3Smrg- vulkan: Adds helpers for vk_object (de)alloation and (de)initialization. 22987ec681f3Smrg- tu: Fix wrong copies of sampler descriptor. 22997ec681f3Smrg- turnip: Use the common base object type and struct. 23007ec681f3Smrg- turnip: implement VK_EXT_private_data 23017ec681f3Smrg 23027ec681f3SmrgIago Toral Quiroga (7): 23037ec681f3Smrg 23047ec681f3Smrg- v3d/compiler: don't rewrite unused temporaries to point to NOP register 23057ec681f3Smrg- v3d/compiler: fix spill offset 23067ec681f3Smrg- v3d/compiler: fix image size for 1D arrays 23077ec681f3Smrg- nir/lower_clip: make the pass compatible with Vulkan semantics 23087ec681f3Smrg- v3d/compiler: handle compact varyings 23097ec681f3Smrg- v3d/compiler: request fragment shader clip lowering to be vulkan compatible. 23107ec681f3Smrg- nir/lower_tex: skip lower_tex_packing for the texture samples query 23117ec681f3Smrg 23127ec681f3SmrgIan Romanick (24): 23137ec681f3Smrg 23147ec681f3Smrg- nir/algebraic: Recognize open-coded byte or word extract from bfe 23157ec681f3Smrg- nir/algebraic: Split ibfe and ubfe with two constant sources 23167ec681f3Smrg- nir/algebraic: Optimize some bfe patterns 23177ec681f3Smrg- nir/algebraic: Optimize ushr of pack_half, not ishr 23187ec681f3Smrg- nir/algebraic: Add some half packing optimizations for pack_half_2x16_split 23197ec681f3Smrg- nir/algebraic: Eliminate useless extract before unpack 23207ec681f3Smrg- i965: Assert that blorp always handles color blits 23217ec681f3Smrg- meta: Make _mesa_meta_texture_object_from_renderbuffer static 23227ec681f3Smrg- meta: Make _mesa_meta_setup_sampler static 23237ec681f3Smrg- meta: Remove support for clearing integer buffers 23247ec681f3Smrg- mesa: Add matrix utility functions to load matrices 23257ec681f3Smrg- mesa: Add function to calculate an orthographic projection 23267ec681f3Smrg- meta: Stop frobbing MatrixMode 23277ec681f3Smrg- meta: Use same vertex coordinates for GLSL and FF clears 23287ec681f3Smrg- meta: Coalesce the GLSL and FF paths in meta_clear 23297ec681f3Smrg- meta: Remove support for multisample blits 23307ec681f3Smrg- anv/tests: Don't rely on assert or changing NDEBUG in tests 23317ec681f3Smrg- anv/tests: Silence unused parameter warnings in main 23327ec681f3Smrg- anv: Silence unused parameter warning in anv_image_get_clear_color_addr 23337ec681f3Smrg- intel: Silence unused parameter warning in __intel_log_use_args 23347ec681f3Smrg- intel/drm-shim: Add noop ioctl handler for set_tiling 23357ec681f3Smrg- intel/drm-shim: Return correct values for I915_PARAM_HAS_ALIASING_PPGTT 23367ec681f3Smrg- glsl: Remove integer matrix support from ir_dereference_array::constant_expression_value 23377ec681f3Smrg- nir/algebraic: Don't distrubte absolute-value into dot-products 23387ec681f3Smrg 23397ec681f3SmrgIcecream95 (78): 23407ec681f3Smrg 23417ec681f3Smrg- pan/midgard: Fix old style shadows 23427ec681f3Smrg- panfrost: Fix background showing when using discard 23437ec681f3Smrg- panfrost: Enable PIPE_CAP_VERTEX_COLOR_UNCLAMPED 23447ec681f3Smrg- panfrost: Decode AFBC flag bits 23457ec681f3Smrg- panfrost: Only use AFBC YTR with RGB and RGBA 23467ec681f3Smrg- pan/midgard: Use a signed value for checking inline constants 23477ec681f3Smrg- Revert "panfrost: Keep cached BOs mmap'd" 23487ec681f3Smrg- panfrost: Mark PIPE_BUFFER BOs as not renderable 23497ec681f3Smrg- pan/mdg: Add a macro for printing instruction source information 23507ec681f3Smrg- pan/mdg: Move r1.w writeout to branch->dest 23517ec681f3Smrg- pan/mdg: Remove old zs store lowering 23527ec681f3Smrg- pan/mdg: Remove old depth writeout code 23537ec681f3Smrg- pan/mdg: Remove writeout case from bytemask_of_read_components 23547ec681f3Smrg- nir: Replace the zs_output_pan intrinsic with combined_output_pan 23557ec681f3Smrg- pan/mdg: Replace writeout booleans with a single value 23567ec681f3Smrg- pan/mdg: Add new depth writeout code 23577ec681f3Smrg- pan/mdg: Move search_var to earlier in midgard_compile.c 23587ec681f3Smrg- pan/mdg: Add depth/stencil support to emit_fragment_store 23597ec681f3Smrg- pan/mdg: Add new depth store lowering 23607ec681f3Smrg- pan/mdg: Print writeout sources in mir_print_instruction 23617ec681f3Smrg- panfrost: Add writes_stencil to the EARLY_Z disable list 23627ec681f3Smrg- panfrost: Move sampler view bo creation to a separate function 23637ec681f3Smrg- panfrost: Create a new sampler view bo when the layout changes 23647ec681f3Smrg- panfrost: Tiled to linear layout conversion 23657ec681f3Smrg- panfrost: Clean up panfrost_frag_meta_rasterizer_update 23667ec681f3Smrg- panfrost: Implement ARB_depth_clamp 23677ec681f3Smrg- pan/decode: Fix helper invocations when tracing 23687ec681f3Smrg- pan/decode: Add missing wrap modes 23697ec681f3Smrg- pan/mdg: Fix max_comp calculation for constant printing 23707ec681f3Smrg- panfrost: RGBA4 and RGB5_A1 framebuffer support 23717ec681f3Smrg- panfrost: Update sampler views when the texture bo changes 23727ec681f3Smrg- panfrost: Copy resources when mapping to avoid waiting for readers 23737ec681f3Smrg- panfrost: Only copy resources when they are in a pending batch 23747ec681f3Smrg- panfrost: Add PAN_MESA_DEBUG=gl3 flag 23757ec681f3Smrg- panfrost: Do fine-grained flushing for occlusion query results 23767ec681f3Smrg- pan/mdg: Vectorize vlut operations 23777ec681f3Smrg- pan/decode: Make mapped memory read-only while decoding 23787ec681f3Smrg- nir: Add a base value to load_raw_output_pan 23797ec681f3Smrg- panfrost: Fix MALI_READS_TILEBUFFER 23807ec681f3Smrg- pan/mdg: Handle tilebuffer wait loops 23817ec681f3Smrg- pan/mdg: Use the writeout tag for tilebuffer wait loops 23827ec681f3Smrg- panfrost: Add rt formats to shader state 23837ec681f3Smrg- panfrost: Add a bitset of render targets read by shaders 23847ec681f3Smrg- pan/mdg: Do the pan_lower_framebuffer pass later 23857ec681f3Smrg- pan/mdg: Emit a tilebuffer wait loop when needed 23867ec681f3Smrg- pan/mdg: Handle non-blend framebuffer lowering 23877ec681f3Smrg- pan/mdg: Support MRT in output load lowering 23887ec681f3Smrg- pan/mdg: Set the z/s store intrinsic base correctly 23897ec681f3Smrg- pan/mdg: Use a 32-bit ld_color_buffer op when needed 23907ec681f3Smrg- panfrost: Implement texture_barrier 23917ec681f3Smrg- panfrost: Stop keying on rt format when using native loads 23927ec681f3Smrg- panfrost: Use f2fmp for framebuffer lowering conversions 23937ec681f3Smrg- panfrost: Enable framebuffer fetch 23947ec681f3Smrg- pan/mdg: Fix non-debug compiliation 23957ec681f3Smrg- compiler: Add dual-source factors to blend_factor 23967ec681f3Smrg- gallium: Dual source support in blend_factor_to_shader 23977ec681f3Smrg- pan/mdg: Add a nir pass to reorder store_output intrinsics 23987ec681f3Smrg- pan/mdg: Dual source blend input/writeout support 23997ec681f3Smrg- pan/mdg: Skip z/s combining for dual-source writes 24007ec681f3Smrg- panfrost: Dual source blend support 24017ec681f3Smrg- pan/decode: Open the dump file later 24027ec681f3Smrg- pan/mdg: Don't disassemble blit shaders 24037ec681f3Smrg- panfrost: Rename lower_store to is_blend in pan_lower_framebuffer 24047ec681f3Smrg- pan/mdg: Do per-sample framebuffer loads 24057ec681f3Smrg- panfrost: Do per-sample shading when outputs are read 24067ec681f3Smrg- nir: Add a face_sysval argument to nir_lower_two_sided_color 24077ec681f3Smrg- nir: Fix lower_two_sided_color when the face is an input 24087ec681f3Smrg- panfrost: Report TEXTURE_BUFFER_OBJECTS cap when gl3 flag set 24097ec681f3Smrg- panfrost: Set depth_enabled when stencil is enabled 24107ec681f3Smrg- nir: Set the alignment for SSBO lowering 24117ec681f3Smrg- panfrost: Make panfrost_bo_wait take a wait_readers bool 24127ec681f3Smrg- panfrost: Fix calls to panfrost_flush_batches_accessing_bo 24137ec681f3Smrg- panfrost: Fake RGTC support 24147ec681f3Smrg- panfrost: Use more tilebuffer sizes 24157ec681f3Smrg- panfrost: 8x MRT support 24167ec681f3Smrg- pan/mdg: Use the blend RT for blend shader framebuffer fetches 24177ec681f3Smrg- panfrost: Allow PIPE_TEXTURE_1D_ARRAY textures 24187ec681f3Smrg- pan/mdg: Fix spilling of non-32-bit types 24197ec681f3Smrg 24207ec681f3SmrgIcenowy Zheng (1): 24217ec681f3Smrg 24227ec681f3Smrg- panfrost: signal syncobj if nothing is going to be flushed 24237ec681f3Smrg 24247ec681f3SmrgIlia Mirkin (14): 24257ec681f3Smrg 24267ec681f3Smrg- freedreno/a3xx: there's no r8i/ui rb format, only rg8i/rg8ui 24277ec681f3Smrg- freedreno/a3xx: reinstate rgb10_a2ui texture format 24287ec681f3Smrg- freedreno/ir3: avoid applying (sat) on bary.f 24297ec681f3Smrg- freedreno/a3xx: fix const footprint 24307ec681f3Smrg- freedreno: fix off-by-one in assertions checking for const sizes 24317ec681f3Smrg- freedreno/a3xx: parameterize ubo optimization 24327ec681f3Smrg- freedreno/a3xx: fix rasterizer discard 24337ec681f3Smrg- nouveau: allow invalidating coherent/persistent buffer backings 24347ec681f3Smrg- st/mesa: allow R8 to not be exposed as renderable by driver 24357ec681f3Smrg- a4xx: add noperspective interpolation support 24367ec681f3Smrg- a4xx: add polygon offset clamp, fix units 24377ec681f3Smrg- ir3: mark ucp_enables as allowed values on all keys 24387ec681f3Smrg- a4xx: hook up centroid ij coords 24397ec681f3Smrg- ir3: use empirical size for params as used by the shader 24407ec681f3Smrg 24417ec681f3SmrgIndrajit Kumar Das (2): 24427ec681f3Smrg 24437ec681f3Smrg- st/mesa: use fragment shader to copy stencil buffer 24447ec681f3Smrg- st/mesa: optimize DEPTH_STENCIL copies using fragment shader 24457ec681f3Smrg 24467ec681f3SmrgItalo Nicola (17): 24477ec681f3Smrg 24487ec681f3Smrg- panfrost: Fix outmods on int to float conversions 24497ec681f3Smrg- pan/mdg: fix src_type in instructions that need a implicit zero 24507ec681f3Smrg- pan/mdg: prepare effective_writemask() 24517ec681f3Smrg- pan/mdg: eliminate references to ins->alu.op 24527ec681f3Smrg- pan/mdg: eliminate references to ins->alu.reg_mode 24537ec681f3Smrg- pan/mdg: fix comment 24547ec681f3Smrg- pan/mdg: eliminate references to ins->alu.outmod 24557ec681f3Smrg- pan/mdg: apply float outmods to textures 24567ec681f3Smrg- pan/mdg: eliminate references to ins->texture.op 24577ec681f3Smrg- pan/mdg: eliminate references to ins->load_store.op 24587ec681f3Smrg- pan/mdg: defer register packing 24597ec681f3Smrg- pan/mdg: externalize mir_pack_mod 24607ec681f3Smrg- pan/mdg: remove ins->alu 24617ec681f3Smrg- pan/mdg: refactor emit_alu_bundle 24627ec681f3Smrg- pan/mdg: defer branch packing 24637ec681f3Smrg- pan/mdg: remove ins->br_compact and ins->branch_extended 24647ec681f3Smrg- pan/mdg: emit REGISTER_UNUSED on unused ALU src2 24657ec681f3Smrg 24667ec681f3SmrgIván Briano (9): 24677ec681f3Smrg 24687ec681f3Smrg- anv: use the correct format on Android 24697ec681f3Smrg- anv: Disable B5G6R5_UNORM_PACK16 24707ec681f3Smrg- anv: Add a way to reserve states from a pool 24717ec681f3Smrg- anv: Implement VK_EXT_custom_border_color 24727ec681f3Smrg- anv: support externally synchronized pipeline caches 24737ec681f3Smrg- anv: implement VK_PIPELINE_CREATE_FAIL_ON_PIPELINE_COMPILE_REQUIRED_BIT_EXT 24747ec681f3Smrg- anv: enable VK_EXT_pipeline_creation_cache_control 24757ec681f3Smrg- anv: Add VK_EXT_custom_border_color to relnotes 24767ec681f3Smrg- anv: fix allocation of custom border color pool 24777ec681f3Smrg 24787ec681f3SmrgJames Park (1): 24797ec681f3Smrg 24807ec681f3Smrg- amd/llvm: Reorder LLVM headers 24817ec681f3Smrg 24827ec681f3SmrgJames Zhu (1): 24837ec681f3Smrg 24847ec681f3Smrg- ac/gpu_info: Correct Acturus cu bitmap 24857ec681f3Smrg 24867ec681f3SmrgJan Beich (5): 24877ec681f3Smrg 24887ec681f3Smrg- drm-uapi: Add sync_file.h 24897ec681f3Smrg- anv,iris: unbreak on BSDs after 812cf5f522ab,abf8aed68047 24907ec681f3Smrg- util: enable futex usage on BSDs after 7dc2f4788288 24917ec681f3Smrg- meson: unbreak sysctl.h detection on BSDs 24927ec681f3Smrg- anv: disable i915_perf warning on non-Linux 24937ec681f3Smrg 24947ec681f3SmrgJan Palus (1): 24957ec681f3Smrg 24967ec681f3Smrg- targets/opencl: fix build against LLVM>=10 with Polly support 24977ec681f3Smrg 24987ec681f3SmrgJan Zielinski (1): 24997ec681f3Smrg 25007ec681f3Smrg- gallium/swr: Fix crashes in sampling code 25017ec681f3Smrg 25027ec681f3SmrgJason Ekstrand (167): 25037ec681f3Smrg 25047ec681f3Smrg- intel/eu: Use non-coherent mode (BTI=253) for stateless A64 messages 25057ec681f3Smrg- Revert "anv/gen12: Temporarily disable VK_KHR_buffer_device_address (and EXT)" 25067ec681f3Smrg- vulkan: Allow destroying NULL debug report callbacks 25077ec681f3Smrg- vulkan,anv: Add a common base object type for VkDevice 25087ec681f3Smrg- anv: Stop clflushing events 25097ec681f3Smrg- anv: Allocate CPU-side memory for events 25107ec681f3Smrg- vulkan,anv: Add a base object struct type 25117ec681f3Smrg- vulkan,anv: Move the DEFINE_HANDLE_CASTS macros to vk_object.h 25127ec681f3Smrg- anv: Refactor setting descriptors with immutable sampler 25137ec681f3Smrg- vulkan: Add run-time object type asserts in handle casts 25147ec681f3Smrg- vulkan/wsi: Make wsi_swapchain inherit from vk_object_base 25157ec681f3Smrg- anv/allocator: Add a start_offset to anv_state_pool 25167ec681f3Smrg- vulkan/object: Always include the type 25177ec681f3Smrg- anv,vulkan: Implement VK_EXT_private_data 25187ec681f3Smrg- vulkan: Handle vkGet/SetPrivateDataEXT on Android swapchains 25197ec681f3Smrg- nir: Make "divergent" a property of an SSA value 25207ec681f3Smrg- util/list: Add a list pair iterator 25217ec681f3Smrg- util/vma: Add an option to configure high/low preference 25227ec681f3Smrg- util/vma: Add a debug print helper 25237ec681f3Smrg- util/ra: Add [de]serialization support 25247ec681f3Smrg- anv: Set 3DSTATE_VF_INSTANCING on the SVGS element 25257ec681f3Smrg- anv: Set MOCS in 3DSTATE_CONSTANT_* on Gen9+ 25267ec681f3Smrg- nir: Add some docs to the metadata types 25277ec681f3Smrg- anv: Call vk_object_base_finish for image views 25287ec681f3Smrg- anv: Fix descriptor set clean-up on BO allocation failure 25297ec681f3Smrg- nir: Use 8-bit types for most info fields 25307ec681f3Smrg- anv:gpu_memcpy: Emit 3DSTATE_VF_INDEXING on Gen8+ 25317ec681f3Smrg- nir: Validate jump instructions as an instruction type 25327ec681f3Smrg- nir: Use a switch statement in nir_handle_add_jump 25337ec681f3Smrg- nir: Add documentation for each jump instruction type 25347ec681f3Smrg- nir/clone: Re-use clone_alu for nir_alu_instr_clone 25357ec681f3Smrg- nir: Add a new helper for iterating phi sources leaving a block 25367ec681f3Smrg- nir: Add a store_reg helper and use the builder in phis_to_regs 25377ec681f3Smrg- nir: Add const to nir_intrinsic_src_components 25387ec681f3Smrg- nir/lower_double_ops: Rework the if (progress) tree 25397ec681f3Smrg- nir/opt_deref: Report progress if we remove a deref 25407ec681f3Smrg- nir/copy_prop_vars: Record progress in more places 25417ec681f3Smrg- nir: Fix sources for image atomic fadd 25427ec681f3Smrg- intel/vec4: Stomp the return type of RESINFO to UINT32 25437ec681f3Smrg- intel/fs: Fix unused texture coordinate zeroing on Gen4-5 25447ec681f3Smrg- intel/fs: Emit HALT for discard on Gen4-5 25457ec681f3Smrg- anv/allocator: Compare to start_offset in state_pool_free_no_vg 25467ec681f3Smrg- nir: Add a nir_metadata_all enum value 25477ec681f3Smrg- nir: Add a nir_shader_preserve_all_metadata helper 25487ec681f3Smrg- nir: Call nir_metadata_preserve on !progress 25497ec681f3Smrg- nir: Properly preserve metadata in more cases 25507ec681f3Smrg- intel/nir: Call nir_metadata_preserve on !progress 25517ec681f3Smrg- iris: Better handle metadata in NIR passes 25527ec681f3Smrg- anv: Add an anv_batch_set_storage helper 25537ec681f3Smrg- anv: Add anv_pipeline_init/finish helpers 25547ec681f3Smrg- nir/intrinsics: Put the _intel intrinsics together at the end 25557ec681f3Smrg- anv: Use resolve_device_entrypoint for dispatch init 25567ec681f3Smrg- vulkan: Update Vulkan XML and headers to 1.2.145 25577ec681f3Smrg- anv: Bump the advertised patch version to 145 25587ec681f3Smrg- intel/fs: Expose a couple of NIR lowering helpers 25597ec681f3Smrg- intel/fs: Break wm_prog_data setup into a helper 25607ec681f3Smrg- intel/fs: Move more prog_data setup into populate_wm_prog_data 25617ec681f3Smrg- intel/compiler: Expose brw_texture_offset to C 25627ec681f3Smrg- intel/eu: Add a brw_urb_dest_msg_type helper 25637ec681f3Smrg- intel/eu: Set the right subnr for ALIGN16 destinations 25647ec681f3Smrg- intel/eu: Add the RNDU opcode 25657ec681f3Smrg- vulkan/wsi: Don't consider VK_SUBOPTIMAL_KHR to be an error condition 25667ec681f3Smrg- wsi/x11: Log swapchain status changes 25677ec681f3Smrg- freedreno: Only call nir_lower_io on shader_in/out 25687ec681f3Smrg- lima: Only call nir_lower_io on shader_in/out 25697ec681f3Smrg- nouveau: Only call nir_lower_io on shader_in/out 25707ec681f3Smrg- vc4: Only call nir_lower_io on shader_in/out 25717ec681f3Smrg- v3d: Only call nir_lower_io on shader_in/out 25727ec681f3Smrg- panfrost: Only call nir_lower_io on shader_in/out 25737ec681f3Smrg- nir: Assert that nir_lower_io is only called with allowed modes 25747ec681f3Smrg- nir: Remove shared support from lower_io 25757ec681f3Smrg- nir: Add docs to nir_lower[_explicit]_io 25767ec681f3Smrg- anv: Handle clamping of inverted depth ranges 25777ec681f3Smrg- nir/validate: Don't abort() until after the shader has printed 25787ec681f3Smrg- spirv: Skip phis in unreachable blocks in the second phi pass 25797ec681f3Smrg- spirv: Allow block-decorated struct types for constants 25807ec681f3Smrg- vulkan: Update Vulkan XML and headers to 1.2.148 25817ec681f3Smrg- anv: Advertise VK_EXT_image_robustness 25827ec681f3Smrg- spirv: Update headers and grammar json 25837ec681f3Smrg- spirv: Add support for SPV_EXT_shader_atomic_float 25847ec681f3Smrg- intel/fs: Use the correct logical op for global float atomics 25857ec681f3Smrg- anv: Advertise support for VK_EXT_shader_atomic_float 25867ec681f3Smrg- nir: Allow for system values with variable numbers of destination components 25877ec681f3Smrg- nir/lower_io: Choose to set access based on intrinsic metadata 25887ec681f3Smrg- nir/lower_io: Use b2b for shader and function temporaries 25897ec681f3Smrg- nir/lower_io: Add support for global scratch addressing 25907ec681f3Smrg- spirv: Simplify our handling of NonUniform 25917ec681f3Smrg- spirv: Drop the void \*ptr from vtn_value 25927ec681f3Smrg- spirv: Fix indentation in vtn_handle_ptr 25937ec681f3Smrg- spirv: Clean up OpSignBitSet 25947ec681f3Smrg- spirv: Use nir_bany/ball for OpAny/All 25957ec681f3Smrg- spirv: Add a helpers for getting types of values 25967ec681f3Smrg- spirv: Rename push_value_pointer to push_pointer 25977ec681f3Smrg- spirv: Add a vtn_push_nir_ssa helper 25987ec681f3Smrg- spirv/amd: Use vtn_push_nir_ssa 25997ec681f3Smrg- spirv: Add a vtn_get_nir_ssa helper 26007ec681f3Smrg- spirv: Use the new helpers in OpConvertUToPtr/PtrToU 26017ec681f3Smrg- spirv: Refactor vtn_push_ssa 26027ec681f3Smrg- spirv/alu: Use vtn_push_ssa_value 26037ec681f3Smrg- spirv/glsl450: Use vtn_push_ssa_value 26047ec681f3Smrg- spirv/subgroups: Stop incrementing w 26057ec681f3Smrg- spirv/subgroups: Refactor to use vtn_push_ssa 26067ec681f3Smrg- spirv: Simplify vtn_ssa_value creation 26077ec681f3Smrg- spirv: Hand-roll fewer vtn_ssa_value creations 26087ec681f3Smrg- spirv: Add better checks for SSA value types 26097ec681f3Smrg- spirv: Drop the sampled boolean from vtn_type 26107ec681f3Smrg- spirv: Give atomic counters their own variable mode 26117ec681f3Smrg- spirv: Add a helper for getting the NIR type of a vtn_type 26127ec681f3Smrg- spirv: Remove a dead case in function parameter handling 26137ec681f3Smrg- spirv: More heavily use vtn_ssa_value in function parameter handling 26147ec681f3Smrg- anv,turnip,radv,clover,glspirv: Run nir_copy_prop before nir_opt_deref 26157ec681f3Smrg- spirv: Rework our handling of images and samplers 26167ec681f3Smrg- spirv: Also copy over binding information for atomic counters 26177ec681f3Smrg- nir: Take a mode in remove_unused_io_vars 26187ec681f3Smrg- nir/dead_variables: Respect the modes passed to remove_dead_vars 26197ec681f3Smrg- nir: Add nir_foreach_shader_in/out_variable helpers 26207ec681f3Smrg- nir: Add a nir_foreach_function_temp_variable helper 26217ec681f3Smrg- nir: Add a nir_foreach_uniform_variable helper 26227ec681f3Smrg- nir: Add a nir_foreach_gl_uniform_variable helper for GL linking 26237ec681f3Smrg- nir: Add and use a nir_variable_list_for_mode helper 26247ec681f3Smrg- nir: Take a nir_shader and variable mode in assign_var_locations 26257ec681f3Smrg- nir: Take a shader and variable mode in nir_assign_io_var_locations 26267ec681f3Smrg- nir/linking: Rework some internal helpers 26277ec681f3Smrg- st/nir: Rework fixup_varying_slots 26287ec681f3Smrg- nir/split_vars: Add mode checks to list walks 26297ec681f3Smrg- nir: Split nir_index_vars into two functions 26307ec681f3Smrg- nir/lower_amul: Add a variable mode check 26317ec681f3Smrg- nir: Use a nir_shader and mode in lower_clip_cull_distance_arrays 26327ec681f3Smrg- nir/lower_io_to_temporaries: Use a separate list for new inputs 26337ec681f3Smrg- nir/io_to_vector: Use nir_foreach_variable_with_modes 26347ec681f3Smrg- nir/lower_two_sided_color: Use nir_variable_create 26357ec681f3Smrg- nir/lower_uniforms_to_ubo: Use nir_foreach_variable_with_modes 26367ec681f3Smrg- nir/split_per_member_structs: Use nir_variable_with_modes_safe 26377ec681f3Smrg- nir/lower_variable_initializers: Restrict the modes we lower 26387ec681f3Smrg- nir/gl_nir_linker: Use nir_foreach_variable_with_modes 26397ec681f3Smrg- freedreno/ir3_lower_tess: Rework var list helpers 26407ec681f3Smrg- lima/standalone: Rework i/o variable fixup 26417ec681f3Smrg- freedreno/ir3_cmdline: Rework i/o variable fixup 26427ec681f3Smrg- r600/sfn/lower_tess_io: Rework get_tcs_varying_offset 26437ec681f3Smrg- r600/sfn/lower_tex: Get rid of the lower_sampler vector 26447ec681f3Smrg- r600/sfn: Use nir_foreach_variable_with_modes in IO vectorization 26457ec681f3Smrg- panfrost/midgard: Make search_var take a nir_shader and mode 26467ec681f3Smrg- panfrost: Use nir_foreach_variable_with_modes in pan_compile 26477ec681f3Smrg- aco: Use nir_foreach_variable_with_modes to walk SSBOs 26487ec681f3Smrg- mesa/ptn: Use nir_variable_create 26497ec681f3Smrg- gallium/ttn: Use variable create/add helpers 26507ec681f3Smrg- nir: Use a single list for all shader variables 26517ec681f3Smrg- nir/split_per_member_structs: Inline split_variables_in_list 26527ec681f3Smrg- nir/gl_nir_linker: Call add_vars_with_modes once for GL_PROGRAM_INPUT 26537ec681f3Smrg- nir: Add a find_variable_with\_[driver\_]location helper 26547ec681f3Smrg- vulkan: Update Vulkan XML and headers to 1.2.149 26557ec681f3Smrg- anv: Implement VK_EXT_4444_formats 26567ec681f3Smrg- nir/deref: Don't try to compare derefs containing casts 26577ec681f3Smrg- compiler/types: Add a struct_type_is_packed wrapper 26587ec681f3Smrg- spirv: Do more complex unwrapping in get_nir_type 26597ec681f3Smrg- anv: Advertise shaderIntegerFunctions2 26607ec681f3Smrg- spirv: Don't emit RMW for vector indexing in shared or global 26617ec681f3Smrg- clover/spirv: Don't call llvm::regularizeLlvmForSpirv 26627ec681f3Smrg- intel/nir: Pass the nir_builder by reference in lower_alpha_to_coverage 26637ec681f3Smrg- intel/nir: Rewrite the guts of lower_alpha_to_coverage 26647ec681f3Smrg- intel/fs: Fix MOV_INDIRECT and BROADCAST of Q types on Gen11+ 26657ec681f3Smrg- intel/fs: Don't copy-propagate stride=0 sources into ddx/ddy 26667ec681f3Smrg- iris: Re-emit push constants if we have a varying workgroup size 26677ec681f3Smrg- spirv: Run repair_ssa if there are discard instructions 26687ec681f3Smrg- nir: More NIR_MAX_VEC_COMPONENTS fixes 26697ec681f3Smrg- intel/fs/swsb: SCHEDULING_FENCE only emits SYNC_NOP 26707ec681f3Smrg- radeonsi: Only call nir_lower_var_copies at the end of the opt loop 26717ec681f3Smrg 26727ec681f3SmrgJesse Natalie (10): 26737ec681f3Smrg 26747ec681f3Smrg- nir_lower_io: Add addr_format_is_offset helper 26757ec681f3Smrg- nir: When nir_lower_vars_to_explicit_types is run on temps, update scratch_size 26767ec681f3Smrg- nir: Support load/store of temps as scratch in nir_lower_explicit_io 26777ec681f3Smrg- nir: Support vec8/vec16 in nir_lower_bit_size 26787ec681f3Smrg- nir: Support algebraic opts on vectors larger than 4 26797ec681f3Smrg- nir: Support 8 and 16 component vectors for reduceable intrinsics 26807ec681f3Smrg- nir/vtn: Add support for 8 and 16 vector ball/bany 26817ec681f3Smrg- u_debug_stack_test: Fix MSVC compiling by using ATTRIBUTE_NOINLINE 26827ec681f3Smrg- nir: More NIR_MAX_VEC_COMPONENTS fixes 26837ec681f3Smrg- glsl_type: Add packed to structure type comparison for hash map 26847ec681f3Smrg 26857ec681f3SmrgJibbityJobbity (1): 26867ec681f3Smrg 26877ec681f3Smrg- drirc: Enable glthread for PCSX2 26887ec681f3Smrg 26897ec681f3SmrgJon Turney (1): 26907ec681f3Smrg 26917ec681f3Smrg- glthread: Fix use of alloca() without #include "c99_alloca.h" 26927ec681f3Smrg 26937ec681f3SmrgJonathan Gray (13): 26947ec681f3Smrg 26957ec681f3Smrg- util: unbreak endian detection on OpenBSD 26967ec681f3Smrg- util/anon_file: add OpenBSD shm_mkstemp() path 26977ec681f3Smrg- meson: build with _ISOC11_SOURCE on OpenBSD 26987ec681f3Smrg- meson: don't build with USE_ELF_TLS on OpenBSD 26997ec681f3Smrg- meson: conditionally include -ldl in gbm pkg-config file 27007ec681f3Smrg- util: futex fixes for OpenBSD 27017ec681f3Smrg- util/u_thread: include pthread_np.h if found 27027ec681f3Smrg- anv: use os_get_total_physical_memory() 27037ec681f3Smrg- util/os_misc: add os_get_available_system_memory() 27047ec681f3Smrg- anv: use os_get_available_system_memory() 27057ec681f3Smrg- util/os_misc: os_get_available_system_memory() for OpenBSD 27067ec681f3Smrg- radv: remove seccomp includes 27077ec681f3Smrg- vulkan: make VK_TIME_DOMAIN_CLOCK_MONOTONIC_RAW_EXT conditional 27087ec681f3Smrg 27097ec681f3SmrgJonathan Marek (135): 27107ec681f3Smrg 27117ec681f3Smrg- turnip: update "fetchsize" value to match fdl6_layout changes 27127ec681f3Smrg- turnip: enable tiling for compressed formats 27137ec681f3Smrg- util/format: translate 422_UNORM and 420_UNORM vulkan formats 27147ec681f3Smrg- freedreno/registers: document 422_UNORM and 420_UNORM formats 27157ec681f3Smrg- turnip: implement VK_KHR_sampler_ycbcr_conversion 27167ec681f3Smrg- turnip: enable 422_UNORM formats 27177ec681f3Smrg- freedreno: move a4xx specific layout code to a4xx code 27187ec681f3Smrg- freedreno/a5xx: remove unused reference to gmem_alignw in layout code 27197ec681f3Smrg- freedreno/a6xx: don't use gmem_alignw for imported buffers 27207ec681f3Smrg- freedreno/a6xx: split up gmem/tile alignment requirements 27217ec681f3Smrg- freedreno: reduce extra height alignment in a6xx layout 27227ec681f3Smrg- freedreno/a6xx: use RESOLVE_TS event 27237ec681f3Smrg- freedreno: add adreno 650 27247ec681f3Smrg- freedreno/layout: add explicit offset/pitch argument to fdl6_layout 27257ec681f3Smrg- turnip: support VkImageDrmFormatModifierExplicitCreateInfoEXT 27267ec681f3Smrg- turnip: fix RENDER_COMPONENTS value 27277ec681f3Smrg- turnip: move HLSQ_UPDATE_CNTL write to before xs config writes 27287ec681f3Smrg- turnip: update some properties based on blob driver 27297ec681f3Smrg- turnip: clamp sampler minLod/maxLod 27307ec681f3Smrg- freedreno/a6xx: use nonbinning VS when GS is used 27317ec681f3Smrg- turnip: correctly emit non-binning vs in transform feedback case 27327ec681f3Smrg- turnip: fix HW binning with geometry shader 27337ec681f3Smrg- turnip: use common emit_xs_cntl to fill a6xx_sp_xs_ctrl_reg0 27347ec681f3Smrg- turnip: fix VFD_CONTROL for binning pass 27357ec681f3Smrg- turnip: pipeline program state refactor 27367ec681f3Smrg- turnip: share code between 3D blit/clear path and tu_pipeline 27377ec681f3Smrg- turnip: add layered 3D path clear for CmdClearAttachments 27387ec681f3Smrg- turnip: add emit renderpass cache flushes for sysmem 3D CmdClearAttachments 27397ec681f3Smrg- turnip: remove some dead/redundant code 27407ec681f3Smrg- freedreno/ir3: fix ir3_nir_move_varying_inputs 27417ec681f3Smrg- turnip: remove duplicated stage2opcode and stage2shaderdb 27427ec681f3Smrg- turnip: simplify stage2 helpers 27437ec681f3Smrg- turnip: set VFD_INDEX_OFFSET in 3D clear/blit path 27447ec681f3Smrg- turnip: fix 3D path always being used for CmdBlitImage 27457ec681f3Smrg- turnip: fix cubic filtering with CmdBlitImage 27467ec681f3Smrg- turnip: compute and graphics have completely separate state 27477ec681f3Smrg- turnip: move descriptor set BO tracking to CmdBindDescriptorSets 27487ec681f3Smrg- turnip: improve dirty bit handling a bit 27497ec681f3Smrg- turnip: delete dead dynamic state code 27507ec681f3Smrg- turnip: refactor draw states and dynamic states 27517ec681f3Smrg- turnip: input attachment descriptor set rework 27527ec681f3Smrg- turnip: use draw states for input attachments 27537ec681f3Smrg- turnip: use u_format for packing gmem clear values 27547ec681f3Smrg- freedreno/a6xx: FETCHSIZE is PITCHALIGN 27557ec681f3Smrg- freedreno/fdl6: rework layout code a bit (reduce linear align to 64 bytes) 27567ec681f3Smrg- turnip: fix a crash when rasterizerDiscardEnable is set 27577ec681f3Smrg- turnip: fix a sample shading case 27587ec681f3Smrg- turnip: fix renderpass gmem configs when there are too many attachments 27597ec681f3Smrg- turnip: set the API version 27607ec681f3Smrg- turnip: move enum translation functions to a common header 27617ec681f3Smrg- freedreno/a6xx: VSC "STRM_ARRAY_PITCH" is "STRM_LIMIT" 27627ec681f3Smrg- freedreno/a6xx: remove unnecessary OVERFLOW_FLAG_REG check 27637ec681f3Smrg- turnip: remove unnecessary OVERFLOW_FLAG_REG check 27647ec681f3Smrg- freedreno/a4xx: restore pitch to bytes change to layout code 27657ec681f3Smrg- freedreno/a4xx: simplify setup_slices 27667ec681f3Smrg- turnip: rework streamout state and add missing counter buffer read/writes 27677ec681f3Smrg- turnip: refactor CmdDraw* functions (and a few fixes) 27687ec681f3Smrg- turnip: enable VK_EXT_index_type_uint8 27697ec681f3Smrg- turnip: implement CmdDrawIndirectByteCountEXT 27707ec681f3Smrg- turnip: fix ts_cs_memory typo 27717ec681f3Smrg- turnip: use pipeline cs for shader programs instead of separate bo 27727ec681f3Smrg- freedreno/registers: a6xx depth bounds test registers 27737ec681f3Smrg- turnip: implement depthBounds 27747ec681f3Smrg- turnip: translate CreateRenderPass to CreateRenderPass2 27757ec681f3Smrg- turnip: replace a memset(0) with zalloc in CreateRenderPass 27767ec681f3Smrg- turnip: use RenderPassCreateInfo for render_pass_add_implicit_deps 27777ec681f3Smrg- turnip: move some logic out of create_render_pass_common 27787ec681f3Smrg- turnip: implement VK_EXT_vertex_attribute_divisor 27797ec681f3Smrg- turnip: fix empty scissor case 27807ec681f3Smrg- turnip: fix update_stencil_mask 27817ec681f3Smrg- turnip: disable early_z for VK_FORMAT_S8_UINT 27827ec681f3Smrg- freedreno/registers: add CP_DRAW_INDIRECT_MULTI 27837ec681f3Smrg- freedreno/ir3: add support for load_draw_id 27847ec681f3Smrg- turnip: implement VK_KHR_shader_draw_parameters 27857ec681f3Smrg- turnip: fix VK_STRUCTURE_TYPE_PHYSICAL_DEVICE_VULKAN_1_1_FEATURES 27867ec681f3Smrg- turnip: fix huge scissor min/max case 27877ec681f3Smrg- freedreno/ir3: fix resinfo wrmask 27887ec681f3Smrg- freedreno/regs: add extra bits for UBWC array pitch 27897ec681f3Smrg- turnip: enable largePoints 27907ec681f3Smrg- turnip: enable depthBiasClamp 27917ec681f3Smrg- freedreno/registers: update varying-related registers 27927ec681f3Smrg- freedreno/a3xx: support LINEAR_PIXEL/PERSP_CENTROID/LINEAR_CENTROID sysvals 27937ec681f3Smrg- freedreno/a4xx: fake LINEAR_PIXEL varying support for u_blitter 27947ec681f3Smrg- freedreno/ir3: add generic get_barycentric() 27957ec681f3Smrg- freedreno/a5xx: set missing bary sysvals 27967ec681f3Smrg- freedreno/a6xx: set missing bary sysvals 27977ec681f3Smrg- turnip: set missing bary sysvals 27987ec681f3Smrg- freedreno/ir3: add support for INTERP_MODE_NOPERSPECTIVE 27997ec681f3Smrg- turnip: make tiling config part of framebuffer state 28007ec681f3Smrg- turnip: rework render_tiles loop 28017ec681f3Smrg- turnip: vsc improvements 28027ec681f3Smrg- turnip: fix tess param bo size calculation 28037ec681f3Smrg- turnip: clear_blit: pass aspect mask to setup function 28047ec681f3Smrg- turnip: support multi-image layouts 28057ec681f3Smrg- turnip: enable 420_UNORM formats 28067ec681f3Smrg- freedreno/layout: fix explicit layout offset not added to slice offset 28077ec681f3Smrg- freedreno/ir3: fix/rework tess levels 28087ec681f3Smrg- Revert "nir: Add an option for lowering TessLevelInner/Outer to vecs" 28097ec681f3Smrg- Revert "nir: Support sysval tess levels in SPIR-V to NIR" 28107ec681f3Smrg- freedreno/regs: document SS6_UBO state src 28117ec681f3Smrg- turnip: use global bo for clear blit shaders 28127ec681f3Smrg- freedreno/ir3: add support for a650 tess shared storage 28137ec681f3Smrg- freedreno/regs: document CS shared storage size bit 28147ec681f3Smrg- freedreno/a2xx: fix compressed textures 28157ec681f3Smrg- freedreno: add a fd_resource_pitch helper 28167ec681f3Smrg- freedreno/layout: layout simplifications and pitch from level 0 pitch 28177ec681f3Smrg- turnip: fix active_desc_sets not being set for compute pipeline 28187ec681f3Smrg- freedreno/ir3: fix setup_input for sparse vertex inputs 28197ec681f3Smrg- freedreno/ir3: run nir_opt_loop_unroll in optimization loop 28207ec681f3Smrg- freedreno: fix layout pitchalign field not being set for imported buffers 28217ec681f3Smrg- freedreno/regs: update primitive output related registers 28227ec681f3Smrg- turnip: clean up primitive output state 28237ec681f3Smrg- turnip: drop GS clear path 28247ec681f3Smrg- turnip: use DIRTY SDS bit to avoid making copies of pipeline load state ib 28257ec681f3Smrg- turnip: emit compute pipeline directly in CmdBindPipeline 28267ec681f3Smrg- turnip: fix inconsistencies with tu6_load_state_size 28277ec681f3Smrg- turnip: remove use of tu_cs_entry for draw states 28287ec681f3Smrg- gitlab-ci: re-enable arm64_a630_vk 28297ec681f3Smrg- freedreno/regs: update a6xx GRAS registers 28307ec681f3Smrg- freedreno/regs: update a6xx RB regs 28317ec681f3Smrg- freedreno/regs: update a6xx VPC regs 28327ec681f3Smrg- freedreno/regs: update a6xx PC regs 28337ec681f3Smrg- turnip: disable tiling for NV12/IYUV formats 28347ec681f3Smrg- turnip: remove extra gmem alignment 28357ec681f3Smrg- freedreno/ir3: fix wrong local_primitive_id_start type 28367ec681f3Smrg- turnip: move WFI out of draw state to fix a650 hangs 28377ec681f3Smrg- turnip: use patchControlPoints for HS_INPUT_SIZE value 28387ec681f3Smrg- turnip: fix SP_HS_UNKNOWN_A831 value for A650 28397ec681f3Smrg- turnip: workaround for a630 d24_unorm_s8_uint fails 28407ec681f3Smrg- turnip: fix sysmem CmdClearAttachments 3D fallback breaking GMEM path flush 28417ec681f3Smrg- turnip: delete tu_clear_sysmem_attachments_2d 28427ec681f3Smrg- turnip: add support for D32_SFLOAT_S8_UINT 28437ec681f3Smrg- turnip: rework extended formats to allow more extended formats 28447ec681f3Smrg- util/format: translate A4R4G4B4_UNORM and A4B4G4R4_UNORM vulkan formats 28457ec681f3Smrg- turnip: implement VK_EXT_4444_formats 28467ec681f3Smrg 28477ec681f3SmrgJordan Justen (17): 28487ec681f3Smrg 28497ec681f3Smrg- intel/dev: Split .num_subslices out of GEN12_FEATURES macro 28507ec681f3Smrg- intel/dev: Add device info for RKL 28517ec681f3Smrg- intel/l3: Don't rely on cfg entry URB size being 0 as a sentinal 28527ec681f3Smrg- intel/l3: Allow platforms to have no l3 configurations 28537ec681f3Smrg- iris/l3: Enable L3 full way allocation when L3 config is NULL 28547ec681f3Smrg- anv: Set L3 full way allocation at context init if L3 cfg is NULL 28557ec681f3Smrg- intel/dev: Add device info for DG1 28567ec681f3Smrg- iris: Make use of devinfo has_aux_map field 28577ec681f3Smrg- anv: Make use of devinfo has_aux_map field 28587ec681f3Smrg- anv/pipeline: Split VFE/INTERFACE_DESCRIPTOR out to emit_media_cs_state 28597ec681f3Smrg- anv/cmd_buffer: Split GPGPU_WALKER out to emit_gpgpu_walker 28607ec681f3Smrg- iris: Split walker and state update into iris_upload_gpgpu_walker 28617ec681f3Smrg- iris/compute: Split out iris_load_indirect_location 28627ec681f3Smrg- intel/compiler/cs: Allow simd32 in some more cases with no8 and/or no16 28637ec681f3Smrg- intel/compiler/fs: Still attempt simd32 when INTEL_DEBUG=no16 is used 28647ec681f3Smrg- iris: Add missing break in switch in modifier_is_supported 28657ec681f3Smrg- anv, iris: Set MediaSamplerDOPClockGateEnable for gen12+ 28667ec681f3Smrg 28677ec681f3SmrgJose Maria Casanova Crespo (4): 28687ec681f3Smrg 28697ec681f3Smrg- v3d: Fix swizzle in DXT3 and DXT5 formats 28707ec681f3Smrg- v3d: Include supported DXT formats to enable s3tc/dxt extensions 28717ec681f3Smrg- vc4: don't relay on intr->num_components for non-vectorized intrinsics 28727ec681f3Smrg- nir: only uniforms with dynamically_uniform offset are dynamically_uniform 28737ec681f3Smrg 28747ec681f3SmrgJoshua Ashton (7): 28757ec681f3Smrg 28767ec681f3Smrg- anv: Remove RANGE_SIZE usage 28777ec681f3Smrg- radv: Remove RANGE_SIZE usage 28787ec681f3Smrg- turnip: Remove RANGE_SIZE usage 28797ec681f3Smrg- vulkan: Update Vulkan XML and headers to 1.2.140 28807ec681f3Smrg- radv: Implement VK_EXT_custom_border_color 28817ec681f3Smrg- radeonsi: Use TRUNC_COORD on samplers 28827ec681f3Smrg- radv: Implement VK_EXT_4444_formats 28837ec681f3Smrg 28847ec681f3SmrgJosé Fonseca (3): 28857ec681f3Smrg 28867ec681f3Smrg- glthread: Add GLAPIENTRY to _mesa_marshal_MultiDrawArrays. 28877ec681f3Smrg- appveyor: Upgrade pip. 28887ec681f3Smrg- appveyor: Use Python3. 28897ec681f3Smrg 28907ec681f3SmrgKarol Herbst (50): 28917ec681f3Smrg 28927ec681f3Smrg- nir/deref: copy ptr_stride when rematerializing 28937ec681f3Smrg- nir/validate: validate the stride for deref_ptr_as_array 28947ec681f3Smrg- Revert "nir/validate: validate the stride for deref_ptr_as_array" 28957ec681f3Smrg- nvir/nir: use component helpers instead of insn->num_components 28967ec681f3Smrg- st/mesa: lower images when needed 28977ec681f3Smrg- nir/lower_images: fix for array of arrays 28987ec681f3Smrg- nir/lower_images: handle dec and inc 28997ec681f3Smrg- nv50/ir/nir: move away from image_deref intrinsics 29007ec681f3Smrg- nv50/ir/nir: handle image atomic inc and dec 29017ec681f3Smrg- nv50/ir/nir: remove image uniform hack 29027ec681f3Smrg- gv100/ir: fix atom cas 29037ec681f3Smrg- gv100/ir: fix shift lowering 29047ec681f3Smrg- gv100/ir: fix OP_TXG for shadow textures 29057ec681f3Smrg- nv50/ir/nir: add workaround for double vertex attribs 29067ec681f3Smrg- nv50/ir/print: add missing VIEWPORT_MASK handling 29077ec681f3Smrg- nv50/ir/nir: fix ext_demote_to_helper_invocation 29087ec681f3Smrg- nv50/ir/nir: fix nv_viewport_array2 29097ec681f3Smrg- nvc0: enable spirv caps with nir 29107ec681f3Smrg- nv50/ir/nir: don't emit a restart with set a stream_id 29117ec681f3Smrg- nv50/ir/nir: handle clip vertex for tess eval shaders 29127ec681f3Smrg- nv50/ir/nir: rework input output handling 29137ec681f3Smrg- nv50/ir/nir: rework CFG handling 29147ec681f3Smrg- nv50/ir/ra: convert some for loops to Range-based for loops 29157ec681f3Smrg- nv50/ir/ra: fix memory corruption when spilling 29167ec681f3Smrg- nv50/ir/nir: fix interpolation on explicit operations 29177ec681f3Smrg- gv100/ir: implement sample shading 29187ec681f3Smrg- gv100/ir: fix coherent and volatile memory access 29197ec681f3Smrg- nv50/ir/nir: fix cache mode conversion 29207ec681f3Smrg- nv50/ir: fix memset on non trivial types warning 29217ec681f3Smrg- nv50/ir/tgsi: move call to tgsi_scan_shader inside Source constructor 29227ec681f3Smrg- nvc0: set local mem size for compute on gv100 29237ec681f3Smrg- nvc0: set sampler index mode to independently on gv100 compute 29247ec681f3Smrg- gv100/ir: set ftz bit on floating point operations 29257ec681f3Smrg- ci: bump libdrm to 2.4.102 29267ec681f3Smrg- nouveau: enable HMM 29277ec681f3Smrg- gallium: add PIPE_CAP_RESOURCE_FROM_USER_MEMORY_COMPUTE_ONLY 29287ec681f3Smrg- nvc0: support PIPE_CAP_RESOURCE_FROM_USER_MEMORY_COMPUTE_ONLY 29297ec681f3Smrg- nouveau: expose HMM 29307ec681f3Smrg- ci: need to install wget in order to download libdrm 29317ec681f3Smrg- ci: bump libdrm to 2.4.102 29327ec681f3Smrg- nouveau: enable HMM 29337ec681f3Smrg- gallium: add PIPE_CAP_RESOURCE_FROM_USER_MEMORY_COMPUTE_ONLY 29347ec681f3Smrg- nvc0: support PIPE_CAP_RESOURCE_FROM_USER_MEMORY_COMPUTE_ONLY 29357ec681f3Smrg- nouveau: expose HMM 29367ec681f3Smrg- st/mesa: fix st_CopyPixels without support for stencil exports 29377ec681f3Smrg- nv50/ir/tgsi: silence warning about unhandled GS_INPUT_PRIM property 29387ec681f3Smrg- nv50/ir: initialize persampleInvocation to false 29397ec681f3Smrg- nir/lower_io: assert that offsets are used for shader_in 29407ec681f3Smrg- nv50/ir/nir: fix global_atomic_comp_swap 29417ec681f3Smrg- spirv: extract switch parsing into its own function 29427ec681f3Smrg 29437ec681f3SmrgKenneth Graunke (20): 29447ec681f3Smrg 29457ec681f3Smrg- iris: Include linux/sync_file.h instead of cut and pasting contents 29467ec681f3Smrg- anv: Include linux/sync_file.h instead of cut and pasting contents 29477ec681f3Smrg- iris: Rename iris_syncpt to iris_syncobj for clarity. 29487ec681f3Smrg- iris: Give up on not passing ice to iris_init_batch 29497ec681f3Smrg- iris: Destroy transfer slab after batches 29507ec681f3Smrg- iris: Flush any current work in iris_fence_await before adding deps 29517ec681f3Smrg- intel: Move anv_gem_supports_syncobj_wait to common code. 29527ec681f3Smrg- iris: Detect DRM_SYNCOBJ_WAIT_FLAGS_WAIT_FOR_SUBMIT kernel support 29537ec681f3Smrg- iris: Implement PIPE_FLUSH_DEFERRED support. 29547ec681f3Smrg- intel: Delete hardcoded devinfo->urb.size values for Gen7+ (sans DG1). 29557ec681f3Smrg- iris: Delete useless #define 29567ec681f3Smrg- intel/eu: Add a brw_urb_desc helper 29577ec681f3Smrg- CI: Disable Panfrost Mali-T820, Lima Mali-400 and Lima Mali-450 jobs 29587ec681f3Smrg- intel: Disable loading drivers on DG1 devices for now 29597ec681f3Smrg- nir: Fix divergence analysis for tessellation input/outputs 29607ec681f3Smrg- iris: Implement pipe->texture_subdata directly 29617ec681f3Smrg- iris: Fix CCS check in iris_texture_subdata(). 29627ec681f3Smrg- iris: Delete shader variants when deleting the API-facing shader 29637ec681f3Smrg- iris: Reorder the loops in iris_fence_await() for clarity. 29647ec681f3Smrg- iris: Drop stale syncobj references in fence_server_sync 29657ec681f3Smrg 29667ec681f3SmrgKristian Høgsberg (73): 29677ec681f3Smrg 29687ec681f3Smrg- freedreno/ir3: Pass stream output info to ir3_shader_from_nir 29697ec681f3Smrg- freedreno/ir3: Rename ir3_nir_lower_to_explicit_io 29707ec681f3Smrg- freedreno/ir3: Add ir3_nir_lower_to_explicit_input() pass 29717ec681f3Smrg- freedreno/ir3: Lower GS builtins before lowering IO 29727ec681f3Smrg- freedreno/ir3: Drop hack to clean up split vars 29737ec681f3Smrg- freedreno/fdl: Align after dividing by block size 29747ec681f3Smrg- freedreno/a6xx: Set tfetch correctly for compressed formats 29757ec681f3Smrg- freedreno/ir3: Drop wrmask for ir3 local and global store intrinsics 29767ec681f3Smrg- freedreno/a6xx: Create shader dependent streamout state at compile time 29777ec681f3Smrg- freedreno/a6xx: Map inputs to VFD entries up front 29787ec681f3Smrg- freedreno/a6xx: Allocate ringbuffer based on VFD count 29797ec681f3Smrg- freedreno/a6xx: Emit VFD setup as array writes 29807ec681f3Smrg- freedreno/a6xx: Avoid stalling for occlusion queries 29817ec681f3Smrg- freedreno: Use the right amount of &'s 29827ec681f3Smrg- freedreno: Use explicit \*_NONE enum for undefined formats 29837ec681f3Smrg- turnip: Use hw enum when emitting A6XX_RB_STENCIL_CONTROL 29847ec681f3Smrg- turnip: Use tu6_reduction_mode() to avoid warning 29857ec681f3Smrg- turnip: Use {} initializer to silence warning 29867ec681f3Smrg- freedreno/ir3: Avoid {0} initializer for struct reginfo 29877ec681f3Smrg- src/util: Remove out-of-range comparison 29887ec681f3Smrg- mapi: Fix a couple of warning in generated code 29897ec681f3Smrg- mesa/st: Use memset to zero out struct 29907ec681f3Smrg- egl/android: Move get_format under HAVE_DRM_GRALLOC guard where it's used 29917ec681f3Smrg- egl/android: Drop unused variable 29927ec681f3Smrg- freedreno/a6xx: Move per element offset to VFD_DECODE 29937ec681f3Smrg- freedreno/a6xx: Decouple VFD_FETCH and VFD_DECODE 29947ec681f3Smrg- freedreno/a6xx: Create stateobj for VFD_DECODE 29957ec681f3Smrg- freedreno/a6xx: Program VFD_DEST_CNTL from program stateobj 29967ec681f3Smrg- freedreno/a6xx: Turn on robustness extensions 29977ec681f3Smrg- docs/features.txt: Update for freedreno 29987ec681f3Smrg- freedreno/a6xx: Fix VFD_CONTROL emit 29997ec681f3Smrg- freedreno/a6xx: Don't write REG_A6XX_RB_SRGB_CNTL in restore 30007ec681f3Smrg- freedreno/a6xx: Set index buffer size to bo size 30017ec681f3Smrg- freedreno: Handle DRM_FORMAT_MOD_INVALID in shared code 30027ec681f3Smrg- turnip: Put VK_KHR_external_fence_fd stubs back 30037ec681f3Smrg- freedreno/a6xx: Don't blit with R2D_RAW 30047ec681f3Smrg- freedreno/a6xx: Move fd6_ifmt into fd6_blitter.c 30057ec681f3Smrg- freedreno/a6xx: Split out src and dst setup helpers for blit 30067ec681f3Smrg- freedreno/a6xx: Don't set unknown bit when tiling differs 30077ec681f3Smrg- freedreno/a6xx: Set src and dst rects outside blit loop 30087ec681f3Smrg- freedreno/a6xx: Program SP_2D_SRC_FORMAT outside blit loop 30097ec681f3Smrg- freedreno/a6xx: Consolidate computing blit_cntl 30107ec681f3Smrg- freedreno/a6xx: Don't emit src state when clearing 30117ec681f3Smrg- freedreno/a6xx: Separate stencil sysmem clear fix 30127ec681f3Smrg- freedreno/a6xx: Enable FMT6_10_10_10_2_UNORM blitting 30137ec681f3Smrg- freedreno/a6xx: Make blit_control helper a little more helpful 30147ec681f3Smrg- freedreno/a6xx: Program A6XX_SP_2D_SRC_FORMAT_COLOR_FORMAT based on dst format 30157ec681f3Smrg- freedreno/a6xx: Move REG_A6XX_SP_2D_SRC_FORMAT programming to helper 30167ec681f3Smrg- freedreno/a6xx: Move CP_SET_MARKER to setup helper 30177ec681f3Smrg- freedreno/a6xx: Program RB_UNKNOWN_8C01 in setup helper 30187ec681f3Smrg- freedreno/a6xx: Don't take pipe_blit_info in emit_blit_dst 30197ec681f3Smrg- freedreno/a6xx: Split clear and blit texture into different functions 30207ec681f3Smrg- freedreno/registers: Rename SP_2D_SRC_FORMAT 30217ec681f3Smrg- turnip: Move device enumeration and feature discovery to tu_drm.c 30227ec681f3Smrg- turnip: Move tu_bo functions to tu_drm.c 30237ec681f3Smrg- turnip: Collapse some tu_drm wrappers 30247ec681f3Smrg- turnip: Move remaining drm code to tu_drm.c 30257ec681f3Smrg- turnip: Only include msm_drm in tu_drm.c 30267ec681f3Smrg- egl/android: Remove unused variable 30277ec681f3Smrg- mapi/test: Change type to unsigned for offset 30287ec681f3Smrg- gallium: Switch u_debug_stack/symbol.c to util/hash_table.h 30297ec681f3Smrg- util: Move stack debug functions to src/util 30307ec681f3Smrg- util: Add unit test for stack backtrace caputure 30317ec681f3Smrg- gallium/android: Rewrite backtrace helper for android 30327ec681f3Smrg- ci: Include enough Android headers to let us compile test EGL 30337ec681f3Smrg- mapi: Mark TLS symbols as optional in glapi-symbols.txt 30347ec681f3Smrg- turnip: Make tu_android.c compile again 30357ec681f3Smrg- meson: Define ANDROID and ANDROID_API_LEVEL when compiling for Android 30367ec681f3Smrg- anv: Pass device to setup_gralloc0_usage for error reporting 30377ec681f3Smrg- anv: Add stub for anv_gem_get_tiling() for Android 30387ec681f3Smrg- vulkan: Allow global symbol HMI for Android 30397ec681f3Smrg- radv/android: Remove unused variable 30407ec681f3Smrg- ci: Add a build test for the Android platform 30417ec681f3Smrg 30427ec681f3SmrgKrzysztof Raszkowski (1): 30437ec681f3Smrg 30447ec681f3Smrg- gallium/swr: Fix building swr with MSVC 30457ec681f3Smrg 30467ec681f3SmrgLaura Ekstrand (3): 30477ec681f3Smrg 30487ec681f3Smrg- docs: include meson in the toctree 30497ec681f3Smrg- docs: Remove version. 30507ec681f3Smrg- docs: Add the favicon to the new page. 30517ec681f3Smrg 30527ec681f3SmrgLeo Liu (3): 30537ec681f3Smrg 30547ec681f3Smrg- radeon/vcn: reset the decode flags from message buffer 30557ec681f3Smrg- radeon/vcn: add Sienna to use internal register offset 30567ec681f3Smrg- radeon/vcn/dec: add db_aligned_height to message buffer 30577ec681f3Smrg 30587ec681f3SmrgLepton Wu (3): 30597ec681f3Smrg 30607ec681f3Smrg- mapi: x86: Fix dynamic entries in x86 tsd stubs. 30617ec681f3Smrg- mapi: Return NULL function pointers for GL_EXT_debug_marker 30627ec681f3Smrg- egl: Allow software rendering for vgem/virtio_gpu in platform_device 30637ec681f3Smrg 30647ec681f3SmrgLionel Landwerlin (60): 30657ec681f3Smrg 30667ec681f3Smrg- drm-shim: move handle lock to shim_fd 30677ec681f3Smrg- drm-shim: don't create a memfd per BO 30687ec681f3Smrg- drm-shim: silence warnings 30697ec681f3Smrg- intel/dev: print out error when platform is not found by name 30707ec681f3Smrg- intel: add stub_gpu tool 30717ec681f3Smrg- ci: Add intel to shaderdb runs 30727ec681f3Smrg- iris: don't assert on unfinished aux import in copy paths 30737ec681f3Smrg- anv: don't expose VK_INTEL_performance_query without kernel support 30747ec681f3Smrg- anv: fix alignments for uniform buffers 30757ec681f3Smrg- genxml: run sorting script 30767ec681f3Smrg- genxml: fix invalid end value for video fields 30777ec681f3Smrg- genxml: factor out utility functions 30787ec681f3Smrg- genxml: pack: deal with default field not being simple integers 30797ec681f3Smrg- intel/genxml: fix bits generation for MI_LOAD_REGISTER_IMM 30807ec681f3Smrg- intel/mi-builder: add framework for self modifying batches 30817ec681f3Smrg- anv: don't reserve a particular register for draw count 30827ec681f3Smrg- anv: add a new execution mode for secondary command buffers 30837ec681f3Smrg- intel/genxml: add PIPE_CONTROL command cache invalidate bit 30847ec681f3Smrg- intel/perf: make pipeline statistic query loading optional 30857ec681f3Smrg- intel/perf: store the appropriate OA formats in queries 30867ec681f3Smrg- intel/perf: update generated code to ralloc all data 30877ec681f3Smrg- intel/perf: create a unique list of counters 30887ec681f3Smrg- intel/perf: compute number of passes for a set of counters 30897ec681f3Smrg- intel/perf: emit counter units in generated code 30907ec681f3Smrg- intel/perf: add helper to compute metrics from counters 30917ec681f3Smrg- intel/perf: add counter category to generated code 30927ec681f3Smrg- intel/perf: report whether the platform supported 30937ec681f3Smrg- anv: use a query filled by the perf code 30947ec681f3Smrg- intel/perf: reuse offset specified in the query 30957ec681f3Smrg- anv: Implement VK_KHR_performance_query 30967ec681f3Smrg- intel/perf: repurpose INTEL_DEBUG=no-oaconfig 30977ec681f3Smrg- anv: fixup unwinding of device create failure 30987ec681f3Smrg- blorp: rename workaround address function 30997ec681f3Smrg- anv: store the workaround address 31007ec681f3Smrg- iris: store workaround address 31017ec681f3Smrg- i965: store workaround_bo offset 31027ec681f3Smrg- intel: add identifier for debug purposes 31037ec681f3Smrg- iris: add identifier BO 31047ec681f3Smrg- i965: add identifier BO 31057ec681f3Smrg- anv: add identifier BO 31067ec681f3Smrg- intel/aub_error_decoder: print driver identifier if found 31077ec681f3Smrg- iris: fix BO destruction in error path 31087ec681f3Smrg- i965: don't forget to set screen on duped image 31097ec681f3Smrg- iris: fix export of GEM handles 31107ec681f3Smrg- i965: fix export of GEM handles 31117ec681f3Smrg- anv: add an option to disable secondary command buffer calls 31127ec681f3Smrg- anv: garbage collect timeline semaphore when querying value 31137ec681f3Smrg- iris: fix fallback to swrast driver 31147ec681f3Smrg- anv: fix uninitialized variable access 31157ec681f3Smrg- anv: properly handle fence import of sync_fd = -1 31167ec681f3Smrg- anv: fix descriptor set free 31177ec681f3Smrg- anv: fix incorrect realloc failure handling 31187ec681f3Smrg- anv: centralize vk to gen arrays 31197ec681f3Smrg- anv: fix up dynamic clip emission 31207ec681f3Smrg- anv: don't fail userspace relocation with perf queries 31217ec681f3Smrg- anv: fix transform feedback surface size 31227ec681f3Smrg- anv: VK_INTEL_performance_query interaction with VK_EXT_private_data 31237ec681f3Smrg- intel/perf: store query symbol name 31247ec681f3Smrg- intel/perf: fix raw query kernel metric selection 31257ec681f3Smrg- intel/compiler: fixup Gen12 workaround for array sizes 31267ec681f3Smrg 31277ec681f3SmrgLiviu Prodea (1): 31287ec681f3Smrg 31297ec681f3Smrg- util: Make process_test path compatible with mingw native toolchains 31307ec681f3Smrg 31317ec681f3SmrgLouis-Francis Ratté-Boulianne (1): 31327ec681f3Smrg 31337ec681f3Smrg- nir: Always create UBO variable when lowering uniforms to ubo 31347ec681f3Smrg 31357ec681f3SmrgLucas Stach (3): 31367ec681f3Smrg 31377ec681f3Smrg- etnaviv: generalize FE stall before loading shader and sampler states 31387ec681f3Smrg- etnaviv: retarget transfer to render resource when necessary 31397ec681f3Smrg- etnaviv: don't expose timer queries 31407ec681f3Smrg 31417ec681f3SmrgLuigi Santivetti (3): 31427ec681f3Smrg 31437ec681f3Smrg- dri2: dri2_make_current() fold multiple if blocks 31447ec681f3Smrg- dri2: do not conflate unbind and bindContext() failure 31457ec681f3Smrg- egl/dri2: try to bind old context if bindContext failed 31467ec681f3Smrg 31477ec681f3SmrgMarcin Ślusarz (24): 31487ec681f3Smrg 31497ec681f3Smrg- i965: remove unused variable 31507ec681f3Smrg- glsl_to_tgsi: add fallthrough comments 31517ec681f3Smrg- glsl: cleanup vertex shader input checks 31527ec681f3Smrg- iris: remove unused iris_bo->swizzle_mode 31537ec681f3Smrg- intel/compiler: fix Android build 31547ec681f3Smrg- st/mesa: fix reporting of float perf counters max value 31557ec681f3Smrg- iris: return max counter value for AMD_performance_monitor 31567ec681f3Smrg- iris: remove iris_monitor_config 31577ec681f3Smrg- intel/perf: move query_mask and location out of gen_perf_query_counter 31587ec681f3Smrg- iris: propagate error from gen_perf_begin_query to glBeginPerfQueryINTEL 31597ec681f3Smrg- i965: propagate error from gen_perf_begin_query to glBeginPerfQueryINTEL 31607ec681f3Smrg- util: fix possible fd leaks in os_socket_listen_abstract 31617ec681f3Smrg- glsl: catch out of bounds access in the debug version 31627ec681f3Smrg- util: fix possible buffer overflow in util_get_process_exec_path 31637ec681f3Smrg- util/format: initialize non-important components to 0 31647ec681f3Smrg- mesa: fix out of bounds access in glGetFramebufferParameterivEXT 31657ec681f3Smrg- mesa: quiet down static analyzers 31667ec681f3Smrg- iris: quiet down static analyzers 31677ec681f3Smrg- intel/vec4: fix out of bounds read 31687ec681f3Smrg- intel/perf: fix performance counters availability after glFinish 31697ec681f3Smrg- anv: refresh cached current batch bo after emitting some commands 31707ec681f3Smrg- anv: fix minor gen_ioctl(I915_PERF_IOCTL_CONFIG) error handling issue 31717ec681f3Smrg- intel/perf: split load_oa_metrics 31727ec681f3Smrg- intel/perf: export performance counters sorted by [group|set] and name 31737ec681f3Smrg 31747ec681f3SmrgMarek Olšák (226): 31757ec681f3Smrg 31767ec681f3Smrg- mesa: optimize glPush/PopClientAttrib by removing malloc overhead 31777ec681f3Smrg- mesa: don't call _mesa_update_state for _mesa_get_clamp_fragment_color 31787ec681f3Smrg- mesa: don't set unnecessary program flags in _mesa_update_state 31797ec681f3Smrg- mesa: don't update shaders on fixed-func state changes if user shaders are bound 31807ec681f3Smrg- mesa,st/mesa: add a fast path for non-static VAOs 31817ec681f3Smrg- mesa: inline vbo_context inside gl_context to remove vbo_context dereferences 31827ec681f3Smrg- mesa: add glInternalBufferSubDataCopyMESA for glthread 31837ec681f3Smrg- mesa: add _mesa_InternalBind{ElementBuffer,VertexBuffers} for glthread 31847ec681f3Smrg- glthread: do glBufferSubData as unsynchronized upload + GPU copy 31857ec681f3Smrg- glthread: don't use atomics for refcounting to decrease overhead on AMD Zen 31867ec681f3Smrg- glthread: track pointers and strides for Pointer & EXT_dsa attrib functions 31877ec681f3Smrg- glthread: track instance divisor changes 31887ec681f3Smrg- glthread: track primitive restart state 31897ec681f3Smrg- glthread: initialize VAOs properly 31907ec681f3Smrg- glthread: handle POS vs GENERIC0 aliasing 31917ec681f3Smrg- glthread: handle gl{Push,Pop}ClientAttrib{DefaultEXT} for glthread states 31927ec681f3Smrg- glthread: upload non-VBO vertices and indices for non-Indirect non-IBM draws 31937ec681f3Smrg- tgsi_to_nir: handle TGSI_SEMANTIC_BLOCK_SIZE 31947ec681f3Smrg- tgsi_to_nir: handle TGSI_OPCODE_BARRIER 31957ec681f3Smrg- radeonsi: unify and align down the max SSBO/TBO/UBO buffer binding size 31967ec681f3Smrg- radeonsi: clean up and deduplicate code around internal compute dispatches 31977ec681f3Smrg- radeonsi: bind shader images after DCC is disabled for image stores 31987ec681f3Smrg- radeonsi: add SI_IMAGE_ACCESS_DCC_OFF to ignore DCC for shader images 31997ec681f3Smrg- radeonsi: implement and use compute-based DCC decompression on gfx9-10 32007ec681f3Smrg- radeonsi: add a workaround to fix KHR-GL45.texture_view.view_classes on gfx9 32017ec681f3Smrg- radeonsi: fix si_compute_clear_render_target with render condition enabled 32027ec681f3Smrg- radeonsi: revert an accidental change in si_clear_buffer 32037ec681f3Smrg- Revert "ac/surface: remove RADEON_SURF_TC_COMPATIBLE_HTILE and assume it's always set" 32047ec681f3Smrg- Revert "ac: reassociate FP expressions for inexact instructions for radeonsi" 32057ec681f3Smrg- ac/surface: fix MSAA crash with FORCE_SWIZZLE_MODE on gfx9 32067ec681f3Smrg- radeonsi: don't wait for idle at the end of gfx IBs 32077ec681f3Smrg- ac/surface: unset RADEON_SURF_TC_COMPATIBLE_HTILE if HTILE hasn't been computed 32087ec681f3Smrg- radeonsi/gfx9: always use IMG_DATA_FORMAT_S8_32 for 8-bit stencil 32097ec681f3Smrg- radeonsi: allow tc_compatible_htile to be mutable 32107ec681f3Smrg- radeonsi: enable TC-compatible HTILE on demand for best Z/S performance 32117ec681f3Smrg- tgsi_to_nir: translate non-vec4 image stores correctly 32127ec681f3Smrg- radeonsi: fix compilation of monolithic PS 32137ec681f3Smrg- amd: update amdgpu_drm.h 32147ec681f3Smrg- amd: remove duplicated definitions from amdgpu_drm.h 32157ec681f3Smrg- amd: assume CMASK is always rb/pipe_aligned, remove ac_surface.u.gfx9.cmask 32167ec681f3Smrg- amd: assume HTILE is always rb/pipe_aligned, remove ac_surface.u.gfx9.htile 32177ec681f3Smrg- ac/surface,radeonsi: move the set/get_bo_metadata code to ac_surface.c 32187ec681f3Smrg- ac/surface,radeonsi: move the set/get_umd_metadata code into ac_surface.c 32197ec681f3Smrg- amd: unify code for overriding offset and stride for imported buffers 32207ec681f3Smrg- ac/surface: override all offsets including metadata offsets 32217ec681f3Smrg- ac/surface: fix broken pitch override on gfx8 32227ec681f3Smrg- gallium: rename 'state tracker' to 'frontend' 32237ec681f3Smrg- gallium: change comments to remove 'state tracker' 32247ec681f3Smrg- gallium: rename PIPE_RESOURCE_FLAG_ST_PRIV to FRONTEND_PRIV 32257ec681f3Smrg- gallium: remove more "state tracker" occurences 32267ec681f3Smrg- radeonsi: also enable tgsi_to_nir caching for compute shaders 32277ec681f3Smrg- glthread: stop using GLenum16 to get correct GL errors for out-of-bounds enums 32287ec681f3Smrg- radeonsi: don't expose 16xAA on chips with 1 RB due to an occlusion query issue 32297ec681f3Smrg- ac/nir: honor ACCESS_STREAM_CACHE_POLICY for L1 and L0 caches too 32307ec681f3Smrg- radeonsi: use correct clear value size for EQAA in expand_fmask 32317ec681f3Smrg- radeonsi: optimize access pattern for compute blits with linear textures 32327ec681f3Smrg- radeonsi: tweak clear/copy_buffer limits when to use compute 32337ec681f3Smrg- radeonsi: simplify setting resource usage for si_init_temp_resource_from_box 32347ec681f3Smrg- radeonsi: rename SI_RESOURCE_FLAG_TRANSFER to FORCE_LINEAR 32357ec681f3Smrg- radeonsi: use vi_dcc_enabled instead of using tex->surface.dcc_offset directly 32367ec681f3Smrg- radeonsi: use display_dcc_offset for setting displayable_dcc_cb_mask 32377ec681f3Smrg- winsys/amdgpu: add RADEON_FLAG_UNCACHED for faster blits over PCIe 32387ec681f3Smrg- radeonsi: disable the L2 cache for most CPU mappings of textures 32397ec681f3Smrg- radeonsi: disable the L2 cache for CPU read mappings of buffers 32407ec681f3Smrg- radeonsi: compute perf tests - don't test 1 wave/SA limit, test no limit first 32417ec681f3Smrg- radeonsi: test uncached clear/copy buffer performance with compute shaders 32427ec681f3Smrg- gallium/u_threaded: execute transfer_unmap with THREAD_SAFE directly 32437ec681f3Smrg- ac/gpu_info: compute the best safe IB alignment 32447ec681f3Smrg- ac/surface: don't compute single-sample CMASK if it's unaligned 32457ec681f3Smrg- radeonsi: don't use INDIRECT_BUFFER within IBs 32467ec681f3Smrg- radeonsi: decrease the max GS invocation count to 32 32477ec681f3Smrg- Revert "radeonsi: don't wait for idle at the end of gfx IBs" 32487ec681f3Smrg- ac: update register and packet definitions for preemption 32497ec681f3Smrg- radeonsi: move resetting tracked registers into a new function 32507ec681f3Smrg- radeonsi: split si_all_descriptors_begin_new_cs and rename functions 32517ec681f3Smrg- radeonsi: don't enable TC-compatible HTILE for stencil if stencil doesn't use it 32527ec681f3Smrg- radeonsi/gfx8: enable TC-compatible HTILE from the beginning as before 32537ec681f3Smrg- radeonsi: don't hardcode most perf counter block counts 32547ec681f3Smrg- ac/gpu_info: replace num_good_cu_per_sh with min/max_good_cu_per_sa 32557ec681f3Smrg- amd: replace SH -> SA (shader array) in comments 32567ec681f3Smrg- radeonsi/gfx10: implement most performance counters 32577ec681f3Smrg- glthread: don't upload for glDraw inside a display list and always sync 32587ec681f3Smrg- nir: add i2imp and u2ump opcodes for conversions to mediump 32597ec681f3Smrg- nir: add int16 and uint16 type helpers 32607ec681f3Smrg- nir: lower int16 and uint16 in nir_lower_mediump_outputs 32617ec681f3Smrg- nir: fix lower_wpos for 16-bit fddy 32627ec681f3Smrg- nir: add options::vectorize_vec2_16bit to limit vectorization to vec2 16 32637ec681f3Smrg- glsl: treat lowp as mediump when lowering builtins 32647ec681f3Smrg- glsl: handle int16 and uint16 types and add instructions for mediump 32657ec681f3Smrg- glsl: lower mediump integer types to int16 and uint16 32667ec681f3Smrg- glsl: lower mediump partial derivatives 32677ec681f3Smrg- glsl: lower the precision of imageLoad 32687ec681f3Smrg- glsl: lower samplers with highp coordinates correctly 32697ec681f3Smrg- gallium: add shader caps INT16 and FP16_DERIVATIVES 32707ec681f3Smrg- ac: rename has_double_rate_fp16 -> has_packed_math_16bit 32717ec681f3Smrg- ac/nir: use more types from ac_llvm_context 32727ec681f3Smrg- ac/nir: support vector types in the type suffix of overloaded intrinsics 32737ec681f3Smrg- ac/nir: remove type and num_channels args from ac_build_buffer_store_common 32747ec681f3Smrg- ac/nir: support 16-bit data in buffer_load_format opcodes 32757ec681f3Smrg- ac/nir: support 16-bit data in image opcodes 32767ec681f3Smrg- ac/nir: handle nir_op_[fiu]2[fiu]mp opcodes 32777ec681f3Smrg- ac/nir: select v_cvt_pkrtz for all conversions from f32 to f16 for radeonsi 32787ec681f3Smrg- ac/nir: set the second v_cvt_pkrtz argument to undef if it's unused 32797ec681f3Smrg- ac/nir: support v2f16 derivatives 32807ec681f3Smrg- nir: don't count samplers and images in interface blocks 32817ec681f3Smrg- nir: gather which images are buffers 32827ec681f3Smrg- nir: gather which images are MSAA 32837ec681f3Smrg- radeonsi: remove unused leftover code for INDIRECT_BUFFER inside IBs 32847ec681f3Smrg- radeonsi: remove const_buffers_declared hacks 32857ec681f3Smrg- radeonsi: pass at most 3 images and/or shader buffers via user SGPRs for compute 32867ec681f3Smrg- radeonsi: add a hack to disable TRUNC_COORD for shadow samplers 32877ec681f3Smrg- gallium/u_vbuf: get rid of some pointer dereferences 32887ec681f3Smrg- gallium/u_vbuf: add a faster path for uploading non-interleaved attribs 32897ec681f3Smrg- glthread: sync in glFlush for multiple contexts 32907ec681f3Smrg- radeonsi: enable ARB_sparse_buffer 32917ec681f3Smrg- ac,radeonsi: replace == GFX10 with >= GFX10 where it's needed 32927ec681f3Smrg- ac,radeonsi: start adding support for gfx10.3 32937ec681f3Smrg- ac/surface: add displayable DCC code for gfx10.3 32947ec681f3Smrg- radeonsi: honor a user-specified pitch on gfx10.3 32957ec681f3Smrg- radeonsi: enable larger SDMA clears and copies on gfx10.3 32967ec681f3Smrg- radeonsi: implement R9G9B9E5 render target and image store support on gfx10.3 32977ec681f3Smrg- radeonsi: move L2_CACHE_CONTROL registers into si_emit_framebuffer_state 32987ec681f3Smrg- radeonsi: set BIG_PAGE fields on gfx10.3 32997ec681f3Smrg- radeonsi: don't set any XNACK options on gfx10.3 33007ec681f3Smrg- ac: align num_vgprs for gfx10.3 33017ec681f3Smrg- radeonsi: add support for Sienna Cichlid 33027ec681f3Smrg- radeonsi: require LLVM 11 for gfx10.3 33037ec681f3Smrg- ac/surface: don't recompute the DCC retile map for imported textures 33047ec681f3Smrg- amd/addrlib: don't recompute DCC info for every ComputeDccAddrFromCoord call 33057ec681f3Smrg- amd/addrlib: remove unused members of ADDR2_COMPUTE_DCC_ADDRFROMCOORD_INPUT 33067ec681f3Smrg- ac/surface: add a wrapper structure to hold ADDR_HANDLE 33077ec681f3Smrg- ac/surface: cache DCC retile maps (v2) 33087ec681f3Smrg- amd/addrlib: fix the C++ one definition rule violation 33097ec681f3Smrg- ac/surface: don't set is_displayable if displayable DCC is missing 33107ec681f3Smrg- ac/surface: require that gfx8 doesn't have DCC in order to be displayable 33117ec681f3Smrg- ac/surface: enable DCC for the first level in the mip tail on gfx10 33127ec681f3Smrg- ac/surface: don't free dcc_retile_map on failure 33137ec681f3Smrg- radeonsi: compact MRTs to save PS export memory space 33147ec681f3Smrg- ac/nir: fix 64-bit division for GL CTS 33157ec681f3Smrg- glapi: fix incorrect param names in ARB_vertex_attrib_binding functions 33167ec681f3Smrg- glthread: rename non_vbo_attrib_mask -> user_buffer_mask, attribs -> buffers 33177ec681f3Smrg- glthread: handle ARB_vertex_attrib_binding 33187ec681f3Smrg- radeonsi: don't wait for idle at the end of gfx IBs 33197ec681f3Smrg- radeonsi: replace ctx->screen with sscreen in si_flush_gfx_cs 33207ec681f3Smrg- glsl,driconf: add allow_glsl_120_subset_in_110 for SPECviewperf13 33217ec681f3Smrg- driconf: add workarounds for SPECviewperf13 33227ec681f3Smrg- amd: add proper definitions for NOP packets 33237ec681f3Smrg- ac,winsys/amdgpu: align IBs the same as the kernel 33247ec681f3Smrg- radeonsi: don't add the border color buffer into the init_config state 33257ec681f3Smrg- radeonsi: rename init_config states to cs_preamble states 33267ec681f3Smrg- radeonsi: don't add the tess ring buffers into the cs_preamble state 33277ec681f3Smrg- radeonsi: make wait_mem_scratch unmappable 33287ec681f3Smrg- radeonsi: disallow adding BOs into si_pm4_state except 1 shader BO per state 33297ec681f3Smrg- radeonsi: make si_pm4_cmd_begin/end static and simplify all usages 33307ec681f3Smrg- radeonsi: clear per-context buffers at the end of si_create_context 33317ec681f3Smrg- radeonsi: remove tabs 33327ec681f3Smrg- radeonsi: don't flush in fence_server_sync 33337ec681f3Smrg- ac/gpu_info: fix num_physical_sgprs_per_simd for gfx10 33347ec681f3Smrg- radeonsi: fix NGG culling for Wave64 33357ec681f3Smrg- radeonsi: always use Wave32 for GS fast launch, because Wave64 hangs 33367ec681f3Smrg- radeonsi: always use Wave64 for HS/GS/VS shader stages (except GS fast launch) 33377ec681f3Smrg- radeonsi: don't try to enable NGG culling for GS 33387ec681f3Smrg- radeonsi: add a debug option to enable NGG culling for tessellation 33397ec681f3Smrg- glsl: make print_type non-static for debugging 33407ec681f3Smrg- glsl: print precision qualifiers in IR dumps 33417ec681f3Smrg- glsl: print constant initializers 33427ec681f3Smrg- glsl: fix the type of ir_constant_data::u16 33437ec681f3Smrg- glsl: fix evaluating float16 constant expression matrices 33447ec681f3Smrg- glsl: run validate_ir_tree if GLSL_VALIDATE=1 regardless of the build config 33457ec681f3Smrg- glsl: validate more stuff 33467ec681f3Smrg- glsl: convert reusable lower_precision util code into helper functions 33477ec681f3Smrg- glsl: remove the return type from lower_precision 33487ec681f3Smrg- glsl: cleanups in lower_precision 33497ec681f3Smrg- glsl: flatten a tautological conditional in lower_precision 33507ec681f3Smrg- glsl: don't lower precision of textureSize 33517ec681f3Smrg- glsl: don't lower builtins to mediump that don't allow it 33527ec681f3Smrg- glsl: lower builtins to mediump that ignore precision of certain parameters 33537ec681f3Smrg- glsl: lower builtins to mediump that always return mediump or lowp 33547ec681f3Smrg- glsl: add capability to lower mediump array types 33557ec681f3Smrg- glsl: lower mediump temporaries to 16 bits except structures (v2) 33567ec681f3Smrg- gallium: add PIPE_SHADER_CAP_GLSL_16BIT_TEMPS for LowerPrecisionTemporaries 33577ec681f3Smrg- Revert "ac/surface: require that gfx8 doesn't have DCC in order to be displayable" 33587ec681f3Smrg- glsl: don't validate array types in ir_dereference_variable 33597ec681f3Smrg- radeonsi: prevent a gfx10_ngg_calculate_subgroup_info failure for TES+NGG GS 33607ec681f3Smrg- radeonsi: add missing initialization of registers 33617ec681f3Smrg- radeonsi/gfx10: set the correct value for OFFCHIP_BUFFERING 33627ec681f3Smrg- radeonsi: sort registers in si_emit_initial_compute_regs according to GPU gen 33637ec681f3Smrg- radeonsi: sort registers in si_init_cs_preamble_state according to GPU gen 33647ec681f3Smrg- ac: add helper ac_get_register_name 33657ec681f3Smrg- ac: add tables for CP register shadowing 33667ec681f3Smrg- winsys/amdgpu: make amdgpu_bo_unmap non-static 33677ec681f3Smrg- radeonsi: make cs_preamble_state optional 33687ec681f3Smrg- radeonsi: reorder code in update_gs_ring_buffers and init_tess_factor_ring 33697ec681f3Smrg- radeonsi: implement CP register shadowing 33707ec681f3Smrg- radeonsi: add reg shadowing codepaths to GS and tess ring setup 33717ec681f3Smrg- radeonsi: add debug code for register shadowing 33727ec681f3Smrg- radeonsi: don't restore states at the beginning of IBs if they're shadowed 33737ec681f3Smrg- radeonsi: set up IBs for preemption 33747ec681f3Smrg- radeonsi: enable preemption if the kernel enabled it 33757ec681f3Smrg- amd: rename SIENNA -> SIENNA_CICHLID 33767ec681f3Smrg- amd: add support for Navy Flounder 33777ec681f3Smrg- amd: enable displayable DCC for everything newer than Navi1x 33787ec681f3Smrg- radeonsi: disable SDMA on gfx9 33797ec681f3Smrg- radeonsi: reorder NIR optimizations 33807ec681f3Smrg- radeonsi: call nir_split_array_vars/shrink_vec_array_vars/opt_find_array_copies 33817ec681f3Smrg- glsl: lower_precision - fix assertion failure with dereferences of constants 33827ec681f3Smrg- glsl: fix constant expression evaluation for 16-bit types 33837ec681f3Smrg- glsl: don't lower atomic functions to mediump 33847ec681f3Smrg- glsl: don't create conversion opcodes for array types 33857ec681f3Smrg- glsl: don't lower to mediump for desktop OpenGL 33867ec681f3Smrg- glsl: improve precision determination for calls 33877ec681f3Smrg- Revert "radeonsi: honor a user-specified pitch on gfx10.3" 33887ec681f3Smrg- radeonsi: use correct wave size in gfx10_ngg_calculate_subgroup_info 33897ec681f3Smrg- radeonsi: use the same units for esgs_ring_size and ngg_emit_size 33907ec681f3Smrg- radeonsi: increase minimum NGG vertex count requirement per workgroup on gfx 10.3 33917ec681f3Smrg- radeonsi: fix applying the NGG minimum vertex count requirement 33927ec681f3Smrg- radeonsi: don't count unusable vertices to the NGG LDS size 33937ec681f3Smrg- radeonsi: add a common function for getting the size of gs_ngg_scratch 33947ec681f3Smrg- radeonsi: remove the NGG hack decreasing LDS usage to deal with overflows 33957ec681f3Smrg- radeonsi: various fixes for gfx10.3 33967ec681f3Smrg- radeonsi: disable NGG culling on gfx10.3 because of hangs 33977ec681f3Smrg- st/mesa: don't generate NIR for ARB_vp/fp if NIR is not preferred 33987ec681f3Smrg- radeonsi: fix tess levels coming as scalar arrays from SPIR-V 33997ec681f3Smrg- gallivm: fix build on LLVM 12 due to LLVMAddConstantPropagationPass removal 34007ec681f3Smrg- ac/llvm: fix unaligned VS input loads on gfx10.3 34017ec681f3Smrg- Revert "ac: generate FMA for inexact instructions for radeonsi" 34027ec681f3Smrg 34037ec681f3SmrgMarek Vasut (3): 34047ec681f3Smrg 34057ec681f3Smrg- etnaviv: Disable seamless cube map on GC880 34067ec681f3Smrg- etnaviv: Remove etna_resource_get_status() 34077ec681f3Smrg- etnaviv: Add lock around pending_ctx 34087ec681f3Smrg 34097ec681f3SmrgMario Kleiner (1): 34107ec681f3Smrg 34117ec681f3Smrg- vulkan/wsi: Really terminate DRM lease in wsi_release_display(). 34127ec681f3Smrg 34137ec681f3SmrgMathias Fröhlich (2): 34147ec681f3Smrg 34157ec681f3Smrg- st/mesa: Move _NEW_FRAG_CLAMP to NewFragClamp driver flag. 34167ec681f3Smrg- mesa: set _NEW_FRAG_CLAMP only when needed 34177ec681f3Smrg 34187ec681f3SmrgMatt Turner (22): 34197ec681f3Smrg 34207ec681f3Smrg- intel/compiler: Drop opt_sampler_eot() 34217ec681f3Smrg- intel/tools: Remove unnecessary reg number checking 34227ec681f3Smrg- intel/tools: Drop srctype from ipreg 34237ec681f3Smrg- intel/tools: Require explicit regions/types for special regs 34247ec681f3Smrg- intel/tools: Disallow control subregisters > 3 34257ec681f3Smrg- intel/tools: Add assembler tests for the cr0 register 34267ec681f3Smrg- intel/compiler: Add assert that set bits are within mask 34277ec681f3Smrg- intel/compiler: Don't emit no-op cr0 changes 34287ec681f3Smrg- intel/tools: Fix typos 34297ec681f3Smrg- intel/tools: Remove stray newline 34307ec681f3Smrg- intel/tools: Don't allow empty type specifier 34317ec681f3Smrg- intel/tools: Simplify register type handling 34327ec681f3Smrg- intel/tools: Make swizzle an integer 34337ec681f3Smrg- intel/tools: Make writemask an integer 34347ec681f3Smrg- intel/tools: Simplify immediate handling 34357ec681f3Smrg- intel/tools: Simplify dstregion 34367ec681f3Smrg- intel/compiler: Relax SENDS regioning assertions 34377ec681f3Smrg- intel/tools: Pass integers, not enums, to stride() 34387ec681f3Smrg- intel/tools: Manually set ARF register file/nr/subnr 34397ec681f3Smrg- intel/tools: Don't hardcode notification register 34407ec681f3Smrg- intel/tools: Simplify notification register handling 34417ec681f3Smrg- intel/tools: Test notification subregisters 34427ec681f3Smrg 34437ec681f3SmrgMauro Rossi (17): 34447ec681f3Smrg 34457ec681f3Smrg- android: iris: add iris_seqno.{c,h} to Makefile.sources 34467ec681f3Smrg- freedreno/drm: android: add libfreedreno_registers static dependency 34477ec681f3Smrg- freedreno: android: add adreno-pm4-pack.xml.h generation to android build 34487ec681f3Smrg- android: util: fix build for GL4.1 support 34497ec681f3Smrg- android: svga: fix build for GL4.1 support 34507ec681f3Smrg- android: aco: add aco_ir.cpp to Makefile.sources 34517ec681f3Smrg- android: nvir/gv100: update sources in Makefile.sources 34527ec681f3Smrg- android: freedreno: add fd5_layout.c to Makefile.sources 34537ec681f3Smrg- android: freedreno/ir3: add missing generated sources and rules 34547ec681f3Smrg- android: freedreno/ir3: simplify generated sources rules 34557ec681f3Smrg- android: panfrost/encoder: add libmesa_nir static dependency 34567ec681f3Smrg- radv: fix build on Android 7 (v2) 34577ec681f3Smrg- android: freedreno/registers: fix generated headers rules 34587ec681f3Smrg- android: freedreno/ir3: fix include paths 34597ec681f3Smrg- android: freedreno/common: add support for libfreedreno_common static 34607ec681f3Smrg- android: freedreno: move a2xx disasm out of gallium 34617ec681f3Smrg- android: freedreno/common: add libmesa_git_sha1 static dependency 34627ec681f3Smrg 34637ec681f3SmrgMichel Dänzer (38): 34647ec681f3Smrg 34657ec681f3Smrg- gitlab-ci: Use YAML anchor for llvmpipe paths in virgl rules 34667ec681f3Smrg- gitlab-ci: Update to current templates 34677ec681f3Smrg- gitlab-ci: Move down container_pre_build.sh invocation in x86_build.sh 34687ec681f3Smrg- gitlab-ci: Add Debian testing repository for x86_build image 34697ec681f3Smrg- gitlab-ci: Install WINE from Debian testing 34707ec681f3Smrg- gitlab-ci: Move lib{drm,pciaccess}-dev cross packages out of loop 34717ec681f3Smrg- gitlab-ci: Install g++-mingw-w64-x86-64-win32 instead of mingw-w64 34727ec681f3Smrg- Revert "ac,radeonsi: fix compilations issues with LLVM 11" 34737ec681f3Smrg- Revert "gallium/gallivm: fix compilation issues with llvm 11" 34747ec681f3Smrg- gitlab-ci: Enable -Werror in `meson-s390x` job 34757ec681f3Smrg- gitlab-ci: Also list arm/x86_build in needs: of test jobs 34767ec681f3Smrg- gitlab-ci: x86_test-base image as common base for x86_test-gl/vk 34777ec681f3Smrg- gitlab-ci: Pull in GCC 9 from Debian testing in x86_test-gl/vk images 34787ec681f3Smrg- gitlab-ci: Move LLVM/clang 6/7 packages to the x86_build_old image 34797ec681f3Smrg- gitlab-ci: Use Debian 10 wine-development packages 34807ec681f3Smrg- gitlab-ci: Stop using packages from Debian testing 34817ec681f3Smrg- gitlab-ci: Move meson back to x86_test-gl/vk ephemeral packages lists 34827ec681f3Smrg- gitlab-ci: Add x86_build-base docker image 34837ec681f3Smrg- gitlab-ci: Use separate docker images for cross builds 34847ec681f3Smrg- loader/dri3: Add dri3_wait_for_event_locked full_sequence out parameter 34857ec681f3Smrg- loader/dri3: Use dri3_wait_for_event_locked in loader_dri3_wait_for_msc 34867ec681f3Smrg- loader/dri3: Check for window destruction in dri3_wait_for_event_locked 34877ec681f3Smrg- gitlab-ci: Automatically run pipelines for Marge Bot pre-merge only 34887ec681f3Smrg- gitlab-ci: Use rules: instead of except:/only: for test-docs job 34897ec681f3Smrg- gitlab-ci: Extend .ci-run-policy template for docs jobs 34907ec681f3Smrg- gitlab-ci: Do not create the "success" job when the test-docs job exists 34917ec681f3Smrg- ci: Use "when: always" for pages job 34927ec681f3Smrg- ci: Move deploy stage between container & build stages 34937ec681f3Smrg- Revert "loader/dri3: Check for window destruction in dri3_wait_for_event_locked" 34947ec681f3Smrg- gitlab-ci: Remove indirect dependencies from needs: 34957ec681f3Smrg- gitlab-ci: Drop dependencies: 34967ec681f3Smrg- Revert https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4580 34977ec681f3Smrg- gitlab-ci: Fix "triggered by Marge for a merge request" rule 34987ec681f3Smrg- gitlab-ci: Only trigger test-docs job automatically for MRs 34997ec681f3Smrg- ci: Use FDO_CI_CONCURRENT in run-shader-db.sh as well 35007ec681f3Smrg- ci: Do not mark container / pages jobs as interruptible 35017ec681f3Smrg- ci: Use half as many parallel softpipe / virgl test jobs 35027ec681f3Smrg- ci: Use ignore_scheduled_pipelines anchor in .radeonsi-rules 35037ec681f3Smrg 35047ec681f3SmrgMichel Zou (1): 35057ec681f3Smrg 35067ec681f3Smrg- swr: fix build with mingw 35077ec681f3Smrg 35087ec681f3SmrgMike Blumenkrantz (73): 35097ec681f3Smrg 35107ec681f3Smrg- zink: explicitly zero some arrays in ntv 35117ec681f3Smrg- zink: add SpvId returns to a couple ntv functions 35127ec681f3Smrg- zink: flush active queries on destroy and free query object 35137ec681f3Smrg- zink: fix vkCmdResetQueryPool usage 35147ec681f3Smrg- zink: reset query on-demand when beginning a new query from resume 35157ec681f3Smrg- zink: always use logical eq ops in ntv with 1bit inputs 35167ec681f3Smrg- zink: track program usages for each shader 35177ec681f3Smrg- zink: emit interpolation decorations for ntv outputs 35187ec681f3Smrg- zink: handle more glsl->spirv builtin translation 35197ec681f3Smrg- zink: rework input/output location emission 35207ec681f3Smrg- zink: use '2' variants for device props/feats, check features for ext enabling 35217ec681f3Smrg- zink: add spirv builder util functions for emitting xfb decorations 35227ec681f3Smrg- zink: add spirv_builder methods for OpVectorExtractDynamic and OpVectorInsertDynamic 35237ec681f3Smrg- zink: implement streamout and xfb handling in ntv 35247ec681f3Smrg- zink: implement transform feedback support to finish off opengl 3.0 35257ec681f3Smrg- zink: set PIPE_CAP_VIEWPORT_TRANSFORM_LOWERED and remove POS special casing 35267ec681f3Smrg- zink: switch to passing VkPhysicalDeviceFeatures2 in VkDeviceCreateInfo 35277ec681f3Smrg- zink: enable xfb extension in screen creation 35287ec681f3Smrg- zink: use int assignment for vk int type 35297ec681f3Smrg- zink: use correct define value for reserved slot count in ntv 35307ec681f3Smrg- zink: clamp VkImageCreateInfo.arrayLayers to 1 for image resource creation 35317ec681f3Smrg- zink: unify code for setting resource barriers 35327ec681f3Smrg- zink: handle signed and unsigned min/max ops in ntv 35337ec681f3Smrg- zink: add ult handling for ntv 35347ec681f3Smrg- zink: add bitfield_reverse handling to ntv 35357ec681f3Smrg- zink: lower byte/word extract ops in nir 35367ec681f3Smrg- zink: handle ixor in ntv 35377ec681f3Smrg- zink: handle isign alu in ntv 35387ec681f3Smrg- zink: set lower_mul_high and lower_rotate in ntv compiler options 35397ec681f3Smrg- zink: use OpFUnordNotEqual for nir_op_fne 35407ec681f3Smrg- zink: set lower_uadd_carry in nir options 35417ec681f3Smrg- zink: implement Vk_EXT_index_type_uint8 35427ec681f3Smrg- nir: add lowering pass for clip plane enabling 35437ec681f3Smrg- st/program: use nir_lower_clip_disable instead of nir_lower_clip_vs conditionally 35447ec681f3Smrg- nir: add lowering pass for fragcolor -> fragdata 35457ec681f3Smrg- zink: translate gl_FragColor to gl_FragData before ntv to fix multi-rt output 35467ec681f3Smrg- u_prim_restart: handle user buffers in util_translate_prim_restart_ib() 35477ec681f3Smrg- nir: allow nir_lower_point_size_mov to run in geometry shader 35487ec681f3Smrg- nir: allow nir_lower_clip_halfz to run in geometry shaders 35497ec681f3Smrg- zink: rework query handling 35507ec681f3Smrg- zink: use #define for number of queries per-pool 35517ec681f3Smrg- zink: only stall during query destroy for xfb queries 35527ec681f3Smrg- zink: properly handle query pool overflows 35537ec681f3Smrg- zink: only reset query pool on query end if current batch isn't in renderpass 35547ec681f3Smrg- zink: use right vulkan type for GL_PRIMITIVES_GENERATED queries 35557ec681f3Smrg- zink: handle ntv case of nested loop instructions more permissively 35567ec681f3Smrg- zink: add lengthy comment and remove assert from discard_if ntv pass 35577ec681f3Smrg- zink: use type of src[0] for ntv store and load ops 35587ec681f3Smrg- zink: try copy_region hook for blits where we can't do a regular blit or resolve 35597ec681f3Smrg- zink: block vkCmdBlitImage usage for multi sampled blits 35607ec681f3Smrg- zink: block resolve blits for depth/stencil buffers 35617ec681f3Smrg- zink: handle empty attachments 35627ec681f3Smrg- zink: try to handle multisampled null buffers 35637ec681f3Smrg- zink: enable tgsi texcoord pipe cap 35647ec681f3Smrg- zink: destroy gfx program when a shader is freed 35657ec681f3Smrg- zink: destroy descriptor pools on context destroy 35667ec681f3Smrg- zink: free pipeline cache during program destroy 35677ec681f3Smrg- zink: free all ntv allocations after creating shader module 35687ec681f3Smrg- zink: use helper function to handle uvec/bvec types 35697ec681f3Smrg- zink: handle texelFetchOffset with offsets 35707ec681f3Smrg- zink: add some asserts for building access chains in ntv 35717ec681f3Smrg- zink: omit Lod image operand in ntv when not using an image texture dim 35727ec681f3Smrg- nir: allow lower_psiz_mov to run in tessellation stages 35737ec681f3Smrg- nir\_ allow nir_lower_clip_halfz to run in tess eval shader 35747ec681f3Smrg- u_prim_restart: handle indirect draws 35757ec681f3Smrg- zink: add extension loading framework for spirv builder 35767ec681f3Smrg- zink: implement VK_EXT_robustness2 35777ec681f3Smrg- zink: clamp PIPE_SHADER_CAP_MAX_SHADER_BUFFERS to PIPE_MAX_SHADER_BUFFERS 35787ec681f3Smrg- zink: handle VK_EXT_vertex_attribute_divisor setup 35797ec681f3Smrg- zink: store valid timestamp bits onto zink_screen 35807ec681f3Smrg- zink: implement handling for VK_EXT_calibrated_timestamps 35817ec681f3Smrg- u_prim_restart: add inline function for getting restart index based on index size 35827ec681f3Smrg- zink: reorder create_stream_output_target to fix failure case leak 35837ec681f3Smrg 35847ec681f3SmrgMiklós Máté (1): 35857ec681f3Smrg 35867ec681f3Smrg- docs: add some missing stuff to sourcetree.rst 35877ec681f3Smrg 35887ec681f3SmrgNanley Chery (18): 35897ec681f3Smrg 35907ec681f3Smrg- iris: Drop can_fast_clear_color's format parameter 35917ec681f3Smrg- iris: Remove the CCS_D fallback 35927ec681f3Smrg- iris: Avoid fast-clear with incompatible view 35937ec681f3Smrg- iris: Disable sRGB fast-clears for non-0/1 values 35947ec681f3Smrg- intel: Add ISL_AUX_USAGE_GEN12_CCS_E 35957ec681f3Smrg- iris: Don't support sRGB + Y_TILED_CCS on gen9 35967ec681f3Smrg- iris: Use ISL_AUX_USAGE_GEN12_CCS_E on gen12 35977ec681f3Smrg- isl/drm: Support I915_FORMAT_MOD_Y_TILED_GEN12_RC_CCS 35987ec681f3Smrg- gallium/dri2: Support I915_FORMAT_MOD_Y_TILED_GEN12_RC_CCS 35997ec681f3Smrg- iris: Handle importing aux-enabled surfaces on TGL 36007ec681f3Smrg- iris: Refactor modifier_is_supported for gen12 36017ec681f3Smrg- iris: Support I915_FORMAT_MOD_Y_TILED_GEN12_RC_CCS 36027ec681f3Smrg- iris: Zero the add-on clear color BO on import 36037ec681f3Smrg- dri_util: Update internal_format to GL_RGB8 for MESA_FORMAT_B8G8R8X8_UNORM 36047ec681f3Smrg- iris: Don't call SET_TILING for dmabuf imports 36057ec681f3Smrg- gallium/dri2: Report correct YUYV and UYVY plane count 36067ec681f3Smrg- iris: Fix aux assertion in resource_get_handle 36077ec681f3Smrg- blorp: Fix alignment test for HIZ_CCS_WT fast-clears 36087ec681f3Smrg 36097ec681f3SmrgNataraj Deshpande (3): 36107ec681f3Smrg 36117ec681f3Smrg- anv: Limit vulkan version to 1.1 for Android 36127ec681f3Smrg- anv: Disable extensions based on Android versions 36137ec681f3Smrg- dri_util: Update internal_format to GL_RGB8 for MESA_FORMAT_R8G8B8X8_UNORM 36147ec681f3Smrg 36157ec681f3SmrgNeha Bhende (6): 36167ec681f3Smrg 36177ec681f3Smrg- util: Initialize pipe_shader_state for passthrough and transform shaders 36187ec681f3Smrg- util: Add util functionality for GL4.1 support 36197ec681f3Smrg- winsys/drm: Add GL4.1 support in drm winsys 36207ec681f3Smrg- svga/include: Headers for GL4.1 support 36217ec681f3Smrg- svga: Add GL4.1(compatibility profile) support in svga driver 36227ec681f3Smrg- svga: Performance fixes 36237ec681f3Smrg 36247ec681f3SmrgNeil Armstrong (2): 36257ec681f3Smrg 36267ec681f3Smrg- Revert "CI: Disable Lima jobs due to lab unhealthiness" 36277ec681f3Smrg- Revert "CI: Disable Panfrost Mali-T820 jobs" 36287ec681f3Smrg 36297ec681f3SmrgNeil Roberts (26): 36307ec681f3Smrg 36317ec681f3Smrg- nir/scheduler: Handle nir_intrinsic_load_per_vertex_input 36327ec681f3Smrg- v3d: Remove unused member of v3d_compile 36337ec681f3Smrg- nir/schedule: Store a pointer to the scoreboard in nir_deps_state 36347ec681f3Smrg- nir/scheduler: Add an option to specify what stages share memory for I/O 36357ec681f3Smrg- v3d: Let scheduler know GS doesn’t have shared I/O memory 36367ec681f3Smrg- gallium: Add pipe cap for primitive restart with fixed index 36377ec681f3Smrg- mesa: Add PrimitiveRestartFixedIndex to gl_constants 36387ec681f3Smrg- v3d: Disable PIPE_CAP_PRIMITIVE_RESTART 36397ec681f3Smrg- v3d: Add missing macro for stvpmd instruction 36407ec681f3Smrg- v3d: Use stvpmd for non-uniform offsets in GS 36417ec681f3Smrg- compiler: Add a system value for the line coord 36427ec681f3Smrg- v3d: Implement the line coord intrinsic 36437ec681f3Smrg- nir: Add intrinsics for the line width 36447ec681f3Smrg- v3d: Handle the line width intrinsics 36457ec681f3Smrg- v3d: Add a lowering pass for line smoothing 36467ec681f3Smrg- v3d: Enable perpendicular line caps when line smoothing 36477ec681f3Smrg- broadcom/qpu: set VC5_QPU_RADDR_A out of the switch at _pack_branch 36487ec681f3Smrg- v3d/compiler: Fix sorting the gs and fs inputs 36497ec681f3Smrg- v3d/compiler: Lower geometry output store base into offset src 36507ec681f3Smrg- nir/scheduler: Move nir_scheduler to its own header 36517ec681f3Smrg- nir/schedule: Store a pointer to the options struct in scoreboard 36527ec681f3Smrg- nir/schedule: Add a callback for backend-specific dependencies 36537ec681f3Smrg- v3d: Mark scheduling dependency for prim id and first output 36547ec681f3Smrg- nir/schedule: Add an option for a fallback scheduling algorithm 36557ec681f3Smrg- v3d: Changed v3d_compile:failed to an enum 36567ec681f3Smrg- v3d: Retry with the fallback scheduler when RA fails 36577ec681f3Smrg 36587ec681f3SmrgOschowa (5): 36597ec681f3Smrg 36607ec681f3Smrg- radv: Don't take absolute value of unsigned type. 36617ec681f3Smrg- aco: Don't declare 'Block' as class, but define as struct. 36627ec681f3Smrg- aco: Don't std::move temporary object. 36637ec681f3Smrg- aco: Use correct reference type in for-range-loop. 36647ec681f3Smrg- radv: Explicitly cast TIMESTAMP_NOT_READY value to uin32_t where needed. 36657ec681f3Smrg 36667ec681f3SmrgPablo Saavedra (5): 36677ec681f3Smrg 36687ec681f3Smrg- ci: TRACES_DB_PATH and RESULTS_PATH defined as relative paths 36697ec681f3Smrg- ci: ArgumentParser receives the args from the main parameters 36707ec681f3Smrg- ci: Migrate tracie tests done in shell script to pytest 36717ec681f3Smrg- ci: Split test_tracie_skips_traces_without_checksum in separate cases 36727ec681f3Smrg- ci: Fix TypoError error when traces in traces.yml is an empty list 36737ec681f3Smrg 36747ec681f3SmrgPavel Asyutchenko (1): 36757ec681f3Smrg 36767ec681f3Smrg- vulkan/overlay: fix crash on destroying NULL swapchain 36777ec681f3Smrg 36787ec681f3SmrgPeter Seiderer (3): 36797ec681f3Smrg 36807ec681f3Smrg- vc4_bufmgr: fix time_t printf 36817ec681f3Smrg- pan_bo.h: add time.h include for time_t 36827ec681f3Smrg- v3d_bufmgr: fix time_t printf 36837ec681f3Smrg 36847ec681f3SmrgPierre Moreau (4): 36857ec681f3Smrg 36867ec681f3Smrg- clover/nir: Check the result of spirv_to_nir 36877ec681f3Smrg- clover/api: Address missing braces for subobj init 36887ec681f3Smrg- clover: Address unnecessary copy warnings 36897ec681f3Smrg- clover/spirv: Remove unused tuple header 36907ec681f3Smrg 36917ec681f3SmrgPierre-Eric Pelloux-Prayer (62): 36927ec681f3Smrg 36937ec681f3Smrg- radeonsi: fix export count 36947ec681f3Smrg- mesa: add gl_coontext::ForceIntegerTexNearest 36957ec681f3Smrg- driconf: add force_integer_tex_nearest option 36967ec681f3Smrg- radeonsi: add workaround for issue 2647 36977ec681f3Smrg- radeonsi: don't print gs_copy_shader stats for shaderdb 36987ec681f3Smrg- glsl: init gl_FragColor if zero_init=true 36997ec681f3Smrg- glsl: rework zero initialization 37007ec681f3Smrg- glsl: add a is_implicit_initializer flag 37017ec681f3Smrg- mesa: extend GLSLZeroInit semantics 37027ec681f3Smrg- gallium: add a new cap PIPE_CAP_GLSL_ZERO_INIT 37037ec681f3Smrg- ac/nir: export some undef as zero 37047ec681f3Smrg- ac/surface: remove shadowing declaration 37057ec681f3Smrg- amdgpu/radeon: add secure api 37067ec681f3Smrg- radeonsi: add AMD_DEBUG=tmz option 37077ec681f3Smrg- radeon: add RADEON_CREATE_ENCRYPTED flag 37087ec681f3Smrg- radeonsi: allocate framebuffer texture as secure when using tmz 37097ec681f3Smrg- amdgpu: add encrypted slabs support 37107ec681f3Smrg- radeonsi: force using staging texture when uploading to secure texture 37117ec681f3Smrg- radeonsi/sdma: implement tmz support 37127ec681f3Smrg- gallium: PIPE_RESOURCE_FLAG_ENCRYPTED 37137ec681f3Smrg- radeonsi: add support for PIPE_RESOURCE_FLAG_ENCRYPTED 37147ec681f3Smrg- amdgpu: use AMDGPU_IB_FLAGS_SECURE when requested 37157ec681f3Smrg- radeonsi: determine secure flag must be set for gfx IB 37167ec681f3Smrg- radeonsi: do not use cmask with encrypted texture 37177ec681f3Smrg- amd/addrlib: fix forgotten char -> enum conversions 37187ec681f3Smrg- radeonsi: fix inversed arguments in si_test_gds_memory_management 37197ec681f3Smrg- amdgpu: fix unitialized variable 37207ec681f3Smrg- radeonsi/sdma: remove useless compare 37217ec681f3Smrg- radeonsi/drirc: enable zerovram option for 7 Days to Die 37227ec681f3Smrg- winsys/radeon: do not cast bo->va as void* 37237ec681f3Smrg- radeonsi: add return value to gfx10_ngg_calculate_subgroup_info 37247ec681f3Smrg- radeonsi/ngg: try GS multi-cycling mode if default mode failed 37257ec681f3Smrg- ac/surface: set SCANOUT if surf->is_displayable 37267ec681f3Smrg- ac/surface: fix epitch when modifying surf_pitch 37277ec681f3Smrg- ac/llvm: load 1 byte at a time if unaligned on gfx10 37287ec681f3Smrg- st/mesa: make texture views inherit compressed_data storage 37297ec681f3Smrg- radeonsi: bump SI_NUM_SHADER_BUFFERS to 32 37307ec681f3Smrg- st/mesa: do not clear NewDriverState for inactive states 37317ec681f3Smrg- glsl: reject size1x8 for image variable with floating-point data types 37327ec681f3Smrg- ac/llvm: remove the -1 hack from ac_atomic_inc_wrap 37337ec681f3Smrg- glsl: don't expose imageAtomicIncWrap for signed image 37347ec681f3Smrg- glsl: only allow 32 bits atomic operations on images 37357ec681f3Smrg- glsl: declare gl_Layer/gl_ViewportIndex/gl_ViewportMask as vs builtins 37367ec681f3Smrg- st/mesa: set compressed_data to NULL when freed 37377ec681f3Smrg- bin/symbols-check.py: add --ignore-symbol argument 37387ec681f3Smrg- ac/llvm: export ac_init_llvm_once in targets 37397ec681f3Smrg- mesa: rename _mesa_free_errors_data 37407ec681f3Smrg- mesa: add bool param to _mesa_free_context_data 37417ec681f3Smrg- mesa/st: release debug_output after destroying the context 37427ec681f3Smrg- ac/surface: adapt surf_size when modifying surf_pitch 37437ec681f3Smrg- radeonsi: adjust epitch for PIPE_FORMAT_R8G8_R8B8_UNORM 37447ec681f3Smrg- radeonsi: extend workaround for KHR-GL45.texture_view.view_classes on gfx9 37457ec681f3Smrg- ac/llvm: handle static/shared llvm init separately 37467ec681f3Smrg- mesa/st: introduce PIPE_CAP_NO_CLIP_ON_COPY_TEX 37477ec681f3Smrg- radeonsi: enable PIPE_CAP_NO_CLIP_ON_COPY_TEX 37487ec681f3Smrg- ac/llvm: add option to clamp division by zero 37497ec681f3Smrg- radeonsi,driconf: add clamp_div_by_zero option 37507ec681f3Smrg- radeonsi: use radeonsi_clamp_div_by_zero for SPECviewperf13, Road Redemption 37517ec681f3Smrg- glsl: fix per_vertex_accumulator::fields size 37527ec681f3Smrg- r600/uvd: set dec->bs_ptr = NULL on unmap 37537ec681f3Smrg- radeon/vcn: set dec->bs_ptr = NULL on unmap 37547ec681f3Smrg- mesa: fix glUniform* when a struct contains a bindless sampler 37557ec681f3Smrg 37567ec681f3SmrgPierre-Loup A. Griffais (2): 37577ec681f3Smrg 37587ec681f3Smrg- radv: fix null descriptor for dynamic buffers 37597ec681f3Smrg- radv: fix vertex buffer null descriptors 37607ec681f3Smrg 37617ec681f3SmrgQiang Yu (6): 37627ec681f3Smrg 37637ec681f3Smrg- radeonsi: remove emacs style config file 37647ec681f3Smrg- panfrost: don't always build bifrost_compiler 37657ec681f3Smrg- radeonsi: fix syncobj wait timeout 37667ec681f3Smrg- radeonsi: fix user fence space when MCBP is enabled 37677ec681f3Smrg- radeonsi: fix max syncobj wait timeout 37687ec681f3Smrg- radeonsi: fix user fence GPU address 37697ec681f3Smrg 37707ec681f3SmrgRafael Antognolli (8): 37717ec681f3Smrg 37727ec681f3Smrg- intel: Store the aperture size in devinfo. 37737ec681f3Smrg- intel/isl: Update mocs for DG1 37747ec681f3Smrg- intel/l3: Return the URB size from devinfo for DG1 37757ec681f3Smrg- intel/devinfo: Add function to check for DRM_I915_GEM_GET_TILING. 37767ec681f3Smrg- iris/bufmgr: Do not use map_gtt or use set/get_tiling on DG1 37777ec681f3Smrg- anv/dg1: Don't use SET_TILING kernel uapi. 37787ec681f3Smrg- iris: Align last_seqnos to 64 bits. 37797ec681f3Smrg- anv: Align "used" attribute to 64 bits. 37807ec681f3Smrg 37817ec681f3SmrgRhys Kidd (5): 37827ec681f3Smrg 37837ec681f3Smrg- nv50_2d: regenerate envytools-based rnndb headers 37847ec681f3Smrg- nv50_2d,nvc0_2d: Document SET_PIXELS_FROM_MEMORY_SAFE_OVERLAP from rnndb 37857ec681f3Smrg- nvc0_2d: Document SET_PIXELS_FROM_MEMORY_CORRAL_SIZE from rnndb 37867ec681f3Smrg- nvc0: fix macro define for NVE4_COPY() 37877ec681f3Smrg- nvc0: add documentation for nve4+ (Kepler) COPY class 37887ec681f3Smrg 37897ec681f3SmrgRhys Perry (174): 37907ec681f3Smrg 37917ec681f3Smrg- aco: remove use of f-strings 37927ec681f3Smrg- aco: add message to static_assert 37937ec681f3Smrg- nir: add missing group_memory_barrier handling 37947ec681f3Smrg- compiler/spirv: flag nclamp/nmin/nmax as exact 37957ec681f3Smrg- nir: make fsat return 0.0 with NaN instead of passing it through 37967ec681f3Smrg- docs: add src/amd/ to sourcetree.html 37977ec681f3Smrg- docs/envvars: document ACO_DEBUG 37987ec681f3Smrg- docs/envvars: update RADV_FORCE_FAMILY 37997ec681f3Smrg- aco: simplify consecutive ordered vmem/lds writes optimization 38007ec681f3Smrg- aco: fix consecutively written vgprs from vmem instructions 38017ec681f3Smrg- aco: mark phi definitions as last-seen phi operands 38027ec681f3Smrg- aco: consider affinities when creating v_mac_f32 38037ec681f3Smrg- aco: improve phi affinities with p_split_vector 38047ec681f3Smrg- aco: split operations that use a swap's definition 38057ec681f3Smrg- aco: fix disassembly with LLVM 11 38067ec681f3Smrg- nir/opt_if: run opt_peel_loop_initial_if after all other optimizations 38077ec681f3Smrg- nir/opt_if: use nir_src_as_bool in opt_peel_loop_initial_if helper 38087ec681f3Smrg- aco: fix typo in insert_waitcnt's kill() 38097ec681f3Smrg- nir: fix lowering to scratch with boolean access 38107ec681f3Smrg- aco: fix interaction with 3f branch workaround and p_constaddr 38117ec681f3Smrg- aco: consider SDWA during value numbering 38127ec681f3Smrg- aco: check instruction format before waiting for a previous SMEM store 38137ec681f3Smrg- aco: preserve more fields when combining additions into SMEM 38147ec681f3Smrg- aco: don't reorder barriers in the scheduler 38157ec681f3Smrg- aco: fix 64-bit shared_atomic_exchange 38167ec681f3Smrg- docs: add missing "shader\_" in VK_KHR_shader_subgroup_extended_types 38177ec681f3Smrg- radv: set keep_statistic_info with RADV_DEBUG=shaderstats 38187ec681f3Smrg- ac/gpu_info, radv: set max_wave64_per_simd to 20 on GFX10 38197ec681f3Smrg- aco: use v_xor3_b32 38207ec681f3Smrg- aco: validate instructions reading/writing upper halves/bytes 38217ec681f3Smrg- aco: p_extract_vector in 64-bit u2f16/i2f16 38227ec681f3Smrg- aco: allow reading/writing upper halves/bytes when possible 38237ec681f3Smrg- aco: prefer 4-byte aligned definitions 38247ec681f3Smrg- aco: add Info::{operand_size,definition_size} 38257ec681f3Smrg- aco: use Info::definition_size instead of definition's regclass 38267ec681f3Smrg- aco: fix moving sub-dword values out of a register for a fixed definition 38277ec681f3Smrg- aco: use num_opcodes instead of last_opcode 38287ec681f3Smrg- aco: improve code for f2{i,u}{8,16} 38297ec681f3Smrg- aco: use p_as_uniform in emit_vop1_instruction 38307ec681f3Smrg- aco: add and set precise flag 38317ec681f3Smrg- aco: create mads when signed zeros should be preserved 38327ec681f3Smrg- aco: try to use fma instead of mad when denormals are enabled 38337ec681f3Smrg- aco: create 16-bit mad/fma 38347ec681f3Smrg- aco: update comment about preserving fp16/fp64 denormals 38357ec681f3Smrg- aco: create 16-bit input and output modifiers 38367ec681f3Smrg- aco: improve sub-dword check for sgpr/constant propagation 38377ec681f3Smrg- aco: fix half_pi constant for 16-bit fsin/fcos 38387ec681f3Smrg- aco: use 32-bit inline constants for 16-bit integer instructions 38397ec681f3Smrg- aco: improve 8/16-bit constants 38407ec681f3Smrg- aco: copy-propagate constants through p_extract_vector/p_split_vector 38417ec681f3Smrg- aco: optimize 16-bit and 64-bit float comparisons 38427ec681f3Smrg- aco: validate sub-dword pseudo instructions 38437ec681f3Smrg- aco: add more opcodes to can_swap_operands 38447ec681f3Smrg- aco: allow GFX9 partial writes with instructions which use opsel 38457ec681f3Smrg- aco: improve check for moving temporaries out of fixed definitions 38467ec681f3Smrg- aco: fix encoding of certain s_setreg_imm32_b32 instructions 38477ec681f3Smrg- aco: fix validation error from vgpr spill/restore code 38487ec681f3Smrg- aco: fix sub-dword opsel/sdwa checks 38497ec681f3Smrg- aco: fix validation of opsel when set for the definition 38507ec681f3Smrg- aco: shrink ssa_info 38517ec681f3Smrg- aco: make ssa_info::label 64-bit 38527ec681f3Smrg- aco: shrink mad_info 38537ec681f3Smrg- aco: fix edge check with sub-dword temporaries 38547ec681f3Smrg- aco: use the same regclass as the definition for undef phi operands 38557ec681f3Smrg- radv: add new drirc option radv_no_dynamic_bounds 38567ec681f3Smrg- radv: enable radv_no_dynamic_bounds for Path of Exile 38577ec681f3Smrg- radv: enable radv_no_dynamic_bounds for more Path of Exile executables 38587ec681f3Smrg- nir: slight correction to cube_face_coord constant folding 38597ec681f3Smrg- spirv: set variables to restrict by default 38607ec681f3Smrg- radv: fix image variable types in meta shaders 38617ec681f3Smrg- aco: only use SMEM if we can prove it's safe 38627ec681f3Smrg- aco: allow SMEM for some sub-dword accesses 38637ec681f3Smrg- radv/aco,aco: allow SMEM SSBO loads on GFX6/7 38647ec681f3Smrg- aco: fix copy+paste error in split_buffer_store 38657ec681f3Smrg- aco: don't store byte-aligned short stores 38667ec681f3Smrg- aco: add missing bld.scc() in byte_align_scalar() 38677ec681f3Smrg- aco: don't create byte-aligned short loads 38687ec681f3Smrg- aco: fix when sub-dword create_vector operand cannot be placed perfectly 38697ec681f3Smrg- aco: improve vectorization of 8/16-bit loads/stores 38707ec681f3Smrg- aco: ignore blocked registers when checking edges in get_reg_impl() 38717ec681f3Smrg- aco: remove outdated assert in handle_operands() 38727ec681f3Smrg- radv: enable zerovram for Quantic Dream games 38737ec681f3Smrg- aco: use VOP2 version of v_mbcnt_hi_u32_b32 on GFX6/7 38747ec681f3Smrg- aco: rework boolean phi pass 38757ec681f3Smrg- aco: create better code for boolean phis with constant operands 38767ec681f3Smrg- aco: optimize boolean phis with uniform selections 38777ec681f3Smrg- aco: don't create phis with undef operands in the boolean phi pass 38787ec681f3Smrg- aco: read 0 from inactive lanes when using dpp 38797ec681f3Smrg- aco: optimize some masked swizzles to DPP 38807ec681f3Smrg- aco: implement <32-bit masked_swizzle_amd 38817ec681f3Smrg- nir/lower_subgroups: pass options struct to lower_shuffle 38827ec681f3Smrg- nir/lower_subgroups: add lower_shuffle_to_swizzle_amd 38837ec681f3Smrg- radv: use lower_shuffle_to_swizzle_amd 38847ec681f3Smrg- aco: add 32-bit integer addition to can_swap_operands 38857ec681f3Smrg- aco: fix underestimated pressure in spiller when a phi has a killed def 38867ec681f3Smrg- aco: rewrite graph coloring in spiller 38877ec681f3Smrg- aco: use unordered_set for spill id interferences 38887ec681f3Smrg- aco: add add_interference() helper 38897ec681f3Smrg- aco: use s_round_mode/s_denorm_mode 38907ec681f3Smrg- aco: flush denormals before fp16 fabs/fneg if needed 38917ec681f3Smrg- aco: fix nir_op_f2f16_rtne with non-default rounding modes 38927ec681f3Smrg- aco: set tcs_in_out_eq=false if float controls of VS and TCS stages differ 38937ec681f3Smrg- radv: enable more float_controls features 38947ec681f3Smrg- aco: properly recognize that s_waitcnt mitigates VMEMtoScalarWriteHazard 38957ec681f3Smrg- aco: use s_waitcnt_depctr to mitigate VMEMtoScalarWriteHazard 38967ec681f3Smrg- spirv: don't split memory barriers 38977ec681f3Smrg- nir/lower_int64: lower 64-bit amul 38987ec681f3Smrg- aco: always set FI on GFX10 38997ec681f3Smrg- radv: replace discard with demote for Quantic Dream games 39007ec681f3Smrg- aco: implement b2i8/b2i16 39017ec681f3Smrg- aco: be more careful combining additions that could wrap into loads/stores 39027ec681f3Smrg- aco: allow overflow for some SMEM instructions 39037ec681f3Smrg- aco: add NUW flag 39047ec681f3Smrg- nir: add nir_unsigned_upper_bound and nir_addition_might_overflow 39057ec681f3Smrg- aco: use nir_addition_might_overflow to combine additions into SMEM 39067ec681f3Smrg- aco: move some setup code into helpers 39077ec681f3Smrg- aco: make validate() usable in tests 39087ec681f3Smrg- aco: print ACO IR before scheduling instead of after 39097ec681f3Smrg- radv: fix invalid conversion warnings in vk_format.h 39107ec681f3Smrg- aco: fix copy of uninitialized boolean 39117ec681f3Smrg- aco: fix includes in aco_ir.cpp 39127ec681f3Smrg- aco: add missing add_to_hazard_query 39137ec681f3Smrg- aco: rework barriers and replace can_reorder 39147ec681f3Smrg- radv/aco,aco: use scoped barriers 39157ec681f3Smrg- aco: consider intrinsic access in visit_{load,store}_image 39167ec681f3Smrg- nir,radv/aco: add and use pass to lower make available/visible barriers 39177ec681f3Smrg- aco: enable value numbering of s_buffer_load_* 39187ec681f3Smrg- aco: use storage_scratch 39197ec681f3Smrg- aco: improve sync_info for TCS output stores 39207ec681f3Smrg- aco: improve workgroup-scope and lower vmem/smem barriers 39217ec681f3Smrg- aco: create acq+rel barriers instead of acq/rel 39227ec681f3Smrg- nir/load_store_vectorize: fix indentation 39237ec681f3Smrg- ac/nir: implement scoped_barrier 39247ec681f3Smrg- radv: use scoped barriers 39257ec681f3Smrg- aco: remove isel for GLSL-style barriers 39267ec681f3Smrg- aco: add framework for unit testing 39277ec681f3Smrg- aco: add a few tests for the assembler and optimizer 39287ec681f3Smrg- aco: add framework for testing isel and integration tests 39297ec681f3Smrg- ci: enable ACO tests 39307ec681f3Smrg- aco/tests: add tests for sub-dword swaps 39317ec681f3Smrg- aco: optimize swizzled SALU 8/16-bit conversions 39327ec681f3Smrg- aco: fix waitcnt insertion on GFX10.3 39337ec681f3Smrg- aco: don't create v_mad_f32 on GFX10.3 39347ec681f3Smrg- aco: update bug workarounds for GFX10_3 39357ec681f3Smrg- aco: fix max_waves_per_simd on Polaris, VegaM and GFX10.3 39367ec681f3Smrg- aco: update vgpr_alloc_granule for GFX10.3 39377ec681f3Smrg- aco: implement subgroup shader_clock on GFX10.3 39387ec681f3Smrg- aco: update aco_opcodes.py for GFX10.3 39397ec681f3Smrg- aco: disable SMEM stores on GFX10.3 39407ec681f3Smrg- aco: replace MADs in isel with FMA on GFX10.3 39417ec681f3Smrg- spirv: set ACCESS_COHERENT for ssbo/global/image atomic load/store 39427ec681f3Smrg- radv/aco: enable VK_KHR_memory_model 39437ec681f3Smrg- ac/nir: consider an image load/store intrinsic's access 39447ec681f3Smrg- ac/nir: fix coherent global loads/stores 39457ec681f3Smrg- radv/llvm: enable VK_KHR_memory_model 39467ec681f3Smrg- aco: fix C++11/C++14 compilation 39477ec681f3Smrg- aco: set constant_data_offset correctly in the case of merged shaders 39487ec681f3Smrg- aco: don't move memory accesses to before control barriers 39497ec681f3Smrg- aco: fix non-rtz pack_half_2x16 39507ec681f3Smrg- aco: consider branch definitions in spiller 39517ec681f3Smrg- aco: don't consider the first partial spill if it's the wrong type 39527ec681f3Smrg- aco: don't fix break condition for break+discard to exec 39537ec681f3Smrg- aco: fix regclass checks when fixing to vcc/exec with Builder 39547ec681f3Smrg- aco: fix spills_entry heuristic for branch blocks in init_live_in_vars() 39557ec681f3Smrg- aco: keep loop live-through variables spilled 39567ec681f3Smrg- aco: reserve 2 sgprs for each branch 39577ec681f3Smrg- aco: create long jumps 39587ec681f3Smrg- aco: fix byte_align_scalar for 3 dword vectors 39597ec681f3Smrg- aco: fix one-off error in Operand(uint16_t) 39607ec681f3Smrg- nir/opt_if: fix opt_if_merge when destination branch has a jump 39617ec681f3Smrg- aco: fix v_writelane_b32 with two sgprs 39627ec681f3Smrg- aco: don't apply constant to SDWA on GFX8 39637ec681f3Smrg- radv: initialize with expanded cmask if the destination layout needs it 39647ec681f3Smrg- radv,aco: fix reading primitive ID in FS after TES 39657ec681f3Smrg 39667ec681f3SmrgRob Clark (265): 39677ec681f3Smrg 39687ec681f3Smrg- util/simple_mtx: add assert_locked() 39697ec681f3Smrg- freedreno: add screen lock wrappers 39707ec681f3Smrg- freedreno: switch to simple_mtx 39717ec681f3Smrg- freedreno: fix buffer import 39727ec681f3Smrg- gallium: extract out logicop helper 39737ec681f3Smrg- freedreno/drm: drop atomic refcnts 39747ec681f3Smrg- freedreno/drm: inline the things 39757ec681f3Smrg- freedreno/a6xx: small query cleanup 39767ec681f3Smrg- freedreno/a6xx: avoid unnecessary clearing VS DP state 39777ec681f3Smrg- freedreno/a6xx: move const state to single stateobj 39787ec681f3Smrg- freedreno/a6xx: move scissor state to stateobj 39797ec681f3Smrg- freedreno/a6xx: limit PROG_FB_RAST state emit 39807ec681f3Smrg- freedreno/a6xx: limit LRZ state emit 39817ec681f3Smrg- freedreno/a6xx: move blend-color to stateobj 39827ec681f3Smrg- freedreno/a6xx: combine sample mask into blend state 39837ec681f3Smrg- freedreno/a6xx: skip unnecessary MRT blend state 39847ec681f3Smrg- freedreno/a6xx: add OUT_PKT() 39857ec681f3Smrg- freedreno/a6xx: convert draw packet to OUT_PKT() 39867ec681f3Smrg- freedreno/a6xx: split out const emit 39877ec681f3Smrg- freedreno/ir3: inline const emit 39887ec681f3Smrg- freedreno/a6xx: convert const emit to OUT_PKT() 39897ec681f3Smrg- freedreno: scissor vs disabled scissor micro-opt 39907ec681f3Smrg- freedreno/a6xx: more OUT_REG() 39917ec681f3Smrg- freedreno: sync registers with envytools 39927ec681f3Smrg- freedreno/a6xx: don't set SP_FS_CTRL_REG0.VARYING for fragcoord 39937ec681f3Smrg- freedreno/a6xx: fix LRZ hang 39947ec681f3Smrg- freedreno/a6xx: add some more formats 39957ec681f3Smrg- freedreno: we don't need aligned vbo's 39967ec681f3Smrg- freedreno/a6xx: compressed blit fixes 39977ec681f3Smrg- freedreno/a6xx: enable tiled compressed textures 39987ec681f3Smrg- freedreno/gmem: don't assume scissor opt when estimating # of bins 39997ec681f3Smrg- freedreno: initialize max_scissor 40007ec681f3Smrg- freedreno/gmem: add div_align() helper 40017ec681f3Smrg- freedreno/gmem: add helper to dump GMEM layout 40027ec681f3Smrg- freedreno: add gmemtool 40037ec681f3Smrg- freedreno/gmem: relax alignment on a6xx 40047ec681f3Smrg- freedreno/gmem: rework gmem layout algo 40057ec681f3Smrg- freedreno/ir3: don't allow negative const_offset 40067ec681f3Smrg- freedreno/ir3: fix indirect cb0 load_ubo lowering 40077ec681f3Smrg- freedreno/ir3: limit # of tex prefetch by shader size 40087ec681f3Smrg- freedreno/ir3/postsched: reset sfu_delay on sync 40097ec681f3Smrg- freedreno/ir3/postsched: try to avoid (sy) syncs 40107ec681f3Smrg- freedreno/ir3/sched: avoid scheduling outputs 40117ec681f3Smrg- freedreno/ir3/sched: try to avoid syncs 40127ec681f3Smrg- freedreno/a6xx: fix max-scissor opt 40137ec681f3Smrg- freedreno/ir3: use const_index accessors 40147ec681f3Smrg- nir: fix indices for ir3 ssbo_atomic intrinsics 40157ec681f3Smrg- nir: add helper to copy const_index[] 40167ec681f3Smrg- nir: add pass to lower disjoint wrmask's 40177ec681f3Smrg- freedreno/ir3: use lower_wrmasks pass 40187ec681f3Smrg- freedreno/fdperf: add dependency on generated headers 40197ec681f3Smrg- freedreno/drm: don't pass thru 'DUMP' flag on older kernels 40207ec681f3Smrg- freedreno/drm: handle ancient kernels 40217ec681f3Smrg- freedreno/ir3: remove Sethi-Ullman numbering pass 40227ec681f3Smrg- freedreno/ir3: juggle around ir3_debug_print() 40237ec681f3Smrg- freedreno/ir3/dce: report progress 40247ec681f3Smrg- freedreno/cf: report progress 40257ec681f3Smrg- freedreno/ir3/cp: report progress 40267ec681f3Smrg- freedreno/ir3/deps: report progress 40277ec681f3Smrg- freedreno/ir3/group: report progress 40287ec681f3Smrg- freedreno/ir3/legalize: report progress 40297ec681f3Smrg- freedreno/ir3/postsched: report progress 40307ec681f3Smrg- freedreno/ir3: add IR3_PASS() macro 40317ec681f3Smrg- freedreno/ir3: move where we preserve binning pass inputs 40327ec681f3Smrg- freedreno/ir3: be iterative 40337ec681f3Smrg- freedreno/ir3: make foreach_src declare cursor ptr 40347ec681f3Smrg- freedreno/ir3: make foreach_ssa_src declar cursor ptr 40357ec681f3Smrg- freedreno/ir3: make input/output iterators declare cursor ptr 40367ec681f3Smrg- freedreno/ir3/group: fix for half-regs 40377ec681f3Smrg- freedreno/ir3: fix mismatched flags on split 40387ec681f3Smrg- freedreno/ir3/cf: handle multiple cov's properly 40397ec681f3Smrg- freedreno/ir3: fix immed type in create_addr0() 40407ec681f3Smrg- freedreno/ir3/print: print cat2 condition 40417ec681f3Smrg- freedreno/ir3/cp: fix cmps folding 40427ec681f3Smrg- freedreno/ir3: fix mismatched wrmask for overlapping VS inputs 40437ec681f3Smrg- freedreno/ir3: add simple validate pass 40447ec681f3Smrg- freedreno/ir3: add helpers to deal with src/dst types 40457ec681f3Smrg- freedreno/ir3/validate: add checking for types and opcodes 40467ec681f3Smrg- freedreno/drm: disallow exported buffers in bo cache 40477ec681f3Smrg- freedreno: add batch debugging 40487ec681f3Smrg- freedreno: clear last_fence after resource tracking 40497ec681f3Smrg- freedreno: handle PIPE_TRANSFER_MAP_DIRECTLY 40507ec681f3Smrg- freedreno/gmem: make noscis debug actually do something on a6xx 40517ec681f3Smrg- freedreno/gmemtool: make GMEM alignment per-gen 40527ec681f3Smrg- freedreno/gmemtool: add a405 40537ec681f3Smrg- freedreno/gmemtool: add verbose mode 40547ec681f3Smrg- freedreno/gmem: add some asserts 40557ec681f3Smrg- freedreno/gmem: fix nbins_x/y mismatch 40567ec681f3Smrg- freedreno/gmem: split out helper to calc # of bins 40577ec681f3Smrg- freedreno/a6xx: LRZ fix for alpha-test 40587ec681f3Smrg- freedreno/a6xx: document LRZ flag buffer 40597ec681f3Smrg- freedreno/a6xx: fix vsc assert 40607ec681f3Smrg- nir: get_base_type() should return enum type 40617ec681f3Smrg- nir: extract out convert_to_bitsize() helper 40627ec681f3Smrg- nir/builder: add bitsize conversion helpers 40637ec681f3Smrg- nir/lower_tex: fixes for fp16 yuv lowering 40647ec681f3Smrg- freedreno/ir3: split kill from no_earlyz 40657ec681f3Smrg- freedreno/a6xx: sync registers from envytools 40667ec681f3Smrg- freedreno/a6xx: update depth-plane control regs 40677ec681f3Smrg- freedreno/a6xx: re-work LRZ state tracking 40687ec681f3Smrg- freedreno/a6xx: add early-lrz-late-z mode 40697ec681f3Smrg- freedreno/a6xx: also consider alpha-test for ztest-mode 40707ec681f3Smrg- freedreno/a6xx: more early-z 40717ec681f3Smrg- freedreno/computerator: fix missing dependency on generated header 40727ec681f3Smrg- nir/print: print tex dest type 40737ec681f3Smrg- freedreno/ir3: add debug code to print conflicting half-regs 40747ec681f3Smrg- freedreno/ir3: respect tex prefetch limits 40757ec681f3Smrg- freedreno/ir3: remove RA "q-values" optimization 40767ec681f3Smrg- freedreno/ir3: limit pre-fetched tex dest 40777ec681f3Smrg- freedreno/ir3: unify shader create/delete paths 40787ec681f3Smrg- freedreno/ir3: move the libdrm dependency out of shared code 40797ec681f3Smrg- turnip: drop linking libfreedreno_drm 40807ec681f3Smrg- freedreno/ir3: don't rely on intr->num_components 40817ec681f3Smrg- radv: don't set num_components for non-vectorized intrinsics 40827ec681f3Smrg- nir/builder: don't set intr->num_components 40837ec681f3Smrg- nir/lower-atomics-to-ssbo: don't set num_components 40847ec681f3Smrg- spriv: don't set num_components for non-vectorised intrinsics 40857ec681f3Smrg- v3d: don't use intr->num_components for non-vectorized intrinsics 40867ec681f3Smrg- nir/validate: validate intr->num_components 40877ec681f3Smrg- freedreno/log-parser: fix compute times 40887ec681f3Smrg- freedreno/sched: reset delay counters at start of block 40897ec681f3Smrg- freedreno/ir3/validate: also check instr->address 40907ec681f3Smrg- freedreno/ir3/cp: properly handle already-folded RELATIV 40917ec681f3Smrg- freedreno: splitup emit_string_marker 40927ec681f3Smrg- freedreno/a6xx: emit shader names in debug builds 40937ec681f3Smrg- freedreno/ir3/legalize: don't allow (nopN) if (rptN) 40947ec681f3Smrg- freedreno/ir3/print: print (r) flag 40957ec681f3Smrg- freedreno/ir3: add test for delay slot calculation 40967ec681f3Smrg- freedreno/ir3/delay: calculate delay properly for (rptN)'d instructions 40977ec681f3Smrg- freedreno/ir3: add helpers to move instructions 40987ec681f3Smrg- freedreno/ir3: delay test support for vectorish instructions 40997ec681f3Smrg- freedreno/ir3/cp: extract valid_flags 41007ec681f3Smrg- freedreno/ir3: add post-scheduler cp pass 41017ec681f3Smrg- freedreno/ir3: convert regmask_t to struct 41027ec681f3Smrg- freedreno/ir3: move mergedreg state out of reg 41037ec681f3Smrg- freedreno/ir3: decouple regset from gpu gen 41047ec681f3Smrg- freedreno/ir3: pass variant to postsched 41057ec681f3Smrg- freedreno/ir3: re-work assembler API 41067ec681f3Smrg- freedreno/ir3: make mergedregs a property of the variant 41077ec681f3Smrg- freedreno/a6xx: set .MERGEREGS based on variant 41087ec681f3Smrg- turnip: set .MERGEDREGS based on variant 41097ec681f3Smrg- freedreno/computerator: MERGEDREGS update 41107ec681f3Smrg- freedreno/ir3: update obsolete comment 41117ec681f3Smrg- spirv: atomic_counter_read_deref is not vectorized 41127ec681f3Smrg- spirv: drop some dead code 41137ec681f3Smrg- glsl_to_nir: fix is_helper_invocation 41147ec681f3Smrg- glsl_to_nir: fix shader_clock 41157ec681f3Smrg- glsl_to_nir: fix vote_any/vote_all 41167ec681f3Smrg- freedreno/ir3: refactor out helper to compile shader from asm 41177ec681f3Smrg- freedreno/ir3: add accessor for const_state 41187ec681f3Smrg- freedreno/a6xx: defer userconst cmdstream size calculation 41197ec681f3Smrg- freedreno/ir3: move ubo_state into const_state 41207ec681f3Smrg- freedreno/ir3: drop shader->num_ubos 41217ec681f3Smrg- freedreno/ir3: constify shader key 41227ec681f3Smrg- freedreno/ir3: pass variant to ir3_create() 41237ec681f3Smrg- freedreno/ir3: convert over to ralloc 41247ec681f3Smrg- freedreno/ir3: move num_reserved_user_consts out of const_state 41257ec681f3Smrg- freedreno/ir3: un-embed const_state 41267ec681f3Smrg- freedreno/ir3: move const_state back to variant 41277ec681f3Smrg- freedreno/ir3: move output_loc to variant 41287ec681f3Smrg- freedreno/ir3: split out ubo info from range 41297ec681f3Smrg- freedreno/ir3: splitup get_existing_range() 41307ec681f3Smrg- freedreno/ir3: split ubo analysis/lowering passes 41317ec681f3Smrg- ci: remove some freedreno a6xx skips 41327ec681f3Smrg- freedreno/ir3: add helper to determine point-coord inputs 41337ec681f3Smrg- freedreno/a6xx: de-duplicate vinterp/vpsrepl state building 41347ec681f3Smrg- freedreno/a6xx: use point-coord helper 41357ec681f3Smrg- freedreno/a5xx: use point-coord helper 41367ec681f3Smrg- freedreno/a4xx: use point-coord helper 41377ec681f3Smrg- freedreno/a3xx: use point-coord helper 41387ec681f3Smrg- freedreno: convert builtin blit VS prog to ureg builder 41397ec681f3Smrg- freedreno/ir3: switch PIPE_CAP_TGSI_TEXCOORD 41407ec681f3Smrg- freedreno: make foreach_bit() declare it's cursor 41417ec681f3Smrg- freedreno: split out batch draw tracking helper 41427ec681f3Smrg- freedreno: split out batch clear tracking helper 41437ec681f3Smrg- freedreno: handle batch flush in resource tracking 41447ec681f3Smrg- freedreno/ir3/ra: fix pre-color edge case 41457ec681f3Smrg- freedreno/ir3: add ir3_finalize_nir() 41467ec681f3Smrg- freedreno/ir3: move finalize_nir to pscreen hook 41477ec681f3Smrg- freedreno/ir3: add ir3_compiler_destroy() 41487ec681f3Smrg- freedreno/ir3: shuffle some variant fields 41497ec681f3Smrg- freedreno/a6xx+ir3: stop generating pointless binning shaders 41507ec681f3Smrg- freedreno/ir3: build binning variant at same time as draw variant 41517ec681f3Smrg- freedreno/ir3: disk-cache support 41527ec681f3Smrg- freedreno/ir3: move nir finalization to after cache miss 41537ec681f3Smrg- freedreno/fdperf: fix print of base address 41547ec681f3Smrg- freedreno/fdperf: better compatible string matching 41557ec681f3Smrg- freedreno/fdperf: prefer render node 41567ec681f3Smrg- gitlab-ci: reduce a630 runner load 41577ec681f3Smrg- freedreno/ir3: add missing VS driver params 41587ec681f3Smrg- freedreno/ir3: make compile fails more visible 41597ec681f3Smrg- freedreno/a6xx: bail instead of crash for compile fails 41607ec681f3Smrg- freedreno/ir3/ra: be better at failing 41617ec681f3Smrg- freedreno/a6xx: don't enable early-z/lrz if no z-test 41627ec681f3Smrg- freedreno/ir3: DCE unused arrays 41637ec681f3Smrg- driconf: allowlist/denylist 41647ec681f3Smrg- gitlab-ci: re-enable all a630 jobs 41657ec681f3Smrg- freedreno: small comment re-word 41667ec681f3Smrg- freedreno: whitespace fix 41677ec681f3Smrg- freedreno/ir3/parser: half-precision relative regs 41687ec681f3Smrg- freedreno/ir3: set array precision on creation 41697ec681f3Smrg- freedreno/ir3: fix half-reg array stores 41707ec681f3Smrg- freedreno/ir3/ra: debug msgs tweak 41717ec681f3Smrg- freedreno/ir3/ra: assign vreg names to all array elements 41727ec681f3Smrg- freedreno/ir3/ra: fix array conflicts for split/merged 41737ec681f3Smrg- freedreno: sync registers from envytools 41747ec681f3Smrg- freedreno: make gen_header.py check parent directory 41757ec681f3Smrg- freedreno: slurp in rnndb 41767ec681f3Smrg- freedreno: slurp in rnn 41777ec681f3Smrg- freedreno: slurp in decode tools 41787ec681f3Smrg- freedreno: slurp in afuc 41797ec681f3Smrg- freedreno/rnn: warnings cleanup 41807ec681f3Smrg- freedreno/decode: warnings cleanup 41817ec681f3Smrg- freedreno/afuc: warnings cleanup 41827ec681f3Smrg- freedreno: add CI for envytools tools 41837ec681f3Smrg- freedreno/ir3: split out regmask 41847ec681f3Smrg- freedreno: drop shader_t 41857ec681f3Smrg- freedreno: deduplicate a3xx+ disasm 41867ec681f3Smrg- freedreno: move a2xx disasm out of gallium 41877ec681f3Smrg- freedreno: deduplicate a2xx disasm 41887ec681f3Smrg- freedreno/ci: add a2xx trace to CI job 41897ec681f3Smrg- freedreno/tools: check rnn parse status 41907ec681f3Smrg- freedreno/rnn: split out helper to find files 41917ec681f3Smrg- freedreno/rnn: add error helper 41927ec681f3Smrg- freedreno/rnn: rename schema file 41937ec681f3Smrg- freedreno/rnn: update schema for 'pos' 41947ec681f3Smrg- freedreno/rnn: add relaxed boolean type 41957ec681f3Smrg- freedreno/rnn: add high/low/pos to registers 41967ec681f3Smrg- freedreno/rnn: add radix/align 41977ec681f3Smrg- freedreno/rnn: relax Hexadecimal to HexOrNumber 41987ec681f3Smrg- freedreno/rnn: add variants/varset to domain 41997ec681f3Smrg- freedreno/registers/a2xx: fix validation error 42007ec681f3Smrg- freedreno/registers/a4xx: fix validation error 42017ec681f3Smrg- freedreno/registers/adreno_pm4: fix validation errors 42027ec681f3Smrg- freedreno/rnn: describe copyright element in schema 42037ec681f3Smrg- freedreno/rnn: add "addvariant" to schema 42047ec681f3Smrg- freedreno/rnn: allow name to be optional in arrays 42057ec681f3Smrg- freedreno/rnn: fix use-group 42067ec681f3Smrg- freedreno/registers/mdp5: fix validation error 42077ec681f3Smrg- freedreno/rnn: schema updates for dynamic/irregular offsets 42087ec681f3Smrg- freedreno/rnn: add schema validation 42097ec681f3Smrg- freedreno/rnn: headergen2 warnings cleanup 42107ec681f3Smrg- freedreno/decode: cffdec warnings cleanup 42117ec681f3Smrg- freedreno/ir3: add missing track_ubo_use() 42127ec681f3Smrg- freedreno/a6xx: don't emit a bogus size for empty cb slots 42137ec681f3Smrg- freedreno/a6xx: fixup draw state earlier 42147ec681f3Smrg- freedreno/rnn: also look for .xml.gz 42157ec681f3Smrg- freedreno/rnn: rework RNN_DEF_PATH construction 42167ec681f3Smrg- freedreno/registers: add .gitignore 42177ec681f3Smrg- freedreno/registers: split header build into subdirs 42187ec681f3Smrg- freedreno/registers: install gzip'd register database 42197ec681f3Smrg- freedreno/decode: move dependencies up a level 42207ec681f3Smrg- freedreno: allow fence_fd fences to be recycled 42217ec681f3Smrg- freedreno/ir3: ir3_cmdline updates 42227ec681f3Smrg- freedreno/ir3: lower local_index using local_id 42237ec681f3Smrg- glsl/lower_precision: split out const lowering 42247ec681f3Smrg- gallium: replace 16BIT_TEMPS cap with 16BIT_CONSTS 42257ec681f3Smrg- glsl: remove LowerPrecisionTemporaries 42267ec681f3Smrg- glsl: don't inline intrinsics for mediump 42277ec681f3Smrg- glsl_to_nir: fix bitfield_extract with 16-bit operands 42287ec681f3Smrg- freedreno/registers: add some missing regs to build 42297ec681f3Smrg- freedreno/crashdec: handle section name typos 42307ec681f3Smrg- freedreno/a6xx: fix occlusion query with more than one tile 42317ec681f3Smrg- freedreno: handle case of shadowing current render target 42327ec681f3Smrg- freedreno/gmemtool: add tile_alignw/h and a650 42337ec681f3Smrg 42347ec681f3SmrgRohan Garg (3): 42357ec681f3Smrg 42367ec681f3Smrg- iris: Fix documentation for _iris_batch_flush 42377ec681f3Smrg- ci: Include trace replay support in ARM rootfses. 42387ec681f3Smrg- gitlab-ci: Replay traces on lava devices 42397ec681f3Smrg 42407ec681f3SmrgRoland Scheidegger (1): 42417ec681f3Smrg 42427ec681f3Smrg- gallivm: fix half to float conversions with llvm 11 42437ec681f3Smrg 42447ec681f3SmrgRoman Gilg (2): 42457ec681f3Smrg 42467ec681f3Smrg- vulkan/wsi/x11: add sent image counter 42477ec681f3Smrg- vulkan/wsi/x11: wait for acquirable images in FIFO mode 42487ec681f3Smrg 42497ec681f3SmrgRoman Stratiienko (5): 42507ec681f3Smrg 42517ec681f3Smrg- egl: Build surfaceless platform on Android 42527ec681f3Smrg- Android: Fixes for Q and R 42537ec681f3Smrg- panfrost: Android build fixes 2020 week 31 42547ec681f3Smrg- lima: Fix lima_screen_query_dmabuf_modifiers() 42557ec681f3Smrg- android: freedreno: Another build fix 42567ec681f3Smrg 42577ec681f3SmrgSagar Ghuge (3): 42587ec681f3Smrg 42597ec681f3Smrg- iris: Use modfiy disables for 3DSTATE_WM_DEPTH_STENCIL command 42607ec681f3Smrg- intel/compiler: Optimize integer add with 0 into mov 42617ec681f3Smrg- intel/compiler: Remove unnecessary optimization for MUL 42627ec681f3Smrg 42637ec681f3SmrgSamuel Pitoiset (235): 42647ec681f3Smrg 42657ec681f3Smrg- ci: fix reporting the number of unexpected/flakes 42667ec681f3Smrg- ci: add lists of expected failures & skipped tests for RAVEN with ACO 42677ec681f3Smrg- aco: remove unecessary p_split_vector with v2b reg class 42687ec681f3Smrg- radv: enable shaderInt16 unconditionally with LLVM and only GFX8+ with ACO 42697ec681f3Smrg- radv: cleanup radv_CreateInstance() 42707ec681f3Smrg- radv: rename radv_devices() to radv_enumerate_physical_devices() 42717ec681f3Smrg- radv: fix a memleak if the physical device initialization failed 42727ec681f3Smrg- radv: report INITIALIZATION_FAILED when the amdgpu winsys init failed 42737ec681f3Smrg- radv: don't report error with other vendor DRM devices 42747ec681f3Smrg- radv: use a linked list for physical devices 42757ec681f3Smrg- radv: display an error message if the winsys init failed 42767ec681f3Smrg- radv/winsys: do not count visible VRAM buffers twice in the budget 42777ec681f3Smrg- ci: remove unused .test-radv-fossilize rule 42787ec681f3Smrg- ci: set ACO_DEBUG=validateir,validatera global for RADV testing 42797ec681f3Smrg- ci: run radv-fossils with Pitcairn (GFX6) and Bonaire (GFX7) too 42807ec681f3Smrg- radv: remove the LLVM version string when ACO is used 42817ec681f3Smrg- radv: do not print the LLVM version string twice in hang reports 42827ec681f3Smrg- radv: report correct backend IR in hang reports when ACO is used 42837ec681f3Smrg- aco: fix 64-bit trunc with negative exponents on GFX6 42847ec681f3Smrg- nir: do not vectorize load/store if offset can overflow and robustness enabled 42857ec681f3Smrg- aco: prevent invalid loads/stores vectorization if robustness is enabled 42867ec681f3Smrg- radv: limit the Vulkan version to 1.1 for Android 42877ec681f3Smrg- radv: handle different Vulkan API versions correctly 42887ec681f3Smrg- radv: update the list of allowed Android extensions 42897ec681f3Smrg- aco: optimize add/sub(a, cndmask(b, 0, 1, cond)) -> addc/subbrev_co(0, a, b) 42907ec681f3Smrg- radv: use the common base object type for VkDevice 42917ec681f3Smrg- radv: use the base object struct types 42927ec681f3Smrg- radv: implement VK_EXT_private_data 42937ec681f3Smrg- vulkan: import common code for generating extensions 42947ec681f3Smrg- radv: use the common code for generating extensions and dispatch tables 42957ec681f3Smrg- anv: use the common code for generating extensions and dispatch tables 42967ec681f3Smrg- turnip: use the common code for generating extensions and dispatch tables 42977ec681f3Smrg- radv: add a LLVM version string workaround for SotTR and ACO 42987ec681f3Smrg- aco: remove useless check for nir_tex_src_bias 42997ec681f3Smrg- aco: add support for texturing with clamped LOD 43007ec681f3Smrg- ac/llvm: add support for texturing with clamped LOD 43017ec681f3Smrg- radv: enable shaderResourceMinLod 43027ec681f3Smrg- spirv: handle OpCopyObject correctly with any types 43037ec681f3Smrg- radv: fix missing break in radv_GetPhysicalDeviceProperties2() 43047ec681f3Smrg- aco: store 16-bit temporary outputs as v2b 43057ec681f3Smrg- aco: convert 16-bit values before exporting MRTs 43067ec681f3Smrg- aco: allow to load/store 16-bit values in VMEM for tess and geom 43077ec681f3Smrg- aco: implement 8-bit/16-bit mov's with p_create_vector 43087ec681f3Smrg- aco: implement 16-bit vertex fetches with tbuffer_load_format_d16_* 43097ec681f3Smrg- aco: validate v_interp_*_f16 as VOP3 instructions instead of VINTRP 43107ec681f3Smrg- aco: emit v_interp_*_f16 instructions as VOP3 instead of VINTRP 43117ec681f3Smrg- aco: implement 16-bit interp 43127ec681f3Smrg- aco: fix off-by-one error with 16-bit MTBUF opcodes on GFX10 43137ec681f3Smrg- radv/aco: enable storageInputOutput16 on GFX9+ 43147ec681f3Smrg- aco: fix missing break in label_instruction() 43157ec681f3Smrg- radv: fix missing break in radv_GetPhysicalDeviceFeatures2() 43167ec681f3Smrg- radv: fix duplicated expression in ac_setup_rings() 43177ec681f3Smrg- radv/winsys: remove useless free in radv_amdgpu_create_bo_list() 43187ec681f3Smrg- aco: declare 8-bit/16-bit reduce operations 43197ec681f3Smrg- aco: implement 8-bit/16-bit reductions 43207ec681f3Smrg- aco: validate 8-bit/16-bit VGPR operands for readfirstlane/readlane/writelane 43217ec681f3Smrg- aco: implement 8-bit/16-bit nir_intrinsic_read_first_invocation 43227ec681f3Smrg- aco: implement 8-bit/16-bit nir_intrinsic_{shuffle,_read_invocation} 43237ec681f3Smrg- aco: implement 8-bit/16-bit nir_intrinsic_quad_* 43247ec681f3Smrg- aco: use a temporary SGPR for 8-bit/16-bit literal reduction identities 43257ec681f3Smrg- aco: sign-extend the input and identity for 8-bit subgroup operations 43267ec681f3Smrg- radv: do not return from radv_GetPhysicalDeviceFeatures2() 43277ec681f3Smrg- radv: cleanup physical device features 43287ec681f3Smrg- radv: remove useless assignment in build_streamout_vertex() 43297ec681f3Smrg- spirv: add ReadClockKHR support with device scope 43307ec681f3Smrg- aco: implement nir_intrinsic_shader_clock with device scope 43317ec681f3Smrg- ac/nir: fix shader clock with subgroup scope 43327ec681f3Smrg- ac/nir: implement nir_intrinsic_shader_clock with device scope 43337ec681f3Smrg- radv: advertise shaderDeviceClock on GFX8+ 43347ec681f3Smrg- spirv: add SpvCapabilityImageGatherBiasLodAMD 43357ec681f3Smrg- spirv: add support for bias/lod with OpImageGather 43367ec681f3Smrg- ac/nir: add support for bias/lod with texture gather 43377ec681f3Smrg- aco: add support for bias/lod with texture gather 43387ec681f3Smrg- radv: add support for querying which formats support texture gather LOD 43397ec681f3Smrg- radv: advertise VK_AMD_texture_gather_bias_lod 43407ec681f3Smrg- spirv,radv,anv: implement no-op VK_GOOGLE_user_type 43417ec681f3Smrg- radv/aco: enable VK_EXT_subgroup_size_control 43427ec681f3Smrg- aco: fix register allocation for subdword instructions on GFX10 43437ec681f3Smrg- aco: implement 8-bit/16-bit reductions on GFX10 43447ec681f3Smrg- aco: allocate a temp VGPR for some 8-bit/16-bit reduction ops on GFX10 43457ec681f3Smrg- aco: allow gfx10_wave64_bpermute with 8-bit/16-bit input 43467ec681f3Smrg- aco: sign-extend input/indentity for 32-bit reduce ops on GFX10 43477ec681f3Smrg- radv/aco: enable VK_KHR_subgroup_extended_types on GFX8+ 43487ec681f3Smrg- radv: enable zero VRAM for Doom Eternal 43497ec681f3Smrg- radv: enable zero VRAM for all VKD3D (DX12->VK) games 43507ec681f3Smrg- aco: implement 16-bit reduce operations on GFX6-GFX7 43517ec681f3Smrg- aco: implement 16-bit nir_intrinsic_quad_* on GFX6-GFX7 43527ec681f3Smrg- aco: fix subdword copies on GFX6-GFX7 43537ec681f3Smrg- aco: sign-extend input/identity for 16-bit subgroup ops on GFX6-GFX7 43547ec681f3Smrg- radv/aco: enable 64-bit atomic features if RADV is linked with LLVM 8 43557ec681f3Smrg- aco: use v_bfe_u32 for unsigned reductions sign-extension on GFX6-GFX7 43567ec681f3Smrg- aco: fix sign-extend 8-bit subgroup operations on GFX6-GFX7 43577ec681f3Smrg- aco: fix nir_intrinsic_quad_* with 8-bit in GFX6-GFX7 43587ec681f3Smrg- radv/aco: enable VK_KHR_shader_subgroup_extended_types on GFX6-GFX7 43597ec681f3Smrg- ac/nir: adjust an assertion for D16 on GFX6-GFX7 43607ec681f3Smrg- nir/lower_explicit_io: fix NON_UNIFORM access for UBO loads 43617ec681f3Smrg- radv/llvm: expose VK_EXT_shader_demote_to_helper_invocation with LLVM 9+ 43627ec681f3Smrg- aco: implement 8-bit/16-bit conversions on GFX6-GFX7 43637ec681f3Smrg- aco: fix alignment of vectors with 4 elements 43647ec681f3Smrg- radv/aco: enable 8-bit/16-bit storage on GFX6-GFX7 43657ec681f3Smrg- radv/aco: enable shaderInt16 on GFX6-GFX7 43667ec681f3Smrg- radv/aco: enable shaderInt8 and VK_KHR_shader_float16_int8 on GFX6-GFX7 43677ec681f3Smrg- ac/nir: fix integer comparisons with pointers 43687ec681f3Smrg- radv: set DB_SHADER_CONTROL.CONSERVATIVE_Z_EXPORT correctly 43697ec681f3Smrg- radv: add new drirc option radv_enable_mrt_output_nan_fixup 43707ec681f3Smrg- aco: implement radv_enable_mrt_output_nan_fixup workaround 43717ec681f3Smrg- radv/llvm: implement radv_enable_mrt_output_nan_fixup workaround 43727ec681f3Smrg- radv: enable radv_enable_mrt_output_nan_fixup for RAGE 2 43737ec681f3Smrg- ac: add ac_choose_spi_color_formats() to common code 43747ec681f3Smrg- spirv: fix using OpSampledImage with OpUndef instead of OpType{Image,Sampler} 43757ec681f3Smrg- aco: allow to swap operands for some 16-bit float instructions 43767ec681f3Smrg- spirv: do not set num_components for non-vectorized mbcnt_amd intrinsic 43777ec681f3Smrg- radv/aco: enable FP16 features/extensions on GFX9+ 43787ec681f3Smrg- radv: lower discards to demote to workaround a RDR2 game bug 43797ec681f3Smrg- radv: make sure to set CB_SHADER_MASK correctly for internal CB operations 43807ec681f3Smrg- radv: compute CB_SHADER_MASK from the fragment shader outputs 43817ec681f3Smrg- radv: only requires LLVM 9 for GFX10 if not using ACO 43827ec681f3Smrg- radv: replace == GFX10 with >= GFX10 where it's needed 43837ec681f3Smrg- aco: replace == GFX10 with >= GFX10 where it's needed 43847ec681f3Smrg- radv: add support for Sienna Cichlid 43857ec681f3Smrg- radv: require LLVM 11+ for GFX 10.3 if not using ACO 43867ec681f3Smrg- aco: fix printing ASM on GFX6-7 if clrxdisasm is not found 43877ec681f3Smrg- aco: improve validation checks for readlane/writelane 43887ec681f3Smrg- aco: fix printing ASM on GFX6-7 again 43897ec681f3Smrg- gitlab-ci: stop testing RADV with LLVM 43907ec681f3Smrg- gitlab-ci: update the list of expected CTS failures for RADV/ACO 43917ec681f3Smrg- gitlab-ci: update the list of expected failures for Pitcairn 43927ec681f3Smrg- radv: fix checking the return value of cs_finalize() 43937ec681f3Smrg- gitlab-ci: add parallel-rdp fossils 43947ec681f3Smrg- radv: lower 64-bit drcp/dsqrt/drsq for fixing precision issues 43957ec681f3Smrg- radv: lower 64-bit dfloor on GFX6 for fixing precision issues 43967ec681f3Smrg- gitlab-ci: add a list of expected failures for RADV/ACO on NAVI14 43977ec681f3Smrg- gitlab-ci: set the number of Fossilize threads to 4 43987ec681f3Smrg- gitlab-ci: append Fossilize stdout/stderr to a file to reduce spam 43997ec681f3Smrg- gitlab-ci: attach the Fossilize log file as artifact on failure 44007ec681f3Smrg- radv: remove the shader ballot workaround for Youngblood with LLVM 44017ec681f3Smrg- radv: remove the load/store workaround for Monster Hunter World with LLVM 44027ec681f3Smrg- radv: enable VK_AMD_shader_ballot on GFX6-7 with both compiler backends 44037ec681f3Smrg- radv: adjust CB_SHADER_MASK for dual-source blending in the shader info pass 44047ec681f3Smrg- radv: rework 8/16-bit color attachment formats detection 44057ec681f3Smrg- radv: use SPI_SHADER_ZERO for non-written color attachments 44067ec681f3Smrg- radv: add support for MRTs compaction to avoid holes 44077ec681f3Smrg- radv: fix wide points and lines 44087ec681f3Smrg- radv: fix wide lines with multisample enabled 44097ec681f3Smrg- Revert "vulkan/wsi/x11: Ensure we create at least minImageCount images." 44107ec681f3Smrg- radv,vulkan: add a new x11 wsi drirc workaround for DOOM Eternal 44117ec681f3Smrg- radv: disable FMASK compression when drawing with GENERAL layout 44127ec681f3Smrg- radv: set depth/stencil enable values correctly for the meta clear path 44137ec681f3Smrg- radv: implement missing VK_ACCESS_MEMORY_{READ,WRITE}_BIT 44147ec681f3Smrg- radv: store the primitive topology hardware value in the pipeline 44157ec681f3Smrg- radv: adjust IA_MULTI_VGT_PARAM.WD_SWITCH_ON_EOP at draw time 44167ec681f3Smrg- radv: adjust IA_MULTI_VGT_PARAM.PARTIAL_VS_WAVE at draw time 44177ec681f3Smrg- radv: compute prim_vertex_count at draw time 44187ec681f3Smrg- aco: fix more validation errors from vgpr spill/restore code 44197ec681f3Smrg- radv: return VK_ERROR_DEVICE_LOST if wait-for-idle failed or expired 44207ec681f3Smrg- radv: remove the secure compile support feature 44217ec681f3Smrg- radv: rework dynamic viewports/scissors support 44227ec681f3Smrg- radv: add VK_EXT_extended_dynamic_state but leave it disabled 44237ec681f3Smrg- radv: declare new extended dynamic states 44247ec681f3Smrg- radv: add support for dynamic cull mode and front face 44257ec681f3Smrg- radv: add support for dynamic primitive topology 44267ec681f3Smrg- radv: add support for dynamic and scissor count 44277ec681f3Smrg- radv: add support for dynamic depth/stencil states 44287ec681f3Smrg- radv: add support for dynamic vertex input binding stride 44297ec681f3Smrg- radv: advertise VK_EXT_extended_dynamic_state 44307ec681f3Smrg- radv: add the custom border color BO to the list of buffers 44317ec681f3Smrg- radv: destroy the base object if VkCreateQueryPool() failed 44327ec681f3Smrg- radv: destroy the base object if VkCreateRenderPass*() failed 44337ec681f3Smrg- radv: destroy the base object if VkCreateImage() failed 44347ec681f3Smrg- radv: destroy the base object if VkCreateBuffer() failed 44357ec681f3Smrg- radv: destroy the base object if VkCreateEvent() failed 44367ec681f3Smrg- radv: destroy the base object if VkCreateSemaphore() failed 44377ec681f3Smrg- radv: destroy the base object if VkCreateFence() failed 44387ec681f3Smrg- radv: destroy the base object if VkAllocateCommandBuffers() failed 44397ec681f3Smrg- radv: destroy the base object if VkCreateInstance() failed 44407ec681f3Smrg- radv/winsys: replace alloca() by malloc() everywhere 44417ec681f3Smrg- radv/winsys: pass the buffer list via the CS ioctl for less CPU overhead 44427ec681f3Smrg- radv: fix destroying the syncobj when exporting a fence FD 44437ec681f3Smrg- radv: fix the error code when exporting a semaphore/fence fails 44447ec681f3Smrg- radv: fix the error code when allocating a fresh imported syncobj fails 44457ec681f3Smrg- radv: optimize creating signaled syncobj with amdgpu_cs_create_syncobj2() 44467ec681f3Smrg- radv: split fence into two parts as enum+union. 44477ec681f3Smrg- radv: remove one useless goto in radv_queue_submit_deferred() 44487ec681f3Smrg- radv: improve the error messages when a CS submission failed 44497ec681f3Smrg- radv: return better Vulkan error codes when VkQueueSubmit() fails 44507ec681f3Smrg- radv: disable CPU caching for IBS to reduce fetch latency 44517ec681f3Smrg- radv/winsys: always allow GTT placements on APUs 44527ec681f3Smrg- radv: advertise VK_EXT_image_robustness 44537ec681f3Smrg- radv: do not perform read-modify-write with the upload BO 44547ec681f3Smrg- radv: disable CPU caching for the upload BO to reduce fetch latency 44557ec681f3Smrg- aco: add support for nir_intrinsic_shared_atomic_fadd 44567ec681f3Smrg- ac/nir: add support for nir_intrinsic_shared_atomic_fadd 44577ec681f3Smrg- radv: advertise VK_EXT_shader_atomic_float 44587ec681f3Smrg- radv: add missing return values check for some winsys calls 44597ec681f3Smrg- radv/winsys: check more allocation failures 44607ec681f3Smrg- radv/winsys: remove useless check when binding virtual buffers/images 44617ec681f3Smrg- radv/winsys: return a Vulkan error code when binding virtual buffers/images 44627ec681f3Smrg- radv/winsys: be more robust when a CS failed during recording 44637ec681f3Smrg- radv: remove declared but unused radv_pipeline::is_dual_src 44647ec681f3Smrg- radv: remove set but unused radv_pipeline::vertex_elements 44657ec681f3Smrg- radv: remove outdated TODO related to PA_SU_VTX_CNTL.PIX_CENTER 44667ec681f3Smrg- radv: emit more invariant registers as part of the initial gfx state 44677ec681f3Smrg- radv: emit PA_SC_LINE_CNTL as part of the rasterization state 44687ec681f3Smrg- radv: clean up VGT_SHADER_STAGES_EN emission 44697ec681f3Smrg- radv: clean up PA_SC_CLIPRECT_RULE emission 44707ec681f3Smrg- radv: reduce the number of allocated dwords for compute CS 44717ec681f3Smrg- radv: clean up radv_compute_generate_pm4() 44727ec681f3Smrg- radv: remove unnecessary radv_tessellation_state::num_patches 44737ec681f3Smrg- radv: remove no-op si_multiwave_lds_size_workaround() 44747ec681f3Smrg- radv: remove one unnecessary param to radv_generate_graphics_pipeline_key() 44757ec681f3Smrg- radv: align the LDS size in calculate_tess_lds_size() 44767ec681f3Smrg- radv: set LDS TCS size at shaders creation for GFX9+ 44777ec681f3Smrg- radv: remove unnecessary radv_tessellation_state::lds_size 44787ec681f3Smrg- radv: clean up tessellation state emission 44797ec681f3Smrg- radv: add radv_pipeline_init_input_assembly_state() 44807ec681f3Smrg- radv: add radv_pipeline_generate_vgt_gs_out() 44817ec681f3Smrg- radv: clean up adjusting MSAA state if conservative rast is enabled 44827ec681f3Smrg- radv: clean up binning state initialization 44837ec681f3Smrg- radv: assign pipeline gfx fields before PM4 emission 44847ec681f3Smrg- radv: constify all radv_pipeline_generate_*() helpers 44857ec681f3Smrg- radv: add radv_pipeline_init_shader_stages_state() 44867ec681f3Smrg- radv: remove useless return value to radv_pipeline_scratch_init() 44877ec681f3Smrg- radv: clean up remaining pipeline init functions 44887ec681f3Smrg- radv: print warnings for famous RADV_PERFTEST options that no longer exist 44897ec681f3Smrg- radv: do not honor a user-specified pitch on GFX 10.3 44907ec681f3Smrg- radv: increase minimum NGG vertex count requirement per workgroup on GFX 10.3 44917ec681f3Smrg- radv: fix sample shading on GFX 10.3 44927ec681f3Smrg- radv: set BYPASS_VTX_RATE_COMBINER_GFX103 on GFX 10.3 44937ec681f3Smrg- radv/gfx10: add missing initialization of registers 44947ec681f3Smrg- radv: limit LATE_ALLOC_GS to prevent a GPU hang on GFX10 44957ec681f3Smrg- radv: fix emitting the border color pointer on the compute queue 44967ec681f3Smrg- nir/algebraic: mark some optimizations with fsat(NaN) as inexact 44977ec681f3Smrg- aco: handle unaligned loads on GFX10.3 44987ec681f3Smrg- spirv: fix emitting switch cases that directly jump to the merge block 44997ec681f3Smrg- radv: fix transform feedback crashes if pCounterBufferOffsets is NULL 45007ec681f3Smrg 45017ec681f3SmrgSatyajit Sahu (1): 45027ec681f3Smrg 45037ec681f3Smrg- frontends/va: Handle dynamic resolution/SVC for VP9 45047ec681f3Smrg 45057ec681f3SmrgSatyeshwar Singh (1): 45067ec681f3Smrg 45077ec681f3Smrg- intel/dev: Don't consider all TGL SKUs as GT1 only 45087ec681f3Smrg 45097ec681f3SmrgSerge Martin (3): 45107ec681f3Smrg 45117ec681f3Smrg- amd/common: Fix incorrect use of asprintf instead of vasprintf 45127ec681f3Smrg- clover: add more cl_mem_object_type to pipe_texture_target mapping 45137ec681f3Smrg- clover: implements clEnqueueFillBuffer 45147ec681f3Smrg 45157ec681f3SmrgShawn Guo (1): 45167ec681f3Smrg 45177ec681f3Smrg- freedreno/a4xx: fix \*_NONE enum conversion 45187ec681f3Smrg 45197ec681f3SmrgSimon Ser (3): 45207ec681f3Smrg 45217ec681f3Smrg- EGL: sync headers with Khronos 45227ec681f3Smrg- gbm: document that gbm_bo_map exposes a linear view 45237ec681f3Smrg- radv: use bitshifts for debug enum values 45247ec681f3Smrg 45257ec681f3SmrgSureshGuttula (1): 45267ec681f3Smrg 45277ec681f3Smrg- radeon/vcn: Corrected vp9 ref associated data incase of target->codec is NULL 45287ec681f3Smrg 45297ec681f3SmrgTapani Pälli (14): 45307ec681f3Smrg 45317ec681f3Smrg- st/mesa: destroy only own program variants when program is released 45327ec681f3Smrg- anv: call base finish only if pass given in DestroyRenderPass 45337ec681f3Smrg- anv: add VK_EXT_extended_dynamic_state but leave it disabled 45347ec681f3Smrg- anv: add new dynamic states 45357ec681f3Smrg- anv: consider dynamic state when creating pipeline 45367ec681f3Smrg- anv: handle dynamic viewport count 45377ec681f3Smrg- anv: add support for dynamic cull mode and winding order 45387ec681f3Smrg- anv: add support for dynamic viewport and scissor with count 45397ec681f3Smrg- anv: add support for dynamic primitive topology change 45407ec681f3Smrg- anv: depth/stencil dynamic state support 45417ec681f3Smrg- anv: dynamic vertex input binding stride and size support 45427ec681f3Smrg- anv: toggle on VK_EXT_extended_dynamic_state 45437ec681f3Smrg- anv: add a check for depthStencilState before using it 45447ec681f3Smrg- anv: null check for buffer before reading size 45457ec681f3Smrg 45467ec681f3SmrgThong Thai (8): 45477ec681f3Smrg 45487ec681f3Smrg- radeon: Fix whitespaces 45497ec681f3Smrg- gallium/auxiliary/vl: Fix compute shader scaling for non-square pixels 45507ec681f3Smrg- gallium/auxiliary/vl: Fix compute shader scale_y for interlaced videos 45517ec681f3Smrg- frontends/va: Fix deinterlace bottom field first flag 45527ec681f3Smrg- frontends/vdpau: Default destination rect to source rect 45537ec681f3Smrg- radeon/vcn: add vcn 3.0 encode support 45547ec681f3Smrg- radeonsi: use PIPE_FORMAT_P010 for 10-bit VP9 decoding 45557ec681f3Smrg- radeon/vcn: increase render_pic_list size 45567ec681f3Smrg 45577ec681f3SmrgTimothy Arceri (69): 45587ec681f3Smrg 45597ec681f3Smrg- glsl: stop cascading errors if process_parameters() fails 45607ec681f3Smrg- glsl: fix slow linking of uniforms in the nir linker 45617ec681f3Smrg- radv: fix regression with builtin cache 45627ec681f3Smrg- nir: add glsl_get_ifc_packing() helper 45637ec681f3Smrg- nir: add callback to nir_remove_dead_variables() 45647ec681f3Smrg- glsl: add can_remove_uniform() helper to the NIR linker 45657ec681f3Smrg- glsl: remove dead uniforms in the nir linker 45667ec681f3Smrg- glsl/spirv: remove dead uniforms in spirv nir linker 45677ec681f3Smrg- gitlab-ci: bump piglit checkout commit 45687ec681f3Smrg- i965: call brw_nir_lower_uniforms() after uniform linking is complete 45697ec681f3Smrg- util: add BITSET_LAST_BIT() helper 45707ec681f3Smrg- glsl: add struct to gather more info about uniform array access 45717ec681f3Smrg- glsl: add update_array_sizes() helper to the NIR uniform linker 45727ec681f3Smrg- glsl: gather uniform dereference info before main linking loop 45737ec681f3Smrg- glsl: when NIR linker enable use it to resize uniform arrays 45747ec681f3Smrg- glsl: fix potential slow compile times for GLSLOptimizeConservatively 45757ec681f3Smrg- glsl: fix incorrect optimisation in opt_constant_variable() 45767ec681f3Smrg- glsl: fix uniform array resizing in the nir linker 45777ec681f3Smrg- glsl: small optimisation fix for uniform array resizing 45787ec681f3Smrg- st_glsl_to_nir: fix potential use after free 45797ec681f3Smrg- mesa: remove _mesa prefix from static function 45807ec681f3Smrg- mesa: add _mesa_program_state_value_size() helper 45817ec681f3Smrg- glsl: define gl_LightSource members in ARB_vertex_program order 45827ec681f3Smrg- st/glsl_to_nir: disable st_nir_lower_builtin() when packing supported 45837ec681f3Smrg- glsl: remove stale FIXME 45847ec681f3Smrg- i965: add and fix fallthrough comments 45857ec681f3Smrg- llvmpipe: add missing fallthrough comments 45867ec681f3Smrg- gallivm: add missing break 45877ec681f3Smrg- anv: update fallthrough comment so gcc sees it 45887ec681f3Smrg- intel/compiler: add and fix up fallthrough comments for gcc warnings 45897ec681f3Smrg- iris: add missing fallthrough comment 45907ec681f3Smrg- egl: move fallthrough comment so gcc can see it 45917ec681f3Smrg- nir: add missing break to nir_opt_access() 45927ec681f3Smrg- mesa: fix fallthrough in glformats 45937ec681f3Smrg- mesa: add fallthrough comments to glformats.c 45947ec681f3Smrg- mesa: add fallthrough comments to get.c 45957ec681f3Smrg- nir: fix implicit fallthrough warnings 45967ec681f3Smrg- mesa: add fallthrough comments to COPY_SZ_4V() 45977ec681f3Smrg- radeonsi: add missing fallthrough comment 45987ec681f3Smrg- glx: add missing fallthrough comment 45997ec681f3Smrg- glsl: move fallthrough comment to where gcc can see it 46007ec681f3Smrg- radeon: add missing fallthrough comments 46017ec681f3Smrg- spirv: add missing fallthrough comments 46027ec681f3Smrg- mesa/vbo: add some missing fallthrough comments 46037ec681f3Smrg- mesa: add missing fallthrough comment to teximage.c 46047ec681f3Smrg- mesa: fix unintended fallthrough in glIsEnabled() 46057ec681f3Smrg- r300: add and fix up fallthrough comments 46067ec681f3Smrg- svga: add missing fallthrough comments 46077ec681f3Smrg- mesa: update fallthrough comment so gcc can see it 46087ec681f3Smrg- nv30: add missing fallthrough comment 46097ec681f3Smrg- meson: turn on Wimplicit-fallthrough project wide 46107ec681f3Smrg- nouveau: fix pointer-sign warning 46117ec681f3Smrg- gitlab-ci: Enable -Werror in `meson-classic` job 46127ec681f3Smrg- r600/radeonsi: silence zero-length-bounds gcc warnings 46137ec681f3Smrg- radeonsi: fix SI_NUM_ATOMS 46147ec681f3Smrg- iris: fix maybe-uninitialized warning for initial_state variable 46157ec681f3Smrg- iris: silence maybe-uninitialized for stc_dst_aux_usage variable 46167ec681f3Smrg- nouveau/nvc0: silence maybe-uninitialized warning 46177ec681f3Smrg- panfrost: add some missing fallthrough comments 46187ec681f3Smrg- panfrost: hide more unused code in bi_lower_combine.c 46197ec681f3Smrg- panfrost: add some missing fallthrough comments to bi_pack.c 46207ec681f3Smrg- freedreno: fix missing fallthrough comments 46217ec681f3Smrg- v3d: remove redefine of VG(x) 46227ec681f3Smrg- zink: fix missing fallthrough comment 46237ec681f3Smrg- nine: remove unused var 46247ec681f3Smrg- etnaviv: add missing fallthrough comments 46257ec681f3Smrg- lima: add missing fallthrough comments 46267ec681f3Smrg- lima: add missing break 46277ec681f3Smrg- gitlab-ci: Enable -Werror in `meson-gallium` job 46287ec681f3Smrg 46297ec681f3SmrgTimur Kristóf (4): 46307ec681f3Smrg 46317ec681f3Smrg- aco/gfx10: Refactor of GFX10 wave64 bpermute. 46327ec681f3Smrg- aco: Implement subgroup shuffle on GFX6-7. 46337ec681f3Smrg- radv/aco: Always enable subgroup shuffle. 46347ec681f3Smrg- aco: Fix emit_boolean_exclusive_scan in wave32 mode. 46357ec681f3Smrg 46367ec681f3SmrgTomeu Vizoso (55): 46377ec681f3Smrg 46387ec681f3Smrg- panfrost: Emit blend descriptors on Bifrost 46397ec681f3Smrg- panfrost: Don't leak temporary descriptors array 46407ec681f3Smrg- pan/decode: Check for correct unknown field 46417ec681f3Smrg- pan/decode: Use correct printf modifier for long int 46427ec681f3Smrg- panfrost: Split bit out of format.unk3 46437ec681f3Smrg- panfrost: Create additional BO for the checksum of imported BOs (Bifrost) 46447ec681f3Smrg- panfrost: Add a bit more info about some tiler fields 46457ec681f3Smrg- pan/bi: Print shaders only if BIFROST_MESA_DEBUG=shaders 46467ec681f3Smrg- pan/decode: Trace to stderr with PANDECODE_DUMP_FILE=stderr 46477ec681f3Smrg- panfrost: GPUs newer than G-71 don't have swizzles... 46487ec681f3Smrg- panfrost: mali_attr_meta.unknown1 is zero on Bifrost 46497ec681f3Smrg- panfrost: Add Bifrost texture trampoline BO to batch 46507ec681f3Smrg- pan/decode: Properly print tripped zeroes 46517ec681f3Smrg- virgl: Properly check for encode_stride when encoding transfers 46527ec681f3Smrg- panfrost: Add checksum BOs to batch 46537ec681f3Smrg- panfrost: Don't trample on top of Bifrost-specific unions 46547ec681f3Smrg- panfrost: Handle MALI_RGB8_UNORM in panfrost_format_to_bifrost_blend 46557ec681f3Smrg- gitlab-ci: Run more dEQP tests for virgl 46567ec681f3Smrg- gitlab-ci: Add manual tests for Virgl using GLES on the host 46577ec681f3Smrg- gitlab-ci: Test virgl with Khronos' OpenGL CTS 46587ec681f3Smrg- gitlab-ci: Update CTS runner 46597ec681f3Smrg- ci: Don't call renderdoc's ReplayController.Shutdown() 46607ec681f3Smrg- ci: Move ARM rootfses to stable 46617ec681f3Smrg- gitlab-ci: Build kernel drivers for a few ethernet USB dongles 46627ec681f3Smrg- gitlab-ci: More stable URL for kernel and ramdisk artifacts, for LAVA 46637ec681f3Smrg- gitlab-ci: Remove left-behind rules: 46647ec681f3Smrg- gitlab-ci: Don't rebuild kernels and rootfs if they have been already built in mainline 46657ec681f3Smrg- gitlab-ci: Run all of GLES3 tests for Panfrost 46667ec681f3Smrg- gitlab-ci: Re-add kernels for bare-metal 46677ec681f3Smrg- gitlab-ci: Download traces from MinIO 46687ec681f3Smrg- gitlab-ci: Upload tracie artifacts to MinIO 46697ec681f3Smrg- gitlab-ci: Fix needs: of the arm64 LAVA test jobs 46707ec681f3Smrg- ci: Upload images of failed replays to MinIO 46717ec681f3Smrg- ci: Use smaller glxgears trace 46727ec681f3Smrg- ci: Prefix tracie artifacts with the device name 46737ec681f3Smrg- ci: Test with more traces 46747ec681f3Smrg- ci: Disable trace testing on Mali T760 46757ec681f3Smrg- ci: Fix the overwriting of traces.yml for baremetal 46767ec681f3Smrg- ci: Namespace trace artifacts to the job number 46777ec681f3Smrg- ci: Always print status code of HTTP uploads in tracie 46787ec681f3Smrg- ci: Print load stats after running dEQP 46797ec681f3Smrg- ci: Fix URL for glslang 46807ec681f3Smrg- ci: Don't ship vk-build-programs after building dEQP 46817ec681f3Smrg- ci: Split building of libdrm to its own script 46827ec681f3Smrg- ci: Build kernels and rootfs for x86 devices 46837ec681f3Smrg- ci: Upload reference images for traces 46847ec681f3Smrg- ci: Print URL to image diff when a trace replay fails 46857ec681f3Smrg- ci: Generate MinIO credentials within LAVA jobs 46867ec681f3Smrg- ci: Set date in LAVA DUTs from NTP servers 46877ec681f3Smrg- ci: Build-test Panfrost tools 46887ec681f3Smrg- ci: Upload traces' reference and actual images to MinIO 46897ec681f3Smrg- ci: Download traces from MinIO in baremetal runs 46907ec681f3Smrg- ci: Remove kernel module build that slipped in 46917ec681f3Smrg- ci: Actually upload trace artifacts to MinIO for baremetal 46927ec681f3Smrg- ci: Use a rootfs tarball for NFS root, instead of a ramdisk (for LAVA) 46937ec681f3Smrg 46947ec681f3SmrgTony Wasserka (4): 46957ec681f3Smrg 46967ec681f3Smrg- nir/lower_idiv: Port recent LLVM fixes to emit_udiv 46977ec681f3Smrg- radv: Fix various non-critical integer overflows 46987ec681f3Smrg- aco: Fix integer overflows when emitting parallel copies during RA 46997ec681f3Smrg- amd/common: Fix various non-critical integer overflows 47007ec681f3Smrg 47017ec681f3SmrgVinson Lee (25): 47027ec681f3Smrg 47037ec681f3Smrg- freedreno: Add missing break statement. 47047ec681f3Smrg- llvmpipe: Fix variable name. 47057ec681f3Smrg- r600/sfn: Initialize VertexStageExportForGS m_num_clip_dist member variable. 47067ec681f3Smrg- panfrost: Ensure final.no_colour is initialized. 47077ec681f3Smrg- r600/sfn: Use correct setter method. 47087ec681f3Smrg- freedreno: Add missing va_end. 47097ec681f3Smrg- pan/bi: Initialize struct fma_op_info member extended. 47107ec681f3Smrg- zink: Check fopen result. 47117ec681f3Smrg- etnaviv: Fix memory leak on error path. 47127ec681f3Smrg- panfrost: Fix printf format specifier. 47137ec681f3Smrg- r300g: Remove extra printf format specifiers. 47147ec681f3Smrg- vdpau: Fix wrong calloc sizeof argument. 47157ec681f3Smrg- mesa: Fix NetBSD compiler macro. 47167ec681f3Smrg- Switch from cElementTree to ElementTree. 47177ec681f3Smrg- intel/genxml: Migrate from deprecated xml.etree.ElementTree getchildren. 47187ec681f3Smrg- rbug: Fix rbug_delete_vs_state lock acquisition. 47197ec681f3Smrg- nir: Add nir_lower_clip_disable.c to SCons build. 47207ec681f3Smrg- util: Fix SCons build. 47217ec681f3Smrg- util: Fix memory leaks in unit test. 47227ec681f3Smrg- meson: Fix lmsensors warning message. 47237ec681f3Smrg- vulkan: Fix memory leaks. 47247ec681f3Smrg- freedreno: Fix file descriptor leak. 47257ec681f3Smrg- svga: Fix unused printf argument. 47267ec681f3Smrg- freedreno: Check file descriptor before write. 47277ec681f3Smrg- panfrost: Delete debug allocated syncobj. 47287ec681f3Smrg 47297ec681f3SmrgYevhenii Kharchenko (1): 47307ec681f3Smrg 47317ec681f3Smrg- st/mesa: fix corrupted texture levels, when adding more levels than expected 47327ec681f3Smrg 47337ec681f3SmrgYevhenii Kolesnikov (5): 47347ec681f3Smrg 47357ec681f3Smrg- glsl: subroutine signatures must match exactly 47367ec681f3Smrg- nvir: don't use designated initialisers in C++ code 47377ec681f3Smrg- intel/compiler: don't propagate cmp to add if add is saturated 47387ec681f3Smrg- mesa: change error code of \*TextureSubImage\* for incorreect target 47397ec681f3Smrg- nine: fix incorrect calculation of layer count for 3D textures 47407ec681f3Smrg 47417ec681f3Smrgjzielins (2): 47427ec681f3Smrg 47437ec681f3Smrg- gallium/swr: Fix compilation warnings 47447ec681f3Smrg- swr: Bump maximum 2D texture size to 16kx16k 47457ec681f3Smrg 47467ec681f3Smrgmmenzyns (1): 47477ec681f3Smrg 47487ec681f3Smrg- nv50: Clear nv50_ir_prog_info of dead and codegen specific variables 4749