Cross Reference: /src/sys/crypto/aes/arch/arm/aes

History log of /src/sys/crypto/aes/arch/arm/aes_neon.c
Revision	Date	Author	Comments
1.6	21-Nov-2020	rin	Fix build with clang for earmv7hf; loadroundkey() is used only for __aarch64__.
1.5	08-Aug-2020	riastradh	branches: 1.5.2; Fix ARM NEON implementations of AES and ChaCha on big-endian ARM. New macros such as VQ_N_U32(a,b,c,d) for NEON vector initializers. Needed because GCC and Clang disagree on the ordering of lanes, depending on whether it's 64-bit big-endian, 32-bit big-endian, or little-endian -- and, bizarrely, both of them disagree with the architectural numbering of lanes. Experimented with using static const uint8_t x8[16] = {...}; uint8x16_t x = vld1q_u8(x8); which doesn't require knowing anything about the ordering of lanes, but this generates considerably worse code and apparently confuses GCC into not recognizing the constant value of x8. Fix some clang mistakes while here too.
1.4	28-Jul-2020	riastradh	Draft 2x vectorized neon vpaes for aarch64. Gives a modest speed boost on rk3399 (Cortex-A53/A72), around 20% in cgd tests, for parallelizable operations like CBC decryption; same improvement should probably carry over to rpi4 CPU which lacks ARMv8.0-AES.
1.3	30-Jun-2020	riastradh	New test sys/crypto/aes/t_aes. Runs aes_selftest on all kernel AES implementations supported on the current hardware, not just the preferred one.
1.2	29-Jun-2020	riastradh	Provide hand-written AES NEON assembly for arm32. gcc does a lousy job at compiling 128-bit NEON intrinsics on arm32; hand-writing it made it about 12x faster, by avoiding a zillion loads and stores to spill everything and the kitchen sink onto the stack. (But gcc does fine on aarch64, presumably because it has twice as many registers and doesn't have to deal with q2=d4/d5 overlapping.)
1.1	29-Jun-2020	riastradh	New permutation-based AES implementation using ARM NEON. Also derived from Mike Hamburg's public-domain vpaes code.
1.5.2.1	14-Dec-2020	thorpej	Sync w/ HEAD.

OpenGrok