forked from OSchip/llvm-project
![]() This moves v32i16/v64i8 to a model consistent with how we treat integer types with avx1. This does change the ABI for types vXi16/vXi8 vectors larger than 512 bits to pass in multiple zmms instead of multiple ymms. We'd already hacked some code to make v64i8/v32i16 pass in zmm. Cost model is still a bit of a mess. In some place I tried to match existing behavior. But really we need to account for splitting and concating costs. Cost model for shuffles is especially pessimistic. Differential Revision: https://reviews.llvm.org/D76212 |
||
---|---|---|
.. | ||
AArch64 | ||
AMDGPU | ||
ARM | ||
PowerPC | ||
RISCV | ||
SystemZ | ||
X86 | ||
no_info.ll |