llvm-project

Commit Graph

Author	SHA1	Message	Date
Sam Elliott	8a53a7375a	[RISCV][NFC] Regenerate Calling Convention Tests This regenerates these tests using utils/update_llc_test_checks.py so that future changes in this area don't have the noise of lots of `@plt` lines being added. I also removed the `nounwind`s from the stack-realignment.ll test to increase coverage on the generated call frame information.	2021-01-14 22:35:17 +00:00
Craig Topper	b894a9fb23	[RISCV] Optimize select_cc after fp compare expansion Some FP compares expand to a sequence ending with (xor X, 1) to invert the result. If the consumer is a select_cc we can likely get rid of this xor by fixing up the select_cc condition. This patch combines (select_cc (xor X, 1), 0, setne, trueV, falseV) - (select_cc X, 0, seteq, trueV, falseV) if we can prove X is 0/1. Reviewed By: lenary Differential Revision: https://reviews.llvm.org/D94546	2021-01-14 13:41:40 -08:00
Sam Elliott	7c9c2a2ea5	Revert "[RISCV] Legalize select when Zbt extension available" We found issues with this patch in additional testing. Backing out while we work on a fix. This reverts commit `71ed4b6ce5`.	2021-01-14 16:44:34 +00:00
Craig Topper	dfc1901d51	[RISCV] Custom lower ISD::VSCALE. This patch custom lowers ISD::VSCALE into a csrr vlenb followed by a shift right by 3 followed by a multiply by the scale amount. I've added computeKnownBits support to indicate that the csrr vlenb always produces 3 trailng bits of 0s so the shift right is "exact". This allows the shift and multiply sequence to be nicely optimized into a single shift or removed completely when the scale amount is a power of 2. The non power of 2 case multiplying by 24 is still producing suboptimal code. We could remove the right shift and use a multiply by 3. Hopefully we can improve DAG combine to fix that since it's not unique to this sequence. This replaces D94144. Reviewed By: HsiangKai Differential Revision: https://reviews.llvm.org/D94249	2021-01-13 17:14:49 -08:00
Hsiangkai Wang	350c0552c6	[NFC][RISCV] Add double type in RISC-V V CodeGen test cases for RV32. Differential Revision: https://reviews.llvm.org/D94584	2021-01-13 23:45:13 +08:00
Craig Topper	1730b0f66a	[RISCV] Remove '.mask' from vcompress intrinsic name. NFC It has a mask argument, but isn't a masked instruction. It doesn't use the mask policy of or the v0.t syntax.	2021-01-12 14:46:16 -08:00
Michael Munday	71ed4b6ce5	[RISCV] Legalize select when Zbt extension available The custom expansion of select operations in the RISC-V backend interferes with the matching of cmov instructions. Legalizing select when the Zbt extension is available solves that problem. Reviewed By: lenary, craig.topper Differential Revision: https://reviews.llvm.org/D93767	2021-01-12 21:24:38 +00:00
Craig Topper	7583ae48a3	[RISCV] Add double test cases to vfmerge-rv32.ll. NFC	2021-01-12 13:09:48 -08:00
Craig Topper	a14040bd4d	[RISCV] Use vmerge.vim for llvm.riscv.vfmerge with a 0.0 scalar operand. We can use a 0 immediate to avoid needing to materialize 0 into an FPR first. Reviewed By: frasercrmck Differential Revision: https://reviews.llvm.org/D94459	2021-01-12 11:08:26 -08:00
Craig Topper	03c8d6a0c4	[LegalizeDAG][RISCV][PowerPC][AMDGPU][WebAssembly] Improve expansion of SETONE/SETUEQ on targets without SETO/SETUO. If SETO/SETUO aren't legal, they'll be expanded and we'll end up with 3 comparisons. SETONE is equivalent to (SETOGT \|\| SETOLT) so if one of those operations is supported use that expansion. We don't need both since we can commute the operands to make the other. SETUEQ can be implemented with !(SETOGT \|\| SETOLT) or (SETULE && SETUGE). I've only implemented the first because it didn't look like most of the affected targets had legal SETULE/SETUGE. Reviewed By: frasercrmck, tlively, nemanjai Differential Revision: https://reviews.llvm.org/D94450	2021-01-12 10:45:03 -08:00
Fraser Cormack	09db958e37	[RISCV] Improve scalable-vector shift tests (NFC) All i8/i16 and several i32 tests were testing immediate shift amounts which exceeded the bits in the vector elements, creating poison values. Amend the tests to test well-behaved shift amounts.	2021-01-12 11:40:21 +00:00
Evandro Menezes	7470017f24	[RISCV] Define the vfclass RVV intrinsics Define the `vfclass` IR intrinsics for the respective V instructions. Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: Evandro Menezes <evandro.menezes@sifive.com> Differential Revision: https://reviews.llvm.org/D94356	2021-01-11 17:40:09 -06:00
Craig Topper	278a3ea1b2	[RISCV] Use vmv.v.i vd, 0 instead of vmv.v.x vd, x0 for llvm.riscv.vfmv.v.f with 0.0 This matches what we use for integer 0. It's also consistent with the scalar 'mv' pseudo that uses addi rather than add with x0.	2021-01-11 15:08:05 -08:00
Fraser Cormack	9ecc991c55	[RISCV] Add scalable vector vselect ISel patterns Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D94294	2021-01-11 22:41:34 +00:00
Fraser Cormack	7989684a2e	[RISCV] Add scalable vector fadd/fsub/fmul/fdiv ISel patterns Original patch by @rogfer01. This patch adds ISel patterns for the above operations to the corresponding vector/vector and vector/scalar RVV instructions, as well as extra patterns to match operand-swapped scalar/vector vfrsub and vfrdiv. Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: Fraser Cormack <fraser@codeplay.com> Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D94408	2021-01-11 21:19:48 +00:00
Fraser Cormack	37b41bd087	[RISCV] Add scalable vector fcmp ISel patterns Original patch by @rogfer01. All ordered comparisons except ONE are supported natively, and all unordered comparisons except UNE are expanded into sequences involving explicit NaN checks and mask arithmetic. Additionally, we expand GT,OGT,GE,OGE to their swapped-operand versions, and pattern-match those back to the "original", swapping operands once more. This way we catch both operations and both "vf" and "fv" forms with fewer patterns. Also add support for floating-point splat_vector, with an optimization for splatting fpimm0. Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: Fraser Cormack <fraser@codeplay.com> Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D94242	2021-01-11 19:38:56 +00:00
Fraser Cormack	b02eab9058	[RISCV] Add scalable vector icmp ISel patterns Original patch by @rogfer01. The RVV integer comparison instructions are defined in such a way that many LLVM operations are defined by using the "opposite" comparison instruction and swapping the operands. This is done in this patch in most cases, except for the mappings where the immediate range must be adjusted to accomodate: va < i --> vmsle{u}.vi vd, va, i-1, vm va >= i --> vmsgt{u}.vi vd, va, i-1, vm That is left for future optimization; this patch supports all operations but in the case of the missing mappings the immediate will be moved to a scalar register first. Since there are so many condition codes and operand cases to check, it was decided to reduce the test burden by only testing the "vscale x 8" vector types. Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: Fraser Cormack <fraser@codeplay.com> Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D94168	2021-01-09 20:54:34 +00:00
Fraser Cormack	41d06095b0	[SelectionDAG] Teach isConstOrConstSplat about ISD::SPLAT_VECTOR This improves llvm::isConstOrConstSplat by allowing it to analyze ISD::SPLAT_VECTOR nodes, in order to allow more constant-folding of operations using scalable vector types. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D94168	2021-01-09 20:54:34 +00:00
Roger Ferrer Ibanez	524d8fa9a5	[RISCV] Do not grow the stack a second time when we need to realign the stack This is a first change needed to fix a crash in which the emergency spill splot ends being out of reach. This happens when we run the register scavenger after we have eliminated the frame indexes. The fix for the actual crash will come in a later change. This change removes an extra stack size increase we do in RISCVFrameLowering::determineFrameLayout. We don't have to change the size of the stack here as PEI::calculateFrameObjectOffsets is already doing this with the right size accounting the extra alignment. Differential Revision: https://reviews.llvm.org/D89237	2021-01-09 16:51:09 +00:00
Fraser Cormack	2c442629f0	[RISCV] Add tests for scalable constant-folding (NFC)	2021-01-09 11:31:22 +00:00
Ben Shi	55f0a1b066	[RISCV] Optimize multiplication with constant 1. Break MUL with specific constant to a SLLI and an ADD/SUB on riscv32 with the M extension. 2. Break MUL with specific constant to two SLLI and an ADD/SUB, if the constant needs a pair of LUI/ADDI to construct. Reviewed by: craig.topper Differential Revision: https://reviews.llvm.org/D93619	2021-01-09 10:37:21 +08:00
Evandro Menezes	946bc50e4c	[RISCV] Define the vfsqrt RVV intrinsics Define the `vfsqrt` IR intrinsics for the respective V instructions. Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: Evandro Menezes <evandro.menezes@sifive.com> Differential Revision: https://reviews.llvm.org/D93745	2021-01-07 17:29:29 -06:00
Fraser Cormack	c9154e8fa3	[RISCV] Add vector mask arithmetic ISel patterns The patterns that want to use 'vnot' use a custom PatFrag. This is because 'vnot' uses immAllOnesV which implicitly uses BUILD_VECTOR rather than SPLAT_VECTOR. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D94078	2021-01-07 09:43:25 +00:00
Ben Shi	351a45ca73	[RISCV][NFC] Add new test cases for mul	2021-01-06 18:55:56 +08:00
Fraser Cormack	e130dea92a	[RISCV] Add vector integer mul/mulh/div/rem ISel patterns There is no test coverage for the mulhs or mulhu patterns as I can't get the DAGCombiner to generate them for scalable vectors. There are a few places in that still need updating for that to work. I left the patterns in regardless as they are correct. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D94073	2021-01-06 09:24:07 +00:00
Craig Topper	249d7de119	[RISCV] Don't print zext.b alias. This alias for andi x, 255 was recently added to the spec. If we print it, code we output can't be compiled with -fno-integrated-as unless the GNU assembler is also a version that supports alias. Reviewed By: lenary Differential Revision: https://reviews.llvm.org/D93826	2021-01-05 10:41:08 -08:00
Craig Topper	c707716c04	[RISCV] Match vmslt(u).vx intrinsics with a small immediate to vmsle(u).vx. There are vmsle(u).vx and vmsle(u).vi instructions, but there is only vmslt(u).vx and no vmslt(u).vi. vmslt(u).vi can be emulated for some immediates by decrementing the immediate and using vmsle(u).vi. To avoid the user needing to know about this, this patch does this conversion. The assembler does the same thing for vmslt(u).vi and vmsge(u).vi pseudoinstructions. There is no vmsge(u).vx intrinsic or instruction so this patch is limited to vmslt(u). Reviewed By: frasercrmck Differential Revision: https://reviews.llvm.org/D94070	2021-01-05 10:20:21 -08:00
Fraser Cormack	1d4411e9ea	[RISCV] Add vector integer min/max ISel patterns Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D94012	2021-01-05 09:15:50 +00:00
Craig Topper	dc9ac0e820	[RISCV] Replace i32 with XLenVT in (add AddrFI, simm12) isel patterns. With the i32 these patterns will only fire on RV32, but they don't look RV32 specific. Reviewed By: lenary Differential Revision: https://reviews.llvm.org/D93843	2021-01-04 10:53:27 -08:00
Michael Munday	e2d3d501ef	[RISCV][NFC] Add additional cmov tests One or more cmov instructions could be generated for these functions when the Zbt extension is present. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D93768	2021-01-04 16:01:40 +00:00
Monk Chiang	1d04cbeb43	[RISCV] Define vector single-width type-convert intrinsic. Define intrinsics: 1. vfcvt.xu.f.v/vfcvt.x.f.v 2. vfcvt.rtz.xu.f.v/vfcvt.rtz.x.f.v 3. vfcvt.f.xu.v/vfcvt.f.x.v We work with @rogfer01 from BSC to come out this patch. Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: Monk Chiang <monk.chiang@sifive.com> Differential Revision: https://reviews.llvm.org/D93933	2020-12-31 11:49:30 +08:00
Monk Chiang	2aed9bc98a	[RISCV] Define vector narrowing type-convert intrinsic. Define intrinsics: 1. vfncvt.xu.f.w/vfncvt.x.f.w 2. vfncvt.rtz.xu.f.w/vfncvt.rtz.x.f.w 3. vfncvt.f.xu.w/vfncvt.f.x.w 4. vfncvt.f.f.w/vfncvt.rod.f.f.w We work with @rogfer01 from BSC to come out this patch. Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: Monk Chiang <monk.chiang@sifive.com> Differential Revision: https://reviews.llvm.org/D93932	2020-12-31 11:48:28 +08:00
Monk Chiang	fdd30faae5	[RISCV] Define vector widening type-convert intrinsic. Define intrinsics: 1. vfwcvt.xu.f.v/vfwcvt.x.f.v 2. vfwcvt.rtz.xu.f.v/vfwcvt.rtz.x.f.v 3. vfwcvt.f.xu.v/vfwcvt.f.x.v 4. vfwcvt.f.f.v We work with @rogfer01 from BSC to come out this patch. Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: Monk Chiang <monk.chiang@sifive.com> Differential Revision: https://reviews.llvm.org/D93855	2020-12-31 11:48:09 +08:00
Monk Chiang	ecc38eac76	Add intrinsic testcase for some missing widening reduction. Add vfredosum/vfredsum/vwredsum/vwredsumu testcase. Differential Revision: https://reviews.llvm.org/D93887	2020-12-31 11:15:15 +08:00
Fangrui Song	7e5508e6a8	[RISCV][test] Add explicit dso_local to definitions in ELF static relocation model tests	2020-12-30 15:28:11 -08:00
Craig Topper	253dc16f9e	[RISCV] Cleanup some V intrinsic names used in tests to match the type overloads used. Add some missing double tests on rv32. NFC The matching for intrinsic names is forgiving about types in the name being absent or wrong. Once the intrinsic is parsed its name will remangled to include the real types. This commit fixes the names to have at least enough correct types so that the name used in the test is a prefix of the canonical name. The big missing part is the type for the VL parameter which changes size between rv32 and rv64. While I was in here I noticed that we were missing some tests for double on rv32 so I fixed that by copying from rv64 and fixing up the VL argument type.	2020-12-30 12:37:11 -08:00
ShihPo Hung	096b02ebbf	[RISCV] Add intrinsics for vcompress instruction This patch defines vcompress intrinsics and lower to V instructions. We work with @rogfer01 from BSC to come out this patch. Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: ShihPo Hung <shihpo.hung@sifive.com> Differential revision: https://reviews.llvm.org/D93809	2020-12-29 18:38:15 -08:00
Zakk Chen	6da0033624	[RISCV] Define vsext/vzext intrinsics. Define vsext/vzext intrinsics.and lower to V instructions. Define new fraction register class fields in LMULInfo and a NoReg to present invalid LMUL register classes. Authored-by: ShihPo Hung <shihpo.hung@sifive.com> Co-Authored-by: Zakk Chen <zakk.chen@sifive.com> Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D93893	2020-12-29 16:50:53 -08:00
Fraser Cormack	f7f09e2b1c	[RISCV] Fill out basic integer RVV ISel patterns This complements the existing RVV ISel patterns for arithmetic, bitwise and shifts with the remaining operations in those categories: sub, and, xor, sra. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D93852	2020-12-29 19:32:18 +00:00
Craig Topper	79cbb003c5	[RISCV] Don't use tail agnostic policy on instructions where destination is tied to source If the destination is tied, then user has some control of the register used for input. They would have the ability to control the value of any tail elements. By using tail agnostic we take this option away from them. Its not clear that the intrinsics are defined such that this isn't supposed to work. And undisturbed is a valid implementation for agnostic so code wouldn't even fail to work on all systems if we always used agnostic. The vcompress intrinsic is defined to require tail undisturbed so at minimum we need this for that instruction or need to redefine the intrinsic. I've made an exception here for vmv.s.x/fmv.s.f and reduction instructions which only write to element 0 regardless of the tail policy. This allows us to keep the agnostic policy on those which should allow better redundant vsetvli removal. An enhancement would be to check for undef input and keep the agnostic policy, but we don't have good test coverage for that yet. Reviewed By: khchen Differential Revision: https://reviews.llvm.org/D93878	2020-12-29 10:37:58 -08:00
Craig Topper	2ae760e27e	[RISCV] Add earlyclobber of destination register to vmsbf.m/vmsif.m/vmsof.m instructions The spec for these instructions include this note. "The destination register cannot overlap either the source register or the mask register ('v0') if the instruction is masked." So we need earlyclobber to enforce this constraint. I've regenerated the tests with update_llc_test_checks.py to show the effects of the earlyclobber. Reviewed By: khchen, frasercrmck Differential Revision: https://reviews.llvm.org/D93867	2020-12-29 10:00:04 -08:00
Zakk Chen	f3f9ce3b79	[RISCV] Define vmclr.m/vmset.m intrinsics. Define vmclr.m/vmset.m intrinsics and lower to vmxor.mm/vmxnor.mm. Ideally all rvv pseudo instructions could be implemented in C header, but those two instructions don't take an input, codegen can not guarantee that the source register becomes the same as the destination. We expand pseduo-v-inst into corresponding v-inst in RISCVExpandPseudoInsts pass. Reviewed By: craig.topper, frasercrmck Differential Revision: https://reviews.llvm.org/D93849	2020-12-28 18:57:17 -08:00
Fraser Cormack	cf8f682c2d	[RISCV] Adjust tested vor ops for more stable tests. NFC.	2020-12-28 19:33:25 +00:00
Zakk Chen	e673d40199	[RISCV] Define vmsbf.m/vmsif.m/vmsof.m/viota.m/vid.v intrinsics. Define those intrinsics and lower to V instructions. Use update_llc_test_checks.py for viota.m tests to check earlyclobber is applied correctly. mask viota.m tests uses the same argument as input and mask for avoid dependency of D93364. We work with @rogfer01 from BSC to come out this patch. Reviewed By: HsiangKai Differential Revision: https://reviews.llvm.org/D93823	2020-12-28 05:54:18 -08:00
Monk Chiang	622ea9cf74	[RISCV] Define vector widening reduction intrinsic. Define vwredsumu/vwredsum/vfwredosum/vfwredsum We work with @rogfer01 from BSC to come out this patch. Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: Zakk Chen <zakk.chen@sifive.com> Differential Revision: https://reviews.llvm.org/D93807	2020-12-26 21:42:30 +08:00
Zakk Chen	da4a637e99	[RISCV] Define vpopc/vfirst intrinsics. Define vpopc/vfirst intrinsics and lower to V instructions. We work with @rogfer01 from BSC to come out this patch. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D93795	2020-12-24 19:44:34 -08:00
Zakk Chen	351c216f36	[RISCV] Define vector mask-register logical intrinsics. Define vector mask-register logical intrinsics and lower them to V instructions. Also define pseudo instructions vmmv.m and vmnot.m. We work with @rogfer01 from BSC to come out this patch. Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: Zakk Chen <zakk.chen@sifive.com> Differential Revision: https://reviews.llvm.org/D93705	2020-12-24 18:59:05 -08:00
ShihPo Hung	912740a864	[RISCV] Add intrinsics for vrgather instruction This patch defines vrgather intrinsics and lower to V instructions. We work with @rogfer01 from BSC to come out this patch. Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: ShihPo Hung <shihpo.hung@sifive.com> Differential revision: https://reviews.llvm.org/D93797	2020-12-24 18:16:02 -08:00
Monk Chiang	afd03cd335	[RISCV] Define vector single-width reduction intrinsic. integer group: vredsum/vredmaxu/vredmax/vredminu/vredmin/vredand/vredor/vredxor float group: vfredosum/vfredsum/vfredmax/vfredmin We work with @rogfer01 from BSC to come out this patch. Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: Zakk Chen <zakk.chen@sifive.com> Differential Revision: https://reviews.llvm.org/D93746	2020-12-25 09:56:01 +08:00
Fraser Cormack	1a7ac29a89	[RISCV] Add ISel support for RVV vector/scalar forms This patch extends the SDNode ISel support for RVV from only the vector/vector instructions to include the vector/scalar and vector/immediate forms. It uses splat_vector to carry the scalar in each case, except when XLEN<SEW (RV32 SEW=64) when a custom node `SPLAT_VECTOR_I64` is used for type-legalization and to encode the fact that the value is sign-extended to SEW. When the scalar is a full 64-bit value we use a sequence to materialize the constant into the vector register. The non-intrinsic ISel patterns have also been split into their own file. Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: Fraser Cormack <fraser@codeplay.com> Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D93312	2020-12-23 20:16:18 +00:00

1 2 3 4 5 ...

480 Commits