llvm-project

Commit Graph

Author	SHA1	Message	Date
Graham Hunter	70d35443dc	[LAA] Handle forked pointers with add/sub instructions Handle cases where a forked pointer has an add or sub instruction before reaching a select. Reviewed By: fhahn Reviewed By: paulwalker-arm Differential Revision: https://reviews.llvm.org/D130278	2022-08-17 09:51:13 +01:00
Simon Pilgrim	5583d4e87b	[CostModel][X86] Add cost kinds test coverage for select operators	2022-08-13 18:08:08 +01:00
Simon Pilgrim	3482523dcd	[CostModel] Rename vselect-cost.ll to select.ll This covers more than just vector select costs	2022-08-13 17:30:42 +01:00
Simon Pilgrim	a22bbbfb73	[CostModel][X86] Add cost kinds test coverage for fp comparisons	2022-08-13 17:20:49 +01:00
Simon Pilgrim	6ba1427225	[CostModel][X86] Add cost kinds test coverage for fp arithmetic operators	2022-08-13 16:42:43 +01:00
Vir Narula	bc56f6377c	[CostModel] Add bfloat and fp128 reduction tests Reviewed By: fhahn Differential Revision: https://reviews.llvm.org/D131269	2022-08-12 16:44:04 +01:00
Simon Pilgrim	d2ce2f1b5c	[CostModel][X86] Sync masked-intrinsic-cost.ll and masked-intrinsic-cost-inseltpoison.ll We'd lost some type test coverage in masked-intrinsic-cost-inseltpoison.ll	2022-08-11 12:05:50 +01:00
jacquesguan	21bf59c92a	[RISCV] Add cost model for mask vector extend and truncate instruction. As extending from or truncating to mask vector do not use the same instructions as the normal cast, this path changed it to 2 which is the number of instructions we used. Differential Revision: https://reviews.llvm.org/D131552	2022-08-11 10:55:43 +08:00
Simon Pilgrim	4178e33470	[CostModel] Update RUN -passes=* to double quotes to appease update scripts on windows DOS really doesn't like `` quotes to be used in command lines Some prep work as I'm intending to resurrect D79483 soon	2022-08-10 17:54:06 +01:00
Edd Barrett	fa250250b2	Migrate llvm.experimental.patchpoint() to ptr. This intrinsic used a typed pointer for a call target operand. This change updates the operand to be an opaque pointer and updates all pointers in all test files that use the intrinsic. Differential revision: https://reviews.llvm.org/D131261	2022-08-10 13:18:02 +01:00
jacquesguan	b6b1c0d1c4	[RISCV] Add cost model for fp-mask cast op. The cost of convert from or to mask vector is different from other cases. We could not use PowDiff to calculate it. This patch set it to 3 as we use 3 instruction to make it. Differential Revision: https://reviews.llvm.org/D131149	2022-08-10 17:14:37 +08:00
Philip Reames	d3f83caeb4	[RISCV] Refresh two autogened tests to avoid future whitespace diffs [nfc]	2022-08-09 11:30:32 -07:00
Congzhe Cao	76be554931	[DependenceAnalysis][PR56275] Normalize negative dependence analysis results This patch is the first of the two-patch series (D130188, D130179) that resolve PR56275 (https://github.com/llvm/llvm-project/issues/56275) which is a missed opportunity, where a perfrectly valid case for loop interchange failed interchange legality. If the distance/direction vector produced by dependence analysis (DA) is negative, it needs to be normalized (reversed). This patch provides helper functions `isDirectionNegative()` and `normalize()` in DA that does the normalization, and clients can query DA to do normalization if needed. A pass option `<normalized-results>` is added to DependenceAnalysisPrinterPass, and we leverage it to update DA test cases to make sure of test coverage. The test cases added in `Banerjee.ll` shows that negative vectors are normalized with `print<da><normalized-results>`. Reviewed By: bmahjour, Meinersbur, #loopoptwg Differential Revision: https://reviews.llvm.org/D130188	2022-08-03 19:59:00 -04:00
Philip Reames	a243af52bb	[CostModel][RISCV] Add test coverage of floating point rounding intrinsics These costs are fairly bogus, but at least we have baseline coverage now.	2022-08-03 16:51:43 -07:00
Vasileios Porpodas	f669030373	[TTI][AArch64][SLP] Sets the cost of an ADD reduction 2xi64 to 2. 2xi64 is the legalized type for wide reductions (like 16xi64) and setting the cost to 2 makes `load-reduce` and `load-zext-reduce` patterns profitable. The few performance measurments that I did on an aarch64 machine confirm that these patterns are actually faster when vectorized. Differential Revision: https://reviews.llvm.org/D130740	2022-08-01 13:03:14 -07:00
Nikita Popov	f96ea53e89	[AA] Do not track Must in ModRefInfo getModRefInfo() queries currently track whether the result is a MustAlias on a best-effort basis. The only user of this functionality is the optimized memory access type in MemorySSA -- which in turn has no users. Given that this functionality has not found a user since it was introduced five years ago (in D38862), I think we should drop it again. The context is that I'm working to separate FunctionModRefBehavior to track mod/ref for different location kinds (like argmem or inaccessiblemem) separately, and the fact that ModRefInfo also has an unrelated Must flag makes this quite awkward, especially as this means that NoModRef is not a zero value. If we want to retain the functionality, I would probably split getModRefInfo() results into a part that just contains the ModRef information, and a separate part containing a (best-effort) AliasResult. Differential Revision: https://reviews.llvm.org/D130713	2022-08-01 07:14:31 +02:00
chendewen	7eeb468ae5	[Aarch64] Add cost for missing extensions. This patch adds a cost estimate for some missing sign extensions. ref: https://reviews.llvm.org/D14730 Reviewed By: dmgreen Differential Revision: https://reviews.llvm.org/D130565	2022-07-28 17:34:00 +08:00
Max Kazantsev	8e9e27ae90	[Test] Fix block name in test	2022-07-28 13:42:14 +07:00
Max Kazantsev	2d1c6e0b44	[LAA] Remove block order sensitivity in LAA algorithm. PR56672 As test in PR56672 shows, LAA produces different results which lead to either positive or negative vectorization decisions depending on the order of blocks in loop. The exact reason of this is not clear to me, however this makes investigation of related bugs extremely complex. Current order of blocks in the loop is arbitrary. It may change, for example, if loop info analysis is dropped and recomputed. Seems that it interferes with LAA's logic. This patch chooses fixed traversal order of blocks in loops, making it RPOT. Note: this is not a fix for bug with incorrect analysis result. It just makes the answer more robust to make the investigation easier. Differential Revision: https://reviews.llvm.org/D130482 Reviewed By: aeubanks, fhahn	2022-07-28 13:36:56 +07:00
Augie Fackler	12c0bf8ba9	tests: add attributes that would normally come from inferattrs As my goal is to remove at least _some_ functions from the static list in MemoryBuiltins.cpp, these tests either need to run inferattrs or statically declare these attributes to keep passing. A couple of tests had alternate cases which are no longer meaningful, e.g. `malloc-load-removal.ll`. Differential Revision: https://reviews.llvm.org/D123087	2022-07-25 17:29:00 -04:00
Alexander Shaposhnikov	2ebfda2417	[InstCombine] Improve folding of mul + icmp This diff adds folds for patterns like X * A < B where A, B are constants and "mul" has either "nsw" or "nuw". (to address https://github.com/llvm/llvm-project/issues/56563). Test plan: 1/ ninja check-llvm check-clang 2/ Bootstrapped LLVM/Clang pass tests Differential revision: https://reviews.llvm.org/D130039	2022-07-22 22:08:53 +00:00
Malhar Jajoo	41958f76d8	[Costmodel] Add "type-based-intrinsic-cost" cli option This patch adds a command line flag to be able to test the type based cost-model analysis for Intrinsics. Differential Revision: https://reviews.llvm.org/D129109	2022-07-22 15:50:57 +01:00
Johannes Doerfert	dfac030271	[Intrinsics] Add `nocallback` to the memset/cpy/move intrinsics These were forgotten when D118680 was applied. Similar to D125937. Differential Revision: https://reviews.llvm.org/D129516	2022-07-21 22:52:46 -05:00
Graham Hunter	0a715c1146	[LAA] Precommit add/sub tests for forked pointers Adds new tests for add and sub instructions before reaching a select. Also adds tests using different bit widths for memory, including non-power-of-two integers.	2022-07-21 15:16:15 +01:00
Congzhe Cao	05ccde8023	[LoopCacheAnalysis] Fix a type mismatch problem in cost calculation There is a problem in loop cache analysis that the types of SCEV variables `Coeff` and `ElemSize` in function `isConsecutive()` may not match. The mismatch would cause SCEV failures when `Coeff` is multiplied with `ElemSize`. The fix in this patch is to extend the type of both `Coeff` and `ElemSize` to whichever is wider in those two variables. As a clean-up, duplicate calculations of `Stride` in `computeRefCost()` is then removed. Reviewed By: Meinersbur, #loopoptwg Differential Revision: https://reviews.llvm.org/D128877	2022-07-21 01:57:05 -04:00
Graham Hunter	db8fcb2c25	[LAA] Add recursive IR walker for forked pointers This builds on the previous forked pointers patch, which only accepted a single select as the pointer to check. A recursive function to walk through IR has been added, which searches for either a loop-invariant or addrec SCEV. This will only handle a single fork at present, so selects of selects or a GEP with a select for both the base and offset will be rejected. There is also a recursion limit with a cli option to change it. Reviewed By: fhahn, david-arm Differential Revision: https://reviews.llvm.org/D108699	2022-07-18 12:06:17 +01:00
Phoebe Wang	f187948162	[X86][FP16] Enable vector support for FP16 emulation This is follow up of D107082, which enable vector support according to psABI. Reviewed By: skan Differential Revision: https://reviews.llvm.org/D127982	2022-07-16 09:38:58 +08:00
Nikita Popov	2a721374ae	[IR] Don't use blockaddresses as callbr arguments Following some recent discussions, this changes the representation of callbrs in IR. The current blockaddress arguments are replaced with `!` label constraints that refer directly to callbr indirect destinations: ; Before: %res = callbr i8* asm "", "=r,r,i"(i8* %x, i8* blockaddress(@test8, %foo)) to label %asm.fallthrough [label %foo] ; After: %res = callbr i8* asm "", "=r,r,!i"(i8* %x) to label %asm.fallthrough [label %foo] The benefit of this is that we can easily update the successors of a callbr, without having to worry about also updating blockaddress references. This should allow us to remove some limitations: * Allow unrolling/peeling/rotation of callbr, or any other clone-based optimizations (https://github.com/llvm/llvm-project/issues/41834) * Allow duplicate successors (https://github.com/llvm/llvm-project/issues/45248) This is just the IR representation change though, I will follow up with patches to remove limtations in various transformation passes that are no longer needed. Differential Revision: https://reviews.llvm.org/D129288	2022-07-15 10:18:17 +02:00
Lian Wang	dca821d80a	[RISCV] Add cost model for vector.reverse mask operation Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D128784	2022-07-15 06:58:57 +00:00
Graham Hunter	a19cf47da0	[LAA] Precommit some extra tests for forked pointers * Converted tests to use opaque pointers * Added suggested test for inbounds GEP * Added a test for forks on both the base and offset terms of a GEP * Added a test for a select of a select * Added a test for a GEP with >2 operands * Added a test for vector GEPs	2022-07-13 10:32:35 +01:00
David Green	0a11ad2aa8	[ARM] Expand MVE i1 fptoint and inttofp if mve.fp is not present. If MVE.fp is not present then we cannot select the vector i1 fp operations to VCMP instructions, so need to expand.	2022-07-11 13:03:30 +01:00
Mingming Liu	b242e8502c	[AArch64][NFC] Prepare test cases (for D128302) to show more accurate cost estimation of extract-element could generate better assembly code. Pre-commit the test cases (for D128302) to show that more accurate cost estimation of extract-element could generate better code. Differential Revision: https://reviews.llvm.org/D128945	2022-07-07 09:39:29 -07:00
David Green	438ffdb821	[ARM] Switch the costs of mve1beat and mve4beat These three subtarget features are meant to control where MVE instructions take 1 vs 2 vs 4 architectural beats. The mve1beat feature is described as "Model MVE instructions as a 1 beat per tick architecture", meaning MVE instruction will execute over 4 cycles. mve4beat is the opposite where the entire 4 beats of the MVE instruction execute in a single cycle. The costs for the two were backwards though, not matching the cycle counts like they should. This patch switches the costs on the two to bring them in-line with expectations. Differential Revision: https://reviews.llvm.org/D129141	2022-07-07 16:10:00 +01:00
Nikita Popov	4a579abd9f	[GlobalsModRef] Don't override getModRefBehavior() for CallBase BasicAA will already call getModRefBehavior() on the Function of the CallBase if there are no operand bundles. This happens through getBestAAResults(), i.e. it is a recursive call that will query other AA providers, not just the BasicAA implementation. As such, there is no need to reimplement the same functionality in GlobalsModRef, a combination of BasicAA and GlobalsModRef already handles it. This does mean that this no longer works under -disable-basic-aa, but that's a testing only option.	2022-07-07 10:35:44 +02:00
Nikita Popov	935570b2ad	[ConstExpr] Don't create div/rem expressions This removes creation of udiv/sdiv/urem/srem constant expressions, in preparation for their removal. I've added a ConstantExpr::isDesirableBinOp() predicate to determine whether an expression should be created for a certain operator. With this patch, div/rem expressions can still be created through explicit IR/bitcode, forbidding them entirely will be the next step. Differential Revision: https://reviews.llvm.org/D128820	2022-07-05 15:54:53 +02:00
Chen Zheng	2c3784cff8	[SCEV] recognize llvm.annotation intrinsic Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D127835	2022-07-03 21:02:50 -04:00
Chen Zheng	6a9434bb9c	[SCEV] pre-commit test case for D127835, NFC	2022-07-01 04:25:00 -04:00
Piotr Sobczak	bd675af2a2	[AMDGPU] Make v16i16/v16f16 legal There are upcoming intrinsics to use the new types. Reviewed By: rampitec Differential Revision: https://reviews.llvm.org/D128865	2022-06-30 23:08:40 +02:00
Florian Hahn	113887e79f	[BasicAA] Add test coverage from D76194.	2022-06-29 11:25:20 +01:00
Florian Hahn	a25f82762d	[BasicAA] Convert test to use opaque pointers. Using opaque pointers simplifies the tests quite a bit.	2022-06-29 11:25:13 +01:00
Philip Reames	aadc9d26a3	[RISCV] Cost model for scalable reductions This extends the existing cost model for reductions for scalable vectors. The existing cost model assumes that reductions are roughly logarithmic in cost for unordered variants and linear for ordered ones. This change keeps that same basic model, and extends it out to the maximum number of elements a scalable vector could possibly have. This results in costs which aren't terribly high for unordered reductions, but are for ordered ones. This seems about right; we want to strongly bias away from using scalable ordered reductions if the cost might be linear in VL. Differential Revision: https://reviews.llvm.org/D127447	2022-06-27 12:44:38 -07:00
Bradley Smith	a83aa33d1b	[IR] Move vector.insert/vector.extract out of experimental namespace These intrinsics are now fundemental for SVE code generation and have been present for a year and a half, hence move them out of the experimental namespace. Differential Revision: https://reviews.llvm.org/D127976	2022-06-27 10:48:45 +00:00
Florian Hahn	e4e22b6d80	[SCEV] Use SCEVUnknown(poison) instead of SCEVUnknown(undef). Use poison instead of undef for SCEVUnkown of unreachable values. This should be in line with the movement to replace undef with poison when possible. Suggested in D114650. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D128586	2022-06-27 09:33:05 +01:00
Nikita Popov	217e85761c	[ArgPromotion] Remove legacy PM support Support for the legacy pass manager in ArgPromotion causes complications in D125485. As the legacy pass manager for middle-end optimizations is unsupported, drop ArgPromotion from the legacy pipeline, rather than introducing additional complexity to deal with it. Differential Revision: https://reviews.llvm.org/D128536	2022-06-27 09:42:17 +02:00
Nuno Lopes	3fa2411dc5	[LoopSimplifyCFG] use poison when replacing dead instructions instead of undef [NFC]	2022-06-26 14:15:55 +01:00
Philip Reames	ab736a2750	[BasicTTI] Account for vector of pointers in getMemoryOpCost By using getPrimitiveSizeInBits, we were getting 0 for every pointer type. This code is trying to account for the cost of truncating a store or extending a load to convert from the source vector element type to the legal vector element type. I'd originally seen this as a crash when trying to scalarize a <vscale x 1 x ptr> type coming from the vectorizer. Here's a minimum reproducer to exercise the code in question. void e(int argv[], int p) { for (int i = 0; i < 1024; i++) argv[i] = p; } This was checked in as the splat_ptr test in `2cf320d`. After bbf3fd, this no longer crashes since we correctly return invalid if the extending load/truncating store isn't legal. Differential Revision: https://reviews.llvm.org/D128228	2022-06-25 11:11:58 -07:00
Nabeel Omer	0d41794335	[SLP] Add cost model for `llvm.powi.` intrinsics (REAPPLIED) Patch was reverted in `4c5f10a` due to buildbot failures, now being reapplied with updated AArch64 and RISCV tests. This patch adds handling for the llvm.powi. intrinsics in BasicTTIImplBase::getIntrinsicInstrCost() and improves vectorization. Closes #53887. Differential Revision: https://reviews.llvm.org/D128172	2022-06-24 10:23:19 +00:00
Nikita Popov	bcadfc2595	[BasicAA] Handle passthru calls in isEscapeSource() isEscapeSource() currently considers all call return values as escape sources. However, CaptureTracking can look through certain calls, so we shouldn't consider these as escape sources either. The corresponding CaptureTracking code is: `7c9a3825b8/llvm/lib/Analysis/CaptureTracking.cpp (L332-L333)` Differential Revision: https://reviews.llvm.org/D128444	2022-06-24 11:00:57 +02:00
Philip Reames	0c1326748f	[BasicTTI] Avoid crash when costing scalable select expansion If the target has chosen to expand a scalable vector type, BasicTTI tries to scalarize and we'd crash. As a minimum, we should return an invalid cost instead. The added test provide coverage for the moment, but given they show a number of gaps in RISCV costing, they're likely not to cover this code path long term.	2022-06-23 09:14:57 -07:00
Nikita Popov	8b6f69a4da	[BasicAA] Add test for call incorrectly treated as escape source (NFC)	2022-06-23 16:30:30 +02:00

1 2 3 4 5 ...

3474 Commits