llvm-project

Commit Graph

Author	SHA1	Message	Date
Sanjay Patel	660728acd4	[InstSimplify] ctlz({signbit} >>u x) --> x The motivating pattern was handled in `0a2d69480d` , but we should have this for symmetry. But this really highlights that we could generalize for any shifted constant if we match this in instcombine. https://alive2.llvm.org/ce/z/MrmVNt	2021-03-15 12:03:35 -04:00
Sanjay Patel	3c93852a78	[InstSimplify] add tests for ctlz of shifted constant; NFC	2021-03-15 12:03:35 -04:00
Sanjay Patel	0a2d69480d	[InstSimplify] cttz(1<<x) --> x https://alive2.llvm.org/ce/z/TDacYu https://alive2.llvm.org/ce/z/KF84S3	2021-03-08 16:30:14 -05:00
Sanjay Patel	afa443831b	[InstSimplify] add tests for cttz of shifted-1; NFC	2021-03-08 16:30:13 -05:00
Nikita Popov	d042f2db5b	[InstSimplify] Fold call null/undef to poison Calling null or undef results in immediate undefined behavior. Return poison instead of undef in this case, similar to what we do for immediate UB due to division by zero.	2021-01-06 21:09:30 +01:00
Nikita Popov	f094d65bea	[InstSimplify] Fix addo/subo with undef (PR43188) We can't fold the first result to undef, because not all values may be reachable under the constraint that no overflow occurred. Use the same folds we do for saturated math instead. Proofs: uaddo: https://alive2.llvm.org/ce/z/zf55N_ saddo: https://alive2.llvm.org/ce/z/a_xPgS usubo: https://alive2.llvm.org/ce/z/DmRqwt ssubo: https://alive2.llvm.org/ce/z/8ag7U-	2021-01-03 18:51:49 +01:00
Sanjay Patel	236c4524a7	[InstSimplify] remove ctpop of 1 (low) bit https://llvm.org/PR48608 As noted in the test comment, we could handle a more general case in instcombine and remove this, but I don't have evidence that we need to do that. https://alive2.llvm.org/ce/z/MRW9gD	2020-12-28 16:06:20 -05:00
Sanjay Patel	1351f719d4	[InstSimplify] add tests for ctpop; NFC (PR48608)	2020-12-28 16:06:19 -05:00
Juneyoung Lee	864dda5fd5	[InstSimplify] Add tests that fold instructions with poison operands (NFC)	2020-12-02 01:01:59 +09:00
Nikita Popov	7397a019b8	[InstSimplify] Extract abs intrinsic tests into separate file (NFC) Also move some tests from InstCombine to InstSimplify, as they are already handled by InstSimplify.	2020-08-12 22:28:46 +02:00
Craig Topper	85b5315dbe	[InstSimplify] Fold abs(abs(x)) -> abs(x) It's always safe to pick the earlier abs regardless of the nsw flag. We'll just lose it if it is on the outer abs but not the inner abs. Differential Revision: https://reviews.llvm.org/D85053	2020-08-01 13:25:00 -07:00
Sanjay Patel	04b99a4d18	[InstSimplify] simplify abs if operand is known non-negative abs() should be rare enough that using value tracking is not going to be a compile-time cost burden, so use it to reduce a variety of potential patterns. We do this in DAGCombiner too. Differential Revision: https://reviews.llvm.org/D85043	2020-08-01 07:47:06 -04:00
Sanjay Patel	1aa52d67d1	[InstSimplify] add abs test with assume; NFC	2020-08-01 07:47:06 -04:00
Sanjay Patel	77a02527dc	[InstSimplify] add tests for abs intrinsic; NFC	2020-07-31 18:49:13 -04:00
Sanjay Patel	e6c016420c	[ValueTracking] fix library to intrinsic mapping to respect 'nobuiltin' attribute This is another problem raised in: http://bugs.llvm.org/PR46627	2020-07-14 10:04:24 -04:00
Sanjay Patel	9300de4d1c	[InstSimplify] add test with nobuiltin attribute (PR46627); NFC	2020-07-14 10:04:24 -04:00
Nikita Popov	5a2265647e	Reapply [InstSimplify] Remove known bits constant folding No changes relative to last time, but after a mitigation for an AMDGPU regression landed. --- If SimplifyInstruction() does not succeed in simplifying the instruction, it will compute the known bits of the instruction in the hope that all bits are known and the instruction can be folded to a constant. I have removed a similar optimization from InstCombine in D75801, and would like to drop this one as well. On average, we spend ~1% of total compile-time performing this known bits calculation. However, if we introduce some additional statistics for known bits computations and how many of them succeed in simplifying the instruction we get (on test-suite): instsimplify.NumKnownBits: 216 instsimplify.NumKnownBitsComputed: 13828375 valuetracking.NumKnownBitsComputed: 45860806 Out of ~14M known bits calculations (accounting for approximately one third of all known bits calculations), only 0.0015% succeed in producing a constant. Those cases where we do succeed to compute all known bits will get folded by other passes like InstCombine later. On test-suite, only lencod.test and GCC-C-execute-pr44858.test show a hash difference after this change. On lencod we see an improvement (a loop phi is optimized away), on the GCC torture test a regression (a function return value is determined only after IPSCCP, preventing propagation from a noinline function.) There are various regressions in InstSimplify tests. However, all of these cases are already handled by InstCombine, and corresponding tests have already been added there. Differential Revision: https://reviews.llvm.org/D79294	2020-05-08 10:24:53 +02:00
Nikita Popov	46ee652c70	Revert "[InstSimplify] Remove known bits constant folding" This reverts commit `08556afc54`. This breaks some AMDGPU tests.	2020-05-03 20:45:10 +02:00
Nikita Popov	08556afc54	[InstSimplify] Remove known bits constant folding If SimplifyInstruction() does not succeed in simplifying the instruction, it will compute the known bits of the instruction in the hope that all bits are known and the instruction can be folded to a constant. I have removed a similar optimization from InstCombine in D75801, and would like to drop this one as well. On average, we spend ~1% of total compile-time performing this known bits calculation. However, if we introduce some additional statistics for known bits computations and how many of them succeed in simplifying the instruction we get (on test-suite): instsimplify.NumKnownBits: 216 instsimplify.NumKnownBitsComputed: 13828375 valuetracking.NumKnownBitsComputed: 45860806 Out of ~14M known bits calculations (accounting for approximately one third of all known bits calculations), only 0.0015% succeed in producing a constant. Those cases where we do succeed to compute all known bits will get folded by other passes like InstCombine later. On test-suite, only lencod.test and GCC-C-execute-pr44858.test show a hash difference after this change. On lencod we see an improvement (a loop phi is optimized away), on the GCC torture test a regression (a function return value is determined only after IPSCCP, preventing propagation from a noinline function.) There are various regressions in InstSimplify tests. However, all of these cases are already handled by InstCombine, and corresponding tests have already been added there. Differential Revision: https://reviews.llvm.org/D79294	2020-05-03 20:26:58 +02:00
Nikita Popov	0372768776	[InstCombine] Simplify calls with "returned" attribute If a call argument has the "returned" attribute, we can simplify the call to the value of that argument. This was already partially handled by InstSimplify/InstCombine for the case where the argument is an integer constant, and the result is thus known via known bits. The non-constant (or non-int) argument cases weren't handled though. This previously landed as an InstSimplify transform, but was reverted due to assertion failures when compiling the Linux kernel. The reason is that simplifying a call to another call breaks assumptions in call graph updating during inlining. As the code is not easy to fix, and there is no particularly strong motivation for having this in InstSimplify, the transform is only performed in InstCombine instead. Differential Revision: https://reviews.llvm.org/D75815	2020-03-20 10:23:39 +01:00
Nico Weber	623cb95eb3	Revert "[InstSimplify] Simplify calls with "returned" attribute" This reverts commit `45555c3819`. Causes clang crashes in some causes, see comments on https://reviews.llvm.org/D75815 for details (including repro steps).	2020-03-16 15:21:30 -04:00
Nikita Popov	45555c3819	[InstSimplify] Simplify calls with "returned" attribute If a call argument has the "returned" attribute, we can simplify the call to the value of that argument. The "-inst-simplify" pass already handled this for the constant integer argument case via known bits, which is invoked in SimplifyInstruction. However, non-constant (or non-int) arguments are not handled at all right now. This addresses one of the regressions from D75801. Differential Revision: https://reviews.llvm.org/D75815	2020-03-09 18:53:47 +01:00
Nikita Popov	829d377a98	[InstSimplify] Don't simplify musttail calls As pointed out by jdoerfert on D75815, we must be careful when simplifying musttail calls: We can only replace the return value if we can eliminate the call entirely. As we can't make this guarantee for all consumers of InstSimplify, this patch disables simplification of musttail calls. Without this patch, musttail simplification currently results in module verification errors. Differential Revision: https://reviews.llvm.org/D75824	2020-03-09 18:46:56 +01:00
Nikita Popov	d2dab92f01	[InstSimplify] Add tests for "returned" attribute; NFC	2020-03-07 17:17:21 +01:00
Sanjay Patel	1c4dd3ae2f	[InstSimplify] fold copysign with negated operand, part 2 This is another transform suggested in PR44153: https://bugs.llvm.org/show_bug.cgi?id=44153 Unlike rG12f39e0fede9, it doesn't look like the backend matches this variant.	2019-12-08 10:16:29 -05:00
Sanjay Patel	12f39e0fed	[InstSimplify] fold copysign with negated operand This is another transform suggested in PR44153: https://bugs.llvm.org/show_bug.cgi?id=44153 The backend for some targets already manages to get this if it converts copysign to bitwise logic.	2019-12-08 10:08:02 -05:00
Sanjay Patel	d5abaaf140	[InstSimplify] add tests for copysign with fneg operand; NFC	2019-12-06 16:23:44 -05:00
Sanjay Patel	e177c5a00d	[InstSimplify] fold copysign with same args to the arg This is correct for any value including NaN/inf. We don't have this fold directly in the backend either, but x86 manages to get it after converting things to bitops.	2019-11-26 17:35:10 -05:00
Sanjay Patel	48a3a1e090	[InstSimplify] add tests for copysign; NFC	2019-11-26 17:23:30 -05:00
Sanjay Patel	be21ceb565	[InstSimplify] fold fma/fmuladd with a NaN or undef operand This is intended to be similar to the constant folding results from D67446 and earlier, but not all operands are constant in these tests, so the responsibility for folding is left to InstSimplify. Differential Revision: https://reviews.llvm.org/D67721 llvm-svn: 373455	2019-10-02 12:12:02 +00:00
Sanjay Patel	1b40402aa2	[InstSimplify] add tests for fma/fmuladd with undef operand; NFC llvm-svn: 373109	2019-09-27 18:38:51 +00:00
Sanjay Patel	e406a3f2d6	[InstSimplify] add tests for fma/fmuladd; NFC llvm-svn: 372236	2019-09-18 17:27:02 +00:00
Roman Lebedev	5a663bd77a	[InstSimplify] Fix addo/subo undef folds (PR42209) Fix folds of addo and subo with an undef operand to be: `@llvm.{u,s}{add,sub}.with.overflow` all fold to `{ undef, false }`, as per LLVM undef rules. Same for commuted variants. Based on the original version of the patch by @nikic. Fixes [[ https://bugs.llvm.org/show_bug.cgi?id=42209 \| PR42209 ]] Differential Revision: https://reviews.llvm.org/D63065 llvm-svn: 363522	2019-06-16 20:39:45 +00:00
Eric Christopher	cee313d288	Revert "Temporarily Revert "Add basic loop fusion pass."" The reversion apparently deleted the test/Transforms directory. Will be re-reverting again. llvm-svn: 358552	2019-04-17 04:52:47 +00:00
Eric Christopher	a863435128	Temporarily Revert "Add basic loop fusion pass." As it's causing some bot failures (and per request from kbarton). This reverts commit r358543/ab70da07286e618016e78247e4a24fcb84077fda. llvm-svn: 358546	2019-04-17 02:12:23 +00:00
Sanjay Patel	9dada83d6c	[InstSimplify] remove zero-shift-guard fold for general funnel shift As discussed on llvm-dev: http://lists.llvm.org/pipermail/llvm-dev/2019-February/130491.html We can't remove the compare+select in the general case because we are treating funnel shift like a standard instruction (as opposed to a special instruction like select/phi). That means that if one of the operands of the funnel shift is poison, the result is poison regardless of whether we know that the operand is actually unused based on the instruction's particular semantics. The motivating case for this transform is the more specific rotate op (rather than funnel shift), and we are preserving the fold for that case because there is no chance of introducing extra poison when there is no anonymous extra operand to the funnel shift. llvm-svn: 354905	2019-02-26 18:26:56 +00:00
Sanjay Patel	421c6e6864	[InstSimplify] add tests for rotate; NFC Rotate is a special-case of funnel shift that has different poison constraints than the general case. That's not visible yet in the existing tests, but it needs to be corrected. llvm-svn: 354894	2019-02-26 16:44:08 +00:00
Nikita Popov	f17421e595	[ConstantFolding] Consolidate and extend bitcount intrinsic tests; NFC Move constant folding tests into ConstantFolding/bitcount.ll and drop various tests in other places. Add coverage for undefs. llvm-svn: 349806	2018-12-20 19:46:52 +00:00
Sanjay Patel	14ab9170b8	[InstSimplify] fold funnel shifts with undef operands Splitting these off from the D54666. Patch by: nikic (Nikita Popov) llvm-svn: 347332	2018-11-20 17:34:59 +00:00
Sanjay Patel	2778f56a40	[InstSimplify] add tests for funnel shift with undef operands; NFC These are part of D54666, so adding them here before the patch to show the baseline (currently unoptimized) results. Patch by: @nikic (Nikita Popov) llvm-svn: 347331	2018-11-20 17:30:09 +00:00
Sanjay Patel	e98ec77a95	[InstSimplify] delete shift-of-zero guard ops around funnel shifts This is a problem seen in common rotate idioms as noted in: https://bugs.llvm.org/show_bug.cgi?id=34924 Note that we are not canonicalizing standard IR (shifts and logic) to the intrinsics yet. (Although I've written this before...) I think this is the last step before we enable that transform. Ie, we could regress code by doing that transform without this simplification in place. In PR34924, I questioned whether this is a valid transform for target-independent IR, but I convinced myself this is ok. If we're speculating a funnel shift by turning cmp+br into select, then SimplifyCFG has already determined that the transform is justified. It's possible that SimplifyCFG is not taking into account profile or other metadata, but if that's true, then it's a bug independent of funnel shifts. Also, we do have CGP code to restore a guard like this around an intrinsic if it can't be lowered cheaply. But that isn't necessary for funnel shift because the default expansion in SelectionDAGBuilder includes this same cmp+select. Differential Revision: https://reviews.llvm.org/D54552 llvm-svn: 346960	2018-11-15 14:53:37 +00:00
Sanjay Patel	4832ffee39	[InstSimplify] add more tests for funnel shift with select; NFC The cases are just different enough that we should have complete tests to avoid bugs from typos in the code. llvm-svn: 346902	2018-11-14 22:34:25 +00:00
Sanjay Patel	7d028670f6	[InstSimplify] add tests for funnel shift with select; NFC llvm-svn: 346881	2018-11-14 19:12:54 +00:00
Sanjay Patel	54421ce918	[InstSimplify] fold funnel shifts with 0-shift amount llvm-svn: 338218	2018-07-29 16:36:38 +00:00
Sanjay Patel	46af5835af	[InstSimplify] add tests for funnel shift intrinsics; NFC llvm-svn: 338217	2018-07-29 16:27:17 +00:00
Matt Arsenault	3ced3d90c3	InstSimplify: canonicalize is idempotent llvm-svn: 312685	2017-09-07 01:21:43 +00:00
Craig Topper	fe9ad82e44	[ConstantFolding] Properly support constant folding of vector powi intrinsic. The second argument is not a vector so needs special treatment. llvm-svn: 304679	2017-06-04 07:30:28 +00:00
Craig Topper	97f113e795	[InstSimplify] Add test case demonstrating that we fail to constant fold vector llvm.powi intrinsics due to the second argument not being a vector. llvm-svn: 304678	2017-06-04 07:30:23 +00:00
Craig Topper	7c553edced	[ConstantFolding] Fix constant folding for vector cttz and ctlz intrinsics to understand that the second argument is still a scalar. llvm-svn: 304668	2017-06-03 18:50:29 +00:00
Craig Topper	36fa2f0dee	[InstCombine][InstSimplify] Add various tests for ctlz/cttz with vectors, some showing missed optimizations. NFC llvm-svn: 304667	2017-06-03 18:50:26 +00:00

1 2

66 Commits