llvm-project

Commit Graph

Author	SHA1	Message	Date
Johannes Doerfert	6555558a80	Revert "[Attributor] Replace AAValueSimplify with AAPotentialValues" This reverts commit `da50dab1ae`. Patch broke AMD GPU OpenMP offload buildbots. https://lab.llvm.org/buildbot/#/builders/193/builds/13246	2022-06-09 17:04:01 +02:00
Johannes Doerfert	da50dab1ae	[Attributor] Replace AAValueSimplify with AAPotentialValues For the longest time we used `AAValueSimplify` and `genericValueTraversal` to determine "potential values". This was problematic for many reasons: - We recomputed the result a lot as there was no caching for the 9 locations calling `genericValueTraversal`. - We added the idea of "intra" vs. "inter" procedural simplification only as an afterthought. `genericValueTraversal` did offer an option but `AAValueSimplify` did not. Thus, we might end up with "too much" simplification in certain situations and then gave up on it. - Because `genericValueTraversal` was not a real `AA` we ended up with problems like the infinite recursion bug (#54981) as well as code duplication. This patch introduces `AAPotentialValues` and replaces the `AAValueSimplify` uses with it. `genericValueTraversal` is folded into `AAPotentialValues` as are the instruction simplifications performed in `AAValueSimplify` before. We further distinguish "intra" and "inter" procedural simplification now. `AAValueSimplify` was not deleted as we haven't ported the re-materialization of instructions yet. There are other differences over the former handling, e.g., we may not fold trivially foldable instructions right now, e.g., `add i32 1, 1` is not folded to `i32 2` but if an operand would be simplified to `i32 1` we would fold it still. We are also even more aware of function/SCC boundaries in CGSCC passes, which is good. Fixes: https://github.com/llvm/llvm-project/issues/54981	2022-06-09 16:48:53 +02:00
Simon Moll	b8c2781ff6	[NFC] format InstructionSimplify & lowerCaseFunctionNames Clang-format InstructionSimplify and convert all "FunctionName"s to "functionName". This patch does touch a lot of files but gets done with the cleanup of InstructionSimplify in one commit. This is the alternative to the less invasive clang-format only patch: D126783 Reviewed By: spatel, rengolin Differential Revision: https://reviews.llvm.org/D126889	2022-06-09 16:10:08 +02:00
Johannes Doerfert	982053e85e	[Attributor][NFC] Improve debug code and comments	2022-06-09 13:41:23 +02:00
Johannes Doerfert	0ece283f03	[Attributor] Add checks needed as we strengthen value simplify	2022-06-09 13:41:23 +02:00
Johannes Doerfert	393be12b74	[Attributor] Look at base values for align, nonnull, and deref Stripping bitcasts and 0-geps helps normalization and minimizes the impact of a follow up change.	2022-06-09 13:41:23 +02:00
Johannes Doerfert	14899bc43d	[Attributor] Generalize interface from ConstantInt to Constant We can use constant to allow undef and there is no need to force integers in the API anyway. The user can decide if a non integer constant is fine or not.	2022-06-09 12:00:26 +02:00
Johannes Doerfert	7a07b88f37	[Attributor][FIX] Replace call site argument uses, not values We need to be careful replacing values as call site arguments (IRPosition::IRP_CALL_SITE_ARGUMENT) is representing a use and not a value. This patch replaces the interface to take a IR position instead making it harder to misuse accidentally. It does not change our tests right now but a follow up exposed the potential footgun.	2022-06-09 12:00:26 +02:00
Johannes Doerfert	1df6e171c3	[Attributor] Simplify (integer range) state handling We used to be very conservative when integer states were merged. Instead of adding the known range (which is large due to uncertainty) into the assumed range (which is hopefully small), we can also only allow to merge in both at the same time into their respective counterpart. This will ensure we keep the invariant that assumed is part of known.	2022-06-09 12:00:26 +02:00
Johannes Doerfert	481b8f31df	[Attributor][NFC] Introduce helper struct We often use a context associated with a value. For now only one use case has been changed.	2022-06-09 12:00:26 +02:00
Johannes Doerfert	4277c1be88	[Attributor][FIX] Avoid metadata and duplicate replication assertion When we recreate instructions as part of simplification we need to take care of debug metadata and replacing the value multiple times. For now, we handle both conservatively.	2022-06-09 12:00:26 +02:00
Aaron Ballman	86cdb2929c	Silence a "not all control paths return a value" warning; NFC	2022-04-18 08:54:08 -04:00
Johannes Doerfert	e87f10a771	[Attributor] CGSCC pass should not recompute results outside the SCC (reapply) When we run the CGSCC pass we should only invest time on the SCC. We can initialize AAs with information from the module slice but we should not update those AAs. We make an exception for are call site of the SCC as they are helpful providing information for the SCC. Minor modifications to pointer privatization allow us to perform it even in the CGSCC pass, similar to ArgumentPromotion.	2022-04-17 12:48:49 -05:00
Johannes Doerfert	04f3a224bc	[Attributor][NFC] Introduce a flag to distinguish the scope of a query	2022-04-15 14:56:10 -05:00
Johannes Doerfert	bd72acf4d8	[Attributor][NFC] Code cleanup to minimize follow up changes	2022-04-15 14:56:09 -05:00
Johannes Doerfert	2d8e7834b0	[Attributor][NFC] Rename AAPotentialValues to AAPotentialConstantValues	2022-04-15 14:56:09 -05:00
serge-sans-paille	fa5a4e1b95	[iwyu] Handle regressions in libLLVM header include Running iwyu-diff on LLVM codebase since `a96638e50e` detected a few regressions, fixing them.	2022-04-13 20:53:19 +02:00
Johannes Doerfert	af30de7788	[Attributor] Introduce AAInstanceInfo The Attributor, as many other parts in LLVM, uses pointer equivalence for `llvm::Value`s. This only works as long as `llvm::Value`s are dynamically unique, or, to be exact, we will never end up with the same `llvm::Value` representing two dynamic instances. We already provided a helper to check the former, namely `AA::isDynamicallyUnique`, however we could not check the latter. In this patch we move the logic into a separate AA which helps with the growing complexity and use cases. We also extend the interface to answer the second question rather than the first. So we do not determine dynamically uniqueness but if we might end up with the `llvm::Value` describing a different dynamic instance. Note that the latter is very much tied to the Attributor capabilities to look through memory, recursion, etc. so we need to update the logic as we go.	2022-04-05 23:07:13 -05:00
Johannes Doerfert	c42aa1be74	[Attributor] Keep loads feeding in `llvm.assume` if stores stays If a load is only used by an `llvm.assume` and the stores feeding into the load are not removable, keep the load.	2022-04-05 23:07:12 -05:00
Johannes Doerfert	857bf306d7	[Attributor] Remove broken and duplicated load simplification We look through loads in the "generic value traversal" and we consequently don't need to look through them again in AAValueSimplify*. The test changes stem from the fact that we allowed any simplified value, incl. non-dynamically unique ones, as long as the underlying memory was an alloca. This doesn't seem to make sense as allocas do not protect against dynamically non-unique values. We need to make the unique check better rather than excluding allocas. That in mind, we can remove a lot of code by simply relying on the generic value traversal load look through. To soften the blow some minor adjustments have been made that allow more simplification through the now used scheme and some tests have been given a `norecurse` for now.	2022-04-05 20:49:03 -05:00
Johannes Doerfert	a8610d7523	[Attributor] Move recursion reasoning into `AA::isPotentiallyReachable` With D106397 we ensured that `AAReachability` will not answer queries for potentially recursive functions. This was necessary as we did not treat recursion explicitly otherwise. Now that we have `AA::isPotentiallyReachable` we can make `AAReachability` a purely intra-procedural AA which does not care about recursion. `AA::isPotentiallyReachable`, however, does already deal with "going back" the call graph and can now do so for potentially recursive functions.	2022-04-05 20:49:03 -05:00
Johannes Doerfert	3e8c4366e2	[Attributor] Visit droppable uses in AAIsDead If we ignore droppable users everything only used in llvm.assume (among other things) is going to be deleted as dead. This is not helpful. Instead we want to only delete things we actually don't need anymore. A follow up will deal with loads in a smarter way.	2022-04-05 18:20:45 -05:00
Johannes Doerfert	79962df386	[Attributor] Allow to reproduce instructions for simplification When simplify values we might end up with an instruction from a different scope or just one that does not dominate the use. If the instruction can be reproduced without side-effect (incl. UB) we can now do that. For now this is mostly used for speculatable (intrinsic) calls but as we learn to make things like arguments or loads available this will become more powerful. This will also allow us to remove dead stores more easily in a follow up.	2022-04-04 12:28:08 -05:00
Augie Fackler	603ae73146	AttributorAttributes: guard against TLI being nullptr I didn't dig into this very much because it appears to be totally valid (especially once these properties can come from attributes instead of only from hard-coded library functions) for TLI to not be defined, and nothing broke when I added this check, including with all my other patches applied. Differential Revision: https://reviews.llvm.org/D122917	2022-04-03 23:19:23 -04:00
Johannes Doerfert	7df2eba7fa	[Attributor][OpenMP] Add assumption for non-call assembly instructions Inline assembly is scary but we need to support it for the OpenMP GPU device runtime. The new assumption expresses the fact that it may not have call semantics, that is, it will not call another function but simply perform an operation or side-effect. This is important for reachability in the presence of inline assembly. Differential Revision: https://reviews.llvm.org/D109986	2022-03-28 20:57:52 -05:00
Johannes Doerfert	a81fff8afd	Reapply "[Intrinsics] Add `nocallback` to the default intrinsic attributes" This reverts commit `c5f789050d` and reapplies `7aea3ea8c3` with additional test changes.	2022-03-25 09:36:50 -05:00
Johannes Doerfert	c5f789050d	Revert "[Intrinsics] Add `nocallback` to the default intrinsic attributes" This reverts commit `7aea3ea8c3` as it breaks the buildbots. I didn't see these failures in the pre-merge checks, looking into it.	2022-03-24 14:04:41 -05:00
Johannes Doerfert	7aea3ea8c3	[Intrinsics] Add `nocallback` to the default intrinsic attributes Most intrinsics, especially "default" ones, will not call back into the IR module. `nocallback` encodes this nicely. As it was not used before, this patch also makes use of `nocallback` in the Attributor which results in many more `norecurse` deductions. Tablegen part is mechanical, test updates by script. Differential Revision: https://reviews.llvm.org/D118680	2022-03-24 13:50:54 -05:00
Johannes Doerfert	ee94a4a3d0	[Attributor][FIX] Avoid endless recursion, simple case There is potential for endless recursion if we try to determine the underlying objects of a load, just to end up with the load as underlying object. A proper solution will require us to pass a visited set around. This will happen as we cleanup genericValueTraversal soon.	2022-03-23 15:55:32 -05:00
serge-sans-paille	f1985a3f85	Cleanup includes: Transforms/IPO Preprocessor output diff: -238205 lines Discourse thread: https://discourse.llvm.org/t/include-what-you-use-include-cleanup Differential Revision: https://reviews.llvm.org/D122183	2022-03-22 10:06:28 +01:00
Johannes Doerfert	4308fdf83b	[Attributor] Remove more non-deterministic behavior and debug output	2022-03-17 17:42:32 -05:00
Johannes Doerfert	88ea86c369	[Attributor][FIX] Remove reference into map that might dangle The reference was taken and the map was modified after. This can (and did) lead to dangling pointers and all sorts of problems afterwards.	2022-03-17 17:42:32 -05:00
Johannes Doerfert	85daf6973d	[Attributor] Remove capture tracker usage and follow uses explicitly Before we used the capture tracker to follow pointer uses, now we do it explicitly ourselves through the Attributor API. There are multiple benefits: For one, the boilerplate is cut down by a lot. The class, potential copies vector, etc. is all not needed anymore. We also do avoid explicitly looking through memory here, something that was duplicated and should only live in the `checkForAllUses~ helper. More importantly, as we do simplifications we need to make sure all parties are in sync when they reason about uses. The old way did not allow us to do this but the new one does as every use visiting AA goes through `checkForAllUses` now..	2022-03-11 22:56:16 -06:00
Johannes Doerfert	f44f60a297	[Attributor] Avoid replacing return operands twice As replacements will become more complex it is better to have a single AA responsible for replacing a use. Before this patch AAValueSimplify* and AAValueSimplifyReturned could both try to replace the returned value. The latter was marginally better for the old pass manager when a function was already carrying a `returned` attribute and when the context of the return instruction was important. The second shortcoming was resolved by looking for return attributes in the AAValueSimplifyCallSiteReturned initialization. The old PM impact is not concerning. This is yet another step towards the removal of AAReturnedValues, the very first AA we should now try to eliminate due to the overlapping logic with value simplification.	2022-03-11 21:55:19 -06:00
Johannes Doerfert	f3ad8cf00e	[Attributor] Cleanup manifest and liveness for CGSCC passes There was some ad-hoc handling of liveness and manifest to avoid breaking CGSCC guarantees. Things always slipped through though. This cleanup will: 1) Prevent us from manifesting any "information" outside the CGSCC. This might be too conservative but we need to opt-in to annotation not try to avoid some problematic ones. 2) Avoid running any liveness analysis outside the CGSCC. We did have some AAIsDeadFunction handling to this end but we need this for all AAIsDead classes. The reason is that AAIsDead information is only correct if we actually manifest it, since we don't (see point 1) we cannot actually derive/use it at all. We are currently trying to avoid running any AA updates outside the CGSCC but that seems to impact things quite a bit. 3) Assert, don't check, that our modifications (during cleanup) modifies only CGSCC functions.	2022-03-11 16:46:02 -06:00
Johannes Doerfert	9ddb1a49ac	[Attributor][FIX] Avoid double free (and useless state copy) In an attempt to remove the memory leak we introduced a double free. The problem was that we allowed a plain copy of the state and it was actually used. The use was useless, so it is gone now. The copy constructor is gone as well. The move constructor ensures the Accesses pointers are owned by a single state, I hope. Reported by: https://lab.llvm.org/buildbot/#/builders/16/builds/25820	2022-03-11 10:10:36 -06:00
Johannes Doerfert	3570b0c5c7	[Attributor][FIX] Remove memory leak The leak was introduced when we made things deterministic. It was reported by the sanitizer buildbot: https://lab.llvm.org/buildbot/#/builders/168	2022-03-11 09:52:44 -06:00
Johannes Doerfert	e8fadafe77	[Attributor][NFCI] Make AAPointerInfo deterministic The order in which we kept accesses was non-deterministic and a debug output was a pointer value. Fixed both.	2022-03-10 23:27:47 -06:00
Johannes Doerfert	7211dbd01d	[Attributor][NFCI] Remove non-deterministic behavior and debug output	2022-03-10 23:27:47 -06:00
Nikita Popov	f682a8386b	[Attributor] Use byval type instead of pointer element type For compatibility with opaque pointers, use the byval type rather than the pointer element type. Differential Review: https://reviews.llvm.org/D120983	2022-03-09 09:30:42 +01:00
Nikita Popov	0636c93d3e	[Attributor] Remove restriction on simplifying function pointers Dropping this restriction seems to work fine (there are no assertion failures), so it appears that either the updater got smarter or the problematic cases are restricted elsewhere. If doing this still causes issues, then the place to address it would probably be `8f5bdaf481/llvm/lib/Transforms/IPO/Attributor.cpp (L1856-L1859)`, which already prevents replacement outside the SCC, so I'm not quite sure what this check is intended to avoid. Differential Revision: https://reviews.llvm.org/D120987	2022-03-07 11:54:37 +01:00
Nikita Popov	a9b03d9e2e	[Attributor] Remove function pointer restriction for AAAlign This check is not compatible with opaque pointers. We can avoid it by adjusting the getPointerAlignment() implementation to avoid creating unnecessary ptrtoint expressions for bitcasted pointers. The code already uses OnlyIfReduced to not create an expression if it does not simplify, and this makes sure that folding a bitcast and ptrtoint into a ptrtoint doesn't count as a simplification. Differential Revision: https://reviews.llvm.org/D120904	2022-03-07 10:02:45 +01:00
Johannes Doerfert	5af11ec34b	[Attributor] Determine potentially loaded values through memory We already look through memory to determine where a value that is stored might pop up again (potential copies). This patch introduces the other direction with similar logic. If a value is loaded, we can follow all the accesses to the pointer (or better object) and try to determine what value might have been stored.	2022-03-06 23:26:37 -06:00
Johannes Doerfert	eb73af4af4	[Attributor] Handle undef and null in AAAlignFloating Both `undef` and `nullptr` are maximally aligned. This is especially important as we often see `undef` until a proper value has been identified during simplification.	2022-03-06 23:26:22 -06:00
Johannes Doerfert	ad26e199ff	[Attributor] Use CFG reasoning also for read accesses With D106397 we used CFG reasoning to filter out writes that will not interfere with a given load instruction. With this patch we use the same logic (modulo the reversal in reachability check order) for store instructions. As an example, we can now proof stores to shared memory are dead if all the loads of the shared memory are not reachable from them.	2022-03-06 23:26:22 -06:00
Johannes Doerfert	ff758372bd	[Attributor][NFCI] Introduce fine-grained anonymous namespaces	2022-03-06 21:28:38 -06:00
Johannes Doerfert	192a34ddb0	[Attributor][OpenMPOpt][FIX] Register simplification callbacks Heap-2-stack and heap-2-shared can replace an allocation call with something else. To avoid us deriving information from the allocator implementation we register a simplification callback now that will force us to stop at the call site. We probably should create the replacement memory eagerly and return that instead though.	2022-03-06 21:28:38 -06:00
Johannes Doerfert	5859ae6a5d	[Attributor][FIX] Use maximal access for dereferenceability deduction While we can use range information when we derive dereferenceability we must make sure to pick he right end of the range. Before we always went with the minimal offset, which is not correct if we want to combine the base dereferenceability with some offset. In that case it's the maximum that gives the correct result.	2022-03-06 21:28:38 -06:00
Johannes Doerfert	1fcd4d0e3b	[Attributor][FIX] Initialize stack variable	2022-03-06 21:28:38 -06:00
Johannes Doerfert	6158f4a466	[Attributor][NFCI] No repeated manifest of AAValueSimplifyReturned (CGSCC)	2022-03-06 19:59:23 -06:00
Johannes Doerfert	8fa839aa58	[Attributor][NFC] Improve debug messages	2022-03-06 19:59:22 -06:00
Simon Pilgrim	3b422455dd	[IPO] AAFunctionReachabilityFunction.updateImpl - reduce AAReachability scope. NFCI. We already have a check for !InstQueries.empty(), so move the for-range over InstQueries inside to avoid the AAReachability uninitialized variable static analysis warnings.	2022-02-25 14:42:31 +00:00
Augie Fackler	95f3cc222a	AttributorAttributes: avoid a crashing on bad alignments Prior to this change, LLVM would attempt to optimize an aligned_alloc(33, ...) call to the stack. This flunked an assertion when trying to emit the alloca, which crashed LLVM. Avoid that with extra checks. Differential Revision: https://reviews.llvm.org/D119604	2022-02-23 14:21:02 -05:00
Arthur Eubanks	1fd980de04	Revert "AttributorAttributes: avoid a crashing on bad alignments" This reverts commit `70ff6fbeb9`. Breaks bots, e.g. http://45.33.8.238/linux/69375/step_12.txt.	2022-02-23 09:08:03 -08:00
Augie Fackler	70ff6fbeb9	AttributorAttributes: avoid a crashing on bad alignments Prior to this change, LLVM would attempt to optimize an aligned_alloc(33, ...) call to the stack. This flunked an assertion when trying to emit the alloca, which crashed LLVM. Avoid that with extra checks. Differential Revision: https://reviews.llvm.org/D119604	2022-02-23 11:46:15 -05:00
Johannes Doerfert	254d6da020	[Attributor][FIX] Ensure stable iteration order With `668c5c688b` we introduced an ordering issue revealed by the reverse iteration buildbot. Depending on the order of the map that tracks the AAIsDead AAs we ended up with slightly different attributes. This is not totally unexpected and can happen. We should however be deterministic in our orderings to avoid such issues.	2022-02-17 12:53:10 -06:00
Johannes Doerfert	8ad39fbaf2	[Attributor][FIX] Heap2Stack needs to use the alloca AS When we move an allocation from the heap to the stack we need to allocate it in the alloca AS and then cast the result. This also prevents us from inserting the alloca after the allocation call but rather right before. Fixes https://github.com/llvm/llvm-project/issues/53858	2022-02-16 15:58:32 -06:00
Johannes Doerfert	668c5c688b	[Attributor][FIX] Use liveness information of the right function When we use liveness for edges during the `genericValueTraversal` we need to make sure to use the AAIsDead of the correct function. This patch adds the proper logic and some simple caching scheme. We also add an assertion to the `isEdgeDead` call to make sure future misuse is detected earlier. Fixes https://github.com/llvm/llvm-project/issues/53872	2022-02-16 15:58:32 -06:00
Johannes Doerfert	6ed1ef0643	[Attributor][FIX] Pipe UsedAssumedInformation through more interfaces `UsedAssumedInformation` is a return argument utilized to determine what information is known. Most APIs used it already but `genericValueTraversal` did not. This adds it to `genericValueTraversal` and replaces `AllCallSitesKnown` of `checkForAllCallSites` with the commonly used `UsedAssumedInformation`. This was supposed to be a NFC commit, then the test change appeared. Turns out, we had one user of `AllCallSitesKnown` (AANoReturn) and the way we set `AllCallSitesKnown` was wrong as we ignored the fact some call sites were optimistically assumed dead. Included a dedicated test for this as well now. Fixes https://github.com/llvm/llvm-project/issues/53884	2022-02-16 14:44:20 -06:00
Johannes Doerfert	dd75c0ea64	[Attributor][NFC] Expose new API in AAPointerInfo New users might want to check bins without a load or store instruction at hand. Since we use those instructions only to find the offset and size of the access anyway, we can expose an offset and size interface to the outside world as well. This commit mainly moves code around and exposes a class (OffsetAndSize) as well as a method forallInterferingAccesses in AAPointerInfo. Differential Revision: https://reviews.llvm.org/D119249	2022-02-10 13:52:24 -06:00
Johannes Doerfert	d1387a26a5	[Attributor][FIX] Reachability needs to account for readonly callees The oversight caused us to ignore call sites that are effectively dead when we computed reachability (or more precise the call edges of a function). The problem is that loads in the readonly callee might depend on stores prior to the callee. If we do not track the call edge we mistakenly assumed the store before the call cannot reach the load. The problem is nicely visible in: `llvm/test/Transforms/Attributor/ArgumentPromotion/basictest.ll` Caused by D118673. Fixes https://github.com/llvm/llvm-project/issues/53726	2022-02-10 13:52:24 -06:00
Johannes Doerfert	e39b419312	[Attributor][FIX] Honor alloca address space in AAPrivatizablePtr When we privatize a pointer (~argument promotion) we introduce new private allocas as replacement. These need to be placed in the alloca address space as later passes cannot properly deal with them otherwise. Fixes https://github.com/llvm/llvm-project/issues/53725	2022-02-10 13:52:24 -06:00
Johannes Doerfert	dd101c808b	[Attributor][FIX] Do not use assumed information for UB detection The helper `Attributor::checkForAllReturnedValuesAndReturnInsts` simplifies the returned value optimistically. In `AAUndefinedBehavior` we cannot use such optimistic values when deducing UB. As a result, we assumed UB for the return value of a function because we initially (=optimistically) thought the function return is `undef`. While we later adjusted this properly, the `AAUndefinedBehavior` was under the impression the return value is "known" (=fix) and could never change. To correct this we use `Attributor::checkForAllInstructions` and then manually to perform simplification of the return value, only allowing known values to be used. This actually matches the other UB deductions. Fixes #53647	2022-02-07 20:19:19 -06:00
Kazu Hirata	3a3cb929ab	[llvm] Use = default (NFC)	2022-02-06 22:18:35 -08:00
Kazu Hirata	cb13ebbf46	[Transforms] Use default member initialization in AAIsDeadCallSiteReturned (NFC)	2022-02-05 21:39:25 -08:00
Joseph Huber	6b78526b1b	[OpenMP] Emit remark on the captured call instead of the variable Changes the remark to emit on the function call that captures the globalized variable instead of the globalized variable itself. The user should be able to see which variable it was in the argument list of the function. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D106980	2022-02-04 17:50:53 -05:00
Johannes Doerfert	a265cf22af	[Attributor] Introduce the `AA::isPotentiallyReachable` helper APIs To make usage easier (compared to the many reachability related AAs), this patch introduces a helper API, `AA::isPotentiallyReachable`, which performs all the necessary steps. It also does the "backwards" reachability (see D106720) as that simplifies the AA a lot (backwards queries were somewhat different from the other query resolvers), and ensures we use cached values in every stage. To test inter-procedural reachability in a reasonable way this patch includes an extension to `AAPointerInfo::forallInterferingWrites`. Basically, we can exclude writes if they cannot reach a load "during the lifetime" of the allocation. That is, we need to go up the call graph to determine reachability until we can determine the allocation would be dead in the caller. This leads to new constant propagations (through memory) in `value-simplify-pointer-info-gpu.ll`. Note: The new code contains plenty debug output to determine how reachability queries are resolved. Parts extracted from D110078. Differential Revision: https://reviews.llvm.org/D118673	2022-02-01 01:40:45 -06:00
Johannes Doerfert	b51b83f68e	[Attributor] Introduce the concept of query AAs D106720 introduced features that did not work properly as we could add new queries after a fixpoint was reached and which could not be answered by the information gathered up to the fixpoint alone. As an alternative to D110078, which forced eager computation where we want to continue to be lazy, this patch fixes the problem. QueryAAs are AAs that allow lazy queries during their lifetime. They are never fixed if they have no outstanding dependences and always run as part of the updates in an iteration. To determine if we are done, all query AAs are asked if they received new queries, if not, we only need to consider updated AAs, as before. If new queries are present we go for another iteration. Differential Revision: https://reviews.llvm.org/D118669	2022-02-01 01:40:44 -06:00
Kuter Dinel	b2d1ae0611	[Attributor] AAFunctionReachability, Instruction reachability. This patch implement instruction reachability for AAFunctionReachability attribute. It is used to tell if a certain instruction can reach a function transitively. NOTE: I created a new commit based of D106720 and set the author back to Kuter. Other metadata, etc. is wrong. I also addressed the remaining review comments and fixed the unit test. Differential Revision: https://reviews.llvm.org/D106720	2022-02-01 01:40:44 -06:00
Johannes Doerfert	ac3ec22df9	[Attributor] Use AAFunctionReachability to determine AANoRecurse We missed out on AANoRecurse in the module pass because we had no call graph. With AAFunctionReachability we can simply ask if the function may reach itself. Differential Revision: https://reviews.llvm.org/D110099	2022-02-01 01:40:44 -06:00
Johannes Doerfert	d1186ce7a9	[Attributor] Make interprocedural value explicit in genericValueTraversal genericValueTraversal can look through arguments and allow value simplification across function boundaries. In fact, the latter already happened unchecked. With this change we allow the user of genericValueTraversal to opt-out of interprocedural traversal if required. We explicitly look through arguments now which helps to do various things, incl. the propagation of constants into OpenMP parallel regions (on the host).	2022-02-01 01:40:44 -06:00
Johannes Doerfert	0f471710f8	[Attributor] Use edge liveness rather than block liveness We moved to the edge API a while back, not all uses were adjusted. Edge liveness is more precise.	2022-02-01 01:18:51 -06:00
Johannes Doerfert	53b6753bdd	[Attributor][FIX] Address two oversights in AAIsDead No tests as these were found browsing the code and I'm not sure how to test them properly.	2022-02-01 01:18:51 -06:00
Johannes Doerfert	cfabffb034	[Attributor][NFCI] Improve debug diagnostic	2022-02-01 01:18:51 -06:00
Johannes Doerfert	adf0d57f15	[Attributor] Provide convenient helpers for isAssumedRead{None,Only} We have two attributes that can answer readnone queries. While there is a dependence between them, it seems best to not force the users to know what AA to ask. The helpers also allow to check for readonly nicely. Test changes show where we now deduce readnone but haven't before, mostly because we only asked AAMemoryBehavior and not AAMemoryLocation. AANoAlias has not been ported to the new API yet.	2022-02-01 01:18:51 -06:00
Johannes Doerfert	e140d51319	[Attributor] Use CFG reasoning to filter potentially interfering writes Since D104432 we can look through memory by analyzing all writes that might interfere with a load. This patch provides some logic to exclude writes that cannot interfere with a location, due to CFG reasoning. We make sure to avoid multi-thread write-read situations properly while we ignore writes that cannot reach a load or writes that will be overwritten before the load is reached. Differential Revision: https://reviews.llvm.org/D106397	2022-02-01 01:18:51 -06:00
Johannes Doerfert	3f0e670498	[Attributor][NFCI] Expose some nosync reasoning to outside users. No-sync is a property that we need in more places as complex transformations emerge. To simplify the query we provide an `AA::isNoSyncInst` helper now and expose two existing helpers through the `AANoSync` class.	2022-02-01 01:07:50 -06:00
Johannes Doerfert	a5b6aef24e	[Attributor][NFCI] Remove anonymous namespaces The namespaces made it more complicate to implement static helpers, among other things. We should not need them at all.	2022-02-01 01:07:50 -06:00
Nikita Popov	67346b43e0	[Attributor] Use MemoryLocation to get pointer operand and accessed type (NFCI) This relies on existing APIs and avoids accessing the pointer element type. The alternative would be to extend getPointerOperand() to also return the accessed type, but I figured going through MemoryLocation would be cleaner. Differential Revision: https://reviews.llvm.org/D117868	2022-01-24 10:10:13 +01:00
Nikita Popov	e7762653d3	[Attributor] Avoid some pointer element type accesses	2022-01-21 11:20:10 +01:00
Johannes Doerfert	37e0c58559	[Attributor][FIX] AAValueConstantRange should not loop unconstrained The old method to avoid unconstrained expansion of the constant range in a loop did not work as soon as there were multiple instructions in between the phi and its input. We now take a generic approach and limit the number of updates as a fallback. The old method is kept as it catches "the common case" early.	2022-01-20 18:07:04 -06:00
Johannes Doerfert	7bf9065ad7	[Attributor][NFC] Clang format	2022-01-20 18:06:53 -06:00
Nikita Popov	a115bbea9b	[Attributor] Remove notional overindexing check AAPointerInfo currently bails on constant expression GEPs with notional overindexing. I don't think this is necessary, as the following code handling GEPOperator will deal with arbitrary indices appropriately. Differential Revision: https://reviews.llvm.org/D117203	2022-01-19 11:30:04 +01:00
Bryce Wilson	28b6e2cb3d	[Attributor] [NFC] Use canonical variable name Differential Revision: https://reviews.llvm.org/D117241	2022-01-13 23:06:00 -08:00
Philip Reames	5d5d4d94f0	[Attributor] Generalize heap to stack to any allocator with relevant properties This completes removal of the isXLike queries, and depends on a whole series of earlier patches which have already landed. Differential Revision: https://reviews.llvm.org/D117242	2022-01-13 15:33:24 -08:00
Philip Reames	cf66f01ec1	[Attributor] Share code for abstract interpretation of allocation sizes with getObjectSize [NFC-ish] The basic idea is that we can parameterize the getObjectSize implementation with a callback which lets us replace the operand before analysis if desired. This is what Attributor is doing during it's abstract interpretation, and allows us to have one copy of the code. Note this is not NFC for two reasons: * The existing attributor code is wrong. (Well, this is under-specified to be honest, but at least inconsistent.) The intermediate math needs to be done in the index type of the pointer space. Imagine e.g. i64 arguments in a 32 bit address space. * I did not preserve the behavior in getAPInt where we return 0 for a partially analyzed value. This looks simply wrong in the original code, and nothing test wise contradicts that. Differential Revision: https://reviews.llvm.org/D117241	2022-01-13 15:33:24 -08:00
Philip Reames	9979299705	[Attributor] Simplify how we handle required alignment during heap-to-stack [NFC] The existing code duplicated the same concern in two places, and (weirdly) changed the inference of the allocation size based on whether we could meet the alignment requirement. Instead, just directly check the allocation requirement.	2022-01-12 17:34:17 -08:00
Philip Reames	d1f4c6a611	[Attributor] Generalize calloc handling in heap-to-stack for any init value [NFC] Rewrite the calloc specific handling in heap-to-stack to allow arbitrary init values. The basic problem being solved is that if an allocation is initilized to anything other than zero, this must be explicitly done for the formed alloca as well. This covers the calloc case today, but once a couple of earlier guards are removed in this code, downstream allocators with other init values could also be handled. Inspired by discussion on D116971	2022-01-12 16:58:39 -08:00
Philip Reames	8e76720cf2	[Attributor] Reuse object size evaluation code [NFC]	2022-01-12 16:58:39 -08:00
Philip Reames	db57065b36	[Attributor] Use getAllocAlignment where possible [NFC] Inspired by D116971.	2022-01-12 16:58:39 -08:00
Johannes Doerfert	7b39dccbe4	[Attributor][FIX] Ensure "IsExact" is false for non-exact accesses If we look at potentially interfering accesses we need to ensure the "IsExact" flag is set appropriately. Accesses that have an "unknown" size or offset cannot be exact matches and we missed to flag that. Error and test reported by Serguei N. Dmitriev.	2022-01-10 10:09:36 -06:00
Nikita Popov	92d55e7336	[MemoryBuiltins] Remove isNoAliasFn() in favor of isNoAliasCall() We currently have two similar implementations of this concept: isNoAliasCall() only checks for the noalias return attribute. isNoAliasFn() also checks for allocation functions. We should switch to only checking the attribute. SLC is responsible for inferring the noalias return attribute for non-new allocation functions (with a missing case fixed in `348bc76e35`). For new, clang is responsible for setting the attribute, if -fno-assume-sane-operator-new is not passed. Differential Revision: https://reviews.llvm.org/D116800	2022-01-10 09:18:15 +01:00
Johannes Doerfert	4e8a02e7f4	[Attributor][FIX] Remove assumption that doesn't have to hold There is no guarantee we strip all GEPOperators and the conservative handling doesn't even require us to.	2022-01-09 13:15:53 -06:00
Johannes Doerfert	6c745e04fa	[Attributor][FIX] Ensure order for multiple references into map If we have multiple references into a map we need to ensure the ones created late do not invalidate the ones created early. To do that we need to make sure all but the first are not modifying the map, hence for them the keys have to be present already. Fixes #52875.	2022-01-08 16:59:21 -06:00
Johannes Doerfert	5602c866c0	[Attributor] Look through allocated heap memory AAPointerInfo, and thereby other places, can look already through internal global and stack memory. This patch enables them to look through heap memory returned by functions with a `noalias` return. In the future we can look through `noalias` arguments as well but that will require AAIsDead to learn that such memory can be inspected by the caller later on. We also need teach AAPointerInfo about dominance to actually deal with memory that might not be `null` or `undef` initialized. D106397 is a first step in that direction already. Reviewed By: kuter Differential Revision: https://reviews.llvm.org/D109170	2021-12-29 00:21:36 -06:00
Johannes Doerfert	6e2fcf8513	[Attributor][FIX] Ensure store uses are correlated with reloads While we skipped uses in stores if we can find all copies of the value when the memory is loaded, we did not correlate the use in the store with the use in the load. So far this lead to less precise results in the offset calculations which prevented deductions. With the new EquivalentUseCB callback argument the user of checkForAllUses can be informed of the correlation and act on it appropriately. Differential Revision: https://reviews.llvm.org/D109662	2021-12-28 23:53:29 -06:00
Joseph Huber	38fc89623b	[Attributor][Fix] Add alignment return attribute to HeapToStack This patch changes the HeapToStack optimization to attach the return alignment attribute information to the created alloca instruction. This would cause problems when replacing the heap allocation with an alloca did not respect the alignment of the original heap allocation, which would typically be aligned on an 8 or 16 byte boundary. Malloc calls now contain alignment attributes, so we can use that information here. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D115888	2021-12-27 16:58:23 -05:00
Nikita Popov	69ffc3cee9	[Attributor] Directly call areTypesABICompatible() hook Instead of using the ArgumentPromotion implementation, we now walk call sites using checkForAllCallSites() and directly call areTypesABICompatible() using the replacement types. I believe that resolves the TODO in the code. Differential Revision: https://reviews.llvm.org/D116033	2021-12-24 09:20:31 +01:00
Matt Arsenault	a25111c9e2	Attributor: Fix typo in function name	2021-12-04 11:25:22 -05:00
Kazu Hirata	7505b7045f	[llvm] Use GetElementPtrInst::indices (NFC)	2021-11-13 21:43:28 -08:00

1 2 3 4 5 ...

366 Commits