llvm-project

Commit Graph

Author	SHA1	Message	Date
Tobias Grosser	922452285a	Keep track of ScopArrayInfo objects that model PHI node storage Summary: When translating PHI nodes into memory dependences during code generation we require two kinds of memory. 'Normal memory' as for all scalar dependences and 'PHI node memory' to store the incoming values of the PHI node. With this patch we now mark and track these two kinds of memories, which we previously incorrectly marked as a single memory object. Being aware of PHI node storage makes code generation easier, as we do not need to guess what kind of storage a scalar reference requires. This simplifies the code nicely. Reviewers: jdoerfert Subscribers: pollydev, llvm-commits Differential Revision: http://reviews.llvm.org/D11554 llvm-svn: 243420	2015-07-28 14:53:44 +00:00
Tobias Grosser	d4dd6ec74d	Simplify code in BlockGenerator::generateScalarLoads [NFC] We hoist statements that are used on both branches of an if-condition, shorten and unify some variable names and fold some variable declarations into their only uses. We also drop a comment which just describes the elements the loop iterates over. No functional change intended. llvm-svn: 243291	2015-07-27 17:57:58 +00:00
Johannes Doerfert	ecff11dcfb	Add scalar and phi code generation To reduce compile time and to allow more and better quality SCoPs in the long run we introduced scalar dependences and PHI-modeling. This patch will now allow us to generate code if one or both of those options are set. While the principle of demoting scalars as well as PHIs to memory in order to communicate their value stays the same, this allows to delay the demotion till the very end (the actual code generation). Consequently: - We __almost__ do not modify the code if we do not generate code for an optimized SCoP in the end. Thus, the early exit as well as the unprofitable option will now actually preven us from introducing regressions in case we will probably not get better code. - Polly can be used as a "pure" analyzer tool as long as the code generator is set to none. - The original SCoP is almost not touched when the optimized version is placed next to it. Runtime regressions if the runtime checks chooses the original are not to be expected and later optimizations do not need to revert the demotion for that part. - We will generate direct accesses to the demoted values, thus there are no "trivial GEPs" that select the first element of a scalar we demoted and treated as an array. Differential Revision: http://reviews.llvm.org/D7513 llvm-svn: 238070	2015-05-22 23:43:58 +00:00
Tobias Grosser	ba0d09227c	Sort include directives Upcoming revisions of isl require us to include header files explicitly, which have previously been already transitively included. Before we add them, we sort the existing includes. Thanks to Chandler for sort_includes.py. A simple, but very convenient script. llvm-svn: 236930	2015-05-09 09:13:42 +00:00
Tobias Grosser	2873645c51	Drop -polly-vectorizer-unroll-only option This options was earlier used for experiments with the vectorizer, but to my knowledge is not really used anymore. If anybody needs this, we can always reintroduce this feature. llvm-svn: 232934	2015-03-23 07:00:36 +00:00
Tobias Grosser	bb4126470a	Drop option to prepare code for the BB vectorizer The BB vectorizer is deprecated and there is no point in generating code for it any more. This option was introduced when there was not yet any loop vectorizer in sight. Now being matured, Polly should target the loop vectorizer. llvm-svn: 232099	2015-03-12 20:47:58 +00:00
Tobias Grosser	c98950671d	Fix compilation after DataLayout was added to ScevExpander The corresponding LLVM commit is 231740. llvm-svn: 231793	2015-03-10 15:24:33 +00:00
Johannes Doerfert	f32d651df6	[Refactor] Include explicitly what is used llvm-svn: 230901	2015-03-01 18:45:58 +00:00
Johannes Doerfert	514f6efa2b	[FIX] Teach RegionGenerator to respect and update dominance When we generate code for a whole region we have to respect dominance and update it too. The first is achieved with multiple "BBMap"s. Each copied block in the region gets its own map. It is initialized only with values mapped in the immediate dominator block, if this block is in the region and was therefor already copied. This way no values defined in a block that doesn't dominate the current one will be used. To update dominance information we check if the immediate dominator of the original block we want to copy is in the region. If so we set the immediate dominator of the current block to the copy of the immediate dominator of the original block. llvm-svn: 230774	2015-02-27 18:29:04 +00:00
Johannes Doerfert	275a1756ad	Allow non-affine control flow -- Code Generation This is the code generation for region statements that are created when non-affine control flow was present in the input. A new generator, similar to the block or vector generator, for regions is used to traverse and copy the region statement and to adjust the control flow inside the new region in the end. llvm-svn: 230340	2015-02-24 16:16:32 +00:00
Johannes Doerfert	b4f08eb671	[REFACTOR] Replace Pass* from BlockGen by the DomTree llvm-svn: 230220	2015-02-23 13:51:35 +00:00
Johannes Doerfert	1ef52333c5	[Refactor] Use the LoopInfo object already present llvm-svn: 228540	2015-02-08 20:50:42 +00:00
Johannes Doerfert	be9c91173f	[Refactor] Use only one BlockGenerator for a SCoP This change has two main purposes: 1) We do not use a static interface to hide an object we create and destroy for every basic block we copy. 2) We allow the BlockGenerator to store information between calls to the copyBB method. This will ease scalar/phi code generation later on. While a lot of method signatures were changed this should not cause any real behaviour change. Differential Revision: http://reviews.llvm.org/D7467 llvm-svn: 228443	2015-02-06 21:39:31 +00:00
Johannes Doerfert	9e3a5db000	[FIX] Debug build + instrinsic handling The ignored intrinsics needed to be ignored in three other places as well. Tests and lnt pass now. llvm-svn: 227092	2015-01-26 15:55:54 +00:00
Johannes Doerfert	3f500fa2f6	Support for math/misc intrinsics The support is currently limited as we only allow them in the input but do not emit them in the transformed SCoP due to the possible semantic changes. Differential Revision: http://reviews.llvm.org/D5225 llvm-svn: 227054	2015-01-25 18:07:30 +00:00
Tobias Grosser	a8cd15249a	Remove redundant semicolon clang-format complained about llvm-svn: 226402	2015-01-18 15:59:16 +00:00
Chandler Carruth	5ec3333d24	[PM] Update Polly for LLVM r226394 and r226396 which changed some of the block splitting interfaces to accept specific analyses rather than a pass. llvm-svn: 226398	2015-01-18 10:52:23 +00:00
Chandler Carruth	f557987b15	[PM] Update Polly following LLVM r226373 which refactors LoopInfo in preparation for the new pass manager. llvm-svn: 226374	2015-01-17 14:16:56 +00:00
Tobias Grosser	683b8e4462	Remove -polly-codegen-scev option and related code SCEV based code generation has been the default for two weeks after having been tested for a long time. We now drop the support the non-scev-based code generation. llvm-svn: 222978	2014-11-30 14:33:31 +00:00
Tobias Grosser	7031141009	Enable SCEV based code generation by default SCEV based code generation allows Polly to detect and generate code for loops that do not have an explicit induction variable, but only virtual induction variables given by SCEV. Being able to do so has two main benefits: - We can detect more scops by default - We require less canonicalization before Polly, which means we get closer to our goal of not touching the IR before analyzing its properties. Specifically, we do not need to run -polly-indvars to introduce explicit canonical induction variables. This switch became possible as both the isl code generation and -polly-parallel are LNT error free with SCEV based code generation and the isl ast generator. llvm-svn: 222113	2014-11-16 22:50:23 +00:00
Tobias Grosser	16371acdc4	BlockGenerator: Recompute values from SCEV before handing back the original values This patch moves the SCEV based (re)generation of values before the checking for scop-constant terms. It enables us to provide SCEV based replacements, which are necessary to correctly generate OpenMP subfunctions when using the SCEV based code generation. When recomputing a new value for a value used in the code of the original scop, we previously directly returned the same original value for all scop-constant expressions without even trying to regenerate these values using our SCEV expression. This is correct when the newly generated code remains fully in the same function, however in case we want to outline parts of the newly generated scop into subfunctions, this approach means we do not have any opportunity to update these values in the SCEV based code generation. (In the non-SCEV based code generation, we can provide such updates through the GlobalMap). To ensure we have this opportunity, we first try to regenerate scalar terms with our SCEV builder and will only return scop-constant expressions if SCEV based code generation was not possible. This change should not affect the results of the existing code generation passes. It only impacts the upcoming OpenMP based code generation. This commit also adds a test case. This test case passes before and after this commit. It was added to ensure test coverage for the changed code. llvm-svn: 221393	2014-11-05 20:48:56 +00:00
Tobias Grosser	d213a8b810	BlockGenerator: inline lookupAvailableValue into getValue [NFC] There was no good reason why this code was split accross two functions. In subsequent changes we will change the order in which values are looked up. Doing so would make the split into two functions even more arbitrary. We also slightly improve the documentation. llvm-svn: 221388	2014-11-05 19:46:04 +00:00
Tobias Grosser	878aba4980	Remove incorrect value description for -enable-polly-aligned The description of the parameter value passed to -enable-polly-aligned did not make any sense at all, but was just a leftover coming from when this option was copied form -enable-polly-openmp. We just drop it as the option description gives sufficient information already. llvm-svn: 220445	2014-10-22 23:22:41 +00:00
Johannes Doerfert	a99130f042	[Refactor][NfC] Simplify and clean the handling of (new) access relations This patch does not change the semantic on it's own. However, the dependence analysis as well as dce will now use the newest available access relation for each memory access, thus if at some point the json importer or any other pass will run before those two and set a new access relation the behaviour will be different. In general it is unclear if the dependence analysis and dce should be run on the old or new access functions anyway. If we need to access the original access function from the outside later, we can expose the getter again. Differential Revision: http://reviews.llvm.org/D5707 llvm-svn: 219612	2014-10-13 12:58:03 +00:00
Johannes Doerfert	1947f863ef	[Refactor][NFC] Simplify (Vector)BlockGenerator This removes an unnecessary cast and a unnecessary local variable. Differential Revision: http://reviews.llvm.org/D5662 llvm-svn: 219338	2014-10-08 20:18:32 +00:00
Johannes Doerfert	731685e6bc	Allow the VectorBlockGenerator to use the IslExprBuilder. This also enables the VectorBlockGenerator to build load store accesses according to the newAccessRelation of a MemoryAccess. llvm-svn: 219321	2014-10-08 17:25:30 +00:00
Johannes Doerfert	87901453d9	Align copied load/store instructions as the original. This also forbids the json importer to access other memory locations than the original instruction as we to reuse the alignment of the original load/store. Differential Revision: http://reviews.llvm.org/D5560 llvm-svn: 218883	2014-10-02 16:22:19 +00:00
Johannes Doerfert	dcb5f1dcf6	Allow the IslExprBuilder to build address of expressions llvm-svn: 218045	2014-09-18 11:14:30 +00:00
Johannes Doerfert	2ef3f4fd23	Make used analysis passes explicit Use the explicit analysis if possible, only for splitBlock we will continue to use the Pass * argument. This change allows us to remove the getAnalysis calls from the code generation. llvm-svn: 215121	2014-08-07 17:14:54 +00:00
Johannes Doerfert	a63b2579c6	Fix the modifiable access creation + Remove the class IslGenerator which duplicates the functionality of IslExprBuilder. + Use the IslExprBuilder to create code for memory access relations. + Also handle array types during access creation. + Enable scev codegen for one of the transformed memory access tests, thus access creation without canonical induction variables available. + Update one test case to the new output. llvm-svn: 214659	2014-08-03 01:51:59 +00:00
Johannes Doerfert	d9e1dbd376	[Refactor] Use non-const MemoryAccess base addresses llvm-svn: 214168	2014-07-29 08:36:18 +00:00
Tobias Grosser	483a90d1bd	clang-format polly to avoid buildbot noise llvm-svn: 212609	2014-07-09 10:50:10 +00:00
Tobias Grosser	91f5b26268	Use range-based for loops llvm-svn: 210170	2014-06-04 08:06:40 +00:00
Tobias Grosser	954939842f	Really fix the load case. Commit r206510 falsely advertised to fix the load cases, even though it only fixed the store case. This commit adds the same fix for the load case including the missing test coverage. llvm-svn: 206577	2014-04-18 09:46:35 +00:00
Tobias Grosser	50fd7010d8	Ensure a scalar pointer when issuing a vector load Even tough we may want to generate a vector load, the address from which to load still is a scalar. Make sure even if previous address computations may have been vectorized, that the addresses are also available as scalars. This fixes http://llvm.org/PR19469 Reported-by: Jeremy Huddleston Sequoia <jeremyhu@apple.com> llvm-svn: 206510	2014-04-17 23:13:49 +00:00
Tobias Grosser	5a56cbf496	[C++11] Use nullptr llvm-svn: 206361	2014-04-16 07:33:47 +00:00
Tobias Grosser	2f4529f864	clang-format: Remove empty lines llvm-svn: 204468	2014-03-21 14:04:25 +00:00
Tobias Grosser	0dd463facf	Support for generating vectors for loads with -1 stride This patch enables vectorization of loops containing backward array traversal (array stride is -1). Contributed-by: Chris Jenneisch <chrisj@codeaurora.org> llvm-svn: 204257	2014-03-19 19:27:24 +00:00
Tobias Grosser	5103ba7c91	Introduce PollyIRBuilder PollyIRBuilder is currently just a typedef to IRBuilder<>. Consequently, this change should not affect behavior. In subsequent patches we will extend its functionality to emit loop.parallel metadata. llvm-svn: 202853	2014-03-04 14:58:49 +00:00
Tobias Grosser	030237d0ff	Codegen: Do not crash when seeing debug intrinsics We now skip the debug intrinsics which is a lot better than crashing due to uncopied metadata references. We should step by step investigate which debug intrinsics we can copy without trouble. We still keep the debug location metadata. llvm-svn: 201860	2014-02-21 15:06:05 +00:00
Hongbin Zheng	5b463ceaf5	BlockGenerator: Split getNewValue. Split the old getNewValue into two parts: 1. The function "lookupAvailableValue" that return the new version of the instruction which is already available. 2. The function calls "lookupAvailableValue", and tries to generate the new version if it is not available yet. llvm-svn: 187114	2013-07-25 09:12:07 +00:00
Hongbin Zheng	fe11e287b4	BlockGenerator: Simplify the old value searching code. Orignally, we first test if a ValueMap contains a Value, and than use the index operator to get the corresponding new value. This requires the ValueMap to lookup the key (i.e. the old value) twice. Now, we directly use the "lookup" function provided by DenseMap to implement the same functionality. llvm-svn: 185260	2013-06-29 13:22:15 +00:00
Hongbin Zheng	b5f24a665e	Refactor memory access getting code in ScopStmt. 1. Do not allow creating new memory access record in the InstructionToAccess map on the fly in function getAccessFor. 2. Do not allow user to modify the memory accesses returned by getAccessFor during the code generation process. llvm-svn: 185253	2013-06-29 06:31:39 +00:00
Tobias Grosser	58032cb029	Integrate latest clang-format changes llvm-svn: 184655	2013-06-23 01:29:29 +00:00
Tobias Grosser	edab1359a0	Use isl_val instead of isl_int in the core of Polly isl recently introduced isl_val as an abstract interface to represent arbitrary precision numbers. This interface superseeds the old isl_int interface. In contrast to the old interface which implemented arbitrary precision arithmetic using macros that forward to the gmp library, the new library hides the math library implementation in isl. This allows us to switch the math library used by isl without affecting users such as Polly. llvm-svn: 184529	2013-06-21 06:41:31 +00:00
Sebastian Pop	3d94fedf0b	add comments to clarify the use of a temporary variable in the map insertion llvm-svn: 182662	2013-05-24 18:46:02 +00:00
Sebastian Pop	753d43f974	fix insertion of values in BBMap In GDB when "step" through generateScalarLoad and "finish" the call, the returned value is non NULL, however when printing the value contained in BBMap[Load] after this stmt: BBMap[Load] = generateScalarLoad(...); the value in BBMap[Load] is NULL, and the BBMap.count(Load) is 1. The only intuitive idea that I have to explain this behavior is that we are playing with the undefined behavior of eval order of the params for the function standing for "BBMap[Load] = generateScalarLoad()". "BBMap[Load] = " may be executed before generateScalarLoad is called. Here are some other possible explanations from Will Dietz <w@wdtz.org>: The error is likely due to BBMap[Load] being evaluated first (creating a {Load -> uninitialized } entry in the DenseMap), then generateScalarLoad eventually accesses the same element and finds it to be NULL (DenseMap[Old]).. Offhand I'm not sure if this is guaranteed to be NULL or if it's uninitialized and happens to be NULL. The same issue can also go wrong in an even worse way: the second DenseMap access can trigger a rehash and invalidate the an earlier evaluated expression (for example LHS of the assignment), leading to a crash when performing the assignment store. llvm-svn: 182655	2013-05-24 17:16:02 +00:00
Tobias Grosser	83628182f7	Sort includes llvm-svn: 181297	2013-05-07 08:11:54 +00:00
Tobias Grosser	637bd63123	Move polly options into separate option category Use the new cl::OptionCategory support to move the Polly options into a separate option category. The aim is to hide most options and show by default only the options a user needs to influence '-O3 -polly'. The available options probably need some care, but here is the current status: Polly Options: Configure the polly loop optimizer -enable-polly-openmp - Generate OpenMP parallel code -polly - Enable the polly optimizer (only at -O3) -polly-no-tiling - Disable tiling in the scheduler -polly-only-func=<function-name> - Only run on a single function -polly-report - Print information about the activities of Polly -polly-vectorizer - Select the vectorization strategy =none - No Vectorization =polly - Polly internal vectorizer =unroll-only - Only grouped unroll the vectorize candidate loops =bb - The Basic Block vectorizer driven by Polly llvm-svn: 181295	2013-05-07 07:31:10 +00:00
Tobias Grosser	e602a07662	Reformat with clang-format clang-format become way more stable. This time we mainly reformat function signatures. llvm-svn: 181294	2013-05-07 07:30:56 +00:00
Tobias Grosser	e8df5bd92b	IndependentBlocks: We can only reconstruct PHI nodes that are within the ScoP In the classical (non -polly-codegen-scev) mode, we assume that we can always recreate PHI nodes during code generation. This is not true. We can only reconstruct them from the polyhedral information, in case the entire loop of the PHI node is part of the SCoP and consequently the PHI node was translated in the polyhedral description. llvm-svn: 179674	2013-04-17 07:20:36 +00:00
Tobias Grosser	d7e58640a5	Update formatting to latest version of clang-format llvm-svn: 179160	2013-04-10 06:55:45 +00:00
Tobias Grosser	4d96c8d714	clang-format: Many more files After this commit, polly is clang-format clean. This can be tested with 'ninja polly-check-format'. Updates to clang-format may change this, but the differences will hopefully be both small and general improvements to the formatting. We currently have some not very nice formatting for a couple of items, DEBUG() stmts for example. I believe the benefit of being clang-format clean outweights the not perfect layout of this code. llvm-svn: 177796	2013-03-23 01:05:07 +00:00
Tobias Grosser	369430ffca	codegen: properly instantiate SCEVs to the place where they are used Given the following code for (i = 0; i < 10; i++) { ; } S: A[i] = 0 When code generating S using scev based code generation, we need to retrieve the scev of 'i' at the location of 'S'. If we do not do this the scev that we obtain will be expressed as {0,+,1}_for and will reference loop iterators that do not surround 'S' and that we consequently do not know how to code generate. What we really want is the scev to be instantiated to the value of 'i' after the loop. This value is {10} and it can be code generated without troubles. llvm-svn: 177777	2013-03-22 23:42:53 +00:00
Tobias Grosser	ecfe21b792	Remove dependence on canonical induction variable When using the scev based code generation, we now do not rely on the presence of a canonical induction variable any more. This commit prepares the path to (conditionally) disable the induction variable canonicalization pass. llvm-svn: 177548	2013-03-20 18:03:18 +00:00
Sebastian Pop	97cb813c29	Correct function to decide if a SCEV can be ignored When doing SCEV based code generation, we ignore instructions calculating values that are fully defined by a SCEV expression. The values that are calculated by this instructions are recalculated on demand. This commit improves the check to verify if certain instructions can be ignored and recalculated on demand. llvm-svn: 177313	2013-03-18 20:21:13 +00:00
Tobias Grosser	7242ad9226	CodeGen: clang-format llvm-svn: 175872	2013-02-22 08:07:06 +00:00
Sebastian Pop	47d4ee3ed4	capitalize SCEV to match the current naming convention llvm-svn: 175306	2013-02-15 21:26:53 +00:00
Sebastian Pop	637b23dc63	use apply and ScevParameterRewriter::rewrite instead of SCEVRewriter llvm-svn: 175296	2013-02-15 20:56:01 +00:00
Sebastian Pop	9d10fffa33	add LoopToScev maps llvm-svn: 175295	2013-02-15 20:55:59 +00:00
Tobias Grosser	c14582f276	CodeGen: clang-format goodness The changed files are not yet clang-format clean, but we are getting close. llvm-svn: 174403	2013-02-05 18:01:29 +00:00
Tobias Grosser	ae2d83ec41	Formatting: Break lines after binary operators such as '&&' assert(Condition && "Text"); -> assert(Condition && "Text); This aligns Polly with the style used in LLVM. llvm-svn: 171242	2012-12-29 23:57:18 +00:00
Tobias Grosser	1bb59b0dcf	Fix obvious formatting problems. We fix the following formatting problems found by clang-format: - 80 cols violations - Obvious problems with missing or too many spaces - multiple new lines in a row clang-format suggests many more changes, most of them falling in the following two categories: 1) clang-format does not at all format a piece of code nicely 2) The style that clang-format suggests does not match the style used in Polly/LLVM I consider differences caused by reason 1) bugs, which should be fixed by improving clang-format. Differences due to 2) need to be investigated closer to understand the cause of the difference and the solution that should be taken. llvm-svn: 171241	2012-12-29 23:47:38 +00:00
Sebastian Pop	a00a029115	change interface for isStride isStride now takes a partial schedule as input. Patch from Tobias Grosser <tobias@grosser.es>. llvm-svn: 170419	2012-12-18 07:46:06 +00:00
Hongbin Zheng	6879421727	Allow polly ask bb-vectorizer to vectorize the loop body. llvm-svn: 156254	2012-05-06 10:22:19 +00:00
Tobias Grosser	e71c6ab54c	SCEV based code generation This is an incomplete implementation of the SCEV based code generation. When finished it will remove the need for -indvars -enable-iv-rewrite. For the moment it is still disabled. Even though it passes 'make polly-test', there are still loose ends especially in respect of OpenMP code generation. llvm-svn: 155717	2012-04-27 16:36:14 +00:00
Hongbin Zheng	8a8466106c	Refactor: Move the code generation related header files to include/polly/CodeGen. llvm-svn: 155547	2012-04-25 13:18:28 +00:00
Hongbin Zheng	3b11a16a44	Refactor: Move the declaration of the BlockGenerator/VectorBlockGenerator to standalone header and source files. llvm-svn: 155546	2012-04-25 13:16:49 +00:00

... 2 3 4 5 6

268 Commits