llvm-project

Commit Graph

Author	SHA1	Message	Date
Tobias Grosser	6217e18a7d	Add preliminary implementation for GPGPU code generation. Translate the selected parallel loop body into a ptx string and run it with the cuda driver API. We limit this preliminary implementation to target the following special test cases: - Support only 2-dimensional parallel loops with or without only one innermost non-parallel loop. - Support write memory access to only one array in a SCoP. The patch was committed with smaller changes to the build system: There is now a flag to enable gpu code generation explictly. This was required as we need the llvm.codegen() patch applied on the llvm sources, to compile this feature correctly. Also, enabling gpu code generation does not require cuda. This requirement was removed to allow 'make polly-test' runs, even without an installed cuda runtime. Contributed by: Yabin Hu <yabin.hwu@gmail.com> llvm-svn: 161239	2012-08-03 12:50:07 +00:00
Tobias Grosser	6cc23b07e6	Revert "Add preliminary implementation for GPGPU code generation." I did not take into account, that this patch fails to compile without the llvm.codegen patch applied. This breaks buildbots. I revert this until we found a solution to commit this without buildbots complaining. This reverts commit cb43ab80e94434e780a66be3b9a6ad466822fe33. llvm-svn: 160165	2012-07-13 07:44:56 +00:00
Tobias Grosser	b299d28181	Add preliminary implementation for GPGPU code generation. Translate the selected parallel loop body into a ptx string and run it with cuda driver API. We limit this preliminary implementation to target the following special test cases: - Support only 2-dimensional parallel loops with or without only one innermost non-parallel loop. - Support write memory access to only one array in a SCoP. Contributed by: Yabin Hu <yabin.hwu@gmail.com> llvm-svn: 160164	2012-07-13 07:21:00 +00:00
Tobias Grosser	3a275d20dd	Move executeScopConditionally() into its own file We will reuse this function for the isl code generator. llvm-svn: 157605	2012-05-29 09:11:54 +00:00
Tobias Grosser	0a91f3220b	Move CLooG.h into include/polly/CodeGen/ llvm-svn: 157604	2012-05-29 09:11:46 +00:00
Tobias Grosser	e192b23f5e	Move isParallelFor into CodeGeneration This removes another include of CLooG header files. llvm-svn: 157242	2012-05-22 08:46:07 +00:00
Sebastian Pop	e1f6554ed8	add some more missing ifdef CLOOG_FOUND llvm-svn: 156306	2012-05-07 16:35:11 +00:00
Hongbin Zheng	6879421727	Allow polly ask bb-vectorizer to vectorize the loop body. llvm-svn: 156254	2012-05-06 10:22:19 +00:00
Hongbin Zheng	8a8466106c	Refactor: Move the code generation related header files to include/polly/CodeGen. llvm-svn: 155547	2012-04-25 13:18:28 +00:00
Hongbin Zheng	3b11a16a44	Refactor: Move the declaration of the BlockGenerator/VectorBlockGenerator to standalone header and source files. llvm-svn: 155546	2012-04-25 13:16:49 +00:00
Hongbin Zheng	4ac4e15582	Refactor: Pass the argument 'IRBuilder' and 'AfterBlock' of function 'createLoop' by reference, so that we do not need to type an extra '&' operator when calling the function. llvm-svn: 155349	2012-04-23 13:03:56 +00:00
Tobias Grosser	4cb5461dae	CodeGen: Generate scalar code if vector instructions cannot be generated This fixes two crashes that appeared in case of: - A load of a non vectorizable type (e.g. float**) - An instruction that is not vectorizable (e.g. call) llvm-svn: 154586	2012-04-12 10:46:55 +00:00
Tobias Grosser	84ecc47e1c	CodeGen: Allow Polly to do 'grouped unrolling', but no vector generation. Grouped unrolling means that we unroll a loop such that the different instances of a certain statement are scheduled right after each other, but we do not generate any vector code. The idea here is that we can schedule the bb vectorizer right afterwards and use it heuristics to decide when vectorization should be performed. llvm-svn: 154251	2012-04-07 06:16:08 +00:00
Tobias Grosser	8239a4cadf	CodeGen: Remove unused declaration llvm-svn: 153954	2012-04-03 12:37:14 +00:00
Tobias Grosser	0905a23806	CodeGen: Recreate old ivs with the original type To avoid overflows we still use a larger type (i64) while calculating the value of the old ivs. However, we truncate the result to the type of the old iv when providing it to the new code. A corresponding test case is added to the polly test suite. Also, a failing test case is fixed. This fixes PR12311. Contributed by: Tsingray Liu <tsingrayliu@gmail.com> llvm-svn: 153952	2012-04-03 12:24:32 +00:00
Tobias Grosser	f00bd96cde	CodeGen: Some style improvements llvm-svn: 153871	2012-04-02 12:26:13 +00:00
Tobias Grosser	d6d7d7e128	CodeGen: Remove unused variable llvm-svn: 153840	2012-04-01 16:55:30 +00:00
Tobias Grosser	89339067b0	CodeGen: Allow function parameters to be rewritten in getNewValue() When deriving new values for the statements of a SCoP, we assumed that parameter values are constant within the SCoP and consquently do not need to be rewritten. For OpenMP code generation this assumption is wrong, as such values are not available in the OpenMP subfunction and consequently also may need to be rewritten. Committed with some changes. Contributed-By: Johannes Doerfert <s9jodoer@stud.uni-saarland.de> llvm-svn: 153838	2012-04-01 16:49:45 +00:00
Hongbin Zheng	609089f254	Move the CodeGeneration.cpp to the CodeGen folder and update the build system. Patched by Tsingray. llvm-svn: 153736	2012-03-30 08:46:18 +00:00

19 Commits