llvm-project

Commit Graph

Author	SHA1	Message	Date
Yaxun Liu	8f66b4b44a	Add support for non-zero null pointer for C and OpenCL In amdgcn target, null pointers in global, constant, and generic address space take value 0 but null pointers in private and local address space take value -1. Currently LLVM assumes all null pointers take value 0, which results in incorrectly translated IR. To workaround this issue, instead of emit null pointers in local and private address space, a null pointer in generic address space is emitted and casted to local and private address space. Tentative definition of global variables with non-zero initializer will have weak linkage instead of common linkage since common linkage requires zero initializer and does not have explicit section to hold the non-zero value. Virtual member functions getNullPointer and performAddrSpaceCast are added to TargetCodeGenInfo which by default returns ConstantPointerNull and emitting addrspacecast instruction. A virtual member function getNullPointerValue is added to TargetInfo which by default returns 0. Each target can override these virtual functions to get target specific null pointer and the null pointer value for specific address space, and perform specific translations for addrspacecast. Wrapper functions getNullPointer is added to CodegenModule and getTargetNullPointerValue is added to ASTContext to facilitate getting the target specific null pointers and their values. This change has no effect on other targets except amdgcn target. Other targets can provide support of non-zero null pointer in a similar way. This change only provides support for non-zero null pointer for C and OpenCL. Supporting for other languages will be added later incrementally. Differential Revision: https://reviews.llvm.org/D26196 llvm-svn: 289252	2016-12-09 19:01:11 +00:00
Alexey Bader	a60db59d6f	[OpenCL] Added a LIT test for ensuring address space mangling is done the same both in OpenCL1.2 and OpenCL2.0. Patch by Egor Churaev (echuraev). Reviewers: Anastasia Subscribers: yaxunl, cfe-commits, bader Differential Revision: https://reviews.llvm.org/D27403 llvm-svn: 288891	2016-12-07 08:43:49 +00:00
Alexey Bader	b3190829e5	[OpenCL] Fix SPIR version generation. Patch by Egor Churaev (echuraev). Reviewers: Anastasia Subscribers: bader, yaxunl, cfe-commits Differential Revision: https://reviews.llvm.org/D27300 llvm-svn: 288890	2016-12-07 08:38:24 +00:00
Anastasia Stulova	e4a1c38109	[OpenCL] Prevent generation of globals in non-constant AS for OpenCL. Avoid using shortcut for const qualified non-constant address space aggregate variables while generating them on the stack such that the alloca object is used instead of a global variable containing initializer. Review: https://reviews.llvm.org/D27109 llvm-svn: 288163	2016-11-29 17:01:19 +00:00
Konstantin Zhuravlyov	62ae8f671c	[AMDGPU] Change frexp.exp builtin to return i16 for f16 input Differential Revision: https://reviews.llvm.org/D26863 llvm-svn: 287390	2016-11-18 22:31:51 +00:00
Stanislav Mekhanoshin	cd433d2811	[AMDGPU] Add wave barrier builtin The wave barrier represents the discardable barrier. Its main purpose is to carry convergent attribute, thus preventing illegal CFG optimizations. All lanes in a wave come to convergence point simultaneously with SIMT, thus no special instruction is needed in the ISA. The barrier is discarded during code generation. Differential Revision: https://reviews.llvm.org/D26584 llvm-svn: 287006	2016-11-15 18:58:03 +00:00
Anastasia Stulova	0df4ac3f94	[OpenCL] Fix for integer parameters of enqueue_kernel Make handling integer parameters more flexible: - For the number of events argument allow to pass larger integers than 32 bits as soon as compiler can prove that the range fits in 32 bits. If not, the diagnostic will be given. - Change type of the arguments specifying the sizes of the corresponding block arguments to be size_t. Review: https://reviews.llvm.org/D26509 llvm-svn: 286849	2016-11-14 17:39:58 +00:00
Anastasia Stulova	2b46120a09	[OpenCL] Change to clk_event parameter in enqueue_kernel. - Accept NULL pointer as a valid parameter value for clk_event. - Generate clk_event_t arguments of internal __enqueue_kernel_XXX function as pointers in generic address space. Review: https://reviews.llvm.org/D26507 llvm-svn: 286836	2016-11-14 15:34:01 +00:00
Pekka Jaaskelainen	5136dd81ad	Fix r286819 (accidentally patched multiple times. llvm-svn: 286821	2016-11-14 13:14:38 +00:00
Pekka Jaaskelainen	2a1cc587bf	[OpenCL] always use SPIR address spaces for kernel_arg_addr_space MD It doesn't make sense to use the target's address space ids in this context as this is metadata that should be referring to the "logical" OpenCL address spaces. For flat AS machines like all "CPUs" in general, the logical AS info gets lost as there's only one address space (0). This commit changes the logic such that we always use the SPIR address space ids for the argument metadata. It thus allows implementing the clGetKernelArgInfo() and the other detection needs. https://reviews.llvm.org/D26157 llvm-svn: 286819	2016-11-14 13:08:30 +00:00
Renato Golin	6a051ba614	Revert "Improve handling of floating point literals in OpenCL to only use double precision if the target supports fp64." This reverts commit r286815, as it broke all ARM and AArch64 bots. llvm-svn: 286818	2016-11-14 12:19:18 +00:00
Neil Hickey	f603672b5c	Improve handling of floating point literals in OpenCL to only use double precision if the target supports fp64. This change makes sure single-precision floating point types are used if the cl_fp64 extension is not supported by the target. Also removed the check to see whether the OpenCL version is >= 1.2, as this has been incorporated into the extension setting code. Differential Revision: https://reviews.llvm.org/D24235 llvm-svn: 286815	2016-11-14 11:15:51 +00:00
Konstantin Zhuravlyov	81a78bb864	[AMDGPU] Add f16 builtin functions (VI+) Differential Revision: https://reviews.llvm.org/D26476 llvm-svn: 286741	2016-11-13 02:37:05 +00:00
NAKAMURA Takumi	5a8949caa2	clang/test/CodeGenOpenCL/convergent.cl: Satisfy -Asserts with "opt -instnamer". llvm-svn: 285733	2016-11-01 20:08:17 +00:00
Yaxun Liu	7d07ae7c85	[OpenCL] Mark group functions as convergent in opencl-c.h Certain OpenCL builtin functions are supposed to be executed by all threads in a work group or sub group. Such functions should not be made divergent during transformation. It makes sense to mark them with convergent attribute. The adding of convergent attribute is based on Ettore Speziale's work and the original proposal and patch can be found at https://www.mail-archive.com/cfe-commits@lists.llvm.org/msg22271.html. Differential Revision: https://reviews.llvm.org/D25343 llvm-svn: 285725	2016-11-01 18:45:32 +00:00
Alexey Bader	abdcfc1809	[OpenCL] Setting constant address space for array initializers Summary: Setting constant address space for global constants used for memcpy-initialization of arrays. Patch by Alexey Sotkin. Reviewers: bader, yaxunl, Anastasia Subscribers: cfe-commits, AlexeySotkin Differential Revision: https://reviews.llvm.org/D25305 llvm-svn: 285557	2016-10-31 10:26:31 +00:00
Yaxun Liu	a91da4ba47	[OpenCL] Allow partial initializer for array and struct Currently Clang allows partial initializer for C99 but not for OpenCL, e.g. float a[16][16] = {1.0f, 2.0f}; is allowed in C99 but not allowed in OpenCL. This patch fixes that. Differential Revision: https://reviews.llvm.org/D25335 llvm-svn: 283891	2016-10-11 15:53:28 +00:00
Yaxun Liu	ea6b796e0e	[OpenCL] Fix bug in __builtin_astype causing invalid LLVM cast instructions __builtin_astype is used to cast OpenCL opaque types to other types, as such, it needs to be able to handle casting from and to pointer types correctly. Current it cannot handle 1) casting between pointers of different addr spaces 2) casting between pointer type and non-pointer types. This patch fixes that. Differential Revision: https://reviews.llvm.org/D25123 llvm-svn: 283114	2016-10-03 14:41:50 +00:00
Konstantin Zhuravlyov	5b48d725a0	[AMDGPU] Expose flat work group size, register and wave control attributes __attribute__((amdgpu_flat_work_group_size(<min>, <max>))) - request minimum and maximum flat work group size __attribute__((amdgpu_waves_per_eu(<min>[, <max>]))) - request minimum and/or maximum waves per execution unit Differential Revision: https://reviews.llvm.org/D24513 llvm-svn: 282371	2016-09-26 01:02:57 +00:00
Alexey Bader	465c18973d	[OpenCL] Augment pipe built-ins with pipe packet size and alignment. Reviewers: Anastasia, vpykhtin Subscribers: dmitry, cfe-commits Differential Revision: https://reviews.llvm.org/D23992 llvm-svn: 282252	2016-09-23 14:20:00 +00:00
Neil Hickey	eb62b17d8f	Reverting r281714 due to causing an assert when calling builtins that expect a double, from CL llvm-svn: 281899	2016-09-19 11:42:14 +00:00
Neil Hickey	ddfb093b72	Improve handling of floating point literals in OpenCL to only use double precision if the target supports fp64 https://reviews.llvm.org/D24235 llvm-svn: 281714	2016-09-16 10:15:06 +00:00
Yaxun Liu	d3e85b98be	AMDGPU: Fix target options fp32/64-denormals Fix target options for fp32/64-denormals so that +fp64-denormals is set if fp64 is supported -fp32-denormals if fp32 denormals is not supported, or -cl-denorms-are-zero is set +fp32-denormals if fp32 denormals is supported and -cl-denorms-are-zero is not set If target feature fp32/64-denormals is explicitly set, they will override default options and options deduced from -cl-denorms-are-zero. Differential Revision: https://reviews.llvm.org/D24512 llvm-svn: 281357	2016-09-13 17:37:09 +00:00
Alexey Bader	af17c7959e	[OpenCL] Fix pipe built-in functions return type. By default return type of call expressions calling built-in functions is set to bool. Fixes https://llvm.org/bugs/show_bug.cgi?id=30219. Reviewers: Anastasia Subscribers: dmitry, cfe-commits, yaxunl Differential Revision: https://reviews.llvm.org/D24136 llvm-svn: 280800	2016-09-07 10:32:03 +00:00
Alexey Bader	3e0b817b91	[OpenCL] Remove access qualifiers on images in arg info metadata. Summary: Remove access qualifiers on images in arg info metadata: * kernel_arg_type * kernel_arg_base_type Image access qualifiers are inseparable from type in clang implementation, but OpenCL spec provides a special query to get access qualifier via clGetKernelArgInfo with CL_KERNEL_ARG_ACCESS_QUALIFIER. Besides that OpenCL conformance test_api get_kernel_arg_info expects image types without access qualifier. Patch by Evgeniy Tyurin. Reviewers: bader, yaxunl, Anastasia Subscribers: cfe-commits Differential Revision: https://reviews.llvm.org/D23915 llvm-svn: 280699	2016-09-06 10:10:28 +00:00
Matt Arsenault	88d7da01ca	AMDGPU: Handle structs directly in AMDGPUABIInfo Structs are currently handled as pointer + byval, which makes AMDGPU LLVM backend generate incorrect code when structs are used. This patch changes struct argument to be handled directly and without flattening, which Clover (Mesa 3D Gallium OpenCL state tracker) will be able to handle. Flattening would expand the struct to individual elements and pass each as a separate argument, which Clover can not handle. Furthermore, such expansion does not fit the OpenCL programming model which requires to explicitely specify each argument index, size and memory location. Patch by Vedran Miletić llvm-svn: 279463	2016-08-22 19:25:59 +00:00
Valery Pykhtin	4b5d9d16d3	[AMDGPU] add s_incperflevel/s_decperflevel builtins Differential revision: https://reviews.llvm.org/D23668 llvm-svn: 279235	2016-08-19 12:54:31 +00:00
Yaxun Liu	26f7566ff8	Re-commit [OpenCL] AMDGCN: Fix size_t type There was a premature cast to pointer type in emitPointerArithmetic which caused assertion in tests with assertion enabled. llvm-svn: 279206	2016-08-19 05:17:25 +00:00
Changpeng Fang	03bdd8f797	AMDGPU: Add clang builtin for ds_swizzle. Summary: int __builtin_amdgcn_ds_swizzle (int a, int imm); while imm is a constant. Differential Revision: http://reviews.llvm.org/D23682 llvm-svn: 279165	2016-08-18 22:04:54 +00:00
Yaxun Liu	dea5ccb04b	Revert [OpenCL] AMDGCN: Fix size_t type due to regressions in test/CodeGen/exprs.c on certain platforms. llvm-svn: 279127	2016-08-18 20:01:06 +00:00
Yaxun Liu	6305f8a351	[OpenCL] AMDGCN: Fix size_t type Pointers of certain GPUs in AMDGCN target in private address space is 32 bit but pointers in other address spaces are 64 bit. size_t type should be defined as 64 bit for these GPUs so that it could hold pointers in all address spaces. Also fixed issues in pointer arithmetic codegen by using pointer specific intptr type. Differential Revision: https://reviews.llvm.org/D23361 llvm-svn: 279121	2016-08-18 19:34:04 +00:00
Joey Gouly	b95e36027f	[OpenCL] Fix typo in test that I accidentally introduced in my previous commit. llvm-svn: 278235	2016-08-10 16:04:14 +00:00
Joey Gouly	ddbda40245	[OpenCL] Change block descriptor address space to constant. The block descriptor is a GlobalVariable in the LLVM IR, so it shouldn't be in the private address space. llvm-svn: 278234	2016-08-10 15:57:02 +00:00
Yaxun Liu	ffb60901fe	[OpenCL] Handle -cl-fp32-correctly-rounded-divide-sqrt Let the driver pass the option to frontend. Do not set precision metadata for division instructions when this option is set. Set function attribute "correctly-rounded-divide-sqrt-fp-math" based on this option. Differential Revision: https://reviews.llvm.org/D22940 llvm-svn: 278155	2016-08-09 20:10:18 +00:00
Yaxun Liu	2c17e82bc7	[OpenCL][AMDGPU] Add support for -cl-denorms-are-zero Adjust target features for amdgcn target when -cl-denorms-are-zero is set. Denormal support is controlled by feature strings fp32-denormals fp64-denormals in amdgcn target. If -cl-denorms-are-zero is not set and the command line does not set fp32/64-denormals feature string, +fp32-denormals +fp64-denormals will be on for GPU's supporting them. A new virtual function virtual void TargetInfo::adjustTargetOptions(const CodeGenOptions &CGOpts, TargetOptions &TargetOpts) const is introduced to allow adjusting target option by codegen option. Differential Revision: https://reviews.llvm.org/D22815 llvm-svn: 278151	2016-08-09 19:43:38 +00:00
Wei Ding	91c8450967	AMDGPU : Add Clang builtin intrinsics for compare with the full wavefront result. Differential Revision: http://reviews.llvm.org/D22934 llvm-svn: 277824	2016-08-05 15:38:46 +00:00
Yaxun Liu	c8acb4f37b	[OpenCL] Add the lit test for image size which was omitted by r277647. llvm-svn: 277756	2016-08-04 19:35:17 +00:00
Alexey Bader	d81623261a	[OpenCL] Added underscores to the names of 'to_addr' OpenCL built-ins. Summary: In order to re-define OpenCL built-in functions 'to_{private,local,global}' in OpenCL run-time library LLVM names must be different from the clang built-in function names. Reviewers: yaxunl, Anastasia Subscribers: cfe-commits Differential Revision: https://reviews.llvm.org/D23120 llvm-svn: 277743	2016-08-04 18:06:27 +00:00
Yaxun Liu	0bc4b2d337	[OpenCL] Generate opaque type for sampler_t and function call for the initializer Currently Clang use int32 to represent sampler_t, which have been a source of issue for some backends, because in some backends sampler_t cannot be represented by int32. They have to depend on kernel argument metadata and use IPA to find the sampler arguments and global variables and transform them to target specific sampler type. This patch uses opaque pointer type opencl.sampler_t* for sampler_t. For each use of file-scope sampler variable, it generates a function call of __translate_sampler_initializer. For each initialization of function-scope sampler variable, it generates a function call of __translate_sampler_initializer. Each builtin library can implement its own __translate_sampler_initializer(). Since the real sampler type tends to be architecture dependent, allowing it to be initialized by a library function simplifies backend design. A typical implementation of __translate_sampler_initializer could be a table lookup of real sampler literal values. Since its argument is always a literal, the returned pointer is known at compile time and easily optimized to finally become some literal values directly put into image read instructions. This patch is partially based on Alexey Sotkin's work in Khronos Clang (`3d4eec6162`). Differential Revision: https://reviews.llvm.org/D21567 llvm-svn: 277024	2016-07-28 19:26:30 +00:00
Yaxun Liu	37ceedeabd	[OpenCL] AMDGCN target will generate images in constant address space Allows AMDGCN target to generate images (such as %opencl.image2d_t) in constant address space. Images will still be generated in global address space by default. Added tests to existing opencl-types.cl in test\CodeGenOpenCL. Patch by Aaron En Ye Shi. Differential Revision: https://reviews.llvm.org/D22523 llvm-svn: 276161	2016-07-20 19:21:11 +00:00
David Majnemer	24547108d6	Let FuncAttrs infer the 'returned' argument attribute This reverts commit r275756. llvm-svn: 276014	2016-07-19 19:59:24 +00:00
Yaxun Liu	f2e8ab2566	[OpenCL] Fixes bug of missing OCL version metadata on the AMDGCN target Added the opencl.ocl.version metadata to be emitted with amdgcn. Created a static function emitOCLVerMD which is shared between triple spir and target amdgcn. Also added new testcases to existing test file, spir_version.cl inside test/CodeGenOpenCL. Patch by Aaron En Ye Shi. Differential Revision: https://reviews.llvm.org/D22424 llvm-svn: 276010	2016-07-19 19:39:45 +00:00
NAKAMURA Takumi	966bde50c3	Revert r275678, "Revert "Revert r275027 - Let FuncAttrs infer the 'returned' argument attribute"" This reverts also r275029, "Update Clang tests after adding inference for the returned argument attribute" It broke LTO build. Seems miscompilation. llvm-svn: 275756	2016-07-18 03:23:25 +00:00
Hal Finkel	81cdef31e6	Revert "Revert r275029 - Update Clang tests after adding inference for the returned argument attribute" This reverts commit r275043 after reapplying the underlying LLVM commit. llvm-svn: 275679	2016-07-16 07:22:09 +00:00
Matt Arsenault	c7536a5d60	AMDGPU: Remove legacy ldexp builtin llvm-svn: 275623	2016-07-15 21:33:06 +00:00
Matt Arsenault	c86671da09	AMDGPU: Update for rsq intrinsic changes llvm-svn: 275622	2016-07-15 21:33:02 +00:00
Wei Ding	ea41f356bb	AMDGPU: Add Clang Builtin for v_lerp_u8 Differential Revision: http://reviews.llvm.org/D22380 llvm-svn: 275577	2016-07-15 16:43:03 +00:00
Alexey Bader	10e9e59898	[OpenCL] Fix code generation of kernel pipe parameters. Improved test with user define structure pipe type case. Reviewers: Anastasia, pxli168 Subscribers: yaxunl, cfe-commits Differential revision: http://reviews.llvm.org/D21744 llvm-svn: 275259	2016-07-13 10:28:13 +00:00
Hal Finkel	9a17d7ac6e	Revert r275029 - Update Clang tests after adding inference for the returned argument attribute The associated backend change is causing miscompiles from the AArch64 backend. llvm-svn: 275043	2016-07-11 04:52:07 +00:00
Jan Vesely	d7e03a5bd9	AMDGPU: Export workitem builtins Reviewers: tstellardAMD Differential Revision: http://reviews.llvm.org/D20299 llvm-svn: 275030	2016-07-10 22:38:04 +00:00

1 2 3 4

192 Commits