llvm-project

Commit Graph

Author	SHA1	Message	Date
Petar Jovanovic	c6d9b04cc3	[mips] Add runtime options to enable/disable madd.fmt and msub.fmt Add options to clang: -mmadd4 and -mno-madd4, use it to enable or disable generation of madd.fmt and similar instructions respectively, as per GCC. Patch by Stefan Maksimovic. Differential Revision: https://reviews.llvm.org/D33401 llvm-svn: 304929	2017-06-07 17:17:57 +00:00
Florian Hahn	cc524bf6ac	[CodeGen] Add thumb-mode to target-features for arm/thumb triples. Summary: The thumb-mode target feature is used to force Thumb or ARM code generation on a per-function basis. Explicitly adding +thumb-mode to functions for thumbxx triples enables mixed ARM/Thumb code generation in places where compilation units with thumbxx and armxx triples are merged together (e.g. the IR linker or LTO). For armxx triples, -thumb-mode is added in a similar fashion. Reviewers: echristo, t.p.northover, kristof.beyls, rengolin Reviewed By: echristo Subscribers: rinon, aemerson, mehdi_amini, javed.absar, cfe-commits Differential Revision: https://reviews.llvm.org/D33448 llvm-svn: 304897	2017-06-07 11:50:45 +00:00
Javed Absar	15b80a5b23	[ARM] Fix Neon vector type alignment to 64-bit This is restricted version of patch - https://reviews.llvm.org/D33205 that I reverted as it was leading to ABI breaks on darwin etc. This patch restricts the fix to AAPCS (Android remains 128-bit). Reviewed by: Renato Golin, Stephen Hines Differential Revision: https://reviews.llvm.org/D33786 llvm-svn: 304889	2017-06-07 10:02:02 +00:00
Dan Gohman	7dee171f69	[WebAssembly] Set MaxAtomicInlineWidth to 64. The WebAssembly threads proposal has changed such that C++ implementations can now declare that atomics up to 64 bits are "lock free" in C++'s terms. llvm-svn: 304859	2017-06-07 02:22:40 +00:00
Florian Hahn	28f03bbcbb	[ARM] Add support for target("arm") and target("thumb"). Summary: This patch adds support for the target("arm") and target("thumb") attributes, which can be used to force the compiler to generated ARM or Thumb code for a function. In LLVM, ARM or Thumb code generation can be controlled by the thumb-mode target feature. But GCC already uses target("arm") and target("thumb"), so we have to substitute "arm" with -thumb-mode and "thumb" with +thumb-mode. Reviewers: echristo, pcc, kristof.beyls Reviewed By: echristo Subscribers: ahatanak, aemerson, javed.absar, kristof.beyls, cfe-commits Differential Revision: https://reviews.llvm.org/D33721 llvm-svn: 304781	2017-06-06 09:26:15 +00:00
Vedant Kumar	a44a6ac81f	Revert "[AArch64] Add ARMv8.2-A FP16 vefctor intrinsics" This reverts commit r304493. It breaks all the Darwin bots: http://green.lab.llvm.org/green/job/clang-stage1-cmake-RA-incremental_check/37168 Failure: Failing Tests (2): Clang :: CodeGen/aarch64-v8.2a-neon-intrinsics.c Clang :: CodeGen/arm_neon_intrinsics.c llvm-svn: 304509	2017-06-02 01:22:14 +00:00
Abderrazek Zaafrani	a44e5f601d	[AArch64] Add ARMv8.2-A FP16 vefctor intrinsics llvm-svn: 304493	2017-06-01 23:22:29 +00:00
Pekka Jaaskelainen	fc2629a65a	[OpenCL] Makes kernels use the SPIR_KERNEL CC by default. Rationale: OpenCL kernels are called via an explicit runtime API with arguments set with clSetKernelArg(), not as normal sub-functions. Return SPIR_KERNEL by default as the kernel calling convention to ensure the fingerprint is fixed such way that each OpenCL argument gets one matching argument in the produced kernel function argument list to enable feasible implementation of clSetKernelArg() with aggregates etc. In case we would use the default C calling conv here, clSetKernelArg() might break depending on the target-specific conventions; different targets might split structs passed as values to multiple function arguments etc. https://reviews.llvm.org/D33639 llvm-svn: 304389	2017-06-01 07:18:49 +00:00
Javed Absar	089f678784	Reverting Neon vector type 64-alignment fix The patch caused ABI breaks on darwin/others. Reverting to come back with a more restrictive patch. llvm-svn: 304220	2017-05-30 17:09:47 +00:00
Javed Absar	3d92d7ab36	[ARM] Fix Neon vector type alignment to 64-bit The maximum alignment for ARM NEON data types should be 64-bits as specified in ARM procedure call standard document Sec. A.2 Notes. This patch fixes it from its current larger natural default values, except for Android (so as not to break existing ABI). Reviewed by: Stephen Hines, Renato Golin. Differential Revision: https://reviews.llvm.org/D33205 llvm-svn: 304201	2017-05-30 10:12:15 +00:00
Reid Kleckner	a40b38a637	Only define __SIZEOF_FLOAT128__ on x86 as intended in r304012 GCC only defines it on x86. llvm-svn: 304013	2017-05-26 17:48:01 +00:00
Reid Kleckner	9eabbb6066	Enable __float128 for mingw for GCC compatibility and define __SIZEOF_FLOAT128__ on x86 GCC defines __FLOAT128__ on Power and __SIZEOF_FLOAT128__ on x86. We're just following the inconsistency for now so users have some way to test. Effectively merges this patch as requested by Martell Malone: https://github.com/Alexpux/MINGW-packages/blob/master/mingw-w64-clang/0107-enable-__float128-for-X86-MinGW.patch llvm-svn: 304012	2017-05-26 17:38:15 +00:00
Oren Ben Simhon	140c1fb9ec	[X86] Adding avx512_vpopcntdq feature set and its intrinsics AVX512_VPOPCNTDQ is a new feature set that was published by Intel. The patch represents the Clang side of the addition of six intrinsics for two new machine instructions (vpopcntd and vpopcntq). It also includes the addition of the new feature set. Differential Revision: https://reviews.llvm.org/D33170 llvm-svn: 303857	2017-05-25 13:44:11 +00:00
Saleem Abdulrasool	b6e946b164	Basic: fix whitespace in file header (NFC) llvm-svn: 303804	2017-05-24 20:27:09 +00:00
Galina Kistanova	3677c0f1d8	Removed fallthrough annotation which does not directly precede switch label. llvm-svn: 303653	2017-05-23 16:56:10 +00:00
Galina Kistanova	078b301bcc	Added LLVM_FALLTHROUGH to address gcc warning: this statement may fall through. llvm-svn: 303649	2017-05-23 16:33:07 +00:00
Yaxun Liu	af3d4db64b	[AMDGPU] Do not require opencl triple environment for OpenCL A recent change requires opencl triple environment for compiling OpenCL program, which causes regressions in libclc. This patch fixes that. Instead of deducing language based on triple environment, it checks LangOptions. Differential Revision: https://reviews.llvm.org/D33445 llvm-svn: 303644	2017-05-23 16:15:53 +00:00
Yaxun Liu	6d96f16347	CodeGen: Cast alloca to expected address space Alloca always returns a pointer in alloca address space, which may be different from the type defined by the language. For example, in C++ the auto variables are in the default address space. Therefore cast alloca to the expected address space when necessary. Differential Revision: https://reviews.llvm.org/D32248 llvm-svn: 303370	2017-05-18 18:51:09 +00:00
James Y Knight	eb96e44aea	[SPARC] Support 'f' and 'e' inline asm constraints. Patch by Patrick Boettcher. Differential Revision: https://reviews.llvm.org/D29117 llvm-svn: 302913	2017-05-12 16:01:23 +00:00
Simon Pilgrim	3511348dbb	[X86][LWP] Add clang support for LWP instructions. This patch adds support for the the LightWeight Profiling (LWP) instructions which are available on all AMD Bulldozer class CPUs (bdver1 to bdver4). Differential Revision: https://reviews.llvm.org/D32770 llvm-svn: 302418	2017-05-08 12:09:45 +00:00
Weiming Zhao	be380c711c	[ARM] Limit the diagnose when an ISR calls a regular function Summary: When the function is compiled with soft-float or on CPU with no FPU, we don't need to diagnose for a call from an ISR to a regular function. Reviewers: jroelofs, eli.friedman Reviewed By: jroelofs Subscribers: aemerson, rengolin, javed.absar, cfe-commits Differential Revision: https://reviews.llvm.org/D32918 llvm-svn: 302274	2017-05-05 19:25:29 +00:00
Duncan P. N. Exon Smith	16b1ac950b	Darwin: Define __STDC_NO_THREADS__ on Darwin targets Darwin doesn't support C11 threads.h. Define `__STDC_NO_THREADS__` so that users can check for it. rdar://problem/18461003 llvm-svn: 301508	2017-04-27 01:47:22 +00:00
Oliver Stannard	233310f3e6	[ARM,AArch64] Define __ELF__ for arm-none-eabihf and AArch64 This macro is defined for arm-none-eabi as of r266625, but it should also be defined for eabihf and aarch64. llvm-svn: 300549	2017-04-18 13:12:36 +00:00
Eric Christopher	d26d8839d8	When we turn on vsx it should also turn on altivec explicitly, same with disabling it as well as disabling all vsx specific features when turning off altivec. Fixes PR32663. llvm-svn: 300395	2017-04-15 06:15:00 +00:00
Konstantin Zhuravlyov	e668b1cd1e	[AMDGPU][GFX9] Set +fp32-denormals for >=gfx900 unless -cl-denorms-are-zero is set Differential Revision: https://reviews.llvm.org/D31482 llvm-svn: 300306	2017-04-14 05:33:57 +00:00
Yaxun Liu	b34ec829be	[OpenCL] Map default address space to alloca address space For OpenCL, the private address space qualifier is 0 in AST. Before this change, 0 address space qualifier is always mapped to target address space 0. As now target private address space is specified by alloca address space in data layout, address space qualifier 0 needs to be mapped to alloca addr space specified by the data layout. This change has no impact on targets whose alloca addr space is 0. With contributions from Matt Arsenault, Tony Tye and Wen-Heng (Jack) Chung Differential Revision: https://reviews.llvm.org/D31404 llvm-svn: 299965	2017-04-11 17:24:23 +00:00
Yaxun Liu	b122ed9181	[AMDGPU] Temporarily change constant address space from 4 to 2 for the new address space mapping Change constant address space from 4 to 2 for the new address space mapping in Clang. Differential Revision: https://reviews.llvm.org/D31771 llvm-svn: 299691	2017-04-06 19:18:36 +00:00
Eric Christopher	fc6ffede65	Default enable the rtm feature only on skylake and later for now because Intel disabled the feature on some haswell and broadwell processors: http://www.intel.com/content/dam/www/public/us/en/documents/specification-updates/core-m-processor-family-spec-update.pdf the -mrtm option will still work normally. llvm-svn: 298956	2017-03-28 23:03:19 +00:00
Eric Christopher	1e6fedbb83	_CALL_LINUX is only defined on 64-bit ppc linux platforms, not 32-bit. Adjust and add a test for the 32-bit side. llvm-svn: 298778	2017-03-25 19:26:04 +00:00
Yaxun Liu	354097b05d	[AMDGPU] Make AMDGPUTargetInfo::AS private llvm-svn: 298773	2017-03-25 11:34:41 +00:00
Eric Christopher	19d8c1fed2	Update the comment on not yet generated preprocessor defines to remove __LONGDOUBLE128. llvm-svn: 298771	2017-03-25 06:38:57 +00:00
Eric Christopher	ee21410bdc	Add the __LONGDOUBLE128 define for ppc targets that have 128 bit long doubles. llvm-svn: 298770	2017-03-25 06:37:23 +00:00
Eric Christopher	0d36116394	Define __HAVE_BSWAP__ on ppc to match gcc since we support both builtins as well. llvm-svn: 298769	2017-03-25 05:40:13 +00:00
Yaxun Liu	3464f92e23	[AMDGPU] Switch address space mapping by triple environment amdgiz For target environment amdgiz and amdgizcl (giz means Generic Is Zero), AMDGPU will use new address space mapping where generic address space is 0 and private address space is 5. The data layout is also changed correspondingly. Differential Revision: https://reviews.llvm.org/D31210 llvm-svn: 298767	2017-03-25 03:46:25 +00:00
Eric Christopher	2fd2eda0ac	Add the _CALL_LINUX preprocessor define for ppc linux platforms. This typically is only for a new enough linker (bfd >= 2.16.2 or gold), but our ppc suppport post-dates this and it should work on all linux platforms. It is guaranteed to work on all elfv2 platforms. llvm-svn: 298765	2017-03-25 03:33:59 +00:00
Eric Christopher	153dad4ca9	__BIGGEST_ALIGNMENT__ has always been 16 on all power platforms rather than the default of 8 in clang, fix and update tests accordingly. llvm-svn: 298761	2017-03-25 02:55:21 +00:00
Eric Christopher	b70819e6a2	Add preprocessor defines for a bare powerpc64le triple/cpu. The le triple didn't exist until power8, so use that as a default (this also matches what gcc does). llvm-svn: 298759	2017-03-25 02:29:18 +00:00
Eric Christopher	3646e6289d	Move setting of LangOpts based on target flags out of CompilerInstance and into TargetInfo::adjust so that it gets called in more places throughout the compiler (AST serialization in particular). Should fix PPC modules after removing of faltivec. llvm-svn: 298487	2017-03-22 06:36:09 +00:00
Eric Christopher	758aad76d8	Remove the -faltivec alias option and replace it with -maltivec everywhere. The alias was only ever used on darwin and had some issues there, and isn't used in practice much. Also fixes a problem with -mno-altivec not turning off -maltivec. Also add a diagnostic for faltivec/fno-altivec that directs users to use maltivec options and include the altivec.h file explicitly. llvm-svn: 298449	2017-03-21 22:06:18 +00:00
Eric Christopher	74fa24ff27	Turn on HTM on power8 and later (including powerpc64le) since it's available by default on those cpus and configurations. llvm-svn: 298307	2017-03-20 21:12:53 +00:00
Krzysztof Parzyszek	ff7f6675e7	[Hexagon] Recognize hexagonv62 as a valid target CPU llvm-svn: 297778	2017-03-14 20:29:23 +00:00
Matt Arsenault	bf5e3e4391	AMDGPU: Make 0 the private nullptr value We can't actually pretend that 0 is valid for address space 0. r295877 added a workaround to stop allocating user objects there, so we can use 0 as the invalid pointer. Some of the tests seemed to be using private as the non-0 null test address space, so add copies using local to make sure this is still stressed. llvm-svn: 297659	2017-03-13 19:47:53 +00:00
Yaxun Liu	4d86799219	[AMDGPU] Add builtin functions readlane ds_permute mov_dpp Differential Revision: https://reviews.llvm.org/D30551 llvm-svn: 297436	2017-03-10 01:30:46 +00:00
Konstantin Zhuravlyov	d1ba16e762	[DebugInfo] Add address space when creating DIDerivedTypes Differential Revision: https://reviews.llvm.org/D29671 llvm-svn: 297321	2017-03-08 23:56:48 +00:00
Brad Smith	8e55bd5bbd	Set the Int64Type / IntMaxType types correctly for OpenBSD/mips64 llvm-svn: 297098	2017-03-06 23:48:31 +00:00
Eric Christopher	fb834a8278	Migrate all of aarch64-linux-gnu to \01_mcount instead of just when passing along gnueabi as this matches both gcc and what the kernel expects. More of PR27311 llvm-svn: 296490	2017-02-28 17:22:05 +00:00
Brad Smith	3d648b357a	Set ABIs correctly for OpenBSD/arm; soft float and aapcs-linux. llvm-svn: 296430	2017-02-28 03:20:26 +00:00
Matt Arsenault	a0c6dca15b	AMDGPU: Add fmed3 half builtin llvm-svn: 295874	2017-02-22 20:55:59 +00:00
Brad Smith	9aa2bf209b	Hook up OpenBSD AArch64 support llvm-svn: 295786	2017-02-21 23:13:09 +00:00
Simon Dardis	df827a7165	[mips] Define macros related to -mabicalls in the preprocessor Summary: Historically, NetBSD, FreeBSD and OpenBSD have defined the macro ABICALLS in the preprocessor when -mabicalls is in effect. Mainline GCC later defined __mips_abicalls when -mabicalls is in effect. This patch teaches the preprocessor to define these macros when appropriate. NetBSD does not require the ABICALLS macro. This resolves PR/31694. Thanks to Sean Bruno for highlighting this issue! Reviewers: slthakur, seanbruno Reviewed By: seanbruno Subscribers: joerg, brad, emaste, seanbruno, cfe-commits Differential Revision: https://reviews.llvm.org/D29032 llvm-svn: 295728	2017-02-21 16:01:00 +00:00
Brad Smith	0561a5a7fe	Enable support for __float128 in Clang on OpenBSD/X86 /usr/local/include/c++/4.9.4/type_traits:279:39: error: __float128 is not supported on this target llvm-svn: 295635	2017-02-20 03:18:15 +00:00
Matt Arsenault	eb952fd93b	AMDGPU: Add gfx900 and gfx901 processors llvm-svn: 295556	2017-02-18 19:02:41 +00:00
Eric Christopher	32ac5e41ef	Add a definition for __STRUCT_PARM_ALIGN__ for elfv2 and 64-bit darwin platforms to match what other compilers produce. llvm-svn: 295156	2017-02-15 07:50:11 +00:00
Dylan McKay	315edb0216	[AVR] Fix __AVR_xxx macro definitions; authored by Peter Wu Summary: The -mmcu option for GCC sets macros like __AVR_ATmega328P__ (with the trailing underscores), be sure to include these underscores for Clangs -mcpu option. See "AVR Built-in Macros" in https://gcc.gnu.org/onlinedocs/gcc/AVR-Options.html Reviewers: jroelofs, dylanmckay Reviewed By: jroelofs, dylanmckay Subscribers: efriedma, cfe-commits Differential Revision: https://reviews.llvm.org/D29817 llvm-svn: 294869	2017-02-11 21:06:07 +00:00
Eric Christopher	f6ee1f3d69	Temporarily revert "For X86-64 linux and PPC64 linux align int128 to 16 bytes." until we can get better TargetMachine::isCompatibleDataLayout to compare - otherwise we can't code generate existing bitcode without a string equality data layout. This reverts commit r294703. llvm-svn: 294708	2017-02-10 04:35:21 +00:00
Eric Christopher	4855ba8f24	For X86-64 linux and PPC64 linux align int128 to 16 bytes. For other platforms we should find out what they need and likely make the same change, however, a smaller additional change is easier for platforms we know have it specified in the ABI. clang support for r294702 llvm-svn: 294703	2017-02-10 03:32:34 +00:00
George Burgess IV	fc9705679e	Add support for armv7ve flag in clang (PR31358). This is a followup change to add v7ve support to clang for gcc compatibility. Please see r294661. Patch by Manoj Gupta. Differential Revision: https://reviews.llvm.org/D29773 llvm-svn: 294662	2017-02-09 23:30:10 +00:00
Craig Topper	4574226c3f	[X86] Clzero flag addition and inclusion under znver1 1. Adds the command line flag for clzero. 2. Includes the clzero flag under znver1. 3. Defines the macro for clzero. 4. Adds a new file which has the intrinsic definition for clzero instruction. Patch by Ganesh Gopalasubramanian with some additional tests from me. Differential revision: https://reviews.llvm.org/D29386 llvm-svn: 294559	2017-02-09 06:10:14 +00:00
Craig Topper	d2bf7b03e5	[X86] Add -mprefetchwt1/-mno-prefetchwt1 command line options and __PREFETCHWT1__ define to match gcc. llvm-svn: 294424	2017-02-08 08:23:40 +00:00
Craig Topper	204ecffdb4	[X86] Add -msgx/-mno-sgx command line options and __SGX__ define to match gcc. llvm-svn: 294423	2017-02-08 08:23:17 +00:00
Craig Topper	b16cb82c93	[X86] Add -mmpx/-mno-mpx command line options and __MPX__ define to match gcc. llvm-svn: 294419	2017-02-08 07:56:42 +00:00
Craig Topper	8c708cf6bc	[X86] Add -mclwb/-mno-clwb command line arguments and __CLWB__ define to match gcc. In the future, we should also add a clwb intrinsic to the backend, a frontend builtin, and an instrinsic header file. llvm-svn: 294416	2017-02-08 07:36:58 +00:00
Craig Topper	32c959dab3	[X86] Remove 'umip' feature flag. This feature flag indicates that the processor has support for removing certain instructions from user mode software. But the feature flag by itself doesn't indicate if the support is enabled in the OS. The affected instructions aren't even instructions the compiler would emit. So I don't think think this feature flag should be in the compiler. llvm-svn: 294414	2017-02-08 07:13:22 +00:00
Craig Topper	78b4787593	[X86] Add -mclflushopt/-mno-clflushopt command line support and __CLFLUSHOPT__ define to match gcc. llvm-svn: 294411	2017-02-08 06:48:58 +00:00
Craig Topper	7d494ef610	Use LLVM_FALLTHROUGH instead of FALLTHROUGH comments. llvm-svn: 294404	2017-02-08 05:44:30 +00:00
Craig Topper	c31d9e4a01	[X86] Remove PCOMMIT feature support since Intel has deprecated this instruction with no plans to release products with it. Intel's documentation for the deprecation https://software.intel.com/en-us/blogs/2016/09/12/deprecate-pcommit-instruction llvm-svn: 294403	2017-02-08 05:44:28 +00:00
Saleem Abdulrasool	5602709b98	Revert "Basic: match GCC behaviour for SuS macro" This reverts commit SVN r294148. Seems that it was mistaken, and GCC does still define `__unix` and `unix` when in GNU mode. llvm-svn: 294332	2017-02-07 19:00:06 +00:00
Dylan McKay	ecb6e7b83c	Revert "Revert "[AVR] Allow specifying the CPU on the command line"" This reverts commit 7ac30e0f839fdab6d723ce2ef6a5b7a4cf03d150. llvm-svn: 294282	2017-02-07 06:04:18 +00:00
Ulrich Weigand	82a86cb155	[SystemZ] Provide predefined __ARCH__ and __VX__ macros GCC 7 will predefine two new macros on s390x: - __ARCH__ indicates the ISA architecture level - __VX__ indicates that the vector facility is available This adds those macros to clang as well to ensure continued compatibility with GCC. llvm-svn: 294197	2017-02-06 17:04:22 +00:00
Diana Picus	37a2d6d699	Revert "[AVR] Allow specifying the CPU on the command line" This reverts commit r294177. It seems to have broken some buildbots. llvm-svn: 294180	2017-02-06 11:35:42 +00:00
Dylan McKay	8464c9b579	[AVR] Allow specifying the CPU on the command line Summary: This tells clang about all of the different AVR microcontrollers. It also adds code to define the correct preprocessor macros for each device. Reviewers: jroelofs, asl Reviewed By: asl Subscribers: asl, cfe-commits Differential Revision: https://reviews.llvm.org/D28346 llvm-svn: 294177	2017-02-06 09:07:56 +00:00
Dylan McKay	d31534cd3a	[AVR] Add support for the full set of inline asm constraints Summary: Previously the method would simply return false, causing every single inline assembly constraint to trigger a compile error. This adds inline assembly constraint support for the AVR target. This patch is derived from the code in AVRISelLowering::getConstraintType. More details can be found on the AVR-GCC reference wiki http://www.nongnu.org/avr-libc/user-manual/inline_asm.html Reviewers: jroelofs, asl Reviewed By: asl Subscribers: asl, ahatanak, saaadhu, cfe-commits Differential Revision: https://reviews.llvm.org/D28344 llvm-svn: 294176	2017-02-06 09:01:59 +00:00
Saleem Abdulrasool	c5cddc9fa8	Basic: match GCC behaviour for SuS macro GCC does not generate `__unix` nor `unix` macros. The latter already intrudes into the user's namespace and should be avoided. Use the canonical spelling of `__unix__` across all the targets. llvm-svn: 294148	2017-02-05 22:18:15 +00:00
Matt Arsenault	09cca093a3	AMDGPU: Update for changed subtarget feature name llvm-svn: 292838	2017-01-23 22:31:14 +00:00
Petr Hosek	8c69cffe15	[Basic] Remove the 32-bit x86 and ARM targets for Fuchsia The 32-bit architectures are no longer supported by Fuchsia. Differential Revision: https://reviews.llvm.org/D28974 llvm-svn: 292671	2017-01-20 22:53:38 +00:00
Dan Gohman	839f215e19	[WebAssembly] Add minimal support for the new wasm object format triple. llvm-svn: 292269	2017-01-17 21:46:38 +00:00
Joerg Sonnenberger	8450266425	Ensure that clang -pthread creates the right macro. -D_POSIX_THREADS seems to have been a C&P error from old GCC specs for OpenBSD. llvm-svn: 292119	2017-01-16 14:07:24 +00:00
Craig Topper	c45744ad93	AMD family 17h (znver1) enablement Summary: This patch enables the following 1. AMD family 17h architecture using "znver1" tune flag (-march, -mcpu). 2. ISAs that are enabled for "znver1" architecture. 3. Checks ADX isa from cpuid to identify "znver1" flag when -march=native is used. 4. ISAs FMA4, XOP are disabled as they are dropped from amdfam17. 5. For the time being, it uses the btver2 scheduler model. 6. Test file is updated to check this flag. This is linked to llvm review item https://reviews.llvm.org/D28017 Patch by Ganesh Gopalasubramanian. Additional test cases added by Craig Topper. Reviewers: RKSimon, craig.topper Subscribers: cfe-commits, RKSimon, ashutosh.nema, llvm-commits Differential Revision: https://reviews.llvm.org/D28018 llvm-svn: 291544	2017-01-10 06:02:12 +00:00
Justin Lebar	86c4e63ff9	[CUDA] Let NVPTX inherit the host's calling conventions. Summary: When compiling device code, we may still see host code with explicit calling conventions. NVPTX needs to claim that it supports these CCs, so that (a) we don't raise noisy warnings, and (b) we don't break existing code which relies on the existence of these CCs when specializing templates. (If a CC doesn't exist, clang ignores it, so two template specializations which are different only insofar as one specifies a CC are considered identical and therefore are an error if that CC is not supported.) Reviewers: tra Subscribers: cfe-commits Differential Revision: https://reviews.llvm.org/D28323 llvm-svn: 291136	2017-01-05 16:53:38 +00:00
Justin Lebar	b662659355	[CUDA] More correctly inherit primitive types from the host during device compilation. Summary: CUDA lets users share structs between the host and device, so for that and other reasons, primitive types such as ptrdiff_t should be the same on both sides of the compilation. Our code to do this wasn't entirely successful. In particular, we did a bunch of work during the NVPTXTargetInfo constructor, only to override it in the NVPTX{32,64}TargetInfo constructors. It worked well enough on Linux and Mac, but Windows is LLP64, which is different enough to break it. This patch removes the NVPTX{32,64}TargetInfo classes entirely and fixes the bug described above. Reviewers: tra Subscribers: cfe-commits Differential Revision: https://reviews.llvm.org/D28322 llvm-svn: 291135	2017-01-05 16:53:21 +00:00
Dylan McKay	95aa265a6e	[AVR] Revert the functional part of r291083 As Senthil points out, this is unnecessary as we already have these registers in AddlRegNames. llvm-svn: 291090	2017-01-05 07:17:46 +00:00
Dylan McKay	f0f33a8b1a	[AVR] Support r26 through r31 in inline assembly These are synonyms for the X,Y, and Z registers. llvm-svn: 291083	2017-01-05 05:31:12 +00:00
Dylan McKay	924fa3abdc	Add AVR target and toolchain to Clang Summary: Authored by Senthil Kumar Selvaraj This patch adds barebones support in Clang for the (experimental) AVR target. It uses the integrated assembler for assembly, and the GNU linker for linking, as lld doesn't know about the target yet. The DataLayout string is the same as the one in AVRTargetMachine.cpp. The alignment specs look wrong to me, as it's an 8 bit target and all types only need 8 bit alignment. Clang failed with a datalayout mismatch error when I tried to change it, so I left it that way for now. Reviewers: rsmith, dylanmckay, cfe-commits, rengolin Subscribers: rengolin, jroelofs, wdng Differential Revision: https://reviews.llvm.org/D27123 llvm-svn: 291082	2017-01-05 05:20:27 +00:00
Marina Yatsina	c42fd03bf8	[inline-asm]No error for conflict between inputs\outputs and clobber list According to extended asm syntax, a case where the clobber list includes a variable from the inputs or outputs should be an error - conflict. for example: const long double a = 0.0; int main() { char b; double t1 = a; __asm__ ("fucompp": "=a" (b) : "u" (t1), "t" (t1) : "cc", "st", "st(1)"); return 0; } This should conflict with the output - t1 which is st, and st which is st aswell. The patch fixes it. Commit on behald of Ziv Izhar. Differential Revision: https://reviews.llvm.org/D15075 llvm-svn: 290539	2016-12-26 12:23:42 +00:00
Tim Northover	c67803fb14	ARM: define a macro for the FPv5 FPU in ARM mode. FPv5 is in Cortex-M7 and the 64-bit CPUs when running in 32-bit mode. The name is from the Cortex-M7 TRM. llvm-svn: 290268	2016-12-21 20:49:43 +00:00
Yaxun Liu	5b74665a41	Recommit r289979 [OpenCL] Allow disabling types and declarations associated with extensions Fixed undefined behavior due to cast integer to bool in initializer list. llvm-svn: 290056	2016-12-18 05:18:55 +00:00
Yaxun Liu	35f6d66b0d	Revert r289979 due to regressions llvm-svn: 289991	2016-12-16 21:23:55 +00:00
Yaxun Liu	2e8331cab6	[OpenCL] Allow disabling types and declarations associated with extensions Added a map to associate types and declarations with extensions. Refactored existing diagnostic for disabled types associated with extensions and extended it to declarations for generic situation. Fixed some bugs for types associated with extensions. Allow users to use pragma to declare types and functions for supported extensions, e.g. #pragma OPENCL EXTENSION the_new_extension_name : begin // declare types and functions associated with the extension here #pragma OPENCL EXTENSION the_new_extension_name : end Differential Revision: https://reviews.llvm.org/D21698 llvm-svn: 289979	2016-12-16 19:22:08 +00:00
Yaxun Liu	402804b6d6	Re-commit r289252 and r289285, and fix PR31374 llvm-svn: 289787	2016-12-15 08:09:08 +00:00
Nico Weber	7849eeb035	Revert 289252 (and follow-up 289285), it caused PR31374 llvm-svn: 289713	2016-12-14 21:38:18 +00:00
Stephan Bergmann	17c7f70362	Replace APFloatBase static fltSemantics data members with getter functions At least the plugin used by the LibreOffice build (<https://wiki.documentfoundation.org/Development/Clang_plugins>) indirectly uses those members (through inline functions in LLVM/Clang include files in turn using them), but they are not exported by utils/extract_symbols.py on Windows, and accessing data across DLL/EXE boundaries on Windows is generally problematic. Differential Revision: https://reviews.llvm.org/D26671 llvm-svn: 289647	2016-12-14 11:57:17 +00:00
Yaxun Liu	8f66b4b44a	Add support for non-zero null pointer for C and OpenCL In amdgcn target, null pointers in global, constant, and generic address space take value 0 but null pointers in private and local address space take value -1. Currently LLVM assumes all null pointers take value 0, which results in incorrectly translated IR. To workaround this issue, instead of emit null pointers in local and private address space, a null pointer in generic address space is emitted and casted to local and private address space. Tentative definition of global variables with non-zero initializer will have weak linkage instead of common linkage since common linkage requires zero initializer and does not have explicit section to hold the non-zero value. Virtual member functions getNullPointer and performAddrSpaceCast are added to TargetCodeGenInfo which by default returns ConstantPointerNull and emitting addrspacecast instruction. A virtual member function getNullPointerValue is added to TargetInfo which by default returns 0. Each target can override these virtual functions to get target specific null pointer and the null pointer value for specific address space, and perform specific translations for addrspacecast. Wrapper functions getNullPointer is added to CodegenModule and getTargetNullPointerValue is added to ASTContext to facilitate getting the target specific null pointers and their values. This change has no effect on other targets except amdgcn target. Other targets can provide support of non-zero null pointer in a similar way. This change only provides support for non-zero null pointer for C and OpenCL. Supporting for other languages will be added later incrementally. Differential Revision: https://reviews.llvm.org/D26196 llvm-svn: 289252	2016-12-09 19:01:11 +00:00
Dan Gohman	c819d0169d	[WebAssembly] Revert r288447. Revert r288447 which introduced -mdirect. It turns out we don't need a custom flag for this, as the information we need is in the target triple. llvm-svn: 288604	2016-12-03 23:03:52 +00:00
Dan Gohman	d0eed81dc0	[WebAssembly] Add an -mdirect flag for the direct wasm object feature. Add a target flag for enabling the new direct wasm object emission feature. llvm-svn: 288447	2016-12-02 01:12:40 +00:00
Pekka Jaaskelainen	6735448751	Add a little endian variant of TCE. llvm-svn: 287112	2016-11-16 15:22:31 +00:00
Stephen Hines	a89d8ff0ad	Define __ANDROID_API__ when specified as part of an Android target. Summary: This macro should be defined only when the user directly specifies an API level as part of an Android target. For any regular Android target, we leave this macro undefined. Bug: https://llvm.org/bugs/show_bug.cgi?id=30940 Reviewers: eugenis, pirama Subscribers: tberghammer, cfe-commits, pirama, eugenis, danalbert Differential Revision: https://reviews.llvm.org/D26491 llvm-svn: 286543	2016-11-11 01:29:39 +00:00
Douglas Katzman	13f4a91a1f	[Sparc] LLONG is not lock-free atomic on v8 Differential Revision: https://reviews.llvm.org/D26286 llvm-svn: 286376	2016-11-09 15:43:51 +00:00
Craig Topper	6c7ecc5341	[AVX-512] Make VBMI instruction set enabling imply that the BWI instruction set is also enabled. Summary: This is needed to make the v64i8 and v32i16 types legal for the 512-bit VBMI instructions. Fixes PR30912. Reviewers: delena, zvi Subscribers: RKSimon, cfe-commits Differential Revision: https://reviews.llvm.org/D26306 llvm-svn: 286340	2016-11-09 04:51:03 +00:00
Stephen Hines	032a4be29f	Revert "Define __ANDROID_API__ for all Android builds." Summary: This reverts commit a8804ddd9fe71304b28e5b834d134fe93e568ee0. Subscribers: cfe-commits, pirama, eugenis, tberghammer, danalbert Differential Revision: https://reviews.llvm.org/D26422 llvm-svn: 286298	2016-11-08 21:54:49 +00:00
Stephen Hines	270dab2794	Define __ANDROID_API__ for all Android builds. Summary: Bug: https://llvm.org/bugs/show_bug.cgi?id=30940 This macro (along with __ANDROID__) should always be defined for Android targets. We set it to the major (only) version of the Android API being compiled for. The Android version is able to be set as an integer suffix for any valid Android target. Reviewers: danalbert, eugenis Subscribers: cfe-commits, pirama, eugenis, tberghammer, danalbert Differential Revision: https://reviews.llvm.org/D26385 llvm-svn: 286295	2016-11-08 21:23:26 +00:00
Erich Keane	757d317c24	regcall: Implement regcall Calling Conv in clang This patch implements the register call calling convention, which ensures as many values as possible are passed in registers. CodeGen changes were committed in https://reviews.llvm.org/rL284108. Differential Revision: https://reviews.llvm.org/D25204 llvm-svn: 285849	2016-11-02 18:29:35 +00:00
Alexey Bader	0ea075328e	[OpenCL] Override supported OpenCL extensions with -cl-ext option Summary: This patch adds a command line option '-cl-ext' to control a set of supported OpenCL extensions. Option accepts a comma-separated list of extensions prefixed with '+' or '-'. It can be used together with a target triple to override support for some extensions: // spir target supports all extensions, but we want to disable fp64 clang -cc1 -triple spir-unknown-unknown -cl-ext=-cl_khr_fp64 Special 'all' extension allows to enable or disable all possible extensions: // only fp64 will be supported clang -cc1 -triple spir-unknown-unknown -cl-ext=-all,+cl_khr_fp64 Patch by asavonic (Andrew Savonichev). Reviewers: joey, yaxunl Subscribers: yaxunl, bader, Anastasia, cfe-commits Differential Revision: https://reviews.llvm.org/D23712 llvm-svn: 285700	2016-11-01 15:50:52 +00:00
Michael Zuckerman	62f516f590	[x86][inline-asm][clang] accept 'v' constraint Commit on behalf of: Coby Tayree 1.'v' constraint for (x86) non-avx arch imitates the already implemented 'x' constraint, i.e. allows XMM{0-15} & YMM{0-15} depending on the apparent arch & mode (32/64). 2.for the avx512 arch it allows [X,Y,Z]MM{0-31} (mode dependent) This patch applies the needed changes to clang LLVM patch: https://reviews.llvm.org/D25005 Differential Revision: https://reviews.llvm.org/D25005 llvm-svn: 285688	2016-11-01 13:16:44 +00:00
Michael Zuckerman	849a6a5e5a	[x86][inline-asm][AVX512][clang][PART-1] Introducing "k" and "Yk" constraints for extended inline assembly, enabling use of AVX512 masked vectorized instructions. Commit on behalf of mharoush Extending inline assembly support, compatible with GCC as folowing: "k" constraint hints the compiler to select any of AVX512 k0-k7 registers. "Yk" constraint is a subset of "k" excluding k0 which is not allowd to be used as a mask. Reviewer: 1. rnk Differential Revision: https://reviews.llvm.org/D25063 llvm-svn: 285604	2016-10-31 17:23:52 +00:00
Ulrich Weigand	30354ebb00	[SystemZ] Add -march=archX aliases For compatibility with other compilers on the platform, allow specifying levels of the z/Architecture instead of model names with -march. In particular, the following aliases are now supported: -march=arch8 equals -march=z10 -march=arch9 equals -march=z196 -march=arch10 equals -march=zEC12 -march=arch11 equals -march=z13 This parallels the equivalent (and prerequisite) LLVM change in r285577. llvm-svn: 285578	2016-10-31 14:38:05 +00:00
Michael Zuckerman	15604b996f	second attempt at r285565. llvm-svn: 285573	2016-10-31 14:16:57 +00:00
Michael Zuckerman	7beec2e8bf	revert r285563 fail in test CodeGen/avx512-inline-asm-kregisters-basics.c llvm-svn: 285565	2016-10-31 12:49:36 +00:00
Michael Zuckerman	0d26eea609	[x86][inline-asm] Introducing (AVX512) k0-k7 registers for inline-asm usage Commit on behalf of mharoush After LGTM and check all: This patch enables usage of k registers in inline assembly syntax. Adding triple Reviewer: 1. rnk 2. delena Differential Revision: https://reviews.llvm.org/D25011 llvm-svn: 285563	2016-10-31 12:05:41 +00:00
Michael Zuckerman	56c85d2119	Revert reviosion 285555 llvm-svn: 285556	2016-10-31 10:12:36 +00:00
Michael Zuckerman	4fe34fa2ec	[x86][inline-asm] Introducing (AVX512) k0-k7 registers for inline-asm usage Commit on behalf of mharoush After LGTM and check all: This patch enables usage of k registers in inline assembly syntax. Reviewer: 1. rnk 2. delena Differential Revision: https://reviews.llvm.org/D25011 llvm-svn: 285555	2016-10-31 09:37:59 +00:00
Yaxun Liu	e59a0bcf54	AMDGPU: Add missing ISA versions gfx7.x.x and 8.x.x. Patch by Laurent Morichetti. Differential Revision: https://reviews.llvm.org/D25920 llvm-svn: 285211	2016-10-26 16:40:21 +00:00
Petr Hosek	9834fcd7ce	[Basic] Support 32-bit x86 and ARM targets for Fuchsia Fuchsia has experimental support for 32-bit x86 and ARM targets, add them to the list of supported targets. Differential Revision: https://reviews.llvm.org/D25838 llvm-svn: 285038	2016-10-24 22:55:57 +00:00
Albert Gutowski	2a0621e58a	Implement MS _BitScan intrinsics Summary: _BitScan intrinsics (and some others, for example _Interlocked and _bittest) are supposed to work on both ARM and x86. This is an attempt to isolate them, avoiding repeating their code or writing separate function for each builtin. Reviewers: hans, thakis, rnk, majnemer Subscribers: RKSimon, cfe-commits, aemerson Differential Revision: https://reviews.llvm.org/D25264 llvm-svn: 284060	2016-10-12 22:01:05 +00:00
Arnold Schwaighofer	4fc955e669	Declare WinX86_64ABIInfo to satisfy SwiftABI info This is minimal support that allows swift's test cases on non windows platforms to pass. rdar://28738985 llvm-svn: 284032	2016-10-12 18:59:24 +00:00
Albert Gutowski	0fd6e9608e	Move x86-64 builtins from SemaChecking.cpp to BuiltinsX86_64.def Summary: Follow-up to https://reviews.llvm.org/D24598 (separating builtins for x84-64 and i386). Reviewers: hans, thakis, rnk Subscribers: cfe-commits Differential Revision: https://reviews.llvm.org/D25494 llvm-svn: 284026	2016-10-12 17:28:44 +00:00
Daniel Jasper	1eb779b5ae	Revert "[x86][inline-asm][clang] accept 'v' constraint" This reverts commit r283716. Breaks buildbot: http://lab.llvm.org:8080/green/job/clang-stage2-configure-Rlto_check/9155/testReport/junit/Clang/CodeGen/x86_inline_asm_v_constraint_c/ llvm-svn: 283743	2016-10-10 11:40:28 +00:00
Michael Zuckerman	fe2b9b4fbf	[x86][inline-asm][clang] accept 'v' constraint Commit in the name of: Coby Tayree 1.'v' constraint for (x86) non-avx arch imitates the already implemented 'x' constraint, i.e. allows XMM{0-15} & YMM{0-15} depending on the apparent arch & mode (32/64). 2.for the avx512 arch it allows [X,Y,Z]MM{0-31} (mode dependent) This patch applies the needed changes to clang LLVM patch: https://reviews.llvm.org/D25005 Differential Revision: D25004 llvm-svn: 283716	2016-10-10 05:45:54 +00:00
Javed Absar	00b7444dc1	[ARM]: Add Cortex-R52 target This patch adds Cortex-R52, the new ARM real-time processor. Cortex-R52 implements the ARMv8-R architecture. llvm-svn: 283543	2016-10-07 12:08:41 +00:00
Mehdi Amini	a0016ec95f	Use StringReg in TargetParser APIs (NFC) llvm-svn: 283527	2016-10-07 08:37:29 +00:00
Petr Hosek	62e1d23986	[Driver] Add driver support for Fuchsia Provide toolchain and tool support for Fuchsia operating system. Fuchsia uses compiler-rt as the runtime library and libc++, libc++abi and libunwind as the C++ standard library. lld is used as a default linker. Differential Revision: https://reviews.llvm.org/D25117 llvm-svn: 283420	2016-10-06 06:08:09 +00:00
Albert Gutowski	f3a0bce155	Separate builtins for x84-64 and i386; implement __mulh and __umulh Summary: We need x86-64-specific builtins if we want to implement some of the MS intrinsics - winnt.h contains definitions of some functions for i386, but not for x86-64 (for example _InterlockedOr64), which means that we cannot treat them as builtins for both i386 and x86-64, because then we have definitions of builtin functions in winnt.h on i386. Reviewers: thakis, majnemer, hans, rnk Subscribers: cfe-commits Differential Revision: https://reviews.llvm.org/D24598 llvm-svn: 283264	2016-10-04 22:29:49 +00:00
Richard Smith	591390284f	P0035R4: add predefined __STDCPP_DEFAULT_NEW_ALIGNMENT__ macro. By default, we assume that ::operator new provides no more alignment than is necessary for any primitive type, except when we're on a GNU OS, where glibc's malloc guarantees to provide 64-bit alignment on 32-bit systems and 128-bit alignment on 64-bit systems. This can be controlled by the command-line -fnew-alignment flag. llvm-svn: 282974	2016-09-30 22:41:36 +00:00
Artem Belevich	fda9905062	[CUDA] added __nvvm_atom_{sys\|cta}_* builtins. These builtins are available on sm_60+ GPU only. Differential Revision: https://reviews.llvm.org/D24944 llvm-svn: 282609	2016-09-28 17:47:35 +00:00
Nemanja Ivanovic	10e2b5dcaa	[Power9] Builtins for ELF v.2 ABI conformance - front end portion This patch corresponds to review: https://reviews.llvm.org/D24397 It adds the __POWER9_VECTOR__ macro and the -mpower9-vector option along with a number of altivec.h functions (refer to the code review for a list). llvm-svn: 282481	2016-09-27 10:45:22 +00:00
Renato Golin	0201a9e94e	[OpenBSD] Add type sign information for OpenBSD Like NetBSD, OpenBSD prefers having a consistent set of typedefs across the architectures it supports over strictly following the ARM ABIs. The diff below makes sure that clang's view of those types matches OpenBSD's system header files. It also adds a test that checks the relevant types on all OpenBSD platforms that clang works on. Hopefully we can add mips64 and powerpc to that list in the future. Patch by Mark Kettenis <mark.kettenis@xs4all.nl> llvm-svn: 282184	2016-09-22 19:28:20 +00:00
Albert Gutowski	727ab8a803	Add some MS aliases for existing intrinsics Reviewers: thakis, compnerd, majnemer, rsmith, rnk Subscribers: alexshap, cfe-commits Differential Revision: https://reviews.llvm.org/D24330 llvm-svn: 281540	2016-09-14 21:19:43 +00:00
Albert Gutowski	9918cb6573	Reverse commit 281375 (breaks building Chromium) llvm-svn: 281399	2016-09-13 21:24:51 +00:00
Albert Gutowski	ae3fb3113f	Add some MS aliases for existing intrinsics Reviewers: thakis, compnerd, majnemer, rsmith, rnk Subscribers: cfe-commits Differential Revision: https://reviews.llvm.org/D24330 llvm-svn: 281375	2016-09-13 19:26:42 +00:00
Yaxun Liu	d3e85b98be	AMDGPU: Fix target options fp32/64-denormals Fix target options for fp32/64-denormals so that +fp64-denormals is set if fp64 is supported -fp32-denormals if fp32 denormals is not supported, or -cl-denorms-are-zero is set +fp32-denormals if fp32 denormals is supported and -cl-denorms-are-zero is not set If target feature fp32/64-denormals is explicitly set, they will override default options and options deduced from -cl-denorms-are-zero. Differential Revision: https://reviews.llvm.org/D24512 llvm-svn: 281357	2016-09-13 17:37:09 +00:00
Justin Lebar	5057f17716	[CUDA] Make __GCC_ATOMIC_XXX_LOCK_FREE macros the same on host/device. Summary: This fixes a bug where we were unable to compile the following CUDA file with libstdc++ (didn't try libc++): #include <future> void foo() { std::shared_future<int> x; } The problem is that <future> only defines std::shared_future if __GCC_ATOMIC_INT_LOCK_FREE > 1. When we compiled this file for device, the macro was set to 1, and then the class didn't exist at all. Reviewers: tra Subscribers: cfe-commits, jhen Differential Revision: https://reviews.llvm.org/D24407 llvm-svn: 281089	2016-09-09 20:35:43 +00:00
Matt Arsenault	f333de3752	OpenCL: Defining __ENDIAN_LITTLE__ and fix target endianness OpenCL requires __ENDIAN_LITTLE__ be set for little endian targets. The default for targets was also apparently big endian, so AMDGPU was incorrectly reported as big endian. Set this from the triple so targets don't have another place to set the endianness. llvm-svn: 280787	2016-09-07 07:08:02 +00:00
Ed Schouten	c6d1a730a6	Add support for targeting armv6-unknown-cloudabi-eabihf. I'm in the progress of adding ARMv6 support to CloudABI. On the compiler side, everything seems to work properly with this tiny change applied. llvm-svn: 280672	2016-09-05 18:38:34 +00:00
Niels Ole Salscheider	b28cc458c3	Replace the Radeon GCN GPU family names by more descriptive ones Differential Revision: https://reviews.llvm.org/D23957 llvm-svn: 280587	2016-09-03 07:13:54 +00:00
Derek Schuff	acdc8e6355	[WebAssembly] Change wasm SizeType to match asmjs Summary: We want wasm and asmjs to have matching ABIs, and right now asmjs uses unsigned int for its size_t. This causes exported symbols in libcxx to not match and can cause weird breakage where libcxx doesn't get linked as a result. Long-term we probably want wasm32, wasm64, and asmjs to all use unsigned long, but that would cause unnecessary ABI churn for asmjs so defer that until we can make all the ABI changes at once. Patch by Jacob Gravelle Differential Revision: https://reviews.llvm.org/D24134 llvm-svn: 280420	2016-09-01 22:38:37 +00:00
Niels Ole Salscheider	c37d69f1ee	Basic/Targets.cpp: Add polaris10 and polaris11 gpus Differential Revision: https://reviews.llvm.org/D23746 llvm-svn: 280120	2016-08-30 18:00:22 +00:00
Krzysztof Parzyszek	7cd30bd7e6	[Hexagon] Use handleTargetFeaturesGroup to process target-specific features llvm-svn: 280093	2016-08-30 14:17:10 +00:00
Krzysztof Parzyszek	1f26912e4c	Handle -mlong-calls on Hexagon Differential Revision:://reviews.llvm.org/D22766 llvm-svn: 280089	2016-08-30 13:57:50 +00:00
Yaxun Liu	26f7566ff8	Re-commit [OpenCL] AMDGCN: Fix size_t type There was a premature cast to pointer type in emitPointerArithmetic which caused assertion in tests with assertion enabled. llvm-svn: 279206	2016-08-19 05:17:25 +00:00
Yaxun Liu	dea5ccb04b	Revert [OpenCL] AMDGCN: Fix size_t type due to regressions in test/CodeGen/exprs.c on certain platforms. llvm-svn: 279127	2016-08-18 20:01:06 +00:00
Yaxun Liu	6305f8a351	[OpenCL] AMDGCN: Fix size_t type Pointers of certain GPUs in AMDGCN target in private address space is 32 bit but pointers in other address spaces are 64 bit. size_t type should be defined as 64 bit for these GPUs so that it could hold pointers in all address spaces. Also fixed issues in pointer arithmetic codegen by using pointer specific intptr type. Differential Revision: https://reviews.llvm.org/D23361 llvm-svn: 279121	2016-08-18 19:34:04 +00:00
Guy Blank	294cbbdb66	test commit llvm-svn: 279042	2016-08-18 08:44:33 +00:00
Yaxun Liu	9385680c24	[OpenCL] AMDGPU: add support of cl_khr_subgroups Patch by Aaron En Ye Shi. Differential Revision: https://reviews.llvm.org/D23573 llvm-svn: 278972	2016-08-17 20:39:49 +00:00
Zijiao Ma	56a837238d	Some missing usage of TargetParser. NFC. llvm-svn: 278890	2016-08-17 02:13:33 +00:00
Yaxun Liu	3317446301	[OpenCL] AMDGPU: Add extensions cl_amd_media_ops and cl_amd_media_ops2 Differential Revision: https://reviews.llvm.org/D23322 llvm-svn: 278851	2016-08-16 20:49:49 +00:00
Yaxun Liu	2c17e82bc7	[OpenCL][AMDGPU] Add support for -cl-denorms-are-zero Adjust target features for amdgcn target when -cl-denorms-are-zero is set. Denormal support is controlled by feature strings fp32-denormals fp64-denormals in amdgcn target. If -cl-denorms-are-zero is not set and the command line does not set fp32/64-denormals feature string, +fp32-denormals +fp64-denormals will be on for GPU's supporting them. A new virtual function virtual void TargetInfo::adjustTargetOptions(const CodeGenOptions &CGOpts, TargetOptions &TargetOpts) const is introduced to allow adjusting target option by codegen option. Differential Revision: https://reviews.llvm.org/D22815 llvm-svn: 278151	2016-08-09 19:43:38 +00:00
Yaxun Liu	99444cb860	[OpenCL] Fix size of image type The size of image type is reported incorrectly as size of a pointer to address space 0, which causes error when casting image type to pointers by __builtin_astype. The fix is to get image address space from TargetInfo then report the size accordingly. Differential Revision: https://reviews.llvm.org/D22927 llvm-svn: 277647	2016-08-03 20:38:06 +00:00
Yaxun Liu	3f9e91286e	[OpenCL] Add extension cl_khr_mipmap_image to clang Adding extension cl_khr_mipmap_image to clang's OpenCL Extensions and initiated inside AMDGPU Target. Patch by Aaron En Ye Shi. Differential Revision: https://reviews.llvm.org/D22637 llvm-svn: 277181	2016-07-29 17:52:34 +00:00
Zijiao Ma	33e9521399	[AArch64] Using AArch64TargetParser in Clang. This resubmit r270688 which broke some specific buildbots.That's because there is incorrect indexing problem in the targetparser,and the problem is fixed in r276957. Differential Revision: https://reviews.llvm.org/D21277 llvm-svn: 276958	2016-07-28 06:24:48 +00:00
Matthias Braun	5092dc010f	Basic/Targets.cpp: Reformat aarch64 CPU list. Having 1 entry per line and an alphabetical order is clearer and reduces the risk of invalid merges. llvm-svn: 276931	2016-07-27 22:47:09 +00:00
Pirama Arumuga Nainar	bb846a32e4	Adjust coercion of aggregates on RenderScript Summary: In RenderScript, the size of the argument or return value emitted in the IR is expected to be the same as the size of corresponding qualified type. For ARM and AArch64, the coercion performed by Clang can change the parameter or return value to a type whose size is different (usually larger) than the original aggregate type. Specifically, this can happen in the following cases: - Aggregate parameters of size <= 64 bytes and return values smaller than 4 bytes on ARM - Aggregate parameters and return values smaller than bytes on AArch64 This patch coerces the cases above to an integer array that is the same size and alignment as the original aggregate. A new field is added to TargetInfo to detect a RenderScript target and limit this coercion just to that case. Tests added to test/CodeGen/renderscript.c Reviewers: rsmith Subscribers: aemerson, srhines, llvm-commits Differential Revision: https://reviews.llvm.org/D22822 llvm-svn: 276904	2016-07-27 19:01:51 +00:00

1 2 3 4 5 ...

1473 Commits