llvm-project

Commit Graph

Author	SHA1	Message	Date
Eric Christopher	c276e80022	Mark this test as requiring and x86 registered target. llvm-svn: 250475	2015-10-16 00:14:36 +00:00
Eric Christopher	15709991d0	Add an error when calling a builtin that requires features that don't match the feature set of the function that they're being called from. This ensures that we can effectively diagnose some[1] code that would instead ICE in the backend with a failure to select message. Example: __m128d foo(__m128d a, __m128d b) { return __builtin_ia32_addsubps(b, a); } compiled for normal x86_64 via: clang -target x86_64-linux-gnu -c would fail to compile in the back end because the normal subtarget features for x86_64 only include sse2 and the builtin requires sse3. [1] We're still not erroring on: __m128i bar(__m128i const *p) { return _mm_lddqu_si128(p); } where we should fail and error on an always_inline function being inlined into a function that doesn't support the subtarget features required. llvm-svn: 250473	2015-10-15 23:47:11 +00:00
Eric Christopher	abde1c2b51	The target-feature command line option doesn't take a comma delimited string, so split them into multiple options. llvm-svn: 250449	2015-10-15 20:04:42 +00:00
Eric Christopher	4fb4fbc5d6	Add the minimum target features that these tests depend upon. llvm-svn: 250448	2015-10-15 20:04:40 +00:00
Craig Topper	da9fe56bf6	[X86] Add command line switches for xsave/xsaveopt/xsavec/xsaves. Macro defines for the same. And add the flags to correct CPU names. llvm-svn: 250368	2015-10-15 05:23:38 +00:00
Eric Christopher	cd875efa78	Canonicalize some of the x86 builtin tests and either remove or comment about optimization options. llvm-svn: 250271	2015-10-14 05:40:21 +00:00
Eric Christopher	442c9b6b41	Remove the optimization option from this test as it is unnecessary and front end tests should avoid this if possible. llvm-svn: 250270	2015-10-14 05:40:11 +00:00
Eric Christopher	e45972719e	Move the adc-builtins test to the pattern of the other builtins tests by predefining _MM_MALLOC_H rather than use -ffreestanding. llvm-svn: 250203	2015-10-13 18:40:21 +00:00
Amjad Aboud	2b9b8a5921	[X86] Add XSAVE intrinsic family Add intrinsics for the XSAVE instructions (XSAVE/XSAVE64/XRSTOR/XRSTOR64) XSAVEOPT instructions (XSAVEOPT/XSAVEOPT64) XSAVEC instructions (XSAVEC/XSAVEC64) XSAVES instructions (XSAVES/XSAVES64/XRSTORS/XRSTORS64) Differential Revision: http://reviews.llvm.org/D13014 llvm-svn: 250158	2015-10-13 12:29:35 +00:00
Craig Topper	334d46150d	[X86] LLVM now prints XOP immediates as unsigned after r250147. Fix expected check string accordingly. llvm-svn: 250149	2015-10-13 05:15:17 +00:00
NAKAMURA Takumi	38c2f6cb20	Tweak clang/test/CodeGen/debug-prefix-map.c to appease win32 hosts. !1 = !DIFile(filename: "/var/empty\5C<stdin>", directory: "E:\5Cllvm\5Cbuild\5Ccmake-ninja\5Ctools\5Cclang\5Ctest\5CCodeGen") llvm-svn: 250136	2015-10-13 00:38:06 +00:00
Saleem Abdulrasool	9e593499b9	test: change argument This failed on AArch64 due to the type mismatch using int instead of __builtin_va_list. llvm-svn: 250112	2015-10-12 21:19:30 +00:00
Saleem Abdulrasool	83346258eb	test: relax path matching for windows The test failed on Windows due to use of \ as a path separator rather than /. llvm-svn: 250111	2015-10-12 21:19:27 +00:00
Saleem Abdulrasool	436256a713	Support Debug Info path remapping Add support for the `-fdebug-prefix-map=` option as in GCC. The syntax is `-fdebug-prefix-map=OLD=NEW`. When compiling files from a path beginning with OLD, change the debug info to indicate the path as start with NEW. This is particularly helpful if you are preprocessing in one path and compiling in another (e.g. for a build cluster with distcc). Note that the linearity of the implementation is not as terrible as it may seem. This is normally done once per file with an expectation that the map will be small (1-2) entries, making this roughly linear in the number of input paths. Addresses PR24619. llvm-svn: 250094	2015-10-12 20:21:08 +00:00
George Burgess IV	5f21c71800	[Sema] Make `&function_with_enable_if_attrs` an error This fixes a bug where one can take the address of a conditionally enabled function to drop its enable_if guards. For example: int foo(int a) __attribute__((enable_if(a > 0, ""))); int (*p)(int) = &foo; int result = p(-1); // compilation succeeds; calls foo(-1) Overloading logic has been updated to reflect this change, as well. Functions with enable_if attributes that are always true are still allowed to have their address taken. Differential Revision: http://reviews.llvm.org/D13607 llvm-svn: 250090	2015-10-12 19:57:04 +00:00
Eric Christopher	a7260af7e5	Handle sse turning on mmx, but no -mmx not turning off SSE. Rationale : // sse3 __m128d test_mm_addsub_pd(__m128d A, __m128d B) { return _mm_addsub_pd(A, B); } // mmx void shift(__m64 a, __m64 b, int c) { _mm_slli_pi16(a, c); _mm_slli_pi32(a, c); _mm_slli_si64(a, c); _mm_srli_pi16(a, c); _mm_srli_pi32(a, c); _mm_srli_si64(a, c); _mm_srai_pi16(a, c); _mm_srai_pi32(a, c); } clang -msse3 -mno-mmx file.c -c For this code we should be able to explicitly turn off MMX without affecting the compilation of the SSE3 function and then diagnose and error on compiling the MMX function. This is a preparatory patch to the actual diagnosis code which is coming in a future patch. This sets us up to have the correct information where we need it and verifies that it's being emitted for the backend to handle. llvm-svn: 249733	2015-10-08 20:10:18 +00:00
Eric Christopher	bbd746db9e	Migrate most feature map inclusion to initFeatureMap for the x86 target so that we can build up an accurate set of features rather than relying on TargetInfo initialization via handleTargetFeatures to munge the list of features. llvm-svn: 249732	2015-10-08 20:10:14 +00:00
David Majnemer	e4e3e6a5bf	[Sema] Tweak incomplete enum types on MSVC ABI targets Enums without an explicit, fixed, underlying type are implicitly given a fixed 'int' type for ABI compatibility with MSVC. However, we can enforce the standard-mandated rules on these types as-if we didn't know this fact if the tag is not part of a definition. llvm-svn: 249667	2015-10-08 07:45:35 +00:00
David Majnemer	c10b8381f7	Update tests touched by r249656 These test updates almost exclusively around the change in behavior around enum: enums without a definition are considered incomplete except when targeting MSVC ABIs. Since these tests are interested in the 'incomplete-enum' behavior, restrict them to %itanium_abi_triple. llvm-svn: 249660	2015-10-08 06:31:22 +00:00
David Majnemer	3f02150d31	[MSVC Compat] Enable ABI impacting non-conforming behavior independently of -fms-compatibility No ABI for C++ currently makes it possible to implement the standard 100% perfectly. We wrongly hid some of our compatible behavior behind -fms-compatibility instead of tying it to the compiler ABI. llvm-svn: 249656	2015-10-08 04:53:31 +00:00
Douglas Katzman	3459ce2e5e	Stop messing with the 'g' group of options in CompilerInvocation. With this change, most 'g' options are rejected by CompilerInvocation. They remain only as Driver options. The new way to request debug info from cc1 is with "-debug-info-kind={line-tables-only\|limited\|standalone}" and "-dwarf-version={2\|3\|4}". In the absence of a command-line option to specify Dwarf version, the Toolchain decides it, rather than placing Toolchain-specific logic in CompilerInvocation. Also fix a bug in the Windows compatibility argument parsing in which the "rightmost argument wins" principle failed. Differential Revision: http://reviews.llvm.org/D13221 llvm-svn: 249655	2015-10-08 04:24:12 +00:00
Reid Kleckner	129552b375	[WinEH] Remove NewMSEH and enable its behavior by default Testing has shown that it is at least as reliable as the old landingpad pattern matching code. llvm-svn: 249647	2015-10-08 01:13:52 +00:00
NAKAMURA Takumi	5849728912	clang/test/CodeGen/exceptions-seh-leave-new.c: Use "opt -instnamer" for branch-sensitive checks. llvm-svn: 249499	2015-10-07 01:29:26 +00:00
Reid Kleckner	f8d115338d	[SEH] Fix x64 __exception_code in __except blocks Use llvm.eh.exceptioncode to get the code out of EAX for x64. For 32-bit, the filter is responsible for storing it to memory for us. llvm-svn: 249497	2015-10-07 01:07:13 +00:00
Ahmed Bougacha	7dfaaf3891	[Headers][X86] Fix stream_load (movntdqa) to accept const. Per Intel intrinsics guide: - _mm256_stream_load_si256 takes `__m256i const ' - _mm_stream_load_si128 takes `__m128i ', for no good reason. Let's accept const for both. llvm-svn: 249213	2015-10-02 23:29:26 +00:00
Dan Gohman	266b38ab56	[WebAssembly] Add a __builtin_wasm_resize_memory() intrinsic. llvm-svn: 249179	2015-10-02 20:20:01 +00:00
Dan Gohman	d4c5fb597d	[WebAssembly] Add a __builtin_wasm_memory_size() intrinsic. llvm-svn: 249176	2015-10-02 19:38:47 +00:00
Andrea Di Biagio	f9989b04bf	Make test more resilient to FastIsel changes. NFC. Currently FastISel doesn't know how to select vector bitcasts. During instruction selection, fast-isel always falls back to SelectionDAG every time it encounters a vector bitcast. As a consequence of this, all the 'packed vector shift by immedate count' test cases in avx2-builtins.c are optimized by the DAGCombiner. In particular, the DAGCombiner would always fold trivial stack loads of constant shift counts into the operands of packed shift builtins. This behavior would start changing as soon as I reapply revision 249121. That revision would teach x86 fast-isel how to select bitcasts between vector types of the same size. As a consequence of that change, fast-isel would less often fall back to SelectionDAG. More importantly, DAGCombiner would no longer be able to simplify the code by folding the stack reload of a constant. No functional change. llvm-svn: 249142	2015-10-02 15:10:22 +00:00
Chandler Carruth	cbe6411401	Fix the SSE4 byte sign extension in a cleaner way, and more thoroughly test that our intrinsics behave the same under -fsigned-char and -funsigned-char. This further testing uncovered that AVX-2 has a broken cmpgt for 8-bit elements, and has for a long time. This is fixed in the same way as SSE4 handles the case. The other ISA extensions currently work correctly because they use specific instruction intrinsics. As soon as they are rewritten in terms of generic IR, they will need to add these special casts. I've added the necessary testing to catch this however, so we shouldn't have to chase it down again. I considered changing the core typedef to be signed, but that seems like a bad idea. Notably, it would be an ABI break if anyone is reaching into the innards of the intrinsic headers and passing __v16qi on an API boundary. I can't be completely confident that this wouldn't happen due to a macro expanding in a lambda, etc., so it seems much better to leave it alone. It also matches GCC's behavior exactly. A fun side note is that for both GCC and Clang, -funsigned-char really does change the semantics of __v16qi. To observe this, consider: % cat x.cc #include <smmintrin.h> #include <iostream> int main() { __v16qi a = { 1, -1, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0}; __v16qi b = _mm_set1_epi8(-1); std::cout << (int)(a / b)[0] << ", " << (int)(a / b)[1] << '\n'; } % clang++ -o x x.cc && ./x -1, 1 % clang++ -funsigned-char -o x x.cc && ./x 0, 1 However, while this may be surprising, both Clang and GCC agree. Differential Revision: http://reviews.llvm.org/D13324 llvm-svn: 249097	2015-10-01 23:40:12 +00:00
Chandler Carruth	9143378db0	Patch over a really horrible bug in our vector builtins that showed up recently when we started using direct conversion to model sign extension. The __v16qi type we use for SSE v16i8 vectors is defined in terms of 'char' which may or may not be signed! This causes us to generate pmovsx and pmovzx depending on the setting of -funsigned-char. This patch just forms an explicitly signed type and uses that to formulate the sign extension. While this gets the correct behavior (which we now verify with the enhanced test) this is just the tip of the ice berg. Now that I know what to look for, I have found errors of this sort throughout our vector code. Fortunately, this is the only specific place where I know of users actively having their code miscompiled by Clang due to this, so I'm keeping the fix for those users minimal and targeted. I'll be sending a proper email for discussion of how to fix these systematically, what the implications are, and just how widely broken this is... From what I can tell, we have never shipped a correct set of builtin headers for x86 when users rely on -funsigned-char. Oops. llvm-svn: 248980	2015-10-01 02:21:34 +00:00
Jingyue Wu	f1eca25b16	[CUDA] fix codegen for __nvvm_atom_cas_* Summary: __nvvm_atom_cas_* returns the old value instead of whether the swap succeeds. Reviewers: eliben, tra Subscribers: jholewinski, llvm-commits Differential Revision: http://reviews.llvm.org/D13306 llvm-svn: 248951	2015-09-30 21:49:32 +00:00
Jeroen Ketema	55a8e80de8	[ARM][NEON] Use address space in vld([1234]\|[234]lane) and vst([1234]\|[234]lane) instructions This is the clang commit associated with llvm r248887. This commit changes the interface of the vld[1234], vld[234]lane, and vst[1234], vst[234]lane ARM neon intrinsics and associates an address space with the pointer that these intrinsics take. This changes, e.g., <2 x i32> @llvm.arm.neon.vld1.v2i32(i8, i32) to <2 x i32> @llvm.arm.neon.vld1.v2i32.p0i8(i8, i32) This change ensures that address spaces are fully taken into account in the ARM target during lowering of interleaved loads and stores. Differential Revision: http://reviews.llvm.org/D13127 llvm-svn: 248888	2015-09-30 10:56:56 +00:00
Nemanja Ivanovic	236904ea9e	Addition of interfaces the FE to conform to Table A-2 of ELF V2 ABI V1.1 This patch corresponds to review: http://reviews.llvm.org/D13190 Implemented the following interfaces to conform to ELF V2 ABI version 1.1. vector signed __int128 vec_adde (vector signed __int128, vector signed __int128, vector signed __int128); vector unsigned __int128 vec_adde (vector unsigned __int128, vector unsigned __int128, vector unsigned __int128); vector signed __int128 vec_addec (vector signed __int128, vector signed __int128, vector signed __int128); vector unsigned __int128 vec_addec (vector unsigned __int128, vector unsigned __int128, vector unsigned __int128); vector signed int vec_addc(vector signed int __a, vector signed int __b); vector bool char vec_cmpge (vector signed char __a, vector signed char __b); vector bool char vec_cmpge (vector unsigned char __a, vector unsigned char __b); vector bool short vec_cmpge (vector signed short __a, vector signed short __b); vector bool short vec_cmpge (vector unsigned short __a, vector unsigned short __b); vector bool int vec_cmpge (vector signed int __a, vector signed int __b); vector bool int vec_cmpge (vector unsigned int __a, vector unsigned int __b); vector bool char vec_cmple (vector signed char __a, vector signed char __b); vector bool char vec_cmple (vector unsigned char __a, vector unsigned char __b); vector bool short vec_cmple (vector signed short __a, vector signed short __b); vector bool short vec_cmple (vector unsigned short __a, vector unsigned short __b); vector bool int vec_cmple (vector signed int __a, vector signed int __b); vector bool int vec_cmple (vector unsigned int __a, vector unsigned int __b); vector double vec_double (vector signed long long __a); vector double vec_double (vector unsigned long long __a); vector bool char vec_eqv(vector bool char __a, vector bool char __b); vector bool short vec_eqv(vector bool short __a, vector bool short __b); vector bool int vec_eqv(vector bool int __a, vector bool int __b); vector bool long long vec_eqv(vector bool long long __a, vector bool long long __b); vector signed short vec_madd(vector signed short __a, vector signed short __b, vector signed short __c); vector signed short vec_madd(vector signed short __a, vector unsigned short __b, vector unsigned short __c); vector signed short vec_madd(vector unsigned short __a, vector signed short __b, vector signed short __c); vector unsigned short vec_madd(vector unsigned short __a, vector unsigned short __b, vector unsigned short __c); vector bool long long vec_mergeh(vector bool long long __a, vector bool long long __b); vector bool long long vec_mergel(vector bool long long __a, vector bool long long __b); vector bool char vec_nand(vector bool char __a, vector bool char __b); vector bool short vec_nand(vector bool short __a, vector bool short __b); vector bool int vec_nand(vector bool int __a, vector bool int __b); vector bool long long vec_nand(vector bool long long __a, vector bool long long __b); vector bool char vec_orc(vector bool char __a, vector bool char __b); vector bool short vec_orc(vector bool short __a, vector bool short __b); vector bool int vec_orc(vector bool int __a, vector bool int __b); vector bool long long vec_orc(vector bool long long __a, vector bool long long __b); vector signed long long vec_sub(vector signed long long __a, vector signed long long __b); vector signed long long vec_sub(vector bool long long __a, vector signed long long __b); vector signed long long vec_sub(vector signed long long __a, vector bool long long __b); vector unsigned long long vec_sub(vector unsigned long long __a, vector unsigned long long __b); vector unsigned long long vec_sub(vector bool long long __a, vector unsigned long long __b); vector unsigned long long vec_sub(vector unsigned long long __V2 ABI V1.1 http://ror float vec_sub(vector float __a, vector float __b); unsigned char vec_extract(vector bool char __a, int __b); signed short vec_extract(vector signed short __a, int __b); unsigned short vec_extract(vector bool short __a, int __b); signed int vec_extract(vector signed int __a, int __b); unsigned int vec_extract(vector bool int __a, int __b); signed long long vec_extract(vector signed long long __a, int __b); unsigned long long vec_extract(vector unsigned long long __a, int __b); unsigned long long vec_extract(vector bool long long __a, int __b); double vec_extract(vector double __a, int __b); vector bool char vec_insert(unsigned char __a, vector bool char __b, int __c); vector signed short vec_insert(signed short __a, vector signed short __b, int __c); vector bool short vec_insert(unsigned short __a, vector bool short __b, int __c); vector signed int vec_insert(signed int __a, vector signed int __b, int __c); vector bool int vec_insert(unsigned int __a, vector bool int __b, int __c); vector signed long long vec_insert(signed long long __a, vector signed long long __b, int __c); vector unsigned long long vec_insert(unsigned long long __a, vector unsigned long long __b, int __c); vector bool long long vec_insert(unsigned long long __a, vector bool long long __b, int __c); vector double vec_insert(double __a, vector double __b, int __c); vector signed long long vec_splats(signed long long __a); vector unsigned long long vec_splats(unsigned long long __a); vector signed __int128 vec_splats(signed __int128 __a); vector unsigned __int128 vec_splats(unsigned __int128 __a); vector double vec_splats(double __a); int vec_all_eq(vector double __a, vector double __b); int vec_all_ge(vector double __a, vector double __b); int vec_all_gt(vector double __a, vector double __b); int vec_all_le(vector double __a, vector double __b); int vec_all_lt(vector double __a, vector double __b); int vec_all_nan(vector double __a); int vec_all_ne(vector double __a, vector double __b); int vec_all_nge(vector double __a, vector double __b); int vec_all_ngt(vector double __a, vector double __b); int vec_any_eq(vector double __a, vector double __b); int vec_any_ge(vector double __a, vector double __b); int vec_any_gt(vector double __a, vector double __b); int vec_any_le(vector double __a, vector double __b); int vec_any_lt(vector double __a, vector double __b); int vec_any_ne(vector double __a, vector double __b); vector unsigned char vec_sbox_be (vector unsigned char); vector unsigned char vec_cipher_be (vector unsigned char, vector unsigned char); vector unsigned char vec_cipherlast_be (vector unsigned char, vector unsigned char); vector unsigned char vec_ncipher_be (vector unsigned char, vector unsigned char); vector unsigned char vec_ncipherlast_be (vector unsigned char, vector unsigned char); vector unsigned int vec_shasigma_be (vector unsigned int, const int, const int); vector unsigned long long vec_shasigma_be (vector unsigned long long, const int, const int); vector unsigned short vec_pmsum_be (vector unsigned char, vector unsigned char); vector unsigned int vec_pmsum_be (vector unsigned short, vector unsigned short); vector unsigned long long vec_pmsum_be (vector unsigned int, vector unsigned int); vector unsigned __int128 vec_pmsum_be (vector unsigned long long, vector unsigned long long); vector unsigned char vec_gb (vector unsigned char); vector unsigned long long vec_bperm (vector unsigned __int128 __a, vector unsigned char __b); Removed the folowing interfaces either because their signatures have changed in version 1.1 of the ABI or because they were implemented for ELF V2 ABI but have actually been deprecated in version 1.1. vector signed char vec_eqv(vector bool char __a, vector signed char __b); vector signed char vec_eqv(vector signed char __a, vector bool char __b); vector unsigned char vec_eqv(vector bool char __a, vector unsigned char __b); vector unsigned char vec_eqv(vector unsigned char __a, vector bool char __b); vector signed short vec_eqv(vector bool short __a, vector signed short __b); vector signed short vec_eqv(vector signed short __a, vector bool short __b); vector unsigned short vec_eqv(vector bool short __a, vector unsigned short __b); vector unsigned short vec_eqv(vector unsigned short __a, vector bool short __b); vector signed int vec_eqv(vector bool int __a, vector signed int __b); vector signed int vec_eqv(vector signed int __a, vector bool int __b); vector unsigned int vec_eqv(vector bool int __a, vector unsigned int __b); vector unsigned int vec_eqv(vector unsigned int __a, vector bool int __b); vector signed long long vec_eqv(vector bool long long __a, vector signed long long __b); vector signed long long vec_eqv(vector signed long long __a, vector bool long long __b); vector unsigned long long vec_eqv(vector bool long long __a, vector unsigned long long __b); vector unsigned long long vec_eqv(vector unsigned long long __a, vector bool long long __b); vector float vec_eqv(vector bool int __a, vector float __b); vector float vec_eqv(vector float __a, vector bool int __b); vector double vec_eqv(vector bool long long __a, vector double __b); vector double vec_eqv(vector double __a, vector bool long long __b); vector unsigned short vec_nand(vector bool short __a, vector unsigned short __b); llvm-svn: 248813	2015-09-29 18:13:34 +00:00
John McCall	8460bcaa33	Honor the casted-to alignment of an explicit cast even when Sema thinks the cast is a no-op, as it does when (e.g.) the only thing that changes is an alignment attribute. Fixed PR24944. llvm-svn: 248775	2015-09-29 04:37:40 +00:00
Artem Belevich	236cfdc4be	[CUDA] 32-bit NVPTX should have 32-bit long type. Currently it's 64-bit which will lead to mismatch between host and device code if we compile for i386. Differential Revision: http://reviews.llvm.org/D13181 llvm-svn: 248753	2015-09-28 22:54:08 +00:00
Artyom Skrobov	d3ae09bcc8	Recommit r248154: [ARM] Handle DSP feature as an ArchExtKind Currently, the availability of DSP instructions (ACLE 6.4.7) is handled in a hand-rolled tricky condition block in lib/Basic/Targets.cpp, with a FIXME: attached. http://reviews.llvm.org/D12937 moved the handling of the DSP feature over to ARMTargetParser.def in LLVM, to be in line with other architecture extensions. This is the corresponding patch to clang, to clear the FIXME: and update the tests. Differential Revision: http://reviews.llvm.org/D12938 llvm-svn: 248521	2015-09-24 17:34:05 +00:00
Daniel Sanders	e0395a7f7f	[mips] Relax -mnan=2008 acceptance to permit MIPS32R2 and MIPS64R2. Summary: Strictly speaking, the MIPSR2 ISA's should not permit -mnan=2008 since this feature was added in MIPSR3. However, other toolchains permit this and we should do the same. Reviewers: atanasyan Subscribers: cfe-commits Differential Revision: http://reviews.llvm.org/D13057 llvm-svn: 248481	2015-09-24 10:22:17 +00:00
Akira Hatanaka	510d7c71e2	Remove attributes minsize and optsize, which conflict with optnone. This commit fixes an assert that is triggered when optnone is being added to an IR function that is already marked with minsize and optsize. rdar://problem/22723716 Differential Revision: http://reviews.llvm.org/D13004 llvm-svn: 248191	2015-09-21 18:52:24 +00:00
James Molloy	93bd4e9979	Revert "[ARM] Handle +t2dsp feature as an ArchExtKind in ARMTargetParser.def" This was committed without the code review (http://reviews.llvm.org/D12938) being approved. This reverts commit r248154. llvm-svn: 248173	2015-09-21 16:34:58 +00:00
Artyom Skrobov	7428f1ef64	[ARM] Handle +t2dsp feature as an ArchExtKind in ARMTargetParser.def Currently, the availability of DSP instructions (ACLE 6.4.7) is handled in a hand-rolled tricky condition block in lib/Basic/Targets.cpp, with a FIXME: attached. http://reviews.llvm.org/D12937 moved the handling of +t2dsp over to ARMTargetParser.def in LLVM, to be in line with other architecture extensions. This is the corresponding patch to clang, to clear the FIXME: and update the tests. Differential Revision: http://reviews.llvm.org/D12938 llvm-svn: 248154	2015-09-21 13:19:25 +00:00
Simon Pilgrim	12919f7e49	[X86][SSE] Replace 128-bit SSE41 PMOVSX intrinsics with native IR 128-bit vector integer sign extensions correctly lower to the pmovsx instructions even for debug builds. This patch removes the builtins and reimplements the _mm_cvtepi_epi intrinsics __using builtin_shufflevector (to extract the bottom most subvector) and __builtin_convertvector (to actually perform the sign extension). Differential Revision: http://reviews.llvm.org/D12835 llvm-svn: 248092	2015-09-19 15:12:38 +00:00
Alexander Musman	fbbc0b8cec	Fix for assertion fail for pragma weak on typedef. Example: typedef int __td3; #pragma weak td3 = __td3 Differential Revision: http://reviews.llvm.org/D12904 llvm-svn: 247975	2015-09-18 07:40:22 +00:00
Charles Davis	c7d5c94f78	Support __builtin_ms_va_list. Summary: This change adds support for `__builtin_ms_va_list`, a GCC extension for variadic `ms_abi` functions. The existing `__builtin_va_list` support is inadequate for this because `va_list` is defined differently in the Win64 ABI vs. the System V/AMD64 ABI. Depends on D1622. Reviewers: rsmith, rnk, rjmccall CC: cfe-commits Differential Revision: http://reviews.llvm.org/D1623 llvm-svn: 247941	2015-09-17 20:55:33 +00:00
Reid Kleckner	01485654db	Use the MSVC SEH personalities on Mingw Mingw generally wraps an old copy of msvcrt.dll which has these personalities, so things should work out, or so I hear. I haven't tested it. llvm-svn: 247902	2015-09-17 17:04:13 +00:00
Asaf Badouh	2718051dd7	re-apply r.247881 fixed the tests. llvm-svn: 247892	2015-09-17 14:53:37 +00:00
Asaf Badouh	8a61250709	revert r.247881 due to tests failures llvm-svn: 247883	2015-09-17 13:09:33 +00:00
NAKAMURA Takumi	007f75dab7	Appease clang/test/CodeGen/avx512dq-builtins.c for -Asserts, for now. llvm-svn: 247882	2015-09-17 12:33:34 +00:00
Asaf Badouh	a0e5e71ef1	[X86][AVX512DQ] add new intrinsics convert i64 to FP and vice versa reduceps & reducepd rangeps & rangepd all in their 512bit versions Differential Revision: http://reviews.llvm.org/D11716 llvm-svn: 247881	2015-09-17 11:56:04 +00:00
NAKAMURA Takumi	f9c52dc3e7	Make clang/test/CodeGen/catch-undef-behavior.c* capable of -Asserts with "opt -instnamer". It reverts r231717. llvm-svn: 247667	2015-09-15 09:50:24 +00:00
Piotr Padlewski	d679d7e924	Generating assumption loads of vptr after ctor call (fixed) Generating call assume(icmp %vtable, %global_vtable) after constructor call for devirtualization purposes. For more info go to: http://lists.llvm.org/pipermail/cfe-dev/2015-July/044227.html Edit: Fixed version because of PR24479 and other bug caused in chrome. After this patch got reverted because of ScalarEvolution bug (D12719) Merged after John McCall big patch (Added Address). http://reviews.llvm.org/D11859 http://reviews.llvm.org/D12865 llvm-svn: 247646	2015-09-15 00:37:06 +00:00
Evgeniy Stepanov	6b2a61d3a5	Revert "Always_inline codegen rewrite" and 2 follow-ups. Revert "Update cxx-irgen.cpp test to allow signext in alwaysinline functions." Revert "[CodeGen] Remove wrapper-free always_inline functions from COMDATs" Revert "Always_inline codegen rewrite." Reason for revert: PR24793. llvm-svn: 247620	2015-09-14 21:35:16 +00:00
Rachel Craik	022bdc7d73	C11 _Bool bitfield diagnostic Summary: Implement DR262 (for C). This patch will mainly affect bitfields of type _Bool Reviewers: fraggamuffin, rsmith Subscribers: hubert.reinterpretcast, cfe-commits Differential Revision: http://reviews.llvm.org/D10018 llvm-svn: 247618	2015-09-14 21:27:36 +00:00
Simon Atanasyan	25be8761b9	[mips] Add test case to check ABI flag emissions in case of inline assembler Follow up to r247546. The test case reproduces the problem fixed by this commit. llvm-svn: 247548	2015-09-14 11:23:02 +00:00
Evgeniy Stepanov	93db40a147	Always_inline codegen rewrite. Current implementation may end up emitting an undefined reference for an "inline __attribute__((always_inline))" function by generating an "available_externally alwaysinline" IR function for it and then failing to inline all the calls. This happens when a call to such function is in dead code. As the inliner is an SCC pass, it does not process dead code. Libc++ relies on the compiler never emitting such undefined reference. With this patch, we emit a pair of 1. internal alwaysinline definition (called F.alwaysinline) 2a. A stub F() { musttail call F.alwaysinline } -- or, depending on the linkage -- 2b. A declaration of F. The frontend ensures that F.inlinefunction is only used for direct calls, and the stub is used for everything else (taking the address of the function, really). Declaration (2b) is emitted in the case when "inline" is meant for inlining only (like __gnu_inline__ and some other cases). This approach, among other nice properties, ensures that alwaysinline functions are always internal, making it impossible for a direct call to such function to produce an undefined symbol reference. This patch is based on ideas by Chandler Carruth and Richard Smith. llvm-svn: 247494	2015-09-12 01:07:37 +00:00
Evgeniy Stepanov	67037ee21e	Revert "Specify target triple in alwaysinline tests." Revert "Always_inline codegen rewrite." Breaks gdb & lldb tests. Breaks on Fedora 22 x86_64. llvm-svn: 247491	2015-09-11 23:48:37 +00:00
Evgeniy Stepanov	6dbfdaa6ab	Specify target triple in alwaysinline tests. This should fix the tests on Windows (failing due to mangling differencies). llvm-svn: 247473	2015-09-11 21:10:12 +00:00
Evgeniy Stepanov	072e83500e	Always_inline codegen rewrite. Current implementation may end up emitting an undefined reference for an "inline __attribute__((always_inline))" function by generating an "available_externally alwaysinline" IR function for it and then failing to inline all the calls. This happens when a call to such function is in dead code. As the inliner is an SCC pass, it does not process dead code. Libc++ relies on the compiler never emitting such undefined reference. With this patch, we emit a pair of 1. internal alwaysinline definition (called F.alwaysinline) 2a. A stub F() { musttail call F.alwaysinline } -- or, depending on the linkage -- 2b. A declaration of F. The frontend ensures that F.inlinefunction is only used for direct calls, and the stub is used for everything else (taking the address of the function, really). Declaration (2b) is emitted in the case when "inline" is meant for inlining only (like __gnu_inline__ and some other cases). This approach, among other nice properties, ensures that alwaysinline functions are always internal, making it impossible for a direct call to such function to produce an undefined symbol reference. This patch is based on ideas by Chandler Carruth and Richard Smith. llvm-svn: 247465	2015-09-11 20:29:07 +00:00
Akira Hatanaka	aecca041c9	Record function attribute "stackrealign" instead of using backend option -force-align-stack. Also, make changes to the driver so that -mno-stack-realign is no longer an option exposed to the end-user that disallows stack realignment in the backend. Differential Revision: http://reviews.llvm.org/D11815 llvm-svn: 247451	2015-09-11 18:55:09 +00:00
Reid Kleckner	9e8f2b46b2	Update test expectations for LLVM asm printing change llvm-svn: 247434	2015-09-11 17:27:53 +00:00
Reid Kleckner	fb06c84be8	[SEH] Port __try / __leave test to new IR It turns out that the IR we already generate for __leave is fine, so no code changes were needed. llvm-svn: 247424	2015-09-11 16:29:27 +00:00
David Blaikie	fc473554a6	[opaque pointer type] update test cases for explicit pointee types on global aliases llvm-svn: 247380	2015-09-11 03:22:18 +00:00
Reid Kleckner	2586aac908	[SEH] Use cleanupendpad so that WinEHPrepare gets the coloring right Cleanupendpad is a lot like catchendpad, so we can reuse the same EHScopeStack type. llvm-svn: 247349	2015-09-10 22:11:13 +00:00
Piotr Padlewski	4bed31b9bf	Revert "Generating assumption loads of vptr after ctor call (fixed)" It seems that there is small bug, and we can't generate assume loads when some virtual functions have internal visibiliy This reverts commit 982bb7d966947812d216489b3c519c9825cacbf2. llvm-svn: 247332	2015-09-10 20:18:30 +00:00
Reid Kleckner	bb34b60359	[SEH] Use catchret in the new EH IR like we do for C++ Also add tests for SEH with the new IRGen. llvm-svn: 247318	2015-09-10 18:39:41 +00:00
Peter Collingbourne	2c7f7e31c4	CFI: Introduce -fsanitize=cfi-icall flag. This flag causes the compiler to emit bit set entries for functions as well as runtime bitset checks at indirect call sites. Depends on the new function bitset mechanism. Differential Revision: http://reviews.llvm.org/D11857 llvm-svn: 247238	2015-09-10 02:17:40 +00:00
John McCall	9a2c1c9603	Don't crash when emitting a block under returns_nonnull. rdar://22071955 llvm-svn: 247228	2015-09-10 00:57:46 +00:00
Sanjay Patel	daf34e9d85	convert builtin_unpredictable on a switch into metadata for LLVM llvm-svn: 247203	2015-09-09 22:39:06 +00:00
Piotr Padlewski	255652e828	Generating assumption loads of vptr after ctor call (fixed) Generating call assume(icmp %vtable, %global_vtable) after constructor call for devirtualization purposes. For more info go to: http://lists.llvm.org/pipermail/cfe-dev/2015-July/044227.html Edit: Fixed version because of PR24479. After this patch got reverted because of ScalarEvolution bug (D12719) Merged after John McCall big patch (Added Address). http://reviews.llvm.org/D11859 llvm-svn: 247199	2015-09-09 22:20:28 +00:00
Alexandros Lamprineas	3834d2ca08	[ARM] "cortex-r5f" and "cortex-m4f" are unknown names for clang. The tests in test/CodeGen/arm-target-features.c are currently passing but warning messages are suppressed. These tests are now synchronized with the corresponding changes in Target Parser. This patch will fix the regressions in clang caused by r247136 Differential Revision: http://reviews.llvm.org/D12722 llvm-svn: 247138	2015-09-09 11:29:06 +00:00
Michael Zolotukhin	84df12375c	Introduce __builtin_nontemporal_store and __builtin_nontemporal_load. Summary: Currently clang provides no general way to generate nontemporal loads/stores. There are some architecture specific builtins for doing so (e.g. in x86), but there is no way to generate non-temporal store on, e.g. AArch64. This patch adds generic builtins which are expanded to a simple store with '!nontemporal' attribute in IR. Differential Revision: http://reviews.llvm.org/D12313 llvm-svn: 247104	2015-09-08 23:52:33 +00:00
John McCall	0a49015629	Collect SEH captures in a set instead of a vector to avoid doing redundant work if a variable is used multiple times. Fixes PR24751. llvm-svn: 247075	2015-09-08 21:15:22 +00:00
NAKAMURA Takumi	64319286d0	clang/test/CodeGen/complex-convert.c: Appease for targeting powerpc64-*. llvm-svn: 247015	2015-09-08 14:19:13 +00:00
Jakub Kuderski	f50ab0ffce	findDominatingStoreToReturn in CGCall.cpp didn't check if a candidate store instruction used the ReturnValue as pointer operand or value operand. This led to wrong code gen - in later stages (load-store elision code) the found store and its operand would be erased, causing ReturnValue to become a <badref>. The patch adds a check that makes sure that ReturnValue is a pointer operand of store instruction. Regression test is also added. This fixes PR24386. Differential Revision: http://reviews.llvm.org/D12400 llvm-svn: 247003	2015-09-08 10:36:42 +00:00
NAKAMURA Takumi	f7bee71c26	Fix clang/test/CodeGen/mips-varargs.c for -Asserts, possibly typo. llvm-svn: 246994	2015-09-08 09:37:09 +00:00
John McCall	7f416cc426	Compute and preserve alignment more faithfully in IR-generation. Introduce an Address type to bundle a pointer value with an alignment. Introduce APIs on CGBuilderTy to work with Address values. Change core APIs on CGF/CGM to traffic in Address where appropriate. Require alignments to be non-zero. Update a ton of code to compute and propagate alignment information. As part of this, I've promoted CGBuiltin's EmitPointerWithAlignment helper function to CGF and made use of it in a number of places in the expression emitter. The end result is that we should now be significantly more correct when performing operations on objects that are locally known to be under-aligned. Since alignment is not reliably tracked in the type system, there are inherent limits to this, but at least we are no longer confused by standard operations like derived-to-base conversions and array-to-pointer decay. I've also fixed a large number of bugs where we were applying the complete-object alignment to a pointer instead of the non-virtual alignment, although most of these were hidden by the very conservative approach we took with member alignment. Also, because IRGen now reliably asserts on zero alignments, we should no longer be subject to an absurd but frustrating recurring bug where an incomplete type would report a zero alignment and then we'd naively do a alignmentAtOffset on it and emit code using an alignment equal to the largest power-of-two factor of the offset. We should also now be emitting much more aggressive alignment attributes in the presence of over-alignment. In particular, field access now uses alignmentAtOffset instead of min. Several times in this patch, I had to change the existing code-generation pattern in order to more effectively use the Address APIs. For the most part, this seems to be a strict improvement, like doing pointer arithmetic with GEPs instead of ptrtoint. That said, I've tried very hard to not change semantics, but it is likely that I've failed in a few places, for which I apologize. ABIArgInfo now always carries the assumed alignment of indirect and indirect byval arguments. In order to cut down on what was already a dauntingly large patch, I changed the code to never set align attributes in the IR on non-byval indirect arguments. That is, we still generate code which assumes that indirect arguments have the given alignment, but we don't express this information to the backend except where it's semantically required (i.e. on byvals). This is likely a minor regression for those targets that did provide this information, but it'll be trivial to add it back in a later patch. I partially punted on applying this work to CGBuiltin. Please do not add more uses of the CreateDefaultAligned{Load,Store} APIs; they will be going away eventually. llvm-svn: 246985	2015-09-08 08:05:57 +00:00
Simon Pilgrim	437cc973fb	[X86][SSE4A] Added SSE4A IR + assembly codegen builtin tests llvm-svn: 246974	2015-09-07 20:10:11 +00:00
Simon Pilgrim	0d9d748bf1	[X86][SSSE3] Added SSSE3 IR + assembly codegen builtin tests Transferred SSSE3 instructions from sse-builtins.c llvm-svn: 246948	2015-09-06 17:06:22 +00:00
Simon Pilgrim	ff88a0da31	[X86]][SSE3] Added SSE41 IR + assembly codegen builtin tests Transferred SSE41 instructions from sse-builtins.c llvm-svn: 246947	2015-09-06 16:38:17 +00:00
Alexandros Lamprineas	94d75dba14	Refactoring of how ARMTargetInfo handles default target features. Differential Revision: http://reviews.llvm.org/D11299 llvm-svn: 246946	2015-09-06 16:15:45 +00:00
Simon Pilgrim	8391ac7001	[X86]][SSE3] Added SSE3 IR + assembly codegen builtin tests llvm-svn: 246945	2015-09-06 14:45:13 +00:00
Simon Pilgrim	de06f31885	[X86]][SSE42] Added SSE42 IR + assembly codegen builtin tests llvm-svn: 246944	2015-09-06 14:05:33 +00:00
George Burgess IV	b40cd567c3	Fix a bug in __builtin_object_size cast removal Apparently there are many cast kinds that may cause implicit pointer arithmetic to happen. In light of this, the cast ignoring logic introduced in r246877 has been changed to only ignore a small set of cast kinds, and a test for this behavior has been added. Thanks to Richard for catching this before it became a bug report. :) llvm-svn: 246890	2015-09-04 22:36:18 +00:00
George Burgess IV	3a03fabdd0	Increase accuracy of __builtin_object_size. Improvements: - For all types, we would give up in a case such as: __builtin_object_size((char*)&foo, N); even if we could provide an answer to __builtin_object_size(&foo, N); We now provide the same answer for both of the above examples in all cases. - For type=1\|3, we now support subobjects with unknown bases, as long as the designator is valid. Thanks to Richard Smith for the review + design planning. Review: http://reviews.llvm.org/D12169 llvm-svn: 246877	2015-09-04 21:28:13 +00:00
Alexey Bataev	a7ab1b4206	[X86-64] Allow additional register names in inline assembler. Patch allows to recognize additional registers x8d, x8b, x8w - x15d, x15b, x15w in inline assembler, already recognized by backend Differential Revision: http://reviews.llvm.org/D12594 llvm-svn: 246835	2015-09-04 03:42:23 +00:00
Dan Gohman	c285307e14	[WebAssembly] Initial WebAssembly support in clang This implements basic support for compiling (though not yet assembling or linking) for a WebAssembly target. Note that ABI details are not yet finalized, and may change. Differential Revision: http://reviews.llvm.org/D12002 llvm-svn: 246814	2015-09-03 22:51:53 +00:00
Oliver Stannard	dc2854c2f1	[ARM] Allow passing/returning of __fp16 arguments The ACLE (ARM C Language Extensions) 2.0 allows the __fp16 type to be used as a functon argument or return type (ACLE 1.1 did not). The current public release of the AAPCS (2.09) states that __fp16 values should be converted to single-precision before being passed or returned, but AAPCS 2.10 (to be released shortly) changes this, so that they are passed in the least-significant 16 bits of either a GPR (for base AAPCS) or a single-precision register (for AAPCS-VFP). This does not change how arguments are passed if they get passed on the stack. This patch brings clang up to compliance with the latest versions of both of these specs. We can now set the __ARM_FP16_ARGS ACLE predefine, and we have always been able to set the __ARM_FP16_FORMAT_IEEE predefine (we do not support the alternative format). llvm-svn: 246764	2015-09-03 12:40:58 +00:00
Oliver Stannard	9253f00d13	Revert 246755 as it breaks buildbots Original commit message: [ARM] Allow passing/returning of __fp16 arguments The ACLE (ARM C Language Extensions) 2.0 allows the __fp16 type to be used as a functon argument or return type (ACLE 1.1 did not). The current public release of the AAPCS (2.09) states that __fp16 values should be converted to single-precision before being passed or returned, but AAPCS 2.10 (to be released shortly) changes this, so that they are passed in the least-significant 16 bits of either a GPR (for base AAPCS) or a single-precision register (for AAPCS-VFP). This does not change how arguments are passed if they get passed on the stack. This patch brings clang up to compliance with the latest versions of both of these specs. We can now set the __ARM_FP16_ARGS ACLE predefine, and we have always been able to set the __ARM_FP16_FORMAT_IEEE predefine (we do not support the alternative format). llvm-svn: 246760	2015-09-03 11:46:24 +00:00
Oliver Stannard	ee0286201c	[ARM] Allow passing/returning of __fp16 arguments The ACLE (ARM C Language Extensions) 2.0 allows the __fp16 type to be used as a functon argument or return type (ACLE 1.1 did not). The current public release of the AAPCS (2.09) states that __fp16 values should be converted to single-precision before being passed or returned, but AAPCS 2.10 (to be released shortly) changes this, so that they are passed in the least-significant 16 bits of either a GPR (for base AAPCS) or a single-precision register (for AAPCS-VFP). This does not change how arguments are passed if they get passed on the stack. This patch brings clang up to compliance with the latest versions of both of these specs. We can now set the __ARM_FP16_ARGS ACLE predefine, and we have always been able to set the __ARM_FP16_FORMAT_IEEE predefine (we do not support the alternative format). llvm-svn: 246755	2015-09-03 09:34:53 +00:00
Sanjay Patel	a24296b459	add __builtin_unpredictable and convert to metadata This patch depends on r246688 (D12341). The goal is to make LLVM generate different code for these functions for a target that has cheap branches (see PR23827 for more details): int foo(); int normal(int x, int y, int z) { if (x != 0 && y != 0) return foo(); return 1; } int crazy(int x, int y) { if (__builtin_unpredictable(x != 0 && y != 0)) return foo(); return 1; } Differential Revision: http://reviews.llvm.org/D12458 llvm-svn: 246699	2015-09-02 20:01:30 +00:00
Hal Finkel	65e1e4dbe0	[PowerPC] Support __builtin_ppc_get_timebase GCC 4.8+ has a PowerPC-specific intrinsic, __builtin_ppc_get_timebase, to do what Clang's __builtin_readcyclecounter does. For compatibility with code that uses GCC's spelling (including glibc), support it as well. Partially fixes PR23681. llvm-svn: 246510	2015-08-31 23:55:19 +00:00
Hans Wennborg	2151d12ec0	Fix CHECK directives that weren't checking. llvm-svn: 246492	2015-08-31 21:48:52 +00:00
Jingyue Wu	2d69f9608e	[CUDA] fix codegen for __nvvm_atom_min/max_gen_u* Summary: Clang should emit "atomicrmw umin/umax" instead of "atomicrmw min/max". Reviewers: eliben, tra Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D12487 llvm-svn: 246455	2015-08-31 17:25:51 +00:00
Yaron Keren	2dbe3440d5	Fix test for Release builds, the label form is <label>:14, not <label>14. llvm-svn: 246391	2015-08-30 17:46:43 +00:00
Yaron Keren	43e184ee0c	Make test resistant to false matches of numbered (unnamed) labels inside other numbers. In release builds labels are numbers. Matching just the number may result in false matches where the label is contained in other numbers, such as 14 inside [114 x i8]. A stricter match requiring start of line or > character before the label avoids these false matches. llvm-svn: 246385	2015-08-30 15:24:46 +00:00
Simon Pilgrim	e7708a84b9	[X86] Reapplied r246204, r246206, r246211, r246223 (Re)added debug codegen test for F16C, FMA4, XOP + 3DNow! intrinsics Part of PR24590 llvm-svn: 246363	2015-08-29 17:13:40 +00:00
Eric Christopher	a15a35e552	Add a check for a function we're not testing. llvm-svn: 246355	2015-08-29 02:59:37 +00:00
Renato Golin	b9365ae36a	Revert "[X86][3DNow] Added debug codegen test for 3DNow! intrinsics" This reverts commit r246223, as it broke all ARM/AArch64 bots. llvm-svn: 246323	2015-08-28 19:39:29 +00:00
Renato Golin	0b4112341b	Revert "[X86][XOP] Added debug codegen test for XOP intrinsics" This reverts commit r246211, as it broke all ARM/AArch64 bots. llvm-svn: 246321	2015-08-28 19:38:05 +00:00
Renato Golin	a3265f5be7	Revert "[X86][FMA4] Added debug codegen test for FMA4 intrinsics" This reverts commit r246206, as it broke all ARM/AArch64 bots. llvm-svn: 246320	2015-08-28 19:36:27 +00:00
Renato Golin	b44e54170e	Revert "[X86][F16C] Added debug codegen test for F16C intrinsics" This reverts commit r246204, as it was breaking all ARM/AArch64 bots. llvm-svn: 246319	2015-08-28 19:34:53 +00:00
Steven Wu	5528da76ef	Revert r246214 and r246213 These two commits causes llvm LTO bootstrap to hang in ScalarEvolution. llvm-svn: 246282	2015-08-28 07:14:10 +00:00
Ahmed Bougacha	02b7b56af8	[X86] Bump Darwin MaxVectorAlign to 64 when AVX512 is enabled. Without this, 64-byte vector types (__m512), specified to be 64-byte aligned in the AVX512 draft SysV ABI, will only be 32-byte aligned. This is analoguous to AVX, for which we accept 32-byte max alignment. Differential Revision: http://reviews.llvm.org/D10724 llvm-svn: 246230	2015-08-27 22:42:12 +00:00
Ahmed Bougacha	82b619ea68	[X86] Conditionalize Darwin MaxVectorAlign on the presence of AVX. There's no point in using a larger alignment if we have no instructions that would benefit from it. Differential Revision: http://reviews.llvm.org/D12389 llvm-svn: 246229	2015-08-27 22:30:38 +00:00
Simon Pilgrim	c7eaa17fe1	[X86][3DNow] Added debug codegen test for 3DNow! intrinsics Part of PR24590 llvm-svn: 246223	2015-08-27 22:18:09 +00:00
Piotr Padlewski	525f746710	Generating assumption loads of vptr after ctor call (fixed) Generating call assume(icmp %vtable, %global_vtable) after constructor call for devirtualization purposes. For more info go to: http://lists.llvm.org/pipermail/cfe-dev/2015-July/044227.html Edit: Fixed version because of PR24479. http://reviews.llvm.org/D11859 llvm-svn: 246213	2015-08-27 21:35:37 +00:00
Simon Pilgrim	7033a9ce33	[X86][XOP] Added debug codegen test for XOP intrinsics Part of PR24590 llvm-svn: 246211	2015-08-27 21:32:03 +00:00
Simon Pilgrim	75dffb8df3	[X86][FMA4] Added debug codegen test for FMA4 intrinsics Part of PR24590 llvm-svn: 246206	2015-08-27 20:41:45 +00:00
Simon Pilgrim	9362270f78	[X86][F16C] Added debug codegen test for F16C intrinsics Part of PR24590 llvm-svn: 246204	2015-08-27 20:34:02 +00:00
Simon Pilgrim	a79dfb3223	[X86] Add __builtin_ia32_undef* intrinsics to test Minor tweak to rL246083 llvm-svn: 246200	2015-08-27 20:29:13 +00:00
Eric Christopher	3751bce2a9	Target attribute syntax compatibility fix - gcc uses no- rather than mno-. llvm-svn: 246197	2015-08-27 20:05:48 +00:00
Eric Christopher	5d2db529cb	Rename this file to have a processor suffix to help identify. llvm-svn: 246196	2015-08-27 20:05:46 +00:00
Eric Christopher	3a98b3c1a5	Rewrite the code generation handling for function feature and cpu attributes. A couple of changes here: a) Do less work in the case where we don't have a target attribute on the function. We've already canonicalized the attributes for the function - no need to do more work. b) Use the newer canonicalized feature adding functions from TargetInfo to do the work when we do have a target attribute. This enables us to diagnose some warnings in the case of conflicting written attributes (only ppc does this today) and also make sure to get all of the features for a cpu that's listed rather than just change the cpu. Updated all testcases accordingly and added a new testcase to verify that we'll error out on ppc if we have some incompatible options using the existing diagnosis framework there. llvm-svn: 246195	2015-08-27 19:59:34 +00:00
Duncan P. N. Exon Smith	8cd9d7acb2	DI: Update DISubprogram testcases after LLVM r246098 llvm-svn: 246099	2015-08-26 22:50:48 +00:00
Ahmed Bougacha	5946ca4fc4	[ARM] Mark mcr/mrc builtin operands as required-immediate. An early error message is better than the "cannot select" alternative. llvm-svn: 246094	2015-08-26 22:21:07 +00:00
Reid Kleckner	14e96b4930	[ms-inline-asm] Add field access to MS inline asm identifier lookup Now we can parse code like this: struct A { int field; }; int f(A o) { __asm mov eax, o.field } Fixes PR19117. llvm-svn: 246088	2015-08-26 21:57:20 +00:00
Simon Pilgrim	5aba9925c0	[X86][SSE] Add _mm_undefined_* intrinsics Added missing SSE/AVX 'undefined' intrinsics (PR24040): _mm_undefined_pd, _mm_undefined_ps + _mm_undefined_si128 _mm256_undefined_pd, _mm256_undefined_ps + _mm256_undefined_si256 _mm512_undefined, _mm512_undefined_ps, _mm512_undefined_pd + _mm512_undefined_epi32 Added builtin intrinsicss: __builtin_ia32_undef128, __builtin_ia32_undef256 + __builtin_ia32_undef512 Differential Revision: http://reviews.llvm.org/D12052 llvm-svn: 246083	2015-08-26 21:17:12 +00:00
Ahmed Bougacha	beb3a9a970	[Headers] Require x86-registered for r245987 codegen tests. llvm-svn: 245992	2015-08-25 23:42:55 +00:00
Ahmed Bougacha	e9abaad105	[Headers][X86] Add -O0 assembly tests for avx2 intrinsics. We agreed for r245605 that, as long as we don't affect -O0 codegen too much, it's OK to use native constructs rather than intrinsics. Let's test that, starting with AVX2 here. See PR24580. Differential Revision: http://reviews.llvm.org/D12212 llvm-svn: 245987	2015-08-25 23:09:05 +00:00
Simon Pilgrim	fbb8904411	[X86] Remove unnecessary MMX declarations from Intrin.h As discussed in PR23648 - the intrinsics _m_from_int, _m_to_int and _m_prefetch are defined in mmintrin.h and prfchwintrin.h so we don't need to in Intrin.h Added tests for _m_from_int and _m_to_int D11338 already added a test for _m_prefetch Differential Revision: http://reviews.llvm.org/D12272 llvm-svn: 245975	2015-08-25 21:27:46 +00:00
Michael Kuperstein	b62c5bc64d	Revert r245923 since it breaks mingw. llvm-svn: 245929	2015-08-25 11:42:31 +00:00
Michael Kuperstein	2c8f9c2c23	[X86] Expose the various _rot intrinsics on non-MS platforms _rotl, _rotwl and _lrotl (and their right-shift counterparts) are official x86 intrinsics, and should be supported regardless of environment. This is in contrast to _rotl8, _rotl16, and _rotl64 which are MS-specific. Note that the MS documentation for _lrotl is different from the Intel documentation. Intel explicitly documents it as a 64-bit rotate, while for MS, since sizeof(unsigned long) for MSVC is always 4, a 32-bit rotate is implied. Differential Revision: http://reviews.llvm.org/D12271 llvm-svn: 245923	2015-08-25 07:21:33 +00:00
Ahmed Bougacha	0aacc46220	[ARM NEON] Remove the old AArch64 vset_lane tests. NFC. They are now properly tested, since r245901. llvm-svn: 245915	2015-08-25 01:00:05 +00:00
Ahmed Bougacha	8fbd95306d	[ARM NEON] Add missing AArch64 vget tests. llvm-svn: 245901	2015-08-24 23:34:25 +00:00
Simon Pilgrim	503976ad9a	Added missing tests for SSE41 pmovsx/pmovzx extension intrinsics llvm-svn: 245815	2015-08-23 16:19:38 +00:00
Piotr Padlewski	fa0e11efdd	Revert "Generating assumption loads of vptr after ctor call (fixed)" Reverting because of 245721 This reverts commit 552658e2b60543c928030b09cc9b5dfcb40c3f28. llvm-svn: 245727	2015-08-21 19:49:41 +00:00
Piotr Padlewski	910a059e42	Generating assumption loads of vptr after ctor call (fixed) Generating call assume(icmp %vtable, %global_vtable) after constructor call for devirtualization purposes. For more info go to: http://lists.llvm.org/pipermail/cfe-dev/2015-July/044227.html Edit: Fixed version because of PR24479. http://reviews.llvm.org/D11859 llvm-svn: 245721	2015-08-21 18:28:00 +00:00
James Y Knight	7160857da3	Properly provide alignment of 'byval' arguments down to llvm. This is important in the case that the LLVM-inferred llvm-struct alignment is not the same as the clang-known C-struct alignment. Differential Revision: http://reviews.llvm.org/D12243 llvm-svn: 245719	2015-08-21 18:19:06 +00:00
Yaron Keren	cc84a1bea1	Expand mingw-long-double.c to test for long double alignment. llvm-svn: 245679	2015-08-21 08:26:31 +00:00
Ahmed Bougacha	5e354cb547	[Headers][X86] Use __builtin_shufflevector in AVX2 broadcasts. This lets us optimize them better. We agreed to remove the intrinsics, instead of combining them later, as, at -O0, we generate the expected instructions. Plus, it's a nice cleanup. Differential Revision: http://reviews.llvm.org/D10556 llvm-svn: 245605	2015-08-20 20:27:21 +00:00
John McCall	0d461693b6	Fix the layout of bitfields in ms_struct unions: their alignment is ignored, and they always allocate a complete storage unit. Also, change the dumping of AST record layouts: use the more readable C++-style dumping even in C, include bitfield offset information in the dump, and don't print sizeof/alignof information for fields of record type, since we don't do so for bases or other kinds of field. rdar://22275433 llvm-svn: 245514	2015-08-19 22:42:36 +00:00
Richard Smith	7747ce2260	Internal-linkage variables with constant-evaluatable initializers do not need to be emitted. (Also reduces the set of variables that need to be eagerly deserialized when using PCH / modules.) llvm-svn: 245497	2015-08-19 20:49:38 +00:00
Yaron Keren	24b64a9405	Add REQUIRES: x86-registered-target to test since it's required. llvm-svn: 245462	2015-08-19 17:18:32 +00:00
Yaron Keren	7890a01263	According to i686 ABI, long double size on x86 is 12 bytes not 16 bytes. See https://gcc.gnu.org/onlinedocs/gcc-3.2/gcc/i386-and-x86-64-Options.html llvm-svn: 245459	2015-08-19 17:02:32 +00:00
George Burgess IV	bdb5b2687a	Make __builtin_object_size always answer correctly __builtin_object_size would return incorrect answers for many uses where type=3. This fixes the inaccuracy by making us emit 0 instead of LLVM's objectsize intrinsic. Additionally, there are many cases where we would emit suboptimal (but correct) answers, such as when arrays are involved. This patch fixes some of these cases (please see new tests in test/CodeGen/object-size.c for specifics on which cases are improved) Resubmit of r245323 with PR24493 fixed. Patch mostly by Richard Smith. Differential Revision: http://reviews.llvm.org/D12000 This fixes PR15212. llvm-svn: 245403	2015-08-19 02:19:07 +00:00
Nico Weber	19999b4816	Revert r245323, it caused PR24493. llvm-svn: 245342	2015-08-18 20:32:55 +00:00
George Burgess IV	232c76213d	Make __builtin_object_size always answer correctly __builtin_object_size would return incorrect answers for many uses where type=3. This fixes the inaccuracy by making us emit 0 instead of LLVM's objectsize intrinsic. Additionally, there are many cases where we would emit suboptimal (but correct) answers, such as when arrays are involved. This patch fixes some of these cases (please see new tests in test/CodeGen/object-size.c for specifics on which cases are improved) Patch mostly by Richard Smith. Differential Revision: http://reviews.llvm.org/D12000 This fixes PR15212. llvm-svn: 245323	2015-08-18 18:18:27 +00:00
Justin Bogner	3c32c83daa	Revert "Generating assumption loads of vptr after ctor call (fixed)" Bootstrap bots were failing: http://lab.llvm.org:8080/green/job/clang-stage2-configure-Rlto_build/6382/ http://bb.pgr.jp/builders/clang-3stage-i686-linux/builds/2969 This reverts r245264. llvm-svn: 245267	2015-08-18 05:40:20 +00:00
Piotr Padlewski	bc7497abbb	Generating assumption loads of vptr after ctor call (fixed) Generating call assume(icmp %vtable, %global_vtable) after constructor call for devirtualization purposes. For more info go to: http://lists.llvm.org/pipermail/cfe-dev/2015-July/044227.html Edit: Fixed version because of PR24479. http://reviews.llvm.org/D11859 llvm-svn: 245264	2015-08-18 03:52:00 +00:00
Hans Wennborg	386e442d1d	Revert r245257 "Generating assumption loads of vptr after ctor call" It caused PR24479 llvm-svn: 245260	2015-08-18 00:17:58 +00:00
Piotr Padlewski	a3f6f9477b	Generating assumption loads of vptr after ctor call Generating call assume(icmp %vtable, %global_vtable) after constructor call for devirtualization purposes. For more info go to: http://lists.llvm.org/pipermail/cfe-dev/2015-July/044227.html http://reviews.llvm.org/D11859 llvm-svn: 245257	2015-08-17 23:33:49 +00:00
John Brawn	2106b7525c	The alias.c test now requires arm-registered-target This should fix a buildbot failure llvm-svn: 244760	2015-08-12 15:55:55 +00:00
John Brawn	495008edde	Add test for PR24379 The fix for this is in LLVM but it depends on how clang handles the alias attribute, so add a test to the clang tests to make sure everything works together as expected. Differential Revision: http://reviews.llvm.org/D11980 llvm-svn: 244756	2015-08-12 15:15:27 +00:00
Filipe Cabecinhas	7af183d841	Propagate SourceLocations through to get a Loc on float_cast_overflow Summary: float_cast_overflow is the only UBSan check without a source location attached. This patch propagates SourceLocations where necessary to get them to the EmitCheck() call. Reviewers: rsmith, ABataev, rjmccall Subscribers: cfe-commits Differential Revision: http://reviews.llvm.org/D11757 llvm-svn: 244568	2015-08-11 04:19:28 +00:00
Derek Schuff	33d60bfc0d	add comment llvm-svn: 244542	2015-08-11 00:19:54 +00:00
Derek Schuff	9cfd488443	Add NaCl to long double/fp128 mangling test Summary: NaCl is a platform where long double is the same as double. Its mangling is spelled with "long double" but its ABI lowering is the same as double. Reviewers: rnk, chh Subscribers: jfb, cfe-commits, dschuff Differential Revision: http://reviews.llvm.org/D11922 llvm-svn: 244541	2015-08-11 00:19:53 +00:00
Derek Schuff	4044f6197a	Add NaCl (a target where long double = double) to long double ABI test A test was recently (r244468) added to cover long double calling convention codegen, distinguishing between Android and GNU conventions (where long doubles are fp128 and x86_fp80, respectively). Native Client is a target where long doubles are the same as doubles. This change augments the test to cover that case. Also rename the test to test/codeGen/X86_64-longdouble.c Differential Revision: http://reviews.llvm.org/D11921 llvm-svn: 244524	2015-08-10 23:02:37 +00:00
Chih-Hung Hsieh	00b6f74935	Fix test case to work with -Asserts builds. When clang is built with -DLLVM_ENABLE_ASSERTIONS=Off, it does not create names for IR values. Differential Revision: http://reviews.llvm.org/D11437 llvm-svn: 244502	2015-08-10 20:58:54 +00:00
Chih-Hung Hsieh	241a890bd7	Correct x86_64 fp128 calling convention These changes are for Android x86_64 targets to be compatible with current Android g++ and conform to AMD64 ABI. https://llvm.org/bugs/show_bug.cgi?id=23897 * Return type of long double (fp128) should be fp128, not x86_fp80. * Vararg of long double (fp128) could be in register and overflowed to memory. https://llvm.org/bugs/show_bug.cgi?id=24111 * Return value of long double (fp128) _Complex should be in memory like a structure of {fp128,fp128}. Differential Revision: http://reviews.llvm.org/D11437 llvm-svn: 244468	2015-08-10 17:33:31 +00:00
Peter Collingbourne	eeebc41b58	AST: Implement mangling support for function types without a prototype. Function types without prototypes can arise when mangling a function type within an overloadable function in C. We mangle these as the absence of any parameter types (not even an empty parameter list). Differential Revision: http://reviews.llvm.org/D11848 llvm-svn: 244374	2015-08-07 23:25:47 +00:00
James Y Knight	57fc89f029	[Sparc] XFAIL CodeGen/atomic_ops test. llvm-svn: 244370	2015-08-07 22:52:34 +00:00
Michael Kuperstein	d7b9392f59	[X86] Add support for _MM_ALIGN16 Differential Revision: http://reviews.llvm.org/D11753 llvm-svn: 244201	2015-08-06 08:24:38 +00:00
Reid Kleckner	124955aade	Add -gcodeview and -gdwarf to control which type Clang emits Summary: By default, 'clang' emits dwarf and 'clang-cl' emits codeview. You can force emission of one or both by passing -gcodeview and -gdwarf to either driver. Reviewers: dblaikie, hans Subscribers: cfe-commits Differential Revision: http://reviews.llvm.org/D11742 llvm-svn: 244097	2015-08-05 18:51:13 +00:00
James Y Knight	81167fb799	Add missing atomic libcall support. Support for emitting libcalls for __atomic_fetch_nand and __atomic_{add,sub,and,or,xor,nand}_fetch was missing; add it, and some test cases. Differential Revision: http://reviews.llvm.org/D10847 llvm-svn: 244063	2015-08-05 16:57:36 +00:00
Asaf Badouh	c68e347c25	[X86][AVX512VLBW] add pack, cvt, mulhi and madd intrinsics Differential Revision: http://reviews.llvm.org/D11642 llvm-svn: 243867	2015-08-03 07:51:00 +00:00
Asaf Badouh	73b639f650	[X86][AVX512VLDQ] add reduce/range/cvt intrinsics add 128 & 256 width intrinsic versions of reduce/range and cvt i64 to FP and vice versa Differential Revision: http://reviews.llvm.org/D11598 llvm-svn: 243848	2015-08-02 12:43:08 +00:00
Duncan P. N. Exon Smith	38a7f11a5a	DI: Update testcases for LLVM assembly change Update testcases after LLVM change r243774. Most of these had no need to check `tag:` field, but did so as a way of getting to the `name:` field. In a few cases I've converted the `tag:` checks to `arg:` or `CHECK-NOT: arg:`. llvm-svn: 243775	2015-07-31 18:59:37 +00:00
Ulrich Weigand	ca25643a05	[SystemZ] Add support for vecintrin.h vector built-in functions This patch adds support for the System Z vector built-in functions. The API-defined header file has the name vecintrin.h. The user-level functions are defined in the same style as the clang version of altivec.h, making heavy use of the __overloadable__ and __always_inline__ attributes. Where possible the functions expand to generic operations rather than specific built-in functions, in the hope that that form can be optimised better. Where a built-in routine is specified to require an immediate integer argument, the __enable_if__ attribute is used to verify the argument is in fact constant and in the appropriate range. Based on a patch by Richard Sandiford. llvm-svn: 243643	2015-07-30 14:10:43 +00:00
Ulrich Weigand	3c5038a535	Add support for System z vector language extensions The z13 vector facility has an associated language extension, closely modeled on AltiVec/VSX. The main differences are: - vector long, vector float and vector pixel are not supported - vector long long and vector double are supported (like VSX) - comparison operators return a vector rather than a scalar integer - shift operators behave like the OpenCL shift operators - vector bool is only supported as argument to certain operators; some operators allow mixing a bool with a non-bool vector This patch adds clang support for the extension. It is closely modelled on the AltiVec support. Similarly to the -faltivec option, there's a new -fzvector option to enable the extensions (as well as an -mzvector alias for compatibility with GCC). There's also a separate LangOpt. The extension as implemented here is intended to be compatible with the -mzvector extension recently implemented by GCC. Based on a patch by Richard Sandiford. Differential Revision: http://reviews.llvm.org/D11001 llvm-svn: 243642	2015-07-30 14:08:36 +00:00
Michael Kuperstein	2229ca71fd	[X86] Recognize "flags" as an identifier, not a register in Intel-syntax inline asm This contains the test-case for r243630. Patch by: marina.yatsina@intel.com Differential Revision: http://reviews.llvm.org/D11513 llvm-svn: 243632	2015-07-30 10:10:47 +00:00
Asaf Badouh	1998eb2077	[X86][AVX512BW] add convert i16 to i8 and unpack intrinsics Differential Revision: http://reviews.llvm.org/D11564 llvm-svn: 243514	2015-07-29 12:34:20 +00:00
Chih-Hung Hsieh	2c656c9417	Add -femulated-tls flag to select the emulated TLS model. This will be used for old targets like Android that do not support ELF TLS models. Differential Revision: http://reviews.llvm.org/D10524 llvm-svn: 243441	2015-07-28 16:27:56 +00:00
Kristof Beyls	918f8ab7c6	RegParmMax must be 0 for AArch64, as the regparm function attribute is not supported on AArch64. llvm-svn: 243417	2015-07-28 14:23:47 +00:00
Adhemerval Zanella	3916c910d1	[AArch64] Implement __builtin_thread_pointer This path add the aarch64 __builtin_thread_pointer support. It will be lowered to llvm.aarch64.thread.pointer. llvm-svn: 243413	2015-07-28 13:10:10 +00:00
Asaf Badouh	93aa4c808a	[X86][AVX512VL] add AVX512VL intrinsics 4 out of 4 Differential Revision: http://reviews.llvm.org/D11526 llvm-svn: 243409	2015-07-28 12:04:40 +00:00
Asaf Badouh	b7cf71b63d	[X86][AVX512VL] add AVX512VL intrinsics 3 out of 4 http://reviews.llvm.org/D11526 llvm-svn: 243406	2015-07-28 11:14:09 +00:00
Asaf Badouh	78ee5cc8e1	[X86][AVX512VL] add AVX512VL intrinsics 2 out of 4 http://reviews.llvm.org/D11526 llvm-svn: 243402	2015-07-28 10:30:56 +00:00
Asaf Badouh	74da38706e	[X86][AVX512VL] add AVX512VL intrinsics 1 out of 4 http://reviews.llvm.org/D11526 llvm-svn: 243394	2015-07-28 08:26:14 +00:00
Simon Pilgrim	917c4d41ea	Fixed test in rL243305 llvm-svn: 243314	2015-07-27 19:49:54 +00:00
Simon Pilgrim	f81966d04b	[X86] Add missing _m_prefetch intrinsic The 3DNOW/PRFCHW cpu targets define both the PREFETCHW (set cache line modified) and PREFETCH (set cache line exclusive) instructions but only the _m_prefetchw (PREFETCHW) intrinsic is included in the header. This patch adds the missing _m_prefetch intrinsic. I'm basing this off AMD documentation - the intel docs on the support for PREFETCHW isn't clear whether Silvermont/Broadwell properly support PREFETCH but given that the intrinsic implementation is a default __builtin_prefetch call, it is safe whatever. Fix for PR23648 Differential Revision: http://reviews.llvm.org/D11338 llvm-svn: 243305	2015-07-27 19:01:52 +00:00
David Majnemer	72f8c5e0c9	Try to make the buildbots happy This test was missing a triple causing it to error out on windows targets. They accept a much smaller alignment value. llvm-svn: 243234	2015-07-26 02:16:35 +00:00
David Majnemer	6cd35912c0	[CodeGen] Don't UBSan-ize the argument to __builtin_frame_address __builtin_frame_address requires its argument to be a constant expression which already implies that it cannot have undefined behavior. However, we used EmitScalarExpr to emit the argument causing UBSan to try to check for overflow. Instead, use the constant expression emission system. This fixes PR24256. llvm-svn: 243206	2015-07-25 05:57:24 +00:00
Chih-Hung Hsieh	0b0eeaaaf6	Correct x86_64 Android fp128 mangled name These changes are for Android x86_64 targets to be compatible with current Android g++. https://llvm.org/bugs/show_bug.cgi?id=23897 Use 'g' and 'Cg' for "long double" and "long double _Complex" mangled type names. Differential Revision: http://reviews.llvm.org/D11466 llvm-svn: 243133	2015-07-24 18:12:54 +00:00
Asaf Badouh	f6a58b6dff	[X86][AVX512F] Add FP scalar intrinsics intrinsics for: add/sub/mul/div/min/max in their FP scalar versions Differential Revision: http://reviews.llvm.org/D11418 llvm-svn: 243009	2015-07-23 12:13:32 +00:00
Asaf Badouh	7d99966e91	[X86][AVX512BW] add madd and maddubs intrinsics Differential Revision: http://reviews.llvm.org/D11420 llvm-svn: 242986	2015-07-23 07:07:25 +00:00
Asaf Badouh	ffeb624483	[X86][AVX512F] add FP arithmetic intrinsics add/div/mul/sub include rounding versions Differential Revision: http://reviews.llvm.org/D11354 llvm-svn: 242790	2015-07-21 15:27:28 +00:00
David Majnemer	1bf0f8ede6	[MS Compat] Add support for __declspec(noalias) The attribute '__declspec(noalias)' communicates that the function only accesses memory pointed to by its pointer-typed arguments. llvm-svn: 242728	2015-07-20 22:51:52 +00:00
Yunzhong Gao	d65200cbfd	Fix quoting of #pragma comment for PS4. This is the PS4 counterpart to r229376, which quotes the library name if the name contains space. It was discovered that if a library name contains both double-quote and space characters, quoting the name might produce unexpected results, but we are mostly concerned with a Windows host environment, which does not allow double-quote or slashes in file/folder names. Differential Revision: http://reviews.llvm.org/D11275 llvm-svn: 242689	2015-07-20 17:46:56 +00:00
Benjamin Kramer	b596056413	[CodeGen] Flip lanes when lowering __builtin_palignr with one lane Otherwise we'd pick the wrong lane for the resulting shuffle and miscompile code. PR24187. llvm-svn: 242678	2015-07-20 15:31:17 +00:00
Alexey Bataev	91e5860fad	[X86, inlineasm] Improve analysis of x,Y0,Yi,Ym,Yt,L,e,Z,s asm constraints (patch by Alexey Frolov) Improve Sema checking of 9 existing inline asm constraints (‘x’, ‘Y*’, ‘L’, ‘e’, ‘Z’, ‘s’). Differential Revision: http://reviews.llvm.org/D10536 llvm-svn: 242665	2015-07-20 12:08:00 +00:00
Asaf Badouh	d4419ca657	[X86][AVX512BW] add clang intrinsics for pmulhrsw / pmulhuw / pmulhw also made minor fix in "test_mm512_maskz_permutex2var_epi16" Differential Revision: http://reviews.llvm.org/D11336 llvm-svn: 242635	2015-07-19 08:47:31 +00:00
Steven Wu	3db51cbc21	Fix test case in r242565 llvm-svn: 242571	2015-07-17 20:49:01 +00:00
Steven Wu	546a19628b	Fix -save-temp when using objc-arc, sanitizer and profiling Currently, -save-temp will cause ObjCARC optimization to be dropped, sanitizer pass to run early in the pipeline, and profiling instrumentation to run twice. Fix the issue by properly disable all passes in the optimization pipeline when generating bitcode output and parse some of the Language Options even when the input is bitcode so the passes can be setup correctly. llvm-svn: 242565	2015-07-17 20:09:56 +00:00
Aaron Ballman	7572e58b66	Disable #pragma redefine_extname for C++ code as it does not make sense in such a context. Patch by Andrey Bokhanko! llvm-svn: 242420	2015-07-16 17:06:53 +00:00
Akira Hatanaka	580efb2475	[ARM] Pass subtarget feature "+no-movt" instead of passing backend option "-arm-use-movt=0". This change is needed since backend options do not make it to the backend when doing LTO and are not capable of changing the behavior of code-gen passes on a per-function basis. rdar://problem/21529937 Differential Revision: http://reviews.llvm.org/D11025 llvm-svn: 242368	2015-07-16 00:43:00 +00:00
Bill Schmidt	f4aa8fe4aa	[PPC64] Update tests for vec_sld Revision 224297 modified the behavior of vec_sld for little endian so that LLVM will generate the correct corresponding vsldoi instruction. I neglected to update the existing tests, which continued to pass because they were not specific enough. This patch adds enough specificity to the tests to make them useful for BE and LE testing of vec_sld. llvm-svn: 242313	2015-07-15 18:55:02 +00:00
Nemanja Ivanovic	6c363ed67a	Add missing builtins to altivec.h for ABI compliance (vol. 4) This patch corresponds to review: http://reviews.llvm.org/D11184 A number of new interfaces for altivec.h (as mandated by the ABI): vector float vec_cpsgn(vector float, vector float) vector double vec_cpsgn(vector double, vector double) vector double vec_or(vector bool long long, vector double) vector double vec_or(vector double, vector bool long long) vector double vec_re(vector double) vector signed char vec_cntlz(vector signed char) vector unsigned char vec_cntlz(vector unsigned char) vector short vec_cntlz(vector short) vector unsigned short vec_cntlz(vector unsigned short) vector int vec_cntlz(vector int) vector unsigned int vec_cntlz(vector unsigned int) vector signed long long vec_cntlz(vector signed long long) vector unsigned long long vec_cntlz(vector unsigned long long) vector signed char vec_nand(vector bool signed char, vector signed char) vector signed char vec_nand(vector signed char, vector bool signed char) vector signed char vec_nand(vector signed char, vector signed char) vector unsigned char vec_nand(vector bool unsigned char, vector unsigned char) vector unsigned char vec_nand(vector unsigned char, vector bool unsigned char) vector unsigned char vec_nand(vector unsigned char, vector unsigned char) vector short vec_nand(vector bool short, vector short) vector short vec_nand(vector short, vector bool short) vector short vec_nand(vector short, vector short) vector unsigned short vec_nand(vector bool unsigned short, vector unsigned short) vector unsigned short vec_nand(vector unsigned short, vector bool unsigned short) vector unsigned short vec_nand(vector unsigned short, vector unsigned short) vector int vec_nand(vector bool int, vector int) vector int vec_nand(vector int, vector bool int) vector int vec_nand(vector int, vector int) vector unsigned int vec_nand(vector bool unsigned int, vector unsigned int) vector unsigned int vec_nand(vector unsigned int, vector bool unsigned int) vector unsigned int vec_nand(vector unsigned int, vector unsigned int) vector signed long long vec_nand(vector bool long long, vector signed long long) vector signed long long vec_nand(vector signed long long, vector bool long long) vector signed long long vec_nand(vector signed long long, vector signed long long) vector unsigned long long vec_nand(vector bool long long, vector unsigned long long) vector unsigned long long vec_nand(vector unsigned long long, vector bool long long) vector unsigned long long vec_nand(vector unsigned long long, vector unsigned long long) vector signed char vec_orc(vector bool signed char, vector signed char) vector signed char vec_orc(vector signed char, vector bool signed char) vector signed char vec_orc(vector signed char, vector signed char) vector unsigned char vec_orc(vector bool unsigned char, vector unsigned char) vector unsigned char vec_orc(vector unsigned char, vector bool unsigned char) vector unsigned char vec_orc(vector unsigned char, vector unsigned char) vector short vec_orc(vector bool short, vector short) vector short vec_orc(vector short, vector bool short) vector short vec_orc(vector short, vector short) vector unsigned short vec_orc(vector bool unsigned short, vector unsigned short) vector unsigned short vec_orc(vector unsigned short, vector bool unsigned short) vector unsigned short vec_orc(vector unsigned short, vector unsigned short) vector int vec_orc(vector bool int, vector int) vector int vec_orc(vector int, vector bool int) vector int vec_orc(vector int, vector int) vector unsigned int vec_orc(vector bool unsigned int, vector unsigned int) vector unsigned int vec_orc(vector unsigned int, vector bool unsigned int) vector unsigned int vec_orc(vector unsigned int, vector unsigned int) vector signed long long vec_orc(vector bool long long, vector signed long long) vector signed long long vec_orc(vector signed long long, vector bool long long) vector signed long long vec_orc(vector signed long long, vector signed long long) vector unsigned long long vec_orc(vector bool long long, vector unsigned long long) vector unsigned long long vec_orc(vector unsigned long long, vector bool long long) vector unsigned long long vec_orc(vector unsigned long long, vector unsigned long long) vector signed char vec_div(vector signed char, vector signed char) vector unsigned char vec_div(vector unsigned char, vector unsigned char) vector signed short vec_div(vector signed short, vector signed short) vector unsigned short vec_div(vector unsigned short, vector unsigned short) vector signed int vec_div(vector signed int, vector signed int) vector unsigned int vec_div(vector unsigned int, vector unsigned int) vector signed long long vec_div(vector signed long long, vector signed long long) vector unsigned long long vec_div(vector unsigned long long, vector unsigned long long) vector unsigned char vec_mul(vector unsigned char, vector unsigned char) vector unsigned int vec_mul(vector unsigned int, vector unsigned int) vector unsigned long long vec_mul(vector unsigned long long, vector unsigned long long) vector unsigned short vec_mul(vector unsigned short, vector unsigned short) vector signed char vec_mul(vector signed char, vector signed char) vector signed int vec_mul(vector signed int, vector signed int) vector signed long long vec_mul(vector signed long long, vector signed long long) vector signed short vec_mul(vector signed short, vector signed short) vector signed long long vec_mergeh(vector signed long long, vector signed long long) vector signed long long vec_mergeh(vector signed long long, vector bool long long) vector signed long long vec_mergeh(vector bool long long, vector signed long long) vector unsigned long long vec_mergeh(vector unsigned long long, vector unsigned long long) vector unsigned long long vec_mergeh(vector unsigned long long, vector bool long long) vector unsigned long long vec_mergeh(vector bool long long, vector unsigned long long) vector double vec_mergeh(vector double, vector double) vector double vec_mergeh(vector double, vector bool long long) vector double vec_mergeh(vector bool long long, vector double) vector signed long long vec_mergel(vector signed long long, vector signed long long) vector signed long long vec_mergel(vector signed long long, vector bool long long) vector signed long long vec_mergel(vector bool long long, vector signed long long) vector unsigned long long vec_mergel(vector unsigned long long, vector unsigned long long) vector unsigned long long vec_mergel(vector unsigned long long, vector bool long long) vector unsigned long long vec_mergel(vector bool long long, vector unsigned long long) vector double vec_mergel(vector double, vector double) vector double vec_mergel(vector double, vector bool long long) vector double vec_mergel(vector bool long long, vector double) vector signed int vec_pack(vector signed long long, vector signed long long) vector unsigned int vec_pack(vector unsigned long long, vector unsigned long long) vector bool int vec_pack(vector bool long long, vector bool long long) llvm-svn: 242171	2015-07-14 17:50:27 +00:00
Asaf Badouh	1626545667	[x86] add 2 bit to ObjCOrBuiltinID and new intrinsics add 2 bit to ObjCOrBuiltinID (changed from 11bits to 13bits), see discussion in Add new intrinsics support that already covered by the BE. All the intrinsics are covered by tests Differential Revision: http://reviews.llvm.org/D10893 llvm-svn: 242144	2015-07-14 14:02:45 +00:00
David Majnemer	c0c42f3dea	[MS ABI] Don't generates code for unreferenced inline definitions of library builtins We should only consider declarations which were written, implicit declarations shouldn't be considered. This fixes PR24084. llvm-svn: 241941	2015-07-10 20:55:38 +00:00
Akira Hatanaka	10bdb2b144	[inlineasm] Attach readonly and readnone to inline-asm instructions. Previously, clang/llvm treated inline-asm instructions conservatively, choosing not to eliminate the instructions or hoisting them out of a loop even when it was safe to do so. This commit makes changes to attach a readonly or readnone attribute to an inline-asm instruction, which enables passes such as LICM and EarlyCSE to move or optimize away the instruction. rdar://problem/11358192 Differential Revision: http://reviews.llvm.org/D10546 llvm-svn: 241930	2015-07-10 18:44:40 +00:00
Ulrich Weigand	03ce2a16bf	Respect alignment of nested bitfields tools/clang/test/CodeGen/packed-nest-unpacked.c contains this test: struct XBitfield { unsigned b1 : 10; unsigned b2 : 12; unsigned b3 : 10; }; struct YBitfield { char x; struct XBitfield y; } __attribute((packed)); struct YBitfield gbitfield; unsigned test7() { // CHECK: @test7 // CHECK: load i32, i32* getelementptr inbounds (%struct.YBitfield, %struct.YBitfield* @gbitfield, i32 0, i32 1, i32 0), align 4 return gbitfield.y.b2; } The "align 4" is actually wrong. Accessing all of "gbitfield.y" as a single i32 is of course possible, but that still doesn't make it 4-byte aligned as it remains packed at offset 1 in the surrounding gbitfield object. This alignment was changed by commit r169489, which also introduced changes to bitfield access code in CGExpr.cpp. Code before that change used to take into account both the alignment of the field to be accessed within the current struct, and the alignment of that outer struct itself; this logic was removed by the above commit. Neglecting to consider both values can cause incorrect code to be generated (I've seen an unaligned access crash on SystemZ due to this bug). In order to always use the best known alignment value, this patch removes the CGBitFieldInfo::StorageAlignment member and replaces it with a StorageOffset member specifying the offset from the start of the surrounding struct to the bitfield's underlying storage. This offset can then be combined with the best-known alignment for a bitfield access lvalue to determine the alignment to use when accessing the bitfield's storage. Differential Revision: http://reviews.llvm.org/D11034 llvm-svn: 241916	2015-07-10 17:30:00 +00:00
Nemanja Ivanovic	26c3534b84	Add missing builtins to altivec.h for ABI compliance (vol. 3) This patch corresponds to review: http://reviews.llvm.org/D10972 Fix for the handling of dependent features that are enabled by default on some CPU's (such as -mvsx, -mpower8-vector). Also provides a number of new interfaces or fixes existing ones in altivec.h. Changed signatures to conform to ABI: vector short vec_perm(vector signed short, vector signed short, vector unsigned char) vector int vec_perm(vector signed int, vector signed int, vector unsigned char) vector long long vec_perm(vector signed long long, vector signed long long, vector unsigned char) vector signed char vec_sld(vector signed char, vector signed char, const int) vector unsigned char vec_sld(vector unsigned char, vector unsigned char, const int) vector bool char vec_sld(vector bool char, vector bool char, const int) vector unsigned short vec_sld(vector unsigned short, vector unsigned short, const int) vector signed short vec_sld(vector signed short, vector signed short, const int) vector signed int vec_sld(vector signed int, vector signed int, const int) vector unsigned int vec_sld(vector unsigned int, vector unsigned int, const int) vector float vec_sld(vector float, vector float, const int) vector signed char vec_splat(vector signed char, const int) vector unsigned char vec_splat(vector unsigned char, const int) vector bool char vec_splat(vector bool char, const int) vector signed short vec_splat(vector signed short, const int) vector unsigned short vec_splat(vector unsigned short, const int) vector bool short vec_splat(vector bool short, const int) vector pixel vec_splat(vector pixel, const int) vector signed int vec_splat(vector signed int, const int) vector unsigned int vec_splat(vector unsigned int, const int) vector bool int vec_splat(vector bool int, const int) vector float vec_splat(vector float, const int) Added a VSX path to: vector float vec_round(vector float) Added interfaces: vector signed char vec_eqv(vector signed char, vector signed char) vector signed char vec_eqv(vector bool char, vector signed char) vector signed char vec_eqv(vector signed char, vector bool char) vector unsigned char vec_eqv(vector unsigned char, vector unsigned char) vector unsigned char vec_eqv(vector bool char, vector unsigned char) vector unsigned char vec_eqv(vector unsigned char, vector bool char) vector signed short vec_eqv(vector signed short, vector signed short) vector signed short vec_eqv(vector bool short, vector signed short) vector signed short vec_eqv(vector signed short, vector bool short) vector unsigned short vec_eqv(vector unsigned short, vector unsigned short) vector unsigned short vec_eqv(vector bool short, vector unsigned short) vector unsigned short vec_eqv(vector unsigned short, vector bool short) vector signed int vec_eqv(vector signed int, vector signed int) vector signed int vec_eqv(vector bool int, vector signed int) vector signed int vec_eqv(vector signed int, vector bool int) vector unsigned int vec_eqv(vector unsigned int, vector unsigned int) vector unsigned int vec_eqv(vector bool int, vector unsigned int) vector unsigned int vec_eqv(vector unsigned int, vector bool int) vector signed long long vec_eqv(vector signed long long, vector signed long long) vector signed long long vec_eqv(vector bool long long, vector signed long long) vector signed long long vec_eqv(vector signed long long, vector bool long long) vector unsigned long long vec_eqv(vector unsigned long long, vector unsigned long long) vector unsigned long long vec_eqv(vector bool long long, vector unsigned long long) vector unsigned long long vec_eqv(vector unsigned long long, vector bool long long) vector float vec_eqv(vector float, vector float) vector float vec_eqv(vector bool int, vector float) vector float vec_eqv(vector float, vector bool int) vector double vec_eqv(vector double, vector double) vector double vec_eqv(vector bool long long, vector double) vector double vec_eqv(vector double, vector bool long long) vector bool long long vec_perm(vector bool long long, vector bool long long, vector unsigned char) vector double vec_round(vector double) vector double vec_splat(vector double, const int) vector bool long long vec_splat(vector bool long long, const int) vector signed long long vec_splat(vector signed long long, const int) vector unsigned long long vec_splat(vector unsigned long long, vector bool int vec_sld(vector bool int, vector bool int, const int) vector bool short vec_sld(vector bool short, vector bool short, const int) llvm-svn: 241904	2015-07-10 13:11:34 +00:00
Ulrich Weigand	6e2cea6f0c	Respect alignment when loading up a coerced function argument Code in CGCall.cpp that loads up function arguments that need to be coerced to a different type may in some cases ignore the fact that the source of the argument is not naturally aligned. This may cause incorrect code to be generated. In some places in CreateCoercedLoad, we already have setAlignment calls to address this, but I ran into one where it was missing, causing wrong code generation on SystemZ. However, in that location, we do not actually know what alignment of the source location we can rely on; the callers do not pass anything to this routine. This is already an issue in other places in CreateCoercedLoad; and the same problem exists for CreateCoercedStore. To avoid pessimising code, and to fix the FIXMEs already in place, this patch also adds an alignment argument to the CreateCoerced* routines and uses it instead of forcing an alignment of 1. The callers are changed to pass in the best information they have. This actually requires changes in a number of existing test cases since we now get better alignment in many places. Differential Revision: http://reviews.llvm.org/D11033 llvm-svn: 241898	2015-07-10 11:31:43 +00:00
Reid Kleckner	8819a4065f	Re-enable 32-bit SEH after the alignment fix llvm-svn: 241878	2015-07-10 00:16:25 +00:00
Reid Kleckner	e7844ea7f8	Disable 32-bit SEH, again Move the diagnostic back to codegen so that we can compile ATL on the self-host bot. We don't actually end up emitting code for the __try, so the diagnostic won't be hit. llvm-svn: 241761	2015-07-08 23:57:03 +00:00
Reid Kleckner	338635389f	[SEH] Re-enable SEH on x86 Windows after r241699 llvm-svn: 241704	2015-07-08 18:27:10 +00:00
Adrian Prantl	bc068586ac	Revert "Revert r241620 and follow-up commits" and move the initialization of the llvm targets from clang/CodeGen into ClangCheck.cpp and CIndex.cpp. llvm-svn: 241653	2015-07-08 01:00:30 +00:00
Reid Kleckner	15d152d3ac	[SEH] Switch from frameaddress(0) to localaddress This should do the right thing for stack realignment prologues. llvm-svn: 241644	2015-07-07 23:23:31 +00:00
Adrian Prantl	142ec39739	Revert r241620 and follow-up commits while investigating linux buildbot failures. llvm-svn: 241642	2015-07-07 23:19:46 +00:00
Reid Kleckner	98cb8ba64c	Update clang for intrinsic rename of framerecover to localrecover llvm-svn: 241634	2015-07-07 22:26:07 +00:00
Adrian Prantl	e50371b948	Wrap clang modules and pch files in an object file container. This patch adds ObjectFilePCHContainerOperations uses the LLVM backend to put the contents of a PCH into a __clangast section inside a COFF, ELF, or Mach-O object file container. This is done to facilitate module debugging by makeing it possible to store the debug info for the types defined by a module alongside the AST. rdar://problem/20091852 llvm-svn: 241620	2015-07-07 20:11:29 +00:00
Akira Hatanaka	3fb33a5d18	[ARM] Pass subtarget feature "+long-calls" instead of passing backend option "-arm-long-calls". This change allows using -mlong-calls/-mno-long-calls for LTO and enabling or disabling long call on a per-function basis. rdar://problem/21529937 Differential Revision: http://reviews.llvm.org/D9414 llvm-svn: 241565	2015-07-07 06:42:05 +00:00
Adrian Prantl	3d2c051cf6	Debug info: Emit distinct __block_literal_generic types for blocks with different function signatures. (Previously clang would emit all block pointer types with the type of the first block pointer in the compile unit.) rdar://problem/21602473 llvm-svn: 241534	2015-07-07 00:49:35 +00:00
Reid Kleckner	9fe7f2396b	Revert "Revert 241171, 241187, 241199 (32-bit SEH)." This reverts commit r241244, but restricts SEH support to Win64. This way, Chromium builds will still fall back on TUs with SEH, and Clang developers can work on this incrementally upstream while patching this small predicate locally. It'll also make it easier to review small fixes. llvm-svn: 241533	2015-07-07 00:36:30 +00:00
Eric Christopher	af4d608d13	Handle arbitrary whitespace in the target attribute support. This allows us to deal a bit more gracefully with inclusions done by macros, token pasting, or just code layout/formatting. llvm-svn: 241525	2015-07-06 23:51:59 +00:00
Adrian Prantl	498fff661d	Debug info: Don't emit a bogus location for the global block pointer type (__block_literal_generic). The arbitrary nature of the location confuses lldb and prevents type uniquing. rdar://problem/21602473 llvm-svn: 241511	2015-07-06 21:31:35 +00:00
Teresa Johnson	8749d80431	Resubmit "Pass down the -flto option to the -cc1 job" (r239481) The patch is the same except for the addition of a new test for the issue that required reverting the dependent llvm commit. --Original Commit Message-- Pass down the -flto option to the -cc1 job, and from there into the CodeGenOptions and onto the PassManagerBuilder. This enables gating the new EliminateAvailableExternally module pass on whether we are preparing for LTO. If we are preparing for LTO (e.g. a -flto -c compile), the new pass is not included as we want to preserve available externally functions for possible link time inlining. llvm-svn: 241467	2015-07-06 16:23:00 +00:00
NAKAMURA Takumi	86042f1d2e	clang/test/CodeGen/builtins-ppc-vsx.c: Fix for -Asserts. llvm-svn: 241401	2015-07-05 08:37:54 +00:00
Nemanja Ivanovic	1c7ad715ec	Add missing builtins to altivec.h for ABI compliance (vol. 2) This patch corresponds to review: http://reviews.llvm.org/D10875 The bulk of the second round of additions to altivec.h. The following interfaces were added: vector double vec_floor(vector double) vector double vec_madd(vector double, vector double, vector double) vector float vec_msub(vector float, vector float, vector float) vector double vec_msub(vector double, vector double, vector double) vector float vec_mul(vector float, vector float) vector double vec_mul(vector double, vector double) vector float vec_nmadd(vector float, vector float, vector float) vector double vec_nmadd(vector double, vector double, vector double) vector double vec_nmsub(vector double, vector double, vector double) vector double vec_nor(vector double, vector double) vector double vec_or(vector double, vector double) vector float vec_rint(vector float) vector double vec_rint(vector double) vector float vec_nearbyint(vector float) vector double vec_nearbyint(vector double) vector float vec_sqrt(vector float) vector double vec_sqrt(vector double) vector double vec_rsqrte(vector double) vector double vec_sel(vector double, vector double, vector unsigned long long) vector double vec_sel(vector double, vector double, vector unsigned long long) vector double vec_sub(vector double, vector double) vector double vec_trunc(vector double) vector double vec_xor(vector double, vector double) vector double vec_xor(vector double, vector bool long long) vector double vec_xor(vector bool long long, vector double) New VSX paths for the following interfaces: vector float vec_madd(vector float, vector float, vector float) vector float vec_nmsub(vector float, vector float, vector float) vector float vec_rsqrte(vector float) vector float vec_trunc(vector float) vector float vec_floor(vector float) llvm-svn: 241399	2015-07-05 06:40:52 +00:00
Kit Barton	b61173e791	This patch adds support for the vector merge even word and vector merge odd word instructions introduced in POWER8. These are the Clang-related changes for http://reviews.llvm.org/D10704 All builtins are added in altivec.h and guarded with the POWER8_VECTOR macro. Phabricator review: http://reviews.llvm.org/D10736 llvm-svn: 241293	2015-07-02 19:29:05 +00:00
Nico Weber	e4f974c6fb	Revert 241171, 241187, 241199 (32-bit SEH). It still doesn't produce quite the right code, test binaries built with this enabled fail some tests. llvm-svn: 241244	2015-07-02 06:10:53 +00:00
Alexey Bataev	0039651304	[OPENMP] Introduced type trait "__builtin_omp_required_simd_align" for default simd alignment. Adds type trait "__builtin_omp_required_simd_align" after discussions here http://reviews.llvm.org/D9894 Differential Revision: http://reviews.llvm.org/D10597 llvm-svn: 241237	2015-07-02 03:40:19 +00:00
Reid Kleckner	698310b004	[SEH] Update EmitCapturedLocals to match r241187 It was still using frameaddress(1) to get the parent FP, even though it had the value it wanted as a parameter. llvm-svn: 241199	2015-07-01 22:33:45 +00:00
Reid Kleckner	eb11c41900	[SEH] Delete the 32-bit IR lowering for __finally blocks and use x64 32-bit finally funclets are intended to be called both directly from the parent function and indirectly from the EH runtime. Because we aren't contorting LLVM's X86 prologue to match MSVC's, calling the finally block directly passes in a different value of EBP than the one that the runtime provides. We need an adapter thunk to adjust EBP to the expected value. However, WinEHPrepare already has to solve this problem when cleanups are not pre-outlined, so we can go ahead and rely on it rather than duplicating work. Now we only do the llvm.x86.seh.recoverfp dance for 32-bit SEH filter functions. llvm-svn: 241187	2015-07-01 21:00:00 +00:00
Reid Kleckner	d0d9a1f63f	[SEH] Add 32-bit lowering for SEH __try This re-lands r236052 and adds support for __exception_code(). In 32-bit SEH, the exception code is not available in eax. It is only available in the filter function, and now we arrange to load it and store it into an escaped variable in the parent frame. As a consequence, we have to disable the "catch i8* null" optimization on 32-bit and always generate a filter function. We can re-enable the optimization if we detect an __except block that doesn't use the exception code, but this probably isn't worth optimizing. Reviewers: majnemer Differential Revision: http://reviews.llvm.org/D10852 llvm-svn: 241171	2015-07-01 17:10:10 +00:00
Eric Christopher	2374a7cba8	Use a stable sort to guarantee target feature ordering in the IR in order to make testing somewhat more feasible. Has the advantage of making it easier to find target features as well. llvm-svn: 241134	2015-07-01 01:07:12 +00:00
Eric Christopher	298ac300b2	Fix sse4 for target attribute feature additions. This reinstates part of the hack removed in r233223, by special casing sse4 as part of the feature additions. The notable change here is that we consider it only as part of setting the SSE level and not as part of the actual target features set which handles setting the rest of the masks. llvm-svn: 241130	2015-07-01 00:08:32 +00:00
Eric Christopher	2249b81697	Fix a TODO dealing with canonicalizing attributes on functions by using a string map to canonicalize. Fix up a couple of testcases that needed changing since we are no longer simply appending features to the list, but all of their mask dependencies as well. llvm-svn: 241129	2015-07-01 00:08:29 +00:00
Michael Kuperstein	a3c7b74208	[X86] Add FXSR intrinsics Add intrinsics for the FXSR instructions (FXSAVE/FXSAVE64/FXRSTOR/FXRSTOR64) These were previously declared in Intrin.h for MSVC compatibility, but now that we have them implemented, these declarations can be removed. llvm-svn: 241053	2015-06-30 09:45:38 +00:00
David Majnemer	64b0bdf88a	[CodeGen] Tweak isTriviallyRecursive further isTriviallyRecursive is a hack used to bridge a gap between the expectations that source code assumes and the semantics that LLVM IR can provide. Specifically, asm labels on functions are treated as an explicit name for a GlobalObject in Clang but treated like an output-processing step in GCC. Tweak this hack a little further to emit calls to library functions instead of emitting an incorrect definition. The definition in question would have available_externally linkage (this is OK) but result in a call to itself which will either result in an infinite loop or stack overflow. This fixes PR23964. llvm-svn: 241043	2015-06-30 04:41:18 +00:00
Eric Christopher	d983270976	Add support for the x86 builtin __builtin_cpu_supports. This matches the implementation of the gcc support for the same feature, including checking the values set up by libgcc at runtime. The structure looks like this: unsigned int __cpu_vendor; unsigned int __cpu_type; unsigned int __cpu_subtype; unsigned int __cpu_features[1]; with a set of enums to match various fields that are field out after parsing the output of the cpuid instruction. This also adds a set of errors checking for valid input (and cpu). compiler-rt support for this and the other builtins in this family (__builtin_cpu_init and __builtin_cpu_is) are forthcoming. llvm-svn: 240994	2015-06-29 21:00:05 +00:00
David Majnemer	5682efd28c	[CodeGen] Remove atomic sugar from record types in isSafeToConvert We failed to see that we should have deferred the creation of a type which references a type currently under construction because of atomic sugar. This fixes PR23985. llvm-svn: 240989	2015-06-29 20:13:23 +00:00
David Blaikie	ea3e51d73f	Account for calling convention specifiers in function definitions in IR test cases Several tests wouldn't pass when executed on an armv7a_pc_linux triple due to the non-default arm_aapcs calling convention produced on the function definitions in the IR output. Account for this with the application of a little regex. Patch by Ying Yi. llvm-svn: 240971	2015-06-29 17:29:50 +00:00
Asaf Badouh	a45b7cab7b	[x86][AVX512CD] Add conflict and lzcnt intrinsics in their 512bit versions include tests review http://reviews.llvm.org/D10795 llvm-svn: 240941	2015-06-29 12:51:53 +00:00
Asaf Badouh	4002ce4834	[X86][AVX512BW] Add more intrinsics support: Blend, abs, packs, adds, subs, avg, max, min, permute. all the intrinsics are covered by tests review: http://reviews.llvm.org/D10799 llvm-svn: 240937	2015-06-29 12:16:40 +00:00
Igor Breger	c2d7e033d7	This is a comment-only change to test commit access llvm-svn: 240931	2015-06-29 09:48:56 +00:00
Elena Demikhovsky	c563c2c61a	AVX-512: Implemented AVX-512 FMA intrinsics and tests. by Igor Breger http://reviews.llvm.org/D10797 llvm-svn: 240928	2015-06-29 09:20:57 +00:00
NAKAMURA Takumi	d94330440a	Revert r240872, "Suppress clang/test/CodeGen/builtins-ppc-p8vector.c for -Asserts for now. Will fix later." This has been fixed since r240912. llvm-svn: 240920	2015-06-28 23:14:35 +00:00
Jingyue Wu	5126186b32	[PPC] fixes typos in builtins-ppc-p8vector.c The extra ] causes %{{[0-9]]*}} to match only %<single digit> such as %1. llvm-svn: 240912	2015-06-28 18:30:36 +00:00
NAKAMURA Takumi	6075fa1273	Suppress clang/test/CodeGen/builtins-ppc-p8vector.c for -Asserts for now. Will fix later. llvm-svn: 240872	2015-06-27 03:33:47 +00:00
Adrian Prantl	9935f4f115	Add a testcase for bitfield debug info. llvm-svn: 240833	2015-06-26 21:25:18 +00:00
Nemanja Ivanovic	2f1f926e34	Add missing builtins to altivec.h for ABI compliance (vol. 1) This patch corresponds to review: http://reviews.llvm.org/D10637 This is the first round of additions of missing builtins listed in the ABI document. More to come (this builds onto what seurer already addes). This patch adds: vector signed long long vec_abs(vector signed long long) vector double vec_abs(vector double) vector signed long long vec_add(vector signed long long, vector signed long long) vector unsigned long long vec_add(vector unsigned long long, vector unsigned long long) vector double vec_add(vector double, vector double) vector double vec_and(vector bool long long, vector double) vector double vec_and(vector double, vector bool long long) vector double vec_and(vector double, vector double) vector signed long long vec_and(vector signed long long, vector signed long long) vector double vec_andc(vector bool long long, vector double) vector double vec_andc(vector double, vector bool long long) vector double vec_andc(vector double, vector double) vector signed long long vec_andc(vector signed long long, vector signed long long) vector double vec_ceil(vector double) vector bool long long vec_cmpeq(vector double, vector double) vector bool long long vec_cmpge(vector double, vector double) vector bool long long vec_cmpge(vector signed long long, vector signed long long) vector bool long long vec_cmpge(vector unsigned long long, vector unsigned long long) vector bool long long vec_cmpgt(vector double, vector double) vector bool long long vec_cmple(vector double, vector double) vector bool long long vec_cmple(vector signed long long, vector signed long long) vector bool long long vec_cmple(vector unsigned long long, vector unsigned long long) vector bool long long vec_cmplt(vector double, vector double) vector bool long long vec_cmplt(vector signed long long, vector signed long long) vector bool long long vec_cmplt(vector unsigned long long, vector unsigned long long) llvm-svn: 240821	2015-06-26 19:27:20 +00:00
Paul Robinson	889d722e87	FileCheck-ize test and make sure more things don't happen. Attribute 'nodebug' means no llvm.dbg.* intrinsics, no !dbg annotations, and no DISubprogram for the function. Differential Revision: http://reviews.llvm.org/D10747 llvm-svn: 240747	2015-06-26 00:36:50 +00:00
David Majnemer	41011f6706	[CodeGen] Restrict isTriviallyRecursive to predefined lib functions forwarding to lib functions isTriviallyRecursive is only supposed to guard functions part of the implementation. This fixes PR23953. llvm-svn: 240735	2015-06-25 23:50:40 +00:00
Artem Belevich	d21e5c6684	[CUDA] Implemented __nvvm_atom__gen_ builtins. Integer variants are implemented as atomicrmw or cmpxchg instructions. Atomic add for floating point (__nvvm_atom_add_gen_f()) is implemented as a call to an overloaded @llvm.nvvm.atomic.load.add.f32.* LVVM intrinsic. Differential Revision: http://reviews.llvm.org/D10666 llvm-svn: 240669	2015-06-25 18:29:42 +00:00
Aaron Ballman	9ec96a2f3f	Fix #pragma redefine_extname when there is a local variable of the same name. The local should not be renamed, only the externally-available declaration should be. Patch by Andrey Bokhanko! llvm-svn: 240653	2015-06-25 15:37:16 +00:00
Richard Smith	1be4940a1c	Fix test failure if this value doesn't end up named %0. llvm-svn: 240479	2015-06-23 23:13:31 +00:00
Bob Wilson	09aa90bbe1	PR22560: Fix argument order for ARM _MoveToCoprocessor builtins. The Microsoft-extension _MoveToCoprocessor and _MoveToCoprocessor2 builtins take the register value to be moved as the first argument, but the corresponding mcr and mcr2 LLVM intrinsics expect that value to be the third argument. Handle this as a special case, while still leaving those intrinsics as generic MSBuiltins. I considered the alternative of handling these in EmitARMBuiltinExpr, but that does not work well for the follow-up change that I'm going to make to improve the error handling for PR22560 -- we need the GetBuiltinType() checks for ICEArguments, and the ARM version of that code is only used for Neon intrinsics where the last argument is special and not checked in the normal way. llvm-svn: 240462	2015-06-23 21:10:15 +00:00
Ahmed Bougacha	0b938284da	[CodeGen] Teach X86_64ABIInfo about AVX512. As specified in the SysV AVX512 ABI drafts. It follows the same scheme as AVX2: Arguments of type __m512 are split into eight eightbyte chunks. The least significant one belongs to class SSE and all the others to class SSEUP. This also means we change the OpenMP SIMD default alignment on AVX512. Based on r240337. Differential Revision: http://reviews.llvm.org/D9894 llvm-svn: 240338	2015-06-22 21:31:43 +00:00
Alexander Potapenko	b9b73ef906	[ASan] Initial support for Kernel AddressSanitizer This patch adds initial support for the -fsanitize=kernel-address flag to Clang. Right now it's quite restricted: only out-of-line instrumentation is supported, globals are not instrumented, some GCC kasan flags are not supported. Using this patch I am able to build and boot the KASan tree with LLVMLinux patches from github.com/ramosian-glider/kasan/tree/kasan_llvmlinux. To disable KASan instrumentation for a certain function attribute((no_sanitize("kernel-address"))) can be used. llvm-svn: 240131	2015-06-19 12:19:07 +00:00
Alexey Bataev	326057d0da	[ATTRIBUTE] Support base vector types of __attribute__((mode)), patch by Alexey Frolov Base type of attribute((mode)) can actually be a vector type. The patch is to distinguish between base type and base element type. This fixes http://llvm.org/PR17453. Differential Revision: http://reviews.llvm.org/D10058 llvm-svn: 240125	2015-06-19 07:46:21 +00:00
Peter Collingbourne	b09b1128d4	Fix hexagon test failure. llvm-svn: 240106	2015-06-19 00:39:59 +00:00
Peter Collingbourne	9881b78b53	Introduce -fsanitize-trap= flag. This flag controls whether a given sanitizer traps upon detecting an error. It currently only supports UBSan. The existing flag -fsanitize-undefined-trap-on-error has been made an alias of -fsanitize-trap=undefined. This change also cleans up some awkward behavior around the combination of -fsanitize-trap=undefined and -fsanitize=undefined. Previously we would reject command lines containing the combination of these two flags, as -fsanitize=vptr is not compatible with trapping. This required the creation of -fsanitize=undefined-trap, which excluded -fsanitize=vptr (and -fsanitize=function, but this seems like an oversight). Now, -fsanitize=undefined is an alias for -fsanitize=undefined-trap, and if -fsanitize-trap=undefined is specified, we treat -fsanitize=vptr as an "unsupported" flag, which means that we error out if the flag is specified explicitly, but implicitly disable it if the flag was implied by -fsanitize=undefined. Differential Revision: http://reviews.llvm.org/D10464 llvm-svn: 240105	2015-06-18 23:59:22 +00:00
David Majnemer	fcbdb6ea58	Update clang to take into account the changes to personality fns llvm-svn: 239941	2015-06-17 20:53:19 +00:00
Ranjeet Singh	e8accef866	[ARM] Replace hard coded metadata arguments in tests with a regex. Differential Revision: http://reviews.llvm.org/D10507 llvm-svn: 239932	2015-06-17 19:56:30 +00:00
Philip Reames	c758ca3c5c	Adjust clang side tests effected by 239795 before reapplying said change llvm-svn: 239848	2015-06-16 20:24:06 +00:00
Peter Collingbourne	c4122c17b4	Protection against stack-based memory corruption errors using SafeStack: Clang command line option and function attribute This patch adds the -fsanitize=safe-stack command line argument for clang, which enables the Safe Stack protection (see http://reviews.llvm.org/D6094 for the detailed description of the Safe Stack). This patch is our implementation of the safe stack on top of Clang. The patches make the following changes: - Add -fsanitize=safe-stack and -fno-sanitize=safe-stack options to clang to control safe stack usage (the safe stack is disabled by default). - Add __attribute__((no_sanitize("safe-stack"))) attribute to clang that can be used to disable the safe stack for individual functions even when enabled globally. Original patch by Volodymyr Kuznetsov and others at the Dependable Systems Lab at EPFL; updates and upstreaming by myself. Differential Revision: http://reviews.llvm.org/D6095 llvm-svn: 239762	2015-06-15 21:08:13 +00:00
Reid Kleckner	717820faa0	Wildcard out some SSA value names from the ACLE intrinsic test case llvm-svn: 239757	2015-06-15 20:55:43 +00:00
Luke Cheeseman	59b2d83909	This patch implements clang support for the ACLE special register intrinsics in section 10.1, __arm_{w,r}sr{,p,64}. This includes arm_acle.h definitions with builtins and codegen to support these, the intrinsics are implemented by generating read/write_register calls which get appropriately lowered in the backend based on the register string provided. SemaChecking is also implemented to fault invalid parameters. Differential Revision: http://reviews.llvm.org/D9697 llvm-svn: 239737	2015-06-15 17:51:01 +00:00
Michael Kuperstein	ce1ce989e2	Add test for parsing the XOR operator in Intel syntax inline assembly. LLVM side of the patch was committed as r239695. Differential Revision: http://reviews.llvm.org/D10384 Patch by marina.yatsina@intel.com llvm-svn: 239696	2015-06-14 13:03:27 +00:00
Ahmed Bougacha	f6a9f0e112	[CodeGen] Don't evaluate immediate inlineasm arguments using isICE(). Instead, just EvaluateAsInt(). Follow-up to r239549: rsmith points out that isICE() is expensive; seems like it's not the right concept anyway, as it fails on `static const' in C, and will actually trigger the assert below on: test/Sema/inline-asm-validate-x86.c llvm-svn: 239651	2015-06-13 01:16:10 +00:00
Alexey Samsonov	d918ff62e5	[CodeGen] Use IRBuilder to create llvm.lifetime intrinsics. Summary: In addition to easier syntax, IRBuilder makes sure to set correct debug locations for newly added instructions (bitcast and llvm.lifetime itself). This restores the original behavior, which was modified by r234581 (reapplied as r235553). Extend one of the tests to check for debug locations. Test Plan: regression test suite Reviewers: aadg, dblaikie Subscribers: cfe-commits, majnemer Differential Revision: http://reviews.llvm.org/D10418 llvm-svn: 239643	2015-06-12 22:31:32 +00:00
Luke Cheeseman	7f5571a129	This patch makes the NEON intrinsics vget_lane_f16, vgetq_lane_f16, vset_lane_f16 and vsetq_lane_f16 available in AArch32. Differential Revision: http://reviews.llvm.org/D10388 llvm-svn: 239610	2015-06-12 15:52:39 +00:00
Teresa Johnson	edca6e507e	Revert commit r239481 as it is dependent on reverted llvm commit r239480. llvm-svn: 239588	2015-06-12 03:11:50 +00:00
Eric Christopher	249e3762e5	Handle fpmath= in the target attribute. Right now we're ignoring the fpmath attribute since there's no backend support for a feature like this and to do so would require checking the validity of the strings and doing general subtarget feature parsing of valid and invalid features with the target attribute feature. llvm-svn: 239582	2015-06-12 01:36:00 +00:00
Eric Christopher	4dfe075f93	Handle -mno-<feature> in target attribute strings by replacing the -mno- with a -<feature> to match how we handle this in the rest of the frontend. llvm-svn: 239581	2015-06-12 01:35:58 +00:00
Eric Christopher	64a247b68b	Add support for tune= to the target attribute support by ignoring it. We don't currently support the -mtune option in any useful way so ignoring the annotation is fine. llvm-svn: 239580	2015-06-12 01:35:56 +00:00
Eric Christopher	11acf739f8	Add support for the the target attribute. Modeled after the gcc attribute of the same name, this feature allows source level annotations to correspond to backend code generation. In llvm particular parlance, this allows the adding of subtarget features and changing the cpu for a particular function based on source level hints. This has been added into the existing support for function level attributes without particular verification for any target outside of whether or not the backend will support the features/cpu given (similar to section, etc). llvm-svn: 239579	2015-06-12 01:35:52 +00:00
Ahmed Bougacha	ff75f3dd6c	[CodeGen] Emit Constants for immediate inlineasm arguments. For inline assembly immediate constraints, we currently always use EmitScalarExpr, instead of directly emitting the constant. When the overflow sanitizer is enabled, this generates overflow intrinsics instead of constants. Instead, emit a constant for constraints that either require an immediate (e.g. 'I' on X86), or only accepts constants (immediate or symbolic; i.e., don't accept registers or memory). Fixes PR19763. Differential Revision: http://reviews.llvm.org/D10255 llvm-svn: 239549	2015-06-11 18:19:34 +00:00
Nemanja Ivanovic	b17f1129fa	Clang support for vector quad bit permute and gather instructions through builtins This patch corresponds to review: http://reviews.llvm.org/D10095 This is for just two instructions and related builtins: vbpermq vgbbd llvm-svn: 239506	2015-06-11 06:25:36 +00:00
Alexei Starovoitov	f657ca8d78	[bpf] add support for BPF backend add support for bpfel/bpfeb targets llvm-svn: 239496	2015-06-10 22:59:13 +00:00
Teresa Johnson	88c3c67997	Pass down the -flto option to the -cc1 job, and from there into the CodeGenOptions and onto the PassManagerBuilder. This enables gating the new EliminateAvailableExternally module pass on whether we are preparing for LTO. If we are preparing for LTO (e.g. a -flto -c compile), the new pass is not included as we want to preserve available externally functions for possible link time inlining. llvm-svn: 239481	2015-06-10 17:49:45 +00:00
Yunzhong Gao	d290eb4565	Fix the test case to handle different IR variable names. llvm-svn: 239457	2015-06-10 03:19:08 +00:00
Yunzhong Gao	cb77930d6b	Implementing C99 partial re-initialization behavior (DR-253) Based on previous discussion on the mailing list, clang currently lacks support for C99 partial re-initialization behavior: Reference: http://lists.cs.uiuc.edu/pipermail/cfe-dev/2013-April/029188.html Reference: http://www.open-std.org/jtc1/sc22/wg14/www/docs/dr_253.htm This patch attempts to fix this problem. Given the following code snippet, struct P1 { char x[6]; }; struct LP1 { struct P1 p1; }; struct LP1 l = { .p1 = { "foo" }, .p1.x[2] = 'x' }; // this example is adapted from the example for "struct fred x[]" in DR-253; // currently clang produces in l: { "\0\0x" }, // whereas gcc 4.8 produces { "fox" }; // with this fix, clang will also produce: { "fox" }; Differential Review: http://reviews.llvm.org/D5789 llvm-svn: 239446	2015-06-10 00:27:52 +00:00
Akira Hatanaka	262a4c4ec0	Attach attribute "disable-tail-calls" to the functions in the IR. This commit adds back the code that seems to have been dropped unintentionally in r176985. rdar://problem/13752163 Differential Revision: http://reviews.llvm.org/D10100 llvm-svn: 239426	2015-06-09 19:04:36 +00:00
Reid Kleckner	0b9bbbfc13	Revert "Re-land r236052, "[SEH] Add 32-bit lowering code for __try"" This reverts commit r239415. This was committed accidentally, LLVM isn't ready for this. llvm-svn: 239417	2015-06-09 17:49:42 +00:00
Reid Kleckner	65870442b3	Re-land r236052, "[SEH] Add 32-bit lowering code for __try" This reverts r236167. LLVM should be ready for this now. llvm-svn: 239415	2015-06-09 17:47:50 +00:00
Tyler Nowicki	90203dc855	Moved CPP CodeGen tests into CodeGenCXX. llvm-svn: 239362	2015-06-08 22:53:36 +00:00
Ahmed Bougacha	01950a02c0	[CodeGen] Add testcase for r239002. Just realized I forgot to add it when committing. llvm-svn: 239093	2015-06-04 20:58:49 +00:00
Bill Seurer	8be14f11ce	[PowerPC] This revision adds 68 of the missing "Predefined Functions for Vector Programming" from appendix A of the OpenPOWER ABI for Linux Supplement document. I also added tests for the new functions and updated another test that was looking for specific line numbers in error messages from altivec.h. https://llvm.org/bugs/show_bug.cgi?id=23679 http://reviews.llvm.org/D10131 llvm-svn: 239066	2015-06-04 18:45:44 +00:00
Asaf Badouh	5a26b25471	minor test fix llvm-svn: 238924	2015-06-03 13:42:46 +00:00
David Majnemer	c6eb650120	[Sema] Make the atomic builtins more efficient by reducing volatility The parameter types and return type do not need to be volatile just because the pointer type's pointee type is volatile qualified. This is an unnecessary pessimization. llvm-svn: 238892	2015-06-03 00:26:35 +00:00
Nuno Lopes	1ba2d78b9a	ubsan: Check for null pointers given to certain builtins, such as memcpy, memset, memmove, and bzero. Reviewed by: Richard Smith Differential Revision: http://reviews.llvm.org/D9673 llvm-svn: 238657	2015-05-30 16:11:40 +00:00
Ahmed Bougacha	5b63908f9f	[Sema] Promote compound assignment exprs. with fp16 LHS and int. RHS. We catch most of the various other __fp16 implicit conversions to float, but not this one: __fp16 a; int i; ... a += i; For which we used to generate something 'fun' like: %conv = sitofp i32 %i to float %1 = tail call i16 @llvm.convert.to.fp16.f32(float %conv) %add = add i16 %0, %1 Instead, when we have an __fp16 LHS and an integer RHS, we should use float as the result type. While there, add a bunch of missing tests for mixed __fp16/integer expressions. llvm-svn: 238625	2015-05-29 22:54:57 +00:00
David Majnemer	c9741603bc	[CodeGen] Update a test-case affected by folding IntToPtr/PtrToInt into Loads Folding IntToPtr or PtrToInt into Loads, due to r238452, perturbs the mips-varargs test-case. Patch by Philip Pfaffe! Differential Revision: http://reviews.llvm.org/D9153 llvm-svn: 238455	2015-05-28 18:51:36 +00:00
Petar Jovanovic	1a3f965fe3	[MIPS] Re-land the change r238200 to fix extension of integer types Re-land the change r238200, but with modifications in the tests that should prevent new failures in some environments as reported with the original change on the mailing list. llvm-svn: 238253	2015-05-26 21:07:19 +00:00
Aaron Ballman	674cf26892	__declspec is not a core Clang language extension. Instead, require -fms-extensions or -fborland to enable the language extension. Note: __declspec is also temporarily enabled when compiling for a CUDA target because there are implementation details relying on __declspec(property) support currently. When those details change, __declspec should be disabled for CUDA targets. llvm-svn: 238238	2015-05-26 19:44:52 +00:00
Hans Wennborg	74df0df135	Revert r238200: "[MIPS] fix extension of integer types (function calls)" mips-unsigned-ext-var.c and mips-unsigned-extend.c fail in some builds. llvm-svn: 238237	2015-05-26 19:39:54 +00:00
Petar Jovanovic	9aa0f1657f	[MIPS] fix extension of integer types (function calls) On MIPS unsigned int type should not be zero extended but sign-extended. Patch by Strahinja Petrovic. Differential Revision: http://reviews.llvm.org/D9198 llvm-svn: 238200	2015-05-26 13:30:54 +00:00
Kit Barton	5944ee2179	This patch adds support for the vector quadword add/sub instructions introduced in POWER8. These are the Clang-related changes for http://reviews.llvm.org/D9081 vadduqm vaddeuqm vaddcuq vaddecuq vsubuqm vsubeuqm vsubcuq vsubecuq All builtins are added in altivec.h, and guarded with the POWER8_VECTOR and powerpc64 macros. http://reviews.llvm.org/D9903 llvm-svn: 238145	2015-05-25 15:52:45 +00:00
Reid Kleckner	892bb0cace	Evaluate union cast subexpressions when the cast value is unused Fixes PR23597. llvm-svn: 237839	2015-05-20 21:59:25 +00:00
Ahmed Bougacha	71185aca7e	[CodeGen] Check x86_64-arguments.c tests on AVX as well. NFC. We used to only check the differing tests on AVX, but we might as well check all of them. llvm-svn: 237818	2015-05-20 18:39:16 +00:00
Michael Kuperstein	7619004211	[X86] Add _mm256_set_m128 and its 5 variants. Differential Revision: http://reviews.llvm.org/D9855 llvm-svn: 237778	2015-05-20 07:46:52 +00:00
Michael Kuperstein	877f3cbe84	[X86] Add _mm_broadcastsd_pd intrinsic _mm_broadcastsd_pd is basically an alias for _mm_movedup_pd, however the alias is only available from AVX2 forward. llvm-svn: 237698	2015-05-19 14:49:14 +00:00
Michael Kuperstein	6168183e04	[X86] Added _mm256_bslli_epi128 and _mm256_bsrli_epi128. These two intrinsics are alternative names for _mm256_slli_si256 and _mm256_srli_si256, respectively. llvm-svn: 237693	2015-05-19 13:05:46 +00:00
Bill Schmidt	41e14c4dfa	[PPC64] Add vector pack/unpack support from ISA 2.07 This patch adds support for the following new instructions in the Power ISA 2.07: vpksdss vpksdus vpkudus vpkudum vupkhsw vupklsw These instructions are available through the vec_packs, vec_packsu, vec_unpackh, and vec_unpackl built-in interfaces. These are lane-sensitive instructions, so the built-ins have different implementations for big- and little-endian, and the instructions must be marked as killing the vector swap optimization for now. The first three instructions perform saturating pack operations. The fourth performs a modulo pack operation, which means it can be represented with a vector shuffle, and conversely the appropriate vector shuffles may cause this instruction to be generated. The other instructions are only generated via built-in support for now. I noticed during patch preparation that the macro __VSX__ was not previously predefined when the power8-vector or direct-move features are requested. This is an error, and I've corrected that here as well. Appropriate tests have been added. There is a companion patch to llvm for the rest of this support. llvm-svn: 237500	2015-05-16 01:02:25 +00:00
Peter Collingbourne	915df9968b	Implement no_sanitize attribute. Differential Revision: http://reviews.llvm.org/D9631 llvm-svn: 237463	2015-05-15 18:33:32 +00:00
Pirama Arumuga Nainar	e9bcddd5cb	Add flag to enable native half type Summary: r235215 enables support in LLVM for legalizing f16 type in the IR. AArch64 already had support for this. r235215 and some backend patches brought support for ARM, X86, X86-64, Mips and Mips64. This change exposes the LangOption 'NativeHalfType' in the command line, so the backend legalization can be used if desired. NativeHalfType is enabled for OpenCL (current behavior) or if '-fnative-half-type' is set. Reviewers: olista01, steven_wu, ab Subscribers: cfe-commits, srhines, aemerson Differential Revision: http://reviews.llvm.org/D9781 llvm-svn: 237406	2015-05-14 23:44:18 +00:00
Peter Collingbourne	470d94247d	Make GNUInline consistent with whether we use traditional GNU inline semantics. Previously we were setting LangOptions::GNUInline (which controls whether we use traditional GNU inline semantics) if the language did not have the C99 feature flag set. The trouble with this is that C++ family languages also do not have that flag set, so we ended up setting this flag in C++ modes (and working around it in a few places downstream by also checking CPlusPlus). The fix is to check whether the C89 flag is set for the target language, rather than whether the C99 flag is cleared. This also lets us remove most CPlusPlus checks. We continue to test CPlusPlus when deciding whether to pre-define the __GNUC_GNU_INLINE__ macro for consistency with GCC. There is a change in semantics in two other places where we weren't checking both CPlusPlus and GNUInline (FunctionDecl::doesDeclarationForceExternallyVisibleDefinition and FunctionDecl::isInlineDefinitionExternallyVisible), but this change seems to put us back into line with GCC's semantics (test case: test/CodeGen/inline.c). While at it, forbid -fgnu89-inline in C++ modes, as GCC doesn't support it, it didn't have any effect before, and supporting it just makes things more complicated. Differential Revision: http://reviews.llvm.org/D9333 llvm-svn: 237299	2015-05-13 22:07:22 +00:00
Artem Belevich	ed0577cc6d	Fixed double-free in case of module loading error. GetOutputStream() owns the stream it returns pointer to and the pointer should never be freed by us. When we fail to load and exit early, unique_ptr still holds the pointer and frees it which leads to compiler crash when CompilerInstance attempts to free it again. Added regression test for failed bitcode linking. Differential Revision: http://reviews.llvm.org/D9625 llvm-svn: 237159	2015-05-12 17:44:15 +00:00
Sunil Srivastava	3acf6275e6	Changed renaming of local symbols by inserting a dot vefore the numeric suffix details in http://reviews.llvm.org/D9483 goes with llvm checkin r237150 llvm-svn: 237151	2015-05-12 16:48:43 +00:00
Eric Christopher	9e172d20f0	Remove the code that pulled soft float attributes out of the feature strings and remove the setting of TargetOptions::UseSoftFloat to match the code change in llvm r237079. llvm-svn: 237080	2015-05-12 01:26:21 +00:00
Elena Demikhovsky	23fccde1b1	AVX-512: Changed CC parameter in "cmp" intrinsic from i8 to i32 according to the Intel Spec llvm-svn: 236980	2015-05-11 09:03:41 +00:00
Elena Demikhovsky	bd5c8b9be9	AVX-512: FP compare intrinsics - changed type of CC parameter from i8 to i32 according to the spec. Added FP compare intrinsics for SKX. llvm-svn: 236715	2015-05-07 11:26:36 +00:00
Alexey Bataev	43f7439cf5	Fix for http://llvm.org/PR23392 : magick/feature.c from ImageMagick-6.9.1-2 ICEs. Fix for codegen of static variables declared inside of captured statements. Captured statements are actually a transparent DeclContexts, so we have to skip them when trying to get a mangled name for statics. Differential Revision: http://reviews.llvm.org/D9522 llvm-svn: 236701	2015-05-07 06:28:46 +00:00
Ulrich Weigand	5722c0f192	[SystemZ] Add support for z13 low-level vector builtins This adds low-level builtins to allow access to all of the z13 vector instructions. Note that instructions whose semantics can be described by standard C (including clang extensions) do not get any builtins. For each instructions whose semantics cannot (fully) be described, we define a builtin named __builtin_s390_<insn> that directly maps to this instruction. These are intended to be compatible with GCC. For instructions that also set the condition code, the builtin will take an extra argument of type "int *" at the end. The integer pointed to by this argument will be set to the post-instruction CC value. For many instructions, the low-level builtin is mapped to the corresponding LLVM IR intrinsic. However, a number of instructions can be represented in standard LLVM IR without requiring use of a target intrinsic. Some instructions require immediate integer operands within a certain range. Those are verified at the Sema level. Based on a patch by Richard Sandiford. llvm-svn: 236532	2015-05-05 19:36:42 +00:00
Ulrich Weigand	66ff51b4ea	[SystemZ] Add support for z13 and its vector facility This patch adds support for the z13 architecture type. For compatibility with GCC, a pair of options -mvx / -mno-vx can be used to selectively enable/disable use of the vector facility. When the vector facility is present, we default to the new vector ABI. This is characterized by two major differences: - Vector types are passed/returned in vector registers (except for unnamed arguments of a variable-argument list function). - Vector types are at most 8-byte aligned. The reason for the choice of 8-byte vector alignment is that the hardware is able to efficiently load vectors at 8-byte alignment, and the ABI only guarantees 8-byte alignment of the stack pointer, so requiring any higher alignment for vectors would require dynamic stack re-alignment code. However, for compatibility with old code that may use vector types, when not using the vector facility, the old alignment rules (vector types are naturally aligned) remain in use. These alignment rules are not only implemented at the C language level, but also at the LLVM IR level. This is done by selecting a different DataLayout string depending on whether the vector ABI is in effect or not. Based on a patch by Richard Sandiford. llvm-svn: 236531	2015-05-05 19:35:52 +00:00
Tim Northover	f9b517c159	ARM: merge Cyclone into other ARMv8 CPUs and add tests for features. Cyclone actually supports all the goodies you'd expect to come with an AArch64 CPU, so it doesn't need its own clause. Also we should probably be testing these clauses. llvm-svn: 236349	2015-05-01 21:17:25 +00:00
Reid Kleckner	9dd1fbdca6	Re-commit the test fix that went with r236274 llvm-svn: 236276	2015-04-30 22:42:45 +00:00
Reid Kleckner	cb7a0a0562	Revert most of r236271, leaving only the datalayout change in lib/Basic/Targets.cpp llvm-svn: 236274	2015-04-30 22:29:25 +00:00
Reid Kleckner	af67602e14	Use 4 byte preferred aggregate alignment in datalayout on x86 Win32 llvm-svn: 236271	2015-04-30 22:13:05 +00:00
Elena Demikhovsky	e7d4c2e229	AVX-512: Added AVX-512 intrinsics and tests by Asaf Badouh (asaf.badouh@intel.com) llvm-svn: 236218	2015-04-30 09:24:29 +00:00
Eric Christopher	fb481a4054	Propagate a terrible hack to the sparc target feature handling code by erasing the soft-float target feature if the rest of the front end added it because of defaults or the soft float option. Add some testing for some of the targets that implement this hack. llvm-svn: 236179	2015-04-29 23:32:17 +00:00
Reid Kleckner	be9843ce54	Revert r236128, LLVM isn't falling back in the right way llvm-svn: 236167	2015-04-29 21:55:21 +00:00
Reid Kleckner	0bb12a8981	Re-land r236052, the linker errors were fixed by LLVM r236123 Basic __finally blocks don't cause linker errors anymore (although they are miscompiled). llvm-svn: 236128	2015-04-29 17:17:17 +00:00
Duncan P. N. Exon Smith	9dd4e4e63a	DebugInfo: Metadata constructs now start with DI* LLVM r236120 renamed debug info IR constructs to use a `DI` prefix, now that the `DIDescriptor` hierarchy has been gone for about a week. This commit was generated using the rename-md-di-nodes.sh upgrade script attached to PR23080, followed by running clang-format-diff.py on the `lib/` portion of the patch. llvm-svn: 236121	2015-04-29 16:40:08 +00:00
Nico Weber	ea721b64df	Revert r236052, it caused linker errors when building 32-bit applications. llvm-svn: 236082	2015-04-29 03:08:32 +00:00
Reid Kleckner	ddd40964f0	[SEH] Add 32-bit lowering code for __try This is just the clang-side of 32-bit SEH. LLVM still needs work, and it will determinstically fail to compile until it's feature complete. On x86, all outlined handlers have no parameters, but they do implicitly take the EBP value passed in and use it to address locals of the parent frame. We model this with llvm.frameaddress(1). This works (mostly), but __finally block inlining can break it. For now, we apply the 'noinline' attribute. If we really want to inline __finally blocks on 32-bit x86, we should teach the inliner how to untangle frameescape and framerecover. Promote the error diagnostic from codegen to sema. It now rejects SEH on non-Windows platforms. LLVM doesn't implement SEH on non-x86 Windows platforms, but there's nothing preventing it. llvm-svn: 236052	2015-04-28 22:19:32 +00:00
Elena Demikhovsky	35dc8c0944	AVX-512: added intrinsics for KNL and SKX by Asaf Badouh (asaf.badouh@intel.com) llvm-svn: 235986	2015-04-28 13:28:01 +00:00
Bradley Smith	ba945626b0	[ARM/AArch64] Enforce alignment for bitfielded structs When creating a global variable with a type of a struct with bitfields, we must forcibly set the alignment of the global from the RecordDecl. We must do this so that the proper bitfield alignment makes its way down to LLVM, since clang will mangle the bitfields into one large type. llvm-svn: 235976	2015-04-28 11:24:54 +00:00
Duncan P. N. Exon Smith	374934cb8a	DebugInfo: Add a clang test for LLVM fix for PR23332 Add a clang test for LLVM r235955, which added support for up to 2^16 arguments. llvm-svn: 235956	2015-04-28 01:09:20 +00:00
Reid Kleckner	e0bca755d8	Remove stale FIXMEs from test case llvm-svn: 235937	2015-04-27 23:14:24 +00:00
Eric Christopher	f37ab1ca73	Always add the target-cpu and target-features sets if they're non-null. This makes sure that the front end is specific about what they're expecting the backend to produce. Update a FIXME with the idea that the target-features could be more precise using backend knowledge. llvm-svn: 235936	2015-04-27 23:11:34 +00:00
Reid Kleckner	1ef49218b3	Don't emit lifetime markers when msan is enabled In r235553, Clang started emitting lifetime markers more often. This caused false negative in MSan, because MSan only poisons all allocas once at function entry. Eventually, MSan should poison allocas at lifetime start and probably also lifetime end, but until then, let's not emit markers that aren't going to be useful. llvm-svn: 235613	2015-04-23 18:07:13 +00:00
Alexey Samsonov	f624650354	Unify the way we report overflow in increment/decrement operator. Summary: Make sure signed overflow in "x--" is checked with llvm.ssub.with.overflow intrinsic and is reported as: "-2147483648 - 1 cannot be represented in type 'int'" instead of: "-2147483648 + -1 cannot be represented in type 'int'" , like we do for unsigned overflow. Test Plan: clang + compiler-rt regression test suite Reviewers: rsmith Subscribers: cfe-commits Differential Revision: http://reviews.llvm.org/D8236 llvm-svn: 235568	2015-04-23 01:50:45 +00:00
Reid Kleckner	fca03389bd	Fix another test broken by r235537 llvm-svn: 235555	2015-04-22 21:39:55 +00:00
David Majnemer	dc012fa266	Revert "Revert r234581, it might have caused a few miscompiles in Chromium." This reverts commit r234700. It turns out that the lifetime markers were not the cause of Chromium failing but a bug which was uncovered by optimizations exposed by the markers. llvm-svn: 235553	2015-04-22 21:38:15 +00:00
Reid Kleckner	4b24f17c27	Fix test failure caused by r235537. It does not run on Windows due to REQUIRES: shell llvm-svn: 235551	2015-04-22 21:14:25 +00:00
Ulrich Weigand	4608438386	Provide alignment info on LLVM external symbols Code in CodeGenModule::GetOrCreateLLVMGlobal that sets up GlobalValue object for LLVM external symbols has this comment: // FIXME: This code is overly simple and should be merged with other global // handling. One part does seems to be "overly simple" currently is that this code never sets any alignment info on the GlobalValue, so that the emitted IR does not have any align attribute on external globals. This can lead to unnecessarily inefficient code generation. This patch adds a GV->setAlignment call to set alignment info. llvm-svn: 235396	2015-04-21 17:27:59 +00:00
Ulrich Weigand	b63f779be4	Fix __alignof__ of global variables on SystemZ SystemZ prefers to align all global variables to two bytes, which is implemented by setting the TargetInfo member MinGlobalAlign. However, for compatibility with existing compilers this should not change the ABI alignment value as retrieved via __alignof__, which it currently does. This patch fixes the issue by having ASTContext::getDeclAlign ignore the MinGlobalAlign setting in the ForAlignof case. Since SystemZ is the only platform setting MinGlobalAlign, this should cause no change for any other target. llvm-svn: 235395	2015-04-21 17:26:18 +00:00
Pete Cooper	635b509dee	Change AArch64 i128 returns to use [2 x i64] when possible. Something like { void, void } would be passed to a function as a [2 x i64], but returned as an i128. This patch unifies the 2 behaviours so that we also return it as a [2 x i64]. This is better for the quality of the IR, and the size of the final LLVM binary as we tend to want to insert/extract values from these types and do so with the insert/extract instructions is less IR than shifting, truncating, and or'ing values. Reviewed by Tim Northover. llvm-svn: 235231	2015-04-17 22:16:24 +00:00
David Majnemer	2ccba83401	[MS ABI] Use the right types for filter and finally blocks The type for abnormal_termination can't be an i1, it an i8. Filter functions return 'LONG', not 'int'. llvm-svn: 235161	2015-04-17 06:57:25 +00:00
David Blaikie	d6c88ece21	[opaque pointer types] Explicit non-pointer type for call expressions (migration for recent LLVM change to textual IR for calls) llvm-svn: 235147	2015-04-16 23:25:00 +00:00
Nico Weber	608e768d8d	Don't crash when a selectany symbol would get common linkage Things can't both be in comdats and have common linkage, so never give things in comdats common linkage. Common linkage is only used in .c files, and the only thing that can trigger a comdat in c is selectany from what I can tell. Fixes PR23243. Also address an over-the-shoulder review comment from rnk by moving the hasAttr<SelectAnyAttr>() in Decl.cpp around a bit. It only makes a minor difference for selectany on global variables, so it goes well with the rest of this patch. http://reviews.llvm.org/D9042 llvm-svn: 235053	2015-04-15 23:04:24 +00:00
Nico Weber	ea7386d40c	Make __declspec(selectany) turn variable declartions into definitions. Fixes PR23242. llvm-svn: 235046	2015-04-15 21:50:06 +00:00
Reid Kleckner	ebaf28d13d	Reland r234613 (and follow-ups 234614, 234616, 234618) The frameescape intrinsic cannot be inlined, so I fixed the inliner in r234937. This should address PR23216. llvm-svn: 234942	2015-04-14 20:59:00 +00:00
Petar Jovanovic	1dbc317736	[Mips] Generate warning for invalid '-mnan' and '-march' combinations This patch generates a warning for invalid combination of '-mnan' and '-march' options, it properly sets NaN encoding for a given '-march', and it passes a proper NaN encoding to the assembler. Patch by Vladimir Radosavljevic. Differential Revision: http://reviews.llvm.org/D8170 llvm-svn: 234882	2015-04-14 12:49:08 +00:00
Nico Weber	ad108337cf	Revert r234613 (and follow-ups 234614, 234616, 234618), it caused PR23216. llvm-svn: 234789	2015-04-13 20:04:22 +00:00
Nico Weber	f2a39a7b4e	Revert r234786, it contained a bunch of stuff I did not mean to commit. llvm-svn: 234787	2015-04-13 20:03:03 +00:00
Nico Weber	b31abb05fb	Revert r234613 (and follow-ups 234614, 234616, 234618), it caused PR23216. llvm-svn: 234786	2015-04-13 20:01:20 +00:00
Nico Weber	1c565c31b1	Revert r234581, it might have caused a few miscompiles in Chromium. If the revert helps, I'll get a repro this Monday. Else I'll put the change back in. llvm-svn: 234700	2015-04-11 23:51:38 +00:00
Joerg Sonnenberger	740f3f4a95	Create the correct profiling symbol on NetBSD. llvm-svn: 234636	2015-04-10 21:02:53 +00:00
Reid Kleckner	7553a4bca0	Really fix exceptions-seh-finally.c llvm-svn: 234616	2015-04-10 17:53:39 +00:00
Reid Kleckner	865c2ca882	Try to fix exceptions-seh-finally.c llvm-svn: 234614	2015-04-10 17:45:23 +00:00
Reid Kleckner	11859afd5f	[SEH] Re-land r234532, but use internal linkage for all SEH helpers Even though these symbols are in a comdat group, the Microsoft linker really wants them to have internal linkage. I'm planning to tweak the mangling in a follow-up change. This is a straight revert with a 1-line fix. llvm-svn: 234613	2015-04-10 17:34:52 +00:00
Arnaud A. de Grandmaison	047a686d53	Remove threshold for inserting lifetime markers for named temporaries Now that TailRecursionElimination has been fixed with r222354, the threshold on size for lifetime marker insertion can be removed. This only affects named temporary though, as the patch for unnamed temporaries is still in progress. My previous commit (r222993) was not handling debuginfo correctly, but this could only be seen with some asan tests. Basically, lifetime markers are just instrumentation for the compiler's usage and should not affect debug information; however, the cleanup infrastructure was assuming it contained only destructors, i.e. actual code to be executed, and was setting the breakpoint for the end of the function to the closing '}', and not the return statement, in order to show some destructors have been called when leaving the function. This is wrong when the cleanups are only lifetime markers, and this is now fixed. llvm-svn: 234581	2015-04-10 10:13:52 +00:00
Nico Weber	bd51a6a99f	Revert r234532 for a bit, it very likely caused http://crbug.com/475768 llvm-svn: 234563	2015-04-10 04:33:03 +00:00
Nemanja Ivanovic	239eec732e	Add Clang support for remaining integer divide and permute instructions from ISA 2.06 This patch corresponds to review: http://reviews.llvm.org/D8398 It adds some builtin functions to access the extended divide and bit permute instructions. llvm-svn: 234547	2015-04-09 23:58:16 +00:00
Reid Kleckner	0dbecf2b78	[SEH] Outline finally blocks using the new variable capture support WinEHPrepare was going to have to pattern match the control flow merge and split that the old lowering used, and that wasn't really feasible. Now we can teach WinEHPrepare to pattern match this, which is much simpler: %fp = call i8* @llvm.frameaddress(i32 0) call void @func(iN [01], i8* %fp) This prototype happens to match the prototype used by the Win64 SEH personality function, so this is really simple. llvm-svn: 234532	2015-04-09 20:37:24 +00:00
Sanjay Patel	359b105745	Process the -freciprocal-math optimization flag (PR20912) The driver currently accepts but ignores the -freciprocal-math flag. This patch passes the flag through and enables 'arcp' fast-math-flag generation in IR. Note that this change does not actually enable the optimization for any target. The reassociation optimization that this flag specifies was implemented by http://reviews.llvm.org/D6334 : http://llvm.org/viewvc/llvm-project?view=revision&revision=222510 Because the optimization is done in the backend rather than IR, the backend must be modified to understand instruction-level fast-math-flags or a new function-level attribute must be created. Also note that -freciprocal-math is independent of any target-specific usage of reciprocal estimate hardware instructions. That requires its own flag ('-mrecip'). https://llvm.org/bugs/show_bug.cgi?id=20912 llvm-svn: 234493	2015-04-09 15:03:23 +00:00
Michael Kuperstein	aed5ccdeed	HasSideEffects() should return false for calls to pure and const functions. Differential Revision: http://reviews.llvm.org/D8548 llvm-svn: 234152	2015-04-06 13:22:01 +00:00
Reid Kleckner	7510c09234	Fix data layout mismatch between LLVM and Clang for i686-pc-windows-msvc-elf Do the same thing as win64. If we're not using COFF, use the ELF manglings. Maybe if we are targetting *-windows-msvc-macho, we should use darwin manglings, but I don't need to stir that pot today. llvm-svn: 233819	2015-04-01 16:45:06 +00:00
Ulrich Weigand	3a610ebf1e	[SystemZ] Support transactional execution on zEC12 The zEC12 provides the transactional-execution facility. This is exposed to users via a set of builtin routines on other compilers. This patch adds clang support to enable those builtins. In partciular, the patch: - enables the transactional-execution feature by default on zEC12 - allows to override presence of that feature via the -mhtm/-mno-htm options - adds a predefined macro __HTM__ if the feature is enabled - adds support for the transactional-execution GCC builtins - adds Sema checking to verify the __builtin_tabort abort code - adds the s390intrin.h header file (for GCC compatibility) - adds s390 sections to the htmintrin.h and htmxlintrin.h header files Since this is first use of target-specific intrinsics on the platform, the patch creates the include/clang/Basic/BuiltinsSystemZ.def file and hooks it up in TargetBuiltins.h and lib/Basic/Targets.cpp. An associated LLVM patch adds the required LLVM IR intrinsics. For reference, the transactional-execution instructions are documented in the z/Architecture Principles of Operation for the zEC12: http://publibfp.boulder.ibm.com/cgi-bin/bookmgr/download/DZ9ZR009.pdf The associated builtins are documented in the GCC manual: http://gcc.gnu.org/onlinedocs/gcc/S_002f390-System-z-Built-in-Functions.html The htmxlintrin.h intrinsics provided for compatibility with the IBM XL compiler are documented in the "z/OS XL C/C++ Programming Guide". llvm-svn: 233804	2015-04-01 12:54:25 +00:00
Elena Demikhovsky	29da2fba46	AVX-512: added clang intrinsics for logical and, or xor for 512 bits by Asaf Badouh (asaf.badouh@intel.com) llvm-svn: 233794	2015-04-01 06:54:16 +00:00
Eli Bendersky	7a0d89153f	Add sm_37 target to Clang for NVPTX Support for this target was added in LLVM r233575 and r233583 llvm-svn: 233715	2015-03-31 17:03:16 +00:00
Derek Schuff	6ab52fabcf	Add driver support for Native Client SDK Add Tool and ToolChain support for clang to target the NaCl OS using the NaCl SDK for x86-32, x86-64 and ARM. Includes nacltools::Assemble and Link which are derived from gnutools. They are similar to Linux but different enought that they warrant their own class. Also includes a NaCl_TC in ToolChains derived from Generic_ELF with library and include paths suitable for an SDK and independent of the system tools. Differential Revision: http://reviews.llvm.org/D8590 llvm-svn: 233594	2015-03-30 20:31:33 +00:00
Kit Barton	e50adcb6b1	[PPC] Move argument range checks for HTM and crypto builtins to Sema The argument range checks for the HTM and Crypto builtins were implemented in CGBuiltin.cpp, not in Sema. This change moves them to the appropriate location in SemaChecking.cpp. It requires the creation of a new method in the Sema class to do checks for PPC-specific builtins. http://reviews.llvm.org/D8672 llvm-svn: 233586	2015-03-30 19:40:59 +00:00
Ulrich Weigand	283ad7d6b4	[SystemZ] Fix fallout from r233543 on no-assert builds Test cases must not check for symbolic variable names that are not present in IR generated by no-assert builds. Fixed by testing a more complete subset of the va_arg dataflow, without relying on variable names. llvm-svn: 233574	2015-03-30 18:08:50 +00:00
Ulrich Weigand	759449c76a	[SystemZ] Fix some ABI corner cases Running the GCC's inter-compiler ABI compatibility test suite uncovered a couple of errors in clang's SystemZ ABI implementation. These all affect only rare corner cases: - Short vector types GCC synthetic vector types defined with __attribute__ ((vector_size ...)) are always passed and returned by reference. (This is not documented in the official ABI document, but is the de-facto ABI implemented by GCC.) clang would do that only for vector sizes >= 16 bytes, but not for shorter vector types. - Float-like aggregates and empty bitfields clang would consider any aggregate containing an empty bitfield as first element to be a float-like aggregate. That's obviously wrong. According to the ABI doc, the presence of an empty bitfield makes an aggregate to be not float-like. However, due to a bug in GCC, empty bitfields are ignored in C++; this patch changes clang to be compatible with this "feature" of GCC. - Float-like aggregates and va_arg The va_arg implementation would mis-detect some aggregates as float-like that aren't actually passed as such. This applies to aggregates that have only a single element of type float or double, but using an aligned attribute that increases the total struct size to more than 8 bytes. This error occurred because the va_arg implement used to have an copy of the float-like aggregate detection logic (i.e. it would call the isFPArgumentType routine, but not perform the size check). To simplify the logic, this patch removes the duplicated logic and instead simply checks the (possibly coerced) LLVM argument type as already determined by classifyArgumentType. llvm-svn: 233543	2015-03-30 13:49:01 +00:00
Daniel Sanders	48fa39e4a1	[mips] Add support for 'ZC' inline assembly memory constraint. Summary: Also add tests for 'R' and 'm'. Reviewers: atanasyan Reviewed By: atanasyan Subscribers: cfe-commits Differential Revision: http://reviews.llvm.org/D8449 llvm-svn: 233542	2015-03-30 13:47:23 +00:00
Bill Schmidt	3303eff774	[PowerPC] Remove assembly testing from test/CodeGen/ppc64-elf-abi.c Eric Christopher pointed out that we have a check for assembly code generation in a clang test, which isn't cool. We already have Driver and back-end CodeGen tests for the .abiversion handling, so this testing is unnecessary anyway. Make it go away. llvm-svn: 233314	2015-03-26 20:16:52 +00:00
Eric Christopher	70c1665d83	Reapply r232888 after applying a fix for -msse4 code generation. As a note, any target that uses fake target features via command line options will have similar problems. llvm-svn: 233227	2015-03-25 23:14:47 +00:00
Kit Barton	8246f28237	Add Hardware Transactional Memory (HTM) Support This patch adds Hardware Transaction Memory (HTM) support supported by ISA 2.07 (POWER8). The intrinsic support is based on GCC one [1], with both 'PowerPC HTM Low Level Built-in Functions' and 'PowerPC HTM High Level Inline Functions' implemented. Along with builtins a new driver switch is added to enable/disable HTM instruction support (-mhtm) and a header with common definitions (mostly to parse the TFHAR register value). The HTM switch also sets a preprocessor builtin HTM. The HTM usage requires a recently newer kernel with PPC HTM enabled. Tested on powerpc64 and powerpc64le. This is send along a llvm patch to enabled the builtins and option switch. [1] https://gcc.gnu.org/onlinedocs/gcc/PowerPC-Hardware-Transactional-Memory-Built-in-Functions.html Phabricator Review: http://reviews.llvm.org/D8248 llvm-svn: 233205	2015-03-25 19:41:41 +00:00
Sanjay Patel	7a9f5d380b	Adding back a CHECK that works with r233110 llvm-svn: 233111	2015-03-24 20:42:20 +00:00
Sanjay Patel	1a94ccbec8	Removing a CHECK that is about to go wrong. I'm about to commit a patch for: http://reviews.llvm.org/D8567 That patch will break this one existing test case in Clang. I'm not sure if this file is intending to create a Clang dependency on the LLVM IR optimizer, but that's the consequence of specifying -O3 on this test file. My hope is to avoid buildbot rage by removing this check, committing the LLVM patch, and then fixing this check. I don't know how to make a simultaneous commit to Clang and LLVM. I will commit the correct CHECK line fix for this test shortly. llvm-svn: 233109	2015-03-24 20:35:24 +00:00
Yunzhong Gao	99efc0361b	Adds a warning for unrecognized argument to #pragma comment() on PS4. PS4 target recognizes the #pragma comment() syntax as in -fms-extensions, but only handles the case of #pragma comment(lib). This patch adds a warning if any other arguments are encountered. This patch also refactors the code in ParsePragma.cpp a little bit to make it more obvious that some codes are being shared between -fms-extensions and PS4. llvm-svn: 233015	2015-03-23 20:41:42 +00:00
Ahmed Bougacha	d1801afeac	[CodeGen] Properly support the half FP type with non-native operations. On AArch64, the -fallow-half-args-and-returns option is the default. With it, the half type is considered legal (rather than the i16 used normally for __fp16), but no operation is, except conversions and load/stores and such. The previous behavior was tantamount to saying LangOpts.NativeHalfType was implied by LangOpts.HalfArgsAndReturns, which isn't true. Instead, teach the various parts of CodeGen that already know about half (using the intrinsics or not) about this weird in-between case, where the "half" type is legal, but operations on it aren't. This is a smaller intermediate step to the end-goal of removing the intrinsic, always using "half", and letting the backend legalize. Builds on r232968. rdar://20045970, rdar://17468714 Differential Revision: http://reviews.llvm.org/D8367 llvm-svn: 232971	2015-03-23 17:54:16 +00:00
Ahmed Bougacha	47ec2c7479	[CodeGen] Convert double -> __fp16 in one step. Fix the CodeGen so that for types bigger than float, instead of converting to fp16 via the sequence "InTy -> float -> fp16", we perform conversions in just one step. This avoids the double rounding which potentially changes results from a natural IEEE-754 operation. rdar://17594379, rdar://17468714 Differential Revision: http://reviews.llvm.org/D4602 Part of: http://reviews.llvm.org/D8367 llvm-svn: 232968	2015-03-23 17:48:07 +00:00
Daniel Jasper	17ae9f0206	Revert "Add CodeGen support for adding cpu attributes on functions based on" This breaks CodeGen for an internal target. I'll get repro instructions to you. llvm-svn: 232930	2015-03-23 05:52:28 +00:00
Eric Christopher	ea00c2a06f	Add CodeGen support for adding cpu attributes on functions based on the target-cpu, if different from the triple's cpu, and target-features as they're written that are passed down from the driver. Together with LLVM r232885 this should allow the LTO'ing of binaries that contain modules compiled with different code generation options on a subset of architectures with full backend support (x86, powerpc, aarch64). llvm-svn: 232888	2015-03-21 06:15:15 +00:00
Sanjay Patel	2ad80dac9b	fixed vperm2* intrinsics to check for shuffles This corresponds to llvm r232852: http://reviews.llvm.org/rL232852 llvm-svn: 232857	2015-03-20 22:37:20 +00:00
David Majnemer	c403a1ce32	MS ABI: Accept calls to an unprototyped declaration of _setjmp This fixes PR22961. llvm-svn: 232824	2015-03-20 17:03:35 +00:00
Chandler Carruth	c66deafb73	[Modules] Implement __builtin_isinf_sign in Clang. Somehow, we never managed to implement this fully. We could constant fold it like crazy, including constant folding complex arguments, etc. But if you actually needed to generate code for it, error. I've implemented it using the somewhat obvious lowering. Happy for suggestions on a more clever way to lower this. Now, what you might ask does this have to do with modules? Fun story. So it turns out that libstdc++ actually uses __builtin_isinf_sign to implement std::isinf when in C++98 mode, but only inside of a template. So if we're lucky, and we never instantiate that, everything is good. But once we try to instantiate that template function, we need this builtin. All of my customers at least are using C++11 and so they never hit this code path. But what does that have to do with modules? Fun story. So it turns out that with modules we actually observe a bunch of bugs in libstdc++ where their <cmath> header clobbers things exposed by <math.h>. To fix these, we have to provide global function definitions to replace the macros that C99 would have used. And it turns out that ::isinf needs to be implemented using the exact semantics used by the C++98 variant of std::isinf. And so I started to fix this bug in libstdc++ and ceased to be able to compile libstdc++ with Clang. The yaks are legion. llvm-svn: 232778	2015-03-19 22:39:51 +00:00
David Majnemer	8fa8c384d2	Basic: Update clang to reflect changes made to LLVM datalayout We now give x86-64 COFF targets a different mangling code, update clang to use it. llvm-svn: 232571	2015-03-17 23:55:00 +00:00
Joerg Sonnenberger	1d3b431c98	Global inline assembler blocks are merged before parsing, so no specific location data is available. If pragma handling wants to look up the position, it finds the LLVM buffer and wants to compare it with the special built-in buffer, failing badly. Extend to the special handling of the built-in buffer to also check for the inline asm buffer. Expect only a single asm buffer. Sort it between the built-in buffers and the normal file buffers. Fixes the assert part of PR 22576. llvm-svn: 232389	2015-03-16 17:54:54 +00:00
Ahmed Bougacha	5a4aa42a59	Add a bunch of missing "CHECK" colons in tests. NFC. llvm-svn: 232237	2015-03-14 01:10:19 +00:00
Robert Lougher	7607b918a7	Make tests more robust. No functional change. In preparation for recommit of revision 232190, change tests so that they are resilient to operands being commuted by the reassociate pass. llvm-svn: 232206	2015-03-13 20:35:45 +00:00
David Blaikie	bdf40a62a7	Test case updates for explicit type parameter to the gep operator llvm-svn: 232187	2015-03-13 18:21:46 +00:00
David Blaikie	81e96f8192	Update test case to make it easier to automatically port to typeless pointer gep operator changes coming soon llvm-svn: 232185	2015-03-13 18:21:11 +00:00
Sanjay Patel	0a6da5de55	[X86, AVX2] Replace inserti128 and extracti128 intrinsics with generic shuffles This is nearly identical to the v*f128_si256 parts of r231792 and r232052. AVX2 introduced proper integer variants of the hacked integer insert/extract C intrinsics that were created for this same functionality with AVX1. This should complete the front end fixes for insert/extract128 intrinsics. Corresponding LLVM patch to follow. llvm-svn: 232109	2015-03-12 21:54:24 +00:00
Sanjay Patel	0c351aba25	[X86, AVX] replace vextractf128 intrinsics with generic shuffles This is very much like D8088 (checked in at r231792). Now that we've replaced the vinsertf128 intrinsics, do the same for their extract twins. Differential Revision: http://reviews.llvm.org/D8275 llvm-svn: 232052	2015-03-12 15:50:36 +00:00
Joerg Sonnenberger	27173288c2	Under duress, move check for target support of __builtin_setjmp/ __builtin_longjmp to Sema as requested by John McCall. llvm-svn: 231986	2015-03-11 23:46:32 +00:00
Hal Finkel	0d0a1a53e3	[PowerPC] ABI support for the QPX vector instruction set Support for the QPX vector instruction set, used on the IBM BG/Q supercomputer, has recently been added to the LLVM PowerPC backend. This vector instruction set requires some ABI modifications because the ABI on the BG/Q expects <4 x double> vectors to be provided with 32-byte stack alignment, and to be handled as native vector types (similar to how Altivec vectors are handled on mainline PPC systems). I've named this ABI variant elfv1-qpx, have made this the default ABI when QPX is supported, and have updated the ABI handling code to provide QPX vectors with the correct stack alignment and associated register-assignment logic. llvm-svn: 231960	2015-03-11 19:14:15 +00:00
Kit Barton	8553bec911	Add builtins for the 64-bit vector integer arithmetic instructions added in POWER8. These are the Clang-related changes for the instructions added to LLVM in http://reviews.llvm.org/D7959. Phabricator review: http://reviews.llvm.org/D8041 llvm-svn: 231931	2015-03-11 15:57:19 +00:00
Rafael Espindola	3937738b81	Remove a bugus test. This was using the driver to test LLVM. I checked that disabling the code path that the test was testing causes llvm tests to fail. llvm-svn: 231895	2015-03-11 00:28:59 +00:00
Sanjay Patel	7f6aa52e93	[X86, AVX] Replace vinsertf128 intrinsics with generic shuffles. We want to replace as much custom x86 shuffling via intrinsics as possible because pushing the code down the generic shuffle optimization path allows for better codegen and less complexity in LLVM. This is the sibling patch for the LLVM half of this change: http://reviews.llvm.org/D8086 Differential Revision: http://reviews.llvm.org/D8088 llvm-svn: 231792	2015-03-10 15:19:26 +00:00
NAKAMURA Takumi	f65421ad9f	Suppress a couple of tests, clang/test/CodeGen/catch-undef-behavior.c and one, for -Asserts for now. They were introduced in r231711. llvm-svn: 231717	2015-03-09 22:32:03 +00:00
Alexey Samsonov	21d2dda3d2	[UBSan] Split -fsanitize=shift into -fsanitize=shift-base and -fsanitize=shift-exponent. This is a recommit of r231150, reverted in r231409. Turns out that -fsanitize=shift-base check implementation only works if the shift exponent is valid, otherwise it contains undefined behavior itself. Make sure we check that exponent is valid before we proceed to check the base. Make sure that we actually report invalid values of base or exponent if -fsanitize=shift-base or -fsanitize=shift-exponent is specified, respectively. llvm-svn: 231711	2015-03-09 21:50:19 +00:00
Tim Northover	d157e19562	ARM: use ABI-specified alignment for byval parameters. When passing a type with large alignment byval, we were specifying the type's alignment rather than the alignment that the backend is actually capable of producing (ABIAlign). This would be OK (if odd) assuming the backend dealt with it prooperly, unfortunately it doesn't and trying to pass types with "byval align 16" can cause it to set fp incorrectly and trash the stack during the prologue. I'll be fixing that in a separate patch, but Clang should still be emitting IR that's as close to its intent as possible. rdar://20059039 llvm-svn: 231706	2015-03-09 21:40:42 +00:00
Alexey Samsonov	48a9db034a	Revert "[UBSan] Split -fsanitize=shift into -fsanitize=shift-base and -fsanitize=shift-exponent." It's not that easy. If we're only checking -fsanitize=shift-base we still need to verify that exponent has sane value, otherwise UBSan-inserted checks for base will contain undefined behavior themselves. llvm-svn: 231409	2015-03-05 21:57:35 +00:00
Rick Foos	e9c019a7a6	Temporary XFAILs for Hexagon Summary: Temporary XFAIL's until patches done. Reviewers: echristo, adasgupt, colinl Reviewed By: colinl Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D8044 llvm-svn: 231318	2015-03-04 23:40:38 +00:00
Nemanja Ivanovic	55e757db4a	Add Clang support for PPC cryptography builtins Review: http://reviews.llvm.org/D7951 llvm-svn: 231291	2015-03-04 21:48:22 +00:00
Reid Kleckner	533bd17268	Fix test/CodeGen/builtins.c for platforms that don't lower sjlj Opt in Win64 to supporting sjlj lowering. We have the backend lowering, so I think this was just an oversight because WinX86_64TargetCodeGenInfo doesn't inherit from X86_64TargetCodeGenInfo. llvm-svn: 231280	2015-03-04 19:24:16 +00:00
Daniel Jasper	8dc4a2a3be	Prevent test from writing files. llvm-svn: 231247	2015-03-04 15:02:17 +00:00
Joerg Sonnenberger	244a577754	Adjust the changes from r230255 to bail out if the backend can't lower __builtin_setjmp/__builtin_longjmp and don't fall back to the libc functions. llvm-svn: 231245	2015-03-04 14:25:35 +00:00
Duncan P. N. Exon Smith	bcf02b44de	DebugInfo: Remove useless test This test doesn't provide any value (it just checks that the frontend produces exactly one compile unit), and it certainly isn't doing what the comment says. Noticed via IRC review of my update to it in r231083. llvm-svn: 231152	2015-03-03 22:18:24 +00:00
Alexey Samsonov	783b8174ad	[UBSan] Split -fsanitize=shift into -fsanitize=shift-base and -fsanitize=shift-exponent. -fsanitize=shift is now a group that includes both these checks, so exisiting users should not be affected. This change introduces two new UBSan kinds that sanitize only left-hand side and right-hand side of shift operation. In practice, invalid exponent value (negative or too large) tends to cause more portability problems, including inconsistencies between different compilers, crashes and inadequeate results on non-x86 architectures etc. That is, -fsanitize=shift-exponent failures should generally be addressed first. As a bonus, this change simplifies CodeGen implementation for emitting left shift (separate checks for base and exponent are now merged by the existing generic logic in EmitCheck()), and LLVM IR for these checks (the number of basic blocks is reduced). llvm-svn: 231150	2015-03-03 22:15:35 +00:00
Duncan P. N. Exon Smith	f04be1fb3a	DebugInfo: Move new hierarchy into place (clang) Update testcases for LLVM change in r231082 to use the new debug info hierarchy. llvm-svn: 231083	2015-03-03 17:25:55 +00:00
Juergen Ributzka	9baa03fc07	Lower _mm256_broadcastsi128_si256 directly to a vector shuffle. Originally we were using the same GCC builtins to lower this AVX2 vector intrinsic. Instead we will now lower it directly to a vector shuffle. This will not only allow LLVM to generate better code, but it will also allow us to remove the GCC intrinsics. Reviewed by Andrea This is related to rdar://problem/18742778. llvm-svn: 231081	2015-03-03 17:22:53 +00:00
David Blaikie	a953f2825b	Update Clang tests to handle explicitly typed load changes in LLVM. llvm-svn: 230795	2015-02-27 21:19:58 +00:00
David Blaikie	218b783192	Update Clang tests to handle explicitly typed gep changes in LLVM. llvm-svn: 230783	2015-02-27 19:18:17 +00:00
Nico Weber	6bdd9b0608	Reland __leave tests (r230717 and r230720, reverted in r230740). The only change is that line 266 changed from // CHECK: br label %[[except]] to // CHECK: br label %[[except:[^ ]*]] llvm-svn: 230764	2015-02-27 16:40:43 +00:00
Daniel Jasper	7fe82ad80b	Revert r230717 (and subsequent r230720). The tests keeps failing on build bots: http://lab.llvm.org:8080/green/job/clang-stage2-configure-Rlto_check/2355/testReport/junit/Clang/CodeGen/exceptions_seh_leave_c/ llvm-svn: 230740	2015-02-27 08:16:32 +00:00
Craig Topper	b1bc5cf4bc	[X86] Remove pblendw and pblendd builtins that aren't being used by the intrinsic headers. llvm-svn: 230738	2015-02-27 06:54:25 +00:00
Alexey Bataev	b832926176	[OPENMP] Codegen for "#pragma omp atomic write" For global reg lvalue - use regular store through global register. For simple lvalue - use simple atomic store. For bitfields, vector element, extended vector elements - the original value of the whole storage (for vector elements) or of some aligned value (for bitfields) is atomically read, the part of this value for the given lvalue is modified and then use atomic compare-and-exchange operation to try to atomically write modified value (if it was not modified). Also, changes in this patch fix the bug for '#pragma omp atomic read' applied to extended vector elements. Differential Revision: http://reviews.llvm.org/D7369 llvm-svn: 230736	2015-02-27 06:33:30 +00:00
Nico Weber	5339a52c9a	Add last missing __leave test. llvm-svn: 230720	2015-02-27 02:26:14 +00:00
Nico Weber	497bf5587e	Add another __leave test. llvm-svn: 230717	2015-02-27 01:58:08 +00:00
Nico Weber	ff62a6a0b7	Don't crash on leaving nested __finally blocks through an EH edge. The __finally emission block tries to be clever by removing unused continuation edges if there's an unconditional jump out of the __finally block. With exception edges, the EH continuation edge isn't always unused though and we'd crash in a few places. Just don't be clever. That makes the IR for __finally blocks a bit longer in some cases (hence small and behavior-preserving changes to existing tests), but it makes no difference in general and it fixes the last crash from PR22553. http://reviews.llvm.org/D7918 llvm-svn: 230697	2015-02-26 22:34:33 +00:00
Petar Jovanovic	d55ae6ba37	Add support for generating MIPS legacy NaN Currently, the NaN values emitted for MIPS architectures do not cover non-IEEE754-2008 compliant case. This change fixes the issue. Patch by Vladimir Radosavljevic. Differential Revision: http://reviews.llvm.org/D7882 llvm-svn: 230653	2015-02-26 18:19:22 +00:00
Craig Topper	ac0d58bc4c	[X86] Remove the blendps/blendpd builtins. They aren't used by the intrinsic headers. We use appropriate shuffle vector instead. llvm-svn: 230616	2015-02-26 08:09:05 +00:00
Nico Weber	d9b8bd6b86	Make __leave test pass in -Asserts builds. llvm-svn: 230514	2015-02-25 17:44:04 +00:00
Nico Weber	e68b9f3e0a	Reland r230460 with a test fix for -Asserts builds. Original CL description: Produce less broken basic block sequences for __finally blocks. The way cleanups (such as PerformSEHFinally) get emitted is that codegen generates some initialization code, then calls the cleanup's Emit() with the insertion point set to a good place, then the cleanup is supposed to emit its stuff, and then codegen might tack in a jump or similar to where the insertion point is after the cleanup. The PerformSEHFinally cleanup tries to just stash away the block it's supposed to codegen into, and then does codegen later, into that stashed block. However, after codegen'ing the __finally block, it used to set the insertion point to the finally's continuation block (where the __finally cleanup goes when its body is completed after regular, non-exceptional control flow). That's not correct, as that block can (and generally does) already ends in a jump. Instead, remember the insertion point that was current before the __finally got emitted, and restore that. Fixes two of the crashes in PR22553. llvm-svn: 230503	2015-02-25 16:25:00 +00:00
Daniel Jasper	cd94c40b10	Revert "Produce less broken basic block sequences for __finally blocks." The test is broken on buildbots: http://lab.llvm.org:8080/green/job/clang-stage2-configure-Rlto_check/2279/ This reverts commit adda738b6dc533c42db5f5f5b31344098a3aba7d. llvm-svn: 230472	2015-02-25 10:07:14 +00:00
Nico Weber	795bd2d411	Produce less broken basic block sequences for __finally blocks. The way cleanups (such as PerformSEHFinally) get emitted is that codegen generates some initialization code, then calls the cleanup's Emit() with the insertion point set to a good place, then the cleanup is supposed to emit its stuff, and then codegen might tack in a jump or similar to where the insertion point is after the cleanup. The PerformSEHFinally cleanup tries to just stash away the block it's supposed to codegen into, and then does codegen later, into that stashed block. However, after codegen'ing the __finally block, it used to set the insertion point to the finally's continuation block (where the __finally cleanup goes when its body is completed after regular, non-exceptional control flow). That's not correct, as that block can (and generally does) already ends in a jump. Instead, remember the insertion point that was current before the __finally got emitted, and restore that. Fixes two of the crashes in PR22553. llvm-svn: 230460	2015-02-25 04:05:18 +00:00
Adrian Prantl	cbc368c5b5	Revert "Wrap clang module files in a Mach-O, ELF, or COFF container." llvm-svn: 230454	2015-02-25 02:44:04 +00:00
Adrian Prantl	8bf7af3de8	Wrap clang module files in a Mach-O, ELF, or COFF container. This is a necessary prerequisite for debugging with modules. The .pcm files become containers that hold the serialized AST which allows us to store debug information in the module file that can be shared by all object files that were built importing the module. This reapplies r230044 with a fixed configure+make build and updated dependencies and testcase requirements. Over the last iteration this version adds - missing target requirements for testcases that specify an x86 triple, - a missing clangCodeGen.a dependency to libClang.a in the make build. rdar://problem/19104245 llvm-svn: 230423	2015-02-25 01:31:45 +00:00
Tim Northover	bc784d1caa	ARM: Simplify PCS handling. The backend should now be able to handle all AAPCS rules based on argument type, which means Clang no longer has to duplicate the register-counting logic and the CodeGen can be significantly simplified. llvm-svn: 230349	2015-02-24 17:22:40 +00:00
Michael Kuperstein	4f818708a8	[WinX86_64 ABI] Treat C99 _Complex as a struct MSVC does not support C99 _Complex. ICC, however, does support it on windows x86_64, and treats it, for purposes of parameter passing, as equivalent to a struct containing two fields (for the real and imaginary part). Differential Revision: http://reviews.llvm.org/D7825 llvm-svn: 230315	2015-02-24 09:35:58 +00:00
Joerg Sonnenberger	096feeb741	Only lower __builtin_setjmp / __builtin_longjmp to llvm.eh.sjlj.setjmp / llvm.eh.sjlj.longjmp, if the backend is known to support them outside the Exception Handling context. The default handling in LLVM codegen doesn't work and will create incorrect code. The ARM backend on the other hand will assert if the intrinsics are used. llvm-svn: 230255	2015-02-23 20:23:47 +00:00
Rafael Espindola	6b07a1c6ee	Add -funique-section-names and -fno-unique-section-names options. For now -funique-section-names is the default, so no change in default behavior. The total .o size in a build of llvm and clang goes from 241687775 to 230649031 bytes if -fno-unique-section-names is used. llvm-svn: 230031	2015-02-20 18:08:57 +00:00
Filipe Cabecinhas	54a2ba8b76	[Headers] Add tests for _mm256_insert_epi64 and fix its definition Summary: The definition for _mm256_insert_epi64 was taking an int, which would get truncated before being inserted in the vector. Original patch by Joshua Magee! Reviewers: bruno, craig.topper Subscribers: cfe-commits Differential Revision: http://reviews.llvm.org/D7179 llvm-svn: 229811	2015-02-19 03:02:33 +00:00
Daniel Sanders	933f0a04d3	Fix test/CodeGen/atomic_ops.c failure on clang-cmake-mips builder (and others). Not all targets generate 'store atomic' instructions for '_Atomic(_Complex int)'. Some targets use the __atomic_store builtin instead. This commit makes the test accept either one. llvm-svn: 229676	2015-02-18 15:08:37 +00:00
David Majnemer	5927e7b681	CodeGen: Relax a FileCheck line for SystemZ llvm-svn: 229617	2015-02-18 02:28:15 +00:00
Manuel Klimek	fa27b8861b	Make tests independent of llvm variable naming. llvm-svn: 229484	2015-02-17 09:49:31 +00:00
Craig Topper	96f9a573b5	[X86] Convert palignr builtin handling to use shuffle form of right shift instead of intrinsics. This should allow the instrinsics to removed from the backend. llvm-svn: 229474	2015-02-17 07:18:01 +00:00
Craig Topper	480e2b6e43	[X86] Merge the 2 separate builtin handlers for PALIGNR into a single one that handles both. llvm-svn: 229469	2015-02-17 06:37:58 +00:00
Sanjay Patel	eb2af4e8b1	x86-64 ABI: unwrap single element structs / arrays of 256-bit vectors to pass and return in registers This is a patch for PR22563 ( http://llvm.org/bugs/show_bug.cgi?id=22563 ). We were not correctly unwrapping a single 256-bit AVX vector that was defined as an array of 1 inside a struct. We would generate a <4 x float> param/return value instead of <8 x float> and lose half of the vector. Differential Revision: http://reviews.llvm.org/D7614 llvm-svn: 229408	2015-02-16 17:26:51 +00:00
Michael Kuperstein	f0e4ccffc5	Fix quoting of #pragma comment for MS compat, clang part. For #pragma comment(linker, ...) MSVC expects the comment string to be quoted, but for #pragma comment(lib, ...) the compiler itself quotes the library name. Since this distinction disappears by the time the directive reaches the backend, move quoting for the "lib" version to the frontend. Differential Revision: http://reviews.llvm.org/D7653 llvm-svn: 229376	2015-02-16 11:57:43 +00:00
Craig Topper	370644f66e	[X86] Teach clang to lower __builtin_ia32_psrldqi256 and __builtin_ia32_pslldqi256 to vector shuffles the backend recognizes. This is a step towards removing the corresponding intrinsics from the backend. llvm-svn: 229348	2015-02-16 00:42:49 +00:00
David Blaikie	3e7149d2ec	Remove trailing whitespace to make test compatible with typeless pointer migration llvm-svn: 229274	2015-02-15 04:12:22 +00:00
David Majnemer	ce27e42d47	CodeGen: _Atomic(_Complex) shouldn't crash We could be a little kinder if we did a compare-exchange loop instead of an atomic-load/store pair. llvm-svn: 229212	2015-02-14 01:48:17 +00:00
David Majnemer	a5b195a1dc	Revert "Revert r229082 for a bit, it caused PR22577." This reverts commit r229123. It was a red herring, the bug was present without r229082. llvm-svn: 229205	2015-02-14 01:35:12 +00:00
David Majnemer	6866a3c6f4	CodeGen: Correctly convert atomic bool from i8 to i1 Bools are a little tricky, they are i8 in memory and must be coerced back to i1 before further operations can be performed on them. This fixes PR22577. llvm-svn: 229204	2015-02-14 01:35:07 +00:00
David Blaikie	4260780552	Update test case to be compatible with auto-migration to new getelementptr syntax coming in the near future The first change won't touch GEPOperators such as these, but the update script only identifies them by the leading '(' after getelementptr or 'getelementptr inbounds', so update this test to at least have those features to allow auto-migrating. llvm-svn: 229198	2015-02-14 00:41:07 +00:00
David Blaikie	f5f3253da0	Adjust test case to be compatible with future changes to explicitly pass the type to getelementptr llvm-svn: 229196	2015-02-14 00:26:13 +00:00
Nico Weber	7ce96b853d	Revert r229082 for a bit, it caused PR22577. llvm-svn: 229123	2015-02-13 16:27:00 +00:00
David Majnemer	abc482effc	MS ABI: Implement /volatile:ms The /volatile:ms semantics turn volatile loads and stores into atomic acquire and release operations. This distinction is important because volatile memory operations do not form a happens-before relationship with non-atomic memory. This means that a volatile store is not sufficient for implementing a mutex unlock routine. Differential Revision: http://reviews.llvm.org/D7580 llvm-svn: 229082	2015-02-13 07:55:47 +00:00
Craig Topper	4fb4581716	[X86] Fix test cases that I foolishly copied and modified from another file that had optimizations on. This caused the check patterns to not quite match. llvm-svn: 229073	2015-02-13 06:27:39 +00:00
Craig Topper	a462482d98	[X86] Add _mm_bslli_si128 and _mm_bsrli_si128 as aliases of _mm_slli_si128 and _mm_srli_si128. This matches Intel documentation and gcc. llvm-svn: 229066	2015-02-13 06:04:45 +00:00
NAKAMURA Takumi	ed9b9f8e01	Mark clang/test/CodeGen/exceptions-seh-leave.c as REQUIRES:asserts, for now. FIXME: Rewrite CHECKs for unnamed BBs and Insts. llvm-svn: 228990	2015-02-13 00:24:21 +00:00
Nico Weber	5779f84000	[ms] Implement codegen for __leave. Reviewed at http://reviews.llvm.org/D7575 llvm-svn: 228977	2015-02-12 23:16:11 +00:00
Steven Wu	15b385f854	Add InlineAsmDiagnosticHandler for bitcode input Summary: This patch installs an InlineAsmDiagnosticsHandler to avoid the crash report when the input is bitcode and the bitcode contains invalid inline assembly. The handler will simply print the same error message that will print from the backend. Add CHECK in test-case Reviewers: echristo, rafael Reviewed By: rafael Subscribers: rafael, cfe-commits Differential Revision: http://reviews.llvm.org/D7568 llvm-svn: 228898	2015-02-12 02:06:55 +00:00
Reid Kleckner	a593000f01	Add the 'noinline' attribute to call sites within __try bodies LLVM doesn't support non-call exceptions, so inlining makes it harder to catch such asynchronous exceptions. llvm-svn: 228876	2015-02-11 21:40:48 +00:00
Adrian Prantl	5f66bae411	Fix PR19351. While building up a composite type it is important to use a non-uniqueable temporary node that is only turned into a permanent unique or distinct node after it is finished. Otherwise an intermediate node may get accidentally uniqued with another node as illustrated by the testcase. Paired commit with LLVM. llvm-svn: 228855	2015-02-11 17:45:15 +00:00
Filipe Cabecinhas	2177fc1732	Make the byte-shift SSE intrinsics emit vector shuffles which we know the backend can handle. Also removed unused builtins. Original patch by Andrea Di Biagio! Reviewers: craig.topper, nadav Subscribers: cfe-commits Differential Revision: http://reviews.llvm.org/D7199 llvm-svn: 228481	2015-02-07 01:37:09 +00:00
Eric Christopher	f5d9977600	Make this test a little less specific by removing the argument that could change. llvm-svn: 228438	2015-02-06 20:53:40 +00:00
Eric Christopher	79f5e4ede5	Inline asm IR input register constraints don't have early clobber modifiers on them. If we have a matching output constraint with an early clobber make sure we don't propagate that to the input constraint. llvm-svn: 228422	2015-02-06 18:44:18 +00:00
Reid Kleckner	deeddeced3	Re-land r228258 and make clang-cl's /EHs- disable -fexceptions again After r228258, Clang started emitting C++ EH IR that LLVM wasn't ready to deal with, even when exceptions were disabled with /EHs-. This time, make /EHs- turn off -fexceptions while still emitting exceptional constructs in functions using __try. Since Sema rejects C++ exception handling constructs before CodeGen, landingpads should only appear in such functions as the result of a __try. llvm-svn: 228329	2015-02-05 18:56:03 +00:00
Reid Kleckner	16f9a6b43d	Fix crash on finally blocks that don't fall through llvm-svn: 228243	2015-02-05 00:58:46 +00:00
Reid Kleckner	aca01db706	Implement IRGen for SEH __finally and AbnormalTermination Previously we would simply double-emit the body of the __finally block, but that doesn't work when it contains any kind of Decl, which we can't double emit. This fixes that by emitting the block once and branching into a shared code region and then branching back out. llvm-svn: 228222	2015-02-04 22:37:07 +00:00
Daniel Sanders	aa748a8db5	Preserve early clobber flag when using named registers in inline assembly. Summary: Named registers with the constraint "=&r" currently lose the early clobber flag and turn into "=r" when converted to LLVM-IR. This patch correctly passes it on. Reviewers: atanasyan Reviewed By: atanasyan Subscribers: cfe-commits Differential Revision: http://reviews.llvm.org/D7346 llvm-svn: 228143	2015-02-04 14:25:47 +00:00
Rafael Espindola	d94d25301c	Make this test stricter. NFC. llvm-svn: 228112	2015-02-04 04:23:48 +00:00
David Majnemer	e1a0b2e2af	MS ABI: Records with fields with required aligmnet shouldn't be common llvm-svn: 227954	2015-02-03 08:49:32 +00:00
David Majnemer	7d82131abe	MS ABI: Records with required alignment can't have common linkage This fixes PR22441. llvm-svn: 227950	2015-02-03 07:35:55 +00:00
Craig Topper	53565c60e7	[X86] Add other flavors of AVX512 cmpps/cmppd intrinsics. llvm-svn: 227773	2015-02-01 22:27:40 +00:00
Craig Topper	2a898bfc67	[X86] Add the AVX512 exp2a23 intrinsics. llvm-svn: 227769	2015-02-01 21:34:11 +00:00
Craig Topper	a5df398006	[X86] Add test for avx512er builtins that I forgot to commit with changes to the header file. llvm-svn: 227762	2015-02-01 19:56:51 +00:00
Craig Topper	67826a5883	[X86] Rename _mm512_valign_epi64/32 intrinsics to _mm512_alignr_epi64/32 to match Intel docs. Make immediate argument to them an ICE. Fix mask size for the alignd version. llvm-svn: 227713	2015-02-01 07:35:40 +00:00
Craig Topper	1601525f1c	[X86] Add range checking to the immediate arguments of many of the SSE/AVX builtins. llvm-svn: 227674	2015-01-31 06:31:23 +00:00
David Blaikie	303facbeb8	DebugInfo: Fix line table for comparisons harder/better for the sake of C (& the GDB buildbot) llvm-svn: 227663	2015-01-31 01:10:11 +00:00
David Blaikie	9fd4d08515	Rename test file to be more accurate (& free up the file name for a more appropriate test) llvm-svn: 227662	2015-01-31 01:10:09 +00:00
Saleem Abdulrasool	71d1dd1e0c	CodeGen: create a WindowsARMTargetCodeGenInfo Create a new TargetCodeGenInfo for Windows on ARM to permit annotating the functions with stack-probe-size (for /Gs and -mstack-probe-support) for generating the stack probe necessary for Windows targets. This will be used by the backend when lowering the frame to generate the stack probe appropriately. llvm-svn: 227641	2015-01-30 23:29:19 +00:00
Reid Kleckner	3a417c301b	SEH: Don't jump to an unreachable continuation block If both the __try and __except blocks do not return, we want to delete the continuation block as unreachable instead. llvm-svn: 227627	2015-01-30 22:16:45 +00:00
David Majnemer	310e3a8f60	MS ABI: Implement proper support for setjmp On targets which use the MSVCRT, setjmp is a macro which expands to _setjmp or _setjmpex. _setjmp and _setjmpex have a secret, hidden argument which is not listed in the function prototype on X64 and WoA. This hidden argument always seems to be the frame pointer. _setjmpex isn't used on X86, _setjmp is magically replaced with a call to _setjmp3. The second argument is zero for 'normal' setjmp/longjmp pairs, otherwise it is a count of additional variadic arguments. This is used when setjmp appears inside of a try or __try. It is not safe to use a pointer to setjmp because _setjmp, _setjmpex and _setmp3 are not compatible with setjmp. llvm-svn: 227426	2015-01-29 09:29:21 +00:00
Derek Schuff	3970a7ec9b	Remove support for pnaclcall attribute Summary: It was used for interoperability with PNaCl's calling conventions, but it's no longer needed. Also Remove NaCl*ABIInfo which just existed to delegate to either the portable or native ABIInfo, and remove checkCallingConvention which was now a no-op override. Reviewers: jvoung Subscribers: jfb, llvm-commits Differential Revision: http://reviews.llvm.org/D7206 llvm-svn: 227362	2015-01-28 20:24:52 +00:00
Alex Rosenberg	286f124da5	Enable pragma comment processing for PS4. Original patch by Yunzhong Gao! llvm-svn: 227336	2015-01-28 18:26:15 +00:00
Tom Stellard	d99fb956a3	R600: Use a Southern Islands GPU as the default for the amdgcn target llvm-svn: 227315	2015-01-28 15:38:44 +00:00
Craig Topper	335e218760	[X86] Add intrinsics for AVX512 128 and 256 bit integer comparison of word and byte vectors. llvm-svn: 227186	2015-01-27 09:16:29 +00:00
Pete Cooper	f051cbf631	Don't generate llvm.expect intrinsics with -O0. The backend won't run LowerExpect on -O0. In a debug LTO build, this results in llvm.expect intrinsics being in the LTO IR which doesn't know how to optimize them. Thanks to Chandler for the suggestion and review. Differential revision: http://reviews.llvm.org/D7183 llvm-svn: 227135	2015-01-26 20:51:58 +00:00
Craig Topper	b4789096c0	[X86] Add AVX512 integer comparison intrinsics for word and byte vectors. llvm-svn: 227079	2015-01-26 09:24:10 +00:00
Craig Topper	2f25a5a875	[X86] Add more of the AVX512 integer comparision intrinsics. This adds 128 and 256 bit vectors of dwords and qwords. llvm-svn: 227075	2015-01-26 08:11:49 +00:00
Craig Topper	4cac1c2318	[X86] Add AVX512F integer comparision intrinsics to header file. llvm-svn: 227067	2015-01-25 23:30:07 +00:00
Justin Bogner	9fa8c9984c	test: Convert some tests to FileCheck These were all doing trivial greps. It's better to use FileCheck. llvm-svn: 227007	2015-01-24 17:39:36 +00:00
Justin Bogner	2af264a44a	test: Remove two redundant lines from this test The FileCheck already checks for these lines, no need to grep as well. llvm-svn: 227006	2015-01-24 17:39:32 +00:00
Sanjay Patel	76c9e0986c	Process the -fno-signed-zeros optimization flag (PR20870) The driver currently accepts but ignores the -fno-signed-zeros flag. This patch passes the flag through and enables 'nsz' fast-math-flag generation in IR. The existing OpenCL flag for the same functionality is made into an alias here. It may be removed in a subsequent patch. This should resolve bug 20870 ( http://llvm.org/bugs/show_bug.cgi?id=20870 ); patches for the optimizer were checked in at: http://llvm.org/viewvc/llvm-project?view=revision&revision=225050 http://llvm.org/viewvc/llvm-project?view=revision&revision=224583 Differential Revision: http://reviews.llvm.org/D6873 llvm-svn: 226915	2015-01-23 16:40:50 +00:00
Reid Kleckner	2a2e156318	SEH: Emit the constant filter 1 as a catch-all Minor optimization of code like __try { ... } __except(1) { ... }. llvm-svn: 226766	2015-01-22 02:25:56 +00:00
Reid Kleckner	1d59f99f5c	Initial support for Win64 SEH IR emission The lowering looks a lot like normal EH lowering, with the exception that the exceptions are caught by executing filter expression code instead of matching typeinfo globals. The filter expressions are outlined into functions which are used in landingpad clauses where typeinfo would normally go. Major aspects that still need work: - Non-call exceptions in __try bodies won't work yet. The plan is to outline the __try block in the frontend to keep things simple. - Filter expressions cannot use local variables until capturing is implemented. - __finally blocks will not run after exceptions. Fixing this requires work in the LLVM SEH preparation pass. The IR lowering looks like this: // C code: bool safe_div(int n, int d, int r) { __try { r = normal_div(n, d); } __except(_exception_code() == EXCEPTION_INT_DIVIDE_BY_ZERO) { return false; } return true; } ; LLVM IR: define i32 @filter(i8* %e, i8* %fp) { %ehptrs = bitcast i8* %e to i32 %ehrec = load i32 %ehptrs %code = load i32* %ehrec %matches = icmp eq i32 %code, i32 u0xC0000094 %matches.i32 = zext i1 %matches to i32 ret i32 %matches.i32 } define i1 zeroext @safe_div(i32 %n, i32 %d, i32* %r) { %rr = invoke i32 @normal_div(i32 %n, i32 %d) to label %normal unwind to label %lpad normal: store i32 %rr, i32* %r ret i1 1 lpad: %ehvals = landingpad {i8, i32} personality i32 (...) @__C_specific_handler catch i8* bitcast (i32 (i8, i8)* @filter to i8) %ehptr = extractvalue {i8, i32} %ehvals, i32 0 %sel = extractvalue {i8, i32} %ehvals, i32 1 %filter_sel = call i32 @llvm.eh.seh.typeid.for(i8 bitcast (i32 (i8, i8)* @filter to i8*)) %matches = icmp eq i32 %sel, %filter_sel br i1 %matches, label %eh.except, label %eh.resume eh.except: ret i1 false eh.resume: resume } Reviewers: rjmccall, rsmith, majnemer Differential Revision: http://reviews.llvm.org/D5607 llvm-svn: 226760	2015-01-22 01:36:17 +00:00
Rafael Espindola	e855c2ae0a	Revert "Try to fix -Asserts build bots." This reverts commit r226758. Looks like rnk's 226757 fixed the real issue. Sorry for the noise. llvm-svn: 226759	2015-01-22 01:33:41 +00:00
Rafael Espindola	b88c11281c	Try to fix -Asserts build bots. llvm-svn: 226758	2015-01-22 01:26:39 +00:00
Reid Kleckner	395dad8213	Give the block inlining test a triple to determinise output It fails on Windows due to another temporary being emitted first, so the LLVM internal renaming scheme gives out the name __block_descriptor_tmp1. llvm-svn: 226757	2015-01-22 01:19:19 +00:00
Rafael Espindola	e5df59ff78	Emit DeferredDeclsToEmit in a DFS order. Currently we emit DeferredDeclsToEmit in reverse order. This patch changes that. The advantages of the change are that * The output order is a bit closer to the source order. The change to test/CodeGenCXX/pod-member-memcpys.cpp is a good example. * If we decide to deffer more, it will not cause as large changes in the estcases as it would without this patch. llvm-svn: 226751	2015-01-22 00:24:57 +00:00
Adam Nemet	f893edeaea	[AVX512] Add sub-vector FP extracts Analogous to AVX2, these need to be implemented as macros to properly propagate the immediate index operand. Part of <rdar://problem/17688758> llvm-svn: 226496	2015-01-19 20:12:05 +00:00
Adrian Prantl	7c6f944cdf	Migrate all uses of DIVariable's FlagIndirectVariable to use a DIExpression with a DW_OP_deref instead. llvm-svn: 226474	2015-01-19 17:51:58 +00:00
Craig Topper	f557b09f14	[x86] Mark that the AVX-512 cmpps/cmppd builtins need an ICE for the comparison immediate. This requires converting to a macro in the header file. llvm-svn: 226421	2015-01-19 01:18:19 +00:00
Eric Christopher	49b425d9d2	Remove pathname dependence. Also rewrite test to use FileCheck at the same time. Patch by David Callahan. llvm-svn: 226331	2015-01-16 22:03:52 +00:00
Rafael Espindola	d9b26d563a	Add comdat to string literal variables on COFF. llvm-svn: 226317	2015-01-16 20:32:35 +00:00
Adam Nemet	c0cff244fc	[AVX512] Add intrinsics for masked aligned FP loads and stores Part of <rdar://problem/17688758> llvm-svn: 226298	2015-01-16 18:51:50 +00:00
Adam Nemet	d47dec4690	Fix typo in r225922. llvm-svn: 226297	2015-01-16 18:51:46 +00:00
Duncan P. N. Exon Smith	8d3ef611ce	IR: Move MDLocation into place (clang testcases) Update testcases to match LLVM change in r226048. llvm-svn: 226049	2015-01-14 22:28:03 +00:00
Daniel Sanders	998c910262	[mips] Handle transparent unions correctly. Summary: This fixes MultiSource/Applications/lemon on big-endian N32 by correcting the handling of the argument to wait(). glibc defines it as a transparent union of void* and int. Such unions are passed according to the rules of the first member so the argument must be passed as if it were a void (sign extended from i32 to i64) and not as a union (shifted to the upper bits of an i64). wait() already behaves correctly on big-endian O32 and N64 since the union is already the same size as an argument slot. Reviewers: atanasyan Reviewed By: atanasyan Subscribers: cfe-commits Differential Revision: http://reviews.llvm.org/D6963 llvm-svn: 225981	2015-01-14 12:00:12 +00:00
Adam Nemet	63a951eb1c	[AVX512] Add FP unpack intrinsics These are implemented with __builtin_shufflevector just like AVX. We have some tests on the LLVM side to assert that these shufflevectors do indeed generate the corresponding unpck instruction. Part of <rdar://problem/17688758> llvm-svn: 225922	2015-01-14 01:31:17 +00:00
Daniel Sanders	cdcb580d4e	[mips] Fix va_arg() for pointer types on big-endian N32. Summary: The Mips ABI's treat pointers in the same way as integers. They are sign-extended to 32-bit for O32, and 64-bit for N32/N64. This doesn't matter for O32 and N64 where pointers are already the correct width but it does matter for big-endian N32, where pointers are 32-bit and need promoting. The caller side is already passing pointers correctly. This patch corrects the callee. Reviewers: vmedic, atanasyan Reviewed By: atanasyan Subscribers: cfe-commits Differential Revision: http://reviews.llvm.org/D6812 llvm-svn: 225782	2015-01-13 10:47:00 +00:00
Alexey Samsonov	8845952b54	Reimplement -fsanitize-recover family of flags. Introduce the following -fsanitize-recover flags: - -fsanitize-recover=<list>: Enable recovery for selected checks or group of checks. It is forbidden to explicitly list unrecoverable sanitizers here (that is, "address", "unreachable", "return"). - -fno-sanitize-recover=<list>: Disable recovery for selected checks or group of checks. - -f(no-)?sanitize-recover is now a synonym for -f(no-)?sanitize-recover=undefined,integer and will soon be deprecated. These flags are parsed left to right, and mask of "recoverable" sanitizer is updated accordingly, much like what we do for -fsanitize= flags. -fsanitize= and -fsanitize-recover= flag families are independent. CodeGen change: If there is a single UBSan handler function, responsible for implementing multiple checks, which have different recoverable setting, then we emit two handler calls instead of one: the first one for the set of "unrecoverable" checks, another one - for set of "recoverable" checks. If all checks implemented by a handler have the same recoverability setting, then the generated code will be the same. llvm-svn: 225719	2015-01-12 22:39:12 +00:00
Rafael Espindola	0d4fb98504	[patch][pr19848] Produce explicit comdats in clang. The llvm IR until recently had no support for comdats. This was a problem when targeting C++ on ELF/COFF as just using weak linkage would cause quite a bit of dead bits to remain on the executable (unless -ffunction-sections, -fdata-sections and --gc-sections were used). To fix the problem, llvm's codegen will just assume that any weak or linkonce that is not in an explicit comdat should be output in one with the same name as the global. This unfortunately breaks cases like pr19848 where a weak symbol is not xpected to be part of any comdat. Now that we have explicit comdats in the IR, we can finally get both cases right. This first patch just makes clang give explicit comdats to GlobalValues where t is allowed to. A followup patch to llvm will then stop implicitly producing comdats. llvm-svn: 225705	2015-01-12 22:13:53 +00:00
David Majnemer	f1fdf4a80c	CodeGen: Simplify consecutive '%' modifiers LLVM the consecutive '%' modifiers are redundant, skip them. llvm-svn: 225602	2015-01-11 09:13:56 +00:00
David Majnemer	14d4e7bdbf	CodeGen: Simplify consecutive '&' modifiers LLVM the consecutive '&' modifiers are redundant, skip them. llvm-svn: 225601	2015-01-11 09:09:01 +00:00
Duncan P. N. Exon Smith	4bbe428cc5	IR: Add 'distinct' MDNodes to bitcode and assembly (clang) Update testcases for LLVM change in r225474 to make `MDNode`s explicitly distinct (when they aren't uniqued). Part of PR22111. llvm-svn: 225475	2015-01-08 22:39:28 +00:00
Tom Stellard	d8e38a3206	R600: Handle amdgcn triple For now there is no difference between amdgcn and r600. llvm-svn: 225294	2015-01-06 20:34:47 +00:00
David Blaikie	b9a23c9155	DebugInfo: Provide a less subtle way to set the debug location of simple ret instructions un-XFAILing the test XFAIL'd in r225086 after it regressed in r225083. llvm-svn: 225090	2015-01-02 22:07:26 +00:00
David Blaikie	5e9e13f54a	Temporarily XFAIL fallout from r225083 while investigating. Between this behavior and that fixed by r225083/r225000, I'll take the latter over the former for now, but I'm immediately working on understanding/addressing this behavior too. (the fact that the code change in r225083 caused this change in behavior is a bit troubling anyway - given that it looks & claims to be just a preformance thing) llvm-svn: 225086	2015-01-02 19:49:28 +00:00
Craig Topper	2094d8fe88	[x86] Add the (v)cmpps/pd/ss/sd builtins to match gcc. Use them in the sse intrinsic files. This still lower to the same intrinsics as before. This is preparation for bounds checking the immediate on the avx version of the builtin so we don't pass illegal immediates into the backend. Since SSE uses a smaller size immediate its not possible to bounds check when using a shared builtin. Rather than creating a clang specific builtin for the different immediate, I decided (after consulting with Chandler) that it was better to match gcc. llvm-svn: 224879	2014-12-27 06:59:57 +00:00
David Majnemer	fd4f63ad4b	Adjust the rest of the tests due to r224849. llvm-svn: 224865	2014-12-26 18:45:57 +00:00
David Majnemer	ca7e485c3f	Update tests due to r224849 Inferring nuw caused some clang tests to change their output. llvm-svn: 224851	2014-12-26 10:29:40 +00:00
Nico Weber	4f477fbe57	Add a triple to try and get this test passing on the ARM bots. llvm-svn: 224747	2014-12-23 01:07:10 +00:00
Nico Weber	08ef80f4b8	Rename test.cc files to test.cpp. The lit.cfg files only add .cpp to suffixes, so these tests used to never run, oops. (Also tweak to of these tests in minor ways to make the actually pass.) llvm-svn: 224718	2014-12-22 18:13:07 +00:00
Alexey Bataev	7cb1789011	Fix for PR21915: assert on multidimensional VLA in function arguments. Fixed assertion on type checking for arguments and parameters on function call if arguments are pointers to VLA Differential Revision: http://reviews.llvm.org/D6655 llvm-svn: 224504	2014-12-18 06:54:53 +00:00
Eric Christopher	560cc4fb44	Make sure that arm-linux-gnu is still the apcs-gnu ABI when we use clang -cc1 matching the front end and backend. Fix up a couple of tests that were testing aapcs for arm-linux-gnu. The test that removes the aapcs abi calling convention removes them because the default triple matches what the backend uses for the calling convention there and so it doesn't need to be explicitly stated - see the code in TargetInfo.cpp. llvm-svn: 224491	2014-12-18 02:08:55 +00:00
Saleem Abdulrasool	86b881c63e	CodeGen: implement __emit intrinsic For MSVC compatibility, add the `__emit' builtin. This is used in the Windows SDK headers, and must therefore be implemented as a builtin rather than an intrinsic. The `__emit' builtin provides a mechanism to emit a 16-bit opcode instruction into the stream. The value must be a compile time constant expression. No guarantees are made about the CPU and memory states after the execution of the instruction. Due to the unchecked nature of the builtin, only support this on Windows on ARM. llvm-svn: 224438	2014-12-17 17:52:30 +00:00
Toma Tabacu	9941195a9f	[mips] Always clobber $1 for MIPS inline asm. Summary: Because GCC doesn't use $1 for code generation, inline assembly code can use $1 without having to add it to the clobbers list. LLVM, on the other hand, does not shy away from using $1, and this can cause conflicts with inline assembly which assumes GCC-like code generation. A solution to this problem is to make Clang automatically clobber $1 for all MIPS inline assembly. This is not the optimal solution, but it seems like a necessary compromise, for now. Reviewers: dsanders Reviewed By: dsanders Subscribers: cfe-commits Differential Revision: http://reviews.llvm.org/D6638 llvm-svn: 224428	2014-12-17 12:02:58 +00:00
Adrian Prantl	98f1f27810	Update this testcase for the new metadata assembler syntax. llvm-svn: 224262	2014-12-15 19:25:33 +00:00
Duncan P. N. Exon Smith	b3a66691f8	IR: Make metadata typeless in assembly, clang side Match LLVM changes from r224257. llvm-svn: 224259	2014-12-15 19:10:08 +00:00
Alexey Bataev	a47ae907e8	Fixed test/CodeGen/atomic_ops.c for compatibility with hexagon target llvm-svn: 224231	2014-12-15 06:12:42 +00:00
Alexey Bataev	452d8e1133	Bugfix for Codegen of atomic load/store/other ops. Currently clang fires assertions on x86-64 on any atomic operations for long double operands. Patch fixes codegen for such operations. Differential Revision: http://reviews.llvm.org/D6499 llvm-svn: 224230	2014-12-15 05:25:25 +00:00
David Majnemer	ee8d04d8dd	CodeGen: Loads/stores to allocas for atomic ops shouldn't be volatile Don't inherit the volatile-ness of the input pointer to the volatile operation for memory allocated on the side. This fixes PR17306. llvm-svn: 224110	2014-12-12 08:16:09 +00:00
Paul Robinson	0855695159	Instead of having -Os/-Oz add OptimizeForSize/MinSize first, and later having OptimizeNone remove them again, just don't add them in the first place if the function already has OptimizeNone. Note that MinSize can still appear due to attributes on different declarations; a future patch will address that. llvm-svn: 224047	2014-12-11 20:14:04 +00:00
Paul Robinson	aae2fba540	Diagnose attributes 'optnone' and 'minsize' on the same declaration. Eventually we'll diagnose them on different declarations, but let's get this part out of the way first. llvm-svn: 223985	2014-12-10 23:34:36 +00:00
Paul Robinson	621b6d3bf7	Revert r223980 as it had wrong commit message. llvm-svn: 223984	2014-12-10 23:32:57 +00:00
Paul Robinson	2936851426	Rename a couple of preprocessor symbols to be more descriptive. NFC. Review feedback from recent changes to GetSVN.cmake. llvm-svn: 223980	2014-12-10 23:12:37 +00:00
Kostya Serebryany	597dcc7a8d	No memcpy for copy ctor with -fsanitize-address-field-padding=1 Summary: When -fsanitize-address-field-padding=1 is present don't emit memcpy for copy constructor. Thanks Nico for the extra test case. Test Plan: regression tests Reviewers: thakis, rsmith Reviewed By: rsmith Subscribers: rsmith, cfe-commits Differential Revision: http://reviews.llvm.org/D6515 llvm-svn: 223563	2014-12-06 01:23:08 +00:00
Eric Christopher	0e2618857c	Have the driver and the target code agree on what the default ABI is for each machine. Fix up darwin tests that were testing for aapcs on armv7-ios when the actual ABI is apcs. Should be no user visible change without -cc1. llvm-svn: 223429	2014-12-05 01:06:59 +00:00
Reid Kleckner	2c8d86ca05	Add test for __umulh llvm-svn: 223319	2014-12-03 23:52:26 +00:00
Anton Korobeynikov	d90dd7977e	Fix invalid calling convention used for libcalls on ARM. ARM ABI specifies that all the libcalls use soft FP ABI (even hard FP binaries). These days clang emits _mulsc3 / _muldc3 calls with default (C) calling convention which would be translated into AAPCS_VFP LLVM calling and thus the result of complex multiplication will be bogus. Introduce a way for a target to specify explicitly calling convention for libcalls. Right now this is temporary correctness fix. Ultimately, we'll end with intrinsic for complex multiplication and all calling convention decisions for libcalls will be put into backend. llvm-svn: 223123	2014-12-02 16:04:58 +00:00
Justin Holewinski	6e9bfa344c	[NVPTX] Fix type error for some builtins in BuiltinsNVPTX.def llvm-svn: 223116	2014-12-02 12:58:24 +00:00
Tim Northover	b047bfae32	AArch64: simplify PCS mapping. Now that LLVM can count the registers needed to implement AAPCS rules, we don't need to duplicate that logic here. This means we can drop the explicit padding and also use more natural types in many cases (e.g. "struct { float arr[3]; }" used to end up as "[2 x double]" to avoid holes on the stack. The one wrinkle is that AAPCS va_arg was also using the register counting machinery. But the local replacement isn't too bad. llvm-svn: 222904	2014-11-27 21:02:49 +00:00
David Majnemer	659be55daa	CodeGen: Fix emission of __atomic_compare_exchange We (wrongly) discarded the return value of the call. llvm-svn: 222798	2014-11-25 23:44:32 +00:00
Chandler Carruth	cc75b75b9d	Update Clang tests that run the LLVM optimizer to reflect the changed canonicalization in r222748. No interesting functionality changed here. llvm-svn: 222749	2014-11-25 10:10:37 +00:00
Tim Northover	bdcc1ed66d	testing: make test use FileCheck The "grep internal \| count" was fragile when your source or remote paths could contain the word "internal". llvm-svn: 222685	2014-11-24 21:03:34 +00:00
Paul Robinson	4ece682586	Correctly remove OptimizeForSize from functions marked OptimizeNone. This allows using __attribute__((optnone)) and the -Os/-Oz options. Fixes PR21604. llvm-svn: 222683	2014-11-24 20:51:42 +00:00
Saleem Abdulrasool	aca550fdb5	CodeGen: make i686-windows-itanium more similar to msvc The itanium environment follows the system calling convention for structures. Pass small aggregates via registers. llvm-svn: 222680	2014-11-24 20:14:29 +00:00
Saleem Abdulrasool	ec5c624550	CodeGen: tweak struct ABI handling Cygwin and MinGW fail to conform to the underlying system's structure passing ABI. Make the check more precise to ensure that we correctly generate code for the itanium environment. llvm-svn: 222626	2014-11-23 02:16:24 +00:00
David Majnemer	d8cd8f7b6e	CodeGen: Make atomic operations play nice with address spaces We were being a little sloppy with our pointer/address space casts. This fixes PR21643. llvm-svn: 222615	2014-11-22 10:44:12 +00:00
Alexey Samsonov	cfb97aa620	Remove support for undocumented SpecialCaseList entries. "global-init", "global-init-src" and "global-init-type" were originally used to blacklist entities in ASan init-order checker. However, they were never documented, and later were replaced by "=init" category. Old blacklist entries should be converted as follows: * global-init:foo -> global:foo=init * global-init-src:bar -> src:bar=init * global-init-type:baz -> type:baz=init llvm-svn: 222401	2014-11-20 01:27:19 +00:00
Chad Rosier	36577d037f	Revert "[Reassociate] Update test cases due to r222142." This reverts commit r222144. Commit r222142 is being reverted due to a spec2006/gcc execution-time regression. Update mips-varargs test as well. llvm-svn: 222397	2014-11-19 23:20:35 +00:00
Daniel Sanders	59229dcb29	Allow EmitVAArg() to promote types and use this to fix some N32/N64 vararg issues for Mips. Summary: With this patch, passing a va_list to another function and reading 10 int's from it works correctly on a big-endian target. Based on a pair of patches by David Chisnall, one of which I've reworked for the current trunk. Reviewers: theraven, atanasyan Reviewed By: theraven, atanasyan Subscribers: cfe-commits Differential Revision: http://reviews.llvm.org/D6248 llvm-svn: 222339	2014-11-19 10:01:35 +00:00
Eric Christopher	8be702dbaa	This test also requires an aarch64 target. llvm-svn: 222268	2014-11-18 22:36:11 +00:00
Justin Hibbits	90ca05e5e5	Add PIC-level support to Clang. Summary: This distinguishes between -fpic and -fPIC now, with the additions in LLVM for PIC level support. Test Plan: No regressions Reviewers: echristo, rafael Reviewed By: rafael Subscribers: rnk, emaste, llvm-commits Differential Revision: http://reviews.llvm.org/D5400 llvm-svn: 222227	2014-11-18 06:17:20 +00:00
Chad Rosier	0837f63fd2	[Reassociate] Update test cases due to r222142. llvm-svn: 222144	2014-11-17 16:34:47 +00:00
Reid Kleckner	b1be683074	Fix IRGen for passing transparent unions We have had a test for this for a long time with a FIXME saying what we should be doing. This just does it. Fixes PR21573. llvm-svn: 222074	2014-11-15 01:41:41 +00:00
Fariborz Jahanian	68e7938361	This patch fixes couple of bugs for predefined expression used inside blocks. It fixes a crash in naming code for __func__ etc. when used in a block declared globally. It also brings back old naming convention for predefined expression which was broken. rdar://18961148 llvm-svn: 222065	2014-11-14 23:55:27 +00:00
Anton Korobeynikov	5f951ee8bd	Recommit r222044 with a test fix - it does not make sense to hunt for a typedef before arithmetic conversion in all rare corner cases. llvm-svn: 222049	2014-11-14 22:09:15 +00:00
Anton Korobeynikov	50fc68f2d9	Again revert r222044 to resolve darwin objc test fails. llvm-svn: 222047	2014-11-14 21:54:46 +00:00
Anton Korobeynikov	dc12b367bc	Follow-up to D6217 Summary: Ok, here is somewhat addition to D6217 aiming to preserve old darwin behavior wrt the typedefed types. The actual change to SemaChecking turned out to be pretty gross, in particular: 1. We need to extract the typedef'ed type for proper diagnostics 2. We need to walk over paren expressions as well Reviewers: chandlerc, rsmith Reviewed By: rsmith Subscribers: cfe-commits Differential Revision: http://reviews.llvm.org/D6256 llvm-svn: 222044	2014-11-14 21:41:07 +00:00
Bill Schmidt	8ff672d397	[PowerPC] Enable vec_perm for long long and double vector types for VSX VSX makes the "vector long long" and "vector double" types available. This patch enables the vec_perm interface for these types. The same builtin is generated regardless of the specified type, so no additional work or testing is needed in the back end. Tests are added to ensure this builtin is generated by the front end. llvm-svn: 221988	2014-11-14 13:10:13 +00:00
Bill Schmidt	cee13a2712	[PowerPC] Add VSX builtins for vec_div This patch adds builtin support for xvdivdp and xvdivsp, along with a new test case. The builtins are accessed using vec_div in altivec.h. Builtins are listed (mostly) alphabetically there, so inserting these changed the line numbers for deprecation warnings tested in test/Headers/altivec-intrin.c. There is a companion patch for LLVM. llvm-svn: 221984	2014-11-14 12:10:51 +00:00
Anton Korobeynikov	50a3cbd7c0	Temporary revert r221818 until all the problems with objc stuff will be resolved. llvm-svn: 221829	2014-11-12 23:15:38 +00:00
Anton Korobeynikov	0140aa8756	Fix fallout from r219557 Summary: Consider the following nifty 1 liner: (0 ? csqrtl(2.0f) : sqrtl(2.0f)). One can easily obtain such code from e.g. tgmath. Right now it produces an assertion because we fail to do the promotion real => _Complex real. The case was properly handled previously (old handleOtherComplexFloatConversion routine), but was forgotten in the current version. This seems to be about fallout from r219557 Reviewers: chandlerc, rsmith Reviewed By: rsmith Subscribers: cfe-commits Differential Revision: http://reviews.llvm.org/D6217 llvm-svn: 221821	2014-11-12 22:19:06 +00:00
Bill Schmidt	9ec8cea02b	[PowerPC] Add vec_vsx_ld and vec_vsx_st intrinsics This patch enables the vec_vsx_ld and vec_vsx_st intrinsics for PowerPC, which provide programmer access to the lxvd2x, lxvw4x, stxvd2x, and stxvw4x instructions. New code in altivec.h defines these in terms of new builtins, which are themselves defined in BuiltinsPPC.def. The builtins are converted to LLVM intrinsics in CGBuiltin.cpp. Additional code is added to builtins-ppc-vsx.c to verify the correct generation of the intrinsics. Note that I moved the other VSX builtins so all VSX builtins will be alphabetical in their own section in BuiltinsPPC.def. There is a companion patch for LLVM. llvm-svn: 221768	2014-11-12 04:19:56 +00:00
Kostya Serebryany	4133eabb45	[clang/asan] Do not emit memcpy for trivial operator= when -fsanitize-address-field-padding >= 1 Summary: If we've added poisoned paddings to a type do not emit memcpy for operator=. Test Plan: regression tests. Reviewers: majnemer, rsmith Reviewed By: rsmith Subscribers: cfe-commits Differential Revision: http://reviews.llvm.org/D6160 llvm-svn: 221739	2014-11-11 23:38:13 +00:00
Alexey Samsonov	e396bfc064	Bundle conditions checked by UBSan with sanitizer kinds they implement. Summary: This change makes CodeGenFunction::EmitCheck() take several conditions that needs to be checked (all of them need to be true), together with sanitizer kinds these checks are for. This would allow to split one call into UBSan runtime into several calls in case different sanitizer kinds would have different recoverability settings. Tests should be fixed accordingly, I'm working on it. Test Plan: regression test suite. Reviewers: rsmith Reviewed By: rsmith Subscribers: cfe-commits Differential Revision: http://reviews.llvm.org/D6219 llvm-svn: 221716	2014-11-11 22:03:54 +00:00
Alexey Samsonov	cfc9d3fe17	Simplify the test by using multiple --check-prefix arguments llvm-svn: 221713	2014-11-11 21:50:44 +00:00
Tim Northover	5a1558ec31	ARM ABI: simplify decisions on whether args can be expanded. Homogeneous aggregates on AAPCS_VFP ARM need to be passed without being flattened (e.g. [2 x float] rather than "float, float") for various weird ABI reasons. However, this isn't the case for anything else; further, we know at the ABIArgInfo::getDirect callsites whether this flattening is allowed. So, we can get more unified ARM code, with a simpler Clang, by just using that knowledge directly. llvm-svn: 221559	2014-11-07 22:30:50 +00:00
Roman Divacky	5cd8df6d1d	Since the file has both ppc and ppc64 tests in it rename it. llvm-svn: 221285	2014-11-04 18:49:15 +00:00
Roman Divacky	c294022900	Rewrite the test to not require asserts. llvm-svn: 221284	2014-11-04 18:48:20 +00:00
NAKAMURA Takumi	06ac98299f	Remove "REQUIRES:shell" from tests. They work for me. llvm-svn: 221269	2014-11-04 13:41:33 +00:00
Reid Kleckner	06ea7d6213	Lower __builtin_fabs* to @llvm.fabs.* mingw64's headers implement fabs by calling __builtin_fabs, so using the library call results in an infinite loop. If the backend legalizes @llvm.fabs as a call to fabs later, things should work out, as the crt provides a definition. llvm-svn: 221206	2014-11-03 23:52:09 +00:00
Roman Divacky	1ae35b902b	Require asserts to unbreak the buildbots. llvm-svn: 221174	2014-11-03 19:50:48 +00:00
Roman Divacky	8a12d84264	Implement vaarg lowering for ppc32. Lowering of scalars and aggregates is supported. Complex numbers are not. llvm-svn: 221170	2014-11-03 18:32:54 +00:00
Hans Wennborg	606bd6dcc5	Don't dllimport inline functions when targeting MinGW (PR21366) It turns out that MinGW never dllimports of exports inline functions. This means that code compiled with Clang would fail to link with MinGW-compiled libraries since we might try to import functions that are not imported. To fix this, make Clang never dllimport inline functions when targeting MinGW. llvm-svn: 221154	2014-11-03 14:24:45 +00:00
Craig Topper	8c7f251e98	Add FSGSBASE intrinsics to x86 intrinsic headers. llvm-svn: 221130	2014-11-03 06:51:41 +00:00
Craig Topper	e1c664b136	Add _lzcnt_u32 and _lzcnt_u64 to lzcntintrin.h to match Intel documentation names for these intrinsics. llvm-svn: 221066	2014-11-01 22:50:57 +00:00
Reid Kleckner	80944df6f4	Implement IRGen for the x86 vectorcall convention The most complex aspect of the convention is the handling of homogeneous vector and floating point aggregates. Reuse the homogeneous aggregate classification code that we use on PPC64 and ARM for this. This convention also has a C mangling, and we apparently implement that in both Clang and LLVM. Reviewed By: majnemer Differential Revision: http://reviews.llvm.org/D6063 llvm-svn: 221006	2014-10-31 22:00:51 +00:00
Bill Schmidt	691e01d94e	[PowerPC] Initial VSX intrinsic support, with min/max for vector double Now that we have initial support for VSX, we can begin adding intrinsics for programmer access to VSX instructions. This patch performs the necessary enablement in the front end, and tests it by implementing intrinsics for minimum and maximum using the vector double data type. The main change in the front end is to no longer disallow "vector" and "double" in the same declaration (lib/Sema/DeclSpec.cpp), but "vector" and "long double" must still be disallowed. The new intrinsics are accessed via vec_max and vec_min with changes in lib/Headers/altivec.h. Note that for v4f32, we already access corresponding VMX builtins, but with VSX enabled we should use the forms that allow all 64 vector registers. The new built-ins are defined in include/clang/Basic/BuiltinsPPC.def. I've added a new test in test/CodeGen/builtins-ppc-vsx.c that is similar to, but much smaller than, builtins-ppc-altivec.c. This allows us to test VSX IR generation without duplicating CHECK lines for the existing bazillion Altivec tests. Since vector double is now legal when VSX is available, I've modified the error message, and changed where we test for it and for vector long double, since the target machine isn't visible in the old place. This serendipitously removed a not-pertinent warning about 'long' being deprecated when used with 'vector', when "vector long double" is encountered and we just want to issue an error. The existing tests test/Parser/altivec.c and test/Parser/cxx-altivec.cpp have been updated accordingly, and I've added test/Parser/vsx.c to verify that "vector double" is now legitimate with VSX enabled. There is a companion patch for LLVM. llvm-svn: 220989	2014-10-31 19:19:24 +00:00
Kostya Serebryany	5f1b4e8f58	ignore -mconstructor-aliases when adding field paddings for asan Summary: When we are adding field paddings for asan even an empty dtor has to remain in the code, so we ignore -mconstructor-aliases if the paddings are going to be added. Test Plan: added a test Reviewers: rsmith, rnk, rafael Reviewed By: rafael Subscribers: cfe-commits Differential Revision: http://reviews.llvm.org/D6038 llvm-svn: 220986	2014-10-31 19:01:02 +00:00
Reid Kleckner	e9f6a717dd	Fix ARM HVA classification of classes with non-virtual bases Reuse the PPC64 HVA detection algorithm for ARM and AArch64. This is a nice code deduplication, since they are roughly identical. A few virtual method extension points are needed to understand how big an HVA can be and what element types it can have for a given architecture. Also make the record expansion code work in the presence of non-virtual bases. Reviewed By: uweigand, asl Differential Revision: http://reviews.llvm.org/D6045 llvm-svn: 220972	2014-10-31 17:10:41 +00:00
Hao Liu	6d45b8c385	[AArch64]Add 2 intrinsics vmov_n_p64/vmovq_n_p64, the alias for vdup_n_p64/vdup_n_p64. As this change is too small, commit it directly. llvm-svn: 220946	2014-10-31 02:41:37 +00:00
Saleem Abdulrasool	a9884bfe3d	test: generalise the test matching The value names may change in different builds, use pattern based tests. llvm-svn: 220861	2014-10-29 16:53:16 +00:00
Saleem Abdulrasool	a25fbef088	CodeGen: add __readfsdword builtin The Windows NT SDK uses __readfsdword and declares it as a compiler provided builtin (#pragma intrinsic(__readfsword). Because intrin.h is not referenced by winnt.h, it is not possible to provide an out-of-line definition for the intrinsic. Provide a proper compiler builtin definition. llvm-svn: 220859	2014-10-29 16:35:41 +00:00
Eli Bendersky	95338a09c0	Pass aggregates on the stack without splitting in NVPTX. Following the NVVM IR specifications, arguments of aggregate type should be passed on the stack without splitting (byval). http://reviews.llvm.org/D6020 Patch by Jacques Pienaar. llvm-svn: 220854	2014-10-29 13:43:21 +00:00
Ulrich Weigand	a094f0428b	[PowerPC ABI] Bug 21398 - Consider C++ base classes in HA classification As discussed in bug 21398, PowerPC ABI code needs to consider C++ base classes when classifying a class as homogeneous aggregate (or not) for ABI purposes. llvm-svn: 220852	2014-10-29 13:23:20 +00:00
NAKAMURA Takumi	7acc8a36c7	clang/test/CodeGen/captured-statements-nested.c: Tweak for -Asserts. llvm-svn: 220851	2014-10-29 13:21:52 +00:00
Alexey Bataev	330de03083	Improved capturing variable-length array types in CapturedStmt. An updated implemnentation of VLA types capturing based on previously committed solution for Lambdas. This version captures the whole VLA type instead of particular variables which are part of VLA size expression and allows to use previusly calculated size of VLA type in captured regions. Required for OpenMP. Differential Revision: http://reviews.llvm.org/D5099 llvm-svn: 220850	2014-10-29 12:21:55 +00:00
Kostya Serebryany	68c29da4c5	Do not insert asan paddings after fields that have flexible arrays. Summary: We should avoid a tail padding not only if the last field has zero size but also if the last field is a struct with a flexible array. If/when http://reviews.llvm.org/D5478 is committed, this will also handle the case of structs with zero-sized arrays. Reviewers: majnemer, rsmith Reviewed By: rsmith Subscribers: cfe-commits Differential Revision: http://reviews.llvm.org/D5924 llvm-svn: 220708	2014-10-27 19:34:10 +00:00
NAKAMURA Takumi	729be14435	Prune CRLF. llvm-svn: 220678	2014-10-27 12:37:26 +00:00
Rafael Espindola	5a1106f8fc	Make this test a bit stricter by checking clang's output too. llvm-svn: 220604	2014-10-25 01:51:19 +00:00
Reid Kleckner	d7857f05f4	Add frontend support for __vectorcall Wire it through everywhere we have support for fastcall, essentially. This allows us to parse the MSVC "14" CTP headers, but we will miscompile them because LLVM doesn't support __vectorcall yet. Reviewed By: Aaron Ballman Differential Revision: http://reviews.llvm.org/D5808 llvm-svn: 220573	2014-10-24 17:42:17 +00:00
Daniel Sanders	aa1b35590f	[mips] Mark aggregate arguments passed in registers with the inreg attribute Summary: This allows us to easily identify them in the backend which in turn allows us to handle them correctly for big-endian targets (where they must be shifted into the upper bits of the register). Depends on D5961 Reviewers: atanasyan Reviewed By: atanasyan Subscribers: cfe-commits, theraven Differential Revision: http://reviews.llvm.org/D5962 llvm-svn: 220566	2014-10-24 15:30:16 +00:00
Daniel Sanders	5b445b3844	[mips] Promote all integral/enumeration types to the GPR width Summary: Ensure all integral/enumeration types are appropriately annotated with signext/zeroext. In particular, i32 now has these attributes when using the N32/N64 ABI. This paves the way for accurately representing the way the N32/N64 ABI's promotes integer arguments to i64. Reviewers: atanasyan Reviewed By: atanasyan Subscribers: cfe-commits, theraven Differential Revision: http://reviews.llvm.org/D5961 llvm-svn: 220563	2014-10-24 14:42:42 +00:00
David Blaikie	60a877b5b9	DebugInfo: Omit scopes in -gmlt to reduce metadata size (on disk and in memory) I haven't done any actual impact analysis of this change as it's a strict improvement, but I'd be curious to know how much it helps. llvm-svn: 220408	2014-10-22 19:34:33 +00:00
Alexey Samsonov	6d87ce8bd5	Fixup for r220403: Use getFileLoc() instead of getSpellingLoc() in SanitizerBlacklist. This also handles the case where function name (not its body) is obtained from macro expansion. llvm-svn: 220407	2014-10-22 19:34:25 +00:00
Alexey Samsonov	fa7a8569bb	SanitizerBlacklist: Use spelling location for blacklisting purposes. When SanitizerBlacklist decides if the SourceLocation is blacklisted, we need to first turn it into a SpellingLoc before fetching the filename and scanning "src:" entries. Otherwise we will fail to fecth the correct filename for function definitions coming from macro expansion. llvm-svn: 220403	2014-10-22 18:26:07 +00:00
Jiangning Liu	2bafc2d5ae	Remove including <complex.h> in test case, and change to use _Complex instead. llvm-svn: 220258	2014-10-21 02:19:58 +00:00
Jiangning Liu	444822bbcf	Lower compound assignment for the missing type llvm::Type::FP128TyID. llvm-svn: 220257	2014-10-21 01:34:34 +00:00
David Majnemer	8e133965c8	CodeGen: ConstStructBuilder must verify packed constraints after padding This reverts commit r220169 which reverted r220153. However, it also contains additional changes: - We may need to add padding after we've packed the struct. This occurs when the aligned next field offset is greater than the new field's offset. When this occurs, we make the struct packed. However, once packed the next field offset might be less than the new feild's offset. It is in this case that we might further pad the struct. - We would pad structs which were perfectly sized! This behavior is immensely old. This behavior came from blindly subtracting NextFieldOffsetInChars from RecordSize. This doesn't take into account the fact that the struct might have a greater overall alignment than the last field. llvm-svn: 220175	2014-10-19 23:40:06 +00:00
Chandler Carruth	bf972bb2e0	Revert r220153: "CodeGen: ConstStructBuilder must verify packed constraints after padding" This commit caused two tests in LNT to regress. I'm able to reproduce on any platform and will send reproduction steps to the original commit log. This should restore the LNT bots that have been failing. llvm-svn: 220169	2014-10-19 19:41:46 +00:00
Chandler Carruth	0c4b230b32	[complex] Teach the complex math IR gen to emit direct math and a NaN-test prior to the call to the library function. This should automatically make fastmath (including just non-NaNs) able to avoid the expensive libcalls and also open the door to more advanced folding in LLVM based on the rules for complex math. Two important notes to remember: first is that this isn't yet a proper limited range mode, it's still just improving the unlimited range mode. Also, it isn't really perfecet w.r.t. what an unlimited range mode should be doing because it isn't quite handling the flags produced by all the operations in the way desirable for that mode, but then neither is compiler-rt's libcall. When the compiler-rt libcall is improved to carefully manage flags, the code emitted here should be improved correspondingly. And it is still a long-term desirable thing to add a limited range mode to Clang that would be able to use direct math without library calls here. Special thanks to Steve Canon for the careful review on this patch and teaching me about these issues. =D Differential Revision: http://reviews.llvm.org/D5756 llvm-svn: 220167	2014-10-19 19:13:49 +00:00
David Majnemer	afefe97e1c	CodeGen: ConstStructBuilder must verify packed constraints after padding Before, ConstStructBuilder::AppendBytes would check packed constraints prior to padding being added before the field's offset. However, adding this padding might force our struct to be packed. Because we wouldn't check after adding padding, ConstStructBuilder would be in an inconsistent state leading to a crash. This fixes PR21300. llvm-svn: 220153	2014-10-19 00:03:10 +00:00
Alexey Samsonov	a0ac3c2bf0	[ASan] Improve blacklisting of global variables. This commit changes the way we blacklist global variables in ASan. Now the global is excluded from instrumentation (either regular bounds checking, or initialization-order checking) if: 1) Global is explicitly blacklisted by its mangled name. This part is left unchanged. 2) SourceLocation of a global is in blacklisted source file. This changes the old behavior, where instead of looking at the SourceLocation of a variable we simply considered llvm::Module identifier. This was wrong, as identifier may not correspond to the file name, and we incorrectly disabled instrumentation for globals coming from #include'd files. 3) Global is blacklisted by type. Now we build the type of a global variable using Clang machinery (QualType::getAsString()), instead of llvm::StructType::getName(). After this commit, the active users of ASan blacklist files may have to revisit them (this is a backwards-incompatible change). llvm-svn: 220097	2014-10-17 22:37:33 +00:00
Kostya Serebryany	644492139f	fix -fsanitize-address-field-padding for the cases with virtual base classes Summary: Correctly compute the non-virtual size of a class. Test Plan: Build SPEC 2016 with -fsanitize-address-field-padding Reviewers: rsmith Reviewed By: rsmith Subscribers: cfe-commits Differential Revision: http://reviews.llvm.org/D5848 llvm-svn: 220089	2014-10-17 21:02:13 +00:00
Hans Wennborg	0b603cc4e9	Move test/CodeGen/sections.c to CodeGenCXX/sections.cpp The test was running with -xc++. Seems it wants to be a C++ file. llvm-svn: 220069	2014-10-17 18:13:21 +00:00
NAKAMURA Takumi	e316722f4d	Add explicit triple to clang/test/CodeGen/sanitize-address-field-padding.cpp, for now. It's incompatible to ms mangling. llvm-svn: 220037	2014-10-17 12:48:01 +00:00
Joerg Sonnenberger	aa3e9f5a0f	complex long double support for PowerPC llvm-svn: 220034	2014-10-17 11:51:19 +00:00
Renato Golin	031e817630	User c-tor name to fix the sanitizer test llvm-svn: 220030	2014-10-17 10:09:25 +00:00
Renato Golin	de44aec0e6	Trying to fix failing Clang sanitizer test on ARM bots llvm-svn: 220029	2014-10-17 09:40:21 +00:00
Kostya Serebryany	23387754f8	trying to fix the new test again, this time for the clang-cmake-armv7-a15 bot llvm-svn: 220002	2014-10-17 00:47:30 +00:00
Alexey Samsonov	1444bb9fc8	SanitizerBlacklist: blacklist functions by their source location. This commit changes the way we blacklist functions in ASan, TSan, MSan and UBSan. We used to treat function as "blacklisted" and turned off instrumentation in it in two cases: 1) Function is explicitly blacklisted by its mangled name. This part is not changed. 2) Function is located in llvm::Module, whose identifier is contained in the list of blacklisted sources. This is completely wrong, as llvm::Module may not correspond to the actual source file function is defined in. Also, function can be defined in a header, in which case user had to blacklist the .cpp file this header was #include'd into, not the header itself. Such functions could cause other problems - for instance, if the header was included in multiple source files, compiled separately and linked into a single executable, we could end up with both instrumented and non-instrumented version of the same function participating in the same link. After this change we will make blacklisting decision based on the SourceLocation of a function definition. If a function is not explicitly defined in the source file, (for example, the function is compiler-generated and responsible for initialization/destruction of a global variable), then it will be blacklisted if the corresponding global variable is defined in blacklisted source file, and will be instrumented otherwise. After this commit, the active users of blacklist files may have to revisit them. This is a backwards-incompatible change, but I don't think it's possible or makes sense to support the old incorrect behavior. I plan to make similar change for blacklisting GlobalVariables (which is ASan-specific). llvm-svn: 219997	2014-10-17 00:20:19 +00:00
Hans Wennborg	528c926b3c	test/CodeGen/sections.c: add triple llvm-svn: 219969	2014-10-16 21:36:23 +00:00
Kostya Serebryany	330e9f6c5f	trying to fix the new test on hexagon-build llvm-svn: 219965	2014-10-16 21:22:40 +00:00
Kostya Serebryany	293dc9be6e	Insert poisoned paddings between fields in C++ classes so that AddressSanitizer can find intra-object-overflow bugs Summary: The general approach is to add extra paddings after every field in AST/RecordLayoutBuilder.cpp, then add code to CTORs/DTORs that poisons the paddings (CodeGen/CGClass.cpp). Everything is done under the flag -fsanitize-address-field-padding. The blacklist file (-fsanitize-blacklist) allows to avoid the transformation for given classes or source files. See also https://code.google.com/p/address-sanitizer/wiki/IntraObjectOverflow Test Plan: run SPEC2006 and some of the Chromium tests with -fsanitize-address-field-padding Reviewers: samsonov, rnk, rsmith Reviewed By: rsmith Subscribers: majnemer, cfe-commits Differential Revision: http://reviews.llvm.org/D5687 llvm-svn: 219961	2014-10-16 20:54:52 +00:00
Hans Wennborg	899ded9cdf	MS Compat: mark globals emitted in read-only sections const They cannot be written to, so marking them const makes sense and may improve optimisation. As a side-effect, SectionInfos has to be moved from Sema to ASTContext. It also fixes this problem, that occurs when compiling ATL: warning LNK4254: section 'ATL' (C0000040) merged into '.rdata' (40000040) with different attributes The ATL headers are putting variables in a special section that's marked read-only. However, Clang currently can't model that read-onlyness in the IR. But, by making the variables const, the section does become read-only, and the linker warning is avoided. Differential Revision: http://reviews.llvm.org/D5812 llvm-svn: 219960	2014-10-16 20:52:46 +00:00
Rafael Espindola	c55172ecbc	Update for llvm change. llvm-svn: 219952	2014-10-16 20:00:22 +00:00
Bradley Smith	04ee8aa1fc	[AArch64] Enable A53 erratum workaround (835769) by default for Android targets llvm-svn: 219933	2014-10-16 16:35:14 +00:00
Alexander Eremin	670c62770e	specify dwarf version for Solaris llvm-svn: 219901	2014-10-16 05:55:24 +00:00
David Majnemer	bb525f7c20	CodeGen: Don't drop thread_local when emitting __thread aliases CodeGen wouldn't mark the aliasee as thread_local if the aliasee was a tentative definition. Even if the definition was already emitted, it would never mark the alias as thread_local. This fixes PR21288. llvm-svn: 219859	2014-10-15 22:38:23 +00:00
Saleem Abdulrasool	4c879bed5b	test: simplify test further Remove the use of an unnecessary function. NFC. llvm-svn: 219850	2014-10-15 21:37:52 +00:00
Tim Northover	147cd2f6e5	ARM: remove ARM/Thumb distinction for preferred alignment. Thumb1 has legitimate reasons for preferring 32-bit alignment of types i1/i8/i16, since the 16-bit encoding of "add rD, sp, #imm" requires #imm to be a multiple of 4. However, this is a trade-off betweem code size and RAM usage; the DataLayout string is not the best place to represent it even if desired. So this patch removes the extra Thumb requirements, hopefully making ARM and Thumb completely compatible in this respect. llvm-svn: 219735	2014-10-14 22:12:21 +00:00
Tim Northover	b98dc4b015	ARM: set preferred aggregate alignment to 32 universally. Before, ARM and Thumb mode code had different preferred alignments, which could lead to some rather unexpected results. There's justification for reducing it from the default 64-bits (wasted space), but I don't think there is for going below 32-bits. There's no actual ABI change here, just to reassure people. llvm-svn: 219720	2014-10-14 20:57:29 +00:00
Saleem Abdulrasool	64ab4de443	CodeGen: correct mangling for blocks This addresses a regression introduced with SVN r219393. A block may be contained within another block. In such a scenario, we would end up within a BlockDecl, which is not a NamedDecl (as the names are synthesised). The cast to a NamedDecl of the DeclContext would then assert as the types are unrelated. Restore the mangling behaviour to that prior to SVN r219393. If the current block is contained within a BlockDecl, walk up to the parent DeclContext, recursively, until we have a non-BlockDecl. This is expected to be a NamedDecl. Add in a couple of asserts to ensure that the assumption that we only encounter a block within a NamedDecl or a BlockDecl. llvm-svn: 219696	2014-10-14 17:20:14 +00:00
Tyler Nowicki	c724a83e20	Allow constant expressions in pragma loop hints. Previously loop hints such as #pragma loop vectorize_width(#) required a constant. This patch allows a constant expression to be used as well. Such as a non-type template parameter or an expression (2 * c + 1). Reviewed by Richard Smith llvm-svn: 219589	2014-10-12 20:46:07 +00:00
Chandler Carruth	b29a743891	[complex] Teach the other two binary operators on complex numbers (== and !=) to support mixed complex and real operand types. This requires removing an assert from SemaChecking, and adding support both to the constant evaluator and the code generator to synthesize the imaginary part when needed. This seemed somewhat cleaner than having just the comparison operators force real-to-complex conversions. I've added test cases for these operations. I'm really terrified that there were no tests in-tree which exercised this. This turned up when trying to build R after my change to the complex type lowering. llvm-svn: 219570	2014-10-11 11:03:30 +00:00
Chandler Carruth	686de24128	[complex] Use the much more powerful EmitCall routine to call libcalls for complex math. This should fix the windows build bots that started having trouble here and generally fix complex libcall emission on targets which use sret for complex data types. It also makes the code a bit simpler (despite calling into a much more complex bucket of code). llvm-svn: 219565	2014-10-11 09:24:41 +00:00
Chandler Carruth	a216cad0fc	[complex] Teach Clang to preserve different-type operands to arithmetic operators where one type is a C complex type, and to emit both the efficient and correct implementation for complex arithmetic according to C11 Annex G using this extra information. For both multiply and divide the old code was writing a long-hand reduced version of the math without any of the special handling of inf and NaN recommended by the standard here. Instead of putting more complexity here, this change does what GCC does which is to emit a libcall for the fully general case. However, the old code also failed to do the proper minimization of the set of operations when there was a mixed complex and real operation. In those cases, C provides a spec for much more minimal operations that are valid. Clang now emits the exact suggested operations. This change isn't just about performance though, without minimizing these operations, we again lose the correct handling of infinities and NaNs. It is critical that this happen in the frontend based on assymetric type operands to complex math operations. The performance implications of this change aren't trivial either. I've run a set of benchmarks in Eigen, an open source mathematics library that makes heavy use of complex. While a few have slowed down due to the libcall being introduce, most sped up and some by a huge amount: up to 100% and 140%. In order to make all of this work, also match the algorithm in the constant evaluator to the one in the runtime library. Currently it is a broken port of the simplifications from C's Annex G to the long-hand formulation of the algorithm. Splitting this patch up is very hard because none of this works without the AST change to preserve non-complex operands. Sorry for the enormous change. Follow-up changes will include support for sinking the libcalls onto cold paths in common cases and fastmath improvements to allow more aggressive backend folding. Differential Revision: http://reviews.llvm.org/D5698 llvm-svn: 219557	2014-10-11 00:57:18 +00:00

... 10 11 12 13 14 ...

3838 Commits