llvm-project

Commit Graph

Author	SHA1	Message	Date
Reid Kleckner	b04cb9ab7a	[MS] Add support for __ud2 and __int2c MSVC intrinsics This was requested in PR31958 and elsewhere. llvm-svn: 297057	2017-03-06 19:43:16 +00:00
Dean Michael Berris	418da3fe80	[XRay] [clang] Allow logging the first argument of a function call. Summary: Functions with the "xray_log_args" attribute will tell LLVM to emit a special XRay sled for compiler-rt to copy any call arguments to your logging handler. Reviewers: dberris Reviewed By: dberris Subscribers: cfe-commits Differential Revision: https://reviews.llvm.org/D29704 llvm-svn: 296999	2017-03-06 07:08:21 +00:00
Erich Keane	2fe684bb14	Allow attributes before union definition permits typedef union __attribute__((transparent_union)) {...} Differential Revision: https://reviews.llvm.org/D28266 llvm-svn: 296518	2017-02-28 20:44:39 +00:00
Vedant Kumar	42de380765	[ubsan] Detect signed overflow UB in remainder operations Teach ubsan to diagnose remainder operations which have undefined behavior due to signed overflow (e.g INT_MIN % -1). Differential Revision: https://reviews.llvm.org/D29437 llvm-svn: 296214	2017-02-25 00:43:39 +00:00
Vedant Kumar	82ee16beb8	[ubsan] Omit superflous overflow checks for promoted arithmetic (PR20193) C requires the operands of arithmetic expressions to be promoted if their types are smaller than an int. Ubsan emits overflow checks when this sort of type promotion occurs, even if there is no way to actually get an overflow with the promoted type. This patch teaches clang how to omit the superflous overflow checks (addressing PR20193). Testing: check-clang and check-ubsan. Differential Revision: https://reviews.llvm.org/D29369 llvm-svn: 296213	2017-02-25 00:43:36 +00:00
Alex Lorenz	e3e8131460	NFC, Add a test that ensure that we don't emit helper code in copy/dispose routines for variables that are const-captured This is a preparation commit that improves code-coverage in code that emits block copy/dispose routines. llvm-svn: 296040	2017-02-23 23:41:50 +00:00
George Burgess IV	0d6592a899	[CodeGen] Don't reemit expressions for pass_object_size params. This fixes an assertion failure in cases where we had expression statements that declared variables nested inside of pass_object_size args. Since we were emitting the same ExprStmt twice (once for the arg, once for the @llvm.objectsize call), we were getting issues with redefining locals. This also means that we can be more lax about when we emit @llvm.objectsize for pass_object_size args: since we're reusing the arg's value itself, we don't have to care so much about side-effects. llvm-svn: 295935	2017-02-23 05:59:56 +00:00
George Burgess IV	8856aa9a54	Call the correct @llvm.objectsize. The following code would crash clang: void foo(unsigned const __attribute__((pass_object_size(0)))); void bar(unsigned i) { foo(i); } This is because we were always selecting the version of `@llvm.objectsize` that takes an i8* in CodeGen. Passing an i32* as an i8* makes LLVM very unhappy. (Yes, I'm surprised that this remained uncaught for so long, too. :) ) As an added bonus, we'll now also use the appropriate address space when emitting @llvm.objectsize calls. llvm-svn: 295805	2017-02-22 02:35:51 +00:00
Jacob Gravelle	40aefb5fe0	Declare lgamma library builtins as never being const Summary: POSIX requires lgamma writes to an external global variable, signgam. This prevents annotating lgamma with readnone, which is incorrect on targets that write to signgam. Reviewers: efriedma, rsmith Subscribers: cfe-commits Differential Revision: https://reviews.llvm.org/D29778 llvm-svn: 295781	2017-02-21 22:37:27 +00:00
Tim Shen	d4ba2f2336	[XRay] Merge xray clang flag tests, and add powerpc64le. Summary: I'm not sure why they were in different files, but it's kind of harder to maintain. I create this patch partially for initiate a discussion. Reviewers: dberris Subscribers: nemanjai, cfe-commits Differential Revision: https://reviews.llvm.org/D30118 llvm-svn: 295778	2017-02-21 22:30:00 +00:00
Craig Topper	117892098a	[X86] Replace XOP vpcmov builtins with native vector logical operations. llvm-svn: 295570	2017-02-18 21:15:30 +00:00
Vedant Kumar	34b1fd6aaa	Retry^2: [ubsan] Reduce null checking of C++ object pointers (PR27581) This patch teaches ubsan to insert exactly one null check for the 'this' pointer per method/lambda. Previously, given a load of a member variable from an instance method ('this->x'), ubsan would insert a null check for 'this', and another null check for '&this->x', before allowing the load to occur. Similarly, given a call to a method from another method bound to the same instance ('this->foo()'), ubsan would a redundant null check for 'this'. There is also a redundant null check in the case where the object pointer is a reference ('Ref.foo()'). This patch teaches ubsan to remove the redundant null checks identified above. Testing: check-clang, check-ubsan, and a stage2 ubsan build. I also compiled X86FastISel.cpp with -fsanitize=null using patched/unpatched clangs based on r293572. Here are the number of null checks emitted: ------------------------------------- \| Setup \| # of null checks \| ------------------------------------- \| unpatched, -O0 \| 21767 \| \| patched, -O0 \| 10758 \| ------------------------------------- Changes since the initial commit: - Don't introduce any unintentional object-size or alignment checks. - Don't rely on IRGen of C labels in the test. Differential Revision: https://reviews.llvm.org/D29530 llvm-svn: 295515	2017-02-17 23:22:59 +00:00
Vedant Kumar	29ba8d9bfe	Revert "Retry: [ubsan] Reduce null checking of C++ object pointers (PR27581)" This reverts commit r295401. It breaks the ubsan self-host. It inserts object size checks once per C++ method which fire when the structure is empty. llvm-svn: 295494	2017-02-17 20:59:40 +00:00
Vedant Kumar	55875b9955	Retry: [ubsan] Reduce null checking of C++ object pointers (PR27581) This patch teaches ubsan to insert exactly one null check for the 'this' pointer per method/lambda. Previously, given a load of a member variable from an instance method ('this->x'), ubsan would insert a null check for 'this', and another null check for '&this->x', before allowing the load to occur. Similarly, given a call to a method from another method bound to the same instance ('this->foo()'), ubsan would a redundant null check for 'this'. There is also a redundant null check in the case where the object pointer is a reference ('Ref.foo()'). This patch teaches ubsan to remove the redundant null checks identified above. Testing: check-clang and check-ubsan. I also compiled X86FastISel.cpp with -fsanitize=null using patched/unpatched clangs based on r293572. Here are the number of null checks emitted: ------------------------------------- \| Setup \| # of null checks \| ------------------------------------- \| unpatched, -O0 \| 21767 \| \| patched, -O0 \| 10758 \| ------------------------------------- Changes since the initial commit: don't rely on IRGen of C labels in the test. Differential Revision: https://reviews.llvm.org/D29530 llvm-svn: 295401	2017-02-17 02:03:51 +00:00
Vedant Kumar	4f94a94bea	Revert "[ubsan] Reduce null checking of C++ object pointers (PR27581)" This reverts commit r295391. It breaks this bot: http://lab.llvm.org:8011/builders/clang-with-thin-lto-ubuntu/builds/1898 I need to not rely on labels in the IR test. llvm-svn: 295396	2017-02-17 01:42:36 +00:00
Vedant Kumar	3e5a9a6be8	[ubsan] Reduce null checking of C++ object pointers (PR27581) This patch teaches ubsan to insert exactly one null check for the 'this' pointer per method/lambda. Previously, given a load of a member variable from an instance method ('this->x'), ubsan would insert a null check for 'this', and another null check for '&this->x', before allowing the load to occur. Similarly, given a call to a method from another method bound to the same instance ('this->foo()'), ubsan would a redundant null check for 'this'. There is also a redundant null check in the case where the object pointer is a reference ('Ref.foo()'). This patch teaches ubsan to remove the redundant null checks identified above. Testing: check-clang and check-ubsan. I also compiled X86FastISel.cpp with -fsanitize=null using patched/unpatched clangs based on r293572. Here are the number of null checks emitted: ------------------------------------- \| Setup \| # of null checks \| ------------------------------------- \| unpatched, -O0 \| 21767 \| \| patched, -O0 \| 10758 \| ------------------------------------- Differential Revision: https://reviews.llvm.org/D29530 llvm-svn: 295391	2017-02-17 01:05:42 +00:00
Craig Topper	f0d1147fae	[AVX-512] Replace 512-bit masked packss/packus builtins and replace with new unmasked builtins. These new unmasked builtins will enable us to easily support optimizing these builtins in InstCombine in the backend. llvm-svn: 295291	2017-02-16 06:32:07 +00:00
Evgeniy Stepanov	287b04b000	Add missing regexp quantifiers in a test. llvm-svn: 295267	2017-02-16 01:35:23 +00:00
Sagar Thakur	9d0ed930ec	[XRAY][MIPS] Add -fxray-instrument for mips/mipsel/mips64/mips64el Summary: Adds xray instrument option for mips/mipsel/mips64/mips64el. Reviewed by sdardis, dberris Differential: D27698 llvm-svn: 295163	2017-02-15 10:41:38 +00:00
Reid Kleckner	fb9f647e5f	MS inline asm: Filter MXCSR out of the inferred clobber list Since r295004, LLVM has started modelling this new register, but we don't have GCC constraint inline asm spellings for it yet. llvm-svn: 295107	2017-02-14 21:38:17 +00:00
Dylan McKay	315edb0216	[AVR] Fix __AVR_xxx macro definitions; authored by Peter Wu Summary: The -mmcu option for GCC sets macros like __AVR_ATmega328P__ (with the trailing underscores), be sure to include these underscores for Clangs -mcpu option. See "AVR Built-in Macros" in https://gcc.gnu.org/onlinedocs/gcc/AVR-Options.html Reviewers: jroelofs, dylanmckay Reviewed By: jroelofs, dylanmckay Subscribers: efriedma, cfe-commits Differential Revision: https://reviews.llvm.org/D29817 llvm-svn: 294869	2017-02-11 21:06:07 +00:00
Richard Trieu	f11ab87ba3	Move test include file from include/ to Inputs/ The Inputs/ directory is the recommended location for extra files for test cases. No functional change. llvm-svn: 294815	2017-02-11 00:52:01 +00:00
George Burgess IV	f9013bf8f0	Don't let EvaluationModes dictate whether an invalid base is OK What we want to actually control this behavior is something more local than an EvalutationMode. Please see the linked revision for more discussion on why/etc. This fixes PR31843. Differential Revision: https://reviews.llvm.org/D29469 llvm-svn: 294800	2017-02-10 22:52:29 +00:00
Eric Christopher	f6ee1f3d69	Temporarily revert "For X86-64 linux and PPC64 linux align int128 to 16 bytes." until we can get better TargetMachine::isCompatibleDataLayout to compare - otherwise we can't code generate existing bitcode without a string equality data layout. This reverts commit r294703. llvm-svn: 294708	2017-02-10 04:35:21 +00:00
Eric Christopher	4855ba8f24	For X86-64 linux and PPC64 linux align int128 to 16 bytes. For other platforms we should find out what they need and likely make the same change, however, a smaller additional change is easier for platforms we know have it specified in the ABI. clang support for r294702 llvm-svn: 294703	2017-02-10 03:32:34 +00:00
Amjad Aboud	546bc1103b	[DebugInfo] Added support to Clang FE for generating debug info for preprocessor macros. Added "-fdebug-macro" flag (and "-fno-debug-macro" flag) to enable (and to disable) emitting macro debug info. Added CC1 "-debug-info-macro" flag that enables emitting macro debug info. Differential Revision: https://reviews.llvm.org/D16135 llvm-svn: 294637	2017-02-09 22:07:24 +00:00
Reid Kleckner	04f9f91da6	[MS] Implement the __fastfail intrinsic as a builtin __fastfail terminates the process immediately with a special system call. It does not run any process shutdown code or exception recovery logic. Fixes PR31854 llvm-svn: 294606	2017-02-09 18:31:06 +00:00
Craig Topper	41cb8ffcc1	[X86] Fix copy and paste bug in clzero test from r294559. llvm-svn: 294560	2017-02-09 06:22:43 +00:00
Craig Topper	4574226c3f	[X86] Clzero flag addition and inclusion under znver1 1. Adds the command line flag for clzero. 2. Includes the clzero flag under znver1. 3. Defines the macro for clzero. 4. Adds a new file which has the intrinsic definition for clzero instruction. Patch by Ganesh Gopalasubramanian with some additional tests from me. Differential revision: https://reviews.llvm.org/D29386 llvm-svn: 294559	2017-02-09 06:10:14 +00:00
Dylan McKay	e8232d73f5	[AVR] Add support for the 'interrupt' and 'naked' attributes Summary: This teaches clang how to parse and lower the 'interrupt' and 'naked' attributes. This allows interrupt signal handlers to be written. Reviewers: aaron.ballman Subscribers: malcolm.parsons, cfe-commits Differential Revision: https://reviews.llvm.org/D28451 llvm-svn: 294402	2017-02-08 05:09:26 +00:00
Dylan McKay	ecb6e7b83c	Revert "Revert "[AVR] Allow specifying the CPU on the command line"" This reverts commit 7ac30e0f839fdab6d723ce2ef6a5b7a4cf03d150. llvm-svn: 294282	2017-02-07 06:04:18 +00:00
Diana Picus	37a2d6d699	Revert "[AVR] Allow specifying the CPU on the command line" This reverts commit r294177. It seems to have broken some buildbots. llvm-svn: 294180	2017-02-06 11:35:42 +00:00
Dylan McKay	8464c9b579	[AVR] Allow specifying the CPU on the command line Summary: This tells clang about all of the different AVR microcontrollers. It also adds code to define the correct preprocessor macros for each device. Reviewers: jroelofs, asl Reviewed By: asl Subscribers: asl, cfe-commits Differential Revision: https://reviews.llvm.org/D28346 llvm-svn: 294177	2017-02-06 09:07:56 +00:00
Dylan McKay	d31534cd3a	[AVR] Add support for the full set of inline asm constraints Summary: Previously the method would simply return false, causing every single inline assembly constraint to trigger a compile error. This adds inline assembly constraint support for the AVR target. This patch is derived from the code in AVRISelLowering::getConstraintType. More details can be found on the AVR-GCC reference wiki http://www.nongnu.org/avr-libc/user-manual/inline_asm.html Reviewers: jroelofs, asl Reviewed By: asl Subscribers: asl, ahatanak, saaadhu, cfe-commits Differential Revision: https://reviews.llvm.org/D28344 llvm-svn: 294176	2017-02-06 09:01:59 +00:00
Coby Tayree	c0fb36f442	[X86][MS]Adjacent comments within multi-line inline assembly statement Allowing adjacent comments within MS inline assembly multi-line statement Differential Revision: https://reviews.llvm.org/D28989 llvm-svn: 294120	2017-02-05 10:23:06 +00:00
Sanjay Patel	3bd28f85d5	[x86] fix tests with wrong dependency to pass because they broke with r294049 llvm-svn: 294058	2017-02-03 22:03:47 +00:00
Davide Italiano	8236b73dee	[CodeGen] Update test after recent changes in llvm (r293846). llvm-svn: 293848	2017-02-02 00:47:53 +00:00
Davide Italiano	0449607738	[CodeGen] Update test after recent changes in llvm (r293799). llvm-svn: 293810	2017-02-01 20:43:28 +00:00
Nirav Dave	0c86ccf4b4	[X86] Teach Clang about -mfentry flag Replace mcount calls with calls to fentry. Reviewers: hfinkel, craig.topper Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D28001 llvm-svn: 293649	2017-01-31 17:00:35 +00:00
Vedant Kumar	d3a601b06b	Re-apply "[ubsan] Sanity-check shift amounts before truncation" This re-applies r293343 (reverts commit r293475) with a fix for an assertion failure caused by a missing integer cast. I tested this patch by using the built compiler to compile X86FastISel.cpp.o with ubsan. Original commit message: Ubsan does not report UB shifts in some cases where the shift exponent needs to be truncated to match the type of the shift base. We perform a range check on the truncated shift amount, leading to false negatives. Fix the issue (PR27271) by performing the range check on the original shift amount. Differential Revision: https://reviews.llvm.org/D29234 llvm-svn: 293572	2017-01-30 23:38:54 +00:00
Alex Lorenz	94c26be581	Revert "r293343 - [ubsan] Sanity-check shift amounts before truncation (fixes PR27271)" After r293343 clang fails to compile itself with -fsanitize=undefined ( http://lab.llvm.org:8080/green/job/clang-stage2-cmake-RgSan_build/). rdar://30259929 llvm-svn: 293475	2017-01-30 11:37:18 +00:00
Vedant Kumar	3db9974b2d	[ubsan] Sanity-check shift amounts before truncation (fixes PR27271) Ubsan does not report UB shifts in some cases where the shift exponent needs to be truncated to match the type of the shift base. We perform a range check on the truncated shift amount, leading to false negatives. Fix the issue (PR27271) by performing the range check on the original shift amount. Differential Revision: https://reviews.llvm.org/D29234 llvm-svn: 293343	2017-01-27 23:02:44 +00:00
Peter Collingbourne	7af93e31fe	Add missing x86 requirement. llvm-svn: 293210	2017-01-26 21:38:48 +00:00
Peter Collingbourne	f5d1712189	IRGen: When loading the main module in the distributed ThinLTO backend, look for the module containing the summary. Differential Revision: https://reviews.llvm.org/D29067 llvm-svn: 293209	2017-01-26 21:09:48 +00:00
Adam Nemet	1a89f25b6e	Further fixes to test from r293146 Require aarch64 and avoid filename in YAML since it may require quotation. llvm-svn: 293149	2017-01-26 04:34:07 +00:00
Adam Nemet	3071266f6f	Fix test from r293146 llvm-svn: 293147	2017-01-26 04:14:04 +00:00
Adam Nemet	7b796f825b	Support MIR opt-remarks with -fsave-optimization-record The handler that deals with IR passed/missed/analysis remarks is extended to also handle the corresponding MIR remarks. The more thorough testing in done via llc (rL293113, rL293121). Here we just make sure that the functionality is accessible through clang. llvm-svn: 293146	2017-01-26 04:07:11 +00:00
Akira Hatanaka	151d339125	Fix test case committed in r293106 so that it passes on targets whose pointers are 4-bytes instead of 8-bytes. llvm-svn: 293111	2017-01-25 23:36:15 +00:00
Akira Hatanaka	cb904604c9	Remove the return type from the check string in test case. Bots were failing because some targets emit signext before i32. llvm-svn: 293108	2017-01-25 23:16:32 +00:00
Akira Hatanaka	fdcd18b4c9	[CodeGen] Suppress emission of lifetime markers if a label has been seen in the current lexical scope. clang currently emits the lifetime.start marker of a variable when the variable comes into scope even though a variable's lifetime starts at the entry of the block with which it is associated, according to the C standard. This normally doesn't cause any problems, but in the rare case where a goto jumps backwards past the variable declaration to an earlier point in the block (see the test case added to lifetime2.c), it can cause mis-compilation. To prevent such mis-compiles, this commit conservatively disables emitting lifetime variables when a label has been seen in the current block. This problem was discussed on cfe-dev here: http://lists.llvm.org/pipermail/cfe-dev/2016-July/050066.html rdar://problem/30153946 Differential Revision: https://reviews.llvm.org/D27680 llvm-svn: 293106	2017-01-25 22:55:13 +00:00
Tim Shen	fd1e5aa8df	[APFloat] Switch from (PPCDoubleDoubleImpl, IEEEdouble) layout to (IEEEdouble, IEEEdouble) Summary: This patch changes the layout of DoubleAPFloat, and adjust all operations to do either: 1) (IEEEdouble, IEEEdouble) -> (uint64_t, uint64_t) -> PPCDoubleDoubleImpl, then run the old algorithm. 2) Do the right thing directly. 1) includes multiply, divide, remainder, mod, fusedMultiplyAdd, roundToIntegral, convertFromString, next, convertToInteger, convertFromAPInt, convertFromSignExtendedInteger, convertFromZeroExtendedInteger, convertToHexString, toString, getExactInverse. 2) includes makeZero, makeLargest, makeSmallest, makeSmallestNormalized, compare, bitwiseIsEqual, bitcastToAPInt, isDenormal, isSmallest, isLargest, isInteger, ilogb, scalbn, frexp, hash_value, Profile. I could split this into two patches, e.g. use 1) for all operatoins first, then incrementally change some of them to 2). I didn't do that, because 1) involves code that converts data between PPCDoubleDoubleImpl and (IEEEdouble, IEEEdouble) back and forth, and may pessimize the compiler. Instead, I find easy functions and use approach 2) for them directly. Next step is to implement move multiply and divide from 1) to 2). I don't have plans for other functions in 1). Differential Revision: https://reviews.llvm.org/D27872 llvm-svn: 292839	2017-01-23 22:39:35 +00:00
Tim Shen	867be0d14c	[Altivec] Change vec_sl to a << (b % (sizeof(a) * 8)) For a << b (as original vec_sl does), if b >= sizeof(a) * 8, the behavior is undefined. However, Power instructions do define the behavior, which is equivalent to a << (b % (sizeof(a) * 8)). This patch changes altivec.h to use a << (b % (sizeof(a) * 8)), to ensure the consistent semantic of the instructions. Then it combines the generated multiple instructions back to a single shift. This patch handles left shift only. Right shift, on the other hand, is more complicated, considering arithematic/logical right shift. Differential Revision: https://reviews.llvm.org/D28037 llvm-svn: 292659	2017-01-20 22:05:33 +00:00
Craig Topper	367c86ddbe	[AVX-512] Replace subvector broadcast builtins with shufflevectors and selects. Verified that the backend codegens this equally well. llvm-svn: 292329	2017-01-18 02:17:10 +00:00
Dan Gohman	0c5954195b	[WebAssembly] Update grow_memory's return type. The grow_memory instruction now returns the previous memory size. Add the return type to the clang intrinsic. llvm-svn: 292324	2017-01-18 01:03:35 +00:00
Dehao Chen	1ef69d8eb0	Temporarily revert the test change in 291870, which is broken in certain buildbots. llvm-svn: 291874	2017-01-13 01:09:43 +00:00
Dehao Chen	a1bd2d6585	Pass -fprofile-sample-use to lto backends. Summary: LTO backend will not invoke SampleProfileLoader pass even if -fprofile-sample-use is specified. This patch passes the flag down so that pass manager can add the SampleProfileLoader pass correctly. Reviewers: mehdi_amini, tejohnson Subscribers: cfe-commits Differential Revision: https://reviews.llvm.org/D28588 llvm-svn: 291870	2017-01-13 00:51:55 +00:00
Anna Zaks	e43b4fc0ae	[tsan] Do not report errors in __destroy_helper_block_ There is a synchronization point between the reference count of a block dropping to zero and it's destruction, which TSan does not observe. Do not report errors in the compiler-emitted block destroy method and everything called from it. This is similar to https://reviews.llvm.org/D25857 Differential Revision: https://reviews.llvm.org/D28387 llvm-svn: 291868	2017-01-13 00:50:50 +00:00
Chandler Carruth	9d25111a4a	Fix two test cases I missed updating in r291850. Sorry for the noise. llvm-svn: 291853	2017-01-12 22:48:28 +00:00
Chandler Carruth	7e8283ae6b	Replace some stray uses of the old spelling of the flag with the new spelling. NFC. llvm-svn: 291851	2017-01-12 22:43:37 +00:00
Eli Friedman	6503f24da8	Add additional testcases for nsw markings on ++ and --. clang has generated correct IR for char/short decrement since r126816, but we didn't have any test coverage for decrement. Patch by Andrew Rogers. llvm-svn: 291805	2017-01-12 19:51:44 +00:00
Dehao Chen	37c79c236d	Revert r291774 which caused buildbot failure. llvm-svn: 291775	2017-01-12 16:56:18 +00:00
Dehao Chen	bd3689de91	Pass -fprofile-sample-use to lto backends. Summary: LTO backend will not invoke SampleProfileLoader pass even if -fprofile-sample-use is specified. This patch passes the flag down so that pass manager can add the SampleProfileLoader pass correctly. Reviewers: mehdi_amini, tejohnson Subscribers: cfe-commits Differential Revision: https://reviews.llvm.org/D28588 llvm-svn: 291774	2017-01-12 16:29:25 +00:00
Tony Jiang	974e4c7899	[PowerPC] Fix the wrong implementation of builtin vec_rlnm. llvm-svn: 291702	2017-01-11 20:59:42 +00:00
Chad Rosier	c22abb3820	[ARM] Use generic bitreverse intrinsic, rather than ARM specific rbit. The backend already supports lowering this intrinsic to a rbit instruction. llvm-svn: 291582	2017-01-10 18:55:11 +00:00
Chad Rosier	5a4a1be690	[AArch64] Use generic bitreverse intrinsic, rather than AArch64 specific. Differential Revision: https://reviews.llvm.org/D28400 llvm-svn: 291574	2017-01-10 17:20:28 +00:00
Teresa Johnson	a8b5558f9c	[ThinLTO] Specify target triple in new test This should fix bot failures in this test. llvm-svn: 291310	2017-01-07 00:09:42 +00:00
Teresa Johnson	cffeb54fc9	[ThinLTO] Optionally ignore empty index file Summary: In order to simplify distributed build system integration, where actions may be scheduled before the Thin Link which determines the list of objects selected by the linker. The gold plugin currently will emit 0-sized index files for objects not selected by the link, to enable checking for expected output files by the build system. If the build system then schedules a backend action for these bitcode files, we want to be able to fall back to normal compilation instead of failing. Fallback is enabled under an option in LLVM (D28410), in which case a nullptr is returned from llvm::getModuleSummaryIndexForFile. Clang can just proceed with non-ThinLTO compilation in that case. I am investigating whether this can be addressed in our build system, but that is a longer term fix and so this enables a workaround in the meantime. Reviewers: mehdi_amini Subscribers: cfe-commits Differential Revision: https://reviews.llvm.org/D28362 llvm-svn: 291303	2017-01-06 23:37:33 +00:00
Mehdi Amini	7f873070c4	Add a cc1 option to force disabling lifetime-markers emission from clang Summary: This intended as a debugging/development flag only. Differential Revision: https://reviews.llvm.org/D28385 llvm-svn: 291300	2017-01-06 23:18:09 +00:00
Filipe Cabecinhas	fe5e5afd53	[ubsan] Minimize size of data for type_mismatch (Redo of D19667) Summary: This patch makes the type_mismatch static data 7 bytes smaller (and it ends up being 16 bytes smaller due to alignment restrictions, at least on some x86-64 environments). It revs up the type_mismatch handler version since we're breaking binary compatibility. I will soon post a patch for the compiler-rt side. Reviewers: rsmith, kcc, vitalybuka, pgousseau, gbedwell Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D28242 llvm-svn: 291236	2017-01-06 14:40:12 +00:00
Saleem Abdulrasool	3f4ab5c0c6	CodeGen: address post commit review comments for r291123 This test would force the execution of the backend. However, the backend already has a test for this. Effectively, this was trying to test that an API call was made properly. We do not have a good way to really test this. The test itself tested very little. Addresses post-commit review comments from Eric Christopher. llvm-svn: 291208	2017-01-06 02:27:40 +00:00
Sean Fertile	222626564d	Remove the ppc insertword/extractword expected fail tests. llvm-svn: 291188	2017-01-05 22:54:34 +00:00
Sean Fertile	96d9e0ec05	Add vec_insert4b and vec_extract4b functions to altivec.h Add builtins for the functions and custom codegen mapping the builtins to their corresponding intrinsics and handling the endian related swapping. https://reviews.llvm.org/D26546 llvm-svn: 291179	2017-01-05 21:43:30 +00:00
Saleem Abdulrasool	16a6efe43d	test: add a requires registered target It seems that the ARM buildbots do not include x86 support. However, other x86 targets do not support the ARM target. Use a x86 triple and require the registered target. llvm-svn: 291142	2017-01-05 17:09:20 +00:00
Saleem Abdulrasool	7bf88b3c1f	test: add an explicit triple Not all targets use the integrated assembler. Specify a triple to ensure we use the integrated as for this. llvm-svn: 291125	2017-01-05 16:36:15 +00:00
Saleem Abdulrasool	888e289ed7	CodeGen: plumb header search down to the IAS inline assembly may use the `.include` directive to include other content into the file. Without the integrated assembler, the `-I` group gets passed to the assembler. Emulate this by collecting the header search paths and passing them to the IAS. Resolves PR24811! llvm-svn: 291123	2017-01-05 16:02:32 +00:00
Erich Keane	521ed960ed	Correct Vectorcall Register passing and HVA Behavior Front end component (back end changes are D27392). The vectorcall calling convention was broken subtly in two cases. First, it didn't properly handle homogeneous vector aggregates (HVAs). Second, the vectorcall specification requires that only the first 6 parameters be eligible for register assignment. This patch fixes both issues. Differential Revision: https://reviews.llvm.org/D27529 llvm-svn: 291041	2017-01-05 00:20:51 +00:00
George Burgess IV	7fb7e361bf	Re-add objectsize function/incomplete type checks. I accidentally omitted these when refactoring this code. This caused problems when building parts of the test-suite on MacOS. llvm-svn: 290916	2017-01-03 23:35:19 +00:00
Reid Kleckner	9bb64de0de	Relax CHECK line from r290906 llvm-svn: 290907	2017-01-03 21:29:51 +00:00
Reid Kleckner	d2ad9dfdb9	[Win64] Don't widen integer literal zero arguments to unprototyped function calls The special case to widen the integer literal zero when passed to variadic function calls should only apply to variadic functions, not unprototyped functions. This is consistent with what MSVC does. In this test case, MSVC uses a 4-byte store to pass the 5th argument to 'kr' and an 8-byte store to pass the zero to 'v': void v(int, ...); void kr(); void f(void) { v(1, 2, 3, 4, 0); kr(1, 2, 3, 4, 0); } Aaron Ballman discovered this issue in https://reviews.llvm.org/D28166 llvm-svn: 290906	2017-01-03 21:23:35 +00:00
Teresa Johnson	3f4c87d0b5	[ThinLTO] Add missing FileCheck invocation One of the intended checks was not being performed. llvm-svn: 290671	2016-12-28 16:45:37 +00:00
George Burgess IV	1a39b86d0f	[CodeGen] Unique constant CompoundLiterals. Our newly aggressive constant folding logic makes it possible for CGExprConstant to see the same CompoundLiteralExpr more than once. So, emitting a new GlobalVariable every time we see a CompoundLiteral is no longer correct. We had a similar issue with BlockExprs that was caught while testing said aggressive folding, so I applied the same style of fix (see D26410) here. If we find yet another case where this needs to happen, we should probably refactor this so we don't have a third DenseMap+getter+setter. As a design note: getAddrOfConstantCompoundLiteralIfEmitted is really only intended to be called by ConstExprEmitter::EmitLValue. So, returning a GlobalVariable* instead of a ConstantAddress costs us effectively nothing, and saves us either a few bytes per entry in our map or a bit of code duplication. llvm-svn: 290661	2016-12-28 07:27:40 +00:00
Michael Kuperstein	071345178b	Update test that relies on the optimizer to match new output. llvm-svn: 290642	2016-12-28 00:30:43 +00:00
Craig Topper	70536f4e47	[AVX-512] Replace masked 512-bit pmuldq and pmuludq builtins with the newly added unmasked versions and selects. llvm-svn: 290580	2016-12-27 04:04:57 +00:00
Craig Topper	c5ab78d4c3	Revert r290575 "[AVX-512] Replace masked 512-bit pmuldq and pmuludq builtins with the newly added unmasked versions and selects." I failed to merge this with r290574. llvm-svn: 290578	2016-12-27 04:03:25 +00:00
Craig Topper	6ad5bcc8ac	[AVX-512] Replace masked 512-bit pmuldq and pmuludq builtins with the newly added unmasked versions and selects. llvm-svn: 290575	2016-12-27 03:46:16 +00:00
Chandler Carruth	88c4ffb4e0	[PM] The new pass manager requires a registered target for these, and given that they hard code specific triples that seems reasonable so add the REQUIRES. llvm-svn: 290560	2016-12-27 00:31:34 +00:00
Chandler Carruth	6d1b83ef87	[PH] Teach the new PM code path to support -disable-llvm-passes. This is kind of funny because I specifically did work to make this easy and then it didn't actually get implemented. I've also ported a set of tests that rely on this functionality to run with the new PM as well as the old PM so that we don't mess this up in the future. llvm-svn: 290558	2016-12-27 00:13:09 +00:00
Amjad Aboud	e2aab8c30c	[DebugInfo] Added support for Checksum debug info feature. Differential Revision: https://reviews.llvm.org/D27641 llvm-svn: 290515	2016-12-25 10:12:27 +00:00
Chandler Carruth	b322d1e6f0	[PM] Fix up from r290449 to start requiring the x86 target to be available. It doesn't seem terribly important to test this with a specific target triple but without that target available. llvm-svn: 290451	2016-12-23 21:19:16 +00:00
Chandler Carruth	50f9e893f2	[PM] Introduce options to enable the (still experimental) new pass manager, and a code path to use it. The option is actually a top-level option but does contain 'experimental' in the name. This is the compromise suggested by Richard in discussions. We expect this option will be around long enough and have enough users towards the end that it merits not being relegated to CC1, but it still needs to be clear that this option will go away at some point. The backend code is a fresh codepath dedicated to handling the flow with the new pass manager. This was also Richard's suggested code structuring to essentially leave a clean path for development rather than carrying complexity or idiosyncracies of how we do things just to share code with the parts of this in common with the legacy pass manager. And it turns out, not much is really in common even though we use the legacy pass manager for codegen at this point. I've switched a couple of tests to run with the new pass manager, and they appear to work. There are still plenty of bugs that need squashing (just with basic experiments I've found two already!) but they aren't in this code, and the whole point is to expose the necessary hooks to start experimenting with the pass manager in more realistic scenarios. That said, I want to strongly caution anyone itching to play with this: it is still very shaky. Several large components have not yet been shaken down. For example I have bugs in both the always inliner and inliner that I have already spotted and will be fixing independently. Still, this is a fun milestone. =D One thing not in this patch (but that might be very reasonable to add) is some level of support for raw textual pass pipelines such as what Sean had a patch for some time ago. I'm mostly interested in the more traditional flow of getting the IR out of Clang and then running it through opt, but I can see other use cases so someone may want to add it. And of course, many features are not yet supported! - O1 is currently more like O2 - None of the sanitizers are wired up - ObjC ARC optimizer isn't wired up - ... So plenty of stuff still lef to do! Differential Revision: https://reviews.llvm.org/D28077 llvm-svn: 290450	2016-12-23 20:44:01 +00:00
Egor Churaev	28f00aab73	[OpenCL] Align fake address space map with the SPIR target maps. Summary: We compile user opencl kernel code with spir triple. But built-ins are written in OpenCL and we compile it with triple x86_64 to be able to use x86 intrinsics. And we need address spaces to match in both cases. So, we change fake address space map in OpenCL for matching with spir. On CPU address spaces are not really important but we'd like to preserve address space information in order to perform optimizations relying on this info like enhanced alias analysis. Reviewers: pekka.jaaskelainen, Anastasia Subscribers: pekka.jaaskelainen, yaxunl, bader, cfe-commits Differential Revision: https://reviews.llvm.org/D28048 llvm-svn: 290436	2016-12-23 16:11:25 +00:00
Chandler Carruth	fcd33149b4	Cleanup the handling of noinline function attributes, -fno-inline, -fno-inline-functions, -O0, and optnone. These were really, really tangled together: - We used the noinline LLVM attribute for -fno-inline - But not for -fno-inline-functions (breaking LTO) - But we did use it for -finline-hint-functions (yay, LTO is happy!) - But we didn't for -O0 (LTO is sad yet again...) - We had weird structuring of CodeGenOpts with both an inlining enumeration and a boolean. They interacted in weird ways and needlessly. - A lot of set smashing went on with setting these, and then got worse when we considered optnone and other inlining-effecting attributes. - A bunch of inline affecting attributes were managed in a completely different place from -fno-inline. - Even with -fno-inline we failed to put the LLVM noinline attribute onto many generated function definitions because they didn't show up as AST-level functions. - If you passed -O0 but -finline-functions we would run the normal inliner pass in LLVM despite it being in the O0 pipeline, which really doesn't make much sense. - Lastly, we used things like '-fno-inline' to manipulate the pass pipeline which forced the pass pipeline to be much more parameterizable than it really needs to be. Instead we can just use the optimization level to select a pipeline and control the rest via attributes. Sadly, this causes a bunch of churn in tests because we don't run the optimizer in the tests and check the contents of attribute sets. It would be awesome if attribute sets were a bit more FileCheck friendly, but oh well. I think this is a significant improvement and should remove the semantic need to change what inliner pass we run in order to comply with the requested inlining semantics by relying completely on attributes. It also cleans up tho optnone and related handling a bit. One unfortunate aspect of this is that for generating alwaysinline routines like those in OpenMP we end up removing noinline and then adding alwaysinline. I tried a bunch of other approaches, but because we recompute function attributes from scratch and don't have a declaration here I couldn't find anything substantially cleaner than this. Differential Revision: https://reviews.llvm.org/D28053 llvm-svn: 290398	2016-12-23 01:24:49 +00:00
Chandler Carruth	93786da2cb	Make '-disable-llvm-optzns' an alias for '-disable-llvm-passes'. Much to my surprise, '-disable-llvm-optzns' which I thought was the magical flag I wanted to get at the raw LLVM IR coming out of Clang deosn't do that. It still runs some passes over the IR. I don't want that, I really want the raw IR coming out of Clang and I strongly suspect everyone else using it is in the same camp. There is actually a flag that does what I want that I didn't know about called '-disable-llvm-passes'. I suspect many others don't know about it either. It both does what I want and is much simpler. This removes the confusing version and makes that spelling of the flag an alias for '-disable-llvm-passes'. I've also moved everything in Clang to use the 'passes' spelling as it seems both more accurate (all LLVM passes are disabled, not just optimizations) and much easier to remember and spell correctly. This is part of simplifying how Clang drives LLVM to make it cleaner to wire up to the new pass manager. Differential Revision: https://reviews.llvm.org/D28047 llvm-svn: 290392	2016-12-23 00:23:01 +00:00
George Burgess IV	e37633713d	Add the alloc_size attribute to clang, attempt 2. This is a recommit of r290149, which was reverted in r290169 due to msan failures. msan was failing because we were calling `isMostDerivedAnUnsizedArray` on an invalid designator, which caused us to read uninitialized memory. To fix this, the logic of the caller of said function was simplified, and we now have a `!Invalid` assert in `isMostDerivedAnUnsizedArray`, so we can catch this particular bug more easily in the future. Fingers crossed that this patch sticks this time. :) Original commit message: This patch does three things: - Gives us the alloc_size attribute in clang, which lets us infer the number of bytes handed back to us by malloc/realloc/calloc/any user functions that act in a similar manner. - Teaches our constexpr evaluator that evaluating some `const` variables is OK sometimes. This is why we have a change in test/SemaCXX/constant-expression-cxx11.cpp and other seemingly unrelated tests. Richard Smith okay'ed this idea some time ago in person. - Uniques some Blocks in CodeGen, which was reviewed separately at D26410. Lack of uniquing only really shows up as a problem when combined with our new eagerness in the face of const. llvm-svn: 290297	2016-12-22 02:50:20 +00:00
Chandler Carruth	d7738fe6ad	Revert r290149: Add the alloc_size attribute to clang. This commit fails MSan when running test/CodeGen/object-size.c in a confusing way. After some discussion with George, it isn't really clear what is going on here. We can make the MSan failure go away by testing for the invalid bit, but why things are invalid isn't clear. And yet, other code in the surrounding area is doing precisely this and testing for invalid. George is going to take a closer look at this to better understand the nature of the failure and recommit it, for now backing it out to clean up MSan builds. llvm-svn: 290169	2016-12-20 08:28:19 +00:00
Adrian Prantl	5f4740d3e5	Update for LLVM global variable debug info API change. This reapplies r289921. llvm-svn: 290155	2016-12-20 02:10:02 +00:00
George Burgess IV	a747027bc6	Add the alloc_size attribute to clang. This patch does three things: - Gives us the alloc_size attribute in clang, which lets us infer the number of bytes handed back to us by malloc/realloc/calloc/any user functions that act in a similar manner. - Teaches our constexpr evaluator that evaluating some `const` variables is OK sometimes. This is why we have a change in test/SemaCXX/constant-expression-cxx11.cpp and other seemingly unrelated tests. Richard Smith okay'ed this idea some time ago in person. - Uniques some Blocks in CodeGen, which was reviewed separately at D26410. Lack of uniquing only really shows up as a problem when combined with our new eagerness in the face of const. Differential Revision: https://reviews.llvm.org/D14274 llvm-svn: 290149	2016-12-20 01:05:42 +00:00
Peter Collingbourne	df86d1a432	Add explicit triple to test to fix arm bots. llvm-svn: 290008	2016-12-16 23:43:51 +00:00
Peter Collingbourne	b701363188	IRGen: Fix assertion failure when creating debug info for an integer constant wider than 64 bits. llvm-svn: 289996	2016-12-16 22:10:52 +00:00
Adrian Prantl	e34d9bc8af	Revert "Update for LLVM global variable debug info API change." This reverts commit r289921. llvm-svn: 289984	2016-12-16 19:39:18 +00:00
Adrian Prantl	db4c86f953	Update for LLVM global variable debug info API change. llvm-svn: 289921	2016-12-16 04:26:15 +00:00
Adrian Prantl	ed4eb86531	Revert "Update for LLVM global variable debug info API change." This reverts commit 289901 while investigating bot breakage. llvm-svn: 289908	2016-12-16 01:01:40 +00:00
Adrian Prantl	35bbcefb4b	Update for LLVM global variable debug info API change. llvm-svn: 289901	2016-12-16 00:35:42 +00:00
Mehdi Amini	9f10f34a6b	Fix printf specifier handling: invalid specifier should not be marked as "consuming data arguments" Reviewers: rsmith, bruno, dexonsmith Subscribers: cfe-commits Differential Revision: https://reviews.llvm.org/D27796 llvm-svn: 289850	2016-12-15 18:54:00 +00:00
Alexey Bataev	3da2619b6f	Revert "[TESTS] Initial commit of tests, by Andrew Tischenko" This reverts commit 5898c713bee5e96aae87c73e11f3f4a7d19c74ed. llvm-svn: 289812	2016-12-15 12:24:20 +00:00
Alexey Bataev	70f090d568	[TESTS] Initial commit of tests, by Andrew Tischenko llvm-svn: 289809	2016-12-15 12:06:27 +00:00
Mehdi Amini	ba80a837ab	Revert "Fix printf specifier handling: invalid specifier should not be marked as "consuming data arguments"" This reverts commit r289762, wasn't ready to be pushed, it broke the printf tests. llvm-svn: 289763	2016-12-15 04:58:51 +00:00
Mehdi Amini	0dcbcb7eb8	Fix printf specifier handling: invalid specifier should not be marked as "consuming data arguments" llvm-svn: 289762	2016-12-15 04:51:22 +00:00
Mehdi Amini	ab11d83048	Fix os_log formating with arbitrary precision and field width llvm-svn: 289761	2016-12-15 04:02:31 +00:00
David Gross	ff06759ffb	[DebugInfo] Restore test case for long double constants. Summary: D27549 (partial fix for PR26619) emits a constant value in the debug metadata for a floating-point static const that does not exceed 64 bits in size. Whether or not a long double exceeds 64 bits in size depends on the target. Modify the test case so that it expects a constant value for long double if and only if the long double is no larger than 64 bits. Reviewers: cfe-commits, probinson Differential Revision: https://reviews.llvm.org/D27597 llvm-svn: 289686	2016-12-14 18:52:33 +00:00
Reid Kleckner	34a0f3dc2f	Improve our handling of tag decls in function prototypes r289225 broke AST invariants by reparenting enumerators into function decl contexts. This improves things by only reparenting TagDecls while also attempting to preserve the lexical declcontext chain. The interesting example here is: int f(struct S { enum E { a = 1 } b; } c); The semantic contexts of E and S should be f, and the lexical context of S should be f and the lexical context of E should be S. We didn't do that with r289225, but now we should. This change should also improve our behavior on this example: void f() { extern void ext(struct S { } o); // S injected here } Before r289225 we would only remove 'S' from the surrounding tag injection context if it was the TU, but now we properly reparent S from f to ext. Fixes PR31366 llvm-svn: 289678	2016-12-14 17:44:11 +00:00
Peter Collingbourne	1a0720e8c4	LTO: Add support for multi-module bitcode files. Differential Revision: https://reviews.llvm.org/D27313 llvm-svn: 289621	2016-12-14 01:17:59 +00:00
Evandro Menezes	ba17775c84	Add support for Samsung Exynos M3 (NFC) llvm-svn: 289614	2016-12-13 23:31:57 +00:00
Craig Topper	678b07fe3c	[AVX-512] Remove masking from 512-bit vpermil builtins. The backend now has versions without masking so wrap it with select. This will allow the backend to constant fold these to generic shuffle vectors like 128-bit and 256-bit without having to working about handling masking. llvm-svn: 289351	2016-12-11 01:26:52 +00:00
Craig Topper	cdd3603c04	[AVX-512] Remove masking from 512-bit pshufb builtin. The backend now has a version without masking so wrap it with select. This will allow the backend to constant fold these to generic shuffle vectors like 128-bit and 256-bit without having to working about handling masking. llvm-svn: 289345	2016-12-10 23:09:52 +00:00
Craig Topper	5391c98341	[AVX-512] Remove 128/256-bit masked vpermilvar builtins and replace with select and the avx unmasked builtins. llvm-svn: 289338	2016-12-10 20:27:39 +00:00
David Gross	bcc6cea748	[DebugInfo] Relax test case for long double constants. Summary: D27549 (partial fix for PR26619) emits a constant value in the debug metadata for a floating-point static const that does not exceed 64 bits in size. The regression test accompanying that fix assumes that a long double exceeds 64 bits in size and hence does not get a constant value in the debug metadata. However, for some targets -- such as "--target=hexagon-unknown-elf" -- a long double does not exceed 64 bits in size, and hence the test fails. As a temporary fix, modify the regression test to no longer inspect the debug metadata for a long double. Reviewers: cfe-commits, probinson Differential Revision: https://reviews.llvm.org/D27589 llvm-svn: 289103	2016-12-08 21:15:17 +00:00
David Gross	1118d591dc	[DebugInfo] Add support for __fp16, float, and double constants. Summary: Partial fix for PR26619. Prior to this change, a DIGlobalVariable corresponding to a static const was marked with an expression corresponding to its constant value only if it is of integral type. With this change, we now do the same if it is of __fp16, float, or double type (that is, floating-point types that do not exceed 64 bits in size, and hence are supported easily by the existing LLVM machinery for creating constant expressions in debug info). Reviewers: llvm-commits Differential Revision: https://reviews.llvm.org/D27549 llvm-svn: 289094	2016-12-08 20:02:46 +00:00
Reid Kleckner	fec0f32ea9	Use ${:uid} to generate unique MS asm labels, not {:uid} llvm-svn: 288093	2016-11-29 00:39:37 +00:00
Reid Kleckner	08ebbcebb9	[MS] Mangle a unique ID into all MS inline asm labels This solves PR23715 in a way that is compatible with LTO. MSVC supports jumping to source-level labels and between inline asm blocks, but we don't. Also revert the old solution, r255201, which was to mark these calls as noduplicate. llvm-svn: 288059	2016-11-28 20:52:19 +00:00
Ehsan Amiri	85f5bfcf0d	[PPC] support for arithmetic builtins in the FE (commit again after fixing the buildbot failures) This adds various overloads of the following builtins to altivec.h: vec_neg vec_nabs vec_adde vec_addec vec_sube vec_subec vec_subc Note that for vec_sub builtins on 32 bit integers, the semantics is similar to what ISA describes for instructions like vsubecuq that work on quadwords: the first operand is added to the one's complement of the second operand. (As opposed to two's complement which I expected). llvm-svn: 287872	2016-11-24 12:40:04 +00:00
Ehsan Amiri	9cce1ee88c	[PPC] revert r287795 A test that passed locally is failing on one of the build bots. llvm-svn: 287796	2016-11-23 18:55:17 +00:00
Ehsan Amiri	9b91cfa0b0	[PPC] support for arithmetic builtins in the FE (commit again after fixing the buildbot failures) This adds various overloads of the following builtins to altivec.h: vec_neg vec_nabs vec_adde vec_addec vec_sube vec_subec vec_subc Note that for vec_sub builtins on 32 bit integers, the semantics is similar to what ISA describes for instructions like vsubecuq that work on quadwords: the first operand is added to the one's complement of the second operand. (As opposed to two's complement which I expected). llvm-svn: 287795	2016-11-23 18:36:29 +00:00
Ehsan Amiri	ac10595b0d	[PPC] Reverting r287772 Due to buildbot failure, I revert. Will recommit after investigation. llvm-svn: 287775	2016-11-23 16:56:03 +00:00
Ehsan Amiri	5ea1054dab	[PPC] support for arithmetic builtins in the FE This adds various overloads of the following builtins to altivec.h: vec_neg vec_nabs vec_adde vec_addec vec_sube vec_subec vec_subc Note that for vec_sub builtins on 32 bit integers, the semantics is similar to what ISA describes for instructions like vsubecuq that work on quadwords: the first operand is added to the one's complement of the second operand. (As opposed to two's complement which I expected). llvm-svn: 287772	2016-11-23 16:32:05 +00:00
Simon Pilgrim	b243bbc87d	[X86][AVX512VL] Add missing _mm256_maskz_alignr_epi64 shufflevector check Missed in rL287733 llvm-svn: 287755	2016-11-23 11:38:52 +00:00
Craig Topper	6aefe00ccf	[X86] Replace valignd/q builtins with appropriate __builtin_shufflevector. llvm-svn: 287733	2016-11-23 01:47:12 +00:00
Craig Topper	37bf5c6a3f	[AVX-512] Replace masked 16-bit element variable shift builtins with new unmasked versions and selects. llvm-svn: 287313	2016-11-18 05:04:51 +00:00
Pekka Jaaskelainen	6aa07ee410	target-data test update for TCE and TCELE llvm-svn: 287115	2016-11-16 16:21:59 +00:00
Simon Pilgrim	698528d83b	[X86][AVX512] Replace lossless i32/u32 to f64 conversion intrinsics with generic IR Both the (V)CVTDQ2PD (i32 to f64) and (V)CVTUDQ2PD (u32 to f64) conversion instructions are lossless and can be safely represented as generic __builtin_convertvector calls instead of x86 intrinsics without affecting final codegen. This patch removes the clang builtins and their use in the headers - a future patch will deal with removing the llvm intrinsics. This is an extension patch to D20528 which dealt with the equivalent sse/avx cases. Differential Revision: https://reviews.llvm.org/D26686 llvm-svn: 287088	2016-11-16 09:27:40 +00:00
Mehdi Amini	dc9bf8fab6	Improve handling of __FUNCTION__ and other predefined expression for Objective-C Blocks Instead of always displaying the mangled name, try to do better and get something closer to regular functions. Recommit r287039 (that was reverted in r287039) with a tweak to be more generic, and test fixes! Differential Revision: https://reviews.llvm.org/D26522 llvm-svn: 287085	2016-11-16 07:07:28 +00:00
Mehdi Amini	f5f37ee546	Revert "Improve handling of __FUNCTION__ and other predefined expression for Objective-C Blocks" This reverts commit r287039, tests are broken. llvm-svn: 287043	2016-11-15 22:19:50 +00:00
Mehdi Amini	26168ad5c5	Improve handling of __FUNCTION__ and other predefined expression for Objective-C Blocks Instead of always displaying the mangled name, try to do better and get something closer to regular functions. Differential Revision: https://reviews.llvm.org/D26522 llvm-svn: 287039	2016-11-15 21:47:11 +00:00
Zaara Syeda	c1d2952388	vector load store with length (left justified) clang portion llvm-svn: 286994	2016-11-15 18:04:13 +00:00
Tony Jiang	6a49aad177	[PowerPC] Implement BE VSX load/store builtins - clang portion. This patch implements all the overloads for vec_xl_be and vec_xst_be. On BE, they behaves exactly the same with vec_xl and vec_xst, therefore they are simply implemented by defining a matching macro. On LE, they are implemented by defining new builtins and intrinsics. For int/float/long long/double, it is just a load (lxvw4x/lxvd2x) or store(stxvw4x/stxvd2x). For char/char/short, we also need some extra shuffling before or after call the builtins to get the desired BE order. For int128, simply call vec_xl or vec_xst. llvm-svn: 286971	2016-11-15 14:30:56 +00:00
Sean Fertile	a9548937d6	[PPC] altivec.h functions for converting half precision to single precision. Adds 2 vector functions for converting from a vector of unsigned short to a vector of float. One converts the low 4 halfwords and one converts the high 4 halfwords. Differential Revision: https://reviews.llvm.org/D26534 llvm-svn: 286863	2016-11-14 18:47:15 +00:00
Sean Fertile	193430fe51	[PPC] add extract sig/exp test data class for vec float and vec double. Add vector extract exponent/significand functions to altivec.h, as well as functions (and related constants) to test the data class of vector float and vector double. Differential Revision: https://reviews.llvm.org/D26271 llvm-svn: 286830	2016-11-14 14:43:27 +00:00
Craig Topper	5e0709d60b	[AVX-512] Replace masked dword and qword variable shift builtins with unmasked builtins and a select. This is part of a set of changes to allow InstCombine in the backend to optimize variable shifts without having to know about masking. llvm-svn: 286757	2016-11-13 07:26:34 +00:00
Craig Topper	2c8f49e67b	[AVX-512] Use scalar vfmsub/vfnmsub mask3 intrinsics instead of inverting the mask argument of a vfmadd intrinsic. Summary: Inverting the mask argument does not reflect the intended semantics of the intrinsic. Reviewers: igorb, delena Subscribers: cfe-commits Differential Revision: https://reviews.llvm.org/D26019 llvm-svn: 286733	2016-11-12 23:24:34 +00:00
Craig Topper	1a44193afd	[AVX-512] Convert the rest of the masked shift by immediate and by single element builtins over to the newly added unmasked builtins and a select. This should also fix PR30691 since the new builtins are handled like the legacy builtins in the backend. llvm-svn: 286714	2016-11-12 07:16:59 +00:00
Anna Zaks	bcd35a8ec1	[tsan][clang] Introduce a function attribute to disable TSan checking at run time This introduces a function annotation that disables TSan checking for the function at run time. The benefit over attribute((no_sanitize("thread"))) is that the accesses within the callees will also be suppressed. The motivation for this attribute is a guarantee given by the objective C language that the calls to the reference count decrement and object deallocation will be synchronized. To model this properly, we would need to intercept all ref count decrement calls (which are very common in ObjC due to use of ARC) and also every single message send. Instead, we propose to just ignore all accesses made from within dealloc at run time. The main downside is that this still does not introduce any synchronization, which means we might still report false positives if the code that relies on this synchronization is not executed from within dealloc. However, we have not seen this in practice so far and think these cases will be very rare. (This problem is similar in nature to https://reviews.llvm.org/D21609; unfortunately, the same solution does not apply here.) Differential Revision: https://reviews.llvm.org/D25857 llvm-svn: 286672	2016-11-11 23:22:44 +00:00
Nemanja Ivanovic	4de0011b5c	[PowerPC] Implement remaining permute builtins in altivec.h - Clang portion This patch corresponds to review: https://reviews.llvm.org/D26479 It adds the remaining vector permute/rotate builtins to altivec.h. llvm-svn: 286650	2016-11-11 22:34:44 +00:00
Nemanja Ivanovic	4079fc8188	[PowerPC] Add vector conversion builtins to altivec.h - clang portion This patch corresponds to review: https://reviews.llvm.org/D26308 It adds a number of vector type conversion builtins to altivec.h. llvm-svn: 286627	2016-11-11 19:56:17 +00:00
Tony Jiang	7723f97d6a	[PowerPC] Implement plain VSX load/store builtins. Implement all the different 24 overloads for vec_xl and vec_xst. llvm-svn: 286455	2016-11-10 14:39:56 +00:00
Douglas Katzman	0f5cc9b6df	[Sparc] Unbreak test llvm-svn: 286380	2016-11-09 17:02:07 +00:00
Douglas Katzman	13f4a91a1f	[Sparc] LLONG is not lock-free atomic on v8 Differential Revision: https://reviews.llvm.org/D26286 llvm-svn: 286376	2016-11-09 15:43:51 +00:00
Adrian Prantl	338ef7a82c	Emit debug info for global constants whose address is taken exactly once. Add a check to the DeclCache before emitting debug info for a GlobalVariable a second time and just attach the previsously created one to it. <rdar://problem/26721101> llvm-svn: 286322	2016-11-09 00:42:03 +00:00
Ayman Musa	e60a41ca28	[X86][AVX512][Clang] Add support for mask_{move\|store\|load}_s{s/d} and int2mask/mask2int intrinsics. Differential Revision: https://reviews.llvm.org/D26021 llvm-svn: 286229	2016-11-08 12:00:30 +00:00
Tony Jiang	c6ddd7221c	[PowerPC] Implement remaining vector comparison builtins. vector bool char vec_cmpeq (vector bool char, vector bool char); vector bool int vec_cmpeq (vector bool int, vector bool int); vector bool long long vec_cmpeq (vector bool long long, vector bool long lon vector bool short vec_cmpeq (vector bool short, vector bool short); llvm-svn: 286205	2016-11-08 04:15:45 +00:00
Erich Keane	757d317c24	regcall: Implement regcall Calling Conv in clang This patch implements the register call calling convention, which ensures as many values as possible are passed in registers. CodeGen changes were committed in https://reviews.llvm.org/rL284108. Differential Revision: https://reviews.llvm.org/D25204 llvm-svn: 285849	2016-11-02 18:29:35 +00:00
Nemanja Ivanovic	05ce4ca0dd	[PowerPC] Implement vector shift builtins - clang portion This patch corresponds to review https://reviews.llvm.org/D26092. Committing on behalf of Tony Jiang. llvm-svn: 285694	2016-11-01 14:46:20 +00:00
Michael Zuckerman	62f516f590	[x86][inline-asm][clang] accept 'v' constraint Commit on behalf of: Coby Tayree 1.'v' constraint for (x86) non-avx arch imitates the already implemented 'x' constraint, i.e. allows XMM{0-15} & YMM{0-15} depending on the apparent arch & mode (32/64). 2.for the avx512 arch it allows [X,Y,Z]MM{0-31} (mode dependent) This patch applies the needed changes to clang LLVM patch: https://reviews.llvm.org/D25005 Differential Revision: https://reviews.llvm.org/D25005 llvm-svn: 285688	2016-11-01 13:16:44 +00:00
Nemanja Ivanovic	251f6dd93d	[PPC] Add vec_absd functions to altivec.h This patch corresponds to review https://reviews.llvm.org/D26073. Committing on behalf of Sean Fertile. llvm-svn: 285679	2016-11-01 08:39:56 +00:00
Craig Topper	08bf53ffda	[AVX-512] Remove masked vector insert builtins and replace with native shufflevectors and selects. Unfortunately, the backend currently doesn't fold masks into the instructions correctly when they come from these shufflevectors. I'll work on that in a future commit. llvm-svn: 285667	2016-11-01 05:47:56 +00:00
Evgeniy Stepanov	f75430963d	[cfi] Fix missing !type annotation. CFI (only in the cross-dso mode) fails to set !type annotations when a function is used before it is defined. llvm-svn: 285650	2016-10-31 22:28:10 +00:00
Victor Leschuk	0df19037c4	DebugInfo: support for DW_TAG_atomic_type Mark C11 _Atomic variables with DW_TAG_atomic_type tag. Differential Revision: https://reviews.llvm.org/D26145 llvm-svn: 285625	2016-10-31 19:09:47 +00:00
Nemanja Ivanovic	e5b62c83be	NFC - Reorder test case names in a PPC test case A few recent commits have messed up the order of some tests in a PPC test case. This just reorders them in a sensible way. llvm-svn: 285623	2016-10-31 19:02:54 +00:00
Michael Zuckerman	b3147e80a6	Fixing problem with CodeGen/avx512-kconstraints-att_inline_asm.c llvm-svn: 285617	2016-10-31 18:40:17 +00:00
Michael Zuckerman	849a6a5e5a	[x86][inline-asm][AVX512][clang][PART-1] Introducing "k" and "Yk" constraints for extended inline assembly, enabling use of AVX512 masked vectorized instructions. Commit on behalf of mharoush Extending inline assembly support, compatible with GCC as folowing: "k" constraint hints the compiler to select any of AVX512 k0-k7 registers. "Yk" constraint is a subset of "k" excluding k0 which is not allowd to be used as a mask. Reviewer: 1. rnk Differential Revision: https://reviews.llvm.org/D25063 llvm-svn: 285604	2016-10-31 17:23:52 +00:00
Michael Zuckerman	2460bada56	[x86][inline-asm] Add support for curly brackets escape using "%" in extended inline asm. Commit on behalf of mharoush After LGTM and check all: This patch is a compatibility fix for clang, matching GCC support for charter escape when using extended in-line assembly (i.e, "%{" ,"%}" --> "{" ,"}" ). It is meant to enable support for advanced features such as AVX512 conditional\masked vector instructions/broadcast assembly syntax. Reviewer: 1. rnk Differential Revision: https://reviews.llvm.org/D25012 llvm-svn: 285585	2016-10-31 15:27:54 +00:00
Ulrich Weigand	30354ebb00	[SystemZ] Add -march=archX aliases For compatibility with other compilers on the platform, allow specifying levels of the z/Architecture instead of model names with -march. In particular, the following aliases are now supported: -march=arch8 equals -march=z10 -march=arch9 equals -march=z196 -march=arch10 equals -march=zEC12 -march=arch11 equals -march=z13 This parallels the equivalent (and prerequisite) LLVM change in r285577. llvm-svn: 285578	2016-10-31 14:38:05 +00:00
Michael Zuckerman	15604b996f	second attempt at r285565. llvm-svn: 285573	2016-10-31 14:16:57 +00:00
Michael Zuckerman	7beec2e8bf	revert r285563 fail in test CodeGen/avx512-inline-asm-kregisters-basics.c llvm-svn: 285565	2016-10-31 12:49:36 +00:00
Michael Zuckerman	0d26eea609	[x86][inline-asm] Introducing (AVX512) k0-k7 registers for inline-asm usage Commit on behalf of mharoush After LGTM and check all: This patch enables usage of k registers in inline assembly syntax. Adding triple Reviewer: 1. rnk 2. delena Differential Revision: https://reviews.llvm.org/D25011 llvm-svn: 285563	2016-10-31 12:05:41 +00:00
Michael Zuckerman	56c85d2119	Revert reviosion 285555 llvm-svn: 285556	2016-10-31 10:12:36 +00:00
Michael Zuckerman	4fe34fa2ec	[x86][inline-asm] Introducing (AVX512) k0-k7 registers for inline-asm usage Commit on behalf of mharoush After LGTM and check all: This patch enables usage of k registers in inline assembly syntax. Reviewer: 1. rnk 2. delena Differential Revision: https://reviews.llvm.org/D25011 llvm-svn: 285555	2016-10-31 09:37:59 +00:00
Craig Topper	cc012b3a37	[AVX-512] Add a regular expression to a test that was missed in r285540. llvm-svn: 285547	2016-10-31 06:24:00 +00:00
Craig Topper	350729627a	[AVX-512] Use selectd instead of selectps for _mm256_mask_extracti32x4_epi32. llvm-svn: 285545	2016-10-31 05:49:11 +00:00
David Majnemer	5116993f8e	Add support for __builtin_alloca_with_align __builtin_alloca always uses __BIGGEST_ALIGNMENT__ for the alignment of the allocation. __builtin_alloca_with_align allows the programmer to specify the alignment of the allocation. This fixes PR30658. llvm-svn: 285544	2016-10-31 05:37:48 +00:00
Craig Topper	93ffabd28d	[AVX-512] Remove masked vector extract builtins and replace with native shufflevectors and selects. Unfortunately, the backend currently doesn't fold masks into the instructions correctly when they come from these shufflevectors. I'll work on that in a future commit. llvm-svn: 285540	2016-10-31 04:30:56 +00:00
Craig Topper	66b2fd1209	[AVX-512] Remove many of the masked 128/256-bit shift builtins and replace them with unmasked builtins and selects. llvm-svn: 285539	2016-10-31 04:30:51 +00:00
Michael Zuckerman	d343697f1e	Fixing "type" issue for (epi32) and replaceing hardcoded inf with clang builtin inf "__builtin_inff()" for float ({max\|min}_{pd\|ps}) llvm-svn: 285519	2016-10-30 14:54:05 +00:00
Craig Topper	312ff9d19d	[AVX-512] Remove masked 128/256-bit builtins for vpmaddwd and vpmaddubsw. Replace with unmasked builtins and select. llvm-svn: 285516	2016-10-30 07:11:34 +00:00
Craig Topper	4caf76bee2	[AVX-512] Remove 128/256-bit masked pmulhrsw/pmulhuw/pmulhw builtins and use unmasked builtins and select instead. llvm-svn: 285505	2016-10-29 19:02:14 +00:00
Craig Topper	2eadf1b67e	[AVX-512] Remove masked 128/256-bit sqrt builtins and replace them with unmasked builtins and a select. llvm-svn: 285504	2016-10-29 19:02:10 +00:00
Craig Topper	09e94007be	[AVX-512] Remove masked 128/256-bit pmuludq/pmuldq builtins and replace them with unmasked builtins and a select. llvm-svn: 285503	2016-10-29 19:02:07 +00:00
Craig Topper	160ca8420d	[AVX-512] Remove masked 128/256-bit floating point max/min builtins. Use unmasked builtins with select instead. llvm-svn: 285502	2016-10-29 19:02:03 +00:00
Michael Zuckerman	25eb420233	[X86][AVX512][Clang][Intrinsics][reduce] Adding missing reduce (max\|min) intrinsics to Clang . After LGTM and Check-all Vector-reduction arithmetic accepts vectors as inputs and produces scalars as outputs.This class of vector operation forms the basis of many scientific computations. In vector-reduction arithmetic, the evaluation off is independent of the order of the input elements of V. Reviewer: 1. craig.topper 2. igorb Differential Revision: https://reviews.llvm.org/D25988 llvm-svn: 285493	2016-10-29 10:29:20 +00:00
Nemanja Ivanovic	931bc548e6	[PPC] add float and double overloads for vec_orc and vec_nand in altivec.h This patch corresponds to review https://reviews.llvm.org/D25950. Committing on behalf of Sean Fertile. llvm-svn: 285439	2016-10-28 20:04:53 +00:00
Nemanja Ivanovic	4f69f924df	Implement vector count leading/trailing bytes with zero lsb and vector parity builtins - clang portion This patch corresponds to review: https://reviews.llvm.org/D26002 Committing on behalf of Zaara Syeda. llvm-svn: 285436	2016-10-28 19:49:03 +00:00
Michael Zuckerman	22a03e435a	Fixing small problem with avx512-reduceIntrin.c test on some OS. llvm-svn: 285419	2016-10-28 17:25:26 +00:00
Michael Zuckerman	edd99eb07a	1. Fixing small types issue (PD\|PS) (reduce) . 2. Cosmetic changes llvm-svn: 285405	2016-10-28 15:16:03 +00:00
David Majnemer	1878da43ea	[CodeGen] Provide an appropriate alignment for dynamic allocas GCC documents __builtin_alloca as aligning the storage to at least __BIGGEST_ALIGNMENT__. MSVC documents essentially the same for the x64 ABI: https://msdn.microsoft.com/en-us/library/x9sx5da1.aspx The 32-bit ABI follows the same rule: it emits a call to _alloca_probe_16 Differential Revision: https://reviews.llvm.org/D24378 llvm-svn: 285316	2016-10-27 17:18:24 +00:00
Nemanja Ivanovic	09dd423a7d	[PPC] add vector byte reverse functions to altivec.h This patch corresponds to review https://reviews.llvm.org/D25915. Committing on behalf of Sean Fertile. llvm-svn: 285268	2016-10-27 06:23:57 +00:00
Nemanja Ivanovic	3de0a385c9	[PowerPC] Implement vector_insert_exp builtins - clang portion This patch corresponds to review https://reviews.llvm.org/D25956. Committing on behalf of Zaara Syeda. llvm-svn: 285229	2016-10-26 19:27:11 +00:00
Nemanja Ivanovic	85a28dcc5d	[PPC] Implement vector reverse elements builtins (vec_reve) This patch corresponds to review https://reviews.llvm.org/D25906. Committing on behalf of Tony Jiang. llvm-svn: 285218	2016-10-26 18:25:45 +00:00
Vitaly Buka	64c80b4e39	[CodeGen] Don't emit lifetime intrinsics for some local variables Summary: Current generation of lifetime intrinsics does not handle cases like: ``` { char x; l1: bar(&x, 1); } goto l1; ``` We will get code like this: ``` %x = alloca i8, align 1 call void @llvm.lifetime.start(i64 1, i8* nonnull %x) br label %l1 l1: %call = call i32 @bar(i8* nonnull %x, i32 1) call void @llvm.lifetime.end(i64 1, i8* nonnull %x) br label %l1 ``` So the second time bar was called for x which is marked as dead. Lifetime markers here are misleading so it's better to remove them at all. This type of bypasses are rare, e.g. code detects just 8 functions building clang (2329 targets). PR28267 Reviewers: eugenis Subscribers: beanz, mgorny, cfe-commits Differential Revision: https://reviews.llvm.org/D24693 llvm-svn: 285176	2016-10-26 05:42:30 +00:00
Michael Zuckerman	facb37cabf	[X86][AVX512][Clang][Intrinsics][reduce] Adding missing reduce (Operators: +,*,&&,\|\|) intrinsics to Clang Committed after LGTM and check-all Vector-reduction arithmetic accepts vectors as inputs and produces scalars as outputs. This class of vector operation forms the basis of many scientific computations. In vector-reduction arithmetic, the evaluation off is independent of the order of the input elements of V. Used bisection method. At each step, we partition the vector with previous step in half, and the operation is performed on its two halves. This takes log2(n) steps where n is the number of elements in the vector. Reviwer: 1. igorb 2. craig.topper Differential Revision: https://reviews.llvm.org/D25527 llvm-svn: 285054	2016-10-25 07:56:04 +00:00
Mehdi Amini	9825ab0433	Fix handling of %% format specifier in os_log builtins. Returning `false` was stopping the parsing of further arguments, which wasn't intended. llvm-svn: 285047	2016-10-25 00:48:48 +00:00
Mehdi Amini	ebff247d41	test/CodeGen/builtins.c: reinstate #ifdef __x86_64__ around __builtin_longjmp Unadvertently removed in r285019 llvm-svn: 285041	2016-10-24 23:38:24 +00:00
Mehdi Amini	58567d71d0	Fix test on non-X86 platforms This is a fixup for r285019, adding an `#ifdef __x86_64__` since the os_log builtin is platform specific. llvm-svn: 285027	2016-10-24 21:22:01 +00:00
Mehdi Amini	06d367c6c6	Add support for __builtin_os_log_format[_buffer_size] This reverts commit r285007 and reapply r284990, with a fix for the opencl test that I broke. Original commit message follows: These new builtins support a mechanism for logging OS events, using a printf-like format string to specify the layout of data in a buffer. The _buffer_size version of the builtin can be used to determine the size of the buffer to allocate to hold the data, and then __builtin_os_log_format can write data into that buffer. This implements format checking to report mismatches between the format string and the data arguments. Most of this code was written by Chris Willmore. Differential Revision: https://reviews.llvm.org/D25888 llvm-svn: 285019	2016-10-24 20:39:34 +00:00
Mehdi Amini	9c39fdceda	Revert "Add support for __builtin_os_log_format[_buffer_size]" This reverts commit r284990, two opencl test are broken llvm-svn: 285007	2016-10-24 19:41:36 +00:00
Mandeep Singh Grang	be2ad8f36b	[clang] Remove redundant --check-prefix=CHECK from tests Reviewers: mkuper, rengolin, hans Subscribers: cfe-commits Tags: #clang-c Differential Revision: https://reviews.llvm.org/D25893 llvm-svn: 285001	2016-10-24 18:53:43 +00:00
Mehdi Amini	29034362ae	Add support for __builtin_os_log_format[_buffer_size] These new builtins support a mechanism for logging OS events, using a printf-like format string to specify the layout of data in a buffer. The _buffer_size version of the builtin can be used to determine the size of the buffer to allocate to hold the data, and then __builtin_os_log_format can write data into that buffer. This implements format checking to report mismatches between the format string and the data arguments. Most of this code was written by Chris Willmore. Differential Revision: https://reviews.llvm.org/D25888 llvm-svn: 284990	2016-10-24 16:56:23 +00:00
Michael Zuckerman	33bd5b235b	revert r284963 because new test file is failing in some OS. test/CodeGen/avx512-reduceIntrin.c llvm-svn: 284967	2016-10-24 11:30:23 +00:00
Michael Zuckerman	98cb041891	[X86][AVX512][Clang][Intrinsics][reduce] Adding missing reduce (Operators: +,*,&&,\|\|) intrinsics to Clang Committed after LGTM and check-all Vector-reduction arithmetic accepts vectors as inputs and produces scalars as outputs. This class of vector operation forms the basis of many scientific computations. In vector-reduction arithmetic, the evaluation off is independent of the order of the input elements of V. Used bisection method. At each step, we partition the vector with previous step in half, and the operation is performed on its two halves. This takes log2(n) steps where n is the number of elements in the vector. Differential Revision: https://reviews.llvm.org/D25527 llvm-svn: 284963	2016-10-24 10:53:20 +00:00
Craig Topper	531ce28311	[AVX-512] Replace 64-bit element and 512-bit vector pmin/pmax builtins with native IR like we do for 128/256-bit, but with the addition of masking. llvm-svn: 284956	2016-10-24 04:04:24 +00:00
Craig Topper	eee7c0520c	[AVX-512] Replace masked 128/256-bit byte, word, and dword min/max builtins with selects and the older unmasked builtins. llvm-svn: 284954	2016-10-23 23:57:30 +00:00
Craig Topper	0c5da26572	[AVX-512] Replace 512-bit pmovzx/sx builtins with native IR. llvm-svn: 284936	2016-10-23 07:35:47 +00:00

... 2 3 4 5 6 ...

4255 Commits