llvm-project

Commit Graph

Author	SHA1	Message	Date
Benjamin Kramer	88d73626cd	[ARM] Update long-test after r304201. llvm-svn: 304208	2017-05-30 12:44:48 +00:00
Benjamin Kramer	c796245431	[PPC] Make altivec conversion function macros. The second argument must be a constant, otherwise instruction selection will fail. always_inline is not enough for isel to always fold everything away at -O0. Sadly the overloading turned this into a big macro mess. Fixes PR33212. llvm-svn: 304205	2017-05-30 11:37:29 +00:00
Javed Absar	3d92d7ab36	[ARM] Fix Neon vector type alignment to 64-bit The maximum alignment for ARM NEON data types should be 64-bits as specified in ARM procedure call standard document Sec. A.2 Notes. This patch fixes it from its current larger natural default values, except for Android (so as not to break existing ABI). Reviewed by: Stephen Hines, Renato Golin. Differential Revision: https://reviews.llvm.org/D33205 llvm-svn: 304201	2017-05-30 10:12:15 +00:00
Benjamin Kramer	e524ddeef3	Unbreak long test after r304127. llvm-svn: 304167	2017-05-29 18:11:11 +00:00
Mehdi Amini	6aa9e9b41a	IRGen: Add optnone attribute on function during O0 Amongst other, this will help LTO to correctly handle/honor files compiled with O0, helping debugging failures. It also seems in line with how we handle other options, like how -fnoinline adds the appropriate attribute as well. Differential Revision: https://reviews.llvm.org/D28404 llvm-svn: 304127	2017-05-29 05:38:20 +00:00
Arnold Schwaighofer	634e320376	CodeGen: Define Swift's legal vector types for AArch64, ARM rdar://32401301 llvm-svn: 304017	2017-05-26 18:11:54 +00:00
Oren Ben Simhon	140c1fb9ec	[X86] Adding avx512_vpopcntdq feature set and its intrinsics AVX512_VPOPCNTDQ is a new feature set that was published by Intel. The patch represents the Clang side of the addition of six intrinsics for two new machine instructions (vpopcntd and vpopcntq). It also includes the addition of the new feature set. Differential Revision: https://reviews.llvm.org/D33170 llvm-svn: 303857	2017-05-25 13:44:11 +00:00
Krzysztof Parzyszek	5960a57ef7	[CodeGen] Pessimize aliasing for member unions (and may-alias) objects Use the TBAA info of the omnipotent char for these objects. Differential Revision: https://reviews.llvm.org/D33328 llvm-svn: 303851	2017-05-25 12:55:47 +00:00
Tony Jiang	f70a913e13	Fix one test case faiulre in commit 303766. It is clean when I build boostrap and run make checkall on my machine, I guess it could be I only build bootstrap with assert, while the buildbots may build without asserts, which could cause the difference. llvm-svn: 303786	2017-05-24 18:12:11 +00:00
Tony Jiang	9aa2c0383d	[PowerPC] Implement vec_xxsldwi builtin. The vec_xxsldwi builtin is missing from altivec.h. This has been requested by developers working on libvpx for VP9 support for Google. The patch fixes PR: https://bugs.llvm.org/show_bug.cgi?id=32653 Differential Revision: https://reviews.llvm.org/D33236 llvm-svn: 303766	2017-05-24 15:54:13 +00:00
Tony Jiang	bbc48e9164	[PowerPC] Implement vec_xxpermdi builtin. The vec_xxpermdi builtin is missing from altivec.h. This has been requested by developers working on libvpx for VP9 support for Google. The patch fixes PR: https://bugs.llvm.org/show_bug.cgi?id=32653 Differential Revision: https://reviews.llvm.org/D33053 llvm-svn: 303760	2017-05-24 15:13:32 +00:00
Dean Michael Berris	170429e290	[XRay][clang] Allow imbuing arg1 logging attribute via -fxray-always-instrument= Summary: This change allows us to add arg1 logging support to functions through the special case list provided through -fxray-always-instrument=. This is useful for adding arg1 logging to functions that are either in headers that users don't have control over (i.e. cannot change the source) or would rather not do. It only takes effect when the pattern is matched through the "fun:" special case, as a category. As in: fun:*pattern=arg1 Reviewers: pelikan, rnk Subscribers: cfe-commits Differential Revision: https://reviews.llvm.org/D33392 llvm-svn: 303719	2017-05-24 05:46:36 +00:00
Simon Dardis	657188ab00	[mips] Make checks in CodeGen/mips-varargs.c less fragile This test was failing on our fork of clang because it was not capturing [[ARG]] in the N32 case. Therefore it used the value from the last function which does not always have to be the same. All other cases were already capturing ARG so this appears to be an oversight. The test now uses -enable-var-scope to prevent such errors in the future. Reviewers: sdardis, atanasyan Patch by: Alexander Richardson Differential Revision: https://reviews.llvm.org/D32425 llvm-svn: 303619	2017-05-23 09:42:50 +00:00
Teresa Johnson	acf4b09fee	Adjust clang test for r303590 Forgot to commit this separately from the llvm change to use a new module flag type for pic and pie levels. Should fix the bot errors llvm-svn: 303593	2017-05-23 00:35:09 +00:00
Simon Atanasyan	2c87f5341d	[mips] Support `micromips` attribute This patch adds support for the `micromips` and `nomicromips` attributes for MIPS targets. Differential revision: https://reviews.llvm.org/D33363 llvm-svn: 303546	2017-05-22 12:47:43 +00:00
Matthias Braun	a451953224	CodeGenModule: Always output wchar_size, check LLVM assumptions. Re-commit r303463 now that LLVM is fixed and adjust some lit tests. llvm::TargetLibraryInfo needs to know the size of wchar_t to work on functions like `wcslen`. This patch changes clang to always emit the wchar_size module flag (it would only do so for ARM previously). This also adds an `assert()` to ensure the LLVM defaults based on the target triple are in sync with clang. Differential Revision: https://reviews.llvm.org/D32982 llvm-svn: 303478	2017-05-20 01:29:55 +00:00
Matthias Braun	421b63dd70	Revert "CodeGenModule: Always output wchar_size, check LLVM assumptions." Let's revert this for now (and with it the assert()) to get the bots back to green until I have LLVM synced up properly. This reverts commit r303463. llvm-svn: 303474	2017-05-20 00:38:27 +00:00
Matthias Braun	bf4a869dfb	CodeGenModule: Always output wchar_size, check LLVM assumptions. llvm::TargetLibraryInfo needs to know the size of wchar_t to work on functions like `wcslen`. This patch changes clang to always emit the wchar_size module flag (it would only do so for ARM previously). This also adds an `assert()` to ensure the LLVM defaults based on the target triple are in sync with clang. Differential Revision: https://reviews.llvm.org/D32982 llvm-svn: 303463	2017-05-19 22:37:15 +00:00
Yaxun Liu	6d96f16347	CodeGen: Cast alloca to expected address space Alloca always returns a pointer in alloca address space, which may be different from the type defined by the language. For example, in C++ the auto variables are in the default address space. Therefore cast alloca to the expected address space when necessary. Differential Revision: https://reviews.llvm.org/D32248 llvm-svn: 303370	2017-05-18 18:51:09 +00:00
Evgeniy Stepanov	5c3e07f78d	[asan] One more test for -fsanitize-address-globals-dead-stripping. llvm-svn: 303114	2017-05-15 20:43:48 +00:00
Teresa Johnson	517729fb20	Remove ignore-empty-index-file option Summary: Clang changes to remove this option and replace with a parameter always set in the context of a ThinLTO distributed backend. Depends on D33133. Reviewers: pcc Subscribers: mehdi_amini, eraman, cfe-commits Differential Revision: https://reviews.llvm.org/D33134 llvm-svn: 302940	2017-05-12 19:32:17 +00:00
James Y Knight	eb96e44aea	[SPARC] Support 'f' and 'e' inline asm constraints. Patch by Patrick Boettcher. Differential Revision: https://reviews.llvm.org/D29117 llvm-svn: 302913	2017-05-12 16:01:23 +00:00
Reid Kleckner	43bbeb4c9f	Issue diagnostics when returning FP values on x86_64 without SSE1/2 Avoid using report_fatal_error, because it will ask the user to file a bug. If the user attempts to disable SSE on x86_64 and them use floating point, that's a bug in their code, not a bug in the compiler. This is just a start. There are other ways to crash the backend in this configuration, but they should be updated to follow this pattern. Differential Revision: https://reviews.llvm.org/D27522 llvm-svn: 302835	2017-05-11 22:43:02 +00:00
Petar Jovanovic	6f4cdb8912	Reland: [mips] Impose a threshold for coercion of aggregates Modified MipsABIInfo::classifyArgumentType so that it now coerces aggregate structures only if the size of said aggregate is less than 16/64 bytes, depending on the ABI. Patch by Stefan Maksimovic. Differential Revision: https://reviews.llvm.org/D32900 with minor changes (use regexp instead of the hardcoded values) to the test. llvm-svn: 302670	2017-05-10 14:28:18 +00:00
Vedant Kumar	4b62b5cddd	[ubsan] Mark overflow checks with !nosanitize Sanitizer instrumentation generally needs to be marked with !nosanitize, but we're not doing this properly for ubsan's overflow checks. r213291 has more information about why this is needed. llvm-svn: 302598	2017-05-09 23:34:49 +00:00
Evgeniy Stepanov	d991cdd50b	[asan] A clang flag to enable ELF globals-gc. This feature is subtly broken when the linker is gold 2.26 or earlier. See the following bug for details: https://sourceware.org/bugzilla/show_bug.cgi?id=19002 Since the decision needs to be made at compilation time, we can not test the linker version. The flag is off by default on ELF targets, and on otherwise. llvm-svn: 302591	2017-05-09 21:57:43 +00:00
Petar Jovanovic	753267b750	Revert r302547 ([mips] Impose a threshold for coercion of aggregates) Reverting Modified MipsABIInfo::classifyArgumentType so that it now coerces aggregate structures only if the size of said aggregate is less than 16/64 bytes, depending on the ABI. as it broke clang-with-lto-ubuntu builder. llvm-svn: 302555	2017-05-09 17:20:06 +00:00
Petar Jovanovic	125c03070e	[mips] Impose a threshold for coercion of aggregates Modified MipsABIInfo::classifyArgumentType so that it now coerces aggregate structures only if the size of said aggregate is less than 16/64 bytes, depending on the ABI. Patch by Stefan Maksimovic. Differential Revision: https://reviews.llvm.org/D32900 llvm-svn: 302547	2017-05-09 16:24:03 +00:00
Dean Michael Berris	42af651358	[XRay] Add __xray_customeevent(...) as a clang-supported builtin Summary: We define the `__xray_customeevent` builtin that gets translated to IR calls to the correct intrinsic. The default implementation of this is a no-op function. The codegen side of this follows the following logic: - When `-fxray-instrument` is not provided in the driver, we elide all calls to `__xray_customevent`. - When `-fxray-instrument` is enabled and a function is marked as "never instrumented", we elide all calls to `__xray_customevent` in that function; if either marked as "always instrumented" or subject to threshold-based instrumentation, we emit a call to the `llvm.xray.customevent` intrinsic from LLVM for each `__xray_customevent` occurrence in the function. This change depends on D27503 (to land in LLVM first). Reviewers: echristo, rsmith Subscribers: mehdi_amini, pelikan, lrl, cfe-commits Differential Revision: https://reviews.llvm.org/D30018 llvm-svn: 302492	2017-05-09 00:45:40 +00:00
Simon Pilgrim	3511348dbb	[X86][LWP] Add clang support for LWP instructions. This patch adds support for the the LightWeight Profiling (LWP) instructions which are available on all AMD Bulldozer class CPUs (bdver1 to bdver4). Differential Revision: https://reviews.llvm.org/D32770 llvm-svn: 302418	2017-05-08 12:09:45 +00:00
Tim Northover	23bcad226c	AArch64: fix weird edge case in ABI. It turns out there are some sort-of-but-not-quite empty structs that break all the rules. For example: struct SuperEmpty { int arr[0]; }; struct SortOfEmpty { struct SuperEmpty e; }; Both of these have sizeof == 0, even in C++ mode, for GCC compatibility. The first one also doesn't occupy a register when passed by value in GNU C++ mode, unlike everything else. On Darwin, we want to ignore the lot (and especially don't want to try to use an i0 as we were). llvm-svn: 302313	2017-05-05 22:36:06 +00:00
Reid Kleckner	6d2ea6ec80	[ms-inline-asm] Use the frontend size only for ambiguous instructions This avoids problems on code like this: char buf[16]; __asm { movups xmm0, [buf] mov [buf], eax } The frontend size in this case (1) is wrong, and the register makes the instruction matching unambiguous. There are also enough bytes available that we shouldn't complain to the user that they are potentially using an incorrectly sized instruction to access the variable. Supersedes D32636 and D26586 and fixes PR28266 llvm-svn: 302179	2017-05-04 18:19:52 +00:00
Peter Collingbourne	9667b91b13	Re-apply r302108, "IR: Use pointers instead of GUIDs to represent edges in the module summary. NFCI." with a fix for the clang backend. llvm-svn: 302176	2017-05-04 18:03:25 +00:00
Sam Parker	b9ea36f9c1	[ARM] ACLE Chapter 9 intrinsics Implemented the remaining integer data processing intrinsics from the ARM ACLE v2.1 spec, such as parallel arithemtic and DSP style multiplications. Differential Revision: https://reviews.llvm.org/D32282 llvm-svn: 302131	2017-05-04 08:37:59 +00:00
Daniel Jasper	6e254b5f38	Fix tests after speculatable intrinsics patch These were relying on the attribute group numbering llvm-svn: 302009	2017-05-03 10:04:25 +00:00
Matt Arsenault	7c4c1cb2f5	Fix tests after speculatable intrinsics patch These were relying on the attribute group numbering llvm-svn: 301996	2017-05-03 03:04:40 +00:00
Vedant Kumar	d919115983	[ubsan] Skip overflow checks on safe arithmetic (fixes PR32874) Currently, ubsan emits overflow checks for arithmetic that is known to be safe at compile-time, e.g: 1 + 1 => CheckedAdd(1, 1) This leads to breakage when using the __builtin_prefetch intrinsic. LLVM expects the arguments to @llvm.prefetch to be constant integers, and when ubsan inserts unnecessary checks on the operands to the intrinsic, this contract is broken, leading to verifier failures (see PR32874). Instead of special-casing __builtin_prefetch for ubsan, this patch fixes the underlying problem, i.e that clang currently emits unnecessary overflow checks. Testing: I ran the check-clang and check-ubsan targets with a stage2, ubsan-enabled build of clang. I added a regression test for PR32874, and some extra checking to make sure we don't regress runtime checking for unsafe arithmetic. The existing ubsan-promoted-arithmetic.cpp test also provides coverage for this change. llvm-svn: 301988	2017-05-02 23:46:56 +00:00
Sanjay Patel	77f3b188a2	[CodeGen] remove/fix checks that will fail when r301923 is recommitted Don't test the optimizer as part of front-end verification. llvm-svn: 301928	2017-05-02 15:20:18 +00:00
Simon Pilgrim	96d02f5503	[X86][AVX] Added support for _mm256_zext* helper intrinsics (PR32839) llvm-svn: 301749	2017-04-29 17:17:06 +00:00
Simon Pilgrim	99ed27053d	[X86][SSE] Add _mm_set_pd1 (PR32827) Matches _mm_set_ps1 implementation llvm-svn: 301637	2017-04-28 10:28:32 +00:00
Rui Ueyama	0fcbb2893e	Revert r301487: Replace HashString algorithm with xxHash64 This reverts commit r301487 to make buildbots green. llvm-svn: 301491	2017-04-26 23:15:10 +00:00
Rui Ueyama	87b30ac9d3	Replace HashString algorithm with xxHash64 The previous algorithm processed one character at a time, which is very painful on a modern CPU. Replace it with xxHash64, which both already exists in the codebase and is fairly fast. Patch from Scott Smith! Differential Revision: https://reviews.llvm.org/D32509 llvm-svn: 301487	2017-04-26 22:45:04 +00:00
Vedant Kumar	e859ebbd06	[ubsan] Skip alignment checks on allocas with known alignment It's possible to determine the alignment of an alloca at compile-time. Use this information to skip emitting some runtime alignment checks. Testing: check-clang, check-ubsan. This significantly reduces the amount of alignment checks we emit when compiling X86ISelLowering.cpp. Here are the numbers from patched/unpatched clangs based on r301361. ------------------------------------------ \| Setup \| # of alignment checks \| ------------------------------------------ \| unpatched, -O0 \| 47195 \| \| patched, -O0 \| 30876 \| (-34.6%) ------------------------------------------ llvm-svn: 301377	2017-04-26 02:17:21 +00:00
Evgeniy Stepanov	c7b90947bd	[asan] Unconditionally enable GC of globals on COFF. This change restores pre-r301225 behavior, where linker GC compatible global instrumentation was used on COFF targets disregarding -f(no-)data-sections and/or /Gw flags. This instrumentation puts each global in a COMDAT with an ASan descriptor for that global. It effectively enables -fdata-sections, but limits it to ASan-instrumented globals. llvm-svn: 301374	2017-04-26 00:51:06 +00:00
Davide Italiano	e2ff98d9f8	[PGO/tests] Update comment to reflect reality. llvm-svn: 301344	2017-04-25 18:04:31 +00:00
Davide Italiano	44f6ea8818	[PGO] Update test now that we don't call IndirectCallPromotion. llvm-svn: 301339	2017-04-25 17:48:10 +00:00
Evgeniy Stepanov	df217a2f3c	[asan] Disable ASan global-GC depending on the target and compiler flags. llvm-svn: 301225	2017-04-24 19:34:12 +00:00
David Blaikie	8150355498	Move Split DWARF handling to an MC option/command line argument rather than using metadata Since Split DWARF needs to name the actual .dwo file that is generated, it can't be known at the time the llvm::Module is produced as it may be merged with other Modules before the object is generated and that object may be generated with any name. By passing the Split DWARF file name when LLVM is producing object code the .dwo file name in the object file can match correctly. The support for Split DWARF for implicit modules remains the same - using metadata to store the dwo name and dwo id so that potentially multiple skeleton CUs referring to different dwo files can be generated from one llvm::Module. llvm-svn: 301063	2017-04-21 23:35:36 +00:00
Adam Nemet	03af42444b	Don't pass FPOpFusion::Strict to the backend This restores the behavior prior to D31167 where the code-gen default was FPC_On which mapped to FPOpFusion::Standard. After merging the FE state (on/off) and the code-gen state (on/fast/off), the default became off to match the front-end. In other words, the front-end controls when to fuse along the language standards and the backend shouldn't override this by splitting fused intrinsics as FPOpFusion::Strict would imply. Differential Revision: https://reviews.llvm.org/D32301 llvm-svn: 300858	2017-04-20 17:09:35 +00:00
David Blaikie	6e2ec5f10e	Parse backend options during thinlto backend compile actions llvm-svn: 300741	2017-04-19 20:08:21 +00:00
Adrian Prantl	c3782a1a6f	Debug Info: Remove special-casing of indirect function argument handling. LLVM has changed the semantics of dbg.declare for describing function arguments. After this patch a dbg.declare always takes the address of a variable as the first argument, even if the argument is not an alloca. https://bugs.llvm.org/show_bug.cgi?id=32382 rdar://problem/31205000 llvm-svn: 300523	2017-04-18 01:22:01 +00:00
Simon Pilgrim	9f6e79c5e4	[X86][SSE] Update MOVNTDQA non-temporal loads to generic implementation (clang) MOVNTDQA non-temporal aligned vector loads can be correctly represented using generic builtin loads, allowing us to remove the existing x86 intrinsics. LLVM companion patch: D31767. Differential Revision: https://reviews.llvm.org/D31766 llvm-svn: 300326	2017-04-14 15:05:57 +00:00
Yaxun Liu	b34ec829be	[OpenCL] Map default address space to alloca address space For OpenCL, the private address space qualifier is 0 in AST. Before this change, 0 address space qualifier is always mapped to target address space 0. As now target private address space is specified by alloca address space in data layout, address space qualifier 0 needs to be mapped to alloca addr space specified by the data layout. This change has no impact on targets whose alloca addr space is 0. With contributions from Matt Arsenault, Tony Tye and Wen-Heng (Jack) Chung Differential Revision: https://reviews.llvm.org/D31404 llvm-svn: 299965	2017-04-11 17:24:23 +00:00
Reid Kleckner	eb9dd5b87f	Reland "[IR] Make AttributeSetNode public, avoid temporary AttributeList copies" This re-lands r299875. I introduced a bug in Clang code responsible for replacing K&R, no prototype declarations with a real function definition with a prototype. The bug was here: // Collect any return attributes from the call. - if (oldAttrs.hasAttributes(llvm::AttributeList::ReturnIndex)) - newAttrs.push_back(llvm::AttributeList::get(newFn->getContext(), - oldAttrs.getRetAttributes())); + newAttrs.push_back(oldAttrs.getRetAttributes()); Previously getRetAttributes() carried AttributeList::ReturnIndex in its AttributeList. Now that we return the AttributeSetNode* directly, it no longer carries that index, and we call this overload with a single node: AttributeList::get(LLVMContext&, ArrayRef<AttributeSetNode*>) That aborted with an assertion on x86_32 targets. I added an explicit triple to the test and added CHECKs to help find issues like this in the future sooner. llvm-svn: 299899	2017-04-10 23:31:05 +00:00
Matt Arsenault	d972949b10	Update for lifetime intrinsic signature change llvm-svn: 299877	2017-04-10 20:18:45 +00:00
Evgeniy Stepanov	1a8030e737	[cfi] Emit __cfi_check stub in the frontend. Previously __cfi_check was created in LTO optimization pipeline, which means LLD has no way of knowing about the existence of this symbol without rescanning the LTO output object. As a result, LLD fails to export __cfi_check, even when given --export-dynamic-symbol flag. llvm-svn: 299806	2017-04-07 23:00:38 +00:00
Hans Wennborg	f6388b182d	Attempt to fix ms-intrinsics.c test llvm-svn: 299785	2017-04-07 17:01:56 +00:00
Hans Wennborg	5c3c51fe05	Implement _interlockedbittestandset as a builtin It's used by MS headers in VS 2017 without including intrin.h, so we can't implement it in the header anymore. Differential Revision: https://reviews.llvm.org/D31736 llvm-svn: 299782	2017-04-07 16:41:47 +00:00
Adam Nemet	60d3264d5f	Add #pragma clang fp This adds the new pragma and the first variant, contract(on/off/fast). The pragma has the same block scope rules as STDC FP_CONTRACT, i.e. it can be placed at the beginning of a compound statement or at file scope. Similarly to STDC FP_CONTRACT there is no need to use attributes. First an annotate token is inserted with the parsed details of the pragma. Then the annotate token is parsed in the proper contexts and the Sema is updated with the corresponding FPOptions using the shared ActOn function with STDC FP_CONTRACT. After this the FPOptions from the Sema is propagated into the AST expression nodes. There is no change here. I was going to add a 'default' option besides 'on/off/fast' similar to STDC FP_CONTRACT but then decided against it. I think that we'd have to make option uppercase then to avoid using 'default' the keyword. Also because of the scoped activation of pragma I am not sure there is really a need a for this. Differential Revision: https://reviews.llvm.org/D31276 llvm-svn: 299470	2017-04-04 21:18:36 +00:00
Adam Nemet	370d0877f6	Set FMF for -ffp-contract=fast With this, FMF(contract) becomes an alternative way to express the request to contract. These are currently only propagated for FMul, FAdd and FSub. The rest will be added as more FMFs are hooked up for this. This is toward fixing PR25721. Differential Revision: https://reviews.llvm.org/D31168 llvm-svn: 299469	2017-04-04 21:18:30 +00:00
Coby Tayree	b76fb84032	[fixup][X86][inline-asm] Add support for MS 'EVEN' directive refining tested targets resolution, to amend failures caused by rL299454 llvm-svn: 299459	2017-04-04 19:20:21 +00:00
Coby Tayree	b186493cc8	[X86][inline-asm] Add support for MS 'EVEN' directive MS assembly syntax provide us with the 'EVEN' directive as a synonymous to at&t '.even'. This patch include the (small, simple) changes need to allow it. llvm-side: https://reviews.llvm.org/D27417 Differential Revision: https://reviews.llvm.org/D27418 llvm-svn: 299454	2017-04-04 17:58:28 +00:00
Michael Zuckerman	13bcf4944a	Fix problem with test. llvm-svn: 299442	2017-04-04 15:44:06 +00:00
Michael Zuckerman	755a13db3d	[X86][Clang] Converting __mm{\|256\|512}_movm_epi{8\|16\|32\|64} LLVMIR call into generic intrinsics. This patch is a part two of two reviews, one for the clang and the other for LLVM. In this patch, I covered the clang side, by introducing the intrinsic to the front end. This is done by creating a generic replacement. Differential Revision: https://reviews.llvm.org/D31394a llvm-svn: 299431	2017-04-04 13:29:53 +00:00
Teresa Johnson	b637cb07ed	[ThinLTO] Handle -emit-llvm* in ThinLTO backends Summary: Use PreCodeGenModuleHook to invoke the correct writer when emitting LLVM IR, returning false to skip codegen from within thinBackend. Reviewers: pcc, mehdi_amini Subscribers: Prazek, cfe-commits Differential Revision: https://reviews.llvm.org/D31534 llvm-svn: 299274	2017-03-31 22:35:47 +00:00
Craig Topper	f771f79b2f	[Sema][X86] Update immediate check for gather/scatter prefetch instructions to match the _MM_HINT_T0/T1 constant definitions Our _MM_HINT_T0/T1 constant values are 3/2 which matches gcc, but not icc or Intel documentation. Interestingly gcc had this same bug on their implementation of the gather/scatter builtins at one point too. Fixes PR32411. llvm-svn: 299233	2017-03-31 17:22:30 +00:00
Petar Jovanovic	9b8b9e81dd	[mips][msa] Range adjustment for ldi_b builtin function operand Reasoning behind this change was allowing the function to accept all values from range [-128, 255] since all of them can be encoded in an 8bit wide value. This differs from the prior state where only range [-128, 127] was accepted, where values were assumed to be signed, whereas now the actual interpretation of the immediate is deferred to the consumer as required. Patch by Stefan Maksimovic. Differential Revision: https://reviews.llvm.org/D31082 llvm-svn: 299229	2017-03-31 16:16:43 +00:00
Teresa Johnson	3cfab4b934	Add back test for r299152 I am hoping the bot failures are addressed by using cc1 for the ThinLTO backend invocations as well. llvm-svn: 299217	2017-03-31 13:48:18 +00:00
Teresa Johnson	adeae05f2d	Revert test added in r299152 Removing the test until I can figure out how to get the ThinLTO backend invocation of clang to use the correct target. llvm-svn: 299181	2017-03-31 04:29:07 +00:00
Teresa Johnson	163e4992b7	Add target-cpu Sigh, another follow-on fix needed for test in r299152 causing bot failures. We also need the target-cpu for the ThinLTO BE clang invocation. llvm-svn: 299178	2017-03-31 03:49:52 +00:00
Teresa Johnson	0c835d21c0	Add more target triples to test Third and hopefully final fix to test for r299152 that is causing bot failures: make sure the target triple specified for the ThinLTO backend clang invocations as well. llvm-svn: 299176	2017-03-31 03:27:47 +00:00
Teresa Johnson	fcb8989d72	Fix new compile command in test My previous attempt to fix bot failures from r299152 didn't add the necessary option to get bitcode out of the cc1 step. llvm-svn: 299173	2017-03-31 02:55:31 +00:00
Teresa Johnson	ae9c74280c	Add triple to new test Attempt to fix bot errors from r299152 by using clang_cc1 and specifying target triple to compile step. llvm-svn: 299170	2017-03-31 02:36:47 +00:00
Teresa Johnson	5ed6c10761	[ThinLTO] Set up lto::Config properly for codegen in ThinLTO backends Summary: This involved refactoring out pieces of EmitAssemblyHelper::CreateTargetMachine for use in runThinLTOBackend. Subsumes D31114. Reviewers: mehdi_amini, pcc Subscribers: Prazek, cfe-commits Differential Revision: https://reviews.llvm.org/D31508 llvm-svn: 299152	2017-03-31 02:05:15 +00:00
Dean Michael Berris	ac7a2f97d4	fixup: use CHECK for non-atttribute sets llvm-svn: 299127	2017-03-30 22:46:49 +00:00
Dean Michael Berris	504fc2262a	[XRay][clang] Fix the -fxray-instruction-threshold flag processing Summary: The refactoring introduced a regression in the flag processing for -fxray-instruction-threshold which causes it to not get passed properly. This change should restore the previous behaviour. Reviewers: rnk, pelikan Subscribers: cfe-commits Differential Revision: https://reviews.llvm.org/D31491 llvm-svn: 299126	2017-03-30 22:46:45 +00:00
Erich Keane	623efd8a75	Clang changes for alloc_align attribute GCC has the alloc_align attribute, which is similar to assume_aligned, except the attribute's parameter is the index of the integer parameter that needs aligning to. Differential Revision: https://reviews.llvm.org/D29599 llvm-svn: 299117	2017-03-30 21:48:55 +00:00
Dean Michael Berris	835832d37a	[XRay] Add -fxray-{always,never}-instrument= flags to clang Summary: The -fxray-always-instrument= and -fxray-never-instrument= flags take filenames that are used to imbue the XRay instrumentation attributes using a whitelist mechanism (similar to the sanitizer special cases list). We use the same syntax and semantics as the sanitizer blacklists files in the implementation. As implemented, we respect the attributes that are already defined in the source file (i.e. those that have the [[clang::xray_{always,never}_instrument]] attributes) before applying the always/never instrument lists. Reviewers: rsmith, chandlerc Subscribers: jfb, mgorny, cfe-commits Differential Revision: https://reviews.llvm.org/D30388 llvm-svn: 299041	2017-03-30 00:29:36 +00:00
Chandler Carruth	45bbe0117b	Revert r298491 and r298494 which changed Clang's handling of 'nonnull' attributes. These patches don't work because we can't currently access the parameter information in a reliable way when building attributes. I thought this would be relatively straightforward to fix, but it seems not to be the case. Fixing this will requrie a substantial re-plumbing of machinery to allow attributes to be handled in this location, and several other fixes to the attribute machinery should probably be made at the same time. All of this will make the patch .... substantially more complicated. Reverting for now as there are active miscompiles caused by the current version. llvm-svn: 298695	2017-03-24 09:11:57 +00:00
Dehao Chen	1240bd31e9	Update the SamplePGO test to verify that unroll/icp is not invoked in thinlto compile phase. Summary: This is the test added for https://reviews.llvm.org/D31217 Reviewers: tejohnson, mehdi_amini Reviewed By: tejohnson Subscribers: cfe-commits, Prazek Differential Revision: https://reviews.llvm.org/D31219 llvm-svn: 298647	2017-03-23 21:20:17 +00:00
Teresa Johnson	488d1dc0ed	[ThinLTO] Clang support for emitting minimized bitcode for thin link Summary: Clang companion patch to LLVM patch D31027, which adds support for emitting minimized bitcode file for use in the thin link step. Add a cc1 option -fthin-link-bitcode=<file> to trigger this behavior. Depends on D31027. Reviewers: mehdi_amini, pcc Subscribers: cfe-commits, Prazek Differential Revision: https://reviews.llvm.org/D31050 llvm-svn: 298639	2017-03-23 19:47:49 +00:00
Hans Wennborg	043f402586	[X86] Implement __readgsqword (and the rest) as builtins (PR32373) It seems MS headers have started using __readgsqword, and since it's used in a header that doesn't include intrin.h, we can't implement it as an inline function anymore. That was already the case for __readfsdword, which Saleem added support for in r220859. This patch reuses that codegen to implement all of __read[fg]s{byte,word,dword,qword}. Differential Revision: https://reviews.llvm.org/D31248 llvm-svn: 298538	2017-03-22 19:13:13 +00:00
Simon Pilgrim	0f3a52b8c9	[X86][MMX] Add tests for _mm_set_ intrinsics llvm-svn: 298511	2017-03-22 14:55:43 +00:00
Chandler Carruth	421fa6c9e2	Remove an overly aggressive assert in r298491 and leave a comment explaining why we have to ignore errors here even though in other parts of codegen we can be more strict with builtins. Also add a test case based on the code in a TSan test that found this issue. llvm-svn: 298494	2017-03-22 10:38:07 +00:00
Chandler Carruth	9b3607f0a6	[nonnull] Teach Clang to attach the nonnull LLVM attribute to declarations and calls instead of just definitions, and then teach it to not attach such attributes even if the source code contains them. This follows the design direction discussed on cfe-dev here: http://lists.llvm.org/pipermail/cfe-dev/2017-January/052066.html The idea is that for C standard library builtins, even if the library vendor chooses to annotate their routines with __attribute__((nonnull)), we will ignore those attributes which pertain to pointer arguments that have an associated size. This allows the widespread (and seemingly reasonable) pattern of calling these routines with a null pointer and a zero size. I have only done this for the library builtins currently recognized by Clang, but we can now trivially add to this set. This will be controllable with -fno-builtin if anyone should care to do so. Note that this does not change the AST. As a consequence, warnings, static analysis, and source code rewriting are not impacted. This isn't even a regression on any platform as neither Clang nor LLVM have ever put 'nonnull' onto these arguments for declarations. All this patch does is enable it on other declarations while preventing us from ever accidentally enabling it on these libc functions due to a library vendor. It will also allow any other libraries using this annotation to gain optimizations based on the annotation even when only a declaration is visible. llvm-svn: 298491	2017-03-22 09:09:13 +00:00
Adam Nemet	5827756e90	Remove -ffp-contract=fast from this test It does not need it and causes mismatch after -ffp-contract=fast is turned into an FMF. llvm-svn: 298469	2017-03-22 00:58:18 +00:00
Adam Nemet	3087c96348	Change -ffp-contract=fast test to run on Aarch64 (I don't have powerpc enabled in my build and I am changing how -ffp-contract=fast works.) llvm-svn: 298468	2017-03-22 00:58:15 +00:00
Eric Christopher	758aad76d8	Remove the -faltivec alias option and replace it with -maltivec everywhere. The alias was only ever used on darwin and had some issues there, and isn't used in practice much. Also fixes a problem with -mno-altivec not turning off -maltivec. Also add a diagnostic for faltivec/fno-altivec that directs users to use maltivec options and include the altivec.h file explicitly. llvm-svn: 298449	2017-03-21 22:06:18 +00:00
George Burgess IV	a63f91574f	Let llvm.objectsize be conservative with null pointers D28494 adds another parameter to @llvm.objectsize. Clang needs to be sure to pass that third arg whenever applicable. llvm-svn: 298431	2017-03-21 20:09:35 +00:00
Dehao Chen	ce39fdd6ee	Clang change: Do not inline hot callsites for samplepgo in thinlto compile phase. Summary: Because SamplePGO passes will be invoked twice in ThinLTO build: once at compile phase, the other at backend. We want to make sure the IR at the 2nd phase matches the hot part in pro file, thus we do not want to inline hot callsites in the first phase. Reviewers: tejohnson, eraman Reviewed By: tejohnson Subscribers: mehdi_amini, cfe-commits, Prazek Differential Revision: https://reviews.llvm.org/D31202 llvm-svn: 298429	2017-03-21 19:55:46 +00:00
Coby Tayree	665b89bac5	[X86][MS-compatability][clang] allow MS TYPE/SIZE/LENGTH operators as a part of a compound expression This patch introduces X86AsmParser with the ability to handle the aforementioned ops within compound "MS" arithmetical expressions. Currently - only supported as a stand alone Operand, e.g.: "TYPE X" now allowed : "4 + TYPE X * 128" LLVM side: https://reviews.llvm.org/D31173 Differential Revision: https://reviews.llvm.org/D31174 llvm-svn: 298426	2017-03-21 19:33:32 +00:00
Simon Pilgrim	60e924985c	[X86][AVX512] Add _mm512_cvtsd_f64 and _mm512_cvtss_f32 intrinsics (PR32305) Differential Revision: https://reviews.llvm.org/D31155 llvm-svn: 298364	2017-03-21 12:46:13 +00:00
Igor Breger	f050b797ac	[X86][AVX512][Clang][Intrinsics] Adding missing intrinsics to Clang . Summary: Adding missing intrinsics : _mm512_set_epi16, _mm512_set_epi8, _mm512_permutevar_epi32 _mm512_mask_permutevar_epi32 Reviewers: zvi, guyblank, eladcohen, craig.topper Reviewed By: craig.topper Subscribers: craig.topper, cfe-commits Differential Revision: https://reviews.llvm.org/D31034 llvm-svn: 298208	2017-03-19 08:27:16 +00:00
Nirav Dave	8497ef4086	[X86] Add NumRegisterParameters Module Flag. Reviewers: rnk, mkuper Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D27051 llvm-svn: 298177	2017-03-18 00:43:39 +00:00
Craig Topper	208c80556c	[AVX-512] Fix test cases that were using the builtins directly without typecasts instead of the intrinsic header. llvm-svn: 298041	2017-03-17 05:59:22 +00:00
Simon Pilgrim	6c38e71476	[X86][XOP] Add codegen tests for vector integer comparison intrinsics (PR15844) We were testing for the generic _mm_com_* intrinsics, but not the specific comparison mode versions. llvm-svn: 297885	2017-03-15 20:20:43 +00:00
Sanjay Patel	e795daa55e	[x86] these aren't the undefs you're looking for (PR32176) x86 has undef SSE/AVX intrinsics that should represent a bogus register operand. This is not the same as LLVM's undef value which can take on multiple bit patterns. There are better solutions / follow-ups to this discussed here: https://bugs.llvm.org/show_bug.cgi?id=32176 ...but this should prevent miscompiles with a one-line code change. Differential Revision: https://reviews.llvm.org/D30834 llvm-svn: 297588	2017-03-12 19:15:10 +00:00
Petar Jovanovic	bc97ab28a4	[mips][msa] Remove range checks for non-immediate sld.[bhwd] instructions Removes immediate range checks for these instructions, since they have GPR rt as their input operand. Patch by Stefan Maksimovic. Differential Revision: https://reviews.llvm.org/D30693 llvm-svn: 297485	2017-03-10 17:51:01 +00:00
Roger Ferrer Ibanez	3fa38a14ac	Honor __unaligned in codegen for declarations and expressions This patch honors the unaligned type qualifier (currently available through he keyword __unaligned and -fms-extensions) in CodeGen. In the current form the patch affects declarations and expressions. It does not affect fields of classes. Differential Revision: https://reviews.llvm.org/D30166 llvm-svn: 297276	2017-03-08 14:00:44 +00:00
Sanjay Patel	3a8ec02743	fix test to not check optimized IR; NFCI This test broke with an LLVM instcombine patch (r297166). I changed the RUN line to only run -mem2reg (to save time checking this large chunk of tests) and updated the checks using the script attached to D17999: https://reviews.llvm.org/D17999 The goal is to make this test immune to optimizer changes. If there's something in these tests that was checking for an IR optimization, that should be tested in LLVM, not Clang. llvm-svn: 297189	2017-03-07 19:24:54 +00:00

1 2 3 4 5 ...

4255 Commits