llvm-project

Commit Graph

Author	SHA1	Message	Date
Ivan Kosarev	be4eaf10ee	[Clang][CodeGen] Fix the cmse-clear-return.c test. Caught with D125604. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D126191	2022-05-24 12:49:42 +01:00
Vitaly Buka	4f89ff3fc7	[test][clang] Move -O3 in command line	2022-05-23 15:57:14 -07:00
Stephen Long	4f1e64b54f	[MSVC, ARM64] Add __readx18 intrinsics https://docs.microsoft.com/en-us/cpp/intrinsics/arm64-intrinsics?view=msvc-170 unsigned char __readx18byte(unsigned long) unsigned short __readx18word(unsigned long) unsigned long __readx18dword(unsigned long) unsigned __int64 __readx18qword(unsigned long) Given the lack of documentation of the intrinsics, we chose to align the offset with just `CharUnits::One()` when calling `IRBuilderBase::CreateAlignedLoad()` Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D126024	2022-05-23 10:59:12 -07:00
Stephen Long	3e0be5610f	[MSVC, ARM64] Add __writex18 intrinsics https://docs.microsoft.com/en-us/cpp/intrinsics/arm64-intrinsics?view=msvc-170 void __writex18byte(unsigned long, unsigned char) void __writex18word(unsigned long, unsigned short) void __writex18dword(unsigned long, unsigned long) void __writex18qword(unsigned long, unsigned __int64) Given the lack of documentation of the intrinsics, we chose to align the offset with just `CharUnits::One()` when calling `IRBuilderBase::CreateAlignedStore()`. Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D126023	2022-05-23 07:01:11 -07:00
Stephen Long	ae80024fbe	[clang] Honor __attribute__((no_builtin("foo"))) on functions Support for `__attribute__((no_builtin("foo")))` was added in https://reviews.llvm.org/D68028, but builtins were still being used even when the attribute was placed on a function. Reviewed By: hans Differential Revision: https://reviews.llvm.org/D124701	2022-05-20 06:41:47 -07:00
Aaron Ballman	d374b65f2d	Drop qualifiers from return types in C (DR423) WG14 DR423 (https://www.open-std.org/jtc1/sc22/wg14/www/docs/n2148.htm#dr_423), resolved during the C11 time frame, changed the way qualifiers are handled on function return types and in cast expressions after it was noted that these types are now directly observable via generic selection expressions. In C, the function declarator is adjusted to ignore all qualifiers (including _Atomic qualifiers). Clang already handles the cast expression case correctly (by performing the lvalue conversion, which drops the qualifiers as well), but with these changes it will now also handle function declarations appropriately. Fixes #39595 Differential Revision: https://reviews.llvm.org/D125919	2022-05-19 13:06:50 -04:00
Nuno Lopes	5fc9449c96	[DeadArgElim] Use poison instead of undef as placeholder for dead arguments It doesn't matter which value we use for dead args, so let's switch to poison, so we can eventually kill undef. Reviewed By: aeubanks, fhahn Differential Revision: https://reviews.llvm.org/D125983	2022-05-19 18:00:24 +01:00
Amy Kwan	c35ca3a1c7	[PowerPC] Implement XL compat __fnabs and __fnabss builtins. This patch implements the following floating point negative absolute value builtins that required for compatibility with the XL compiler: ``` double __fnabs(double); float __fnabss(float); ``` These builtins will emit : - fnabs on PWR6 and below, or if VSX is disabled. - xsnabsdp on PWR7 and above, if VSX is enabled. Differential Revision: https://reviews.llvm.org/D125506	2022-05-19 11:28:40 -05:00
Mitch Phillips	7aa1fa0a0a	Reland "[dwarf] Emit a DIGlobalVariable for constant strings." An upcoming patch will extend llvm-symbolizer to provide the source line information for global variables. The goal is to move AddressSanitizer off of internal debug info for symbolization onto the DWARF standard (and doing a clean-up in the process). Currently, ASan reports the line information for constant strings if a memory safety bug happens around them. We want to keep this behaviour, so we need to emit debuginfo for these variables as well. Reviewed By: dblaikie, rnk, aprantl Differential Revision: https://reviews.llvm.org/D123534	2022-05-18 13:56:45 -07:00
Zi Xuan Wu (Zeson)	dca37af061	[NFC][Clang] Modify expect of fail test or XFAIL because CSKY align is different CSKY is always in 4-byte align, no matter it's long long type. For global aggregate variable, it's 4-byte align if its size is bigger than or equal to 4 bytes. Differential Revision: https://reviews.llvm.org/D124977	2022-05-18 10:53:30 +08:00
David Green	4c6a070a2c	[AArch64] Teach perfect shuffles tables about D-lane movs Similar to D123386, this adds D-Movs to the AArch64 perfect shuffle tables, slightly lowering the costs a little more. This is a rough improvement in general, especially if you ignore mov v0.16b, v2.16b type moves that are often artefacts of the calling convention. The D register movs are encoded as (0x4 \| LaneIdx), and to generate a D register move we are required to bitcast into a higher type, but it is otherwise very similar to the S-lane mov's already supported. Differential Revision: https://reviews.llvm.org/D125477	2022-05-17 18:16:45 +01:00
Mitch Phillips	ed2c3218f5	Revert "[dwarf] Emit a DIGlobalVariable for constant strings." This reverts commit `4680982b36`. Broke a fuchsia windows bot. More details in the review: https://reviews.llvm.org/D123534	2022-05-16 19:07:38 -07:00
Mitch Phillips	4680982b36	[dwarf] Emit a DIGlobalVariable for constant strings. An upcoming patch will extend llvm-symbolizer to provide the source line information for global variables. The goal is to move AddressSanitizer off of internal debug info for symbolization onto the DWARF standard (and doing a clean-up in the process). Currently, ASan reports the line information for constant strings if a memory safety bug happens around them. We want to keep this behaviour, so we need to emit debuginfo for these variables as well. Reviewed By: dblaikie, rnk, aprantl Differential Revision: https://reviews.llvm.org/D123534	2022-05-16 16:52:16 -07:00
Stephen Long	b147717bb3	[MSVC] Add support for pragma alloc_text `#pragma alloc_text` is a MSVC pragma that names the code section where functions should be placed. It only applies to functions with C linkage. https://docs.microsoft.com/en-us/cpp/preprocessor/alloc-text?view=msvc-170 Reviewed By: aaron.ballman Differential Revision: https://reviews.llvm.org/D125011	2022-05-16 07:00:17 -07:00
Egor Zhdan	2f04e703bf	[Clang] Add DriverKit support This is the second patch that upstreams the support for Apple's DriverKit. The first patch: https://reviews.llvm.org/D118046. Differential Revision: https://reviews.llvm.org/D121911	2022-05-13 20:34:57 +01:00
Stephen Long	3946de0456	[MSVC] Add support for pragma function MSVC pragma function tells the compiler to generate calls to functions in the pragma function list, instead of using the builtin. Needs https://reviews.llvm.org/D124701 https://docs.microsoft.com/en-us/cpp/preprocessor/function-c-cpp?view=msvc-170 Reviewed By: aaron.ballman Differential Revision: https://reviews.llvm.org/D124702	2022-05-13 06:39:47 -07:00
Zakk Chen	7dfc56c107	[RISCV] Add the passthru operand for RVV unmasked segment load IR intrinsics. The goal is support tail and mask policy in RVV builtins. We focus on IR part first. If the passthru operand is undef, we use tail agnostic, otherwise use tail undisturbed. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D125323	2022-05-13 02:16:40 -07:00
Quentin Colombet	9766fed9c1	[DeadArgElim] Re-apply: Set unused arguments for internal functions The re-apply includes fixes to clang tests that were missed in the original commit. Original message: Prior to this patch we would only set to undef the unused arguments of the external functions. The rationale was that unused arguments of internal functions wouldn't need to be turned into undef arguments because they should have been simply eliminated by the time we reach that code. This is actually not true because there are plenty of cases where we can't remove unused arguments. For instance, if the internal function is used in an indirect call, it may not be possible to change the function signature. Yet, for statically known call-sites we would still like to mark the unused arguments as undef. This patch enables the "set undef arguments" optimization on internal functions when we encounter cases where internal functions cannot be optimized. I.e., whenever an internal function is marked "live". Differential Revision: https://reviews.llvm.org/D124699	2022-05-12 08:46:16 -07:00
David Tenty	d9c1d3cbcb	[clang][AIX] Don't ignore XCOFF visibility by default D87451 added -mignore-xcoff-visibility for AIX targets and made it the default (which mimicked the behaviour of the XL 16.1 compiler on AIX). However, ignoring hidden visibility has unwanted side effects and some libraries depend on visibility to hide non-ABI facing entities from user headers and reserve the right to change these implementation details based on this (https://libcxx.llvm.org/DesignDocs/VisibilityMacros.html). This forces us to use internal linkage fallbacks for these cases on AIX and creates an unwanted divergence in implementations on the plaform. For these reasons, it's preferable to not add -mignore-xcoff-visibility by default, which is what this patch does. Reviewed By: DiggerLin Differential Revision: https://reviews.llvm.org/D125141	2022-05-11 13:27:48 -04:00
Matt Devereau	75bb815231	[AArch64][SVE] Add aarch64_sve_pcs attribute to Clang Enable function attribute aarch64_sve_pcs at the C level, which correspondes to aarch64_sve_vector_pcs at the LLVM IR level. This requirement was created by this addition to the ARM C Language Extension: https://github.com/ARM-software/acle/pull/194 Differential Revision: https://reviews.llvm.org/D124998	2022-05-11 13:33:56 +00:00
Rosie Sumpter	131e6636f2	[Sema][SVE2] Move/simplify Sema testing for SVE2 ACLE builtins Currently for SVE2 ACLE builtins, single tests are used to verify both clang code generation (when the feature is available) and semantic error/warning messages (when the feature is unavailable). This patch moves the semantic testing for the target feature flag into dedicated Sema tests. Differential Revision: https://reviews.llvm.org/D124850	2022-05-10 13:24:17 +01:00
Rosie Sumpter	f635e63709	[Sema][SVE] Move/simplify Sema testing for SVE ACLE builtins Currently for SVE ACLE builtins, single tests are used to verify both clang code generation (when the feature is available) and semantic error/warning messages (when the feature is unavailable). This patch moves the semantic testing into dedicated Sema tests. Differential Revision: https://reviews.llvm.org/D124924	2022-05-10 13:17:51 +01:00
Simon Pilgrim	8a92c45e07	[Clang] Add integer mul reduction builtin Similar to the existing bitwise reduction builtins, this lowers to a llvm.vector.reduce.mul intrinsic call. For other reductions, we've tried to share builtins for float/integer vectors, but the fmul reduction intrinsic also take a starting value argument and can either do unordered or serialized, but not reduction-trees as specified for the builtins. However we address fmul support this shouldn't affect the integer case. Differential Revision: https://reviews.llvm.org/D117829	2022-05-09 12:12:53 +01:00
Simon Pilgrim	2cd080c884	[X86] rdrand-builtins.c - add 32-bit target coverage and enable -Wall/-Werror	2022-05-07 14:35:42 +01:00
Simon Pilgrim	6e345426de	[X86] Remove unused 'hint' argument from prefetch tests hint is a compile time constant and can't be passed in as a variable - we already hardcode	2022-05-07 13:38:40 +01:00
Simon Pilgrim	102824f048	[clang][X86] Rename some intrinsics tests to use the *-builtins.c naming convention	2022-05-06 14:49:46 +01:00
Richard Smith	c4f95ef86a	Reimplement `__builtin_dump_struct` in Sema. Compared to the old implementation: * In C++, we only recurse into aggregate classes. * Unnamed bit-fields are not printed. * Constant evaluation is supported. * Proper conversion is done when passing arguments through `...`. * Additional arguments are supported and are injected prior to the format string; this directly supports use with `fprintf`, for example. * An arbitrary callable can be passed rather than only a function pointer. In particular, in C++, a function template or overload set is acceptable. * All text generated by Clang is printed via `%s` rather than directly; this avoids issues where Clang's pretty-printing output might itself contain a `%` character. * Fields of types that we don't know how to print are printed with a `"%p"` format and passed by address to the print function. No return value is produced. Reviewed By: aaron.ballman, erichkeane, yihanaa Differential Revision: https://reviews.llvm.org/D124221	2022-05-05 14:55:47 -07:00
Aaron Ballman	2cb2cd242c	Change the behavior of implicit int diagnostics C89 allowed a type specifier to be elided with the resulting type being int, aka implicit int behavior. This feature was subsequently removed in C99 without a deprecation period, so implementations continued to support the feature. Now, as with implicit function declarations, is a good time to reevaluate the need for this support. This patch allows -Wimplicit-int to issue warnings in C89 mode (off by default), defaults the warning to an error in C99 through C17, and disables support for the feature entirely in C2x. It also removes a warning about missing declaration specifiers that really was just an implicit int warning in disguise and other minor related cleanups.	2022-05-04 08:35:47 -04:00
Phoebe Wang	b540ee5402	[X86] Fix redundant `%s` in RUN command. NFC	2022-05-04 20:29:50 +08:00
Simon Pilgrim	a23291b7db	[Clang] Add integer add reduction builtin Similar to the existing bitwise reduction builtins, this lowers to a llvm.vector.reduce.add intrinsic call. For other reductions, we've tried to share builtins for float/integer vectors, but the fadd reduction intrinsics also take a starting value argument and can either do unordered or serialized, but not reduction-trees as specified for the builtins. However we address fadd support this shouldn't affect the integer case. (Split off from D117829) Differential Revision: https://reviews.llvm.org/D124741	2022-05-02 11:03:25 +01:00
Kito Cheng	41b951c929	[RISCV] Fix int16 -> __fp16 conversion code gen clang emit wrong code sequence for `int16`(`short`) to `__fp16` conversion, and that should fix the code gen directly is the right way I think, but I found there is a FIXME comment in clang/Basic/TargetInfo.h say that's should be removed in future so I think just let swich to using generic LLVM IR rather than llvm.convert.to.fp16 intrinsics code gen path is enough. ``` /// Check whether llvm intrinsics such as llvm.convert.to.fp16 should be used /// to convert to and from __fp16. /// FIXME: This function should be removed once all targets stop using the /// conversion intrinsics. virtual bool useFP16ConversionIntrinsics() const { return true; } ``` Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D124509	2022-04-30 11:10:44 +08:00
Kito Cheng	02c7de3a4c	[RISCV] Precommit test for D124509 Test case to show the wrong code gen for `int16` -> `__fp16` conversion, clang just emit a load and store without did conversion in the case, and another case used for demonstrate the code gen change of `__fp16` -> `int16`. Reviewed By: khchen Differential Revision: https://reviews.llvm.org/D124510	2022-04-30 11:09:12 +08:00
joker881	19978e0874	[RISCV]Add CTZ Intrinsic for ZBB in Clang Add Intrinsics and test for B extension (updating coming soon (: Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D124348	2022-04-30 08:18:10 +08:00
David Candler	9e7c9967c3	Additionally set f32 mode with denormal-fp-math When the denormal-fp-math option is used, this should set the denormal handling mode for all floating point types. However, currently 32-bit float types can ignore this setting as there is a variant of the option, denormal-fp-math-f32, specifically for that type which takes priority when checking the mode based on type and remains at the default of IEEE. From the description, denormal-fp-math would be expected to set the mode for floats unless overridden by the f32 variant, and code in the front end only emits the f32 option if it is different to the general one, so setting just denormal-fp-math should be valid. This patch changes the denormal-fp-math option to also set the f32 mode. If denormal-fp-math-f32 is also specified, this is then overridden as expected, but if it is absent floats will be set to the mode specified by the former option, rather than remain on the default. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D122589	2022-04-29 15:06:32 +01:00
Joao Moreira	db1cec371c	[X86] Fix CodeGen Module Flag for -mibt-seal When assertions are enabled, clang will perform RoundTrip for CompilerInvocation argument generation. ibt-seal flags are currently missing in this argument generation, and because of that, the feature doesn't get enabled for these cases. Performing RoundTrip is the default for assert builds, rendering the feature broken in these scenarios. This patch fixes this and adds a test to properly verify that modules are being generated with the flag when -mibt-seal is used. Please, add any known relevant reviewer which I may have missed. [1] - https://reviews.llvm.org/D116070 Reviewed By: pengfei, gftg, aaron.ballman, nickdesaulniers Differential Revision: https://reviews.llvm.org/D118052	2022-04-29 15:37:28 +08:00
jonasyhwang	eaca933c59	[Clang][CodeGen]Fix __builtin_dump_struct missing record type field name Thanks for @rsmith to point this. I'm sorry for introducing this bug. See @rsmith 's comment in https://reviews.llvm.org/D122248 Eg:(By @rsmith ) https://godbolt.org/z/o7vcbWaEf I have added a test case struct: ``` struct U19A { int a; }; struct U19B { struct U19A a; }; struct U19B a = { .a.a = 2022 }; ``` Dump result: ``` struct U19B { struct U19A a = { int a = 2022 } } ``` Reviewed By: erichkeane Differential Revision: https://reviews.llvm.org/D122920	2022-04-29 12:58:53 +08:00
Craig Topper	bd30d4be23	[Driver] Add f16 support to -mrecip parsing. This is a followup to D120158 which added an 'h' suffix to the backend handling. Reviewed By: spatel Differential Revision: https://reviews.llvm.org/D124551	2022-04-28 08:33:52 -07:00
Rosie Sumpter	f7068c82a2	[Sema][SVE2] Move/simplify Sema testing for SVE2 ACLE builtins Currently for SVE2 ACLE builtins, single tests are used to verify both clang code generation (when the feature is available) and semantic error/warning messages (when the feature is unavailable). This WIP patch moves the semantic testing for the values of immediate arguments into dedicated Sema tests.	2022-04-28 13:41:09 +01:00
Jonas Paulsson	9b38e2efa0	[SystemZ] Fix C++ ABI for passing args of structs containing zero width bitfield. A struct like { float a; int :0; } should per the SystemZ ABI be passed in a GPR, but to match a bug in GCC it has been passed in an FPR (see `759449c`). GCC has now corrected the C++ ABI for this case, and this patch for clang follows suit. Reviewed By: Ulrich Weigand Differential Revision: https://reviews.llvm.org/D122388	2022-04-26 17:16:14 +02:00
David Green	9727c77d58	[NFC] Rename Instrinsic to Intrinsic	2022-04-25 18:13:23 +01:00
Zakk Chen	ffe03ff75c	[RISCV] Fix incorrect policy implement for unmasked vslidedown and vslideup. vslideup works by leaving elements 0<i<OFFSET undisturbed. so it need the destination operand as input for correctness regardless of policy. Add a operand to indicate policy. We also add policy operand for unmaksed vslidedown to keep the interface consistent with vslideup because vslidedown have only undisturbed at 0<i<vstart but user have no way to control of vstart. Reviewed By: rogfer01, craig.topper Differential Revision: https://reviews.llvm.org/D124186	2022-04-25 09:18:41 -07:00
Xiang1 Zhang	afa536e33e	[x86] Support 3 builtin functions for 32-bits mode _mm_cvtsi128_si64, _mm_cvtsi64_si128, _mm_extract_epi64 Reviewed By:RKSimon, Topper Craig Differential Revision: https://reviews.llvm.org/D124067	2022-04-22 11:28:28 +08:00
Xiang1 Zhang	caf5ad5da7	Revert "[x86] Support 3 builtin functions for 32-bits mode" This reverts commit `a69c219a8c`.	2022-04-22 09:11:40 +08:00
Xiang1 Zhang	a69c219a8c	[x86] Support 3 builtin functions for 32-bits mode _mm_cvtsi128_si64, _mm_cvtsi64_si128, _mm_extract_epi64	2022-04-22 09:06:25 +08:00
Ulrich Weigand	1283ccb610	Support z16 processor name The recently announced IBM z16 processor implements the architecture already supported as "arch14" in LLVM. This patch adds support for "z16" as an alternate architecture name for arch14.	2022-04-21 19:58:22 +02:00
Pengxuan Zheng	38612fbc89	Reland "[COFF, ARM64] Add __break intrinsic" https://docs.microsoft.com/en-us/cpp/intrinsics/arm64-intrinsics?view=msvc-170 Reland after fixing the test failure. The failure was due to conflict with a change (D122983) which was merged right before this patch. Reviewed By: rnk, mstorsjo Differential Revision: https://reviews.llvm.org/D124032	2022-04-20 13:01:30 -07:00
Pengxuan Zheng	bff8356b19	Revert "[COFF, ARM64] Add __break intrinsic" This reverts commit `8a9b4fb4aa`.	2022-04-20 11:57:49 -07:00
Pengxuan Zheng	8a9b4fb4aa	[COFF, ARM64] Add __break intrinsic https://docs.microsoft.com/en-us/cpp/intrinsics/arm64-intrinsics?view=msvc-170 Reviewed By: rnk, mstorsjo Differential Revision: https://reviews.llvm.org/D124032	2022-04-20 11:20:26 -07:00
Aaron Ballman	7d644e1215	[C11/C2x] Change the behavior of the implicit function declaration warning C89 had a questionable feature where the compiler would implicitly declare a function that the user called but was never previously declared. The resulting function would be globally declared as extern int func(); -- a function without a prototype which accepts zero or more arguments. C99 removed support for this questionable feature due to severe security concerns. However, there was no deprecation period; C89 had the feature, C99 didn't. So Clang (and GCC) both supported the functionality as an extension in C99 and later modes. C2x no longer supports that function signature as it now requires all functions to have a prototype, and given the known security issues with the feature, continuing to support it as an extension is not tenable. This patch changes the diagnostic behavior for the -Wimplicit-function-declaration warning group depending on the language mode in effect. We continue to warn by default in C89 mode (due to the feature being dangerous to use). However, because this feature will not be supported in C2x mode, we've diagnosed it as being invalid for so long, the security concerns with the feature, and the trivial workaround for users (declare the function), we now default the extension warning to an error in C99-C17 mode. This still gives users an easy workaround if they are extensively using the extension in those modes (they can disable the warning or use -Wno-error to downgrade the error), but the new diagnostic makes it more clear that this feature is not supported and should be avoided. In C2x mode, we no longer allow an implicit function to be defined and treat the situation the same as any other lookup failure. Differential Revision: https://reviews.llvm.org/D122983	2022-04-20 11:30:12 -04:00
Simon Pilgrim	1226d276b4	[X86][AVX512] Rename avx512popcntdq intrinsics tests files to match *-builtins.c naming convention	2022-04-20 15:12:12 +01:00
Simon Pilgrim	72d4e3dc2d	[X86][AVX] Add i386 test coverage to avx2 intrinsic tests	2022-04-20 15:12:12 +01:00
Simon Pilgrim	0140a672a6	[X86][AVX] Add i386 test coverage to avx-vnni intrinsic tests	2022-04-20 15:12:12 +01:00
Simon Pilgrim	2c176f2f1e	[X86][AVX] Add i386 test coverage to avx intrinsic tests	2022-04-20 15:12:11 +01:00
Simon Pilgrim	3949c2de79	[X86][SSE] Add i386 test coverage to sse2 intrinsic tests	2022-04-20 12:05:10 +01:00
Simon Pilgrim	7c1bff3f7b	[X86][FMA4] Add i386 test coverage to fma4 intrinsic tests	2022-04-20 11:14:18 +01:00
Simon Pilgrim	ba2e567f04	[X86][SSE] Add i386 test coverage to sse42 intrinsic tests	2022-04-20 11:06:07 +01:00
Simon Pilgrim	90e5c690e2	[X86][SSE] Add i386 test coverage to sse41 intrinsic tests	2022-04-20 11:06:06 +01:00
Simon Pilgrim	b402ea55a8	[X86][SSE] Add i386 test coverage to sse4a intrinsic tests	2022-04-20 10:48:47 +01:00
Simon Pilgrim	c86588af65	[X86][SSE] Add i386 test coverage to ssse3 intrinsic tests	2022-04-20 10:44:28 +01:00
Simon Pilgrim	88d61cc6e9	[X86][SSE] Add i386 test coverage to sse3 intrinsic tests	2022-04-20 10:44:28 +01:00
Simon Pilgrim	6574d75b8d	[XOP] Add i386 test coverage to xop intrinsic tests	2022-04-20 10:44:27 +01:00
Chen Zheng	3c776c70a7	[PowerPC] add XLC compat builtin __abs Reviewed By: jsji Differential Revision: https://reviews.llvm.org/D123372	2022-04-20 05:14:22 -04:00
Zakk Chen	bd0d126302	[RISCV][Clang][NFC] Update vid intrinsic tests. Re-run the update_cc_test_checks.py to update expected result. I'm not sure why those tests are passed before. Differential Revision: https://reviews.llvm.org/D124062	2022-04-20 01:35:53 -07:00
Jonas Paulsson	4aa5dc15f0	[SystemZ] Handle SystemZ specific inline assembly address operands. Handle ZQ, ZR, ZS and ZT inline assembly operand constraints. Review: Ulrich Weigand Differential Revision: https://reviews.llvm.org/D110267	2022-04-19 16:55:45 +02:00
Eli Friedman	4802edd1ac	Fix size of flexible array initializers, and re-enable assertions. In D123649, I got the formula for getFlexibleArrayInitChars slightly wrong: the flexible array elements can be contained in the tail padding of the struct. Fix the formula to account for that. With the fixed formula, we run into another issue: in some cases, we were emitting extra padding for flexible arrray initializers. Fix CGExprConstant so it uses a packed struct when necessary, to avoid this extra padding. Differential Revision: https://reviews.llvm.org/D123826	2022-04-15 12:09:57 -07:00
Eli Friedman	6cf0b1b3da	Comment out assertions about initializer size added in D123649. They're causing failures in LLVM test-suite. Added some regression tests that explain the issue.	2022-04-14 13:58:17 -07:00
John Brawn	1b1466c346	[AArch64] Adjust aarch64 constrained intrinsics tests and un-XFAIL Remove the checking of the generated asm, as that's already tested elsewhere, and adjust some tests that were expecting the wrong intrinsic to be generated. Differential Revision: https://reviews.llvm.org/D118259	2022-04-14 16:51:22 +01:00
David Truby	66c44b20b4	[Clang][AArch64][SVE] Add shift operators for SVE vector types This patch enables shift operators on SVE vector types, as well as supporting vector-scalar shift operations. Shifts by a scalar that is wider than the contained type in the vector are permitted but as in the C standard if the value is larger than the width of the type the behavior is undefined. Differential Revision: https://reviews.llvm.org/D123303	2022-04-14 13:20:50 +01:00
David Truby	53fd8db791	[Clang][AArch64][SVE] Allow subscript operator for SVE types Undefined behaviour is just passed on to extract_element when the index is out of bounds. Subscript on svbool_t is not allowed as this doesn't really have meaningful semantics. Differential Revision: https://reviews.llvm.org/D122732	2022-04-14 13:20:50 +01:00
joker881	a4f47a99aa	RISCV] Add clang builtins for CLZ instruction. add intrinsic for CLZ Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D121915	2022-04-14 12:29:15 +08:00
Aaron Ballman	385e7df330	Correctly diagnose prototype redeclaration errors in C We did not implement C99 6.7.5.3p15 fully in that we missed the rule for compatible function types where a prior declaration has a prototype and a subsequent definition (not just declaration) has an empty identifier list or an identifier list with a mismatch in parameter arity. This addresses that situation by issuing an error on code like: void f(int); void f() {} // type conflicts with previous declaration (Note: we already diagnose the other type conflict situations appropriately, this was the only situation we hadn't covered that I could find.)	2022-04-13 08:21:31 -04:00
Jonas Paulsson	46f83caebc	[InlineAsm] Add support for address operands ("p"). This patch adds support for inline assembly address operands using the "p" constraint on X86 and SystemZ. This was in fact broken on X86 (see example at https://reviews.llvm.org/D110267, Nov 23). These operands should probably be treated the same as memory operands by CodeGenPrepare, which have been commented with "TODO" there. Review: Xiang Zhang and Ulrich Weigand Differential Revision: https://reviews.llvm.org/D122220	2022-04-13 12:50:21 +02:00
Daniel Kiss	b0343a38a5	Support the min of module flags when linking, use for AArch64 BTI/PAC-RET LTO objects might compiled with different `mbranch-protection` flags which will cause an error in the linker. Such a setup is allowed in the normal build with this change that is possible. Reviewed By: pcc Differential Revision: https://reviews.llvm.org/D123493	2022-04-13 09:31:51 +02:00
Arthur Eubanks	cbcdd5ff8a	[clang][test] Disable opaque pointers in test Was missed in opaque pointer switch due to not being run on x86.	2022-04-12 19:15:33 -07:00
Quinn Pham	7d7022fb0c	[PowerPC] Fix EmitPPCBuiltinExpr to emit arguments once This patch changes `EmitPPCBuiltinExpr` in `CGBuiltin.cpp` to remove the loop at the beginning of the function that emits the arguments and to delay emitting the arguments until inside the switch statement. These changes will put `EmitPPCBuiltinExpr` in line with the strategy of the target independent function `EmitBuiltinExpr`. Also, this patch ensures that arguments are only emitted once. Tests that included builtins affected by these changes have been modified to match expected behaviour. Reviewed By: #powerpc, nemanjai, amyk Differential Revision: https://reviews.llvm.org/D121637	2022-04-12 15:33:20 -05:00
Fangrui Song	c2dafea758	[CodeGen][test] Fix disable-tail-calls.c if CLANG_ENABLE_OPAQUE_POINTERS_INTERNAL is off	2022-04-11 22:27:40 -07:00
Fangrui Song	fe02896a79	[Driver] -fno-optimize-sibling-calls: use the same spelling for its -cc1 counterpart And remove a -no-opaque-pointers	2022-04-11 22:21:24 -07:00
Arthur Eubanks	29363f80a8	[test] Remove various legacy pass manager tests The new PM been the default for a while and we're in the process of removing the legacy PM optimization pipeline.	2022-04-11 14:02:17 -07:00
Arthur Eubanks	98574aa1d2	[test] Remove references to -fno-legacy-pass-manager in tests This has been the default for a while and we're in the process of removing the legacy PM optimization pipeline.	2022-04-11 13:40:27 -07:00
Arthur Eubanks	61d418f971	[test] Remove references to -fexperimental-new-pass-manager in tests This has been the default for a while and we're in the process of removing the legacy PM optimization pipeline.	2022-04-11 13:29:08 -07:00
Nikita Popov	918c5a9260	[Clang] Avoid legacy PM in some tests (NFC) Either remove legacy PM run lines or change them to use new PM.	2022-04-11 16:15:57 +02:00
Nikita Popov	b00d0fa2c7	[Clang] Add -no-opaque-pointers to native powerpc test (NFC) Does not run on x86, so I missed this before. The test currently has typed pointer check lines.	2022-04-11 13:15:46 +02:00
Nikita Popov	69f7f15683	[Clang] Add -no-opaque-pointers to recently added test (NFC)	2022-04-11 11:13:38 +02:00
Kai Luo	549e118e93	[PowerPC] Support 16-byte lock free atomics on pwr8 and up Make 16-byte atomic type aligned to 16-byte on PPC64, thus consistent with GCC. Also enable inlining 16-byte atomics on non-AIX targets on PPC64. Reviewed By: hubert.reinterpretcast Differential Revision: https://reviews.llvm.org/D122377	2022-04-08 23:25:56 +00:00
Mitch Phillips	fa34951fbc	Reland "[MTE] Add -fsanitize=memtag* and friends." Differential Revision: https://reviews.llvm.org/D118948	2022-04-08 14:28:33 -07:00
Aaron Ballman	11da1b53d8	[C89/C2x] Improve diagnostics around strict prototypes in C Functions without prototypes in C (also known as K&R C functions) were introduced into C89 as a deprecated feature and C2x is now reclaiming that syntax space with different semantics. However, Clang's -Wstrict-prototypes diagnostic is off-by-default (even in pedantic mode) and does not suffice to warn users about issues in their code. This patch changes the behavior of -Wstrict-prototypes to only diagnose declarations and definitions which are not going to change behavior in C2x mode, and enables the diagnostic in -pedantic mode. The diagnostic is now specifically about the fact that the feature is deprecated. It also adds -Wdeprecated-non-prototype, which is grouped under -Wstrict-prototypes and diagnoses declarations or definitions which will change behavior in C2x mode. This diagnostic is enabled by default because the risk is higher for the user to continue to use the deprecated feature. Differential Revision: https://reviews.llvm.org/D122895	2022-04-08 16:19:58 -04:00
Aaron Ballman	4aaf25b4f7	Revert "[MTE] Add -fsanitize=memtag* and friends." This reverts commit `8aa1490513`. Broke testing: https://lab.llvm.org/buildbot/#/builders/109/builds/36233	2022-04-08 16:15:58 -04:00
Mitch Phillips	8aa1490513	[MTE] Add -fsanitize=memtag* and friends. Currently, enablement of heap MTE on Android is specified by an ELF note, which signals to the linker to enable heap MTE. This change allows -fsanitize=memtag-heap to synthesize these notes, rather than adding them through the build system. We need to extend this feature to also signal the linker to do special work for MTE globals (in future) and MTE stack (currently implemented in the toolchain, but not implemented in the loader). Current Android uses a non-backwards-compatible ELF note, called ".note.android.memtag". Stack MTE is an ABI break anyway, so we don't mind that we won't be able to run executables with stack MTE on Android 11/12 devices. The current expectation is to support the verbiage used by Android, in that "SYNC" means MTE Synchronous mode, and "ASYNC" effectively means "fast", using the Kernel auto-upgrade feature that allows hardware-specific and core-specific configuration as to whether "ASYNC" would end up being Asynchronous, Asymmetric, or Synchronous on that particular core, whichever has a reasonable performance delta. Of course, this is platform and loader-specific. Differential Revision: https://reviews.llvm.org/D118948	2022-04-08 12:13:15 -07:00
Nikita Popov	692a147bf4	[CGCall] Make findDominatingStoreToReturnValue() more robust This was skipping specific lifetime + bitcast patterns, but with opaque pointers the bitcast will not be present, and we did not perform this fold. Instead skip over lifetime.end and bitcasts generally, without trying to correlate them.	2022-04-08 15:18:12 +02:00
serge-sans-paille	301e0d9135	[Clang][Fortify] drop inline decls when redeclared When an inline builtin declaration is shadowed by an actual declaration, we must reference the actual declaration, even if it's not the last, following GCC behavior. This fixes #54715 Differential Revision: https://reviews.llvm.org/D123308	2022-04-08 09:31:51 +02:00
Quinn Pham	fef56f79ac	Revert "[PowerPC] Fix EmitPPCBuiltinExpr to emit arguments once" This reverts commit `2aae5b1fac`. Because it breaks tests on windows.	2022-04-07 16:45:19 -05:00
Quinn Pham	2aae5b1fac	[PowerPC] Fix EmitPPCBuiltinExpr to emit arguments once This patch changes `EmitPPCBuiltinExpr` in `CGBuiltin.cpp` to remove the loop at the beginning of the function that emits the arguments and to delay emitting the arguments until inside the switch statement. These changes will put `EmitPPCBuiltinExpr` in line with the strategy of the target independent function `EmitBuiltinExpr`. Also, this patch ensures that arguments are only emitted once. Tests that included builtins affected by these changes have been modified to match expected behaviour. Reviewed By: #powerpc, nemanjai, amyk Differential Revision: https://reviews.llvm.org/D121637	2022-04-07 16:00:12 -05:00
Alex Brachet	50de659adc	[clang] Use -triple, not -target for %clang_cc1	2022-04-07 18:19:54 +00:00
Alex Brachet	3329dae5cb	[clang] Fix macos build broken after D120989	2022-04-07 18:17:29 +00:00
Nikita Popov	82d0f7bdb5	[Clang] Remove redundant -no-opaque-pointers flag in test (NFC) This was accidentally caught in an automated replacement. This test is testing the -opaque-pointers flag itself, so we shouldn't add -no-opaque-pointers here (though it doesn't hurt either). Also drop the line testing the default, as the default is now determined by a cmake option.	2022-04-07 13:53:37 +02:00
Kavitha Natarajan	b1ea0191a4	[clang][DebugInfo] Support debug info for alias variable clang to emit DWARF information for global alias variable as DW_TAG_imported_declaration. This change also handles nested (recursive) imported declarations. Reviewed by: dblaikie, aprantl Differential Revision: https://reviews.llvm.org/D120989	2022-04-07 17:15:40 +05:30
Nikita Popov	b16a3b4f3b	[Clang] Add -no-opaque-pointers to more tests (NFC) This adds the flag to more tests that were not caught by the mass-migration in `532dc62b90`.	2022-04-07 12:53:29 +02:00
Nikita Popov	532dc62b90	[OpaquePtrs][Clang] Add -no-opaque-pointers to tests (NFC) This adds -no-opaque-pointers to clang tests whose output will change when opaque pointers are enabled by default. This is intended to be part of the migration approach described in https://discourse.llvm.org/t/enabling-opaque-pointers-by-default/61322/9. The patch has been produced by replacing %clang_cc1 with %clang_cc1 -no-opaque-pointers for tests that fail with opaque pointers enabled. Worth noting that this doesn't cover all tests, there's a remaining ~40 tests not using %clang_cc1 that will need a followup change. Differential Revision: https://reviews.llvm.org/D123115	2022-04-07 12:09:47 +02:00
Ting Wang	b389354b28	[Clang][PowerPC] Add max/min intrinsics to Clang and PPC backend Add support for builtin_[max\|min] which has below prototype: A builtin_max (A1, A2, A3, ...) All arguments must have the same type; they must all be float, double, or long double. Internally use SelectCC to get the result. Reviewed By: qiucf Differential Revision: https://reviews.llvm.org/D122478	2022-04-05 22:43:48 -04:00
Tom Honermann	5531abaf71	[clang] Corrections for target_clones multiversion functions. This change merges code for emit of target and target_clones multiversion resolver functions and, in doing so, corrects handling of target_clones functions that are declared but not defined. Previously, a use of such a target_clones function would result in an attempted emit of an ifunc that referenced an undefined resolver function. Ifunc references to undefined resolver functions are not allowed and, when the LLVM verifier is not disabled (via '-disable-llvm-verifier'), resulted in the verifier issuing a "IFunc resolver must be a definition" error and aborting the compilation. With this change, ifuncs and resolver function definitions are always emitted for used target_clones functions regardless of whether the target_clones function is defined (if the function is defined, then the ifunc and resolver are emitted regardless of whether the function is used). This change has the side effect of causing target_clones variants and resolver functions to be emitted in a different order than they were previously. This is harmless and is reflected in the updated tests. Reviewed By: erichkeane Differential Revision: https://reviews.llvm.org/D122958	2022-04-05 19:50:22 -04:00
Tom Honermann	7c53fc4fe1	[clang] Emit target_clones resolver functions as COMDAT. Previously, resolver functions synthesized for target_clones multiversion functions were not emitted as COMDAT. Now fixed.	2022-04-05 15:34:35 -04:00
Tom Honermann	3531a4fa3b	[clang] NFC: Extend comdat validation in target multiversion function tests.	2022-04-05 15:34:35 -04:00
David Truby	4be1ec9fb5	[clang][AArc64][SVE] Add support for comparison operators on SVE types Comparison operators on SVE types return a signed integer vector of the same width as the incoming SVE type. This matches the existing behaviour for NEON types. Differential Revision: https://reviews.llvm.org/D122404	2022-04-05 13:56:27 +01:00
Zahira Ammarguellat	4d165ad7d9	In fast-math mode, when unsafe math optimizations are enabled, the compiler is allowed to use optimizations that allow reassociation and transformations that don’t guaranty accuracy. For example (x+y)+z is transformed into x+(y+z) . Although mathematically equivalent, these two expressions may not lead to the same final result due to errors of summation. Or x/x is transformed into 1.0 but x could be 0.0, INF or NaN. And so this transformation also may not lead to the same final result. Setting the eval method 'ffp-eval-method' or via '#pragma clang fp eval_method' in this mode, doesn’t have any effect. This patch adds code to warn the user of this. Differential Revision: https://reviews.llvm.org/D122155	2022-04-05 04:58:19 -07:00
Nikita Popov	f348ca51c7	[Tests] Use %clang_cc1 instead of %clang -cc1 in codegen tests (NFC)	2022-04-05 13:21:44 +02:00
Nikita Popov	b1f610fec5	[Test] Use cc1 interface in more tests (NFC) There doesn't seem to be any particular reason why these tests use the driver interface rather than the cc1 interface, which is typically used in CodeGen tests.	2022-04-05 13:16:24 +02:00
Nikita Popov	d69e9f9d89	[OpaquePtrs][Clang] Add -opaque-pointers/-no-opaque-pointers cc1 options This adds cc1 options for enabling and disabling opaque pointers on the clang side. This is not super useful now (because -mllvm -opaque-pointers and -Xclang -opaque-pointers have the same visible effect) but will be important once opaque pointers are enabled by default in clang. In that case, it will only be possible to disable them using the cc1 -no-opaque-pointers option. Differential Revision: https://reviews.llvm.org/D123034	2022-04-05 10:15:41 +02:00
Aaron Ballman	ee8a92b637	Require C99 for more tests; NFC intended This augments `5d90004874` which got all of the -verify lines, but accidentally missed all of the -verify= ones.	2022-04-04 15:58:26 -04:00
Aaron Ballman	5d90004874	Require C99 for these tests; NFC intended The tests are doing -verify and testing a diagnostic behavior, but that behavior is changing. This ensures the tests continue to run and check the diagnostic. The behavior of the tests is expected to remain identical as before.	2022-04-04 13:43:27 -04:00
Priyansh Singh	aa19500a4c	Correct a typo in a RUN line	2022-04-04 11:32:05 -04:00
Luo, Yuanke	979d876bb4	[X86][AMX] enable amx cast intrinsics in FE. We have some discission in D99152 and llvm-dev and finially come up with a solution to add amx specific cast intrinsics. We've support the intrinsics in llvm IR. This patch is to replace bitcast with amx cast intrinsics in code emitting in FE. Differential Revision: https://reviews.llvm.org/D122567	2022-04-02 14:02:35 +08:00
wangyihan	907d3acefc	[Clang][CodeGen]Beautify dump format, add indent for nested struct and struct members Beautify dump format, add indent for nested struct and struct members, also fix test cases in dump-struct-builtin.c for example: struct: ``` struct A { int a; struct B { int b; struct C { struct D { int d; union E { int x; int y; } e; } d; int c; } c; } b; }; ``` Before: ``` struct A { int a = 0 struct B { int b = 0 struct C { struct D { int d = 0 union E { int x = 0 int y = 0 } } int c = 0 } } } ``` After: ``` struct A { int a = 0 struct B { int b = 0 struct C { struct D { int d = 0 union E { int x = 0 int y = 0 } } int c = 0 } } } ``` Reviewed By: erichkeane Differential Revision: https://reviews.llvm.org/D122704	2022-03-31 07:38:37 +08:00
wangpc	cebbfd3d25	[RISCV] Add index check for vset/vget Index of vset/vget must be a constant integer and be located in right range. Reviewed By: kito-cheng Differential Revision: https://reviews.llvm.org/D122629	2022-03-30 19:29:13 +08:00
Zakk Chen	10b2760da0	Revert "[RISCV] Add policy operand for masked compare and vmsbf/vmsif/vmsof IR" This reverts commit `10fd2822b7`. I have a better implementation for those operations without the additional policy operand. masked compare and vmsbf/vmsif/vmsof are always tail agnostic so we could assume undef maskedoff is mask agnostic. Differential Revision: https://reviews.llvm.org/D122455	2022-03-29 18:05:33 -07:00
Phoebe Wang	cd26190a10	[X86][regcall] Support passing / returning structures Currently, the regcall calling conversion in Clang doesn't match with ICC when passing / returning structures. https://godbolt.org/z/axxKMKrW7 This patch tries to fix the problem to match with ICC. Reviewed By: LuoYuanke Differential Revision: https://reviews.llvm.org/D122104	2022-03-29 11:29:57 +08:00
Chenbing Zheng	d9ef6ad05f	[RISCV] [NFC] add some tests for overloaded intrinsics of FP16 Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D122564	2022-03-29 10:00:20 +08:00
Florian Hahn	8b245ab41d	[Clang,TBAA] Add test cases for nested pointers and TBAA data.	2022-03-27 19:59:37 +01:00
Florian Hahn	171cdba867	[Clang,TBAA] Use pattern for metadata reference in test. Update the single check line that still had a hard-coded metadata reference. This makes it more robust to slight changes in the metadata numbering.	2022-03-25 18:12:39 +00:00
Johannes Doerfert	a81fff8afd	Reapply "[Intrinsics] Add `nocallback` to the default intrinsic attributes" This reverts commit `c5f789050d` and reapplies `7aea3ea8c3` with additional test changes.	2022-03-25 09:36:50 -05:00
Hubert Tong	ce21c926f8	[Clang] Work with multiple pragmas weak before definition Update `WeakUndeclaredIdentifiers` to hold a collection of weak aliases per identifier instead of only one. This also allows the "used" state to be removed from `WeakInfo` because it is really only there as an alternative to removing processed map entries, and we can represent that using an empty set now. The serialization code is updated for the removal of the field. Additionally, a PCH test is added for the new functionality. The records are grouped by the "target" identifier, which was already being used as a key for lookup purposes. We also store only one record per alias name; combined, this means that diagnostics are grouped by the "target" and limited to one per alias (which should be acceptable). Fixes PR28611. Fixes llvm/llvm-project#28985. Reviewed By: aaron.ballman, cebowleratibm Differential Revision: https://reviews.llvm.org/D121927 Co-authored-by: Rachel Craik <rcraik@ca.ibm.com> Co-authored-by: Jamie Schmeiser <schmeise@ca.ibm.com>	2022-03-24 20:17:49 -04:00
wangyihan	7faa95624e	[clang][CodeGen]Fix clang crash and add bitfield support in __builtin_dump_struct Fix clang crash and add bitfield support in __builtin_dump_struct. In clang13.0.x, a struct with three or more members and a bitfield at the same time will cause a crash. In clang15.x, as long as the struct has one bitfield, it will cause a crash in clang. Open issue: https://github.com/llvm/llvm-project/issues/54462 Differential Revision: https://reviews.llvm.org/D122248	2022-03-24 12:23:29 -07:00
Johannes Doerfert	c5f789050d	Revert "[Intrinsics] Add `nocallback` to the default intrinsic attributes" This reverts commit `7aea3ea8c3` as it breaks the buildbots. I didn't see these failures in the pre-merge checks, looking into it.	2022-03-24 14:04:41 -05:00
Johannes Doerfert	7aea3ea8c3	[Intrinsics] Add `nocallback` to the default intrinsic attributes Most intrinsics, especially "default" ones, will not call back into the IR module. `nocallback` encodes this nicely. As it was not used before, this patch also makes use of `nocallback` in the Attributor which results in many more `norecurse` deductions. Tablegen part is mechanical, test updates by script. Differential Revision: https://reviews.llvm.org/D118680	2022-03-24 13:50:54 -05:00
Aaron Ballman	488c772920	Fix a crash with variably-modified parameter types in a naked function Naked functions have no prolog, so it's not valid to emit prolog code to evaluate the variably-modified type. This fixes Issue 50541.	2022-03-24 10:39:14 -04:00
Qiu Chaofan	895e5b2d80	[NFC] Format and uglify PowerPC intrinsics headers This change formats PowerPC intrinsics wrapper headers into LLVM style, and add extra prefix '__' to all variables to prevent conflict with user code.	2022-03-24 21:14:55 +08:00
Qiu Chaofan	406bde9a15	[PowerPC] [Clang] Add SSE4 and BMI intrinsics implementation Reviewed By: jsji Differential Revision: https://reviews.llvm.org/D119407	2022-03-24 20:03:08 +08:00
Ben Shi	51585aa240	[clang][AVR] Implement standard calling convention for AVR and AVRTiny This patch implements avr-gcc's calling convention: https://gcc.gnu.org/wiki/avr-gcc#Calling_Convention Reviewed By: aykevl Differential Revision: https://reviews.llvm.org/D120720	2022-03-24 02:08:22 +00:00
Xiang1 Zhang	287dad13ab	[InlineAsm] Fix mangle problem when global variable used in inline asm (Add modifier P for ARR[BaseReg+IndexReg+..]) Reviewed By: skan Differential Revision: https://reviews.llvm.org/D120887	2022-03-24 09:41:23 +08:00
Xiang1 Zhang	8a6b644c79	[Inline asm] Fix mangle problem when variable used in inline asm. (Connect InlineAsm Memory Operand with its real value not just name) Revert 2 history bugfix patch: Revert "[X86][MS-InlineAsm] Make the constraint *m to be simple place holder" This patch revert https://reviews.llvm.org/D115225 which mainly fix problems intrduced by https://reviews.llvm.org/D113096 This reverts commit `d7c07f60b3`. Revert "Reland "[X86][MS-InlineAsm] Use exact conditions to recognize MS global variables"" This patch revert https://reviews.llvm.org/D116090 which fix problem intrduced by https://reviews.llvm.org/D115225 This reverts commit `24c68ea1eb`. Reviewed By: skan Differential Revision: https://reviews.llvm.org/D120886	2022-03-24 09:41:22 +08:00
Arthur Eubanks	9bd66b312c	[PassManager][Coroutine] Run passes under -O0 conditionally and run GlobalDCE CoroSplit lowers various coroutine intrinsics. It's a CGSCC pass and CGSCC passes don't run on unreachable functions. Normally GlobalDCE will come along and delete unreachable functions, but we don't run GlobalDCE under -O0, so an unreachable function with coroutine intrinsics may never have CoroSplit run on it. This patch adds GlobalDCE when coroutines intrinsics are present. It also now runs all coroutine passes conditional when coroutine intrinsics are present. This should also solve the -O0 regression reported in D105877 due to LazyCallGraph construction. Fixes https://github.com/llvm/llvm-project/issues/54117 Reviewed By: ChuanqiXu Differential Revision: https://reviews.llvm.org/D122275	2022-03-23 11:03:26 -07:00
Nick Desaulniers	5a2e56b70e	[Clang][NeonEmitter] emit ret decl first for -Wdeclaration-after-statement The generated arm_neon.h header isn't -Wdeclaration-after-statement compliant when targeting -mbig-endian. Update the generator to declare the return value, if any, first before any other arguments that might need to be "reversed" from little endian to big. Another approach would have been to try to ignore this warning in system headers, though that might not be precise for tokens involved in macro expansion. See also: https://reviews.llvm.org/D116833#3236209. Link: https://github.com/ClangBuiltLinux/linux/issues/1603 Fixes: https://github.com/llvm/llvm-project/issues/54062 Reviewed By: DavidSpickett Differential Revision: https://reviews.llvm.org/D122189	2022-03-23 09:40:43 -07:00
David Truby	683fc6203c	[clang][AArc64][SVE] Implement vector-scalar operators This patch extends the support for C/C++ operators for SVE types to allow one of the arguments to be a scalar, in which case a vector splat is performed. Differential Revision: https://reviews.llvm.org/D121829	2022-03-23 14:20:48 +00:00
Zakk Chen	10fd2822b7	[RISCV] Add policy operand for masked compare and vmsbf/vmsif/vmsof IR intrinsics. Those operations are updated under a tail agnostic policy, but they could have mask agnostic or undisturbed. Reviewed By: rogfer01 Differential Revision: https://reviews.llvm.org/D120228	2022-03-22 07:47:21 -07:00
Alan Zhao	8cd8bd4a5c	Implement __cpuid and __cpuidex as Clang builtins https://reviews.llvm.org/D23944 implemented the #pragma intrinsic from MSVC. This causes the statement #pragma intrinsic(cpuid) to fail [0] on Clang because cpuid is currently implemented in intrin.h instead of a Clang builtin. Reimplementing cpuid (as well as it's releated function, cpuidex) should resolve this. [0]: https://crbug.com/1279344 Differential revision: https://reviews.llvm.org/D121653	2022-03-18 18:13:52 +01:00
David Truby	f47e7e4a34	[clang][SVE] Add support for bitwise operators on SVE types This patch implements support for the &, \|, ^, and ~ operators on sizeless SVE types. Differential Revision: https://reviews.llvm.org/D121119	2022-03-18 14:06:47 +00:00
Kai Luo	9247145fba	[PowerPC][NFC] Add atomic alignments and ops tests for powerpc PowerPC is lacking tests checking `_Atomic` alignment in cfe. Adding these tests since we're going to make change to align with gcc on Linux. Reviewed By: hubert.reinterpretcast, jsji Differential Revision: https://reviews.llvm.org/D121441	2022-03-18 13:22:28 +08:00
Zahira Ammarguellat	bbf0d1932a	Currently the control of the eval-method is mixed with fast-math. FLT_EVAL_METHOD tells the user the precision at which, temporary results are evaluated but when fast-math is enabled, the numeric values are not guaranteed to match the source semantics, so the eval-method is meaningless. For example, the expression `x + y + z` has as source semantics `(x + y) + z`. FLT_EVAL_METHOD is telling the user at which precision `(x + y)` is evaluated. With fast-math enable the compiler can choose to evaluate the expression as `(y + z) + x`. The correct behavior is to set the FLT_EVAL_METHOD to `-1` to tell the user that the precision of the intermediate values is unknow. This patch is doing that. Differential Revision: https://reviews.llvm.org/D121122	2022-03-17 11:48:03 -07:00
Craig Topper	bbd2ecf9f0	[RISCV] Add +experimental-zvfh extension to cover half types in vectors. Currently we allow half types in vectors if the scalar Zfh extension is enabled. This behavior is not inline with the vector spec. For f32 and f64 types, the Zve32f, Zve64f, Zve64d, and V explicitly control the availablity of floating point types in vectors. In order to make our compiler compliant, we either need to remove all support for half in vectors or we need an extension to control it. Draft spec here https://github.com/riscv/riscv-v-spec/pull/780 Reviewed By: kito-cheng Differential Revision: https://reviews.llvm.org/D121345	2022-03-17 10:04:02 -07:00
Matt Devereau	a9e08bc7c1	[AArch64][SVE] InstCombine llvm.aarch64.sve.sel to select InstCombine llvm.aarch64.sve.sel to select. This allows an existing instCombine added in `20b0fa91c9` to fire. Differential Revision: https://reviews.llvm.org/D121792	2022-03-17 16:20:48 +00:00
Kazushi (Jam) Marukawa	9df395bb68	[Clang][VE] Add vector mask intrinsics to clang Add vector mask intrinsics instructions to clang. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D121816	2022-03-17 18:52:28 +09:00
Arthur Eubanks	2371c5a0e0	[OpaquePtr][ARM] Use elementtype on ldrex/ldaex/stlex/strex Includes verifier changes checking the elementtype, clang codegen changes to emit the elementtype, and ISel changes using the elementtype. Basically the same as D120527. Reviewed By: #opaque-pointers, nikic Differential Revision: https://reviews.llvm.org/D121847	2022-03-16 14:11:53 -07:00
Thomas Lively	7e8913d775	[WebAssembly] Fix names of SIMD instructions containing '_zero' Fix the instruction names to match the WebAssembly spec: - `i32x4.trunc_sat_zero_f64x2_{s,u}` => `i32x4.trunc_sat_f64x2_{s,u}_zero` - `f32x4.demote_zero_f64x2` => `f32x4.demote_f64x2_zero` Also rename related things like intrinsics, builtins, and test functions to match. Reviewed By: aheejin Differential Revision: https://reviews.llvm.org/D121661	2022-03-16 13:34:57 -07:00
David Truby	d38c9d3834	[NFC][clang][SVE] Auto-generate SVE operator tests.	2022-03-16 16:39:27 +00:00
Yonghong Song	3251ba2d0f	[Attr] Fix a btf_type_tag AST generation Current ASTContext.getAttributedType() takes attribute kind, ModifiedType and EquivType as the hash to decide whether an AST node has been generated or note. But this is not enough for btf_type_tag as the attribute might have the same ModifiedType and EquivType, but still have different string associated with attribute. For example, for a data structure like below, struct map_value { int __attribute__((btf_type_tag("tag1"))) __attribute__((btf_type_tag("tag3"))) a; int __attribute__((btf_type_tag("tag2"))) __attribute__((btf_type_tag("tag4"))) b; }; The current ASTContext.getAttributedType() will produce an AST similar to below: struct map_value { int __attribute__((btf_type_tag("tag1"))) __attribute__((btf_type_tag("tag3"))) a; int __attribute__((btf_type_tag("tag1"))) __attribute__((btf_type_tag("tag3"))) b; }; and this is incorrect. It is very difficult to use the current AttributedType as it is hard to get the tag information. To fix the problem, this patch introduced BTFTagAttributedType which is similar to AttributedType in many ways but with an additional BTFTypeTagAttr. The tag itself can be retrieved with BTFTypeTagAttr. With the new BTFTagAttributed type, the debuginfo code can be greatly simplified compared to previous TypeLoc based approach. Differential Revision: https://reviews.llvm.org/D120296	2022-03-16 08:46:52 -07:00
Kazushi (Jam) Marukawa	c2f62ab84b	[Clang][VE] Add the rest of intrinsics to clang Add the rest of intrinsics to clang except intrinsics using vector mask registers. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D121586	2022-03-17 00:17:21 +09:00
Simon Moll	0aab344104	[Clang] Allow "ext_vector_type" applied to Booleans This is the `ext_vector_type` alternative to D81083. This patch extends Clang to allow 'bool' as a valid vector element type (attribute ext_vector_type) in C/C++. This is intended as the canonical type for SIMD masks and facilitates clean vector intrinsic declarations. Vectors of i1 are supported on IR level and below down to many SIMD ISAs, such as AVX512, ARM SVE (fixed vector length) and the VE target (NEC SX-Aurora TSUBASA). The RFC on cfe-dev: https://lists.llvm.org/pipermail/cfe-dev/2020-May/065434.html Reviewed By: erichkeane Differential Revision: https://reviews.llvm.org/D88905	2022-03-16 11:10:32 +01:00
Keith Smiley	a2db7d5e9c	reland: [clang] Don't append the working directory to absolute paths This fixes a bug that happens when using -fdebug-prefix-map to remap an absolute path to a relative path. Since the path was absolute before remapping, it is safe to assume that concatenating the remapped working directory would be wrong. This was originally submitted as https://reviews.llvm.org/D113718, but reverted because when testing with dwarf 5 enabled, the tests were too strict. Differential Revision: https://reviews.llvm.org/D121663	2022-03-15 13:42:35 -07:00
Keith Smiley	cb22d71806	[clang] Fix DIFile directory root on Windows On unix systems this logic would not separate the file and directory of the DIFile unless they shared more components at the start than just the root path character. The logic to do this was unix specific so it didn't work on Windows. Now we check if the entire root_path is the same as what you were going to set as the Dir and use the full filepath in that case. Differential Revision: https://reviews.llvm.org/D111579	2022-03-14 20:07:01 -07:00
Dávid Bolvanský	003c0b9307	[Clang] always_inline statement attribute Motivation: ``` int test(int x, int y) { int r = 0; [[clang::always_inline]] r += foo(x, y); // force compiler to inline this function here return r; } ``` In 2018, @kuhar proposed "Introduce per-callsite inline intrinsics" in https://reviews.llvm.org/D51200 to solve this motivation case (and many others). This patch solves this problem with call site attribute. "noinline" statement attribute already landed in D119061. Also, some LLVM Inliner fixes landed so call site attribute is stronger than function attribute. Reviewed By: aaron.ballman Differential Revision: https://reviews.llvm.org/D120717	2022-03-14 21:45:31 +01:00
Arthur Eubanks	250620f76e	[OpaquePtr][AArch64] Use elementtype on ldxr/stxr Includes verifier changes checking the elementtype, clang codegen changes to emit the elementtype, and ISel changes using the elementtype. Reviewed By: #opaque-pointers, nikic Differential Revision: https://reviews.llvm.org/D120527	2022-03-14 10:09:59 -07:00
Arthur Eubanks	4fc7c55fff	[NewPM] Actually recompute GlobalsAA before module optimization pipeline RequireAnalysis<GlobalsAA> doesn't actually recompute GlobalsAA. GlobalsAA isn't invalidated (unless specifically invalidated) because it's self-updating via ValueHandles, but can be imprecise during the self-updates. Rather than invalidating GlobalsAA, which would invalidate AAManager and any analyses that use AAManager, create a new pass that recomputes GlobalsAA. Fixes #53131. Differential Revision: https://reviews.llvm.org/D121167	2022-03-14 09:42:34 -07:00
Erich Keane	dc152659b4	Have cpu-specific variants set 'tune-cpu' as an optimization hint Due to various implementation constraints, despite the programmer choosing a 'processor' cpu_dispatch/cpu_specific needs to use the 'feature' list of a processor to identify it. This results in the identified processor in source-code not being propogated to the optimizer, and thus, not able to be tuned for. This patch changes to use the actual cpu as written for tune-cpu so that opt can make decisions based on the cpu-as-spelled, which should better match the behavior expected by the programmer. Note that the 'valid' list of processors for x86 is in llvm/include/llvm/Support/X86TargetParser.def. At the moment, this list contains only Intel processors, but other vendors may wish to add their own entries as 'alias'es (or with different feature lists!). If this is not done, there is two potential performance issues with the patch, but I believe them to be worth it in light of the improvements to behavior and performance. 1- In the event that the user spelled "ProcessorB", but we only have the features available to test for "ProcessorA" (where A is B minus features), AND there is an optimization opportunity for "B" that negatively affects "A", the optimizer will likely choose to do so. 2- In the event that the user spelled VendorI's processor, and the feature list allows it to run on VendorA's processor of similar features, AND there is an optimization opportunity for VendorIs that negatively affects "A"s, the optimizer will likely choose to do so. This can be fixed by adding an alias to X86TargetParser.def. Differential Revision: https://reviews.llvm.org/D121410	2022-03-14 06:14:30 -07:00
Kazushi (Jam) Marukawa	b1b4b6f366	[Clang][VE] Add vector load intrinsics Add vector load intrinsic instructions for VE. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D121049	2022-03-12 09:09:57 +09:00
David Truby	058c92f2a4	[clang][SVE] Add aarch64-registered-target to sve vector op tests This fixes failing tests where aarch64 isn't available.	2022-03-11 16:01:00 +00:00
David Truby	3aca0ffd50	[clang][SVE] Add support for arithmetic operators on SVE types This patch implements support for the +, -, *, / and % operators on sizeless SVE types. Support for these operators on svbool_t is excluded. Differential Revision: https://reviews.llvm.org/D120323	2022-03-11 15:39:44 +00:00
Matt Devereau	6c5da880e0	[AArch64][SVE][Clang] Fix crash for incorrect svptrue and svcnt parameters Giving an int parameter to SVE intrinsics svptrue and svcnt caused Clang to crash on compilation. Changing their parameter types to void instead of omitting args results in a diagnostic error message instead. Differential Revision: https://reviews.llvm.org/D121294	2022-03-11 11:19:53 +00:00
4vtomat	25df633c24	Split up large test files(over 10k lines) under clang/test/CodeGen/RISCV including: The llvm pre-merge test got timeout due to large test files, this commit split up the files that have over 10k lines under clang/test/CodeGen/RISCV into even smaller ones. Differential Revision: https://reviews.llvm.org/D121431	2022-03-10 19:13:39 -08:00
Phoebe Wang	4de9a752d6	[X86] Add helper enum for ternary intrinsics Reviewed By: RKSimon, LuoYuanke Differential Revision: https://reviews.llvm.org/D120307	2022-03-08 11:19:05 +08:00
Qiu Chaofan	b2497e5435	[PowerPC] Add generic fnmsub intrinsic Currently in Clang, we have two types of builtins for fnmsub operation: one for float/double vector, they'll be transformed into IR operations; one for float/double scalar, they'll generate corresponding intrinsics. But for the vector version of builtin, the 3 op chain may be recognized as expensive by some passes (like early cse). We need some way to keep the fnmsub form until code generation. This patch introduces ppc.fnmsub.* intrinsic to unify four fnmsub intrinsics. Reviewed By: shchenz Differential Revision: https://reviews.llvm.org/D116015	2022-03-07 13:00:06 +08:00
Shao-Ce SUN	fa9c8bab0c	[RISCV] Support k-ext clang intrinsics Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D112774	2022-03-05 13:57:18 +08:00
4vtomat	5a148869d3	[NFC] Divide tests into smaller files This commit divides the large test files(over 30k lines) under clang/test/CodeGen/RISCV including: rvv-intrinsics/vloxseg.c rvv-intrinsics/vluxseg.c rvv-intrinsics-overloaded/vloxseg.c rvv-intrinsics-overloaded/vluxseg.c into "non-masked" version and "masked" version which can reduce the test cases by 50% in a single file. Differential Revision: https://reviews.llvm.org/D120967	2022-03-04 04:16:52 -08:00
Tong Zhang	f76d3b800f	[clang][CGStmt] fix crash on invalid asm statement Clang is crashing on the following statement char var[9]; __asm__ ("" : "=r" (var) : "0" (var)); This is similar to existing test: crbug_999160_regtest The issue happens when EmitAsmStmt is trying to convert input to match output type length. However, that is not guaranteed to be successful all the time and if the statement itself is invalid like having an array type in the example, we should give a regular error message here instead of using assert(). Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D120596	2022-03-02 11:18:55 -08:00
Florian Mayer	1d730d80ce	[HWASAN] erase lifetime intrinsics if tag is outside. Reviewed By: eugenis Differential Revision: https://reviews.llvm.org/D120437	2022-03-01 14:47:33 -08:00
Nicolas Miller	510fd283fd	[NVPTX] Add ex2.approx.f16/f16x2 support NOTE: this is a follow-up commit with the missing clang-side changes. This patch adds builtins and intrinsics for the f16 and f16x2 variants of the ex2 instruction. These two variants were added in PTX7.0, and are supported by sm_75 and above. Note that this isn't wired with the exp2 llvm intrinsic because the ex2 instruction is only available in its approx variant. Running ptxas on the assembly generated by the test f16-ex2.ll works as expected. Differential Revision: https://reviews.llvm.org/D119157	2022-03-01 11:07:11 -08:00
Jakub Chlanda	a895182302	[NVPTX] Add more FMA intriniscs/builtins This patch adds builtins/intrinsics for the following variants of FMA: NOTE: follow-up commit with the missing clang-side changes. - f16, f16x2 - rn - rn_ftz - rn_sat - rn_ftz_sat - rn_relu - rn_ftz_relu - bf16, bf16x2 - rn - rn_relu ptxas (Cuda compilation tools, release 11.0, V11.0.194) is happy with the generated assembly. Differential Revision: https://reviews.llvm.org/D118977	2022-03-01 11:07:11 -08:00
Jakub Chlanda	7a6d692b3b	[NVPTX] Expose float tys min, max, abs, neg as builtins Adds support for the following builtins: abs, neg: - .bf16, - .bf16x2 min, max - {.ftz}{.NaN}{.xorsign.abs}.f16 - {.ftz}{.NaN}{.xorsign.abs}.f16x2 - {.NaN}{.xorsign.abs}.bf16 - {.NaN}{.xorsign.abs}.bf16x2 - {.ftz}{.NaN}{.xorsign.abs}.f32 Differential Revision: https://reviews.llvm.org/D117887	2022-03-01 11:07:11 -08:00
Tong Zhang	17ce89fa80	[SanitizerBounds] Add support for NoSanitizeBounds function Currently adding attribute no_sanitize("bounds") isn't disabling -fsanitize=local-bounds (also enabled in -fsanitize=bounds). The Clang frontend handles fsanitize=array-bounds which can already be disabled by no_sanitize("bounds"). However, instrumentation added by the BoundsChecking pass in the middle-end cannot be disabled by the attribute. The fix is very similar to D102772 that added the ability to selectively disable sanitizer pass on certain functions. In this patch, if no_sanitize("bounds") is provided, an additional function attribute (NoSanitizeBounds) is attached to IR to let the BoundsChecking pass know we want to disable local-bounds checking. In order to support this feature, the IR is extended (similar to D102772) to make Clang able to preserve the information and let BoundsChecking pass know bounds checking is disabled for certain function. Reviewed By: melver Differential Revision: https://reviews.llvm.org/D119816	2022-03-01 18:47:02 +01:00
Kristina Bessonova	57aaab3b17	[NVPTX] Fix nvvm.match.sync*.i64 intrinsics return type (i64 -> i32) NVVM IR specification defines them with i32 return type: declare i32 @llvm.nvvm.match.any.sync.i64(i32 %membermask, i64 %value) declare {i32, i1} @llvm.nvvm.match.all.sync.i64(i32 %membermask, i64 %value) ... The i32 return value is a 32-bit mask where bit position in mask corresponds to thread’s laneid. as well as PTX ISA: 9.7.12.8. Parallel Synchronization and Communication Instructions: match.sync match.any.sync.type d, a, membermask; match.all.sync.type d[\|p], a, membermask; ... Destination d is a 32-bit mask where bit position in mask corresponds to thread’s laneid. Additionally, ptxas doesn't accept intructions, produced by NVPTX backend. After this patch, it compiles with no issues. Reviewed By: tra Differential Revision: https://reviews.llvm.org/D120499	2022-03-01 12:26:16 +02:00
Dávid Bolvanský	223b824022	[Clang] noinline call site attribute Motivation: ``` int foo(int x, int y) { // any compiler will happily inline this function return x / y; } int test(int x, int y) { int r = 0; [[clang::noinline]] r += foo(x, y); // for some reason we don't want any inlining here return r; } ``` In 2018, @kuhar proposed "Introduce per-callsite inline intrinsics" in https://reviews.llvm.org/D51200 to solve this motivation case (and many others). This patch solves this problem with call site attribute. The implementation is "smaller" wrt approach which uses new intrinsics and thanks to https://reviews.llvm.org/D79121 (Add nomerge statement attribute to clang), we have got some basic infrastructure to deal with attrs on statements with call expressions. GCC devs are more inclined to call attribute solution as well, as builtins are problematic for them - https://gcc.gnu.org/bugzilla/show_bug.cgi?id=104187. But they have no patch proposal yet so.. We have free hands here. If this approach makes sense, next future steps would be support for call site attributes for always_inline / flatten. Reviewed By: aaron.ballman, kuhar Differential Revision: https://reviews.llvm.org/D119061	2022-02-28 21:21:17 +01:00
Itay Bookstein	f3480390be	[clang][CodeGen] Avoid emitting ifuncs with undefined resolvers The purpose of this change is to fix the following codegen bug: ``` // main.c __attribute__((cpu_specific(generic))) int foo(void) { static int z; return &z;} int main() { return foo() = 5; } // other.c __attribute__((cpu_dispatch(generic))) int foo(void); // run: clang main.c other.c -o main; ./main ``` This will segfault prior to the change, and return the correct exit code 5 after the change. The underlying cause is that when a translation unit contains a cpu_specific function without the corresponding cpu_dispatch the generated code binds the reference to foo() against a GlobalIFunc whose resolver is undefined. This is invalid: the resolver must be defined in the same translation unit as the ifunc, but historically the LLVM bitcode verifier did not check that. The generated code then binds against the resolver rather than the ifunc, so it ends up calling the resolver rather than the resolvee. In the example above it treats its return value as an int , therefore trying to write to program text. The root issue at the representation level is that GlobalIFunc, like GlobalAlias, does not support a "declaration" state. The object which provides the correct semantics in these cases is a Function declaration, but unlike Functions, changing a declaration to a definition in the GlobalIFunc case constitutes a change of the object type, as opposed to simply emitting code into a Function. I think this limitation is unlikely to change, so I implemented the fix by returning a function declaration rather than an ifunc when encountering cpu_specific, and upgrading it to an ifunc when emitting cpu_dispatch. This uses `takeName` + `replaceAllUsesWith` in similar vein to other places where the correct IR object type cannot be known locally/up-front, like in `CodeGenModule::EmitAliasDefinition`. Previous discussion in: https://reviews.llvm.org/D112349 Signed-off-by: Itay Bookstein <ibookstein@gmail.com> Reviewed By: erichkeane Differential Revision: https://reviews.llvm.org/D120266	2022-02-26 11:17:49 +02:00
Adrian Prantl	bc7aeea854	Revert "Don't append the working directory to absolute paths" This reverts commit `2cd9a86da5`.	2022-02-25 17:00:10 -08:00
Adrian Prantl	2cd9a86da5	Don't append the working directory to absolute paths This fixes a bug that happens when using -fdebug-prefix-map to remap an absolute path to a relative path. Since the path was absolute before remapping, it is safe to assume that concatenating the remapped working directory would be wrong. Differential Revision: https://reviews.llvm.org/D113718	2022-02-25 13:03:59 -08:00
Ben Shi	91c587b3b1	[clang][NFC] Move all avr CodeGen tests to avr specific directory Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D120468	2022-02-25 00:23:46 +00:00
Fangrui Song	da047445f7	[clang][test] Make mips-vector-return.c work with CLANG_DEFAULT_PIE_ON_LINUX=on	2022-02-24 15:25:35 -08:00
Aaron Ballman	df00a995f9	Fix typo in file name; NFC	2022-02-24 15:41:25 -05:00
Aaron Ballman	2ceee2f884	Add -Wno-strict-prototypes to C tests; NFC This patch adds -Wno-strict-prototypes to all of the test cases that use functions without prototypes, but not as the primary concern of the test. e.g., attributes testing whether they can/cannot be applied to a function without a prototype, etc. This is done in preparation for enabling -Wstrict-prototypes by default.	2022-02-24 15:30:30 -05:00
Aaron Ballman	1c2558021c	Use functions with prototypes when appropriate; NFC A significant number of our tests in C accidentally use functions without prototypes. This patch converts the function signatures to have a prototype for the situations where the test is not specific to K&R C declarations. e.g., void func(); becomes void func(void); This is the final batch of tests being updated to add prototypes, hopefully.	2022-02-24 15:30:13 -05:00
Aaron Ballman	003924963f	Remove useless RUN lines in the middle of the file and pipe to FileCheck; NFC	2022-02-24 11:01:33 -05:00
Nikita Popov	a266af7211	[InstCombine] Canonicalize SPF to min/max intrinsics Now that integer min/max intrinsics have good support in both InstCombine and other passes, start canonicalizing SPF min/max to intrinsic min/max. Once this sticks, we can stop matching SPF min/max in various places, and can remove hacks we have for preventing infinite loops and breaking of SPF canonicalization. Differential Revision: https://reviews.llvm.org/D98152	2022-02-24 09:01:20 +01:00
Fangrui Song	0477cac332	[asan] Allow -fsanitize-address-globals-dead-stripping with -fno-data-sections for ELF -fdata-sections decides whether global variables go into different sections. This is orthogonal to whether we place their metadata (`.data` or `asan_globals`) into different sections. With -fno-data-sections, `-fsanitize-address-globals-dead-stripping` can still: * deduplicate COMDAT `asan.module_ctor` and `asan.module_dtor` * (with ld --gc-sections): for a data section (e.g. `.data`), if all global variables defined relative to it are unreferenced, discard them and associated `asan_globals` sections (rare but no need to exclude this case) Similar to `c7b90947bd` for PE/COFF. Reviewed By: #sanitizers, kstoimenov, vitalybuka Differential Revision: https://reviews.llvm.org/D120394	2022-02-23 16:08:25 -08:00
Zahira Ammarguellat	1592d88aa7	Add support for floating-point option `ffp-eval-method` and for `pragma clang fp eval_method`. Differential Revision: https://reviews.llvm.org/D109239	2022-02-23 15:00:18 -08:00
Florian Hahn	09193f20a1	Revert "Add support for floating-point option `ffp-eval-method` and for" This reverts commit `32b73bc6ab`. This breaks builds on macOS in some configurations, because __FLT_EVAL_METHOD__ is set to an unexpected value. E.g. https://green.lab.llvm.org/green/job/clang-stage1-RA/28282/consoleFull#129538464349ba4694-19c4-4d7e-bec5-911270d8a58c More details available in the review thread https://reviews.llvm.org/D109239	2022-02-18 11:04:00 +00:00
hyeongyukim	35baa26747	[NFC][Clang/test] add target triple to CodeGen/analyze_noundef.cpp	2022-02-18 17:54:22 +09:00
Alexander Potapenko	c85a26454d	[asan] Add support for disable_sanitizer_instrumentation attribute For ASan this will effectively serve as a synonym for __attribute__((no_sanitize("address"))). Adding the disable_sanitizer_instrumentation to functions will drop the sanitize_XXX attributes on the IR level. This is the third reland of https://reviews.llvm.org/D114421. Now that TSan test is fixed (https://reviews.llvm.org/D120050) there should be no deadlocks. Differential Revision: https://reviews.llvm.org/D120055	2022-02-18 09:51:54 +01:00
hyeongyukim	5333447a00	[NFC] Fix a buildbot failure after `b529744`	2022-02-18 17:38:50 +09:00
hyeongyukim	b529744c29	[Clang] Rename `disable-noundef-analysis` flag to `-[no-]enable-noundef-analysis` This flag was previously renamed `enable_noundef_analysis` to `disable-noundef-analysis,` which is not a conventional name. (Driver and CC1's boolean options are using [no-] prefix) As discussed at https://reviews.llvm.org/D105169, this patch reverts its name to `[no-]enable_noundef_analysis` and enables noundef-analysis as default. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D119998	2022-02-18 17:02:41 +09:00
Peter Collingbourne	82e5f951fd	AST: Move __va_list tag back to std conditionally on AArch64. In post-commit feedback on D104830 Jessica Clarke pointed out that unconditionally adding __va_list to the std namespace caused namespace debug info to be emitted in C, which is not only inappropriate but turned out to confuse the dtrace tool. Therefore, move __va_list back to std only in C++ so that the correct debug info is generated. We also considered moving __va_list to the top level unconditionally but this would contradict the specification and be visible to AST matchers and such, so make it conditional on the language mode. To avoid breaking name mangling for __va_list, teach the Itanium name mangler to always mangle it as if it were in the std namespace when targeting ARM architectures. This logic is not needed for the Microsoft name mangler because Microsoft platforms define va_list as a typedef of char *. Depends on D116773 Differential Revision: https://reviews.llvm.org/D116774	2022-02-17 11:31:40 -08:00
Zakk Chen	ca78312407	[RISCV] Add the policy operand for nomask vector Multiply-Add IR intrinsics. The goal is support tail and mask policy in RVV builtins. We focus on IR part first. The nomask vector Multiply-Add need a policy operand because merge value could not be undef. Reviewed By: monkchiang Differential Revision: https://reviews.llvm.org/D119727	2022-02-17 09:12:46 -08:00
Zahira Ammarguellat	32b73bc6ab	Add support for floating-point option `ffp-eval-method` and for `pragma clang fp eval_method`. https://reviews.llvm.org/D109239	2022-02-17 08:59:21 -08:00
Zakk Chen	eeb7754f68	[RISCV] Add the passthru operand for vmv.vv/vmv.vx/vfmv.vf IR intrinsics. Add the passthru operand for VMV_V_X_VL, VFMV_V_F_VL and SPLAT_VECTOR_SPLIT_I64_VL also. The goal is support tail and mask policy in RVV builtins. We focus on IR part first. If the passthru operand is undef, we use tail agnostic, otherwise use tail undisturbed. Reviewed By: rogfer01 Differential Revision: https://reviews.llvm.org/D119688	2022-02-17 06:38:14 -08:00
Zakk Chen	093ecccdab	[RISCV] Add the passthru operand for vadc/vsbc/vmerge/vfmerge IR intrinsics. The goal is support tail and mask policy in RVV builtins. We focus on IR part first. If the passthru operand is undef, we use tail agnostic, otherwise use tail undisturbed. Reviewed By: rogfer01 Differential Revision: https://reviews.llvm.org/D119686	2022-02-17 02:21:39 -08:00
Aaron Ballman	46b0d0eef9	Use functions with prototypes when appropriate; NFC A significant number of our tests in C accidentally use functions without prototypes. This patch converts the function signatures to have a prototype for the situations where the test is not specific to K&R C declarations. e.g., void func(); becomes void func(void); This is the twelfth batch of tests being updated (the end may be in sight soon though).	2022-02-16 16:10:37 -05:00
Qiu Chaofan	e30efa0304	[NFC] Fix intel intrinsics test for PowerPC These tests are dumped without optimization, which makes them too lengthy and contain meaningless load/stores. Clean them up to prepare for future headers update.	2022-02-16 16:16:30 +08:00
Zakk Chen	e8973dd389	[RISCV] Add the passthru operand for some RVV nomask unary and nullary intrinsics. The goal is support tail and mask policy in RVV builtins. We focus on IR part first. If the passthru operand is undef, we use tail agnostic, otherwise use tail undisturbed. My plan is to handle more complex operations in follow-up patches. Reviewers: frasercrmck Differential Revision: https://reviews.llvm.org/D118253	2022-02-15 22:34:06 -08:00
Nico Weber	125abb61f7	Revert "Add support for floating-point option `ffp-eval-method` and for" This reverts commit `4bafe65c2b`. Breaks at least Misc/warning-flags.c, see comments on https://reviews.llvm.org/D109239	2022-02-15 22:02:25 -05:00
Zakk Chen	b784719904	[RISCV] Add the passthru operand for RVV nomask binary intrinsics. The goal is support tail and mask policy in RVV builtins. We focus on IR part first. If the passthru operand is undef, we use tail agnostic, otherwise use tail undisturbed. Add passthru operand for VSLIDE1UP_VL and VSLIDE1DOWN_VL to support i64 scalar in rv32. The masked VSLIDE1 would only emit mask undisturbed policy regardless of giving mask agnostic policy until InsertVSETVLI supports mask agnostic. Reviewed by: craig.topper, rogfer01 Differential Revision: https://reviews.llvm.org/D117989	2022-02-15 18:36:18 -08:00
Zahira Ammarguellat	4bafe65c2b	Add support for floating-point option `ffp-eval-method` and for `pragma clang fp eval_method`.	2022-02-15 13:59:27 -08:00
Aaron Ballman	adc402bf3d	Use functions with prototypes when appropriate; NFC A significant number of our tests in C accidentally use functions without prototypes. This patch converts the function signatures to have a prototype for the situations where the test is not specific to K&R C declarations. e.g., void func(); becomes void func(void); This is the eleventh batch of tests being updated (there are a significant number of other tests left to be updated).	2022-02-15 16:06:43 -05:00
Amy Kwan	5dc0a1657b	[PowerPC] Fix __builtin_pdepd and __builtin_pextd to be 64-bit and P10 only. The `__builtin_pdepd` and `__builtin_pextd` are P10 builtins that are meant to be used under 64-bit only. For instance, when the builtins are compiled under 32-bit mode: ``` $ cat t.c unsigned long long foo(unsigned long long a, unsigned long long b) { return __builtin_pextd(a,b); } $ clang -c t.c -mcpu=pwr10 -m32 ExpandIntegerResult #0: t31: i64 = llvm.ppc.pextd TargetConstant:i32<6928>, t28, t29 fatal error: error in backend: Do not know how to expand the result of this operator! ``` This patch adds sema checking for these builtins to compile under 64-bit mode only and on P10. The builtins will emit a diagnostic when they are compiled on non-P10 compilations and on 32-bit mode. Differential Revision: https://reviews.llvm.org/D118753	2022-02-15 12:30:50 -06:00
Aaron Ballman	ed509fe296	Use functions with prototypes when appropriate; NFC A significant number of our tests in C accidentally use functions without prototypes. This patch converts the function signatures to have a prototype for the situations where the test is not specific to K&R C declarations. e.g., void func(); becomes void func(void); This is the tenth batch of tests being updated (there are a significant number of other tests left to be updated).	2022-02-15 09:28:02 -05:00

... 2 3 4 5 6 ...

7691 Commits